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METHODS OF DIAGNOSIS OF CANCER, COMPOSITIONS AND METHODS OF 
SCREENING FOR MODULATORS OF CANCER 

5 CROS S-REFERENCES TO RELATED APPLICATIONS 

This application claims priority to USSN 60/340,376, filed December 14, 2001; 
Attorney Docket Number 018501-006400US, filed February 8, 2002; USSN 60/347,211, filed 
January 8, 2002; USSN 60/334,393, filed November 29, 2001; USSN 60/335,394, filed 
November 15, 2001; USSN 60/347,349, filed January 10, 2002; USSN 60/368,809, filed March 
10 29, 2002; USSN 60/409,450, filed September 9, 2002; USSN 60/359,077, filed February 20, 
2002; USSN 60/386,614, filed June 5, 2002; USSN 60/356,714, filed February 13, 2002; USSN 
60/397,775 filed July 22, 2002; USSN 60/332,464, filed November 21, 2001; USSN 
60/397,845, filed July 22, 2002; USSN 60/370,110, filed April 4, 2002; USSN 60/396,839, filed 
July 16, 2002; USSN 60/350,666, filed November 13, 2001; and USSN 60/372,246, filed April 
15 12, 2002; each of which is incorporated herein by reference for all purposes. The application 
also incorporates by reference PCT/US02/29560; PCT/US02/02242; and PCT/US02/17594. 
FIELD OF THE INVENTION 
The invention relates to the identification of nucleic acid and protein expression profiles 
and nucleic acids, products, and antibodies thereto that are involved in cancer and other 
20 diseases; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis, and therapy of these conditions. The invention further relates to methods for 
identifying and using agents and/or targets that modulate these conditions. 

BACKGROUND OF THE INVENTION 
Cancer is a major cause of morbidity in the United States. For example, in 1996, the 
25 American Cancer Society estimated that 1 ,359, 150 people were diagnosed with a malignant 
neoplasm and 554,740 died from one of these diseases. Cancer is responsible for 23.9 percent 
of all American deaths and is exceeded only by heart disease as a cause of mortality (33 
percent). Unfortunately, cancer mortality is increasing and sometime early in this century, 
cancer is expected to become the leading cause of mortality in the United States as it already is 
30 in Japan. 

Cancers share the charactaristic of disordered control over normal cell division, growth, 
and differentiation. Their initial clinical manifestations are extremely heterogeneous, with over 
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70 types of cancer arising in virtually every organ and tissue of the body. Moreover, some of 
those similarly classified cancer types may represent multiple different molecular diseases. 
Unfortunately, some cancers may be virtually asymptomatic until late in the disease course, 
when treatment is more difficult, and prognosis grim. 

Treatment for cancer typically includes surgery, chemotherapy, and/or radiation therapy. 
Although nearly 50 percent of cancer patients can be effectively treated using these methods, 
the current therapies all induce serious side effects which diminish quality of life. The 
identification of novel therapeutic targets and diagnostic markers will be important for 
improving the diagnosis, prognosis, and treatment of cancer patients. 

Recent advances in molecular medicine have increased the interest in tumor-specific 
antigens that could serve as targets for various immunotherapeutic or small molecule strategies. 
Antigens suitable for immunotherapeutic strategies should be highly expressed in cancer tissues, 
preferably accessible from the vasculature and at the cell surface, and ideally not expressed in 
normal adult tissues. Expression in tissues that are dispensable for life, however, may be 
tolerated, e.g., reproductive organs, especially those absent in one sex. Examples of antigens 
that are currently available for the detection and treatment of certain cancers include Her2/neu 
and the B-cell antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(Herceptin®/trastuzumab) are currently in use for the treatment of metastatic breast cancer. See 
Ross and Fletcher (1998) Stem Cells 16:413-428. Similarly, anti-CD20 monoclonal antibodies 
(Rituxin®/rituximab) are used to effectively treat non-Hodgkin's lymphoma. See Maloney, et 
al. (1997) Blood 90:2188-2195; Leget and Czuczman (1998) Curr. Qpin. Oncol. 10:548-551. 

The elucidation of a role for novel proteins and compounds in disease states for 
identification of therapeutic targets and diagnostic markers is valuable for improving the current 
treatment of cancer patients. Accordingly, provided herein are molecular targets for therapeutic 
intervention in various defined cancers. Additionally, provided herein are methods that can be 
used in diagnosis and prognosis of cancer. Further provided are methods that can be used to 
screen candidate bioactive agents for the ability to modulate cancer. 

SUMMARY OF THE INVENTION 

The present invention provides methods for determining the presence or absence of a 
pathological cell in a patient, the method comprising detecting a nucleic acid comprising a 
sequence at least 80% identical to a sequence as described in Tables 2A-80 in a biological 
sample from the patient, thereby determining the presence or absence of the pathological cell. 



WO 03/042661 



PCT/US02/36810 



In certain embodiments of the method, the pathology is described in Table 1, including a 
cancer; the biological sample comprises isolated nucleic acids; the nucleic acids are mRNA; the 
biological sample is tissue from an organ which is affected by the pathology of Table 1, 
including a cancer; a further step is used of amplifying nucleic acids before the step of detecting 
5 the nucleic acid; the detecting is of a protein encoded by the nucleic acid; the nucleic acid 

comprises a sequence as described in Tables 2A-80; the detecting step is carried out by using a 
labeled nucleic acid probe, utilizing a biochip comprising at sequence at least 80% identical to a 
sequence as described in Tables 2A-80, or detecting a polypeptide encoded by the nucleic acid; 
or the patient is undergoing a therapeutic regimen to treat the pathology of Table 1, or is 
10 suspected of having the pathology or cancer. 

Compositions are also provided, e.g., an isolated nucleic acid molecule comprising a 
sequence as described in Tables 2A-80, including, e.g., those which are labeled; an expression 
vector comprising such nucleic acid; a host cell comprising such expression vector; an isolated 
polypeptide which is encoded by such a nucleic acid molecule comprising a sequence as 
1 5 described in Tables 2A-80; or an antibody that specifically binds the polypeptide. In particular 
embodiments, the antibody is: conjugated to an effector component, is conjugated to a 
detectable label (including, e.g., a fluorescent label, a radioisotope, or a cytotoxic chemical), an 
antibody fragment, or is a humanized antibody. 

Additional methods are provided, including methods for specifically targeting a 
20 compound to a pathological cell in a patient, the method comprising administering to the patient 
an antibody, as described, thereby providing the targetting. Others include, e.g., methods for 
determining the presence or absence of a pathological cell in a patient, the methods comprising 
contacting a biological sample with an antibody, as described, hi more particular methods, the 
antibody is: conjugated to an effector component, or to a fluorescent label; or the biological 
25 sample is a blood, serum, urine, or stool sample. 

Further methods include those for identifying a compound that modulates a pathology- 
associated polypeptide, the method comprising steps of: contacting the compound with a 
pathology-associated polypeptide, the polypeptide encoded by a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as described in Tables 2A-80; and 
30 determining the functional effect of the compound upon the polypeptide. Another drug 

screening assay method comprises steps of: adrriinistering a test compound to a mammal having 
a pathology of Table 1 or a cell isolated therefrom; and comparing the level of gene expression 
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of a polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 
as described in Tables 2A-80 in a treated cell or mammal with the level of gene expression of 
the polynucleotide in a control cell or mammal, wherein a test compound that modulates the 
level of expression of the polynucleotide is a candidate for the treatment of the pathology. 
5 DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the objects outlined above, the present invention provides novel 
methods for diagnosis and prognosis evaluation for various disorders, e.g., angiogenesis, 
fibrosis, and various defined forms of cancer, including metastatic cancer, as well as methods 
for screening for compositions which modulate such conditions. Also provided are methods for 

10 treating such disorders or cancers. See, e.g., American Society of Clinical Oncology (ed. 2001) 
ASCO Curriculum: Symptom Management Kendall/Hunt, ISBN: 0787277851; Bonadonna, et 
al. (2001) Textbook of Breast Cancer (2d ed.) Dunitz Martin, ISBN: 1853178241; Devita and 
Hellman (eds. 2001) Cancer Principles and Practice of Oncology (2 vols.), Lippincott Williams, 
ISBN: 0781723876; Howell, et al. (2001) Breast Cancer Isis Medical Media, ISBN: 

15 1901865584; Kaye and Laws (2001) Brain Tumours: An Encyclopedic Approach (2d ed.) 

Churchill Livingstone, ISBN: 0443064261; Mihm, et al. (2001) The Melanocvtic Proliferation: 
A Comprehensive Textbook of Pigmented Lesions Wiley-Liss, ISBN: 0471252719; 
Montgomery and Aaron (2001) Clinical Pathology of Soft-Tissue Tumors Marcel Dekker, 
ISBN: 0824702905; Petrovich, et al. (eds. 2001) Combined Modality of Central Nervous 

20 System Tumors (Medical Radiology) Springer Verlag, ISBN: 3540660534; Rosen (2001) 
Rosen's Breast Pathology Lippincott Williams and Wilkins, ISBN: 0781723795; Shah, et al. 
(2001) Oral Cancer Isis Medical Media, ISBN: 189906687X; Weiss and Goldblum (2001) 
Enzinger and Weiss's Soft Tissue Tumors (4th ed.) Mosby, ISBN: 0323012000; Abeloff, et al. 
(eds. 2000) Clinical Oncology (2d ed.) Churchill Livingstone, ISBN: 044307545X; American 

25 Society of Clinical Oncology (ed. 2000) Cancer Genetics and Cancer Predisposition Testing 
Kendall/Hunt, ISBN: 0787276154; Fletcher (2000) Diagnostic Histopathology of Tumors (2 
vols. 2d ed.) Churchill Livingstone, ISBN: 0443079927; Vogelzang (ed. 2000) Comprehensive 
Textbook of Genitourinary Oncology (2d ed.) Lippincott Williams and Wilkins, ISBN: 
0683306456; Holland, et al. (eds. 2000) Holland-Frei Cancer Medicine (Book with CD-ROM 

30 5th ed.) Decker, ISBN: 1550091 131; Turrisi, et al. (2000) Lung Cancer Isis Medical Media, 
ISBN: 1901865428; Bartolozzi and Lencioni (eds. 1999) Liver Malignancies: Diagnostic and 
Interventional Radiology (Medical Radiology) Springer Verlag, ISBN: 3540647562; Gasparini 
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(ed. 1999) Prognostic Variables in Node-Negative and Node-Positive Breast Cancer Kluwer, 
ISBN: 0792384474; Hansen (ed. 1999) The LASLC Textbook of Lung Cancer: International 
Association for the Study of Lung Cancer Dunitz Martin, ISBN: 1853177083; Raghavan, et al. 
(eds. 1999) Textbook of Uncommon Cancer (2nd ed.) Wiley, ISBN: 0471929212; Thawley, et 
5 al. (eds. 1999) Comprehensive Management of Head and Neck Tumors (2 vols.) Saunders, 
ISBN: 0721655823; Whittaker and Holmes (eds. 1999) Leukemia and Related Disorders (3d 
ed.) Blackwell Science, ISBN: 0865426074; Aapro (ed. 1998) OncoMedia: Medical Oncology 
(CD-ROM) Elsevier Science, ISBN: 0080427480; Abeloff (1998) Clinical Oncology (Library 
Version 2 CD-ROM Individual Version 2.0 Windows and Macintosh) Harcourt Brace, ISBN: 

10 0443075557; Benson (ed. 1998) Gastrointestinal Oncology (Cancer Treatment and Research, 
CTAR 98) Kluwer, ISBN: 0792382056; Brambilla and Brambilla (eds. 1998) Lung Tumors: 
Fundamental Biology and Clinical Management (Vol 124) Marcel Dekker, ISBN: 0824701607; 
Canellos, et al. (eds. 1998) The Lymphomas Saunders, ISBN: 0721650309; Greenspan and 
Remagen (1998) Differential Diagnosis of Tumors and Tumor-Like Lesions of Bones and Joints 

15 Lippincott Williams and Wilkins Publishers, ISBN: 0397517106; Hiddemann (ed. 1998) Acute 
Leukemias VII: Experimental Approaches and Novel Therapies (Haematologie Und 
Bluttransfusion, Vol 39), Springer Verlag, ISBN: 3540635041; Husband and Reznek (1998) 
Imaging in Oncology (2 vols.) Mosby, ISBN: 1899066489; Leibel and Phillips (eds. 1998) 
Textbook of Radiation Oncology Saunders, ISBN: 0721653367; Maloney and Miller (eds. 

20 1998) Cutaneous Oncology: Pathophysiology, Diagnosis, and Management Blackwell Science, 
ISBN: 0865425175; Mittal, et al. (eds. 1998) Advances in Radiation Therapy Kluwe, ISBN: 
0792399811; Oldham (ed. 1998) Principles of Cancer Biotherapy (3d ed.) Kluwer, ISBN: 
0792335074; Ozols (ed. 1998) Gynecologic Oncology Kluwer, ISBN: 0792380703; Parkin, et 
al. (eds. 1998) Cancer Incidence in Five Continents (Iarc Scientific Publications, No 143) 

25 Oxford University Press, ISBN: 9283221435; Perez and Brady (eds. 1998) Principles and 

Practice of Radiation Oncology Lippincott Williams and Wilkins, ISBN: 0397584164; Black, et 
al. (eds. 1997) Cancer of the Nervous System Blackwell Science, ISBN: 0865423849; 
Bonadonna, et al. (1997) Textbook of Breast Cancer: A Clinical Guide to Therapy Blackwell 
Science, ISBN: 1853173487; Pollock (ed. 19971 Surgical Oncology Kluwer, ISBN: 
30 0792399005; Sheaves, et al. (eds. 1997) Clinical Endocrine Oncology Blackwell Science, 

ISBN: 086542862X; Vahrson (1997) Radiation Oncology of Gynecological Cancers Springer 
Verlag, ISBN: 0387567682; Walterhouse and Conn (eds. 1997) Diagnostic and Therapeutic 
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Advances in Pediatric Oncology Kluwer, ISBN: 0792399781; Aisner(ed. 1996) Comprehensive 
Textbook of Thoracic Oncology Lippincott, Williams and Wilkins, ISBN: 0683000624; 
Bertino, et al. (eds. 1996) Encyclopedia of Cancer (3 vols.) Academic, ISBN: 012093230X; 
Cavalli, et al. (1996) Textbook of Medical Oncology Dunitz Martin, ISBN: 1853172901; 
5 Peckham, et al. (eds. 1995) Oxford Textbook of Oncology (2- Vols.) Oxford University Press, 
ISBN: 0192616854; and Freireich and Kantarjian (eds. 1996) Molecular Genetics and Therapy 
of Leukemia (Cancer Treatment and Research, V. 84) Kluwer, ISBN: 0792339126. 

In particular, identification of markers selectively expressed on defined cancers allows 
for use of that expression in diagnostic, prognostic, or therapeutic methods. As such, the 
10 invention defines various compositions, e.g., nucleic acids, polypeptides, antibodies, and small 
molecule agonists/antagonists, which will be useful to selectively identify those markers. For 
example, therapeutic methods may take the form of protein therapeutics which use the marker 
expression for selective localization or modulation of function (for those markers which have a 
causative disease effect), for vaccines, identification of binding partners, or antagonism, e.g., 
15 using antisense or RNAi. The markers may be useful for molecular characterization of subsets 
of the diseases, e.g., as provided in Table 1, which subsets may actually require very different 
treatments. Moreover, the markers may also be important in related diseases to the specific 
disorders and cancers, e.g., which affect similar tissues in non-malignant diseases, or have 
similar mechanisms of induction/maintenance. Metastatic processes or characteristics may also 
20 be targeted. Diagnostic and prognostic uses are made available, e.g., to subset related but 

distinct diseases, or to determine treatment strategy. The detection methods maybe based upon 
nucleic acid, e.g., PCR or hybridization techniques, or protein, e.g., ELISA, imaging, IHC, etc. 
The diagnosis may be qualitative or quantitative, and may detect increases or decreases in 
expression levels. 

25 Tables 2B-76B provide unigene cluster identification numbers for the nucleotide 

sequence of genes that exhibit increased or decreased expression in diseased samples (see 
Tables 1-3), particularly sequences involved in angiogenesis, arthritis, prostate cancer, breast 
cancer, colorectal cancer, cervical cancer, bladder cancer, head and neck cancer, esophageal 
cancer, lung cancer, ovarian cancer, pancreatic cancer, renal cancer, stomach cancer, skin 

30 cancer, testicular cancer, uterine cancer, glioblastoma, Ewing sarcoma, soft tissue sarcoma, and 
lung fibrosis. Tables 2A-80 also provide an exemplar accession number that provides a 
nucleotide sequence that is part of the unigene cluster. 
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Definitions 

The teim "cancer protein" or "cancer polynucleotide" or "cancer-associated transcript" 
refers to nucleic acid and polypeptide polymorphic variants, alleles, mutants, and interspecies 
homologues that: (1) have a nucleotide sequence that has greater than about 60% nucleotide 
5 sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably about 92%, 94%, 96%, 97%, 
98%, or 99% or greater nucleotide sequence identity, preferably over a region of over a region 
of at least about 25, 50, 100, 200, 500, 1000, or more nucleotides, to a nucleotide sequence of or 
associated with a gene of Tables 1-80; (2) bind to antibodies, e.g., polyclonal antibodies, raised 
against an immunogen comprising an amino acid sequence encoded by a nucleotide sequence of 

10 or associated with a gene of Tables 1-80, and conservatively modified variants thereof; (3) 

specifically hybridize under stringent hybridization conditions to a nucleic acid sequence, or the 
complement thereof of Tables 1-80 and conservatively modified variants thereof; or (4) have an 
amino acid sequence that has greater than about 60% amino acid sequence identity, 65%, 70%, 
75%, 80%, 85%, preferably 90%, 91%, 93%, 95%, 97%, 98%, or 99% or greater amino 

15 sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 
500, 1000, or more amino acids, to an amino acid sequence encoded by a nucleotide sequence 
of or associated with a gene of Tables 1-80. A polynucleotide or polypeptide sequence is 
typically from a mammal including, but not limited to, primate, e.g., human; rodent, e.g., rat, 
mouse, hamster; cow, pig, horse, sheep, or other mammal. A "cancer polypeptide" and a 

20 "cancer polynucleotide," include both naturally occurring or recombinant forms. 

A "full length" cancer protein or nucleic acid refers to a cancer polypeptide or 
polynucleotide sequence, or a variant thereof, that contains elements normally contained in one 
or more naturally occurring, wild type cancer polynucleotide or polypeptide sequences. The 
"full length" may be prior to, or after, various stages of post-translational processing or splicing, 

25 including alternative splicing. 

"Biological sample" as used herein is a sample of biological tissue or fluid that contains 
nucleic acids or polypeptides, e.g., of a cancer protein, polynucleotide, or transcript. Such 
samples include, but are not limited to, tissue isolated from primates, e.g., humans, or rodents, 
e.g., mice, and rats. Biological samples may also include sections of tissues such as biopsy and 

30 autopsy samples, frozen sections taken for histologic purposes, archival samples, blood, plasma, 
serum, sputum, stool, tears, mucus, hair, skin, etc. Biological samples also include explants and 
primary and/or transformed cell cultures derived from patient tissues. A biological sample is 
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typically obtained from a eukaryotic organism, most preferably a mammal such as a primate, 
e.g., chimpanzee or human; cow; dog; cat; a rodent, e.g., guinea pig, rat, mouse; rabbit; or a 
bird; reptile; or fish. Livestock and domestic animals are of interest. 

"Providing a biological sample" means to obtain a biological sample for use in methods 
described in this invention. Most often, this will be done by removing a sample of cells from an 
animal, but can also be accomplished by using previously isolated cells (e.g., isolated by 
another person, at another time, and/or for another purpose), or by performing the methods of 
the invention in vivo. Archival tissues or materials, having treatment or outcome history, will 
be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more nucleic acids or 
polypeptide sequences, refer to two or more sequences or subsequences that are the same or 
have a specified percentage of amino acid residues or nucleotides that are the same (e.g., about 
70% identity, preferably 75%, 80%, 85%, 90%, 91%, 93%, 95%, 97%, 98%, 99%, or higher 
identity over a specified region, when compared and aligned for maximum correspondence over 
a comparison window or designated region) as measured using, e.g., a BLAST or BLAST 2.0 
sequence comparison algorithms with default parameters described below, or by manual 
alignment and visual inspection (see, e.g., NCBI web site http://www.ncbi.nlm.nih.gov/BLAST/ 
or the like). Such sequences are then said to be "substantially identical." This definition also 
refers to, or may be applied to, the complement of a test sequence. The definition also includes 
sequences that have deletions and/or insertions, substitutions, and naturally occurring, e.g., 
polymorphic or allelic variants, and man-made variants. As described below, the preferred 
algorithms can account for gaps and the like. Preferably, identity exists over a region that is at 
least about 25 amino acids or nucleotides in length, or more preferably over a region that is 
about 50-100 amino acids or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, to which 
test sequences are compared. When using a sequence comparison algorithm, test and reference 
sequences are entered into a computer, subsequence coordinates are designated, if necessary, 
and sequence algorithm program parameters are designated. Preferably, default program 
parameters can be used, or alternative parameters can be designated. The sequence comparison 
algorithm then calculates the percent sequence identities for the test sequences relative to the 
reference sequence, based on the program parameters. 
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A "comparison window", as used herein, includes reference to a segment of contiguous 
positions selected from the group consisting typically of from about 20 to 600, usually about 50 
to 200, more usually about 100 to 150, in which a sequence may be compared to a reference 
sequence of the same number of contiguous positions after the two sequences are optimally 
aligned. Methods of alignment of sequences for comparison are well-known. Optimal 
alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm 
of Smith and Waterman (1981) Adv. Appl. Math. 2:482-489, by the homology alignment 
algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48:443-453, by the search for 
similarity method of Pearson and Lipman (1988) Proc. Nat'l. Acad. Sci. USA 85:2444-2448, by 
computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in 
the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., 
Madison, WI), or by manual alignment and visual inspection (see, e.g., Ausubel, et al. (eds. 
1995 and supplements) Current Protocols in Molecular Biology Wiley). 

Preferred examples of algorithms that are suitable for determining percent sequence 
identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, which are 
described in Altschul, et al. (1977) Nuc. Acids Res. 25:3389-3402 and Altschul, et al. (1990) J. 
Mol. Biol. 215:403-410. BLAST and BLAST 2.0 are used, with the parameters described 
herein, to determine percent sequence identity for the nucleic acids and proteins of the 
invention. Software for performing BLAST analyses is publicly available through the National 
Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves 
first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in 
the query sequence, which either match or satisfy some positive-valued threshold score T when 
aligned with a word of the same length in a database sequence. T is referred to as the 
neighborhood word score threshold (Altschul, et al., supra). These initial neighborhood word 
hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are 
extended in both directions along each sequence for as far as the cumulative alignment score 
can be increased. Cumulative scores are calculated using, e.g., for nucleotide sequences, the 
parameters M (reward score for a pair of matching residues; always > 0) and N (penalty score 
for mismatching residues; always < 0). For amino acid sequences, a scoring matrix is used to 
calculate the cumulative score. Extension of the word hits in each direction are halted when: the 
cumulative alignment score falls off by the quantity X from its maximum achieved value; the 
cumulative score goes to zero or below, due to the accumulation of one or more negative- 
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scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN 
program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1, an expectation (E) 
of 10, M=5, N— 4 and a comparison of both strands. For amino acid sequences, the BLASTP 
program uses as defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 
scoring matrix (see Henikoff and Henikoff (1992) Proc. Natl. Acad. Sci. USA 89:10915-919) 
alignments (B) of 50, expectation (E) of 10, M=5, N=- 4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity between two 
sequences. See, e.g., Karlin and Altschul (1993) Proc. Nat'l. Acad. Sci. USA 90:5873-5787. 
One measure of similarity provided by the BLAST algorithm is the smallest sum probability 
(P(N)), which provides an indication of the probability by which a match between two 
nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is 
considered similar to a reference sequence if the smallest sum probability in a comparison of the 
test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably less than 
about 0.01, and most preferably less than about 0.001 . Log values may be negative large 
numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 110, 150, 170, etc. 

An indication that two nucleic acid sequences are substantially identical is that the 
polypeptide encoded by the first nucleic acid is immunologically cross reactive with the 
antibodies raised against the polypeptide encoded by the second nucleic acid. Thus, a 
polypeptide is typically substantially identical to a second polypeptide, e.g., where the two 
peptides differ only by conservative substitutions. Another indication that two nucleic acid 
sequences are substantially identical is that the two molecules or their complements hybridize to 
each other under stringent conditions. Yet another indication that two nucleic acid sequences 
are substantially identical is that the same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 
expression vector and supports the replication or expression of the expression vector. Host cells 
may be cultured cells, explants, cells in vivo, and the like. Host cells may be prokaryotic cells 
such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or mammalian cells such as 
CHO, HeLa, and the like (see, e.g., the American Type Culture Collection catalog or web site, 
www.atcc.org). 

The terms "isolated," "purified," or "biologically pure" refer to material that is 
substantially or essentially free from components that normally accompany it as found in its 

10 



WO 03/042661 



PCT/US02/36810 



native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 
preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid or 
protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means that the 
nucleic acid or protein is at least about 85% pure, more preferably at least 95% pure, and most 
preferably at least 99% pure. "Purify" or "purification" in other embodiments means removing 
at least one contaminant or component from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogeneous, e.g., 100% pure. 

The terms "polypeptide," "peptide," and "protein" are used interchangeably herein to 
refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which 
one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally 
occurring amino acid, as well as to naturally occurring amino acid polymers, those containing 
modified residues, and non-naturally occurring amino acid polymers. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, as well as 
amino acid analogs and amino acid mimetics that function similarly to the naturally occurring 
amino acids. Naturally occurring amino acids are those encoded by the genetic code, as well as 
those amino acids that are later modified, e.g., hydroxyproline, y-carboxyglutamate, and O- 
phosphoserine. Amino acid analogs refers to compounds that have the same basic chemical 
structure as a naturally occurring amino acid, e.g., an a carbon that is bound to a hydrogen, a 
carboxyl group, an amino group, and an R group, e.g., homoserine, norleucine, methionine 
sulfoxide, methionine methyl sulfonium. Such analogs may have modified R groups (e.g., 
norleucine) or modified peptide backbones, but retain somebasic chemical structure as a 
naturally occurring amino acid. Amino acid mimetic refers to a chemical compound that has a 
structure that is different from the general chemical structure of an amino acid, but that 
functions similarly to another amino acid. 

Amino acids may be referred to herein by either their commonly known three letter 
symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 
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"Conservatively modified variant" applies to both amino acid and nucleic acid 
sequences. With respect to particular nucleic acid sequences, conservatively modified variants 
refers to those nucleic acids which encode identical or essentially identical amino acid 
sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially 
identical or associated, e.g., naturally contiguous, sequences. Because of the degeneracy of the 
genetic code, a large number of functionally identical nucleic acids encode most proteins. For 
instance, the codons GCA, GCC, GCG, and GCU each encode the amino acid alanine. Thus, at 
each position where an alanine is specified by a codon, the codon can be altered to another of 
the corresponding codons described without altering the encoded polypeptide. Such nucleic 
acid variations are "silent variations," which are one species of conservatively modified 
variations. Every nucleic acid sequence herein which encodes a polypeptide also describes 
silent variations of the nucleic acid. In certain contexts each codon in a nucleic acid (except 
AUG, which is ordinarily the only codon for methionine, and TGG, which is ordinarily the only 
codon for tryptophan) can be modified to yield a functionally similar molecule. Accordingly, a 
silent variation of a nucleic acid which encodes a polypeptide is implicit in a described 
sequence with respect to the expression product, but not necessarily with respect to actual probe 
sequences. 

As to amino acid sequences, one of skill will recognize that individual substitutions, 
deletions, or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters, 
adds, or deletes a single amino acid or a small percentage of amino acids in the encoded 
sequence is a "conservatively modified variant" where the alteration results in the substitution of 
an amino acid with a chemically similar amino acid. Conservative substitution tables providing 
functionally similar amino acids are well known. Such conservatively modified variants are in 
addition to and do not exclude polymorphic variants, interspecies homologs, and alleles of the 
invention. Typically conservative substitutions include for one another: 1) Alanine (A), Glycine 
(G); 2) Aspartic acid (D), Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine 
(R), Lysine (K); 5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine 
(F), Tyrosine (Y), Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), 
Methionine (M) (see, e.g., Creighton (1984) Proteins: Structure and Molecular Properties 
Freeman). 

Macromolecular structures such as polypeptide structures can be described in terms of 
various levels of organization. For a general discussion of this organization, see, e.g., Alberts, 
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et al. (eds. 2001) Molecular Biology of the Cell (4th ed.) Garland; and Cantor and Schimmel 
(1980) Biophysical Chemistry Part I: The Conformation of Biological Macromolecules 
Freeman. "Primary structure" refers to the amino acid sequence of a particular peptide. 
"Secondary structure" refers to locally ordered, three dimensional structures within a 
polypeptide. These structures are commonly known as domains. Domains are portions of a 
polypeptide that often form a compact unit of the polypeptide and are typically 25 to 
approximately 500 amino acids long. Typical domains are made up of sections of lesser 
organization such as stretches of P-sheet and a-helices. "Tertiary structure" refers to the 
complete three dimensional structure of a polypeptide monomer. "Quaternary structure" refers 
to the three dimensional structure formed, usually by the noncovalent association of 
independent tertiary units. Anisotropic terms are also known as energy terms. 

"Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical equivalents used 
herein means at least two nucleotides covalently linked together. Oligonucleotides are typically 
from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50, or more nucleotides in length, up to about 
100 nucleotides in length. Nucleic acids and polynucleotides are a polymers of any length, 
including longer lengths, e.g., 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, etc. A 
nucleic acid of the present invention will generally contain phosphodiester bonds, although in 
some cases, nucleic acid analogs are included that may have at least one different linkahge, e.g., 
phosphoramidate, phosphorothioate, phosphorodithioate, or O-methylphophoroamidite linkages 
(see Eckstein (1992) Oligonucleotides and Analogues: A Practical Approach Oxford Univ. 
Press); and peptide nucleic acid backbones and linkages. Other analog nucleic acids include 
those with positive backbones; non-ionic backbones, and non-ribose backbones, including those 
described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7 of Sanghvi and 
Cook (eds. 1994) Carbohydrate Modifications in Antisense Research ACS Symposium Series 
580. Nucleic acids containing one or more carbocyclic sugars are also included within one 
definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done for a 
variety of reasons, e.g., to increase the stability and half-life of such molecules in physiological 
environments or as probes on a biochip. Mixtures of naturally occurring nucleic acids and 
analogs can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures of 
naturally occurring nucleic acids and analogs may be made. 

A variety of references disclose such nucleic acid analogs, including, e.g., 
phosphoramidate (Beaucage, et al. (1993) Tetrahedron 49:1925-1963 and references therein; 
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Letsinger (1970) J. Org. Chem. 35:3800-3803; Sprinzl, et al. (1977) Eur. J. Biochem. 81:579- 
589; Letsinger, et al. (1986) Nucl. Acids Res. 14:3487-499; Sawai, et al. (1984) Chem. Lett. 
805, Letsinger, et al. (1988) J. Am. Chem. Soc. 1 10:4470-4471; and Pauwels, et al. (1986) 
Chemica Scripta 26:141-149), phosphorothioate (Mag, et al. (1991) Nucleic Acids Res. 
19:1437-441; and U.S. Patent No. 5,644,048), phosphorodithioate (Brill, et al. (1989) J. Am. 
Chem. Soc. 11 1:2321-2322), O-methylphophoroamidite linkages (see Eckstein (1992) 
Oligonucleotides and Analogues: A Practical Approach , Oxford Univ. Press), and peptide 
nucleic acid backbones and linkages (see Egholm (1992) J. Am. Chem. Soc. 114:1895-1897; 
Meier, et al. (1992) Chem. Int. Ed. Engl. 31:1008-1010; Nielsen (1993) Nature 365:566-568; 
Carlsson, et al. (1996) Nature 380:207, all of which are incorporated by reference). Other 
analog nucleic acids include those with positive backbones (Denpcy, et al. (1995) Proc. Natl. 
Acad. Sci. USA 92:6097-101; non-ionic backbones (U.S. Patent Nos. 5,386,023, 5,637,684, 
5,602,240, 5,216,141, and 4,469,863; Kiedrowski, et al. (1991) Angew. Chem. Intl. Ed. English 
30:423-426; Letsinger, et al. (1988) J. Am. Chem. Soc. 110:4470-4471; Letsinger, et al. (1994) 
Nucleoside and Nucleotide 13:1597; Chapters 2 and 3 in Sanghvi and Cook (eds. 1994) 
Carbohydrate Modifications in Antisense Research ACS Symposium Series 580; Mesmaeker, et 
al. (1994) Bioorganic and Medicinal Chem. Lett. 4:395-398; Jeffs, et al. (1994) J. Biomolecular 
NMR 34:17; Horn, et al. (1996) Tetrahedron Lett. 37:743) and non-ribose backbones, including 
those described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7 in Sanghvi 
and Cook (eds. 1994) Carbohydrate Modifications in Antisense Research ACS Symposium 
Series 580. Nucleic acids containing one or more carbocyclic sugars are also included within 
one definition of nucleic acids (see Jenkins, et al. (1995) Chem. Soc. Rev, pp 169-176). Several 
nucleic acid analogs are described in Rawls (page 35, June 2, 1997) C&E News . 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide nucleic 
acid analogs. These backbones are substantially non-ionic under neutral conditions, in contrast 
to the highly charged phosphodiester backbone of naturally occurring nucleic acids. This 
results in at least two advantages. The PNA backbone exhibits improved hybridization kinetics. 
PNAs have larger changes in the melting temperature (T m ) for mismatched versus perfectly 
matched basepairs. DNA and RNA typically exhibit a 2-4° C drop in T m for an internal 
mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9° C. Similarly, due to 
their non-ionic nature, hybridization of the bases attached to these backbones is relatively 
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insensitive to salt concentration. In addition, PNAs are not degraded by cellular enzymes, and 
thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or contain 
portions of both double stranded or single stranded sequence. The depiction of a single strand 
5 also defines the sequence of the complementary strand; thus the sequences described herein also 
provide the complement of the sequence. The nucleic acid may be DNA, both genomic and 
cDNA, RNA, or a hybrid, where the nucleic acid may contain combinations of deoxyribo- and 
ribo-nucleotides, and combinations of bases, including uracil, adenine, thymine, cytosine, 
guanine, inosine, xanthine hypoxanthine, isocytosine, isoguanine, etc. "Transcript" typically 
10 refers to a naturally occurring RNA, e.g., a pre-mRNA hnRNA, or mRNA. As used herein, the 
term "nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 
naturally occurring analog structures. Thus, e.g., the individual units of a peptide nucleic acid, 
each containing a base, are referred to herein as a nucleoside. 
15 A "label" or a "detectable moiety" is a composition detectable by spectroscopic, 

photochemical, biochemical, immunochemical, physiological, chemical, or other physical 
means. In general, labels fall into three classes: a) isotopic labels, which may be radioactive or 
heavy isotopes; b) immune labels, which may be antibodies, antigens, or epitope tags; and c) 
colored or fluorescent dyes. The labels may be incorporated into the cancer nucleic acids, 
20 proteins, and antibodies. For example, the label should be capable of producing, either directly 
or indirectly, a detectable signal. The detectable moiety may be a radioisotope, such as 3 H, 
14c, 3 2p 5 35g 5 or 125t ; electron-dense reagents, a fluorescent or chemiluminescent compound, 
such as fluorescein isothiocyanate, rhodamine, or luciferin, or an enzyme (e.g., as commonly 
used in an ELISA), biotin, digoxigenin, or haptens and proteins or other entities which can be 
25 made detectable such as alkaline phosphatase, beta-galactosidase, or horseradish peroxidase. 
Methods are known for conjugating the antibody to the label. See, e.g., Hunter, et al. (1962) 
Nature 144:945; David, et al. (1974) Biochemistry 13:1014-1021; Pain, et al. (1981) 1 
Immunol. Meth. 40:219-230; andNygren (1982) J. Histochem. and Cvtochem. 30:407-412. 

An "effector" or "effector moiety" or "effector component" is a molecule that is bound 
30 (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 

noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 
The "effector" can be a variety of molecules including, e.g., detection moieties including 
15 
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radioactive compounds, fluorescent compounds, enzymes or substrates, tags such as epitope 
tags, toxins; activatable moieties, chemotherapeutic agents; lipases; antibiotics; chemoattracting 
moieties, immune modulators (micA/B), or radioisotopes, e.g., emitting "hard" beta, radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, e.g., covalently, 
through a linker or a chemical bond, or noncovalently, through ionic, van der Waals, 
electrostatic, or hydrogen bonds to a label such that the presence of the probe may be detected 
by detecting the presence of the label bound to the probe. Alternatively, methods using high 
affinity interactions may achieve the same results where one of a pair of binding partners binds 
to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is a nucleic acid capable of 
binding to a target nucleic acid of complementary sequence through one or more types of 
chemical bonds, usually through complementary base pairing, e.g., through hydrogen bond 
formation. As used herein, a probe may include natural (e.g., A, G, C, or T) or modified bases 
(7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be joined by a linkage 
other than a phosphodiester bond, preferably one that does not functionally interfere with 
hybridization. Thus, e.g., probes may he peptide nucleic acids in which the constituent bases 
are joined by peptide bonds rather than phosphodiester linkages. Probes may bind target 
sequences lacking complete complementarity with the probe sequence depending upon the 
stringency of the hybridization conditions. The probes are preferably directly labeled, e.g., with 
isotopes, chromophores, lumiphores, chromogens, or indirectly labeled, e.g., with biotin to 
which a streptavidin complex may later bind. By assaying for the presence or absence of the 
probe, one can detect the presence or absence of the select sequence or subsequence. Diagnosis 
or prognosis may be based at the genomic level, or at the level of RNA or protein expression. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the cell, nucleic acid, protein, or vector, has been modified by 
the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic 
acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., recombinant cells 
express genes that are not found within the native (non-recombinant) form of the cell or express 
native genes that are otherwise abnormally expressed, under expressed, or not expressed at all. 
By the term "recombinant nucleic acid" herein is meant nucleic acid, originally formed in vitro, 
in general, by the manipulation of nucleic acid, e.g., using polymerases and endonucleases, in a 
form not normally found in nature. In this manner, operably linkage of different sequences is 
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achieved. Thus an isolated nucleic acid, in a linear form, or an expression vector formed in 
vitro by ligating DNA molecules that are not normally joined, are both considered recombinant 
for the purposes of this invention. It is understood that once a recombinant nucleic acid is made 
and reintroduced into a host cell or organism, it will replicate non-recombinantly, e.g., using the 
in vivo cellular machinery of the host cell rather than in vitro manipulations; however, such 
nucleic acids, once produced recombinantly, although subsequently replicated non- 
recombinantly, are still considered recombinant for the purposes of the invention. 

Similarly, a "recombinant protein" is a protein made using recombinant techniques, e.g., 
through the expression of a recombinant nucleic acid as depicted above. A recombinant protein 
is distinguished from naturally occurring protein by at least one or more characteristics. The 
protein may be isolated or purified away from some or most of the proteins and compounds 
with which it is normally associated in its wild type host, and thus may be substantially pure. 
An isolated protein is unaccompanied by at least some of the material with which it is normally 
associated in its natural state, preferably constituting at least about 0.5%, more preferably at 
least about 5% by weight of the total protein in a given sample. A substantially pure protein 
comprises at least about 75% by weight of the total protein, with at least about 80% being 
preferred, and at least about 90% being particularly preferred. The definition includes the 
production of a cancer protein from one organism in a different organism or host cell. 
Alternatively, the protein may be made at a significantly higher concentration man is normally 
seen, through the use of an inducible promoter or high expression promoter, such that the 
protein is made at increased concentration levels. Alternatively, the protein may be in a form 
not normally found in nature, as in the addition of an epitope tag or amino acid substitutions, 
insertions and deletions, as discussed below. 

The term "heterologous" when used with reference to portions of a nucleic acid indicates 
that the nucleic acid comprises two or more subsequences that are not normally found in the 
same relationship to each other in nature. For instance, the nucleic acid is typically 
recombinantly produced, having two or more sequences, e.g., from unrelated genes arranged to 
make a new functional nucleic acid, e.g., a promoter from one source and a coding region from 
another source. Similarly, a heterologous protein will often refer to two or more subsequences 
that are not found in the same relationship to each other in nature (e.g., a fusion protein). 

A "promoter" is typically an array of nucleic acid control sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic acid 
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sequences near the start site of transcription, such as, in the case of a polymerase II type 
promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most environmental 
and developmental conditions. An "inducible" promoter is active under environmental or 
developmental regulation. The term "operably linked" refers to a functional linkage between a 
nucleic acid expression control sequence (such as a promoter, or array of transcription factor 
binding sites) and a second nucleic acid sequence, e.g., wherein the expression control sequence 
directs transcription of the nucleic acid corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 
synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be transcribed 
in operable linkage to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, duplexing, 
or hybridizing of a molecule selectively to a particular nucleotide sequence under stringent 
hybridization conditions when that sequence is present in a complex mixture (e.g., total cellular 
or library DNA or RNA). 

The phrase "stringent hybridization conditions" refers to conditions under which a probe 
will hybridize to its target subsequence, typically in a complex mixture of nucleic acids, but to 
no other sequences. Stringent conditions are sequence-dependent and will be different in 
different circumstances. Longer sequences hybridize specifically at higher temperatures. An 
extensive guide to the hybridization of nucleic acids is found in "Overview of principles of 
hybridization and the strategy of nucleic acid assays" in Tijssen (1993) Hybridization with 
Nucleic Probes (Laboratory Techniques in Biochemistry and Molecular Biology) (vol. 24) 
Elsevier. Generally, stringent conditions are selected to be about 5-10° C lower than the 
thermal melting point (T m ) for the specific sequence at a defined ionic strength pH. The T m is 
the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% of 
the probes complementary to the target hybridize to the target sequence at equilibrium (as the 
target sequences are present in excess, at T m , 50% of the probes are occupied at equilibrium). 
Stringent conditions will be those in which the salt concentration is less than about 1.0 M 
sodium ion, typically about 0.01-1.0 M sodium ion concentration (or other salts) at pH 7.0 to 
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8.3 and the temperature is at least about 30° C for short probes (e.g., about 10-50 nucleotides) 
and at least about 60° C for long probes (e.g., greater than about 50 nucleotides). Stringent 
conditions may also be achieved with the addition of destabilizing agents such as formamide. 
For selective or specific hybridization, a positive signal is typically at least two times 
background, preferably 10 times background hybridization. Exemplary stringent hybridization 
conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, incubating at 42° C, or, 
5x SSC, 1% SDS, incubating at 65° C, with wash in 0.2x SSC, and 0.1% SDS at 65° C. For 
PCR, a temperature of about 36° C is typical for low stringency amplification, although 
annealing temperatures may vary between about 32°-48° C depending on primer length. For 
high stringency PCR amplification, a temperature of about 62° C is typical, although high 
stringency annealing temperatures can range from about 50-65° C, depending on the primer 
length and specificity. Typical cycle conditions for both high and low stringency amplifications 
include a denaturation phase of 90-95° C for 30-120 sec, an annealing phase lasting 30-120 sec, 
and an extension phase of about 72° C for 1-2 min. Protocols and guidelines for low and high 
stringency amplification reactions are provided, e.g., in Innis, et al. (1990) PCR Protocols: A 
Guide to Methods and Applications Academic Press, NY. 

Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides which they encode are substantially identical. This 
occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy 
permitted by the genetic code. In such cases, the nucleic acids typically hybridize under 
moderately stringent hybridization conditions. Exemplary "moderately stringent hybridization 
conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 1% SDS at 37° C, 
and a wash in IX SSC at 45° C. A positive hybridization is typically at least twice background. 
Alternative hybridization and wash conditions can be utilized to provide conditions of similar 
stringency. Additional guidelines for determining hybridization parameters are provided in 
numerous references, e.g., Ausubel, et al. (eds. 1991 and supplements) Current Protocols in 
Molecular Biology Wiley. 

The phrase "functional effects" in the context of assays for testing compounds that 
modulate activity of a cancer protein includes the determination of a parameter that is indirectly 
or directly under the influence of the cancer protein or nucleic acid, e.g., a physiological, 
functional, physical, or chemical effect, such as the ability to decrease cancer. It includes ligand 
binding activity; cell viability; cell growth on soft agar; anchorage dependence; contact 
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inhibition and density limitation of growth; cellular proliferation; cellular transformation; 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 
tumor growth and metastasis in vivo; mRNA and protein expression in cells undergoing 
metastasis; and other characteristics of cancer cells. "Functional effects" include in vitro, in 

5 vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compound that increases 
or decreases a parameter that is indirectly or directly under the influence of a cancer protein 
sequence, e.g., physiological, functional, enzymatic, physical, or chemical effects. Such 
functional effects can be measured, e.g., changes in spectroscopic characteristics (e.g., 

10 fluorescence, absorbance, refractive index), hydrodynamic (e.g., shape), chromatographic, or 
solubility properties for the protein, measuring inducible markers or transcriptional activation of 
the cancer protein, measuring binding activity or binding assays, e.g., binding to antibodies or 
other ligands, and measuring growth, cellular proliferation, cell viability, cellular 
transformation, growth factor or serum dependence, tumor specific marker levels, invasiveness 

15 into Matrigel, tumor growth and metastasis in vivo, mRNA and protein expression, and other 
characteristics of cancer cells. The functional effects can be evaluated by many means, e.g., 
microscopy for quantitative or qualitative measures of alterations in morphological features, 
measurement of changes in RNA or protein levels for cancer-associated sequences, 
measurement of RNA stability, identification of downstream or reporter gene expression (CAT, 

20 luciferase, /3-gal, GFP, and the like), e.g., via chemiluminescence, fluorescence, colorimetric 
reactions, antibody binding, inducible markers, and ligand binding assays. 

1 "Inhibitors", "activators," and "modulators" of cancer polynucleotide and polypeptide 

sequences are used to refer to activating, inhibitory, or modulating molecules or compounds 
identified using in vitro and in vivo assays of cancer polynucleotide and polypeptide sequences. 

25 Inhibitors are compounds that, e.g., bind to, partially or totally block activity, decrease, prevent, 
delay activation, inactivate, desensitize, or down regulate the activity or expression, of cancer 
proteins, e.g., antagonists. Antisense or inhibitory nucleic acids may seem to inhibit expression 
and subsequent function of the protein. "Activators" are compounds that increase, open, 
activate, facilitate, enhance activation, sensitize, agonize, or up regulate cancer protein activity. 

30 Inhibitors, activators, or modulators also include genetically modified versions of cancer 

proteins, e.g., versions with altered activity, as well as naturally occurring and synthetic ligands, 
antagonists, agonists, antibodies, small chemical molecules, and the like. Such assays for 
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inhibitors and activators include, e.g., expressing the cancer protein in vitro, in cells, or cell 
membranes, applying putative modulator compounds, and then determining the functional 
effects on activity, as described above. Activators and inhibitors of cancer can also be identified 
by incubating cancer cells with the test compound and determining increases or decreases in the 
expression of 1 or more cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, or more 
cancer proteins, such as cancer proteins encoded by the sequences set out in Tables 2A-80. 

Samples or assays comprising cancer proteins that are treated with a potential activator, 
inhibitor, or modulator are compared to control samples without the inhibitor, activator, or 
modulator to examine the extent of inhibition. Control samples (untreated with inhibitors) are 
assigned a relative protein activity value of 100%. Inhibition of a polypeptide is achieved when 
the activity value relative to the control is about 80%, preferably 50%, more preferably 25-0%. 
Activation of a cancer polypeptide is achieved when the activity value relative to the control 
(untreated with activators) is about 110%, more preferably 150%, more preferably 200-500% 
(e.g., two to five fold higher relative to the control), more preferably 1000-3000% higher. 

The phrase "changes in cell growth" refers to any change in cell growth and proliferation 
characteristics in vitro or in vivo, such as cell viability, formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 
limitation of growth, loss of growth factor or serum requirements, changes in cell morphology, 
gaining or losing immortalization, gaining or losing tumor specific markers, ability to form or 
suppress tumors when injected into suitable animal hosts, and/or immortalization of the cell. 
See, e.g., pp. 231-241 in Freshney (1994) Culture of Animal Cells a Manual of Basic Technique 
(2d ed.) Wiley-Liss. 

"Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 

"Cancer cells," "transformed" cells or "transformation" in tissue culture, refers to 
spontaneous or induced phenotypic changes that do not necessarily involve the uptake of new 
genetic material. Although transformation can arise from infection with a transforming virus 
and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 
spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 
aberrant growth control, nonmorphological changes, and/or malignancy. See, Freshney (2000) 
Culture of Animal Cells: A Manual of Basic Technique (4th ed.) Wiley-Liss. 
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"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon, 
and mu constant region genes, as well as the myriad immunoglobulin variable region genes. 
Light chains are classified as either kappa or lambda. Heavy chains are classified as gamma, 
mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, IgM, IgA, 
IgD, and IgE, respectively. Typically, the antigen-binding region of an antibody or its 
functional equivalent will be most critical in specificity and affinity of binding. See Paul (ed. 
1999) Fundamental Immunology (4th ed.) Raven. 

An exemplary immunoglobulin (antibody) structural unit comprises atetramer. Each 
tetramer is composed of two identical pairs of polypeptide chains, each pair having one "light" 
(about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each chain defines a 
variable region of about 100 to 110 or more amino acids primarily responsible for antigen 
recognition. The terms variable light chain (VtJ and variable heavy chain (Vfj) refer to these 
light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well-characterized 
fragments produced by digestion with various peptidases. Thus, e.g., pepsin digests an antibody 
below the disulfide linkages in the hinge region to produce F(ab)'2 ? a dimer of Fab which itself 
is a light chain joined to Vh-Ch1 by a disulfide bond. The F(ab)'2 may be reduced under mild 
conditions to break the disulfide linkage in the hinge region, thereby converting the F(ab)'2 
dimer into an Fab' monomer. The Fab' monomer is essentially Fab with part of the hinge, region 
(see Paul (ed. 1999) Fundamental Immunology (4th ed.) Raven. While various antibody 
fragments are defined in terms of the digestion of an intact antibody, one of skill will appreciate 
that such fragments may be synthesized de novo either chemically or by using recombinant 
DNA methodology. Thus, the term antibody, as used herein, also includes antibody fragments 
either produced by the modification of whole antibodies, or those synthesized de novo using 
recombinant DNA methodologies (e.g., single chain Fv) or those identified using phage display 
libraries (see, e.g., McCafferty, et al. (1990) Nature 348:552-554). 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal antibodies, 
many techniques known. See, e.g., Kohler and Milstein (1975) Nature 256:495-497; Kozbor, et 
al. (1983) Immunology Today 4:72; Cole, et al. (1985) pp. 77-96 in Reisfeld and Sell (1985) 
Monoclonal Antibodies and Cancer Therapy Liss; Coligan (1991) Current Protocols in 
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Immunology Lippincott; Harlow and Lane (1988) Antibodies: A Laboratory Manual CSH 
Press; and Goding (1986) Monoclonal Antibodies: Principles and Practice (2d ed.) Academic 
Press. Techniques for the production of single chain antibodies (U.S. Patent 4,946,778) can be 
adapted to produce antibodies to polypeptides of this invention. Also, transgenic mice, or other 
organisms such as other mammals, maybe used to express humanized antibodies. 
Alternatively, phage display technology can be used to identify antibodies and heteromeric Fab 
fragments that specifically bind to selected antigens. See, e.g., McCafferty, et al. (1990) Nature 
348:552-554; Marks, et al. (1 992) Biotechnology 10:779-783. 

A "chimeric antibody" is an antibody molecule in which (a) the constant region, or a 
portion thereof, is altered, replaced, or exchanged so that the antigen binding site (variable 
region) is linked to a constant region of a different or altered class, and/or species, or an entirely 
different molecule which confers new properties to the chimeric antibody, e.g., an enzyme, 
toxin, hormone, growth factor, drug, effector function, chemoattractant, immune modulator, 
etc.; or (b) the variable region, or a portion thereof, is altered, replaced, or exchanged with a 
variable region having a different or altered antigen specificity. 
Identification of cancer-associated sequences 

In one aspect, the expression levels of genes are determined in different patient samples 
for which diagnosis information is desired, to provide expression profiles. An expression 
profile of a particular sample is essentially a "fingerprint" of the state of the sample; while two 
states may have any particular gene similarly expressed, the evaluation of a number of genes 
simultaneously allows the generation of a gene expression profile that is characteristic of the 
state of the cell. That is, normal tissue may be distinguished from cancerous or metastatic 
cancerous tissue, or cancer tissue or metastatic cancerous tissue can be compared with tissue 
from surviving cancer patients. By comparing expression profiles of tissue in known different 
cancer states, information regarding which genes are important (including both up-and down- 
regulation of genes) in each of these states is obtained. Molecular profiling may distinguish 
subtypes of a currently collective disease designation, e.g., different forms of a cancer. 

The identification of sequences that are differentially expressed in cancer versus non- 
cancer tissue allows the use of this information in a number of ways. For example, a particular 
treatment regime maybe evaluated: does a chemotherapeutic drug act to down-regulate cancer, 
and thus tumor growth or recurrence, in a particular patient. Alternatively, a treatment step may 
induce other markers which may be used as targets to destroy tumor cells. Similarly, diagnosis 
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and treatment outcomes may be done or confirmed by comparing patient samples with the 
known expression profiles. Maliganant disease may be compared to non-malignant conditions. 
Metastatic tissue can also be analyzed to determine the stage of cancer in the tissue, or origin of 
primary tumor, e.g., metastasis from a remote primary site. Furthermore, these gene expression 
profiles (or individual genes) allow screening of drug candidates with an eye to mimicking or 
altering a particular expression profile; e.g., screening can be done for drugs that suppress the 
cancer expression profile. This may be done by making biochips comprising sets of the 
important cancer genes, which can then be used in these screens. These methods can also be 
done on the protein basis; that is, protein expression levels of the cancer proteins can be 
evaluated for diagnostic purposes or to screen candidate agents. In addition, the cancer nucleic 
acid sequences can be administered for gene therapy purposes, including the administration of 
antisense nucleic acids, or the cancer proteins (including antibodies and other modulators 
thereof) administered as therapeutic drugs. 

Thus the present invention provides nucleic acid and protein sequences that are 
differentially expressed in cancer relative to normal tissues and/or non-malignant disease, or in 
different types of related diseases, herein termed "cancer sequences." As outlined below, cancer 
sequences include those that are up-regulated (e.g., expressed at a higher level) in cancer, as 
well as those that are down-regulated (e.g., expressed at a lower level). In a preferred 
embodiment, the cancer sequences are from humans; however, cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, other cancer 
sequences are provided, from vertebrates, including mammals, including rodents (rats, mice, 
hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, horses, 
etc.) and pets (e.g., dogs, cats, etc.). Cancer sequences from other organisms may be obtained 
using the techniques outlined below. 

Cancer sequences can include both nucleic acid and amino acid sequences. In a 
preferred embodiment, the skin cancer sequences are recombinant nucleic acids. These nucleic 
acid sequences are useful in a variety of applications, including diagnostic applications, which 
will detect naturally occurring nucleic acids, as well as screening applications; e.g., biochips 
comprising nucleic acid probes or PCR microliter plates with selected probes to the cancer 
sequences. 

A cancer sequence can be initially identified by substantial nucleic acid and/or amino 
acid sequence homology to the cancer sequences outlined herein. Such homology can be based 
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upon the overall nucleic acid or amino acid sequence, and is generally determined as outlined 
below, e.g., using homology programs or hybridization conditions. 

For identifying cancer-associated sequences, the cancer screen typically includes 
comparing genes identified in different tissues, e.g., normal and cancerous tissues, cancer and 
non-malignant conditions, non-malignant conditions and normal tissues, or tumor tissue 
samples from patients who have metastatic disease vs. non metastatic tissue. Other suitable 
tissue comparisons include comparing cancer samples with metastatic cancer samples from 
other cancers, such as lung, stomach, gastrointestinal cancers, etc. Samples of different stages 
of cancer, e.g., survivor tissue, drug resistant states, and tissue undergoing metastasis, are 
applied to biochips comprising nucleic acid probes. The samples are first microdissected, if 
applicable, and treated for preparation of mRNA. Suitable biochips are commercially available, 
e.g., from Affymetrix, Santa Clara, CA. Gene expression profiles as described herein are 
generated and the data analyzed. 

In one embodiment, the genes showing changes in expression as between normal and 
disease states are compared to genes expressed in other normal tissues, including, and not 
limited to lung, heart, brain, liver, stomach, kidney, muscle, colon, small intestine, large 
intestine, spleen, bone, and/or placenta. In a preferred embodiment, those genes identified 
during the cancer screen that are expressed in a significant amount in other tissues (e.g., 
essential organs) are removed from the profile, although in some embodiments, this is not 
necessary (e.g., where organs may be dispensible, e.g., female or male specific). That is, when 
screening for drugs, it is usually preferable that the target expression be disease specific, to 
minimize possible side effects on other organs were there expression. 

In a preferred embodiment, cancer sequences are those that are up-regulated in cancer; 
that is, the expression of these genes is higher in the cancer tissue as compared to non-cancer or 
non-malignant tissue. "Up-regulation" as used herein often means at least about a two-fold 
change, preferably at least about a three fold change, with at least about five-fold or higher 
being preferred. Another embodiment is directed to sequences up-regulated in non-malignant 
conditions relative to normal. Uniformity among relevant samples is also preferred. 

Unigene cluster identification numbers and accession numbers herein are for the 
GenBank sequence database and the sequences of the accession numbers are hereby expressly 
incorporated by reference. GenBank is available, see, e.g., Benson, et al. (1998) Nuc. Acids 
Res. 26:1-7; and http://www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, 
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e.g., European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
In some situations, the sequences may be derived from assembly of available sequences or be 
predicted from genomic DNA using exon prediction algorithms, such as FGENESH. See 
Salamov and Solovyev (2000) Genome Res. 10:516-522. In other situations, sequences have 
been derived from cloning and sequencing of isolated nucleic acids. 

In another preferred embodiment, cancer sequences are those that are down-regulated in 
the cancer; that is, the expression of these genes is lower in cancer tissue as compared to non- 
cancerous tissue. "Down-regulation" as used herein often means at least about a two-fold 
change, preferably at least about a three fold change, with at least about five-fold or higher 
being preferred. 
Informatics 

The ability to identify genes that are over or under expressed in cancer can additionally 
provide high-resolution, high-sensitivity datasets which can be used in the areas of diagnostics, 
therapeutics, drug development, pharmacogenetics, protein structure, biosensor development, 
and other related areas. For example, the expression profiles can be used in diagnostic or 
prognostic evaluation of patients with cancer or related diseases. See Tables 1-3. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation (see Anderson (June 11-12, 1998) Pharmaceutical Proteomics: Targets, 
Mechanism, and Function , paper presented at the IBC Proteomics conference, Coronado, CA). 
Subcellular toxicological information can also be utilized in a biological sensor device to 
predict the likely toxicological effect of chemical exposures and likely tolerable exposure 
thresholds (see U.S. Patent No. 5,811,231). Similar advantages accrue from datasets relevant to 
other biomolecules and bioactive agents (e.g., nucleic acids, saccharides, lipids, drugs, and the 
like). 

Thus, in another embodiment, the present invention provides a database that includes at 
least one set of assay data. The data contained in the database is acquired, e.g., using array 
analysis either singly or in a library format. The database can be in a form in which data can be 
maintained and transmitted, but is preferably an electronic database. The electronic database of 
the invention can be maintained on any electronic device allowing for the storage of and access 
to the database, such as a personal computer, but is preferably distributed on a wide area 
network, such as the World Wide Web. 
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The focus of the present section on databases that include peptide sequence data is for 
clarity of illustration only. Similar databases can be assembled for assay data acquired using an 
assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative and/or 
absolute abundance of a variety of molecular and macromolecular species from a biological 
sample representing cancer, e.g., the identification of cancer-associated sequences described 
herein, provide an abundance of information which can be correlated with pathological 
conditions, predisposition to disease, drug testing, therapeutic monitoring, gene-disease causal 
linkages, identification of correlates of immunity and physiological status, among others. 
Although the data generated from the assays of the invention is suited for manual review and 
analysis, in a preferred embodiment, data processing using high-speed computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is available. 
For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database system for 
storing biomolecular sequence information in a manner that allows sequences to be catalogued 
and searched according to one or more protein function hierarchies. U.S. Patent 5,953,727 
discloses a relational database having sequence records containing information in a format that 
allows a collection of partial-length DNA sequences to be catalogued and searched according to 
association with one or more sequencing projects for obtaining full-length sequences from the 
collection of partial length sequences. U.S. Patent 5,706,498 discloses a gene database retrieval 
system for making a retrieval of a gene sequence similar to a sequence data item in a gene 
database based on the degree of similarity between a key sequence and a target sequence. U.S. 
Patent 5,538,897 discloses a method using mass spectroscopy fragmentation patterns of 
peptides to identify amino acid sequences in computer databases by comparison of predicted 
mass spectra with experimentally-derived mass spectra using a closeness-of-fit measure. U.S. 
Patent 5,926,818 discloses a multi-dimensional database comprising a functionality for multi- 
dimensional data analysis described as on-line analytical processing (OLAP), which entails the 
consolidation of projected and actual data according to more than one consolidation path or 
dimension. U.S. Patent 5,295,261 reports a hybrid database structure in which the fields of each 
database record are divided into two classes, navigational and informational data, with 
navigational fields stored in a hierarchical topological map which can be viewed as a tree 
structure or as the merger of two or more such tree structures. See also Baxevanis, et al. (2001) 
Bioinformatics: A Practical Guuide to the Analysis of Genes and Proteins Wiley; Mount (2001) 
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Bioinformatics: Sequence and Genome Analysis CSH Press, NY; Durbin, et al. (eds. 1999) 
Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids Cambridge 
University Press; Baxevanis and Oeullette (eds. 1998) Bioinformatics: A Practical Guide to the 
Analysis of Genes and Proteins (2d. ed.) Wiley-Liss; Rashidi and Buehler (1999) 
Bioinformatics: Basic Applications in Biological Science and Medicine CRC Press; Setubal, et 
al. (eds. 1997) Introduction to Computational Molecular Biology Brooks/Cole; Misener and 
Krawetz (eds. 2000) Bioinformatics: Methods and Protocols Humana Press; Higgins and Taylor 
(eds. 2000) Bioinformatics: Sequence. Structure, and Databanks: A Practical Approach Oxford 
University Press; Brown (2001) Bioinformatics: A Biologist's Guide to Biocomputing and the 
Internet Eaton Pub.; Han and Kamber (2000) Data Mining: Concepts and Techniques ' 
Kaufmann Pub.; and Waterman (1995) Introduction to Computational Biology: Maps, 
Sequences, and Genomes Chap and Hall. 

The present invention provides a computer database comprising a computer and 
software for storing in computer-retrievable form assay data records cross-tabulated, e.g., with 
data specifying the source of the target-containing sample from which each sequence specificity 
record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample is 
from a control tissue sample known to be free of pathological disorders. In a variation, at least 
one of the sources is a known pathological tissue specimen, e.g., a neoplastic lesion or another 
tissue specimen to be analyzed for cancer. In another variation, the assay records cross-tabulate 
one or more of the following parameters for each target species in a sample: (1) a unique 
identification code, which can include, e.g., a target molecular structure and/or characteristic 
separation coordinate (e.g., electrophoretic coordinates); (2) sample source; and (3) absolute 
and/or relative quantity of the target species present in the sample. 

The invention also provides for the storage and retrieval of a collection of target data in 
a computer data storage apparatus, which can include magnetic disks, optical disks, magneto- 
optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, magnetic bubble 
memory devices, and other data storage devices, including CPU registers and on-CPU data 
storage arrays. Typically, the target data records are stored as a bit pattern in an array of 
magnetic domains on a magnetizable medium or as an array of charge states or transistor gate 
states, such as an array of cells in a DRAM device (e.g., each cell comprised of a transistor and 
a charge storage area, which may be on the transistor). In one embodiment, the invention 
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provides such storage devices, and computer systems built therewith, comprising a bit pattern 
encoding a protein expression fingerprint record comprising unique identifiers for at least 10 
target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides a method 
for identifying related peptide or nucleic acid sequences, comprising performing a computerized 
comparison between a peptide or nucleic acid sequence assay record stored in or retrieved from 
a computer storage device or database and at least one other sequence. The comparison can 
include a sequence analysis or comparison algorithm or computer program embodiment thereof 
(e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may be of the relative amount 
of a peptide or nucleic acid sequence in a pool of sequences determined from a polypeptide or 
nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM-compatible 
(DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format (e.g., Linux, 
SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard (fixed, 
Winchester) disk drive, comprising a bit pattern encoding data from an assay of the invention in 
a file format suitable for retrieval and processing in a computerized sequence analysis, 
comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing devices 
linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN line, 
wireless network, optical fiber, or other suitable signal transmission medium, whereby at least 
one network device (e.g., computer, disk array, etc.) comprises a pattern of magnetic domains 
(e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM cells) composing a"bit 
pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a database 
comprising a plurality of assay results obtained by the method of the invention. 

hi a preferred embodiment, the invention provides a computer system for comparing a 
query target to a database containing an array of data structures, such as an assay result obtained 
by the method of the invention, and ranking database targets based on the degree of identity and 
gap weight to the target data. A central processor is preferably initialized to load and execute 
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the computer program for alignment and/or comparison of the assay results. Data for a query 
target is entered into the central processor via an I/O device. Execution of the computer 
program results in the central processor retrieving the assay data from the data file, which 
comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to secondary 
memory, which is typically random access memory (e.g., DRAM, SRAM, SGRAM, or 
SDRAM). Targets are ranked according to the degree of correspondence between a selected 
assay characteristic (e.g., binding to a selected affinity moiety) and the same characteristic of 
the query target and results are output via an I/O device. For example, a central processor can 
be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, PA-8000, SPARC, MPS 
4400, MIPS 10000, VAX, etc.); a program can be a commercial or public domain molecular 
biology software package (e.g., UWGCG Sequence Analysis Software, Darwin); a data file can 
be an optical or magnetic disk, a data server, a memory device (e.g., DRAM, SRAM, SGRAM, 
SDRAM, EPROM, bubble memory, flash memory, etc.); an I/O device can be a terminal 
comprising a video display and a keyboard, a modem, an ISDN terminal adapter, an Ethernet 
port, a punched card reader, a magnetic strip reader, or other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as that 
described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a collection 
of peptide sequence specificity records obtained by the methods of the invention, which may be 
stored in the computer; (3) a comparison target, such as a query target; and (4) a program for 
alignment and comparison, typically with rank-ordering of comparison results on the basis of 
computed similarity values. See, e.g., Ewens and Grant (2001) Statistical Methods in 
Bioinformatics: An Introduction Springer- Verlag. Mathematical approaches can also be used to 
conclude whether similarities or differences in the gene expression exhibited by different 
samples are significant. See, e.g., Golub, et al. (1999) Science 286:531-537; Duda, et al. (2001) 
Pattern Classification Wiley; and Hastie, et al. (2001) The Elements of Statistical Learning: 
Data Mining. Inference, and Prediction Springer-Verlag. One approach to determine whether a 
sample is more similar to or has maximum similarity with a given condition between the sample 
and one or more pools representing different conditions for comparison; the pool with the 
smallest vector angle is then chosen as the most similar to the biological sample among the 
pools compared. 
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Characteristics of cancer-associated proteins 

Cancer proteins of the present invention may be classified as secreted proteins, 
transmembrane proteins, or intracellular proteins. In one embodiment, the cancer protein is an 
intracellular protein. Intracellular proteins may be found in the cytoplasm and/or in the nucleus. 
Intracellular proteins are involved in all aspects of cellular function and replication (including, 
e.g., signaling pathways); aberrant expression of such proteins often results in unregulated or 
disregulated cellular processes (see, e.g., Alberts, et al. (eds. 1994) Molecular Biology of the 
Cell (3d ed.) Garland). For example, many intracellular proteins have enzymatic activity such 
as protein kinase activity, protein phosphatase activity, protease activity, nucleotide cyclase 
activity, polymerase activity, and the like. Intracellular proteins also serve as docking proteins 
that are involved in organizing complexes of proteins, or targeting proteins to various 
subcellular localizations, and are involved in maintaining the structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence in the 
proteins of one or more structural motifs for which defined functions have been attributed, hi 
addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated targets 
in a sequence dependent manner. PTB domains, which are distinct from SH2 domains, also 
bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich targets. In addition, PH 
domains, tetratricopeptide repeats and WD domains to name only a few, have been shown to 
mediate protein-protein interactions. Some of these may also be involved in binding to 
phospholipids or other second messengers. These motifs can be identified on the basis of amino 
acid sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 
One useful database is Pfam (protein families), which is a large collection of multiple sequence 
alignments and hidden Markov models covering many common protein domains. Versions are 
available via the internet from Washington University in St. Louis, the Sanger Center in 
England, and the Karolinska Institute in Sweden. See, e.g., Bateman, et al. (2000) Nuc. Acids 
Res. 28:263-266; Sonnhammer, et al. (1997) Proteins 28:405-420 ; Bateman, et al. (1999) Nuc. 
Acids Res. 27:260-262; and Sonnhammer, et al. (1998) Nuc. Acids Res. 26:320-322. 

hi another embodiment, the cancer sequences are transmembrane proteins. 
Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. They may 
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have an intracellular domain, an extracellular domain, or both. The intracellular domains of 
such proteins may have a number of functions including those already described for 
intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular domain 
of transmembrane proteins serves both roles. For example certain receptor tyrosine kinases 
have both protein kinase activity and SH2 domains. In addition, autophosphorylation of 
tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane domains. For 
example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases and 
receptor serine/threonine protein kinases contain a single transmembrane domain. However, 
various other proteins including channels and adenylyl cyclases contain numerous 
transmembrane domains. Many important cell surface receptors such as G protein coupled 
receptors (GPCRs) are classified as "seven transmembrane domain" proteins, as they contain 7 
membrane spanning regions. Characteristics of transmembrane domains include approximately 
17 consecutive hydrophobic amino acids that may be followed by charged amino acids. 
Therefore, upon analysis of the amino acid sequence of a particular protein, the localization and 
number of transmembrane domains within the protein may be predicted (see, e.g., PSORT web 
site http://psort.nibb.ac.jp/). Important transmembrane protein receptors include, but are not 
limited to the insulin receptor, insulin-like growth factor receptor, human growth hormone 
receptor, glucose transporters, transferrin receptor, epidermal growth factor receptor, low 
density lipoprotein receptor, epidermal growth factor receptor, leptin receptor, and interleukin 
receptors, e.g., IL-1 receptor, IL-2 receptor, etc. 

The extracellular domains of transmembrane proteins are diverse; however, conserved 
motifs are found repeatedly among various extracellular domains. Conserved structure and/or 
functions have been ascribed to different extracellular motifs. Many extracellular domains are 
involved in binding to other molecules. In one aspect, extracellular domains are found on 
receptors. Factors that bind the receptor domain include circulating ligands, which may be 
peptides, proteins, or small molecules such as adenosine and the like. For example, growth 
factors such as EGF, FGF, and PDGF are circulating growth factors that bind to their cognate 
receptors to initiate a variety of cellular responses. Other factors include cytokines, mitogenic 
factors, neurotrophic factors, and the like. Extracellular domains also bind to cell-associated 
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molecules. In this respect, they may mediate cell-cell interactions. Cell-associated ligands can 
be tethered to the cell, e.g., via a glycosylphosphatidylinositol (GPI) anchor, or may themselves 
be transmembrane proteins. Extracellular domains may also associate with the extracellular 
matrix and contribute to the maintenance of the cell structure. 

Cancer proteins that are transmembrane are particularly preferred in the present 
invention as they are readily accessible targets for immunotherapeutics, as are described herein. 
In addition, as outlined below, transmembrane proteins can be also useful in imaging 
modalities. Antibodies may be used to label such readily accessible proteins in situ. 
Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically permeablized to provide access to intracellular proteins. In addition, some membrane 
proteins can be processed to release a soluble protein, or to expose a residual fragment. 
Released soluble proteins maybe useful diagnostic markers, processed residual protein 
fragments may be useful lung markers of disease. 

It will also be appreciated that a transmembrane protein can be made soluble by 
removing transmembrane sequences, e.g., through recombinant methods. Furthermore, 
transmembrane proteins that have been made soluble can be made to be secreted through 
recombinant means by adding an appropriate signal sequence. 

In another embodiment, the cancer proteins are secreted proteins; the secretion of which 
can be either constitutive or regulated. These proteins may have a signal peptide or signal 
sequence that targets the molecule to the secretory pathway. Secreted proteins are involved in 
numerous physiological events; e.g., if circulating, they often serve to transmit signals to 
various other cell types. The secreted protein may function in an autocrine manner (acting on 
the cell that secreted the factor), a paracrine manner (acting on cells in close proximity to the 
cell that secreted the factor), an endocrine manner (acting on cells at a distance, e.g, secretion 
into the blood stream), or exocrine (secretion, e.g., through a duct or to adjacent epithelial 
surface as sweat glands, sebaceous glands, pancreatic ducts, lacrimal glands, mammary glands, 
wax producing glands of the ear, etc.). Thus secreted molecules often find use in modulating or 
altering numerous aspects of physiology. Cancer proteins that are secreted proteins are 
particularly preferred in the present invention as they serve as good targets for diagnostic 
markers, e.g., for blood, plasma, serum, or stool tests. Those which are enzymes may be 
antibody or small molecule targets. Others may be useful as vaccine targets, e.g., via CTL 
mechanisms. 
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Use of cancer nucleic acids 

As described above, cancer sequence is initially identified by substantial nucleic acid 
and/or amino acid sequence homology or linkage to the cancer sequences outlined herein. Such 
homology can be based upon the overall nucleic acid or amino acid sequence, and is generally 
determined as outlined below, using either homology programs or hybridization conditions. 
Typically, linked sequences on a rriRNA are found on the same molecule. 

As detailed elsewhere, percent identity can be determined using an algorithm such as 
BLAST. A preferred method utilizes the BLASTN module of WU-BLAST-2 set to the default 
parameters, with overlap span and overlap fraction set to 1 and 0.125, respectively. Alignment 
may include the introduction of gaps in the sequences to be aligned. In addition, for sequences 
which contain either more or fewer nucleotides than those of the nucleic acids described, the 
percentage of homology may be determined based on the number of homologous nucleosides in 
relation to the total number of nucleosides. Thus, e.g., homology of sequences shorter than 
those of the sequences identified will be determined using the number of nucleosides in the 
shorter sequence. 

In one embodiment, the nucleic acid homology is determined through hybridization 
studies. Thus, e.g., nucleic acids which hybridize under high stringency to a described nucleic 
acid, or its complement, or is also found on naturally occurring rnRNAs is considered a cancer 
sequence. In another embodiment, less stringent hybridization conditions are used; e.g., 
moderate or low stringency conditions maybe used; see Ausubel, supra, and Tijssen, supra. 

The cancer nucleic acid sequences of the invention, e.g., the sequences in Tables 1-80, 
can be fragments of larger genes, e.g., they are nucleic acid segments. "Genes" in this context 
includes coding regions, non-coding regions, and mixtures of coding and non-coding regions. 
Accordingly, using the sequences provided herein, extended sequences, in either direction, of 
the cancer genes can be obtained, using techniques well known for cloning either longer 
sequences or the full length sequences; see Ausubel, et al., supra. Much can be done by 
informatics and many sequences can be clustered to include multiple sequences corresponding 
to a single gene, e.g., systems such as UniGene (see, http://www.ncbi.nlm.nih.govAJniGene/). 

Once a cancer nucleic acid is identified, it can be cloned and, if necessary, its constituent 
parts recombined to form the entire cancer nucleic acid coding regions or the entire mRNA 
sequence. Once isolated from its natural source, e.g., contained within a plasmid or other vector 
or excised therefrom as a linear nucleic acid segment, the recombinant cancer nucleic acid can 
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be further used as a probe to identify and isolate other cancer nucleic acids, e.g., extended 
coding regions. It can also be used as a "precursor" nucleic acid to make modified or variant 
cancer nucleic acids and proteins. 

The cancer nucleic acids of the present invention are used in several ways. In one 
embodiment, nucleic acid probes to the cancer nucleic acids are made and attached to biochips 
to be used in screening and diagnostic methods, as outlined below, or for administration, e.g., 
for gene therapy, vaccine, RNAi, and/or antisense applications. Alternatively, cancer nucleic 
acids that include coding regions of cancer proteins can be put into expression vectors for the 
expression of cancer proteins, again for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to cancer nucleic acids (both the nucleic 
acid sequences outlined in the figures and/or the complements thereof) are made. The nucleic 
acid probes attached to the biochip are designed to be substantially complementary to the cancer 
nucleic acids, e.g., the target sequence (either the target sequence of the sample or to other 
probe sequences, e.g., in sandwich assays), such that hybridization of the target sequence and 
the probes of the present invention occurs. As outlined below, this complementarity need not 
be perfect; there may be any number of base pair mismatches which will interfere with 
hybridization between the target sequence and the single stranded nucleic acids of the present 
invention. However, if the number of mutations is so great that no hybridization can occur 
under even the least stringent of hybridization conditions, the sequence is not a complementary 
target sequence. Thus, by "substantially complementary" herein is meant that the probes are 
sufficiently complementary to the target sequences to hybridize under normal reaction 
conditions, particularly high stringency conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single and partially 
double stranded. The strandedness of the probe is dictated by the structure, composition, and 
properties of the target sequence. In general, the nucleic acid probes range from about 8-100 
bases long, with from about 10-80 bases being preferred, and from about 30-50 bases being 
particularly preferred. That is, generally whole genes are not used. In some embodiments, 
much longer nucleic acids can be used, up to hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with either 
overlapping probes or probes to different sections of the target being used. That is, two, three, 
four or more probes, with three being preferred, are used to build in a redundancy for a 
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particular target. The probes can be overlapping (e.g., have some sequence in common), or 
separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 

Nucleic acids can be attached or immobilized to a solid support in a wide variety of 
ways. By "immobilized" and grammatical equivalents herein is meant the association or 
binding between the nucleic acid probe and the solid support is sufficient to be stable under the 
conditions of binding, washing, analysis, and removal as outlined. The binding can typically be 
covalent or non-covalent. By "non-covalent binding" and grammatical equivalents herein is 
meant one or more of electrostatic, hydrophilic, and hydrophobic interactions. Included in non- 
covalent binding is the covalent attachment of a molecule, e.g., streptavidin to the support and 
the non-covalent binding of the biotinylated probe to the streptavidin. By "covalent binding" 
and grammatical equivalents herein is meant that the two moieties, the solid support and the 
probe, are attached by at least one bond, including sigma bonds, pi bonds, and coordination 
bonds. Covalent bonds can be formed directly between the probe and the solid support or can 
be formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to the biochip in a wide variety of ways. As 
described herein, the nucleic acids can either be synthesized first, with subsequent attachment to 
the biochip, or can be directly synthesized on the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or 
other grammatical equivalents herein is meant a material that can be modified for the 
attachment or association of the nucleic acid probes and is amenable to at least one detection 
method. Often, the substrate may contain discrete individual sites appropriate for individual 
partitioning and identification. The number of possible substrates is very large, and include, but 
are not limited to, glass and modified or functionalized glass, plastics (including acrylics, 
polystyrene and copolymers of styrene and other materials, polypropylene, polyethylene, 
polybutylene, polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, 
silica or silica-based materials including silicon and modified silicon, carbon, metals, inorganic 
glasses, plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluoresce. See WO 0055627. 

Generally the substrate is planar, although other configurations of substrates may be 
used as well. For example, the probes may be placed on the inside surface of a tube for flow- 
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through sample analysis to minimize sample volume. Similarly, the substrate may be flexible, 
such as a flexible foam, including closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be derivatized 
with chemical functional groups for subsequent attachment of the two. Thus, e.g., the biochip is 
derivatized with a chemical functional group including, but not limited to, amino groups, 
carboxy groups, oxo groups, and thiol groups, with amino groups being particularly preferred. 
Using these functional groups, the probes can be attached using functional groups on the probes. 
For example, nucleic acids containing amino groups can be attached to surfaces comprising 
amino groups, e.g., using linkers; e.g., homo-or hetero-bifunctional linkers as are well known 
(see 1994 Pierce Chemical Company catalog, technical section on cross-linkers, pages 155- 
200). In addition, in some cases, additional linkers, such as alkyl groups (including substituted 
and heteroalkyl groups) may be used. 

In this embodiment, oligonucleotides are synthesized, and then attached to the surface of 
the solid support. Either the 5' or 3' terminus may be attached to the solid support, or 
attachment maybe via linkage to an internal nucleoside. In another embodiment, the 
immobilization to the solid support may be very strong, yet non-covalent. For example, 
biotinylated oligonucleotides can be made, which bind to surfaces covalently coated with 
streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface. For example, 
photoactivation techniques utilizing photopolymerization compounds and techniques are used. 
In a preferred embodiment, the nucleic acids can be synthesized in situ, using known 
photolithographic techniques, such as those described in WO 95/25116; WO 95/35505; U.S. 
Patent Nos. 5,700,637 and 5,445,934; and references cited within, all of which are expressly 
incorporated by reference; these methods of attachment form the basis of the Affymetrix 
GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression level of 
cancer-associated sequences. These assays are typically performed in conjunction with reverse 
transcription. In such assays, a cancer-associated nucleic acid sequence acts as a template in an 
amplification reaction (e.g., Polymerase Chain Reaction, or PCR). In a quantitative 
amplification, the amount of amplification product will be proportional to the amount of 
template in the original sample. Comparison to appropriate controls provides a measure of the 
amount of cancer-associated RNA. Methods of quantitative amplification are well known. 
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Detailed protocols for quantitative PCR are provided, e.g., in Innis, et al. (1990) PCR Protocols: 
A Guide to Methods and Applications Academic Press. 

In some embodiments, a TaqMan based assay is used to measure expression. TaqMan 
based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent dye and a 3' 
quenching agent. The probe hybridizes to a PCR product, but cannot itself be extended due to a 
blocking agent at the 3' end. When the PCR product is amplified in subsequent cycles, the 5' 
nuclease activity of the polymerase, e.g., AmpliTaq, results in the cleavage of the TaqMan 
probe. This cleavage separates the 5' fluorescent dye and the 3' quenching agent, thereby 
resulting in an increase in fluorescence as a function of amplification (see, e.g., literature 
provided by Perkin-Elmer, e.g., www2.perkin-elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase chain 
reaction (LCR) (see Wu and Wallace (1989) Genomics 4:560-569, Landegren, et al. (1988) 
Science 241:1077-1080, and Barringer, et al. (1990) Gene 89:1 17-122), transcription 
amplification (Kwoh, et al. (1989) Proc. Natl. Acad. Set USA 86:1 173-1 177), self-sustained 
sequence replication (Guatelli, et al. (1990) Proc. Natl. Acad. Sci. USA 87:1874-1878), dot 
PCR, linker adapter PCR, etc. 
Expression of cancer proteins from nucleic acids 

In a preferred embodiment, cancer nucleic acids, e.g., encoding cancer proteins, are used 
to make a variety of expression vectors to express cancer proteins which can then be used in 
screening assays, as described below. Expression vectors and recombinant DNA technology are 
well known (see, e.g., Ausubel, supra, and Fernandez and Hoeffler (eds. 1999) Gene Expression 
Systems Academic Press) to express proteins. The expression vectors may be either self- 
replicating extrachromosomal vectors or vectors which integrate into a host genome. Generally, 
these expression vectors include transcriptional and translational regulatory nucleic acid 
operably linked to the nucleic acid encoding the cancer protein. The term "control sequences" 
refers to DNA sequences used for the expression of an operably linked coding sequence in a 
particular host organism. Control sequences that are suitable for prokaryotes, e.g., include a 
promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with 
another nucleic acid sequence. For example, DNA for a presequence or secretory leader is 
operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in 
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the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence 
if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a 
coding sequence if it is positioned so as to facilitate translation. Generally, "operably linked" 
means that the DNA sequences being linked are contiguous, and, in the case of a secretory 
leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. 
Linking is typically accomplished by ligation at convenient restriction sites. If such sites do not 
exist, synthetic oligonucleotide adaptors or linkers are used in accordance with conventional 
practice. Transcriptional and translational regulatory nucleic acid will generally be appropriate 
to the host cell used to express the cancer protein. Numerous types of appropriate expression 
vectors and suitable regulatory sequences are known for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, but are 
not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 
sequences, translational start and stop sequences, and enhancer or activator sequences. In a 
preferred embodiment, the regulatory sequences include a promoter and transcriptional start and 
stop sequences. 

Promoter sequences may be either constitutive or inducible promoters. The promoters 
may be either naturally occurring promoters or hybrid promoters. Hybrid promoters, which 
combine elements of more than one promoter, are also known, and are useful in the present 
invention. 

An expression vector may comprise additional elements. For example, the expression 
vector may have two replication systems, thus allowing it to be maintained in two organisms, 
e.g., in mammalian or insect cells for expression and in a prokaryotic host for cloning arid 
amplification. Furthermore, for integrating expression vectors, the expression vector often 
contains at least one sequence homologous to the host cell genome, and preferably two 
homologous sequences which flank the expression construct. The integrating vector may be 
directed to a specific locus in the host cell by selecting the appropriate homologous sequence 
for inclusion in the vector. Constructs for integrating vectors are available. See, e.g., 
Fernandez and Hoeffler, supra; and Kitamura, et al. (1995) Proc. Nat'l Acad. Sci. USA 92:9146- 
9150. 

In addition, in a preferred embodiment, the expression vector contains a selectable 
marker gene to allow the selection of transformed host cells. Selection genes are well known 
and will vary with the host cell used. 
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The cancer proteins of the present invention are usually produced by culturing a host cell 
transformed with an expression vector containing nucleic acid encoding a cancer protein, under 
the appropriate conditions to induce or cause expression of the cancer protein. Conditions 
appropriate for cancer protein expression will vary with the choice of the expression vector and 
5 the host cell, and will be easily ascertained through routine experimentation or optimization. 
For example, the use of constitutive promoters in the expression vector will require optimizing 
the growth and proliferation of the host cell, while the use of an inducible promoter requires the 
appropriate growth conditions for induction. In addition, in some embodiments, the timing of 
the harvest is important. For example, the baculoviral systems used in insect cell expression are 

1 0 lytic viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect and 
animal cells, including mammalian cells. Of particular interest are Saccharomyces cerevisiae 
and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, Neurospora, BHK, 
CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial cells), THP1 cells (a 

1 5 macrophage cell line), and various other human cells and cell lines. 

In a preferred embodiment, the cancer proteins are expressed in mammalian cells. 
Mammalian expression systems may be used, and include retroviral and adenoviral systems. 
One expression vector system is a retroviral vector system such as is generally described in 
PCT/US97/01019 and PCT/US97/01048. Of particular use as mammalian promoters are the 

20 promoters from mammalian viral genes, since the viral genes are often highly expressed and 
have a broad host range. Examples include the SV40 early promoter, mouse mammary tumor 
virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, and the 
CMV promoter (see, e.g., Fernandez and Hoeffler, supra). Typically, transcription termination 
and polyadenylation sequences recognized by mammalian cells are regulatory regions located 3' 

25 to the translation stop codon and thus, together with the promoter elements, flank the coding 
sequence. Examples of transcription terminator and polyadenlyation signals include those 
derived from SV40. 

Methods of introducing exogenous nucleic acid into mammalian hosts, as well as other 
hosts, are available, and will vary with the host cell used. Techniques include dextran-mediated 
3 0 transfection, calcium phosphate precipitation, polybrene mediated transfection, protoplast 

fusion, electroporation, viral infection, encapsulation of the polynucleotide(s) in liposomes, and 
direct microinjection of the DNA into nuclei. 
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In a preferred embodiment, cancer proteins are expressed in bacterial systems. 
Promoters from bacteriophage may also be used. In addition, synthetic promoters and hybrid 
promoters are also useful; e.g., the tac promoter is a hybrid of the tip and lac promoter 
sequences. Furthermore, a bacterial promoter can include naturally occurring promoters of non- 
bacterial origin that have the ability to bind bacterial RNA polymerase and initiate transcription. 
In addition to a functioning promoter sequence, an efficient ribosome binding site is desirable. 
The expression vector may also include a signal peptide sequence that provides for secretion of 
the cancer protein in bacteria. The protein is either secreted into the growth media (gram- 
positive bacteria) or into the periplasmic space, located between the inner and outer membrane 
of the cell (gram-negative bacteria). The bacterial expression vector may also include a 
selectable marker gene to allow for the selection of bacterial strains that have been transformed. 
Suitable selection genes include genes which render the bacteria resistant to drugs such as 
ampicillin, chloramphenicol, erythromycin, kanamycin, neomycin, and tetracycline. Selectable 
markers also include biosynthetic genes, such as those in the histidine, tryptophan, and leucine 
biosynthetic pathways. These components are assembled into expression vectors. Expression 
vectors for bacteria are well known, and include vectors for Bacillus subtilis, E. coli, 
Streptococcus cremoris, and Streptococcus lividans, among others (e.g., Fernandez and 
Hoeffler, supra). The bacterial expression vectors are transformed into bacterial host cells using 
techniques such as calcium chloride treatment, electroporation, and others. 

In one embodiment, cancer proteins are produced in insect cells using, e.g., expression 
vectors for the transformation of insect cells, and in particular, baculovirus-based expression 
vectors. 

In a preferred embodiment, a cancer protein is produced in yeast cells. Yeast expression 
systems are well known, and include expression vectors for Saccharomyces cerevisiae, Candida 
albicans and C. maltosa, Hansenula polymorpha, Kluyveromyces fragilis and K. lactis, Pichia 
guillerimondii and P. pastoris, Schizosaccharomyces pombe, and Yarrowia lipolytica. 

The cancer protein may also be made as a fusion protein, using available techniques. 
Thus, e.g., for the creation of monoclonal antibodies, if the desired epitope is small, the cancer 
protein may be fused to a carrier protein to form an immunogen. Alternatively, the cancer 
protein may be made as a fusion protein to increase expression, or for other reasons. For 
example, when the cancer protein is a cancer peptide, the nucleic acid encoding the peptide may 
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be linked to other nucleic acid for expression purposes. Fusion with detection epitope tags can 
be made, e.g., with FLAG, His6, myc, HA, etc. 

In a preferred embodiment, the cancer protein is purified or isolated after expression. 
Cancer proteins may be isolated or purified in a variety of ways depending on what other 
components are present in the sample and the requirements for purified product, e.g., natural 
conformation or denatured. Standard purification methods include ammonium sulfate 
precipitations, electrophoretic, molecular, immunological, and chromatographic techniques, 
including ion exchange, hydrophobic, affinity, and reverse-phase HPLC chromatography, and 
chromatofocusing. For example, the cancer protein may be purified using a standard anti- 
cancer protein antibody column. Ultrafiltration and diafiltration techniques, in conjunction with 
protein concentration, are also useful. See, e.g., Walsh (2002) Proteins: Biochemistry and 
Biotechnology Wiley; Hardin, et al. (eds. 2001) Cloning, Gene Expression and Protein 
Purification Oxford Univ. Press; Wilson, et al. (eds. 2000) Encyclopedia of Separation Science 
Academic Press; and Scopes (1993) Protein Purification Springer- Verlag. The degree of 
purification necessary will vary depending on the use of the cancer protein. In some instances 
no purification will be necessary. 

Once expressed and purified if necessary, the cancer proteins and nucleic acids are 
useful in a number of applications. They may be used as immunoselection reagents, as vaccine 
reagents, as screening agents, therapeutic entities, for production of antibodies, as transcription 
or translation inhibitors, etc. 
Variants of cancer proteins 

Also included within one embodiment of cancer proteins are amino acid variants of the 
naturally occurring sequences, as determined herein. Preferably, the variants are preferably 
greater than about 75% homologous to the wild-type sequence, more preferably greater than 
about 80%, even more preferably greater than about 85%, and most preferably greater than 
90%. In some embodiments the homology will be as high as about 93-95% or 98%. As for 
nucleic acids, homology in this context means sequence similarity or identity, with identity 
being preferred. This homology will be determined using standard techniques, as are outlined 
above for nucleic acid homologies. 

Cancer proteins of the present invention may be shorter or longer than the wild type 
amino acid sequences. Thus, in a preferred embodiment, included within the definition of 
cancer proteins are portions or fragments of the wild type sequences herein. In addition, as 
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outlined above, the cancer nucleic acids of the invention may be used to obtain additional 
coding regions, and thus additional protein sequence. 

In one embodiment, the cancer proteins are derivative or variant cancer proteins as 
compared to the wild-type sequence. That is, as outlined more fully below, the derivative 
cancer peptide will often contain at least one amino acid substitution, deletion, or insertion, with 
amino acid substitutions being particularly preferred. The amino acid substitution, insertion, or 
deletion may occur at many residue positions within the cancer peptide. 

Also included within one embodiment of cancer proteins of the present invention are 
amino acid sequence variants. These variants typically fall into one or more of three classes: 
substitutional, insertional, or deletional variants. These variants ordinarily are prepared by site 
specific mutagenesis of nucleotides in the DNA encoding the cancer protein, using cassette or 
PCR mutagenesis or other techniques, to produce DNA encoding the variant, and thereafter 
expressing the DNA in recombinant cell culture as outlined above. However, variant cancer 
protein fragments having up to about 100-150 residues may be prepared by in vitro synthesis 
using established techniques. Amino acid sequence variants are characterized by the 
predetermined nature of the variation, a feature that sets them apart from naturally occurring 
allelic or interspecies variation of the cancer protein amino acid sequence. The variants 
typically exhibit a similar qualitative biological activity as a naturally occurring analogue, 
although variants can also be selected which have modified characteristics. 

While the site or region for introducing an amino acid sequence variation is often 
predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be conducted 
at the target codon or region and the expressed cancer variants screened for the optimal 
combination of desired activity. Techniques for making substitution mutations at predetermined 
sites in DNA having a known sequence are well known, e.g., Ml 3 primer mutagenesis and PCR 
mutagenesis. Screening of mutants is often done using assays of cancer protein activities. 

Amino acid substitutions are typically of single residues; insertions usually will be on 
the order of from about 1-20 amino acids, although considerably larger insertions may be 
tolerated. Deletions generally range from about 1-20 residues, although in some cases deletions 
may be much larger. 

Substitutions, deletions, insertions, or combination thereof may be used to arrive at a 
final derivative. Generally these changes are done on a few amino acids to minimize the 
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alteration of the molecule. However, larger changes may be tolerated in certain circumstances. 
When small alterations in the characteristics of the cancer protein are desired, substitutions are 
generally made in accordance with the amino acid substitution relationships described. 

The variants typically exhibit essentially the same qualitative biological activity and will 
elicit the same immune response as a naturally-occurring analog, although variants also are 
selected to modify the characteristics of cancer proteins as needed. Alternatively, the variant 
may be designed such that a biological activity of the cancer protein is altered. For example, 
glycosylation sites maybe added, altered, or removed. 

Substantial changes in function or immunological identity are sometimes made by 
selecting substitutions that are less conservative than those described above. For example, 
substitutions may be made which more significantly affect: the structure of the polypeptide 
backbone in the area of the alteration, for example the alpha-helical or beta-sheet structure; the 
charge or hydrophobicity of the molecule at the target site; or the bulk of the side chain. 
Substitutions which generally are expected to produce the greatest changes in the polypeptide's 
properties are those in which (a) a hydrophilic residue, e.g., serine or threone is substituted for 
(or by) a hydrophobic residue, e.g., leucine, isoleucine, phenylalanine, valine, or alanine; (b) a 
cysteine or proline is substituted for (or by) another residue; (c) a residue having an 
electropositive side chain, e.g., lysine, arginine, or histidine, is substituted for (or by) an 
electronegative residue, e.g., glutamic or aspartic acid; (d) a residue having a bulky side chain, 
e.g., phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine; or (e) a 
proline residue is incorporated or substituted, which changes the degree of rotational freedom of 
the peptidyl bond. 

Variants typically exhibit a similar qualitative biological activity and will elicit the same 
immune response as the naturally-occurring analog, although variants also are selected to 
modify the characteristics of the skin cancer proteins as needed. Alternatively, the variant may 
be designed such that the biological activity of the cancer protein is altered. For example, 
glycosylation sites may be altered or removed. 

Covalent modifications of cancer polypeptides are included within the scope of this 
invention. One type of covalent modification includes reacting targeted amino acid residues of 
a cancer polypeptide with an organic derivatizing agent that is capable of reacting with selected 
side chains or the N-or C-terminal residues of a cancer polypeptide. Derivatization with 
bifunctional agents is useful, for instance, for crosslinking cancer polypeptides to a water- 
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insoluble support matrix or surface for use in a method for purifying anti-cancer polypeptide 
antibodies or screening assays, as is more fully described below. Commonly used crosslinking 
agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, e.g., esters with 4-azidosalicylic acid, homobifunctional 
imidoesters, including disuccinimidyl esters such as 3,3'-dithiobis(succinimidylpropionate), 
bifunctional maleimides such as bis-N-maleimido-l,8-octane and agents such as methyl-3-((p- 
azidophenyl)dithio)propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the 
corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, 
phosphorylation of hydroxyl groups of serinyl, threonyl, or tyrosyl residues, methylation of the 
amino groups of the lysine, arginine, and histidine side chains (e.g., pp. 79-86, Creighton (1992) 
Proteins: Structure and Molecular Properties Freeman), acetylation of the N-terminal amine, 
and amidation of a C-terminal carboxyl group. 

Another type of covalent modification of the cancer polypeptide included within the 
scope of this invention comprises altering the native glycosylation pattern of the polypeptide. 
"Altering the native glycosylation pattern" is intended for purposes herein to mean deleting one 
or more carbohydrate moieties found in native sequence cancer polypeptide, and/or adding one 
or more glycosylation sites that are not present in the native sequence cancer polypeptide. 
Glycosylation patterns can be altered in many ways. Different cell types to express cancer- 
associated sequences can result in different glycosylation patterns. 

Addition of glycosylation sites to cancer polypeptides may also be accomplished by 
altering the amino acid sequence thereof. The alteration may be made, e.g., by the addition of, 
or substitution by, one or more serine or threonine residues to the native sequence cancer 
polypeptide (for O-linked glycosylation sites). The cancer amino acid sequence may optionally 
be altered through changes at the DNA level, particularly by mutating the DNA encoding the 
cancer polypeptide at preselected bases such that codons are generated that will translate into 
the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the cancer 
polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. See, e.g., 
WO 87/05330; pp. 259-306 in Aplin and Wriston (1981) CRC Crit. Rev. Biochem. 

Removal of carbohydrate moieties present on the cancer polypeptide may be 
accomplished chemically or enzymatically or by mutational substitution of codons encoding for 
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amino acid residues that serve as targets for glycosylation. Chemical deglycosylation 
techniques are applicable. See, e.g., Sojar and Bahl (1987) Arch. Biochem. Biophvs. 259:52-57 
and Edge, et al. (1981) Anal. Biochem. 1 18:131-137. Enzymatic cleavage of carbohydrate 
moieties on polypeptides can be achieved by the use of a variety of endo-and exo-glycosidases. 
See, e.g., Thotakura, et al. (1987) Meth. Enzvmol. 138:350-359. 

Another type of covalent modification of cancer comprises linking the cancer 
polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene glycol, 
polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 
4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192, or 4,179,337. 

Cancer polypeptides of the present invention may also be modified in a way to form 
chimeric molecules comprising a cancer polypeptide fused to another heterologous polypeptide 
or amino acid sequence. In one embodiment, such a chimeric molecule comprises a fusion of a 
cancer polypeptide with a tag polypeptide which provides an epitope to which an anti-tag 
antibody can selectively bind. The epitope tag is generally placed at the amino-or carboxyl- 
terminus of the cancer polypeptide. The presence of such epitope-tagged forms of a cancer 
polypeptide can be detected using an antibody against the tag polypeptide. Also, provision of 
the epitope tag enables the cancer polypeptide to be readily purified by affinity purification 
using an anti-tag antibody or another type of affinity matrix that binds to the epitope tag. In an 
alternative embodiment, the chimeric molecule may comprise a fusion of a cancer polypeptide 
with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of 
the chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are available. Examples" 
include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; HIS6 and metal 
chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field, et al. (1988) Mol. 
Cell. Biol. 8:2159-2165); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7, and 9E10 antibodies 
thereto (Evan, et al. (1985) Molecular and Cellular Biology 5:3610-3616); and the Herpes 
Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky, et al. (1990) Protein 
Engineering 3(6):547-553). Other tag polypeptides include the Flag-peptide (Hopp, et al. 
(1988) BioTechnology 6:1204-1210); the KT3 epitope peptide (Martin, et al. (1992) Science 
255:192-194); tubulin epitope peptide (Skinner, et al. (1991) J. Biol. Chem. 266:15163-15166); 
and the T7 gene 10 protein peptide tag (Lutz-Freyennuth, et al. (1990) Proc. Natl. Acad. Sci. 
USA 87:6393-6397). 
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Also included are other cancer proteins of the cancer family, and cancer proteins from 
other organisms, which are cloned and expressed as outlined below. Thus, probe or degenerate 
polymerase chain reaction (PCR) primer sequences may be used to find other related cancer 
proteins from humans or other organisms. Particularly useful probe and/or PCR primer 
sequences include the unique areas of the cancer nucleic acid sequence. Preferred PCR primers 
are from about 15-35 nucleotides in length, with from about 20-30 being preferred, and may 
contain inosine as needed. The conditions for PCR reaction have been well described (e.g., 
Innis, PCR Protocols, supra). 

In addition, cancer proteins can be made that are longer than those encoded by the 
nucleic acids of the Tables, e.g., by the elucidation of extended sequences, the addition of 
epitope or purification tags, the addition of other fusion sequences, etc. 

Cancer proteins may also be identified as being encoded by cancer nucleic acids. Thus, 
cancer proteins are encoded by nucleic acids that will hybridize to the sequences of the 
sequence listings, or their complements, as outlined herein. 
Antibodies to cancer proteins 

In a preferred embodiment, when the cancer protein is to be used to generate antibodies, 
e.g., for immunotherapy or immunodiagnosis, the cancer protein should share at least one 
epitope or determinant with the full length protein. By "epitope" or "determinant" herein is 
typically meant a portion of a protein which will generate and/or bind an antibody or T-cell 
receptor in the context of MHC. Thus, in most instances, antibodies made to a smaller cancer 
protein will be able to bind to the full-length protein, particularly linear epitopes. In a preferred 
embodiment, the epitope is unique; that is, antibodies generated to a unique epitope show little 
or no cross-reactivity. In a preferred embodiment, the epitope is selected from a protein 
sequence set out in the tables. 

Methods of preparing polyclonal antibodies exist (e.g., Coligan, supra; and Harlow and 
Lane, supra). Polyclonal antibodies can be raised in a mammal, e.g., by one or more injections 
of an immunizing agent and, if desired, an adjuvant. Typically, the immunizing agent and/or 
adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. 
The immunizing agent may include a protein encoded by a nucleic acid of Tables 2A-80 or 
fragment thereof or a fusion protein thereof. It may be useful to conjugate the immunizing 
agent to a protein known to be immunogenic in the mammal being immunized. Examples of 
such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum 
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albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants which 
may be employed include Freund's complete adjuvant and MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). Various immunization 
protocols may be used. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies 
may be prepared using hybridoma methods, such as those described by Kohler and Milstein 
(1975) Nature 256:495. In a hybridoma method, a mouse, hamster, or other appropriate host 
animal, is typically immunized with an immunizing agent to elicit lymphocytes that produce or 
are capable of producing antibodies that will specifically bind to the immunizing agent. 
Alternatively, the lymphocytes may be immunized in vitro. The immunizing agent will 
typically include a polypeptide encoded by a nucleic acid of the tables or fragment thereof, or a 
fusion protein thereof. Generally, either peripheral blood lymphocytes ("PBLs") are used if 
cells of human origin are desired, or spleen cells or lymph node cells are used if non-human 
mammalian sources are desired. The lymphocytes are then fused with an immortalized cell line 
using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell (e.g., pp. 
59-103 in Goding (1986) Monoclonal Antibodies: Principles and Practice Academic Press). 
Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells of 
rodent, bovine, or human origin. Usually, rat or mouse myeloma cell lines are employed. The 
hybridoma cells may be cultured in a suitable culture medium that preferably contains one or 
more substances that inhibit the growth or survival of the unfused, immortalized cells. For 
example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase 
(HGPRT or HPRT), the culture medium for the hybridomas typically will include 
hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the 
growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are 
monoclonal, preferably human or humanized, antibodies that have binding specificities for at 
least two different antigens or that have binding specificities for two epitopes on the same 
antigen. In one embodiment, one of the binding specificities is for a protein encoded by a 
nucleic acid of the tables or a fragment thereof, the other one is for another antigen, and 
preferably for a cell-surface protein or receptor or receptor subunit, preferably one that is tumor 
specific. Alternatively, tetramer-type technology may create multivalent reagents. 
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In a preferred embodiment, the antibodies to cancer protein are capable of reducing or 
eliminating a biological function of a cancer protein, in a naked form or conjugated to an 
effector moiety, as is described below. That is, the addition of anti-cancer protein antibodies 
(either polyclonal or preferably monoclonal) to cancer tissue (or cells containing cancer) may 
reduce or eliminate the cancer. Generally, at least a 25% decrease in activity, growth, size, or 
the like is preferred, with at least about 50% being particularly preferred and about a 95-100% 
decrease being especially preferred. 

In a preferred embodiment the antibodies to the cancer proteins are humanized 
antibodies (e.g., Xenerex Biosciences, Medarex, Inc., Abgenix, Inc., Protein Design Labs, Inc.) 
Humanized forms of non-human (e.g., murine) antibodies are chimeric molecules of 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab')2 
or other antigen-binding subsequences of antibodies) which contain minimal sequence derived 
from non-human immunoglobulin. Humanized antibodies include human immunoglobulins 
(recipient antibody) in which residues from a complementary determining region (CDR) of the 
recipient are replaced by residues from a CDR of a non-human species (donor antibody) such as 
mouse, rat, or rabbit having the desired specificity, affinity, and capacity. In some instances, Fv 
framework residues of a human immunoglobulin are replaced by corresponding non-human 
residues. Humanized antibodies may also comprise residues which are found neither in the 
recipient antibody nor in the imported CDR or framework sequences. In general, a humanized 
antibody will comprise substantially all of at least one, and typically two, variable domains, in 
which all or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the framework (FR) regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will typically 
comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones, et al. (1986) Nature 321 :522-525; Riechmann, et al. (1988) Nature 
332:323-329; and Presta (1992) Curr. Op. Struct. Biol. 2:593-596). Humanization can be 
essentially performed following the method of Winter and co-workers (Jones, et al. (1986) 
Nature 321:522-525; Riechmann, et al. (1988) Nature 332:323-327; Verhoeyen, et al. (1988) 
Science 239:1534-1536), by substituting rodent CDRs or CDR sequences for corresponding 
sequences of a human antibody. Accordingly, such humanized antibodies are chimeric 
antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact human variable 
domain has been substituted by corresponding sequence from a non-human species. 
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Human antibodies can also be produced using phage display libraries (Hoogenboom and 
Winter (1992) J. Mol. Biol. 227:381-388; Marks, et al. (1991) J. Mol. Biol. 222:581-597) or 
human monoclonal antibodies (e.g., p. 77, Cole, et al. in Reisfeld and Sell (1985) Monoclonal 
Antibodies and Cancer Therapy Liss; and Boerner, et al. (1991) J. Immunol. 147:86-95). 
Similarly, human antibodies can be made by introducing human immunoglobulin loci into 
transgenic animals, e.g., mice in which the endogenous immunoglobulin genes have been 
partially or completely inactivated. Upon challenge, human antibody production is observed, 
which closely resembles that seen in humans in nearly all respects, including gene 
rearrangement, assembly, and antibody repertoire. This approach is described, e.g., in U.S. 
Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, and in the 
following scientific publications: Marks, et al. (1992) Bio/Technology 10:779-783; Lonberg, et 
al. (1994) Nature 368:856-859; Morrison (1994) Nature 368:812-13; Fishwild, et al. (1996) 
Nature Biotechnology 14:845-851; Neuberger (1996) Nature Biotechnology 14:826; and 
Lonberg and Huszar (1995) Intern. Rev. Immunol. 13:65-93. 

By immunotherapy is meant treatment of cancer with an antibody raised against cancer 
proteins. As used herein, immunotherapy can be passive or active. Passive immunotherapy as 
defined herein is the passive transfer of antibody to a recipient (patient). Active immunization 
is the induction of antibody and/or T-cell responses in a recipient (patient). Induction of an 
immune response is the result of providing the recipient with an antigen to which antibodies are 
raised. The antigen may be provided by injecting a polypeptide against which antibodies are 
desired to be raised into a recipient, or contacting the recipient with a nucleic acid capable of 
expressing the antigen and under conditions for expression of the antigen, leading to an immune 
response. 

In a preferred embodiment the cancer proteins against which antibodies are raised are 
secreted proteins as described above. Without being bound by theory, antibodies used for 
treatment may bind and prevent the secreted protein from binding to its receptor, thereby 
inactivating the secreted cancer protein, e.g., in autocrine signaling. 

In another preferred embodiment, the cancer protein to which antibodies are raised is a 
transmembrane protein. Without being bound by theory, antibodies used for treatment may 
bind the extracellular domain of the cancer protein and prevent it from binding to other proteins, 
such as circulating ligands or cell-associated molecules. The antibody may cause down- 
regulation of the transmembrane cancer protein. The antibody may be a competitive, non- 
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competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
cancer protein. The antibody may also be an antagonist of the cancer protein. Further, the 
antibody may prevent activation of the transmembrane cancer protein, or may induce or 
suppress a particular cellular pathway. In one aspect, when the antibody prevents the binding of 
other molecules to the cancer protein, the antibody prevents growth of the cell. The antibody 
may also be used to target or sensitize the cell to cytotoxic agents, including, but not limited to 
TNF-a, TNF-p, IL-1, INF-y, and IL-2, or chemotherapeutic agents including 5FU, vinblastine, 
actinomycin D, cisplatin, methotrexate, and the like. In some instances the antibody may 
belong to a sub-type that activates serum complement when complexed with the transmembrane 
protein thereby mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, 
cancer may be treated by administering to a patient antibodies directed against the 
transmembrane cancer protein. Antibody-labeling may activate a co-toxin, localize a toxin 
payload, target a drug loaded liposome, or otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector moiety. The 
effector moiety can be various molecules, including labeling moieties such as radioactive labels 
or fluorescent labels, or can be a therapeutic moiety. In one aspect the therapeutic moiety is a 
small molecule that modulates the activity of a cancer protein. In another aspect the therapeutic 
moiety may modulate the activity of molecules associated with or in close proximity to a cancer 
protein. The therapeutic moiety may inhibit enzymatic or signaling activity such as protease or 
collagenase or protein kinase activity associated with cancer, or be an attractant of other cells, 
such as NK cells. See, e.g., USSN 09/544,494. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic agent. In this 
method, targeting the cytotoxic agent to cancer tissue or cells results in a reduction in the 
number of afflicted cells, thereby reducing symptoms associated with cancer. Cytotoxic agents 
are numerous and varied and include, but are not limited to, cytotoxic drugs or toxins or active 
fragments of such toxins. Suitable toxins and their corresponding fragments include diphtheria 
A chain, exotoxin A chain, ricin A chain, abrin A chain, curcin, crotin, phenomycin, enomycin, 
saporin, auristatin, and the like. Cytotoxic agents also include radiochemicals made by 
conjugating radioisotopes to antibodies raised against cancer proteins, or binding of a 
radionuclide to a chelating agent that has been covalently attached to the antibody. Targeting 
the therapeutic moiety to transmembrane cancer proteins not only serves to increase the local 
concentration of therapeutic moiety in the cancer afflicted area, but also serves to reduce 
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deleterious side effects that may be associated with the untargeted therapeutic moiety. 
Antibody fragments may be used to target toxin loaded liposomes. 

In another preferred embodiment, the cancer protein against which the antibodies are 
raised is an intracellular protein. In this case, the antibody may be conjugated to a protein 
which facilitates entry into the cell. In one case, the antibody enters the cell by endocytosis. In 
another embodiment, a nucleic acid encoding the antibody is administered to the individual or 
cell. Moreover, wherein the cancer protein can be targeted within a cell, e.g., the nucleus, an 
antibody thereto may contain a signal for that target localization, e.g., a nuclear localization 
signal. 

The cancer antibodies of the invention specifically bind to cancer proteins. By 
"specifically bind" herein is meant that the antibodies bind to the protein with a of at least 
about 0.1 mM, more usually at least about 1 /xM, preferably at least about 0.1 /iM or better, and 
most preferably, 0.01 ptM or better. Selectivity of binding to the specific target and not to 
related sequences is often also important. 

Detection of cancer sequence for diagnostic and therapeutic applications 

In one aspect, the RNA expression levels of genes are determined for different cellular 
states in the cancer phenotype. Expression levels of genes in normal tissue (e.g., not 
undergoing cancer) and in cancer tissue (and in some cases, for varying severities of cancer that 
relate to prognosis, as outlined below), or in non-malignant disease are evaluated to provide 
expression profiles. A gene expression profile of a particular cell state or point of development 
is essentially a "fingerprint" of the state of the cell. While two states may have a particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the generation 
of a gene expression profile that is reflective of the state of the cell. By comparing expression 
profiles of cells in different states, information regarding which genes are important (including 
both up- and down-regulation of genes) in each of these states is obtained. Then, diagnosis may 
be performed or confirmed to determine whether a tissue sample has the gene expression profile 
of normal or cancerous tissue. This will provide for molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to qualitative 
or quantitative differences in the temporal and/or cellular gene expression patterns within and 
among cells and tissue. Thus, a differentially expressed gene can qualitatively have its 
expression altered, including an activation or inactivation, in, e.g., normal versus cancer tissue. 
Genes may be turned on or turned off in a particular state, relative to another state thus 
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permitting comparison of two or more states. A qualitatively regulated gene will exhibit an 
expression pattern within a state or cell type which is detectable by standard techniques. Some 
genes will be expressed in one state or cell type, but not in both. Alternatively, the difference in 
expression maybe quantitative, e.g., in that expression is increased or decreased; e.g., gene 
5 expression is either upregulated, resulting in an increased amount of transcript, or 

downregulated, resulting in a decreased amount of transcript. The degree to which expression 
differs need only be large enough to quantify via standard characterization techniques as 
outlined below, such as by use of Affymetrix GeneChip™ expression arrays. See, Lockhart 
(1996) Nature Biotechnology 14:1675-1680. Other techniques include, but are not limited to, 

10 quantitative reverse transcriptase PCR, northern analysis, and RNase protection. As outlined 
above, preferably the change in expression (e.g., upregulation or downregulation) is at least 
about 50%, more preferably at least about 100%, more preferably at least about 150%, more 
• preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 
Evaluation may be at the gene transcript or the protein level. The amount of gene 

15 expression may be monitored using nucleic acid probes to the RNA or DNA equivalent of the 
gene transcript, and the quantification of gene expression levels, or, alternatively, the final gene 
product itself (protein) can be monitored, e.g., with antibodies to the cancer protein and standard 
immunoassays (ELISAs, etc.) or other techniques, including mass spectroscopy assays, 2D gel 
electrophoresis assays, etc. Proteins corresponding to cancer genes, e.g., those identified as 

20 being important in a cancer or disease phenotype, can be evaluated in a cancer diagnostic test. 
In a preferred embodiment, gene expression monitoring is performed simultaneously on a 
number of genes. Multiple protein expression monitoring can be performed as well. 

In this embodiment, the cancer nucleic acid probes are attached to biochips as outlined 
herein for the detection and quantification of cancer sequences in a particular cell. The assays 

25 are further described below in the example. PCR techniques can be used to provide greater 
sensitivity. 

In a preferred embodiment nucleic acids encoding the cancer protein are detected. 
Although DNA or RNA encoding the cancer protein may be detected, of particular interest are 
methods wherein an mRNA encoding a cancer protein is detected. Probes to detect mRNA can 
30 be a nucleotide/deoxynucleotide probe that is complementary to and hybridizes with the mRNA 
and includes, but is not limited to, oligonucleotides, cDNA, or RNA. Probes also should 
contain a detectable label, as defined herein. In one method the mRNA is detected after 
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immobilizing the nucleic acid to be examined on a solid support such as nylon membranes and 
hybridizing the probe with the sample. Following washing to remove the non-specifically 
bound probe, the label is detected. In another method, detection of the mRNA is performed in 
situ. In this method permeabilized cells or tissue samples are contacted with a detectably 
5 labeled nucleic acid probe for sufficient time to allow the probe to hybridize with the target 
mRNA. Following washing to remove the non-specifically bound probe, the label is detected. 
For example a digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA 
encoding a cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3-indoyl 
10 phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins as 
described herein (secreted, transmembrane, or intracellular proteins) are used in diagnostic 
assays. The cancer proteins, antibodies, nucleic acids, modified proteins, and cells containing 
cancer sequences are used in diagnostic assays. This can be performed on an individual gene or 

1 5 corresponding polypeptide level. In a preferred embodiment, the expression profiles are used, 
preferably in conjunction with high throughput screening techniques to allow monitoring for 
expression profile genes and/or corresponding polypeptides. 

As described and defined herein, cancer proteins, including intracellular, 
transmembrane, or secreted proteins, find use as markers of cancer, e.g., for prognostic or 

20 diagnostic purposes. Detection of these proteins in putative cancer tissue allows for detection, 
prognosis, or diagnosis of cancer or similar disease, and for selection of therapeutic strategy. In 
one embodiment, antibodies are used to detect cancer proteins. A preferred method separates 
proteins from a sample by electrophoresis on a gel (typically a denaturing and reducing protein 
gel, but maybe another type of gel, including isoelectric focusing gels and the like). Following 

25 separation of proteins, the cancer protein is detected, e.g., by immunoblotting with antibodies 
raised against the cancer protein. 

In another preferred method, antibodies to the cancer protein find use in in situ imaging 
techniques, e.g., in histology. See, e.g., Asai, et al. (eds. 1993) Methods in Cell Biology: 
Antibodies in Cell Biology (vol. 37) Academic Press. In this method, cells are contacted with 

30 from one to many antibodies to the cancer protein(s). Following washing to remove non- 
specific antibody binding, the presence of the antibody or antibodies is detected. In one 
embodiment the antibody is detected by incubating with a secondary antibody that contains a 
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detectable label. In another method the primary antibody to the cancer protein(s) contains a 
detectable label, e.g., an enzyme marker that can act on a substrate. In another preferred 
embodiment each one of multiple primary antibodies contains a distinct and detectable label. 
This method finds particular use in simultaneous screening for a plurality of cancer proteins. 
Many other histological imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the ability to 
detect and distinguish emissions of different wavelengths. In addition, a fluorescence activated 
cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing cancer from blood, 
serum, plasma, stool, and other samples. Such samples, therefore, are useful as samples to be 
probed or tested for the presence of cancer proteins. Antibodies can be used to detect a cancer 
protein by previously described immunoassay techniques including ELISA, immunoblotting 
(western blotting), immunoprecipitation, BIACORE technology and the like. Conversely, the 
presence of antibodies may indicate an immune response against an endogenous cancer protein. 

In a preferred embodiment, in situ hybridization of labeled cancer nucleic acid probes to 
tissue arrays is done. For example, arrays of tissue samples, including cancer tissue and/or 
normal tissue, are made. In situ hybridization (see, e.g., Ausubel, supra) is then performed. 
When comparing the fingerprints between an individual and a standard, a diagnosis, a 
prognosis, or a prediction may be based on the findings. It is further understood that the genes 
which indicate the diagnosis may differ from those which indicate the prognosis and molecular 
profiling of the condition of the cells may lead to distinctions between responsive or refractory 
conditions or may be predictive of outcomes. 

In a preferred embodiment, the cancer proteins, antibodies, nucleic acids, modified 
proteins, and cells containing cancer sequences are used in prognosis assays. As above, gene 
expression profiles can be generated that correlate to cancer, clinical, pathological, or other 
information, in terms of long term prognosis. Again, this may be done on either a protein or 
gene level, with the use of genes being preferred. Single or multiple genes may be useful in 
various combinations. As above, cancer probes may be attached to biochips for the detection 
and quantification of cancer sequences in a tissue or patient. The assays proceed as outlined 
above for diagnosis. PCR method may provide more sensitive and accurate quantification. 
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Assays for therapeutic compounds 

In a preferred embodiment, the proteins, nucleic acids, and antibodies as described 
herein are used in drug screening assays. The cancer proteins, antibodies, nucleic acids, 
modified proteins, and cells containing cancer sequences are used in drug screening assays or 
by evaluating the effect of drug candidates on a "gene expression profile" or expression profile 
of polypeptides. In a preferred embodiment, the expression profiles are used, preferably in 
conjunction with high throughput screening techniques, to allow monitoring for expression 
profile genes after treatment with a candidate agent (e.g., Zlokarnik, et al. (1998) Science 
279:84-88; Heid (1996) Genome Res. 6:986-994. 

In a preferred embodiment, the cancer proteins, antibodies, nucleic acids, modified 
proteins and cells containing the native or modified cancer proteins are used in screening 
assays. That is, the present invention provides novel methods for screening for compositions 
which modulate the cancer phenotype or an identified physiological function of a cancer 
protein. As above, this can be done on an individual gene level or by evaluating the effect of 
drug candidates on a "gene expression profile". In a preferred embodiment, the expression 
profiles are used, preferably in conjunction with high throughput screening techniques, to allow 
monitoring for expression profile genes after treatment with a candidate agent, see Zlokarnik, 
supra. 

Having identified the differentially expressed genes herein, a variety of assays may be 
performed. In a preferred embodiment, assays maybe run on an individual gene or protein 
level. That is, having identified a particular gene as up regulated in cancer, test compounds can 
be screened for the ability to modulate gene expression or for binding to the cancer protein. 
"Modulation" thus includes both an increase and a decrease in gene expression. The preferred 
amount of modulation will depend on the original change of the gene expression in normal 
versus tissue undergoing cancer, with changes of at least 10%, preferably 50%, more preferably 
100-300%, and in some embodiments 300-1000% or greater. Thus, if a gene exhibits a 4-fold 
increase in cancer tissue compared to normal tissue, a decrease of about four-fold is often 
desired; similarly, a 10-fold decrease in cancer tissue compared to normal tissue often provides 
a target value of a 10-fold increase in expression to be induced by the test compound. 

The amount of gene expression maybe monitored using nucleic acid probes and the 
quantification of gene expression levels, or, alternatively, the gene product itself can be 
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monitored, e.g., through the use of antibodies to the cancer protein and standard immunoassays. 
Proteomics and separation techniques may also allow quantification of expression. 

In a preferred embodiment, gene expression or protein monitoring of a number of 
entities, e.g., an expression profile, is monitored simultaneously. Such profiles will typically 
involve a plurality of those entities described herein. 

In this embodiment, the cancer nucleic acid probes are attached to biochips as outlined 
herein for the detection and quantification of cancer sequences in a particular cell. 
Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, may be used with 
dispensed primers in desired wells. A PCR reaction can then be performed and analyzed for 
each well. 

Modulators of cancer 

Expression monitoring can be performed to identify compounds that modify the 
expression of one or more cancer-associated sequences, e.g., a polynucleotide sequence set out 
in the tables. Generally, in a preferred embodiment, a test modulator is added to the cells prior 
to analysis. Moreover, screens are also provided to identify agents that modulate cancer, 
modulate cancer proteins, bind to a cancer protein, or interfere with the binding of a cancer 
protein and an antibody or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes a molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the cancer phenotype or the expression of a cancer sequence, e.g., a nucleic acid 
or protein sequence. In preferred embodiments, modulators alter expression profiles, or 
expression profile nucleic acids or proteins provided herein. In one embodiment, the modulator 
suppresses a cancer phenotype, e.g., to a normal or non-malignant tissue fingerprint. In another 
embodiment, a modulator induced a cancer phenotype. Generally, a plurality of assay mixtures 
are run in parallel with different agent concentrations to obtain a differential response to the 
various concentrations. Typically, one of these concentrations serves as a negative control, e.g., 
at zero concentration or below the level of detection. 

Drug candidates encompass numerous chemical classes, though typically they are 
organic molecules, preferably small organic compounds having a molecular weight of more 
than 100 and less than about 2,500 daltons. Preferred small molecules are less than 2000, or 
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less than 1500, or less than 1000, or less than 500 D. Candidate agents comprise functional 
groups necessary for structural interaction with proteins, particularly hydrogen bonding, and 
typically include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least two 
of the functional chemical groups. The candidate agents often comprise cyclical carbon or 
heterocyclic structures and/or aromatic or polyaromatic structures substituted with one or more 
of the above functional groups. Candidate agents are also found among biomolecules including 
peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural analogs, 
or combinations thereof. Particularly preferred are peptides. 

In one aspect, a modulator will neutralize the effect of a cancer protein. By "neutralize" 
is meant that activity of a protein is inhibited or blocked and the consequent effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will be screened 
for an ability to bind to a cancer polypeptide or to modulate activity. Conventionally, new 
chemical entities with useful properties are generated by identifying a chemical compound 
(called a "lead compound") with some desirable property or activity, e.g., inhibiting activity, 
creating variants of the lead compound, and evaluating the property and activity of those variant 
compounds. Often, high throughput screening (HTS) methods are employed for such an 
analysis. See, e.g., Janzen (2002) High Throughput Screening: Methods and Protocols 
Humana; Devlin (ed. 1997) High Throu gh put Screening: The Discovery of Bioactive 
Substances Dekker; and Mei and Czarnik (eds. 2002) Integrated Drug Discovery Techniques 
Dekker. 

In one preferred embodiment, high throughput screening methods involve providing a 
library containing a large number of potential therapeutic compounds (candidate compounds). 
Such "combinatorial chemical libraries" are then screened in one or more assays to identify 
those library members (particular chemical species or subclasses) that display a desired 
characteristic activity. The compounds thus identified can serve as conventional "lead 
compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical compounds 
generated by either chemical synthesis or biological synthesis by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial chemical 
library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of chemical 
building blocks called amino acids in every possible way for a given compound length (e.g., the 
number of amino acids in a polypeptide compound). Millions of chemical compounds can be 
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synthesized through such combinatorial mixing of chemical building blocks (Gallop, et al. 
(1994) J. Med. Chem. 37:1233-12511 

Preparation and screening of combinatorial chemical libraries is well known. Such 
combinatorial chemical libraries include, but are not limited to, peptide libraries (see, e.g., U.S. 
Patent No. 5,010,175, Furka (1991) Pept. Prot. Res. 37:487-493, Houghton, et al. (1991) Nature 
354:84-88), peptoids (PCT Publication No WO 91/19735), encoded peptides (PCT Publication 
WO 93/20242), random bio-oligomers (PCT Publication WO 92/00091), benzodiazepines (U.S. 
Pat. No. 5,288,514), diversomers such as hydantoins, benzodiazepines and dipeptides (Hobbs, et 
al- (1993) Proc. Natl. Acad. Sci. USA 90:6909-6913, vinylogous polypeptides (Hagihara, et al. 
(1992) J. Amer. Chem. Soc. 114:6568-570), nonpeptidal peptidomimetics with a Beta-D- 
Glucose scaffolding (Hirschmann, et al. (1992) J. Amer. Chem. Soc. 114:9217-9218), 
analogous organic syntheses of small compound libraries (Chen, et al. (1994) J. Amer. Chem. 
Soc, 116:2661-662), oligocarbamates (Cho, etal. (1993) Science 261:1303-1305). and/or 
peptidyl phosphonates (Campbell, et al. (1994) J. Org. Chem. 59:658). See, generally, Gordon, 
et al. (1994) J. Med. Chem. 37:1385-1401, nucleic acid libraries (see, e.g., Stratagene, Corp.), 
peptide nucleic acid libraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, e.g., 
Vaughn, et al. (1996) Nature Biotechnology 14(3):309-314, and PCT/US96/10287), 
carbohydrate libraries (see, e.g., Liang, et al. (1996) Science 274:1520-1522, and U.S. Patent 
No. 5,593,853), and small organic molecule libraries (see, e.g., benzodiazepines, page 33 Baum 
(Jan 18, 1993) C&EN; isoprenoids, U.S. Patent No. 5,569,588; thiazoUdinones and 
metathiazanones, U.S. Patent No. 5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 
5,519,134; morpholino compounds, U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent 
No. 5,288,514; and the like). 

Devices for the preparation of combinatorial libraries are commercially available (see, 
e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, Rainin, Woburn, 
MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, Bedford, MA). 

A number of well known robotic systems have also been developed for solution phase 
chemistries. These systems include automated workstations like the automated synthesis 
apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic 
systems utilizing robotic arms (Zymate JJ, Zymark Corporation, Hopkinton, Mass.; Orca, 
Hewlett-Packard, Palo Alto, Calif.), which mimic manual synthetic operations performed by a 
chemist. The above devices are suitable for use with the present invention. The nature and 
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implementation of modifications to these devices (if any) so that they can operate as discussed 
herein will be apparent. In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, 
Inc., St. Louis, MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek 
5 Biosciences, Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. Preferred 
assays thus detect enhancement or inhibition of cancer gene transcription, inhibition, or 
enhancement of polypeptide expression, and inhibition or enhancement of polypeptide activity. 
High throughput assays for the presence, absence, quantification, or other properties of 

1 0 particular nucleic acids or protein products are well known. Similarly, binding assays and 

reporter gene assays are similarly well known. Thus, e.g., U.S. Patent No. 5,559,410 discloses 
high throughput screening methods for proteins, U.S. Patent No. 5,585,639 discloses high 
throughput screening methods for nucleic acid binding (e.g., in arrays), while U.S. Patent Nos. 
5,576,220 and 5,541,061 disclose high throughput methods of screening for ligand/antibody 

15 binding. 

In addition, high throughput screening systems are commercially available (see, e.g., 
Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman Instruments, 
Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems typically 
automate entire procedures, including sample and reagent pipetting, liquid dispensing, timed 

20 incubations, and final readings of the microplate in detector(s) appropriate for the assay. These 
configurable systems provide high throughput and rapid start up as well as a high degree of 
flexibility and customization. The manufacturers of such systems provide detailed protocols for 
various high throughput systems. Thus, e.g., Zymark Corp. provides technical bulletins 
describing screening systems for detecting the modulation of gene transcription, ligand binding, 

25 and the like. 

In one embodiment, modulators are proteins, often naturally occurring proteins or 
fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing proteins, or 
random or directed digests of proteinaceous cellular extracts, may be used. In this way libraries 
of proteins may be made for screening in the methods of the invention. Particularly preferred in 
30 this embodiment are libraries of bacterial, fungal, viral, and mammalian proteins, with the latter 
being preferred, and human proteins being especially preferred. Particularly useful test 
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compound will be directed to the class of proteins to which the target belongs, e.g., substrates 
for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5-30 amino acids, 
with from about 5-20 amino acids being preferred, and from about 7-15 being particularly 
preferred. The peptides may be digests of naturally occurring proteins, random peptides, or 
"biased" random peptides. By "randomized" or grammatical equivalents herein is meant that 
each nucleic acid and peptide consists of essentially random nucleotides and amino acids, 
respectively. Since generally these random peptides (or nucleic acids, discussed below) are 
chemically synthesized, they may incorporate a nucleotide or amino acid at any position. The 
synthetic process can be designed to generate randomized proteins or nucleic acids, to allow the 
formation of all or most of the possible combinations over the length of the sequence, thus 
forming a library of randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence preferences or 
constants at any position. In a preferred embodiment, the library is biased. That is, some 
positions within the sequence are either held constant, or are selected from a limited number of 
possibilities. For example, in a preferred embodiment, the nucleotides or amino acid residues 
are randomized within a defined class, e.g., of hydrophobic amino acids, hydrophilic residues, 
sterically biased (either small or large) residues, towards the creation of nucleic acid binding 
domains, the creation of cysteines, for cross-linking, prolines for SH-3 domains, serines, 
threonines, tyrosines, or histidines for phosphorylation sites, etc., or to purines, etc. 

Modulators of cancer can also be nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid modulating agents may be 
naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. For 
example, digests of prokaryotic or eukaryotic genomes maybe used as is outlined above for 
proteins. 

In a preferred embodiment, the candidate compounds are organic chemical moieties, a 
wide variety of which are available in the literature. 

After the candidate agent has been added and the cells allowed to incubate for some 
period of time, the sample containing a target sequence to be analyzed is added to the biochip. 
If required, the target sequence is prepared using known techniques. For example, the sample 
may be treated to lyse the cells, using known lysis buffers, electroporation, etc., with 
purification and/or amplification such as PCR performed as appropriate. For example, an in 
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vitro transcription with labels covalently attached to the nucleotides is performed. Generally, 
the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a fluorescent, a 
chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting the target 
5 sequence's specific binding to a probe. The label also can be an enzyme, such as, alkaline 
phosphatase or horseradish peroxidase, which when provided with an appropriate substrate 
produces a product that can be detected. Alternatively, the label can be a labeled compound or 
small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or altered by the 
enzyme. The label also can be a moiety or compound, such as, an epitope tag or biotin which 

10 specifically binds to streptavidin. For the example of biotin, the streptavidin is labeled as 

described above, thereby, providing a detectable signal for the bound target sequence. Unbound 
labeled streptavidin is typically removed prior to analysis. 

These assays can be direct hybridization assays or can comprise "sandwich assays", 
which include the use of multiple probes, as is generally outlined in U.S. Patent Nos. 5,681,702, 

15 5,597,509, 5,545,730, 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 

5,624,802, 5,635,352, 5,594,118, 5,359,100, 5,124,246, and 5,681,697, all of which are hereby 
incorporated by reference. In this embodiment, in general, the target nucleic acid is prepared as 
outlined above, and then added to the biochip comprising a plurality of nucleic acid probes, 
under conditions that allow the formation of a hybridization complex. 

20 A variety of hybridization conditions may be used in the present invention, including 

high, moderate, and low stringency conditions as outlined above. The assays are generally run 
under stringency conditions which allows formation of the label probe hybridization complex 
only in the presence of target. Stringency can be controlled by altering a step parameter that is a 
thermodynamic variable, including, but not limited to, temperature, formamide concentration, 

25 salt concentration, chaotropic salt concentration, pH, organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally 
outlined in U.S. Patent No. 5,68 1,697. Thus it may be desirable to perform certain steps at 
higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. Components 

30 of the reaction may be added simultaneously, or sequentially, in different orders, with preferred 
embodiments outlined below. In addition, the reaction may include a variety of other reagents. 
These include salts, buffers, neutral proteins, e.g., albumin, detergents, etc. which maybe used 
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to facilitate optimal hybridization and detection, and/or reduce non-specific or background 
interactions. Reagents that otherwise improve the efficiency of the assay, such as protease 
inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be used as appropriate, 
depending on the sample preparation methods and purity of the target. 

5 The assay data are analyzed to determine the expression levels, and changes in 

expression levels as between states of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the cancer phenotype. In one 
embodiment, screening is performed to identify modulators that can induce or suppress a 
particular expression profile, thus preferably generating the associated phenotype. In another 

10 embodiment, e.g., for diagnostic applications, having identified differentially expressed genes 
important in a particular state, screens can be performed to identify modulators that alter 
expression of individual genes. In an another embodiment, screening is performed to identify 
modulators that alter a biological function of the expression product of a differentially 
expressed gene. Again, having identified the importance of a gene in a particular state, screens 

1 5 are performed to identify agents that bind and/or modulate the biological activity of the gene 
product. 

In addition, screens can be done for genes that are induced in response to a candidate 
agent or treatment process. After identifying a modulator based upon its ability to suppress a 
cancer expression pattern leading to a normal expression pattern (or its converse), or to 

20 modulate a single cancer gene expression profile so as to mimic the expression of the gene from 
normal tissue, a screen as described above can be performed to identify genes that are 
specifically modulated in response to the agent. Comparing expression profiles between normal 
tissue and agent treated cancer tissue reveals genes that are not expressed in normal tissue or 
cancer tissue, but are expressed in agent treated tissue. These agent-specific sequences can be 

25 identified and used by methods described herein for cancer genes or proteins. In particular, 
these sequences and the proteins they encode find use in marking or identifying agent treated 
cells. In addition, antibodies can be raised against the agent induced proteins and used to target 
novel therapeutics, e.g., toxin loaded liposomes, to the treated cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of cancer 

30 cells that have an associated cancer expression profile. By "administration" or "contacting" 
herein is meant that the candidate agent is added to the cells in such a manner as to allow the 
agent to act upon the cell, whether by uptake and intracellular action, or by action at the cell 
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surface. In some embodiments, nucleic acid encoding a proteinaceous candidate agent (e.g., a 
peptide) may be put into a viral construct such as an adenoviral or retroviral construct, and 
added to the cell, such that expression of the peptide agent is accomplished, e.g., PCT 
US97/01019. Regulatable gene therapy systems can also be used. 

Once a test compound has been administered to the cells, the cells can be washed if 
desired and are allowed to incubate under preferably physiological conditions for some period 
of time. The cells are then harvested and a new gene expression profile is generated, as outlined 
herein. 

Thus, e.g., cancer or non-malignant tissue may be screened for agents that modulate, 
e.g., induce or suppress a cancer phenotype. A change in at least one gene, preferably many, of 
the expression profile indicates that the agent has an effect on cancer activity. By defining such 
a signature for the cancer phenotype, screens for new drugs that alter the phenotype can be 
devised. With this approach, the drug target need not be known and need not be represented in 
the original expression screening platform, nor does the level of transcript for the target protein 
need to change. 

In a preferred embodiment, as outlined above, screens may be done on individual genes 
and gene products (proteins). That is, having identified a particular differentially expressed 
gene as important in a particular state, screening of modulators of either the expression of the 
gene or the gene product itself can be done. The gene products of differentially expressed genes 
are sometimes referred to herein as "cancer proteins" or a "cancer modulatory protein". The 
cancer modulatory protein may be a fragment, or alternatively, be the full length protein to the 
fragment encoded by the nucleic acids of the Tables. Preferably, the cancer modulatory protein 
is a fragment. In a preferred embodiment, the cancer amino acid sequence which is used to 
determine sequence identity or similarity is encoded by a nucleic acid of the Tables. In another 
embodiment, the sequences are naturally occurring allelic variants of a protein encoded by a 
nucleic acid of the Tables. In another embodiment, the sequences are sequence variants as 
further described herein. 

Preferably, the cancer modulatory protein is a fragment of about 14-24 amino acids 
long. More preferably the fragment is a soluble fragment. Preferably, the fragment includes a 
non-transmembrane region. In a preferred embodiment, the fragment has an N-terminal Cys to 
aid in solubility. In one embodiment, the C-terminus of the fragment is kept as a free acid and 
the N-terminus is a free amine to aid in coupling, e.g., to cysteine. 
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In one embodiment the cancer proteins are conjugated to an immunogenic agent as 
discussed herein. In one embodiment the cancer protein is conjugated to BSA. 

Measurements of cancer polypeptide activity, or of cancer or the cancer phenotype can 
be performed using a variety of assays. For example, the effects of the test compounds upon the 
function of the cancer polypeptides can be measured by examining parameters described above. 
A suitable physiological change that affects activity can be used to assess the influence of a test 
compound on the polypeptides of this invention. When the functional consequences are 
determined using intact cells or animals, one can also measure a variety of effects such as, in the 
case of cancer associated with tumors, tumor growth, tumor metastasis, neovascularization, 
hormone release, transcriptional changes to both known and uncharacterized genetic markers 
(e.g., northern blots), changes in cell metabolism such as cell growth or pH changes, and 
changes in intracellular second messengers such as cGMP. In the assays of the invention, 
mammalian cancer polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in vitro. For 
example, a cancer polypeptide is first contacted with a potential modulator and incubated for a 
suitable amount of time, e.g., from 0.5-48 hours. In one embodiment, the cancer polypeptide 
levels are determined in vitro by measuring the level of protein or mRNA. The level of protein 
is typically measured using immunoassays such as western blotting, ELISA, and the like with 
an antibody that selectively binds to the cancer polypeptide or a fragment thereof. For 
measurement of mRNA, amplification, e.g., using PCR, LCR, or hybridization assays, e.g., 
northern hybridization, RNAse protection, dot blotting, are preferred. The level of protein or 
mRNA is typically detected using directly or indirectly labeled detection agents, e.g., 
fluorescently or radioactively labeled nucleic acids, radioactively or enzymatically labeled 
antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using a cancer protein promoter 
operably linked to a reporter gene such as luciferase, green fluorescent protein, CAT, or /3-gal. 
The reporter construct is typically transfected into a cell. After treatment with a potential 
modulator, the amount of reporter gene transcription, translation, or activity is measured 
according to standard techniques. 

In a preferred embodiment, as outlined above, screens maybe done on individual genes 
and gene products (proteins). That is, having identified a particular differentially expressed 
gene as important in a particular state, screening of modulators of the expression of the gene or 

65 



WO 03/042661 PCT/US02/36810 



the gene product itself can be done. The gene products of differentially expressed genes are 
sometimes referred to herein as "cancer proteins." The cancer protein may be a fragment, or 
alternatively, the full length protein to a fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes is 
5 performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially expressed 
proteins. These compounds are then evaluated for the ability to modulate differentially 
expressed activity. Moreover, once initial candidate compounds are identified, variants can be 
further screened to better evaluate structure activity relationships. 

10 In a preferred embodiment, binding assays are done. In general, purified or isolated 

gene product is used; that is, the gene products of one or more differentially expressed nucleic 
acids are made. For example, antibodies are generated to the protein gene products, and 
standard immunoassays are run to determine the amount of protein present. Alternatively, cells 
comprising the cancer proteins can be used in the assays. 

15 Thus, in a preferred embodiment, the methods comprise combining a cancer protein and 

a candidate compound, and determining the binding of the compound to the cancer protein. 
Preferred embodiments utilize the human cancer protein, although other mammalian proteins 
may also be used, e.g., for the development of animal models of human disease. In some 
embodiments, as outlined herein, variant or derivative cancer proteins may be used. 

20 Generally, in a preferred embodiment of the methods herein, the cancer protein or the 

candidate agent is non-diffusably bound to an insoluble support, preferably having isolated 
sample receiving areas (e.g., a microtiter plate, an array, etc.). The insoluble supports may be 
made of a composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The surface 

25 of such supports may be solid or porous and of a convenient shape. Examples of suitable 

insoluble supports include microtiter plates, arrays, membranes, and beads. These are typically 
made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, teflon™, etc. 
Microtiter plates and arrays are especially convenient because a large number of assays can be 
carried out simultaneously, using small amounts of reagents and samples. The particular 

30 manner of binding of the composition is typically not crucial so long as it is compatible with the 
reagents and overall methods of the invention, maintains the activity of the composition, and is 
nondiffusable. Preferred methods of binding include the use of antibodies (which do not 
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sterically block either the ligand binding site or activation sequence when the protein is bound 
to the support), direct binding to "sticky" or ionic supports, chemical crosslinking, the synthesis 
of the protein or agent on the surface, etc. Following binding of the protein or agent, excess 
unbound material is removed by washing. The sample receiving areas may then be blocked 
5 through incubation with bovine serum albumin (BSA), casein, or other innocuous protein or 
other moiety. 

In a preferred embodiment, the cancer protein is bound to the support, and a test 
compound is added to the assay. Alternatively, the candidate agent is bound to the support and 
the cancer protein is added. Novel binding agents include specific antibodies, non-natural 

10 binding agents identified in screens of chemical libraries, peptide analogs, etc. Of particular 
interest are screening assays for agents that have a low toxicity for human cells. A wide variety 
of assays may be used for this purpose, including labeled in vitro protein-protein binding assays, 
electrophoretic mobility shift assays, immunoassays for protein binding, functional assays 
(phosphorylation assays, etc.), and the like. 

15 The determination of the binding of the test modulating compound to the cancer protein 

may be done in a number of ways. In a preferred embodiment, the compound is labeled, and 
binding determined directly, e.g., by attaching all or a portion of the cancer protein to a solid 
support, adding a labeled candidate agent (e.g., a fluorescent label), washing off excess reagent, 
and determining whether the label is present on the solid support. Various blocking and 

20 washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the proteins (or 
proteinaceous candidate compounds) can be labeled. Alternatively, more than one component 
can be labeled with different labels, e.g., 125 I for the proteins and a fluorophor for the 
compound. Proximity reagents, e.g., quenching or energy transfer reagents are also useful. 

25 In one embodiment, the binding of the test compound is determined by competitive 

binding assay. The competitor may be a binding moiety known to bind to the target molecule 
(e.g., a cancer protein), such as an antibody, peptide, binding partner, ligand, etc. Under certain 
circumstances, there may be competitive binding between the compound and the binding 
moiety, with the binding moiety displacing the compound. In one embodiment, the test 

30 compound is labeled. Either the compound, or the competitor, or both, is added first to the 
protein for a time sufficient to allow binding, if present. Incubations may be performed at a 
temperature which facilitates optimal activity, typically between about 4-40° C. Incubation 
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periods are typically optimized, e.g., to facilitate rapid high throughput screening. Typically 
between 0.1-1 hour will be sufficient. Excess reagent is generally removed or washed away. 
The second component is then added, and the presence or absence of the labeled component is 
followed, to indicate binding. 
5 In a preferred embodiment, the competitor is added first, followed by a test compound. 

Displacement of the competitor is an indication that the test compound is binding to the cancer 
protein and thus is capable of binding to, and potentially modulating, the activity of the cancer 
protein. In this embodiment, either component can be labeled. Thus, e.g., if the competitor is 
labeled, the presence of label in the wash solution indicates displacement by the agent. 
10 Alternatively, if the test compound is labeled, the presence of the label on the support indicates 
displacement. 

In an alternative embodiment, the test compound is added first, with incubation and 
washing, followed by the competitor. The absence of binding by the competitor may indicate 
that the test compound is bound to the cancer protein with a higher affinity. Thus, if the test 

15 compound is labeled, the presence of the label on the support, coupled with a lack of competitor 
binding, may indicate that the test compound is capable of binding to the cancer protein. 

In a preferred embodiment, the methods comprise differential screening to identity 
agents that are capable of modulating the activity of the cancer proteins. In one embodiment, 
the methods comprise combining a cancer protein and a competitor in a first sample. A second 

20 sample comprises a test compound, a cancer protein, and a competitor. The binding of the 

competitor is determined for both samples, and a change, or difference in binding between the 
two samples indicates the presence of an agent capable of binding to the cancer protein arid 
potentially modulating its activity. That is, if the binding of the competitor is different in the 
second sample relative to the first sample, the agent is capable of binding to the cancer protein. 

25 Alternatively, differential screening is used to identify drug candidates that bind to the 

native cancer protein, but cannot bind to modified cancer proteins. The structure of the cancer 
protein maybe modeled, and used in rational drug design to synthesize agents that interact with 
that site. Drug candidates that affect the activity of a cancer protein are also identified by 
screening drugs for the ability to either enhance or reduce the activity of the protein. 

30 Positive controls and negative controls may be used in the assays. Preferably control 

and test samples are performed in at least triplicate to obtain statistically significant results. 
Incubation of all samples is for a time sufficient for the binding of the agent to the protein. 
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Following incubation, samples are washed free of non-specifically bound material and the 
amount of bound, generally labeled agent determined. For example, where a radiolabel is 
employed, the samples may be counted in a scintillation counter to determine the amount of 
bound compound. 

5 A variety of other reagents may be included in the screening assays. These include 

reagents like salts, neutral proteins, e.g., albumin, detergents, etc., which maybe used to 
facilitate optimal protein-protein binding and/or reduce non-specific or background interactions. 
Also reagents that otherwise improve the efficiency of the assay, such as protease inhibitors, 
nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture of components may 

10 be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a cancer protein. The methods comprise 
adding a test compound, as defined above, to a cell comprising cancer proteins. Preferred cell 
types include almost any cell. The cells contain a recombinant nucleic acid that encodes a 

1 5 cancer protein. In a preferred embodiment, a library of candidate agents are tested on a plurality 
of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous or 
subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, antigens, 
cytokines, growth factors, action potentials, pharmacological agents including 
20 chemotherapeutics, radiation, carcinogenics, or other cells (e.g., cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate cancer agents are identified. Compounds with 
pharmacological activity are able to enhance or interfere with the activity of the cancer protein. 
Once identified, similar structures are evaluated to identify critical structural feature of the 
25 compound. 

In one embodiment, a method of inhibiting cancer cell division is provided. The method 
comprises administration of a cancer inhibitor. In another embodiment, a method of inhibiting 
cancer is provided. The method may comprise administration of a cancer inhibitor. In a further 
embodiment, methods of treating cells or individuals with cancer are provided, e.g., comprising 
30 administration of a cancer inhibitor. 

In one embodiment, a cancer inhibitor is an antibody as discussed above. In another 
embodiment, the cancer inhibitor is an antisense molecule. 
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A variety of cell growth, proliferation, viability, and metastasis assays are available, as 
described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, such 
as semi-solid or soft agar. The transformed cells, when transfected with tumor suppressor 
genes, regenerate normal phenotype and require a solid substrate to attach and grow. Soft agar 
growth or colony formation in suspension assays can be used to identify modulators of cancer 
sequences, which when expressed in host cells, inhibit abnormal cellular proliferation and 
transformation. A therapeutic compound would reduce or eliminate the host cells' ability to 
grow in stirred suspension culture or suspended in semi-solid media, such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are described, 
e.g., in Freshney (1998) Culture of Animal Cells: A Manual of Basic Technique (3d ed.) Wiley- 
Liss; Freshney (2000) Culture of Animal Cells: A Manual of Basic Technique (4th ed.) Wiley- 
Liss; and Garkavtsev, et al. (1996) Nature Genet. 14:415-20. 
Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until they 
touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and continue 
to grow to high densities in disorganized foci. Thus, the transformed cells grow to a higher 
saturation density than normal cells. This can be detected morphologically by the formation of 
a disoriented monolayer of cells or rounded cells in foci within the regular pattern of normal 
surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at saturation density can 
be used to measure density limitation of growth. See Freshney (2000), supra. The transformed 
cells, when transfected with tumor suppressor genes, regenerate a normal phenotype and 
become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a preferred 
method of measuring density limitation of growth. Transformed host cells are transfected with 
a cancer-associated sequence and are grown for 24 hours at saturation density in non-limiting 
medium conditions. The percentage of cells labeling with ( 3 H)-thymidine is determined 
autoradiographically. See, Freshney (1998), supra. 
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Growth factor or serum dependence 

Transformed cells typically have a lower serum dependence than their normal 

counterparts (see, e.g., Temin (1966) J. Natl. Cancer Insti. 37:167-175; Eagle, et al.(1970) L 

Exp. Med. 13 1 :836-879); Freshney, supra. This is in part due to release of various growth 
5 factors by the transformed cells. Growth factor or serum dependence of transformed host cells 

can be compared with that of control. 

Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor specific 

markers") than their normal counterparts. For example, plasminogen activator (PA) is released 
10 from human glioma at a higher level than from normal brain cells (see, e.g., Gullino 

"Angiogenesis, tumor vascularization, and potential interference with tumor growth" pp. 178- 

184 in Mihich (ed. 1985) Biological Responses in Cancer Plenum. Similarly, tumor 

angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 

counterparts. See, e.g., Folkman (1992) Sem. Cancer Biol. 3:89-96. 
15 Various techniques which measure the release of these factors are described in Freshney 

(1998), supra. Also, see, Unkeless, et al. (1974) J. Biol. Chem. 249:4295-4305: Strickland and 

Beers (1976) J. Biol. Chem. 251:5694-5702; Whur, et al. (1980) Br. J. Cancer 42:305-312: 

Gullino "Angiogenesis, tumor vascularization, and potential interference with tumor growth" 

pp. 178-184 in Mihich (ed. 1985) Biological Responses in Cancer Plenum; Freshney (1985) 
20 Anticancer Res. 5:111-130. 

Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix constituent 

can be used as an assay to identify compounds that modulate cancer-associated sequences. 

Tumor cells exhibit a good correlation between malignancy and invasiveness of cells into 
25 Matrigel or some other extracellular matrix constituent. In this assay, tumorigenic cells are 

typically used as host cells. Expression of a tumor suppressor gene in these host cells would 

decrease invasiveness of the host cells. 

Techniques described in Freshney (1994), supra, can be used. Briefly, the level of 

invasion of host cells can be measured by using filters coated with Matrigel or some other 
30 extracellular matrix constituent. Penetration into the gel, or through to the distal side of the 

filter, is rated as invasiveness, and rated histologically by number of cells and distance moved, 
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or by prelabeling the cells with 125 I and counting the radioactivity on the distal side of the filter 
or bottom of the dish. See, e.g., Freshney (1984), supra. 
Tumor growth in vivo 

Effects of cancer-associated sequences on cell growth can be tested in transgenic or 
immune-suppressed mice. Knock-out transgenic mice can be made, in which the cancer gene is 
disrupted or in which a cancer gene is inserted. Knock-out transgenic mice can be made by 
insertion of a marker gene or other heterologous gene into the endogenous cancer gene site in 
the mouse genome via homologous recombination. Such mice can also be made by substituting 
the endogenous cancer gene with a mutated version of the cancer gene, or by mutating the 
endogenous cancer gene, e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells containing 
the newly engineered genetic lesion are injected into a host mouse embryo, which is re- 
implanted into a recipient female. Some of these embryos develop into chimeric mice that 
possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic lesion 
(see, e.g., Capecchi, et al. (1989) Science 244:1288-1292). Chimeric targeted mice can be 
derived according to Hogan, et al. (1988) Manipulating the Mouse Embryo: A Laboratory 
Manual CSH Press; and Robertson (ed. 1987) Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach IRL Press, Washington, D.C. 

Alternatively, various immune-suppressed or immune-deficient host animals can be 
used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella, et al. (1974) L 
Natl. Cancer Inst. 52:921-930), a SCED mouse, a thymectomized mouse, or an irradiated mouse 
(see, e.g., Bradley, et al. (1978) Br. J. Cancer 38:263-272; Selby, et al. (1980) Br. J. Cancer 
41:52-61) can be used as a host. Transplantable tumor cells (typically about 10 6 cells) injected 
into isogenic hosts will produce invasive tumors in a high proportions of cases, while normal 
cells of similar origin will not. In hosts which developed invasive tumors, cells expressing a 
cancer-associated sequences are injected subcutaneously. After a suitable length of time, 
preferably about 4-8 weeks, tumor growth is measured (e.g., by volume or by its two largest 
dimensions) and compared to the control. Tumors that have statistically significant reduction 
(using, e.g., Student's T test) are said to have inhibited growth. 
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Polynucleotide modulators of cancer 
Antisense and RNAi Polynucleotides 

In certain embodiments, the activity of a cancer- associated protein is down-regulated, or 
entirely inhibited, by the use of an inhibitory or antisense polynucleotide, e.g., a nucleic acid 
complementary to, and which can preferably hybridize specifically to, a coding mRNA nucleic 
acid sequence, e.g., a cancer protein mRNA, or a subsequence thereof. Binding of the antisense 
polynucleotide to the mRNA reduces the translation and/or stability of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise naturally- 
occurring nucleotides, or synthetic species formed from naturally-occurring subunits or their 
close homologs. Antisense polynucleotides may also have altered sugar moieties or inter-sugar 
linkages. Exemplary among these are the phosphorothioate and other sulfur containing species. 
Analogs are comprehended by this invention so long as they function effectively to hybridize 
with the cancer protein mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., 
Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant means, or 
can be synthesized in vitro. Equipment for such synthesis is sold by several vendors, including 
Applied Biosystems. The preparation of other oligonucleotides such as phosphorothioates and 
alkylated derivatives is also well known. 

Antisense molecules as used herein include antisense or sense oligonucleotides. Sense 
oligonucleotides can, e.g., be employed to block transcription by binding to the anti-sense 
strand. The antisense and sense oligonucleotide comprise a single-stranded nucleic acid 
sequence (either RNA or DNA) capable of binding to target mRNA (sense) or DNA (antisense) 
sequences for cancer molecules. A preferred antisense molecule is for a cancer sequences in the 
Tables, or for a ligand or activator thereof. Antisense or sense oligonucleotides, according to 
the present invention, comprise a fragment generally at least about 14 nucleotides, preferably 
from about 14-30 nucleotides. The ability to derive an antisense or a sense oligonucleotide, 
based upon a cDNA sequence encoding a given protein is described in, e.g., Stein and Cohen 
(1988) Cancer Res. 48:2659-2668: and van der Krol, etal. (1988) BioTechniques 6:958-976. 

RNA interference is a mechanism to suppress gene expression in a sequence specific 
manner. See, e.g., Brumelkamp, et al. (2002) Sciencexpress (21March2002); Sharp (1999) 
Genes Dev. 13:139-141; and Cathew (2001) Curr. Op. Cell Biol. 13:244-248; In mammalian 
cells, short, e.g., 21 nt, double stranded small interfering RNAs (siRNA) have been shown to be 
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effective at inducing an RNAi response. See, e.g., Elbashir, et al. (2001) Nature 41 1:494-498. 
The mechanism maybe used to downregulate expression levels of identified genes, e.g., 
treatment of or validation of relevance to disease. 
Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and inhibit 
transcription of cancer-associated nucleotide sequences. A ribozyme is an RNA molecule that 
catalytically cleaves other RNA molecules. Different kinds of ribozymes have been described, 
including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, RNase P, and axhead 
ribozymes (see, e.g., Castanotto, et al. (1994) Adv. in Pharmacology 25: 289-317 for a general 
review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel, et al. (1990) 
Nucl. Acids Res. 18:299-304; European Patent Publication No. 0 360 257; U.S. Patent No. 
5,254,678. Methods of preparation are described in, e.g., WO 94/26877; Ojwang, et al. (1993) 
Proc. Natl. Acad. Set USA 90:6340-6344; Yamada, et al. (1994) Human Gene Therapy 1:39- 
45; Leavitt, et al.(1995) Proc. Natl. Acad. Sci. USA 92:699-703; Leavitt, et al. (1994) Human 
Gene Therapy 5:1151-120; and Yamada, etal. (1994) Virology 205: 121-126. 

Polynucleotide modulators of cancer may be introduced into a cell containing the target 
nucleotide sequence by formation of a conjugate with a ligand binding molecule, as described in 
WO 91/04753. Suitable ligand binding molecules include, but are not limited to, cell surface 
receptors, growth factors, other cytokines, or other ligands that bind to cell surface receptors. 
Preferably, conjugation of the ligand binding molecule does not substantially interfere with the 
ability of the ligand binding molecule to bind to its corresponding molecule or receptor, or 
block entry of the sense or antisense oligonucleotide or its conjugated version into the cell. 
Alternatively, a polynucleotide modulator of cancer may be introduced into a cell containing the 
target nucleic acid sequence, e.g., by formation of an polynucleotide-lipid complex, as described 
in WO 90/10448. It is understood that the use of antisense molecules or knock out and knock in 
models may also be used in screening assays as discussed above, in addition to methods of 
treatment. 

Thus, in one embodiment, methods of modulating cancer in cells or organisms are 
provided. In one embodiment, the methods comprise administering to a cell an anti-cancer 
antibody that reduces or eliminates the biological activity of an endogenous cancer protein. 
Alternatively, the methods comprise administering to a cell or organism a recombinant nucleic 
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acid encoding a cancer protein. This may be accomplished in any number of ways. In a 
preferred embodiment, e.g., when the cancer sequence is down-regulated in cancer, such state 
may be reversed by increasing the amount of cancer gene product in the cell. This can be 
accomplished, e.g., by overexpressing the endogenous cancer gene or administering a gene 
encoding the cancer sequence, using known gene-therapy techniques. In a preferred 
embodiment, the gene therapy techniques include the incorporation of the exogenous gene using 
enhanced homologous recombination (EHR), e.g., as described in PCT/US93/0386. 
Alternatively, e.g., when the cancer sequence is up-regulated in cancer, the activity of the 
endogenous cancer gene is decreased, e.g., by the administration of a cancer antisense or other 
inhibitor, e.g., RNAi. 

In one embodiment, the cancer proteins of the present invention may be used to generate 
polyclonal and monoclonal antibodies to cancer proteins. Similarly, the cancer proteins can be 
coupled, using standard technology, to affinity chromatography columns. These columns may 
then be used to purify cancer antibodies useful for production, diagnostic, or therapeutic 
purposes. In a preferred embodiment, the antibodies are generated to epitopes unique to a 
cancer protein; that is, the antibodies show little or no cross-reactivity to other proteins. The 
cancer antibodies may be coupled to standard affinity chromatography columns and used to 
purify cancer proteins. The antibodies may also be used as blocking polypeptides, as outlined 
above, since they will specifically bind to the cancer protein. 
Methods of identifying variant cancer-associated sequences 

Without being bound by theory, expression of various cancer sequences is correlated 
with cancer. Accordingly, disorders based on mutant or variant cancer genes may be 
determined. In one embodiment, the invention provides methods for identifying cells 
containing variant cancer genes, e.g., determining all or part of the sequence of at least one 
endogenous cancer gene in a cell. In a preferred embodiment, the invention provides methods 
of identifying the cancer genotype of an individual, e.g., determining all or part of the sequence 
of at least one cancer gene of the individual. This is generally done in at least one tissue of the 
individual, and may include the evaluation of a number of tissues or different samples of the 
same tissue. The method may include comparing the sequence of the sequenced cancer gene to 
a known cancer gene, e.g., a wild-type gene. 

The sequence of all or part of the cancer gene can then be compared to the sequence of a 
known cancer gene to determine if any differences exist. This can be done using known 
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homology programs, such as Bestfit, etc. In a preferred embodiment, the presence of a 
difference in the sequence between the cancer gene of the patient and the known cancer gene 
correlates with a disease state or a propensity for a disease state, as outlined herein. 

In a preferred embodiment, the cancer genes are used as probes to determine the number 
of copies of the cancer gene in the genome. 

In another preferred embodiment, the cancer genes are used as probes to determine the 
chromosomal localization of the cancer genes. Information such as chromosomal localization 
finds use in providing a diagnosis or prognosis in particular when chromosomal abnormalities 
such as translocations, and the like are identified in the cancer gene locus. 
Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a cancer protein or modulator 
thereof, is administered to a patient. By "therapeutically effective dose" herein is meant a dose 
that produces effects for which it is administered. The exact dose will depend on the purpose of 
the treatment, and will be ascertainable using known techniques. See, e.g., Ansel, et al. (1999) 
Pharmaceutical Dosage Forms and Drug Delivery Lippincott; Lieberman (1992) Pharmaceutical 
Dosage Forms (vols. 1-3) Dekker, ISBN 0824770846, 082476918X, 0824712692, 0824716981; 
Lloyd (1999) The Art, Science and Technology of Pharmaceutical Compounding Amer. 
Pharmaceut. Assn.; and Pickar (1998) Dosage Calculations Thomson. Adjustments for cancer 
degradation, systemic versus localized delivery, and rate of new protease synthesis, as well as 
the age, body weight, general health, sex, diet, time of administration, drug interaction, and the 
severity of the condition may be necessary. U.S. Patent Application No. 09/687,576, further 
discloses the use of compositions and methods of diagnosis and treatment in cancer. 

A "patient" for the purposes of the present invention includes both humans and other 
animals, particularly mammals. Thus the methods are applicable to both human therapy and 
veterinary applications. In the preferred embodiment the patient is a mammal, preferably a 
primate, and in the most preferred embodiment the patient is human. 

The administration of the cancer proteins and modulators thereof of the present 
invention can be done in a variety of ways, including, but not limited to, orally, subcutaneously, 
intravenously, intranasally, transdermally, intraperitoneally, intramuscularly, intrapulmonary, 
vaginally, rectally, or intraocularly. In some instances, e.g., in the treatment of wounds and 
inflammation, the cancer proteins and modulators may be directly applied as a solution or spray. 
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The pharmaceutical compositions of the present invention comprise a cancer protein in a 
form suitable for administration to a patient. In the preferred embodiment, the pharmaceutical 
compositions are in a water soluble form, such as being present as pharmaceutically acceptable 
salts, which is meant to include both acid and base addition salts. "Pharmaceutically acceptable 
acid addition salt" refers to those salts that retain the biological effectiveness of the free bases 
and that are not biologically or otherwise undesirable, formed with inorganic acids such as 
hydrochloric acid, hydrobromic acid, sulfuric acid, nitric acid, phosphoric acid, and the like, and 
organic acids such as acetic acid, propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic 
acid, malonic acid, succinic acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic 
acid, mandelic acid, methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic 
acid, and the like. "Pharmaceutically acceptable base addition salts" include those derived from 
inorganic bases such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, 
zinc, copper, manganese, aluminum salts, and the like. Particularly preferred are the 
ammonium, potassium, sodium, calcium, and magnesium salts. Salts derived from 
pharmaceutically acceptable organic non-toxic bases include salts of primary, secondary, and 
tertiary amines, substituted amines including naturally occurring substituted amines, cyclic 
amines and basic ion exchange resins, such as isopropylamine, trimethylamine, diethylamide, 
triethylamine, tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: carrier 
proteins such as serum albumin; buffers; fillers such as microcrystalline cellulose, lactose, corn 
and other starches; binding agents; sweeteners and other flavoring agents; coloring agents; and 
polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit dosage forms 
depending upon the method of administration. For example, unit dosage forms suitable for oral 
administration include, but are not limited to, powder, tablets, pills, capsules and lozenges. It is 
recognized that cancer protein modulators (e.g., antibodies, antisense constructs, ribozymes, 
small organic molecules, etc.) when administered orally, should be protected from digestion. 
This is typically accomplished either by complexing the molecule(s) with a composition to 
render it resistant to acidic and enzymatic hydrolysis, or by packaging the molecule(s) in an 
appropriately resistant carrier, such as a liposome or a protection barrier. Means of protecting 
agents from digestion are available. 
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The compositions for administration will commonly comprise a cancer protein 
modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous carrier. A 
variety of aqueous carriers can be used, e.g., buffered saline and the like. These solutions are 
sterile and generally free of undesirable matter. These compositions may be sterilized by 
conventional, well known sterilization techniques. The compositions may contain 
pharmaceutically acceptable auxiliary substances as required to approximate physiological 
conditions such as pH adjusting and buffering agents, toxicity adjusting agents, and the like, 
e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium lactate, and 
the like. The concentration of active agent in these formulations can vary widely, and will be 
selected primarily based on fluid volumes, viscosities, body weight, and the like in accordance 
with the particular mode of administration selected and the patient's needs (e.g., (1980) 
Remington's Pharmaceut ical Science (18th ed.) Mack, and Hardman and Limbird (eds. 2001) 
Goodman and Gilman: The Pharmacolo gical Basis of Therapeutics (10th ed.) McGraw-Hill. 

Thus, a typical pharmaceutical composition for intravenous administration would be 
about 0. 1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per patient per 
day may be used, particularly when the drug is administered to a secluded site and not into the 
blood stream, such as into a body cavity or into a lumen of an organ. Substantially higher 
dosages are possible in topical administration. Actual methods for preparing parenterally 
administrable compositions will be known or apparent. 

The compositions containing modulators of cancer proteins can be administered for 
therapeutic or prophylactic treatments. In therapeutic applications, compositions are 
administered to a patient suffering from a disease (e.g., a cancer) in an amount sufficienfto cure 
or at least partially arrest the disease and its complications. An amount adequate to accomplish 
this is defined as a "therapeutically effective dose." Amounts effective for tins use will depend 
upon the severity of the disease and the general state of the patient's health. Single or multiple 
administrations of the compositions maybe administered depending on the dosage and 
frequency as required and tolerated by the patient In any event, the composition should 
provide a sufficient quantity of the agents of this invention to effectively treat the patient. An 
amount of modulator that is capable of preventing or slowing the development of cancer in a 
mammal is referred to as a "prophylacticaUy effective dose." The particular dose required for a 
prophylactic treatment will depend upon the medical condition and history of the mammal, the 
particular cancer being prevented, as well as other factors such as age, weight, gender, 
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administration route, efficiency, etc. Such prophylactic treatments may be used, e.g., in a 
mammal who has previously had cancer to prevent a recurrence of the cancer, or in a mammal 
who is suspected of having a significant likelihood of developing cancer based, at least in part, 
upon gene expression profiles. Vaccine strategies may be used, in either a DNA vaccine form, 
or protein vaccine. 

It will be appreciated that the present cancer protein-modulating compounds can be 
administered alone or in combination with additional cancer modulating compounds or with 
other therapeutic agent, e.g., other anti-cancer agents or treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides comprising 
nucleic acid sequences set forth in the Tables, such as RNAi, antisense polynucleotides or 
ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
methods, reagents, vectors, and cells useful for expression of cancer-associated polypeptides 
and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or organism-based) 
recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for introducing 
foreign nucleotide sequences into host cells may be used. These include the use of calcium 
phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, plasma 
vectors, viral vectors, and other well known methods for introducing cloned genomic DNA, 
cDNA, synthetic DNA, or other foreign genetic material into a host cell (see, e.g., Berger and 
Kimmel (1987) Guide to Molecular Cloning Techniques from Methods in Enzvmology (vol. 
152) Academic Press; Ausubel, et al. (eds. 1999 and supplements) Current Protocols Lippincott; 
and Sambrook, et al. (2001) Molecular Cloning: A Laboratory Manual (3d ed., Vol. 1-3) CSH 
Press. 

In a preferred embodiment, cancer proteins and modulators are administered as 
therapeutic agents, and can be formulated as outlined above. Similarly, cancer genes (including 
both the full-length sequence, partial sequences, or regulatory sequences of the cancer coding 
regions) can be administered in a gene therapy application. These cancer genes can include 
inhibitory applications, e.g., as inhibitory RNA, gene therapy (e.g., for incorporation into the 
genome), or antisense compositions. 

Cancer polypeptides and polynucleotides can also be administered as vaccine 
compositions to stimulate HTL, CTL, and antibody responses. Such vaccine compositions can 
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include, e.g., lipidated peptides (see, e.g.,Vitiello, et al. (1995) J. Clin. Invest. 95:341-349), 
peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") microspheres 
(see, e.g., Eldridge, et al. (1991) Molec. Immunol. 28:287-294,; Alonso, et al. (1994) Vaccine 
12:299-306; Jones, et al. (1995) Vaccine 13:675-681), peptide compositions contained in 
5 immune stimulating complexes (ISCOMS) (see, e.g., Takahashi, et al. (1990) Nature 344:873- 
875; Hu, et al. (1998) Clin Exp Immunol. 113:235-243), multiple antigen peptide systems 
(MAPs) (see, e.g., Tarn (1988) Proc. Natl. Acad. Sci. USA 85:5409-5413; Tam (1996) J. 
Immunol. Methods 196:17-32), peptides formulated as multivalent peptides; peptides for use in 
ballistic delivery systems, typically crystallized peptides, viral delivery vectors (Perkus, et al., p. 
10 379, in Kaufmann (ed. 1996) Concepts in Vaccine Development de Gruyter; Chakrabarti, et al. 

(1986) Nature 320:535-537; Hu, et al. (1986) Nature 320:537-540; Kieny, et al. (1986) 
Bio/Technology 4:790-795; Top, et al. (1971) J. Infect. Pis. 124:148-154; Chanda, et al. (1990) 

Virology 175:535-547), particles of viral or synthetic origin (see, e.g., Kofler, et al. (1996) J. 

Immunol. Methods 192:25-35; Eldridge, et al. (1993) Sem. Hematol. 30:16-24; Falo, et al. 
15 (1995) Nature Med. 1 :649-653), adjuvants (Warren, et al. (1986) Annu. Rev. Immunol. 4:369- 

388; Gupta, et al. (1993) Vaccine 11:293-306), liposomes (Reddy, et al. (1992) J. Immunol. 

148:1585-1589; Rock (1996) Immunol. Today 17:131-137), or, naked or particle absorbed 

cDNA (Ulmer, et al. (1993) Science 259:1745-1749; Robinson, et al. (1993) Vaccine 11:957- 

960; Shiver, et al., p 423, in Kaufmann (ed. 1996) Concepts in Vaccine Development de 
20 Gruyter; Cease and Berzofsky (1994) Annu. Rev. Immunol. 12:923-989; and Eldridge, et al. 

(1993) Sem. Hematol. 30:16-24). Toxin-targeted delivery technologies, also known as receptor 

mediated targeting, such as those of Avant Immunotherapeutics, Inc. (Needham, Massachusetts) 

may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a substance 
25 designed to protect the antigen from rapid catabolism, such as aluminum hydroxide or mineral 
oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis, or . 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available as, 
e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, MI); 
Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline Beecham, 
30 Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or aluminum 

phosphate; salts of calcium, iron, or zinc; an insoluble suspension of acylated tyrosine; acylated 
sugars; cationically or anionically derivatized polysaccharides; polyphosphazenes; 
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biodegradable microspheres; monophosphoryl lipid A and quil A. Cytokines, such as GM-CSF, 
interleukin-2, -7, -12, and other like growth factors, may also be used as adjuvants. 

Vaccines can be administered as nucleic acid compositions wherein DNA or RNA 
encoding one or more of the polypeptides, or a fragment thereof, is administered to a patient. 
This approach is described, for instance, in Wolff et. al. (1990) Science 247:1465-1468, as well 
as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; WO 
98/04720; and in more detail below. Examples of DNA-based delivery technologies include 
"naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, cationic lipid 
complexes, and particle-mediated ("gene gun") or pressure-mediated delivery (see, e.g., U.S. 
Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the invention can 
be expressed by viral or bacterial vectors. Examples of expression vectors include attenuated 
viral hosts, such as vaccinia or fowlpox. This approach involves the use of vaccinia virus, e.g., 
as a vector to express nucleotide sequences that encode cancer polypeptides or polypeptide 
fragments. Upon introduction into a host, the recombinant vaccinia virus expresses the 
immunogenic peptide, and thereby elicits an immune response. Vaccinia vectors and methods 
useful in immunization protocols are described in, e.g., U.S. Patent No. 4,722,848. Another 
vector is BCG (Bacille Calmette Guerin). BCG vectors are described in Stover, et al. (1991) 
Nature 35 1 :456-460. A wide variety of other vectors are availablel for therapeutic 
administration or immunization, e.g., adeno and adeno-associated virus vectors, retroviral 
vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like. See, e.g., 
Shata, et al. (2000) Mol Med Today 6:66-71; Shedlock, et al. (2000) J. Leukoc. Biol. 68:793- 
806; Hipp, et al. (2000) In Vivo 14:571-85. 

Methods for the use of genes as DNA vaccines are well known, and include placing a 
cancer gene or portion of a cancer gene under the control of a regulatable promoter or a tissue- 
specific promoter for expression in a cancer patient. The cancer gene used for DNA vaccines 
can encode full-length cancer proteins, but more preferably encodes portions of the cancer 
proteins including peptides derived from the cancer protein. In one embodiment, a patient is 
immunized with a DNA vaccine comprising a plurality of nucleotide sequences derived from a 
cancer gene. For example, cancer-associated genes or sequence encoding subfragments of a 
cancer protein are introduced into expression vectors and tested for their immunogenicity in the 
context of Class I MHC and an ability to generate cytotoxic T cell responses. This procedure 
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provides for production of cytotoxic T cell responses against cells which present antigen, 
including intracellular epitopes. 

In a preferred embodiment, DNA vaccines include a gene encoding an adjuvant 
molecule with the DNA vaccine. Such adjuvant molecules include cytokines that increase the 
5 immunogenic response to the cancer polypeptide encoded by the DNA vaccine. Additional or 
alternative adjuvants are available. 

In another preferred embodiment, cancer genes find use in generating animal models of 
cancer. When the cancer gene identified is repressed or diminished in cancer tissue, gene 
therapy technology, e.g., wherein inhibitory or antisense RNA directed to the cancer gene will 

10 also diminish or repress expression of the gene. Animal models of cancer find use in screening 
for modulators of a cancer-associated sequence or modulators of cancer. Similarly, transgenic 
animal technology, including gene knockout technology, e.g., as a result of homologous 
recombination with an appropriate gene targeting vector, will result in the absence or increased 
expression of the cancer protein. When desired, tissue-specific expression or knockout of the 

1 5 cancer protein may be necessary. 

It is also possible that the cancer protein is overexpressed in cancer. As such, transgenic 
animals can be generated that overexpress the cancer protein. Depending on the desired 
expression level, promoters of various strengths can be employed to express the transgene. 
Also, the number of copies of the integrated transgene can be determined and compared for a 

20 determination of the expression level of the transgene. Ammals generated by such methods will 
find use as animal models of cancer and are additionally useful in screening for modulators to 
treat cancer. 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, kits are 

25 also provided by the invention. In diagnostic and research applications, such kits may include 
at'least one of the following: assay reagents, buffers, cancer-specific nucleic acids or antibodies, 
hybridization probes and/or primers, antisense polynucleotides, ribozymes, dominant negative 
cancer polypeptides or polynucleotides, small molecule inhibitors of cancer-associated 
sequences etc. A therapeutic product may include sterile saline or another pharmaceutically 

30 acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing instructions (e.g., 
protocols) for the practice of the methods of this invention. While the instructional materials 
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typically comprise written or printed materials, they are not limited to such. A medium capable 
of storing such instructions and communicating them to an end user is contemplated by this 
invention. Such media include, but are not limited to, electronic storage media (e.g., magnetic 
discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. Such media may 
include addresses to internet sites that provide such instructional materials. 

The present invention also provides for kits for screening for modulators of cancer- 
associated sequences. Such kits can be prepared from readily available materials and reagents. 
For example, such kits can comprise one or more of the following materials: a cancer-associated 
polypeptide or polynucleotide, reaction tubes, and instructions for testing cancer-associated 
activity. Optionally, the kit contains biologically active cancer protein. A wide variety of kits 
and components can be prepared according to the present invention, depending upon the 
intended user of the kit and the particular needs of the user. Diagnosis would typically involve 
evaluation of a plurality of genes or products. The genes will typically be selected based on 
correlations with important parameters in disease which may be identified in historical or 
outcome data. 

EXAMPLES 

Example 1: Gene Chip Analysis 

Molecular profiles of various normal and cancerous tissues were determined and 
analyzed using gene chips. RNA was isolated and gene chip analysis was performed as 
described (Glynne, et al. (2000) Nature 403:672-676; Zhao, et al. (2000) Genes Dev. 14:981- 
993). 
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Jn„™il S l^S S°!L abn0 T ,fe ' ° r ° r 9 a " s . i]se3 f- ^ to j n Tal *s 2A and 3A, for which markers have been identified, and other related medical 

conditions (including vanous stages and/or metastases) in which those markers will also be useful, e.g„ in therapeutic, diagnostic, prognostic, subsetting, vaccine, and other uses. 



- — r~ - — • >-»•• »"••»> « * ww*. »iu;, oiiiuiuauidiuiia, eiiuoi.ieniL.3is, presumed ocular histoplasmosis synarome, hypoxia, solid tumors 

lymphomas lymphadenitis, lymphangitis, autoimmune diseases (e.g., RA, SLE, juvenile chronic arthritis, pigmented villonodular synovitis, etc.), retinal neovascularization 
uSe^broids 9 '' retln °P ath y' macular degeneration, presumed ocular histoplasmosis syndrome, etc.), scleritis/conjunctivitis, hypertrophic scars (keloid), birth control, 

pineocytoma, neurofibroma, neurofibrosarcoma, malignant peripheral nerve sheath tumors, granular cell tumors, plexosarcoma, ganglioneuroblastoma ne'uroecith*rna 
neuroma, ganglioneuroma 1 

breast ductal carcinoma in situ, lobularcarcinoma in situ 

cervix: cancer of the cervix, vagina, or vulva 

colon/rectum: precancerous colorectal disease (e.g., neoplastic polyps (adenomas), familial adenomatous polyposis, ulcerative colitis), colon cancer, e.g., epithelial tumor (e g 
oJXJXS S ^ am ° US Ce " Ca ' dn0ma ' a <^asquamous carcinoma, undifferenta.ed ca " 

olitis Crohn's disease (granuiomatou colitis) dyspla ia), ecal 
carcinoma, papillary villous carcinoma, mucinous adenocarcinoma, melanoma) 
esophagus: premalignant or predisposing conditions (e.g., esophagitis), squamous cell cancers (e.g., cancers of the head and neck lunq or cervix) aastrodiaestive carcinomas 

(e.g., cancers of the stomach, colon, or rectum) a 
fibrosis: lung fibrosis (idiopathic pulmonary fibrosis hype iv I n imonifs interstitial pneumonitis, nonspecific idiopathic pneumonitis), chronic obstructive pulmonary disease 
(e emphycm rhron asthma, bronchiect to t | e trma, wound healing , P aryQlsesse 

^^J^J^^^^-^^^-^^o^^ oralpharyn, l r ,ryn I ( opharynx, salivary glands, paragangliomas, esophagus 
kidney, clearcell (nonpapillary) carcinoma, papillary carcinoma, chromophobe renal carcinoma, hypernepl ima adenrx , c inon i p radii renal carcinomas heredterv renal 
carcinomas(vonHippel-Lindaudisease),carcinomaoftherenalpelvis,ureteralcarcncna ft papilla j yolip-oma on ma 

S^'r^h^ rh ° 1r ' " " 6 ' ' ! cu,a '^P homa . Newell lymphoma, Burkittlymphoma, plasma cell neoplasms, mantte cell 

ymphomalymphoplasmacyticlymF c r II I h * T I I em a/Iymphoma, Hodgkin disease, acute myelogenous leukemia, chronic myelogenous 
leukemia, thymic hyperplasia hairy cell leukemia malig reformat! propriale a-t,vat,on o abnormalities of leukocytes (e g , immature, precursor B (pri-B) or 
, monocytes, neutrophfls eosinophils, basophils, dendritic cells, lymphoblasls), arthritis, inflammation, leukocytosis, lymphadenitis, 
n r pe it ymphrjdcnit r r vound healing 



J T(pr< T)lyi iphoi ,tes mi i 
, ; .. r ,,jngitis, bacteremia, chronic noncpeunc iy ui-ji i^-r^i hhl., paouasis, wouna neanng 
" Ven h ga P Wadder 9 oV W?duct B ' C) ' ^ '"""^ ^ ^ * condilions ' prinraty ma " 9nailt epithelial lumors ' P™ ar V mali 9 nar,t mesenchymal tumors, tumors of the 

lung: lung cancer, small cell lung carcinoma (oat cell carcinoma), non-small cell carcinomas (e.g., squamouscell carcinoma, adenocarcinoma larqe cell luno carcinoma carcinoid 
granulomas ) fb r ■ m ) t ncobstSen^narv 

H > t "{ 1 l P l 1 I ) 1 I h " t 1 I JD 1 jl ance ' V * 

ovary: ovarian carcinoma (e.g., epithelial (serous tumors, mucinous tumors, endometrioid tumors), germ cell (e.g., teratomas, choriocarcinomas, polyembryomas, embryomal 

Soma Somyoma ^ tUm ° r ' dyS9erm ™ ma ' 9°" ad °°'a-lama), stra ™- carcinomas (e.g., granulosal stromal cell tumors)), fallopian tube carcinoma, peritoneal 
pancreas: adenocarcinoma, ductal adenocarcinoma, mucinous cyst adenocarcinoma, acinar cell carcinoma, unclassified large cell carcinoma, small cell carcinoma, 
nmsi J^^hS i i ' uc !" ectali ' mucin hypensecretinj tu i iscysiadenc i, papilla ystic neopla ou denon diabct r.cliti hronc pancreatitis 
111 a" cell t o I c r o r rd I I n rr * 

neoplasms (e.g., mesenchymal and lymphoma), germ cell tumors, prostatic intraepithelial neoplasia (PIN), hormone independent prostate cancer, benign prostate hyperplasia 



skin/melanoma: melanoma lentigo (common benign localized hyperplasia of melanocytes), nevocellular nevi (congenital or acquired neoplasm of melanocytes), actinic keratosis 
^vergrov4hofouterlayersofskin),basalcellcarcinoma,Merkelcellcarcinoma,benignfbou h , (d neo| fibroblasts and tocytes) 

derrnatofibrosarcoma protuberans (well differentiated fibrosarcoma of the skin), xanthomas (tumor-like collections of foamy histiocytes within the dermis) dermal vascular 
umors, seborrheic keratoses (benigr i tumor), acanthosis nigricans (benign or malignant hypcrpl^a and hyperpigmentata ,1 ( ua:i u cell c rcinomasoftheskin 

lung, cervix, esophagus, uterus, head, neck, or bladder 

soft tissue: son tissue tumors (e.g., fibrosarcoma, liposarcoma, leiomyosarcoma, histiocytoma, fibrohisfiocytic sarcoma) smooth muscle tumors (e.g., rhabdomyoma, 

rhabdomyosarcoma) tumors of the blood and lymph vessels (e.g., angiosarcoma, lymphangioma, Kaposi's sarcoma), perivascular tumors (e.g , glomus tumors 
hemangiopenc) na) yno It ors 9 ., mesothelioma), neural tumor g., neurofib r ieurofibro! ,, malignant peripheral nerveshealt I tumors ; granu'la I 
tumors, plexosarcoma, ganglioneuroblastoma, neuroepitheli ra e, r let I Ewing's sarcoma, schwannorr neurorr ganqlior rorr.a) paraq Jroma extraskeletal 
cartilaginous and osseous tumors (e.g„ chondrosarcoma, osteosarcoma), pluripotential mesenchymal tumors, epitheliod sarcomas, rhabdoid I tumors, desmoplastic small cell 

^ mX> ^IWM'^^^^^^ m ' ade,loaoa " thoma ' caicinoid ' lei °'WosarCTma. SasMBs (chronic atrophic, H. pylori associated), hyperplastic polyps, lipoma, 
testicles: germ cell tumors (including seminomas, embryonal carcinomas, teratomas, choriocarcinomas, yolksac tumors), sex chord stromal tumors (including Leydiq cell tumors 
^^^^f^^^*™*}™*^** 1 mtr ' " r n-*blastomas),Wxaland P aratestolar^ 
tissue sarcomas, and adnexal of the rete testes), miscellaneous neop asm- inr iding r , hnphoma, and cysts) 

1 J - ™' J — etrioid. papillary serous, clearcell, rr ■' ■ " 

i (e.g., malignant mi 



Tables 2B-72B, 76B and 79B list accession numbers for Pkeys lacking UnigenelD's for Tables 2A-72A, 76A, and 79A, respectively. For each probeset is listed gene cluster number 
from which oligonu^ Gene clusters were compile u ingsequenc den\ ed from Genbank ESTs and mRNAs. These sequences were clustered based on 

■Alston- 'rata enRg 3 Ali9nmei ' tT0OlS {Doub-eTwist. Oakland California). Genbank accession numbers forsequences comprising each cluster are listed in the 

Tables 2C-72C, 76C, and 79C list genomic positioning for Pkeys lacking Unigene ID's and accession numbers in Tables 2A-72A, 76A, and 79A, respectively. For each predicted 
exon is listed genomic sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 

Table 2A, Disease Indications and Preferred Utilities forSelected Genes 

Table 2A provides preferred disease indications and preferred utilities for about 413 selected genes. Thesegenes were identified using Eos/Affymetrix Genechip arrays. 

Pkey: Unique Eos | 

ExAccn: Exemplar Accession 
UnigenelD: Unigene ID number 
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Unigene Title: Unigene gene title 

Disease: prefened diseases indicated for selected gene as described in table 1 and abbreviated as follows: blad (bladder diseases), angio (blood vessel diseases), EWS 
(bone diseases), glio (brain diseases), breast (breast diseases), cerv (cervical diseases), colon (colorectal diseases), esoph (esophageal diseases), fibro (fibrosis 
diseases), headnk (head & neck diseases), leio (leiomyoma diseases), leuk (leukocyte diseases), hepC(liver diseases), lung (lung diseases), ovar (ovarian 
diseases), endo (ovarian endometrioid diseases), omuc (ovarian mucinous diseases), pane (pancreatic diseases), pros (prostate diseases), renal (renal diseases), 



Pkey; ExAccn; UnigenelD; Unigene Title; Disease; Utility 



he text and abbreviated as follows: CTL (DNA vaccine target), diag (diagnostic or prognostic target), mAb 



102892; BE440042; Hs.83326; matrix metalloproteinase 3 (st; headnk; mAb-Kiiag-ts.rn. 

104865; T79340; Hs.22575; B-cell CLUIymphoma 6, member ; angio; CTL 

104978; AI199268; Hs.19322; Homo sapiens. Similar to RIKEN; colon, lung, pros, blad, stom; CTL 

109424; NM_005329; Hs.85962; byaluronan synthase 3; blad, lung; mAb-ts.m. 

110765; AK000322; Hs.18457; hypothetical protein FU20315; colon, pros, stom, uter; mAb+diag 

110906; AA035211; Hs.17404; SOX7 SRY (sex determining regi; angio, blad; CTL 

115522; BE614387; Hs.333893; c-Myc target JP01; colon, lung, blad, pane; GTL 

116176; AA311152; Hs.288708; hypothetical protein FLJ21562; colon; CTL 

118695; AK000465; Hs.50081; KIAA1199 protein; colon, lung; diag 

123049; BE047680; Hs.211869; dickkopf (Xenopus laevis) homo; EWS; mAb-tdiag 

131486; F06972; Hs.27372 nd itl ( li il vocine knase(E; angio; CTL+s.m. 

133370; AF245505; Hs.72157; Adlican; breast, lung, pane; diag 

310016; AW449S12; Hs. 152475; ESTs: colon: CTL 

322303; AI357412; Hs.157601; ESTs; colon, pros, fibro. breast; CTL-Kliag 

400289; X07820; I ls.2258; matrix metalh proteinase 10 ( ingio blad lung, cerv, war, headnk, esoph; rnANdlag*s.m. 
400297; AI127076; Hs.288381; hypothetical protein DKFZp5640; breasL blad, colon, pros; mAb 
400303; AA242758; Hs.79136; LIV-1 protein, estrogen regula; breast, ovar, pros, stom, uter, blad, lung, headnk; mAb 
400843; ; ; NM_003105':Homo sapiens sorlil; blad; s.m. 

402075; ; ; ENSP00000251O56*:Plasma membra; blad, lung, headnk, cerv, mela, esoph; mAb-nliag 
402901; ;; NM_025206*:Homo sapiens hypoth; blad ; CTL 

404287; ; ; FGENESH predicted novel CUB-do; pane, lung, colon, uter, esoph; mAb-is.rn. 
404682; ; ; ortholog of mouse polydomain p; pane; diag 
404875; ; ; NM_022819':Homo sapiens phosph; blad; CTL+s.m. 
404977; ; ; Insulin-like growth factor 2 (; blad, ovar, sarc; mAb+diag 
405033; ; ; C1002652':gi|544327|sp|Q04799|; blad; s.m. 
06401 f llikrein 8 (i u nan j/ar uter; diag 
406964; M21305; ; FGENES predicted novel secrete *u b i fit diag 
407603; AW955705; Hs.62604; Homo sapiens, clone IMAGE:4299; glio, blad; CTL 
407792; AI077715; Hs.39384; putative secreted ligand homol; ovar, uter, cerv, pane; mAb^Jiag 
407811; AW190902; Hs.40093; cysteine knot superfanfly 1, B; blad. pane, stom, titer, lung, esoph; diag 
407836; T79340; Hs.22575; B-cell CLUIymphoma 6, member ; angio; CTL 
407975; X89426; Hc.41716; endothelial cell-specific mole; angio, renal; diag 
408243; Y00787; Hs.624; interleukin 8; blad, stom, hea I ik, cerv lung angio, =soph, pare; diag 
408367; AK001178; Hs.44424; Homo sapiens orphan neurotrans; mela; mAb^s.m. 
408369; R38438; Hs.118747; SLC15A2 Solute earner family ; pros, lung, fibro, uter, glio, cerv, over mAb 
408380; AF123050; Hs.44532; diubiquitin; lung, blad, headnk, pane, stom, fibro, esoph, mela; CTL 
408482; NM_000676; Hs.45743; adenosine A2b receptor; lung, esoph, headnk, colon; mAb+s.m. 
408562; AI436323; Hs.31141; roundabout (axon guidance rece; uter, fibro, sarc; mAb+s.m. 
408790; AW580227; Hs.47860; neurotrophic tyrosine kinase, ; lung; mAb+s.m. 
408908; BE296227; Hs.250822; serine/threonine kinase 15; blad, luna, headnk, stom.colon; s.m. 
409041; AB033025; Hs.50081; Hypothetical protein, XPJ5186; uter, ovar, lung, colon, stom, headnk, breast pane; CTL^diag 
409079; W87707; Hs.82065; interleukin 6 signal transduce; breast, pros; mAb*s.m. 
4091 03; AF251 237; Hs.1 1 2208; XAGE-1 protein; lung; CTL 
409178; BE393948; Hs.5091 5; kallikrein 5; ovar, breast, mela; diag 

409220; BE24332 H 1 t r necrosis factor i angio, renal colon, stom; mAb^.m. 

409420; Z15008; Hs.54451; laminin, gamma 2 (nic ir,l lun hi Ink, pai n cerv esoph, blad; diag 

409632; W74001; Hs.55279; serine (or cysteine) proteinas; lung, blad, headnk; diag 

409663; AI743750; Hs.98306; Kl M1862 protein; renal; CTL 

409757; NMJD01898; Hs.123114; cystatin SN; pane, stom, lung, blad; diag 

409889; AW630041; Hs.56937; suppression of tumorigenicity ; colon, ovar, pros; mAtusm 

409893, AW247090 H 57101; minich mosom mainlenam defiling erv blad, tr t e oph; CTL+s.m. 

409956;AW103364; Hs.727; inhibin, beta A (activin A, x; breast, pane, ovar, colon, headnk, lung, blad, esoph; diag 

410001;AB041036;Hc r 7-'i,MiKr 11 ovar, pros, uter, cerv, lung ; diag 

410055; AJ250839; Hs.58241 ; gene for serine/threonine prat; renal; s.m. 

410153; BE31 1926; Hs. 15830; hypothetical proteir I Lj 26 1 renal.bl d; CTL 

410274;AA381807;Hs.336402, hyp xi i ducible pr I: n >; I ing renal; CTL 

410309, BE043077 H 2781 3 .I, siaiyllransferasell;panc;s.m. 

410407; X66839; Hs.53287; carbonic anhydrase IX; renal, lung, colon, stom, ovar, uter, blad, sarc; mAb-ts.m, 
410418;D31382;Hs.63325;transmembraneprotease,serine;colon,blad,lung,ovar,panc " " 
411274; NM_002776; Hs.69423; kallikrein 10; colon, ovar, uter, cerv, headk, pane; diag 
41 1411; M345241; Hs.55950; ESTs, Weakly similar to K1AA13; renal; mAb«-3,m. 
411773; NMJ06799; Hs.72026; protease, serine, 
411975; AI916058; Hs,144583; 3'UTRof; dead rin 
412078; X59699; Hs.73149; paired boxgene 8; ovar; CTL 
412140; AA219691; Hs.73625; RAB6 interacting, kinesin-like; lung, blad, headnk, breast, ovar, pane, angio, test, mela; s.m. 
412314; M825247; Hs.355084; G protein-coupled receptor 27, ovai uter, t mAb+s.m 
412509; Z48804; Hs.74124; ocular albinism 1 (Nettleship-; mela; sm. 
412628; AI972402; Hs.306051; hypothetical protein MGC2648; pros; diag 
412709; AL022327; Hs.74518; KIAA0027 protein; glio, sarc; mAb«.m. 
412719; AW016610; Hs.816; ESTs; lung, headnk, blad, glio, cerv, sarc; s.m. 
412959; D87458; Hs.75090; KIM0282 protein; glio; CTL-^s.m. 
413048; M93221; Hs.75182; mannose receptor, C type 1; fibro, pane; mAb 
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ist, cere, ovar, headnk, esoph, mela; CTL+s.m. 

413385; M34455; Hs.840; indoleamine-pyrrole 2,3"dioxyg; blad, lung, mela, fibro. uter, sarc; s.ia 
413554; AA319146; Hs.75426; secretogranin II (chromogranin; pane, glio; diag 

413719; BE439580; Hs.75498; small r lucit cytokine subf; leul r t n , lung, headnk, cere, colon, uter.stom, esoph; diag 

414555; N98569; Hs.76422; phospholipase A2, group IIA (p; pros; sm. 

414577; AI056548; Hs.378938; hypothetical protein FU2Q992 ; angio; CTL+diag 

414774; X02419; Hs.77274; plasminogen activator, urokina; lung, blad, headnk, pane, stom, ovar, esoph; diag 

414812; X72755; Hs.77367; monokine induced by gamma inle; breast, blad, lung, fibro, pane, colon, headnk, cere, stom, renal, ovar, test, mela, esoph; diag 

414883;M926960;Hs,348669;CDC28 protein kma 1 u , va f jn. ^r i '-adnk Lst.sm 

414907; X90725; Hs.77597; polo (Drosophia)-like kinase; blad, lung, ovar, test s.m. 

414991; C17898; ; Homo sapiens up-regulated by B; fibro, lung; mAb-Kliag 

415138; C18356; Hs,295944; tissuefactor pathway inhibitor angio, pane, stom, lung, uter; CTL+diag 

415539; AI733881; Hs.72472; NAME OMITTED ... receptor kinase; breast; mAb+sm 

415668; AW957684; Hs.306814; Homo sapiens lysyl oxidase*; mela; diag 

415669; NM_005025; Hs.78589; serine (or cysteine) proteinas; lung; mAb+diag+s.m. 

415817; U88967; Hs.78867; protein tyrosine phosphatase, ; lung, glio, headnk, cerv, mela, esoph, fibro; mAb+s.m. 

415929; AA724373; Hs.304950; Homo sapiens mucolipin-3 (MCOL; mela; mAb 

41 5091; AF295370; Hs,283082; defensin, beta 3; headnk, esoph, mela; CTL+diag 

416209; AA236776; Hs.79078; MAD2 (mitotic arrest deficient; lung, headnk, colon, uter, stom; CTL+s.m. 

416250; AA581386; Hs.73452; Kremen 2; esoph, lung, cerv, ovar; mAb+s.m. 

416530; U62801; Hs.79361; kallikrcin 6 (neurosin, zymo); ovar, uter; diag 

416636; N32536; Hs.42645; solute earner family 16 (mono; breast, pane, uter, mela; mAb+s.m. 

416558; U03272; Hs.79432; fibrillin 2 (congenital contra; lung, ovar, uter, blad, angio, test, sarc; diag 

416836; D54745; Hs.80247; cholecystokinin; pros, EWS, glio ; diag 

41685?; AA138775; I ls.292453; FGENESI I predicted TM ccntainln; glio; rrAb+s.m. 

416965; N26223; Hs.160436; MDAC1; fibro, ovar, uter; mAb 

417034; NMJJ06183; Hs.80962; neurotensin; lung, headnk, cerv; diag 

417079; U65590; Hs.81134; interleukin 1 receptor antagon; blad, lung, headnk, cerv, esoph; diag 

417166; AA431323; Hs.42146; Paired box protein Pax-3; mela, sarc; CTL 

417389; BE260964; Hs.82045; midkine (neurite growth-promoU ovar, lung, blad, uter, cerv, pane, stom, mela, test, colon, sarc; mAb-Kliag 

417433; BE270266; Hs.82128; 5T4 oncofetal trophoblast glyc; pane, breast, blad, lung, headnk, cerv, uter, ovar, stom, renal; mAb 

417771; AA804698; Hs.82547; retinoic acid receptor respond; blad, cerv, pane, pros, ovar; mAb 

417866; AW067903; Hs. 82772; collagen, type XI, alpha 1; lung, pane, breast, ovar, headnk, stom, sarc; CTL 

417931, W95642 Hs.82961; trefoil to ' ntc-tinal) ovar pane, stom, colon, uter, pros; diag 

4179 3; X02303; H 82962 nyind/latesynthelase, blad, lung, angio, colon, pane esoph; s.m. 

418007; M13509; Hs.83169; matrix metalloproteinase 1 (In; lung, blad, fibro, headnk, pane, stom, colon, ovar, esoph, mela; mAb+diag+s.m. 

418030; BE207573; Hs.83321; neuromedin B; glio, pane; diag 

418064; BE387287; Hs.83384; S100 calcium-binding protein, ; mela; diag 

418281; U09550; Hs.1154; oviductal glycoprotein 1, 120k; uter, ovar; CTL+diag 

418478; U38945; Hs.1174; cyclin-dependcntkinasc inhibi; lung, blad, ovar, headnk, pane, cerv, mela, sarc; s.m. 
418506; AA084248 l+ 72651 Unknown proieii fo iT( 23~43 aigio, ova-, glio, uter, lung, blad, pane, mela, sarc mAb+diag 
418526; BE019020; Hs.85838; solute carrierfamily 16 (mono; lung, blad, renal, pane, stom, colon, ovar; mAb+s.m. 
418558; AW082266; Hs.86131; Fas (TNFRSFSl-associaicd via d; esoph, headnk; cm. 

18678; NM_001327; H I I bled torn ovar, pane, esoph, cerv, sarc; CTL 

418738;AW388633; Hs.6682; solute earner family 7, (cati; angio, lung, ovar, blad, colon, stom, pane, uter, leuk; mAb+s.m. 
418830; BE513731; Hs.348874; hypothetical protein MGC4816; lung; CTL 
418867; D31771; Hs.89404; msh (Drosophila) homeo box horn; blad; s.m. 

41 8870; AF147204; Hs.89414; chemokine (C-X-C motif), recep; leuk, ovar, breast, blad, renal; mAb+s.m. 

I19080;AW15 835 Hs I3378 hypothetical protcii 162 1 I mrutor lung « Tl 

419121; AA374372; Hs.89626; parathyroid hormone-like hormo; lung, esoph, headnk, blad; diag 

419171, NMJJ02846 H' protein tyi e phosph e lung mAb+s.m. 

419172. AW338 25 F 20 I T nilar R *i Mr IE \H mo renal mAb+s.m. 
419183; U60G6S; Hs.896G3; cytochrome P450, subfamily XXI; blad, lung, headnk, pane; CTL+s.m. 

419216; AU076718; Hs.164021; small inducible cytokine subfa; pane, lung, stom, cerv, pros, headnk, esoph; diag 
419235; AW470411; Hs.288433; neurotrimirr, pane, fibro, headnk, lung; mAb+diag 

419452; U33635; Hs.90572; PTK7 protein tyrosine kinase 7; ovar, pros, lung, breast, uter, tesL pane, stom, sarc; mAb+s.m. 

419508; AW997938; Hs.90786; ATP-binding cassette, sub-fami; glio, omuc, stom, lung, pane, colon, renal, uter; mAb+s.m. 

419556; U29615; Hs.91093; chitjnase 1 (chitotriosidase); lung, fibro, test; mAb+diag 

419704; AA429104; Hs.45057; ESTs; glio; CTL+s.m. 

419723; AL120193; Hs.339810; longevity assurance (LAG1, S. ; glio; mAb+diag 

419741; NM 007019; Hs.93002; ubiquitin carrier protein E2-C; blad, lung, colon, ovar, test, esoph, mela, sarc; CTL+s.m. 

419833; AA251131; Hs.220697; Homosapiens tryptophanyl-tRNA; fbro, stom, blad, esoph, uter; diag 

420159; AI572490; Hs.99785; Homo sapiens cDNA: FU21245 S; blad, stom; mAb 

420162; BE378432; Hs.95577; cyclin-dependent kinase 4; lung, mela, sarc; s.m. 

420370; Y13645; Hs.97234; uroplakin 2; blad; mAb 

420440; NM_002407; Hs.97644; mammaglobin 2; ovar, uter, cerv; diag 

420602; AF060877; Hs.99236; regulator of G-protein signall; headnk, glio, cerv, mela; CTL+s.m. 

420610; AI683183; Hs.99348; distal-less homeo box 5; uter, endo, lung; CTL 

420737; L08096; Hs.99899; CD70 ; tumor necrosis factor ; renal; mAb+s.m. 

420876; AA918425; Hs.177744; FGENES predicted novel protein; pane, blad; s.m. 

1 t 1 4 1 t f i 

421110; AJ250717; Hs.1355; cathepsin E; blad, pane, stom, lung, fibro, ovar, esoph; sm+diag 
421340; F07783; Hs.1369; decay accelerating factor for ; angio, pane, stom; diag 

421379; Y15221; Hs.103982; small inducible cytokine subfa; breast, pane, headnk, lung, stom, blad, cerv, colon, leuk, fibro, test, mela, esoph; diag 
421471; U90545; Hs.327179; solute earner family 17 (sodi; renal; mAb+s.m. 
421474; U76362; Hs.104637; solute carrier family 1 (gluta; lung; mAb+s.m. 
421524; AA312082; Hs.105445; GDNF family receptor alpha 1; breast; mAb+sm 

421552; AF026692; Hs.105700; secreted frizzled-related prof breast, ovar, pane, cerv, uter, pros, lung, stom, headnk; diag 

421563 NM_0064SC h 10 80! ui,sm fibro, dag 

421574; AJ000152; Hs.105924; defensin, beta 2; headnk, lung; CTL+diag 

421582; AI910275; Hs.350470; trefoil factor 1 (breast cance; breast, pane, lung, omuc; diag 
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421659; NM.014459; Hs.10B511; protocadherin 17; fibre; mAb 
421753; BE314828; Hs.107911; ATP-binding cassette, sub-fami; lung; mAb+s m. 
421817; AF146074; Hs 108660, ATP bir I ig c ell ub-fami lung, cerv, headnk, blad; mAlns.ni. 
421829; AB018330; Hs.108708; calcium/calmcdulin-dependent p; pros; s.ra 
422048; NM.012445; Hs.288126; spondin 2, extrxellular matri; pane, pros, sarc; diag 
422083; NM_001141; Hs.111256; arachidonate 15-lipoxygenase, ; pros; s.m. 
422109; S73265; Hs.1473; gastrin-releasing peptide; pane, lung, colon, fibro; diag 
422158; L10343; Hs.112341; protease inhibitor 3, skin der; headnk, blad, lung, cerv, stom, esoph; diag 
422192; M305159; Hs,113019; 1!s485; mela; s.m. 

422260; AA315993; Hs.105484; regenerating gene type IV; colon, omuc, stom, pane; mAb+diag 
422282; AF019225; Hs.114309; apolipoprotein L; blad, lung, headnk, renal; diag 
422283; AW411307; Hs.114311; CDC45 (cell division cycle 45,; lung, blad, test, cerv, headnk, esoph; s.m, 
422330; D30783; Hs.115263; eplregulin; pane, colon, blad; mAb+diag 

422337: AJ223366; He. 1 16051; MYEOV Myeloma overexpressed ge; pane, stom, colon, esoph, renal, blad; CTL+s.m. 
422424; AI186431; Hs.296638; prostate differentiation facto; blad, pane, pros, angio, colon, stom, lung, mela; diag 
422578; AF239666; Hs.1545; caudal type homeo box transcrl; colon; CTL 
422627; BE335857; Hs.118787; transforming growth factor, be; colon, renal, sarc; mAb+diag 
422765; AW409701; Hs.1578; bxuloviral IAP repeat-contain; lung, blad; s.m. 
422809; AK001379; Hs.121028; hypothetical protein FU10549; blad, cer, lung, uter, angio, stom, test; s.m. 
422867; L32137; Hs.1584; cartilage oligomeric matrix pr; breast, ovar, pros, pane, lung, colon, uter, sarc; diag 
422956; BE545072; Hs.122579; ECT2 protein (Epithelial cell ; ovar, blad, pane, lung, headnk, colon.stom; CTL+s.m. 
423161; AL049227; Hs.124776; downstream of cadherin 6 (by 3; renal, ovar, blad; mAb+srn. 
423184; NM_004428; Hs.1624; ephrin-A1; pros, pane, renal.colon; mAb+s.m. 
423242; AL039402; Hs.125783; DEME-6 protein; breast, renal, ovar, pros, colon; CTL 
423508; AW604297; Hs.129711; hepab'b's A virus cellular rec; renal, colon; mAb 
423583; AL122055; Hs.129836; KIAA1028 protein; pros; s.m. 
423634; AW959908; I I 'J nern i i vthfe ) ung, blad, headnk. pane; d 

423673; BE003054; Hs.1695; matrix metalloproteinase 12 (m; blad, lung, headnk, ovar, pane, colon, stom, uter, cerv, esoph, test; mAb+diag+s.m. 
423936; U77629; Hs.135639; xhaete-scute complex (Drosoph; colon, stom, ovar; CTL 

423961; D13666; Hs.136348; periostin (OSF-2os); breast, colon, blad, lung, fibro, pane, headnk, ovar, mela, sarc; mAb+diag 
424008; R02740; Hs.137555; putative chemokine receptor; G; blad, headnk, stom, cerv, esoph; mAb+s.m. 
424046; AF027866; Hs.138202; serine (or cysteine) proteinas; headnk, lung, cerv; diag 
424321; W74048; Hs.1765; lymphocyte-specilic protein ty; mela, fibro; s.n 

424381; AA285249; Hs. 146329; protein kinase Chk2 (CHEK2); tur 

424411; NM_005209; Hs.146549; crystallin, beta A2; pane, sarc; s.m 
424562; AF 242388; Hs.149585; lenqsin; lung; s.m. 

424503; NM_002205; Hs.149609; integrin, alpha 5 (fibronectjn; pane, pros.angio, blad, lung; mAb+s.m 
424620; AA101043; Hs.151254; kallikrein 7 (chymotryptjc, st; ovan diag 

424687; J0507D Hs.151738 n tri ,n, yi , fl -temase 9 (ge; headnk, pane, lung, blad, uter, cerv, colon, stom, test, mela. sarc; diag 

424735; U31875; Hs.272499; short-chain alcohol dehydrogen; blad, breast; CTL+SJn. 

424825; AF207069; Hs.153357; procollagen-lysine, 2oxogluta; mela; CTL+s.m. 

424905; NMJ02497; Hs.153704; NIMA (never in mitosis gene a); ovar, blad, lung, headnk, pane, stom; s.m. 

424917; AI636208; Hs.96901; hypothetical protein FLJ 23049; fibro, uter, ovar; CTL 

424943; AU077260; Hs.153924; dcath-ccoociated protein kinas; fibro; s.m. 

425009; X58288; Hs.154151; protein tyrosine phosphatase, ; renal, fibro; mAb+s.m. 

425071; NM_013989; Hs.154424; deiodinase, iodothyronine, typ; pros, colon, stom, u'.er, cerv, headnk, esoph, pane; diag 
425115; R44664; Hs.123956; downstream of. G protein-coup; glio; mAb+s.m. 

425247; NM_005940; Hs.155324; matrix metalloproteinase 1 1 (s; breast, ovar, lung, colon, pane, headnk, stom, uter, cerv, blad, esoph, sarc; mAb+diag+s.m. 
425263; NM 001197; Hs.155419; BCl2-interacting killer (apopt; pros; s.m. 
425322; U63630; Hs.155637; protein kinase, DNA-xtivated.; lung, ncadnK s.m. 
425535; AB007937; Hs.158287; syndecan 3; mela, alio; mAb+s.m. 

425650; NM.001944; Hs.1925; desmogleln 3 (pemphigus vulgar; lung, headnk, cerv, esoph, blad; mAb 
425721; AC0021 15; Hs.159309; uroplakin 1A; blad; mAb 

425723; NMJ14420; Hs.159311; dickkopf (Xenopus laevis) homo;endo, uter, colon; CTL+diag 
425734; AF056209; Hs.159396; peptldylglycine alpha-arrfdatln; lung; s.m. 
425776; U25128; Hs.159499; parathyroid hormone receptor 2; ovar, uter, lung; mAb+diag 
425842; AI587490 Hs.1C 623 Nk (Drosophila) hon kj B; pan lio; s.m 
425852; AK001504; Hs.159651; death receptor 6, TNF superfam; blad, lung, headnk; mAb+s.m. 
425883; AL1 37708; Hs.151031; Homo sapiens mRNA; cDNA DKFZp4; blad. pane; mAb 
425998; AU076629; Hs.165950; fibroblast growth factor recep; renal; mAb+s.m. 
425028; NM.001110; Hs.172028; a disintegrin and metalloprote; blad; mAb+diag 
426215; AW96341^ 1 r r «atan 2; breast lung, renal, colon, ovar, ute . 

426227; U67058; Hs.154299; Human proteinase activated rec; pane, lung, colon, esoph, stom; mAb+s.m. 
426322; J05068; Hs.2012; transcobalamin I (vitamin B12 ; pane, blad, stom; diag 
426344; H41821; Hs.322469; transcriptional activator of t glio; CTL+s.m. 

426427; M86699; Hs.169840; TTK protein kinase; ovar, lung, headnk, cerv, colon, uter, stom, test CTL+s.m. 
426451; AI908165; Hs.169946; GATA-binding protein 3 (T-cell; blad, breast; s.m. 
426514; BE616633; Hs.170195; bone morphogenetic protein 7 (; ovar, colon, blad, lung, cerv; mAb+diag 
426600; NMJI03378; Hs.171014; VGF nerve growth factor induci; mela, sarc (Jag 

426761; AI015709; Hs.172089; PORIMIN Pro-oncosls receptor i; lung, esoph, pros, uter, pane, colon, ovar, headnk; mAb+s.m. 
426812; AF105365; Hs.172613; solute earner family 12 (pota; renal; mAb+s.m. 
426890; AA393167; Hs.41294; ESTs; renal, colon, ovar, uter, stom; CTL 
427239 BE270447 ^ i t Tier protein; King bind test, mela, sarc; CTL+s.m. 
427335; AA448542; Hs.278444; G antigen 7B; lung, headnk, blad, mela, esoph, sarc; CTL 
427343; AI880044; Hs.176977; protein kinase C binding prate; glio; CTL+s.m. 
427722; AK000123; Hs.180479; hypothetical protein FU201 16; colon, stom, pane; CTL 
427747; AW411425; Hs.180655; serine/threonine kinase 12; blad, lung, ovar, stom, test, esoph, sarc; s.m. 
427923; AW274357; Hs.301406; FGENESH predicted 11 TM protei; mela; mAb 
427969; NMJW1963; Hs.2230; epidermal growth factor (beta-; pane; mAb+diag 
428093; AW594506; Hs.104830; ESTs; ovar, pane; CTL 
428179; AI127772; Hs.380877; serum/glucocorticoid regulated; breast; s.m. 
428187; AI687303; Hs.285529; G protein-coupled receptor 49; ovar, uter, colon, stom; mAb+s.m. 
428242; H55709; Hs.2250; leukemia inhibitory factor (eh; ovar, pane, leuk, lung; diag 
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428296; NMJ03058; Hs.183572; solute carrier family 22 (orga; renal; mAb+s.m. 

428330; L22524; Hs.2256; matrix metalloproteinase 7 (ma; uter, ovar, fibra, pros, pane, lung, blad, headnk, esoph, mela; mAb+diag+s.m. 

428368; BE440042; Hs.83326; matrix metalloproteinase 3 (st; headnk, stom, esoph, colon; diag 

428392; H10233; Hs.2265; s e i jr/ granule, neuroendocr; pane; diag 

428450; NMJ14791; Hs.184339; KIAA0175 gene product; ovar, cerv, pane, lung, blad, meta; sm 

I o Hs.334 , i j 

428484; AF104032; Hs.184601; solute carrier family 7 (catio; lung, blad, headnk, cerv, esoph, giro, uter, stom, colon, mela; mAb+s.m. 

428486; AW583497; Hs.184604; pancreatic polypeptide; pane; diag 

428505; AL035461; Hs.2281; chromogranin B (secretogranin ; pane, lung; diag 

428513; BE220806; Hs.184697; plexin C1; mela, pane, breast stom, headnk; mAb 

428579; NMJ05756; Hs.184942; G protein-coupled receptor 64; ovar, EWS, uter; mAb+sm 

428664; AK001666; Hs.189095; similar to SALL1 (sal (Drosoph, bled, ova-, pros, lung, stom, test; CTL«*ja 

428698; M852773; Hs.334838; KIM1866 protein; breast, colon, lung, pane, stom, headnk, ovar, EWS; mAb 

428748; AW593206; Hs.98785; Ksp37 protein; lung, sarc; diag 

428758; AA433988; Hs.98502; CA125 antigen; mucin 16; ovar, cerv, lung, pane, stom, renal; diag 
428778; AK000530; Hs.193326; fibroblast growth factor recep; ovar; mAb+s.m. 

428953; M306610; Hs.348183; tumor necrosis factor receptor; cerv, pane, colon, stom, headnk, renal; mAb+diag 
428969; AF120274; Hs.194689; artemin; lung, cerv; diag 

428970; BE276891; Hs.194691; retjnorc add Induced 3 (RAIG; stom, pane, colon, ovar, fibro; mAb+s.m. 
429149; AW193360; Hs.197962; Homolog of mouse ADP-ribosylat; glio; mAbtsm 

42921 1; AF052693; Hs.198249; gap junction protein, beta 5 (; lung, blad, headnk, cerv, esoph, stom, mela; mAb+s.m. 

429263; M019004; Hs.198396; ATP-binding cassette, sub-fami; lung; mAb+s.m. 

429276; AF056085; Hs.198612; G protein-coupled receptor 51; angio, blad, giro; mAb+s,m. 

429353; AL1 17406; Hs.335891 ; ATP-binding cassette transport; breast, pros ; mAb+s.m. 

429547; AW009166; Hs.99376; FGENESH predicted novel secret; pane, headnk, lung, ovar; diag 

ill 024937; Hs.211 LUNXpro PUJNCI lu; lu t Ab+Oiag 

429903; AL134197; Hs.93597; cyclin-dependent kinase 5, reg; lung, mela; s.m. 

429910; NM_000867; Hs.2507; 5-hydroxytryptamine (serotonin; leio; mAb+s.m. 

430147; R60704; Hs.234434; hairy/enhancer-of-split relate; glio; s.m. 

430178; AW449612; Hs.152475; 3'UTRof achaete-scute comple; colon, stom, ovar CTL 

430377; HM 001922; Hs.301865; dopachrome tautomerase (dopach; mela; CTL 

430413; AW842182; Hs,241392; small inducible cytokine A5 (R; fibro, esoph, mela; d;ag 

430486; BE062109; Hs.241551; chloride channel, calcium acti; lung, blad, headnk, cerv, esoph; mAb+s.m. 

430822; AJ005371; Hs.248017; glyceraldehyde-3-phosphate deh; mela, sarc; sjn. 

431130; NM_006103; Hs.2719; HE4; epididymis-specific, whey; ovar, uter; diag 

431462; AW583672; Hs,25631 1; granin-like neuroendocrine pep; pane, lung, glio, test; diag 

431515; NMJ12152; Hs.258583; EDG-7 (endothelial differenti; ovar, pros, lung, blad; mAb+s,m. 

431620; AA126109; Hs.264981; 2'-5'-aligoadeny1ate synthetas; esoph, cerv; CTL+s.m. 

431629; AU077025; Hs.265827; interferon, alpha-induciblc pr; pane, uter, cerv, stom, esoph, mela; mAb+diag 

431630; NMJ)02204; Hs.265829; integrin, alpha 3 (antigen CD4; ovar, pane, blad, headnk, mela, renal; mAb+s.m. 

431745; AW972448; Hs.163425; Novel FGENESH predicted cadher; fibro, ovar, uter; mAb 

431840; M534908; Hs.2860; POU domain, class 5, iranscrip; test, renal, blad; CTL 

431846; BE019924; Hs.271580; uroplakin 1B; lung, blad, headnk, uter, cerv, stom, ovar; mAb+ciag 

431958; X63629; Hs.2877; cadherin 3, type 1, P-cadherin; lung, blad, cerv, headnk, ovar, colon, pros, pane, breast, esoph, test, mela; mAb+diag 

432101; AI918950; Hs.123642; EphA3; pros, pane, EWS sarc; s.m, 

432179; X75208; Hs.2913; EphB3; ovar, colon; mAb+s.m. 

432196; AW300888; Hs.273230; hypothetical protein FU10830; renal; CTL 

432201; AI538613; Hs.298241; Transmembrane protease, serine; breast, colon, ovar, stom, pane, uter, cerv, lung ; mAb+diag+s.m. 
432579; AF043244; Hs.278439; nucleolar protein 3 (apoptosis; renal; CTL 
432596; AJ224741; Hs.278461; matrilin 3; pane, breasL sarc; diag 

432S06; NM_002104; Hs.3066; granzyme K (serine protease, g; renal, breast, lung, stom, hepC, fibro, leuk; CTL 
432829; W60377; Hs.57772; ESTs; blad; CTL+s.m. 

432874; W94322; Hs.279651 ; melanoma inhibitory activity; pane, stom, mela, sarc; diag 

432990; AL036071; Hs.279899; tumor necrosis factor receptor; pros, renal; mAb+s.m. 

433001; AF217513; Hs.279905; clone HQ0310 PRO0310p1; colon. breasL lung, blad, cerv, uter, test mela; s.m. 

433447; U29195; Hs.3281; neuronal pentraxin II; mela, esoph, colon, renal; diag 

433848; AF095719; Hs.93764; carboxypeptidase A4; headnk, esoph, lung; s.m, 

433867; AK000596; Hs.3618; hippocalcin-like 1; renal; CTL 

434206; AW136973; Hs.362915; ESTs, Weakly similar to S69890; colon, lung, stom; CTL+s.m. 
434276; AF123659; Hs.93605; leucine zipper, putative tumor; mela; s.m. 
434293; NM_004445; Hs.3796; EphB6; blad, pros;s.m. 

435013; H91923; Hs.110024; NM 020142;Homo sapiens NADH:ub; renal, lung, sarc; CTL 

435472; AW972330; Hs.283022; triggering receptor expressed ; glio; mAb 

435505; AF200492; Hs,211238; interteukin-1 homolog 1; lung, headnk; diag 

436456; AW292677; Hs.248122; melanin-concentrating hormone ; mela, g'io; mAb+s.m. 

436480; AJ271643; Hs.87469; putative acid-sensing ion chan; glio; mAb+s.m. 

436481; AA379597; Hs,5199; HSPC150 protein similar to ubi; lung, blad, colon, ovar, uter, headnk, test; s.m. 

436576; AI458213; Hs.77542; ESTs; renal, pane, headnk, lung; mAb+s.m. 

436608; AA628980; Hs.192371; downsyndromecritical region ; blad, lung, sarc; CTL+s.m, 

436895; AF037335; Hs.5338; cartonic anhydraseXII; breast, renal, ovar, glio; mAb+s.m. 

436961; AW375974; Hs.156704; ESTs; lung, pane, renal, uter, colon ; CTL 

436982; AB018305; Hs.5378; spondin 1, (f-spondin) extrace; ovar, fibro; diag 

437016; AU076916; Hs.5398; guanine monphosphate synthetas; lung, blad, cerv, esoph, colon, headnk; s.m. 

437044; AL035864; Hs.69517; differentially expressed in Fa; headnk, cerv, lung, blad, breast, pros, ovar, stom, esoph; CTL 

437789; AI581344; Hs.127812; ESTs, Weakly similar to T17330; lung; CTL 

437852; BE001836; Hs.256897; putative GPCR; blad, lung; mAb+s.m. 

438380; T06430; Hs.6194; chondroitin sulfate proteoglyc; glio, mela; diag 

438549; BE386801; Hs.21858; trinucleotide repeat containin; mela, sarc; CTL+diag 

439018; AW300887; Hs.26538; membrane-spanning 4-domains, s; uter, stom, pros, fibro; mAb 

439223; AW238299; Hs.250618; UL16 binding protein 2; lung, headnk, cerv, esoph, leuk, blad, colon; mAb 

439477; W69813; Hs.58042; ESTs, Moderately similar to 6F; lung; mAb+s.m. 

439569, AW6021&O i 9 °1 1 ot n breast, pros, blad; diag 

439606; W79123; Hs.58561; G protein-coupled receptor 87; lung, blad, headnk, cerv, esoph; mAb+s.m. 
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439738; BE246502; Hs.9598; sema domain, immunoglobulin do; blad, lung, cerv, renal; mAb+s.m. 

439979; AW600291; Hs.6823; hypothetical protein FU10430; renal, cerv, pros, headnk, colon, test sarc; mAb 

440006; AK000517; Hs.6844, NALP, in fRIN rl n; blad mg iittstsm 

440065; W03476; Hs.266331; Homo sapiens Fc receptor homo!; mela; diag 

440304; BE159984; Hs. 125395; hepatitis A virus cellular rec; renal, colon, blad; mAb+s.m. 

440516; S42303; Hs.161;cadherin 2, type 1, N-cadherin; glio, ovar, uter, renal, hepC; mAb+diag 

440672; AF083811; Hs.7345; MAD1 (mitotic arrest deficient; mela; s.m. 

441362; BE614410; Hs.23044; RAD51 (£ cere^ - horn ' - lung blad, h „ ,1 lesl mela esoph;s.m. 

4421 17; AW664954; Hs. 128899; ESTs; hypothetical protein for, breast, lung, blad, pane, headnk, stom, ovar, pros, sarc; mAb+s.m. 

442133; AW874138; Hs,129017; ES Is; type la transmembrane p; ovar, uter; mAb 

442275; AW449457; Hs.54795; Homo sapiens secretoglobin, fa; fibro; diag 

442652; AI005163; Hs.201378; Homo sapiens cDNA FU40427 (is; fibro, ovar, uter; CTL 

443105; X96753, H c 30( i chcndroitii mate proteoglye; mela; mAb+diag 

443247; BE614387; Hs.333893; c-Myc target JP01; colon, lung, blad, pane; CTL 

443324; R44013; Hs.184225; ESTs; fibro; mAb-Kiiag 

443426; AF098158; Hs.9329; chromosome 20 open reading fra; colon, lung, blad, stom, test, mela, sarc; CTL 
443595; AF169312; Hs.9613; PPAR(gamma) angiopoielin relal; renal; diag 
443646; AI085198; Hs.164226; Thrombospondin T, angio, pane, uter; diag 
443859; NMJ313409; Hs,9914; follistatin; lung, cerv, headnk, blad, esoph; diag 
443987; AW163123; Hs.10071; seven fransmembrane protein TM; renal; mAb+s.m. 

444006; BE395085; Hs.334762; type I transmembrane protein F; pane, colon, lung, ovar, renal, esoph, mela, blad, stom, cerv; mAb 
444090; S69115; Hs.10306; natural kil'crce'l group 7 se; fibro, renal, mela; diag 
444371; BE540274; Hs.239; forkhead box M1; lung, headnk, blad, glio, lest, mela; s.m. 

444381; BE387335; Hs.283713; hypothetical protein BC014245; breast, colon, blad, lung, pane, headnk, ovar, stom, uter, renal, angio, test, mela, esoph, sarc; CTL+diaq 

444488, AW192879 h nscrved do p I mAb+s.m. 

444527; NMJWS408; Hs, 11383; small inducible cytokine subfa; f bro, esoph; diag 

444781; NM.014400; Hs.11950; GPI-anchored metastasls-associ; lung, blad, headnk, cerv; mAb+diag 

444783; AK001468; Hs.62180; anillin (Drosophila Scraps horn; ovar, lung, blad, headnk, pane, cerv, stom, uter, colon, esoph; CTL+s.m. 

445417; AK001058; H 12680 a di ntegnn Ike ard metal o pan h k.stoi lui it i d ag 

445537; AJ245671; Hs.12844; EGF4lke-domain, multiple 6; ovar, blad, uter, breast, lung, headnk, renal, fibro, pane, cerv, sarc; mAb+diag 

445891; AW391342; Hs,19S460; DPCR1 protein, ston , par cs >pl , ™c, esoph, mAb 

445895; D29954; Hs.13421; KIAA0056 protein; pros; CTL 

446051; BE048061; Hs.37054; ephrin-A3; colon, breast; mAb+diag 

445163; AA026880; Hs.25252; prolact'n receptor, breast, cerv, uter; mAb+s.m. 

445341; AL040763; lls.310735; FGENESI I prediction similar lo ; mela; mAb+s.m. 

446619; AU076643; Hs.313; secreted phosphoprotem I (i a i rr headnk, lung, colon, blad, mela, esoph, uter, sarc; diag 

15 U )l b I 1 i ' < e i it , 2 " g i t 
445921 ABD121f H I I I I cytol- ne sjbfc breast, pane, headnk, lung, fibro, mela; diag 

447033; AI35741 2; Hs.157601; Predicted gene; Eos cloned; se; colon, pros, fibro, breast, ovar, lung, pane, sarc; CTL+diag 
' D51594 h 1/ J tyr Iprofein sulfotranofcrao; glio, f m 

447131; NM_00458S; Hs. 17466; retinoic acid receptor respond; renal, breast, stom, lung, mela, ovar; mAb+s.m. 

447208; BE315291; Hs.237971; hypothetical protein MGC5627; esoph, stom, colon; CTL+diag 

447269; NM_004861; Hs.17958; cerebrosido (3'-phosphoadenyly; renal; CTL 

447342; AI139268; Hs.19322; Homo sapiens, Similar to RIKEN; colon, blad, pros, lung, stom, ovar; CTL 

447400; AK000322; Hs.18457; hypothPtird ;n|, >n 12031 colon, p tern, uter, mAb+diag 

1 7 1 H' > IV H 1 11 2 /c i d p r en t a 2;mcl n, 
448243; AW369771; Hs.367688; inlegrin, beta 8, ovar u lung torn, I J il ylio pane ; mAb+s.m. 
448610; N(vL006157; Hs.21602; nel (chicken)-like 1; mela, sarc; diag 
448733; NM_005629; Hs.187958; solute carrier family 6 (ncuro; lung, renal; mAb > s.m. 
1 1 164; FGE j lorn, omuc mAb+s.m. 

449032; AA045573; Hs.22900; nuclear factor (erythrold-deri; colon, test, stom; CTL+s.m. 
449043; Z45051; Hs^2920; similar to S68401 (cattle) glu, paa- o> i ut i glio he uak lung, sarc; mAb 
449444; AW818436; Hs.351306; solutecarrierfani'l/ iC imii no, rena p n i Ab+ m 

449523; NMJ00579; Hs.544^ chem i ( r motif) re r I ng p renal stom, hepC, fibro, leuk, mela; mAb+s.m. 

449720; AA311152; Hs,288708; hypothetical protein FU21562; colon; CTL 

449722; BE280074; Hs,23960; cyclin B1; headnk, blad, lung, pane, angio, test, mela, esoph; s m. 

450001; NM.001C44; Hs.406; solute carrier family 6 (neuro; renal; mAb+s.m. 

450375; AA009647; Hs.352537; a disintegrin and metalloprote; breast ovar, headnk, pane, lung, esoph, colon, sarc; mAb+diag+s.m. 
450531; AW301032; Hs.203800; (BC017500) Similar to hypothec, colon; CTL 

450701; H39960; Hs,288467; hypothetical protein XP.098151; lung, headnk, pane, breast, stom, ovar, esoph, colon, sarc; mAb+diag 

450726; AW204600; Hs.355462; HUMPSPBA Human pulmonary surfa; fibro, lung; s.m. 

450931; N25155; Hs.25648; tumor necrosis factor receptor; lung, renal; mAb+s.m. 

450983; AA305384; Hs.25740; ER01 (S. cerevislae)Jlke; blad, lung, ova', pane; diag 

451310; AW250651; Hs.26213; Human DNA sequence from clone ; colon, pane; CTL 

451527; AF022813; Hs.26518; transmembrane 4 superfamily me; renal; mAb 

451537, R56631; Hs,26550 ; retinoid X receptor, gamma; mela; CTL+s.m. 

451668; Z43948; Hs.326444; cartilage acidic protein 1; blad, ovar, lung; mAb+diag 

451933 U8045 Hs.2731 n i, d IL ph npo< CTL 

451979; F06972; Hs .27372; endothelial tyrosine kinase (E; angio; CTL+s.m. 

451988; AF263928; Hs.27410; papillomavirus regulatory fact; renal; CTL 

452017; AF109302 I "4 ' prost rcer associated pro, pros, s m 

452097; AB002364; Hs.27916; a disintegrin-like and metallo; ovar; mAb+s.m ,+dlag 

452190; H26735; Hs.91668; Homo sapiens clone PP1498 unkn; breast, stom, pare; mAb 

452194; AI694413; Hs.373599; olfactory recepto f r is 2 , I i Ha fibro; mAb+s.m. 

452203; X57522; Hs.352018; transporter 1, ATP-blndlng cas; cerv, esoph, blad, stom, mela, renal; mAb+s.m. 

452281; T93500; Hs.28792; Homo sapiens cDNA FU11041 fis; breast, headnk, pane, stom. lung, esoph, fibro; diag 

452401; NM_0071 15; Hs.29352; tumor necrosis factor, atpha-i blad t t rc, headnk m.lung.leuk .at, esoph; diag 

452431; U88879; Hs.29499; toll-like receptor 3; renal, hepC; mAb 

452747; BE153855; Hs.61460; Igsur famil recepto lNIR, breast, blad, lung, headnk, ovar, stom, uter, pane; mAb 

452838; U65011; Hs.30743; preferentially expressed antig; lung, ovar, breast, mela, test, esoph, renal, sarc; CTL 

452862; AW378065 Hs 8687 ADAMT^ , tegrin-like a; headnk lor uk ung blad, esoph, stom, sarc; mAb+diag 
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453837; AL138387; Hs.256126; baculoviral lAP repeat-contain; renal, meta, sarc; s.m. 
453968; AA847843; Hs.62711; High mobility group (nonhistorr, lung, uter, blad, test CTL+s.m. 

1690321; Hs.20 KCNK1 , mel, subf; ovar ; mAb*s.n 

456662; NM_002448, II, 14J4 msh(C o i <phila) homeo box horn; uter, ovar ; CTL 
457133; M54968; Hs 351221 v-KI-ras2r rsten rat sarcoma ; pane; s.m 
457489; AI693815; Hs.127179; cryptic gene; pane, pros, lung; diag 
457819; AA057484; Hs 35406. FU20522 Hypctheti I proteii lun i tt headnk; mAb^iiag 
458079; AI796870; Hs.381 220; Homo sapiens similar to RIKEN ; mela, fibro, sarc; mAb 
458627; AW088642; Hs.S7984; SRY (sex determining region Y); ovar, uter, test CTL 

TABLE 2B 

Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



414991 1785136.1 D78831 C17898 D78863 
TABLE 2C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled "The DNA 

sequence of human chromosome 22." Dunham I. et al„ Nature (1 999) 402:489495, 

Strand: Indicates DNA strand from which exons were predicted . 

NLpositiom Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

400843 9188605 Plus 5863-5970,7653-7784,8892-9023,9673-9807, 

402075 8117407 Plus 121907-122035,122804-122921,124019-12416 

402901 8894222 Mnus 175426-176667 

404287 2326514 Plus 63134-53281 

404682 9797231 Minus 40977-41150 

404875 9801324 Plus 96588-96732,97722-97831 

404977 3738341 Minus 43081-43229 

405033 7107731 Minus 142358-142546 

406400 9256298 Plus 1553-1712,1878-2140,42524385,5922-6077 

Table 3A. Disease Indications and Preferred Utilities for Selected Genes 

Table 3A provides preferred disease indications and preferred utilities for about 2709 selected genes. These genes were identified using Eos/Affymelrix Genechip arrays. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number 
UnigenelD: Unigene ID number 
Unigene Title: Unigene gene title 

Disease: preferred diseases indicated for selected gene as described in table 1 and abbreviated as follows: blad (bladder diseases), anglo (blood vessel diseases), EWS 
(bone diseases), glio (brain diseases), breast (breast diseases), cerv (cervical diseases), colon (colorectal diseases), esoph (esophageal I ), fibro (fi 
diseases), headnk (head & neck diseases), Icio (leiomyoma diseases), leuk (leukocyte diseases), hepC (liver diseases), lung (lung diseases), ovar (ovarian 
diseases), endo (ovarian endometrioid diseases), omuc (ovarian mucinous diseases), pane (pancreatic diseases), pros (prostate diseases), renal (renal diseases), 
sarc (soft tissue and bone diseases), mela (skin diseases), stom (stomal j eases),t i , t > nculsr diseases), uter (uterine diseases) 

Utility. preferred utilities for selected gene as described in the text and abbreviated as follows: CTL (DNA vaccine target), diag (diagnostic or prognostic target). mAb 



Pkey; ExAccn; UnigenelD; Unigene Title; Disease; Utility - 



100147; D13666; Hs.136348; periostin (OSF-2os); breast, colon, blad, lung, fibro, pane; mAb+diag 
100241; BE273648; Hs.32963; cadherin 6, type 2, K-cadherin; blad; mAb 
100299; D49493; Hs.2171; growth differentiation factor ; EWS; diag 
100335; AW247529; Hs.6793; platelet-activating factor ace; breast, lung, blad; s.m. 
00365; AI878927; Hs.79284; mesoderm specific transcript (; colon, pros; diag 
00372; NM_014791; Hs.184339; KIAA0175 gene product; ovar, lung, cerv, pane; s.m, 

I <t I n 2; anglo; d 

100420; D86983; Hs.118893; Melanoma associated gene; breast, pros, lung, colon, anglo, leuk; diag 
I00448; AF234887; Hs.57652; cadherin, EGF LAG seven-pass G; breast; mAb-ts.m. 
100452; D87742; Hs.241552; K1AA0268 protein; pros; diag 
100559; NMJM0094; Hs.1640; collagen, type VII, alpha 1 (e; lung; CTL*s.m. 
100654; A03758; Hs.184411; NM_000477":Homo sapiens albumi; pros; diag 
100655; A03758; Hs 1 84" 1 1 ; Empii 1 ;ali> - elected from AFFX; pros; diag 
100668; L05424; Hs.169610; CD44 antigen (homing function ; lung, breast; mAb 
100824; AI393237; Hs.193989; runt-related transcription fac; ovar; CTL*s.m. 
100930; J04129; Hs.82269; progestagen-associated endomet lung; diag 
101063; D54745; Hs.80247; cholecystokinin; pros, EWS; diag 

101097; BE245301; Hs.89414; chemokine (C-X-C motif), recep; leuk, ovar, breast, blad; mAb+s.m. 
101104; AW862258; Hs.169266; neuropeptide Y receptor Y1; breast, EWS; mAb 
101 192; BE247295; Hs.78452; solute carrier family 20 (phos; angio; mAiws.nl 
101193; L20861; Hs.152213; wingless-type MMTV Integration; blad, lung; diag 
101249; L18964; Hs.1904; protein kinase C, iota; ovar, s.m. 
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101261; D30857; Hs.82353; protein C receptor, endothelia; angio; mAb+s.m. 
101389; AW951430; Hs.78888; diazepam binding inhibitor (GA; pros; mAb+s.m. 
101431; BE185289; Hs.1076; small praline-rich protein 1B ; lung, blad; diag 
101-M7; M21305; ; gb:Human alpha satellite and s; angio, blad; diag 
101461; N98569; Hs. 76422; phospholipase A2, group IIA (p; pros; diag 
101485; AA296520; Hs.89546; sele' tin E I idotti jl hes pros, ovar; mAb 
101506; J02931; Hs.62192; coagulahon (thi ' 

101526; NM_002197; Hs.220529; at " 
101543; M31 166; Hs.2050; pentaxi 
101545; BE246154; Hs.154210; en 

101560; AW95B272;H 47 ler »ular adhesion molecul; angio; mAb 
101626; M57399; Hs.44; pleiotrophin (heparin binding ; lung; diag 
101649; AW959908; Hs.1690; heparin-binding growth factor ; lung, blad; diag 
101714; M68874; Hs.21 1587; phospholipase A2, group IVA (c; angio; s.m. 
101724; L11690; Hs.198689; bullous pemphigoid antigen 1 (; breast, pros, blad, lung; mAb+CTL 
101741; NM 003199; Hs.326198; transcription fxtor 4; angio; CTL*s.m. 
101748; NM_001944; Hs.1925; desmoglein 3 (pemphigus vulgan lung, blad, headnk, cerv; mAb 
101759; M80244; Hs.1 84601 ; solute earner family 7 (cafo; lung, glio, blad, headnk; mAb*s.m. 
101791; M83822; Hs.62354; cell division cycle 4-like; pros; s.m. 
101804; M86699; Hs.169840; TTK protein kinase; ovar, lung, blad, cerv; CTL-^.m. 
I0180M il 11 Hi i n r )P rot in t, blad; d 

101809; M85849; Hs.323733; gap junction protein, beta 2, ; colon, blad, lung, pane, headnk; mAb 
101839; AA446644, Hs GA733-2 antigei epithelial gl jrar, pros, nAb 
101845; U88967; Hs.78867; protein tyrosine phosphatase, ; lung, gllo, headnk, cerv; mAb*s.m. 
101851; BE260964; Hs.82045; midkine (neuhte growth-promot; lung, blad, ovar, breast, pane; mAb«Hag 
102009; BE245149; Hs.82643; protein tyrosine kinase 9; ovan s.m. 
102012; BE259035; Hs.118400; singed (Drosophila)-like (sea ; angio; diag 
102024; AA301867; Hs.76224; EGF-containing GbuKn-like ex; angio; diag 
102048; U07225; Hs.339; purinergic receptor P2Y, 6-pro; blad; mAb 
102076; BE299197; Hs.179665; cyclln-dependent kinase inhibi; pros; CTL+s.m. 
102125; NM_006456; Hs.288215; sialyltransferase; breast, lung, ovan s.m. 
102136; AA300576; Hs.85769; acidic 82 kDa protein mRNA; ovan diag 
102151; T27013; Hs.3132; steroidogenic acute regulatory; ovar; diag 
102154; 1117760; Hs.75517; laminin, beta 3 (nicein (125kD; lung, blad, headnk; diag 
102178; AW1 78761; Hs.227948; serine (or cysteine) proteinas; blad; mAb+diag 
102187; U20325; Hs.1707; cocaine- and amphetamlne-regul; breast; diag 
102193; AL036335; Hs.313; secreted phosphoprotein 1 (ost; ovar, lung, fibre; diag 
102200; AA232362; Hs.317432; branched chain aminotransferas; ovan s.m. 
102208; U22961; Hs. 184411; gb:Human mRNA clone with slmll; pros; diag 
102211; BE314524; Hs.78776; putative transmembrane protein; breast, blad; mAb 

I02283;AW1 .1 8338 guanine i * ti J bindi. pro; ang "II - i 

102297; NM_001504; Hs.198252; G protein-coupled receptor 9; breast mAb 

102304; AF015224; Hs.46452; mammaglobin 1; breast; diag 

102305; AL043202; Hs.90073; chromosome segregation 1 (yeas; ovar, lung, blad ; diag 

102348; U37519; Hs.87539; aldehyde dehydrogenase 3 famil; lung, blad; s.m. 

102380; U40434; Hs.155981; mesothclin; ovar; diag 

102394; NM 003816; Hs.2442; a disintegrin and metalloprote; pane; s.m. 

102455; U48705; Hs.75562; dlscoldln domain receptor fami; breast; mAb 

102457; NM r n pc I t ft n 4 t r 

102522; BE250944; Hs.183556; solute carrier family 1 (neutn pros; mAb 

102581; AU077228; Hs.77256; enhancer of zeste (Drosophlla); blad, EWS, leuk; CTUs.m. 

102610; U65011; Hs.30743; preferentially expressed antig; lung, ovan CTL 

102623; AW249285; Hs.371 10; melanoma antigen, family A, 9; lung, blad; mAb+CTL 

102669; U71207; Hs.29279; eyes absent (Drosophlla) homol; lung, pros; CTL+s.m. 

102696; BE540274; Hs.239; forkhead box M1; lung, blad; s.m. 

102725; AB026187; Hs.374280; protocadherin 11; EWS; mAb 

102742; U79293; Hs.159264; Human clone 23948 mRNA sequenc; breast, ovar ; diag 

102745; AW753865; Hs.74376; olfactomedin related ER local); EWS; diag 

102803; H48299; Hs.26126; claudin 10; ovan mAb 

102829; NMJM6183; Hs.80952; neurotensin; lung, ovar, headnk; diag 

102836; U94320; Hs.158330; neuropeptide Y receptor Y5; EWS; mAb 

102852; V00571; Hs.75294; corticotropin releasing hormon; blad; diag 

102898; NMJM2205; Hs.149609; integrin, alpha 5 (fibronectin; angio, blad, lung, pros; mAb-ts.m. 

102915; X07820; Hs.2258; matrix metalloproteinase 10 (s; angio, blad, lung, ovan mAb+diag+sm 

102917; AI016712; Hs.287797; integrin, beta 1 (fibronectin ; angio; mAb 

102927; BE512730; Hs.65114; keratin 18; ovan diag 

102968; AU076611; Hs.154672; methylene tetrahydrofolate deh; ovan s.m. 

102994; X51730; Hs.2905; progesterone recepton blad; mAtw.ni. 

103003; AI910275; Hs.350470; trefoil factor 1 (breast cance; breast, pane; diag 

103021, BE001596, Hs f i !~ r r b»l 4 lung blad; mAb 

103036; M13509; Hs.83169; matrix metalloproteinase 1 (in; angio, colon, blad, lung, leuk, ov; 

103037; BE018302; Hs.2894; plxental growth fxtor, vascu; angio; diag 

103060; NM_005940; Hs.155324; matrix metalloproteinase 11 (s; breast, lung, ovar, pane; mAb+ 

103080; AU077231; Hs.82932; cyclin D1 (PRAD1: parathyroid ; breast, EWS; diag 

103095; NMJJ05424; Hs.78824; tyrosine kinase with immunoglo; angio; mAb 

103111; NMJ06103; Hs.2719; epididymis-specific, whey-acid; ovar, uten diag 

103119; X63629; Hs.2877; cadherin 3, type 1, P-cadherin; lung, blad, ovar, colon, pros, pane, on 

103206; X72755; Hs.77367; monokine induced by gamma inte; breast, lung ; diag 

103210; X72925; Hs.69752; desmxollin 1; pros; mAb 

103280; U84722; Hs.76206; cadherin 5, type 2, VE-cadheri; angio, fibro; mAb+s.m. 

103299; IMM_005756; Hs.184942; G protein-coupled receptor 64; ovar; mAb+s.m. 

103312: Y12542; Hs.3185; lysosomal; lung, blad; mAb 

103365; X90908; Hs.74126; fatty acid binding protein 6, ; blad; diag 
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103408; NMJ01 504; Hs.198252; G protein-coupled receptor 9; breast ; mAb 
103478; BE514982; Hs.38991; S100 calcium-binding protein A; iung, blad, headnk; diag 
103587; BE270266; Hs.82128; 5T4 oncofetal trophoblastglyc; breast, blad, lung; mAb 
103594; AI368680; Hs.816; SRY (sex determining region Y); lung, glio; s.m. 
103592;AW137912;Hs.227583;Hcmo • 
103739; AA115173; ;gb:zn30d02.s1 Sti 
103767; BE244667; Hs.348996; CGI-100 protein; angio; diag 
103989; AA315993; Hs.105484; regenerating gene type IV; colon, omuc; mAb+diag 
104352; NMJJ02407; Hs.97644; mammaglobin 2; ovar; diag 
104115; AF183810; Hs.26102; opposite strand of trichorhi; breast; mAb 
1042 F002246; Hs.210863; c esior f I • r f 1, 
104301; AA768491; Hs.6783; hypothetical protein FU22724; ovan diag 
104308; N25117; Hs.355957; ribosomal protein S26; pros; diag 
104394; AA129551; Hs.172129; Home sapiens cDNA: FU21409 fl; co'on; diag 
104542; R29657; ; gb:F1-1179D 22 week old human ; pros; diag 
104608; AF143867; Hs.337588; ESTs, Moderately similar to S6; blad; mAb 
104659; AW969769; Hs.100343; ESTs; EWS; diag 

104660; BE298665; Hs.14846; Homo sapiens mRNA; cONA DKFZp5; uter, colon, pros; mAb 
104667; AI239923; Hs.63931; dachshund (Drosophila) homolog; breast, pros, colon; diag 
104689; AA420450; Hs.380088; Plakophilin; lung; diag 

104691; U29690; Hs.37744; Homo sapiens beta-1 adrenergic; pros, EWS; mAb+s.m. 

104755; T49951; Hs.9029; DKFZP434G032 protein; breast, colon; diag 

104764; AI039243; Hs.278585; ESTs; angio; diag 

104786; AA027167; Hs.380438; KIAA0955 protein; angio; CTL+s.m. 

104877; AI138635; Hs.22968; intron of VEGFR; renal; diag 

104888; AW939591; Hs.5940; mucin 13, epithelial transmemb; colon, stom, uter; mAb+s.m. 

104919; AA026880; Hs.25252; Homo sapiens cDNA FU13603 fis; breast, cerv, uter; mAb+s.m. 

104943; AF072873; Hs.114218; frizzled (Drosophila) homolog ; ovan mAb+s.m. 

104954; AW250651; Hs.26213; Human DNA sequence from clone ; colon; diag 

104971; BE311926; Hs.15830; hypothetical protein FLJ12691; blad; CTL 

105012; AF098158; Hs.9329; chromosome 20 open reading fra colon lung blad CTL 

105038; AW503733; Hs.9414; KIAA14S8 protein; breast, angio; CTL+s.m. 

105039; AA907 I I75; ESTs; breast; diag 

105093; AL137566; Hs.32405; Homo sapiens mRNA; cDNA OKFZpS; blad; diag 

105149; BE089288;Hs 8958; Homo- enscDNAFL 1 2024 fis; pros; diag 

105175; AA305384; Hs.25740; ER01 (S. ccrevisiae)-like; colon, lung; mAb 

I0526 v 63 I j trie lily 7, (cati; angio i r, blad, pane; mAb+s.rr 

105298; BE387790; Hs.26369; hypothetical protein FU20287; ovar, lung; diag 

105301; AW352357; Hs.7457; MAGE1 protein; EWS; diag 

105316; AI671245; Hs.24835; hypothetical protein FU14594; EWS; mAb 

105329; AA234561; Hs.22862; ESTs; breast, pros; CTL+S.m. 

105330; AW338625; Hs.22025; ESTs; similar to TRANSMEMBRANE angio; mAb+s.m. 

105370; AF179274; I ls.22791; transmembrane protein with EGF; pros ; mAb+s.m. 

105500; AW602166 Hs.222399 « EGP1 p iteln; breast, pros; diag 

105503; AW963624; Hs.31 707; ESTs Weakly similar to YEW4_Y; pros, breast, colon; CTL+s.m. 

105507; BE268348; lls.380963; CCR4-NOT transcription complex; colon; diag 

105516; AK001269; Hs.30738; hypothetical protein FU10407; ovan diag 

105564; BE616694; Hs.288042; hypothetical protein FU14299; breast; diag 

105645; AW294631; Hs.351270; ESTs; pros; diag 

105715; BE621800; Hs.29444; putative small membrane protei; colon; diag 

105743; BE246502; Hs.9598; sema domain, immunoglobulin do; breast, lung; mAb+s.m. 

I 4<- a' 1 1952 46 ' i, fi i ip einFL 20 bie CTL+ 1 
105777; R42755; Hs.23096; ESTs; breast; diag 

105782; H09748; Hs.57987; B-cell CLUIymphoma 11B (zinc ; EWS; CTL+s.m. 
105826; AA478756; Hs.194477; E3 ubiquilin ligase SMURF2; angio; s.m. 
105990; AI690586; Hs.29403; hypothetical protein FU22060; breast; diag 
106000; AW194426; Hs.20726; ESTs; breast; diag 
106012; AI240665; Hs.352537; ESTs; breast, lung; diag 
106014; AF1 23094; Hs.180566; mucosa associated lymphoid tis; leuk; diag 
106063; BE260415; Hs.348198; hypothetical protein FU20262; pros; diag 
106066; AW274357; Hs.301406; hypothetical protein PP3501; mela; CTL+s.m. 
106111; AW875398; Hs.6451; PRO0659 protein; EWS; CTL+s.m. 
106124; H93366; Hs.7567; branched chain aminotransferas; angio; s.m. 

II AA425414; lls.3 i I actor l/B; breast, pre r lag 
106373; AW503807; Hs.21907; histone acetyltransferase; breast; s.m. 
106400; BE397649; Hs.279607; Homo sapiens cDNA FU13634 fis; colon; diag 
106414; BE568205; Hs.28827; milogen-activated protein kina; breast s.m. 
106448; Z42061; Hs.27004; ESTs; pros; diag 

106533; AL134708; Hs.145998; ESTs; EWS; diag 

106574; BE044325; Hs.227280; U6 snRNA-associated Sm-like pr, colon; diag 

106579; AA456135; Hs.23023; ESTs; pros; diag 

106632; NM_014400; Hs.11950; GPI-anchored metastasis-associ; lung, blad, headnk; mAb+diag 
106738; AW149266; Hs.25130; Homo sapiens cDNA FU14923 fis; ovar; diag 
106793; H94997; Hs.16450; ESTs; angio; diag 

106844; AA485055; Hs.158213; sperm associated antigen 6; breast mAb+CTL 
106906, AA861271 I r pt or factor BMAL2; lung, blad; diag 

106990; AA280722; Hs.24758; ESTs, Weakly similar to I33022; breast; tfsq 
107036; AI973016; Hs.15725; hypothetical protein SBBI48; pros; diag 
107102; AB037765; Hs.30652; KIAA1344 protein; pros, breast; diag 
107105; AW963419; Hs.155223; stanniocalcin 2; breast; diag 
107136; AV661958; Hs.8207; GK001 protein; breast, colon; diag 

107151; AW378065; Hs.8687; ADAMTS2 (a disintegrin-like a; breast, colon, leuk, lung, blad; mAb+diag 
107216; D51069; Hs.211579; melanoma cell adhesion molecul; angto; diag 
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107248; AW263124; Hs.350547; nuclear receptor co-repressor/; breast, colon, pros; mAtws.m. 

107284; NM_005629; Hs.187958; solute carrier family 6 (neuro; lung; mAbts.m. 

10 3jr NM_00539 i 1642! pod lyxir lit ng i did. 

107901; L42612; Hs.335952; keratin 6B; breast blad, lung; diag 

107922; BE153855; Hs.61460; Ig superfamily receptor LNIR; breast, blad, lung; mAb 

107932; AW392555;Hs.18878,hypolhfie, proteii FLJ21620 lung.CTL 

108055; AJ404672; Hs.334483; hypothetical protein FU23571; breast, ovar; diag 

108059; S69002; Hs.234773; Homo sapiens cDNA: FLJ22281 f; ovar; CTUs.m. 

108153; AW519204; Hs.40808; ESTs; pros; diag 

108186; AW068579; Hs.7780; Homo sapiens mRNA; cDNADKFZp5; pros; diag 
108242; AA062746; Hs.355244; gbzm03g12.s1 Stratagene come; pros; diag 
108282; AA065142; ; gb.im50h11.r1 Stratagene fibro; pros; diag 
108505; AA083376; ; gb:zn09g08.s1 Stratagene hNT n; pros; diag 
108679; AA1 15963; Hs.323423; ESTs, Moderately similar to B ; pros; diag 
108695; AB029000; Hs.70823; KIAA1077 protein; breast, colon, lung; diag 
108732; AA258888; Hs.107476; ATP synthase, H+ transporting,; pros, s.m. 
108778; AF133123; Hs.90847; general transcription factor I; ovar; diag 
108828; AK001693; Hs.273344; DKFZP564O0463 protein; breast; diag 
108860; AA133334; Hs.816; ESTs; lung; s.m. 

109001; AI056548; Hs.721 16; hypothetical protein FLJ20992 ; angio; CTL+dlag 
109032; AI219207; Hs.72222; hypothetical protein FU13459; blad; CTL 
109077; AI732617; Hs.182362; ESTs; blad; diag 

109112; AW419196; Hs.257924; hypothet'cal protein FLJ13782; breast, pros, blad; diag 
109141; AF174600; Hs.5978; ESTs, Highly similar to AF1746; colon; CTL+s.m. 

109166; AA219691; Hs.73625; RAB6 interacting, kinesin-like; lung, blad, breast, colon, ovar, headnk, EWS; s.m. 
109220; AW958181; Hs. 189998; ESTs; pros; diag 

109273; AA375752; Hs.348140; Homo sapiens mRNA; cDNA DKFZp5; breast diag 

109292; AW975746; Hs.188662; KIAA1702 protein; breast; diag 

109454; AA232255; Hs.295232; ESTs, Moderately similar to A4; ovar; diag 

109456; AW956580; Hs.42699; ESTs; angio, pane; diag 

109514; AA234087; Hs.262346; ESTs, Weakly similar to S72482; breast ; diag 

109530; AA908645; Hs. 19597; KIAA1694 protein; pros; CTUs.m. 

109648; 1117800; lls.7154; ESTs; ovar; diag 

109680; AB037734 H 1 IAA131 | ntem ovar; diag 

110009, BE075297 H 14; ESTs, V j ly imilar to A43932; breast, colon; diag 

110151 1118835 lis 1 hypothet rotei I 1041 pu s EWS diag 

110156; AA581322; Hs.4213; hypothetical protein MGC16207; lung, blad; diag 

1 10240; A1668594; Hs.176588; ESTs, Weakly similar to CP4Y_H; breast; diag 

1 10278; AF061573; Hs.19492; protocadherin 8; EWS; mAtwi.m 

1 1 0675; H89355; Hs.249159; adrenergic, alpha-2A-, recepto; pros; m Ab*s.m. 

1 1 0728; AA737106; Hs.32250; ESTs, Moderately similar to 17; EWS; s.m. 

110844; AI740792; Hs.167531; mettiyicrotonoyl -Coenzyme A car; pros, pros; s.m. 

110915; BE092285; Hs.29724; hypothetical protein FU13187; breast, pros; diag 

110971; AI760098; Hs.21411; ESTs; pros; diag 

111157; AL109729; Hs.99364; putative transmembrane protein; pros; mAb+s.m. 

11 1 179; AK000136; Hs.10760; asporin (LRR class 1); breast, colon; CTt-^.m. 

11 1185; AJ245671; Hs.12844; EGF-like-domain, multiple 6; ovar, blad; mAb+diag 

1 1 1223; AA8G2773; I ls.334838; K!AA1866 protein, br( co'on, Itu rw mAb 

1 1 1 299; AB033091; Hs.355925; KIAA1265 protein; ovar diag 

1 1 1357; BE314949; Hs.87128; hypothet'cal protein FU23309; breast; diag 

1 1 1384; N94606; Hs.288969; HSCARG protein; breast; diag 

111900; AF131784; Hs.25318; Homo sapiens clone 25194mRNA ; breast; diag 

111929; AF027208;Hs.112360;prominin(mouse)-like1; color,, breast, fibro; mAb 

112134; R4182C lls.7413;EST v i 2; breast EAS diag 

112244; AB029000; Hs.70823; KIAA1077 protein; breast colon, blad, lung; diag 

1 12280; AA863360; Hs.26040; ESTs, Weakly similar to fatty ; breast s.m. 

112283; L14561;lte.20952;ATPase,Ca++trjnsp.OTrij pi ov r mAb 

112287; AB033064; Hs.236463; KIAA1238 protein; breast ; diag 

1 12971; Z42387; Hs.83883; transmembrane, prostate androg; colon, pros, pros; mAb+s.m. 

1 13003; AW292315; Hs.7215; ESTs; EWS; diag 

113021; AL122055; Hs.129836; KIAA1028 protein; pros; s.m. 

11304 1571940; Hs.7549 ESTs, breast colon; diag 

113073; N39342; Hs.103042; microlubule-associaled protein; pros; CTL*s.m. 

113168; AW002393; Hs.337629; gb:wu61d05.x1 NCI CGAP GC6 Horn; ovar; diag 

113195, H83265 h 1; ESTs, Weekly sim 11044; angio, lung; diag 

1 1 3230; T61430; ; gb:yc06a03.s1 Stratagene lung ; blad; diag 

113361; T79589; Hs.63325; transmembrane protease, serine; colon, blad, lung, ovar, pane, headnk; mAb*<iiag+s.m. 

11 1; T79925; H ) E" Voakly Gimil \LU1_H; leuk; d 

1 13443; AW083923; I ls.16098; claudin 2: colon, pane; mAb 

1 13471; AI765890; Hs.16341; fVAWD binding protein; pros; diag 

113490; BE178110, Hs 17o ! m, pici s sDNA FU10500 fls; colon; diag 

113950; AI267652; I Is 246107; Homo sapiens mRNA.; cDNA DKFZp4; breast pros ;diag 

113970; W27249; Hs.8109; hypothetical protein FU21080; breast, lung, stom, uten diag 

114124; W57554; Hs.125019; lymphoid nuclear protein (LAF-; breast; diag 

114 51 Hi 51: H I94 F^T . , 

114292; AI815395; Hs.184641; fatty acid desaturase 2; breast; s.m. 

114334; AB037784; Hs.22941; KIAA1363 protein; ovar; diag 

114407; BE539976; Hs.103305; Homo sapiens mRNA; cDNA DKFZp4; b-east, colon, long; ding 
114452; A1369275; Hs.243010; Homo sapiens cDNA FU14445tls; angio; diag 
114480; BE066778; Hs.151678; UDP-N-acetyl-alpha-D-galactosa; breast s.m. 

114531; AA053033; Hs.63325; transmembrane protease, serine; colon, blad, lung, ovar, pane, headnk; mAb-tdiag4S.m. 

114540 AI9C4232 I 3; prohibit! b east diag 

114542; AW970128; Hs.91011; anterior gradient 2 (Xenepus I; breast, pros; diag 
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114587; AF086009; Hs.296398; gb:Homo sapiens full length in; colon; diag 

114724; R64730.comp;Hs.155986;DEAD/H(Asp-GljAlH f c , CM i 

114768; AF212848; Hs.182339; ets homologous facton pros, breast, colon; CTI *s m 

114798; AA159181; Hs.54900; serologically defined colon ca; pros; CTL-^.m. 
J 114908; AA454985; Hs.54973; cadherin-like protein VR20; pros; diag 

114918; BE165762; Hs.23518; hypothetical protein from BCRA; pros; diag 

114965; AI733881 Hs 77 < ' t \ME MITTED re h F i, \ mase, breast mAb 

115050; AF052693; Hs.198249; gap junction protein, beta 5 (; lung, blad, headnk- mAb-ts m 
1 . 115221; AW365434; Hs.79741; hypothetical protein FU10116; ovar; diag 
1 U 115239; BE251328; Hs.73291; hypothetical protein FLJ10881; colon; diag 

115291; BE545072; Hs.1225 '9 hypot letical ,rote n rLJ104G1; ovar, lung; CTL«.m 

115412; AW131168; Hs.372382, EST^ V.' akly imllart 113322; pros; diag 

II %SS$S* m 'S u' in (Dr0S ° phila S ° rapS h0m; 0Var ' lu " 9 ' bIad ' headnk ' P anc ' cerv ' stom - uler ' col °": CTL+s.m. 
.._ 115674 A A c (0 Homo sapiens pyruvate dehydrog, ovar s m 

1 -> 115575; W87707; Hs.82065; interleukin 6 signal transduce; breast, pros- 'mAb-»s m 

115683; Hs.546 n glo mAb ' 

115697; D31382;Hs.63325; transmembrane protease, serine; colon, blad, lung ovar pane hea 

115719; AW992405; Hs.352406; Homo sapiens, clone IMAGB3507; pros, breast, colon' CTUs m 

115819; AA486620; Hs.41135; endomucin-2; angio; diag 

115827; AA428000; Hs.283072; actin related protein 2/3 comp; angio diag 

115844; AI373052; lis. 33293 hyp! I pr 1 i 5370 pros; diag 

115881; NM_005756; Hs. 184942; G protein-coupled receptor 64; ovar, EWS, uten mAb+s m 

115892 AA2913 Hs. 50831; ESI v blad rq d.ja 
„ 115909; AW872527; lls.59761; ESTs, Weakly similar to DAP1 H; ovar, lunq- diao 

23 115947; R47479; Hs.94761; KIAA1691 protein; colon; diag 

115978; AL035864; Hs.69517; cDNAfor differentially expres; lung, blad, breast, pros ova- headnk- C 

116003; DE27S469; lls.66493; Down by, omecril I region ; colon; m 

1 1601 1; AL359053; Hs.57664; Homo sapiens mRNA full length ; breast; diag 

116028; H59799;Hs. 12644 th - i n-likf ■ lurq drag 

116107;AL133916; Hs 47860 hyp. Ihek rate I i H3 lung breast, drag 

116202; BE159.W Hs 194092 ESTs pros drag 

116238; AV660717; Hs.47144; DKFZP586N0819 protein; ovar; diag 

116301, AW969706 ,, 29333 _STb, EWS drag 

116334; AL038450; Hs.48948; ESTs; pros; diag 
J 5 1 16335; AK001 100; Hs.41690; desmocollin 3; lung, blad, headnk; diag 

116393 AI97?4 J I 1,1, , I , r t r MCC'648 pros; drag 

116399; AA8891 20; Hs.1 10637; homeo boxA10; pros; CTL-^.m. 

116401;AW89391r Hs.5 )3; EST m diag 

116416;AW753376; Hs.39982; ESTs; avandiag 
4U 116470; AI272141; Hs.351928; SRY (sex determining region Y); colon, breast, angio, blad' diag 

116483; AI346201; Hs.75118; ubiquitin carboxyl-terlal es; an io, lung; s.m 9 

116610; D80449; Hs.184841; ESTs; pros; diag 

116732;AW152225 ,1 16590 L^f ffeatly similar to I38022; colon; diag 

4 5 US S 2 7 955 ; Hs v« 6 ^ 7; , H c omo sapiens cDNA FLJ14415 5s; pros ' breas1 ' colon ' M 

116 962; H79677; ; gb;yu76g10.s1 Soares fetal liv; pros; diag 
1 17027; AW085208 h I3009 ESI reasl diag 

117280; M18217; Hs.172129; Homo sapiens cDNA: FU21409 fi; breast, colon, pros; diag 

117284; AK001701; Hs.183779; Homo sapiens cDNA FU10590 tis; pros; diaa 

117320; AB024937; Hs.211092; LUNXprotcin; PLUNC (palate lu; lung; mAb^iag 
DO 117367; AI041793; Hs. 42502; ESTs; breast ; diag 

117412, N3253G Hs.4; ' r so t - , e family 16 (mono; breast, ovar ; mAb^m 

117425; AK000028; Hs.356100; ribosomal protein S24; pros; diaq 

117563; AF055634; Hs.44553; unc5 (Celegans homolog) c; leuk ; diag 
<- 1 N35020 ' HC4 nc finger protein; EWS; C i L^.rr 
33 117921;AA021459; Hs.306480; Homo sapiens mRNA; cDNA DKFZp7; pros • diag 

ISffl?? 72 * Hs ' 106778 : ATPase ' Ca« transporting, typ; pros, breast, colon; mAb 

118049; N5314S ; gb:yv55fD9.s1 Soares fetal liv; pros; diag 

118314; N48580; Hs.46692; ESTs; blad, lung; diag 

115S' BE -° 1 ^ 7ir ' ' T "1. breast ovar, blad, pros; CTL«.m. 
OU 118368; N64339; Hs.48956; gap junction protein, beta 6 (; lung, blad; mAb 

118417; AF080229; ; gb:Human endogenous retrovirus; pros; s m. 

118472; AL157545; Hs.173179; bromodomain and PHD finger con; breast; diag 

118511; N75620, 1 Is 43157 EST- ngio; drag 
- 118901; AW292577; Hs.94445; ESTs; breast; diag 

CO 118905; AW973708; Hs.201925; Homo sapiens cDNA FU13446 fis; breast diag 

119018; AA631143; Hs.278695; Homo sapiens prostein mRNA, co; pros, pros; diag 

119036; R95872; Hs.117572; chemokine binding protein 2; breast, ovar mAb 

119073; BE245360; Hs.45514; v-ets erythroblastosis virus E; angio, pros; CTUs.m. 

1 19082; AF252297; Hs.91546; cytochrome P450 retinoid metab; EWS; diag 
/ U 1 19126; R45175; Hs.117183; ESTs; pros, breast colon; diag 

119279; N57568; Hs.48028; EST; breast diag 

1 19307; BE048061; Hs.37054; ephrin-A3; colon, breast mAb-Kiiag 

119478; AI624342; Hs.179082; ESTs; breast; diag 

119617; AA516531; Hs.55999; NK homeobox(Drosophila), fami; pros; diag 
/ 3 1 19743; AA947552; Hs.58086; branched chain aminotransferas; ovar; s m 

119771; AI905687; Hs.348419; AI905687:IL-BT095-190199-019 B; breast diag 

ISM 6 . 625 ' ~ 1n ' 1 hypoth tical protein; ovar, lung; CTUs.m. 

119789 BE39394 50915 I i 5 ovar diag 
m lfS, Hs ', 58561; G Protern^oupled receptor 87; lung, blad, headnk, cerv; mAb+s.m. 

«U 119940; AL050097;Hs.?7 531 DKFZP586G0319 protein uros, diag 

IS SIS? 0 ! 231 Hs ' 180479 : hypothetical protein FU20116; colon, lung; diag 
20 32; W57554; Hs.125019; lymphoid nuclear protein (LAF-; pros, breast diag 

120147; AI917116; Hs.348941; hemoglobin, beta; EWS; diag 
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120206; H26735; Hs.91668; Homo sapiens clone PP1498 unkn; breast ; mAb 

120242; AW969587; Hs.86366; ESTs; blad; diag 

120328; AA923278; Hs.290905; ESTs. Weekly similar to protea; pros; s.m. 

120438; AW015242; Hs.99488; ESTs, Weakly similar to YK54_Y; ovan diag 

120471; AA251944; Hs.104058; CGI-29 protein; colon; diag 

120486; AW368377; Hs.137569; tumor protein 63 kDa with stro; lung, blad, headnk; diag 

120588; AA703226; Hs.16193; Homo sapiens mRNA; cDNA DKFZp5; pros; diag 

120624; AW407987; Hs.173518; M-phase phosphoprotein hemolog; brenst, cm. 

120830; AI568170; Hs.96886; ESTs; EWS; diag 

120977; AA398155; Hs.97600; ESTs; breast, ovar; diag 

121027; AI572490; Hs.99785; Homo sapiens cDNA; FLJ21245 fi; blad; mAb 

121231; AA814948; Hs.96343; ESTs, Weakly similar to ALUC_H; EWS; diag 

121335; AA404418; ; gb:zw37e02.s1 Soaresjotaljet; angio; diag 

121362, AF050147; Hs.97932; chondromodulin I precursor; EWS; mAb 

121457; W07404; Hs.102558; hypothetical protein FLJ22055; colon; diag 

121619; AA528339; Hs.178062; ESTs, Weakly similar to phosph; EWS; s.m. 

121710; AF163474; Hs.96744; prostate androgen-regulated tr; pros; diag 

121721; AL047051; Hs.199961; ESTs, Weakly similar to ALU7_H; pros; diag 

121723; AA243499; Hs.104800; hypothetical protein FU10134; breast; diag 

121748; BE536911; Hs.234545; hypothetical protein NUF2R; breast; diag 

121779 AW51314C Hs.93367; SRY ( Jet ng reg n>) ova diag 

121791; AA815378; Hs.253317; ESTs, Weakly similar to GGC1 H; blad, headnk, lung, ovar; mAb+CTL 

121792; AW969726; Hs.98381; ESTs, Weakly similar to serine;EWS; diag 

121913; A1249368; Hs.98558; ESTs; protease inhibitor 15 (; breast, pros; s.m. 

121920; AA428300; ; gb:zw18b02.s1 Soares ovary turn; ovar, uter, cerv; diag 

122041; AA677577; Hs.380213; Homo sapiens Chromosome 16 BAC; pros; diag 

122520, AW951324; Hs.173609; pregnancy specific beta-1-glyc; colon; diag 

122797; AJ251027; Hs.99526; odorant-binding protein 2B (QB; breast; diag 

122802; AI687303; Hs.285529; G protein-coupled receptor 49; ovar, uter; mAb-ts.m. 

122969; AW821252; Hs.104336; hypothetical protein; ovar; diag 

123005; AW369771; Hs.367688; integrin, beta 8; ovar, lung, headnk, glio; mAb+s.m. 

123044; AK001035; Hs.130881; B-cell CLL/lymphoma 11A(zinc ; lung; diag 

123137; AI073913; Hs.100686; ESTs, Weakly similar to JE0350; breasl, colon, ovar, uter, lung, stom; diag 

123158; AF161426; Hs.218329; hypothetical protein; breast; diag 

123160; AA488687; Hs.284235; ESTs, Weakly similar to I38022; lung; diag 

123169; AI950087; Hs.369628; gb:wqOSc02.x1 NCI_CGAP_Kid12 H; ovar; diag 

123209; AW968543; Hs.203270; ESTs, Weakly similar to ALU1_H; pros; diag 

123308; C14187; Hs.157208; ESTs; EWS; diag 

123339; AW188464; Hs.101515; ESTs; ovan diag 

123475; BE439553; Hs.12329; Homo sapiens, clone IMAGE:4098; pros; diag 

11 \W179019;Hs.112110 mil nominal rite nip - var no 

123520; AA608550; ; gb:ae53d12.s1 Stratagene lung ; pros; s.m. 

123533; AA608751; ; gb:ae56h07.s1 Stratagene lung ; colon; diag 

123619; AA602964; Hs.366318; gb:no97c02.s1 NCI_CGAP_Pr2 Horn; breast; CTL-s.m. 

123689; AA399323; Hs.285130; Homo sapiens pinch-2 protein m; ovar; diag 

123709; M706910; Hs.1 12742; ESTs; breast; diag 

123829; AF251237; Hs.1 12208; XAGE-1 protein; lung, blad, test; CTL 

123972; T46848; Hs.70337; immunoglobulin superfamily, me; ovan diag 

124006; AI147155; Hs.279727; ESTs; homologue of PEM-3 [Cion; breast, angio, '.ung, ovar, EWS; diag 

124059; BE387335; Hs 233 13 ESTs We kiy similar to 11354, breast, colon, blad, lung; CTL+diag 

124153; AU077333; Hs.160483; erythrocyte membrane protein b; pros; mAb 

124352; M640891; Hs.102406; ESTs; breast, pros, ovar, lung; diag 

124526; N62096; Hs.293185; ESTs, Weakly similar to JC7328; pros; mAtns.m. 

124579; AI69381 5 Hs 1 71 ' pti, g, ne pane, diag 

124777; R41933; Hs.140237; ESTs, Weakly similar to ALU1.H; pros, breast; diag 

125103; AA570056; Hs.122730; ESTs Moderately similar to Kl; colon; mAb 

125154; W38419; ; gb:zc78a07.s1 Pancreatic Islet ovar; diag 

125250; W26524; Hs.356686; protein phosphatase 4 regulato; ovan CTL+s.m. 

125266; W90022; Hs.186809; ESTs, Highly similar to LCT2_H; angio; diag 

125453; BE385523; Hs.18048; melanoma antigen, family A, 10; blad; mAb+CTL 

125666; AL390172; Hs.317432; Homo sapiens cDNA: FU21270 fi; ovan diag 

125770; AA143045 h 16 tHai Zuckerman 4 feline EWS, diag 

125976; AA436760; Hs.35552; gb:zv67d1 1.r1 Soaresjotaljet; pros; diag 

126399; AA088767; Hs.83883; transmembrane, prostate androg; pane; mAb*s.m. 

126645; AA316181; Hs.61635; six transmembrane epithelial a; pros, breast, lung, pane, headnk, EWS; mAb > CTL 

126758; AI559444; Hs.104679, ESTs; pros, breasi; mAb 

126799; AW753865; Hs.74376; olfactomedin related ER locali; EWS; diag 

126872; AW450979; ; gb:UI-H-BI3-ala-a-12-0-UT.s1 N; blad; diag 

126892; AF121856; Hs.284291; sorting nexin 6; ovar; diag 

126960; AL390172; Hs.317432; branched chain aminotransferas; ovan s.m. 

126966; R38438; Hs.118747; solute earner family 15 (H+/p; pros; mAb 

127003; AW816515; Hs.173540; ATPase, Class V, type 10D; pros; mAb 

127221; BE062109; Hs.241551; chloride channel, calcium acti; lung, blad, headnk, cere; mAb+s.m. 

127240; AJ005683; Hs.86998; nuclear factor of activated T-; pros; CTL+sju. 

127425; AF183810; Hs.26102; trichorhinophalangeal syndrome; breast; mAb 

127479; D31152; Hs.179729; collagen, typeX, alpha 1 (Sch; breast, lung, headnk, pane; diag 

7;f L 1 c I E ^ j 

127664; AA806164; Hs.1 16502; ESTs; EWS; diag 

128046; AA873285; Hs.357313; gb:oh68h05.s1 NCI_CGAP_Kid5 Ho; pros, breast, colon; diag 

128305; AI954968; Hs.365706; matrix Gla protein; breast diag 

128478; AA708205; Hs.100343; ESTs; EWS; CTL-ts.m. 

128515; BE395085; Hs.334762; type I transmembrane protein F; pane; mAb 

128595; U31875; Hs.272499; short-chain alcohol dehydrogen; blad, breast. CTLksju. 
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128610; N48373; Hs.10247; xtivated leucocyte cell adhes; breast, pros, lung, ovar; diag 
128734; AB008390, H- I 457i kallikreir (ni ropsii ivasir ovar.diag 

128790; AF026692; Hs.105700; secreted frizzled-related prot breast, colon, pros, ovar, uter, pane; diag 
128797; NM.002975; Hs.105927; stem cell growth fxtor; lymph; EWS, leuk; diag 
128854; BE159181; Hs.168232; hypothetical protein FLJ13855; breast; diag 
128925; R67419; Hs.21851; Homo sapiens cDNA FU12900 fis; breast; diag 

128949; AA009647; Hs.352537; a disintegrin and metalloprote; breasl, ovar, headnk, pane; mAb<diag+s.m. 

128969; Z42047; Hs.107479; Homo sapiens PR02751 mRNA, com; pros; diag 

129041; BE382756; Hs.169902; solute carrier family 2 ( p" I ung bl d m b-»s.m 

129097; BE243933; Hs.108642; zinc finger protein 22 (KOX 15; ovan CTL+s.m. 

129099; AF146074; Hs.108660; ATP-binding cassette, sub-fami; lung, blad, headnk; mAb«.m. 

129184; AW161450; Hs.109201; CGI-86 protein; pros; mAb 

129260; AF077200; Hs.279813; hypothetical protein; colon; diag 

129284; AA318224; Hs.296141; ESTs; colon; diag 

129362; U30246; Hs.110736; solute carrier family 12 (sodi; colon, breast, pros; mAb 

129366; BE220806; Hs.184697; Homo sapiens clone 23785 mRNA ; breast diag 

129389; NM.012445; Hs.288126; spondin 2, extracellular matri; colon, pros; diag 

129404; A1267700; Hs.351201; ESTs; pros, ovar, lung, blad, headnk, pane; diag 

129455; L42583 II 34309; kei tin 6A ' irq blad drag 

129482; AA188185; Hs.289043; spindlin; breast; diag 

129534; AK002126; Hs.11260; hypothetical protein FU11264; pros; diag 

129571; X51530; lls.1145; Wilms tumor I; ovar; CTL-»s.m. 

129605; AF061812; Hs.115947; keratin 16 (focal non-epidermo; lung, blad, headnk; diag 

129620; D79338; Hs.239720; CCR4-NOT transcription complex; breast, angio; diag 

129628; U38945; Hs.1174; cyclin-dependent kinase mhib. 3. blad / r he 1 

129650; AF109298; Hs.118258; prostate cancer associated pro; pros, EWS; diag 

129689; AW748482; Hs.77873; B7 homolog 3; breast; diag 

129703; BE388665; Hs.179999; Homo sapiens, clone IMAGE3457; EWS, leuk; diag 

129720; M156214; Hs. 12152; APMCF1 protein; breasl; diag 

129750; AFC56085; Hs. 198612; G protein-coupled receptor 51; angio, blad; mAb-^.m. 

129869; AI222069; Hs.13015; hypothetical protein similar t; breast; diag 

129912; AF155096; Hs.107213; hypothetical protein FU20585; ovan CTL«.m. 

129936; AJ250717; Hs.1355; cathepsin E; blad; sm-Kiiag 

129953; AA412195; ils.13740; ESTs; breast; diag 

129977; NMJW0399; Hs.1395; early growth response 2 (Krox-; EWS; CTL«.m. 
130010; M301116; Hs.142838; nucleolar phosphoprotein Nopp3; ovan diag 
130057; AF027153; Hs.324787; solute earner family 5 (inosi; breast; mAb 
130095; AK001635,Hs 14838 hyp r i< , pre tt n FL I0773 breast, diag 
130155; AA101043; Us. 151254; kallikrein 7 (chymotryptic, st ovan diag 
130181, ArOE2119 ils.151508 Tomo apens clone 23522 mRNA; pros; diag 
130184; H58306; Hs.15165; retinoic acid induced 14; angio; diag 
130262; D63216; Hs.153684; frizzled-related protein; pane, EWS, stom, renal; diag 
130343; AB040914; Hs.278628; KIAA1481 protein; breast; diag 

0376; R4087 I 155174; CDC5 ( I division cycle 5, S; ovan CTL+s.m. 
130385; AW067800; Hs.155223; stannicoalcin 2; breast, lung; mAb-niiag 
130455; D90041; lls.155956; N-acetyltransferase 1 (arylami; breast; s.m. 
13051 1 ; L32137; Hs.1584; cartilage oligomeric matrix pn, breast, ovar; diag 
130558; BE564937; Hs45984; pp21 homolog; pros; CTL+s.m. 
130577; M69241, Hs 162 ir sulii - it e -.ir Vh factor bin; ovan diag 
130604; AA383256; Hs4657; estrogen receptor 1; breast; mAbts.m. 

130627; BE003054; Hs.1695; matrix metalloproteinase 12 (m; lung, colon, blad, headnk, ovar, pane; mArwiiag+s.m. 

130637; AA356764; Hs.17109; integral membrane protein 2A; EWS; mAtws.m. 

130648; AI458165; Hs47296; hypothetical protein MGC2376; colon; diag 

130667; BE246961; Hs.17639; Homo sapiens ubiquitin protein; breast; s.m. 

130690, AB006625 Hs 13 3C p II, - r r e>sed 3,ovandiag 

130714; AI348274; Hs.18212; DNA segment on chromosome X (u; breast; diag 

130760; AW379130; Hs.18953; phosphodiesterase 9A; pros; CTL^.ia 

130800; AI187292; Hs.19574; hypothetical protein MGC5469; colon, lung; diag 

130839; AB01 1 169; Hs.380875; similar to S. cerevisiae SSM4; angio; diag 

130844; U76248; Hs.20191; seven in absentia (Drosophila); breast; diag 

130892; AL120837; Hs.20993; high-glucose-regulated protein; breast; CTL-Hs.m. 

130941; NM_000869; Hs.2142; 5-hydroxytryptamine (serotonin; ovan mAb 

130967; M393071; Hs.182579; leucine aminopeptidase; ovar; s.m. 

130972; D81866; Hs.374468; Homo sapiens mRNA; cDNA DKFZpE; angio; diaa 

130987; BE613269; Hs.21893; hypothetical protein DKFZp761N; colon; diag 

131046; AA321649; Hs.2243; small inducible cytokine subfa; breast, lung, blad, ovar, fibro; diag 

131080; NMJ01955; Hs.2271; endothelin 1; angio; diag 

131083; Y09763; Hs.22785; gamma-aminobutyric acid (GABA); pros; mAb 

131148; AW953575; Hs.303125; p53-induced protein PIGPC1; breast, colon, angio; diag 

131216; AI815486; Hs.243S01; Homo sapiens cDNA FLJ20738 fis; colon, breast; diag 

131228; AW207469; Hs.24485; chandro'rtin sulfate proleoglyc; ovan diag 

131244; A1638429; Hs.24763; RAN binding protein 1; lung, Kssd, headnk; CT L*s.m. 

131288; AA278482; Hs.25328; ESTs, Moderately similar to AL; pros; diag 

131289; AA296696; Hs.333418; FXYD domain-containing ion tra; colon; diag 

131307; NM_000025; Hs.2549; adrenergic, beta-3-, recepton EWS; mAb 

131313; R96290; Hs.75874; ribosomal protein L44; EWS; diag 

131492; AI452601; Hs.288869; nuclear receptor subfamily 2, ; pros; mAb+s.m. 

131544; AL355715; Hs.28555; programmed cell death 9 (PDCD9; breast; diag 

131559; AL078599; Hs.10784; hypothetical protein FU20037; breast; diag 

131564; T93500; Hs.28792; Homo sapiens cDNA FU11041 fis; breast; diag 

131603; X81334; Hs.2936; matrix metalloproteinase 13 (c; blad; s.m. 

131643; AW410601; Hs.30026; HSPC182 protein; breast diag 

131739; AF017986; Hs.31386; secreted frizzled-related prot breasl; mAb-s.m. 
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131817; U20536; Hs.3280; caspase 6, apoptosis-related c; libra, breast cerv, king, blad, pane glio; 
131885; BE502341; Hs.3402; ESTs; breast; diag 

131919; T15803; Hs.272458; protein phosphatase 3 (formed; pros, breast s.m. 

131925; AF151048; Hs.183180; anaphase promoting complex sub; breast; diag 

131965; W79283; Hs.35962; ESTs; lung, ovan diag 

131985; AA503020; Hs.36563; hypothetical protein FU22418; breast, ovar; diag 

132050; AI267615; Hs,38022; ESTs; angio; diag 

132173; X89426; Hs.41716; endothelial cell-specific mole; angio; diag 

132180; NM.C04460; Hs,418; fibroblast activation protein,; colon, pane, esoph; mAb 

132191; AA507576; Hs.288361; Homo sapiens cDNA: FU22696 fi; ovan diag 

132349; AW975654; Hs,181286; serine protease inhibitor, Kaz; pros, blad; s.m. 

132354; BE185289; Hs.1076; small praline-rich protein 1B ; lung; diag 

132358; NML003542; Hs.46423; H4 histone family, member G; pros; CTL+s.m. 

132371; AA235448; Hs.222088; PRO2000 protein; breast; diag 

132454; BE296227; Hs.250822; serine/threonine kinase 15; blad, breast; s.m. 

132490; NM_001290; Hs.4980; LIM domain binding 2; angio; diag 

132520; AA257992; Hs.50651; Janus kinase 1 (a protein tyro; EWS; s.m. 

132528; T78736; Hs.50758; SMC4 (structural maintenance o; ovan CTL+s.m. 

132543; BE568452; Hs.344037; protein regulator of cytokines; colon, lung; diag 

132572; A1929659; Hs.237825; signal recognition particle 72; ovar; diag 

132592; AW803564; Hs.288850; Homo sapiens cDNA; FU22528 f; colon; diag 

132624; AA326108; Hs.33829; bHLH protein DEC2; ovan diag 

132632; AU076916; Hs.5398; guanine monphosphate synthetas; ovar, lung; s.m. 

132669; W38586; Hs.380933; guanine nucleotide binding pro; colon; diag 

132710; W74001; Hs.55279; serine (or cysteine) proteinas; lung, blad, colon, headnk; diag 

132725; NMJ06276; Hs.184157; splicing facto, argininc/ocn; ovar, CTL+s.m. 

132767; BE182592; Hs. 11261; small proline-rich protein 2A; lung; diag 

132791; AB029561; Hs.7910; RING1 and YY1 binding protein; pros; CTL+s.m. 

132837; M370362; Hs.57958; EGF-TM7-latrophilin-related pn angio; diag 

132856; NMJM1448; Hs.58367; glypican 4; breast, colon, pros; mAb 

132888; NMJ05476; Hs.5920; UDP-N-acetylglucosamine-2-epim; pros; s.m. 

132902; A1936442; Hs.59838; hypothetical protein FU10808; colon; diag 

132939; AB009284; Hs.61 152; exostoses (mu!tiple)-like 2; ovar; diag 

132964; AI362575; Hs.303171; ESTs; pros; diag 

132967; AA316181; Hs.61635; six transmembrane epithelial a; pros, pros; mAb+CTL 

132990; X77343; Hs.334334; transcription factor AP-2 alph; breast, lung; CTL+s.m 

132994; AA112748; Hs.279905; clone HQ0310 PRO0310p1; colon, breast s.m. 

133006; AW978436; Hs.62515; KIAA0494 gene product; colon; diag 

133015; AJ002744; Hs.246315; UDP-N-acetyl-alpha-D-galactosa; breast colon, pros; s.m. 

133016; AI439688; Hs.6289; hypothetical protein FU20886; breast; diag 

133061; AI186431; Hs.296638; prostate differentiation facto; angio, pros, blad; diag 

133063; AI654133; Hs.356247; thyroid receptor interacting p; pros; mAb+s.m. 

133070; U92649; Hs.380136; a disintegrin and metalloprote; leuk; diag 

133179; U81599; Hs.66731; homeo box Q13; pros; CTL+s.m. 

133199; AF231981; Hs.250175; homolog of yeast long chain po; breast, angio; CTL+s.m. 

133260; M403045; Hs.6906; Homo sapiens cDNA: FU23197 fi; angio; diag 

133272; NNL002776; Hs.69423; kallikrein 10; colon, ovan diag 

133314; AA102670; Hs.70725; gamma-aminobutyric acid (GABA); breast pane; mAb 

133321; T79526; Hs.179516; integral type I protein; breast diag 

133391; AW103364; Hs.727; inhibin, beta A (activin A, ac; breast, blad, lung; diag 

133415; X69699; Hs.73149; paired box gene 8; ovan CTL 

133477; AW502935; Hs.740; PTK2 protein tyrosine kinase 2; breast s.m. 

133579; X75346; Hs.75074; mitogen-activated protein kina; pros; diag 

133626; AW836130; Hs.75277; hypothetical protein FU13910; pros; diag 

133736; D49958; Hs.75819; glycoprotein M6A; pros; mAb 

133829; AW630088; Hs.76550; Homo sapiens mRNA; cDNA DKFZp5; ovar; diag 

133860; S78296; Hs.76888; hypothetical protein MGC12702; blad; diag 

133944; AW068579; Hs.7780; Homo sapiens mRNA; cDNA DKFZp5; pros; diag 

133975; C18356; Hs.295944; tissue factor pathway inhibito; angio, pane; CTL+diag 

133976; A1903165; Hs.169946; GATA-binding protein 3 (T-cell; breast, blad; mAb+s.m. 

134100; AA460085; Hs.171075; replication factor C (activate; pros; diag 

134110; U41060; Hs.79136; LIV-1 protein, estrogen regula; breast blad, ovar, pros; mAb 

134169; AI690916; Hs.178137; transducer of ERBB2, 1; breast CTL+s.m. 

134219; NM_000402; Hs.80206; glucose-6-phosphate dehydrogen; breast; s.m. 

134319; BE304999; Hs.285754; fumarate hydratase; colon; s.m. 

134326; AW903838; Hs.81 800; chondroilin sulfate proteoglye; ovar, breast, pane, king; diag 

134348; AW291946; Hs.82065; interieukin 6 signal transduce; breast mAb+s.m. 

134374; N22687; Hs.8236; ESTs; pros; diag 

134390; R35528; Hs.8258; DKFZP434D1335 protein; pros; CTL+s.m. 

134401; AI916662; Hs.211577; kinectin 1 (kinesin receptor); pros, breast mAb+sm 

134405; AW067 3; H 2772 llogoi ,| Xl.atpl 1; brea lung, ovar, headnk; CI L 

134470; X54942; Hs.83758; CDC28 protein kinase 2; lung, blad, headnk; s.m. 

134520; BE091005; Hs.349506; activated RNA polymerase II tn ovar; s.m. 

134529; AW411479; Hs.848; FK50B-binding protein 4 (59kD); breast diag 

134570; U66615; Hs.172280; SWI/SNF related, matrix associ; EWS; CTL+s.m. 

134654; AK001741; Hs.8739; hypothetical protein FU10879; breast; diag 

134666; BE391929; Hs.8752; transmembrane protein 4; breast; mAb+s.m. 

134691; AW382987; Hs.88474; prostaglandin-endoperoxide syn; ovan s.m. 

134727; X80507; Hs.84520; yes-associated protein 55 kDa; blad; diag 

134731; D89377; Hs.89404; msh (Drosophila) homeo box horn; blad; s.m. 

134786; T29618; Hs.89640; TEK tyrosine kinase, endotheli; angio; s.m. 

134824; S78723; Hs.298623; 5-hydroxytryptamine (serotonin; blad; mAb 

134856; BE281128; Hs.9030; TONDU; blad; CTL+s.m. 
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134868; AB020689; Hs.90419; KIM0882 protein; breasi; diag 
134924; BE294029; Hs.279903; Ras homolog enriched in brain ; breast; mAb 
134972; AL033527; Hs.169252; v-myc avian myelocytomalosis v, ovar; CTL+s.m. 
134975; R50333; Hs.92186; Lemancoiled-coil protein; breast; diag 
134989; AW968050 Hs.E 31; r i lix ,.1-cleoside diphosphate ; colon; diag 
135073; W55956; Hs.94030; Homo sapiens mRNA; cDNA DKFZpE; angio; diag 
1351 17; W52493; Hs.94694; Homo sapiens cDNA FU10561 fls; breast; diag 
135166; M135867; Hs.280858; ESTs, Highly similar to A35551; pros; diag 
1 3 W29824 r r e I n i 
13524?; AI583137; Hs 9700; cyclin E1; ovar; CTL+s.m. 
135243; BE463721; Hs.97101; putative G protein-coupled rec; colon; mAb+s.m. 
135255; Y13645; Hs.97234; uroplakin 2; blad; mAb 
135309; AI564123; Hs.355689; ADP-ribosylation factor-like 5; pros; diag 
135315; H81 136; Hs.334604; Homo sapiens mRNA for KIAA1870; pros; diag 
135389; U05237. Hs 9") 7 fetal Alzhe'n it i antigen, pros, breast, colon, CTL+s.m. 
135400; X78592; Hs.99915; androgen receptor (dihydrotest; pros; mAb+s.m. 
300254; AW183618; Hs.55610; solute earner family 30 (zinc; pros, breast; mAb+s.m, 
300256; AW591433; Hs.298241; Transmembrane protease, serine; breasi, colon, lung, o 
300318; AW444502; Hs.256982; ESTs, Weakly sim.lar to NEL1_H; angio; CTL+diag 
300605; AI218847; Hs,162670; ESTs; pros; diag 

300921; AF146747; Hs.232165; polycythemia rubra vera 1; eel; pros; mAb+s.m. 

300923; AW136372; Hs.1852; acid phosphatase, prostate; pros; s.m. 

301042; AI659131 Hs.366( hyp. the al protein MGC2843; pros; mAb 

301043; AI160316; Hs,149155; voltage-dependent anionchanne; pros; mAb+s.m. 

301050; AW136973; Hs.362915; ESTs, Weakly similar to S69890; colon, lung; CTL+s.m. 

301341; AA887801; Hs.208229; G protein-coupled receptor; breast, lung; mAb+s.m. 

302001; AB02071 1; Hs.374965; KIAA0904 protein; breast ; CTL+s.m. 

302005; BE252922; Hs.123119; MAD (mothers against decapenta; pros; diag 

302067; BE542706; Hs.222399; CEGP1 protein; breast; diag 

3021 67; NMJ06227; Hs.283007; phospholipid transfer protein; pros; mAb 

302225; NM.0C7231 Hs.16221 1 t o nriei family 6 .n-uro pane- mAt+s.m. 

302275; AW057736; Hs.323910; HER2 receptor tyrosine kinase ; breast; mAb+s.m. 

302290, AA179949; Hs.175563; Homo sapiens mRNA; cDNA DKFZpfi pros, breast; diag 

302372; AL117406; Hs.335891; ATP-hinding cassette transport; breast, pros ; mAb+s.m. 

302384; AI678069; Ho.202676; synaptosomal complex proton 2; breast, cerv; diag 

302410; NMJM4917; Hs.218366; kallikrein 4 (prostase, enamel; pros; diag 

302468; AL133551; Hs.3801 55; DKFZP434B061 protein; pros; diag 

302562; BE149762; Hs.48956; gap junction protein, beta 6 (; lung, blad; mAb 

302881; AA508353; Hs. 105314; retain 1 (H1); pros; diag 

303295; AA205625; Hs.208067; ESTs; blad; diag 

303380; AW962764; Hs.303171; olfactory receptor, family 51,; pros; mAb 

303506; AA340605; Hs.10G887; ESTs, Weakly similar io Hornolo; pes, breast, colon ; diag 

303699; BE143707; Hs.19525; hypothetical protein FU22/94; pros; diag 

303753; AW503733; Hs.9414; KIAA1488 protein; pros, breast, colon; CTL+s.m. 

305503; AA759177; Hs.298148; ESTs, Weakly similar to KIAA05; pros; diag 

306273; AA936290; ; gb:on70a01.s1 Soares_NFL_T_GBC; pros; diag 

306676; AI005603; ;gb:ov15o10.s1 NCI CGAP GC3 Horn; lung; diag 

306840; AI077477; Hs.307912; ESTs; angio; diag 

309177; AI951 1 18; Hs.326736; Homo sapiens breast cancer ant; breast, pros; mAb+CTL 
309583; AW170035; Hs.326736; Homo sapiens breast cancer ant; breast mAb+CTL 
309931; AW341683; Hs.343663; gb:hd13d01jc1 Soares NFL_T GBC; lung; mAb 
310382; AI734009; Hs.127699; KIAA1603 protein; pros; diag 
310431; AI420227 h 605 ESI \ lysimil t A4 010; pros; diag 

7 W 192 30 H 15314; ESTs, pi fag 
310636; AI814373; Hs.164175; ESTs; lung; diag 
310781; AI380797; Hs.158992; ESTs; breast ; diag 
310955; AI476732; Hs.263912; ESTs; breast, angio; diag 
311034; BE567130; Hs.31 1389; ESTs, Highly similar to NKGD.H; lung; mAb+s.m. 
311166; A1821005; Hc.113599; intron of: EFF9 (GDNFRa).; breast ;<feg 
111251; AI355662;H 1 I7G93 ESTs iro: diag 
311557; AF200492; Hs.211238; interleukin-1 homolog 1; lung; diag 
311596; AI682088; Hs.79375; single-minded (Drosophila) horn; pros; CTL 
31163C AI915444 I 172 T li li i g war pane headnk, mAb+diag+s.m. 
311877;AA084248; Hs.372651; G protein-coupled receptor 39; ovar, angio, glio; mAb+s.m. 
• 311911; R19175; Hs.169793; ribosomal protein L32; pros; diag 
11928; TG221G; II o E c Ji i 
312182; T94344; Hs.326263; ESTs; pros; diag 
312252; AI128388; Hs.143655; ESTs; blad; diag 
312319; AA906997; Hs.180780; TERA protein; colon; CTL+s.m. 
312521; AI263307; Hs.356901; H2B histone family, member L; pros, breast, lung; diag 
312544; AA51 6420; Hs.352340; ESTs, Weakly similar to I38022; breast; diag 
312742; A1650363; Hs.116462; ESTs; colon; diag 
312795; AW975014; Hs.26; ferrochelatase (protoporphyria; pros; s.m. 
312857; BE083868; Hs.126914; KIAA1430 proleh; colon, pros CTL > s.m. 
312922; AA329256; Hs.378739; ESTs, Moderately similar to al; pros; diag 
313328; AW449211; Hs.105445; GDNF family receptor alpha 1; breast; mAb+s.m. 
313513;AV\ 8600; Hs.64313; ESI V, kly similar to S595C1 o; mAb+s.m 
313556; AA628517; Hs.118502; ESTs; angio; diag 
313665; AW751201; Hs.120932; ESTs; angio; diag 
313774; AI916058; Hs.144583; ESTs; colon; CTL 
313915; C18863; Hs.163443; intron of periostin (OSF-2os); breast diag 
313978; AI870175; Hs.13957; ESTs; angio; diag 
314078; AW129357; Hs.329700; ESTs; breast; diag 
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314097; AA648744; Hs.269493; ESTs; breast diag 
314121; AI732083; Hs.187619; ESTs; pros, breast ; diag 
314171; AI821895; Hs.193481; ESTs; pros; diag 

314506; AA833655; Hs.206868; Homo sapiens cDNA FLJ14056 fls; breast; diaq 
J 314547; AA399272; Hs.144341; ESTs; breast • diag 

314558; AI873274; Hs.370280; ESTs; breast, pros; diag 

314589; AK001432; Hs.153408; Homo sapiens cDNA FU10570 fis; lung, b'ad; diaa 

314691; AW207206; Hs.355962; ESTs; breast, pros: diag 
, „ 314785; AI538226; Hs.32976; guanine nucleotide binding pro; colon, pros; diag 
1 0 314907; AW971082; Hs.222336; ESTs, Weakly similar to TRHY H; pros; diag 

3 5006; AI538613; Hs.298241; Transmembrane protease, serine; breast colon, lung, ovar; iMb+dlag+s.m. 

i1 03 AI49304G M 14G133 E T olon diag " 

315051; AW292425> 1 53484 E rs breast, pros, blad; diag 

315052; AA876910; Hs.134427; ESTs; pros, breast ; diag 
1 J 315196; AI367347; Hs.44898; Homo sapiens clone TCCCTA00151; breast; diag 

315368; AB037745; Hs.104696; KIM1324 protein; pros; diag 

315108: AVV273261: Hs.215292; ESTs; mos' dbg 

315634; AA837085; Hs.372254; ESTs; breast pros; diag 

315720; AA292998; Hs. 163900; ESTs; blad; diag 
IV 316177; AI904982; Hs.293102; downstream of breastcancer an; breast; mAb+CTL 

316442; AA760894; Hs.125350; ESTs; pros; diag 

316580; AA938198; Hs. 146123; poly(A) polymerase gamma; breast, angio; s.m, 

316886; AA836331 Hs.17C 161; ESTs breast ; diag 

316943; AW014875; Hs.137007; ESTs; blad; diag 
I J 317079; BE159984; Hs.125395; ESTs; blad; mAbts.m. 

317140; AA885430; Hs.201925; Homo sapiens cDNA FU13446 fis; breast diag 

317224; X73608; Hs.93029; sparc/osteonecfJn, cwcv and ka- pros anqio; diaq " 

317548; BE568568; Hs.159066; ESTs; pros; CTL*s.m. 

317803; AW664964; Hs.128899; ESTs; breast, lung, ovar, pros; mAb-s.m. 
•3U 317881; AI827248; Hs.224398; Homo sapiens cDNA FU11469fis; breast, lung ■ diag 

318240" AI08' 377 h 143610; ESTs; lung; diag 

318524; AK001050; Hs.159066; hypothetical protein FU10188; pros, colon; CTL+sm 
318532; AW139377; Hs.127179; cryptic gene; pane; diag 
318744; AI793124; Hs.144479; ESTs; breast; diag 
55 318754; W21423; Hc.44222; CGI-90 protein; pros; diag 
319080; AW9670",, i I' 23023; ESTs; pros, diag 
319795; AB037821; Hs.146858; protocadherin 10; pros, glio; mAb+s.m. 
320066; BE305242 16( idir colon pane; diag 
320167; AA984373; Hs.90790; Homo sapiens cDNA: FU22930 fr, breast, pros; diag 
4U 320203; AL049227; Hs.124776; downstream of cadherin 6 (by 3; renal, ovar mAb+s m 
320211; AL039402; Hs,125783; DEME-6 protein; breast pros; CTL 
320322; AF077371 Hs.139322 n i| lin , h pr< t™ 3 hng, diag 
320324; AF071202; Hs.139336: ATP-biodlng cassette, sub-fami; pros; mAb 
320561; AF085808; Hs.159330; uroplakin 3; pros, blad; diag 
43 "20590 o 39; Hi I tivat pros' mAb+s m 

320635; N50617; Hs.80506; small nuclear ribonucleoprotci; angio lung; diag 
320736; M315361; Hs.170195; bone morphogenetic protein? (; ovar; mAb«diag 
320796; AK001541; Hs.31218; secretory carrier membrane pro; pros, colon ■ diag 
320896; BE019924; Hs.271580; uroplakin 1B; lung, blad, ovar, headnk; mAb+diag 
M) 321023 AvV29'316 H r 2560 ESTs; colon; diag 

321107; AI732643; Hs.144151; downstream of breast cancer an; breast mAb*CTl 
321412; AI674383; Hs.22691; solute earner family 7 (catio; pros; mAb+s.m. 
321441; AF107493; Hs.201675; Homo sapiens LUCA-15 protein m; pros, breast ; diag 
321644 7 ] ( ESTs; breast pros ' diag 

55 321717; AW956580; Hs.42699; ESTs; angio; diag 

321906; AW270608; Hs.170195; bone morphogenetic protein 7 (; ovar; mAb+diag 
321911; AF026944; I ls.293797; ESTs; angio, lung, blad; diag 
322035; AL137517; Hs.306201; hypothetical prolan DKFZp5S40; breast Mad; mAb 
322521; AF147347; ; gb:Homo sapiens full length In; breast ;diag 
322706; M018899; Hs.127179; cryptic gene; pane; diag 
322782; AA056060; Hs.202577; Homo sapiens cDNA FLJ12166 fis; pros' diag 
322818; AW043782; Hs.293616; ESTs; pros, breast angio, glio ; diag 

3228S2; AW248508; Hs.279727; Homo sapiens cDNA FLJ14035fis; breast lung, ovar, angio blad' diaq 
322975; C1 5391; ; intron of breastcancer anSge; breast ; mAb+CTL 
323168; AL120862; Hs.124165; programmed cell death 9 (PDCD9; breast ■ diag 
323226; AF055019; Hs.355279; Homo sapiens clone 24670 mRNA ; pros; diag 
323262; AL133990; Hs.190642; CEGP1 protein; breast pros, blad; diag 
323287; AA639902; Hs.104215; ESTs, Moderately similar lo SP; pros; diag 
323332; AI629520; ; gb:wl19c06.x1 NCI CGAP Ut1 Horn; breast ; diag 
323335; AI655499; Hs.161712; ESTs; pros, breast mAb 
323587; AI299709; Hs.131886; Homo sapiens cDNA: FU22113 fi; colon; diag 
323817; AA410943; ; NAME OMITTED ... receptor kinase; breast ; mAb 
324261; BE069341; ; gb:QV3-BT0381-270100-073-c08 B; breast' diaq 
324295; AA434579; Hs.143691; ESTs; pros; diag 
324338; AA927668; Hs.145078; regulator of differentiation (; colon; diag 

^°! 8 '' Hs - 335798 ; H °™ sapiens cDNA: FU23241 fi; pros, coin; diag 
324432; AA454510, Hs 1 ^ 31 Ejl ^ st lung, pane; diag 
324603; AW993522; Hs. r K, [S p breast, diag 
324617; AA508552; Hs 222874; ES rs. Weakly similar to I38C22; pros; diag 
324658; AI694767; Hs.129179; Homo sapiens cDNA FU13581 fis; pros ; diag 
324718; AI557019 H< 11 167; small ni ar protein PRAC; colon, pros; diag 
£M2!I£ 4 H » Small p ^nch protein SPR; lung L dial 
324871; AI890347; Hs.271923; Homo sapiens cDNA: FU22785fi; colon; diag 
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324987; AI375572; Hs.172534; ESTs; breast; dlag 
325372; ;; Phase 2 & 3 Exons; breast; CTUs.m. 
325544; ; ; Phase 2 & 3 Exons; breast, angio; diag 
327036; ; ; Phase 2 & 3 Exons; lung, angio; diag " 
J 327075; ; ; Phase 2 & 3 Exons; breast, lung; diag 
32741 4; ;; Phase 2 & 3 Exons; angio; diag 
328700; ;; Phase 2 & 3 Exons; breast, angio; diag 
33021 1; ; ; Phase 2 & 3 Exons; pros; CTUsm 

330468; L10343; Hs.112341; protease inhibitor 3; skin-der; lung, colon, blad; diag 
1 U 330493; M27826; Hs.334372; endogenous retroviral protease; lung, colon; s.m. 

330630; NM_002902; Hs.79088; reticulocalbin 2; EF-hand calc; pros; diag 

330762; AW407332; Hs.13014; ADP-ribosylalion factor GTPase; pros; CTUs.m. 

330790; AI660243; Hs.318545; Hu01 Chip Redos; pros, blad; diag 

330814; AI955040; Hs.265398; PAR-6 beta (partitioning def; breast; diag 
15 330827; AI961486; Hs.219196: ESTs; lung, uter; diag 

330892; AF109298; Hs.118258; prostate cancer associated pro; pros; diag 

331014; AW770994; Hs.30340; hypothetical protein KIAA1 165; colon; diag 
l r1 i- 1 I 602 ESTs, pros, breast; diag 

331183; T40769; Hs.8469; ESTs; colon; diag 
20 331237; W87874; Hs.25277; Homo sapiens cDNA FU10717 fls; angio; diag 

331490; AF216751; Hs.26813; CDA14; pros; diag 

331578; AI246482; Hs.243010; ESTs; angio; diag 

331614, N9229" I- .0, r L S Modp ately similar to ALU: breast; diag 

331811; AW885727;Hs.9914; Hu01 Chip Redos; lung; diag 
Z5 331889; AA677677; Hs.380213; Homo sapiens Chromosome 16 BAC; pros; diag 

331969; AAS26911; Hs.82772; collagen; typeXI; alpha 1; breast, lung; CTL 

332180; AF134160; Hs.7327; claudin 1; lung; mAb 

332247; AA669097; ; ESTs; pros, breast ; diag 

332396; AW579342; Hs.380730; hypothetical protein FU10697; pros; diag 
~> U 332453; L42583; Hs.334309; Hu01 Chip Redos; lung; diag 

332466; AB018259;Hs.118140;KIAA0716gene product; angio; diag 
. 332530; M31609; Hs.1735; inh bin; beta B (acMi AB be; war, pros; diag 

332535; AF167706; Hs.19280; cysteine-rich repeat-containin; angio; diag 

332640; BE568452 H^-,0 r egul yt n blad h dnk; diag 
35 332645; AA284371, Hs 8064; ESTs; breast, colon; diag 

332686; X69699; Hs.73149; paired box gene 8; ovar; CTUsm 

332697; X514G5; Hs.75360; carboxypeptidase E; pros; diag 

332740; BE4096G9; Hs.286241; Homo sapiens cDNA: FLJ22698 fi; pros; diag 

332798; ; ; C22CC0007:gi|12314195|emb|CAB9; pros, breast ; diag 
4U 333769; ; • NM_005940*:Homo sapiens matrix; breast, colon, lung; mAb^iag+s.m. 

333904; ; ; Chromosome 22; pros; diag 

334223; ; ; H M_00508C':Homo sapiens X-box ; pros, breast diag 

334447; ; ; NM_012429*:Homo sapiens SEC14 ; pros; diag 

335115; ; ; NM_006498':Homo sapiens lectin; pros; CTUs.m. 
45 335809 NM_01450<-*Homosapien' en: b si FI rr 

335824; ; ; ENSP00000249072*:DJ222E13,1 (N; breast, pros; CTUs.m. 

335825; ; ; ENSP00000249072*:DJ222E13.1 (N; breast; dlag 

335936; ; ; Chromosome 22; lung; diag 

336034; ; ; NM_007172:I lomo sapiens nucleop; breast, angio; CTUs.m. 
5 U 336152; ; ; NM_014246:Homo sapiens cadheri; breast ; mAb 

336636; ; ; C22000024 , :gi|10645308|gb|AAG2; lung, breast; CTUs.m. 

338008; ; ; NM_005940*:Homo sapiens matrix; lung, breast, colon; mAb+diag+s.m. 

338033; ; ; Chromosome 22; lung, angio; diag 

338158; ; ; NM_012399":Homo sapiens phosph; lung, angio; diag 
5 5 338255; ; ; NM_014323*:Homo sapiens zinc f; pros, breast, colon ; CTUs.m. 

400195; ; Hs.42650; NM_007057*;Homo sapiens ZW10 i; lung; CTUs.m. 

400269; ; Hs.253495; Eos Control; fibro; diag 

100285;;; Eos Control; lung; diag 

400287; S39329; Hs.1 81350; kaliikrein 2, prostatic; pros; diag 
OU 400288; X06256; Hs.149609; integrin, alpha5 (fibronectin; pane, pros.angio, blad, lung; mAUs.m. 

400290; H18836; Hs.31608; hypothetical protein FU20041; pros, colon, EWS; mAb 

400294; N95796; Hs.278695; Homo sapiens prostein mRNA, co; pros, pros; mAb 

400295; W72838; Hs.348419; AI90S687:IL-BT095-190199-O19 B; breast; diog 
- 400298;AA03227g;Hs.61635;sixtransmembraneepitheIial a;panc, lung, headnk, stom, EWS, ov 
05 400328; X87344; ; transporter 2, ATP-binding cas; lung; mAb*s.m. 

400409; AF153341; ; Homo sapiens winged helix/fork; blad; CTUs.m. 

400419; AF084545; ; Target; lung, sarc; diag 

400440; X83957; Hs.83870; nebulin; sarc; diag 

400494; ; ; ENSP00000238970*:CIG30 (Fragme; angio; mAb 
/ U 400517; ; ; lengsin; stom, cerv, uter, lung, pros, colon, hepC; diag 

400651; ; ; ENSP00000228031*:COPPER CHAPER; sarc; s.m. 

400665; ; ; NM_002425:Homo sapiens matrix; lung; mAb-«Jiag+s.m. 

400773; ; ; NM_003105*:Homo sapiens sortil; blad; mAb 

400844; ; ; NM_003105*;Homc sapiens sortil: blad; s in. 
/ 5 400846; ; ; sortilln-related receptor, L(D; blad; mAUs.m. 

400881; ; ; NM_025080:Homo sapiens hypothe; ovar; diag 

401093; ; ; C12000586*:gi|6330167|dbj|BAA8; blad, lung; CTUs.m. 

401234; ; ; mitogen-activated protein kina; angio; diag 

401424; ; ; NM_001 172:Homo sapiens arginas; pros; s.m. 
80 401486; ; ; C4000647';gi|4758508|ref|NP_00; headnk; mAb 

401704; ; ; NM_021195*:Homo sapiens claudi; tost mAb 

401732; ; ; NM_001176*;Homo sapiens Rho GD; pane; diag 

401747; ; ; Homo sapiens keratin 17 (KRT17; blad. lung, headnk, mela; diag 
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401760; ; ; Target Exon; blad, lung, headnk, esoph; diag 

401780; ; ; NM_005557*:Homo sapiens kerati; lung, blad, headnk, esoph, mela; diag 

401781; ; ; Target Exon; lung, blad, headnk, esoph, cerv; diag 

401785; ; ; NM_002275*:Homo sapiens kerati; lung; diag 

401797; ; ; Target Exon; saro; diag 

401994;;; Target Exon; lung; diag 

4021 45; ; ; Target Exon; test; CTL+s.m. 

402199; ; ; Target Exon; test; CTUs.m. 

40223I Fgene h pr Jcted CYTOCHROME; blad; diag 

402239; ; ; Target Exon; blad; diag 

102260 , NM_001436*:Homo sapiens fibril; blad; CTL+s.m. 

402265; ; ; Target Exon; lung; diag 

402305 ; ; C1 9000735*:gi|4508027|ref|NP_0; blad; CTUs.m. 

402420; ; ; C1000823':gi|10432400|emb|CAC1; lung; diag 

402424; ; ; NM_024901:Homo sapiens hypothe; blad; CTL+s.m. 

402447; ; ; C1000201:gi|204416|gb|AM02627; esoph; mAb 

402474; ; ; NM_004079:Homo sapiens oatheps; lung, colon, stom, flbro; diag 

402550; ; ; Target Exon; flbro; diag 

402604; ; ; Target Exon; glio; diag 

402605;;; Target Exon; glio; diag 

402606; ; ; NM_024626:Homo sapiens hypothe; ovar, breast; mAb 

402680; ; ; Target Exon; test; mAb 

402777 31002652* | r M327|sp|Q04799|; blad; diag 

402860; ; ; ENSP00000239210:DJ50O24.4 (nov; mela; CTL+s.m. 

402888; ;; Target Exon; sarc; diag 

402992; ; ; Target Exon; sarc; diag 

402994; ; ; NM_002463*;Homo sapiens myxovi; esoph; diag 

403046; ; ; NM_005656*:Homo sapiens transm; pros; mAb 

403047; ; ; NM 005656':Homo sapiens transm; pros, blad, colon; mAb 

403071; ; ; NM_003319':Homo sapiens titin ; sarc, diag 

403088; ; ; NM 003319*:Homo sapiens titin ; sarc; diag 

403171; ;;C2001472*:gi|5809678|gb|AAB418; test; diag 

403328; ; ; Target Exon; mela; diag 

403329; ; ; unnamed protein product [Homo ; lung; diag 

403381; ; ; ENSP00000231844*:Ecotropic vir; blad; CTL*s.m. 

403409; ; ; NMJ05929:! lomo sapiens antigen; mela; mAb 

403433; ; ; NM_001S22:Homo sapiens alpha-2; hepC; diag 

403478; ; ; NM_022342:Homo sapiens kinesin; lung; CTL-ts.m. 

40371 5; ; ; Target Exon; lung; diag 

403740; ; ; NM_001076':Homo sapiens UDP gl; pros, hepC; s.m. 
403776; ; ; ENSP00000226542':Small inducib; pane; diag 
403903; ; ; C5001G32':gi|10645308|gb|AAG21; blad; CTL^m. 
404029; ; ; NM_018936*:Homo sapiens pratoc; glio; mAb 
I ' 40 NMJ 1 89 7*:Hom p n p oio- lio 1 b 

404210; ; ; NM_005936:llomo sapiens myeloid; pane, uter, cerv, lung, ovar, pros, colon, stom; diag 

404240; ; ; NM_018950:Homo sapiens major h; fibre; mAb 

404253; ; ; NM_021058*;Homo sapiens H2B hi; lung; CTUs.m. 

404286; : ; CG001909:gi|704441|dbj|BAA1890; pane; diag 

404296; ; ; C6001238*:gi|121715|sp|P25697|; lung; s.m. 

404403;;; Target Exon; blad; diag 

404440; ; ; NM_02104ftHomo sapiens melanom; lung, blad; mAb+CTL 

404866; ; ; ENSP0O000251112*:Sodiumfpotass; pane; s.m. 

404877; ; ; NM_005365:Homo sapiens melanom; lung, blad; CTl-ts.m. 

404927; ; ; Target Exon; lung, headnk; diag 

404996; ; ; Target Exon; lung, headnk, esoph; diag 

405001; ; ; intedeukin enhancer binding f; sarc; diag 

405025; ; ; Homo sapiens bone morphogeneti; angio; diag 

4051 21 ; ; : mitogen-activated protein kina; angio, renal; s.m. 

405238; ; ; Target Exon; glio; diag 

405239; ; ; oxidative 3 alpha hydroxystero; glio; s.m. 

405451; ; ; Homo sapiens glufamin/i-peptid; mela; s.m. 

405545; ; ; Target Exon; cerv; mAb 

405546; ; ; NM_018833*.'Homo sapiens transp; cerv; mAb 

405547: • ; NM_018833*:Homo sapiens transp; cerv, mela; mAb 

405646; ; ; C12000200:gi|4557225|ref|NP_00; lung; diag 

405704; ; ; NM_001844*:Homo sapiens coHag; sarc; diag 

405770; ; ; NM_002362:Homo sapiens melanom; lung, esoph; mAb+CTL 

405849; ; ; Target Exon; pane; diag 

405932; ;; C15O00305:gi|3806122|gb|AAC691; blad, lung, headnk, cerv; CTUs.m. 

406081; ; ; Target Exon; blad; diag 

406137; ; ; NM_000179*:Homo sapiens mutS (; lung; CTL-ts.m. 

406173; ; ; ENSP0O000250148*:Grov* hormon; pane; CTL-ts.m. 

406348; ; ; Target Exon; breast; CTUs.m. 

405360; ; ; Target Exon; lung, headnk; diag 

406399; ; ; NM_003122*:Homo sapiens serine; blad; diag 

406434; ; ; NM_030579*:Hcmo sapiens cytoch; blad; diag 

406467; ; ; Target Exon; lung, headnk, blad; diag 

406506; ; ; Target Exon; angio; diag 

406547;;; Target Exon; test; diag 

406627; T64904; Hs.163780; ESTs; angio; CTUs.m. 

406671; AA129547; Hs.285754; metproto-oncogene (hepatocyte; pane; mAb 

406672; M26041 Hs.19 r " m is * i ipa bility< up! fibro mAb 

406685; M18728; ; gb:Human nonspecific crossreac; pane, colon, blad, headnk, stom, lung; mAb+CTL 
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406687; M31126; Hs.352054; matrix metallcproteinase 11 (s; breast, lung, ovar, cerv, liter, pane, esoph, mela, sarc; mAb+diag+s.m. 
406690; M29540; Hs.220529; carcinoembryonic antigen-relat; lung, headnk, pane, stom. blad, colon, cerv; mAb+CTL 
406706; X03740; Hs.231 581; myosin, heavy polypeptide 1,s;saro;diag 
406850; AI624300; Hs.172928; collagen, type I, alpha 1; sarc; CTL+s.m. 
5 406906; Z25424; ; gb:H.sapiens protein-serine/th; blad, lung; s.m. 
406937; U14622; ; gb:Human transketolase-like pr; test; s.m. 
406967; M24349; ; gb:Human parathyroid hormone-l; lung; CTL+s.m. 
406974; M57293; ; gb:Human parathyroid hormone-r; lung, blad; diag 
407013; U35637; Hs.83870; gb:Human nebulin mRNA, partial; sarc; diag 
1 0 407034; U84540; ; gb:Human dystrobrevin isotbrm ; glio; diag 

407103; AA424881; Hs.256301; hypothetical protein MGC13170; pros; diag 
407118; M156790; Hs.262036; ESTs, Weakly similar to 2223_H; pros; diag 
407122; H20276; Hs.31742; ESTs; pros; diag 

407137; T97307; ; gb:ye53h05.s1 Soares fetal liv; lung, blad, ovar, pros, pane, headnk; diag 
15 407168; R45175; Hs.117163; ESTs; pros, breast, colon; diag 

407178; AA195651 ; Hs.35231 2; AP-2 beta transcription factor; breast ; CTL4s.ni. 
407202; N58172; Hs.109370; ESTs; pros; diag 
40721G H9177 II lysyl oxidase p^nc dag 

407242; M18728; ; gb:Human nonspecific crossreac; pane, colon, blad, headnk, stom, lung, ovar, cerv; mAb 
20 407244; M10014;; fibrinogen, gamma polypeptide; lung; diag 

407245; X90568; Hs.172004; titin; sarc; diag 

407251; U67611; Mm.29182; Iransaldolase 1; pros; s.m. 

407252; AA659037; Hs. 163780; ESTs; angio; CTL+s.m. 
„ _ 407276; AI951 118; Hs.326736; Homo sapiens breast cancer ant; breast; mAb+CTL 
25 407289; AA135159; Hs.203349; Homo sapiens cDNA FU12149 fis; lung; diag 

407366; AF026942; Hs.17518; gb.Homo sapiens cig33 mRNA, pa; ovar, hepC, siom, mela, esoph; diag 

407581; R48402; Hs.173508; P3ECSL; blad; CTL+s.m. 

407601; AC002300; Hs.37129; sodium channel, nonvoltage-gat; blad; mAb 

407619; AL050341; Hs.37165; collagen, type IX, alpha 2; sarc; diag 
30 407634; AW016569; Hs.136414; UDP-GlcNAc:betaGal beta-1,3-N-; lung, headnk;s.m. 

407710; AW022727; Hs.23616; ESTs; test; diag 

407720; AB037776; Hs.38002; KIAA1355 protein; lung; mAb 

407746; AK001962; Hs.38114; hypothetical protein FU11100; lung; diag 

407756; AA1 16021; Hs.38260; ubiquitin specific protease 18; pane, lung, esoph, fibro, mela; CTL+s.m. 
3 J 407758; D50915; Hs.38365; KIAA0125 gene product lung; diag 

407777; M161071; Hs.71465; squalene epoxidase; pane, esoph; s.m. 

407782; AA608956; Hs.1 12619; ESTs, Moderately similar to PU; lung; diag 

407786; AA687538; Hs.38972; tetraspan 1; pros, colon, uter, stom, ovar, cerv; mAb 

407788 BC |J1 lls.38991 I i n tindrnq protein A; headnk, pane, blad, lung, fibre; diag 
40 407818; AL021938; Hs.40154; jumonji (mouse] homolog; test; CTL+s.m. 

407824; AA147884; Hs.9812; Homo sapiens cDNA FU14388 fis; sarc; diag 

407839; AA045144; Hs.161566; ESTs; blad, headnk; mAb 

407846; AA426202; Hs.40403; Cbp/p300-interacting transacti; mela; diag 

407853; AA336797; Hs.40499; dickkopf (Xenopus laevis) homo; colon, stom, renal, breast, ovar, uter, cerv; diag 
45 407856; AA045281; l ls.266175; phosphoprotein associated with; mela; diag 

407872; AB039723; Hs.40735; frizzled (Drosophila) homolog ; ovar; mAb 

407881; AW072003; Hs.40968; heparan sulfate (glucosamine) ; pane; s.m. 

407910; AA650274, th 41296; fibronectin ei cine rich trans; fibro; mAb 

407944, R34008 H 2 7 lesm olli. lun Hn, esoph nAb 
50 407949; W21874; Hs.247057; ESTs, Weakly similar to 210926; fibro, blad; diag 

407952; A'133530; Ms.02930; ESTs, Weakly similar to S59501; angio; mAb+S.m. 

408000; L11690; Hs.198689; bullous pemphigoid antigen 1 (; breast, pros, blad, lung, headnk, cerv, esoph; mAb+CTL 

408015; AW136771; Hs.244349; epidermal differentiation comp; mela, sarc; diag 

408045; AW138959; lls.245123; ESTs; breast; diag 
5 5 408056; AA31 2329; Hs.42331 ; ephrin-A4; ovar; diag 

408063; BE086548; Hs.381047; calcineurin-binding protein ca; pros, lung; diag 

408081; AW451 597; I Is 167409; intron ol I asic helix-! .op-he 1 ; ovar, glio; diag 

408101; AW968504; Hs.278346; CDC2-related protein kinase 7; breast, lung, stom; s.m. 

408122; AI432652; Hs.42824; hypothetical protein FU10718; lung; diag 
60 408209; NM_004454; Hs.43697; ets variant gene 5 (ets-relate; mela; CTL+s,m. 

408296; AL117452; Hs.44155; DKFZP586G1517 protein; angio; diag 

408308; AL033377; Hs.44197; hypothetical protein DKFZp564D; pane, renal, colon; mAb 

408353; BE439838; Hs.44298; mitochondrial ribosomal protei; lung; diag 

408430; S79876; Hs.44926; dipeptidylpeptidase IV (CD26, ; pros; mAb 
65 408522; AI541214; Hs.46320; Small praline-rich protein SPR; lung, blad, headnk, eosph, cerv; diag 

408561; AI308037; Hs.84120; hypothetical protein MGC13016; mela; CTL+s.m. 

408570; AL046406; Hs.103483; KIAA1798 protein; angio; CTL+s.m. 

408572; AA055611; Hs,226568; ESTs, Moderately similar to AL; lung; diag 

408591; AF015224; Hs.46452; mammaglobin 1; breast, cerv; diag 
70 40861 1; NMJM4367; Hs.46468; chemokine (C-C motif) receptor; mela; mAb 

408633; AW963372; Hs.222088; PRO2000 protein; blad, lung, headnk, pros; diag 

408660; AA525775; Hs.89040; ESTs, Moderately similar to PC; ovar, pane, pros, esoph, sarc; diag 

408728; AL137379; Hs.47125; hypothetical protein FU1391 2; test; diag 

408758; NMJW3686; Hs.47504; exonuclease 1; mela; CTL+s.m. 
75 408770; AW270608; Hs.170195; bone morphogenetic protein 7 (; ovar; mAb+dfeg 

408771; AW732573; Hs.47584; potassium voltage-gated channe; lung; mAb 

408780; D31797; Hs.652; tumor necrosis factor (ligand); leuk; diag 

408795; AW749126; Hs.170345; hypothetical protein FU13710; ovar; diag 

408826; AF216077; Hs.48376; Homosapiens clone HB-2 mRNA s; pane, pros; diag 
8 0 408833 AV 6122 - M ESTs p os diag 

408877; AA479033; Hs.13031 5: ESTs, Weakly similar to A47582; breast; diag 

408915; NM_016651; Hs.48950; heptacellular carcinoma novel ; pane, sarc; diag 

408930; AA146721; Hs.334686; hypothetical protein FU21588; blad; CTL+s.m, 



102 



WO 03/042661 



PCT/US02/36810 



408962; BE386436; Hs 
408992; AA059325; Hi 

408996; AI979168; Hs.82226; glycoprotein (transmembrane) n; mela: mAb+s.m. 
409012; AL1 17435; Hs.49725; DKFZP434I216 protein; sarc; CTL+s.m. 
409038; T97490; Hs.50002; small inducible cytokine subfa; mela; diag 
409051; AA080912; ; gb:zn04d03.r1 Stratagene hNT n; pros; s.m. 
409077; AA063037; Hs.66803; ESTs; lung; diag 
409093; BE243834; Hs.50441; CGI-04 protein; lung; diag 
409123; AA063403; ; gb:zm04d12.s1 Stratagene corne; pros; s.m. 
409142; AL136877; Hs.50758; SMC4 (structural maintenance o; ovar, luno, mela; diag 
409153; W03754; Hs.50813; hypothetical protein FLJ20022; fibro; dag 
409200; AL042914; Hs.51039; KIM0076 gene product; sarc; CTL+s.m. 
409203; AA780473; Hs.687; cytochrome P450, subfamily IVB; f bra, blad, ovar; diag 
409228; R1681 1 ; Hs.22010; ESTs, Weakly similar to 210926; lung; mAb 

409231; AA446644; Hs.692 GA733-2 antigen; epithelial gl; pros, ovar, breast, uter, pane, colon, stom; mAb 
409243; AB037761; Hs.51743; KIAA1340 protein; test; diag 
409262; AK000631; Hs.52256; hypothetical protein FU20624; pros; CTL+s.m. 
409264; NMJ14937; Hs.52463; KIAA0963 protein; mela; CTL+s.m. 

'™"3;Hs.22972; steroid 5 alpha-redur se2-l bres ...ar, lung, pane, uter; mAb 
>2; Hs.53563; collagen, type IX, alpha 3; colon, pane, sarc; CTL+s.m. 
409340; BE174S29; lls.321130; hypothetical protein MGC2771; mela; CTL+s.m. 
409342; AU077058; Hs.54089; BRCA1 associated RING domain 1; test; CTL+s.m. 
409348 AMC1 C 35 Hs.11 1 nal glio diag 

409361; NMJW5982; Hs.54416; sine oculis homeobox (Drosophi; blad, lung, pros; CTL+s.m. 

409389; AB007979; Hs.301281; Homo sapiens mRNA, chromosome ; glio; diag 

409395; U46745; Hs.336678; dystrobrevin, alpha; glio; diag 

409402; AF208234; Hs.695; cystalin B (stefin B); blad; diag 

409415; AA579258; Hs.6083; Homo sapiens cDNA: FLJ21028 fi; mela; diag 

409421; AA199883; Hs.67624; ESTs; test; diag 

409430; R21945; I ls.546735; splicing factor, arginine/seri; mela; diag 

409432; D49372; Hs.54460; small inducible cytokine subfa; stom, esoph; diag 

409433; AA074382: Hs. 135255; ESTs; glio, sarc; diag 

409509 AL03~923 lls.322710 T nqio d ag 

409512- AW979187; Hs.293591; melanoma differentiation assoc; mela, esoph; CTL+s.m. 
409542; AA503020; Hs.36563; hypothetical protein FU22418; breast, ovar; diag 
409582, R27430; I ls.271565; ESTs; lung; diag 

409601; AF237621; Hs.80828; keratin 1 (epidermolytic hyper; headnk, mela, sarc; CTL+s.m. 
409633; AW449822; Hs.55200; ESTs; sarc; diag 

409637; AA323948; I ls.55407; Homo sapiens mRNA; cDNA DKFZp4; renal; diag 

409638; AW450420; Hs.21335; ESTs; glio; dian 

409670; AI368109; Hs.381163; KIAA1856 protein; test; CTL+s.m. 

409703; NMJW6187; I ls.56009; 2'-5'-oligoadenylate synthetas; pane, esoph, mela; s.m 

409705; M37762; Hs.56023; brain-derived neurotrophic fac; lung; diag 

409719; AI769160; Hs.108681; Homo sapiens brain tumor assoc; lung; diag 

409731; AA125985; I ls.56145; thymosin, beta, identified in ; pros, sarc; CTL+s.m. 

409745; AA077391; ;gb:7B14E12 Chromosome 7 Fetal ; ovar, renal; CTL+s.m. 

409935; AW511413; Hs.187393; ESTs; lung; diag 

409958; NMJW1523; lls.57697; hyaluronan synthase 1; pane, ovar; mAb 

409988; N27687; Hs.334334; transcription factor AP-2 alph; mela; diag 

410006; AW732308; Hs.577B3; cukaryotic translation initial; test diag 

410037; AB020725; Hs.58009; KIAA0918 protein; pros; diag 

410044; BE566742; Hs.58169; highly expressed in cancer, ri; blad; diag 

410048; W76467; Hs.343874; proline oxidase homolog; test; s.m. 

410076; T05387; Hs.7991; ESTs; lung, pros; diag 

410079; U94362; Hs.380757; glycogenin 2; mela; diag 

410082; AA081594; Hs.158311; Musashi (Drosophila) homolog 1; pros; diag 

410102; AW248508; I ls.279727; ESI n I E U 3 [Don ovar breast, blad, lung, angio, sarc; diag 
410174; AA306007; Hs.S9461; DKFZP434C245 protein; mela; diag 
410240; AL157424; Hs.61289; synaptojanin 2; angio; diag 



llal a; pane, pros, EWS; mAb+CTL 



410290; AA402307; Hs.322844; hypothetical protein DKFZp564A; mela; diag 
410310; J02931; Hs.62192; coagulation factor III (thromb; pros, pane; mAb 
410361; BE391804; Hs.62661; guanylate binding protein 1, i; mela, esoph, hepC, fibro, liter; diaa 
410438; AW74801 2 H, M) lypothetical protein KIAA1 335; lung; CTL+s.m. 
410467; AF102546; Hs.63931; dachshund (Drosophila) homolcg; breast, colon, uter, stom; diag 
410480; R97457; Hs.63984; cadherin 13, H-cadherin (heart; angio; mAb 
410491; AA465131; Hs.64001; Homo sapiens clone 25218 mRNA ; mela, esoph; diag 
410530; M25809; Hs.64173; ATPase, H transporting, lysoso; ovar; mAb 
410553 AW016824 H 168; hypothetical | if 14P8 blad lung; diag 
410555; U92649; Hs.380136; a disintegrin and metalloprote; leuk, lung; mAb 
410561; BE540255; Hs.6994; Homo sapiens cDNA: FU22044 fi; lung; diag 
410566; AA373210; Hs.43047; Homo sapiens cDNA FLJ13565 fis; pane; diag 
410600; AW575742; Hs.351676; ESTs, Moderately similar to S6; mela; mAb+s.m. 

410681; AW246890 Hs.6 425 , i'l inrli 1 1, (28kO); lung; diag 
410687; U24389; Hs.65436; lysyl oxidase* 1; pane; diag 
410733; D84284; Hs.66052; CD38 antigen (p45); pros; mAb+CTL 
410763; AF279145; Hs.8966; hypothetical protein FLJ21776; pane; mAb 
410855;X97795; Hs.66718; RAD54 (S.cerevisiae)-like; test; CTL+s.m. 
410867;X63556; Hs.750; fibrillin 1 (Marfan syndrome); pane; diag 
410870; U81599; Hs.66731; homeo box B13; pros; CTL+s.m. 
410883; D43767; Hs.65742; CCL17chemokine (TARC) (SCYA1; leuk; diag 
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410889; X91662; Hs.66744; twist (Drosophila) homolog (ac; sarc; CTL*s.m. 
410929; H47233; Hs.30643; ESTs; over, test; diag 
41 1078; AI222020; Hs.182364; CocoaCrisp; pros, glio, breast; diag 
. 411089; AA456454; Hs.214291; cell division cycle 2 like 1 (; luwj, fibro; CTL^.m. 
5 41 1243; AB039E66; I ls.69319; CA1 1; esoph; diaq 

41 1248; M551538; Hs.69321; Homo sapiens cDNA FU14408 lis; blad; diag 
411257; AA628967; Hs.1 15274; Indian hedgehog protein (IHHJ ; ovar; diag 
411263; BE297802; Hs.69360; kinesin-like 6 (mitotic centra; lung, blad, headnk; CTL*s.m. 
411296; BE207307; Hs.10114; growth suppressor 1; sarc; diag 
10 411358; R47479; Hs.94761; KIAA1691 protein; mela,renal, saro; mAb 
41 1388; X72925; Hs.69752; desmocollin 1; headnk, mela; mAb 
411393; AW797437; Hs.69771; B-factor, properdin (COMPLEMEN; ovar; diag 

41 1558; AA102670; Hs.70725; gamma-aminobutyric acid (GABA); pane, pros, stom, breast, uter, cerv, ovar, mAb 
41 1573; AB029000; Hs.70323; KIAA1 077 protein; pane, headnk, lung, stom; diag 
15 41 1 579; AC005258; Hs.70830; U6 snRNA-assoclated Sm-llke pr; lung; diag 
411732; U47924; Hs.71642; guanine nucleotide binding pro; lung; diag 
41 1768; MM 0133 I H 1 terieukin 1 m. uter, cerv diag 

41 1789; AF245505; Hs.72157; Adlican; breast, pane, lung, stom, headnk, ovar, uter, esoph, sarc; diag 

41 1825; AK000334; Hs.352415; solute earner family 39 (zinc; colon, ovar; mAb 
20 41 1828; AW161449; Hs,72290; wingless-type MMTV integration; ovar; diag 

41 1869; W20027; Hs.23439; ESTs; angio; diag 

411874; AA096106; Hs.20403; ESTs; blad; diag 

411880; AW872477; ; gb:hm30f03.x1 NCI_CGAP_Thy4 Ho; blad; diag 

411945; AL033527; Hs.92137; L-myc-2 protein (k4YCL2); blad, ovar; CTLts.m. 
25 412006; AW451618; Hs.380683; ESTs; sarc; diag 

412026; M383618; Hs.73073; testis-specific ankyrin motif ; test; diag 

412045; AA099802; Hs.83883; transmembrane, prostate androg; pros; mAb+s.m. 

412099; U64198; Hs.73165; interieukin 12 receptor, beta ; leuk, mela; mAb 
_ . 412104; AW205197; Hs.240951; Homo sapiens, Similar to RIKEN; pane, fibro; diag 
30 412115; AK001763; Hs.7 52 19 Hypothetical protein FU10901; lung, blad: CTL+s m. 

412116; AW402166; Hs.784; Epstein-Barr virus induced gen; pane; mAb 

412133; U83460; Hs.380728; solute earner family 31 (copp; pros; mAb 

412228; AW503785; Hs.73792; complement component (3d/Epste; mela; mAb 

412247; AF022375; Hs.73793; vascular endothelial growth fa; renal, glio, blad, colon; diag 
35 412265; M101325; Hs.86154; hypothetical protein FU12457; test; CTLts.m. 

412326; R07566; Hs.73817; small Inducible cytokine A3 (h; pros, leuk; diag 

412351; AL135960; Hs.73828; T-cell acute lymphocytic leuke; angio; CTL-Ki.m. 

412420; AL035668; lls.73853; bone morphogenetic protein 2; blad, glio, lung, stom, angio; diag 
. . 412448; LI 2964; Hs.73895; tumor necrosis factor receptor; leuk; mAb 
40 1 1 I » ! i I growth factor; cerv, n h 

412490 AW803 r G< II / r I I •> pie Of FLJ22 28 fi mela diag 

412519; AA196241; Hs.73980; troponin T1, skeletal, slow; sarc; diag 

412530; AA766268; Hs.266273; hypothetical protein FU13346; blad, lung; diag 

412564; X83703 Hsr M r <, n n repeat | r s nqio diag 
45 412580; AA113262; Hs.17901; similar to CABLES (Homo sapien; mela; diag 

412610; X90908; Hs.74126; fatty acid binding protein 6, ; blad; diag 

412661; N32860; lls.24611; ESTs, Weakly similar to I54374; blad; CTL+s.m. 

412715; NM_000947; Hs.74519; primase, polypeptide 2A (58kD); pros; s.m. 

412723; AA648459; Hs.335951; hypothetical protein AF301222; lung, blad, headnk, colon, stom, uter, diag 
50 412755: BE14430G I Is. 179891; ESTs Weakly similar to P4IIAJ!; ango; s.m. 

41281 1; H06382; Hs.349705; ESTs; lung; diag 

412817; AL037159; Hs.74619; proteasome (prosome, macropain; lung; s.m. 

412843; AF007555; Hs.74624; protein tyrosine phosphatase, ; pros; mAb 

412856; BE386745; Hs.74631; baslgin (OK blood group); mela; mAb 
55 412926; AI879076; Hs.75061; macrophage myristoylated alani; mela; CTLts.m. 

412939; AW411491; Hs.75069; eukaryotic translation elongat; mela, renal; diag 

412970; AB026436; Hs.177534; dual specificity phosphatase 1; breast, mela; s.m. 

412986; X811 20; Hs.751 1 0; cannabinoid receptor 1 (brain); glio; mAb 

413004; T35901; Hs.75117; interieukin enhancer binding f; lung; diag 
60 41301 1; AW0681 15; Hs.821; biglycan; lung; CTL+s.m. 

413049; NMJJ02151; Hs.823; hepsin (transmembrane protease; pros; mAb 

413095; M494359; Hs.30715; potassium voltage-gated channe; pane, stom, renal, colon; mAb+s.m, 

413125; BE244589; Hs.75207; glyoxalase I; pros; s.m. 
. _ 413126; AW419203; Hs.174174; ESTs; angio; diag 
65 413129; AF292100; Hs.104613; RP42 homolog; lung; diag 

413132; NM_006823; Hs.75209; protein kinase (cAMP-dependent; angio; CTL+s.m. 

413142; M81740; Hs. 75212: ornithine decarboxylase 1; lung; s.m. 

413163; Y00815; Hs,75216; protein tyrosine phosphatase, ; pros; mAb 

413171; AA318325; Hs.7521 9; tyrosinase-related protein 1; mela; mAb 
70 413190; AA151802; Hs.40368; adaptor-related protein comple; mela; diag 

413219; AA878200; Hs.118727; Homo sapiens cDNA FU13692 fis; escph, cerv; diag 

413223; AI732182; Hs.191866; ESTs; lung; diag 

413268; AL039079; Hs.75256; regulator of G-protein signall; headnk; CTL*s.m. 

H tion factor BMAL2; lung, blad, headn o , 

75 413313; NM_002047; Hs.293885; glycyl-tRNA synthetase; test; s.m. 

413328; Y15723; Hs.75295; guanylate cyclase 1, soluble, ; pros; s.m. 

413335; AI613318; Hs.48442; ESTs; ovar; diag 

413364; BE536218; Hs.137516; fidgetin-like 1; lung; diag 

413372; H55532; Hs.349695; tubulin, alpha 2; test; diag 
80 413435; X51405; Hs.75360; carboxypeptidase E; pros, glio, pane, sarc; diag 

413436; AF238083; Hs.68061; sphingosine kinase 1; sarc; s.m. 

413472; BE242870; Hs.75379; solute carrier family 1 (glial; glio; mAb 

413556; AW604451; Hs.381 153; sprouty (Drosophila) homolog 4; saic; CTL-»s.m. 
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413573; AI733859; Hs.149089; ESTs; lung; diag 

4135B2; AW295647; Hs.71331; hypothetical protein MGC5360; lung; diag 

413597; AW302885; i is. 11 7183; ESTs; pros; diag 
_ 413623; AA825721; Hs.246973; intron of Bicaudal D homolog 1; ovar, pros; diag 
5 413691; AB023173; Hs.75478; ATPase, Class VI, type 11B; lung; mAb 

413711; AW291765; Hs.75486; heatshock transcription facto; renal; diag 

413753; U17750: H= 517 minin.bet mh« i ?5kD lung, blad, headnk, pane, cerv, esoph, colore diag 

413762; AW411479; Hs.848; FK506-binding protein 4 (59kD); test diag 

413778; M090235; Hs.75535; myosin, light polypeptide 2, r, sarc; diag 
1 0 413794; AF234532; Hs.61638; myosin X; mela; diag 

413804; T64682; ; gb;yc48b02.r1 Stratagene liver; blad; diag 

413808; J00287; Hs.350038; Homo sapiens mRNA for caldesmo; esoph; diag 

413813; M96956; Hs. 75561; teratocarcinoma-derived growth; colon; diag 

413833; Z15005; Hs.75573; centromere protein E (312kD); lung; CTL-ts.m. 
1 5 413842; M29383; Hs.856; interferon, gamma; leuk; diag 

413869; NMJ00878; Hs.75596; interieukin 2 receptor, beta; fibro, renal; mAb 

413880; AI660842; Hs.1 10915; interieukin 22 receptor; pane, colon; mAbts.m. 

413»24 AL1VVM l r l I dm 1 pros, breast, ovar, diag 

413943; AW294416; Hs.144687; Homo sapiens cDNA FLJ12981 lis; blad, lung; CTLts.ni. 
20 413985; AI018666; Hs.75667; synaptophysin; glio, sarc; mAb 

414004; AA737033; I is.71 55; ESTs, Moderately similar to 21; pane, mela; diag 

414020; NMJJ02984; Hs.75703; CCL4 Chemokine (C-C motif) lig; pros; diag 

414034; U89277; Hs.305985; early development regulator 1 ; test CTL-ts.m. 
n _ 414035; Y00630; Hs.75716; serine (or cysteine) proleinas; lung, cerv, headnk, blad; s.m. 
25 414053; BE391635; Hs.75725; transgelin 2; blad; diag 

414061; NM_000699; Hs.300280; amylase, alpha 2A; pancreatic; ovar; diag 

414065; AW515373; Hs.271249; Homo sapiens cDNA FU13580 fis; pros; diag 

414085; AA114016; Hs.75746; aldehyde dehydrogenase 1 fan*, pres. pane, sarc; s.m. 

414142; AW368397; Hs.334485; hemicentin (fibulin 6); fibro, pane, sarc; diag 
30 414161; AA136106; Hs.184852; KIAA1553 protein; test; diag 

414166; AW888941; Hs.75789; N-myc downstream regulated; pros, renal; diag 

414217; AI309298; Hs.279898; Homo sapiens cDNA: FU23165 fi; glio; diag 

414219; W20010; Hs.75823; ALU -fused gene from chromosom; sarc; diag 

414221; AW450979;;gb;UI-H-BI3-ala-a-12-0-Ul.s1 N; blad; diag 
35 41 4251; AL042306;Hs.97689;VASA protein; test; CTL-ts.m. 

414259; W44633; I b.301296; integrin, beta-like 1 (with EG; pane; diag 

414334; AA824298; Hs.21331; hypothetical protein FU10036; test diag 

414359; M62194; Hs.75929; cadherin 1 1, type 2, OB-cadher; breast ovar, uter, pros, colon, pane, sarc; mAb 

414368; W70171; Hs.75939; uridine monophosphate kinase; lung; s.m. 
40 414386; X00442; Hs.75990; haptoglobin; ovar; diag 

414416; AW409985; Hs.76084; hypothetical protein MGC2721; blad, lung; CTLts.ni. 

414420; AA043424; lls.76095; immediate early response 3; pane, colon; diag 

414430; AI346201; Hs.76 1 18, ubiquitin carboxyl-terminal es; lung; s.m. 

414443; AU077268; Hs.76144; platelet-derived growth factor; sarc, pane, headnk; mAb 
45 414476; AA301857; I ls.76224; EGF-conlaining fibulin-like ex; angio; diag 

414477; U41635; Hs 76228; amplified in osteosarcoma; sarc; diag 

414509; AW16131 1; Hs.76294; CD63 antigen (melanoma 1 antig; mela; mAb 

414521; D28124; Hs.76307; neuroblastoma, suppression of ; breast ova:, uter, pros, blad, pane, colon, slom, fibr; diag 

414565; AA502972; Hs.163390; hypotheti 3 I protein FU1359I p is, diag 
50 414569; AF109298; Hs.118258; prostate cancer associated pro; pros, EWS;diag 

414575; H11257; Hs.375743; Homo sapiens clone IMAGE:45193; renal; diag 

414595; AA641726; Hs.289015; hypothetical protein MGC4171; blad; diag 

414602; AW630088; Hs.76550; Homo sapiens mRNA; cDNA DKFZp5; pros; mAb 

414683; S78296; Hs.76888; hypothetical protein MGC12702; blad, lung, test; diag 
55 414732; AWaIi'T' H 7/1 52 mimchromosornernaiulenaucedef;tesl,blad;diag 

414761; AU077228; Hs.77256; enhancer of zeste (Drosophila); lung, blad, test CTL-ts.m. 

414776; AA1 55598; Hs.212839; hypothetical protein FU14195;; angio; diag 

414786; AI246482; Hs.243010; Homo sapiens cDNA FU14372 fis; angio; diag 

414799; AI752416; Hs.77326; insulin-like growth factor bin; renal; diag 
60 414806; D14694; Hs.77329; phosphatidylserine synthase 1; lung; mAb 

414807; AI738616; Hs.77348; hydroxyprostaglandiu dehydroge; blad; s.m. 

414809; AI434699; Hs.77356; transferrin receptor (p90, CD7; lung; mAbts.m. 

414825; X06370; Hs.77432; epidennal growth factor recept glio, lung, renal, esoph, pane, headnk, leuk ; mAbts.m. 

414915 N"_ >i 2 t 11; my i i iflueri c) resiitdn &>oph diaj 
65 414918; AI219207; Hs.72222; hypothetical protein FU13459; blad; CTL 

414921; BE390551; Hs.77628; steroidogenic acute regulatory; breast; diag 

414945; BE076358; Hs.77667; lymphocyte antigen 6 complex, ; mela; mAb 

414987; AA524394; Hs.294022; hypothetical protein FU14950; blad, pane, esoph; diag 

414998; NMJXK543; Hs.77729; oxidised low density lipoprote; fibro, ovar, pane, colon; mAb 
70 41 5003; M1 1 437; Hs.77741 ; kininogen; pane; diag 

415025; AW207091; Hs.72307; ESTs; blad; diag 

415091; AL044872; Hs.77910; 3-hydroxy-3-meUiylglutaryl-Coe; lung, headnk;s.m. 

415178; D80503; Hs.46692; ESTs; blad; diag 

415214; AI445236; Hs.125124; EphB2; colon, stom; mAb 
75 415314 N i I' 5422 jly tein MSB mela mAb 

415457; AW081710; Hs.7369; Homo sapiens testes specific A; fibro, ovar, uter; CTL-ts.m. 

41 11; AI73261 182362; ESTs; blad, c ren dia 

415542; R13474; Hs.290253; ESTs, Weakly similar to I38022; blad; diag 

415 ,4 ffc I03 6X H 8687 n ul pliingoms in (N-SM , I Tl , m 
00 415752; BE314524; Hs.78776; putaUve transmembrane protein; endo, uter, breast, stom, blad, mela; mAb 

4 ^ y H 324; hypothetical protein FL t pros, blad; d 

1' 7"7 01463 ,l 3! EST i diag 

415819; AU077330; Hs.360791; transcription elongation facto; test; CTLts.m. 
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415829; AW450198; Hs.163742; ESTs; test; diag 

415857; AA866115; Hs.127797; Homo sapiens cDNA FU11381 lis; lung, tesl; diag 
415910; U20350; Hs.78913; chemokine (C-X3-C) receptor 1; glio; mAb 
415947; U04045; Hs.78934; mutS (E. coii) homolog 2 (colo; test; diag 
5 415989; AI267700; Hs.351201; ESTs; pros.ovar, blad, lung, headnk, pane, colon, sarc; diag 
415992; C05837; Hs.145807; hypothetical protein FU13593; pros, fibro; mAb 
415999; AA172179; Hs.294029; ESTs; pros, uler ; diag 
416018; AW138239; Hs.78977; proprotein convertase subtilis; colon, pane, lung; diag 
416033 H152 1 H 2194 - ESTs bieast libra diag 
1 0 416065; BE267931; Hs.78996; proliferating cell nuclear ant; blad, lung, headnk, mela; CTL+s.m. 
416111; AA033813; Hs.79018; chromatin assembly factor 1, s; lung, stonr, CTL-ts.m. 
416188; BE157260; Hs.79070; v-myc avian myelocytomatosis v; pros; diag 
416201; AA467752; Hs.195161; ESTs; test; diag 
416208, AW29 H 11 15; EST \ I , imila tc MUC2_H; lung; diag 
1 5 416224; NM_002902; Hs.79088; reticulocalbin 2, EF-hand calc; ovar; diag 

416225; AA577730; Hs.188684; ESTs, Weakly similar to PC4259; pros, blad; diag 
416350, AF188625; Hs.189507; phospholipase A2, group IID; test, mela, fibro; diag 
416370; N90470; Hs.203697; CD38 anligen (p4S); pros, glio; mAb+CTL 
416373; AA1 95845; Hs. 73680; ESTs, Weakly similar loS 12658; sarc; diag 
20 416402; NMJ00715; Hs.1012; complementcomponent 4-binding; fibro; diag 

416448; L13210; Hs79339; lectin, galactoside-binding. s; war, colon, storn; diag 
416498; U33632; Hs.79351; potassium channel, subfamily K; pane, stom, breast, ovar, colon; mAb 
416539; Y07909; Hs.79368; epithelial membrane protein 1; pros, headnk; diag 
41 6602; NMJM6159; Hs.367895; Protein kinase C-binding prole; breast; diag 
25 416640; BE262478; Hs.13406; neuron-specific protein; mela; diag 

416661; AA634543; Hs.79440; IGF-II mRNA-binding protein 3; blad, lung, headnk, cerv, pane, angio; diag 
416773; AK000340; Hs.79828; hypothetical protein FU20333; test; CTLts.m. 
416815; U41514; Hs.80120; UDP-N-acetyl-alpha-D-galactosa; angio; s.m. 
416819; U77735; Hs.80206; pim-2 oncogene; lung, test; diag 
30 416881; N32520; Hs.141358; ESTs; mela; diag 

416929; N20535; Hs.43265; melastatin 1; mela; diag 
416975; NMJ04131; Hs.1051; granzyme B (granzyme 2, cytolo; mela; s.m. 
417003; AL038170; Hs.80756; betaine-homocysteine methyltra; blad; s.m. 
417070; Z19077; Hs.172004; titin; sarc, diag 
35 417105; X60992; Hs.81226; CD6 antigen; fibro; mAb 

417115; AW952792; Hs.334612; small nuclear ribonucleoprotei; test CTL*s.m. 
417124; BE122762; Hs.25338; ESTs; angio; diag 
417148; AA359896; Hs.374554; hypothetical protein FU14902; pane; diag 
417151; M194055; Hs.293858 ; ESTs; blad; diag 
40 417153; X57010; Hs.81343; collagen, type II. alpha 1 (pr; pros, sarc; diag 

417218; M005247; Hs.285754; met proto-oncogenc (hepatocyte; lung; mAb+s.m. 
417237; H86385; Hs.81737; palmitoyl-protein thioesterase; mela; s.m. 
417259; AW903838; Hs.81 800; chondroitin sulfate proteoglye; pane, breast; diag 
417275; X63578; Hs.295449; paralbumin; blad; diag 
45 417295; AW993524; Hs.43148; epilhelial membrane protein 1; pros; diag 

417308; H60720; Hs.81892; KIAA0101 gene product; lung, headnk, blad, cerv, angio, mela, sarc; diag 
417312; AW888411 h-V i I - x Jed phosphopro; blad; CTl+s.m. 
417333; AL157545; Hs.173179; bromodomain and PHD finger con; breast; diag 
417355; D13168; Hs.82002; endolhelin receptor type B; glio, mela; mAb 
50 417365, D50683, Hs E20? 3 trai cm „ nr. >*th factor, be;fibro, angio; mAb 

417366; BE185289; Hs.1076; small proline-rich protein 1B ; lung, blad, headnk, pane, esoph, mela; diag 
417370; T28651 ; Hs.374466; tryptophanyl-tRNA synthetase; fibro, mela; diag 
417391; AW291946; Hs.82065; interleukin 6 signal transduce; breast mAb-ts.ni. 
417400; AA663486; Hs.123072; RAB3B, member RAS oncogene fam; pros; diag 
55 417407; AA923278; Hs.290905; ESTs, Weakly similarto prolea; test pros; s.m. 
417409; BE272506; Hs.82109; syndecan 1; blad; diag 
417412; X16896; Hs.82112; interleukin 1 receptor, type I; fibro, pros, pane; mAb 
417426; NMJW2291; Hs.82124; laminin, beta 1; angio; diag 
417437; U52682; Hs.82132; interferon regulatory factor 4; mela; CTt-ts.m. 
60 417512; X76534; Hs.82226; glycoprotein (transmembrane) n; lung, mela, headnk, pane, breast; mAb 
417515; t24203; Hs.82237; ataxia-telangiectasia group D-; lung, headnk, blad; diag 
417542; J04129; Hs.82269; progestagen-associated endomet lung, mela; diag 
417592; AA204664; Hs.182437; ESTs, Weakly similar to I54383; test diag 
417599; AA204688; Hs.62954; ESTs; blad, esoph; diag 
65 417621; AV654694; Hs.82316; interferon-induced, hepatitis ; esoph; diag 
417696; BE241624; Hs.82401; CD69 antigen (p60, early T-cel; pros; mAb 
417705; AW134952; Hs.175220; hypothetical protein FU14541; test mAb 
417715; AW969587; Hs.86366; ESTs; blad, lung, headnk; diag 
417720; M205625; Hs.208067; ESTs; blad, lung, esoph, headnk; diag 
70 417750; AI267720; Hs.260523; synovial sarcoma, translocated; sarc; diag 
417777; AI823763; Hs.7055; ESTs, Weakly similar to I78885; test s.m. 
417791; AW965339; Hs.44269; ESTs; ovar, blad, lung, headnk; CTLts.m. 
417801; AA417383; Hs.82582; integrin, beta-like 1 (with EG; pane, fibro; diag 
417805; U38545; Hs.82587; phospholipase D1, phophalidylc; angio; s.m. 
75 417831; H16423; Hs.82685; CD47 antigen (Rh-related antig; ovan mAb 

417843; W07361; Hs.22545; HomosapienscDNAFU12935fis; pros; diag 
417847; AI52155B; Hs.7331; hypothetical protein FU22316; ovan diag 
417849; AW291587; Hs.82733; nidogen 2; angio, headnk; diag 
41 7874; BE61 61 60; Hs.82829; protein tyrosine phosphatase, ; pane; mAt+s.m. 
80 417880; BE241595; Hs.82848; selectin L(lymphocyte adhesio; mela; mAb 
417BB5;AA214 r t ESI lesl, ovar diag 

417900; BE250127; Hs.82906; CDC20 (cell division cycle 20,; lung, stom, lest, tlad, headnk, cerv, esoph; CTUs.m. 
417911; AA333387; Hs 82916, chape oi r. containing TCP1 i lesl kj 



106 



WO 03/042661 



PCT/US02/36810 



417944; AU077196; Hs.82985; collagen, type V, alpha 2; sarc; diag 
417975; AA641836; Hs.30085; hypothetical protein FU23186; colon, stom, lung; mAb 
417976; BE565892; Hs.83077; interteukin 18 (interferon-gam; colon, slom, fbros; diag 
418004; U3751 9; Hs.87539; aldehyde dehydrogenase 3 famil; lung, headnk, esoph; s.m. 
418036; Z37976; Hs.B3337; latent transforming growth fac; angio; diag 
418064; NM_002318; Hs.83364; lysyl oxidase-like 2; lung, angio, test, sarc; diag 
418067; AI127958; Hs.83393; cystab'n E/M; headnk, pane, blad; diag 
418068; AW971155; Hs.293902; ESTs, Weakly similar to ISHUSS; blad; s .m. 

418113; AI272141; Hs.83484; SRY (sex determining region Y); blad, breast, uter, colon, lung, ovar, glio, test, sarc; CTL-s.m. 

P1 3977 1 8661 ST 
418140; BE613B36; Hs.83551; microfibrillar-associated proh lung, headnk, esoph, ovar, sarc; diag 
418156; W17056; Hs.83623; nuclear receptor subfamily 1, ; fibro; mAb+s.m. 
418203; X54942; Hs.83758; CDC28 protein kinase 2; lung, blad, test mela, stom; s.m. 
418216; AA662240; Hs.283099; AF15q14 protein; headnk, lung, blad; diag 
418245; AA088767; Hs.83883; transmembrane, prostate androg; pane; mAb+s.m. 

418283; S79895; Hs.83942; oathepsin K (pyenodysostosis); breast, cere, ovar, uter, pros, headnk, lung, pane, colon, stom; diag 

418318; U47732; Hs.84072; transmembrane 4 superfamily me; pane, pros, colon, slom, omuc; mAb 

418322; AA284166; Hs.84113; oyolin-dependent kinase inhibi; heart*, lung, blad; s.m. 

418338; NMJ02522; Hs.84154; neuronal | enlraxin I; sa c diag 

418339; AA639902; Hs.104215; ESTs, Moderately similar to SP; pros; diag 

418345; AJ001696; Hs.241407; serine (or cysteine) proteinas; cerv, lung; s.m. 

418371; M13560; Hs.84298; CD74 antigen (invariant polype; renal; mAb 

418379; AA218940, Hs.137516; fidgetln-like 1; lung; diag 

418394; AF1 3281 8; Hs.84728; Krappel-like factor 5 (intesti; pane; CTL-fS.m. 

418396 AP6580 C I 1; SLC2A12 Solute cams nmily pros mAb 

418397; NM_001269; Hs.84746; chromosome condensation 1; lung; diag 

418399; AF131781; Hs.84753; hypothetical protein FU12442; test, blad, sarc; diag 

418406; X73501; Hs.84905; cytokeratin 20; blad, colon; diag 

418432; M14156; Hs.851 12; insulin-like growth factor 1 (; pros, fibro; diag 

418460; M26315; Hs.85258; CD8 anligen, alpha polypeptide; fibro; mAb 

418462; BE001596; Hs.85266; integrin, beta 4; lung, blad, cerv, headnk, ovar; mAb 

418543; NM_005329; Hs.85962; hyaluronan synthase 3; blad, lung; mAb 

41 8676; AW9681 59; Hs.302740; Epithelial ealoium channel 2, ; pros; mAtHs.m. 

1 I0;AW24 ! I I ti ht I protein MG< > 7 42 pr diag 

418641; BE243136; Hs.86947; a disintegrin and metalloprate; cerv, lung, pane, blad, headnk, stom; rnAb 
418655; AA226354; Hs.11 1240; ESTs; pros; diag 

418661; NM_001949; Hs.1189; E2F transcription factor 3; ovar, lung, mela; CTL+s.m. 

418663; AK001 100; Hs.41690;desmocollin 3; lung, blad, headnk, cerv, esoph; mAb 

418683; U90908; Hs.87241; hypothetical protein from olon; angio; CTUs.m. 

418686; Z36830, Hs.8726£ nn )A8 blad lung diag 

418693; AI750878; Hs.B7409; thrombospondin 1; angio, pane; diag 

418696; AW959433; Hs.326290; hypothetical protein FU12581; test; diag 

418739; AA31 0964; Hs.88012; SHP2 interacting transmembrane; mela; diag 

418756; AA252254; Hs.226949; ESTs; test ; diag 

418825; AA228881; Hs.22394; hypothetical protein FU10893; angio; diag 

418829; AA516531; Hs.55999; NKhomeobox (Drosophila), fami; pros; diag 

418882; NM_004996; Hs.89433; ATP-bindlng cassette, sub-faml; ovar, pros, breast, lung; diag 

418883; BE387036; Hs. 1211; acid phosphatase 5, tartrate r; fibro; s.m. 

418888; AU076801; Hs.89436; cadherin 17, LI cadherin (live; colon, stom, ovar, uter, pane; mAb«s.m. 

418918; X07871; Hs.89476; CD2 antigen (p50), sheep red b; mela, fibro; mAb 

418932; L34059; Hs.89484; cadherin 4, type 1, Rodherin; glio; mAb 

418941; AA452970; Hs.239527; E1B-55kDa-associated protein 5; angio, blad; diag 

41 8968; NMJM0078; Hs.89538; cholesteryl ester transfer pro; mela; diag 

41 8994; AA296520; Hs.89546; selectin E (endothelial adhesi; pros, angio; mAb 

419038; AW134924; Hs.58290; ESTs; pros; diag 

419073; AW372170; Hs.183918; transmembrane receptor Unc5H2 ; ovar, renal, blad, lung; mAb 

419078; M93119; Hs.89584; insulinoma-associated 1; blad, lung, pane, sarc; diag 

419079; AW014836; Hs.18844; ESTs; esoph, lung, stom, colon; diag 

419086; NM_000216; Hs.89591; Kallmann syndrome 1 sequence; fibro; diag 

419092; J05581; Hs.89603; mucin 1, transmembrane; breast, pane, lung, blad, fibro; mAb 

41 9222; AD001 528; Hs.8971 8; spermine synthase; pros; s.m. 

419223 X60111 HI ) i i snni* 

419231; AL046294; Hs. 136245 ESI v lysimil T1 7 227; fibro; diag 

419261 ; X07876; Hs.89791 ; wingless-type MMTV integration; pane; diag 

41 9264; AA3771 04; Hs.293672; ESTs, Weakly similar to ALUB_H; pros; diag 

419290; AI128114; Hs.1 12885; spinal cord-derived growth fac; pane; diag 

419356; AI656166; Hs.7331; hypothetical protein FU22316; uter, ovar; diag 

419359; AL043202; Hs.90073; chromosome segregation 1 (yeas;!ung, blad, test; diag 

419440; AB020689; Hs.90419; KIAA0882 protein; breast; diag 

419485; AA489023; Hs.99B07; ESTs, Weakly similar to unname; mela; diag 

41 9490; NM_006144; Hs.90708; granzyme A (granzyme 1 , cytoto; fibro; s.m. 

419519; AI198719; Hs.176376; ESTs; mela; diag 

41 9551 ; AW582256; Hs.91 01 1; anterior gradient 2 (Xenepus I; pane, pros, breast; diag 
419559; Y07828; Hs.91096; ring finger protein; blad, colon, stom; CTL+s.m. 
419568; AB0261 16; Hs.283078; hOAT4; renal; mAb 
419559; AI9716S1; Hs.91143; lagged 1 (Alagille syndrome); headnk, lung; diag 
419628; H67546; Hs.49768; ESTs; mela, sarc; diag 

41 9657; AU077005; Hs .92208; a disintegrin and metalloprate; breast, cerv, angio; mAb 

419593; AA133749; Hs.301350; FXYD domain-containing ion tra; pros, breast, ovar, pane, lung; mAb 

419721; NM_001650; Hs.315369; aquaporin 4; glio, lung, fibro; mAb 

419743; AW408762; Hs.5957; Homo sapiens clone 2441 5 mRNA ; blad, headnk; diag 

419749; X73608; Hs.93029; sparc/osteonectin, cwov and ka; pros, pane, lung; diag 

419752; AA249573; Hs.152618; ESTs, Moderately similar to ZN; lung; diag 
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41 9839; U24577; Hs.93304; phospholipase A2, group VII (p; pros, lung; diag 

419870; AW403911; Hs.266175; phosphoprotein associated with; mela; diag 

419875;AA853410,h ) 57; j ni nalin, sarc, diag 

419948; AB041035; Hs.93847; NM_016g31:Homo sapiens NADPH o; angio; mAb 

419956, AL137939, h 10096; cadtierin 9 t/pe 2, mela; mAb 

419968; X04430; Hs.93913, interleukin 5 (interferon, bet; lung, pane, esoph;diag 

419981; AA897581; Hs.128773; ESTs; angio; diag 

420005; AW271 106; Hs.133294; ESTs; lung, test, blad, colon; diag 

420062; AW411096; Hs.94785; TGF(beta)-induced transcriptio; test; CTL*s.m. 

420067; T52431; Hs.94795; Homo sapiens mRNA; cDNA DKFZp5: sac; disa 

4201 37; AA306478; Hs.95327; CD3D anligen, delta polypeptid; fibre; mAb 

420154; AI093155; Hs.95420; G anagen family C 1 protein); Dros, leio CTL+s.m 

420174; AI824144; Hs.199749; ESTs; angio; CTL*s.m. 

420208; BE276055; Hs.95972; silver (mouse homolog) like; mela, sarc; mAb 

420209; AA256444 Hs 126485 h> r thel I pro. L 12604 angio diag 

42021 8; AW958037; Hs.381105; ribosomal protein L4; mela, pros; diag 

420255; NMJJ072B9; Hs.1298; membrane metallo-endopeptidase; pros; rrAfc 

420256; U84722; Hs.76206; cadhertn 5, type 2, VE-cadheri; angio, fibro; mAb 

420267; N37030; Hs.173337; ESTs; mela, sarc; diag 

420281; AI623693; Hs,323494; Predicted cation efflux pump; lung, blad, ovar, pane; mAb 
420301; AA767526; Hs.22030; paired box gene 5 (B-cell line; mela; diag 
420338; AA825595; Hs.B8269; Homo sapiens, clone MCC:17339,; mela; mAb 
420340; NM_000734; Hs.97087; CD3Z anligen, zeta polypeptide; fibro; mAb 

420344; BE463721; Hs.97101 ; putative G proteirvcoupled rec; colon, pros, blad, headnk, pane, stom, ovar; m 

420347; AL033539; Hs.97124; Human DMA sequence from clone ; test; diag 

420360, U83171, H- 9 2 1 mall inducit cy r, il Meuk; diag 

420367; AA259090; Hs.257028; ESTs; test; diag 

420376; AL137471; Hs.97266; protocadhenn 18; sarc; mAb+s.m. 

420378; NM.014143 Hs.9725 7-H1 p otem leuk, mAb 

420380, AA640891 H 1 406; EST ung diag 

420424; AB033036; Hs.97594; KIM1210 protein; pros; diag 

420462; AF050147; Hs.97932; chondromodulin I precursor; lung, EWS, sarc; mAb 

420474; L09753; Hs.1313; tumor necrosis factor (ligand); leuk; mAb 

420544; M677577; Hs.380213; Homo sapiens Chromosome 16 BAC; pros; diag 

420576; AA297634; Hs.54925; KIM1 858 protein; sarc; diag 

420596; NM.002692; Hs.991 85; polymerase (DNA directed], eps; test; CTL*s.m. 

420633; NM_0145B1; Hs.274480; odorant-binding protoir, 2B (OB; breast, endo; diag 

420656; AA279098; Hs.187636; ESTs; fibro; diag 

420710; NM_007009; Hs.99B75; zona pellucida binding protein; test; diag 

420729; AW964897; Hs.290825; ESTs; pros; diag 

420757; X78592; Hs.99915; androgen receptor (dihydratest; pros, rrAb*s.m. 

420759; T1 1832; Hs.127797; Homo sapiens cDNA FU1 1381 fis; test; diag 

420783; AI65983B; Hs.99923; lectin, galactoside-binding, s; lung, blad, headnk; diag 

420789; AI670057; Hs.199882; ESTs; renal,; diag 

420859; AW468397; Hs.100000; S100 calcium-binding protein A; sarc; diag 

420908; AL049974; Hs.100261; Homo sapiens mRNA; cDNA DKFZp5; pane; diag 

420923; AF097021; Hs.273321; differentially expressed in he; blad, colon; diag 

420931; AF0441 97; Hs.100431; small inducible cytokine B sub; breast, lung, mela; diag 

420981; L40904; Hs.100724; peroxisome proliferative activ; colon; mAb+s.m. 

421016; AA504583; Hs.10104 ; transcription factoi 3 (E2A im; test TTL+s m 

421 044; AF061 871 ; Hs.101 302; Human DNA sequence from clone ; pane; diag 

421059; AI6541 33; Hs.356247; thyroid receptor interacting p; pros; mAb+s.m. 

421064; AI245432; Hs.101382; tumor necrosis factor, alpha-i; blad, uter, diag 

421 070; AA2831 85; Hs.19327; ESTs; blad; diag 

421 100; AW351839; Hs.124660; Homo sapiens cDNA: FU21763 S; blad; diag 

421133; AA814971; Hs.26410; ESTs; lung; diag 

421 1 54; AA284333; Hs.287631 ; Homo sapiens cDNA FU 14269 fis; BPH; diag 

421 155; H87879; Hs.102267; lysyl oxidase; headnk, pane, renal, sara; diag 

421 21 8; NMJJ00499; Hs.7291 2; cytochrome P450, subfamily I (; blad, angic d . 

421233; M209534; Hs.2B4243; tetraspan NET-6 protein; pros, breast, ovar; mAb 

421241; X91817; Hs.102866; transketotase-like 1; test; s.m. 

421302; T34462; Hs. 103291; neuritin; uter, endo ; dag 

421305; BE397354 h 124 0; ( 1 r - b<m resistance pro; ovar; diag 

421307; BE539976; Hs.103305; Homo sapiens mRNA; cDNA DKFZp4; breast, lung, angio, test, sarc- diag 

Pi J n J2 1 H 2 %11 deleted in malignant brain turn; pane, lung; diag 

421350; AW301608; Hs.2781BB; ESTs, Moderately similar to 15; test; diss 

421373; AA808229; Hs.222088; ESTs; blad; diag 

421433; AI829192; Hs.223B0; ESTs; pros; diag 

421451;AA291377, H.-'B.--' ESTs ovar, blad, luna;diaa 

421458; NM_O03654; Hs.104576; carbohydrate (keratan sulfate ; sarc; s.m. 

421478; AI683243; Hs.97258; ESTs, Moderately sir,, tu S2 ov i blad r^nal uter diag 

421481; AW391972; Hs.104696; KIAA1324 protein; pros; diag 

421 502; AF1 11 856; Hs.1 05039; solute earner family 3" o< var.fib mAt 

421506; BE302796; Hs.105097; thymidine kinase 1, soluble; lung, headnk esoph s m 

ISC J 10 11 blad 

421535; AB002359; Hs.105478; phosphoribosylformytglycinamid; test s.m. 
421537; BE383488; Hs.105547; neural proliferation, differen; pros; diag 
421566; NM_000399; Hs.1395; earty growth response 2 (Krox-; pros; CTL+s m 
421579; NM_002975; Hs.105927; stem cell growth factor; lymph; sarc; mAb 

1633 AF12 860 H 1062 riing n ir r, I jaq 
421650; AA781795; Hs.34380O; ESTs; mela; diag 
421666 AL01525U Hs.1406;ei i 3 mela diag 
421727; Y13153; Hs.107318; kynurenine 3-monooxygenase (ky; breast s.m. 
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421773; W69233; Hs.112457; ESTs; mela, esoph, sarc; diag 

421777; BE562088; Hs.108196; HSPC037 protein; esoph, cerv, lung; diag 

421779; AI879159; Hs.108219; wingless-lype M MTV integration; colon, ovar; diag 

421 798; N74880; Hs.355462; N-acylsphingosine amidohydrola; fibro; s.m. 
5 421B14 L12 50; Hs 1l>-_:,thrcr)bospondin2 panc;diag 

421831; AA298836; Hs.22026; ESTs; similar to TRANSMEMBRAN; angio; mAb+s.m, 

4218B7; AW161450; Hs.109201; CGI-86 protein; pros; mAb 

421896; NB2293; Hs.45107; ESTs; pros; diag 

421917; AB028943; Hs.109445; KIAA1020 protein; test; diag 
1 0 421920; BE551245; Hs.1433; gamma-aminobutyric acid (GABA); sarc; mAb 

421924; BE514514; Hs.109606; coronin, actin-tinding protein; fibro; diag 

421948; L42S83; Hs.334309; keratin 6A; lung, headnk, blad, esoph, cerv, mela; diag 

421952; AA300900; Hs.98B49; dynein light chain 2B (DNLC2B); fibro; diag 
. , 421991; NM_014918; Hs.110488; KIAA0990 protein; pane; diag 
1 5 421996; AW583807; Hs.1460; glucagon; pane; diag 

422002; X70070; Hs.1 10642; neurotensin receptor 1 (high a; colon; mAb 

422027; AL0431 00; Hs.288828; fatty acid amide hydrolase; pros; s.m 

422033; AW245805; Hs.1 1 0903; claudin 5 (transmembrane prate; gtio; mAb 

422087 X58968 Hs 111301 ll metallO| I ina-e 2 (ge s,rr diag 
20 422089; AA5231 72; Hs.1031 35; ESTs, Weakly similar to SFR4_H; pros; diag 

422094; AF1 29535; Hs.272027; F-box only protein 5; blad, lung; CTL-s.m. 

422095; AI868872; Hs.282804; hypothetical protein FLJ22704; lung, pane, ovar, breast; CTL+s.m. 

422099; AA1 56022; Hs.111518; hypothetical protein; angio; CTL*s.m. 

422100; AI096988; Hs.1 11 554; ADP-ribosylation factor-like 7; lung, esoph; CTL+s.m. 

2 5 4221 10; AI376736; Hs.1 21 555; secreted protein, acidic, cyst; pane; diag 

4221 19; AI277829; Hs.1 1 1862; KIAA0590 gene product; blad; diag 
422134; AW1 7901 9; Hs.112110; mitochondrial ribosomal protei; lung; diag 
422152; AA909249; Hs.1 12282; solute earner family 30 (zinc; blad; mAbts.m. 
422163; AF027208; Hs.1 12360; prominin (mouse)-like 1; colon, breast, fibro; mAb 

3 0 422164; NMJ14312; Hs 1 12377; cortic al thymocyte receptor (; blad; mAbts.m. 

422168; AA586894; Hs.112408; S100 calcium-binding protein A; lung, blad, headnk, breast, mela, esoph, sarc, cerv; CTL-ts.m. 

422170; AI791949; Hs.1 12432; anU-Mullerian hormone; uter, blad; diag 

422173; BE385828; Hs.250619; phorbolin-like protein MDS019 ; meta; diag 

422247; U18244; Hs.1 13602; solute carrier family 1 (high ; blad; mAb 
3 5 422278; AF072873; Hs.114218; frizzled (Drosophila) homolog ; ovar, headnk, blad, cerv, lung, panc,stom; mAb 

422309; U79745; Hs.1 14924; solute carrier family 16 (mono; mela; mAb-»s.m. 

42231 1; AF073515; Hs.114948; cytokine receptor-like factor ; lung, fibro; diag 

422355; AW403724; Hs.300697; coagulation factor VII (serum ; fibro; diag 

422363; T55979; Hs.1 15474; replication factor C (activato; mela, colon; diag 
40 422398; AI4761 49; Hs.334489; hypothetical protein F1J21992; fibro; CTL+s.m. 

422406; AF025441; Hs.1 16206; Opainteracting protein 5; blad, lung; diag 

422420; U03398; Hs.1524; tumor necrosis factor (ligand); colon, pane, stom, leuk; mAb 

422423; AF283777; Hs.1 16481; CD72 antigen; spleen, leuk; mAb 
. , 422440; NM_004812; Hs.1 16724; aldo-keto reductase family i, : lung, headnk; s.m 
45 422487.-AJ010901; tls.190267: mucin 4, tracheobronchial; lung, headnk, pane, sarc; diag 

422532; AI.008726; Hs.1 1 81 26; protective protein for beta-ga; renal, mela; s.m. 

422565; BE259035; Hs.1 18400; singed (Drosophila)-like (sea ; pane, test, me'a; diag 

422573; AW297935; Hs.295726; integrin, alpha V (vitronecli; pane; rrAb-s.m. 

422596; AF063611; Hs.1 18633; 2-!J-o!igoadenylate synthetas; esoph, mela; s.m. 
5 O 422603; BE242587; Hs.1 1 8651 ; hematopoietically expressed ho; angio; CTL*sm 

422633; X56832; Hs.1 18804; enolase 3, (beta, muscle); sarc; s.m. 

422658; AF231 981; Hs.2501 75; homolog of yeast long chain po; pros; diag 

422689; AW856665; Hs.299797;gb:RC3-CT0297-290100-013-d03C; test diag 
_ , 422726; U1 1 690; Hs.1 572; faciogenital dysplasia (Aarsko; test; diag 
55 422728 AW937825 Hs 10326? MAD (mothers gair ( la; pros; diag 

422789; AK001 1 13; Hs.120842; hypothetical protein FU10251; test; CTL+s.m. 

422835; BE218705; Hs.121378; metallothionein-like 5, testis; breast; diag 

422871; AL031 228; Hs.1 21 509; collagen, type XI, alpha 2; sarc; diag 
„. 422887; AI751 848; Hs.4921 5; ESTs; sarc; CTUs.m. 
60 422938; NM_001 809; Hs.1594; centromere protein A (17kD); lung, test CTL+s.m. 

422963; M79141; Hs.13234; ESTs; lung, pane; diag 

422997; BE018212; Hs.122908; DNA replication factor; test; CTL-^jii. 

423017; AW178761 K 1. , 18; serin, (oi o U ir - cte.nas blad, headnk, mela; mAb-diag 

423052; M28214; Hs.123072; RAB3B, member RAS oncogene fam; pros; diag 
65 423189; M59371; Hs.171596; EphA2; colon, ovar; mAb 

423196; AK001 866; Hs.125139; hypothetical protein FU1 1004; fibro; CTL+s.m. 

423198; M81933; Hs.1634; cell division cycle 25A; test; CTL+s.m. 

423201; NM_000163; Hs.125180; growth hormone receptor; pros; mAb 
_ 423217; NM_000094; Hs.1640; collagen, type VII, alpha 1 (e; lung, esoph; diag 

/O 423271; W47225; I 126256 , ul 1 1 tela, blad, stom, esoph; diag 

423309; BE006775; Hs.126782; sushi-repeat protein; lung, colon; diag 

423354 AB01 1130 H 127 6; calcium nnel rallaqe-depen test, fibro; mAb 

423387; AJ01 2074; Hs.348500; vasoactive intestinal peptide ; pros; mAb 

423397; NM_001838; Hs.1652; chemokine (C-C motif) receptor; blad, mela; mAb 
75 423412; AF109300; Hs.351615; prostate cancer associated pro; pros; diag 

423422; AC005175; Hs.128425; NY-REN-24 antigen; glio; mAb+CTL 

423445; NM_014324; Hs.128749; alpha-methylacyl-CoA racemase; pros; s.m. 

423453; AW450737; Hs.1 28791 ; CCI-09 protein; lung; CTL+s.m. 
„ 423458; AI204212; Hs.351 1 1 3; ESTs; test CTL+s.m. 

80 423511; AF036329; Hs.129715; gonadotropin-releasing hormone; lung; diag 

423515; AA327017; Hs.176594; ESTs; ovar; diag 

423541; AA296922; Hs.129778; serine protease inhibitor, Kaz; colon, pane; diag 

423575; C18863; Hs.163443; intron of periostin (OSF-2os); headnk, breast, pane, lung, fibro, esoph; diag 
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423605: AF047826; Hs.129887; cadherin 19, type 2; mela; mAb 

423647 AW452ob0 n 15714 hypotheti ! ol n" f13204; lung; diag 

423662; AK001035; Hs.130881; B-cell CLUymphoma 11A(zino ; lung; diag 

423685; BE350494; Hs.49753; uveal auloanti'gen with coifed ; pane, uter, colon; CTL+s.m. 

4 r 108; Hs. 132127; hypothetical | I lunq, headnk, blad; d 

423739; AA398155; Hs.97600; ESTs; breast, ovar, pane; diag 

423761; NM_006194; Hs.132576; paired box gene 9; headnk; CTL+s.m. 

423765; R23858; Hs.143375; Homo sapiens, clone IMAGE3840; test; diag 

423778; Y09267; Hs.132821; flavin containing monooxygenas; fibro; cm. 

423779; AW071837; Hs.57971; TRANSCRIPTION FACTOR HES-5; glio; diag 

423787; AJ295745; Hs.736204; nuclear pore complex protein; tesl, esoph; diag 

423798; AF047033; Hs.132904; solute earner family 4, sodiu; angio; mAb 

423799; AW026300; Hs.132906; 19A24 protein; mela; mAb 

423849; AL1 57425; Hs.1 33315; Homo sapiens mRNA; cDNA DKFZp7; lung; diag 

423887; AL0802C7; Hs.1 34585; DKFZP434G232 protein; headnk, lung; diag 

423899; NMJ01427; Hs.1 34989; engrailed homolog 2; mela; CTL+s.m. 

423905; AW579960; Hs.135150; king type-l cell membrane-ass o; test; mAb 

423909; AJ223183; Hs.135194; immunoglobulin superfamily, me; test liag 

423934; U89995; Hs.159234, forkhead box E1 (thyr »id In ns lui g CTI +s , i 

424001 ;W67883; Hs.137476; paternally expressed 10; breast, ovar; diag 

42401 2; AW368377; Hs.137569; tumor protein 63 kDa with stro; lunq, blad, headnk, esoph; diag 

424036; AA770688; Hs.348495; H2A histone family, member L; pane, ovar; CTL+s.m. 

424054; AA334511; Hs.26638; membrane-spanning 4-domains, s; pros, fibro; mAb 

424078; AB006625; Hs.1 39033; paternally expressed 3; ovar, uter; CTL+s.m. 

424081; NMJ06413; Hs.139120; ribonuclease P (30kD); test; s.m. 

424098; AF077374; Hs.139322; small praline-rich protein 3; lung, blad, headnk, esoph, cea<; diag 

424125; M31669; Hs.1735; inhibin, beta B (activin AB be; ovar, pros; diag 

424144; AA454033; Hs.41644; AKAP-associated sperm protein; fibro; diag 

424153; M451 737; Hs.141496; MAGE-like 2; mela; CTL+s.m. 

424165; AW582904; Hs. 142255; islet amyloid polypeptide; pane; mAb 

424212; NM.005B1 4; I Is. 1431 31; glycoprotein A33 (transmembran; colon, stem, ovar; mAb 

424218; AF031824; Hs.1 4321 2; nystatin F (leukocystaBn); mela, fibro; diag 

■124244, AV647184; Hs.143601; hypothetical protein hCLA-iso; blad; diag 

424252; AK000520; Hs.14381 1; hypothetical protein FU20513; colon, slom; diag 

424264; D80400; Hs.239388; Human DNA sequence from clone ; blad; mAb 

124 IE f , r / r n H 1 143 ninichromosom mointcr ni dot; bl 1 Inn cs dia 

424310; AA338648; Hs.50334; testes development-related NYD; fibro; diag 

424337; AA338919; Hs.1 01 61 5; ESTs; pros; diag 

424339; BE257148; Hs.145416; cndoglycan; pros, lung; diag 

424343; AW956360; Hs.4748; adenylate cyclase activating p; glio, ovar, uter; mAb 

424364; AW383226; Hs.163834; ESTs, Weakly similar to G01 763; lung, blad, headnk, cerv; diag 

424399; AI9056B7; Hs.348419; AIS05687:IL-BT035 190199-019 B; breast, uter, headnk; diag 

424420; BE614743; Hs.146688; prostaglandin E synthase; lung, blad; s.m. 

424440; AA340743; Hs.133208; ESTs; sarc; diag 

424441; X14850; Hs.147097; H2A histone family, member X; lung; diag 

A I 14 n tc r 

424522; AL134847; Hs.149957; ribosomal protein S6 kinase, 9; breast, cerv, ovar, u!er, blad, colon, slom; s.m. 
1,' \W138558;Hc.33^873;ESTs,\ ally milar to 1 54374; fbr diag 
4 4 \K001973; Hs.150890; hypothetical protein; lest, CTLh m 
424581; M62052; Hs.150917; catenin (cadherin-associated p; glio, ovar, uter. mAb+sjii. 
124586; NMJM3401 npl ri c 1L 

424629; M90656; Hs.151393; glutamate-cysteine ligase, cat; lung; CTL+s.m. 
424635; AA420687; Hs.1 15455; Homo sapiens cDNA FU14259 fis; glio; diag 
424676; Y08565; Hs.1 51 678; UDP-N-acctyl-alpha-D-galactosa; breast; s.m. 
424704; AI263293; Hs.152096; cytochrome P4G0, subfamily IIJ; renal; s.m. 
424711, NM^OC?/^ Hs. 152175 alcitonin receptc angio mAb 
424717; H03754; Hs.152213; wingless-type MMTV integ ation; bl lung, he >d, i diag 
424803 AU35583 H 153203 MyoD family i hit I . I st pros; diag 
424806; AA3B2523; Hs.105589; MSTP031 protein; anglo; mAb 

424834; AK001432; Hs.153408; Homo sapiens cDNA FU10570 fis; lung, blad, over, headnk, esoph, cerv, uter, 

424846; AU077324; Hs.1 832; neuropeptide Y; pros; diag 

424897; D63216; Hs.153684; frizzled-related protein; pane, EWS, stom, renal; diag 

424902; NM_003866; Hs.153687; inositol polyphosphate-4-phosp; pane, louk, mela; CTbs.m. 

424954; NM_000546; Hs.1846; tumor protein p53 (U-Fraumeni; mela, colon; CTL+sm. 

424971; AA479005; Hs.154036; tumor suppressing subtransfera; pane, mela; CTL+s.m. 

424998; U58515; Hs.154138; chitinase 3-like2; glio; diag 

425023; AW956889; Hs.154210; EDG-1 (endothelial different!'; angio; mAb 

425048; H05488; Hs.164502; ESTs; lung, blad; diag 

425057; AA826434; Hs.1619; achaete-scute complex (Drosoph; glio, lung; CTL+s.m. 

425088; AA663372; Hs.169395; hypothetical protein FU12015; glio, mela; diag 

425154; NM_001851; Hs.154850; collagen, type IX, alpha 1; sax; diag 

425159; NM_004341; Hs.154868; carbamoyl-phosphate synthetase; lung, test; s.m. 

425200; BE255203.comp; Hs. 155101; ATP synthase, H transporting. ; pane s.m. 

425206; NM_002153; Hs.155109; hydroxysteroid (17-beta) dehyd; blad; mAb 

425211; M18667; Hs.1867; progastnesm (pep i log n C); I ut >, -soph, pros, diag 

425234; AW152225; Hs.165909; ESTs, Weakly similar to I38022; lung, angio, blad, mela; diag 

425235; AA353113; Hs.105468; Homo sapiens cDNA: FU22743 fi; angio; diag 

425237; U07695; Hs.155227; EphB4; test; mAb 

425245; AI751768; Hs.155314; KIAA0095 gene product; lung; diag 

425259; AL049280; Hs.145010; Homo sapiens mRNA; cDNA DKFZp5; pros; diag 

425262; D87119; Hs.155418; GS3955 protein; mela, renal: CTL+s.m. 

425266; J00077; Hs.1 55421; alpha-feloprotein; lung;diag 

425274; BE281191; Hs.155462; minichromosome maintenance def; test; diag 
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425289; AW1 39342; Hs.1 55530; Interferon, gamma-inducible pr, mela; CTL+s.m. 
425308; M97639; Hs.1 55585; receptor tyrosine kinase-like ; pros, sarc; mAb 
425367; BE271188; Hs.155975; protein tyrosine phosphatase, ; libra; diag 
425371; D49441; Hs.155981; mesothelin; ovar, lung, fibro; mAb 

425397; J04088; Hs.156345; lopoisomerase (DNA) II alpha (; lung, blad, pane, angio, test, mela, esoph; CTUs.m. 

425427; AI652662; Hs.31 7432; branched chain aminotransferas; test; s.m. 

425428; AL1 10261 ; Hs.15721 1 ; DKFZP586B0621 protein; pane; diag 

425465; L189B4; Hs.1904; protein kinase C, iota; ovar, pros, colon; s.m. 

425525; M358883; Hs.23871; ESTs; sarc; diag 

425545; N98529; Hs.158295; Homo sapiens, clone MGC:12401,; sarc; diag 

425555, AA359291, Hs 7;Homosapior b'l , FI .3553 fi fibro, CTL+s.m. 

425572; AB01 1076; Hs.1 58307; undifferenlialed embryonic eel; lest; CTL+s.m. 

425601; AW629485; Hs.140720; GSK-3 binding protein FRAT2; test; CTL+s.m. 

425606 U52112, Ha 15P '1 r iir tndrq r'Otcm mela diag 

425628; NM_004476; Hs.1915; folate hydrolase (prostate-spe; pros; s.m. 

425638; NMJ31 2337; Hs.1 58450; nasopharyngeal epithelium spec; fibro; CTL+s.m. 

425679; X05997; Ks.160177; lipase, gastric; esoph; s.m. 

425692; D90041; Hs.155956; N-acetyllransferase 1 (arylami; breast; s.m. 

425695; NMJ05401; Hs.159238; protein tyrosine phosphatase, ; lung; mAb+s.m. 

425709; AA383076; Hs.159274; outer dense fibre of sperm tai; test; diag 

425710; AF030880; Hs.159275; solute carrier family, member ; pros; mAb 

425722; AI659076; Hs.97031; hypothetical protein MGC13047; mela; diag 

425726; AF085808; Hs.1 59330; uroplakin 3; pros, blad; diag 

425769; U72513; Hs.159486; Human RPL13-2 pseudogene mRNA,; lest diag 

42581 1; AL039104; Hs.1 59557; karyopherin alpha 2 (RAGcohor, test; diag 

425849; AJ000512; Hs.296323; serum/glucocorticoid regulated; lung, headnk; s.m. 

425921 ; NM_007231 ; Hs.16221 1 ; solute carrier family 6 (neuro; stom, pane; mAb 

425976; C75094; Hs.334514; NG22 protein; pros, ovar; mAb 

426027; NM_002608; Hs.1976; platelet-derived growth factor; sarc; diag 

426050; AF01 7307; Hs.166096; E74-like factor 3 (ets domain ; ovar, blad, stom; CTL+s.m. 

426059; BE292842; Hs.1661 20; interferon regulatory factor 7; esoph, cerv; CTL-*s.m. 

126057 / 664 'I 3.97053 SI lung J 

426088; AF038007; Hs.166196; ATPase, Class I, type 8B, memb; blad, lung; mAb 
426094; AF034611; Hs.1 66206; cubilin (intrinsic factor-coba; renal; diag 
4261 16; MB68729; Hs.144694; ESTs; fibro; diag 

426125; X87241; Hs.166994; FAT tumor suppressor (Drosophi; colon, stom, pane, pros, rena', fibro, cerv; mAb 
426156; BE244537; Hs.167382; natriuretic peptide receptor A; ovar, mAb 

42615B; NM J01982; Hs.199067; v-erb-b2 avian erythroblastic ; breast, ovar, uter, pros, blad, pane, colon, fibro, mela; diag 
426172; AA371307; Hs.125056; ESTs; pros; diag 

426174; AA547959; Hs.115838; Homo sapiens similar to Echino; breast, pros, fibro; diag 

426212 371 24 Hs.1 679B8; neural cell adhesion molecule glio; mAb 

426271 ; AF026547; Hs.1 69047; chondroitin sulfate proteoglye; giro; diag 

426274; D38122; Hs.2007; tumor necrosis factor (ligand); fibro, mela; mAb 

426300; U15979; Hs.169228; delta-like homolog (Drosophila; ovar, sarc; mAb 

426310; NM_000909; Hs.169266; neuropeptide Y receptor Y1; breast; mAb 

426312; AF026939; Hs.181874; interferon-induced protein wit; esoph, mela; diag 

426320; W47595; Hs.169300; transforming growth factor, be; ovar, pros, blad, pane; diag 

426350; NM_003245; Hs.2022; transglutaminase 3 (E polypept; cerv; s m. 

426363; M58524; Hs.2025; transforming growth factor, be; pros; diag 

(26370 R9828E Is 170 tilin 1 ' diag 

426416; AW612744; Hs.169824; killercell leclin-likerecept; fibro; mAb 

426440; BE382756; Hs.169902; solute earner family 2 (facil; lung, pane, ovar, blad, headnk, esoph; mAb 
426462; U59111; Hs.169993; demiatan sulphate proteoglycan; sax; diag 
426470; AA528794; Hs.128644; ESTs; mela; diag 

426471; M22440; Hs.170009; transforming growth factor, al; headnk, renal, pane; diag 

426490; NMJW1621; Hs.170087; aryl hydrocarbon receptor; pane; mAb+s.m. 

426501; AW043782, H 5 293616; ESTs pros breast gllo lung, mela, mAb 

426502; Y07759; Hs.170157; myosin VA (heavy polypeptide 1; mela; diag 

426534; U58096; Hs.2051; testis specific protein, Y-lin; test CTL+s.m. 

426535 AU077C12 H 70279; ESTs, \\ mil i bqui angio; diag 

426555; NM_000372; Hs.2053; tyrosinase (oculocutaneous alb; mela, sarc; mAb 

426559; AB001914; Hs.170414; paired basic amino acid cleavi; hepC, breast, ovar, renal; diag 

426575; M74826; Hs.1 70808; glulamate decarboxylase 2 (pan; pane; s.m. 

426627; AF012359; Hs.1 95685; ESTs; test; diag 

426635; BE395109; Hs.1 29327; hypothetical protein MGC13057; ovar; CTL+s.m. 

426682; AV660038; Hs.2056; UDP glycosyltransferase 1 fami; blad, lung; s.m. 

426691; NMJW6201; Hs.171834; PCTAIRE protein kinase 1;ovar; CTL+s.m. 

426696; AW363332; Hs.171844; Homo sapiens cDNA: FU22205 fi; angio; mAb 

426721 ; AA383588; Hs.288545; ESTs, Weakly similar to T29012; fibro; diag 

426726; AA48891 5; Hs.1 71 955; trophinin associated protein (; test; diag 

426747; AA535210; Hs.171995; kailikrein 3, (prostate specif pros; diag 

426752; X69490; Hs.1 72004; titin; sarc; diag 

426759; AI590401; Hs.21 21 3; ESTs; mela; diag 

425793; X89887; Hs.172350; HIR (histone cell cycle reguia; pros; CTL+s.m. 

426828; NM.000020; Hs.172670; activin A receptor type ll-lik; a ngio; mAb 

425866; U02330; Hs.172816; neuregulln 1; esoph; CTL+s.m. 

425897; AW976570; Hs.97387; ESTs; lung; diag 

1 390( "A\ 1 ' U 75 E T blad p mAb 

426935; NM_000088; Hs.172928; collagen, type I, alpha 1; test, sarc; CTL+s.m. 

425966; AI4931 31; Hs.349204; sclerostln; lung; diag 

426968; U07616; Hs.173u34 arnphlphys n (St|-ff-Mannsyndro;Uad;mAb+CU 

426991; AK001536; Hs.214410; Homo sapiens cDNA FU10674 rls; ovar, sarc; diag 

427080; AW068287; Hs.301175; ras-related C3 botulinum toxin; mela; diag 
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427099; AB032953; Hs.173560; odd Oz/ten-m homolog 2 (Drosop; headnk, esoph; diag 

427122; AW057736; Hs.323910; HER2 receptor l>r ne in mAb 

427244; AA402400; Hs.1 78045; ESTs; esoph; diag 

427260; AA663848; ; gb:ae70b06.s1 Stralagene sohiz; lung; diag 

427274; NM.00S211: Hs.174142;ccli ny stimulal ng facto: 1 re; pros, sarc; mAb 

427298; M400495; ; ESTs; test; diag 

42731 8; AF1 86081 ; Hs.1 75783; zinc transporter; pros; mAb 

427333; AF067797; Hs.17665B; aquaporin 8; pane, colon; mAb 

4 J « NMJ00869 is.2142; 5 hy ti < (ryptam ie ( <-rotonm;ovar; mAb 

•127355; AW023482; Hs.97849; ESTs; ovar, breast, pros, btad, lung; diag 

42739B; AW390020; Hs.2041 5; chromosome 21 open reading fra; pros; diag 

427427; AF077345; Hs.1 77936; lectin, superfamily member 1 (; breast; diag 

427441; AA41 2605; Hs.293266; SPANX family, member C; lung, esoph; CTL+s.m. 

427461; AA531 527; Hs.332040; hypothetical protein MGC13010; pros; mAb 

4/7474; U13I92; Hs.2159; aggrecan 1 (chondroitin sulfat; sere; diag 

427486; AA974433; Hs.362432; fibroblast growth factor 4 (he; test; diag 

42751 0;Z47542; Hs.1 79312; small nuclear RNA activating c; lung: CTL+s.m. 

427515; T79526; Hs.179516; integral type I protein; pros; diag 

427521 W97 52 E Ts test; diai 

427528; AU077143; Hs.179565; minichromosome maintenance def; mela; CTL+s.m. 
427535; R29543; Hs.2164; pro-ptatetet basic protein (in; fibre; diag 
427546; AA405280; Hs.36793; hypothetical protein FLJ23188; lung; diag 
427550; BE242818; Hs.311609; nuclear RNA hellcase, DECDvar, mela; CTL+s.m. 

427557; NM_002659; Hs.179657; plasminogen activator, urokina; pane, colon, stom, ovar, cerv, blad, lung, headnk, esoph; mAb 
42757B; AI5913C5 Hs.1690! ESI Highly simil o TUL3 H; test; diag 
427583; M82962; Hs.179704; meprin A, alpha (PABA peptide ; colon; mAb 
427584; BE410293; Hs.179718; v-myb avian myeloblastosis vin test; CTL+s.m. 

427585; D31 1 52; Hs.179729; collagen, type X, alpha 1 (Sch; breast lung, headnk, panc,stom, colon, ovar, cerv, sarc; diag 

427615; BE410107; Hs.179817; CGI-82 protein, PSDRI; pros; diag 

427634; AI399745; Hs.1 8449; hypothetical protein MGC10820; mela, sarc; diag 

427647; W1 9744; Hs.1 80059; Homo sapiens cDNA FU20653 lis; sarc; diag 

17914 , nlhe skin protein (; breast, cerv, blad, lung, headnk, esoph; diag 

427667; AK001279; Hs.180171; Homo sapiens cDNA FU10417fis; test; diag 
427668; M298760; Hs.180191; hypothetical protein FLJ14904; lung, lest; diag 
427681; AB018263; Hs.284232; tumor necrosis factor receptor; ovar; mAb+s m 
427698 AW9725«4 H- (99; E t. ro OTL+S m 
427701; AA411101; Hs.243886; nuclear autoantigenic sperm pn lung; mAb+CTL 
427715; BE245274; Hs.180428 ; KIAA11B1 protein; pros; diag 
427719; AI393122; Hs.1 34726; ESTs; test, blad; diag 
427730; AW250549; Hs.1 80577; granuh ), me a; diag 
427786; BE407863; Hs.256871; ESTs; esoph, blad; diag 
427809; M26380; Hs.1 80878; lipoprotein lipase; ovar; mAb 
427811; M81057; Hs.180884; carboxypeptidase B1 (tissue); breast; s.m. 
427897; NM_017413; Hs.303084; apelin; peptide llgand forAPJ; angio, renal, pros; diag 
427912; AL022310; Hs.181097; tumor necrosis factor (ligand); angio; mAb 
427958; AA418000; Hs.376771; potassium intermediate/small c; pros, glio; mAb 
427961; AW293 165; Hs.143134; ESTs- lung, sarc; diag 
428001; H97428; Hs.219907; ESTs, Moderately similar to Tr, mela; diag 
428004; AA449563;Hs 151393; glutamate-cysterc'ig c luna;s.m 
428023; AL038843; Hs.374530; Homo sapiens cDNA: FU23602 fi; lung; diag 
428046; AW812795; Hs.337534; ESTs Mode atclyini i 13; lung, coion ;. 
428062; AA420683; Hs.98321; hypothetical protein FU14103; angio; diag 
428087 AA100573 Hs.1 I; Iroponi fast; sarc; CTL+s.m. 
428141 ; D50402; Hs.18261 1; solute earner family 1 1 (prot glio; mAb 
1 1 ' W51314 H ( I .nin.ng region, ovar diaj 

428169; AI928984; Hs.1 82793; golgi phosphoprotein 2; pros; diag 

428182; BE386042 H .' 3317 ESTs Weakly ar - , GGC1_H; blad, headnk, lung, ovar, sarc; CTL+s.m. 

428183, AW969^26,H 1381 ESTs Weakly imila I j ^nne, EWS, diag 
428206; AB020643, Hs.1 83006; KIAA0835 protein; angio; mAb 
428221; U96781; Hs.183075; ATPase, Ca transporting, cardi; sarc; s.m. 

428227; AA321649; Hs.2248; small inducible cytokine subfa; breast, lung, tlad, ovar, headnk, fibro. colon, stom, cerv, leuk, renal test mela esooh heoC- diaa 

428248; AI126772; Hs.40479; ESTs; saic; diag 

428293; BE250944; Hs.183556; solute earner family 1 (neutr; pros; mAb 

428305; AA44662B; Hs.2799; cartilage linking protein 1; sarc; diag 

428329; AA426091 ; Hs.98453; ESTs, Moderately similar to R2; test; diag 

428336; AA503115; Hs.183752; microseminoprotein, beta-; pros; diag 

428355; BE256452; Hs.2257; vitronectin (serum spreading f; colon; diag 

428398; AI249368; Hs.98558; ESTs; pros, breast ; diag 

428405; Y00762; Hs.2266; cholinergic receptor, nicotini; esoph, sarc; mAb 

428423; AU076517; Hs.1 84276; solute earner family 9 (sodiu; ovar; CTL+s.m. 

428434; AW363590; Hs.65551 ; Homo sapiens Simil Lit- ng, fib diag 

428467; AK002121; Hs.1 84465; hypothetical protein FU11259; fibro; mAb 

428471; X57348; Hs.184510; stratifm; lung, headnk, colon, pane; diag 

428645; AA431400; Hs.98729; ESTs, Weakly similar lo 201 750; lung; s.m. 

428651; AF196478; Hs.188401; annexin A10;blad, stom, pane; diag 

428667; AI375550; Hs.346868; nucleolar protein p40; homolog; fibro, uten diag 

428722 U76456, Hb 07£ tissue inhibit fi allop glio diag 

428728; NM_016625; Hs.191381; hypothetical protein; ovar, lung, BPH; CTL+s.m. 

428771; AB028992; Hs.193143; KIAA1069 protein; lung; CTL+s.m. 

428784; Y12851; Hs.193470; purinergic receptor P2X, ligan; glio, mela; mAb 

428800; M57627; Hs.193717; interieukin 10;fibro; diag 

428801; AW277121; Hs. 254381; ESTs; pros; diag 

428804; AK000713; Hs.193736; hypothetical protein FU20706; mela; diag 
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428B10; AF068236; Hs.193788; nitric oxide synthase 2A (indu; lung; s.m. 
428B19; AL135623; Hs.193914; KIM0575 gene product; pros; CTL+s.m. 
428824; W23624; Hs.173059; ESTs; pane; diag 

428832; M578229; Hs.324239; ESTs, Moderately similar to ZN; pane, uter; diag 
5 428841; AI418430; Hs.104935; ESTs; renal; diag 

428848; NM_000230; Hs.1 94236; leptin (murine obesity homolog; sarc; diag 

428862; NM_000346; Hs.2316; SRY(sex determining region Y); pros, sax; CTL+s.m. 

428927; AA441837; Hs.90250; Homo sapiens hypothetical prat; fi bra; mAb+diag 

42892B; BE409838; Hs.194657; cadherin 1 , type 1, E-cadherin; pros, breast, stom, blad; mAb 
1 0 428949; AA4421 53; Hs.104744; hypothetical protein DKFZp434J; saic; diag 

428957; NM.003881; Hs.194679; WNT1 inducible signaling pathw; cerv; diag 

428959; AF100779; Hs.194680; WNT1 inducible signaling pathw; sarc; diag 

428977; AK001404; Hs.194698; cyclin B2; test; CTL+s.m. 
, 428981; BE313077; Hs.93135; ESTs, Weakly similar to ALU2_H; sarc; diag 

1 5 429002; AW248439; Hs.2340; (unction plakoglobin; blad; CTL+s.m. 

429010; Y18198; Hs.194725; one cut domain, family member ; pane; diag 

429038; AL02351 3; Hs.194766; seizure related gene 6 (mouse); lung; mAb 

429058; AF138B63; Hs.35254; hypothetical protein FLB6421; esoph; diag 

429065; AI753247; Hs.29643; Homo sapiens cDNA FU13103 fis; lung; diag 
20 429083; Y09397; Hs.227817; BCL2-related protein A1; mela; diag 

429113; D28235; Hs.196384; proslaglandin-endaperoxide syn; ang'o, blad, stem; s.m. 

429120; AK001673; Hs.196530; hypothetical protein FLJ1081 1 ; test; diag 

429150; AF120103; Hs.197366; smoothened (Drosophila) homolo; ovar; mAb 

429163; AA884766; ; gb:am20a10.s1 Saares_NFL_T_GBC; pros; diag 
25 429170; NM_001394; Hs.2359; dual specificity phosphatase 4; breast, panc,stom, lung, mela; s.m. 

429201; X03178; Hs.198246; group-specific component (vita; pane; diag 

429220; AW207206; Hs.356962; ESTs; breast, pros, BPH; diag 

429228; AI553633; Hs.356828; ESTs; lung, fibro. headnk, esoph; diag 
_ n 429259; AA420450; Hs.380088; Plakophilin; lung, headnk; diag 
3 O 429290; AF203032; Hs.198760; neurofilament, heavy polypepti; pros; CTL+s.m. 

429299; AI620463; Hs.347408; hypothetical protein MGC1 31 02; pros, cerv; diag 

429329; AA456140; Hs.99235; Homo sapiens pannexin 3 (PANX3; sarc; mAb 

429345; R1 1 141; Hs.199695; hypothetical protein; blad; diag 

429359; W00482; Hs.2399; matrix metalloproteinase 14 (m; headnk, breast, cerv, ova-, blad, lung, esoph, mela, sarc; i 
35 429413; NM.014058; Hs.201877; DESC1 protein; lung, blad; diag 

429415; NMJM2593; Hs.2O2097; procollagen C-endopeptidase en; sarc; diag 
429423; AI016712; Hs.380983; integrin, beta 1 (f bronectin ; angio; mAb 
429432; AI678059; Hs.202676; synaptoncmal compicx protein 2; breast, cerv; diag 
. n 429441; AJ2241 72; Hs.204096; lipophilin B (uteroglobin fami; breast, p-os, ovar; diag 
40 429466; M85835; Hs.12827; ESTs; glio, uter; CTL+s.m. 

429469; M84590; Hs.389791 : glycine dehydrogenase (deca-to; test; s.m. 
429486; AF1 55827; Hs.203963; hypothetical protein FU10339; blad, lung, headnk, test; diag 
429500; X78565; Hs.2891 14; hexabrachion (tenascin C, cyto; lung, glio, headnk, mela, sarc; diag 
429504; X99133; Hs.204238; lipocalin 2 (oncogene 24p3] (N; ovar, lung, blad; diag 
45 429505;AW820035;Hs,278679;adi in. rin ai lelall rjle Leon, leuk; mAb 
429538; BE182592; Hs.139322; small proline-rich protein 2A; lung, esoph; diag 
429563, BE61941 3, H 24 7 ika lie tran t in in , t luna, diag 
429585; T7351 0; Hs.2091 53; angiopoietin-like 3; hepC; CTL+s.m. 
429597; NMJJ03816; Hs.2442; a disintegrin and metalloprote; pane, colon, stom, lung; mAb 
50 429699; AF992246; lls.219363; cell adhesion molecule with ho; ovar, mela; diag 

429612; AF062649; Hs.252587; pituitary tumor-transforming 1; lung, blad, headnk; diag 
429655 U48nbJH 11' ,) n, light | ypeptide lina pros s m 
429663; M68874; Hs,21 1587; phospholipase A2, group IVA (c; angio, lung; s.m. 
429664, L20433,Hs 211588, POU dom. n cl 4 nscrip; saic; CTL+s.m 
5 5 429736; AF1 25304; Hs.212680; tumor necrosis factor receptor; lung; mAb 

429747; M87507; Hs.2490; oaspase 1, apoptosis-related o; colon, stom, fibro; s.m. 
429764; BE245076; Hs.216958; KIAA0194 protein; pros; mAb 
429769; NM_004917; Hs.218366; kallikrein 4 (prostase, enamel; pros; s.m. 
429784; M89796; Hs.30; membrane-spanning 4-domains, s; fibro; mAb 
60 429823; AA459443; Hs.181400; ESTs; sarc; diag 

429859; NM_007050; Hs.225952; protein tyrosine phosphatase, ; breast; mAb+s.m. 
429918; AW873986; Hs.1 19383; ESTs; pros, gSo; diag 
429921; AA526911; Hs,82772; collagen, type XI, alpha 1; headnk, pane, sarc; CTL 
429983; W92620; Hs.260855; ESTs; blad; diag 
65 429986; AF092047; Hs,227277; sine oculis homeobox (Drosophi; lung; CTL+s.m. 
430014; H59354; Hs.374303; actinin, alpha 4; renal; diag 
430016; NM_004736; Hs.227656; xenotropic and polytropic retn ovar; mAb 
430044; AA464510; Hs.152812; ESTs; breast, lung, pane, headnk, ovar, stom, esoph; diag 
_ n 430056; X97548; Hs.228059; KRAB-associated protein 1; test CTL+s.m. 
70 430129; BE301708; Hs.233955; hypothetical protein FU20401; angio; diag 

430130; AL137311; Hs.234074; Homo sapiens mRNA; cDNA DKFZp7; pros; mAb 
4301 44; AI732722; Hs.98927; ERGL protein; tRGIC-53-like pr; pros; diag 
430152; AB001325; Hs.234642; aquaporin 3; blad, fibro; mAb 
430154;AW5B3058;Hs.234,2> scrinr ^rr/cinnc) proteinas; pros; diag 
75 430157; BE348706; Hs.278543; ESTs; blad; diag 

430168; AW96B343; Hs.145582; DKFZP434I1735 protein; blad; diag 
430223; NM_002514; Hs.235935; nephroblastoma overexpressed g; mela; diag 
430226 BE2455C/ I I p pro mAb 

430228, AW950939 M i ESI Highly sim toT00391; glio, diag 
SU 430 6 774; Hs.105 !; tost 1o >pmer cl od NYD I; CTL< n 
430253; AK001514; Hs.236844; hypothetical p otein FL 10652 test; CTL+s.m. 
430255; AK000703; Hs.323822; Homosapiens mRNA for KIAA1551; test: CTL+s.m. 
430259; BE550182; Hs.375142; RalGEF-like protein 3, mouse h; ovar; CTL+s.m. 
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430280; M361258; Hs.237868; interleukin 7 recepto-; mela, lung, pane, stom, esoph, headnk, libra; mAbts.m. 

430287; AW182469 H- 1 r T Woakl ml r bl FU5_H lest, diag 

430294; AI53822S H i ) , j leotide binding pro, pros, diag 

430337 M36707 H J I Ike 3 lung diag 

430354; AA954810; Hs.239784; human homolog ofDrosophila So; ovar; diag 

430378; Z29572; Hs.2556; tumor necrosis factor receptor; lung.ribro, breast headnk, blad, breast, colon, stom; diag 
430393; BE185030; Hs.241305; estrogen-responsive B box prot; lung; diag 
430396; D49742; Hs.241363; hyaluronan-binding protein 2; pane; diag 
430407; H23551; Hs.30974; ESTs; pane; diag 
430439; AL1 33561 ; Hs.3801 55; DKFZP434B061 protein; lung, test; diag 
430451; AAB36472; Hs.297939; cathepsin B; ovar, lung, headnk, pane, slom; diag 
430454; AW469011; Hs.105635; ESTs; lung; diag 
430476; M447465; Hs.2563; tachykinin, precursor 1 (subst sarc; diag 
430487; DB7742; Hs.241552; KIM0268 protein; pros; diag 
430491; AL1 09791; Hs.241559; Homo sapiens mRNA full length ; ovar; diag 
430498; X02910; Hs.241570; tumor necrosis factor (TNF sup; leuk; diag 
430508; AI015435; Hd 04637; ESTs; lung; ra^.m. 
430521; NNL016383; Hs.242183; HOM-TES-85 tumor antigen; test CTL-s.m. 
430540; AW245422; Hs.106357; Homo sapiens cDNA: FU22105 fi; mela; mAb 
430563; AA481269; Hs.348628; ATP-binding cassette, sub-fami; lung; diag 
430594; AK000790; Hs.246885; hypothetical protein FU20783; mela diag 
430634; AI860651; Hs.26665; calcyphosir.e; ovar; diag 
430637; BE160081; Hs.256290; S100 calcium-binding protein A; mela; diag 
430647; AC003682; Hs.127988; ESTs, Weakly similar lo Z21 1_H; lest diag 
430676; AF084866; Hs.372585; gb:Homo sapiens envelope prate; lest; diag 
430677; Z26317; Hs.359784; desmoglein 2; lung, colon; mAb 
430678; AI458174; Hs.192855; ESTs; lung; di: 
430686; NM_001942; Hs.2633; desmoglein 1; 

430691; C14187; Hs.157208; aristaless-related homeobox pr; bWB, ovar, pane; diag 
430704; AW813091; Hs.335799; ESTs; stom; diag 
430770; AA765694; Hs.123296; ESTs; mela; diag 

430832; AI073913; Hs.100686; ESTs, Weakly similar to JE0350; breast, colon, ovar, uter, lung, stom, fibro; diag 

430838; N46664; Hs.169395; hypothetical protein FU12015; mela; CTL*s.m. 

430890; X54232; Hs.2699; glypican 1; glio, lung, cerv, blad, esoph; mAb 

430985 AA4902^2 H 23; ESTs, \\ lyimilarto I78885; lung; mAb 

431009; BE149762; Hs.48956; gap junction protein, beta 6 (; lung, blad, headnk, esoph; mAb 

431053; S40369; Hs.249141; Glutamate receptor subunit glio; mAb 

431070; AW408164; Hs.249184; transcription factor 19 (SC1); blad; diag 

431089; BE041395; Hs.374629; ESTs, Weakly similar to unknow; blad, lung, pros, angio, fibro; diag 

431099; Y13367; Hs.249235; phosphoinositide-3-kinase, cla; pros; CTL*s.m. 

431103; M57399; Hs.44; ploiotrophin (heparin binding ; sarc, mela; diag 

431124; AF284221; Hs.59506; doublesex and mab-3 related tr; lung; CTL+s.m. 

431151; BE207083; Hs.366053; gb:ba10d10.y1 NIH_MGC_7 Homos; pros; mAb 

431164;M493650; Hs.94367; thyroid transcription factor 1; fibro; CTUs.m. 

431183; NMJ06855; Hs.250696; KDEL (Lys-Asp-Glu-Leu) endopla; mela, pros, pane, colon, stom; mAb 

431211; M86849; Hs.323733; gap junction protein, beta 2, ; colon, blad, lung, pane, headnk, esoph; mAb 

431217; UU 013427; Hs.250830; Rho GTPase activating protein ; pros; CTL+s.m. 

431221; AA449015; Hs.286145; SRB7 (suppressor of RNA potyme; lung; CTLts.m. 

431222; X56777; Hs.273790; zona pellucida glycoprotein 3A; pros; diag 

431250; BE264649; Hs.251377; taxol resistance associated ge; esoph; diag 

431322; AW970622; Ms 375626; gb:EST382704 MAGE resequences,; blad, ovar, uter; diag 

431347; AI133461; Hs.251664; insulin-like growth factor 2 (; blad; mAb+diag 

431354; BE046956; Hs.251673; DNA (cytosino-5-)-mcthy!tranof; test; CTL^.m. 

431360; NM_000427; Hs.251680; loricrin; mela, sarc; diag 

431362; AI874223; Hs.293560; ESTs; angio; diag 

431369; BE184455; Hs.251754; secretory leukocyte protease i; ovar, blad; diag 

431384; BE158000; Hs.334372; gb:MR2-HT0377-150200-202-e03 H; lung; diag 

431385; BE178536; Hs.11090; membrane-spanning 4-domains, s; pane; diag 

431441; U81961; Hs.2794; sodium channel, nonvoltage-gat; ovar, pros, blad; mAb 

431448; AL1 37517; Hs.306201 ; hypothetical protein DKFZp5640; blad; mAb 

431457; NM_012211; Hs,256297; integrin, alpha 11; headnk; mAb 

431474; AL133990; Hs.190642; CEGP1 protein; breast, pros, blad; diag 

431494; AA991355; Hs.298312; hypothetical protein DKFZp434A; lung; diag 

431512; BE270734; Hs.2795; lactate dehydrogenase A; pane; s.m. 

431548; AI834273; Hs.9711; novel protein; lung, angio, pros; diag 

431553; X78075; Hs.2799; cartilage linking protein 1; sarc; diag 

431579; AW971 082; Hs.222886; ESTs, Weakly similar to TRHYJ; pros; diag 

431616; AA508552; Hs.222874; ESTs, Weakly similar to I38022; pros, pane, colon ; mAb 

431674; AA098901; Hs.301642; G-protein coupled receptor; ovar; mAb*s.m. 

431723; AW058350; Hs.278966; Homo sapiens mRNA cDNA DKFZp5; fibro; diag 

431728; NMJW7351; Hs.268107; multimerin; angio; diag 

431808; M30703; Hs.270833; amphiregulin (schwannoma-deriv; breast, headnk, pane, colon; diag 
431836; AF178532; Hs.27141 1; beta-site APP-cleaving enzyme ; mela; mAb 
431870; AW449902; Hs.105500; ESTs; renal; diag 

431890; X17033, 1 Is 2719S5 ntegrin alpha 2 (CD49B, alph; blad, headnk, lung, pane, cerv, slom; mAb 

431938; AA938471; Hs.5443 1; specific granule protein (28 k; pane; diag 

431939; AW008061;Hs.231994; ESTs; renal, colon; diag 

431941; AK000106; Hs.272227; Homo sapiens CDNAFU20099 fts, cerv, glio; diag 

431989; AW972870; Hs.291069; ESTs; ovar; diag 

431992; NMJ02742, Hs.2891; protein kinase C, mu; pros, glio; s.m. 

432004; BE018302; Hs.2894; placental growth factor, vascu; renal; diag 

432015; AL157504; Hs,159115; Homo sapiens mRNA; cDNA DKFZpS; blad: diag 

432023; AW273128; Hs.300268; EST; lung; diag 
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432097; X51730; Hs.2905; progesterone receptor; blad; mAb+s.m. 
432117; AL036195; Hs.2909; protamine 1; test; CTL-s.m. 
432123; AA127221; Hs.66; ESTs; angio; diag 

432141; BE410964; Hs.272736; nuclear receptor binding prate; test; mAb+s.m. 
432189; AA527941; ; gb:nh30c04.s1 NCI_CGAP_Pr3 Horn; pros; diag 
432199; AI693815; Hs.127179; cryptic gene; pane; diag 

432210; AI567421; Hs.273330; Homo sapiens, clone IMAGE3544; ovar, lung, blad; diag 
432222; AI204995; ; gb;an03c03.x1 Stratagene schiz; angio, blad, fibro; diag 
432231; M339977, H 274127 LST112 protein libra, diag 
432239; X31334; Hs.2936; matrix metalloproteinase 13 (c; blad, lung, headnk, esoph, sarc; s.r 
432240 A6M767 H- 1291 9; Horn i cDN II 113 ,1 fc pros dag 
432305;M62402 H^-'J 1 ii r, I i-gruMh factor bin; cerv; diag 
432374; W68815; Hs.301885; Homo sapiens cDNA FU11346 fis; cerv, lu 
432407; AA221036; ; gb:zr03f12.r1 Stratagene NT2n; lung, tesl,colon; " 
432415 T16971; ; ESTs, Weakly similar to A43932; ovar, pros; diag 



432435; BE218886; Hs.282070; ESTs; pros, uter, colon, stom, fibro; diag 

432441; AW292425; Hs.163484; intron of hepatocyte nuclear t blad, fibro, pros; diag 

432473; AI202703; Hs.1 52414; ESTs; pros; diag 

432481; AW451645; Hs.151504; intron of collagen, type XI, a; sarc; diag 

432512; NM_003284; Hs.3017; transition protein 1 (during h; test CTL»s.m. 

432519; AI221311; Hs.130704; ESTs, Weakly similar to BCHUIA; fibro, ovar, uter; CTL-s.m. 

432542; AW083920; Hs]l6098; claudin 2; raio^pancTdiag"' ' ^ 
432583; AW023624; Hs.162282; potassium channel TASK-4; pola; lung; mAb 
432615; AA557191; Hs.55028; ESTs, Weakly similar to I54374; pros; diag 
432621; AI298501; Hs.21192; ESTs, Weakly similar to T46428; pros; mAb 
432629; AW860548; Hs.280658; ESTs; ovar; diag 

432653; N62096,Hs 29 1 r_r Vv I } MrnilartoJC7328; pros, lung ; mAb+s.m. 
432666; AW204069; Hs.351118; ESTs, Weakly similar to unname; test; diag 

327 NM 013 )) < J 2 3124 024 tig II rnallcelllur , I o- c r pi n b+CTI 
432730; AI066520; Hs.1 31358; ESTs; test; diag 
432731; R31178; Hs.287820; fibronectin 1; pane, fibro; diag 
432788; AA521091; Hs.178499; Homo sapiens cDNA: FU23117 ft; lung, ovar; CTL*s.m. 
432800; BE391046; Hs.278962; AIM-1 protein; mela, pros; mAb 

432842;AW674093; Hs.334822; hypothetical protein MGC4485; blad, lung, headnk; CTL-HS.m. 

432850; X87723; Hs.3110; angiotensin ieceptor2 (AT2); leio; mAb 

432855; AF01 7988; Hs.279565; secreted frizzled-related prot; pare; diag 

432867; AW016936; Hs.233364; ESTs; stom, colon; diag 

432878; BE386490; Hs.279663; Pirin; mela; CTL*s.m. 

432887; AI926047; Hs.162859; AK056805: Homo sapiens cDNAFL; pros; diag 

432938; T27013; Hs.3132; steroidogenic acute regulatory; ovar; diag 

432966;M650114 I Is. 325198; EST pros diag 

4 ~> 1_004045; Hs 27 ATX 1 (ant ant protein 1 ri , 

433013; AI697890; Hs.127337; axin 2 [conductin, axil); colon; CTL*«.m. 

433043 W57554 II 1 ^, ymphoid nuc'E r pre n (LAF pro breast; diag 

433068; NM_006456; Hs.288215; sialyltransferase; breast, ovar, mela; s.m. 

433078; AW015188; Hs.121575; Homo sapiens cDNA FU12231 hs; blad; diag 

433091; Y12642; Hs.3185; lymphocyte antigen 6 complex, ; blad, lung, headnk, cerv; mAb 

433147; AF091 434; Hs.43080; platelet derived growth factor; ovar, pane, fibro; diag 

433159; AB035898; Hs.150587; kinesin-like protein 2; ovar, uter, colon, blad; diag 

433170; AB037816; Hs.8982; KIAA1395; angio; diag 

433183; AF231 338; Hs.222024; transcription factor BMAL2; lung; diag 

433228; F28212; Hs.14953; KIAA1491 protein; test; CTL*s.m. 

433258; AI806626; Hs.207300; ESTs, Weakly similar to ALUB.H; lung; diag 

433285; AW975944; Hs. 237396; ESTs; breast, pros ; diag 

433293; AF007835; Hs.32417; hypothetical protein MGC2742; fibro, pros, stom, pane; CTL+s.m. 

433323; AA805132; Hs. 159142; ESTs; pros; diag 

433334; AI927208; Hs.231958; matrix metalloproteinase 28; panc,s.m. 



433376; AI249361; Hs.74122; caspase4, apoptosis-related c; angio; s.m. 
433388; AI432672; Hs.288539; hypothetical protein FLJ22191; ovar CTLts.m. 
433404; T32982; Hs.352670; Homo sapiens cDNA FLJ32064 fis; pros; diag 

433437; 1)20536; Hs.3280; caspase 6, apoptosis-related c; fibro, breast, cerv, lung, blad, pane, glio, colon; s.n 

433444; AW975324; Hs.129816; ESTs; pros; diag 

433466; AA508353; Hs.1 05314; relaxin 1 (HI); pros; diag 

433485; AI493076; Hs.306098; aldo-keto reductase family 1 , ; lung; s.m. 

433495; AW373784; Hs.71; alpha-2-glycoprotein 1, zinc; breast, pros; diag 

433576; BE080715; Hs.161091; ESTs; mela; diag 

433662; W07162; Hs.150826; RAB25 RAB25, member RAS oncoge; colon; diag 

433671; AW138797; Hs.132906; 19A24 protein; fibro; mAb 

433701; AW445023; Hs.15155; ESTs; test; diag 

433724; AI827749; Hs.144924; serine/threonine protein kinas; test; CTL+s.m. 

433764; AW753676; Hs.39982; zinc finger protein RINZF (NMj pros, ovar, drag 

433800; AI034361; Hs.135150; lung type-l cell membrane-asso; glio, lung test mAb 

433914;AF108138;Hs.112160;HomosapiensDNAhfl as< h u I 

43401 1; AW953437; Hs.5486; clone FLB5214; pros; diag 

434105; AW952124; Hs 13094; presenilis associated rhomboi; lung; diag 

434217; AW014795; Hs.23349; ESTs; angio; diag 

434262; AF121858; Hs.12169; sorting nexin 8; mela; CTL+s.m. 

434274; AA628539; Hs.57783; ESTs, Moderately similar to AL; test; diag 
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434334; AA912476; Hs.116750; Homo sapiens CDNAFU13221 Us; test; diag 
434340; Al lls.35314C;E ! jng 

434360; AW015415; Hs.127780; ESTs; lung; diag 
434370; AF130988; Hs.58346; ectodysplasin 1, anhidrotic re; col 
434377; AW137148; I Is. 306593; in'.ron of periostin (OSF-?os)- 1 
434398; AA121098 Hs3838 serum in U m j reast; CTUs.r 

434411; AA632649; Hs.201372; ESTc; stom leuk; diag 
434414; AI798376; ; gb:tr34b07.x1 NCLCGAP_Ov23 Ho; lung, test, colon; diag 
434423; NM_006769; Hs.3844; LIM domain only 4; pane; diag 
434449; AW953484; Hs.3849; hypothetical protein FLJ22C41 ; sarc; diag 
434487; AF143867; Hs.337588; ESTs, Moderately similar lo S6; blad; mAtws.m 
434596; T59538; ; gb:yb65g12.s1 Slratagene ovary; angio; s.m. 
434608; AA805443; Hs. 179909; hypothetical protein FU22995; lest; CTL«s.m. 
434609; R76593; ; gb:yi60o11.r1 Soares placenta ; pros; diag 
434636; AA083764; Hs.349208; hypothetical protein MGC3178; angio; diag 
434649; AA738254; Hs.165390; ESTs, Highly similar to A4035Q; test diag 
434665; AA642125, 1 fe '4502 qn rt30c01 s1 NCLCGAP_Lyrn3 Ho; pane; diag 
434666; AF151103; Hs.1 12259; T cell receptor gamma locus; pros; mAb-ts.m. 
434699; AA643687; Hs.149425; Homo sapiens cDNA FLJ11980 fis; pane; diag 
434826; AF155661; Hs.22265; pyruvate dehydrogenase phospha; mela- s m 
434846; AW295389; Hs.119768; ESTs; angio; diag 
434876; AF160477; Hs.61460; Ig superfamily receptor LNIR; lung, blad; mAb 
434927; H46612; Hs.293815; Homo sapiens HSPC285mRNA pai angio; diag 
434973; AW449285 h 31 636; EST: pros; diag 
435045; BE297155; Hs. 143698; ESTs; test; diag 
435047; AA454985; Hs.54973; cadherin-like protein VR20; pros; mAb 
435066; BE261750; Hs.4747; dyskeratosis congenita 1 , dysk; colon; CTL+s.m. 
435080; AI831760; Hs.155111; hypothetical protein FU14428; renal; mAb 
435094; AI560129; Hs.289008; EST; ovar, cerv; diag 

435099; AC004770; Hs.4756; flap structure-specif c endonu; blad, test, mela; CTL-ts m 
435140; AA668123; Hs.134170; ESTs; fibro; diag 
435159; AA668879; Hs.116649; ESTs; lung; diag 
435206; AI432364; Hs.160594; ESTs; test; diag 

435243; AW292886; Hs.348932; hypothetical protein dJ434014.; cerv, headnk; diag 

435292; N20514; Hs.172965; ESTs; mela; diag 

435299; AI745458, Hs 343026 E r I ,\ n ilai o T20593; fibro; diag 

435479; AF197137; Hs 155101; ATP synthase, H transporting. : pros; s m. 

435496; AW840171; Hs.265398; PAR-6 bete; breast, pane ova'; diag 

435563; AF210317; Hs.95497; solute camerfamil, (facil ad; mAb* 

4 5575; AF213457; Hs.44234; triggering receptor expressed ; fibro; mAb+s.m. 

435602; AF217515; Hs.283532; uncharacterized bone marrow pr; test; diag 

435615; Y15065; Hs.4975; potassium voltage-gated channe; glio; mAb 

435652; N32388; Hs.334370; uncharacterized hypothalamus p; pane; diag 

435793; AB037734; Hs.4993; KIAA1313 protein, ovar, lung, ul d ^ 

435849; BE305242; Hs.16098; claudin 2; colon, pane; diag 

435876; AW612586; Hs.16027 1 , G proteii c iuj led receptor 43, pros; mAb 

435897; AF269223; Hs.128322; t-complex 11 (a murine top horn; test; diag 

435904; AF261655; Hs.8910; 1,2-alpha-mannosidase IC; blad; s.m. 

435918; AF263538; Hs.86232; growth differentiation factor ; test; diag 

435974; U29690; Hs.37744; Homo sapiens beta-1 adrenergic; pros, EWS; mAb<s.m. 

436032; AA150797, Hs I092 76: late in prDfein, pane, angio; diag 

436063; AK000028; Hs.356100; ribosomal protein S24; pros; diag 

436120; AI248193;Hs.1 19860; ESTs; fibro; diag 

436199; R38946; Hs.127951; hypothetical protein FU14503; renal; diag 

436246; AW450963; Hs.119991; ESTs; blad; diag 

436251; BE515065; Hs.296585; nucleolar protein (KKE/D repea; colon, test, blad; CTUs.m. 
436278; BE396290; Hs.5097; synaptogyrin 2; pros; mAb 

436291; BE568452; Hs.344037; protein regulator of cytokines; lung, blad, headnk; diag 

436293; AI601 188; Hs.306201; ESTs; blad; diag 

436302; AL355841; Hs.99330; hypothetical protein FU23588; lung; diag 

436315; BE390513; Hs.27935; hypothetical protein MGC4837; me'a; diag 

' 1683487; H 15 i, MIM\ i rr t 1 » n n d r 

436420; AA443966; Ho 1595; ESTs; ang ; o; mAb 
436476; AA326108; Hs.33829; bHLH protein DEC2; pane; diag 
436511; AA721252; Hs.291502; ESTs; lung; diag 
436553; AW407 157; He 181 125; immunoglobulin lambda locus; lung; diag 
436569; BE439539; Hs.301961; glutathione S-transferase M2 (; blad; s.m. 
436614; AW104388; Hs.149091; ESTs; mela; CTUs.m. 
436700; AI693690; Hs.301406; hypothetical protein PP3501; mela; mAb 
436729; BE621807;Hs.351316; transmembrane 4 superfamily me- pane colon stom ovar lun 
436772; AW975688; Hs.348918; metallothionein 1E (functional; angio; diag 
436775; AA731111 I ?222 EST jter, ovar; diag 
436839; AA767346; Hs.372277; ESTs; lung; diag 
436856; AI469355; Hs.127310; ESTs; mela; diag 
436954; AA740151,Hs 130425 E! rs tib'o, uter, ovan diag 
436972; AA284679; Hs.25640; claudin 3; ovar, lung, pros; mAb 
437052; AA861697; Hs.120591; ESTs; pros; diag 
437099; N77793; Hs.48659; ESTs, Highly similar to S14458; test diag 
437100; AI761073; Hs.14535; Homo sapiens cDNA; FU22314 fi; pane, renal diaq 
437119 AI379921 Hs 1 C J n fibro- diag 

437145; AF007216; Hs.5462; solute canlerfamily 4, sodiu; pane, pros, stom- mAb 

'I 6600; Hs. 121 194; Homo sapiens cDNA: FLJ21 torn rial col i 
437181; AI306615; Hs.125343; ESTs, Weakly similar to K1AA07; blad; mAI>*s m 
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437204; AL1 10216; Hs.355961; ESTs, Weakly similar lo 155214; lung; CTLts.m. 

437212; AI765021; Hs.210775 ESTs; renel, uter, ovar; diag 

437224; AL1 17628; Hs.97808; ESTs; test; diag 

437259; AI377755; Hs.120695; ESTs; lung; diag 

437267; AW51 1443; Hs.258110; ESTs; BPH;diag 

437269; AA334384; Hs.149420; ESTs; angio; diag 

437330; AL353944; Hs.50115; Homo sapiens mRNA; cDNA DKFZp7; sarc; diag 

437381; NM_003684; Hs.5591; MAP kinase-interacting serine/; glio; CTL+s.m, 

437390; AI125859; Hs.112607; ESTs; lung; diag 

437412; BE069288; Hs.34744; Homo sapiens mRNA; cDNA DKFZp5; lung; diag 
437435; AA249439; Hs.27027; hypothetical protein DKFZp762H; lung; diag 
437437; AA226869; Hs.351623; hypothetical protein DKFZp762L; tost; CTbs.m. 
437478; AL390172; Hs.317432; branched chain aminotransferas; angio; s.m. 
437553; AI829935; Hs.130497 EST; Weaklj slmilartoM/ 8 H; blad Wl 
437571; AA760894; Hs.125350; ESTs; pros; diag 
437623; D63880; Hs.5719; chromosome condensation-relate; test; diag 
437740; AA810265; Hs.122915; ESTs; mela; diag 
437802; AI475995; Hs.122910; ESTs; pane; diag 

437862; AW978107; Hs.5884; Homo sapiens mRNA; cDNA DKFZp5; mela; CTL^.m. 
437908; AI082424; Hs.351043; ESTs; test; diag 

437915; AI637993; Hs.202312; Homo sapiens clone N11 NTera2D| lung, hoadnk, ovar, blad, uter; diag 
437931; AI249468; Hs.124434; ESTs; blad; diag 

437935; AW939591; Hs.5940; mucin 13, epithelial transmemb; colon, stom, uter, pane; mAb-^m. 
437938; AI950087; Hs.369628; gb:wqOSc02.x1 NCI_CGAP_Kid12 H; renal, ovar. u'.or, cerv, blad: diag 
137 till ly similar to S59501 angio mAb-^.rn. 

437960; AI669586; Hs.369312; ESTs; uter, ovar; diag 

438167; R28363; Hs.24286; chemokine binding protein 2 (C; ovar, breast, uter, mAb 

438199; AW016531; Hs.122147; hypothetical protein FU13189 ; breast diag 

438209; AL120659; Hs.6111; aiyl-hydrocarbon receptor nucl; mela; mAbts.m. 

438233; W52448; Hs.56147; ESTs; pros, cerv ; diag 

438274; AI918906; Hs.55080; ESTs; headnk; diag 

438403; AA806607; Hs.292206; ESTs; lung; mAb 

438438; AA257992; Hs.50651; Janus kinase 1 (a protein tyro; EWS; s.m. 

438450; AI05C866; Hs.65853; nodal, mouse, homolog; test; diag 

4 84J \9133 , lls.2797 ESI lesl di g 

438552; AJ245820; Hs.6314; type I transmembrane receptor ; pros, ovar; diag 

438670; AI275803; Hs.1 23428; ESTs; fibro; CTUs.m. 

438702; AI879064; Hs.7164; ESTs; lung; diag 

438707; L08239; Hs.5326; amino acid system N transporte; ovar; mAb 

438746; AI885815; Hs.184727; Human melanoma-ass.-'*.- 1 ant; pai c b ad, mela, ovar; mAb .CTL 

438817; AI023799; Hs.163242; ESTs; ovar, uter, blad, renal; diag 

438859; AI559626; Hs.93522; Homo sapiens mRNA for KIAA1 647; rena'; dr ag 

438866; U44385; Hs.6441; tissue inhibitor of metallopro; mela; diag 

438873; AI302471; Hs.124292; Homo sapiens cDNA; FU23123 fi; fibro; diag 

436698; AI81PS63; Hs.1 06243; ESTs; lung; rite 9 

438915; AA280174; Hs.355711; Williams-Beuren syndrome chrom; lung, tccl, mela; diag 

436929; AW195515; I ls.253177; ESTs; renal; dlsg 

438956; W00847; Hs.1 35066; Human DMA sequence from clone ; lung; diag 

438966; AW979074; ; gb:EST391 184 MAGE resequences,; renal; diag 

438983; AF085884; Hs.20029; proecrosin binding protein sp3; test CTL^.m. 

438993, AA828951 3 t 177b0f I N( !_CGAP_Ov2 Horn; ovar; mAb+s.m. 

439C53; BE244588; Hs.6456; chaperonin containing TCP1, su; test; diag 

439092; AA830149; ; gb:oc44f08.s1 NCLCGAPJ3CB1 Ho; pros; diag 

439176; AI446444; Hs.190394; ESTs, Weakly similar to B28096; pros; diag 

439180; AI393742; Hs.199067; v-erb-b2 avian erythroblastic ; breast, ovar, uter, pros, blad, pane, colon, fibro, mela; mAb 
439221; AA737106 H 32250 ESTs Moderately mil I WS; s.m 
439237; AW408158; Hs.318893; ESTs, Weakly similar to A47582; mela, fibro- diag 
439239; AI031540; Hs.235331; ESTs; blad; diag 

439285; AL133916; Hs.47860; hypothetical protein FLJ20093; lung, breast diag 
439310-, AF086120; Hs.102793; ESTs; mela; diag 

439318; AW837046; Hs.6527; G protein-coupled receptor 56; colon, breast, ovar, uter, cerv, pros, lung, headnk, blad, mela; mAtwj.m. 

439335; AA742697; Hs.62492; NM_052863:Homo sapiens secreto; fibro, uter; diag 

439366; AF100143; Hs.6540; fibroblast growth factor 13; pros; CTL+s.m. 

439382; BE247684; Hs.103070; ESTs; angio; diag 

439394; AA149250; Hs.56105; ESTs; lung; diag 

439410; AA632012; Hs.188746; ESTs; angio; diag 

439453; BE264974; Hs.6566; thyroid hormone receptor inter; lung, esoph, ovar; mAb+s.m. 
439496; BE616501 Hs 3. m m pl , 3 milarto RIKEN; mela, esoph; diag 
439659; AW970780; Hs.59483; leucine-rich repeat-containing; ovar, stom, mela, colon; mAb 
439668; A1091277; Hs.302634; frizzled (Drosophila) homolog ; ovar, uter; mAb 
439670; AF088076; Hs.59507; ESTs, Weakly similar to AC0048; lung, headnk, cerv; diag 
439702; AW085525; Hs.55964; ESTs; mela; diag 

439706; AW87252" H, i9761 EST Weakly in (c DAP1 H; ovar, lung, headnk; diag 

439735; AI635386; Hs.142846; hypothetical protein; pros; diag 

439737; AI75143S; Hs.41271; I lomo sapiens mRNA Ml length ; pane; diag 

439750; AL359053; Hs.57664; Homo sapiens mRNA full length ; pane, fibro, breast; diag 

439755; AW748482; Hs.77873; B7 homolog 3; sarc; mAb 

439759; AL359055; Hs.67709; Homo sapiens mRNA fall length ; colon, stom, pane, leuk, lung- diag 

439778; AL109729; Hs.99364; putative transmembrane protein; pros; mAtHS,m. 

439780; AL109688; ; gb:Homo sapiens mRNA full leng; blad, esoph; diag 

439820; AL360204; Hs.283853; Homo sapiens mRNA full length ; ovar, uter, cerv. breast, pros ; diag 

439864; AI720078; Hs.291997; ESTs, Weakly similar lo A47582; lest; diag 

439867; AA847510; Hs.161292; ESTs; pane; diag 
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439920; H05430; Hs.288433; neurotrimin; pane; mAb*diag 

439926; AW014875; Hs.137007; ESTs; bled, esoph, lung, ceiv; diag 

-139 3 AW247! ) L l 7£ pic.n t-adivaling fact > t, lung, blad' s m 
- 440342 AI073387 Hs.133898 "STs svar, CTL-H;.rn. 
5 440086; NM_005402;Hs69Ub .-ra! simian leukomi viral on; angio; diag 

440099; AL080058; Hs.6909; DKFZP5B4G202 prolein; pane; diag 

440119; AA865455, II' 125331 ESTs, Moderately similar to un; test diag 

440138; AB033023; Hs.3181 27; hypothetical protein FLJ10201; lung; CTLts.m. 
n - 440151; AA868167; ; gb:ak38e07.s1 SoaresJestisJIH; sarc; diag 
1 U 440207; AI371978; Hs.128326; ESTs; test; diag 

440209; H05049; Hs.247837; neurexin 3; fibro; diag 

440210; AW6745'"' H 12212 [T Jl0 diag 

440225; BE295782; Hs.159; tumor necrosis factor receptor; glio; mAb 
1 440238; AW451970; Hs.155644; paired box gene 2; ovar; diag 

15 1402 0 AI9723 lls.713 pine IV- pros- d'ag 

440273; AI805392 M 32533 Hom -piens cDNA: FU23523 (i; lung, fibro; diag 

440274; R24595; Hs.7122; scrapie responsive protein 1; sarc; diag 

44031 1; AI733079; Hs.125407; ESTs, Moderately similar lo AL; renal' diag 

440325; NM_003812; Hs.7164; a disinlegrin and metalloprote; lung; mAb 
ZU 440333; AI378424; Hs.288761; hypothetical protein FU21749; pros; CTL+s.m 

440449; AA885430; Hs.201925; Homo sapiens cDNA FU13446 hs; breast; diag 

440452; AI925136; Hs.55150; ESTs, Weakly similar to CAYP_H; fibro; diag 

440457; BE387593; Hs.21321; Homo sapiens clone FLB9213 PRO; mela; diag 

440484 BE3281 Hs. 150356 ESTs pane diag 
ZD 440529; AW207640; Hs.16478; Homo sapiens cDNA. FU21718 fi; pros' diag 

440659; AF1 34160; Hs.7327; claudin 1; lung; mAb " ' 

440704; M69241 ; Hs. 1 62; insulin-like growth factor bin; lung, glio, ovar; diag 

440773; AA352 I llomosaf I I RIHEN test diag 

440801; AA906366; Hs.370038; ESTs; pros; diag 
JU 440819; AI809444; Hs.202108; ESTs; pros; diag 

440901; AA909358; Hs.128612; ESTs; ovar, pros; diag 

440943; AW082298; Hs.146161; hypothetical protein MGC2408; lung;diag 

440983; M20681; Hs.7594; solute earner family 2 (facil; test; mAb 

441020; W79283; He 35952 E Ts 'ung, panc;diag 
J J 441031; AH 10684; Hs.7645; librinogen, B beta polypeptide; lung, pane, colon; CTL*s.m. 

441085; AW135551; Hs.181245; Homo sapier ;DNA FLJ12632 fis p.nu >va s>om ute Jung, dug 

441 134; W29092; Hs.345950; cellular relinoic acid-bindina; sarc; diag 

441247; AW118681; Hs.128051 Hnrr ar. r hyrn jjmallj, pros; diag 
An 44132 H F B-cel associated protein q 
4U 441345; AW063579; Hs.7780; Homo sapiens mRNA; cDNA DKFZp5; pros; diag 

441350; AB020690;Hs 7782, F ,M«Tht< antiger MA2, pane; mAb+CTL 

441377; BE218239; Hs.202656; ESTs; iter, endo, lung; diag 

441384; AA447849; Hs.288660; retinoic acid induced 3; ovar; mAb-s m 
. r 441392; AW451831; Hs.2221 19; ESTs, Weakly similar to S30433- rena'- diaq 
45 441457, AW99 I; Hs.4383 ESTs angio diag 

441495;/W2nm u f 339 LjT blaj diag 

441525; AW241867; Hs.127728; ESTs; lung; diag 

441553; AA281219; Hs.121296; ESTs; lung, test, ovar, CTL»s.m. 

50 t« hs&i; m rr.r-r^*^^^^^*" 

441801; AW242799; Hs.86366; ESTs; blad; diag 

441835; AB036432; Hs.184; advanced glycosylate end pro; libra; mAb 

441859; AW194364; Hs.380444; intedeukin4 induced gene-1 p; ovar, mela, fibro- mAb 

441878 Al 0186 H 12 982 ESP te-l diag 
5 5 442006; AW975183; Hs.372210; ESTs, Weakly similar to S72482; fibro, angio; CTUsji. 

442082; R41E2' H< H calsyn n.n 2 breast, pros, ovar; diag 

442104; L20971; Hs.188; phosphodiesterase 4B, cAMP-spe; angio; CTL+s.m. 

442108; AW452649; Hs.343259; ESTs; lung; diag 

442242; AV647908; Hs.90424; Homo sapiens cDNA: FU23285 ti; BPH; diag 
60 442323; AW016669; Hs.29190; ESTs; breast; diag 

442333; AI650877; Hs.1 29302; ESTs; test; diag 

442432; BE093589; Hs.38178; hypothetical protein FU23468; blad, lung, pane, esoph, mela- CTbs m 
442438; AA995998; Hs.370007; gb;os26b03.s1 NCI.CGAP Kid5 Ho; uter, ovar, renal' diag 
442441; AI820662; Hs.129598; ESTs; breast; diag ~ -...<* 
05 442503; AF147078; Hs.375031; p53-responsive gene 5; mela; diag 
442506; BE566411; Hs.41726; ESTs; angio; diag 

442573; H93366; Hs.7567; branched chain aminolransferas; ovar, pane, angio, test s m 

442577; AA292998; Hs.163900; ESTs; blad, pane, colon, stom, ovar, diag 

442560; AI733682: Hs. 1 30239: ESTs; breast ; diag 
/<J 442609; A L02099o H L 1u 3 rop iN mela diag 

442613; AI004002; Hs.130522; Kv channel-interacting protein; glio- diag 

442622; NM_000435; Hs.8546; Notch (Drosophila) homolog 3; ovan mAb 

442711; AF151073; Hs.8645; hypothetical protein; angio, mela, sarc; diag 

442739; NM_iKr«i | 6 7 i>solicacy!<xx*zymeAthic;mela;s.m. 
I J 442757, AI73\..[ H 28345 ESTs mela,diag 

442818; AK001741; Hs.8739; hypothetical protein FU10879; breast diag 

442821; BE391929; Hs.8752; transmembrane protein 4; ovar diag 

442832; AW206560; Hs.253569; ESTs; pros, fibro- diag 
on 442896 R3- t 5 F Ts pan- diag 

8U 442994; AI026718; Hs.16954; ESTs; blad.fibro; diag 

443054; AI745185; Hs.84520; yes-associated protein 65 kDa; blad; diag 

443162; T49951; Hs.9029; DKFZP434G032 protein; blad, lung; CTL-s m 

443171; BE281128; Hs.9030; TONDU; blad, ovan diag 
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443184; AI638728; Hs.135159; ESTs; sarc; diag 

443211, AI128388 H 143 I ! o/ar lung, headnk, stom, colon; diag 
443216; W80487; Hs.324521 ; hypothetical protein DC50; test; diag 
443257; AI334040; Hs.11614; HSPC065 protein; fibro; CTL+8.m. 
443400; R28424; Hs.250648; ESTs; lung; diag 
443523; AK001575; Hs.9536; hypothetical p'Otein FU10713; test; CTL*s.m. 
443537; D13305; Hs.203; cholecystokinin B receptor; test; mAb 
443648; AI085377; I is.143G10; ESTs: king, headnk; diag 
443709; A 3269 Hs.1 3466?; ESTs; I J r 
443715; AI583187; Hs.9700; cyclin E1; lung, stom, ovar, colon; CTL+s.m. 
443785; AW449952; Hs.190125; basic-helix-lcop-helix-PAS pro; glio, uter, ovar CTL+s m. 
443802; AW504924; Hs.9805; KIAA1291 protein; sarc; diag 
443883; AA1 14212; Hs.9930; serine (or cysteine) proteinas; sarc; s.m. 
443885; H91806; Hs.15284; ESTs; mela; diag 
443892; AI889572; Hs.246875; ESTs; lung; diag 
443950; NM_001425; Hs.9999; epithelial membrane protein 3; mela; mAb 
443968; AA287702; Hs.10031; KIAA0955 protein; angio; diag 
443983; H04482; Hs.163724; ESTs; mela; mAb 

443991; NM_002250; Hs.10082; potassium intermediate/small c; pros, colon, uter, mAb 
444009; AI380792; Hs.135104; ESTs; angio; diag 
444151; AW972917; Hs.128749; alpha-methylacyl-CoA racemase; pros; mAb 
444159; AF1 16846; Hs.10431; dcrjd nngcr (Droccphih-l4= • test; CTb s m 
444163; AI126098; ; FGENESH predicted RNaseH domai; blad; s.m. 
444301;AK000136;Hs10T,0 ,, porin (LRR class 1); pane; diag 
444325,AW152t>18 K lw-7 3Tc -ph. diag 
144330 1 I ,r ESTs; angio; diag 

444342; NM_014398; Hs.10887; similar to lysosome-associated; hepC, lung, fibro, blad, csoptr diag 
444378; R41339; Hs.47860; neurotrophic tyrosine kinase, ; lung, glio; mAb^.m. 
444409; AI792140; Hs.49265; ESTs; angio; diag 
444444; AI149332; Hs.14855; ESTs; blad; diag 
444471; AB020684; Hs.1 1 21 7; KIM0877 protein; glio, lung, colon ; mAb 
444476; AF020038; Hs.1 1223; isocilrate dehydrogenase 1 (NA; blad; s.m. 
444484; AK002126; Hs.1 1 260; hypothetical protein FU11264; pros; diag 

444633; AF1 1 1713; Hs.12284; junctional adhesion molecule 1; ovar, uter, breast, cent, blad, headnk; mAb 
444649; AW207523; l ls.371001; ESTs; blad; diag 
444670; H58373; Hs.332938; hypothetical protein MGC5370; sarc; diag 

444764; Tt 911; Hs.1 1861; transmembrane 4 superfamily me; pane, omuc, stom, lung, colon; mAtns.m 
444809; BE207568; Hs.208219; oculospanin; mela; mAb 
444823, BE262980 II I2045 putative protein test; diag 
444863; AW384082; Hs.104879; serine (orcysteine) proteinas; mela; s.m. 
444895; AI674383; l is 22891, solute carrier family 7 (c • . mAb+s.m 
444995; AJ 272265; Hs.12230; secreted phosphoprotein 2, 24k; hepC, pane; diag 

I 540; Hs.281 295; E 111 

445070; NM. 000677; lls.258; adenosine A3 receptor; glio, renal; mAb 
445076; AI206888; Hs.154131; ESTs; lest; diag 
445084; H38914; Hs.250848; hypothetical protein FU14761; sarc; mAb 
445093; AI207197; lls.156905; ESTs; test; diag 
445109; AF039916; Hs.12330; ectonucleoside triphosphate di; pros; s.m. 
4451 19; AF035121; Hs.1 2337; kinase insert domain receptor : angio; mAb 
D(J 445160, AI299144, lis 101937 ; ieo His t jineobox (Drosophi, sarc; CTUs.m. 
445182; AW189787; Hs.361778; ESTs; blad; diag 
445247; AW274290; Hs.153997; ESTs; mela; diag 
445279, R419CU h 245 EST aigio diag 
445363; NMJ05993; Hs.1 2570; tubulin-specificchaperor.e d; test; diag 
33 445413.AA15 )42 II 1 2 77 ( G 147 protein pros, colon, uter, ovar, lung, pane ; diag 
445418, AW1 39 I 12717 ryptic gene pane d.ag 
445424; AB028945; Hs.1 2696; cortactin SH3 domain-binding p; pros ; diag 
445443; AV653835; Hs.295131; ESTs; lung; diag 
445654; X91247; I ls.1304G; ihioredoxin reductase 1; lung; s.m. 
OU 445684; AK001696; Hs.13109; Ran binding protein 11; angio; diag 

445701 ; AF055561; Hs.1 31 31; lymphocyte adaptor protein; angio; CTL+s.m. 
445784; A1253155, He ID", EST me , a CTL+5m 
445885; AI734009; Hs.127699; KIM1603 protein; pros, fibro; diag 
- 445900; AF070526; Hs. 125036; Homo sapiens clone 24787 mRNA ; renal, leuk; mAb 

OD 445911; AI985987; Hs.145S45; ESTs, Moderately similar to AL; blad; diag 
445982; BE410233; Hs.13501; pescadillo (zebrafish) homolog; mela; diag 
446057; AI420227; Hs.366053; Trp-p8 transient receptor pote; pros; mAb 
446082; A12741 39; Hs.1 56452 ESTs, blad; diag 
446098 AW072215;H 1 ESTs ngo dag 
/ U 446100; AW967109; Hs.13804; hypothetical protein dJ462023.; pros; diag 
446102; AW1680ar- H 317 4: EST lung dag 
446113; AW967553 Hs 3 apt is mRNA for FU00083; test; mAb 

446269; AW263155; Hs.14559; hypothetical protein FU10540; lung, headnk; CTL*s.m. 
446291; BE397753; Hs.14623; interferon, gamma-inducible pr, mela; diag 
/ J 446292; AF081497; Hs.279682; Rh type C glycoprotein; lung, cerv; mAb 

446293; AI420213; Hs.149722; LIM domain transcription facto; ovar, test; diag 
446320; AF126245; Hs.14791; acyl-Coenzyme A dehydrogenase ; pros' s m 
446332; AK001635; Hs.14838; hypothetical protein FU10773; breast; diag 
446342; BE298665; Hs.14846; solute camerfamily 7 (catio; uter, colon, pros, mela; mAb 
OU 446428; AW082270; Hs.12496; ESTs, Weakly similar to ALU4 H; fibre- diag 
446528, AU076640 h 1 524 nucleolai rotein 1 (1 kD) lung test diag 
146608; N75217; H I75622;E T t r fibro, diag 
446626; AW292180; Hs.156142; ESTs; pros; diag 
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446636; AC002563; Hs.15767; citron (rho-'mteracting, serin; lung; CTL-HS.m. 
446644; NM_003272; Hs.21065; transmembrane 7 superfamily me; mela; mAb 
446673; NM.016361; Hs.15871; LPAP for lysophosphatidic acid; blad; diag 
446727; AB011095; Hs.16032; KIAA0523 protein; angio; CTL-Ki.m. 
446733; AA863360; Hs.26040; ESTs, Weakly similar to fatty ; breast; s.m. 
446755; AW451473; Hs.16134; serine/threonine kinase 10; mela; CTL-Kj.m. 
446791; AI632278; Hs.196922; ESTs; test diag 
446839; BE091926 H 1t,2!l otk | i> coiled-coil re; test; diag 
446856; AI814373; Hs.164175; ESTs; lung; diag 
446868; AV660737; Hs.348297; ESTs; pane; diag 
446872; X97058; Hs.16362; pyrimidinergic receptor P2Y, G; lung; mAb 
446932; AA961459; Hs.125644; ESTs; fibro; diag 
446967; AI699629; Hs.156781; ESTs; fibro; diag 
446979; AI654443; Hs.197683; ESTs; test; diag 

446984; AB020722; Hs.16714; Rho guanine exchange factor (G; angio; CTL-ts.m. 
446989; AK001898; Hs.16740; hypothetical protein FU11036; lung, headnk; diag 
446998; N99013; Hs.278966; Homo sapiens mRNA; cDNA OKFZpS; pane, fibro; diag 
446999; AA151520; Hs.351416; hypothetical protein MGC4435; headnk; diag 
447004; AW296968; Hs.157539; FGENESH predicted secreted pro; alio; diag 
447078; AW885727; Hs.9914; ESTs; lung; diag 

447126; AW150632 H« 1 ) 0i i a! lujr r^ nucleotide exchang; angio; diag 

447164; AF02S941; Hs.1751 3; vipirin; similar to infi amma'.; colon, lung, breast, s'.om, hepC, esoph, mela; diag 
447178; AW594641; Hs.192417; ESTs; mela; diag 
447188; H65423; Hs.17631 hypothetical protein DKFZp434E; test; diag 
447210; AF035263 H i 7 - , p a idyl enne specific ph; pros, rnela; s.m. 
447289; AW247017; Hs.36978; melanoma antigen, family A, 3; lunn, mela: mAb+CTL 
447334; AA515032; Hs.91109; ESTs; blad; diag 

447343; AA256641; Hs.236894; ESTs, Highly similar to S02392; lung, blad, pane, headnk, mela; mAtre.m. 
447350' AI375572; Hs. 172634; v-erb-a avian erythroblastic I; breast, ovar, uter; diag 
447377; X77343; Hs.334334; transcription factor AP-2 alph; breast, lung, mela; CTL<s.m. 
447395; AI418412; Hs.184793; Homo sapiens cDNA; FU21880 fi; pane; diag 
447437; U07225; Hs.339; purinergic receptor P2Y, G-pro; blad; mAb 
447499; AW262580; Hs.147674; protocadherin beta 16; pros, glio, ovar ; mAb+s.m. 
447532; AK000614; Hs.18791; hypothetical protein FLJ20607: lung, blad; CTL»s.m. 
447534; AW953935; Hs.288655; ESTs; lung, test; diag 
447578; AA912347; Hs.136585; ESTs, Weakly similar to JC5314; ovar; s.m. 
447595; AW379130; Hs.18953; phosphodiesterase 9A; pros; CTL*s.m. 
147004 'U 139933 H 301342 hypolh tie I proteir MGC4342 mela; diag 
447636; Y10043; Hs.19114; hiqh-mobility group [nonhiston; lung; CTL+s.m. 
447733; AF157482; Hs.19400; MAD2 (mitotic arrest deficient; test; diag 
447749; T53260; Hs.8297; ESTs; renal; diag 
447761; AF061573; Hs.19492; protocadhen'n 8; EWS, glio; mAb 
447768; X86400; Hs.19520; FXYD domain-containing ion Ira; renal; mAb 
447818; W79940; Hs.355279; Homo sapiens clone 24670 mRNA ; renal; diag 
447835; AW591 623; Hs. 1541 29; ESFs, Weakly similar to I38022; renal, ovar, uter; diag 
447881; BE620886; Hs.355279; GCN1 (general control of amino; renal; diag 
447937; AL109716; Hs.20034; Homo sapiens mRNA full length ; mela; mAb 

447993; AW139525; Hs.170362; ESTs; colon, blad, lung, ovar, pane, headnk; n . 

448030; N30714; Hs.325960; membrane-spanning 4-domains, s; pane, leuk, renal, stom lung; mAb 
448045; AJ297436; Hs.20166; prostate stem cell antigen; blad, pane, pros; mAb 
448105; AW591433; Hs.298241; Transmembrane protease, serine; breast, pane, colon, lung, ovar, stom; mAb*diag-s.m. 
448133; M723157; Hs.73769; folate receptor 1 (adult); ovar, fibro; mAb 
448140; AF146761.HS 20450 BCM lite n mbi neproteii ee fit m la leuk; mAb 
448153;Y10805; Hs 20621, HM1 I (hnRNI methyltransferasc,; test, CTL+s.m. 
448181; AF272833; Hs.279763; hypothetical protein FU10504; test diag 
448204; AI475124; Hs.170S61; ESTs; sarc; diag 
448231; AI70191 6; Hs.202509; ESTs; angio; diag 

448258; BE386983; Hs.343214; hypothetical protein FU20396; mela, ovar; mAb 
448262; AW880830; Hs.186273; ESTs; blad; diag 
448275; BE514434; Hs.20830; kinesin-like 2; ovar, esoph, mela; diag 
4.18278; W07369; Hs.1 1 732; ESTs; lunq; diag 

448290; AK002107; Hs.20843; Homo sapiens cDNA FU11245 (is; pros; diag 
448321; NM_005883; Hs.20912; adenomatous polyposis cot tik; glio; CTL+s.m. 
448357; N20169; Hs.108923; RAB38, member RAS oncogene fam; lung, mela; diag 
448410; AK000227; Hs.21 126; hypothetical protein FU2022O; mela; diag 
448437; AW470125; Hs.220529; gb:xw60c04.x1 NCI_CGAP_Pan1 Ho; pane, colon; diag 
448499; BE61 3280; Hs.77550; p53-regulated DDA3; gfc; diag 

448569; BE382657; Hs.21486; signal transducer and activato; pane, headnk, fibro, cerv, mela, renal; CTL-^.m. 

448588; AI970276; Hs.156905; KIAA1676; test; CTL+s.m. 

448595; AB014544; Hs.21572; KIAA0644 gene product breast glio; mAb 

448664; AI879317; Hs.334691; splicing factor 3a. subunit 1,; mela; CTUs.m, 

448674; W31178; Hs.154140; ovary-specific acidic protein; angio; diag 

448692; AW013907; Hs.167531 ; mcthylcrotonoyl-Coenzyme A can pros, pros; s.m, 

448706; AW291095; Hs.21814; inlerleukin 20 receptor, alpha pro- ite blad ;oIon;mAt 

448719; AA033627; Hs.21858; trinucleotide repeat containin; mela, sarc; CTWiag 

448775; AB025237; Hs.388; nudix (nucleoside diphosphate ; test diag 

448811; AI590371; Hs.199460; ESTs; esoph, pane; mAb 

448939; BE267795; Hs.22595; hypothetical protein FU10637; test; CTL+s m 

448966; AW372914; Hs.86149; phosphoinositol 3-phosphate-bi; mela; CTL-ts m. 

44B98 1968719; Hs.1 37; EST "J 

448988; Y09763; Hs.22785; gamma-aminobutyric acid (GABA); pros, cerv, colon, lung, stom, blad, headnk, ovar, breast; mA 
448993; AI471630; Hs.355952; KIM0144 gene product; lung, blad; diag 
448999; AF179274; Hs.22791; transmembrane protein with EGF; pros, glio ; mAb 
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449003; X76342; Hs.389; alcohol dehydrogenase 7 (class; lung, headnk; s.m. 
449019; AI949095; Hs.67776; ESTs, Weakly similar lo T22341; blad, lur.a: diag 
449027; AJ271216; Hs.22880; dipeptidylpeptidase III; blad, colon, ovar. s.m. 
449040; NIL012191: ! Is 22919: putative tumor suppressor; lung; CTLts.m. 
449078; AK001 256; Hs.22975; KIAA1576 protein; mela; diag 
449101;AA205847; Hs.23016; G protein-coupled receptor, lung, headnk; mAb 
449109; AW270992; Hs.1 20949; ESTs, Weakly similar to ALU7_H; sarc; diag 
4491 56; AF1 03907; Hs.1 71 353; prostate cancer antigen 3, non; pros; mAb+CTL 
449207; AL044222; Hs.23255; nucleoporin 155kD; lung; diag 
449228; AJ4031 07; Hs.148590; protein related with psoriasis; lung; diag 

449230; BE613348; Hs.356392; melanoma cell adhesion molecul; lung, cerv, headnk, blad, ovar, colon; mAb 
449317; AW29341 3; Hs.132906; 19A24 protein; mela; mAb 

449318; AW236021; Hs.78531; Homo sapiens, Similar lo RIKEN; headnk, lung, angio; CTUs.m. 
449322; AI63861 6; Hs.1 96566; ESTs; test; diag 
449338; H73444; Hs.394; adrenomedullin; renal; diag 
449394, AA004368, 181 ). Horn sapien cDNAFU11550fis; angio; mAb 
449437; AI702038; Hs.100057; Homo sapiens cDNA: FU22902 li; test; diag 
449448; D60730; Hs.57471; ESTs; blad, lung, headnk, breast; diag 
449467; AW205006; Hs.197042; ESTs; lung; diag 
449494; AW237014; Hs.31 5369; aquaporin 4; fibro; diag 
449569; AI656634; Hs.195389; ESTs; test; diag 
449592; A1655494; Hs.195718; ESTs; pane, diag 
449618; AI076459; Hs.15978; KIAA1272 protein; angio; diag 
449625; NMJ14253; H3.349094; odz (odd Oz/ten-m, Drosophila); pros; diag 
449650; AF055575; Hs.23838; calcium channel, voltage-depen; pros; mAb 
449680; AI033821; Hs.12160; ESTs; renal; diag 
449961; AW265634; Hc.133100; ESTs; glio, esoph.lung, blad ; diag 
449976; 1 106350; Hs.1350G6; Human DNA sequence from clone ; lung; diag 
450096; AI682088; Hs.79375; single-minded (Drosophila) horn; pros; CTL 
450098; W27249; Hs.8109; hypothetical protein FLJ21080; breast, lung, Stan, uts; diag 
450149; AW969781; Hs.132863; Zic family member 2 (odd-paire; sarc; CTL+s.m. 
450152; AI138635; Hs.22968;intronofVEGFR; renal; diag 
450377; AB033091; Hs.355925; KIAA1265 protein; w , colon: diaq 
450382; M397658; Hs.60257; Homo sapiens cDNA FU 1 3598 (is; pros; diag 
450400; AI694722; Hs.279744; ESTs; pane; diag 
450431; AW136797; Hs.266041; ESTs; test; diag 
450451; AW591528; Hs.202072; ESTs; uter, endo; d:ag 
450506; NM_00446C; Hs.418; fibroblast activation protein,; pane, esoph; diag 
450534; AI57018S; Hs.25132; KIM0470 gene product; angio; CTL^.m. 
c I 81 K I 1 i 

450635; AW403954; Hs.25237; mesenchymal stem cell protein ; blad; mAb 
450642; R39773; Hs.7130; copino IV; pros; diag 

450656; AA010539; lls.18912; unnamed protein product; fibro, uter; CTUsm 
450663; H43540; Hs.25292; ribonuclease HI, large subunit. mela; s.m. 
450676; AI147155; Hs.279727; ESTs; sarc; diag 

450684; AA872605; Hs.25333; intedeukin 1 receptor, type I; blad, lung, headnk; mAb 
t i 6696;Hs.333418;FXYDdomain-coi aini .on tra mela diag 
150 « 50 I H ' 3 E T pn, er; diaq 
450719 AI09683( H 13 Ei jrly similar lo RB8B_H; test; diag 

450737; AW007152; Hs.63325; transmembrane protease, serine; coton, blad, lung, ovar, pane, headnk; mAb*diag-*s.m. 
450785;M852713;Hs.108EEE,l,-rn 0 aoe l io a'pha-1 (VI) col; arc CTL+s.m. 
450632; AW970G02; Hs.105421; ESTs; lung; diag 

451027; AW519204; Hs.40808; Homo sapiens, Similar to RIKEN; pros, uter, glio; diag 
451035; AU076785; Hs.430; plaslin 1 (I isoform); pane; diag 
' r v\ 21 , J 51309; ESTs: n or 

451099; R52795; Hs.25954; interleukin 13 receptor, alpha; glio, fibro, mela; mAb 

451106; BE382701; Hs.25960; N-MYC oncogene; test, ovar, CTL+s.m. 

451110;AI955040; Hs.265398; PAR-6beta (partitioning def; breast, ovar, lung, coon; CTI.-ts.rn. 

451181; AI796330; Hs.207461; ESTs; pane; diag 

451253; H482SS; Hs.26126; claudin 10; lung, ovar, pane; mAb 

451291; R39288; Hs.6702; ESTs; lung; diag 

451295; AI55721 2; Hs 17132 E Ts Vr"W_py similar to 15; pane; diag 

451320; AW118072; Hs.350251; diacykjlycerol kinase, zeta (1; lung; s.m. 

451346; NMJ06338; Hs.2S31 2; glioma amplified on chromosome; ovar; mAb 

451386; AB029006; Hs.26334; spastic paraplegia 4 (autosomal lung; diag 

451398; AI7931 24; Hs.144479; ESTs; breast, ovar; diag 

451411; AA017492; Hs.135655; EST; pros; diag 

451497; H83294; Hs.234122; Writ inhibitory factor-1: uter, fibro, pros, colon, sarc; diag 

451541; BE279383; Hs.25557; plakophilin 3; lung, blad, ovar; diag 

451592; AI805416; Hs.213897; ESTs; lung, headnk; diag 

451635; M018899; Hs.127179; cryplic gene; pane; diag 

451663; AI872360; Hs.209293; ESTs; pros; diag 

451720; AW970985; Hs.290853; ESTs; pros; diag 

451743; AW074266; Hs.336428; ESTs: lung; diag 

451820; AW058357; Hs.199248; ESTs; pane; mAb 

451844; T61430; ; gb:yc06a03.s1 Stratagene lung ; blad; diag 

451982; F13036; Hs.27373; Homo sapiens mRNA; cDNA DKFZp5; pros, blad; mAb 

451999; AW1 76401; Hs.360623; DEAD,H (Asp-Glu-Ala-Asp/His) b: test; CTbs.m. 

452046; AB018345; Hs.27657; KIM0802 protein; lung, uter, CTLts.m. 

452208; AA024792; Hs.31895; hypothetical prate n MG 409 al; diag 

452240.AI591147 I 1 M ?;ESTs;bl luna headnk, pane, cerv; diag 

4 24 AL35571 I 2 r c pr amn -I II death u°DCD9: breast; diag 

452244; N33530; Hs.176674; ESTs; mela- diag 
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452291; AF015592; Hs.28853; CDC7 (cell division cycle 7, S; lest; CTL+s.m. 
452295; BE379936; Hs.28866; programmed cell death 10; lung; diag 
452298; AI039243; Hs.278585; ESTs; angio; diag 

152304; M025386; Hs.6131 1; ESTs, Weakly similar to S1O590; lung, pane, blad, stom, esoph, (ibro.colon; s.m. 

152316; M298484; Hs.61265; ESTs, Moderately similar to G7; blad; diag 

452340; NMJJ02202; Hs.505; ISL1 transcription fxtor, LIM; pane, pros; CTL+s.m. 

152353; C18825; Hs.29191; epithelial membrane protein 2; pros, breast; mAb 

152355; N54926; Hs.29202; G protein-coupled receptor 34; glio, fibro, pane; mAb 

S52367; U71207; Hs.29279; eyes absent (Drosoptiila) homol; lung, pros, ovar, uten CTL+s.m. 

.5241 6; AA0261 1 5; Hs.1 1 4777; ESTs; fibro; diag 

152461; N78223; Hs.108106; transcription factor; blad, lung, headnk, ovar, glio, s'.om, colon, cerv; CTL+s.m. 
152571; W31518; Hs.34665; ESTs; stom, lung, pane, colon, fibro; diag 
152594; AU076405; Hs.29981 ; solute carrier family 26 (sulf; ovar mAb 
152613; AA461599; Hs.23459; ESTs; lung; diag 

452679; Z42387; Hs.83883; transmembrane, prostate androg; pros, colon, pane, pros; mAb 

452705; H49805; Hs.246005; ESTs; pane; diag 

452717; AW160399; Hs.30376; hypothetical protein; pros; diag 

452721; AJ269529 ' 1871 1 m f n I, 7 ( ql/c pros; mAb 

452732; BE300078; Hs.80449; Homo sapiens, clone IMAGE:3535; blad; diag 

452744; AI267652; Hs.246107; Homo sapiens mRNA; cDNA DKFZp4; mela; diag 

452792; AB037765; lis 30652 KIAA134- p o-ein; pros, uter, breast; diag 

452795; AW392555; Hs.18878; hypothetical protein FU21620; renal, headnk, colon, lung, pane; CTL 

452796; AB01 1 1 00; Hs.30656; KIAA0528 gene product; test; diag 

452833; BE559681; Hs.30736; KIAA0124 protein; lung, mela,' CTL+s.m. 

452865; AI924046; Hs.1 19567; ESTs, Weakly similar to A47582; lung; diag 

452899; M96739; Hs.30956; nescient helix icop helix 1; sarc; CTL+s.m. 

452924; AW580939; Hs.97199; complement component C1q reeep; angio; diag 

452933; AW391423; Hs.288555; Homo sapiens cDNA: FU22425 (i; angio; CTL+s.m. 

452934; AA581322; Hs.4213; hypothetical protein MGC16207; lung, blad; diag 

452955; AW390282; Hs.31 1 30; transmembrane 7 superfamily me; pros; mAb+s.m. 

453006; AI362575; Hs.303171; ESTs; pros; diag 

453028; AB006S32; Hs.31442; RecQ protein-like 4; blad, lung, test; CTL+s.m. 

463085; AW954243; Hs.351573; KIAA02S1 protein; angio; diag 

453096; AW294631; Hs.351270; ESTs; pros; diag 

453102; NM_007197; Hs.31664; frizzled (Drosophila) homolog ; lung, headnk, colon; mAb 

453107; NMJH6113; Hs.279746; vanilloid rcccptor-likc protci; mola; mAb 

453134; AA032211; Hs.118493; ESTs; blad; diag 

453142; AA033648; Hs.7473; Homo sapiens gap junction prat; fibro; mAb 

453160 AI26330 H 1 JJ H2BI 1 lomt L, lung p n pro Jog 

453210; AL133161; Hs.32360; hypothetical protein FU10867; lung; CTL+s.m. 

453216; AL137566; Hs.32405; progesterone receptor (PR); blad; mAb+s.m. 

453256; AI565587; Hs.32556; KIAA0379 protein; mela; diag 

45331 0; X70697; Hs.553; solute carrier family 6 (neuro; fibro; mAb 

453321 ; AI984381 ; Hs.232521 ; ESTs; blad; diag 

453323; AF034102; Hs.32951 ; solute earner family 29 (nucl; ovar; CTL+s.m. 

453331; AI240005; lfe.352537; ESTs; breast, lung, pane, esoph; mAb+dag+s.m. 

453344; BE349075; Hs.44571; ESTs; mela; diag 

453348; BE272318; Hs.8595; hypothetical protein FU12438; test; CTL+s.m. 

453365; AA035211; Hs.17404; SOX7 SRY (sex determining regi; angio, blad; CTL+s.m. 

453370; AI470523 H 1 t rP-binding cass D fami p os mAb 

453389; BE273648; Hs.32963; cadherin 6, type 2, K^adherin; renal, ovar, blad; mAb+s.m. 

4 ) " H 2 4 T i rmining region Y); ovar, lung, glio CTL+s.m 

453459; BE047032; Hs.257789; ESTs; ovar, cerv, blad, uter, pane, angio, lung; diag 

453464; AI884911 H 3 r ( n o mAb 

453633; AA357001; Hs.34045; hypothetical protein FU20764; lung, esoph, test; diag 

453637; NM 002589; Hs.34073; BH-protocadherin (brain-heart); headnk; mAb 

453642; AI370936; Hs.34074; dipeptidylpeptidase VI; glio; mAb 

453779; N351 87; Hs.43388; 28kD interferon responsive pro; mela; diag 1 
453789.AA628517.il 11 502 ESTs angio diag 

453857; AL080235; Hs.35861; Ras-induced senescence 1 (RIS1;glio, lung, uter, headnk, cerv, pane, pros, sarc; mAb 

453883; AI63851 6; Hs.347524; cofactor required forSpl tran;blad, lung; diag 

453884; AA355925; Hs.35232; KIAA0186 gene product; lung, ovar, test, esoph; diag 

453912; AL1 21 031; Hs.356843; SWI/SNF related, matrix associ; mela; diag 

453922; AF053306; Hs.36708; budding uninhibited by benzimi; colon, stom, lung, test; CTL+s.m. 

453935; AI633770; Hs.42572; ESTs; pane; diag 

453941; U39817; Hs.36820; Bloom syndrome; lung, cerv, headnk; CTL+s.m. 

453964; AI961486; Hs.249196; ESTs; lung; diag 

453966; BE148734; Hs.63325; transmembrane protease, serine; colon, biad, lung, ovar, pane, headnk; mAb*diag+s.m. 
453985; N44545; Hs.251855; ESTs; test; diag 

454034; NM.000691; Hs.575; aldehyde dehydrogenase 3 famil; lung, headnk; s.m. 

454042; H22570; Hs.47860; hypothetical protein FU20093; lung; diag 

454066; X00356;Hs 37i it i le tt n i/i c'i nnir related ; lung; diag 

454071; AI041793; Hs.42502; ESTs; breast ; diag 

454077; AC005952; Hs 37062; Insulin-like 3 (Leydig cell); test; diag 

454098; W27953; Hs.21 7493; Plakophilin; lung; diag 

454117; BE410100; Hs.40368; adaptor-related protein comple; mela; CTL+s.m. 

454360; L78207; Hs.54470; ATP-bmdmq i s i i'i r gl 10 mAb 

454429; BE273437; Hs.301406; hypothetical protein PP3501; mela; mAb 

454439; AW819152; Hs.154320; DKFZP56601646 protein; lung; diag 

454478; AW805749; Hs.372783; superoxide dismutase 2. mitoch; mela; s.m. 

455601; AI368680; Hs.816; SRY (sex determining region Y); lung, cerv, esoph; s.m. 

456C34, AW450979, ; gb:UI-H-BI3-ala-a-12-0J 1 N; blad Sbro diag 

456062; AI866286; Hs.71962; ESTs, Weakly similar to B36298; fibro, ovar, uter; diag 
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ic; CTL-^.m. 



456177; NM_012391; Hs.79414; pi 
456266; L29073; Hs.198726; cold i 
456321 ; NM_001327; Hs.87225; cancer/testis antigen; lung; CTL 
456553; AA721325; Hs.189058; ESTs, Highly similarto Simila; pane; oiag 
456723; Z43902; Hs.4748; adenylate cyclase activating p; glio; mAb-ts.m. 
456736; AW248217; Hs.1619; achaete-scute complex (Drosoph; luna; diag 
456759; BE259150; Hs.127792; delta (Drosophila)-like 3; glio, lung; mAb 
456847; AI360456; Hs.86088; ESTs; test; diag 
456938; X52509; Hs.161640; tyrosine aminotransferase; breast; s.m. 
456977; AK000252; Hs.169758; hypothetical protein FU2024S; angio; diag 
457200; U33749; Hs.197764; thyroid IranscripOon factor 1; fibro; CTL-is.m. 
45721 1; AW972565; Hs.32399; ESTs, Weakly similar to S51797; meia, pros; CTL+s.m. 
457292; AI921270; Hs.281462; hypothetical protein FLJ14251; blad; mAb 
457313; AF047002; Hs 241520; transcriptional coactjvator; test; CTL+s.m. 
457411, AW085961 H 13009 k -c'ass homeobox protei; breast, fibro; diag 
457465; AW301344; Hs.122908; DNA replication factor; test, mela; diag 
457498; AI732230; Hs.191737; ESTs; pros; diag 
457561; AA331517; Hs.286055; chimerin (chimaerin) 2; glio; mAb 
457590; AI612809; Hs.5378; hypothetical protein MGC10724; ovar; diag 
457869; AU0771 86; Hs.1 08885; Homo sapiens, alpha-1 (VI) col; sarc; CTL+s.m. 

458124; AW005548; Hs.124590; ESTs: fibro; diag 

458435; AI41 871 8; Hs.1 441 21 ; ESTs, Weakly similar to T46916; glio; diag 

458471; AV648609; Hs.1 94240; ESTs; renal, pane, hepC; diag 

458933; AI638429; Hs.24763; RAN binding protein 1; lung, test; diag 

459373; BE408266; Hs.301406; hypothetical protein PP3501; mela; mAb 

459578; AW612538; Hs.304491; EST; mela; diag 

459702; AI204995; ; gb:an03c03.x1 Stratagene schiz; blad, fibro; diag 

459705; BE082764; Hs.270252; ESTs, Weakly similar to androg; fibro; mAb+s.m. 

TABLE 3B 




322975 
323332 
2 817 



1784158J 
245301 1 
887879J 



409051 107934J 



427260 
427298 
427521 



AA1 15173 AA075709 AA076354 M083101 AA075396 AA085391 AA070B84 AA083368 M075779 M075221 M076395 AA650486 M083500 
AA065143AA065142 
AI820546AI821336T61430 

AF080229 AF080232 U87593 U87592 U87591 U87590 AI63B743 AI63381 8 AW206802 AI58371B AF080231 AF080234 AF080233 AL535594 
AI81 8326 AF080230 S46404 AI970376 AA463992 AW555466 BF51221 0 U87595 U87589 BE550633 AI672574 BE467547 AI680833 AW614951 
N29986 N25695 H69001 U87596 BE573974 AI797496 AI701526 M703396 AW139734 H92278 N66048 BE219539 BE671665 AI624817 BE466611 
AI206344 AA574397 BF59341 3 BG23 1 271 BF77351 7 U87594 BF062 1 80 BE466420 AI887798 BF674385 AA204735 AW496808 AA204833 
AA207155 BI004756 AA206262AI365204 H77608AW590511 
AA404418AI217248 

AA1 36653 AA1 36656 AW450979 AA984358 AA809054 AW238038 AA492073 BE168945 
AF147347 T55503 T55426 
C16391 C16413 

AI829520 AI791 832 AI791 823 AA2293 1 5 AA22841 4 AA22921 1 
AA410943 BF366582 AA334202 AA332882 BF371899AW948953 
AL044891 AI908240AA393080 AW748403 BE059341 BF330573 
M18728 

AA07541 9 AA082953 AA080912 AA062835 M071252 AA084926 AA078992 M1 13913 AA081 881 AA070343 AA083821 AA062836 M1 13892 
AA07531 8 AA076594 AA078900 AA1 34801 M063293 M083403 AW974305 
AA070050 AA070823 M063403 

BI030997 AA921 874 AW1 88822 Bl 027862 AI34751 8 AI361 453 AI08S754 AW207491 AA077391 BG012775 BG997382 AA286833 AA1 50722 
BI007625 BI027864 BI009100 BI005275 BI006270 BI031000 BI029864 BI006277 BI007627 BI006266 BI006991 BI006990 BI007763 BI007762 
BG997377 AA150780 BI033518 BI027818 BG015789 BI033807 AA341445 

BE088101 T05990AW872477 



AA136653M136656AW450979 M984358AA809054AW238038AA492073BE168945 
AA210987 D57294 AA214584 AA207006 D56572 
AA401424 AA400100 AA663848 
AA933717BF061897AW628327 M641788AA400495 

AW973352BF222929AW016853 BF0591 30 AIS51 829 BE551757 AA558414 AI339359 BF059601 AI961 162 AI341 422 AI206248 AI2061 65 



AW974271 AA592975AA447312 AA884766 



BG207209 BE166299 AI204995 BG199355 AW969908 AA528756 AW440776 BI044354 

BG036675 BF772005 BF771866 BG960386 BG960381 NM.005712 AF110315 BE074534 BE182776 BE158000 BE157999 BE714315 AW818104 
AW847519 AA099426 AW817981 AW856396 BG951 122 AA224498 M308542 AW821833 BF902155 AI732411 BG778834 BG283641 BE748279 
BE748870 BG319540 BE748864 BF739224 BG986155 AK057283 BI861466 AA663341 AA457591 BG949294 AW392886 M071122AA227849 
AA584918 BG959570 BF773486 AL041698 BF959013 R87170 C16859 BF770411 BF771298 AI075321 L13823 AA216700 BF771864 AW861859 
BE537068 C18935M155719BF771172BR69107BF804964AW818172AW818143AW392930AW817057AW858044BF746211 AA179928 
AW861 687 AW821826 BI055726 BF242643 AA2071 8Q BF77041 2 BF771 1 57 BG430030 M055592 

BG166382AW161086 N42363 BE935013 BE9349S8 BG291451 AV700520 BG152773 AI224956 AI079635 AW054706 AA843979 AI744193 F04060 
T23457 F04044 AA723859 AA977643 AA283764 AM 23609 N21561 BF055052 BE856661 AI804220 AA843394 AI472045 AI740490 AA578830 
H09495 AI283334 AA609495 AI1 22773 AW1 62643 AW1617Q8 BF940077 AI808825 AI360866 AI123189 R40236 R20726 AW975899 BE764052 
N31709 N31 708 AI031947 AW194138 

AF134164 BF809407 AA218567 BF842863 AI267168 BF876178 BG999253 AW861851 AW858362 AI817548 BF771300 AA1 13928 AA223422 
AA055556 BF773400 BF998869 BE081333 BE073424 BE142245 H59571 H59570 BF871558 BF871064 BE001132 BF826831 AW754298 
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14701J AF147374 T59538 T59589 T59598 T59542 

14739 1 AF147390 R76593 R76594 

1242593J AW979074AA834841 M828650 

25801 63_1 AI926361 AA834879 M828995 

919640 1 AW978407AA830149M85983AW503637BR 

49082_1 AL1 09688 R23665 R26578 

187991 1_1 AA868167 F21558 F31418 F35624 

682245J BG403189AI148521 Al 

23271 74 1 AI820546 AI821 336 T61 430 

685586J M136653AA136656AW450979AA984358AA8090S4AW238038 M492073BE168945 

539529 1 BG207209BE166299AI204995BG199355AW969908AA528756AW440776BI044354 



e. The 7 digil numbers in this column ate Genbank Identifier (Gl) numbers, 
sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402 489495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Stand NLposition 



'Dunham I. et al. - refers to the publication entitled "The DNA 



325544 
32703C 
327075 

328700 

332798 



40 
45 
50 



336034 
336152 
336636 



5867750 Plus 



Dunham, I. et.al. 
■ Dunham, I. elal. 
Dunham, I. etal. 
Dunham, 
Dunham, 
Dunham, 
Dunham, I. etal. 



1117061-1117304 

171228-171286 

319951-320040 

40413184041431 

102461-102586 

764089-764203 

59158-59215 

Plus 



8217374-8217261 

12734365-12734269 

14308764-14308824 

21388250-21388146 

26310772-26310909 



27360474-27360400 

29014404-29014590 

30156053-30155870 

988418-989185 

7697068-7697236 

8092128-8092271 

11794465-11794343 



81488-81646 
16879-17023 

44116-44238,4820848321 



401424 
401486 
401704 
401732 
401747 
401760 
401780 
401781 
401785 
401797 
401994 
402145 
402199 
402230 
402239 



402424 
402447 
402474 



8516137 Minus 

9929642 Plus 

8176894 Plus 

7341763 Plus 

3097841 Plus 



7249190 
7249190 
7249190 
6730720 



32585-32756,36281-36540,4079140933,4401 

19346-19525,19625-19708,19897-19973,2006 
1 18596-1 1 8816,1 1 91 1 9- 1 19244, 1 1 9609-1 1 976 
831 26-83250,85320-85540,9471 9-95287 



83215-83435,83531-83656.83740-33901,3423 
165776-165996,166189-166314,166408-16656 
6973-7118 

4290443124,4321 143336,4460744763,451 9 

113086-114800 

84187-84744 

29782-29932 

38175-38304,421334 2266 

1 1 3765-1 1 3910,1 1 5653-1 1 5765, 1 1 6808-1 1 694 

21059-21168 

40832-41362 



47605-47729.51 696-51 821 ,52070-52257,5330 
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7767907 
2996643 
3540153 
3540153 



76423-76560 
54727-54901 
42137-42515 
4727-4969 

37-55859,56369-56511 



136688-137096 

169894-170193,170504-170806 

74502-74703 

120428-120703 



404029 
404049 
404210 
404240 



404866 
404877 
404927 
404996 
405001 



406081 
406137 
406173 
406348 
406350 
406399 
406434 



7770611 
7710671 
7671252 
3688074 



7249119 
7622517 
1054740 

1054740 
4914350 
4204244 



1414-1513,1624-1756 

101165-102597 

108716-111112 

75765-78155 

169926-170121 

116132-116407,116653-116922 

51086-51301 

73591-73723 

72053-72238 

80430-81581 

11743-11929 • 

1095-2107 



37999-38145,38652-38998,39727-39872,4055 
1 05267-1 05343,1061 84-106294, 1 06387-1 0653 



51728-51836 

144345-144464,144690-144835,151750-15188 
145949-146227 

1 18677-1 1 8807,1 1 9091-1 19296, 1 21 626-1 21 82 
124010-124183 

124361-124520,124914-125050 



138842-139051 
61057-62075 
17705-18287 



71754-71944 
7513-7673 
63448-63554 
17803-17931 



Table 4A lists about 425 genes up-regulated In breast cancer compared to normal adult Issues and to non-malignant breast tissues. These genes were selected from a starting 
collection of about 59680 probesets on the Affymetrix/Eos-Hu03 GeneChlp® array. The "average" breast cancer level wx set to the 90th percentile value amongst breast cancer 
specimens. The "average" normal adult tissue level was set to the 75th percenti le value amongst non-malignant adult tissues. Certain predicted protein domains are noted. 

Table 5A lists about 231 genes up-regulated in breast cancer compared to normal adult Issues and to non-malignant breast tissues that are likely to encode proteins amenable to 
modulation by small molecules, peptides, or antibodies. These were selected similarly as for Table 1 . The "average" breast cancer level was set to the 90th percentile value amongst 
breast cancer specimens. The "average" normal adult tissue level was set to the 50th pexentile value amonq ti ni r gnan tt it product f the e genes often 

contain one or more domains indicative of have oncogenic function or of transducing intracellular signals, or of being modulatable by small molecules, peptides, or antibodies (e.g, 
pkinase, death-domain, 7tm, phosphatase, or ionjransporter). Certain predicted protein domains are noted. 

TABLE 4A: ABOUT 425 GENES UP-REGULATED IN BREAST CANCER COMPARED TO NORMAL ADULT TISSUES 

Pkey: Unique Eos probeset identifier number 

ExAcen: Exemplar accession number, GenBank accession number 

UniGenelD: Unigene number 

Pred.Prot,Domains: Certain predicted protein domains. Abbreviations used: TM, transmembrane domain; SS, signal sequence; =Y, very likely lo.contain; =M, 

likely to contain; =N, less likely to contain. All olher protein domain abbreviations are from PFAM (Nucleic Acids Research, 2002, 30:276-280). 

UniGene Title: Unigene gene title 

R1 : Ratio of 90th percentile of tumor to 75th percentile of normal body tissue 

Pkey ExAcen UniGenelD Pred.Prot.Domains UniGeneTitle R1 
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409340 
428471 
417931 
447966 

421814 
406867 
426104 



421552 
409453 
409632 
417515 



40 
45 
50 
55 
60 
65 
70 



446051 
451982 
450334 
431890 
434449 



AA547959 Hs.1 15838 

BE174629 Hs.321130 

X57348 Hs.1 84510 

W95642 Hs.82961 

AA340605 Hs.1 05887 

L12350 Hs.1 08623 

AA157857 Hs.1 82265 

AI20441 8 Hs.1 90080 

AW391972 Hs.1 04696 

AU076442 Hs.1 17938 

AB011155 Hs.1 70290 

M133749 Hs.301350 

AY007220 Hs.288998 
AA216387 

M130986 Hs.271627 

BE260964 Hs.82045 

U33635 Hs.90572 

AF026692 Hs.1 05700 

AI885516 Hs.95612 

L24203 Hs!82237 

AB029000 Hs.70823 

BE389014 Hs.372548 

X58968 Hs.1 11301 

AB024536 Hs.1 02171 

AI088063 Hs.7882 

AU077196 Hs.82985 

AL157488 Hs.50150 

M36803 Hs.346935 

M97639 Hs.1 55585 

BE546947 Hs.44276 

AI949095 Hs.67776 

AA351978 Hs.4943 

U24389 Hs.65436 

AI472111 Hs.278694 

AA147026 Hs.76704 

AI701162 Hs.331904 

BE502341 Hs.3402 

M456454 Hs.355702 

NMJ05864 Hs.24587 

AB040935 Hs.23954 

AI917901 Hs.208641 

AI964074 Hs.225838 

AI733848 Hs.71935 



anterior gradient 2 (Xenepus taevis 
Homo sapiens similar to Echinoidin 
mclanophilin (MLPH), mRNA 



i AJ243706 i 

i NM.000088 h 

i AA688292 h 

i M1 57857 Y 

i AC007954 h 

i AI612027 i 



,143323 



414822 
452683 
444784 
453857 
413859 
440369 
418140 
441384 
424464 
423582 
432562 
433320 
429165 
416984 



' AW451711 



Hs.324051 
Hs.21355 
ls.313760 



NMJ306207 Hs.1 70040 

AF254260 Hs.283009 

BE048061 Hs.37054 

F13036 Hs.27373 

AF035959 Hs.24879 

X17033 Hs.271986 

AW953484 Hs.3849 

BE410590 Hs.1 19257 



SH3,PDZ,Guanylate_kin;TM= 

ATP1G1_PLM_MAT8;TM=Y;SS=M 

S_100;TM=M;SS=N 



PTN_MK;TM=M;SS=Y 

ig,pkinase;TM=Y;SS=M 

Fz,NTR;TM=N;SS=M 

cadherin,cadherin,Cadheri 

serpin;TM=N;SS=N 

zf-B_box,zf-UBR1;TM=M;SS= 



KIAA1 324 protein 
collagen, type XVII, alpha 1 
discs, large (Drosophila) homolog 5 
FXYD domain-containing ion Iranspor 
S100-type calcium binding protein A 
gb:nc15b02,s1 NCLCGAP_Pr1 Homo sap 



ngf 



SH2,none 
fn2,hemopexin,Peptidase_M 
ig,LRR,LRRNT,LRRCT;TM=M;S 

w»cCollagen,COLFI;TM=N;S 

hemopexin;TM=N;SS=M 
ig,kringlo,pkinase.Fz;TM= 
homeobox;TM=M;SS=N 

MAGE,Cys_knot,EGF,laminin 
LysyLoxidase;TM=N;SS=M 

PMP22_Claudin > PMP22_Claud 



EGFMfibrinogen C.toxi 
zf-C2H2;TM=M;SS=N 
PHD^RIDjmjCjmjN zF-CSl I 
vw,Collagen,COLFI;TM=M;S 
homione_rec,zf-C4 . 
tilament,bZIP;TM=N;SS=M 

TB2_DP1_HVA22;TM=Y;SS=M 
efhand.efhand 

ig,Ribosomal_S19;TM=M;SS= 



PTK7prt . 
secreted frizzled-related protein 4 
ESTs 

serine (or cysteine) proteinase inh 
alaxia-telangi ' ' 
KIAA1077pr< 



D12485 Hs.1 1951 



M447849 Hs.288660 

R68537 Hs.17962 

BE000831 Hs.23837 

BE531048 Hs.278422 

D60647 Hs.250879 

AW009886 Hs.1 18258 



Somatomedin_B,Endonucleas 

TM=Y;SS=M 

SAM.PNT.none 



TGFb_propeplide,TGF-beta, 



Flavodoxin_2;TM=M;SS=N 

rrm,zf-RanBP,pkinase,GST_- 

serpin;TM=M;SS=M 

TM=Y;SS=N 

SH3,ank;TM=M;SS=N 

pkinase,DCX;TM=M;SS=N 

homeobox,none 

ig;TM=N;SS=M 

bZIP;TM=M;SS=N 

Ephrin,A_deamin,dsrm,z-al 

NA;NA 

PAP2,TM=Y;SS=M 
vw£,in:egrin_A,FG-GAP;Tlv1= 
emand,FKBP;TM=M;SS-N 
SH3,HS1_rep;TM=M;SS=N 



CD209 antigen 
ESTs 

hypothetical prolan MGC11138 



putative zinc finger protein from E 
putative DNA/chromatin binding moti 
collagen, type I, alpha 1 



hypothetical protein DKFZp434P01 1 1 
Homo sapiens, clone MGC:9381, mRNA, 
DKFZP434I1735 protein 
Thy-1 cell surface antigen 
ESTs 

progesterone membrane binding prote 
ectonucteotide pyrophosphatase/phos 
Ras-induced senescence 1 (RIS1) 
Homo sapiens pyruvate dehydrogenase 



Homo sapiens cDNA FU 1 1 81 2 fis, do 
DKFZP586G1 122 protein 
ESTs, Highly similar to CTXN RAT CO 
prostate cancer associated protein 
diaphorase (NADH/NADPH) (cytochrome 
myosin VI 

serine (or cysteine) proteinase inh 
ENSP0000022471 6*:GTP-binding protei 
RelA-associated inhibitor 
doublecortin and CaM kinase-like 1 
ESTs, Weakly similar to I38022 hypo 
plaleiet-derived growth factor rece 



Homo sapiens mRNA; cDNA DKFZp564017 
phosphaUdlc acid phosphatase type 
integrin, alpha 2 (CD49B, alpha 2 s 
hypothetical protein FLJ22041 simil 
emsl sequence (mammary tumor and sq 
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Hs.130816 
Hs.32366 
Hs.25318 
Hs.83942 



423057 AW961597 

452063 R53135 

450680 AF131784 

418283 S79895 

416361 AW204907 



408113 T82427 Hs.194101 

407792 AI077715 Hs.39384 

422765 AW409701 Hs.1578 

429359 W00482 Hs.2399 

442572 AI001922 Hs.135121 

448826 AI580252 

419548 T73551 

421485 AA243499 

440273 AI805392 

417353 AW129357 

451277 AK001123 

421823 N40850 Hs.28625 

452239 AW379378 Hs.356289 

444285' AI625304 Hs.201008 

451541 BE279383 Hs.26557 

451304 M92642 Hs.26208 

429556 AW139399 Hs.314807 

441094 U33819 Hs.7647 

407788 BE514982 Hs.38991 

451292 AB037716 Hs.26204 

437762 T78028 Hs.1 54679 

433399 N46406 Hs.84700 

408056 AA312329 Hs.42331 

404578 AF183810 Hs.26102 



Peptidase_C1;TM=N;SS=M 

GST_C,GST_N;TM=M;SS=N 
7tm_3,nane 
TM=M;SS=Y 
BIR;TM=M;SS=N 



Hs.91877 
Hs.104800 
Hs.325335 
Hs.329700 
Hs.26176 



445084 H38914 

453880 AI803166 

424125 M31669 

437377 AL359573 

422562 AI952060 

422320 AI745249 

433078 AW015188 

411894 M57609 

425976 C75094 

418113 AI272141 

418753 BE21 7818 

452679 Z42387 

421030 AW1613S7 

431567 N51357 



435818 AA700553 

420005 AW271106 

410867 X63556 

402531 AB037745 



443933 AI091631 

432952 AA813887 

424036 AA770688 

453828 AW970960 

407112 AA070801 

445669 A1570830 

446091 AW022192 

424651 AI493206 

409178 BE393948 

417059 AL037672 

431194 D43704 

430397 AI924533 

418969 W33191 



418840 AI821614 

433573 AF234887 

433430 AI863735 

422491 M338548 

435114 AA775483 

416899 BE262645 

422110 AI376736 



Hs.135121 

Hs.1735 

Hs.124940 

lls.118397 

Hs.23650 

Hs.121575 

lls.72916 

Hs.83484 

Hs.87016 

Hs.83883 

Hs.1 01 174 

Hs.260855 

Hs.2025 

Hs.92679 

Hs.21851 

Hs.12079 

Hs.277728 

Hs.181131 

Hs.368614 

Ho. 13 3294 

Hs.750 

Hs.104696 

Hs.22891 

Hs.1 05689 

Hs.203845 

Hs.1 881 73 

Hs.348495 

Hs.293821 

Hs.51615 

Hs.174870 

Hs .2001 97 

Hs.120785 

Hs.50915 

Hs.81071 

Hs.250712 

Hs.105607 

Hs.28907 

Hs.1 77556 

Hs.137569 

Hs.1 85831 

Hs.57652 

Hs.369982 

Hs.1 17546 

Hs.288936 

Hs.80420 

Hs.1 21555 



414945 BE076358 Hs.77667 
422119 AI277829 Hs.1 11862 
447335 BE617695 Hs.286192 



zf-C2H2,LIM,PHD,TFIIS;TM= 
efttand.S 100,S_100,eftiand 
SH3;TM=M;SS=N 

START;TM=M;SS=N 

Ephrin;TM=M;SS=M 

zf-C2H2,r j I" V 

serpln;TM=M;SS=M 

TM=Y;SS=M 

HSP70,none 

TGF-beta,TGFb_propeptde', 
ras;TM=M;SS=N 
Zn_carbOpeptF5_F8Jype_C 



TM=Y;SS 

rf-C2H2;TM=N;SS=M 

voltage_CLC;TM=Y;SS=M 

HMG_box,homcobox;TM=M;SS= 

TM=Y;SS=M 

tubulin-binding;TM=N;SS=M 
TM=M;SS=N 

TGF-beta,TGFb_propeptide; 



EGF,TB,wnt,EB,TIL;TM=N;SS 
TM=M;SS=M 

aa_permeases;TM=Y;SS=M 



histone,CBFD_NFYB_HMF;TM= 

Pep_M12B_propep,Reprolysi 

homione_rec,zf-C4 



Ca_channel_B,RepB_protein 

HC03_cotransp;TM=Y;SS=N 

SH3;TM=M;SS=N 

MAGE;TM=M;SS=N 

SAM,P53;TM=MSS=N 



zf-RanBP,MDM2,Ndr 

UPAR_LY6;TM=M;SS=M 

WD40;TM=M;SS=N 



ESTs, Moderately similar to I38022 
ESTs, Weakly similar lo TWST_HUMAN 
Homo sapiens clone 25194 mRNA seque 



.Weakly similar to CA13_HUMAN 




to putative p1 
onsive SPOT14 ( 
hypothetics! protein FU10134 
Homo S£pienscDNA:FU23523 lis, cl 
ESTs 

hypotheticel protein FU10261 
ESTs 

proieln tyrosine phosphatase, recep 



MYC-associaled zinc linger protein 
S100 ca!clum-bindlng protein A2 
KIAA1295 protein 
synaptotagmin I 

similar to phosphatidylcholine tran 
ephrin-A4 

trichorhinophalangeal syndrome I gene 
serine (or cysteine) proteinase inh 
hypothetical protein FU14761 
ESTs, Weakly similar to I38022 hypo 
inhibin, beta B (activin AB beta po 
GTP-binding protein 
AE-binding protein 1 
ESTs, Weakly similar to AAB47496 NG 
Homo sapiens cDNA FU12231 Ss, clo 
GLI-Knippel family member GLI3 (Gre 



Homo sapiens cDNA: FU21410Ss,cl 

^mo^^mewkmU mRNA s 
Homo sapiens cDNA FLJ1 2900 fis, clo 



SEC14(S.cerereiae)-like2 



fibrillin 1 (Marfan syndrome) 

KIAA1 324 protein 

solute came r family 7 (cationic a 

MSTP031 protein 

two pore potassium channel KT3.3 

Homo sapiens cDNAFU12187fis, clo 

H2A histone family, member L 



ESTs, Weakly similar to ALU7.HUMAN 




bicarbonate transporter related pro 
hypothetical protein FU20258 
melanoma antigen, family D, 1 
, tumor protein 63 kDa with strong ho 
ESTs 

cadherin, EGF LAG seven-pass G-type 



hypothetical protein FU20940 
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419235 
445033 
404394 
452222 



H43540 Hs.25292 

AW021102 Hs.21509 

AC005954 Hs.25527 

AW748482 Hs.77873 

BE159253 Hs.300638 

AI656707 Hs.48713 

AW470411 Hs.288433 

AV652402 Hs.72901 

AF332975 Hs.307004 

AW806287 Hs.21432 

Y13620 Hs.122607 

AW006352 Hs.159643 

AW245741 Hs.58461 

D59505 Hs.351344 

M788946 Hs.101302 

AW162667 Hs.106857 

AA418160 Hs.86043 

NM_001982 HS.199Q67 

AI498072 Hs.351474 



RNase_HII;TM=N;SS=M 
zf-C2H2,none 

PDZ,Guanylate kin;TW=N;SS 
ig;TM=Y;SS=M 



ank;TM=N;SS=N 
EGF,fn1,' 

Sema,TIG,PSI,GDI 
TM=M;SS=N 

zf-C2H2,TFIIS,KRAB;TM=M;S 
iq.SET,PIID,zf-CXXC,Adap_c 
tn3,vwa,Collagen,TSPN;TM= 



412703 AW984744 

427871 AW992405 Hs.352406 

444273 AI903474 Hs.230 

434936 AI285970 Hs.183817 

457869 AU077186 Hs.108885 

422575 AK000546 Hs.1 18552 

428343 AL043021 Hs.12705 

426716 NM.006379 Hs.171921 

423778 Y09267 Hs.1 32821 

451558 NM_001089 Hs.26630 

407926 AW956382 Hs.59771 

447041 AL1 35480 Hs.250705 

419073 AW372170 Hs.183918 

446945 AI19311S Hs.1 6611 

416322 BE019494 Hs.79217 

447347 AA570056 Hs.1 22730 

448984 AW751955 Hs.22753 

421778 AA428000 Hs.283072 

423363 BE544348 Hs.127562 

432545 X52486 Hs.3041 

408495 W68796 Hs.237731 

406851 AA609784 Hs.352392 

418736 T18979 Hs.87908 

410197 NM_005518 Hs.59889 

453597 BE281130 Hs.33713 

417259 AW903838 Hs.81800 

453985 N44545 Hs.251865 

412634 U55984 Hs.356531 

407204 R41933 Hs.140237 

444371 BE540274 Hs.239 

447334 M515032 Hs.91109 

426530 U24578 Hs.278625 

419749 X73608 Hs.93029 

423595 R82826 Hs.220702 

406673 M34996 Hs.198253 
434241 AF1 19913 

412490 AW803564 Hs.288850 

452277 AL049013 Hs.28783 

431457 NM_012211 Hs.256297 

421777 BE562088 Hs.108196 

453082 H18835 Hs.31608 

414085 M114016 Hs.75746 

440300 N39760 Hs.8859 

400290 H18836 Hs.31608 

433339 AF019226 Hs.8036 

419301 AA236166 Hs.1 32957 

414792 BE314949 Hs.87128 

451428 AW083384 Hs.1 1067 

432210 AI567421 Hs.273330 

452242 R50956 Hs.1 59993 

450676 AI147155 Hs.279727 
413014 AW250533 Hs.75139 
427919 AA173942 Hs.326416 
424005 AB033041 Hs.1 37507 
422072 AB018255 Hs.1 11 138 
440995 T57773 Hs.10263 
426150 NMJ03658 Hs.167218 
416877 BE386266 Hs.85658 
452191 AU076408 Hs.28309 
450273 AW296454 Hs.24743 
456177 NM.012391 Hs.79414 
423062 NM.003655 Hs.5637 
421848 X15880 Hs.108885 



Furin-like,pkinase,Recep_ 

ank,pkinase,UPF0073;TM-N; 

zf-C3HC4,AIP3;TM=M;SS=N 

TM=M;SS=N 

LRR,LRRNT;TM=M;SS=M 
UCH-2 

vwa,Collagen;TM=M;SS=M 

PTR2;TM=Y;SS=M 

WD40;TM=N;SS=M 

ig,Sema,PSI;TM=N;SS=M 

FMO-like,pyrjedox;TM=Y;S 

ABC_tran,SRP54;TM=Y;SS=M 

TYA;TM=N;SS=M 

death ,ZU5;TM=N;SS=M 
TM=M;SS=N 

P5CR,NAD._Gly3P_dh,Octopin 



ig,MHCJI_beta;TM=M;SS=Y 
helicase_C,AT_hook,SNF2_N 
HMG_CoA synt;TM=N;SS=N 
KH-domaln ,Ribosomal_S3_C, 
EGF,ig,Iectin_c,sushi,Xli 
PH,none 



Fork_head;TM=M;SS=N 



ion trans;TM=Y;SS=M 
aldedh;TM=N;SS=M 
TM=M;SS=N 
Cysjwot 

ras,arf,TM=M;SS=N 

TM=Y;SS=M 

TM=Y;SS=M 

EGF,kazal,laminin.EGF,lam 



ESTs 

neurotrimin 

cycliiKlependent kinase inhibitor 2 

Zonadhesin 

SEX gene 

B-cellCLlJymphoma9 
ESTs, Weakly similar to T32554 hypo 
ESTs, Weakly similar to A35659 krue 
ESTs, Weakly similar to K1CI.HUMAN 
ESTs, Moderately similar to CA1C RA 
caib ndin 2, i'29kD, calrelinin) 
Homo sapiens cDNA FLJ13SS8 fis, do 
v-erb-b2 avian erythroblastic leuke 
membrane-associated tyrosine- and t 
95kDa retinoblastoma protein bindi 
gb:RC1-HN0015-040400-011J03HN0015 
Homo sapiens, clone IMAGE:3507281 , 



ATP-binding casselle, sub-family A 
ESTs 

■ Homo sapiens cDNA FU11685 fis, do 
Homo sapiens cDNA FLJ12797 ts. do 
tumor protein D52-like 1 
pynoline-5-carboxylate reductase 1 
ESTs, Moderately similar to KIAA121 
hypothetical protein FU2231 8 
aclin re'ated protein 2/3 complex, 




heatshock 90kD protein 1, alpha 
ESTs, Weakly similar to ALU1.HUMAN 
forkhead box M1 



Homo sapiens PRO3077 mRNA, complete 
Homo sapiens cDNA: FU22528 fis, cl 
KIAA1 223 protein 
integrin, alpha 11 
HSPC037 protein 
hypothetical protein FU20041 
aldehyde dehydrogenase 1 family, me 
Homo sapiens, Similar to RIKEN cDNA 
hypothetical protein FU20041 
glioblastoma overexpressed 
lenomodulin protein 
hypothetica! protein FU23309 
ESTs, Highly similar to T46395 hypo 
■■ no sapiens, clone IMAGE:3544662, 



CTF NFI,none 
TM=Y;SS=N 
RhoGAP;TM=M;SS=N 

homeobox;TM=N;SS=M 
zf-C2H2;TM=M;SS=N 
UDPG_MGDP_dh,UDPG_MGDP_dt 

Ets,'sAM_PNT;TM=M;SS=N 

chromo;TM=N;SS=M 

vwa,Collagen;TM=M;SS=M 



partner of RAC1 (arfaptin 2) 
lorn apier 1 N/ OHi DKFZp IH19 
vang (van gogh, Drosophila)-like 2 
KIAA0712gene product 
ESTs 

BarH-likehomeobox2 

hypi h li ; I prote.n FU23436 ' 

UDP-glucose dehydrogenase 

hypothetical protein FU20171 

prostate epithelium-specific Ets tr 

ESTs 

collagen, type VI, alpha 1 
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433577 AW007080 Hs.284192 

409636 M305729 Hs,18272 

404730 M486704 Hs.33287 

422940 BE077458 

410001 AB041036 Hs.57771 

427461 M531527 Hs.332040 

453468 W00712 Hs.32990 

443807 W52930 Hs.9822 



412755 
429690 
423472 



30 
35 
40 



BE144306 Hs.179891 

AW956329 Hs.23721 

AF041260 Hs.1 29057 

BE269041 Hs.1 40452 

AW976035 Hs.292396 

437464 AA323296 Hs.97837 

433592 NM.004642 Hs.3436 

434931 AW968941 Hs.1 66254 

451691 AI809278 Hs.208152 

430433 AA478883 Hs.273766 

429343 AK000785 Hs.1 99480 

450835 BE262773 Hs.25584 

414591 AI888490 Hs.248107 

452579 M131657 Hs.23830 



449933 AW1 57098 Hs.324104 



434182 W20309 

422737 M26939 

408202 AA227710 

424971 AA479005 

407869 AI827976 

444734 NMJ01360 

426991 AK001536 

414081 AW969976 

408795 AW749126 

452345 AA293279 

437879 BE262082 

407872 AB039723 

427289 AI097346 

432375 BE536069 

429415 NM.002593 

412774 AA120865 

445942 T80334 

439456 AI752409 

414774 X02419 

433336 AF017986 

439905 AW799755 

420251 AW374968 

413004 T35901 

418686 Z36830 

410279 BE271977 

424391 BE550112 

440409 AW294316 

452689 F33868 

418154 BE1 65866 

434384 AA631910 

413436 AF238083 

431663 NM 016569 

432874 W94322 

436252 AI539519 

421044 AF061871 

419102 AA234098 

419359 AL043202 

441859 AW194364 



Aa trans;TM=Y;SS=N 

CTF_NFI,none 

Sec7,PH,ANF_receptor,lig_ 

trypsin;TM=M;SS=M 

TM=Y;SS=M 

TM=M;SS=N 

HAT;TM=N;SS=M 



413076 U10564 Hs.75188 

407874 AI766311 Hs.289047 

448019 AW947164 Hs.195641 

427024 M397572 Hs.348902 

410281 AF076612 Hs.166186 

447205 BE617015 Hs.1 1006 

434433 AWB29759 

439737 AI751438 Hs.41271 

450157 AW961576 Hs.60178 

445989 H97754 Hs.1 11 08 

442213 N36110 Hs.305971 



perilipin;TM=N;SS=M 
Frizzled,Fz 

TM=M;SS=N 



CNJiydrolase 

vwc,Collagen,COLFI;TM=M;S 



Hs.24391 

Hs.11806 

Hs .214410 

Hs.365706 

Hs.170345 

Hs.29173 

Hs.5894 

Hs.40735 

Hs.323878 

Hs.2962 

Hs.202097 

Hs.380149 

Hs.1 3479 

Hs.109314 

Hs.77274 

Hs.31386 

Hs.1 10953 

Hs.379829 

Hs.75117 

Hs.87268 



Hs.284176 
Hs.352403 
Hs.370133 
Hs.68061 
Hs.267182 
Hs .279651 
Hs.142827 
Hs.101302 
Hs.42424 
Hs.90073 
Hs.9877 



fn3,Fuiin-!ike,pk!nase,Re 

KOW.Ribosomal 846,34,^; 

DUF176,efhand;TM=M;SS=N ■ 

TM=Y;SS=M 

γTM=M;SS=N 

Collagen l COLFI;TM=N;SS=M 

OLF,TM=M;SS=N 

PH;TM=M;SS=N 

efhand;TM=M;SS=N 

ERG4_ERG24;TM=Y;SS=M 



gla;TM=N;SS=Y 



DSPc;TM=M;SS=N 
TM=N;SS=N 

Fz,Frizzled,7lm_2,DUF81;T 
aminotran_5,SDF,none 
efhand,S_100;TM=N;SS=M 
CUB,NTR;TM=N;SS=M 



TM=M;SS=N 
zf-C2H2;TM=N;SS=M 
kringle,trypsin,plantjhi 
Fz,NTR;TM=N;SS=M 
HLH;TM=M,SS=N 

TM=M;SS=N 

annexin;TM=M;SS=N 

ras;TM=M;SS=N 



DAGKc;TM=M;SS=N 
T-box;TM=M;SS=N 
SH3;TM=M;SS=Y 

fn3,vwa,Collagen,TSPN;TM= 

CAS_CSE1;TM=M;SS=N 

Amino_oxidase,FAD_binding 

Collagen,C4.VPR;TM=N;SS=M 

pkinase;TM=M;SS=N 

COQ7 



amino acid transporter system A1 
Nuclear factor l/B 

gb:RC1 -BT0606-090500-01 5-b04 BT0606 
lyp I li il protein MGC13010 



's, Weakly similar to P4HA_HUMAN 



ESTs, Weakly similar to A47582 B-ce 
Homo sapiens mRNA; cDNA DKFZp547J04 
deleted in oral cancer (mouse, homo 
hypothetical protein DKFZp566l133 



Homo sapiens, Similar to epsin 3, c 

hypothetical protein FU10767 

ESTs, Weakly similar to ALU8.HUMAN 



collagen, type I, alpha 1 
collagen, type XIV, alpha 1 (unduli 
progesterone membrane binding prate 
ESTs 

insulin-like giowth factor 1 recept 
hypothetical protein FU22678 
Human DNA sequence from clone RP1-6 
ESTs, Weakly similar to T23273 hypo 
G-proteingamma-12subunit 
collagen, type III, alpha 1 (Ehlers 
DKFZP586L1 51 protein 
tumor suppressing subtransferable c 
hypothetical protein FU13612 
7 -dehydrocholesterol reductase 
Homo sapiens cDNA FU10674 fis, cto 
matrix Gla protein 
hypothetical protein FU13710 
hypothetical protein FU20515 
hypothetical protein FU10305 
frizzled (Drosopliila) homolog 3 

S1 00 calcium-binding protein P 



Ued frizzled-related protein 2 
)ic acid induced 1 

an DNA sequence from clone RP5-1 



nuclear receptor subfamily 1 , group 
ESTs 

sphingosine kinase 1 

TBX3-iso protein 

melanoma inhibitory activity 

Homo sapiens cDNA FU11562fis,clo 

Human DNA sequence from clone RP1-2 

ESTs, Weakly similar to 2004399A ch 



Homo sapiens cDNA FLJ14059 lis, do 
ESTs," ' ' 



PH,Band_41 ,RhoGEF,noi 
sugar_tr;TM=Y;SS=M 



ne 1 1 open reading frame 14 
Homo sapiens clone 23928 mRNAseque 
ESTs, Moderately similar to T17372 
gb:hh70c05.y1 NCI_CGAP_GU1 Homo cap 
Homo sapiens mRNA lull length inser 



WO 03/042661 



PCT/US02/36810 



438974 AF089816 Hs.6454 
439335 M742697 Hs.62492 
412276 BE262621 Hs.73798 



456157 
452753 
414420 
446229 
453143 



40 
45 



AI744964 
AA382234 
AL042355 



Hs.336881 
Hs.277728 
Hs.76095 
Hs.14449 



Hs.70202 
Hs.351625 
AW613841 Hs.301394 



444838 
427876 
413040 



411362 
418444 
423464 
424604 
420059 
453271 



Hs.179516 

AI207256 Hs.13766 

AI061301 Hs.1 64773 

M461599 Hs.23459 

NM.014521 Hs.17667 

T85247 Hs.351875 

BE244231 Hs.84038 

i AK000517 Hs.6844 

: AI589941 Hs.8254 

I AA297634 Hs.54925 

I NM 002447 Hs.2942 

' AW973666 Hs.1 53850 

I BE382685 Hs.1 58549 

AA306477 Hs.29379 

BE614410 Hs.23044 

AI902899 Hs.85155 

NM 016240 Hs.128856 

AW865388 Hs.151076 

AF161486 Hs.94769 

M903424 Hs.6786 

NM.002776 Hs.69423 

AA011117 Hs.3745 



PDZ;TM=N;SS=M 

TM=N;SS=M 

MIF,sugar_tr,none 



TM=M;SS=N 

serpin;TM=N;SS=M 

WD40;TM=M;SS=N 

AAA.hormone rec,zf-C4 

IRK;TM=Y;SS=N 

TM=Y;SS=M 

RFX_DNA_binding,none 

integrinJVFG-GAP,none 

Na_Ca.Ex;TM=Y;S&=M 
EMP24_GP25L;TM-Y;SS-M 
filament;TM=N;SS=N 
trypsin,kringle,UPAR_LY6 

SH3;TM=M;SS=N 

COX6C;TM=M;SS=M 

TM=Y;SS=N 

AAA,NB-ARC,PAAD_DAPIN;NA; 



413244 AW966951 

411756 BE294350 

409007 AL122107 

452547 AA335295 

414359 M62194 

433212 BE218049 

449123 D50920 

431176 AI026984 

419245 AI732742 

434493 AA635305 

449177 BE616594 

430449 AA352723 

452887 AI702223 

451678 AA374181 

445457 AF168793 

407597 AA043925 

431629 AU077025 

432302 AA345857 

442549 AI751601 

437959 AI472068 

447400 AK000322 

411734 AW374954 

443547 AW271273 

417000 BE277919 



401038 

418245 AA088767 

407688 W25317 

456906 AF1 17646 

424744 AW175781 

452195 AA994712 

415988 BE407713 

418399 AF131781 



NM_052863:Homo sapiens secretoglobi 
macrophage migration inhibitory fac 
dermatopontin 



WD repeat domain 10 

Homo sapiens cDNA: FU23260 fs, cl 

hypothetical protein MGC3101 



Homo sapiens mRNA for FU00074 prot 



Hs.71891 
Hs.49599 
Hs.74120 
Hs.75929 
Hs.121820 
Hs.23106 
Hs.293662 
Hs.87440 
H3.37659! 
Hs.288042 
Hs.241471 
Hs.1 07253 
Hs.26799 
Hs.1 2743 



pkinase,Sema,PSI,TIG,A4_E 



TM=M;SS=N 

TM=N;SS=N 

zf-CCCH;TM=M;SS=N 

Collagen;TM=Y;SS=N 

TM=M;SS=N 

LIM;TM=M;SS=N 

tryps!n;TM=M;SS=N 

EGF,F5_F8_type_C;TM-N;SS- 

ank;TM=M;SS=N 

BTB,Pcp_M12BjropDp,Repro 

pkinase,F5_F8Jype_C;TM^Y 



Hs.274307 
Hs.8375 
Hs.375604 
Hs.18457 

Hs!356487 
Hs.306019 
Hs.80712 
Hs.149255 
Hs.355424 
Hs.296178 
Hs.3797 



Hs.1 52720 
Hs.1 16878 
Hs.78943 
Hs.84753 
Hs.247735 



fn3,ig;TM=Y;SS=M 

TM=M;SS=Y 

TIG;TM=M;SS=N 

MATH,zf-TRAF,zf-C3HC4;TM= 

elF5_elF2B,W2;TM^M;SS=N 

zf-C3HC4;TM=Y;SS=M 



TM=Y;SS=M 

GTP CDC;TM=N;SS=M 

PIP5K;TM=N;SS=M 

pkinase,F5_F8_type_C,adh_ 

Occludin;TM=M;SS=N 

ras,arf;TM=M;SS=N 

TM=M;SS=N 



414152 NM.003248 Hs.75774 
421307 BE539976 Hs.103305 
444868 BE560471 Hs.1 21 01 



NALP2 protein; PYRIN-Conlaining APA 
Homo sapiens, Similar to tumor diff 
KIAA1 858 protein 

macrophage stimulating 1 receptor ( 
hypothetical protein C321D2.4 
ESTs, Weakly similar to T2D3_HUMAN 
hypothetical protein FU10687 
RAD51 (S. cerevisiae) homolog (Eco 
butyrate response factor 1 (EGF-res 



ESTs 

kalikreinIO 

milk fat globuIe-EGF factor 8 prote 
NM_031419*:Homo sapiens molecule po 
krup pel-related 2inc finger protein 
discoidin domain receptor family, m 
Homo sapiens mRNA; cDNA DKFZp434G08 
adipose specific 2 
cadherin 11, type 2, OF 



KIAA0130gene pr< 



hypothetical protein DKFZp751F241 



Homo sapiens brother of CDO (BOC) m 
interferon, atpha-indu 
KIAA1442 protein 
TNF receptor-associated factor 4 
KIAA1856 protein 
hypothetical protein FU20315 
HomosapiensDNAfrom chromosome 1 9 
hypothetical protein FU12666 
ESTs, Weakly similar to ALU7 HUMAN 
KIAA0202 protein 
plios[ h tr > y l[iinsitoI-4-phosphate 5- 
ESTs 



hypothetical protein FU22637 




C9000306':gi|12737280|ref|XP_006682 
thrombospondin 4 
Homo sapiens mRNA; 
hypothetical protein 



WO 03/042661 



PCT/US02/36810 



450214 
4526B4 
422105 
422278 
434067 



BE439763 Hs.227571 

AA398859 Hs.18397 

AI929700 Hs.111680 

AF072873 Hs.114218 

H18913 Hs.124023 

NM.000165 Hs.74471 

M486846 Hs,271795 

AI252640 Hs.110364 

AI520463 Hs.347408 

AB011084 Hs.48924 

AI885815 Hs.184727 



RGS;TM=M;SS=N 

TM=M;SS=M 

TM=M;5S=N 

Fz,Frizzled,7trn_2;TM=Y;SS 
connexin,Connexin43;TM=Y; 



regulator of G-protein signalling 4 



TM=Y;SS=N 
Armadillo_seg;TM=M;SS-M 
t/ansfem'n,Guanylate_kin 



frizzled (Drasophila) homolog 6 
Homo sapiens cDNA FLJ14218 fis, do 
gap junction protein, alpha 1, 43kD 
ESTs, Weakly similar to I38022 hypo 
peptidylprolyl isomerase C (cycloph 
hypothetical protein MGC13102 
KIM0512 gene product; ALEX2 



Pkey: Unique Eos probeset identifier number 
CAT numben Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 



418344 245371.1 



TABLE 4C 



AA216387T63548AA228876 
AW984759AW984744 
AF1 19913 AI2Q7698 RS7074 

BC012771 BG397153 BF366196 M337277 AA319235 AWB43252 
M136653AA136656AW450979AA984358 M80g054AW238038 M492073 BE168945 
AA633408 AW749965 AW629759 AI65100S 



Pkey. Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digil numbers in this column are Genbank identifier (Gl) numbers. 1 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
Ntjosition: Indicates nucleotide positions of predicted exons. 



Ref Strand 
9256180 Plus 
9743564 Plus 



403439 9719679 
401038 7232177 
404661 9797073 



NLposition 

116229-116371,117512-117651 

4151541695 

8615-9103 

91463-91632 

4277-4469 

33374-33676,33769-34008 



Unique Eos probeset identifier number 

Exemplar accession number, GenBank accession number 

UniGene number 

Certain predicted protein domains. Abbreviations used: TM, transmembrane domain; SS, signal sequence; =Y, very likely to contain; =M, 
likely to contain; =N, less likely to contain. All other protein domain abbreviations are from PFAM (Nucleic Acids Research, 2002, 30:276-280). 
UniGene gene title 

Ratio of 90th percentile tumor to 50th percentile of normal body tissue 



UniGenelD Pred.ProtDoi 



417389 BE260964 

414521 D28124 

438091 AW373062 

413815 AL046341 

439180 AI393742 

452547 M335295 

452239 AW379378 

441384 AA447849 

419223 X60111 

413859 AW992356 

410687 U24389 

422699 BE410590 

419452 U33635 

427378 BE515037 

444784 D12485 

436972 AA284679 

412926 AI879076 

425280 U31519 

432636 M340864 

423778 Y09267 

424206 NM_003734 

444797 AB018333 



Hs.301350 
Hs.82045 
Hs.76307 
Hs.351546 
Hs.75562 



Hs.1 19257 
Hs.90572 
Hs.1 77556 
Hs.11951 
Hs.25640 
Hs.75061 
Hs.1872 
Hs.278562 
Hs.132821 
Hs.198241 
Hs.1 2002 



aajermeases.pyridoxaLde 
TM=M;SS=M 

ATP1G1 PLM MAT8;TM=Y;SS=M 

PTN_MK;TM=M;SS=Y 

OAN;TM=M;SS=M 

hormone_rec,zf-C4,none 

pkinase,F5_F8_type_C;TM=Y 

Furin-like,pkinase,Recep 

ASC;TM=Y;SS=N 

LEA;TM=M;SS=N 

7tm_3,none 

transmembrane4;TM=Y;SS=M 

SAM_PNT,none 

Lysyl_oxidase;TM=N;SS=M 

SH3,HS1_rep;TM=M.ES-N 

tg,pkinase;TM=Y;SS=M 

MAGE;TM=M;SS=N 



PMP22_C!audln;TM=Y;SS=M 

MARCKS;TM=N;SS=M 

PEPCK;TM=M;SS=N 

PMP22_Claudln;TM=Y;SS=M 

FMO-like,pyr_redox;TM=Y;S 



nielanophilin (MLPH), mRNA 
KIAA1324 protein 

FXYD domain-containing ion transpor 
midkine (neurite growth-promoting f 
neuroblastoma, suppression of tumor (DAN) 
nuclear receptor subfamily 1, group 
discoidin domain receptor family, m 
v-erb-b2 avian erythroblastic leuke 
sodium channel, nonvoltagr^gated 1 
adipose specific 2 

protein tyrosine phosphatase, recep 
retincic acid induced 3 
CD9 antigen (p24) 

Homo sapiens pyruvate dehydrogenase 



SH3,SAM;TM=M;SS=N 



flavin containing monooxygenase 2 
amine oxidase, copper containing 3 
KIAA0790 protein 



131 



WO 03/042661 



PCT/US02/36810 



402559 AF043329 



400290 H18836 

438089 W05391 

426158 NM 001982 

447191 NMJ14521 

439941 AI392640 

439318 AW837046 

442213 N36110 

412649 NM.002206 

448913 AA194422 

420166 AW732276 

407102 M007629 

452516 AA058630 

413076 U10564 

443604 C03577 

429002 AW248439 

432562 BE531048 

426359 AA376409 

417733 AL048678 

451541 BE279383 

443951 F13272 

409960 BE261944 

423184 NM_004428 

405121 X04385 

438974 AF089816 

417771 AA304698 

424118 BE269041 

402705 X67951 

417115 AW9S2792 

442572 AI001922 

447216 R75812 

422278 AF072873 

414657 AA424074 



444143 AW747996 

414443 AU077268 

418751 BE389014 

448479 H96115 

410552 X66945 

414883 AA926960 

417426 NMJ02291 

428179 AI127772 

443195 BE148235 

424512 X53002 

421733 AL119671 

428950 BE311879 

450172 NMJJ05864 

416078 AL034349 

408912 AB011084 

428373 AI751656 

449029 N28989 

406621 X57809 

431629 AU077025 

428169 AI928984 

443337 Y07604 

451292 AB037716 

425976 C75094 

426539 AB011155 

417208 S67773 

438278 BE409248 

429455 AI472111 

431685 AW296135 

445033 AV652402 

411756 BE294350 

453902 BE502341 

418005 AI1 86220 



453064 R40334 

448520 AB002367 

452683 AI089575 

402575 AF043329 

444672 Z95636 

450440 AB024334 



Hs.173717 

Hs.9973 

Hs.102!71 



Hs.296267 
Hs.31608 
Hs.351546 



PAP2;TM=Y;SS=M 
SH2,WW,PID,none 
ig,LRR,LRRNT,LRRCT;TM=M;S 
TM=M;SS=N 

death,TNFR_c6;TM=Y;SS=M 
ras,ABC tran,arf;TM=M;SS= 
SH3,Sorb;TM=M;SS=N 
efhand,kazal,arf,ras,7tm_ 
Cys_knot 

hormone_rec,zf-C4,none 
Furin-Iike,pkinase,Recep_ 



PPAP2B Phosphatide acid phosphattype 2B 



Aa_trans;TM=Y;SS=N 

7tmJ,CytC_asm,GPS;TM=Y;S 

sugar Jr,TM=Y;SS=M 

integrin_A,FG-GAP;TM=M;SS 

rrm;zf-RanBP,pkinase,GST_ 

transmembrane4;TM=Y;SS=M 

transport_protSWIB,ASC 

TM=N;SS=M 

pkinase;TM=M;SS=N 

efhand;TM=M;SS=N 

Armadillo_seg,TM=M;SS=N 

zf-C2H2;TM=M;SS=N 



Hs.305971 
Hs.74369 
Hs.22564 
Hs.95583 
Hs.348601 
Hs.29759 
Hs.75188 



Hs.82S03 NA;NA 

Hs.26557 Armadillo seg,TM=M;SS=N 

Hs.356835 PMP22_Claudin,none 
Hs.355264 

Hs.1624 Ephrin;TM=M;SS=M 

Hs. 11 0802 Cys_knot,TGF-beta,vwa,vwi 

Hs.6454 PDZ;TM=N;SS=M 
Hs.82547 

Hs.140452 perilipin;TM=N;SS=M 

Hs.180909 AhpC-TSA;TM=M;SS=M 

Hs.334612 Sm,pkinase;TM=N;SS=N 



cytochrome_c;NA;NA 

Fz,Frfz2led,7tm.2;TM=Y;SS 

TM=M;SS=N 

TB2_DP1_HVA22;TM=Y;SS=M 

TM=Y;SS=M 

TM=Y;SS=M 

ig,pki'nase;TM=Y;SS=N 

UDPGP;TM=M;SS=N 

ig.pkinase,SH2 1 SH3.C2.PH. 

CKS-,TM=N;SS=N 

laminin_EGr,laminin_Nterm 

pkinase,PX,pkinase_C;TM=N 




gIycerol-3-phosphatede 



RNA POLYMERASE I ANDTRANSCRIPT REL 
weel (S. pombe) homolog 
myosin regulatory light chain 2, sm 
junction plakoglobin 
□KFZP586G1 122 protein 
Homo sapiens cDNA: FU23313 lis, cl 
H.sapiens mRNA for 3"UTR of unknown 
plakophilin 3 
ferritin, light polypeptide 



Hs.114218 

Hs.76780 

Hs.76277 

Hs.351316 

Hs.181444 

Hs.160999 

Hs.76144 

Hs.372548 

Hs.21293 

Hs.748 

Hs.348669 

Hs.82124 

Hs.279696 

Hs.193063 

Hs.149846 



Hs.24587 SH3,hormone3;TM=M;SS=N 
Hs.79005 

Hs.48924 Armadillo_seg;TM=M;SS=M 
Hs.183986 ig;TM=Y;SS=M 
Hs.22891 aa_permeases;TM=Y;SS=M 
Hs.131125 ig,HSP70,Ppx-GppA;TM=M;SS 
Hs.265827 ' 
Hs.132793 
Hs.9235 
Hs.26204 SH3;TM=M;SS=N 
Hs.334514 voltage_CLC;TM=Y;SS-M 

SH3,PDZ,Guanylale_kin;TM= 
ig,pkinase;TM=Y;SS=M 
TRIS,RNA_POL_M 15KD.UPF0 
lectin c 

CH,DAG_PE-bind,PH,RhoGEF, 
ank;TM=N;SS=N 
pkinase,F5_F8_type_C;TM=Y 

Collagen,TSPN;TM=M;SS=M 
SH3,none 

aa_permeases,pyridoxaLde 

pkinase,DCX;TM=M;SS=N 
homecbox,none 
PAP2;TM=Y;SS-M 
lamir,in_EGF,lamir,in_G,EGP 
14-3-3;TM=M;SS=N 



cargo selection protein (mannose 6 
peroxiredoxin 1 (PRDX1) 
small nuclear ribonucleoprotein pol 
hypothetical protein FU22415 
p75NTR-associated cell death execut 
frizzled (Drosophila) homolog 6 
protein phosphatase 1, regulatory ( 
Homo sapiens, clone MGC:9381. mRNA, 



UDP-N-acteylglucosamine pyrophospho 
fibroblast growth factor receptor 1 
CDC28 protein kinase 1 



Homo sapiens cDNA FU14201 fis,c 
integrin, beta 5 

i'i t - t rjrourth factor receptor 3 



photoRC,UPF0118;TM=Y;SS=N 



Hs.170290 
Hs.81665 
Hs.57988 
Hs.278694 
Hs.267659 
Hs.72901 
Hs.71891 
Hs.3402 
Hs.83164 



Hs.21355 
Hs.374574 
Hs.173717 
Hs.11669 
Hs.25001 



signal transduction protein (SH3 co 
protein tyrosine phosphatase, recep 
KIAA051 2 gene product; ALEX2 
poliovirus receptor-related 2 (herp 
solute canier family 7 (cationic a 
immunoglobulin lambda locus 
interferon, alpha-inducible protein 
gdgiphosphoprotein2 
non-metastatic cells 4, protein exp 
KIAA1295 protein 
NG22 protein 

discs, large (Drosophila) homolog 5 
v-kit Hardy-Zuckerman 4 feline sarc 
hypothetical protein FU22357 simil 
CD20S antigen 
vav 3 oncogene 




:acidphosphat. type2B 



WO 03/042661 



PCT/US02/36810 



432314 M533447 

438564 AA381553 

444252 R21135 

425184 BE278288 

431890 X17033 

449475 AI348027 

449538 AI559444 

414496 W73853 

414217 AI309298 

445333 BE537641 

431183 NM 006855 

409645 AI142265 

412276 BE262621 

416137 BE279513 

412969 AI373162 

414504 AW069181 

433573 AF234887 

436415 BE265254 

413900 AW409747 

441455 AJ271671 

444006 BE395085 

408269 AW888219 

411372 AI147861 

450825 AC005954 

456534 X91195 

451558 NM 001089 

446812 AL042279 

424307 AW293399 

405484 XM_093451 

425367 BE271188 

444607 AW405635 

421456 AW579842 
412810 M21574 

450334 AF035959 

453880 AI803166 

439578 AW263124 

450954 AI904740 
414555 N98569 
409963 AA133590 
450463 AW952018 
425177 AF127577 
445496 AB007860 
428981 BE313077 
424441 X14850 
415662 AW972481 
422105 AI929700 



425205 NMJM5854 

444633 AF1 11713 

431565 AF161470 

429655 U48959 

431886 L77964 

453143 AA382234 

451863 AL1 20634 

422293 X94453 

432179 X75208 

408048 NM_007203 

448153 Y10805 

421251 Z28913 

439039 AI656707 

409882 AJ243191 

451295 AI557212 

442549 AI751601 

445930 AF055009 

453082 H18835 

426432 AF001601 



414774 X02419 
414368 W70171 
446051 BE048061 



447918 AI129320 

405517 AF000974 

413588 AA971014 

411089 AA456454 



Hs.285173 
Hs.1 98253 
Hs.54985 



Hs.104679 

Hs.355424 

Hs.279898 

Hs.44278 

Hs.250696 

Hs.55498 

Hs.73798 

Hs.278607 

Hs.75103 

Hs.1 15175 

Hs.57652 

Hs.343268 

Hs.75612 

Hs.7854 

Hs.334762 

Hs.44077 

Hs.213289 

Hs.25527 

Hs.1 00623 

Hs.26630 

Hs.1 6206 

Hs.356377 



ig,MHCJI_alpha,none 

ig;TM=Y;SS=M 
vwa,integrin_A,FG-GAP;TM= 
transmembrane4;TM=Y;SS=M 
TM=M;SS=M 

pkinase,F5_F8Jype_C,adh_ 
NA;NA 

ras,aif,TK;TM=N;SS=M 

ER.Iumen receptTM=M;SS=M 

polyprenyl_synt;TM=M;SS=N 

MIF,sugar_tr,none 

pkinase,UBA,ThiF;TM=M;SS= 

14-3-3;TM=N;SS=M 

pkinase,SAM;TM=M;SS=N 

7tra_2,EGF,cadherin,lam!ni 

Peptidase_M24,Furin-like, 

TPR,PDZ,WW,Guanylate_kin; 

Zip;TM=Y;SS=M 

ldLreoepLa,PKD,MHCJ;TM 

CH;TM=M;SS=N 

GlycoJransf_11,EGF,ldl r 

PDZ,Guanylat f kin;TM=N;SS 

UM,PDZ,pkinase;TM-N,SS=M 

ABC tran,SRP54;TM=Y;SS=M 



Hs.350547 
Hs.25691 
Hs.76422 
Hs.377830 
Hs.201398 
Hs.1 55017 
Hs.1 2802 
Hs.93135 
Hs.1 47097 
Hs.170610 
Hs.1 11680 
Hs.314807 
Hs.42331 
Hs.1 551 06 
Hs.1 2284 
Hs.260622 
Hs.211582 
Hs.271980 



Hs.1 14366 
Hs.2913 
Hs.42322 
Hs.20521 



Hs.31608 

Hs.1 69857 

Hs.78781 

Hs.25450 

Hs.77196 

Hs.284243 

Hs.77274 

Hs.75939 

Hs.37054 

Hs.249159 

Hs.7036 

Hs.2128 

Hs.208558 

Hs.1 151 75 

Hs.1 19498 

Hs.75432 

Hs.355702 

Hs.342874 



TM=N;SS=M 

TU=M;SS=Y 

PI-PLC-X,PH,PI-PLC-Y 

zf-C2H2,DUF18,efhand,C2,P 

ig,pkinase,DUF11;TM4,1;S3- 

PAP2;TM=Y;SS=M 

HSP70,none 

WD40;TM=M;SS=N 

TM=Y;SS=M 

phoslip;TM=M;SS=Y 

MBOAT.none 

C1q,Collagen;TM=M;SS=Y 

TM=N;SS=M 

SH3,ank,PH,ArfGap;TM=M;SS 

histone,CBFD_NFYB_HMF;TM= 

pkinase,none 

TM-M;SS=N 



Lutheran blood group (Auberger b an 
integrin, a'phs 2 (CD49B, alpha 2 s 
hypothetical protein PP1057 



tyrosine 3-monooxygenase/tryptophan 
sterile-alpha motif and leucine zip 
cadherin, EGF LAG seven-pass G-type 
proliferation-associated 2G4,38kD 
stress-induced-phosphoprotein 1 (Hs 



type" 

paivin, alpha 

low density lipoprotein receptor (f 
tight junction protein 3 (zona occl 
phospholipase C, beta 3. neighbor p 
ATP-binding cassette, sub-family A 



C30021 24':gi| 1 2737280|ref|XP_006682 
protein tyrosine phosphatase, recep 
ESTs 

hypothetical protein FU10697 
platclot-dorivod growth factor rece 
phosphatidic acid phosphatase type 
ESTs, Weakly similar to I38022 hypo 
nuclear receptor co-rcpressor/HDAC3 
receptor (calcitonin) activity modi 
phospholipase A2, group HA (platel 



G protein coupled receptor interact 
nuciear receptor interacting protei 
development and differentiation enh 

A kly imil rk \LU2_HUMAN 
H2A histone family, member X 
ESTs, Highly similar to G01887MEK 



Ephrin;TM=M;SS=*l 
TM=Y;SS=N 
ig;TM=Y;SS=M 
TM=Y;SS=N 
pkinase,fn3,ig,none 
pkinase;TM=M;SS=N 
serpin;TM=N;SS=M 
cpn60_TCP1,E1-E2jMPase,C 
aldedh,aakinase;TM=M;SS=N 
EPH_lbd,fn3,pkinase,SAM;T 
Paralemmin;TM=M;SS=N 
NusG;TM=N;SS=M 
LIM,PDZ;TM=N;SS=M 
pkinase.none 
HSP20;TM=N;SS=M 
pkinase.DAG PE-bind.pkina 
MATH,zf-TRAF,zf-C3H C4;T M= 
DAGKc,DAGKa,ank,WD40,bZIP 
ion_trano;TM=Y;SS=M 
Arylesterase;TM=M;SS=N 
PDGF;TM=M;SS=M 
Nucleoside_tran;TM=Y;SS=M 
efhand,SH3,spectrin;TM=N; 
transmembrane4;TM=Y;SS=M 
kn'ngle.trypsin.plantjhi 
PRK,CoaE;TM=N;SS=N 
Ephn'n,A deamin,dsrm,z-al 
7trn_1,7tm_2;TM=Y;SS=M 
ROK;TM=M;SS=N 
Rhodanese,DSPc,Y_phosphat 
integrin_A,FG-GAP,none 
pkinase.SAM.none 
LIM;TM=M;SS=N 
IMPDH_C,CBS,IMPDH_N;TM=M; 

zona_pelluclda;TM=Y;SS=M 



ESTs 



mitogen-activated protein kinase 6 
protein tyrosine phosphatase, recep 
ATPase, Ca transporting, plasma mem 
pyrroline-5-c arboxylate synthetase 
EphB3 

A kinase (PRKA) anchor protein 2 
HMT1 (hnRNP methyltransferase, S. c 
enigma (LI M domain protein) 
ESTs 

heat shock 27kD protein family, mem 
ESTs, Moderately similar to I54374 
TNF receptor-associated factor 4 
Homo sapiens done 24747 mRNA seque 
hypothetical protein FU20041 



vascular endothelial growth factor 
solute carrier family 29 (nucleosid 
spectrin, alpha, non-erythrocytic 1 
tetraspan NET-6 protein 
plasminogen activator, urokinase 
uridine monophosphate kinase 



ESTs, Highly similar toJC5818 gamm 
thyroid hormone receptor interactor 6 
IMP (inosine monophosphate) dehydro 
cell division cycle 2-like 1 (PITSL 
transforming growth fxtor, beta re 
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422034 AC006486 

417098 AB017365 

430526 AF181862 

414176 BE140638 

416710 AI268325 

417896 AA379770 

413244 AW955951 

421837 AF135168 

429379 NM.014840 

429619 AL120751 

437275 AW976035 

421071 AI311238 

448581 NMJ02709 

452568 M805634 

452069 AB028949 

437175 AW968078 



452345 M293279 

443412 W84893 

412853 M34175 

439866 AA280717 

439975 AW328081 

435523 T62849 



422765 AW409701 

427502 AI811865 

414166 AW888941 

424954 NMJ00546 

422089 AA523172 

426636 BE242634 

410793 AW581906 

TABLE 5B 



Hs.38095 ABC tran,PRK;TM=Y;SS=M 

Hs.42322 Paralemmin;TM=M;SS=N 

Hs.333069 Ets;TM=M;SS=N 

Hs.1 73859 Frizz!ed,Fz,7tm_2.toxin_2 

Hs.242407 7tm 3;TM=Y;SS=M 

Hs.75794 7tm 1,CRCB;TM=Y;SS=N 

Hs.54890 Peptidase M49,EGF,ig.Neur 

Hs.82890 DAD;TM=Y;SS=M 

Hs.159265 BTB,PepJVl12Bjropep,Repro 

Hs.1 08802 AM,cdc48_N.cdc48_2,NB-AR 

Hs.200598 pkinase,RI01;TM=M;SS=N 
Hs.211568 

Hs.292396 Frizzled.Fz 
Hs]21537 



Hs.25797 

Hs.301394 

Hs.172084 

Hs.29173 

Hs.9305 

Hs.74626 

Hs.6727 

Hs.6817 

Hs.8997 
Hs.74335 
Hs.285115 
Hs.6314 



spectiin,SH3,PH,CH 
rrm;TM=M;SS=N 
IRK;TM=Y;SS=N 
PHD;TM=M;SS=N 



frizzled (Drosophiia) homolog 7 



KIM0537 gene product 



ESTs, Weakly similar to A47582 B-ce 
ESTs, Weakly simitar to CGHUIEcoll 
protein phosphatase 1 , catalytic su 
Homo sapiens mRNA; cDNA DKFZp547M07 
KIAA1026 protein 

protein kinase, cAMP-dependcnt, cat 
gb:ok33a11.s1 Soares_NSF_F8_9W_OT_P 
splicing factor 3b, subunit 4, 49kD 
hypothetical protein MGC3101 
Homo sapiens, clone IMAGE:3627860, 



S.1578 

Hs.75789 

Hs.1846 

Hs.103135 

Hs.2055 

Hs.66392 



Adaplin_N,Alpha_adaptinC2 
rrm,NTF2;TM=M;SS=N 
Hamlp like;TM=M;SS=N 
TM=Y;SS=M 

HSP70,ig,Ppx-GppA;TM=M;SS 
HSP90,HATPase_c;TM=M;SS=N 
fn3;TM=Y;SS=M 

BIR;TM=M;SS=N 
TM=M;SS=N 

□EAD,helicase C,rrm,Ndr,C 
P53,WD40,IRK;TM=M;SS=N 
REJ,PLAT,PKD,WSC,LRR 
ThiF,UBACT;TM=M;SS=N 
SH3,efhand,C2,PH,RhoGEF,M 



adaptor-related protein complex 2, 
Ras-GTPase activating protein SH3 d 
inosine triphosphatase (nucleoside 



(in13rec 

type I transmembrane receptor (seiz 
baculoviral IAP repeat-containing 5 
Homo sapiens, clone IMAGE-.3161564, 
N-myc downstream regulated 
tumor protein p53 (Li-Fraumeni synd 
ESTs, Weakly similar to SFR4_HUMAN 
ubiquifn-activating enzyme E1 (A1S 



437056 428504_3 AW976398AI147061 AA765223AA743380 AI803927 
TABLE 5C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled "The DNA 

sequence of human chromosome 22.' Dunham I. et at, Nature (1999)402489-495. 

Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

405484 5922025 Plus 199214-199579,199672-199920,200262-20049 



TABLE 6A: 777 GENES UP-REGULATED IN COLON CANCER COMPARED TO NORMAL ADULTTISSUES 

Table 6A Iists777 genes up-regulated in colon cancer compared to normal adult tissues, These were selected from 59680 probesets on the Aff/metrix/Eos Hu03 GeneChip array such 
that the ratio of "average" colon cancer to "average" normal adult tissues was greater than or equal to 3.0. The "average" colon cancer level was set to the W< percentile amongst 95 
colon cancers. The "average" normal adult tissue level was set to the 90 n percentile amongst 209 non-malignant tissues. In order to remove gene-specific background levels of non- 
specific hybridization, the 15" 1 percentile value amongst the 209 non-malignant tissues was subtracted from both the numerator and the denominator before the ratio wa s evaluated. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal body tissue 
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Pkey ExAccn UnlgenelD Unigene Title 

447033 AI357412 Hs.1576Q1 ESTs 

409041 AB033025 Hs.50081 KIM1 199 protein 

418007 M13509 Hs.83169 

422330 D30783 Hs.115263 

428187 AI687303 Hs.285529 G protein-coupled receptor 49 

428368 BE440042 Hs.83326 matrix metalloproteinase 3 (strc 

422956 BE545072 Hs.1 22579 hypothetical protein FLJ10461 

416209 AA236776 Hs.79078 MAD2 (mitotic arrest deficient, yeast, h 

444783 AK001468 Hs.62180 anillin (Drosophila Scraps homolog), act 

415989 AI267700 Hs.317584 ESTs 

427585 D31152 Hs.1 79729 collagen, type X, alpha 1 (Schmid metaph 

441031 AI110684 Hs.7645 fibrinogen, B beta polypeptide 

421470 R27496 Hs.1 378 annexinA3 

439759 AL359055 Hs.67709 Homo sapiens mRNA full length insert cDN 

449032 M045573 Hs.22900 nuclear factor (erythroid-derived 2)-lik 

450531 AW301032 Hs.203800 ESTs 

432867 AW016936 Hs.233364 ESTs 

443211 AI128388 Hs.1 43655 ESTs 

450221 AA328102 Hs.24641 cytoskeleton associated protein 2 

406964 M21305 gb:Human alpha satellite and satellite 3 

410355 S58544 Hs.153057 sperm associated antigen 1 

441377 BE218239 Hs.202656 ESTs 

413318 AU076607 Hs.75285 inter-alpha (globulin) inhibitor, H2 pol 

442409 BE208843 Hs.129544 hypothetical protein MGC1 5438 

440304 BE159984 Hs.125395 ESTs 

426427 M86699 Hs.1 69840 TTK protein kinase 

451561 N52812 Hs.177403 ESTs 

434032 AW009951 Hs.206892 ESTs 

428365 AA295331 Hs.183861 Homo sapiens cDN A FU20042 fis, clone CO 

422420 U03398 Hs.1 524 tumor necrosis factor (ligand) superfami 

428664 AK001666 Hs.189095 similar to SALL1 (sal (Drosophila)-like 

446232 AI281848 Hs.194691 retinoic acid induced 3 

424252 AK000520 Hs.143811 hypothetical protein FU2051 3 

450149 AW969781 Hs.1 32863 23c family member 2 (odd-paired Drosophi 

419752 AA249573 Hs.152618 ESTs, Moderately similar to ZN91_HIWAN Z 

422011 U30246 Hs.11073B solutccamcrfamily12 (sodiumipotassi 

452461 N78223 Hs.1 081 06 transcription factor 

431808 M30703 Hs.270833 amphiregulin (schwan 
400534 

448706 AW291095 Hs.21814 interleukin 20 receptor, alpha 

453688 AW381270 Hs.194110 hypothetical protein PRO2730 
426890 AA393167 Hs.41294 ESTs 

408950 AA707814 Hs.14945 long fatty acyl-CoA synthetase 2 gene 

423020 AA383092 Hs.1608 replication protein A3 (14kO) 

453922 AF053306 Hs,36708 budding uninhibited by benzimidazoles 1 

447505 AL049266 Hs.18724 Homo sapiens mRNA; cDNA DKFZp564F093 (fr 

425761 AW664214 Hs.1 96729 ESTs 

404567 

428535 AI143139 Hs.2288 visinin-like 1 

414972 BE263782 Hs.77695 KIAA0008 gene product 

459504 BE514127 gb:601315974F1 NIH.MGCJ Homo sapiens cD 

438018 AK001160 Hs.5999 hypothetical protein FU 10298 

447863 AL047611 Hs.288885 Homo sapiens cDNA FU14246 fis, clone OV 

442353 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 

424905 NM_002497 Hs.153704 NIMA (never in mitosis gene a)-related k 

451917 AW391351 Hs.50820 Homo sapiens unknown mRNA 

420900 AL045633 Hs.44269 ESTs 

438639 AI278360 Hs.31409 ESTs 

439521 AI808955 Hs.58248 'ESTs 

445676 AI247763 Hs.1 6928 ESTs 

408489 AI082437 Hs.26690 ESTs 

418738 AW388633 Hs.6682 solute earner family 7, (cationic amino 
446311 AW007294 Hs.149795 ESTs, Moderately similar to ALU1_HUMAN A 
423349 AF010258 Hs.1 27428 homeoboxA9 
400195 NA NA 

411765 H43346 gb:yp09a04.M Soares breast 3NbHBst Homo 

418895 AA894638 Hs.1 4600 ESTs 

424653 AW977534 Hs.151469 calciumfcalmodulin-dependent serine prot 
448776 BE302464 Hs.30057 MRS2 (S. cerevisiae)-like, magnesium horn 
417218 AA005247 Hs.285754 met proto-oncogene (hepatccyte growth fa 
414559 AV656184 Hs.76452 C-reacflu 
445436 AI224105 Hs.1 51408 ESTs 
403776 

433447 U29195 Hs.3281 neuronal pentraxin II 
407168 R45175 Hs.1 171 83 ESTs 
419335 AW960146 Hs.284137 hypothetical protein FU12888 
422505 AL1 20862 Hs.1 241 65 ESTs 

458242 BE299588 Hs.28465 Homo sapiens cDNA: FU21869 fis, clone H 
452943 BE247449 Hs.31082 hypothetical protein FU10525 
446155 AI553695 Hs.1 59422 Homo sapiens cDN A FU1 3997 fis, clone Y7 
409687 T51125 Hs.8493 ESTs 
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416661 AA634543 Hs,79440 1GF-II mRNA-binding protein 3 

443614 AV655386 Hs.7645 fibrinogen, B beta polypeptide 

406360 NA NA 

443450 N66C45 Hs.133629 ESTs 

414422 AA147224 Hs,337232 ESTs 

442611 BE077155 Hs.177537 hypothetical protein DKFZp761B1 51 4 

438604 M811896 Hs,44604 ESTs 

447254 NM 004153 Hs.17908 origin recognition complex, subunit 1 (y 

400250 NA NA 

420759 T11832 Hs.1 27797 Homo sapiens cDNA FLJ1 1381 lis, clone HE 
404996 

450459 AI697193 Hs.299254 Homo sapiens cDNA: FLJ23597 fis, clone L 

445019 AI205540 Hs.281295 ESTs 

409269 M576953 Hs.22972 hypothetical protein FU 13352 

448816 AB033052 Hs.22151 KIAA1 226 protein 

444361 W76027 Hs.23920 hypothetical protein FU1 1105 

449370 AK002114 Hs.23495 hypothetical protein FLJ 11 252 

433859 AW896758 Hs.273789 ESTs 

416143 AI955650 Hs.79033 glutaminyl-peptide cyclotransferase (glu 

456120 AA535244 Hs.78305 RAB2, member RAS oncogene family 

419751 AW195581 Hs.93121 KIM0761 protein 

417830 AW504786 Hs.122579 hypothetical protein FU10461 

456553 AA721325 Hs.1 89058 ESTs, Highly similar to Similar to a C.e 

421373 AA808229 Hs.167771 ESTs 

418763 AK000219 Hs.88367 hypothetical protein FU20212 

423248 AA380177 Hs.125845 ribulose-5-phosphate-3-epimerase 

444798 BE242144 Hs.12013 ATP-binding cassette, sub-family E (OABP 

429486 AF1 55827 Hs.203963 hypothetical protein FU1 0339 

413573 AI733859 Hs.1 49089 ESTs 

442660 AW138174 Hs.130651 ESTs 

427878 C05766 Hs.1 81 022 CGI-07 protein 

438394 BE379623 Hs.27693 peptidylpralyl isomerase (cyclophilin)-l 

422711 D60641 Hs.21739 HomosapiensmRNA;cDNADKFZp586l1518(f 

457030 AI301740 Hs.173381 dihydropyrimidinase-like 2 

417168 AL133117 Hs.81376 Homo sapiens mRNA;cDNADKFZp586L1121 (f 

442973 BE567665 Hs.288550 Homo sapiens cDNA: FLJ231 56 fis, clone L 

453102 NMJ07197 Hs.31664 frizzled (Drosophila) homolog 10 

416018 AW138239 Hs.78977 proprotein convertase subtilisin/kexin t 

424051 AL110203 Hs.138411 Homo sapiens mRNA; cDNA DKFZp586J 1922 (f 

401644 

450434 M166950 Hs.195870 hypothetical protein FLJ14991 

428479 Y00272 Hs.184572 cell division cycle 2, G1 .to S and G2 to 
406747 AI925153 Hs.217493 anncxinA2 
445191 AF048686 Hs.12393 dTDP-D-gluccse 4,6-dehydratase 
424296 AI631874 Hs.155140 casein kinase 2, alpha 1 polypeptide 
428392 H10233 Hs.2265 secretory granule, neuroendocrine protei 
427072 H38046 Hs.3031 93 ESTs 
452588 AA889120 Hs.110637 homeoboxA10 
439809 R41396 Hs.101774 hypothetical protein FU23045 
423123 NM 012247 Hs. 124027 SELENOPHOSPHATE SYNTHETASE ; I 
418454 AA315308 Hs.195870 hypothetical protein FU14991 
423685 BE350494 Hs.49753 uveal autoantigen with coiled coil domai 
447342 AI199268 Hs.19322 Homo sapiens. Similar to RIKENcDNA 2010 
410908 AA121686 Hs.10592 ESTs 

406671 AA129547 Hs.285754 met proto-oncogene (hepatccyte growth fa 
450638 AK001826 Hs.25245 hypothetical protein FU 11 269 
452838 U65011 Hs.30743 preferentially exp 
451389 N73222 Hs.279009 matrix Gla proteir 
438202 AW1 69287 Hs.22588 ESTs 
452198 AI097560 Hs.51210 ESTs, Weakly similar to 138022 hypotheli 
425860 L29339 Hs.1 964 solute carrier family 5 (sodium/glucose 
435538 AB011540 Hs.4930 low density lipoprotein receptor-related 
436539 AI005457 Hs.275048 ESTs 

417404 NM_007350 Hs.82101 pleckstrin homotogy-like domain, family 
430388 AA356923 Hs.240770 nuclear cap binding protein subunit 2, 2 
425905 AB032959 Hs.318584 novel C3HC4 type Zinc finger (ring tinge 
407237 AA169872 Hs.6216 Homo sapiens hepatocellular carcinoma-as 
413597 AW302885 Hs.117183 ESTs 

429529 AA454190 Hs.24283 ESTs, Moderately similar to reduced expr 
409916 BE313625 Hs.57435 solute carrier family 11 (proton-coupled 
407746 AK001962 Hs.38114 hypothetical protein FU11100 
426921 AA037145 Hs.172865 cleavage stimulation factor, 3' pre-RN A, 
438050 BE262816 Hs.6061 protein kinase, AMP-xtivated, beta 1 no 
416857 AA188775 Hs.292453 ESTs 

409683 U33317 Hs.711 defensln, alpha 6, Paneth cell-specific 
452291 AF015592 Hs.28853 CDC7 (cs!l division cycle 7, S. cerevisi 
420096 AA775910 Hs.95011 syntrophin, beta 1 (dystrophin-associate 
448693 AW004854 Hs.228320 hypothetical protein FU23537 
433393 AF038564 Hs.98074 itchy (mouse homolog) E3 ubiquitin prote 
424745 AA214618 Hs.152759 activator of S plr " 
408771 AW732573 Hs.47584 
454438 AA224053 Hs.1 72405 cell division cycle 27 
407771 AL138272 Hs.62713 
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416057 AI927382 Hs.29857 ESTs 

442917 AA314907 Hs.85950 ESTs 

451813 NMJ16117 Hs.27182 phospholipase A2-activating protein 

453700 AB009426 Hs.560 apolipoprotein B raRNA editing enzyme, ca 

412246 AI160873 Hs.69233 zinc finger protein 

430899 BE018217 Hs.183528 hypothetical protein FU14906 

418668 AW407987 Hs.173518 M-phase phosphoprotein homolog 

416421 AA134006 Hs.79306 eukaryotic translation initiation factor 

455800 R22479 Hs.167073 Homo sapiens cDNA FU13047 tis, clone NT 

409913 BE243842 Hs,283077 centroscmalP4.1 -associated protein; unc 

429201 X03178 Hs.198246 group-specitic component (vitamin D bind 

408908 BE296227 Hs.250822 serine/threonine kinase 15 

413585 AI133452 Hs.75431 fibrinogen, gamma polypeptide 

432298 AL1 18812 Hs.274293 Homo sapiens mRNA; cDNA DKFZp7B1 G1 1 1 1 (f 

424345 AK001380 Hs.145479 HomosapienscDNAFU10S18fis,cloneNT 

441645 AI222279 Hs.201555 ESTs, Weakly similar to T23406 hypotheti 

409187 AF154830 Hs.50966 carbamoyl-phosphate synthetase 1, mitoch 

406687 M31126 Hs.272620 pregnancy specific beta-1 glycoprotein 9 

452281 T93500 Hs.28792 HomosapienscDNAFU11041 fis.clonePL 

429945 NM_006729 Hs.226483 diaphanous (Drosophila, homolog) 2 

459309 AA040620 Hs.5672 hypothetical protein AF1 40225 

410060 NM.001448 Hs.58367 glypican 4 

423806 M331247 Hs.86617 ESTs 

454036 M374756 Hs.93560 Homo sapiens mRNA for KIM1771 protein, 

430345 AK000282 Hs.239681 hypothetical protein FU20275 

411096 U80034 Hs.68583 mitochondrial intermediate peptidase 

429125 M446854 Hs.271004 ESTs, Weakly similar to I38022 hypotheti 

442957 AI949952 Hs.49397 ESTs 

426642 AW068223 Hs.171581 ubiquitin C-terminal hydrolase UCH37 

426518 Z43039 Hs.170198 KIAA0009 gene product 

441894 AA134329 Hs.24170 Homo sapiens, clone IMAGE:3685398, mRNA, 

415385 R17798 Hs.7535 COBW-like protein 

409757 NM_001898 Hs.123114 cystatinSN 

433687 AA743991 gb:ny57g01.s1 NCI_CGAP_Pr18 Homo sapiens 

424492 AI133-182 gb:HA2093 Human fetal liver cDNA library 

452606 N45202 Hs.90012 hypothetical protein FU23441 

438777 AA825487 Hs.142179 ESTs 

417235 AA810278 Hs.24250 ESTs 

451177 AI969716 Hs.13034 ESTs 

415227 AW821113 Hs.72402 ESTs 

436217 T53925 Hs.107 flbrinogen-like 1 

452081 AW135220 Hs.241921 ESTs 

426010 AA136563 Hs.1975 

426235 AI631964 Hs.34447 

445640 AW969626 Hs.31704 ESTs, Weakly similar to KIM0227 [H.sapi 

444743 M045648 Hs.301957 nudix (nucleoside diphosphate linked mol 

442980 AA857025 Hs.8878 kinesin-like 1 

418882 NMJW4996 Hs.89433 ATP-binding cassette, sub-family C (CFTR 

453884 AA355925 Hs.36232 KIM0186 gene product 

444478 W07318 Hs.240 M-phase phosphoprotein 1 

419502 AU076704 Hs.90765 fibrinogen, A alpha polypeptide 

420218 AW958037 Hs.286 ribosomal protein L4 

421155 H87879 Hs.102267 lysyl oxidase 

441421 AA356792 Hs.334824 hypothetical protein FU 14825 

456435 AI880384 Hs.270747 ESTs, Weakly similar to ALU2_HUMAN ALU S 

428046 AW812795 Hs.155381 ESTs, Moderately similar to I38022 hypot 

446372 AB020644 Hs.14945 long fatty acyl-CoA synthetase 2 gene 

421477 AI904743 Hs.1 04650 hypothetical protein FU 10292 

409564 AA045857 Hs.54943 fracture callus 1 (rat) homolog 

453080 AI423056 Hs.23921 hypothetical protein DKFZp547A023 
430217 N47863 Hs,336901 ribosomal protein S24 
417372 T99755 Hs.334728 ESTs 
415139 AW975942 Hs.48524 " 
412140 AA219691 Hs.73625 
424086 AI351010 Hs,102267 lysyl oxi 
409327 L41162 Hs.53563 collagen, type IX, alpha 3 
417576 AA339449 Hs.82285 phosphoribosy!gIycina dtfonylt sfer 
452131 AI860677 Hs.72325 Human DNAsequence from clone RP1-187J11 
436016 AA806465 Hs.121536 Human DNAsequence from clone RP11-472E5 
449347 AV649748 Hs.295901 KIAA0493 protein 
445038 AI635444 Hs.143917 dJ467N11.1 protein 
453921 AI824009 Hs.44577 ESTs 
413582 AW295647 Hs.71331 hypothetical protein MGC5350 
421076 AW007988 Hs.233299 ESTs, Weakly similar to I38022 hypotheti 
407884 BE075315 Hs.95011 syntrophin, beta 1 (dystophin-associale 
433384 AI021992 Hs.124244 ESTs 
422026 U80736 Hs.1 10826 trinucleotide repeat containing 9 
447020 T27308 Hs.16986 hypothetical protein FU11046 
441795 N58115 Hs.21137 AD024 protein 
449416 AI651016 Hs.246311 ESTs 
418379 AA218940 Hs.137516 fidgetin-like 1 
426753 T89832 Hs.170278 ESTs 
422109 S73265 Hs.1473 gastrin-releasing peptide 
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replication factor C (activator 1) 2 (40 



424717 H03754 Hs.152213 wingless-type MMTV integrator! site fami 

427728 AJ245600 Hs.180545 Homo sapiens mRNA for hypothetical prate 

447713 AI420733 Hs.207083 ESTs 

425739 T19016 Hs.159410 molytrtopterin synthase sulfurylase 

420170 U43374 Hs.95631 Human normal keratinocyte mRNA 

407244 M10014 Hs.75431 fibrinogen, gamma polypeptide 

441139 AW449009 Hs.126647 ESTs 

451121 AW973795 Hs.128927 Homo sapiens cDNA FU13903 fis, clone TH 

435202 AI971313 Hs. 170204 KIAA0551 protein 

431699 NM 001173 Hs.267831 Rho GTPase activating protein 5 

418384 AW149266 Hs.25130 Homo sapiens cDNA FU14923 fis, clone PL 

422805 AA436989 Hs.121017 H2A histone family, member A 

411750 BE562298 Hs.71827 KIAA0112 protein; homolog of yeast ribos 

435496 AW840171 Hs.265398 ESTs, Weakly similar to transformation-r 

434539 AW748078 Hs.214410 ESTs, Weakly similar to MUC2_HUMAN MUCIN 

413293 AL047483 Hs.302498 GTP-binding protein homologous to Saccha 

445236 AK001676 Hs.12457 hypothetical protein FU10814 

415091 AL044872 Hs.77910 34iydroxy-3-methylglutaryl-Coenzyme A sy 

441675 AI914329 Hs.5461 ESTs 

449802 AW901804 Hs.23984 hypothetical protein FU20 147 

401480 NA NA 

408562 AI436323 Hs.31141 Homo sapiens mRNA for KIAA1568 protein, 

426269 H15302 Hs.168950 Homo sapiens mRNA; cDNA DKFZp5G6A104G (f 

414718 H95348 Hs.107987 ESTs 

419139 AI123517 Hs.269940 ESTs 

430789 AA632577 Hs.310235 ESTs, Weakly similar to I7888S serine/lh 

425420 BE536911 Hs.234545 hypothetical protein NUF2R 

408758 NMJ03686 Hs.47504 exonuclease 1 

439741 BE379646 Hs.6904 Homo sapiens mRNA full length insert cDN 

418612 AB037788 Hs.224961 cleavage and polyadenylation specific fa 

433927 AI557019 Hs.1 16467 small nuclear protein PRAC 

450568 AL050078 Hs.25159 Homo sapiens cDNA FU10784 fis, clone NT 

455777 AA524285 Hs.154172 ESTs, Moderately similar to BCGF HUMAN B 

428743 AL080060 Hs.301549 Homo sapiens mRNA; cDNA DKFZp564H172 (fr 

42112G M74587 Hs.102122 insulin-like growth factor binding prole 

411835 U29343 Hs.72550 hyaluronan-mediated motility receptor (R 

418396 AI765805 Hs.26691 ESTs 

430510 AW162916 Hs.241576 hypothetical protein PR02577 

408430 S79876 Hs.44926 dipeptidylpeptidase W (CD26, adenosine 



411901 AA166730 Hs.6966 Human DNA sequence from clone RP1-187J1 1 

404025 NA NA 

451807 W52854 Hs.27099 hypothetical protein FU23293 similar to 

436662 AI582393 Hs.1 26695 ESTs 

414140 AA281279 Hs.23317 hypothetical protein FU14681 

410044 BE566742 Hs,58159 highly e*pr e Jin ncer, rich in leuc 

431041 AA490967 Hs.197955 KIAA0704 protein 

417860 AW408557 Hs.235498 hypothetical protein FU14075 

410658 AW105231 Hs.192035 ESTs 

425895 AI269484 Hs.161427 zinc finger protein 215 

422892 AA988176 Hs.121553 hypothetical protein FU20641 

436397 AA715013 Hs.169835 ESTs 

439225 AA192669 Hs.45032 >. ESTs 

423197 T91418 Hs.125156 transcriptional adaptor 2 (ADA2, yeast, 

413374 NMJ01034 Hs,75319 ribonucleotide reductase M2 polypeptide 

412723 AA648459 Hs.335951 hypothetical protein AF301222 

425745 U44060 Hs.14427 Homo sapiens cDNA; FU21800 fis, clone H 
452795 AW392555 Hs.18878 hypothetical protein FU21620 

430704 AW813091 Hs.335799 ESTs 

429682 NMJ06306 Hs.211602 SMC1 (structural maintenance of chromoso 

433326 AI379486 Hs.159430 ESTs 

437958 BE139550 Hs.121668 ESTs, Moderately similar to PC4259 fern' 

410566 AA373210 Hs.43047 Homo sapiens cDNA FU13585 fis, clone PL 

423343 AA324643 Hs.246106 ESTs 

416467 H57585 Hs.37467 ESTs 

408867 AA437199 Hs.656 cell division cycle 25C 

419423 D26488 Hs.90315 KIAA0007 protein 

414132 AI801235 Hs.48480 ESTs 

423948 AW392342 Hs.283077 cenuosomal P4.1-assccialed protein; unc 

425746 NMJ01701 Hs.159440 biie acid Coenzyme A: amino acid N-acylt 
451009 AA013140 Hs.115707 ESTs 

431064 AI903735 gb:MR-BT035-2001S9-031 BT035Homosapien 

432725 AL137496 Hs.9001 ESTs 

400298 AA032279 Hs.61635 six transmembrane epithelial anijgen of 

410486 AW235094 Hs.69233 zinc finger protein 

428532 AF157326 Hs.184786 TBP-interacting protein 

429782 NMJ05754 Hs.220689 Ras-GTPase-activating protein SH3-domain 

408380 AF123050 Hs.44532 diubiquitin 

423936 U77629 Hs.135639 achaete-scute complex (Drosophila) homol 

434294 AJ271379 Hs.76194 ribosomal protein S5 
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400115 I 
430967 
438078 
412359 
429774 
426214 
419790 
450480 
421211 
419261 
434414 



BE167434 Hs.98471 

M081924 Hs.124918 

AW977850 Hs.23856 

M814043 Hs.88045 



Hs.33790 ESTs 
ESTs 



30 
35 



424625 
448912 
442671 
411893 
456281 
421106 
451401 
404516 
414968 
421308 



AI522215 Hs.50883 

H59846 Hs.128355 

U79250 Hs.93201 

X82125 Hs.25040 

AA284966 Hs.266308 

X07876 Hs.89791 

M625207 Hs.264915 

AW162948 Hs.64542 

AI741122 Hs.101810 

AW369771 Hs.52620 
N80701 
AI292036 

AF086401 Hs.293847 

AK001581 Hs.334828 

NMJ07115 Hs.29352 

BE218705 Hs.121378 

M852985 Hs.89232 

AW975998 Hs.58595 

R80061 Hs.164478 

M263143 Hs.24596 

AF161521 Hs.9081 

L00190 Hs.75599 

AW014795 Hs.23349 

AW885757 Hs.257862 

AA210704 Hs.190465 

AL1 35352 Hs.255883 

AL137163 Hs.57549 

AK000322 Hs.18457 

AF017445 Hs.150926 

M452934 Hs.279813 

AW904466 Hs.321197 

D83781 Hs.22559 
AI005668 



414538 
453931 
427718 



444188 
457059 
407162 



N63855 



Homo sapiens done TCCCTA00142 mRNA seou 
ESTs, Weakly similar tc T1 871 2 hvpotheti 
KIAA1795 protein 



hypothetical 



gb:Q\/3-LT0048-140200-083-e05 LT0048 Horn 
KIAA1804 protein 

ESTs, Moderately similar to ALU7_HUMAN A 
glycerol-3-phosphate dehydrogenase 2 (mi 
nc finger protein 239 



wingless-type MMTV integration site fami 
gb:tr34b07.x1 NCI.CGAP.Ov23 Homo sapiens 
Homo sapiens cDNA FL112908 fis, clone NT 
cleavage and polyadenylation specific fa 
Homo sapiens cDNA FU14232 fis, clone NT 
integrin, beta 8 



AA284166 
AA877124 
AI793163 

C16096 Hs.22826 

AA687322 Hs.192843 

AI827237 Hs.282884 

AW205197 Hs.240951 

AW503857 Hs.4007 

AB035898 Hs.150587 

S65791 Hs.89764 

AA662910 Hs.42635 

AF151852 Hs.111449 

AI372822 Hs.1 10103 

AA151520 Hs.334822 

AW612228 Hs.107987 

AL121278 Hs.25144 
AI798580 

X02544 Hs.572 

H05049 Hs.22269 

AI918049 Hs.1 24961 

NM.01S122 Hs.56148 

AW013907 Hs.167531 

AW628655 Hs.98440 



ESTs, Moderately similar to S65657 alpha 
hypothetical protein FU10719; KIAA1794 
tumor necrosis factor, alpha-induced pro 
metailothionein-like 5, testis-specifi c 
chromobox hcmolog 5 (Drosophila HP1 alph 
ESTs, Weakly similar to I38022 hypotheti 
hypothetical protein FL121939 similar lo 
RAD51-inieracting protein 
phenylalanyl-tRNA synthetase beta-subuni 
serine (or cysteine) proteinase inhibito 
ESTs 
ESTs 
ESTs 

ESTs, Weakly similar to I38022 hypotheti 
hypothetical protein dJ473B4 
hypothetical protein FL120315 
fncose-1-phosphateguanylyllransferase 
hypothetical protein 

PDZ domain protein (Drosophila inaD-like 

KIAA0197 protein 

EST 

ESTs 



gb:on52g03.y5 NCLCGAP_Co8 Homo sapiens 



hypothetical protein DKFZp434K2435 
CGI-94 protein 

RNA polymerase I transcription factor RR 
hypothetical protein MGC4485 



Hs.25933 ESTs 



Inc finger protein 



434370 AF130988 Hs.58346 

427651 AW405731 Hs.18498 

448666 NM_014953 Hs.323346 KIAA1008 pre 

450375 AA009647 Hs.8850 

410467 AF102546 Hs.63931 

456030 AA136106 Hs.184852 

434082 AI373481 Hs.131715 hypothetical protein PR01777 

443646 AI085198 Hs.1 64226 ESTs 

434265 AA846811 Hs.130554 Homo sapiens cDNA: FLJ23089 fis, clone L 

418524 AA300576 Hs.85769 acidic 82 kDa protein mRNA 

432619 AW291722 Hs.278526 related to the N terminus of tre 
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447207 M442233 Hs.17731 hypothetical protein FU12892 

423673 BE003054 Hs.1695 matrix metalloproteinase 12 (maci 

414853 U31116 Hs.77501 sarcoglycan, beta (43kD dystrophin-assoo 

424176 AL137273 Hs.142307 hypothetical protein 

452259 AA317439 Hs.28707 signal sequence receptor, gamm; 

418203 X54942 Hs.83758 CDC28 protein kinase 2 

435420 AI928513 Hs.59203 ESTs 

406666 V00495 Hs.184411 albumin 

417655 AA780791 Hs.14014 hypothetical protein FLJ14813 

449448 D60730 Hs.57471 ESTs 

421037 AI684808 Hs.197653 ESTs 
448310 AI480316 

408155 AB014528 Hs.43133 

413841 M34276 Hs.75576 

400110 NA NA 

400289 X07820 Hs.2258 matrix metalloproteinase 10 (stromelysin 

443715 AI583187 Hs.9700 cyclin E1 

408296 AL117452 Hs.44155 DKFZP586G1517 protein 

450164 AI239923 Hs.30098 ESTs 

451592 AI805416 Hs.213897 ESTs 

402373 AL135225 Hs.301885 dopachrame tautomerase (dopachrome delta 

426199 AA371865 Hs.97090 ESTs 

414148 BE084049 gb:PMO-BT0651-270400-003-f02 BT0651 Homo 

417006 AW673606 Hs.80758 aspartyl-tRNA synthetase 

449532 W74653 Hs.271593 ESTs, Moderately similar to A47582 B-cel 

434551 BE387162 Hs.280858 ESTs, Highly similar to A35661 DNA excis 

436291 BE568452 Hs.5101 protein regulalorof cytokinesis 1 

423337 NMJM4655 Hs.127337 axin 2 (conductin, axil) 

416185 AW975861 Hs.47367 KIAA1785 protein 

443054 AI745185 Hs.8939 yes-associated protein 65 kDa 

432596 AJ224741 Hs.278461 matrilin3 1 

451229 AW967707 Hs.48473 ESTs 

413583 AL120806 Hs.5888 ESTs 

432702 AW973953 Hs.293744 ESTs 

' 437207 T27503 Hs.15929 hypothetical protein FU12910 

434699 AA643687 Hs.149425 Homo sapiens cDNA FLI11980 fis, clone HE 

423697 BE088697 Hs.131834 Homo sapiens mRNA; cDNA DKFZp434B0328 (f 

428822 W28418 Hs.30715 potassium voltage-gated channel, Isk-rel 

432289 AI860145 Hs.55118 ESTs 

413384 NMJ0O401 Hs.75334 exostoses (multiple) 2 

433042 AM93534 ^281895 Homo sapfcns cDNA FLJ^1660 fis, clone HE 

410094 BE147897 Hs.58593 general IranscripBon factor IIF, polype 

441826 AW503603 Hs.129915 phosphotriesterase related 

444059 R69743 Hs.1 16774 integrin, alpha 1 

426262 AI792141 Hs.196270 folate transporter/carrier 

452641 AW952893 Hs.237825 signal recognition particle 72kD 

454403 BE065985 gb:RC3-BT0319-120200-014-a09BT0319Horm> 

448315 AW290912 Hs.20797 ESTs, Weakly similar to AIU1_HUMAN ALU S 

411343 U77949 Hs.69563 CDC6 (cell division cycle 6, S. cerevisi 

409734 BE161664 Hs.56155 hypothetical protein 

454014 AW016670 Hs.233275 ESTs 

453116 AI276680 Hs.146086 ESTs 

449508 AK001566 Hs.23618 hypothetical protein FU10704 

428227 AA321649 Hs.2248 small inducible cytokine subfamily B (Cy 

435040 AI932350 Hs.152825 ESTs 

426249 F05422 Hs,168352 nucleoporin-like protein 1 

451110 AI955040 Hs.265398 ESTs, Weakly similar to transformaSon-r 

431716 D89053 Hs.268012 fatty-acid-Coenzyme A ligase, long-chain 

437631 AA764749 Hs.267245 hypothetical protein FU14803 

429118 H20669 Hs.35406 ESTs, Highly similar to unr ' 
405769 

438295 AI394151 Hs.37932 ESTs 

453628 AW243307 Hs.83937 hypothetical protein 

450096 AI682088 Hs.79375 holocarboxylase synthetase (bio«n-[prop 

449318 AW236021 Hs.78531 Homo sapiens, Similar to RIKENcDNA 5730 

423881 AK001720 Hs.134403 hypothetical protein FU10858 

408728 AL137379 Hs.47125 hypothetical protein FU13912 

422219 AW978073 Hs.1010 regulator of mitotic spindle assembly 1 

418661 NMJW1949 Hs.1189 E2F transcription factor 3 

420726 K02402 Hs.1330 coagulation factor IX (piasma thromtopla 

418413 R95735 Hs.1 17753 ESTs, Weakly similar to A48665 cell pro! 

443354 AW970672 Hs.9247 protein kinase, AMP-activated, alpha 1 c 

406667 M12523 Hs.184411 albumin 

436411 AW674352 g b:ba63o07.y1 NIH_MGC_1 2 Homo sapiens cD 

417246 AI760098 Hs.21411 ESTs 

410664 NM_006033 Hs.65370 lipase, endothelial 

432686 BE223007 Hs.152460 Homo sapiens cDNA FU12909 fis, clone NT 

442881 AI023175 Hs.167022 ESTs 

432356 AA831032 Hs.1 11670 ESTs, Highly similar to JC2257 prolyl ol 
450218 R02018 Hs.168640 ankylosis, progressive (mouse) homo!cg 

405460 NA NA 

452824 W27643 Hs.73965 splicing factor, arginine/serine-rich 2 
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AA250970 


Hs.251946 


poIy(A)-bindfng protein, cytoplasmic 1-i 








Hs.155485 








W847194 


Hs.232002 








AF039390 




tumor necrosis factor (ligand) superfami 






AB020684 




KIAA0877 protein 


3'69 






Hs.91096 






















hypothetical protein FU13159 




417791 


AW965339 


Hs!l 11471 


ESTs 


3^66 




AK000275 




protein kinase C binding protein 1 


3.66 


432023 




Hs.330144 








AI538226 


Hs.32976 




3.65 














AA278865 


Hs.88523 




















Hs.172405 








AV653742 




hypothetical protein DKFZp761J139 






BE1 58006 


Hs.212296 


















AW301344 












Hs.'l 59452 


3^63 






Hs. 181107 


RAB22A, member RAS oncogene family 






AB001106 


Hs.151413 








BE616633 


Hs.170195 


















AA740875 




ESTs, Moderately similar to I38022 hypot 
















BE280074 










AW1 35714 


Hs.283127 


ESTs, Weakly similar to T19201 hypotheti 






AW290975 


Hs.1 18923 












Homo sapiens mRNA; cDNA DKFZp586D0923 (f 






AW505139 




Homo sapiens mRNA; cDNA DKFZp547C244 [fr 






AW082279 






3^60 








hypothetical protein FLJ22995 






AW959075 




ESTs, Moderately similar to I38022 hypot 


Iso 




AB028992 




KIAA1069 protein 










Homo sapiens mRNA; cDNA DKFZp434P1 1 15 (f 










KIAA0074 protein 






AW375974 










AA318501 






























AW604297 




















AI743539 




















AB037837 




hypothetical protein FU20357 










Homo sapiens, clone IMAGE:4132043, mRNA, 










ESTs, Moderately similar to ALU6 HUMAN A 








Hs.31570 


ESTs! Weakly similar to KIAA1 324 protein 






Z47542 


Hs'.1 79312 








AV655701 


Hs.75183 






441085 


AW136551 


Hs.181245 


II m o s f i DNAFU125 ' li oik' J 


3.54 


434206 


AW136973 


Hs.288516 


ESTs A'( 1 ysi o S69890 mTogen i 


3.54 




AU076643 


Hs.313 








AW083920 


Hs.16098 








AW977653 


Hs.75319 








R68649 


Hs.278359 


absentin melanoma 1 like 






AA029722 


Hs.2173 








M994364 




ESTs, Weakly similar trjT25472 hypotheti' 






AI572438 


Hs!32976 








AW369351 


Hs.287955 








NMJ04529 Hs.404 










Hs.122460 








AI127076 


Hs.334473 


hypothetical protein DKFZp56401278 






AW504944 


Hs.162990 


Homo sapiens cDNA FU14193 fis, clone NT 






W33165 


Hs.22983 








AI080042 


Hs.336901 






41914C 


AI982647 


Hs.215725 


ESTs 


3~44 


446901 


AI347274 




gb:tc05d02.x1 NCI_CGAP.Co16 Homo sapiens 




451806 


NMJ03729 Hs.27075 


RNAJ-terminal phosphate cyclase 


3^42 


411571 




Hs.70811 


hypothetical protein FU2051 6 


3.42 


442717 


R88362 


Hs.180591 


ESTs, Weakly similar to T23976 hypotheti 


3.41 


443426 


AF098158 


Hs.9329 


chromosome 20 open reading frame 1 


3.41 


419131 


AA406293 


Hs.41167 


ESTs 


3.41 


430264 


AA470519 




gb:nc71f10.s1 NCI CGAP Pr1 Homo sap:ens 


3.40 


450159 


AI702416 


Hs.200771 


ESTs, Moderately similar to A Chain A, T 


3.40 


453531 


AA417940 


Hs.271400 


ESTs, Weakly similar to JC5795 CDEP prot 


3.40 


444826 


AI674482 


Hs.148441 


ESTs 


3.40 


445354 


AV653485 


Hs.6390 


Homo sapiens clone FLB3344 PRO0845 mRNA, 


3.40 


444078 


BE246919 


Hs.10290 


U5 snRNP-specific 40 kDa protein (hPrp8- 


3.40 
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418939 AW630803 Hs.89497 lamin B1 

418134 AA397769 Hs.86617 ESTs 

430544 AA481066 Hs.105153 Homo sapiens, done IMAGE:3461987, mRNA, 

427927 AI879165 Hs.2227 CCAAT/enhancer binding protein (C/EBP), 

410804 U64820 Hs.66521 Machado-Joseph disease (spinocerebelar 

410406 A1969703 Hs.1466 glycerol kinase 

408494 AA554714 Hs.187578 HomosapienscDNAFU11639fis,c:oneHE 

452930 AW195285 Hs.194097 ESTs, Weakly similar to 138022 hypothett 

446432 AI377320 Hs.150058 ESTs 

456653 AI807519 Hs.104520 Homo sapiens cDNA FU13694 fs, clone PL 

425322 U63630 Hs.155637 protein kinase, DNA-actlvated, catalytic 

428261 AW006855 Hs.118392 ESTs, Weakly similar to LNHUERIgE Fcro 

421039 NMJ03478 Hs.101299 cullin 5 

407819 R42185 Hs.274803 ESTs 

424698 AA164366 Hs.151973 hypothetical protein FU23S11 

433361 AW469373 Hs.300141 ribosomal protein L39 

435022 AW183385 Hs.54627 ESTs, Weakly simitar to FTDH HUMAN 10-FO 

447164 AF026941 Hs.17518 Homo sapiens cig5 mRNA, partial sequence 

454018 AW016892 Hs.100855 ESTs 

439871 R88518 Hs.46736 hypothetical protein FLJ23476 

453941 U39817 Hs.36820 Bloom syndrome 

411560 AW851186 Hs.179909 hypothetical protein FU22995 

435532 AW291488 Hs.117305 Homosapien= r < i - IMAGE J6829C , \p 

438008 AA775026 Hs.203802 ESTs 

421246 AW582962 Hs.1 02897 CGI-47 protein 

451707 AW051061 Hs.60973 ESTs 

457574 H88717 Hs.27774 ESTs, Highly similar to AF161 349 1 HSPCO 

443613 AI079356 gb:oz39b09.s1 Soares NhHMPu S1 Homo sapi 

415857 AA866115 Hs.127797 HomosapienscDNAFU11381flS,cloneHE 

438746 AI885815 Hs.1 84727 ESTs 

408660 AA525775 Hs.292523 ESTs, Moderately similar to PC4259 ferri 

427738 NMJM0318 Hs.180612 peroxisomal membrane protein 3 (35kD, Ze 

458855 AW361299 Hs.107000 hypothetical protein FU 11 294 

417221 AW379029 Hs.118338 ESTs, Weakly similar to unnamed protein 

424770 AA425562 Hs.11065 Home sapiens HDCME13P mRNA, partial cds 

417720 AA205625 Hs.208067 ESTs 

428571 NMJ06531 Hs.2291 Probe hTg737 (polycystic kidney disease, 

452862 AW378065 Hs.8687 ESTs 

414343 AL036166 Hs.323378 coated vesicle membrane protein 

437222 AL1 17588 Hs.12778 ESTs 

422665 AJ011812 Hs.119018 transcription factor NRF 

414706 AW340125 Hs.76989 KIAA0097 gone product 

446565 D13757 Hs.311 phosphoribosyl pyrophosphate amidotransf 

427576 BE242611 Hs.2173 fucosyltransferase 4 (alpha (1,3) fuoosy 

456132 BE219771 Hs.237146 hypothetical protein FU 12752 

407305 AA715284 gbTO35t03.r1 NO CGAP Br5 Homo sapiens 

419741 NMJW7019 Hs.93002 ubiquitin earner protein E2-C 

424581 M62062 Hs.1 50917 catenin (cadhenn-associated protein), a 

445592 AV654382 Hs.1 7947 ESTs, Weakly similar to T1 6534 hypotheti 

453320 AW450240 Hs.257274 ESTs 

451797 AW663858 Hs.333513 small inducible cytokine subfamily E, me 

413930 M86153 Hs.75618 RAB1 1A, member RAS oncogene family 

410659 AI080175 Hs.68826 ESTs 

446202 AI279706 Hs.149474 ESTs 

432193 AA372264 Hs.273193 hypothetical protein FU 10706 

439262 AA832333 Hs.333045 ESTs 

401823 NA NA 

441264 AA927170 Hs.23290 ESTs 

424081 NM_006413 Hs.1 39120 ribonuclease P (30kD) 

408321 AW405882 Hs.44205 cortistatin 

447432 AW958473 Hs.301957 nudix (nucleoside diphosphate linked moi 
404519 

443268 AI800271 Hs.129445 hypothetical protein FU12496 

422660 AW297582 Hs.103267 hypothetical protein FLJ22548 similar to 

427961 AW293165 Hs.143134 ESTs 

427660 A1741320 Hs.114121 Homo sapiens cDNA; FU23228 lis, clone C 

411643 AI924519 Hs.192570 hypothetical protein FU22028 

458652 AW375610 Hs.117102 hypothetical protein FU13045 similar to 

426472 BE246138 Hs.30853 ESTs 

410389 AW954049 Hs.8177 ESTs, Weakly similar to PIHUB6 sa'ivary 

439979 AW600291 Hs.6823 hypothetical protein FU10430 

422363 T55979 Hs.1 15474 replication factor C (activator 1) 3 (38 

443162 T49951 Hs.9029 DKFZP434G032 protein 

431678 AW072372 Hs.267446 hypothetical protein FU11184 

430439 AL133561 Hs.241426 DKFZP434B061 protein 

407201 N31998 Hs.164256 hypothetical protein FU20657 

437905 AW363121 Hs.175596 ESTs, Weakly similar to T26935 hypotheti 

434160 BE551196 Hs.1 14275 ESTs 

407995 AI094748 Hs.100134 hypothetical protein FU 12787 

412966 BE243311 Hs.8024 IK cytokine, down-regulator of HLA II 

414386 X00442 Hs.75990 haptoglobin 

444381 BE387335 Hs283713 ESTs, Weakly similar to S64054 hypotheti 
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Hs.82302 






423629 


AW021173 




Homo sapiens cDNA: FLJ21909 fis, clone H 






































Hs.282387 


Homo sapiens cDNA: FLJ21837 fis, clone H 






































Hs.28739 












jagged 1 (Alagille syndrome) 






AW01521 1 
































Hs.1 80059 


Homo sapiens cDNA FU 20653 fis, clone KA 










ESTs, Weakly simi'ar to 2017205A dihydro 










hypothetical protein FU22729 






AW847814 


Hs.289005 


Homo sapiens cDNA: FU21532 fis, clone C 














443584 


AI807036 


Hsl267245 


hypothetical protein FU14803 


3,13 


406668 


T62745 


Hs.184411 








AI694413 




olfactory receptor, family 2, subfamily 


3J3 


437594 


AA761431 


Hs.74335 


heat shock 90kD protein 1 , beta 


3.13 


433759 


M680003 


Hs.109363 


Homo sapiens cDNA: FU23603 fis, clone L 




420911 


U77413 


Hs.100293 


O-linked N-acetylglucosamine (GlcNAc) tr 


111 


414080 


AA135257 


Hs.47783 


B aggressive lymphoma gene 




450209 


AW008921 


Hs.13138 


Homo sapiens, ctonelMAGE-,3448343, mRNA, 


3.11 


441790 


AW294909 


Hs.1 32208 


ESTs 




439352 


BE614347 


Hs.169615 


hypothetical protein FLJ20989 


3.10 


449664 


R06212 


Hs.127733 


ESTs 




435979 


W03698 


Hs.83513 


ESTs, Weakly similar to ALU1.HUMAN ALU S 


3.10 


424602 
402963 


AK002055 


Hs.151046 


hypothetical protein FU11193 


3.10 
3.10 


42896? 


AW978441 




ESTs 


3.10 



455838 BE145808 gb:MRO-HT0208-101299-103-f11 HT0208 Homo 

407502 U52096 gb:Human zinc finger protein (kr-znf1) m 

426853 U32974 Hs.172777 baculoviral IAP repeat-containing 4 

417845 AL117461 Hs.82719 Homo sapiens mRNA;cDNA DKFZp586F1822 [f 

421056 AI076890 Hs.146847 TRAF family member-associated NFK3 activ 

420617 AK001652 Hs.99423 ATP-dependent RNA helicase 

421841 AA908197 Hs.108850 MAK-related kinase 

429534 AW976987 Hs.163327 ESTs, Weakly similar to 2109260AB cell 

408353 BE439838 Hs.44298 mitochondrial ribosomal protein S17 

433037 NrVL.014158 Hs.279938 HSPC067 protein 

443183 R16258 Hs.6217 Homo sapiens cDNA FU12521 lis, clone NT 

457726 AI217477 Hs.194591 ESTs 

415786 AW419196 Hs.257924 hypothetical protein FU 13782 

433013 AI697890 Hs.127337 axin 2 (condijctjn, axil) 

417601 NM.014735 Hs.82292 KIAA0215 gene product 

420276 AA290938 Hs.190561 ESTs, Highly similar to SORi_HUMAN SORT! 

443323 BE560621 Hs.9222 estrogen receptorbinding site associate 

446223 BE300091 Hs.1 19699 hypothetical protein FU 12969 

425851 NM_001490 Hs.159642 glucosaminyl (N-aoetyl) transferase 1,c 

436203 BE384982 Hs.5076 Homo sapiens cDNA: FU22128 fis, clone H 

416402 NM_000715 Hs.1012 complement component 4-bhding proteh. 

436554 AI985810 Hs.301173 ESTs 

413801 M62246 Hs.35406 ESTs, Highly similar to unnamed protein 

416248 H99169 Hs.23450 mitochi 

445413 M151342 Hs.12677 CGI-147 protein 

452909 NM.015368 Hs.30985 pannexinl 

447048 AW393080 Hs.228320 hypothetical protein FU23537 

425942 AU077195 Hs.1 64036 glucosamine (N-acetyl]-5-sulfatase (Sanf 

428454 U55936 Hs.184376 synaptosomal-associated protein, 23kD 

411864 AW948147 gb:RCO-MT0013-280300-03U03 MT0013 Horn 

458632 AI744445 Hs.1 67073 Homo sapiens cDNAFU 13047 fis, done NT 

448292 BE281316 Hs.47334 hypothetical protein FU14495 

459055 N23235 Hs.30557 ESTs, Woakly m lai t< B i tJiP- patriot 

402167 

433133 AB027249 Hs.104741 PDZ-binding kinase; T-cell originated pr 

437828 AW976806 Hs.73149 paired box gene 8 

404232 

418164 AI761820 Hs.41074 ESTs, Weakly similar to 139294 McLeod sy 
412610 X90908 Hs.74126 fatty acid binding protein 6, ileal (gas 
452787 AW294022 Hs.222707 KIAA1718 protein 
425782 U66468 Hs.159525 cell growth regulatory with EF-hand doma 
410718 AI920783 Hs.191435 ESTs 
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419196 AF110908 Hs.297660 

446861 AI696519 Hs.14427 

425492 AL021918 Hs.158174 

452834 AI638627 Hs.105685 

410102 AW248508 Hs.279727 

416568 H64844 Hs.138558 

425834 NM.001639 Hs.1957 

426711 AA383471 Hs.180669 

412719 AW016610 Hs.129911 



TNF receptor-associated factor 3 
Homo sapiens cDNA: FU21800 fls, clone H 
zinc finger protein 184 (Kruppel-like) 
K1M1688 protein 

Homo sapiens cDNA FU 14035 lis. done HE 
ESTs 

amyloid P component, serum 



429703 T93154 



AI200540 Hs.14877 

NM.015836 Hs.227274 

D26528 Hs.1 23058 

AA312551 Hs.230157 

BE350295 Hs.15032 

AW170015 Hs.6594 

: AK000492 Hs.98806 

- U80456 Hs.2731 1 

; AI370188 Hs.211454 

I BE155293 Hs.76064 

i AI867277 Hs. 183733 

i AI365208 Hs.293606 

' NM_004458 Hs.81452 



cartilage oligomeric matrix protein (pse 
ESTs 

ATP-binding cassette, sub-family C (CFTR 
ESTs 

ESTs, Weakly similar to (define not ava 
tryptophanyl tRNA synthetase 2 (mitochon 
DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 



RAN binding protein 17 



n'bosomal protein L27a 



gb:IL3-CT0214-291299-0S2-A12 CT0214Homo 



Pkey CAT number Accession 



411765 125700.1 

411864 1262055.1 

412359 129085.1 

413516 1374595.1 

414148 142133.1 

424492 240008 1 

430264 315008.1 

431064 327472.1 

433687 373061 1 



435447 406400 1 

436411 419334.1 

443613 575391.1 

446901 697809.1 

448310 757918.1 

451401 868474.1 

454403 1170435.1 

454975 1247077.1 

455838 1374605.1 



H43346AA248302 M095182 

AW948147 BE092318 AW948138 AW948130 AW946148 AW948129 AW948136 AW948152 AW948144 AW948137 AW948160 
AW837985 AW837938 AA101955 AW837913 AW837935 

BE145907 BE145796 BE145803 BE145851 BE145923 BE145812 BE145809 BE145852 BE145856 

BE084049 AW292907 AA135984 

AI133482AI207619AA341626 

M470519 BE303010 BE302954 BE384120 

AI903735 M491283 AI694953 AW976903 M761362 

AA743991 AA604852 AW272737 

AI798376 S46400 AW81 1617 AW811616 W00557 BE142245 AW858232 AW861 851 AW858362 AA232351 AA218567 AA055556 AW858231 
AW857M1AW814172H66214AW814398AF134164 M243093M173345M199942AA223384AA227092AA227080T12379AA092174T61139 
M149776AA699829AW879188AW813567AW813538AI267168AA157718AA157719M100472M100774AA130756AA157705M157730 
M157715 AA053524 AW849581 AW854566 C05254 AW882836 T92637 AW812621 AA206583 AA209204 BE166909 AA226824 AI829309 AW991957 
N66951 AA527374 H66215 AA045564 AI694265 H60808 M149726 AW195620 BE081333 BE073424 AW81 7662 AW81 7705 AW817703 AW817659 
BE081531 H59570 

AI872932 M682306 BE220163 W88595 T81307 H91447 

AW674352M715374Z25205 

AI079356W23287 

AI347274AW844024 

AI480316AW847535 

AI793163 AW875182 AW875178 AW875175 



AW848047 AW848202 AW848631 AW848142 AW848702 AW8481 21 AW848632 AW848140 AW848571 AW848009 AW848067 AW848069 AW848905 
AW848214 

BE145808BE145807 BE181883 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. 

sequence of human chromosome 22.' Dunham I. et al.. Nature (1999)402:489495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



"Dunham I. et al." refers to the publication entitled "The DNA 



144 



WO 03/042661 



PCT/US02/36810 



Strand reposition 



401480 7321503 Plus 

401644 8576138 Plus 

401823 2262095 Minus 

402167 8571795 Plus 

402963 5419653 Minus 

403137 9211494 Minus 

403776 7770611 Minus 

404025 7341444 Plus 

404232 8218045 Minus 

404253 9367202 Minus 

404477 8080699 Plus 

404516 8151967 Plus 

404519 8152000 Plus 

404567 7249169 Minus 

404996 6007890 Plus 

405394 6624123 Minus 



166120-166347,166451-166557.169651-169832 



92349-92572,92958-93084,93579-93712,93949-94072,94591-94748,95214-95337 

1414-1513,1624-1756 

131740-131905 

71800-71956 



114153-114322 

12817-13000 

101320-101501 

37999-38145,38652-38998,39727-39872,40557-40674,42351-42450 
31900-32373 



76844-77193 
54304-54584 
64689-64798 
7513-7673 



TABLE 7A: 516 GENES UP-REGULATED IN COLON CANCER COMPARED TO NORMAL COLON 



denominator value and the ratio was equal to or greater than 5.( 



35 Pkey: 



Unique Eos probeset identifier nu 



Unigene gene bile 

RaSo ot tumor to normal colon 



Pkey ExAccn UnigenelD Unigene Title 



441031 
406667 
409041 
428330 
421552 
429201 
452281 



421462 
424252 
452823 
432340 
433447 

425260 i 

444754 ' 

439518 < 

443211 , 



AI110684 Hs.7645 
M12523 Hs.184411 
AB033025 Hs.50081 
Hs.2256 
Hs.105700 
Hs.198246 
Hs.28792 
Hs.157601 
AI767756 Hs.82302 
AW075485 Hs.286049 
M34276 Hs.75576 
AI687303 Hs.285529 G | 
AW847814 Hs.289005 
AW378065 Hs.8687 
AI267700 Hs.317584 
□31152 Hs.179729 
R27496 Hs.1378 
AL110203 Hs.138411 
AL359055 Hs.67709 
AA045573 Hs.22900 
AF016495 Hs.104624 
5.143811 



AB012124 



secreted ftizzled-relaled protein 4 
group-specific component (vitamin D bind 
Homo sapiens cDNA FU11041 fis, clone PL 
ESTs 

Homo sapiens cDNA FU14814 fis, clone NT 



Homo sapiens cDNA; FU21532 fis, clone C 



collagen, type X, alpha 1 (Schmid metapb 

Homo sapiens mRNA; cDNA DKFZp586J1 922 (f 
Homo sapiens mRNA full length insert cDN 
nuclear factor (erythroid-derived 2)-lik 



hypothetical protein FU20513 
transcripb'on factor-like 5 (basic helix 
gb:nj21d02.s1 NCI_CGAP_AA1 Homo sapiens 
neuronal pentraxin II 
haptoglobin 

phenylalanine hydroxylase 
transmembrane 4 superfamily member 4 
gb:zd60d04.r1SoaresJetal_heart_NbHH19W 
ESTs 



C-reactive protein, pentraxin-related 



448974 AL049390 Hs.22689 Homo sapiens mRNA; cDNA DKFZp58601318 (f 
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416402 NMJ00715 Hs.1012 

453863 X02544 Hs.572 orosomucoid 1 

441243 AI767056 Hs.193002 ESTs 

413318 AU076607 Hs.75285 inter-alpha (globulin) inhibitor, H2 pol 

451917 AW391351 Hs.50820 Homo sapiens unknown mRNA 

433213 AW665130 Hs.137190 ESTs 

428261 AW00B855 Hs.1 18392 ESTs, Weakly similar to LNHUER IgE Fc re 

438578 AA811244 Hs.164168 ESTs 

429170 NMJ01394 Hs.2359 dual specificity phosphatase 4 

417006 AW673606 Hs.80758 asparlyl-tRNA synthetase 

449199 AI990122 Hs.196988 ESTs 

436393 AW022213 Hs.143617 ESTs 

413585 AI133452 Hs.75431 fibrinogen, gamma polypeptide 

451561 N52812 Hs.177403 ESTs 

420734 AW972872 Hs.293736 ESTs 

422420 U03398 Hs.1524 tumor necrasis factor (ligand) superfami 

441377 BE218239 Hs.202656 ESTs 

435981 H74319 Hs.188620 ESTs 

417296 L36196 Hs.81884 sulfotransferase family, cytosolic, 2A, 

428664 AK001666 Hs.189095 similar to SALL1 (sal (Drosophila)-like 

459370 AA889982 Hs.271826 ESTs, Weakly similar to I38022 hypotheti 

430290 AI734110 Hs.136355 ESTs 

450221 AA328102 Hs.24641 cytoskeleton associated protein 2 

450628 AW382884 Hs.204715 ESTs 

446232 AI281848 Hs.194691 retinoic acid induced 3 

428223 AA424313 Hs.98402 ESTs 

432582 AI623817 Hs.168457 ESTs 

407884 BE075316 Hs.95011 syntrophin, beta 1 (dystrophin-associate 

428493 AK001745 Hs.184628 hypothetical protein FU10883 

407202 N58172 Hs.109370 ESTs 

422109 S73265 Hs.1473 gastrin-releasing peptide 

434699 AA643687 Hs.149425 Homo sapiens cDNA FU11980 fis, clone HE 

447532 AK000614 Hs.18791 hypothetical protein FLJ20607 

443162 T49951 Hs.9029 DKFZP434G032 protein 

450149 AW969781 Hs.132863 Zic family member 2 (odd-paired Drosophi 

446372 AB020644 Hs.14945 long fatty acyl-CoA synthetase 2 gene 

453909 AW004045 Hs.203365 ESTs 

416655 AW968613 Hs.79428 BCL2/adenovirus E1B 19kD-interacting pro 

452903 AI953425 Hs.246911 ESTs, Weakly similar to I38022 hypotheti 

433011 H07960 Hs.306044 CGI-05 protein 

423575 C18863 Hs.163443 Homo sapiens cDNA FLJ11576 fis, clone HE 

456777 AA524285 Hs.154172 ESTs, Moderately similar to BCGF_HUMAN B 

425745 U44060 Hs.14427 Homo sapiens cDNA: FU21800 fis, clone H 

407168 R45175 Hs.117183 ESTs 

407633 NMJJ07069 Hs.37189 similar to rat HREV107 

400534 

450434 AA166950 Hs.195870 hypothetical protein FU14991 

445873 AA250970 Hs.251946 poly(A]-binding protein, cytoplasmic 1-1 

440526 AI832243 Hs.211471 ESTs 

427544 AI767152 Hs.181400 ESTs, Weakly similar to I78885 serine* 

447974 R76886 gb:yi64b03.s1 Soares placenta Nb2HP Homo 

431958 X63629 Hs.2877 cadherin 3, type 1, P-cadherin (placenta 

453922 AF053306 Hs.36708 budding uninhibited by benzimidazoles 1 

457065 AI476318 Hs.192480 ESTs 

408950 AA707814 Hs.14945 long fatty acyl-CoA synthetase 2 gene 

419852 AW503756 Hs.286184 hypothetical protein dJ551D2,5 

414718 H95348 Hs.107987 ESTs 

447505 AL049266 Hs.18724 Homo sapiens mRNA; cDNA DKFZp564F093 (fr 
404567 

428536 AI143139 Hs.2288 visinin-like 1 

444381 BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypotheti 

437267 AW511443 Hs.258110 ESTs 

420583 H77859 Hs.65450 reUculon 4 

407244 M10014 Hs,75431 fibrinogen, gamma polypeptide 

425071 NMJ13989 Hs.154424 deiodinase, iodothyronine, type II 

449655 AI021987 Hs.59970 ESTs 

459504 BE514127 gb:601315974F1 NIH_MGC_8 Homo sapiens cD 

434609 R76593 gb:yi60c11,r1 Soares placenta Nb2HP Homo 

412104 AW205197 Hs.240951 Homo sapiens, Similar to RIKENcDNA 2210 
447863 AL047611 Hs.288885 Homo sapiens cDNAFU 14246 fis, clone OV 

448106 AI800470 Hs.171941 ESTs 
439192 AW970536 Hs.105413 ESTs 

446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteoponth, 

427535 R29543 Hs.2164 pro-platelet basic protein (includes pla 
440591 AA431599 Hs.132799 hypothetical protein FU23451 
440404 AI015881 Hs.324527 mitochondrial ribosomal protein S5 
426283 NMJ03937 Hs.169139 kynureninase (L-kynurenine hydrolase) 

401742 NA NA 
416393 N54037 Hs.262869 plasminogen-like 
413339 AI818080 Hs.194290 ESTs 
437641 AA811452 Hs.291911 ESTs 

428743 AL080060 Hs.301549 Homo sapiens mRNA; cDNA DKFZp564H172 (fr 
431319 AA873350 Hs.302232 ESTs 
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434008 M740878 Hs.1 12982 

446311 AW007294 Hs.149795 

419968 X04430 Hs.93913 

413597 AW302885 Hs.117183 

440527 AV657117 Hs.184164 

409327 U1162 Hs.53563 

444107 T46839 Hs.10319 

456653 AI807519 Hs.104520 



432867 AW016936 Hs.233364 ESTs 



collagen, type IX, alpha 3 
UDP glycosyltransferase 2 family, polype 
Homo sapiens cDNA FLJ13694 fis, clone PL 



452712 AW838616 

419131 AA40S293 

444783 AK001468 

418895 AA894638 

422665 AJ011812 

409757 NMJ01898 

419752 AA249573 

412446 AI768015 

433285 AW975944 

414538 AW612228 

449318 AW236021 

430835 AI240006 

445467 AI239832 

445537 AJ245671 



Hs.92127 
Hs.237396 
Hs.107987 
Hs.78531 



transcription factor NRF 
cyslatln SN 

ESTs, Moderately similarto ZN91.HUMAN Z 



Homo sapiens, Similar to RIKEN cDNA 5730 




418379 AA218940 

424560 AA1 58727 

453116 AI276680 

419929 U90268 

409687 T51125 

407790 AI027274 

416661 AA634543 



similarity lo L 

TNFreceptor-as: 

Homo sapiens cDNA: FLJ21 869 fis, clone H 
transcription elongation factor A (Sll)- 
gb:nf76g11.s1 NCI_CGAP_Co3 Homo sapiens 

' family 7, (c " ' 
Homo sapiens cDNA FU13997 fis, clone Y7 
fidqeUn-like 1 

by clone 23733 



Hs.288941 
Hs.79440 
Hs.83337 



422762 AL03132C Hs.11 



Hs.106260 
Hs.16098 
Hs.21479 
Hs.90765 
Hs.184411 
Hs.83169 
Hs.58582 

Hs.272620 
Hs.112264 
Hs.191060 
Hs.133529 
Hs.15423 



fibrinogen, A alpha polypeptide 



Homo sapiens cDNA FU12789 fis, clone NT 
pregnancy specific beta 1-glycoprotein 9 

hypothetical protein HDCMC04P 
KIAA0754 protein 

lipopolysaccharide specific response-7 p 
ESTs 
ESTs 

Homo sapiens cDNA FU23435 fis, clone H ' 
ESTs 

gbdf27e02.y1 Morton Fetal Cochlea He 
ribosomal protein S5 
ESTs, Moderately similar to PC4259 ferri 
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423952 AW877787 Hs.13S102 KIAA0853 protein 

431193 AW749505 Hs.296770 KIAA1719 protein 

413530 AA130158 Hs.19977 ESTs, Moderately similar to ALU8_HUMAN A 

417479 AI057052 Hs.133554 ESTs, Weakly similar to Z195_HUMAN ZINC 

424613 AL079850 Hs.151236 highly charged protein 

417720 AA205525 Hs.208057 ESTs 

449347 AV649748 Hs.295901 KIAA0493 protein 

449444 AW818436 Hs.23590 solute earner family 16 (monocarboxylic 

447499 AW262580 Hs.147674 protocadherinbeta 16 

426890 AA393167 Hs.41294 ESTs 

450568 AL050078 Hs.25159 Homo sapiens cDNA FU10784 fs, clone NT 

445019 AI205540 Hs.281295 ESTs 

419474 AW968619 Hs.155849 ESTs 

417015 M83772 Hs.80876 flavin containing monooxygenase 3 

41 1765 H43346 gb:yp09a04.r1 Soares breast 3NbHBst Homo 

448816 AB033052 Hs.22151 KIAA1226 protein 

416143 AI955650 Hs.79033 glutaminyl-peptide cydotransferase (g!u 

451815 AW974911 Hs.184793 Homo sapiens cDNA: FU21880 fls, clone H 

420777 AA280223 Hs.130865 ESTs 

414463 T69078 Hs.76177 alpha-1-microglobulin/bikunin precursor 

409269 AA576953 Hs.22972 hypothetical protein FU13352 

443614 AV655386 Hs.7645 fibrinogen, B beta polypeptide 

448706 AW291095 Hs.21814 interteukin 20 receptor, alpha 

441124 T97717 Hs.119563 ESTs 

414680 AA743331 Hs.272572 hemoglobin, alpha 2 

446432 AI377320 Hs.1 50058 ESTs 

439295 AW206091 Hs.253536 ESTs 

436902 AW247145 Hs.192729 ESTs 

428679 AA431765 gb:zv»80c03.s1 Soaresjestis NHT Homo sap 

414865 AA157156 Hs.274414 hypothetical protein FLJ 14457 

421373 AA808229 Hs.167771 ESTs 

432435 BE218886 Hs.282070 ESTs 

427933 AW974643 Hs.190571 ESTs 

436330 NM. 004413 lls.109 dipeptidase 1 (renal) 

433800 AI034361 Hs.135150 lung type-l cell membrane-associated gly 

418763 AK000219 Hs.88367 hypothetical protein FU20212 

446322 N23033 Hs.155814 ESTs 

442577 AA292998 Hs.1 63900 ESTs 

429486 AF155827 Hs.203963 hypothetical protein FU10339 

412280 AW20511G Hs.272814 hypothetical protein DKFZp434E1723 

436679 AI127483 Hs.120451 ESTs, Weakly similar to unnamed protein 

429125 AA446854 Hs.271004 ESTs, Weakly similar to I38022 hypotheti 

453204 R10799 Hs.191990 ESTs 

431576 M76665 Hs.275215 hydrcxysteroid (11-beta) dehydrogenase 1 

427878 C05766 Hs-131022 CGI-07 protein 

454438 AA224053 Hs.172405 cell division cycle 27 

424402 M63108 Hs.1769 luteinizing hormone/choriogonadctropin r 

438394 BE379623 Hs.27693 peptidylprolyl ioomcrasc (cyclophilin)-l 

414271 AK000275 Hs.75871 protein kinase C binding protein 1 

417168 AL133117 Hs81376 Homo sap ens m ? , DNA DKFZp586L1121 (f 

419629 AB020695 Hs.91662 KIAA0888 protein 

451686 AA059246 Hs.110293 ESTs 

430829 AW451999 Hs.194024 ESTs 

446501 AI302616 Hs.150819 ESTs 

442973 BE567665 Hs.288550 Homo sapiens cDNA: FU23156 fis, clone L 

457030 AI301740 Hs.173381 dihydropyrimidinase-like 2 

437773 U24186 Hs.283018 replication protein A complex 34 kd subu 

416018 AW138239 Hs.78977 proprotein convertase subtilisin/kexin t 

425478 AB007953 Hs.268840 ESTs 

411543 AI924519 Hs.192570 hypothet'eal protein FU22028 

418555 AI417215 Hs.87159 hypothet'eal protein FL) 12577 

453102 NM.007197 Hs.31664 frizzled (Drosophila) homolog 10 

433615 AA732982 Hs.269607 ESTs, Weakly similar to ALU1.HUMAN ALU S 

450638 AK001826 Hs.25245 hypothet'eal protein FLU 1269 

450480 X82125 Hs.25040 zinc finger protein 239 

445191 AF048686 Hs.12393 dTDP-D-glucose 4,6-dehydratase 

414575 H11257 Hs.22968 Homo sapiens clone IMAGE451 939, mRNAse 

432639 AW973785 gb:EST385886 MAGE resequences, MAGM Homo 

410116 AW630671 Hs.58636 squamous cell carcinoma antigen recogniz 

449894 AK001578 Hs.24129 CLLL7 protein 

442914 AW188551 Hs.99519 hypothet'eal protein FU14007 

424745 AA214618 Hs.152759 act'vatorofS phase kinase 

441801 AW242799 Hs.86366 ESTs 

435542 AA687376 Hs.269533 ESTs 

427072 H38046 Hs.303193 ESTs 

418051 AW192535 Hs.19479 ESTs 

436217 T53925 Hs.107 fibrinogen-like 1 

439809 R41396 Hs.101774 hypothetical protein FU23045 

430704 AW813091 Hs.335799 ESTs 

410227 AB009284 Hs.61152 exostoses (mulu'ple)-like 2 

417067 AJ001417 Hs.81086 solute earner family 22 (extraneuronal 
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428392 H10233 Hs.2265 secretory granule, neuroendocrine protei 

452198 AI097560 Hs.61210 ESTs, Weakly similar to 138022 hypotherj 

438202 AW169287 Hs.22588 ESTs 

458311 AF069478 gb:AF069478 Homo sapiens astrocytoma lib 

451389 N73222 Hs.279009 matrix Gla protein 

427899 AA829286 Hs.332053 serum amyloid A1 

448693 AW004854 Hs.228320 hypothetical protein FU23537 

425492 AL021918 Hs.158174 zinc finger protein 184 (Kruppel-like) 

435532 AW291488 Hs.117305 Homo sapiens, clone IMAGE:3582908, mRNA 

433908 AW298141 Hs.1 57975 ESTs 

430114 AA847744 Hs.99640 ESTs 

434032 AW009951 Hs.206892 ESTs 

444656 AI277924 Hs.145199 ESTs 

433607 AA602004 Hs.23260 ESTs 

440659 AF134160 Hs.7327 claudin 1 

435663 AI023707 Hs.134273 ESTs 

409916 BE313625 Hs.57435 solute carrier family 11 (proton-coupled 

436547 AJ297351 Hs.30824 leucine zipper transition factor-like 

447500 AI381900 Hs.159212 ESTs 

407237 M169872 Hs.6216 Homo sapiens hepatocellular carcinoma-as 

417715 AW969587 Hs.86366 ESTs 

423276 AC003034 Hs.126261 Homo sapiens Chromosome 16 BAC clone CIT 

438138 R98299 Hs.177502 ESTs 

424590 AW966399 Hs.46821 hypothetical protein FU20086 

416857 AA188775 Hs.292453 ESTs 

427667 AK001279 Hs.180171 Homo sapiens cDNAFU 10417 fis, clone NT 

— Hs.269710 ESTs 





AA 037145 






6.18 


438940 


AF075045 






6.18 










6.15 






Hs.59962 




0.12 








CDC7 (cell division cycle 7, S. cerevisi 






AA775910 










AI476318 






6.10 








ESTs, Highly similar to T51 146 ring-box 


6.10 








£3t 3 Weekly similar to PIHUB6 salivary 
















AA347746 




ESTs, Weakly similar to 2N43 HUMAN ZINC 
















NM 003528 






G'08 


452689 


F33868 


Hs.284176 


transferrin 




453804 


AA300204 


lls.35276 


KIAA0852 protein 


6.06 








Homo sapiens done TCCCTA00142 mRNA sequ 




408243 


Y00787 


Hs£24 


interleukin 8 


504 


420721 


AA927802 


Hs.159471 


ZAP3 protein 


6.04 


429393 


AA383024 


Hs.201603 


Homo sapiens mRNA; cDNA DKFZp434D0917 (f 


6.04 
6.04 


435420 
428046 


AI928513 
AW812795 


Hs.59203 
lls.155331 


ESTs 

ESTs, Moderately similar to 138022 hypot 


6.04 


407746 


AK001962 


Hs.38114 


hypothetical protein FU11100 
ESTs 


6.02 


442116 


AI884570 


Hs.128813 


6.00 


423508 


NM 005256 


Hs.129818 


growth arrest-specific 2 


6.00 


422011 


U30246 


Hs.110736 


solute carrier family 12 (sodium/potassi 


5.99 


441794 


AW197794 


Hs.253338 


ESTs 


5.99 


434739 


AA804487 


Hs.144130 


ESTs 


5.98 


449802 


AW901804 


Hs.23984 


hypothetical protein FU20147 


5.9C 


420218 


AW958037 


Hs.286 




5.96 


425707 


AF1 15402 


Hs.11713 


E744ike factor 5 (ets domain transcript 


5.94 


445546 


AW468821 




ESTs 


5.94 


439096 


AA830185 


Hs.269680 


ESTs 


5.94 


452606 


N45202 


Hs.90012 


hypothetical protein FU23441 


5.94 


427701 


AA411101 


Hs.243886 


nuclear autoantigenic sperm protein (his 


5.94 


417246 


AI760098 




ESTs 


5.94 


433190 


M26901 


Hsi3210 




5.92 


418744 


AI887288 


Hs.1 96379 


ESTs, Weakly similar to putative p150 |H 


5.92 




AI904743 


Hs.104650 


hypothetical protein FU 10292 




434342 




Hs.1 16768 


ESTs 


5^92 


406668 


T62745 


Hs.1 84411 


albumin 


5.92 


418668 


AW407987 




M-phase phosphoprotein homolog 


5.90 


429984 


AL050102 


Hs.227209 


hypothetical protein FU21617 


5.90 


408867 


AA437199 


Hs.656 


cell division cycle 25C 


5.90 




BE243842 


Hs.283077 


centrosomal P4.1-associated protein; unc 


5.88 


450380 


AI863675 


Hs.1 14017 


ESTs 


• 5.88 


413026 


AA809485 


Hs.124219 


hypothetical protein FU12934 


5.6S 


454653 


AW812227 




gb:RC2-ST01 73-201 099-01 1-s09 ST0173 Homo 


5.87 


457876 


AI821940 


Hs.264622 


ESTs, Moderately similar to ALU8 HUMAN A 


5.86 


437222 


AL1 17588 


Hs.12778 


ESTs 


5.86 


455630 


AV655701 


Hs.75183 


cytochrome P450, subfamily HE (ethanol- 


5.86 


426269 


H15302 


Hs.168950 


Homo sapiens mRNA; cDNA DKFZp566A1046 (f 


5.84 


409045 


AA635062 


Hs.50094 


Homo sapiens mRNA; cDNA DKFZp434O051 5 (f 


5.84 


441645 


AI222279 


Hs.201555 


ESTs, Weakly similar to T23406 hypotheS 


5.84 


401352 
419088 


AI533323 


Hs.52620 


integrin, beta 8 


5'.84 


431379 


AA504264 


Hs.182937 


peptidylprolyl isomerase A (cyclophlin 


5.83 
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439580 AF086401 Hs.293847 ESTs, Moderately sinvlar to S65657 alpha 
424345 AK001380 Hs.145479 Homo sapiens cDNA FLJ10518 fis, clone NT 
419220 AA811938 Hs.291759 ESTs 
439303 W00605 Hs.102784 ESTs 
415954 AA171850 Hs.42251 ESTs 

418849 AW474547 Hs.53565 Homo sapiens PIG-M mRNA for mannosyllran 

429945 NM_006729 Hs.226483 diaphanous (Drosophila, homolog) 2 

439527 AW298119 Hs.202536 ESTs 

435380 AA679001 Hs.1 92221 ESTs 

424086 AI351010 Hs.102267 lysyl oxidase 

425898 AA365649 Hs.269478 ESTs, Weakly similar to PC4259 ferritin 

428460 AA428865 Hs.98563 ESTs 

416623 N74925 Hs.38761 Homo sapiens cDNA: FLJ21564 fis, done C 

413982 BE503035 Hs.279193 ESTs 

453240 AI969564 Hs.166254 hypothetical protein DKFZp566l133 

410505 AW752139 Hs.314323 ESTs 

447072 D61594 Hs.17279 tyrosylprotein sulfotransferase 1 

433312 AI241331' Hs.131765 ESTs, Moderately similar to I38937 DNA/R 

448966 AW372914 Hs.86149 phosphoinositol 3-phosphate-bindlng prat 

433384 AI021992 Hs.1 24244 ESTs 

415385 R17798 Hs.7535 COBW-like protein 

441894 AA134329 Hs.24170 Homo sapiens, clone IMAGE:3685398, mRNA, 

414696 AF002020 Hs.76918 Niemann-Pick disease, type C1 

448666 NM.014953 Hs.323346 KIAA1008 protein 

412246 AI160873 Hs.69233 zinc finger protein 

426518 Z43039 Hs.170198 KIAA0009 gene product 

418269 AA806113 Hs.189025 ESTs 

443316 AI478463 Hs.18443 aldehyde dehydrogenase 8 family, member 
422805 AA436989 Hs.121017 H2Ahistone family, member A 
442252 AI733395 Hs.129124 ESTs 

414372 AA143654 gb:zo65a02.r1 Stratagene pancreas (93720 

435040 AI932350 Hs.152825 ESTs 

438777 AA825487 Hs.142179 ESTs 

433849 BE465884 Hs.280728 ESTs 

438639 AI278360 Hs.31409 ESTs 

411274 NMJ02776 Hs.69423 kallikrein 10 

435008 AF150262 Hs.162898 ESTs 

434194 AF119847 Hs.283940 Homo sapiens PR01 550 mRNA, partial cds 

452881 AW135220 Hs.241921 ESTs 

426010 AA1 36563 Hs.1975 hypothetical protein FU21007 

424492 AI133482 gb:HA2093 Human fetal liver cDNA library 

418971 AA360392 Hs.87113 ESTs 

453716 AA037675 Hs.152675 ESTs 

406972 M32053 gb:Human H19 RNA gene, complete cds. 

417543 AA203620 Hs.1 10153 ESTs 

419423 D26488 Hs.90315 KIAA0007 protein 

434674 AA831879 Hs.136985 ESTs 

442980 AA857025 Hs.8878 kinesin-like 1 

418882 NM_004996 Hs.89433 ATP-binding cassette, sub-family C (CFTR 

404227 NA NA 

412766 BE544475 Hs.54347 ESTs 

441708 AI469911 Hs.26498 hypothetical protein FU21 657 

408432 AW195262 gb:xn67b05.x1 NCI_CGAP_CML1 Homo sapiens 

437440 AA846804 Hs.123694 ESTs 

410486 AW235094 Hs.69233 zinc finger protein 

456435 AI880384 Hs.270747 ESTs, Weakly similar to ALU2.HUMAN ALUS 

437378 AI198823 Hs.160473 ESTs 

436907 AA737171 Hs.131809 ESTs 

407300 AA102616 gb:zn43e07.s1 Stratagene HeLa cell s3 93 

413582 AW295647 Hs.71331 hypothetical protein MGC5350 
441795 N58115 Hs.21137 AD024 protein 
452449 AW063658 Hs.20943 ESTs 

439699 AF086534 Hs.187561 ESTs, Moderate'/ in il toALUUIUMAN/ 
431941 AK000106 Hs.272227 Homo sapiens cDNA FU20099 fis, done CO 
441217 AI922183 Hs.213246 ESTs 

432383 AK000144 Hs.274449 Homo sapiens cDNA FU20137 fis, clone CO 
453891 AB037751 Hs.36353 Homo sapiens mRNA full length insert cDN 
408690 AW864542 gb:PM4-SN0016-120500-003-h02SN0016 Homo 

433759 AA680003 Hs.109363 Homo sapiens cDNA; FU23603 fis, done L 
446142 AI754693 Hs.145968 ESTs 

408562 AI436323 Hs.31141 Homo sapiens mRNA for KIAA1568 protein, 
433929 AI375499 Hs.27379 ESTs 
421155 H87879 Hs.102267 lysyl oxidase 

424853 BE549737 Hs.132967 Human EST clone 122887 mariner transposo 
453931 AL121278 Hs.25144 ESTs 
409091 AW970386 Hs.269423 ESTs 
416057 AI927382 Hs.29857 ESTs 
433647 AA813118 Hs.163230 ESTs 

415091 AL044872 Hs.77910 3-hydroxy-3-methylglutaryl-Coenzyme Asy 

445038 AI635444 Hs.143917 dJ467N11.1 protein 

408622 AA056060 Hs.202577 Homo sapiens cDNA FU12166 fis, clone MA 

403432 

435820 AA700580 Hs.189000 ESTs 
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413151 
414853 
417372 
443613 
412610 
408943 
415139 
447982 



AK001566 Hs.23618 

H47969 Hs.141971 

U31116 Hs.77501 

T99755 Hs.334728 
AI079356 

X90908 Hs.74126 

NMJ07070 Hs.49105 

AW975942 Hs.48524 

H22953 Hs.137551 

AA632577 Hs.310235 

AI824009 Hs.44577 

R27430 Hs.271565 

U77413 Hs.100293 

BE545072 Hs.122579 

NM.001949 Hs.1189 

D82484 Hs.330994 



hypothetical protein FU10704 

ESTs, Weakly similar to ALU1.HUMAN ALU 

sarooglycan, beta (43kD dystrophin-assoc 



ESTs, Weakly similar to I78885 serlnefth 



O-linked N-acetylglucosamine (GlcNAc) Ir 
hypothetical protein FU10461 
E2F transcription factor 3 



N59650 Hs.27252 ESTs 



an DNA sequence from c'one RP11-472E5 
KIM1201 protein 

Homo sapiens mRNA full length insert cDN 
gb:oa13b11.s1 NCI.CGAP.GCB1 Homo sapiens 



Hs.246311 ESTs 



426116 
441928 
432657 
438011 



AI283085 



419706 
416113 



Hs.145696 
IIS.2S0931 
Hs.1722 
Hs.98074 
AA830854 Hs.187810 



U43374 
AI635113 
T19016 
AI216751 
C04649 



Hs.95631 



Hs.159410 
Hs.143977 
Hs.77899 
Hs.180545 
AA173525 Hs.118758 



AA610211 Hs.34244 ESTs 
AA328153 Hs.88201 ESTs, 



ESTs 

ESTs, Weakly similar t< 
splicing factor (CC1.3) 

ESTs, Weakly similar to YFJ7.YEAST HYPOT 
interleukin 1 , alpha 

tehy (mouse homolog) E3 ubiquitin prote 

Human normal keratinocyte mRNA 
ESTs, Weakly similar to I78885 serine'tti 
malybdopterin synthase sulfurylase 
ESTs 

tropomyosin 1 (alpha) 

Homo sapiens mRNA for hypothetical prote 

ESTs, Weakly similar to RLF [Hsapiens] 

hypothetical protein FU12969 

ESTs 

es placenta Nb2HP Homo 



AI574253 
AI821940 
AI985810 



418217 
401480 
456179 



wingless-type MMTV integration site fami 
ESTs 

ESTs, Moderately similar to ALU8.HUMAN A 



Hs.125759 
Hs,302498 
Hs.13442 



trinucleotide repeat containing 12 
origin recognition complex, subunit 3 (y 
ESTs, Weakly similar io LEU5.HUMAN LEUKE 
GTP-binding protein homologous to Saccha 



H75490 Hs.271930 ESTs 



Unique Eos probesetide 
Gene cluster number 



408432 1058667.1 AW195262 R27868 AW81 1262 

408690 107490J AW864542 AA056567 AW882724 

411765 125700_1 H43346 AA24B302 AA095182 

414372 143909.1 AA143654 AW753140 AA213770 AW970865 AA569075AA492132 

424281 237742.1 AA766243 AA338252 AA338213 
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424492 240008.1 A1133482 AI207619 AA341626 

428679 294049J AA431765 AA432015 

430848 324621.1 AW021726 AA487752 AA488085 

432340 345248.1 AA534222AA632632 T81234 

432363 345469.1 AA534489 AW970240 AW970323 

432639 351744.1 AW973785 H60163AA557608 

434609 38950.1 R76593 AF147390 R76594 

439518 47334.1 W76326AF086341 W72300 



447197 711623.1 R36075 AI366546 R36167 

447974 745643.1 R76886 AI453674 R77049 

452712 928309.1 AW838616AW838660BE144343AI914520AW888910BE184854BE184784 

454653 1228081.1 AW812227 AW812294 AW812092 

458311 543550.1 AF069478 AF069479 AF069480 



Unique number corresponding to an Eos probeset 
Sequence source. The 7 digii numbers in il lis column are Gt 
sequence of human chromosome 22.- Dunham I. et a!., Nature (1999) 402:489-495. 



Pkey Ref Strand Imposition 

400534 6981826 Minus 278637-279292 

401352 9931258 Minus 26064-26208 

401480 7321503 Plus 166120-166347,166451-166557,169651-169832 

401714 6716702 Plus 96484-96681 

401742 2911728 Plus 64003-64147 

403432 9719611 Minus 68204-68392 

403776 7770611 Minus 1414-1513,1624-1756 

404227 7838233 Minus 93110-93259 

404567 7249169 Minus 101320-101501 



Table 8A shows 538 genes significantly down-regulated in colon cancer compared to normal colon. Those were selected as for Table 7A and the ratio was equal to or less than than 
0.33. 

TABLE 8A: 538 GENES SIGNIFICANTLY DOWN-REGULATED IN COLON CANCER COMPARED TO NORMAL COLON 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal colon 



429970 AK000072 Hs.227059 chloride channel, calcium activated, fam 



416889 AW250318 Hs.80395 

429050 X81333 Hs.194777 

427019 AA001732 Hs.173233 

446947 AF146747 Hs.232165 

429657 D13626 Hs.2465 

443506 H10661 Hs.192124 

415314 N88802 Hs.5422 

451181 A1796330 Hs.207461 



ESTs 

hypothetical protein FU10970 
polycythemia rubra vera 1; cell surface 
KIAA0001 gene product putative G-protei 
ESTs, Weakly similar to I38022 hypotheti 



0.0629 
0.0634 
0.0707 
0.0714 



ESTs 



152 



WO 03/042661 



PCT/US02/36810 



411529 AA430348 Hs.317596 

441056 AW205427 Hs.190726 

431252 NMJJ05478 Hs.251380 

437099 N77793 Hs.48659 

433546 AI075877 Hs.125461 

415154 D63175 

409921 AW600239 

432440 X63597 Hs.2996 

430468 NMJ04673 Hs.241519 

427167 AI239607 Hs.99196 

441212 AW242447 Hs.146182 

423605 AF047826 Hs.129887 

411381 AW841862 Hs.306831 

412639 AW961284 Hs.296235 

453399 Z70295 Hs.32966 
403548 

421913 AI934365 Hs.109439 

457982 AW856093 Hs.183617 

448835 BE277929 Hs.11081 

431292 M370141 Hs.2281 

441805 M285136 Hs.301914 

407639 AW205369 Hs.312830 

AK001879 ' Hs.1 07527 



421741 
454790 
451742 
425849 
411880 



T77609 Hs.1 17970 

AJ000512 Hs.296323 
AW872477 

X54162 Hs.79386 

AF255910 Hs.54650 

AB020629 Hs.38095 



404767 



427359 AW020782 Hs.79881 

452768 AW069459 Hs.61539 

418692 AK000268 Hs.87383 

414831 M31158 Hs.77439 
407551 . Y10516 



453500 AI478427 

442080 AW444761 

431706 AI816086 

429545 AI824164 

418390 AF133820 

435056 AW023337 

426034 AI276989 

429609 AF002246 

408221 M912183 

425220 AW975317 



456064 M256213 

428133 AW167727 

447261 NM_006691 

437734 AA693951 

414290 AI568801 



442496 R55073 

450693 AW450461 

420736 AI263022 
405385 
404638 

427333 AF067797 



421666 , 
452854 , 
400514 
454186 



Hs.43125 

Ils296341 

Hs.84665 

Hs.5422 

Hs.56123 

HS210863 

Hs.47447 

Hs.162987 

Hs.12409 

Hs.284450 

Hs.72010 

Hs.1 1873 

Hs.17917 

Hs.180284 

Hs.71721 



Hs.124130 
Hs.203965 
Hs.82204 



Hs.23767 



436008 AI078428 



419746 
410677 
415672 
419050 
417342 
413714 



ESTs, Highly similar to S14458 laminin a 
hypothetical protein FU11539 
gb:HUM501B09B Clontech human placenta po 
gb:EST00009 pGEM-T library Homo sapiens 



angiopoietin-like 1 
hypothetical protein MGC11324 
cytosolic beta-glucosldase 



Homo sapiens cDNA: FU22549 fis, clone H 




hypothetical protein FU11017 
gb:RC2-ST0301 -1 20200-01 1-f1 2 ST0301 Homo 
ankyrin 2, neuronal 
serum/glucocorticoid regulated kinase 

n3 I ^P_Thy4 Homo sap 



ATP-binding cassette, sub-family A (ABC1 



hypothetical protein 

protein kinase, cAMP-dependent, regulato 
gb:H.sapiens mRNA for CD58 T3 protein. 



adenylyl cyclase-associated protein 2 
lymphocyte antigen 6 complex, locus E 
titin immunoglobulin domain protein (myo 
glycoprotein M6B 

Homo sapiens cDNA FU13443 (is, clone PL 



carbonic anhydrase IV 



Hs.58785 
Hs.128433 
NNL007351 Hs.268107 
AW867943 Hs.127216 
NMJ03278 Hs.65424 
N53097 Hs.193579 
NM.000036 Hs.89570 
W40277 Hs.81994 
AI560944 Hs.71428 



gb:MRO-HT0067-201099-002-h11 HT0067 Homo 
nucleosome assembly protein 1-like 2 
hypothetical protein FU12666 
gb:ni50o09.y5 NCI_CGAP_Ov2 Homo sapiens 



0.0951 
0.0957 
0.0985 



0.1067 
0.1107 
0.1114 
0.1147 
0.1167 
0.1190 
0.1211 
0.1239 
0.1240 
0.1248 
0.1274 
0.1277 
0.1277 
0.1291 
0.1309 
0.1315 
0.1325 
0.1328 
0.1335 
0.1379 
0.1393 

0M456 
0.1459 
0.1460 
0.1462 
0.1463 
0.1466 
0.1471 



0.1558 
0.1563 
0.1582 

o!l615 
0.1637 
0.1638 
0.1656 
0.1660 
0.1676 
0,1698 
0.1718 
0.1745 
0,1751 
0.1757 
0.1763 
0.1767 
0.1769 
0.1781 
0.1784 
0.1795 
0.1805 
0.1808 
0.1812 
0.1812 
0.1812 
0.1820 
0.1328 
0.1832 
0,1835 
0.1838 
0.1333 
0.1333 
0.1B42 
0.1845 
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427061 AB032971 Hs.173392 K1AA1145 pr< 



412295 AW088826 Hs.117176 poly(A)-binding protein, nuclear 1 

447414 D82343 Hs.18551 neuroblastoma (nerve Issue) proteir 

407891 AA486620 

437140 AA312799 

431544 AK000770 



Hs.144464 
Hs.17518 
Hs.42746 
Hs.173373 
Hs.126083 
Hs.94694 
Hs.202639 
Hs.109012 
Hs.106642 
Hs.37129 
Hs.93173 



447164 AF026941 

453698 AA037615 

423743 AB023148 

428412 AA428240 

420058 AK001423 

434683 AW298724 

421865 AA609911 

452786 R61362 

407601 AC002300 

419758 U31973 

437740 AA810265 



437145 AF007216 Hs.5462 

416961 BE391476 Hs.80617 
401465 

437425 AW183714 Hs.20981 

416231 H30333 Hs.165062 
401753 

433430 AI863735 Hs.186755 

432150 AK000224 Hs.272789 

411644 H92064 Hs.278626 



419956 AL137939 Hs.40096 

418026 BE379727 Hs.83213 

423655 M72242S Hs.182785 
401381 

426452 AW614271 Hs.121647 

433476 M594394 Hs.152616 

423405 NM.014151 Hs.128155 

442826 AI018777 Hs.131241 

427060 AW378993 Hs.90286 

437354 AA749215 Hs.291886 

447734 AI421412 Hs.163659 

424585 AA464840 lls.131987 

458016 AW188099 Hs.131813 

423893 AL031709 Hs.134846 



AL137311 Hs.234074 



401024 

414802 AI793107 Hs.27018 
441083 — 
417355 
422440 



60 
65 



433575 
416035 
428415 



Homo sapiens cigS mRNA, partial sequence 



Homo sapiens cDNA FU10561 lis, clone NT 
ESTs 

MAX dimerization protein 
ESTs, Weakly similar to T09052 hypothec 
sodium channel, nonvoltage-gated 1, beta 
phosphodiesterase 6C, cGMP-specific, oa 



hypothetical protein DKFZp547M236 



pancreatic lipase-related protein 2 



y similar to 1207289A rev 
ESTs, Highly similar to AC006014 8 simil 



D13168 Hs.82002 

NMJ04812 Hs.116724 

AW881975 Hs.213923 

BE041395 Hs.283676 

AV654020 Hs.184261 

F09747 Hs.268707 

AW295031 Hs.198671 

AA704687 Hs.191294 
AA625499 

NM.005771 Hs.179608 

AA180363 Hs.1 18769 
BE385725 

Z30300 Hs.281935 

AI656707 Hs.48713 

AA600175 Hs.39720 
H42314 

AA337211 Hs. 184222 

AI372588 Hs.8022 

NM_006846 Hs.331555 

AI743069 Hs.134736 



446294 AI284935 

414193 BE260069 

411514 AW850178 

453098 Z25935 Hs.86379 

430378 229572 Hs.2556 

432374 W68815 Hs.301885 

443482 AW188093 Hs.250385 



Human DNA sequence from clone 31 6G12 on 
Homo sapiens mRNA; cDNADKFZp761G02121 ( 



gb:601336446F1 NIH_MGC_44 Homo sapiens c 

endothelin receptor type B 

aldo-ketc reductase family 1, member B10 



gb:af69g08.r1 Scares_NhHMPu_S1 Homosapi 



gb:601276347F1 NIH_MGC_20 Homo sapiens c 



serine protease inhibitor, Kazal type, 5 



gb:qk55g09.x1 NCLCGAP Co8H 
gb:601150964F1 NIH_MGC_19 Homo sapiens c 
gb:IL3-CT0219-271099-022-H12 CT0219 Homo 



0.1876 
0.1895 
0.1901 
0.1904 
0.1905 
0.1926 
0.1928 
0.1941 
0.1942 
0.1944 
0.1957 
0.1957 
0.1969 
0.1981 
0.2004 
0.2016 
0.2017 
0.2025 
0.2041 
0.20« 
0.2049 



0.2059 
0.2063 
0.2070 
0.2075 
0.2075 
0.2090 
0.2093 
0.2118 
0.2120 
0.2122 
0.2125 
0.2130 
0.2132 
0.2137 
0.2137 
0.2144 
0.2146 
0.2151 
0.2151 
0.2157 
0.2165 
0.2171 
0.2179 
0.2185 
0.2186 
0.2188 
0.2193 
0.2209 
0.2212 
0.2217 



0.2244 
0.2254 
0.2257 
0.2258 
0.2253 
0.2273 



0.2299 
0.23D4 
0.2305 
0.2309 
0.2315 

0^2319 
0.2320 
0.2326 
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432134 AI816782 Hs.122533 

421539 AA292747 Hs.97296 

448520 AB002367 Hs.21355 

408001 AA046458 Hs.95296 

409331 M36634 Hs.53973 

431094 AW972276 Hs.116195 

429575 AA706003 Hs.99387 
404958 

439731 AI953135 Hs.45140 

452742 AW589945 Hs.97876 

417511 AL049176 Hs.82223 
404927 

430297 AW243166 Hs.129806 



AA989220 Hs.292100 
BE379046 

AW664708 Hs.171959 

AI082777 Hs.61384 

AW161284 Hs.79564 

M383623 Hs.293616 



hypothetical protein FU21934 
ESTs 

doubleoortin and CaM kinase-like 1 



vasoactive intestinal peptide 



ESTs 

KIAA1233 protein 

ESTs 

ESTs 

mitochondrial carrier homolog 1 
ESTs 

gb:601236646F1 NIH_MGC_44 Homo sapiens c 
ESTs 

sema domain, seven thrombosp 
neuronal PAS domain protein 1 
ESTs 



0.2330 
0.2344 
0.2347 
0.2351 
0.2354 
0.2358 
0.2361 
0.2364 
0.2380 
0.2381 
0.2387 
0.2412 
0.2415 
0.2416 
0.2421 
0.2424 



30 
35 



AW500131 Hs.1 71763 

AI676062 Hs.111902 

AW604804 Hs.151717 

AA884208 Hs.30484 

I AA234223 Hs.139204 

; AV657547 Hs.236014 
! AA344576 



446066 AI343931 

408345 R93851 

418358 L02840 

416950 AL049798 



Hs.149383 
Hs.63063 
Hs.84244 
Hs.80552 
Hs.178589 
ls.106106 
ls.333352 



459688 U72671 

450880 AK002183 

457330 AB013818 

451979 F06972 

440274 R24595 

430097 AI523245 

410626 BE407727 

453992 AW014995 

453888 AW450670 
401371 

456145 BE299427 

408134 AK000184 

422591 L07648 



Hs.21446 
Hs.42945 
Hs.118630 
Hs.29206 
Hs.147554 



405880 

420321 D78761 Hs.96657 

454415 AK000846 Hs.58679 

437032 AW867372 Hs.302063 

448025 BE502965 Hs.170426 

444304 AW628433 Hs.271296 

424885 AI333771 Hs.82204 

457413 M743462 Hs'.165337 

452078 AA022620 Hs.52170 

450785 AA852713 Hs.25459 
401974 

411319 BE537094 

417761 R13727 

426132 AA370501 

455771 BE084820 

414349 BE512968 
402182 



Hs.21435 
Hs.186711 



potassium voltage-gated channel, Shab-re 



hepatocellular carcinoma antigen gene 52 
protein kinase (cAMP-dependen', catalyti 
I lomo sapiens brother of CDO (BOC) mRNA, 
gb:Hcmo sapiens mucin (mucin) mRNA, pari 
ribosomal protein L3 
intercellular adhesion molecule 5, telen 
Homo sapiens cDNAFU11321 fis, clone PL 



gb:601299771F1 NIH_MGC_21 Homo sapiens c 



hyothetical protein 

solute carrier family 7, (cationic amino 
immunoglobulin heavy constant mu 
ESTs 

ESTs, Weakly similar to I54374 gene NF2 
ESTs 

paraoxonase 1 

ESTs 

ESTs 

Homo sapiens, alpha-1 (VI) collagen 

gb:601063333F1 NIHJJGCJ0 Homo sapiens c 
ESTs 

gb:EST82261 Prostate gland I Homo sapien 
hypothetical protein FU20070 
gb:601172296F1 NIH_MGC_15 Homo sapiens c 



0.2462 
0.2471 
0.2474 
0.2478 
0.2481 
0.2494 
0.2495 
0.2499 
0.2500 
0.2506 
0.2506 
0.2608 
0.2510 
0.2513 
0.2515 
0.2519 



0.2558 
0.2564 
0.2565 
0.2569 
0.2569 
0.2574 
0.2579 
0.2580 
0.2532 
0.2584 
0.2586 
0.2333 
0.2595 
0.2502 
0.2604 
0.2605 



0.2660 
0.2660 
0.2661 



WO 03/042661 



PCT/US02/36810 



408684 R61377 Hs.12727 

444209 AI753134 Hs.146494 

415022 X59960 Hs.77813 

416184 R48481 Hs.269177 

422909 AA533356 

412047 AA934589 Hs.49696 

426356 BE536836 Hs.98682 

AW135374 Hs.270949 



402425 
450545 
417118 
419850 
428020 
441493 
413541 
428470 
455597 
447809 
444195 



30 
35 
40 



U38654 Hs.50477 
F06844 

L19058 Hs.181581 

AW070446 Hs.127037 
BE147036 

AC002301 Hs.1 84507 

BE008545 Hs.1 561 10 

AW207605 Hs.1 64230 

AB002351 Hs.1 0587 



421823 N40850 



Hs.28625 



BE393856 Hs.66915 

U86136 Hs.232070 

AA524447 Hs.152377 
AW998605 

H53035 Hs.337620 

AI024834 Hs.131729 
AW811633 

AI219896 Hs.97592 

AI522109 Hs.171066 

AI349306 Hs.11782 
AA420712 
AI142379 

NMJ05309 Hs.103502 

AI880516 Hs.84630 

AW996722 Hs.125297 



403047 

412506 AW967159 

441042 AA077736 

434660 AA764768 Hs.121158 

444453 AW379394 Hs.145126 

457736 AK000390 Hs.4205 

454012 M76424 Hs.37014 

427448 BE246449 Hs.2157 



435021 
413344 
447787 



437483 
407938 
417332 



Hs.75302 

AA465293 Hs.1 05069 

AI929453 Hs.1 22489 
AL390174 

AA905097 Hs.85050 

AW972717 Hs.288462 

AW444451 Hs.134812 

H04607 Hs.9218 

AA352026 Hs.94319 



428746 AW503820 Hs.1 92861 

431822 M516049 

441290 W27501 Hs.89605 

422033 AW245805 Hs.110903 

421935 AA131632 Hs.109672 

447955 BE544271 Hs.288390 

405364 

422165 AL041199 Hs.14B1 

431087 H12723 Hs.290791 

450610 AA010370 I 

445627 AW818475 I 

436144 AW881250 Hs.148367 



Hs.60385 



ESTs, Moderately similar to F41925 hypoi 



ESTs, Highly similar to JC7266 3',5-cyc 
KIAA0353 protein 

gb:yd38a04.r1 Soares fetal liver spleen 



ESTs, Weakly similar to 16.7Kd protein [ 



gb:RC2-ST0158-091099-011-d05ST0158 Homo 



430304 AL122071 Hs.238927 

455614 AI693369 Hs.202274 

419683 AA248897 Hs.48784 

411886 AL046810 Hs.20021 

430770 AA765694 Hs.1 23296 

444459 AI680624 Hs.1 48676 



ESTs, Weakly similar to 2004399A chromos 



gb:EST3G9229 MAGE resequences, MAGD Homo 
gb:7B48A07 Chromosome 7 Fetal Brain cDNA 
hypothetical protein DKFZp434J0113 
ESTs 

hypothetical protein FU20124 



family 1 (neuronal/epithe 



gb:601483015F1 NIH_MGC_69 Homo sapiens c 
ESTs 

Homo sapiens cDNA FU13289 fis, clone OV 
gb:Homo sapiens mRMA; cDNA DKFZp547J184 
phospholamban 
hypothetical protein FU21511 



VPS10 domain receptor protein 

gb:RC3-CT0255-200100-024-a08CT0255 Homo 
Spi-B transcription factor (Spi-VPU.1 r 
gb:n 0 55d01.s1 NC ( 1 \Jip2 Homo sapiens 
cholinergic receptor, nicotinic, alpha p 
claudin 5 (transmembrane protein deleted 
CMP-NeuAC:(beta)-N-acetykjalactosaminide 
hypothetical protein FU22795 

histidine decarboxylase 



Homo sapiens mRNA; cDNA DKF2p434H1235 (f 



gb.'MR4-ST0062-180200-001-b04 ST0062 Homo 
ESTs 

156 



0.2677 
0.2681 
0.2C81 
0.2693 
0.2703 
0.2709 
0.2710 
0.2710 
0.2725 
0.2727 

0.2733 

0^2734 
0.2740 
0.2740 
0.2743 
0 2747 
0.2755 
0.2755 
0.2756 

0^2763 
0.2768 
0.2775 
0.2775 
0.2778 

0^2778 



0.2799 
0.2804 
0,2804 
0.2809 
0.280= 
0.2815 
0.2816 
0.2817 
0.2820 
0.2821 



0.2840 
0.2841 
0.2841 
0.2842 
0.2845 
0.2846 

o!2857 
0.2857 



0.2872 
0.2874 
0.2877 
0.2878 
0.2880 
0.2881 



0.2886 
0.2891 
0.2891 
0.2899 
0.2900 
0,2904 
0.2913 
0.2913 
0.2915 
0.2915 
0.2921 
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413056 BE063031 

443998 AI620661 Hs.296276 

424412 H15512 Hs.10043 

421204 AW081587 Hs.155051 

420931 AF044197 Hs.100431 
440507 H06994 

445555 AW974013 Hs.260809 

438570 AW888554 Hs.84293 

447195 T73745 Hs.279870 

423267 AL137416 Hs.125177 

421920 BE551245 Hs.1438 

412177 223091 Hs.73734 

428042 AA419529 Hs.76391 

433745 AF075320 Hs.28980 

417935 R53697 Hs.170044 

NM_000055 Hs.1327 
BE146866 

AI215667 Hs.175044 

AW135965 Hs.246783 



gb:MRO-BTO265-231199-002-e09 BT0265 Homo 
ESTs 

hypothetical protein FU 13074 



ESTs 

CD74 antigen (in 
ESTs, Weakly similar to A46010 Winked 
Homo sapiens mRNA; cDNA DKFZp4340192 (fr 
gamma-aminobutyric acid (GABA) receptor, 
glycoprotein V (platelet) 
myxovirus (influenza) resistance 1, homo 
hypothetical protein FU 14540 



AV657778 Hs.3314 

AI610447 Hs,48778 

BE256176 Hs.278712 
R29072 

BE047637 Hs.173739 

M16801 Hs.1790 

D90337 Hs.247916 

AA018311 Hs.1 14762 

AI202492 HS.212S33 



413537 
445194 
454135 
403418 
457605 
408896 
448542 
417945 
412518 
424566 
430778 
451531 
444926 

407366 AF026S42 
459456 M486036 Hs.190124 
417111 AW016321 Hs.82306 
452975 M85521 Hs.244482 
451959 AA056203 Hs.27337 
410482 AW772187 Hs.191859 
417700 M36542 Hs.1101 



432247 AA531287 

453471 AL037887 

417481 AA203281 

432306 Y18207 

448744 AL1 35424 

429223 BE264152 

404501 AW247262 

406829 AW419128 

438839 AW297945 

431848 AI378857 



Hs.303090 
lls.9469 

Hs.221994 



456373 BE247706 Hs.89751 

458789 AL1 57468 Hs.325825 

443294 AI733625 

447023 AA356754 

458583 AI479646 

414567 BE281057 

445123 AI762911 

412682 AW983772 



Hs.86379 



Hs.133053 
lls.17109 
Hs.157081 
Hs.184519 
Hs.145369 

H8.1 17772 



eukaryotic translation initiation factor 
gb:F1-101D22weekold human fetal liver 
hypothetical protein FU10297 
nuclear receptor subfamily 3, group C, m 
natriuretic peptide precursor C 
ESTs 



in, class 2, transcription facto 



protein phosphatase 1, regulatory (inhib 



nucleoside phosphorylase 



ESTs, Highly similar to AF1 75283 1 zinc 
membrane-spanning 4-domains, subfamily A 
Homo sapiens cDNA FLJ20848 fi s, clone AD 



Hs.8752 

Hs.115700 

Hs.250700 



414026 BE241713 

432149 AW614326 

408350 AW183350 
401042 

422585 AA312704 

438692 AB007950 

447452 BE618258 

444414 AW293214 

422373 AK001843 

430410 AF099144 

419299 AI311085 
400672 

444010 AW976457 

451699 AL1 18571 

432471 BE244667 
405277 

456765 AI497900 

452090 AA022684 



423621 BE002904 

417919 AI928203 

414484 BE314385 

457439 AW410408 Hs.271167 

426449 AL1 34009 Hs.1 69936 

419942 U25138 Hs.93841 

458544 AI631036 Hs.196843 



gb:RC3-HN0002-060400-012-h09 HN0002 Homo 



Jin FU22127 
KIAA0481 gene product 
Homo sapiens, clone IMAGE3869590, mRNA 



Homo sapiens cONA: FLJ23515 fis, clone L 
tryptase beta 1 

hypothetical protein FU22573 



0.2938 
0.2943 
0.2945 
0.2948 



0,2973 
0.2973 
02974 
0.2976 
0.2986 



0.3012 
0.3012 
0.3012 
0.3012 
0.3012 
0.3013 
0.3018 
0.3019 
0.3023 
0.3028 
0.3029 
0.3032 
0.3033 
0.3034 
0.3037 
0.3039 
0.3039 
0.3042 
0.3045 
0.3048 
0.3050 
0.3052 
0.3056 
0.3057 
0.3064 
0.3065 
0.3071 
0.3072 
0.3073 
0.3074 

03077 
0.3077 
0.3083 
0.3085 
0.3088 
0.3090 
0.3091 

0.3096 
0.3096 
0.3105 
0.3106 
0.3106 
0.3106 
0.3106 



gb:QV4-BN0090-07040O-163-cC7 BN0090 Homo 
ESTs 

gb:601154649F1 NIH_MGC_19 Homo sapiens c 
L-pipecolic acid oxidase 

" sapiens mRNA; cDNA DKFZpE86N1918 (f 
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446616 
409404 
417318 
443980 
459138 
414807 
434704 
414214 
446378 



438219 
412944 
431103 
426662 

432168 
411084 
425367 
448812 
411288 



BE271708 Hs.95110 

AW170591 Hs.13967 
AA017656 

M019961 Hs.26216 

R65964 Hs.241569 

BE220053 Hs.129056 

AW953937 Hs.12891 

AI459140 Hs.299087 
AI903291 

AI738616 Hs.77348 

AW135276 Hs.192311 

D49958 Hs.75819 

AI905699 Hs.239760 
i AI939966 

i NM.004235 Hs.7934 

! BE394222 Hs.231444 

AA382250 Hs.145601 

I AL037285 Hs.289848 

AJ242724 

AW505582 Hs.1 30732 

AI9161S1 Hs.257194 

AA384110 Hs.197143 

M57399 Hs.44 

AA879474 Hs.122710 

AL137597 Hs.11114 

AK000563 Hs.272805 

T18987 Hs.125472 

BE271188 Hs.155975 

H30775 Hs.22140 



422884 AW860975 I 



414540 BE379050 Hs.306969 

428568 AC004755 Hs.1 84922 

428106 BE620016 Hs.182470 

411856 H67899 Hs.4190 
445682 AW378397 

AI954795 Hs.156135 

AI608810 Hs.193288 

AL133573 Hs.272312 

AA235900 Hs.87500 

D83407 Hs.1 56007 

424729 AF063012 Hs.152531 

440020 AI480204 Hs.177131 

429082 AL1 35682 Hs.22452 

433663 AF083131 Hs.229535 
400641 
406140 

415280 R56473 Hs.268715 

447635 AI669669 Hs.195362 



448943 
431999 
419279 
405913 



ESTs, Weakly similar to'S16506 hypotheli 
ESTs, Weakly similar (o A55943 1-phospha 
ESTs, Weakly similar to PSM.HUMAN PROSTA 
gb:ze39h01.r1 Soares retina N2MHRHomo 
LOC50627 

ESTs, Weakly similar to ALU8.HUMAN ALU S 



gb:RC-BT029-080199-047 BT029 Homosapien 
hydroxyprostaglandin dehydrogenase 15-(N 
ESTs 

glycoprotein M6A 
citrate synthase 

gb:MR0-CT0015-160799-002-b06 CT0015 Homo 
KruppeWike factor 4 (gut) 
Homosapier - - 
ESTs 



EST, Weakly sir 



"to ALU4_HUMAN ALU SU 



0.3121 
03125 
0.3125 
0.3125 
0.3130 
0.3132 
0.3135 
0 3139 
0.3140 
0.3142 
0.3143 
03143 
0.3145 



0.3150 
0.3152 
0.3172 
0.3173 



0.3188 
0.3189 
0.3190 



pleiotrophin (heparin binding growth fac 
ESTs 

hypothetical protein dJ1181N3.1 
hypothetical protein FLJ 20556 
ESTs, Moderately similar to KIAA0877 pro 
protein tyrosine phosphatase, receptor i 
BM88 antigen 

gb:QVC4_T0015-180200-127-d02 LT0015 Homo 
ESTs 

tankyrase, TRF1-interacting ankyrin-rela 6.31 95 

gb:15d7 Human retina cDNA randomly prime 0.3195 
Homo sapiens, clone MGC:10782, mRNA.com 0.3195 
Homosapiens chromosome 19, fosmid 37502 0.3195 
PTD010 protein 0.3198 
Homosapiens cDNA: FU23269 As, clone C 0.3202 
tjb:RC3-HT0220-031299-012-g06 HT0220 Homo 0.3205 
ESTs 0.3205 
ESTs 0 3205 

Homo sapiens mRNA; cDNA DKFZp434J2235 (f 0.3207 
ESTs 0.3208 
0.3209 
0.3212 
0.3212 
0.3213 
0.3215 
0.3215 
0.3216 
0.3216 
0.3217 



432749 
408112 
433234 
422831 
403215 
451868 
446901 
430553 
445848 



H47495 Hs.13810 

AB018311 Hs.21917 

AW242684 Hs.243623 
AW847669 

NM_014438 Hs.278909 

AW451982 Hs.248613 

AB040928 Hs.65366 

R02504 Hs.332943 

R85962 Hs.221926 



gb:IU3-CT0213-28010(K)56-G10 CT0213 Homo 



ESTs, Weakly similar to I38022 hypotheti 
gb:tc05d02.id NCI_CGAP_Co16 Homo sapiens 
gb:CM4-ST0275-021299-053-h09 ST0275 Homo 
Homo sapiens clone 23649 and 23755 unkno 



0.5273 
0.3226 
0.3231 
0.3231 
0.3231 
0.3234 
0.3236 
0.3236 
0.3242 
0.3254 



436777 
431651 
454117 



T62216 Hs.270840 

AA180256 Hs.210473 

AW844694 Hs.306752 

AA731199 Hs.293130 

BE250915 Hs.266914 

BE410100 Hs.40368 

AI768853 Hs.134478 

BE383234 Hs.25925 

AA393551 Hs.97450 



gb:EST02679 Hippocampus, Stratagene (cat 



hypothetical protein FU10355 



Homo sapiens, clone MGC:15393, mRNA, coi 



0.3267 
0.3267 
0.3268 
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413814 
422818 
423634 
414002 
452164 
458477 
433197 
405701 



AA404290 Hs.97848 
AW959908 Hs.1690 
NM.006732 Hs.75678 



heparin-binding growth factor binding pr 



gb:tz44b02.x1 NCI_CGAP_Bm52 Homo sapien 
phosphatase and tensin homolog (mutated 
KIAA1456 protein 

AI370875 Hs.79090 exportin 1 (CRM1 , yeast, homolog) 

AI761313 Hs.204605 ESTs 

AA598956 Hs.120439 

AF039918 Hs.80975 

BE270795 Hs.268864 

AA325225 Hs.1 24023 

AA405093 Hs.126519 

AA601122 Hs.95655 

AI831648 Hs.143993 
AW837063 



ectonucleoside triphosphate diphosphohyd 
ESTs 

Homo sapiens cDNA FU14218 fis, clone NT 



0.3275 
0.3275 
0.3277 
0.3278 



0.3280 
0.3282 
0.3284 
0.3286 
0.3269 
0 3290 



gb:QV1-LT0037-150200-069-g08 LT0037 Homo 



Unique Eos probeset identifier number 
Gene cluster number 
Genbank accession numbers 



AW600239 AW600255 AW505332 
BE407727 

AW81 IMS AW61 1652 AW811898 ^ AW8 ° 7285 AWS0727 ° AW8 ° 7372 AM0728 ° AW807283 

AW835511 AW835517AW835513 
BE537094AW836542 
AW8370C3 AW935882 AW935957 
AW847669 AW847667 BE145799 
AW850178AW850233AW850445AW8S0446 
AW85453C AW854417 AW854495 AW854355 
AW872477 BE088101 T05990 
AI791451 AI791288 BE019234 BE296601 AA1 11939 
AW957159H09937T75143 

AW983772 AW983730 AW983769 AW9B3836 AW983B35 AW983837 

M86151 BE061884 BE061883 BE061898 BE061882 BE061887 BE061891 BE061690 BE061896 BE061693 BE061695 BE061694 BE061885 BE007474 
BE007481 BE007553 



BE145866BE146865BE146867 
BE147036 BE146951 BE146958 BE146966 BE146976 BE146955 
BE169592BE169421 
BE241713BE241912 



Pkey CAT number 

409921 1159516 1 

410626 1212621_-1 

410845 1223881.1 

410950 1227728 1 

411288 1237709.1 

411319 1238595.1 

411328 1238987.1 

411466 1246771 1 

411514 1248638.1 

411620 1252014.1 

411880 1263110J 

412474 129869 1 

412506 1301336.1 

412682 1321572.1 

412888 1334784J 

413056 1347545 1 

413537 1375441.1 

413641 1375499 1 

413814 1391574J 

414026 1411109 1 

414193 14247061-2 

414349 1437515.-1 

414484 1452830 -1 

414539 1460320.1 

415154 1525577.1 

415160 1525766 1 

416035 1567254.1 

417945 1711126 1 

419850 188485.1 

422909 222858.1 

422954 223239J 

423621 230314 1 

424648 241947 1 

426132 261431J 

426497 268121 1 

430553 319868.1 

431822 338082.1 AA51 6049 AW004922 

434098 380006.1 AA625499 AA625269AA625184 

437483 43756.1 AL390174 AW898817 

440507 495677.1 H06994 BE147898 

441042 50823.1 AA077736AA078505BE562497Z17859 

441083 50904.1 BE562611 AA436054 

445682 647580.1 AW378397AW378390AW378358AI247957 

446294 670076.1 AI284935AW409822BE408182 

446901 697809.1 AI347274AW844024 

447787 73719.1 BE620108 BE312062AW896316 BE262546 

451385 86787.1 AA017656AA017374AA019761 

452164 902091.1 AI863171 BE047919 

454186 1049791.1 BE141030 BE141474BE141467 BE141753 BE141024 BE141761 AW177583 AW177S79 AW177582 AW177585 AW177S87 AW807582 AW177581 
BE141749 AW177598 BE141 ° 28 BE141?75 8341489 BE141?51 AW1775 " BE141750 AW177597 BE14151 2 BE141460 



D63175D78984 D63269 



H42314 H43080 H45217 H15384 

R29072 R29717 R29699 R29709 R29751 R29609 1 

F06844 F06845Z45488 AW748501 AW748591 AW752021 
M533356 AW468427 R67736 AA779031 AA614088 AI823404 AA318991 AA7209B6 
AW998605 AW993131 BE514709 AA319445 
BE002904 H64880AA328679 
AA344576 AA732430 AA344601 
AA370501 AW962784AA370727 
AA379913 AA379981 AW963523 
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454790 1234752.1 AW820652 AW820773 AW821088 

454947 1245953J AW846590 AW846615 AW846584AW846592AW846621 AV 

455353 1284289J W26786 AW998612AW902272 

457115 286601_1 AA420712 AA469165 AA420737 



Unique number corresponding to an Eos probesel 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. -Dunham I. et al.- refers to the publication entitled The DNA 
sequence of human en omo Dunhsml I., Nature ^1999) 402:489-495. luneruraea meuiw 





Ref 


Strand 


reposition 


400514 


9796594 


Plus 


78844-79025,80850-80991,89754-89941,93750-93891 
4786-4992 


400672 


8118724 




148067-148503 


400767 


8131627 




80531-80629,82169-82278 




1945037 




44482-45526 


401024 






60551-60802 


401042 


8117611 


Plus 


151364-151606 


401371 


9650602 


PIUS 


80901-81283 


401381 
401465 


8570226 


ptT 


118629-119146,119392-119657 


6682292 






401521 


7705251 


Plus 


9127-9234 


401753 


9838183 


Minus 


155287-155529,159719-159997 


401776 


7229981 


Plus 


115535-115743,117746-117839,120290-120455 


401887 


Pus 


93973-94120 
85330-85683 




3126777 


Pius 


402076 


8117410 


Pius 


128316-128627 


402182 






98298-98439 


402425 


9796347 




50224-50395 


402610 


9926549 


Minus 


22955-23124 




8569871 


Minus 


159927-160055 


403047 


3540153 


Minus 


59793-59968 


403418 






177270-177971 


6862692 




176202-176395 


403548 


8081591 


Minus 


38760-39352 


403957 


8076835 




81649-81754 


404070 


2996642 


Plus 


7210-7414,10043-10195 


404246 
404414 


7406725 


Plus 


82477-82628,82721-82B17,B2910-63071,83149-83387 


7382165 


Plus 


143127-143398 


404605 


9212566 


Plus 


125032-125291 


404638 


9796751 




99433-99528,100035-100161 


404696 


9800109 




60037-60144,62675-63081 


404767 


7882827 




23244-23759 


404769 


8099713 




175801-176823 


404927 


7342002 




68690-69563 


404958 


7407941 




2731-4531 


405073 


7708797 




11115-11552 


7769921 


HuT 


31419-31774 


405138 


8576241 




90303-90516 


405277 


3980473 


Plus 


23471-23572 


405282 
405334 


3810573 
3135285 


Minus 


10482-10689 


405364 


2281075 


Plus 
Minus 


139386-139856 

48325-48491,49136-49252 

48332-48454 


405385 


6552772 


Plus 


405535 


9795658 


Plus 


63384-63545 


405610 


5757553 


Minus 


71907-72080 


405654 


4895155 


Minus 


53624-53759 


405691 


4508112 


Plus 


171350-171739 


405701 


4263751 


Plus 


93243-93364 


405880 


6758747 


Minus 


55673-56287 


405913 


7712139 


Minus 


7484-7678 


406140 


9168231 


Mmus 


49887-50219 


406592 


4567182 


Plus 


352560-352963 



Table 9A lists about 1,234 genes up-regulated in colon cancer compared to normal adult tissues and to non-malignant colon tissues that are likelv to encode nrotein^ amenable to 

^■^^T.. «^™?'" B8 . r9ater ^, J 11 JonKel was set to the 90th percentle value amongst colon prima^ 

tual to 50 „t JSK? ,TT ■ 1 ' f Ue 6 " ' W3S S6t l ° " perten "' e v * e amongst "°"-™lignant Issues, the 'average" colon 
"of belna modulatX bv ,1 1 '. i P ?h < "° [ f h T"!' 3 r 0 °< ^"ucing » intracellular 

of being modulatable by small molecules, peptides, or antibodies (e.g. pkinase. death-domain, 7tm, phosphatase, or ionjransporter). In order to remove gene-specific 
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background levels of non-specific hybridization, the 15th percentile value amongst the over non-malignant tissues was sul 
the ratio was evaluated. 

TABLE 9A: 1,234 genes up-regulated in col 



ExAccn: Exemplar;. 
UnigenelD: Unigenenu .... 
Unigene Title: Unigene gene title 

R1: 90* percentile of tumor samples divided by the 70* percentile of normal body tissue samples, where the 1 5* pei 



UnigenelD Unigene Title 



25 

30 



414386 X00442 Hs.75990 

428934 AF039401 Hs.194659 

416768 AA363733 Hs.1032 

446787 U67167 Hs.315 

431912 AI660552 Hs.356163 

437935 AW939591 Hs.5940 
407242 M18728 

AA296922 Hs.129778 

AH 10684 Hs.7645 



50 
55 



70 
75 



441031 
406685 
422578 



421582 
436217 
422260 



407007 
413719 
450685 
418007 
423673 
423371 
421964 
447400 
421100 
406741 



Hs.16098 
Hs.374281 
Hs.572 
Hs.350470 
Hs.107 



AJ243212 
X02544 
AI910275 
T53925 

AA315993 Hs.1 05484 

AU076801 Hs.89436 

AA058357 Hs.74466 

NM_005814 Hs.143131 

T69078 Hs.76177 

U22961 Hs.184411 

BE439580 Hs.75498 

L15533 Hs.423 

M13509 Hs.83169 

BE003054 Hs.1695 

AU076819 Hs.1650 

X73079 Hs.288579 

AK000322 Hs.18457 

AW351839 Hs.1 24660 

M058357 Hs.74466 

M82962 Hs.1 79704 
M36803 
M31126 
W03754 
J05070 
AA315933 

AA025386 Hs.6131 

AF241254 Hs.17809( 

L00190 Hs.75599 



.50813 
151738 
120879 



422424 
428470 
417931 
435538 
430272 
451917 
421907 



411975 
408243 
422310 



430677 
422487 
444381 



AI186431 Hs.296638 

AC002301 Hs.184507 

W95642 Hs.82961 

AB011540 Hs.4930 

X04898 Hs.237658 

AW391351 Hs.50820 

BE018556 Hs.1 09358 

AA298484 Hs.61265 

AU076405 Hs.29981 

NM_014479 Hs.145296 

AF098158 Hs.9329 

AI694413 Hs.373599 

AI916058 Hs.144583 

Y00787 Hs.624 

AA316622 Hs.98370 

X69532 Hs.2777 

BE463721 Hs.97101 

D30783 Hs.1 15263 

AW205197 Hs.240951 

AU076785 Hs.430 

AW939252 Hs.1 92927 

Z26317 Hs.359784 

AJ010901 Hs.1 98267 

BE387335 Hs.283713 

W74001 Hs.55279 



NM_005143:Homo sapiens haptoglobin (HP), 
NM_001285:Homo sapiens chloride channel, 
HM_005507;Homo sapiens regenerating isle 
NMJM2457:Homo sapiens mucin 2, intestin 
H< i r T // mi to " E4_H 
NM_033049:Homo sapiens mucin 13, epithet 
(locuslink)NM_002483:Homo sapiens careen 
NMJ)14471:Hcmo sapiens serine protease i 
NM_005141:Homo sapiens fibrinogen, B bet 
(locuslink)NM_002483:Homo sapiens carcin 
NM_001804:Hcmo sapiens caudal type homeo 
NM_020384:Homo sapiens claudin 2 (CLDN2) 
NM_007329:Homo sapiens deleted in malign 
Hs.572:orosomucoid1 
NM_003225:Homo sapiens trefoil factor 1 
NM_004467:Homo sapiens fibrinogen-^ i 
NM_032044:Homo sapiens regenerating gene 
NM_004063:Homo sapiens cadherin 17, Li c 
(lo uclink)NM_O0 690:Homo apiens carcii 
NM_005814:Homo sapiens glycoprotein A33 
NM_001633:Homo sap'iens alpha-1-microglob 
NM_000477:Hcmo sapiens albumin (ALB), mR 
NM_004591:Homo sapiens small inducible c 



NM_002421 :Homo sapbns matrix metallopro 
NM_002426:Homo sapiens matrix metallopro 
NM_000111:Homo sapiens solute earner fa 
NM_002S44:Homo sapiens polymeric immunog 
NM_017763:Homo sapiens hypothetical prot 
Hs.124550:ESTs, Moderately similar to 21 
(!ocuslink)NM_006690:Homo sapiens carcin 
NM_005588:Homo sapiens meprin A, alpha ( 
NM_000613:Homo sapiens hemopexin (HPX), 
Hs.352054:pregnancy specific beta-1-glyc 
NM_017625:Hcmo sapiens inteleclin (ITLN) 
NM_004994:Homo sapiens matrix metallopro 
Hs.1 20879:Homo sapiens, clone MGC:32C71 
Hs.6131 1:ESTs, Weakly similar to S10590 
NM_021804:Homo sapiens angiotensin I con 
(locuslink)NM_000488:Homo sapiens serine 

Hs296638:prostate differentiation facto 
Hs.184507:Homo sapiens, similar to Homol 
Hs.82961 :Homo sapiens, clone MGC:22588 1 
Hs.4930:low density lipoprotein receptor 



Hs.50820:h : 



Hs.109358:ATPase, Class V, type 10B 
NM_138805:Homo sapiens family with segue 
Hs.29981 :solute carrier family 26 (sulfa 
NM_014479:Homo sapiens ADAM-like, decysi 
(locuslink)NM.012112:Homo sapiens chromo 



NM_002215:Homosa| 



NM_014373:Homo sapiens putative G protei 
NM_001432:Homo sapiens epiregulin (EREG) 
(locuslink)NM_033120:Homo sapiens naked 
NM_002670:Homosa f ens plastic 1 (I isol 
NM_017726:Homo sapiens protein phosphata 
NM_001943:Homo sapiens desmoglein 2 (DSG 
NM_018406:Homo sapiens mucin 4, tracheob 



NM_002639:Homo sapiens serine (or cystei 
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417491 AW376842 Hs.1085 
413936 AF113676 Hs.297681 
422627 BE336857 Hs.1 18787 
411825 AK000334 Hs.352415 
446921 AB012113 Hs.16530 
415214 A1445236 Hs.125124 
414816 Y13709 Hs.77399 
422106 D84239 Hs.111732 
423803 NM_005709 Hs.1 32945 
452281 T93500 Hs.28792 
447342 AI199268 Hs.19322 
443957 AA521049 Hs.353013 
403220 

408908 BE296227 Hs.250822 
449722 BE280074 Hs.23960 
425976 C75094 Hs.334514 
414617 AI339520 Hs.288817 
408983 NM_000492 Hs.663 
423445 NMJJ14324 Hs.128749 
421379 Y15221 Hs.103982 
431301 AA502384 Hs.151529 
418318 U47732 Hs.84072 
415992 C05837 Hs.145807 
436972 AA284679 Hs.25640 
414987 M524394 Hs.294022 
431657 AI345227 Hs.105448 
424273 W40460 Hs.144442 
413916 N49813 Hs.75615 
409757 NM_001898 Hs.123114 
430204 AA61833S Hs.356664 
426227 U67058 Hs.1 54299 
420542 NM_000505 Hs.1321 
414809 AI434699 Hs.77356 
414639 X67055 Hs.76716 
410418 D31382 Hs.63325 
414590 NMJ00506 Hs.76530 
444151 AW972917 Hs.128749 
438746 AI885815 Hs.1 84727 
408704 M056635 Hs.5366 
414798 AI286323 Hs.97411 
436251 BE515065 Hs.296585 
414753 AF158255 Hs.77225 
428970 BE276B91 Hs.1 94691 
443991 NM.002250 Hs.1 0082 
432978 AF126743 Hs.279884 
425834 NMJ101639 Hs.1 957 
432179 X75208 Hs.2913 
408482 NMJ00676 Hs.45743 
430135 NMJ00035 Hs.234234 
426174 AA547959 Hs.115836 
403218 

411142 NMJ14256 Hs.69009 
449027 AJ271216 Hs.22880 
43X83 AL042759 Hs.191762 
431779 AW971178 Hs.266571 
421408 AI688223 Hs.91096 
430603 AA148164 Hs.247280 
422867 L32137 Hs.1584 
422539 AJ009936 Hs.118138 
424010 AL080188 Hs.137556 
428953 AA306610 Hs.348183 
457001 J03258 Hs.2062 
425983 AK000226 Hs.165619 
Hs.2253 



NM_004963:Homo sapiens guanylate cyclase 
NM_000295:Homo sapiens serine (or cystei 
Hs.11 8787:fransforming growth factor, be 
NM_017767:Homo sapiens solute earner fa 



418322 AA284166 Hs.8- 



AW630041 Hs.56937 

447472 AW207347 Hs.211101 

423164 AK000232 Hs.124835 

429345 R11141 Hs.1 99695 

430680 AW138724 Hs.1 68974 

414998 NM.002543 Hs.77729 

417165 R80137 Hs.302738 
403221 

415000 AW025529 Hs.239B12 

433437 U20536 Hs.3280 

414052 AW578849 Hs.283552 

406673 M34996 Hs.198253 

418203 X54942 Hs.83758 

422714 AB018335 Hs.1 19387 

410639 BE269047 Hs.65234 

411393 AW797437 Hs.69771 

431193 AW749505 Hs.296770 

428450 NM014791 Hs.184339 



NM_004442:Homo sapiens EphB2 (EPHB2), tr 
NM_001265:Homo sapiens caudal type homeo 
NM_003890:Homo sapiens IgG Fc binding pr 
(locuslink)NM_005709:Horr.o sapiens PDZ-73 
Hs.28792:Homo sapiens cDNA FU 1 1 041 lis, 
Hs.19322:Homo sap'ens, Similar to RIKEN 
Hs.353013:chrotnosome20 open reading fra 

(locuslink]NM_003158:Homo sapiens serine 
Hs.23960:cyclin B1 

NM_025257:Homo sapiens chromosome 6 open 
(locuslink]NM_025130:Homo sapiens hypoth 
NM_000492:Homo sapiens cystic fibrosis t 
NM_014324:Homo sapiens alp' 
NM_005409:Homosa|' 
Hs.151529:ESTs 
NM_004616:Homosa| 
Hs.145807:hypothetical protein FLJ13593 
Hs.25640:claudin 3 

NM_032865:Homo sapiens hypothetical prot 
Hs.105448:protein kinase, lysine deficie 
NM_003561:Homo sapiens phospholipase A2, 
NM_000483:Homo sapiens apolipoprotein C- 
NM_001898:Homo sapiens cystaUn SN (CST1 
y 6664:hypothclicol protein FLJ32 
(iocus ,nUIJM_005242 Homo sapiens coagul 
NM_000505:Homo sapiens coagulation facto 
Hs.77356:transfem'n receptor (p90, CD71 
NM_002217:Homo sapiens pre-alpha (globul 
NM_019894'.Homo sapiens transmembrane pro 
NM_000506:Homo sapiens coagulation facto 
(Iocuslink)NMJ14324:Homo sapiens alpha- 
Hs.184727:ESTs, Weakly similar to T4573B 
NM_139053:Homo sapiens epidermal growth 
Hs.97411:hypothetical prolein MGC12335 
(locuslink)NM_006392:Horr.o sapiens nucleo 
NM_006437:Homo sapiens ADP-nbosyltransf 
NMJ)03979:Homo sapiens retinoic acid ind 
NM_002250:Homo sapiens potassium intense 
NrVL013238:Homo sapiens DNAJ domain-conta 
Hs.1957.amyloid P component, serum 
NM 004443:Homo sap'ens EphB3 (EPHB3), mR 
NM_000676:Homo sapiens adenosine A2b rec 
NM_000035:Homo sapiens aldolase B.fruct 
Hs.115838:ESTs 

NM_014256:Homo sapiens UDP-GicNAc:betaGa 
NM_005700:Homo sapiens dipeptidylpeptida 
Hs.191762:hypottietical protein MGC20256 
(teuslink)IJM_001645:Homo sapiens apolip 
NM_052816:Homo sapiens tripartite motif- 
Hs.247280:chromosome 20 open reading lira 
Hs.1584:cartilage oligomeric matrix prot 
NM_033013:Homc sapiens nuclear receptor 
NM_033100:Homo sapiens MT-protocadherin 
NM_003823:Homo sapiens tumor necrosis fa 

fi rtjinnD(125-dihydrox>vilami 
NM_031265:Homo sapiens mucin andcadheri 
Hs.2253:complement component 2 
NM_005192:Homo sapiens cyclin-dependent 
NM_021978:Homo sapiens suppression of tu 
Hs.211101 :ESTs 

NM_019062:Homo sapiens hypothetical prot 
Hs.199695:hypothetical protein MAC30 
Hs.16B974:ESTs 

NM_002543:Homo sapiens oxidised low dens 
Hs.302738:Homo sapiens cDNA: FU21425 fi 

Hs.239B12:serologically defined breastc 
NM_001226:Homo sapiens caspase 6, apopto 
Hs.283552:hypothetical protein BC016153 
Hs.198253:major histocompatibility compl 
NM_001827-.Homo sapiens CDC28 protein kin 
NM_014698:Homo sapiens KIM0792 gene pro 
(locusliik)NMJ)17895:Homo sapiens DEAD/H 
NM_001710:Homo sapiens B-factor, properd 
Hs.296770:KIAA1719 protein 
NM_014791:Ho 
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425873 NM.013390 Hs.160417 

422765 AW409701 Hs.1578 

445109 AF039915 Hs.12330 

422535 AA311914 Hs.154578 

422609 Z46023 Hs.118721 

431548 AI834273 Hs.9711 



NM_002447 Hs.2942 

AA601038 Hs.191797 

AF086401 Hs.293847 

BE548446 Hs.321579 

L40904 Hs.100724 

AI493046 Hs.146133 

X63629 Hs.2877 

AI492170 Hs.77917 

AF127026 Hs.5394 

AU076442 Hs.117938 

AK001455 Hs.5198 



408194 
4395B0 
443464 



40 
45 
50 



Z29067 Hs.2236 

NM_014252 Hs.78457 

M447849 Hs.288660 

Y00272 Hs.334562 

R34008 Hs.239727 

AI267592 Hs.75761 

AI675419 Hs.1 64464 

AW362B67 Hs.302738 

BE264974 Hs.6566 

M446644 Hs.692 

AW411425 Hs!l 80655 

AW247529 Hs.6793 

AA088767 Hs.83883 

NM_003963 Hs.184194 

AF207069 Hs.153357 

NM.012079 Hs.288627 

M628967 Hs.1 1 5274 

AA878200 Hs.1 18727 

AW290997 Hs.190153 

AF027208 Hs.1 12360 

NM.002153 Hs.155109 

AW136551 Hs.181245 

AW328081 Hs.6817 

AI086138 Hs.204044 

AW952792 Hs.334612 

AL120168 Hs.78919 

AL1 17435 Hs.49725 

NMJ04605 Hs.376147 

BE048051 Hs.37054 

BE302795 Hs.1 05097 

AW190902 Hs.40098 

NHL002318 Hs.83354 

! AL136877 Hs.50758 

' AW997938 Hs.90786 

AI015709 Hs.172089 

I T82427 Hs.194101 

I BE396495 Hs.159428 



435327 BE3C1871 Hs.4867 



441384 
42G479 
407944 



424825 
429033 
411257 



409956 AW103364 

439659 AW970780 

425397 J04088 

413753 U17760 

428698 AA852773 



441623 AA315805 

417866 AW067903 

442013 AA506476 

425247 NM_005940 

428385 AF112213 

412612 NMJ00047 

425280 U31519 

411263 BE297802 

452017 AF109302 

452721 AJ269529 

450737 AW007152 

456906 AF1 17646 

412974 R18978 



Hs.160417:transmembrane protein 2 
NM_001168:Homo sapiens baculoviral !AP r 
NM_001247:Homo sapiens ectonucleoside tr 
Hs.154578:Homo sapiens mRNAfbrFU00256 
NM_0C0434:Homo sapiens sia'idase 1 (lyso 
NM_017515:Homo sapiens nove' proter, (HS 
NM_005224:Homo sapiens dead ringer-like 



Hs.191797:ESTs 



NM_021095:Homo sapiens solute carrier fa 



i \ Jjni , 93 Homo sapiens cadherin 3, type 
NM_006002:Homo sapiens ubiquiiin csrbony 
NM_005379:Homo sapiens rr.yosin IA (MY01A) 
NM_000494-.Homo sapiens collagen, type XV 
Hs.5198:Down syndrome critical region ge 
Hs.12680:Homo sapiens cDNA FLJ10196 lis, 
I i M.: MA (never in mitosis gene a)-r 
NM_0142S2:Homo sapiens solute carrier fa 
Hs.288660:Homo sapiens cDN'A: FLJ22182 fi 
NM_001786:Homo sapiens cell division eye 
NM_024422:Homo sapiens desmocolin 2 (DS 
NM_003137-.Homos piens SFR! prod inkina 
Hs.164464:Homo sapiens, clone MGC:23656 
Hs.302738:Homo sapiens cDNA: FLJ21425 fi 



NM_002354:Homo sapiens tumor-associated 
Hs.194346:Spir-2 protein 
(I u imHfIM_004217:Homo sapiens serine 
Hs.6793:platelet-aclivating factor acety 
NM_020182:Homo sapiens transmembrane, pr 
NM_003963:Homo sapiens transmembrane 4 s 
NM_001084:Homo sapiens procollagen-lysin 
MM_012079:Homo sapiens diacylglycerol 0- 
Hs.1 15274:lndian hedgehog homolog (Droso 
Hs.1 18727:Homo sapiens cDNA FU33803 fis 
Hs.190153:Homo sapiens cDNA FU33988 fis 
Hs.112360:prominin-like 1 (mouse) 
NM_002153:Homo sapiens hydroxysteroid (1 
-ts.181 c , .n , pien cDHAFI II '2 i 
NM_033453:Homo sapiens inosine triphosph 
Hs.204044:ESTs 

NM_003094:Homo sapiens small nuclear rib 
NM_021083:Homo sapiens Keli blood group 
Hs.49725:DKFZP434l215 protein 
Hs.376147:Homo sapiens cDNA FU39099 fis 
Hs.37054:ephrin-A3 
H3.105097:thymidine kinase 1, soluble 



Hs.727 

Hs.59483 

Hs.156346 

Hs.75517 

Hs.334838 

Hs.179657 

Hs.348710 

Hs.82772 

Hs.375009 

Hs.155324 

Hs.1 84062 

Hs.74131 

Hs.1 872 

Hs.69360 

Hs.27495 

Hs.301871 

Hs.63325 

Hs.1 56637 

Hs.75105 

Hs.166468 



NM_002318:Homo sapiens lysyl oxidase-lik 
Hs.50758:SMC4 structural maintenance of 
NM_003786:Homo sapiens ATP-binding casse 
Hs.172089:pro-oncosis receptor inducing 
Hs.1941 01 :Homo sapiens cDNA: FU20869 fi 
NM_138761:Homo sapiens BCL2-associated X 
Hs.101282:Homo sapiens mRNA; cDNA DKFZp4 
Hs.4867:mannosyl (alpha-1 ,3-)-glyccprote 

Hs.296301 :hy potheCcal protein MGC13170 
(locuslink)NM_001202:Homo sapiens bono m 
NM_003491:Homo sapiens ARD1 homolog, N-a 
NM_002192:Homo sapiens inhibin, beta A ( 



NM_001067:Homo sapiens 
NM_000228:Ho 



in,beta3( 



Hs.334838:KIAA1866 protein 
NM_002659:Homo sapiens plasminogen activ 
Hs.348710:Homo sapiens, clone IMAGE4242 
Hs.82772:collagen, type XI, alpha 1 
Hs.375009:Homo sapiens mRNA; cDNA DKFZp6 ■ 



_„i-Iike6(m 
Hs.27495:prostate cancer associated prot 
Hs.301871:solute cam'er family 37 (glyc 
H: 6332 i 'ransmembrane protease, serine 
NM_012116:Homo sapiens Css-Br-M (murine) 
NM_006579:Homo sapiens emopamil binding 
NMJ04708:Homo sapiens programmed cell d 
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416065 BE267931 Hs.78996 
403739 

417576 AA339449 Hs.82285 
405484 

409162 H25530 

430514 AA318501 

414695 BE439915 

411165 NM_000169 

421975 AW961017 

431836 AF1 78532 

412133 U83460 

412870 N22788 

413278 BE563085 
400130 

430696 AA531276 

443802 AW504924 

407777 AA161071 



NM_002592:Homo sapiens proliferating eel 
NM_000819:I 



Hs.50868 
Hs.241587 
Hs.76913 



Hs.271411 
Hs.104557 
Hs.82407 



421943 BE616520 

422293 X94453 

4194BB AA316241 

425123 AW205274 

447343 AA256641 

413254 U40272 

413950 AA249096 

409453 AI885516 

452888 AW955454 

421910 NM_014586 

434263 N34895 

444700 NM_003645 

433662 W07162 

418526 BE019020 

425998 AU076629 

422616 BE300330 



424441 X14850 

426378 U80082 

409636 M305729 

407786 AA687538 

431945 AW000827 



453082 H18835 
400529 

412869 AA290712 

427239 BE270447 

429638 AI916662 

445462 AA378776 

415003 M11437 

443639 BE269042 

400290 H18836 

431350 AI192528 

430154 AW583058 

435099 AC004770 

418216 AA662240 

414907 X90725 



Hs.367942 
Hs.343912 
Hs.114366 



427490 
427333 
434203 
414806 
456362 



Hs.109437 
Hs.79187 
Hs.11729 
Hs.150826 
Hs.86838 
Hs.165950 
Hs.1 18725 
Hs.42331 
Hs.50964 
Hs.170195 
Hs.26557 
Hs.1 47097 
Hs.169600 
Hs.1 8272 
Hs.38972 
Hs.1 1962 
Hs.195155 

Hs.31608 

Hs.82407 

Hs.356512 

Hs.211577 

Hs.288649 

Hs.77741 

Hs.9651 

Hs.31608 

Hs.1 64537 

Hs.234726 

Hs.4756 

Hs.283099 

Hs.77597 

Hs.75184 

Hs.371581 



AF035959 Hs.24879 

BE395085 Hs.334762 

AI7O2038 Hs.100057 

AW411491 Hs.75069 

295152 Hs.178695 

AF067797 Hs.176658 

BE262677 Hs.283558 

D14694 Hs.77329 

AW973003 Hs.1 79909 

BE243834 Hs.50441 

AU076916 Hs.5398 

AW372884 Hs.240770 

AL038843 Hs.374530 

AW301003 Hs.51483 



428376 AF1 19665 Hs.1 84011 

431890 X17033 Hs.271986 

446696 AF279265 Hs.298476 

419378 R24922 Hs.90078 

448140 AF145761 Hs.20450 



Hs.50868:solute carrier family 22 (organ 
NM_021246:Homo sapiens lymphocyte antige 
Hs.75913:proleasome (prosome, macropain) 
NM_000169:Homo sapiens galactosidase, al 
(if, uslink)NM_0 (531 Homo apien polii 
NM_13B992:Homosapiens beta-site APP-cle 
NM_001859:Homo sapiens solute carrier fa 
NM_022059:Homa sapiens chemokine (C-X-C 
Hs.833:interferon-stimulated protein, 15 
NM_001746:Homo sapiens calnexin (CANX), 
Hs.59509:ESTs, Weakly similar b similar 
Hs.9805:exportin 5 
Hs.71465:squalene epoxidase 
Hs.367942:Homo sapiens, clone IMAGE: 4701 
NM_033504'.Homo sapiens CAC-1 (CAC-1), mR 
Hs.114366:pyrroline-5-carboxylale synthe 
NM_006993:Homo sapiens nucleophosmin/nuc 
NM_000303:Homo sapiens phosphomannomutas 
Hs.236894:ESTs, Highly similar to S02392 
NM_004135:Homosapiens isocitrate dehydr 
Hs.3279J.Homo sapiens cDNA FU31108 fa, 
Hs.95612:ESTs 

NM_004093:Homo sapiens ephrin-B2 (EFNB2) 
NM_014586:Homc sapiens hormonally upregu 
Hs.79187:coxsackie virus and adenovirus 
NM_003645:Homo sapiens fatty-acid-Coenzy 
NM_020387:Homo sapiens RAB25, member RAS 
NM_004207:Homo sapiens solute earner fa 
NMJ0201 1:Homo sapiens fibroblast growth 
NM_012248:Homo sapiens selenophosphate s 
Hs.42331:ephrin-A4 

(loc -lirl)fJM_CC1712:Homo sapiens carcin 
Hs.170195:bone morphogenetic protein 7 ( 
NM_007183:Homo sapiens plakophilinS (PK 
Hs.147097:H2A histone family, member X 
Hs.1 69600:KIAA0826 protein 
(locuslink)NM_030674:Homo sapiens solute 
NM_005727:Homo sapiens tetraspan 1 [TSPA 
NMJ)30766:Homo sapiens apoptosis reculat 
Hs.1951 55:solute carrier family 38, men* 

(locuslink)NM_017636:Homo sapiens transi 

Hs.82407:chemokine (C-X-C motif) tigand 
Hs.356512:ESTs, Weakly similar to UBCA_A 
(locuslink)NM_004966:Homo sapiens klned 
(locuslink)NM_024051:Homo sapiens hypoth 
Hs.77741 :kininogen 



NM_004111:Homo sapiens flap structure-sp 
Hs.283099:AF15q14 protein 
NM_000998:Homo sapiens ribosomal protein 
Hc.75184:chitinacc3-like1 (cartilage g 
Hs.371581:ESTs 



NM_016086:Homo sapiens map kinase phosph 
Hs.24879:phosphatidic acid phosphatase t 
locuslink)NM_032832:l fens hypotl 

Hs.10QQ57:serine/threonine kinase 35 
Hs.75069:serinehydroxymethyl transferase 
NM_002754:Homo sapiens mitogen-activated 
NM_001 169:Homo sapiens aquaporin 8 (AQP8 
NM_018509:Homo sapiens hypothetical prot 
(locuslink)NM_014754:Homo sapiens phosph 
1 i , HNM_024B31 Homo sapiens nuclea 
NM_015936:Homo sapiens CGI-04 protein (L 
Hs.5398:guanine monphosphate synthetase 
Hs.240770:nuclear cap binding protein su 
Hs.374530:Homo sapiens cDNA: FU23602 n 
Hs.51483:Homosapiens, Similar to RIKEN 
NM_003212:Homo sapiens teratccarcinoma-d 
Hs.1 840 1 1 pyrophosphatase (inorganic] 
JM_002203:Hcm I ntegrin, alph 
NMJ2291 1:Homo sapiens solute earner fa 
Hs.90078:nucleotide-sugar transporter si 
NM_020125:Homo sapiens E lymphocyte acti 
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452679 Z42387 Hs.83883 

432636 AA340864 Hs.278562 

433020 AI375726 Hs.227152 

425003 AF119046 Hs.154149 

413095 AA494359 Hs.30715 

417386 AL037228 Hs.301957 

409152 AA176585 Hs.194346 
404826 

453111 AB014598 Hs.31720 

409964 AW368226 Hs.67928 

446342 BE298665 Hs.14846 

452098 AI858183 

428072 BE258602 Hs.1 82366 



(locuslink)NM_020162:Homo sapiens transm 
NM_001307:Hcmo sapiens claudin 7 (CLDN7) 
NM_016391:Homo sapiens hypothetical prot 
NM_014481:Homo sapiens APEX nuclease (ap 



449667 
406684 
424534 
426031 
417526 

424154 



416084 
420162 
413476 
412115 
413588 
431512 
435777 
431211 
453258 
414812 
423068 
443180 



444184 
412641 
413781 
409213 



AI469355 Hs.1 27310 

AB023227 Hs.23860 

X16354 Hs.50964 

□87682 Hs.150275 

AA295251 Hs.1 66066 

AA568906 Hs.82240 

N28522 Hs.8935 

AF026004 Hs.141660 

BE159028 Hs.279704 

AI660842 Hs.1 1091 5 

AK000609 Hs.103808 

H72187 Hs.356668 

L16991 Hs.79006 

BE378432 Hs.95577 

U25849 Hs.75393 

AK001763 Hs.73239 

AA971014 Hs.75432 

BE270734 Hs.2795 

AW419202 Hs.286192 

M86849 Hs.323733 

AW293134 Hs.32597 

X72755 Hs.77367 

M25629 Hs.123107 

R15875 Hs.258576 

U34038 Hs.154299 

AB037784 Hs.22941 

T87841 Hs.282990 



M16660 
J05272 
U61412 



60 
65 
70 



74335 
Hs.850 
Hs.51133 
Hs.18720 
BE543146 Hs.281434 
R58665 Hs.46847 
NM_005764 Hs.271473 
AF102850 Hs.227933 



407971 AI469117 Hs.62918 

424865 AF011333 Hs.153563 

432211 BE274530 Hs.273333 

436014 AF281134 Hs.283741 



407722 BE252241 Hs.38041 

417129 AI381800 Hs.300684 

409463 AI458165 Hs.17296 

407137 T97307 

464390 AB020713 Hs.56966 

438485 W57578 Hs.378718 



' AI272144 Hs.236522 

' W94824 Hs.1 1565 

I AW607831 Hs.38738 

I T28407 Hs.81564 

BE267057 Hs.325321 

AW406289 Hs.96593 

AL049761 Hs.155140 

AA306007 Hs.59461 

NMJM0312 Hs.2351 

AF151812 Hs.169992 

BE277414 Hs.5947 

AW594506 Hs.1 04830 



420319 
425209 
410174 
429023 



NM_014799:Homo sapiens hephaestin (HEPH) 
Hs.67928:ESTs 

Hs.14846:Homo sapiens mRNA; cDNA DKFZp56 
BF755039:QVO-CT0583-18100(W28-f07 CT058 
NM_016292:Homo sapiens heat shock protei 
NM_025217:Homo sapiens UL16 binding prot 
Hs.29002:KIAA1706 protein 
NM_030622:Homo sapiens cytochrome P450, 
(locuslink)NM_144624:Homo sapiens kinase 
Hs.23860:KIAA1010protein 
(locuslink)NM_001712:Homo sapiens carcin 
Hs.150275:KIAA0241 protein 
(locuc!ink)NM_006697:Homo sapiens oispla 
Hs.82240:syntaxin 3A 

NM_014298:Homo sapiens quinolinate pfiosp 
NM_0a4366:Homo sapiens chloride channel 
Hs.279704:chromatin accessibility comple 
NM_021258:Homo sapiens interteukin 22 re 
NM_017896:Homo sapiens chromosome 20 opo 
(locuslink]NM_005274:Homo sapiens guanin 
NM_012145:Homo sapiens deoxythymidylste 
NM_052984:Homo sapiens cyclin-dependsnt 
NM_004300:Homo sapiens acid phosphatase 
Hs.73239:hypothetical protein FU10901 
NM_00a884:Homo sapiens IMP (inosine mono 
Hs.279&lactate dehydrogenase A 
NMJ 92:Homc P r e hosphal 
ri 23 3:g pjun ion prot n beta 
NM_005977:Homo sapiens ring finger prote 
NM_002416:Homo sapiens monokine induced 
NM_002257:Homo sapiens kallikrcin 1, rcn 
NM_012129:Homo sapiens claudin 12 [CLDN1 
NM_005242Homo sapiens coagulation facto 
Hs.22941:KIAA1363 protein 
(lxuslink)NM_O33550:Homo sapiens chromo 
Hs.74335cheat shock 90kD protein 1, beta 
(locuclink)NM_00uBB3:Homo sapiens IMP (i 
NM_005975:Homo sapiens PTK6 protein tyro 
NM_004208:Homo sapiens programmed cell d 
Hs.281434:Homo sapiens cDNA FU31373 fis 
NM_016614:Homo sapiens TRAF and TNF rece 
Hs.27 i473:epithelial protein up-regulate 
NM_013338:Homo sapiens Alg5, E.cerevisi 
NM_003129:Homo sapiens squalene epoxides 
Hs.131810:Homo sapiens cDNA FU35976 fis 
Hs.301350:FXYD domain-containing ion Ira 
lls.62918:CDC91ttil ti sionc d c 1 I 
NM_002349:Homo sapiens lymphocyte antige 
Hs.273333:hypotJietical protein FU 10966 
NM_020158:Homo sapiens exosome component 
Hs.5097:synaptogyrin 2 
NM_003904:Homo sapiens zinc finger prote 
NM_003921:Homo sapiens B-oell CLL/lympho 
i oc 1 3' k)NM_004358:Homo sapiens cell d 



NM_O03681:Hc 



(!ocuslink)NM_C24923:Homo sapiens hypoth 
Hs.378718:Homo sapiens cDNA FLJ33433 5s 
NM_012394:Homo sapiens prefoidin 2 (PFDN 
NM_003286:Homo sapiens lopoisomerase (DN 
Hs.75323:prohibitin 
Hs.236522:DKFZP434P106 protein 
NM_080748:Homo sapiens chromosome 20 ope 
NM _014343:l lomo sapiens claudin 1 5 (CLDN1 
NM_002619:Homo sapiens platelet factor 4 
Hs.325321.-WD repeat domain 18 
NM_019034:Homo sapiens ras horrolog gene 
NM_001895:Homo sapiens casein kinase 2, 
Hs.59461:DKFZP434C245 protein 
NM_000312:Homo sapiens protein C (inacti 
NM_015966:Homo sapiens serologically def 
NM_005370:Homo sapiens mel transtbrming 
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30 
35 
40 



55 
60 



i AU077141 Hs.374548 

1 NM_001197 Hs.155419 

' AL080235 Hs.35861 

I AB023182 Hs.184523 

i AI638871 Hs.378965 

i AB020643 Hs.183006 

i NM_004460 Hs.418 

I N99692 Hs.75227 

i NM_004987 Hs.112378 

i AW296135 Hs.267659 

I AW377424 Hs.205126 

I AI796390 Hs.210667 

R94038 Hs.374664 

i W21223 Hs.151734 

) AF111170 Hs.306165 

! BE244231 Hs.84038 

AW052006 Hs.374973 

! 

: AI678727 Hs.378970 

I U85768 Hs.247838 

NM_014742 Hs.79305 

AA296363 Hs.121520 

BE243313 Hs.334851 
BE072881 

U63630 Hs.1 55637 

BE397753 Hs.14623 

BE266322 Hs.211374 

M81933 Hs.1 634 

AW977382 Hs.15898 

NM_002951 Hs.75722 

U95031 Hs.102482 

R52557 Hs.91579 

AL1 18990 Hs.373554 

U11700 Hs.84999 

AI580053 Hs.109007 

AA570256 Hs.348504 

AW411066 Hs.274351 

BE173977 Hs.10098 

T47364 Hs.278613 

T98226 Hs.171952 

AA1 94952 Hs.36093 

AW015211 Hs.153799 

AW505308 Hs.75812 

AI055776 Hs.133397 

X59136 Hs.166110 

AA316181 Hs.61635 

NMJ04341 Hs.154868 

AL110291 Hs.99364 

L12350 Hs.108623 

AU076609 Hs.2934 

AF208234 Hs.695 

AL080192 Hs.101282 

AI754813 Hs.146428 

AB025237 Hs.388 

BE391929 Hs.8752 



BE314877 
AF213457 
X91195 



Hs.1499 

Hs.42644 

Hs.24553 

Hs.44234 

Hs.100623 

Hs.286192 

Hs.75939 



: NM_006855 Hs.250696 

i AV660976 Hs.3569 

AI732374 Hs.339827 

D30857 Hs.82353 

i NM_005402 Hs.6906 

AA1 51647 Hs.68877 

AF034102 Hs.32951 

W79485 Hs.173980 

AI878909 Hs.17883 



Hs.923:single-stranded DNA binding prole 



Hs.: 

NM_001 1 97:Homo sapiens BCL2-interacting 
Hs.35861:Ras-induced senescence 1 
Hs.1 84523serinerthreonine kinase 38 lik 
Hs.378965:Homo sapiens cDNA FU37658 lis 
Hs.183006:likely homolog of mouse hepari 
(locuslink)NM_004460:Homo sapiens fibrob 
NM_005u02:Homo sapiens NADH dehydrogenas 
(!ocuslink)NM_004987:Homo sapiens LI M an 

NM_0061 13:Homo sapiens vav 3 oncogene (V 
Hs.205126:Homo sapiens cDNA: FU22667 f 
Hs.210667:ESTs 

NM_005538:Homo sapiens inhibin, beta C ( 
Hs.1 51734:nuolear transport factor 2 
Hs.306165:ESTs, Highly similar to unknot 
NM_015937:Homo sapiens CGI-06 protein (L 
NM_004697:Homo sapiens PRP4 pre-mRNA pro 

Hs.378970:Homo sapiens cDNA FU35102 Rs 
NM_002991:Homo sapiens small inducible c 
Hs.79305:KIAA0255 gene product 
Hs.121520:Hcmo sapiens cDNA FLJ35792 fis 
Hs.334861:UM and SH3 protein 1 
BE072881:RC2-BT0548-200300-012-e09 BT054 
NM_006904:Homo sapiens protein kinase, D 
Hs.l4623:interferon. gamma-inducible pro 
(!ocu ink)NMJ45051:l mc cm hvpur 
ls.1G3 1 n cycle 25^ 
Hs.15898:2,4-dlenoyl CoA reductase 2, pe 
NM_002951:Homo sapiens ribophorin II (RP 
Hs.102482:mucin 5. subtype B, tracheobro 
NM_033416:Homo sapiens similar to HYPOTH 
(:ocuslink)NM_13078&Homo sapiens alpha- 
NMJ000S3:Homo sapiens ATPase, Cu++ Iran 
Hs.109007:Homo sapiens, Similar to LOC16 
Hs.348504:tiypothetical protein BC014072 
NM_016032:Homo sapiens z'nc finger, DHHC 
NM_0190B2:Homo sapiens putative nucleola 



Hs.133397:ESTs 
Hs.1561 ^immunoglobulin kappa constant 
NM_012449:Homo sapiens s'x transmembrane 
NM_004341:Homo sapiens carbsmoyl-phospha 



)mo sapiens rib 
i B (sfefin B) 

Hs.101282.Homo sapiens mRNA; cDNA DKFZp4 



NM_005526:Homc sapiens heal shock transc 
Hs.42644:thioredo>cin-like 2 
> i J i ^ 3 H - jp hypotl 
NM_018965:Homo sapiens triggering recept 
NM_138689:Homo sapiens protein phosphata 
NM_032192:Homo sapiens protein phosphata 
NM_012474:Homo sapiens uridine monophosp 
Hs.1 18638:non-metastatic cells 1, prolei 
Hs.146170:hypothetical prote'n FU22969 
NM_006655:Homo sapiens KDEL (Lys-Asp-Glu 



Hs.339627.ESTs, Weakly similar to pa 
NM_006404:Homo sapiens protein C recepto 
NM_005402:Homo sapiens v-ral simian leuk 

Hs.68877:cytochrome b-245, alpha polypep 
NM_001532:Homo sapiens solute earner fa 
H I nwlo rn trix protein NMP200 
NM_002707:Homo sapiens protein phosphata 
NM_006114:Homo sapiens translocase of ou 
NM_000982:Homo sapiens ribosoma! protein 
NM_002266:Homo sapiens karyopherin alpha 
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446356 AI816736 Hs.14896 

419170 BE002798 Hs.287850 

426858 NMJJ04182 Hs.1 72791 

418558 AW082266 Hs.86131 

444706 AK000398 Hs.11747 

444734 NMJJ01360 Hs.11806 

424482 BE268621 Hs.149155 

438203 BE540090 Hs.7345 

409685 AK000002 Hs.55879 

418681 AA287786 Hs.23449 

419705 AW368634 Hs.154331 

420185 NMJ15925 Hs.95697 

413835 AI272727 Hs.249163 

448153 Y10805 Hs.20521 

425274 BE281191 Hs.155462 

435472 AW972330 Hs.283022 

451932 AA360954 Hs.27268 

420085 AI741909 Hs.44680 

412006 AW451618 Hs.290216 

424954 NM.000546 Hs.1846 



AF141332 Hs.200333 

X57522 Hs.352018 

AW246428 Hs.75355 

AL359575 Hs.23765 

AI567839 Hs.1 51714 

AF078866 Hs.2B4296 

BE185030 Hs.241305 

AA122237 Hs.81874 

AA380731 Hs.84 

U52112 Hs.182018 

AF1 67572 Hs.1 291 2 
Hs.98614 
i.82045 
Hs.73798 

AI859390 Hs.288940 

NM.013442 Hs.3439 

AA926960 Hs.348669 

T35958 Hs.1 07614 

A1420156 Hs.326733 

AL036662 Hs.144949 

X78342 Hs.77313 

AA173992 Hs.7956 

AF290512 Hs.58215 

AK002127 HS.155313 

AF104032 Hs.184601 

NM.014448 Hs.87435 

BE265254 Hs.343258 

AI022813 Hs.92679 

BB509S4 Hs.89399 

AA088177 Hs.172870 

AI805664 Hs.31731 

BE265839 Hs.1 2126 

AW451034 Hs.326525 

AI816209 Hs.95867 

AB012193 Hs.1 83874 

AA296696 Hs.333418 

AW409747 Hs.75612 

X03068 Hs.73931 

AK000524 Hs.39850 

AW271217 Hs.281434 



BE260964 



424089 
414788 
442315 
449944 
425244 
428484 
418703 
436415 
447151 
418862 
410636 



BE386750 Hs.86978 

AI434780 Hs.4248 

AI064707 Hs.301226 

AL109729 Hs.99364 

R07566 Hs.73817 



421779 
400262 
418803 



AW732918 Hs.182490 

AI879159 Hs.1 0821 9 
Hs.75309 

U50079 Hs.88556 

BE613348 Hs.356392 

2 AW138207 Hs.146170 
5 AA524545 Hs.224630 

3 AW362945 Hs.1 62459 
t AU077013 Hs.28757 

1 BE243136 Hs.86947 

2 AF161254 Hs.1 061 96 
) AA360154 Hs.1 7741 5 
) AF1 19666 Hs.23449 



Hs.14896:zinc finger, DHHC domain coniai 
NMJ02219:Homo sapiens integral membrane 
NM_004182:Homo sapiens ublquitously-expr 
Hs.861 r (F FRSF6) >c j de 
'I J ii L j r J I77 )8:Homo pier in 
NM_001360:Homo sapiens 7-dehydrocholeste 
(locu k)NMJ r m n ;ol 
ri 7345:MAD1 mitol a defic it-li 
(locuslink)NM_033450:Homo sapiens multid 
Hs.23449::nsu'in receptor tyrosine kinas 
Hg.154331 ZSTs 

Hs.95697:liver.specific bHLH-Zip transcr 
NM_024306:Homo sapiens fatty acid hydrox 
NM_001536:Homo sapiens HMT1 hnRNP methyl 
Hs.155462:MCM6 minichromosome maintenanc 
NM_018643:Homo sapiens triggering recept 
Hs.27268:Homo sapiens cDNA: FU21933 fs 
Hs.44680:hypothetical protein FU 20979 
Hs.290216:ESTs 

NM_000546:Homo sapiens tumor protein p53 
NM_020470:Homo sapiens putative transmem 
NM_052886:Homo sapiens msl, T-ccll diffe 
NM_001961:Homo sapiens eukaryotic trans] 
NM_018690:Horra sapiens apollpoprotein B4 
NM_000593:Homo sapiens transporter 1 , AT 
NM_003348:Homo sapiens ubiquitin-conjuga 
Hs.23765: membrane metallo-endopeptidase- 
(b:usl nk)NM_033405:Homosapiens peroxi 
NM_033161:Homo sapiens surfeit 4 (SURF4) 
(b u i:nk)MM_00f 170:Homo sapiens tripar 
NM_002413:Homo sapiens microso t I 
NM_000206:Homo sapiens interleukin 2 rec 
Hs.182018:interleukin-1 receptor-associa 
NM_006109:Homo sapiens SKB1 homolog (E. 
NM_004587:Homo sapiens ribosome binding 
Hs.82045:midklne (neurite growth-promoti 
Hs.7379B:mxrophage migration inhibitor/ 
NM_021259:Homo sapiens transmembrane pro 
Hs.3439:slamatin(EPB72)-like2 
HS.348669-.CDC28 protein kinase 1 
Hs.107614:DKFZP564l1171 protein 
NM_052858:Homo sapiens similar to RIKEN 



Hs.77313:cyclin-dependent kinase (CDC2-I 
Hs.7956:ESTs 

(locuslink)NM_033046:Homo sapiens rhotek 
NM_022105:Homo sapiens death associated 
(locuslink)NM_0u3486:Homo sapiens solute 
Hs.87435:Rho guanine exchange factor (GE 
NM_006191:Homo sapiens proliferalion-asa 
(locu linl til'Jb- kHomo piens kine i 
NM_005176:Homo sapiens ATP synthase, H+ 
Hs.172870:KIAA1913 protein 
(locuslink)NM_012094:Hcmo sapiens peroxi 
NM_018487:Homo sapiens hepatocellular ca 
NM_001669:Homo sapiens arylsulfatase D ( 
(locuslink)MM_024112:Hcmo sapiens chromo 
NM_003589:Homo sapiens cullin 4A (CUL4A) 
(locuslink)MM_014164:Hcmo sapiens FXYD d 
NM_006819:Homo sapiens stress-lnduced-ph 
Hs.73931:majorhistocompatibi:ity«jmple 
Hs.39850:uridine kinase-likel 
Hs.281434:Homo sapiens cDNA FLJ31373 fs 
Hs.131741:ESTs 
Hs.86978:prolyl eridopeptidase 
Hs.4248:Homo sapiens PP3781 mRNA.comple 
Hs.301226:Homo sapiens, clone IMAGE3456 
Hs.99364:abhydrolase domain containing 1 
NM_002983:Homo sapiens small inducible c 
NM_003726:Homo sapiens sre family associ 
Hs.182490:leucine-rich PPR-motif contain 
NM_004626.Homo sapiens wingless-type MMT 
NM_001961:Homo sapiens eukar/oiic transl 
NM_004964:Homo sapiens histone deacetyla 
! )2:ESTs, Highiy simil 3-| 
NM_022842:Homo sapiens hypothetical prat 
Hs.224630:Homo sapiens cDNA FLJ33318 lis 
Hs.162459:ESTs 



NM_001109:Homo sapiens adisintegrin and 
(locuslin NM_016 0 >mc pis 6D6an 

M-00199 Homo jpion Finkcl 

NfvL018842:Homo sapiens insulin receptor 
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25 
30 



AW749865 Hs.11 

418181 U37012 Hs.83727 

434826 AF155661 Hs.22265 

428734 BE303044 Hs.192023 

423022 AA320525 Hs.201076 

427648 AI376722 Hs.180062 
404240 

408989 AW361666 Hs.49500 

452835 AK001269 Hs.30738 

446506 AI123118 Hs.15159 
402260 

429671 BE379335 Hs.211594 

409267 NMJ12453 Hs.52515 

445937 AI452943 Hs.321231 

431243 U46455 Hs.252189 

422611 AA158177 Hs.118722 

427647 W19744 Hs.180059 

449644 AW960707 Hs.148324 

448719 AA033627 Hs.21858 

417457 AA378907 Hs.349326 

454128 AL031259 Hs.367900 



426410 BE298446 
426812 AF105365 Hs.172613 
435750 AB029012 Hs.4990 



421905 AI660247 



!s.32699 



444301 
451455 
417777 
417144 
450825 



448847 
431236 
414702 
430024 



. c finger protein 264 

013291:Homo sapiens cleavage and poly 
Hs 2226Spyruvaie dehydrogenase phosphat 
NM_003757:Homo sapiens eukaryotic transl 
Hs.201076:ESTs 



Hs.49500:KIAA0746 protein 
NM_018087:Homo sapiens hypothetical prot 
(locuslink)NM_016326:Homo sapiens chemok 

211594:prateasome (prosome, macropain 



427268 X78520 Hs.174139 

412525 AAS81439 Hs. 152328 

422813 AV656571 Hs.121068 

441406 Z45957 Hs.7837 

408806 AW847814 Hs.75608 

435730 AB020635 Hs.4984 

432871 NM.016142 Hs.279617 

447783 AF054178 Hs.19561 



AF038961 Hs.6710 

AW237726 Hs.288S49 

AA340605 Hs.105887 

AI498072 Hs.351474 

AF001212 Hs.90744 

D15049 Hs.179770 

AI878842 Hs.237924 

M128808 Hs.179902 

AI924046 Hs.1 19567 

NMJ14312 Hs.112377 

AK000136 Hs.10760 

AI937227 Hs.8821 . 

AI823763 Hs.7055 

AA382104 Hs.81337 

AC005954 Hs.25527 



NM_01 



(locustink)NM_003779:Homo sapiens UDP-Ga 
NM_002999:Homo sapiens syndecan 4 (amphi 
(locuslink)NM_0044S0:Homo sapiens fucosy 
! Is. i 80059:Homo sapiens cDNA FLJ31 360 fis 
Hs.148324:ESTs 

Hs.21858:serine (or cysteine) proteinase 
II W\ 6 Homo sapiens cDNAFU30577fis 
Hs.367900:programmed cell death 2 
Hs.153850:hypothetical protein C321D2.4 
H 1 >l I Ml </ Ic oc I Imcmbr 
NM_00651 6:Homo sapiens solute carrier fa 
NM_138578:Homo sapiens BCL2-!ike 1 (BCL2 
NM_006598:Homo sapiens solute carrier fa 
Hs.4990:KIAA1089 protein 
(locusli nk)NM_0 1 6022:Homo sapiens CGI-78 
Hs.32699-.Homo sapiens, Similar to RIKEtl 
Hs.119122:ribosomal protein L13a 
NM_001 829:Homo sapiens chloride channel 
Hs.1S2328:ESTs 

(locustink)NM_003270:Homo sapiens transm 
Hs.7837:phosphoprotein regulated by mito 
Hs.7S608:tight junction protein 2 (zona 



Hs.279617:hydroxysteroid (17-beta) dehyd 
NM_005001:Homo sapiens NADH dehydrogenas 
NM_003576:Horno sapiens serine/threonine 
NM_018698:Horno sapiens hypothetical prot 
NM_000569:Homo sapiens Fc fragment of Ig 



R21945 Hs.346735 

AF134160 Hs.7327 

BE311856 Hs.274230 

AW843633 Hs.343261 

I AU076782 Hs.248267 

: U66615 Hs.1 72280 

• NMJJ01761 Hs.1973 

AI587180 Hs.1 10906 

AV656840 Hs.285115 

L22005 Hs.76932 

AI808780 Hs.227730 

BE277024 Hs.1 46381 

BE018682 Hs.166196 

X04588 Hs.85844 

AK000047 Hs.61960 



411531 AB014511 Hs.70604 

447032 AK000310 Hs.17138 

414249 AI797994 Hs.279929 

448440 AA173467 Hs.62402 

423184 NM 004428 Hs.1624 

419452 U33635 Hs.90572 

417878 U90916 Hs.82845 



Hs.351474:Homo sapiens cDNA FU30002 f s 
Hs.90744:proteasome (prosome, macropain) 
NM_002842:Homc sapiens protein tyrosine 
NM_016016:Homo sapiens CGI-69 protein (L 
(locjslink)NM 022100 Hoino sapiens CDv.92 
Hs.1 19567:ESTs, Weakly similar to ALU 1_H 
Hs.1 12377:cortical thymocyte receptor (X 
NM_01 7680:Homo sapiens asporin (LRR clas 
NM_ 1175 ! piens hepsidin i ici 
Hs.7055:Homo sapiens cDNA FU334 2 0 fis, 
Hs.31 337:lccUn, galactoside-binding, so 
(a uJih';NM_014428:Hoit,o sapiens light 
NM_002658'.Homo sapiens plasminogen activ 
Hs.346735:Homo sapiens, clone IMAGE:3BB1 
NM_021101:Homosapiensclaudin 1 (CLDN1) 
Hs.274230:3'-phosphoadenoslr.e ff-phospho 
1 13261 locompatibility (mine 13 
(locuslink)NM_021 126:Homo sapiens mercap 
NM_003074:Homo sapiens SWI/SNF related, 
NM_001761:Homo sapiens cyclin F (CCNF), 
Hs.110906:hypotheUcal protein BC004501 
NM_001560:Homo sapiens interferon 13 re 
NM_004359:Homo sapiens cell division eye 
NM_000210:Horno sapiens integrin, alpha 6 
Hs.146381 ;RMA binding motif protein, X c 
Hs.166196:ATPase, Class I, type 8B, memb 



NM_018992:Homo sapiens 
NM_031420:Homo sapiens rntochondrial rib 
Hs.70604:ATPase, Oass II, lype 9A 
(locuslink)NM_01 7755:Homo sapiens hypoth 
I s nl Ni,1 317 i n > or, p CI 
Hs.62402:p21/Cdc4»Rac1-activated kinase 
NM_004428:Homo sapiens ephrin-A1 (EFNA1) 
Hs.90572:PTK7 protein tyrosine kinase 7 
Hs.82845:Homo sapiens cDNA: FU21930lis 
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431884 


AA521246 


Hs.210792 


Hs.210792:Homo sapiens cDNA FU33591 fis 


3.00 


425261 


BE385099 


Hs.355814 


H 14:Homo ne IMAGE:2933 


3.00 


445229 


BE276013 


Hs.343828 


Hs.343828:Homo sapiens rnRNA; cDNA DKFZp7 


3.00 


412146 


M92444 


Hs.73722 


Hs.73722:APEX nuclease (multifunctional 


2.99 




AA469369 


Hs.5831 


1- 831:1 i hibltc r> lloprot 


2.99 


406865 


AI025931 


Hs.181357 


H 181 <- I mil r epto 1 (67kD rilx 


2.99 






Hs. 159322 


(locuslink)NM_012243:Homo sapiens solute 




417259 


AW903838 


Hs.81800 


Hs.81800:chondroitin sulfate proteoglyca 


23 


453902 


BE502341 






2.98 


432396 


AW295956 




(locu tin 0 n no apit ; hy D I 


2.98 


411358 


R47479 


Hs!94761 


Hs.94761:KIAA1691 protein 


2.98 


453518 


AW503205 


Hs.27268 


Hs.27268:Homo sapiens cDNA: FU21933 fis 


2.98 




AA1 94422 


Hs.22564 


NM_004999:Homo sapiens myosin VI (MY06), 


2.98 


440943 


AW082298 




NM.032331-.Homo sapiens hypothetical prot 


2.98 




AA766605 


Hs47099 


NM_024642:Homo sapiens hypothetical prot 


2.98 


452124 


AA454220 


Hs.61170 


Hs.61170:ESTs 


2.98 


416391 


AI878927 


Hs.79284 


NM_002402:Homo sapiens mesoderrn specific 


2.98 


419092 
426375 


AK000597 


Hs.89603 


NM_002456:Homc sapiens mucin 1 , transmern 


2.97 


437296 


AA350994 


Hs!20281 


Hs.20281:MAPKphosphatase-7 


2^97 


407736 


N41744 


Hs.349326 


Hs.349326:Homo sapiens cDNA FU30677fis 




443303 


U67319 


Hs.9216 


NM_033340:Homo sapiens caspase 7, apopto 


2.97 


424756 


AW504657 


Hs.152931 


(locuslink)NM_002296:Homo sapiens larnin 


2.97 


430354 


AA954810 


Hs.239784 


Hs.239784:scribble 


2.97 


417079 


U65590 


Hs.81134 


(lc.:uslink)NM_000577:Homo sapiens inlerl 


2.97 


425221 


AV649864 


Hs.155188 


NM_005642:Homo sapiens TAF7 RNA polymera 


2.97 


414186 


U33446 


Hs.75799 


Hs.75799:protease, serine, 8 (prostasin) 


2.S7 


432065 


AA401039 




Hs.2903:protein phosphatase 4 (formerly 


2.96 


452012 


AA307703 


Hs.279766 


(loc-jsllr.k)NM_012310:Homo sapiens kinesi 


2.9G 


412429 


AV650262 


Hs.75765 


NM_002089:Homo sapiens GR02 oncogene (GR 


2 96 


452714 


AW770994 


Hs.30340 


Hs.30340:hypothetical protein KIAA1 165 


2.96 


429922 




Hs.226117 


NM_005318:Homo sapiens H1 histone family 


2 96 


434931 


AW968941 


Hs.166254 


Hs.1 66254:likely ortholog of rai vacuole 


2.96 


416293 


BE244454 


Hs.79162 


Hs.79162:structure specific recognition 


2.96 


428781 


AF164799 


Hs.193384 


Hs.193384:putataBve 28 kDa protein 


2.96 


445350 


AF052112 


Hs.12540 


NM_006330:Homo sapiens lysophosphollpase 


2.96 


422396 


W21872 


Hs.7907 


(locuslink)NMJ 45059:Homo sapiens L-fuco 


2.96 


429597 


NM 003816 


Hs.2442 


1 ls.2442:a disinlegrin and metalloprotein 


2.95 




U72664 


Hs. 148495 


NM_002810:Homo sapiens proleasome (proso 


2.95 


417691 


AU076610 


Hs.82399 


NM 007357:Homo sapiens component of olig 


2.95 


427715 


BE245274 


Hs.1 80428 


Hs.1 80428:KIAA1 181 protein 


295 


430589 


AJ002744 


Hs.246315 


NM 017423:Homo sapiens UDP-N-acetyl-alph 


2.95 


409220 


BE243323 


Hs.51233 


(locuslink)NM_003842:Homo sapiens tumor 


2.95 


443883 


AA114212, 


Hs.9930 


Hs.9930:serine (or cysteine) proteinase 


2.95 


445720 


AL040482 


Hs.286173 


Hs.286173:KIAA1 595 protein 


2.95 


429583 


NM_0064I2 


Hs.2091 19 


NM 00641 2:Homo sapiens 1-acylglycerol-3- 


2.95 


427581 


NM 014788 


Hs.179703 


NM 014788:Homo sapiens tripartite motif- 


2.94 


419193 


D29643 


Hs.34789 


NM_095216:Hom hyl-d 


2.94 


419152 


L12711 


Hs.89643 


(locuslink)NM_001064:Horr,o sapiens transk 


2.94 


444824 


AA843575 


Hs.12056 


NM 001671:Homo sapiens asialoglycoprotei 


2.94 




AU077025 


Hs.265827 


NM_022873:Homo sapiens interferon, alpha 


2.94 


425118 




Hs.1 54672 


Hs.1 54672:methylene tetrahydrofolate deh 


2.94 


422010 


AA302049 


Hs.31181 


lls.31131:Homosap:er,scDNA: FU23230 fis 


2.93 


436075 


BE090176 


Hs.179902 


NM_080546:Homo sapiens CDw92 antigen (CD 


2.93 


412338 


AA151527 


Hs.69485 


(locuslink)NM_024661 :Homo sapiens hypoth 


2.93 


413073 


AL038165 


Hs.75187 


NM_014765:Homo sapiens translocase of ou 


2.93 


412088 


AI689496 


Hs.108932 


Hs,108932:ESTs 


2.93 


447140 


AF070537 


Hs.17481 


NM_138391:Homo sapiens hypothetical prot 


2.92 


426746 


J03626 


Hs.2057 


NM 00O373:Homo sapiens undine monophosp 


2.92 


424291 


AL1 20051 


Hs.144700 


NM_004429:Homo sapiens ephrin-B1 (EFNB1) 


2.92 


417944 


AU077196 


Hs.82985 


NM 000393:Homo sapiens collagen, type V, 


2.92 


428343 


AL043021 


Hs.1 2705 


lot link)MM 145294:Homo3af s n I 


2.92 


435640 


AF220053 


Hs.54960 


NM_018468:Homo sapiens uncharacterized h 


2.92 


451608 


AA384525 


Hs.26745 


NM_016499:Homo sapiens HSPC244 (MGC1337 


2.92 


434608 


AA805443 


Hs.1 79909 


NM 024831;Homo sapiens nuclear receptor 


2.92 


437186 


AA338305 


Hs.377816 


Hs.377816:Homo sapiens cDNAFU36808fis 




429574 


BE268321 


Hs.208912 


Hs.208912:hypothetical protein MGC861 


191 


438549 


BE386801 


Hs.21858 


Hs.21858:serine (or cysteine) proteinase 


2.91 


440246 


W52010 




Hs.191379:ESTs 






BE222542 


Hs!l28782 


Hs.128782:Homo sapiens cDNA FU31512 fs 


2.91 


444193 


Y17801 


Hs.10574 


Hs.10574:solute carrier family 2, (facil 


2.91 








(locuslink)NM_004451:Homosapiensestrog 




415938 


BE383507 


HS78921 


NM_003488:Homo sapiens Akinase (PRKA) a 


251 


450167 


AA446404 


Hs.24563 


NM_013248:Horr,c sapiens NTF2-like export 


2.91 


408815 


AW957974 


Hs.25485 


(locu k tJM_0?4599 Homosapfens hypoth 


2.91 




AA371931 




Hs.77422:proteolipid protein 2 (colonic 


2.91 


410013 


AF067173 


Hs57904 


Hs.57904:mago-nashi homolog, proliferati 


2.91 


444823 


BE262989 


Hs.12045 


Hs.12045:C2f protein 


Z91 


422197 






Hs.111632:Lsm3 protein 


2.90 


432710 
405203 


AA609685 


H&278672 


NM_005898:Homo sapiens membrane componen 


2.90 


432465 


D56165 


Hs.275163 




2.90 
2.90 


412926 


AI879076 


Hs.75061 


Hs.75061:maorophage myristoylated alanin 


2.90 
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30 
35 



L12535 Hs.75551 

D14697 Hs.77393 

NM_007274 Hs.8679 

AW979153 Hs.336881 

AW820035 Hs.278679 

NM_003028 Hs.244542 

AW450485 Hs.4437 

AA810141 Hs.192182 

NM_001982 Hs.199067 

AJ271671 Hs.7864 

AW732276 Hs.95583 

BE394784 Hs.78596 

AA532963 Hs.9100 

AW630656 Hs.83383 

BE397032 Hs.14468 

AW084791 Hs.133122 

M16038 Hs.80887 

BE281591 Hs.106768 

NMJ05738 Hs.10706 

BE387193 Hs.67896 

R08950 Hs.272044 

R52518 Hs.7967 

AJ224538 Hs.50732 

AW513684 Hs.7218 

AK001777 Hs.258551 

AW838306 Hs.129819 

BE382657 Hs.21486 

AK000461 Hs.26890 

U82756 Hs.374973 

AA114050 Hs.211610 

BE267994 Hs.102419 

BE262943 Hs.9234 

L13210 Hs.79339 

R63438 Hs.183454 

R26944 Hs.1 80777 

D80001 Hs.152629 

X87241 Hs.166994 

AW296454 Hs.24743 
Z47055 

AA005169 Hs.1 88489 

AW674193 Hs.227152 

H16423 Hs.82685 

AA084798 Hs.82646 

AI815206 Hs.100293 

AF161383 Hs.284207 

AA421404 Hs.346868 

AW954324 Hs.75790 

AA304961 Hs.699 

H05430 Hs.288433 

BE001596 Hs.85266 

BE277633 Hs.372542 

AI708347 Hs.1 84014 

BE545277 Hs.340959 

M164687 Hs.177576 

AB032983 Hs.21894 

AF191018 Hs.279923 

i AK001735 Hs.22983 

; AI751438 Hs.41271 

i AA852504 Hs.125359 

i AB023179 Hs.9059 

AF052693 Hs.1 98249 

I AA280722 Hs.24758 

' BE243982 Hs.83532 
I AI523875 

\ T50872 Hs.2001 

I AU076635 Hs.1478 

BE560672 Hs.1 3543 

I AW501735 Hs.1 80059 

AK000568 Hs.43654 
Hs.1 1899 
Hs.1 27007 
Hs.31181 
Hs.324507 
Hs.1 55965 



427640 AF058293 Hs.1 8001 5 

433233 AB040927 Hs.301804 

415697 AI365603 Hs.279696 

441321 H17182 Hs.7771 

430040 AW503115 Hs.227823 

449954 AA641636 Hs.37477 



(locuslink)NM_012425:Homo sapiens Rs 



417018 
429404 



409132 
440490 
431498 



- 457274 
417831 
417824 



418462 
442199 
406710 



444758 AL044878 

423323 AI951628 

439720 AI935202 

435550 AI224456 

425907 AA365752 



(locusl:r.k)NM_007274:Homo sapiens cytoso 
Hs.- 6681 ESI 

NM_033274:Homc sapiens a diskegrin and 
Hs.244542:Homo sapiens cDNA FU38908 fis 
NM_000991:Homo sapiens ribosornal protein 



NM_001982:Horao sapiens v-erb-b2 erylhrob 
NM_014437:Homo sapiens solute canierfa 
NM_012339:Homosapienslransrnembrane4s 
NM_002797:Homo sapiens proleasorne (proso 
Hs.9100:hypothe(ical gene supported by A 
NM_006406:Homo sapiens peroxiredoxin 4 ( 
NM_020230:Homo sapiens peter pan homolog 
1 I' X rypotl calpr ;;i J145 
Hs.80887:v-yes-1 Yarraguchi sarcoma viral 
NMJJ1 81 20:Homo sapiens hypolhetical prat 
NM_005738:Homo sapiens ADP-ribosylation 
(locuslink)NM_007343:Homo sapiens opioid 
H ' 04-1 _ 1 Veakly simitar to hypolh 
Hs.7967:ESTs, Weakly similar to extensin 
NM_005399:Homo sapiens protein kinase, A 
Hs.7218:acetyl-Coenzyme A synthetase 2 ( 
NM_01 21 00:Homo sapiens aspartyl aminopep 
NM_018344:Homo piens hy rl , I pro 
Hs.21486:signai transducer and activator 
NM_01 7829:Homo sapiens cat eye syndrome 
(locuslinl ; NM_004697:Homo sapiens PRP4 p 
NM_001228:Homo sapiens caspase 8, apopto 
Hs.102419:zinc finger protein 
NM_032635:Homo sapiens seven transmembra 
NM_005567:Homo sapiens lectin, galactosi 
Hs.183454:Homo sapiens cDNA FU14883 fis 
Hs.180777:Hcmo sapiens mRNA; cDNA DKFZp5 
Hs.152629:KIAA0179 protein 
Hs.166994:FAT tumor suppressor homolog 1 
Hs.24743:hypothetical protein FU20171 

Hs.188489:ESTs 

NM_016391:Homo sapiens hypothetical prot 
Hs.82685:CD47 antigen (Rh-relaled anlige 
NM_006145:Homo sapiens DnaJ (Hsp40) homo 
Hs.100293:O-linked N-acetylglucosamlne( 
Hs.284207:hypothetical protein BC003515 
NM_006824:Homo sapiens EBNA1 binding pro 
(locuslink)NM_002642:Homo sapiens phosph 
Hs.699:peptidylprolyl isomerase B (oyclo 
NM_010522:Homo sapiens neurolrimin (HNT) 
Hs.85266.integrin,beta4 
NM_004879;Homo sapiens i 
Hs.184014:ribosomal protein L31 
NM_005726.Homo sapiens Ts translation el 
Hs.177576:mannosyl(alpha-1,3-)-glycopro 
Hs.21894:KIAA1 157 protein 



NM_006288:Homo sapiens Thy-1 cell surfac 
Hs.9059:KIAA0S62 protein 
NM_005268:Homo sapiens gap junction piol 
Hs.24758:Horno sapiens cDNA FU32068 fis, 



R45782:Ha616-f Adult heart, Cbnlech Horn 
NM_001061:Homo sapiens thromboxane A syn 
NMJJ001 85:Homo sapiens serine (or cystei 
locu tJM_145214:Homo sapiens tripar 
Hs.18C059:Homo sapiens cDNA FU31360fis 
NM_017882:Horno sapiens ceroid-lipofuscin 
NM_000859:Homo sapiens 3-hydroxy-3-methy 
NM_00374C:Homo sapiens potassium, channel 
Hs.31181:Homo sapiens cDNA: FU23230 fis 
Hs.324507:hypothetica: protein FU20986 
Hs.155965:ESTs 

Hs.1 681 59:bifti nctional apoptosis regulat 

NM_00135E H api( i s D-do[ achrorne i M 

Hs.301804:KIAA1494 protein 
Hs.279696:DKFZP566H024 protein 
NM_00 1 273:Homo sapiens repressor of estr 
'J" 014287:1 sap r F'"V ir I 
Hs.37477:ESTs, Weakly similar to T46220 
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427022 
410047 
400845 
419501 
418140 



40 
45 
50 



AW843822 Hs.199961 

BE613836 Hs.83551 

AA476966 Hs.1 10857 

T72104 Hs.93194 

AF201931 Hs.5268 

AA298741 Hs.102135 

BE018138 Hs.24447 

AW300887 Hs.26638 



Hs.2280 



423880 BE278111 



457329 
426437 
412627 
452695 
409531 



424867 
410600 
451452 



425978 
422753 
449051 

412890 
415752 
427609 



418478 
428157 
416178 
435025 
421917 
406621 
408196 
412600 



AA320160 Hs.171811 
AW748265 Hs.5741 

Hs.118964 



BE391959 Hs.74276 

AW780199 Hs.30327 

BE384319 Hs.54702 

Y09763 Hs.22785 

AF090922 Hs.1 52738 

NM.015977 Hs.285681 

AI373162 Hs.75103 

AI024860 Hs.1 53591 

AW575742 Hs.351676 

BE560065 Hs.26433 

AW073716 Hs.8037 

AA593867 Hs.300842 

AW470411 Hs.288433 

AA527348 Hs.288967 

AF019225 Hs.1 14309 

AK000476 Hs.75798 
AW516565 

NM.002496 Hs.90443 

NM.016929 Hs.283021 

AW379030 Hs.41324 

AW875443 Hs.22209 

H61001 Hs.171802 

H54520 Hs.351327 

AA379770 Hs.82890 

AW327546 Hs.1 11 024 

AL050373 Hs.25213 

AA773820 Hs.63970 

AI376750 Hs.57600 

BE206939 Hs.42287 

H59354 Hs.374303 

AW451469 Hs.209990 

AA305800 Hs.5672 

AI565004 Hs.374415 

H52280 Hs.1 861 2 

BE621800 Hs.29444 

BE253927 Hs.24983 ' 

AI928995 Hs.1 575 

AW961400 Hs.333526 

H39960 Hs.288467 

T85247 Hs.351875 

BE314524 Hs.78776 

AK000436 Hs.1 79791 

AA019924 Hs.28803 

AA013051 Hs.91417 

AF189062 Hs.285976 

BE242241 Hs.82542 

AA224760 Hs.1 53 

BE545684 Hs.343566 

AI333660 Hs.1 7558 

AV652402 Hs.72901 

U38945 Hs.1 174 

AI738719 Hs.198427 

AI808527 Hs.1 92822 

T08990 Hs.4742 

AB028943 Hs.1 09445 

X57809 Hs.1 81 125 

AL034548 Hs.43627 

L28824 Hs.74101 

AW014549 Hs.58373 

AA748418 Hs.33368 

BE281170 Hs.106357 



Hs.199961-.ESTs, We 
(locusnnk)NM_002403:HomoEapiens'microf 
NM_016310:Homo sapiens polymerase (RNA) 
Hs.93194:apolipoproteinA-l 
Hs 52 it -mc finger, DHHC domain contain 
NM )06 Horn piei gnal secuencei 
(!o i il ,it,NM_005866:Homo sapiens t/pe I 
NM_031457:Homo sapiens membrane-spanning 
(locuslink)NM_025233:Homo sapiens nucleo 
NM_002950:Homo sapiens ribophorir. I (RPN 
Hs.91202:Homo sapiens cDNA FLJ25946 lis, 
Hs.61289:synaptoianin2 
Hs.134200:DKFZP564C186 prolein 
Hs.1 11497:neuronal protein 17.3 
NM_001625:Homo sapiens adenylate kinase 
NM_016230:Homo sapiens flavohemoprotein 
NM_017660:Homo sapiens hypothetical prot 
(locuslink)NM_016442:Homo sapiens type 1 
Hs.169895:ubiquilin-conjua ting enzyme E 
Hs.74276:chlorideint 
NM_003668:Homo sapiens mto 
(loaislink)NM_007255:Hcmo sai 
NM_021987:Homosa t ' 
NM_016050:Homo sapiens mitochondrial rib ' 



NM_005787:Homo sapiens NotE6 (D. melanog 
Hs.351676:Homo sapiens cDNA FU25921 fis 
NM_001382:Homo sapiens dolichyl-phosphat 
(to Mr 1 M ) 23:Ho a| let 
NM_024820:Homo sapiens KIM1608 protein 
NM_016522:Homo sapiens neurotrlmin (HNT) 
Hs.288967:Homo sapiens, similar to R1KEN 
' > jslinkJNM > 66 Hon a F fr i 
NM_016470:Horno sapiens chromosome 20 ope 
AA229762:nc49l01.r1 NCI_CGAP_Pr3 Homo sa 
NM_002496:Homo sapiens NADH dehydrogenas 
NM_016929:Horno sapiens chloride intracei 
Hs.41324:ESTs 

H 2209 secreted modular calclum-bindin 
Hs.171802:Homo sapiens, clone 1MAGE:3956 
(locuslink)NM_017828:Homo sapiens hypoth 
Hs.82890:defender against cell death 1 
(locust n-,NM_005984:Homo sapiens solute 
NM_015677:Homo sapiens hypolhcliccl pro; 
Hs.63970:ESTs 

NM_001283:Homo sapiens adaptor-related p 
NM_001952:Homo sapiens E2F transcription 
(locuslink)NMJ 44691 :Homo sapiens hypoth 
Hs.209990:ESTs 

(locuslink)NM_030799:Homo sapiens golgi 



...3l protein from EUROS 
Hs.1575:small nuclear ribonucleoprotein 
NM_032339:Homo sapiens hypothetical prot 
Hs.288467:Homo sapiens cDNA FLJ12280 fis 
NM_004374:Homo sapiens cytochrome c oxid 
NM_012342:Homo sapiens putative transmem 
NM_017817:Homo sapiens RAB20, member RAS 



(locustink)NM_007027:Homo sapiens topois 
(locuslinl- NM_013384:Homo sapiens LAG1 1 
NM_0Q1637;Homo sapiens acyloxyacyl hydro 
NM_000971:Horno sapiens ribosomal protein 
Hs.343566:KIAA0251 protein 
Hs.17558:Homo sapiens, clone IMAGE40704 
NM_078487:Homo sapiens cyclin-dependenl 
NM_000077:Homo sapiens cyclin-dependenl 
NM_000189:Homo sapiens hexokinase 2 (HK2 

M 3949:1 i , r phosph 
Hs.4742:GPAA1 P anchor attachment protein 
Hs.109445:hypermethylated in cancer 2 
lls.181125:immunoglobulin lambda locus 
NM_006943:Homo sapiens SRY (sex detcrmin 
Hs.74101:spleen tyrosine kinase 
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AW953575 




Is 3031 25:p53-mdu d orolein PIGPC1 






AF1 23303 




13386 m | ypothetica! p 




422691 


BQ87593 5 


Hs!l19251 


t 1 ir ic tochr 


2.74 


440457 




Hs.21321 


(o k)NM..1458( r e r I 


2.74 




BE018517 


Hs.1 19140 






428428 




Hs.184298 








BE247600 


Hs!377968 


MHC02040ftHomo sapiens G protein-coupled 


273 


428297 


AA236291 


Hs.1 83583 


NM_030666:Homo sapiens serine (or cystei 


2.73 
















































































































































AL1 37723 




Hs.5855:Homo sapiens mRNA; cDNA DKFZp434 
















AI878907 








































W42913 


Hs 78089 










Hs.2280 
















































AF052692 






































































M93718 


Hs!l66373 








AW247252 


Hs.75514 


NM_000270:Homo sapiens nucleoside pnospn 






AI064821 


Hs.1 29953 








AF284422 


Hs.1 19178 








NM 00273; 


I Hs.3136 








M307211 


Hs.251531 








AA454501 


Hs.43666 










Hs.79768 








H59696 


Hs.1 8747 








AF040704 


Hs.149443 








BE536554 


Hs.278270 










Hs.1 69476 








NMJM533 Hs.2730 








AA458542 


Hs.10326 








X80695 


Hs.151134 








AA224827 




AA224827:nc32g04.s1 NCI CGAP Pr2Homosa 






AW769844 


Hs.1 11222 








D31365 


Hs.24220 








U37689 


Hs.3128 








AL080080 


Hs.24766 








BE299343 


Hs.2430 








BE336699 


Hs.185055 








AB037848 


Hs.12365 








NMJM2314 Hs.36566 


NM_016735:Homo sapiens LIM domain kinase 




418883 


BE387036 


Hs.1211 


NM 00161 1:Homo sapiens acid phosphatase 


2.68 


420957 


X98743 


Hs.100555 


Hs.100555:DEAM (Asp-Glu-Ala-Asp/His) b 


2.68 


418187 


NM_004604 Hs.83734 


NM_004604:Homo sapiens syntaxin 4A (plac 


2.68 


409533 


AW969543 


Hs.144609 


NM_080652:Homo sapiens simila oRII EN' 


2.68 


433184 


AA147979 


Hs.285005 


NM 020243:Homo sapiens translocase of ou 


2.68 


455303 


AW892049 




BE066891:PM3-BT0338-211299-002-e12BT033 


2.68 


452600 


AI910842 


Hs.103381 


Hs.103381-.ESTs, Weakly similar to hypoth 


2.58 


415410 


AF037332 


Hs.278569 


NM_014748:Homo sapiens WAA0064gene pro 


2.57 


426432 


AF001601 


Hs.169857 


NM 000305:Homo sapiens paraoxonase 2(PO 


2.67 


435049 


AL122067 


Hs.4746 


NM 021941:Homo sapiens hypothetical prot 


2.57 


450528 


NM_014072 Hs.25063 


NM_031268:Homo sapiens PRO0461 protein ( 


2.67 


433339 


AF019226 


Hs.8036 


NM_004283:Homo sapiens RAB3D, member RAS 


2.67 


408783 


AF192522 


Hs.47701 


NM_013389:Homo sapiens NPC1 (Niemann-Pic 


2.67 



172 
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451798 BE297567 Hs.27047 

427716 L38951 Hs.180446 

436319 H90727 Hs.5123 

415116 M160363 Hs.269956 



U68140 Hs.58927 

AI929659 Hs.237825 

M830797 Hs.184750 

J03060 Hs.247551 

W32474 Hs.301746 

AK001333 Hs.6216 

AI686379 Hs.1 10796 

L76528 Hs.3260 

NMJJ14521 Hs.17667 

D87432 Hs.10315 



447191 
444099 
417821 
418529 



30 
35 
40 



55 
60 



AW005695 Hs.250897 

AW138330 Hs.17558 

AK001714 Hs.95744 

BE620274 Hs.43112 

N66148 Hs.11125 

X56777 Hs.273790 

AA293303 Hs.356342 

AF038962 Hs.7381 

H61082 Hs.14743 

BE048146 Hs.75671 

AW513963 Hs.39143 

AF113008 Hs.8102 

AW972990 Hs.196270 

AW150807 Hs.356262 

AI557212 Hs.17132 

AF282874 Hs.21201 

BE178050 Hs.171271 

N77630 Hs.13895 
H96850 

U66839 Hs.180533 

AA857991 Hs.123106 

X57348 Hs.184510 

AK000429 Hs.26570 

BE244334 Hs.75249 

BE249915 Hs.293533 

BE259190 Hs.289721 

U82987 Hs.87246 

BE379794 Hs.1 59651 

AI146327 Hs.334802 

BE563042 Hs.1 18820 

AA521091 Hs.178499 

M564512 Hs.372775 

450377 AB033091 Hs.355925 

434633 AI189587 Hs.120915 

424707 BE061914 Hs.10844 

427600 AW630918 Hs.179774 

446522 NMJW3876 Hs.15196 
Hs.181244 



440708 
416526 
413995 



426216 
407223 
427725 



410668 
436183 
441226 



410701 
410182 
406716 
430308 
431074 
412867 
440524 
435968 
422672 
447528 
406774 



Hs.27047:hypothetical protein FU20392 
Hs.180446:karyopherin (imporrin) beta 1 
Hs.5123:hypothetical protein BC008246 
Hs.269956:ESTs 

NM_014071:Homo sapiens nuclear receptor 
NM_006831:Homo sapiens ATP/GTP-binding p 
NM_013258:Homo sapiens apoptosis-asscda 
ocuslii t! 0 5 to apter nude 
Hs.237825:signal recognition particle 72 
NM_005760:Homo sapiens CCAAT-box-binding 
NM_002455:Homo sapiens metaxin 1 (MTX1), 
Hs.301746:Homo sapiens cDNA FU37267 fis 
Hs.6216:DnaJ (Hsp40) homolog, subfamily 
I M_020150:H tens SARI 

NM 000021:H a . r r pr I 1 
(locuslink)NM_014S21:Homo sapiens SH3-do 
NM_003983:Homo sapiens solute carrier fa 



Hs.250897:TRK-fuse 
Hs.17558:Homo sapiens, clone IMAG&40704 
NM_019028:Homo sapiens hypothetical prot 
Hs.43112:Homo sapiens mRNA; cDNADKFZp43 
(locuslink)NM_014041:Homo sapiens signal 
NM_007155:Homo sapiens zona pellucida gl 
Hs.356342:ESTs, Highly 6 fir.Bor to 21 1 320 
Hs.7381:voltage-dependent anion channel 
Hs.14743:ESTs 

NM_004603:Homo sapiens syntaxin 1A (brai 
Hs.39143:hypothetical protein MGC13125 
NM_001023:Homo sapiens ribosomal protein 
NM_030780:Homo sapiens folate transports 
Hs.356262:ESTs, Highly similar to A31 233 



NM_015480:Homo sapiens ne 

M_001904:Horr c Jherin 

J . I 895 Homo sapiens cDNA FLJ1 1654 fis, 
H96850:yw03b12.s1 Soares melanocyle 2NbH 
NM_002756:Homo sapiens mitagen-activatad 



NM_017814:Homo sapiens hypothetical prot 
Hs.75249:ADP-ribosylation factor-like 6 
Hs.293533:Homo sapiens cDNA FU37093 fis 
Hs.289721:growth arrest-specific 5 
NM_014417:Horno sapiens BCL2 binding comp 
NM_014452:Homo sapiens tumor necrosis fa 
(locucl:nk)HM_024718:Homo sapiens hypoth 
Hs.1 16820:hypothetical protein BC307882 



Hs.372776:Homo sapiens, clone IMAGE:3946 



AF198620 Hs.10_. - 

NM_001983 HS5S544 

AW148546 Hs.169476 

BE540865 Hs.238990 

BE072772 Hs.8997 

AU076861 Hs.74637 

R71264 Hs.16798 

AW161481 Hs.1 11577 

X12784 Hs.1 19129 

AI612027 Hs.76277 

AW518383 Hs.177592 

AW748482 Hs.77873 

W86610 Hs.185736 

AW578252 Hs.190161 

BE540516 Hs.378825 

X63692 Hs.77462 

NM_001545 Hs.9078 

AW859679 Hs.21902 



407687 AK002011 Hs.37558 
442232 AI357813 Hs.337460 
419625 U91616 Hs.182885 



408452 AA054683 Hs.222728 
443142 AI696513 Hs.108705 
426152 BE299190 Hs.167246 



;.120915:ESTs 
Hs.10844:leuoine-richalpha-2-glycoprote 
Hs.179774:prateasome (prosorne, macropain 
NM_003876:Homo sapiens putative receptor 
Hs.181244:mojor histocompatibility compl 
NM_005105:Homo sapiens RKA binding motif 
NM_001983:Homo sapiens excision repair c 
Hs.169476:glyceraklehyde-3-phosphatedeh 
NM_004064:Homo sapiens cyclin-dependent 
Hs.8997:Sad1 unc-84 domain protein 1 
Hs.74637:testis enhanced gene transcript 
Hs.16798:Homo sapiens mRNA; cDNA DKFZp56 
(locuslink)NM_030926:Homo sap'ens integr 
NM_001845:Homo sapiens collagen, type IV 
NM_138393:Homo sapiens hypothetical pro! 
Hs.177592:riboscmal protein, large, P1 
Hs.77873:B7 homolog 3 
Hs.185736:ESTs 
Hs.190161:LR8 protein 
Hs.378825:Homo sapiens cDNA FU37850fls 
(lt>ousllnk)NM_001379:Homo sapiens DNA (c 



Hs.21902:Homo sapiens clone 25237 mRNA s 
Hs.166486:Homo sapiens cDNA FU1 1432 fis 
\U 0183c Homo sapiens lypotha'.ical prol 
Hs.337460:ESTs, Highly similar to HYEP_H 
NM_004556:Homo sapiens nuclear factor of 
Hs.1 83868:glucuronidase, beta 



Hs.222728:Homo sapiens cDNA FU39004 fis 
Hs.1 08705:protein phosphatase 2 Hbrmerl 
Hs.1 67246:P450 (cytochrome) oxl 
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419667 AU077005 Hs.92208 

415072 BE253687 Hs.77876 

406670 W79632 Hs.256301 
403399 

419579 W49529 Hs.296200 

437202 AA326110 Hs.374481 

414020 NM_002984 Hs.75703 

421295 AW081061 Hs.103180 

446488 AB037782 Hs.15119 

442504 BE503373 Hs.334335 

448204 AI475124 Hs.170561 

449175 AJ005892 Hs.23170 



424805 
425421 
422739 



35 
40 
45 



AF230904 Hs.153260 

L11669 Hs.157145 

H20106 Hs.1 19591 

C18458 Hs.25597 

BE148235 Hs.193063 

H52761 Hs.44095 

AW263124 Hs.350547 

AA345547 Hs.53263 

AA224483 Hs.27239 

BE262310 Hs.75782 

BE295918 Hs.1 12193 

AA322859 Hs.284275 

AW293542 Hs.75309 

BE515018 Hs.107014 

BE275469 Hs.66493 

AF132959 Hs.7236 

M579668 Hs.41072 

BE244879 Hs.1 55939 

: U46351 Hs.621 

: AI192838 Hs.173135 

! BE301909 Hs.279952 

i BE514743 Hs.379039 

W79410 Hs.82887 

! AK000860 Hs.6191 

i AA505112 Hs.282990 



416435 AI431301 

415444 BE247295 

452222 AW806287 
400541 

444309 U83236 

416116 H51847 

41862S BE247550 

432996 AF105025 

426781 AL048967 

452636 BE615074 

406851 M609784 

447674 BE270640 

445647 AV654627 

444736 AA533491 



Hs.279901 
Hs.172207 
Hs.145279 
Hs.352392 
Hs.19192 
Hs.271808 
Hs.23317 



NM_003815:Homo sapiens a disintegrin and 
lls.7787! hypothet I gen 1.1 J 35 i 
Hs.256301 -.hypothetical protein MGC13170 



s hypothetical prat 
Hs.374481 :ESTs, Weakly similar to T34549 
NM 002984:Homo sapiens small inducible c 
Hs.1 031 80:DC2 protein 
Hs.15119:KIAA1361 protein 
NM_022484:Homo sapiens hypothetical prot 
lls.1705C1:ESTs 

Oocuslink)NM_012280:Homo sapiens FlsJ h 
Hs.8509:ESTs, Weakly similar to C3HU com 
NM_031892:Homo sapiens SH3-domain kinase 
Hs.1 57 1 45:tetracycline transporter-like 
(locuslink)NM_004069:Homo sapiens adaplo 
Hs.25597:elongation of very long chain f 
Hs.193053:Homo sapiens CDNAFLJ14201 fls 
Hs.44095:cyclin M3 

NM_024665:Homo sapiens nuclear receptor 
(locuslink)NM_024647:Homo sapiens nucleo 
Hs.272J9:2inc linger, DHHC domain contai 
NM_001521:Homo sapiens general tascrip 
(locuslink)NM_025259:Homo sapiens chromo 
Hs.284275:p21 (CDKNIA)-aclivated kinase 
Hs.75309:eukaryotic translation ekmgati 
NM_016641:Homo sapiens membrane interact 
NM_016430:Homo sapiens Down syndrome crt 
NM_015953:Homo sapiens eNOS interacting 
(locuslink)NM_004563:Homo sapiens serine 
NM_005541:Homo sapiens inositol polyphos 
Hs.621:lectin, galaclosiaa binding, sob 
Hs.1 73135:dual-specificity lyrosine-(Y)- 
NM_015917:Homo sapiens glutatiiione S-tra 
NM_005851:Homo sapiens tumor suppressor 
(taislink)NM_021959:Homo sapiens protei 
NM_020441:Homo sapiens coranin, adin-bi 
NM_033550:Homo sapiens chromoscme 20 ope 
Hs.2085S8:ESTs 

Hs.374897:Ho;ra sapiens cDNA FU36874 fis 
Hs.78452:solute carrier family 20 (phosp 
Hs.21432:SEXgene 

Hs.10803:calcium and integrin binding 1 
Hs.99858:ribosomal protein L7a 
(locuslink)NM_005310:Homo sapiens growth 
Hs.279901 :PTD009 protein 
(locuslink)NM_007363:Homo sapiens non-PO 
Hs.145279:SET translocation (myeloid leu 
Hs.352392:major histocompstibility comp! 
NM_001798:Hcmo sapiens cyclin-dependent 
Hs.271808:Homo sapiens cDNA FU38018 fis 
NM_032824:Homo sapiens hypothetical prot 



414029 BE297731 Hs.75709 

427700 AA262294 Hs.1 80383 

449961 AW265634 Hs.133100 

449378 AW664026 Hs.59892 

442599 AF078037 Hs.324051 

448633 AA311426 Hs.21635 

416078 AL034349 Hs.79005 

428044 AA093322 Hs.301404 

451564 AU076698 Hs.132760 

457601 AF041429 Hs.284265 



innose-6-phospha 
NM_001946:Homo sapiens dual specificity 
Hs.133100:ESTs 

^ i _ i_ 1 V I mil II 
(locuslink)NM_006663:Homo sapiens RelA-a 
NM_001070:Homo sapiens tubulin, gamma 1 
NM_002844:Homo sapiens protein tyrosine 
NM_006743;Homo sapiens RNA binding motif 
(to it NM 1014 on i- i r 
locuslinl M'4_11M 9:Homosaf mila 
I 8633:Homo picn cDN -LJ2 tSfe 
NM_000807:Homo sapiens gamma-aminobutyrl 
Hs.283655'.lysophosphollpase II 



421541 
434848 
424488 



AK000413 Hs.149227 

D78850 Hs.250465 

I AF052155 Hs.227949 

I AW753112 Hs.15514 

i AH 51 353 Hs.29742 

I NM_016361 Hs.15871 

i AA399671 Hs.71969 

i AW009480 Hs.943 

i BE258948 Hs.290874 



Hs.205736:KIAA1978 protein 
Hs.105584:ribosomal protein S6 kinase, 9 
NM_018445:Homo sapiens AD-015 protein (L 
(locuslink)NM_017806:Homo sapiens hypolh 
Hs.250465:Homo sapiens mRNA; cDNA DKFZp4 
NM_030673:Homo sapiens SEC13-like 1 (S. . 
Hs. 15514:hypothetical protein MGC3260 
Hs.29742:Homo sapiens cDNA FU32147 fis, 
NM_01 6361:Homo sapiens LPAP for lysophos 
Hs.71969:Homo sapiens mRNA; cDNA DKFZp66 
Hs.943:natural killer cell transcript 4 
Hs.290874:Homo sapiens, clone MGC31984 
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417785 X59812 Hs.82568 

443358 H65417 Hs.17757 

452349 AB028944 Hs.29189 

427721 AI582843 Hs.1 80455 

407559 AA313352 Hs.280858 

413426 U88837 Hs.75354 

425465 L18964 Hs.1904 

444152 AI125694 Hs.149305 

451820 AW058357 Hs.1 99248 

441356 BE384361 Hs.182885 

444410 BE387360 Hs.33719 

415200 AL040328 Hs.78202 
403955 

430361 AI033965 Hs.239926 

432401 NM_013330 Hs.274479 

446719 W39500 Hs.301872 

439941 AI392640 Hs.18272 

436685 W28661 Hs.5288 

424522 AL134847 Hs.149957 

442904 AW575008 Hs.1 1355 

422605 H16646 Hs.1 18666 

442069 AW664144 Hs.297007 

447362 AW176120 Hs.9D61 

416305 AU076628 Hs.79187 

422624 BE616678 Hs.76152 

447298 BE617527 Hs.239818 

412833 AW960547 Hs.298262 
404854 

415761 M132666 Hs.78802 

431104 AW970859 Hs.313503 

439180 AI393742 Hs.199067 

424250 AF073310 Hs.143648 

452878 AW081128 Hs.246374 

415742 BE410243 Hs.78769 
404140 

407255 M012992 Hs.256301 

422509 AA258513 Hs.1 17865 

434866 AW002565 Hs.355460 

429743 AA804398 Hs.288995 

433047 M86135 Hs.279946 
418945 BE246762 Hs.89499 
445926 AF054284 Hs.334826 
411353 BE383533 Hs.279784 
448252 BE622791 Hs.12199 
447365 BE383676 Hs.334 
414844 AA296874 Hs.77494 
444025 AA578364 Hs.349093 
416149 AA311965 Hs.79058 
418741 H83265 Hs.8881 
437952 D63209 Hs.5944 
445625 BE246743 Hs.353181 
431565 AF161470 Hs.260622 
410179 W27723 Hs.59498 
431476 BE612705 Hs.256697 
406672 M26041 Hs.198253 
418180 BE618087 Hs.83724 
428248 AI126772 Hs.40479 
419935 AB020980 Hs.93832 
446143 BE245342 Hs.30607S 
426691 NMJM6201 Hs.171834 
408124 U89337 Hs.42853 
456266 L29073 Hs.198726 
428921 Z43809 Hs.194638 
ls.77091 



'ome P450, 
nsplecks 



A( 



Hs.77515 



422607 Z45471 

421846 AA017707 

414874 D26351 

432956 AL037895 Hs.279861 

438393 M351815 Hs.50740 

418360 AW296974 Hs.84264 
401061 

426559 AB001914 Hs.170414 

412204 AI125507 Hs.24937 
Hs.9275 



NM_000784:Homo sapiens cytoch 
,I.Hii.-iiHIJM_021622:Homosapit 
Hs.29189:ATPase, Class VI, type 
NM 005053:Homo sapiens RAD2I 
I 30 8hc.no sapiens cDNAFU32370fis 
Hs.75354:GCN1 general control of amlno-a 
Hs.1904:protein kinase C, iota 
Hs.149305:liypolhetical protein MGC2603 
NM 000958:Homo sapiens prostaglandin E r 
(locuslink)NM_004556:Homo sapiens nuclea 
Hs.33719:Homo sapiens, similar to data s 
NM_003072: Homo sapiens SWI/SNF related, 



Hs.239926:slerot-C4-methyl oxi 
NM_013330:Homo sapiens NME7 (NME7), mRNA 
H ■ 1 1 27^ hypothetical protein MGC4840 
NM_030674;Homo sapiens solute earner fa 
Hs.5288:Homo sapiens mRNA; cDNA DKFZp434 
Hs.1499S7:ribosomal protein S6 kinase, 9 
Hs.11355:thymopcietin 
Hs.1 18666:hypotheHcal protein PP591 
Hs.297007:Homo sapiens cDNA FLJ32174 fis 
NM_024099:Homo sapiens hypothetical prot 
NM_001338:Homc sapiens coxsackie virus a 



(locuslink)rJM_002093:Homo sapiens glycog 
Hs.313503:ESTs 

Hs.199067:v-erb-b2 erythroblastic leukem 
NMJ03749:lcrr,osof ' "-" 



Hs.256301 -.hypothetical protein MGC13170 
Hs.1 1786S:so!ute carrier family 1 7 (anio 
Hs.355460:Homo sapiens cDNA: FU21763 f 
OocusJink)NW_017961:Horr,o sapiens hypoth 
NM 004990:Homo sapiens methionine-tRNA s 
Hs.89499:arachidonate lipoxygenase 
NM_012433:Homo sapiens splicing factor 3 
Hs.279784:prolactin regulatory elementb 
NM_030577:Horno sapiens hypothefcal prot 
(locuslink)NM_005435:Horno sapiens Rho gu 
NM_080916:Homo sapiens deoxyguanosine ki 
NM_015945:Homo capionc ovarian cancer ov 
NM_003168:Homo sapiens suppressor of Ty 
Hs.8881:Homo sapiens cDNA FU32163fs, 
NM_014585:Homosapien soluli c m'erl 
(locuslink)NM_025092:Homo sapiens hypoth 
Hs.260622:butyrate-lnduced transcript 1 
(loeuslink)NM_003718:Homo sapiens cell d 
(locuslink)NMJ05340:Homo sapiens histid 
(locusllnk)NM_002122:Homo sapiens major 
Hs.83724:hypothetical protein MGC5466 
Hs.40479:Homo sapiens cDNA FU25802 tls, 
Hs.93832:putative membrane protein 
NM 013336:Homo sapiens protein transport 
(locuslink)NrvL006201:Homo sapiens PCTAIR 
NM.004381 :Homo sapiens cAMP responsive e 
NM 003651:Homo sapiens cold shock domain 
I ls.1946: btpolymerase (RNA) II (DNA dlrec 
NM_006730:Homo sapiens deoxyribonuclease 
t'M_ )0 923 Homo sapiens stromal cell-deri 
NM_002743:Homo sapiens protein kinase C 
NM_002224-.Homo sapiens inositol 1,4,5-tr 
NM_01 5959:Homo sapiens CGI-31 protein (L 
Hs.50740:Homo sapiens mRNA; cDNA DKFZp76 
NM_006401:Homo sapiens acidic (leucine-r 

Hs.170414:paired basic amino acid cleavi 
Hs.24937:lransformer-2 alpha (hlra-2 alp 
NM_020410:Homo sapiens CGI-152 protein ( 
(locuslink)NM_018234:Homo sapiens duduli 
Hs.288968:RAB22A, member RAS oncogene fa 
(locuslink]NM_012289:Homosapiens Kelch- 



409936 AK001691 Hs.57655 

414675 R79015 Hs.288968 

409983 D50922 Hs.57729 

450914 AI743761 Hs.142528 

444630 AI753230 Hs.323562 (locuslink)NM_032121:Homosapiens hypoth 
401353 

441680 AW444598 Hs.7940 

406860 AA876469 

449163 AW161356 Hs.23119 



(locuslink)NM_021 1 59:Homo sapiens RAP1 , 
AA876469:oe48b04,s1 NCI_CGAP_Pr25 Homo s 
NM_003492:Homo sapiens chromosome X open 
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432975 M331517 I 
430600 AW950967 I 
407584 W25945 i 



Unique Eos probeset identifier number 



CAT Number Accession 

0_0 M18728 

161393J BG028348 BF772844 H83066 AW81 7969 H90985 BF755039 AI8581 83 

1495511J BE072881 AI762181 BE072946 

223120 -4 AW578452 

2189115J R45782 R45781 

12789 14 M229762 M230035 



(6_1 

455303 1 152492J BE066891 BE066895 AW892049 BE066897 BE903884 



Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) number; 
sequence of human chromosome 22.' Dunham I. et ah, Nature (1999) 402:489-495. 
Indicates DNA strand from which exons were predicted. 
: Indicates nucleotide positions of predicted exons. 



"Dunham I. et at." refers to the publication en 



404140 
401353 



7630969 Plus 
7630969 Plus 



403218 
403221 
403219 

405484 
405556 
400529 
404826 
400750 
400847 



402829 8918414 



9256114 
7230116 
8119072 
7710730 



2814366 Minus 

7770475 Minus 

7143420 Plus 

9843520 Plus 

3242744 Minus 

9931296 Minus 



37-1 63623,1 6471 5-1 64968,1 65369-1 6550 
138232-138423 
47726-48046 



44643-44835 
177372-177674 

101532-101852,102006-102263 



113477-113893 

116132-116407,116653-116922 

1 13765-1 13910,1 1 5653-1 1 5765,1 1 6808-1 1 694 

14403-14602,17000-17147,17241-17368 



122409-122600 

72000-72290,72431-72700,72929-73199 
34428-34612 



144546-144854 
157909-156430 
61841-62145,62367-62756 



37761-38147 

99468-99549,100707-100848,1 00918-1 01 1 07, 
50831-51352 



Table 10A lists about 1 ,103 genes up-regulated in colon cancer compared to normal adult tissues excluding non-malignant colon tissues (whole colon and colon epithelium). These 
were selected from the starting collection of 59680 probesets on the Affymetrix/Eos-Hu03 GeneChip® array as follows: the ratio of "average" colon to "average" normal adult tissues 
was greater than or equal to 2.5, the "average" colon level was set to the 9Dth percentile value amongst colon primary cancer specimens and colon liver derived metatases, the 
"average" normal adult tissue level was set to the 85th percentile value amongst non-malignant tissues minus the colonic derived samples, the "average" colon value was greater than 
or equal to 50 units. In order to remove gene-specific background levels of non-specific hybridization, the 15th percentile value amongst the over non-malignant tissues was 
subtracted from both the numerator and the denominator before the ratio was evaluated. 

TABLE 1 0A: 1 ,1 03 genes up-regulated in colon cancer compared to noimal adult tissues excluding non-malignant colon tissues (whole colon and colon epithelium) 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal adult tissues 



176 
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Pkey ExAccn UnigenelD 

436749 AA584890 Hs.5302 

428934 AF039401 Hs.194659 

446787 U67167 Hs.315 

431912 A1660552 Hs.356183 

423541 AA296922 Hs.1 29778 

406690 M29540 Hs.220S29 



418007 
422260 
421582 
441031 
424212 



Hs.1545 

! AU076801 Hs.89436 

M13509 Hs.83169 

M315993 Hs.1 05484 

AI910275 Hs.350470 

AI110684 Hs.7645 ' 

NM 005814 Hs.1 431 31 

X02544 Hs.572 

M058357 Hs.74466 

T53925 Hs.1 07 



430178 
423673 
447400 
409683 
431777 



410407 X66839 



AK000322 Hs.1 8457 

U33317 Hs.711 

AA570296 Hs.307047 

L15533 Hs.423 

M82962 Hs.1 79704 

T64297 Hs.351719 



Hs.75498 



412374 
407244 
419741 



413585 / 



NM_007019 Hs.93002 
M058357 Hs.74466 
X00442 Hs.75990 

Hs.75431 



414463 T69078 Hs.76177 

430828 AI763257 Hs.86327 

433927 AI557019 Hs.116467 

406687 M31126 Hs.352054 

423538 AW603823 Hs,1 46268 

434206 AW136973 Hs.362915 

409041 AB033025 Hs.50081 

432542 AW083920 Hs.1 6098 

422664 M315933 Hs.1 20879 

436330 NM.004413 Hs.109 

X73079 Hs.288579 

X04898 Hs.237658 

L00190 Hs.75599 

AF097021 Hs.273321 

ACO02301 Hs.1 84507 

U22376 Hs.1334 

M025386 Hs.61311 

AW293464 Hs.1 62031 

AJ243212 Hs.374281 

AW273128 Hs.300268 

AI357412 Hs.1 57601 

AW374954 Hs.71779 
M18728 

AW939252 Hs.192927 



421964 
430272 
413881 



447033 
411734 
406685 
428753 



409153 W03754 Hs.50813 

449388 H53191 Hs.36723 

428046 AW812795 Hs.337534 

433013 AI697890 Hs.127337 

419079 AW014836 Hs.18844 

428355 BE256452 Hs.2257 

422956 BE545072 Hs.1 22579 

424326 NM_014479 Hs.1 45296 

450543 AI394037 Hs.170296 

428187 AI687303 Hs.285529 

435538 AB011540 Hs.4930 

411825 AK000334 Hs.352415 

427722 AK000123 Hs.1 80479 

430569 AF241254 Hs.178098 

414816 Y13709 Hs.77399 

430577 Z26317 Hs.359784 



Unigene Title 

NM_006149:Homo sapiens lectin, galactosi 
NM_001285:Homo sapiens chloride channel, 
NM_002457:Horao sapiens mucin 2, intestin 
H i6183.E U kly mi r to F H 
NM_01 4471 :Homo sapiens serine protease i 
(locus!ink)NM_004363:Homo sapiens carer, 
Hs.84905:cytokeralin20 

NM_006507:Homo sapiens regenerating isle 
NM_033049:Hcmo sapiens mucin 13, epithel 
NM_001804:Homo sapiens caudal type homso 
NM_004063Homo sapiens cadherin 17, LI c 
NM_002421:Hcmo sapiens matrix metallopro 
NM_O32044:Homo sapiens regenerating gene 
NM_003225-.Homo sapiens trefoil factor 1 
NM.005 ! 4 1 :Homo sapiens fibrinogen, B bet 
NM_005814:Homo sapiens glycoprotein A33 
Hs572:orosomucoid 1 
(locuslink)NM_006890:Homo sapiens carcin 
NM_004467:Homo sapiens thrinogen-like 1 
NM.0001 1 1 '.Homo sapiens solute carrier fa 
Hs.152475:ESTs 

NM_002426:Homo sapiens matrix metallopro 
NM_017763:Homo sapiens hypothetical prot 
NM.0019 II m sap sdef isi ' C 
NM_032579:Homo sapiens colon and small i 
NM_138938.Homo sapiens pancreatitls-asso 
NM_005588:Homo sapiens meprin A, alpha ( 
NM_001443:Homo sapiens fait/ acid bindin 
NM.001216:Homo sapiens carbonic anhydras 
NM.004591-.Homo sapiens small r.ducible c 
llfl 000477:Homc 3 piei il imin ( LSI mR 
NM.000040:Homo sapiens apofpoprotein C- 

Hs.93Q02:ubiquItin-CQnjugating enzyme E2 



NM.005143-.Homo sapiens haptoglobin (HP), 



NM_000613:Homo sapiens hemopexin (HPX), 
NM.001633:Homo sapiens alpha-1-microglob 
Hs.86327:homeoboxB9 



Hs.352054:pregnancy specific beta-1-glyc 
Hs.146268:ESTs, Weakly similar to C71400 
Hs.362915:Homo sapiens cDNA FU34876 fis 
Hs.50081:KIAA1199 protein 
NM_020384:Homo sapiens claudin 2 (CLDN2) 
Hs.120879:Homo sapiens, clone VGC-.32871 
NM_004413:Homo sapiens dipeplidase 1 (re 
NM_002644:Homo sapiens po'ymeric immunog 
Hs.237658:apolipoproteinA-ll 
(locoslinl NM.CC048B Homo sapiens serine 
NM 00641 8:Homo sapiens differentially ex 
I omosaf nilartoHomol 

NM_005375:Homo sapiens v-myb myeloblasto 
Hs.61311:ESTs, Weakly imilarto S10590 
Hs.1 62031 :ESTs 

NM_007329:Homo sapiens deleted in malign 

Hs.300268:EST 

Hs.1 57601 :ESTs 

lls.71779:ESl Weakly r arl S24 r x 
loc linl -I" 0 83 iorno ' r ii 
NM_017726:Homo sapiens protein phosphata 
Hs.333893:ce'l division cycle assxiated 
NM_017625:Homo sapiens intelectin (ITLN) 
Hs.35723:ESTs, Weakly similar to C05G5.5 
Hs.337534:Homo sapiens cDNA FU25241 fis 
(locuslink)NM_004655:Homo sapiens axin 2 
Hs.18844:ESTs 

NM_000638:Homo sapiens vitronectin (seru 
(locuslink)NM_01809B:Homo sapiens epithe 
NM_014479:Homo sapiens ADAM-like, decysi 
Hs.17 i h iLN rti ■<{ fti 
Hs.285529:G protein-coupled receptor 49 
Hs.4930:low density lipoprotein receptor 
NM_017767:Homo sapiens solute earner fa 
NM_017671:Homo sapiens chromosome 20 ope 
NM_021804:Hon 
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447208 BE315291 Hs.237971 NM_02409S:Homo sapiens hypothetical prol 



Hs.19322 
Hs.373599 
Hs.109358 



NM_005409:Homo sa P :ens small inducible c 
Hs.1932?:Homo sapiens, Similar to RIKEf 
Hs.373599:EST 

Hs.109358:ATPase, Class V, type 10B 



408380 
415214 
431330 
420344 



Hs.2777 
Hs.97101 
AU076405 Hs.29981 



441377 BE218239 Hs.202656 
444666 BE293347 Hs.11638 
415701 NM_003878 Hs.78619 



455630 
422310 
423337 
422330 



408704 
420005 
451035 
427506 
423445 



411975 
425371 
451917 



430294 AI538226 



NM_006398:Homo sapiens ubiquitin D (UBD) 
NM_004442:Homo sap'ens EphB2 (EPHB2), tr 
NM_002215:Homo sapiens inter-alpha (glob 
NM_014373:Homo sapiens putative G protei 



Hs.12 

AV655701 Hs.75183 

M316S22 Hs.98370 

NM_004655 Hs.127337 

□30783 Hs.115263 

BE296227 Hs.250822 

AA056635 Hs.53S6 

AW271106 Hs.133294 

AU076785 Hs.430 

AK000134 Hs.179100 

NMJ14324 Hs.128749 

M355925 Hs.36232 

AA502334 Hs.151529 

NM_000492 Hs.663 

AA045573 Hs.22900 

NM_016C45 Hs.3945 
M18728 

AI381530 Hs.371132 

AI745325 Hs.271923 

W40460 Hs.144442 

AI916058 Hs.144583 

D49441 Hs.155981 

AW391351 Hs.50820 



Hs.91096 
tls.32976 
Hs.69321 



X53463 Hs.2704 



414809 AI434699 Hs.77355 
400965 

430204 AA618335 Hs.356654 

432978 AF126743 Hs.279884 

417931 W95642 Hs.82961 

430832 AI073913 Hs.100586 

403482 NM_000676 Hs.45743 

422487 AJ010901 Hs.198267 



449720 AA311152 Hs.288708 

411142 NM.014256 Hs.69009 

421777 BE562088 Hs.108196 

419395 BE268326 Hs.90280 

443211 AI128388 Hs.143655 

430603 AA148184 Hs.247280 

409757 NM_001898 Hs.123114 

426227 U67058 Hs,154299 

421408 AI688223 Hs.91096 

421100 AW351839 Hs,124660 

440869 NM_014297 Hs.7486 

414075 U11862 Hs.75741 

444151 AW972917 Hs.128749 

440409 AW294316 Hs.125608 

445564 AB028957 Hs.12896 



Hs.202656:ESTs 
to lir ,HM 316234:1 lomo sapiens fatty- 
1 us ink NM_TO878:Homo sapiens gamma- 
NM_00C042:Homo sapiens apolipoprotcin H 
NM_000773:Homo sapiens cytochrome P450, 
(locuslink)NM_030622:Homo sapiens cytoch 
NM_004655:Homo sapiens axin 2 (conductin 
NM_001432:Homo sapiens epiregulin (EREG) 
(loaislink)NM_003158:Homc sapiens serine 
NM_005170:Hc 



NM_0C2670:Homo sapiens plastin 1 (I isof 
NM_017678:Homo sapiens hypothetical prot 
NM_014324:Homo sapiens alphMethylacyl- 
NM_021067:Homo sapiens KIAA0186 gene pro 



NM_004289:Horno sapiens nuclear factor (e 
NM_016045:Homo sapiens chromosome 20 ope 
(to i link NM_TO4B3 Homo sapiens carcin 



Hs.144583:Homo sapiens, clone IVAGE:3462 
NM_005823:Homo sapiens mesothelin (MSLN) 
Hs.50820:liypotheticai cardiac/skeletal m " 
Hs.233364:ESTs 

NM_007028:Homo sapiens tripartite motif- 
locu iiniWJ^J r Hj. 'i n i jnii 
Hs.69321:KIAA1359 protein 

NM_002083:Homo sapiens glutathione perox 
AF134163:Homo sapiens Human endogenous r 
(locuslink)NM_012112:Homo sapiens chromo 
NMJ3301 3.Homo sapiens nuclear re^ptor 



Hs.77356:transfem'n receptor (p90, CD71 

Hs.356664:hypothetical protein FU32334 
NM_013238:Homo sapiens DNAJ domain-conta 
Hs.82961:Homo sapiens, c'one MGC:22588 1 
Hs.100686.anterior gradient protein 3 
NM_000676:Homo sapiens adenosine A2b rec 
NM_018406:Homo sapiens mucin 4, tracheob 
liit ft i i i i i i h 



NM_021784:Homo sapiens hepatocyte nut „ 
(locu lint NM_D25113:Homo sapiens h/poth 
NM_014256:Homo sap'ens UDP-GlcNAc:belaGa 
NM_016095:Horno sapiens HSPC037 protein ( 
Hs.90280:5-aminoim!dazole-4H:arboxamide 
Hs.143655:ESTs 

Hs.247280:chromosome 20 open reading fia 
NM_0 1898:Homo ap ei , tatin SN { 
(lxuslink)NMJ)05242:Homo sapiens coagul 
NM_052816:Homo sapiens tripartite motif- 
Hs.124660:ESTs, Moderately similar to 21 
NM_014297:Homo sapiens protein expressed 
NM 0( 1091 Horn if i jmilorid inding 
(locuslink)NM_014324:Homo sapiens alpha* 
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20 
25 



50 
55 





U58766 


Hs.264428 


Hs.264428:tissue specific transplantatio 
Hs.158989:Homo sapiens cDNA FU37936 fis 


4.10 


430187 


AI799909 


Hs.158989 


4.10 


414590 


NM_O00506 


Hs.76530 


NM_000506:Homo sapiens coagulation facto 


4.08 


449281 


AI808699 


Hs.162717 


NM_032755:Homo sapiens hypothetic^ prat 


4.08 


449722 


BE280074 


Hs.23960 


Hs.23960;cyclin B1 


4.08 


418318 


U47732 


Hs.84072 


NM_004616:Homo sapiens transmembrane 4 s 


4.06 




BE261750 




NM_001363:Horno sapiens dyskeratosis cong 


4.05 


414753 


AF1 58255 


Hs.77225 


NM_006437;Homo sapiens ADP-ritosyitransf 


4.04 


409889 
449027 


AW630041 


Hs.56937 


NM_021978:Homo sapiens suppression of lu 


4.04 
4.03 


AJ271216 


Hs.22880 


NM_005700:Homo sapiens dipeptldylpeptida 


4.03 


431657 


AI345227 


Hs.1 05448 


l lr I054 rr tei kin e, f> defi 


4.01 


439759 


AL359055 


Hs.67709 


Hs.67709:Homo sapiens mRNA full length i 




441362 


BE614410 


Hs.23044 


NM_080668:Homo sapiens similar to RIKEN 


3.99 


417900 




Hs.82906 


Hs.82906:CDC20 cell division cycie 20 ho 


3.99 


428987 


NM 004751 


Hs.1 94710 


NM_004751:Homo sapiens glucosaminy' (N-a 


3.99 


456977 


AK000252 


Hs.1 69758 


NM_017723:Homo sapiens hypothetical prot 


3.99 


445919 


T53519 




Hs.334692:hypothetical protein MGC14141 


3.98 


423164 


AK000232 




NM_019062:Homo sapiens hypothetical prot 


3.98 


422627 


BE336857 


Hs.1 18787 


Hs.118787:transforming growth factor, be 




422363 


T55979 


Hs.1 15474 


NM_002915:Homo sapiens replication facto 


3^97 


433437 




Hs.3280 


NM_001226:Homo sapiens caspase 6, apopto 


3.9? 


415992 


C05837 


Hs.1 45807 


Hs.145807:hypolhetlcal protein FU13593 


3.9G 


425834 


NMJ01639 


Hs.1 957 


Hs.1957:amyloid P component, serum 




415000 


AW025529 


Hs.239812 


Hs.239812:serologically defined breastc 


335 


432407 


AA221036 




AF134164:Homo sapiens Human endogenous r 


3.95 


408243 


Y00787 


Hs.624 


N Mt i. k 


3.94 


408494 


AA554714 


Hs.1 87578 


Hs.187578:Homo sapiens cDNA FU1 1639 fis 


3.94 


412610 


X90908 


Hs.74126 


NM_001445:Homo sapiens fatty acid bindin 


3.94 


433323 


AA805132 


Hs.1 591 42 


Hs.159142:lunatic fringe homolog (Drosop 


3.94 


422515 


AW500470 


Hs.1 17950 




3.92 


436543 


NM_002212 


Hs.5215 


I l5:integrinbE 1 binding protcii 


3.91 


418113 


AI272141 


Hs.83484 


Hs.83484:SRY (sex determining region Y)- 


3.91 


433083 


AL042759 


Hs.191762 


Hs.191762:hypolhetical protein MGC20258 


3.90 


431939 


AW008061 


Hs.231994 


Hs.231994:Horno sapiens, clone IMAGE:4341 


3.90 


453439 


AI572438 


Hs.32976 


NMJ)04485:Homo sapiens guanine nucleotid 


3.89 


441888 


AI733306 


Hs.128071 


NM_022901:HomD sapiens hypothetical prot 




432150 


AK000224 


Hs.272789 


NM_017716:Homo sapiens membrane-spanning 


3^89 


425234 


AW1 52225 


Hs.165909 


Hs.1 65909:ESTs, Weakly simitar to hypoth 


3.89 


423803 


NM 005709 


Hs.132945 


(locusl'nk)NMJM5709:Hcmo sapiens PDZ-73 


3.87 


410418 


D31382 


Hs.63325 


NM_019894:Homo sapiens transmembrane pro 


3.85 


436251 


BE515065 


Hs.296585 


(L i ri/IM 1 ! tn j| en < 


3.85 


422424 


AI186431 


Hs.296638 


Hs.296638:prostate differentiation facto 


3.84 


432269 


NM_002447 


Hs.2942 


Hs.2942:macnophage stimulating 1 recepto 


3.84 


424905 


NM 002497 


Hs.153704 


NM_002497:Horno sapiens NIMA (never in mi 


3.84 


416209 


AA236776 


Hs.79078 


NM_002358:Homo sapiens MAD2 mitotic arre 


3.83 




AW1 38724 


Hs.168974 


Hs.168974:ESTs 


3.83 


434370 
436481 


AF130988 


Hs.58346 


NM_022336:llomo sapiens ectodysplasin 1, 


3.82 


AA379597 




NM_014176:Homo sapiens HSPC150 protein s 


3.82 


453700 


AB009426 


Hs.560 


NM_001644:Homo sapiens apolipoprotoin B 


3,81 


410619 


BE512730 


Hs.65114 


Hs.65114:keratin18 


3.81 


409420 


Z15008 


Hs.54451 


NM_005562:Homo sapiens lamlnln, gamma 2 


3.79 


422535 




Hs.154578 


Hs.154578:Homo sapiens mRNAfor FU00256 


3.79 


432179 


X75208 


Hs.2913 


NM_004443:Homo sapiens EphB3 (EPHB3), t.R 


3.78 


453967 


AW009077 


Hs.232947 


Hs.232947:ESTs 


3.78 


426106 


AI678765 


Hs.21812 


Hs.21812:ESTs 


3.78 


434170 


AA626509 


Hs.159642 


(locuslink)NM..001 490:1 lomo sapiens glucos 


3,78 


418322 


AA284166 


Hs.84113 


NM_005192:Homo sapiens cyclln-dependent 


3.78 


444381 


BE387335 


Hs.283713 


NMJ38455:Homo sapiens collagen triple h 


3.74 


41S229 


AI827237 


Hs.362919 


Hs.362919:ESTs 


3.74 


437 ICG 


AI916600 


Hs.121194 


Hs.121194:Homo sapiens cDNA: FLJ21569 fi 


3.74 


452833 


BE559681 


Hs.30736 


(locusl!nk)NM_015201:Homo sapiens block 


a73 


426831 


BE296216 


Hs.1 72673 


NM_000687:Homo sapiens S-adenosylhomccys 


3,73 


428970 


BE276891 


Hs.194691 


NM_003979:Homo sapiens retinoic acid ind 


3.72 


443957 


AA521049 


Hs.353013 






408832 


AW085690 


Hs.63428 


Hs.63428:Homo sapiens cDNA FU34457 fis, 


3,71 


440300 


N39760 


Hs.8859 


NM_138793:Homo sapiens apyrase (SHAPY), 


3.71 


425976 


C75094 




NM_025257:Homo sapiens chromosome 6 open 




432575 


Awfo5197 


Hs.1 94346 


Hs.1 94346:Spir-2 protein 


3.71 












417001 


AU076648 


HsJ0741 


NM_000282:Homo sapiens propionyl Coenzym 




421225 


AA463798 




Hs.102696:MCT-1 protein 


t<S9 


445109 


AF039916 


Hs!l2330 


NM..001247:Homo sapiens eclonuoieoside Ir 


3.69 


428330 


L22524 




NM_002423:Homo sapiens matrix metallopro 




447472 


AW207347 


HSJ11101 


Hs.211101:ESTs 


3*.67 




AF010258 


Hs.1 27428 


NM_002142:Homo sapiens homeo boxA9(HOX 


3.67 


422026 


U80736 


Hs.1 10826 


Hs.1 1 0826:trinuclectide repeat containin 


3.66 


419574 


AK001989 




Hs.91 165\hypothetical protein FLJ1 1 1 27 


3.66 




AA205625 


H&208067 


Hs.208067:ESTs 


3.66 


411257 
421515 


AA628967 
Y11339 


Hs.1 15274 


Hs.1 1 5274:lndian hedgehog homolog (Droso 




Hs.1 05352 


( islink)NM_01 4l4Hirr piei I NAc 


3^65 


433675 


AW977653 


Hs.75319 


Hs.75319:ribonucleotide reductase M2 pol 


3.65 
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437009 
445417 
412140 



55 

60 
65 



AF1 27026 Hs.5394 

AK001058 Hs.12680 

AA219691 Hs.73625 

NM_000505 Hs.1321 

BE264974 ' 



414798 AI286323 



is.97411 



414052 
412056 
401519 
428011 



422765 
422106 
422532 



446372 
432378 
434171 
428479 
415099 
414918 
440340 
418384 



410237 
428407 
436213 
442923 
431548 



422158 
418256 
400157 
406709 
453751 
42152G 
415164 



Hs.2316 



NM_000346 H 

AW578849 Hs.283552 

T28160 Hs.778 

BE387514 Hs.181418 

NM_004572 Hs.25051 

AW079940 Hs.15951 

AF113676 Hs.297681 

M88700 Hs.150403 

AI027643 Hs.120912 

AF217515 Hs.283532 

AI650363 Hs.116462 

AW247529 Hs.6793 

AA515032 Hs.91109 

AI867118 Hs.279607 

H43540 Hs.25292 

AF207069 Hs.1 53357 

AW409701 Hs.1578 

D84239 Hs.111732 

AL008726 Hs.118126 



Hs.1964 
Hs.24968 
Hs.356766 
Hs.78919 



L29339 
R35343 
W52452 
AL120168 
AB020644 
AI493046 Hs.146133 
BE247688 Hs.347349 
Y00272 hb.334562 
AI492170 Hs.77917 
AI219207 Hs.72222 
AW895503 Hs.125276 
AW149266 Hs.25130 
X54942 Hs.83758 
NM_012079 Hs.288627 
M446644 Hs.692 
N51357 Hs.260855 
AI638516 Hs.347524 
AA377618 Hs.103834 
AI750589 Hs.61258 
NM_003963 Hs.184194 
AA325512 Hs,71472 
AW248322 Hs.95835 
AI834273 Hs.9711 
AA298484 Hs.61265 
AI471630 Hs.355952 
AI675419 
AI267592 
AK001927 
AF086401 Hs.293847 
L10343 
AW845318 

Hs.356473 

AI355761 
R36762 
AL080121 
AW084352 



S.101282 
s.105460 
S.157123 

S.204044 



AI086138 

M13149 

' AA339449 Hs.82285 

J03258 Hs.2062 

! AA305384 Hs.25740 

i AW891965 Hs.367942 

: BE387040 Hs.182476 

i AI940507 Hs.318526 

M90516 Hs.1674 

' AW250S51 Hs.26213 



429271 
437575 
432677 



NM_004482 Hs.278611 



NM_005379:Homo sapiens myosin !A (MY01A) 
Hs.12680;Homo sapiens cDNA FU10196 fis, 
NHL005733:Homo sapiens RAB6 interacting, 
NM_000505:Homo sa ' 



Hs.778:guanylate cyclase ac 



lor2A(g 



NM_014730:Homo sapiens KiAA0152gene pro 
NM_004572:Homo sapiens plakophilin 2 (PK 
ti J In l45202:Hoi j n r !,r 
NM_000295:Homo sapiens serine (orcystei 
NM_000790:Homo sapiens dopa decarboxylas 
Hs.120912:EST3 

NM_018455:Homo sapiens uncharacterized b 
Hs.116462:ESTs 

Hs.6793:platelel-aclivaling factor acety 
. H ^EST Weakly similar to putativ 
Hs.279607:Homo sapiens cDNA FU34399 fis 
Hs.25292:ribonuclease H2, large subunil 
NM_001084:Homo sapiens protoll, jen-lys'i 
NM_00116&Homo sapiens baculoviral IAP r 
NM_003890:Homo sapiens IgG Fc binding or 
(loouslink)NM_000308Homo sapiens protec 
NM_000343:Homo sapiens solute earner fa 
Hs.24968:hypotheticat protein BC016683 
Hs.356766:Homo sapiens mFSNA; cDNA DKFZp7 
NM_021083:Homo sapiens Kell blcod group 
Hs.14945:fatty-acid-Coenzyme A ligase, I 
Hs.146133:ESTs 

(locuslink)NM_004749:Homo sapiens cell c 
NM_001786:Homo sapiens cell divls on c <c 



Hs.25130:Homosa| 

NM J01827:Homo sapiens CDC28 protein kin 
fM 01 , ij 10 

NM_002354:Horno sapiens tumor-associated 
' ml Ut> i' Homosa r L ( 
Hs.347524:Homo sapiens, clone MGC:24665 
NM_024056:Homo sapiens hypothetical oral 
Hs.61258:argininosur,cii,i t K, a 
NM_003963:Homosa| 



NM_017515:Homo sapiens novel protein (HS 
NM_138805:Homo sapiens family witii seque 
I ,r j2 ESTs, Weakly similar to 090320 
Hs.164464:Homo sapiens, clone MGC:23656 
NM_003137:Homo sapiens SFRS protein kina 
(locuslink)NM_01B099:Hamo sapiens hypotti 
Hs.293847:ESTs 

NM_002638:Homo sapiens protease inhibito 
(tocuslink)NM..012162:Homo sapiens F-box 
NM_0067ia-Homo sapiens activated RNA pol 
Hs.242463:keratin 8 

Hs.101282:Homo sapiens mRNA; cDNA DKFZp4 
NM_015393:Homo sapiens DKFZP564O0823 pro 
Hs.157123:ESTs 

Hs.204044:ESTs 

NM_000412Homo sapiens histidine-rich gl 
NM_000819:Homo sapiens phosphoribosylgly 
I 2062:v mi D(1,2: iih jroxyvitam 
NM_014584:Homo sapiens EROI-like (S. car 
Hs.367942Homo sapiens, clone IMAGE;4701 
NM_031295:Homo sapiens Wiliams Beuren s 
NM_025138:Homo sapiens hypothetical prot 
NM_002056:Homo sapiens glutamine-fructos 
NM_052951:Homo sapiens chromosome 20 ope 
Hs.1 60417*ansmembrane protein 2 
NM_005224:Homo sapiens dead ringer-like 
NM_024320:Horno sapiens hypothetical prot 
NM_004482:Homo sapiens UDP-N-acetyl-alph 



435745 , 
403532 
413916 I 
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424996 AFOO6005 Hs.154104 
402944 

417165 R80137 Hs.302738 

427528 AU077143 Hs.1 79565 

426711 AA383471 Hs.343BO0 

439186 AI697274 Hs.1 05435 

444783 AK001468 Hs.62180 

426174 AA547959 Hs.115838 

421585 U95626 Hs.302043 

421605 BE440108 Hs.106127 

446921 AB012113 Hs.16530 

438746 AI885815 Hs.184727 
403219 

420981 L40904 Hs.100724 

456946 T29678 Hs.166068 

425580 L11144 Hs.1907 

412605 AW410734 Hs.74111 

441384 AA447849 Hs.288660 

416782 L35035 Hs.79886 

426761 AI015709 Hs.1 72089 

441633 AW958544 Hs.112242 



A1670056 HG.137274 

AF178532 Hs.271411 

M878200 Hs.1 18727 

BE269047 Hs.65234 

AW007152 Hs.63325 

AW362867 Hs.302738 

NMJ14791 Hs.184339 

BE246399 Hs.367646 



no sapiens pleiomorphic ad 



Hs.105435:GDP-mannose4,6-dehydratase 
NM_018685:Homo sapiens anillir,, actin bi 
Hs.115838:ESTs 

NM_003965:Homo sapiens chemokine (C-C mo 
NM_015972:Homo sapiens RNA polymerase I 
NM_002988:Homo sapiens small Inducib'e c 
Hs.1 84727:ESTs, Weakly similar to T45738 



NM_005037:Hc 



o sapiens peroxisome prolif 



Hs.741 1 1:RNA binding protein (autoanlige 
Hs.288660:Homo sapiens oDNA: FLJ22182 11 
(locuslink)NM_144563:Homo sapiens ribose 
Hs.172089'.pro-oncosis receptor inducing 
NM_032413:Homo sapiens norma! mucosa of 
NM_004131:Homo sapiens granzyme B (granz 
Hs.194366:transthyretjn (prealbumin, amy 
Hs.1 37274:ESTs, Weakly similar to hypoth 
NM_138992:Homo sapiens beta-site APP-cle 
- Hs.1 18727:Homo sapiens cDNA FU33803 (is 
(loouslink)NM_017895:Homo sapiens DEAD/H 
i ls.63325:lransmembrane protease, serine 
Hs.302738:Homo sapiens cONA: FLJ21425 fi 
NM_014791:Homo sapiens maternal embryorf 
NM_016617:Homo sapiens hypothetical prot 



428024 229067 Hs.2236 
412994 D32257 Hs.75113 
443162 T49951 Hs.9029 
435327 BE301871 Hs.4867 
424010 AL080188 Hs.137556 
428953 M306610 Hs.348183 
Hs.90073 
Hs.76913 



414695 BE439915 

444371 BE540274 

450221 AA328102 

449207 AL044222 

422609 Z46023 
403485 

441623 AA315805 

421943 BE616520 
406484 

435849 BE305242 

449139 BE258315 



Hs.24641 
Hs.23255 
Hs.1 18721 



65 
70 
75 



457465 
426317 
414639 

444261 

424837 
427747 



400130 Hs.155560 
422293 X94453 Hs.114366 
400290 H18836 Hs.31608 



Hs.2236:NIMA (never in mitosis gene a)-r 
Hs.75113:general transcription factor II 
(locusllnk)NM_015515:Homo sapiens type I 
Hs.4867:mannosyl (alpha 1,3-)-glycoprotc 
NM_033100:llomo sapiens MT-protccadherin 
NM_003823:Homo sapiens tumor necrosis fa 
Hs.90073:CSE1 chromosome segregation 1-1 
Hs.76913:proteasome (prosome, macropain) 
Hs.239:forkhead boxM1 
NMJ18204'.Homo sapiens cytoskeleton asso 
Ml i 14298:1 nn p'e nucleoporin 155kD 
NM_O00434:Homo sapiens sialidase 1 (lyso 



447188 H65423 Hs.17631 

423226 AA323414 Hs.146109 

413254 U40272 Hs.75253 

424243 AI949359 Hs.1 43600 

435014 BE560898 Hs.10026 

452281 T93500 Hs.28792 

416065 BE267931 Hs.78996 

427333 AF067797 Hs.176658 

443464 BE548446 Hs.321579 

432035 AA524725 Hs.1 621 08 

408868 AW292286 Hs.255058 

429504 X99133 Hs.204238 

441085 AW136551 Hs.181245 

426991 AK001536 Hs.214410 

408901 AK001330 Hs.48855 

439979 AW600291 Hs.6823 

453968 AA847843 Hs.62711 

AW301344 Hs.122908 

AA312350 Hs.169294 

X67055 Hs.76716 

AW328081 Hs.6817 

AA298958 Hs.1 0724 

AF107457 Hs.37035 

BE276113 Hs.333034 

AW411425 Hs.180655 



NM_004461:Homo sapiens phenylaianine-tRN 

NM_030804:Homo sapiens liypothetica! prot 
Hs.1461 09:ESTs, Weakly similar to T28937 
NM_0041 35:Homo sapiens isocitrate dehydr 
Hs.1 43600:golgi phosphopratein 4 
NM_O22061:Homo sapiens ribosoma! protein 
II 28 ! Homosap nscDN FL 11041 fi 
NM_O02592.Homo sapiens proliferating eel 
' saquaporir,8(AQP8 



NM.021095:Homosa| 
Hs.162108'.ESTs 



cogenc 24p3) 
Hs.181245:Homo sapiens cDNA FU12532 lis 
Hs.214410:Homo sapiens cDNA FU31573 fis 
(locuclink)MM_018101:Homo sapiens hypofii 
NM_018092:Homo sapiens neuropilin (NRP) 
Hs.62711:Homo sapiens, clone IMAGE-.33512 
NM_030928:Homo sapiens DNA replication I 
NM_003202Homo sapiens transcription fac 
NM_002217:Homo sapiens pre-alpha (globul 
NM_033453:Homo sapiens inosine triphosph 



NM_005515:Homo sapiens homeo box HB9 (HL 
NM_003491:Horno sapiens ARD1 homolog, N-a 
U linl IJM i 1421 Homosapiei rii 
Hs 5198:Down syndrome critical region ge 
NM_001746:Homo sapiens calnew (CANX), 
Hs,114366:pyrroline-5-c3rboxy!ate synthe 
(locusllnk)NM_017636:Homo sapiens transi 
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45 
50 



419239 


M468183 


Hs.335798 


(locuslink)NM-033103:Homo sapiens rtiophi 


3.18 


426215 


AW963419 


Hs.155223 


NM 003714:Homo sapiens stanniocalcin 2 ( 


3.18 


425743 


BE396495 


Hs.1 59428 


NM 138761:HomosapiensBCL2-associatedX 


3.17 


413313 


NM_002047 Hs.293885 


NM 002047:Homo sapiens gtycyl-tRNA synih 


3.17 


422714 


AB018335 


Hs.1 19387 


NM_014698:Homo sapiens KIAA0792 gene pro 


3.17 


408353 


BE439838 




NM_015969;Homo sapiens mitochondrial rib 


3.17 


400203 






NM_002794:Homo sapiens proteasome (praso 




412870 




Hs.82407 


NM 022059Homo sapiens chemokine (C-X-C 


3.16 


426088 




Hs.1 661 96 


NM 005603.Homo sapiens ATPase, Class I, 


3.16 


416984 




Hs.80706 


NM_00C903 Homo sapiens NAD(p)Hdehydrage 


3.16 


450635 


AW403954 


Hs.25237 


NM_016647:Homo sapiens mesenchymal stem 


3.16 


406708 


AI282759 




AI282759:qt84a01.x1 NCi CGAP Cc14 Homos 


3.16 


452888 


AW955454 




NM_004093:Homo sapiens ephrin-B2 (EFNB2) 


3.16 


430127 


H60720 98 


Hs.233952 


Hs.233952:proteasome (prosome, macropain 


3,16 


417308 




Hs.81892 


NM_014736:Homo sapiens KIAA0101 gene pro 


3,15 






Hs.289052 


NM_052842:Homo sapiens BCL2-like 1 2 (pro 


3,15 


402474 








3,15 




AA316241 


Hs.90691 


NM_006993:Homo sapiens nucleophosmin/nuc 


3.15 


446595 






NM 017943:Homo sapiens hypothetical prot 


3,15 




AW247076 


Hs/12163 


NM_0039O8:Homo sapiens eukaryotic transl 


3.15 


434263 




Hs.79187 


Hs.79187:coxsackie virus and adenovirus 


3,15 


411165 


NM_000169 


Hs.69089 


NM_000169:Homo sapiens galactosidase, al 


3.15 


430696 


AA531276 


Hs.59509 


Hs.59509:ESTs, Weakly similar to similar 


3.15 


436391 


AJ227892 


Hs.146274 


Hs.146274:ESTs 


3.15 


409142 


AL136877 




Hs.50758:SMC4 structural maintenance of 


3.14 


445873 


AA250970 


Hs!251946 


Hs.251946:Homo sapiens cDNA FLJ1 1 840 (is 


3.14 


432370 


AA308334 


Hs.274424 


NM_018946:Homo sapiens N-acety'neuramini 


3.14 


430514 


AA318501 


Hs.241587 


NM_021246:Homo sapiens lymphocyte antige 


3.14 


417791 


AW965339 


Hs.44269 


Hs.44269:Homo sapiens cDNA FU37972 fis. 


3.14 


417115 


AW952792 


Hs.334612 


NM_003094:Homo sapiens small nuclear rib 


3.13 


411126 


NMJ01202 Hs.68879 


(locuslink)NM_001202:Homo sapiens bonem 


313 


456906 




Hs.156637 


NM_012116:Homo sapiens Cas-Br-M (murine) 


3.13 


425123 


AW205274 




NM_000303:Homo sapiens phosphornannomutas 


3.13 


408056 


M312329 


Hs!«331 5 


Hs.42331:ephrin-A4 


3.13 


446386 


A1032108 


Hs.54424 


Hs.54424:hepatooyte nuclear factor 4, al 


3.12 


407804 


AF228603 


Hs.39957 


NM 016445:Homo sapiens pleckstrin 2 (mou 


3.12 


412723 


AA648459 


Ms 35951 


Hs.335951 :h/pothet!cal protein AF301222 


3.12 


407233 




Hs.50964 


(locuslink)NM 001712:Homo sapiens oarcin 




407168 






Hs.117183:Homo sapiens mRNA; cDNA DKFZp5 


3.11 


452721 


AJ269529 


Hs.301871 


Hs.301871 solute carrier family 37 (glyc 




426427 


M86699 


Hs.1 69840 


Hs.169840:TTK protein kinase 


3.11 


409162 


H25530 


Hs.50868 


Hs.50868:solute carrier family 22 (organ 


3.10 




NM 000047 


Hs.74131 






434861 


AA206153 


Hs.4209 


NM 016491:Hi no i, ens mitochondrial rit 


310 


407944 


R34008 




NM 024422;Homo sapiens desmocollin 2(DS 


3.10 


414806 


01 4694 


Hs H 2! 


(locuslink)NM 014754:Homo sapiens phosph 


3.09 


454464 


AW811606 


Hs.271819 


Hs.271819:mucin 17 


3.09 


415474 


NM 014252 




NM_014252:Homo sapiens solute carrier fa 


3.09 


422616 


BE300330 


Hs.1 18725 


NM_012248:Homo sapiens selenophosphate s 




421470 


R27496 


Hs.1378 


NM 005139;Homo sapiens annexin A3 (ANXA3 


3.09 


419551 


AW582256 


Hs.9101 1 


NM C06408:Homo sapiens anleriorgradienl 


3.09 


418691 






Hs.87296:Homo sapiens cDNA FU20269 fis, 


3.08 


422163 


AF027208 


Hsj 12360 


Hs.112360:prominin-like1 (mouse) 


3.08 


405506 


AI431328 


Hs.348605 


NM_052963:Homo sapiens mitochondrial top 


3.08 
3.08 


429957 


AW204530 


Hs.99500 


Hs.99500:ESTs 


3.08 


410166 


AK001376 


Hs.59346 


NM_018122Homo sapiens hypothetical prot 


3,08 


422880 


AF228704 


Hs.193974 


Hs.193974:gtutathione reductase 


3.08 


442013 


AA506476 


Hs.375009 


Hs.375009:Homo sapiens mRNA; cDNA DKFZpS 


3.08 


431722 


AF161528 


Hs.268049 


(locuslink)NM 016101:Homo sapiens hypoth 


3.08 


421506 


BE302796 


Hs.105097 


Hs.105097:thymidine kinase 1, soluble 


3,08 




AK001301 


Hs.3487 


Hs-VKI^ 




439492 


AF086310 


Hs.1 031 59 




3X7 


417866 


AW067903 


Hs.82772 


Hs.82772:cotlagen, type XI, alpha 1 


3,07 


412530 


AA766268 


Hs.266273 


(I i 1 I I • 1 i Urr npicns chrorr 


3,07 




AA290712 








453132 


AW951952 


Hs.293724 


Hs!293724:Homo sapiens cDNA FLJ12683 ris 


3X6 


424971 


AA479005 


Hs.154036 


NM_003311:Homo sapiens tumor suppressing 


3.06 


427557 


NMJW265! 


3 Hs.179657 


NM_002659:Homc sapiens plasminogen activ 


3.05 


439273 


AW1 39099 


Hs.367692 


Hs.367692:Homo sapiens cDNA FU25668 fis 


3.05 


431945 


AW000827 


Hs.11962 


NM_030766:Homo sapiens apoptosis regulat 


3.05 




AW630133 




(locuslink)NM_020192:Homo sapiens GK003 




407289 
403739 


AA135159 




Hs.203349:Homo sapiens cDNA FU1 2149 fis 


3^04 


444664 


N26362 


Hs.11615 


NM_01 6086:Homo sapiens map kinase phosph 


3X4 


409152 


AA176585 


Hs.194346 


Hs.194346:Spir-2 protein 


3.04 


409093 
406545 


BE243834 


Hs.50441 


NM_0l5936:Homo sapiens CGI-04 protein (L 


3.04 
3.03 




AW850613 




Hs.8715:hypothetical protein MGC3232 


3.03 


418867 


D31771 


hfe89404 


NM_002449:Homo sapiens msh hoxeo box horn 


3.03 


422976 


AU076657 


Hs.1 600 


Hs.1600:chaperonin oonlainingTCPI, subu 


3.03 


434523 


AA703709 


Hs.23410 


(locusllnk)NM_016539:Homo sapiens sirtui 


3.03 


440088 




Hs.1 83232 


NM_024839:Homo sapiens hypothetical prot 


3.02 
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414907 X90725 

407103 M424881 

434203 BE262677 

422283 AW411307 

412133 U83460 



436561 BE560135 Hs.5232 

426031 AA295251 Hs.166066 

417678 X06560 Hs.82396 

417386 AL037228 Hs.301957 

429983 W92620 Hs.260855 

417526 AA568906 Hs.82240 

414732 AW410976 Hs.77152 

409614 BE297412 Hs.55189 

439053 BE244588 Hs.6456 

411096 U80034 Hs.68583 

433312 AI241331 Hs.131765 

420767 AF072711 Hs.99918 

429523 AK000788 Hs.205280 

423242 AL039402 Hs.125783 

420552 AK000492 Hs.98806 

413380 AI904232 Hs.75323 

421533 N71826 Hs.105465 

439352 BE614347 Hs.169615 

428023 AL038843 Hs.374530 

431193 AW749505 Hs.296770 

457211 AW972565 Hs.32399 

410467 AF102546 Hs.63931 

422066 AW249275 Hs.343521 

418526 BE019020 Hs.85838 

453012 T95804 Hs.31334 



413813 M96956 

418362 AL031714 

431350 AI192528 

417911 AA333387 

413597 AW302885 



445462 AA378776 

418245 AA088767 i 

408194 M601038 

421959 AW751497 



Hs.164537 
Hs.82916 
Hs.117183 
Hs.727 



414416 . 
434094 . 
409012 



Hs.76084 
Hs.238205 
Hs.49725 
Hs.44241 



AI583187 Hs.9700 

H18835 Hs.31608 

BE176078 Hs.30819 

D87682 Hs.1 50275 

NMJM6420 Hs.118249 

AA161071 Hs.71465 

U36787 Hs.211571 

NM_001034 Hs.75319 

AW163390 Hs.278554 
Hs.184693 

AB023420 Hs.90093 

H72187 Hs.356668 

AA622037 Hs.166468 

BE396290 Hs.5097 

J04088 Hs.156346 



NM_000998:Homo sapiens ribosomal protein 
Hs.256301:hypothetical proten MGC13170 
NM_018509:Homo sapiens hypothetical prat 
NM_003504:Homo sapiens COC45 eel! divlsi 
NM_001859:Homo sapiens solute canierfa 
Hs.24879:phosphatidic acid phosphatase t 
NM_024616:Hoto sapiens hypothetical prot 
NM_052984:Homo sapiens cydivdependent 

M 14165:11 n pien HSPC125 prol ( 
(locuslink)NM_006697:Homo sapiens cispla 

lo i k)NM_( )2534:H i rr ^ 
NM_018144:Horr,o sapiens Sec61 alpha form 
(locuslink)NMJ45175:Homo sapiens NSE1 ( 
Hs.82240:syntaxin3A 

Hs.77152:MCM7 minichramosome maintenance 
NM_016489:Homo sapiens 5'-nucleotidase, 
Hs.6456:chaperonin containing TCP1 , subu 
NM_005932:Homo sapiens mitxhondrial int 
Hs.131765-.ESTs, Moderately similar lo 13 
Hs.99918:carboxyl ester lipase (bile sal 
Hs.205280:Homo sapiens cDNA FU20781 fis 



Hs.98806:hypothetical protein FLJ20485 



NM_003095:Homo sapiens small nuclear rib 
NM_023080:Homo sapiens hypothetical prol 
Hs.374530:Homo sapiens cDNA: FU23602 fi 
Hs.296770:KIAA1 719 protein 



NM_080759:Homo sapiens dachshund homolog 
Hs.343521:malate dehydrogenase 2, NAD (m 
NM_004207:Homo sapiens solute earner fa 
NM_01 2469:Homo sapiens chromosome 20 ope 
Hs.75069:serinehydroxymelhyltransferase 

NMl00334&Homo sapiens ubTquTn^o^ 

,.16-'- i E )Ts 
Hs.829 1 6:chaperonin containing TCP1 . ssib 
Hs.117183:Homo sapiens mRNA; cDNA DKFZp5 
NM_002192:I lorno sapiens inhibin, beta A ( 
,l!.l)tlM_024051:Homo sapiens hypoth 

M_020 Home iens transmembrane, r. 
lls.191797:ESTs 

NM_030622:Homo sapiens cytochrome P450, 
AW578452:RC1-CT0252-030103 023-b07 CT025 
NM.001 895:Homo sapiens casein kinase 2, 
Hs.59483:leucine-rich repeat-containing 
NM_016614:Homo sapiens TRAF and TNF rece 
NM_013237:Homo sapiens px19-like protein 
(locuslink)NM_001712:Homo sapiens carcin 
NM_003751:Homo sapiens eukaryotic trans! 
'I IC3901 H. light mi riowinge I 
NM_014481:Homo sapiens APEX nuclease (ap 
Hs.1 00057:scrine/threonine kinase 35 
NM_021238:Homo sapiens TERA protein (TER 
NM_02291 1 :Homo sapiens solute carrier fa 
NM_024841:Homo sapiens hypothetical prot 
(locuslink)NM_004911:Homo sapiens protei 
Hs.90078:nucleotide-sugar transporter si 
Hs.1 8401 1 .'pyrophosphatase (inorganic) 
t * tin /If i Ml n sapi i hypoll 
Hs.238205:hypothetical protein PRO2013 
Hs.49725:DKFZP434l216 protein 
Hs.44241:Homo sapiens cDNA: FU21447fis 
Hs.169600:KIAA0826 protein 



Hs.171917:hypothetical protein FU1106S 



NM_001238:Homo sapiens cyelin E1 (CCNE1) 

3CU3link)NM_017636:llom f r n 
Hs.30819:hypolhetical protein C40 
Hs.150275:KIAA0241 protein 
Hs.1 18249:ADP-ribosylation factorguanin 
Hs.71465:squalene epoxidase 
NM_005333:Homo sapiens holocytochrome c 
r-M ) 10 4' om ip it i tide r 
NM_007276:Homo sapiens chromobox homolog 
NM_005648:Homo sapiens transcription do 
Hs.90093:heat shock 70kD protein 4 
(iocus!ink)NM_005274:Homo sapiens guanin 
NM_004708:Homo sapiens programmed cell d 
Hs.5097;synaptogyrin2 
NM_001067:Homo sapiens topoisomerase (DN 
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432754 
437016 
446228 



427430 
417634 
435099 
425811 
446849 



428810 
427505 
418443 
447343 
409262 



424270 
414396 
426120 



440334 
449057 
432680 
446421 



435777 
424441 
407833 
415083 
421462 
443572 
443180 
413753 



410197 
413095 
417677 
425263 
437430 



U31814 Hs.3352 

BE561850 Hs.80506 

AW182459 Hs.125759 

R15486 Hs.285218 

AI420156 Hs.326733 

BE297802 Hs.69360 

BE241691 Hs.3100 

AU076916 Hs.5398 

NM_016046 Hs.14415 

AF281133 Hs.343589 

AF112213 Hs.184062 



NM_014586 Hs.109437 

AL117554 Hs.119908 

M971014 Hs.75432 

NM 001949 Hs.1189 

Z95152 lls.178695 

W27202 Hs.82327 

AC004770 Hs.4756 

AL039104 Hs.1 59557 

AU076617 Ha16251 

R18978 Hs.75105 

N99692 Hs.75227 

AW607831 Hs.38738 

AW290997 Hs.190153 

NM_016034 Hs.20776 

AF068236 Hs.193788 

AA361562 Hs.1 78761 

NM_005239 Hs.8S146 

AA256641 Hs.236894 

AK000631 Hs.52256 

BE560621 Hs.9222 

AW249181 Hs.154796 

AI733848 Hs.71935 

AW963979 Hs.24723 

AW969587 Hs.86366 

BE612992 Hs.27931 

H53323 Hs.25717 

BE618609 Hs.279591 

AF141332 Hs.200333 

AA336878 Hs.9842 

AW973003 Hs.1 79909 

AW797437 Hs.69771 

AK001818 Hs.144407 

BE548266 Hs.76057 

AA325243 Hs.166887 

BE614599 Hs.356501 

AW504924 Hs.9805 

R12234 Hs.13396 

AF155108 Hs.256150 

BE276112 Hs.7165 

AB037784 Hs.22941 

T47364 Hs.278613 

BE297434 Hs.15071 

BE270447 Hs.356512 

U30930 Hs.158540 

AI916662 Hs.211577 

AW419202 Hs.286192 

X14850 Hs.147097 

AW955632 Hs.66666 

AI632683 Hs.27179 

AF016495 Hs.104624 

AA025610 Hs.9605 

R15875 Hs.258576 

U17760 Hs.75517 

ABO05532 Hs.31442 

U34038 Hs.154299 

AW301003 Hs.51483 

NM_005518 Hs.59889 

AA494359 Hs.30715 

NM_016055 Hs.82389 

NM_001197 Hs.156419 

W44671 Hs.124 

M26301 Hs.2253 



NM_001527:Homo sapiens histonc deacety'a 
NM_003090:Horao sapiens small nucleai rib 
Hs.125759:likelyortholcg of mouse RING 
I <i r S If" 02121 H fr l 

NM_052858:Homo sapiens similar to RIKEN 
NM_006845:Homo sapiens kinesir lil > (,, 
Hs.3100:lysy!-tRNA synthetase 
Hs.5398:guanine monphosphate synthetase 
NM_016046:Homo sapiens exosomal core pro 
Hs.343589:exosome component Rrp41 
Hs.1 84062:chromosome 20 open reading fra 
NM_052815:Homo sapiens immediate early r 
NM_013338:Homo sapiens Alg5, S. ccrevisi 

NM_014799:Homo sapiens hephaesUr, (HEPH) 
NM_006713:Homo sapiens activated RNA pol 
NMJ14586:Homo sapiens hormonally upregu 
NM_015934:Homo sapiens nucleolar protein 
NM_000884:Homo sapiens IMP (inosinc mono 
NM_001949:Homo sapiens E2F transcription 
NM_002754:Homo sapiens mitogen-actlvated 
NM_000178:Homo sapiens glutathione synth 
NM_00411 1:Homo sapiens flap structure-sp 
NM_002266:Homo sapiens karyopherin alpha 
(jccuslink)NM_016207:Homo sapiens cleava 
NMJ06579:Homo sapiens emopamil biixtinc 
NMJ05002:Homo sapiens NADH dehydrogenas 
NM_014343:Homo sapiens claudin 15 (CLDN1 
Hs.190153:Homo sapiens cDNA FU33988 (is 
li uslink) M U 1f mo mil h 
NM_000625:Homo sapiens nitric oxide synt 
Hs.178761:26S prateasome-associated pad1 
Hs.35146:v-ets erythroblastosis virus E2 
Hs.236894:ESTs, Highly similar to S02392 
Hs.52256:hypothetica! protein FU20624 
Hs.9222:estrogen receptor binding site a 
Hs.154796:Homo sapiens cONA FLJ37976 (is 
NM_021220:Homo sapiens zinc f ncer prate 
Hs.24723;ESTs 



Hs 27731 :hypothctical protein FLJ10607 s 
Ms, 25717. Homo sapiens cDNA: FLJ23454fis 
Hs.279591 :Homc sapiens, Similar to RNA p 
NM_018690:Homo sapiens apolipoprotein D4 
Hs.9842:ESTs 

(!ocus!ink)NM_024831:Homo sapiens nuclea 
NM_001710:Homo sapiens B-factor, properd 
NMJ18283:Homo sapiens hypothetical prof 
(locuslink)NM_000403:Homo sapiens gal act 
Hs.156887:copincl 

(locuslink)NM_032335:Homo sapiens hypoth 
Hs.9805:exportin 5 

Hs.13396:Homo sapiens clone 25028 mRNA s 
Hs.256150:NY-REN-41 antigen 
NM_003904:Homo sapiens z'nc f ncer prate 
Hs.22941:KIAA1363 protein 
(locuslink)NM_005532:Homo sapiens interf 
Hs.15071:chapeionin containing TCP I, sub 
Hs.356512:ESTs, Weakly similarto UBCA_A 
(locuslink)NM_003360:Homo sapiens UDP~gl 
( u nk)NM_ 14 86:Homo r is kin t 
NM_032192:Homo sapiens protein phosphata 
Hs.147097:H2Ahisior,e family, member X 
Hs.66666:chromosome 7 open readinc frame 
Hs.27179:Homo sapiens cDNA FU12933 fns, 
NM_020980:Homo sapiens aquaporin 9 (AQP9 
Hs.9605:cleavage and potyadenylat ! on spe 
NM_01 2129:Homo sapiens claudin 12 (CLDN1 
NMJ»0228:Hamo sapiens laminin, beta 3 { 
NM_004260:Homo sapiens RecQ protein-like 
NMJ)05242:Hcmo sapiens coagulation facto 
Hs.51433:Homo sapfens, Similarto RIKEN 

locuslmk)NM_00551 mo sap hyd 
Hs.3071 5:potassium voltageflated channel 
f ' > 1 J .Ho 1 1 r mitochondrial rib 

« 1,001197:1 m sap 3 BCL2 nteraciing 
NM_014628:Homo sapiens gene predictedfr 



400750 

428788 AF082283 Hs.193516 

432633 AI796390 Hs.210667 

432816 N38913 Hs.221575 



NM_003921:Homo sapiens B-cell CLUlympho 

Hs.210567:ESTs 

Hs.221575:ESTs 

184 
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Hs.58189:eukaryotic translation initiaB 
NM_024026:Homo sapiens mitochondrial rib 
Hs.240770:nuclear cap binding protein su 
Hs.349092;ESTs, Weakly similar to A42442 
Hs.2795:lactate dehydrogenase A 
Hs.104830:ESTs 

NM_012080:Homo sapiens DNAsei 
NM_004208:Homo sapiens prograrr 
Hs.110695:hypothe«cal protein MGC3133 
NM 002011; 



40 
45 
50 



436414 BE264633 Hs.143638 

436291 BE568452 Hs.344037 

427963 AI042582 Hs.1 81271 

426459 AF151812 Hs.169992 

413880 AI660842 Hs.110915 

442468 N77737 Hs.8349 

413476 U25849 Hs.75393 

413278 BE563085 Hs.833 

430120 AW675298 Hs.233694 

452875 BE275760 Hs.30928 

417164 AA338283 Hs.81361 

407811 AW190902 Hs.40098 

409636 AA305729 Hs.18272 

447619 AI174800 Hs.19054 

434845 BE267057 Hs.325321 

414862 BE621310 Hs.923 

443639 BE269042 Hs.9661 

414045 NM_002951 Hs.75722 

430512 AF182294 Hs.241578 

432636 AA340864 Hs.278562 

414697 BE266134 Hs.76927 

420665 AW469240 Hs.371581 

428474 AB023182 Hs.184523 

448093 AW977382 Hs.15898 

443343 BE409809 Hs.301005 

418313 BE244231 Hs.84038 

424154 AF026004 Hs.141660 

456950 AF111170 Hs.306165 

432543 AA552690 Hs .152423 
423271 



410094 
403817 
459125 
432705 



414767 



432320 AW411066 Hs.274351 

430450 R23553 Hs.241489 

433808 NMJ14062 Hs.3566 

431890 X17033 Hs.271986 

433369 Z49254 Hs.3254 

446946 AI878932 Hs.317 

432204 AI916132 Hs.121593 

424438 AA340724 Hs.271912 

433862 D86960 Hs.3610 

417080 BE392846 Hs.1063 

428242 H55709 Hs.2250 

416188 BE157260 Hs.79070 

436014 AF281134 Hs.283741 



NM_080748:Homo sapiens chromosome 20 ope 
NM_002157:Homo sapiens heat shock 10kD p 
Hs.279521:hypothetical protein FU20530 
Hs.182018:lnterieukin-1 receptor-associa 
Hs.374811:Homo sapiens, similarlo 4-1 BB 
NM_023930:Homo sapiens hypothetical prot 
Hs.255052:ESTs 

(locuslink)NM_004494:Homo sap'ens hepato 



AW629223 Hs.64794 

AF146761 Hs.20450 

AA434109 Hs.12271 

NM 015925 Hs.95697 

BE147897 Hs.58593 

AA811363 Hs.29464 

AI879473 Hs,157123 

AI440137 Hs.164989 

AA489023 Hs,99807 

BE159028 Hs,279704 

NM_001955 Hs.2271 

BE541381 Hs.178705 



NM_033661:Homo sapiens WD repeal domain 
(locuslink)NM_003981:Homo sapiens protei 
NM_016057:Homo sapiens CGI-120 protein ( 
NM 015966:Homo sapiens serologically de( 
NM_021258:Homo sapiens interteukln 22 re 
NM_138933:Homo sapiens apobec-1 compleme 
NM_004300:Homo sapiens acid phosphatase 
H J l nteri n slimy fed irotei V, 
(!ocuslink)NM_018396:Homo sapiens putati 
NMJ061 1 4:Homo sapiens translocase of ou 
Hs.81361:heterogeneous nuclear ribonucle 
Hs.40098:cysteineknotsuperfamily 1, BM 
(locuslink)NM_030674:Homo n 
(lu:u!>'ink)NM_018530:Homo sapiens hypoth 
Hs.325321 :WD repeat domain 18 
Hs.923:single-stranded DNA binding prate 
Hs.9661:prateasome(prasomp ma. u f in) 
NM 002951:Homo sapiens nbophorin II (RP 
NM_016200:Homo sapiens U6 snRNA-associat 
NM_001307:Homo sapiens claudin 7 (CLDN7) 
Hs.76927:translocase of outer mitochondr 
Hs.371581:ESTs 

Hs.184523:serine/threonine kinase 38 Ilk 
Hs.15898:2,4-dienoyl CoA reductase 2, pe 
Hs.301005:histone H2A.F/Z variant 
NM_015937:Homo sapiens CGI-05 protein (L 
NM_004366:Homo sapiens chloride channel 
Hs.306165:ESTs, Highly similar to unknow 
Hs.152423:Homo sapiens cDNA: FU21274 fi 
NM_000576:Homo sapiens interieukin 1, be 
NM_006978:Homo sapiens zinc finger prate 
NM_020125:Homo sapiens B lymphocyte acti 
NM_012162:Homo sapiens F-box and leucine 
I live D I! b Lr Zip! c 

NM_004128:Homo sapiens general transcrip 

Hs.29464:Homo sapiens cDNA: FU23460 fis 



Hs.99807:Homo sapiens mRNA; cDNA DKFZp31 
Hs.279704:chramatin accessibility comple 
NM_001955:Homo sapiens endothelln 1 (EDN 
NM_033515:Homo sapiens Mac GAP protein (M 
Is '2 peptidyipi A neraseA(cy 
NM_016032:Homo sapiens zhcfiiger, DHHC 
(lo .-hi itJM_015913;Homo sapiens hypoth 
Hs.3566:ART-4 protein 
NM_002203:Homo sapiens integnn, alpha 2 
NM_021134:Homo sapiens mitochondrial rib 
NM_003286:Homo sapiens topoisomerase (DN 
Hs.121593:Homo sapiens cDNA FU13123 fis 
Hs.271912:Homo sapiens cDNA FU38690fis 
NM_014873:Homo sapiens KIAA0205 gene pro 
Hs.1063:small nuclear ribonucleoprotein 
Hs 'ho pukemia inhibitory factor(chol 
NM_002467;Homo sapiens v-myc myelocytoma 
NM_020158:Homo sapiens exosome component 
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419489 AW411280 Hs.90693 

407971 AI469117 Hs.62918 

432403 AA550815 Hs.124840 

410775 AB014460 Hs.66196 



433662 W07162 H: 

426235 A1631964 H: 

413186 AU077141 H: 

419713 AW968058 H: 

410174 AA306007 H 

430720 U85768 H: 

429345 R11141 H: 

452767 AW014195 H: 

414561 AI064813 H: 

423198 M81933 H: 

421038 AL080192 H: 

444706 AK000398 H 

423908 AJ006422 H 

433412 AV653729 H 

438485 W57578 H 

452461 N78223 H 

407699 AA825974 H 

412258 AA376768 H 

450256 AA286887 H 

443905 AI215948 H 

413274 NM_004893 H 

408885 C02741 H 

424685 W21223 H 

424692 AA429834 H 

413762 AW411479 H 

418054 NMJ02318 H 

450164 AI239923 H 

412115 AK001763 H 

450897 W16741 H 

447349 AI375546 

445413 M151342 H 

448826 AI580252 H 

406671 M129547 H 

433604 NM.013442 H 

441020 W79283 H 

458933 AI638429 H 



439656 AW138241 Hs.160502 

425236 AW067800 Hs.155223 

420085 AI741909 Hs.44680 

448296 BE622756 Hs.10949 

430200 BE613337 Hs.234896 

424308 AW975531 Hs.154443 

423453 AW450737 Hs.128791 

421344 AW631030 Hs.103655 

446607 AI691065 Hs.155780 

418558 AW082266 Hs.86131 

443835 AF016371 Hs.9880 

413794 AF234532 Hs.61638 

451481 AA300228 Hs.295866 

458820 BE552151 Hs.108118 

425905 AB032959 Hs.318584 

408089 H59799 Hs.42644 

431201 AA678405 Hs.8854 

437897 M770561 Hs.146170 

441703 AW390054 Hs,192843 

433916 AW732839 • Hs.3631 

422516 BE258862 Hs.117950 

416084 L16991 Hs,79006 

427464 BE262956 Hs,178292 

453876 AW021748 Hs,110406 

424373 AJ133798 Hs.146219 

411619 AI418609 Hs.71040 

413004 T35901 Hs.75117 

420062 AW411096 Hs.94785 

446077 BE251048 Hs.22579 

446269 AW263155 Hs.14559 

428728 NMJ16625 Hs.191381 



NM_013400:Homo sapiens replcaUon Initi 
Hs.62918:CDC91 cell division cycle 91-li 
(locusiink)NM 1 38456:llomo sapiens hypoth 
NM_002528:Hcmo sapiens nth endonuclease 
NM_018683:Homo sapiens zinc finger prole 
NM_002707:Homo sapiens protein phosphata 
Hs.350108:ribosoma! protein, large, P0 
NM_015421:Homo sapiens DKFZP564K2062 pro 
NM_020387.'Homo sapiens RAB25, member RAS 
Hs.34447;Homo sapiens cDNA FU38512 fis, 
Hs.374548:solute carrier family 16 (mono 
HM_019094:Homo sapiens nndix (nucleoside 
Hs.59461 :DKFZP434C245 protein 
NM_002991:Homo sapiens small inducible c 
Hs.199695;hypolhelical protein MAC30 
Hs.61472:Homo sapiens, clone lf/AGE:51341 
Hs.1 951 55;solute earner family 38, men* 
I ls.1634:cell division cycle 25A 
Hs.101282:Homo sapiens mRNA; cDNA DKFZp4 
(locuslink)NM_017798:Homo sapiens chromo 
NM_006869:Homo sapiens centaurin, alpha 
NM_021 199:Homo sapiens sulfide dehydroge 
Hs.378718:Homo sapiens cDNA FU33433 fis 
Hs.108106:ubiquifin-:ike, containing PUD 
NM_024622:Homo sapiens hypothetical prat 
(to ns'ir.|)NM_025151:Homo sapiens Rabco 
Hs.24724:Homo sapiens cDNA FU39185 fis, 
Hs.143969:ESTs 

NM_004893:Homo sapiens H2A hlstone famil 
NM_017948:Homo sapiens hypothetical prot 
Hs.151734:nucleartransportfactor 2 
NM_014679:Hcmo sapiens KIAA0092 cene pro 
NM_002014:Homo sapiens FK506 binding pro 
NM_002318:Homo sapiens lysyl oxidase-lik 
NM_080759:Homo sapiens dachshund hoxolog 
Hs.73239:hypothetical protein FU10901 
NM_014017:Homo sapiens HSPC003 prolan ( 
BE743847:601577765F1 NIH_MGC_9 Hdt,o sapi 
(locuslink]NM_016077;Homo sapiens CGI-14 
Ho p r U PL ( f 

Hs.3439:stomatin (EPB72)-like 2 
Hs.35962:Homo sapiens mRNA; cDNA DKFZp68 
NM_002882:Homo sapiens RAN binding prate 
Hs.236204:nuclearcore complex protein 
il protein BC007772 
2:Homo sapiens cDNA FU36008fis 
NM_003714;Homo sapiens stanniocalcin 2 [ 
Hs.44680:hypothefral protein FU20979 
Hs.10949:Homo sapiens cDNA FU 141 62 fis, 



Hs.154443:MCM4 mi 



NM_015939:Homo sapiens CGl-09 protein (L 
(locuslink)NM_015873:Homo sapiens viilin 
Hs,155780:ESTs 

Hs.86131:Fas (TNFRSF5)-associated via de 
NM_006347:Homo sapiens peptidyl prolyl i 
NM_012334:Homo sapiens myosin X (MYO10), 
(locuslink)NM_030974:Homo sapiens hypolh 
Hs.108118:hypothetical prote'm FU22474 
NM_032173:Homo sapiens hypothetical prot 
Hs.42644;thioredoxin-like 2 
Hs.8854:Pvt1 oncogene homolog, MYC activ 
Hs.146170:hypotheti'cal protein FU22969 
NM_022145;Homo sapiens leucine zipper pr 
NM_001551:Homo sapiens immunoglobuin (C 



Hs,178292:proteinO-i 
Hs.110406:ESTs 
NM_014427:Homosapiens copine VII (CPNE7 
NM_017816:Homo sapiens hypothetical prot 
H ' 1 i erieukine hancerbi dingfa 
(locusl:nk)NM_021 I mo ]p V E 
Hs.22579-.Hom, sapi n lone CDABP0036 mF 
NM_018131:Homo sapiens hypothetical prot 
H lihypotheti Ip iteinLOC I l" 

ni iih u t i 

NM_006746:Homo sapiens sex comb on midle 
Hs.260523:ncuroblssioma RAS viral (v-ras 
Hs.211594:proteasome (prosome, macropaln 
Hs.107213fcrmin binding protein 3 
NM_012083:Homo sapiens frequently rearra 
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25 
30 



50 
55 



425274 
433159 
406673 



408113 
457313 
413142 
414998 



427477 
421839 
400448 
418416 
445304 
417601 
444700 
431021 



432435 
433020 
436106 
431127 



421743 
428072 
417957 
409119 
447200 
409214 
414883 



BE281191 Hs.155462 

AB035898 Hs.150587 

M34996 Hs.198253 

AB020643 Hs.183006 

R94038 Hs.374664 

AI393122 Hs.134726 

T82427 Hs.194101 

AF047002 Hs.241520 

M81740 Hs.75212 

NM_002543 Hs.77729 

AI732374 Hs.339827 

BE298665 Hs.14846 

AW438602 Hs.191179 



U11700 Hs.84999 

BE613206 Hs.279607 

NM_014735 Hs.82292 

NM_003645 Hs.11729 

AI869664 Hs.351863 

AF077036 Hs.31989 

AW052006 Hs.374973 



AW194426 Hs.20726 

AW368634 Hs.1 54331 

AK001724 Hs.102950 

BE017985 Hs.102558 

AI816736 Hs.14896 

BE218886 Hs.282070 

AI375726 Hs.227152 

AI050715 Hs.2331 

U66618 Hs.250581 

AW963118 Hs.161784 

AI015435 Hs.1 04637 

AU077228 Hs.77256 

N34731 Hs.74562 

AW247090 Hs.57101 



Hs.155462:MCM6 minichramosome maintenanc 
NM_020242:Homo sapiens Wnesin-like 7 (K 
Hs.198253:maior histocompatibility compl 
Hs.183006:likely homotog of mouse hepari 
NM 005538:Homo sapiens inhibin, beta C ( 
{ ' ( j ir;.k)NM t-i o Horn sa c r r.pon 
Hs.194101:Homo sapiens cDNA: FU20369 fi 
NM_005782:Homo sapiens transcriptional c 
(locuclir t_002539;Homosa en nith 
NM_002543:Homo sapiens oxidised low dens 
Hs.339827:ESTs. Weakly similar to ptotea 
Hs.14846:Homo sapiens mRNA; cOMA OKFZp56 
Hs.191179:ESTs 

Hs.6682:solute carrier family 7, (cation 



NM_000053:Homo sapiens ATPase, Cu++ tran 
Hs.279607:Homo sapiens cDNA FU34399 fis 
NM_014735:Homo sapiens KIAA0215 gene pro 
NM_003645;Homo sapiens fatty-acid-Coenzy 
, i li Hi M 003312 ii i aplen thio j 
NM_015449:Homo sapiens NlCE-3 protein (N 
NM_004697:Homo sapiens PRP4 pre-mRNA pro 
Hs.1 18638:non.metastatic cells 1 , protei 
NM_005717:Homo sapiens actin related pro 
Hs.20726:ESTs, Moderately similar to hyp 
Hs.154331.EST3 

NM_016128:Homo sapiens coal protein gamm 
Hs.102558:Homo sapiens cDNA FU40369 fis 
Hs.14896:zinc finger, DHHC domain contai 



BE258602 



417129 
410397 . 
419420 , 



442025 
437379 . 
409703 
419170 
418216 
451926 
413781 
407236 
421405 



AW134519 Hs.96125 

J05272 Hs.850 

W79485 Hs.173980 

AA251944 Hs.1 04058 

NM.012111 Hs.204041 

AW857376 Hs.159238 

NM_000429 Hs.323715 

NM.007350 Hs.82101 

BE274530 Hs.273333 

AF083208 Hs.15178 



182366 

H53497 Hs.83006 

AA531133 Hs.4253 

BE543146 Hs.281434 

AW405967 Hs.333388 

AA926960 Hs.348669 

AI580053 Hs.109007 

AW963372 Hs.222088 

AW873704 Hs 320831 

AF1 18395 Hs.279865 

BE409261 Hs.1 2084 

BE269243 Hs.182625 

AI694131 Hs.29002 

NMJ14448 Hs.87435 

BE048061 Hs.37054 

AI808780 Hs.227730 



AW006352 Hs.1 59643 

AW631437 Hs.5184 

AI381800 Hs.300684 

AF217517 Hs.63042 

AA355435 Hs.30724 

AA032279 Hs.61635 

412599 AU076782 Hs.248267 

436199 R38945 Hs.127951 

X74794 Hs.154443 

AW887434 Hs.1 1810 
-ls.23765 

NM_006187 Hs.56009 

BE002798 Hs.287850 



NM_016391:Homo sapiens hypothetical prot 
Hs.2331:E2F transcription factor 5, p130 
H 2505 fl/SNF related, mat i 
Hs.161784:ESTs 

I is.104G37:sofute carrier family 1 (giuta 
NM_004456:Homo sapiens enhancer of zeste 
NM_078480:Homo sapiens fuse-binding prot 
I ls.57101 : VCM2 minichramosome maintenance 
HS.107614.DKFZP564I1171 protein 
NM_016292:Homo sapiens heat shock protei 
NM_016071:Homo sapiens mitochondrial rib 
Hs.4253:hypolhelical protein MGC2574 
Hs.281434:Homo sapiens cDNA FLJ31373 fis 
Hs.333388:similarto CG3714 gene product 
Hs.348669:CDC28 protein kinase 1 
Hs.109007:Homo sapiens, Similar to LOC16 
NM_014109:Homo sapiens PRO2000 protein ( 
Hs.320831:chromosome 20 open reading fra 
NM_014317:Homo sapiens trans-prenyllrans 
Hs.1 2084:Tu translation elongation facto 
Hs.132625:VAMF 3 mt 

Hs.29002:KIAA1706 protein 
Hs.87435:Rho guanine exchange factor (GE 
Hs.37054:ephrin-A3 

NM_000210:Homo sapiens integrin, alpha 5 

Hs.159643:ESTs, Weakly similar to putati 
(locuslink)MM_016397:Homo sapiens TH1-U 
I ls.300684:calc:tonin gene-related peptid 
NM_018457:Homo sapiens DKFZp564J157 prot 
(locuslink)NM_001516:Homo sapiens genera 
Hs.51635:six transmembrane epithelial an 
(li r ii )NM_ '1 Homo , ensm r 
Hs.127951:Homo sapiens cDNA FU14503 fis 
Hs.154443:MCM4 minichramosome maintenanc 
NM_032026:Homo sapiens CDA1 1 protein (CD 
Hs.23765:membrane metallo-ondopeptidase- 
M ii 2 b olraoadcnylatc synthetase 
NM_002219:Homo sapiens integral membrsne 
Hs.283099:AF15q14 protein 
(locuolink)NM_025151:Homo sapiens Rab co 
( i i Ififi ) n o ei f'F( 
Hs.1 73980:n uclear matrix protein NMP200 
NM_015957:Homo sapiens CGI-29 protein (L 
NM_0121 1 1:Homo sapiens chromosome 14 ope 
NM_000149:Homo sapiens fucosyltransferas 
tou r'lNM_000429 Homo sapiens melhio 
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(locu5link]NM__0Q4875:Homo sapiens RNA po 










NM_018256:Homo sapiens WD repeat domain 
















































































NM_032486:Homo sapiens dynactm 4 (MGC32 










NM_024829:Homo sapiens hypothetical prat 








Hs.30738 


NM_018087:Homo sapiens hypothetical prot 






AI733735 




NM_033266:Homo sapiens ER to nucleus sig 






AA311919 




NM_018983:Homo sapiens nucleolar protein 


2.60 






Hs.99736 


Hs.99736:hypothetical protein MGC39350 






AK000226 




NM_031265:Homo sapiens mucin and cadheri 








Hs.236522 


Hs.236522;DKFZP434P106 protein 






R52557 




NM_033416:Homo sapiens similar to HYPOTH 






AW997938 




NM_003786:Homo sapiens ATP-binding casse 






AW293134 


Hs.32597 


NM_005977:Homo sapiens n'ng ringer prate 






AW96B360 




Hs 1 4355:Homo sapiens cDNA FLJ13207 fis, 


2.60 






Hs.100293 


Hs.100293:O-linked N-acetylglucosaminet, 










NM_000077:Homo sapiens cyclin-dependent 








Hs.170673 


NM_138969:Homo sapiens retina) short cha 










NM_021178:Homo sapiens enhancer of invas 










Hs.212296:ESTs 










\\ i k)NN 30877:Honn t i s i^i 










(locusfnk)NM_006644:Homo sapiens heats 








Hs, 265398 


Hs.265398:ESTs, Moderately similar to hy 




















Hs.68877:cytochrome b-245, alpha poiypep 




426858 


NM_004182 




NM.0041 82:Homo sapiens ubiquitoucly-expr 










NM_032351:I lomo sapiens mitochondrial rib 










NM_015982:Homo sapiens germ cell specifi 










NM_006109:Homo sapiens SKB1 homolog (S. 








Hs.171352 


Hs.171952:occludin 










NMJ)06191:Homo sapiens proliferation-ass 






BE562826 




BE562826:601336534F1 NIH MGC 44 Homo sap 




410570 


AI133096 


Hs.64593 


NMJ)06356:ilomo sapiens ATP synthase, H+ 


2'sa 


430594 


AK000790 


Hs.246885 


NM_017958:Homo sapiens hypothetical prot 


2.58 






Hs.378965 


Hs.378965: Homo sapiens cDNA FU37658 fis 










NM. 033340:1 lomo sapiens caspase 7, apopto 








Hs.159322 


( ianlml)NMJ)12243:Homo sapiens solute 










Hs.67776*.Homo sapiens, clone IMAGE:54556 








Hs.63788 


Hs.63788:propionyl Coenzyme A carbcxylas 










Hs.367942:Homo sapiens, clone IMAGE:4701 






AI244459 




Hs.1 10826 :trinudeotide repeat containin 






AU077101 


Hs.79222 


Hs.79222:galactosidase, beta 1 


2.58 








Hs.35844:neurctrcph ; c tyrosine kinase, r 










NM_012394:Homo sapiens prefoldin 2 (PFDN 








Hs.74335 


Hs.74335:heat shock 90kD protein 1, beta 






M1 57634 




r 1 i fi ' jt jrrier tarn ly 25 (n ic 






U50079 


Hs*88556 


NM_004964:Homo sapiens histone deacetyla 








Hs.18791 


NMJIf 7899:Homo sapiens hypothetical prot 






AW043637 


Hs.21766 


Hs.21766:ESTs, Weakly simitar to hypothe 




447418 


AA063074 


Hs.18552 


Hs.18552:E2IG2 protein 


2.57 


424142 


AI678727 


Hs.378970 


Hs.378970:Homo sapiens cDNA FU351 02 fis 


2.57 




AI739168 


Hs.349283 


Hs.349283:Homo sapiens cDNA FU31753 fis 


2.57 




AL121523 


Hs.97774 








AW451034 


Hs.326525 


NM_001669:Homo sapiens arylsulfatase D ( 






W16480 


Hs.24283 


Hs.24283:Homo sapiens cDNA FU25952 fis, 






AI859390 


Hs.288940 


NM_021259:Homo sapiens transmembrane pro 






BE267756 


Hs.180312 


NM_016065:Homo sapiens mitochondria' rib 






AI277924 


Hs.145199 


hs.1 4 31 99:ESTs, Weakly similar to hypoih 






NM.002153 Hs.155109 


MM_002153:Homo sapiens hydroxysteroid (1 






NM 014742 


. Hs.79305 


Hs.79305:KIAA0255 gene product 






AI376722 


Hs.1 80062 


Hs,180062:protea3ome (prosome, macropain 






□29643 
AW368226 


Hs.34789 
Hs.57928 


NM_O05216:Homo sapiens dolichyl-diphosph 
Hs.67928:ESTs 


2.56 


431910 


AK000142 


Hs.101774 


Hs.101774:hypothetical protein FU23045 


2.56 


413010 


M393273 


Hs.75133 


NM_003201:Homo sapiens transcription fac 


2.56 


452264 


AU077013 


Hs.28757 


Hs.28757:transmembrane 9 superfamily mem 


2.56 


419423 


D26488 


Hs.90315 


Hs.90315:KIAA0037 protein 


2.56 


425221 


AV649864 


Hs.155188 


NM_005642:Homo sapiens TAR RNA polymera 


2.56 


437623 


D63880 


Hs.5719 


NM_01 4355:Homo sapiens chromosome conden 


2.56 


444184 


T87841 


Hs.282990 


(locuslink)NM_033550:Homo sapiens chromo 


2.56 


418650 


BE386750 


Hs.86978 


Hs.86978:prolylendopeptidase 


2.56 


425358 


AB014595 


Hs.155976 


to I tiM_ 3588:11011 si in 


2.56 


420614 


AL110291 


Hs.99354 


Hs.99364:abhydrolase domain containing 1 


2.56 




AI494291 


Hs.369171 


Hs,369171:ESTs 


2.56 


418862 


BE550964 


Hs.89399 


NM_005176;Homo sapiens ATP synthase, H+ 


2.56 


416432 


BE391767 


Hs.79322 


(locuslink]NM_005051:Homo sapiens glutam 


2.55 
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458814 AI498957 Hs.351937 

448153 Y10805 Hs.20521 

454003 AA058944 Hs.1 16602 

411950 T28407 Hs.81564 

457400 AF032906 Hs.252549 

426410 BE298446 Hs.305890 
402829 

425843 BE313280 Hs.159627 
400995 

452945 AW978187 Hs.79103 

417144 AA382104 Hs.81337 

422192 AA305159 Hs.1 13019 

424755 AB033094 Hs.152925 

410012 AW015832 Hs.57898 

431236 AV656840 Hs.285115 

449042 AW294985 Hs.30715 

420281 AI623693 Hs.323494 

418681 AA287786 Hs.23449 

406629 AW277078 Hs.181165 

421612 AF161254 Hs.106196 

446715 AI337735 Hs.173919 

431183 NM_0068S5 Hs.250696 

407722 BE252241 Hs.38041 

427368 BE041451 Hs.177507 

426268 AF083420 Hs.168913 

409632 W74001 Hs.55279 

442875 BE623003 Hs.23625 

455031 AA335996 Hs.355907 

442432 BE093589 Hs.38173 

437741 BE561610 Hs.5809 

448775 AB025237 Hs.388 

414368 W70171 Hs.75939 

432876 AW248272 Hs.279652 

431731 BE266322 Hs.211374 

425994 AK000207 Hs.165803 

445982 BE410233 Hs.13501 

444232 W56010 Hs.347297 

435655 AW105663 Hs.6947 

417686 AA769155 Hs.235498 

417933 X02308 Hs.82962 

426812 AF105365 Hs.172613 

452313 Y00486 Hs.28914 

438317 AA826401 Hs.122393 

409299 AA045650 Hs.53125 

423599 AI805664 Hs.31731 

412525 AA581439 Hs.152328 

424291 AL120051 Hs.144700 

427581 NM 014788 Hs.179703 

414987 AA524394 Hs.294022 

434274 AA628539 Hs.57783 

400282 Hs.289101 

425322 U63630 Hs.155637 

453344 BE349075 Hs.44571 

449915 NM 004529 Hs.404 

417691 AU076610 Hs.82399 

439012 BE383814 Hs.6455 

434931 AW968941 Hs.166254 

411678 AI907114 Hs.71465 

442315 AA173992 Hs.7956 

447140 AF070537 Hs.17481 

422385 BE549407 Hs.1 15823 

433517 AW022133 Hs.189838 

450230 AW016607 Hs.201582 

432866 BE395875 Hs.279609 

433001 AF217513 Hs.279905 

440773 AA352702 Hs.37747 

440587 AL138461 Hs.323084 

422813 AV656571 Hs.121068 

424259 AK001776 Hs.143954 

436075 BE090176 Hs.179902 

453204 R10799 Hs.191990 

453665 AA626250 Hs.326184 
432353 NM_016558 Hs.274411 

433271 BE621697 Hs.14317 

431770 BE221880 Hs.268555 

444019 BE173977 Hs.10098 

428839 AI767756 Hs.82302 
404826 

429669 BE185499 Hs.2471 

434474 AL042936 Hs.211571 

424482 BE25B621 Hs.149155 
450422 AA743525 Hs.50300 
440214 AA247118 Hs.7049 



Hs.351937:ribosomal protein, large P2 
NM_001536:Homo sapiens HMT1 hnRNP methyl 
Hs.1 16602:hypothetical protein BC009115 
MM 002G1 9:Homo sapiens platelet factor 4 
NM_001336:Homo sapiens cathepsin Z (CTSZ 
NM_138578:Homo sapiens BCLMike 1 (BCL2 

NMJ)04632:Homo sapiens death associated 

NM_030579:Homo sapiens cytochrome b5 out 
Hs.31337:lectin, galactoside-binding. so 
NM_015931:Homo sapfensfls485(LOC51066) 
Hs.152925:KIAA1268 protein 
(locuslink)NM_017819:Homo sapiens hypoth 
NM_001560:Homo sapiens interteukin 13 re 



(locusllnk)NM_017964:Homo sapiens hypoth 
Hg.23449 r ulin r cptortyi no kino 
lls.1 81 165:eukaryotic translation elongat 



Hs.173919:ESTs, Weakly similar to net 
NM_006855:Homo sapiens KDEL (Lys-Asp-Glu 
NM_003681:Homo sapiens pyridoxal (pyrido 
Hs.177507:hypothetical protein HSPC155 
NM_003576:Homo sapiens serine/threonine 
NM_002639:Hcmo sapiens serine (or cyslei 
Hs.23625:Homo sapiens clone TCCCTA00142 
Hs,355907:EST:, Weakly similar lo protci 
NM_024629:Homo sapiens hypothetical prot 
NM_020470:Homo sapiens putative transmem 
NM_002452:Homo sapiens nudix (nucleoside 
NM_012474;Homo sapiens uridine monophosp 
NM_015956:Homo sapiens mitochondrial rib 
(locuslink]NM_145051:Homo sapiens hypoth 
NM_017703:Homo sapiens hypothetical prot 
Ion lnHtlM_014303:Homo sapiens pescad 
(locuslink]NM_013397:Homo sapiens over-s 
(locuslink]NM 0 1 41 59:Homo sap-ens Huntin 
Hs.235498:hypothetical protein FLJ14075 
NM_001071:Homo sapiens thymidylate synth 
NM I 6593 H m piens so Ir am a 
Hs.28914:adeninephosphoribosyltransfera 
Hs.122393:ESTs 

NM_004597;Homo sapiens small nuclear rib 
(locuslink)NM_012094:Horno sapiens peroxi 
Hs.152328:ESTs 

NMJ04429:Homo sapiens ephrin-B1 (EFNB1) 
NM 178 Homosapier p rtite motif 
NM 032865:Homo sapiens hypothetical prot 
il c ' eukaryoti r n tioninitiati 
NM_005313:Homo sapiens glucose regulated 
NM 006904:Homo sapiens protein kinase, D 
Hs.44571:ESTs 

NM_004529:Homo sapiens myeloid/lymphoid 
NM_007357:Homo sapiens component of olig 
NM_006666:Homo sapiens RnvD-like 2 (E. c 
Hs. 1 6fi254:likely ortholog of rat vacuole 
NM_003129:Homo sapiens squalene epoxldas 
Hs.7956:ESTs 

NM_138391:Homo sapiens hypothetical prot 
(locuslink)NM_006638:Homo sapiens ribonu 
Hs.189838:ESTs 
Hs.201582:ESTs 

NM_014342:Homo sapiens mitochondrial car 
NM_016359:Homo sapiens nucleolar protein 
NM_022767:Homo sapiens hypothetical prot 
(lccusIink)NM_031209:Homo sapiens tRNA-g 
(lcaislink)NM_003270:Homo sapiens transm 
(locuslink)NM_018270:Homo sapiens chromo 
NM_080546:Homo sapiens CDw92 antigen (CD 
Hs.191990:ESTs 

Hs.326184:Homo sapiens nuclear protein p 
NM_016558:Homo sapiens SCAN domain conta 
NM_018648:Homo sapiens nucleolar protein 
NM_012255:Homo sapiens 5'-3' exoribonucl 
NM_019082:Homo sapiens putative nucleola 
(locuslink)NM_147174-.Homo sapiens hepara 

NM_014878:Homo sapiens KIAA0020 gene pro 
<i^t< nk) M 00! 3:Ho < r troiooy 
lie slink)NM_003 74:Hom( apien olta 
NM_033414:Homo sapiens hypothetical prat 
(Iocusllnk)NM_018386:Homo sapiens hypoth 
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421168 
435750 
452101 



435677 
406363 
452018 



410723 
435496 
425159 



AF182277 Hs.330780 

AB029012 Hs.4990 

T50298 HS.10B44 

AW963838 Hs.168830 

S78187 Hs.153752 

AA694142 Hs.6685 

AW102941 Hs.211265 

AA532963 Hs.9100 

AI878857 Hs.109706 

BE244454 Hs.79162 

AW138207 Hs.146170 

D57341 Hs.188361 

AW732918 Hs.182490 

U68105 Hs.172182 

BE297635 Hs.3069 

AI864142 Hs.29288 

NMJ00312 Hs.2351 

NMJ02692 Hs.99185 

Y08890 Hs.113503 

AA100683 Hs.372108 

AW840171 Hs.265398 

NM_004341 Hs.154868 

AF078859 Hs.86347 

H71025 Hs.21075 



Hs.168830:Homo sapiens cDNA FU12136 fis 
rotu:lT,MMMJ043S8:Honio sapiens cePd 
Hs.6685:thyroid hormone receptor interao 

Hs.211265:ESTs 

Hs.91Q0:hypothetical gene supported by A 
NM_016185:Homo sapiens hematological and 
Hs.79162:structwe specif c recognition 
NM_022842:Homo sapiens hypothetical prot 
Hs.188361:Homo sapiens cDNA FU12807 fis 
Hs.182490:leucine-rich PPR-motif contain 
NM_002568:Homc sapiens poly(A) binding p 
NM_004134:Homo sapiens heat shock 70kD p 
(iocuslink)NM_022759:Homo sODions cndo-b 
NM_000312:Homo sapiens protein C (inacti 

NM_002692:Homo sapiens polymerase (DNA d 
NM_002271:Homo sapiens karyopherin (inipo 
Hs.372108:ESTs 

Hs.265398:ESTs, Moderately similar to hy 
NM_004341:Homo sapiens carbamoyl-phospha 
NM_013341:Homo sapiens hypothetical prat 
NM_016328:Homo sapiens GTF2I repeat doma 



Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 



M18728 

AF134164 BF809407 AA218557 BF842863 AI2671S8 BF876178 BG999253 AW861851 AW858362 AI8 17548 BF771300 AA113928 AA223422 
AA055556 BF773400 BF998869 BE081333 BE073424 BE142245 H59571 H59570 BF871558 BF871064 BE001132 BF826831 AW754298 
AA223267 BG997895 BG997897 AW991957 AA534354 BG319501 BF736309 AI694265 AA045564 BG950256 AI829309 BG987850 BE093175 
BF854337 

BG036675 BF772005 BF771866 BG950386 BG960381 NMJJ05712 AF110315 BE074534 BE182776 BE158000 BE157999 BE714315 AW818104 
AW847519AA099426 AW817981 AW856396 BG961122 AA224498 AA308542 AW821833 BF902155 AI732411 BG778834 BG283641 BE748279 
BE748870 BG319540 BE7488C4 BF739224 BG986155 AK057283 BI861466 AA663341 AA457591 BG949294AW392886 AA071122 AA227849 
AA584918 BG959570 BF773486 AL041698 BF959013 R87170 C16859 BF770411 BF771298 AI075321 L13823AA216700 BF771864 AW861859 
BE537068 C18935 AA155719 BF771172 BF769107 BF804954 AWB18172 AW818143 AW392930AW817057 AW858044 BF746211 AA179928 
AW861687 AW821826 BI055726 BF242643 AA207189 BF770412 BF771 1 57 BG430030 AA055592 



223120 -4 
1063443J 
264480 3 



BE743847 AW809603 BM469626 AI375546 



Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled "The DNA - 
sequence of human chromosome 22." Dunham I. etal., Nature (1999) 402.489495. 
Indicates DNA strand from which exons were predicted. 
: Indicates nucleotid e positions of predicted exons. 



404519 8152000 Plus 



403218 
401866 
403221 



9368423 Plus 



12817-13000 
63448-63554 
64338-64517 

33374-33675,33769-34008 

109532-110225 
173043-173564 
58039-58149 
73125-73623 

66294-66438,66936-67124 
157315-157950 
145949-146227 
81750-81901 

110411-110716,111173-111640 
61858-61995 



2888-3001,3198-3532,36554117 

14-199579,19^ 

31-111020 
53526-53628,5575 
80014-80401,80593-81125' 
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7711510 
7768126 
8119067 
9438647 



400448 
406122 
406180 



145662-145781,147854-147984.148098-14824 



110297-111052 
177372-177674 
30940-31386 
38923-39107 

101532-101852,102006-102263 



Table 1 1 A lists about 958 genes up-regulated in colon cancer compared to normal adult tissues excluding non-malignant colon tissues (whole colon and colon epithelium) that are 
likely to encode proteins amenable to modulation by small molecules, peptides, or antibodies. These were selected from the starting collection of 59680 probesets on the 
Affymetrix/Eos-Hu03 GeneChip® array as follows: the ratio of "average 1 colon to 'average' norma! adult tissues was greater than or equal to 3.0, the 'average' colon level was set to 
the 90th percentile value amongst colon primary cancer specimens and colon liver derived metatases, the 'average' normal adult tissue level was set to the 70th percentile value 
amongst non-malignant tissues minus the colonic derived samples, the 'average' colon value was greater than or equal to 50 units, and the predicted protein contained astructural 
domain that is indicative of have an oncogenic function or of transducing an intracellular signal, or of being modulaiable by small molecules, peptides, or antibodies (e.g. pkinase, 
death-domain, 7tm, phosphatase, or ion.transporter). In order to remove gene-specifc background levels of non-specific hybridization, the " 
"' '-'-'•-'ire the ratio was evaluated. 



TABLE 11. 



958 genes unregulated in colon cai 



a compared to normal adult tissues excluding non-malignantcolon tissues (whole colon and colon epithelium) 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title Unigene gene title 

R1: Ratio of tumor to normal adult tissues 



ExAccn UnigenelD 



436749 AA584890 



s.5302 
Hs.220529 



441031 
421582 
407243 



421100 
409153 
452316 



421964 
407007 
423673 
447400 
450685 
427583 
418007 
406741 
422424 
423371 
452304 
422106 



422281 
413585 
422664 
417931 



M18728 
M18728 

AI660552 Hs.356183 

AF039401 Hs.194659 
M12523 

AW939691 Hs.5940 

U67167 Hs.315 

M296922 Hs.129778 

AJ243212 HS.3742B1 

X00442 Hs.75990 

AA363733 Hs.1032 

AF239666 Hs.1545 

AI110684 Hs.7645 

AI910275 Hs.350470 

AA058357 Hs.74466 

AA315993 Hs.105484 

AW083920 Hs.16098 

NM 005814 Hs.143131 

AU076801 Hs.89436 

X02544 Hs.572 

BE439580 Hs.75498 

T53925 Hs.107 

AW351839 Hs.124660 

W03754 Hs.50813 

AA298484 Hs.61265 

T69078 Hs.76177 

X73079 Hs.288579 

U22961 Hs.184411 

BE003054 Hs.1695 

AK000322 Hs.18457 



Hs.42 

M82962 Hs. 179704 

M13509 Hs.83169 

AA058357 Hs.74466 

AI186431 Hs.296638 

AU076819 Hs.1650 

AA025386 Hs.61311 

D84239 Hs.1 11732 

AF241254 Hs.178098 

M31126 Hs.352054 

BE256452 Hs.2257 

M36803 Hs.346935 

AI133452 Hs.75431 

AA315933 Hs.1 20879 

W95642 Hs.82961 



Unigene Title 

NM_006149:Homo sapiens lectin, galactosl 
(locuslink)NM_004363:Homo sapiens caxin 
(locuslink)NM_.002483:Homo sapiens carcin 
(locuslink)NM_002483:Hcmo sapiens carcin 
Hs.356183:ESTs, Weakly similar to S3B4_H 
NMJ)01285:l1omo sapiens chloride channel, 

NM_033049:Homo sapiens mucin 13, epithel 
n\ 024 7:ll rr n mu i nl i 
NM_014471:Homo sapiens serine protease i 
NM_0l 29:Honx I ■< dinm liqi 
NMJ)05143:Homo sapiens haptoglobin (HP), 
NM_006507:Homo sapiens regenerating isle 
' 1:Homooo f I l I n 

NM_005141:Homo sapiens fibrincgen, B bet 
NM_0( 5:H o pien retoil t 1 
(locuslink)NM_005890:Homo sapiens carcin 
NM_032044:Homo sapiens regenerating gene 
NM_020384:Homo sapiens claudin 2 (CLDN2) 
NM_005814:Homosapiens glycoprotein A33 
NM_004063:Homo sapiens cadherin 17, LI c 
Hs.572:orosomucoid 1 

NM_004591:Homo sapiens small inducible c 
NM_004467:Homo sapiens fibrinogen-like 1 
Hs.124660:ESTs, Moderately similar to21 
NM_017625:Homo sapiens intelectin (ITLN) 
NM_138805:Homo sapiens family with seque 

NM_001633:Homo sapiens alpha-1-microglob 
NM_002644:Homo sapiens polymeric immunog 
NM_000477:Homo sapiens albumin (ALB), mR 
NM_002426:Homo sapiens matrix metallopro 
NM.01 7763:Homo sapiens hypothetical prat 
NM_138938:Homo sapiens pancreatitis-asso 
NM_005588:Homo sapiens meprin A, alpha ( 
NM_002421:Ho.mo sapiens matrix metallopro 
(!c ink)NM_006 l:ll , sapiens carcin 
Hs.296638:proslate differentiation facto 
NMJW011 l:Homo sapiens solute carrier fa 
H .i 1 "1 A'eak imilartoSIO 30 
NM_003890:Homo sapiens IgG Fc binding pr 
NM_021804:Homo sapiens ang'olcncin i con 
Hs.352054:pregnancy specific beta-1-clyc 
NM_000638:Homc ipien; vitrone im Is. -J 
NM_000613:Homo sapiens hemopexin (HPX), 
NM_000509:Homo sapiens fibrinogen, gamma 
Hs.120879:Homo sapiens, clone MG032871 
Hs.82961:Homo sapiens, clone MGC:22588 1 
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422487 AJ010901 Hs.198267 

413936 AF1 13676 Hs.297681 

424687 J05070 Hs.151738 

420344 BE463721 Hs.97101 

446921 AB012113 Hs.16530 

428470 AC002301 Hs.184507 

452594 AU076405 Hs.29981 

422310 AA316622 Hs.98370 

421907 BE018556 Hs.109358 

424326 NM_014479 Hs.145296 

435538 AB011540 Hs.4930 

413881 L00190 Hs.75599 

443426 AF098158 Hs.9329 

: AA284679 Hs.25640 

226317 

: W74001 

NM_005709 Hs.132945 




411825 
451917 
410418 
418318 
414617 
447342 
452194 
414816 



414798 
415214 
411975 
422511 
408983 
431301 
428970 
424273 



U47732 Hs.84072 

A1339520 Hs.288817 

AI199268 Hs.19322 

AI694413 Hs.373599 

Y13709 Hs.77399 

AW376842 Hs.1085 

NMJ14324 Hs.128749 

C05837 Hs.145807 

BE280074 Hs.23960 

AI286323 Hs.97411 

AI445236 Hs.125124 

AI916068 Hs.144583 

AU076442 Hs.117938 

NMJM0492 Hs.663 

AA502384 Hs.1 61529 



W40460 



Is. 144 



407786 
414809 
425280 
432179 
443957 
426174 
430135 
420542 
409453 
408482 
444151 
444381 
421408 



428874 



AA056635 Hs.5366 

AW939252 Hs.192927 

U67058 Hs.154299 

M62839 Hs.1252 

M524394 Hs.294022 

BE336857 Hs.1 18787 

AA687538 Hs.38972 

AI4346SS Hs.77356 

U31519 Hs.1872 

X75208 Hs.2913 

AA521049 Hs.353013 

M547959 Hs.1 15838 

NM_000035 Hs.234234 

~"~ Hs.1321 
Hs.95612 

NMJW0676 Hs.45743 

AW972917 Hs.128749 

BE387335 Hs.283713 

AI688223 Hs.91096 

AA618335 Hs.356664 

NMJW2250 Hs.10082 

W32133 Hs.194366 



411142 NM.014256 Hs.69009 

452281 T93500 Hs.28792 

412115 AK001763 Hs.73239 

421379 Y15221 Hs.103982 

433083 AL042759 Hs.191762 

412104 AW205197 Hs.240951 

449027 AJ271216 Hs.22880 

429345 R11141 Hs.199695 

414753 AF158255 Hs.77225 

415000 AW025529 Hs.239812 

425206 NM.002153 Hs.155109 

432978 AF126743 Hs.279884 



NM_018406:Homo sapiens mucin 4, traoheob 
NM_000295:Homo sapiens serine (or cystei 
NM_004994:Homo sapiens matrix meta'iopro 
NM_014373:Homo sapiens putative G protei 
NM_002988:Homo sapiens small inducible c 
Hs.184507:Homo sapiens, similar to Homol 
Hs.29981:solute carrier family 26 (sulfa 
(locuslink)NM_030622:Homo sapiens cytoch 
Hs.109358:ATPase, Class V, type 10B 
NM_014479:Homo sapiens ADAM like dccysi 
Hs.4930:low density lipoprotein receptor 
ib i Uhl 1_( Ci Hon pier rr 
(b:uslink)NM_0121 1 2:Homo sapiens chrome 
Hs.2S640:claudin 3 

NM_001943:Homc sapiens desmoglein 2 (DSG 
NM_002639:Homo sapiens serine (or cystei 
(locuolink)NM_005709:Homo sapiens PDZ-73 
Hs.2376S8:apolipoprotein A-ll 
NfM_002670:Homo sapiens plastin 1 (I isof 
NM_001432:Homo sapiens epiregulir (GREG) 
t,M 0252 7 Homo sapiens chromosome 5 open 
NM_017767:Homo sapiens solute carrier fa 
Hs.50820:hypotiietical cardiac/skeletal m 
NM_019894:Homo sapiens transmembrane pro 
NM_004616:Homo sapiens transmembrane 4 s 
(loci hnk)NM 'lit i ;Hc o apie, r, rot 
Hs.19322:Homo sapiens, Similar to F5IKEN 



NM_00126S:Homo sapiens caudal typo homoo 
NM_004963:Homo sapiens guanylate cyclase 
NM_014324:Homo sapiens alpha-methytacyl- 

Hs.145807:liypothetical protein FU13593 
I ls.239G0:cyclin B1 

Hs.97411:hypothetical protein MGC12335 
NM_004442:Homo sapiens EphD2 (EPHB2), tr 
I ls.144583:Homo sapiens, clone IMAGE3462 
NM_000494:Homo sapiens collagen, type XV 
NM 00492:Hon ,i ticfibro t 
lls.151529:ESTs 

NM_003979:Homo sapiens retinoic acid ind 
NM_003561:Homo sapiens phospholipase A2, 
I ls.105448:protein kinase, lysine deScie 
NM_002215:Homo sapiens inter-alpha (glob 
NM_031265:Homo sapiens mucin and cadheri 
NM_000584:Homo sapiens interieukin 8 (IL 
Hs.285529:G protein-coupled receptor 49 
NM_139053:Homo sapiens epidermal growth 
NM_017726:Homo sapiens protein phosphate 
(locuslink)NM_005242:Homo sapiens coagul 
NM_000042:Homo sapiens apolipcprotein II 
NM_032865:Homo sapiens hypothetical prot 
Hs.118787:transforming growth factor, be 
NM_005727:Homo sapiens tetrsspan 1 (TSPA 
Hs.77356:transferrin receptor (p90, CD71 
NM_002591:Homo sapiens phosphoenolpyruva 
NM_004443:Homo sapiens EphB3 (EPHB3), mR 
Hs.353013:chromosome 20 open reading fra 



Hs.95612:ESTs 
NM_000676:Homo sapiens adenosine A2b rec 
(locuslink)MM_014324:Homo sapiens alpha- 
NfvL138455:Homo sapiens collagen triple h 
NM_052816:Homo sapiens tripartite motif- 
Hs.356664:hypotheti'cal protein FU32334 ' 
NM_002250:Homo sapiens potassium interme 
Hs.194366:transthyretin (prealbumin, amy 

- ' INM 33158:1 lomo pien erine 
NM_014256:Homo sapiens UDP-GlcNAc:belaGa 
Hs.28792:Homo sapiens cDNA FLJ11041 fis, 
Hs.73239:l pothc c pre FU10901 
NM_005409:Homo sapiens sma'l inducible c 
Hs.191762:hypothetical protein MGC20258 

(locuslink)NM_033120:Homo sapiens naked 
NM_005700:Homo sapiens dipeptidylpeptida 
Hs.199695:hypothetical protein WAC30 
NM_006437:I lomo sapiens ADP-ribosyltransf 
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AW630041 Hs.56937 

AW578849 Hs.283552 

N49813 Hs.75615 

AA284166 Hs.84113 

U20536 Hs.3280 

AL080188 Hs.137556 

A1885815 Hs.184727 

NM_000506 Hs.76530 

J03258 Hs.2062 

AK000232 Hs.124835 

NM_001898 Hs.123114 

J04088 Hs.156346 

L40904 Hs.100724 

R80137 Hs.302738 

AW997938 Hs.90786 

AW362867 Hs.302738 

BE515065 Hs.296585 

AA148164 Hs.247280 

NM_004572 Hs.25051 

AA311914 Hs.154578 

AA447849 Hs.288660 

NMJJ01639 Hs.1957 

AW138724 Hs.168974 

AI493046 Hs.146133 

AA133749 Hs.301350 

AF027208 Hs.1 12360 

AI675419 Hs.164464 

AL120168 Hs.78919 

X54942 Hs.83758 

R34008 Hs.239727 

M26301 Hs.2253 

AA446644 Hs.692 

BE048061 Hs.37054 

NM_002447 Hs.2942 

' NM_002659 Hs.179657 

I X67055 Hs.76716 

i AW409701 Hs.1 578 

' Y00272 Hs.334562 

; NMJ13390 Hs.160417 

i AA553722 Hs.194346 

: AL136877 Hs.50758 

' AW411425 Hs.180655 

I Z46023 Hs.1 18721 

AI086138 Hs.204044 

I AA029722 Hs.2173 

! BE305242 Hs.16098 

' AA628967 Hs.1 15274 

i BE267931 Hs.78996 

I M34996 Hs.1 98253 

BE279383 Hs.26557 

I NMJ12079 Hs.288627 

i AW797437 Hs.69771 

I AF039916 Hs.12330 

I AI834273 Hs.9711 

I AK001989 Hs.91165 

I NMJ14791 Hs.184339 

M86849 Hs.323733 

I AF127026 Hs.5394 

i BE264974 Hs.6566 

I M531276 Hs.59509 

I AW971178 Hs.268571 

I AK001455 Hs.5198 

I AI267592 Hs.75761 

I AJ009936 Hs.118138 

i Z29067 Hs.2236 

' NMJ03963 Hs.184194 

I AW190902 Hs.40098 

! H25530 Hs.50868 

I AF130988 Hs.58346 

I U17760 Hs.75517 



AF016495 Hs.1 04624 

NMJ14252 Hs.78457 

AI492170 Hs.77917 

AW952792 Hs.334612 

AI015709 Hs.172089 

R36762 Hs.101282 

AJ269529 Hs.301871 

NM_002497 Hs.1 53704 



NM_021978:Homo sapiens suppression of tu 
. s.2E \2 hypothetical protein BC016153 
NM_000483:Homo sapiens apolipoprotein C- 
NM_005192:Homo sapiens cyclin-depencfenl 
NM_001226:Homo sapiens caspase 6, apopto 
NM_033100:Homo sapiens MT-prctocadherin 
Hs.184727:ESTs, Weakly similar to T45738 
1 M_0 r l ! n d| t, lagulatior, faclo 
h< 2062:vitan n L'(1 25 dihydroxyvitami 
NM_019062:Homo sapiens hypothetical prol 
NM_001898:Homo sapiens cystalin SN (CST1 
NM_001067:Homo sapiens tcpoisomerase(DN 

NM_005037:Homo sapiens peroxisome prolif 
Hs.302738:Homo sapiens cDNA: FU21425A 
NM_003786:Homo sapiens ATP-binding casse 
Hs.302738:Homo sapiens cDNA: FLJ21425 fi 
(locusllnk)NM_006392:Homo sapiens nuclec 
He 24 7 2 8 0 chromosome 20 open reading fra 
NM_004572:Homo sapiens plakophilin 2 (PK 
Hs.154578:Homo sapiens mRNAfor FU00256 
Hs.288660:Homo sapiens cDNA: FU22182rl 
Hs.1957:amyloid P component, serum 
Hs.168974:ESTs 



Hs.30135C:FXYD domain-containing ion tra 
Hs.112360:praminin-like1 (mouse) 
Hs.164464:Homo sapiens, clone WGC:23656 
NM_021083:Homo sapiens Keil blood group 
NM_001827:Homo sapiens CDC28 protein kin 
NM_024422:Homo sapiens desmocollin 2 (DS 
Hs.2253:complement component 2 
N'1 102354 lomc api. is lui ) sj ciat J 
Hs.37054:ephrin-A3 

I 294 T.acrophr est nul ling 1 - opto 
NM_002G59:I lomo sapiens plasminogen activ 
NM_002217:Homo sapiens pre-alpha (g'obul 
NM_001168:Homo sapiens boculovirol IAP r 
NM_001786:llomo sapiens cell division eye 



Hs.194346:Spir-2 protein 
Hs.50758:SMC4 structural maintenance of 
(locuslink)NM_004217:Hon o sapiens serine 
NM_000434:Homo sapiens eialidasc 1 (yso 
Hs.204044:ESTs 

NM_002033:Homo sapiens fucosyllransferas 
Hs.16098:olaudin 2 

lls.115274:lndian hedgehog homolog (Droso 
NM_002592:Homo sapiens proliferaling eel 
Hs.198253:major histocompatibility compl 
NM_007183:Homo sapiens plakop in 3 (PK 
NM_012079:Homo sapiens diacylglycerol 0- 
NM_001710:Homo sapiens B-factor, properd 
NM_001247:llomo sapiens ectonucleoside Ir 
NM_01 751 5:Homo sapiens novel protein (HS 
Hs.91 1 65:hypottietical protein FU11127 
NM_014791:Homo sapiens maternal embryoni 
1 1 - 523733 gap junction protein, beta 2, 
NM_005379:Homo sapiens myosin lA (MY01A) 
Hs.6566:thyroid hormone receptor interac 
Hs.59509: ESTs, Weakly similar to similar 
(locuslink)NM_001645:Horno sapiens opolip 
Hs.5198;Down syndrome critical region ge 
NM_003137:Homo sapiens SFRS protein Una 
NM_033013:Homo sapiens nuclear receptor 
Hs.2236:NIMA (never in mitosis gene a)-r 
NM_003963:Homo sapiens transmembrane 4 s 
Hs.40098:cysteine knot superfamily 1, BM 
Hs.50868:solute carrier family 22 (organ 
NM_022336:Homo sapiens ectodysplasin 1 , 
NM_000228:Homo sapiens laminin, beta 3 ( 

(locuslink)NM_017895:Homo sapiens DEAD/H 
NM_003823:Homo sapiens tumor necrosis fa 
Hs.236894:ESTs, Highly similar to S02392 
NM_020980:Homo sapiens aquaporin 9 (AQP9 
HM_014252:Homo sapiens solut carrierfa 
NM_006002:Homo sapiens ubiquitin carboxy 
NM_003094:Homo sapiens small nuclear rib 
Hs.1 72089:pro-oncosis receptor Inducing 
Hs.101282:Homo sapiens mRNA; cDNA DKF2p4 
Hs.301871 .-solute earner family 37 (glyc 
NMJ)02497:Homo sapiens NIMA (never In mi 
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421943 BE616520 Hs.343912 NM_033504:Homo sapiens CAC-1 (CAC-1), mR 
400529 

407233 X16354 Hs.50964 

447472 AW207347 Hs.211101 

447966 AA340605 Hs.105887 

439963 AW247529 Hs.6793 

407103 AA424881 Hs.256301 



20 
25 
30 



(locuslink)NM_145252:Homo sapiens simila 
Hs.6793:platelet-adivating factor acely 
Hs.256301:hypothetical protein MGC13170 



421506 



BE302796 Hs.105097 
NMJW3645 Hs.11729 
BE548446 Hs.321579 



422714 AB018335 Hs.119387 



412723 AA648459 
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Hs.335951 
Hs.153357 
Hs.286192 
Hs.147097 
Hs.272789 
Hs.76913 
Hs.63325 



AA339449 Hs.82285 



NM_003645:Homo sapiens fatty-acid-Coenzy 
NM_021095:Homo sapiens solute earner fa 
Hs.348710:Homo sapiens, clone IMAGE4242 
NM_002257:Homo sapiens kallikrein 1, ren 
NMJ14698:Homo sapiens KIAA0792 gene pro 

NM_002754:Homo sapiens mitogen-aclivated 
NM_005603:Homo sapiens ATPa=e, C ass I, 
Hs.335951:hypothetical protein AF301222 
NM_001084:Homo sapiens pre 
NM_032192:Homos,-' - 



425123 AW205274 Hs.154695 

425743 BE396495 Hs.159428 

406C84 X16354 Hs.50964 

439580 AF086401 Hs.293847 



i AW000827 Hs. 11962 

i BE300330 Hs.1 18725 

l AW206330 Hs.355663 

I NMJW2543 Hs.77729 

I AW955454 Hs.30942 

I AA506476 Hs.375009 

I AF035969 Hs.24879 

' AK001058 Hs.12680 

! W07162 Hs.150826 

I Y07828 Hs.91096 

I L29339 Hs.1964 

' AW290997 Hs.190153 

i AF178532 Hs.271411 

' AW419202 Hs.286192 

' L32137 Hs.1584 

I AI192528 Hs.164537 

! AW301003 Hs.51483 

i AW961017 Hs.6459 

) N22788 Hs.82407 

I U83460 Hs.104557 

t X94453 Hs.1 14366 

i AU076629 Hs.1 65950 

! H18835 Hs.31608 

I AB014598 Hs.31720 

' NM-004482 Hs.278611 

I AF039850 Hs.198515 

1 AA622037 Hs.1 66468 

2 NM.000047 Hs.74131 

i AF011333 Hs.153563 

) BE270447 Hs.356512 

t U40272 Hs.75253 

) AW970780 Hs.59483 

5 AA568906 Hs.82240 

5 AU077141 Hs.374548 

I AJ227892 Hs.146274 

) AA878200 Hs.1 18727 

f AF109302 Hs.27495 

i T82427 Hs.194101 

5 A1916662 Hs.211577 



Hs.4867:mannosyl(alpha-1,3-) 



,3-)-glycoprote 
I . sir I- ( (m 



AF134164:Homo sapiens Human endogenous r 
NM_033453:Homo sapiens inosine triphosph 
NM_005975:Homo sapiens PTK6 protein tyro 

NMJ)06579:Homo sapiens emopami! binding 
Hs.191797:ESTs 

NM_000412:llorro sapiens hislidine-rich g! 
NM_0121 16:Homo sapiens Cas-Br-M (murine) 
NWL000303:Homo sapiens phosphomannomutas 
NM_138761:Homo sapiens BCL2-associated X 
(locuslink)NM_001712:Horno sapiens carcin 
Hs.293847:ESTs 

(locuslink)NM 001202:l iomo sapiens bone m 
Hs.184062:ctiromosome 20 open reading fra 
Hs.79187:coxsackie virus and adenovirus 
NM_030766:llomo sapiens apoplosis regulat 
NM_012248:Homo sapiens selenophosphate s 

NM_002543:Homo sapiens oxidised low dens 
NM_004093:Homo sapiens ephrin-B2 (EFNB2) 
Hs.375009:Homo sapiens mRNA; cDNA DKFZp6 
Hs.24879:phosphaMc acid phosphatase t 
Hs.12680:Homo sapiens cDNA FU10196 fls, 
NM_020387:Homo sapiens RAB25, member RAE 
NM_007028:Homo sapiens tripartite mo'if- 
NM_000343:Homo sapiens solute carrier fa 
Hs.190153:Homo sapiens cDNA FU33988fs 
NM_138992:I Iomo sapiens beta-site APP-cle 
NM_032192:Homo sapiens protein phosphata 
Hs.1584:cartilage oligomeric matrix prot 



Hs.51483:Homo sapiens, Similar to RIKEN 
(l s!ink)NM_0 L Horn; apiens h p -h 
NM_02 15 -lor spiens n mokine (C-X-i 
NM_001859:Homo sapiens solute carrier fa 
Hs.1 14366:pyrrolins-5-carboxylate synth e 
NMJ0201 1:Homo sapiens fibroblast growth 
f'j- 1 1 1 1 iNM_017636:Homo sapiens "transi 
NM_014799:Homo sapiens hephaestin (HEPH) 
NM_004482:Homo sapiens UDP-N-acetyl-alph 
NM_005224:Homo sapiens dead ringer-like 
NM_004708:Homo sapiens programmed cell d 
NM_000047:Homo sapiens arylsulfatase E ( 
HM_002349:I Iomo sapiens lymphocyte anlige 
Hs.356512:ESTs,Weakl nilartoUBt \ 
NM_004135:Homosapiens isocitrato dchyd: 
Hs.59483:leucine-rich repeat-containing 
Hs.82240:syntaxin3A 
Hs.374548:solute carrier family 1 6 (mono 



Hs.1 18727:Homo sapiens cDNA FLJ33803 fls 
Hs.27495:prostate cancer associated prot 
Hs.194101:Homo sapiens cDNA: FU20869 fi 
(locuslink)NM_004986:Homo sapiens kinect 
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412869 AA290712 Hs.82407 

443639 BE269042 Hs.9661 

418245 AA088767 Hs.83883 

409636 AA305729 Hs.18272 

408989 AW361666 Hs.49500 

428023 AL038843 Hs.374530 

410199 AW377424 Hs.205126 

431685 AW296135 Hs.267659 

409956 AW1 03364 Hs.727 

413278 BE563085 Hs.833 

431193 AW749505 Hs.296770 

436856 AI469355 Hs.127310 



NM.002318 Hs.83354 



NM_004605 Hs.375147 

M86699 Hs.169840 

NMJ01197 Hs.155419 

AA295251 Hs.166056 

AW136551 Hs.181245 

AW411491 Hs.75069 

M318501 Hs.241587 

NM_005764 Hs.271473 

AW372884 Hs.240770 

AW505308 Hs.75812 

BE262677 Hs.283558 

U80082 Hs.1 69600 

AI375726 Hs.227152 

AW406289 Hs.96593 

AF279265 Hs.298476 



446696 
400130 
431890 
425003 

417386 ..„. „, 

424837 BE276113 Hs.333034 
424534 " 



428471 
409012 
424154 
400290 
409152 
427333 
413835 
444664 

107777 
414806 
421190 



□87582 
AA378776 
X57348 
AL1 17435 
AF026004 
H18636 
AA176585 



s.194346 



418661 
419092 
414013 



AF067797 Hs.1 75658 

AI272727 Hs.249153 

N26362 Hs.1 1615 

AW751497 Hs.98370 

M161071 Hs.71435 

D14594 Hs.77329 

U95031 Hs.102482 

R58665 Hs.46847 

AA316241 Hs.90691 

X90725 Hs.77597 

NM.005940 Hs.155324 

AW504924 Hs.9805 

NM_000169 Hs.69089 

AF155108 Hs.256150 

AF119665 Hs.184011 

AA662240 Hs.283099 

BE396290 Hs.5097 

NM_014586 Hs.109437 

AW067903 Hs.82772 

AB037784 Hs.22941 

AI064813 Hs.195155 

X72755 Hs.77367 

AW973003 Hs.179909 

X63529 Hs.2877 

NMJ01949 Hs.1 189 

J05581 Hs.89603 

AA766605 Hs.47099 

BE243834 Hs.50441 

AA250970 Hs.251946 

X59135 Hs.156110 

NMJ14312 Hs.112377 

AU076915 Hs.5398 

AI702038 Hs.100057 

AI878932 Hs.317 



NM_020182:Homo sapiens transmembrane, pr 
(locuslink)NM_030674:Hoxo sapiens solute 
Hs.49500'.KIAA0746 protein 
Hs.374530:Homo sap : onscDNA: FLJ23602 fi 
I ls.20512G:Homo sapiens cDNA: FLJ22667 (i 
NM_006113:Homo sapiens vav 3 oncogene (V 
NM_002192:Homo sapiens inhibin, beta A ( 
Hs.833:interferon-stimulated protein, 15 
Hs.296770:KIAA1719 protein 
(locuclink)NM_144624:Horr,o sapiens kinase 
Hs.367942:Homo sapiens, clone IMAGE:4701 
Hs.2056:UDPglycosyltransferase 1 family 
NM_002318:Homo sapiens lysyl oxldase-lik 
NM_004207:Homo sapiens solute came; fa 
Hs.146170:hypothetica! protein FU22969 
NM_004111:Homo sapiens flap structure-sp 
Hs.90078:nucleotide-sugar transporter si 
N(vL003965:Homo sapiens chemokine (C-C mo 
Hs.376147:Homo sapiens cDNA FU39099 fis 
Hs.169840:nK protein kinase 
NM_001197:Hcmo sapiens BCL2-interscting 
locus1ink)NM_00 7:Homo sapiei 
Hs.181245:Homo sapiens cDNA FU1 2532 fis 
Hs.75069:sennehyd-o*ymelhyluansferase 
NM_021246:Homo sapiens lymphocyte antige 
H 27147 n 31 pr t ir .p egi ,1 
Hs.240770:nuclearcap binding protein su 

NM_004563:Homo sapiens phosphoenolpyruva 
NrVL018509:Homo sapiens hypothetica' prot 
Hs.169600:KIAA0826 protein 
NM_016391:Homo sapiens hypothetica! prot 
NM_019034:Homo sapiens ras homolog gene 
NMJJ2291 1:Homo sapiens solute carrier fa 
NM_001746:Homo sapiens calncxin fCANX), 
NM_002203:t lomo sapiens integrin, alpha 2 
NM_014481:Homo sapiens APEX nuclease (ap 
NM_018144:Hcmo sapiens Sec61 alpha form 
Nr/_003491:l lomo sapiens ARD1 homolog, N-a 
Hs.150275:KIAA0241 protein 
(locusltnk)NM_024051:Homo sapiens hypoth 
Hs.184510:stratifin 
Hs.49725:DKFZP434l216 protein 
NM_004366:Homosa[ 
(locislink)NM_017636Homosa 
Hs.194346:Spir-2 protein 
NM_001169:Homo sapiens aquaporin 8 (AQP8 
NM_024306:Homo sapiens fatty acid hydrox 
NM_016086:Hcmo sapiens map kinase phosph 
NM_030622:Homo sapiens cytochrome P450, 
Hs.71465.-squaleneepoxidase 
(loci:slink)NM_014754:Homo sapiens phosph 
Hs.102482:mucin 5, subtype B, tracheobro 
NM_016614:Homo sapiens TRAP and TNF rece 
NM_006993:Hcmo sapiens nucieophosmin/nuc 
NM_O00998:Homo sapiens ribosomal protein 



Hs.256150:NY-REN-41 an 
Hs.184011:pyrcphosphatase(inorg; 
Hs.283099:AF15q14 protein 
Hs.5097:synaptogyrin2 
NM_014586:Hcmo sapiens hormon 
Hs.82772:collagen, type XI, alpha 1 
Hs.22941:KIAA1353 protein 
Hs.195155:soiute carrier family 38, tr. 
NM_002416:Homo sapiens iron "" 
(locuslink)NM_024831:Homosa[ 
NM_001793:Homo sapiens cadherin 3, type 
NM_001949:Homo sapiens E2F transcription 
NM_002456:Homo sapiens mucin 1, transmem 
NM_024642:I o osa itn hypoll ticai prot 
NM_015936:Hcmo sapiens CG1-04 protein (L 
Hs.251946:Homo sapiens cDNA FU11840 fis 
Hs.1551 ^immunoglobulin kappa constant 
! l ll.'Tco-ticalthymocytereceptorfX 

Hs.100057:serinc/threonine kinase 35 
NM_003286:Homo sapiens topoisome-ase (DN 
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BE378432 




NM 052984:Homo sapiens cyclin-cependent 








Hs.258575 


MO _J I n a| i lir (1 1 Di i 






AW388633 




i lS-G ! 382:so'Li1e earner family 7, (cation 






A1458165 










AW401864 




NM__004208:Homo sapiens programmed eel: d 






AW977382 


Hs.15898 


Hs.15898:2,4-dienoyl CoA reductase 2, pe 






AA852773 


Hs.334838 


Hs.334838:KIAA1866 protein 








Hs.378718 


Hs.378718:Homo sapiens cDNA FU33433 fls 










(locuslink)NM_005274:Homo sapiens guanin 


3.89 








H 8:CDC91 cycle 91-1 


3.89 


400750 


AF146761 










AI660842 




10 c I r.c r r ( ul 






AW293134 




NM_005977:Homo sapiens ring finger prote 






AF082283 




1M 103921:1 loxosapien n VC Ulympho 
















AA494359 




Hs,30715:polassium voltage-gated channel ^ 






AI381800 




Hs.300684:calcitonin gene-related peplid 






AA316181 


Hs.61635 


NM 012449:Hamo sapiens six transmembrane 










NM_005242:Homo sapiens coagulation facto 










NM 01 2145:Homo sapiens deoxythymidylatc 






AB023227 


Hs.23860 


HS.23860.KIAA1010 protein 






AA 032279 


Hs.61635 


i s 3163t siv transmembrane epithelial an 






AW607831 




NM 014343:Homo sapiens claudin 15 (CLDN1 






BE244231 




NM~015937:Homo sapiens CGI-06 protein (L 






A1904232 




Hs.75323:prohibltin 






BE158006 




Hs.212296:ESTs 






AA971014 




NM_000884:Homo sapiens IMP (inosine mono 
















AB023182 


Hs. 184523 


Hs 184523-se ne/threo ne Una £ 31* 






AI272144 


Hs.236522 


Hs.236522:DKFZP434P106 protein 






BE621310 




Hs.923:single-stranded DNA binding prate 






BE277414 




NM_005370:Homo sapiens mel transforming 






AF186081 


Hs.175783 


AW578452fRC1-CT0252-030100-023-b07CT025 






AW578452 










BE298665 




Hs.14846:Homo sapiens mRNA; cDNA DKFZp56 










i 11 omaii; \ 






BE267057 


Hs.325321 








BE616633 










AI638871 




Hs.378965:Homo sapiens cDNA FLJ37658 fis 






AI660247 


Hs.32699 


I 699:Hon pi ' j k P h EN 






AW391972 




Hs.104696:KIAA1324 protein 






AW015211 


Hs.153799 


H V > H i )ic V *■ Fl 8333 f 
















AF 102850 


Hs.227933 


f M 01 H rrc pier ~ r S c visi 










NM~003212:Homo sapiens tcralocarcinoma-d 






AW630088 




Hs.171952:occludin ^' 








Hs.171952 






























Hs.278613 


(locuslink)NM_0O5532:Homo sapiens interf 






AW293801 


Hs. 255052 


Hs.255052:ESTs 






BE276112 










NM 004987 




(locuslink)NM_004987^Homosapiens UM an 






BE258602 




NM_016292:Homo sapiens heat shock protci 






BE252241 


Hs.38041 


NM_003681:Homo sapiens pyridoxal (pyrido 






AF151812 




NM 015966:Homo sapiens serologically del 






BE560621 


Hs.9222 






















Hs.16 34:cell d : vision cycle 25A 






AB020643 


Hs.1 83006 






447200 


BE543146 


Hs!281434 


Hs.281434:Homo sapiens cDNA FU31373 fis 








Hs.155140 


NM_001895:Homo sapiens casein kinase 2, 






T28407 


Hs.81564 


NM 0Q2619:Homo sapiens platelet factor 4 






AA287786 


Hs.23449 








AW138207 


Hs.146170 


NM 02284 2:Homo sapiens'hypothe'tical prot 




446291 


BE397753 


Hs.14623 


Hs.14623:interferon, gamma-inducible pro 




435886 


BE265839 


Hs.12126 






417286 




Hs.81874 


NMJ)02413:Homo sapiens microsomal glutat 








Hs.107614 


Hs.1 076 1 4: DKFZP5641 1171 protein 




400419 


AF084545 




AF084545:Homo sapiens versican Vintisof 


3.73 


421357 


AK000309 


Hs.103808 


NM 017896:Homo sapiens chromosome 20 ope ■ 


3.73 




AW469240 


Hs.371581 


Hs.371581:ESTs 


3.73 


418703 


NMJ14448 Hs.87435 


Hs.87435:Rho guanine exchange factor (GE 


3.73 


452679 


Z42387 


Hs.83883 


(locuslink)NM_020182:Homo sapiens transrr, 


3.72 




AW408762 


Hs.5957 


Hs.5957:Homo sapiens clone 24416 mRNA se 


3.72 


435730 


AB020635 


Hs.4984 


Hs.4984:KIAA0828 protein 


3.72 


431512 


BE270734 


Hs.2795 


Hs.2795.lactate dehydrogenase A 


3.72 


444006 BE395085 


Hs.334762 


(locuslink)NM_032832:Homo sapiens hypoth 


3.72 


442875 BE623003 


Hs.23625 


I 2 r Muu api , lo TCC( rA00142 


3.71 


413431 


AW24642E 


) Hs.75355 


NM 003348:Homo sapiens ubiquitin-conjuga 


3.71 


41395C 


I AA249096 


Hs.32793 


Hs.32793:Homo sapiens cDNA FLJ31 108 fis, 


3.71 


41112E 


i AA151647 


Hs.68877 


Hs.68877:cytochrome b-245, alpha polypep 
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H27498 Hs.293441 
U11700 Hs.84999 
AL080192 Hs.101282 



413476 U25849 Hs.75393 
400846 
415003 
408137 
418650 I 
413179 N99692 Hs.75227 



M11437 
AI694131 
BE386750 



442315 
462875 
401179 
410174 
418558 
440086 
409402 
436014 



BE313280 
AU076609 
J05272 
R94038 



AA306007 Hs.59461 

AW0B2266 Hs.86131 

NMJJ05402 Hs.6906 

AF208234 Hs.695 

AF281134 Hs.283741 

AI796390 Hs.210667 

AU076782 Hs.248267 

AL080235 Hs.35861 



430720 U85768 Hs.247838 

432320 AW411066 Hs.274351 

420186 NMJ1592S Hs.95697 

441128 M570256 Hs.348504 

444184 T87841 Hs.282990 

411678 AI907114 Hs.71465 

423750 AF165883 Hs.298229 

412948 BE243313 Hs.334851 

452098 AI858183 

430024 AI808780 Hs.227730 

416412 NM_014742 Hs.79305 

437712 X04588 Hs,86844 
400847 

406671 M1 29547 Hs.285754 

412641 M16660 Hs.74336 
404354 
400448 



441406 Z45957 

417389 BE260964 

419607 R52557 

447250 AI878909 

446356 AI816736 

431236 AV656840 

426722 U53823 

420531 AI652069 

416933 BE561850 

447698 AI420156 

434457 AF141332 

424241 AWgg5948 

452264 AU077013 

420614 AL1 10291 

434224 AA360731 



70 
75 



Hs.129673 
i Hs.8364 
Hs.174139 



AV656571 Hs.121068 



421802 BE261458 



AC005954 Hs.25527 

AA301116 Hs.142838 

AW052006 Hs.374973 

AF161254 Hs.105196 

AB012193 Hs.183874 

F07783 Hs.1369 



Hs.293441:HomosapiensSNC73 protein (SN 
NM_000053:Homo sapiens ATPase, Cim-lran 
Hs.101282:Homo sapiens mRNA; cDNA DKFZp4 
NM_001853:Homo sapiens collagen, type IX 
NM_004300:Homo sapiens acid phosphatase 



Hs.7837 

Hs.82045 

Hs.91579 

Hs.17883 

Hs.14896 

Hs.285115 

lls.171952 

Hs.98614 

Hs.80506 

Hs.320733 

Hs.200333 

Hs.8364 



NM_005002:Homo sapiens NADH dehydragenas 
NM_004632:Homo sapiens death associated 
NM_001033:Homo sapiens ribonucleotide re 
(locuslink)NM_000883:l lomo apiei IMP (i 
NM_005538:Homo sapiens inhibin, beta C ( 



NM_006114:Homo sapiens translocate of ou 

Hs.59461:DKFZP434C245 protein 
Hs.86131:Fas (TNFRSF6)-associated via de 
NM_005402Homo sapiens v-r 
Hs.695:cystatinB(steflnB) 
NM_0201S8:Homo sapiens ex 



(locuslink)NM_021 126:Homo sapiens mercap 
Hs.35861:Ras-induced senescence 1 
I ls.273333:hypothetica( protein FU10966 
NM_002991 :Homo sapiens small inducible c 
NM_016032:Homo sapiens zino finger, DHHC 
Hs.95697:liver-specific bHLH-Zip transcr 
485I f:l polheti. I p'ol ir r 01407 
(locusl : nk)NM_033550:Homo sapiens chromo 
NM_003129:Homo sapiens squalene epoxides 
NM_012394:Horno sapiens prefoldin 2 (PFDN 
Hs.334851-.LIM and SH3 protein 1 
BF755039:QVO-CT0583-181000-428-f07CT058 
NM_000210:Horr,o sapiens integrin, alpha 6 
Hs.79305:KIAA0255 gene product 
Hs.85844:neurotrophic tyrosine kinase, r 



Hs.82045:midkine (neunte growth-promoti 
NM_033416:Homo sapiens similar to HYPOTH 
NM_002707:Homo sapiens protein phosphate 
Hs.14896:zinc finger, DHHC domain contai 
NMJ01560 Homo sapiens intorleukin 13 re 
NM_002538:Homo sapiens rxcludin (OCLN), 
NM_004587:Homo sapiens ribosome binding 
NM_003090:Homo sapiens small nuclear rib 
NM_052858:Homo sapiens similar to RIKEN 
NM_018690:Homo sapiens apolipoprotein B4 
Hs.8364:pynjvate dehydrogenase kinase, i 
Hs.28757:transmembrane 9 superfamily mem 
Hs.99364:abhydrolase domain contain™ 1 
NM_000206:Homo sapiens interieukin 2 rec 
NM_006904:Homo sapiens protein kinase, D 
Hs.129673:eukaryotic translation initiat 
Hs.8364:pyruvate dehydrogenxe kinase, i 
NM_001829:Homo sapiens chloride channel 
NM_080748:Homo sapiens chromosome 20 ope 
(locuslink)NM_017988:Homo sapiens hypoth 
NM_021259:Homo sapiens Uansmembrane pro 
NM_004964:Homo sapiens Wstonedeacetyla 
NM_001532:Homo sapiens solute earner fa 
(locuslink)NM 003270:1 lomo sapiens transm 
Hs.184389:ESTs, Moderately similar to S 1 
Hs.274230:3'-phosphoadenos;ne5'-phospho 
NM_002266:Homo sapiens karyopherin alpha 
NM_020470:Homo sapiens putative Iransmem 
- link)NM_016022:Homo sapiens CGI-78 



(locuslink)NMJ45698:Homo sapiens endoze 
(locuslink)NM_033405:Homo sapiens peroxi 
(locusl ."MJ14428 Homo sapiens tight 
NM_032390:Homo sapiens MKI67 (FHA domain 
NM_004697:Homo sapiens PRP4pre-mRNA pro 
(locuslink)NM_016579:Homo sapiens 8D6 an 
NM_003589:Homo sapiens cullir, 4A (CUL4A) 
NM_000574:Hcmo sapiens decay acceleratin 
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429023 NM_000312 Hs.2351 
452862 AW378065 Hs.8687 
442993 BE018682 Hs.166196 



Hs.153752 
Hs.209119 
Hs.32123i 
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50 
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429583 NM_006412 

445937 AI452943 

424954 NM.000546 

424142 AI678727 

428028 U52112 

456534 X91195 

424685 W21223 
419170 



457635 AV660976 
419705 AW368634 
454390 AB020713 



Hs.378970 

Hs.182018 

Hs.1 00623 

Hs.151734 

Hs.287850 

Hs.6710 

Hs.350077 

Hs.246315 

Hs.250696 

Hs.118638 

Hs.3569 

Hs.154331 

Hs.56966 



451707 
433604 
420085 



AW051061 Hs.60973 

NM_013442 Hs.3439 

AI741909 Hs.44680 

AA766142 Hs.131810 

AW238299 Hs.250618 

X57522 Hs.352018 

AW296454 Hs.24743 

H59799 Hs.42644 

! AA305B00 Hs.5672 

AL359575 Hs.23765 

NMJ14788 Hs.179703 

AF07B866 Hs.284296 

I Hs.75309 

i AI580063 Hs.109007 

i AA088177 Hs.172870 

' AF111170 Hs.306165 

AI732374 Hs.339827 

BE269243 Hs.1 82625 

AA382104 Hs.81337 

i AW451034 Hs.326525 

DE281191 Hs.155462 

I AA194422 Hs.22564 

' AI587180 Hs.110906 

i AW732276 Hs.95583 

I AL035668 Hs.73853 

! BE268621 Hs.149155 

i U33446 Hs.75799 

i W21872 Hs.7907 

! AW594506 Hs.104830 

! BE250944 Hs.183556 

I AF167572 Hs.12912 

BE391929 Hs.8752 

D15049 Hs.179770 

! AI376722 Hs.180062 

I BE303044 Hs.192023 

! BE502341 Hs.3402 

i AW953575 Hs.303125 

i NM_002951 Hs.75722 

! AI951628 Hs.127007 

I U67319 Hs.9216 

: AF083420 Hs.168913 



451129 BE072881 



448440 AA173467 Hs.62402 

426858 NM_004182 Hs.172791 

417457 AA378907 Hs.349323 
406363 

444758 AL044878 Hs.1 1899 



426125 X87241 

452835 AK001269 

419493 AF001212 

457670 AF119666 
400125 

429404 NM 005738 

410293 AK000047 

434826 AF155661 

423599 AI805664 

427715 BE245274 



Hs.166994 
Hs.30738 
Hs.90744 
Hs.23449 
Hs.125078 
Hs.10706 
Hs.61960 



NM_000312:Homo sapiens protein C (inacfj 
Hs.8687:ESTs 

Hs.166196ATPase, Class I, type 8B, memb 

(!ocuslink)NM_004358:Homo sapiens cell d 
NM_00641 2:Homo sapiens 1-acylglycerol-3- 
(loci link)NM J0377 Home 3[ 3n UDP-G 
NM_000546:Homo sapiens tumor protein p53 
Hs.378970:Homo sapiens cDNA FLJ35102 fis 
f 18201 inl I 1 r Icra so £ 
NM_138689:Homo sapiens protein phosphata 
Hs.151734:nuciear transport factor 2 
NM_002219:Homo sapiens integral membrane 
NM_004870:Homo sapiens mannose-P-dolicho 
NM_000982.Homo sapiens ribosomai protein 
NM_017423:Homo sapiens UDP-N-acetyl-alph 
NM_006855:Homo sapiens KDEL (Lys-Asc-Glu 
Hs.1 18638:non-metastatic cells 1, protei 
Hs.3569:chromosome20open reading frame 
Hs.154331:ESTs 

(locuslink)NM_024923:Homo sapiens hypoth 

Hs.60973:Homo sapiens cDNA FU40329 fis, 
Hs.3439:stomatin(EPB72)-like2 
Hs 14( jO:hypotr.etic i [rot- 1 FU20979 
Hs.131810:Homo sapiens cDNA FLJ35976 fis 
NM_025217:Homo sapiens UL16 binding prot 
NM_000593:Homo sapiens transporter 1 , AT 
Hs.24743:hypothetical protein FU20171 
l 12 J4 th redoxii iil 2 
(locuslink)NM_030799:Horno sapiens golgi 
Hs.23765'.membranemetalle-e - ' • 
NM_0147B8:Homo sapiens trif 
NMJ331 61:Homo sapiens surfeit 4 (SURF4) 
NM_001961:Homo sapiens oukaryotic transl 
Hs.109007:Homo sapiens, Similar to LOC16 
Hs.172870:KIAA1913 protein 
Hs.3061 B5:ESTs, Highly similar to unknot 
lls.339827:ESTs, Weakly similar to protea 
Hs.182S25:VAMP (vesicle-associated membr 
Hs.81 337,lectir„ galactosirje-binding, so 
NM_001669:Homo sapiens arylsulfatase D ( 

NM_004999.Homo sapiens myosin VI (MY06), 
Hs.110906:hypot}ietical protein DC004501 
NfvL012339;Homo sapiens transmembrane 4 s 
NM_O01200:Homo sapiens bone morpbogeneti 
(locuslink)NM_003374:Homo sapiens voitag 
Hs.75799:pralease, serine, 8 (prostasin) 
(Iocuslink)NM_145059:Homo sapiens L-fuco 
Hs.104830:ESTs 

Hs.183556:solute earner family 1 (neulr 
NM_006109:Homo sapiens SKB1 homolog (S. 
Hs.8752transmembrane protein 4 
NM_002842:Homo sapiens protein tyrosine 
Hs.180062:proleasome (prosome, macropain 
NM_003757:Homo sapiens eukaryotic trancl 
NM_139177:Homo sapiens chromosome 17 ope 
Hs.3031 25:p53-induced protein PIGPC1 
NM_002951:Homo sapiens ribophorin 1 1 (RP 
NM_003740:Homo sapiens potassium channel 
NM_033340;Homo sapiens caspase 7, apopto 
NM_00357S:Homo sapiens serine/threonine 
NM_005526:Homo sapiens heat shock transc 
BE072881:RC2-BT0548-200300-012-e09 BT054 
NM_006265:Homo sapiens RAD21 homolog (S, 
Hs.182490:leucine-rich PPR-motif contain 
Hs.62402:p21/Cdc42/Rac1-activated kinase 
NM_004182:Homo sapiens ubiquitouslv-expr 
Hs.349326:Homo sapiens cDNA FU30377 fis 

NMJ)00859:Homo sapiens 3-hydroxy-3-rr,ethy 
IM_ M Hi iosaf hi-rer tp 

Hs.166994:FAT tumor suppressor homolag 1 
NM_018087:Homosapiens hypothetical prot 
Hs.90744:proteasome (prosome, macropain) 
NM_018842:Homo sapiens insulin receptor 
I ei link i*JM_304152:Homo sapiens omilh 
NM_005738:Homo sapiens ADP-ribosylation 
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AA298741 Hs.102135 

AA360954 Hs.27268 

AW294985 Hs.30715 

J03626 Hs.2057 

NMJJ12243 Hs.159322 

NMJ01360 Hs.11806 

BE265254 Hs.343258 

AI634860 Hs.359682 

AA581439 Hs.152328 

AI878927 Hs.79284 

D29643 Hs.34789 

AA401039 Hs.2903 
Hs.75309 

BE247600 Hs.377968 

AL1 20051 Hs.1 44700 

NM.004460 Hs.418 

M350994 Hs.20281 

BE266322 Hs.211374 

NM_004341 Hs.1 54868 

M360154 Hs.1 7741 5 

AI498072 Hs.351474 

AB025237 Hs.388 

W79485 Hs.1 73980 

AF052112 Hs.12540 

AK000398 Hs.1 1747 

BE268321 Hs.208912 

W19744 Hs.1 80059 

BE383507 Hs.78921 

AK000275 Hs.75871 

BE277024 Hs.146381 

AA338305 Hs.377816 

AI557486 Hs.1 19122 

BE173977 Hs.1 0098 

AW968941 Hs.1 66254 

NM.001982 Hs.1 99067 

BE185030 Hs.241305 

AA128808 Hs.1 79902 

AI056776 lls.133397 

AA926960 Hs.348669 

! AL035737 Hs.75184 

AU076698 Hs.1 32760 

AW450485 Hs.4437 

BE464341 Hs.21201 

AW843633 Hs.343261 

: AA305159 Hs.113019 

i AI123118 Hs.15159 

! BE269352 Hs.949 

AW960707 Hs.148324 

AA158177 Hs.1 18722 

D30857 Hs.82353 

' AI738719 Hs.1 98427 

AW271217 Hs.281434 

AF212223 Hs.25010 

NM_003365 Hs.1 19251 

i AK000597 Hs.1 69549 

) BE613348 Hs.356392 

i AW504657 Hs.1 52931 

: AW503680 Hs.5957 

AI878842 Hs.237924 

I Y10805 Hs.20521 

! NMJ101756 Hs.1 305 

AU076610 Hs.82399 

i AI879076 Hs.75061 

I D26067 Hs.174905 

i AA524545 Hs.224630 

) AI990122 Hs.196988 

I NM_007274 Hs.8679 

AW327546 Hs.1 11024 

I AW770994 Hs.30340 

t AA521246 Hs.210792 
I 

i AK000002 Hs.55879 

' NMJ12453 Hs.52515 

I AF054178 Hs.19561 

! AF105365 Hs.172613 



,i '3931:maj lis c m|. I lityrompli 
Hs.99364:abhydrolase domain containing 1 
NM_005176:Homo sapiens ATP synthase, H+ 
NM_006280:Homo sapiens signal sequence r 
Hs.27268:Horao sapiens cDNA: FLJ21933 lis 
Hs.30715:potasslum voltage-gated channel 
NMj300373:llomo sapiens ui linemonophosp 
(locuslink)NM_012243:Homo sapiens solute 
NM_001360:Homo sapiens 7-dehydrocholeste 
NMJXJ61 91 :Homo sapiens proliferation-ass 
i rH M OK 1 n i i 1 
NM_002765:Homo sapiens phosphoribosyl py 



NM_002402:Homo sapiens mesoderm specif c 
NM_00521£ Homo sapiens doiichyl-diphosph 
Hs.2903:protein phosphatase 4 (formerly 
NMJ01 961 :Homo sapiens eukaryotic trans! 
NM_020400:Homo sapiens G protein-coupled 
NM_004429:Homo sapiens ephrin-B1 (EFNB1) 
(lccuslink)NM_004460:Homo sapiens fibrob 
Hs.20281 :MAPKphosphatase-7 
(locuslink)NM_145051:Homo sapiens hypoth 
NMJ04341 :Homo sapiens cartamoyl-phospha 
(lccuslmk)NM_001997:Homo sapiens Finkel 
Hs.351474:Homo sapiens cDNA FU30002 fis 
NMJ02452:Homo sapiens nudix (nucleoside 
Hs.1 73980:nuclear matrix protein NMP200 
NM_006330:Homo sapiens lysophospholipase 
(lrciis!infc)NM_017793:Homo sapiens chromo 
Hs.208912:hypotheUcal protein MGC861 
Hs.180059:Homo sapiens cDNA FU31360 fis 
NM_003488:Homo sapiens A kinase (PRKA) a 
(locuslink)NM_01 2408:Homo sapiens protei 
I ls.140381 :RNA binding motif protein, X c 
Hs.377816:Homo sapiens cDNA FU36808 fis 
Hs.119122:ribosomal protein L13a 
HU_ 01 9082:1 lomo sapiens putative nucleola 
Hs.166254:likely crlhdog of rat vacuole 
NM_001982:Homo sapiens v-erb-b2 erythrob 
(locuslink)NM_006470:Homo sapiens tripar 

(locuslink)NM_022109:Homo sapiens CDw92 



Hs.348669:CDC28 protein kinase 1 
H 75184:c Una : Hike 1 (cartil teg 
(locuslinkJNM 001467:Homo sapiens glucos 
NM_000991:Homo sapiens ribosomal protein 
Hs.21201:ncotin3 

lis 343201 .histocompatibility (minor) 13 
NMJ1 5931:Homo sapiens fls485 (LOC51 065) 
(iocuslink)NM_016326-.Homo sapiens chemok 
NM_000433:Homo sapiens neutrophil cytoso 
Hs.148324:ESTs 

(locuslink)NM_004480:Homo sapiens fucosy 
NM_006404:Homo sapiens protein C receplo 
NM_000189:Homo sapiens hexoklnase 2 (HK2 
Hs.281434:Homo sapiens cDNA FU31373 fis 
NM_018698:Homo sapiens hypothetical prat 
NM_003365:Homo sapiens ubiquinol-cytochr 
NM_017893:Hcmo sapiens sema domain, immu 
Hs.35B392:ESTs, Highly similar to S-phas 
(locuslink)NM_002296:Homo sapiens lamin 
Hs.5957:Homo sapiens clone 24416 mRNAse 
NM_016016:Homo sapiens CGI-69 protein (L 
NMJ)01536;Homo sapiens HMT1 tinRNP methyl 
NM_001756:Homo sapiens serine (or cystei 
NM_007357:Homo sapiens component of olig 
Hs.75061:macrapbage myristoylated alanin 
Hs.1 74905KIAA0033 protein 
Hs.224630:Homo sapiens cDNA FU3331 8 fis 
Hs.196988:ESTs 

(locusli ,k)NM_007274:Homo sapiens cytoso 
(locuslink)NMJ) 1 pie, u 

Hs.30340:hypothctical protein KIAA1 1 05 
Hs.210792:Homo sapiens cDNA FU35591 fis 

(locuslink)NM_033450:Homo sapiens multid 
NM .01 2453:Homo sapiens transducin (beta) 
NM_005001:Homo sapiens NADH dehydrogenas 



NM_006598:Homo sapiens sol 
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429671 BE379335 Hs.211594 

434521 NM.002267 Hs.3886 

454128 AL031259 Hs.367900 

445033 AV652402 Hs.72901 

443752 AA593867 Hs.300842 

425221 AV649864 Hs.155188 

440286 U29589 Hs.7138 

413745 AW247252 Hs.75514 

412338 AA151527 Hs.69485 



427640 



424381 AA285249 



25 

30 



AF058293 Hs.180015 

AW082298 Hs.146161 

NM.001761 Hs.1973 

L13210 Hs.79339 

AF290512 Hs.58215 
Hs.146329 
Hs.172216 

AI816209 Hs.95867 

R52518 Hs.7967 

AU076718 Hs.164021 

AW583058 Hs.234726 

AB023179 Hs.9059 

AW575742 Hs.351676 

AI434780 Hs.4248 

AF220053 Hs.54960 

R26944 Hs.1 80777 

L28824 Hs.74101 

NMJ16929 Hs.283021 

NM.016142 Hs.279617 

R31178 Hs.287820 

AW182833 Hs.112188 

AI167810 Hs.379753 

NM.003028 Hs.244542 

AA365752 Hs.155965 

BE090176 Hs.179902 

NMJ05754 Hs.220689 
AI808527 Hs.192822 
BE276013 Hs.343828 
X51416 Hs.1 10849 
AA532963 Hs.9100 
Hs.70604 
Hs.77422 
Hs.28803 
Hs.301804 
Hs.75790 



411531 AB014611 

414820 AA371931 

450770 M019924 

433233 AB040927 

414172 AW954324 

428781 AF164799 Hs.193384 

432078 BE314877 Hs.24553 

454358 AW792876 Hs.288936 

447140 AF070537 Hs.1 7481 

409132 AJ224538 Hs.50732 
400836 

420281 AI623693 Hs.323494 

414343 AL036166 Hs.75914 

447096 BE539199 Hs.62112 

412276 BE262621 Hs.73798 

425261 BE385099 Hs.355814 

407736 N41744 Hs.349326 



Hs21 1594:proleasome (prosome, macrapan 
Hs.3886:kaiyopherin alpha 3 flmpoitin al 
Hs.367900:programmed cell death 2 
NM_078487:Homo sapiens cydin-dependenl 
NMJ)24820:Homo sapiens KIAA1608 protein 
NM_005642:Homo sapiens TAF7 RNA polymera 
NM_000740:Homo sapiens cholinergic recep 
NM_000270:Homo sapiens nucleoside phosph 
(locuslink)NM_024661:Homo sapiens hypoth 
Hs.343566:KIM0251 protein 
NM_0013SS:Homo sapiens D-dopachrome taut 

M_C 1:Homos ens hypothetical prol 
NM_001761:Homo sapiens cyclin F (CCNF), 
NM_005567:Homo sapiens lectin, galactosi 
(lxuslink)NM_033046:Homo sapiens rhotek 
NM_007194:Homo sapiens CHK2 checkpoint h 
NM_00127S:Homo sapiens chromogranin A (p 
(locuslink)NM_024112:Homo sapiens chromo 
Hs.7967:ESTs, Weakly simil t ir 
NM_002993:Homo sapiens small inducible c 
NM_00108SHom pie. rine (o. cy 
Hs.9059:KIAA0962 protein 
Hs.351676:Homo sapiens cDNA FU25921 fis 
Hs.4248:Homo sapiens PP3781 mRMA, comple 
NM_018468:Homo sapiens uncharacterized h 
Hs.180777:Homo sapiens mRNA; cDNA DKFZp5 
Hs.74101:spleen tyrosine kinase 
NM_01 6929:Homo sapiens chloride intracel 
Hs.279617:hydroxysteroid (17-beta) dehyd 
Hs.287820:libronectin 1 
(locuslink)NM_021826:Hcmo sapiens tiypoth 
Hs.379753:Homo sapiens cDNAFU33176«s 
Hs.244542:Homo sapiens cDNA FU38908 rls 



NM.080546:Homo sapiens CDw92 antigen (CD 



NM_030949:Homo sapiens protein phosphata 
Hs.343828:Homo sapiens mRNA; cDNA DKFZp7 
(:rxuslink)NM_004451:Homo sapiens estrog 
Hs.9100:hypotiietical gene supported by A 
Hs.70604:ATPase, Class II, type 9A 
Hs.77422:proteolipid protein 2 (colonic 
Hs.28803:ESTs 
Hs.301804:KIAA1494 protein 
(lccuslink)NM_002642:Homo sapiens phosph 
Hs.193384:putatative 28 kDa protein 
(loeusBnk)NM_022369;Homo sapiens hypoth 
t'1 120 i r 
NM_138391:Homo sapiens hypothetical prat 
NM_005399:Homo sapiens protein kinase, A 

(locuslink)NM_017964:Homo sapiens hypoth 
NM_006315;H rr sa[ nsco -dvr ok rm 
(lccuslink)NM_003457;Homo sapiens zinc ( 
Hs.73798:macrophage migration inhibitory 
1- l4:Homosap r II f- 2 
Hs.349326:Homo sapiens cDNA FU30677 fis 



414249 
431243 
417777 
457274 
445139 
430280 
422197 
422938 
435575 



414814 
405387 
444108 



U46455 
AI823763 Hs.70 



AW674193 I 

AB037848 

AA361258 



.227152 
.12365 
.237868 
AW974265 Hs.1 11632 
NM.001809 Hs.1594 
AF213457 Hs.44234 
AK000733 Hs.23900 
NM_00119S Hs.172894 



R55784 Hs.140942 

AL036662 Hs.144949 

X78342 Hs.77313 

AW973666 Hs.1 53850 

BE560065 Hs.26433 

BE222542 Hs.128782 

AK000310 Hs.17138 

AA194952 Hs.36093 



(locuslink)NM_012310:Homo sapiens kinesi 

(locuslink)NM_017510:Homo sapiens gp25L2 
NM_002999:Homo sapiens syndecan 4 (amphi 
Hs.7055:Homo sapiens cDNA FU33420 fis. 
NM_016391:Homo sapiens hypothetical prot 
Hs.12365:synaptotagminXIII 
Hs.237868;Homo sapiens mRNA; cDNA DKFZp6 
Hs.1 11632:Lsm3 protein 
NMJJ01 809:Homo sapiens centromere praH 
NM_018965:Homo sapiens triggering recept 
Hs.23900:RacGTPas9 activating protein 1 
Hs.172894;BH3 interacting domain death a 
(locuslink]NM_002004:Homo sapiens fames 

Hs.140942;Homo sapiens cDNA FU38396 fis 
Hs.144949;ESTs 

Hs.77313:cydin-dependent kinase (CDC2-I 
Hs.153850:hyp- , ti I protein C321D24 
NMJ)01382:Homo sapiens dolichyl-phosphat 
Hs.128782;Homo sapiens cDNA FU31 51 2 fis 
(locuslink)NM_017755:Homo sapiens hypoth 
Hs.36093:Homo sapiens cDNA FU12885 fis, 



WO 03/042661 



PCT/US02/36810 



415402 AA164687 

451032 W03692 

412146 M92444 

417018 M16038 

425244 AK002127 

417878 U90916 

426675 AW084791 

432728 NM_006979 

442643 U82756 

418462 BE001596 



418127 BE243982 

416293 BE244454 

435968 AW161481 

414702 L22005 



426410 BE298446 

422282 AF019225 

450295 AI766732 

413900 AW409747 

452695 AW780199 

407797 AK000524 

412006 AW451618 

424247 X14008 

447627 AF090922 

422753 AI928995 

441321 H17182 

411358 R47479 

438444 AI064707 

424727 AW590378 



40 

45 



Hs.323079 
Hs.73722 
Hs.80887 
Hs.155313 



Hs.133122 
Hs.278721 
Hs.374973 



428699 AW578252 

422675 BE018517 

444301 AK000136 

409932 AI376750 

419152 L12711 

410240 AL1 57424 

413073 AL038165 

406865 AI025931 



Hs.79162 

Hs.1 11577 

Hs.76932 

H3.5741 

Hs.4990 

Hs.278672 

Hs.41694 

Hs.7345 

Hs.305890 

Hs.1 14309 

Hs.210628 

Hs.75612 

Hs.30327 

Hs.39850 

Hs.290216 

H&152738 
Hs.1 575 
Hs.7771 
Hs.94761 
Hs.301 226 
Ms.378965 
Hs.373554 
Hs.171811 



lls.119140 
Hs.10760 
Hs.57600 
Hs.89643 
Hs.61289 
Hs.75187 
Hs.181357 



Hs.177576:mannosyl (alpha-1 .3-)-glycopro 
Hs.323079:Homo sapiens mRNA; cDNA DKFZp5 
HS.73722APEX nuclease (multifunctional 
Hs.80887:v-yes-1 Yamaguchi sarcoma viral 
NM.0221 05:Homo sapiens death associated 
Hs.82845:Homo sapiens cDNA: FLJ21930 f:s 
Hs.13312?:hyp3thetical protein FLJI4524 
NM_006979:Homo sapiens HU class II regi 
(locuslink]NMJ)04697:Homo sapiens PRP4 p 
Hs.85266:inlegnn,beta4 
NM_005318:Homo sapiens H1 histone family 
Is l4S07:hyp hetfcal pr te .MG 20 
(lxuslink]NM_002389:Homosa|- 



408150 
431738 
449703 



BE620274 Hs.43112 

AW237726 Hs.288549 

H61001 Hs.171802 

NMJ04428 Hs.1624 
Hs.2280 

AW961400 Hs.333526 

AA236291 Hs.183583 

AI929659 Hs.237825 

AA093322 Hs.301404 

AI815206 Hs.100293 

M114050 Hs.211610 

AA302049 Hs.31181 

BE262989 Hs.12045 



441028 AI333660 Hs.1 7558 

417211 T97617 Hs.269092 

421684 BE281591 Hs.106768 

429824 AA296363 Hs.1 21 520 

426234 BE314534 Hs.168159 

408805 H69912 Hs.48269 

417821 BE245149 Hs.82643 

416976 BE243985 Hs.80680 

410013 AF067173 Hs.57904 

429597 NM_003816 Hs.2442 

453518 AW503205 Hs.27268 

429238 NM_002849 Hs.198288 

433409 AI278802 Hs.25661 

426440 BE382756 Hs.169902 

435472 AW972330 Hs.283022 

427609 AK000436 Hs.179791 

408201 AK000568 Hs.43654 

418181 U37012 Hs.83727 

445176 AI878907 Hs.12379 

436906 H95990 Hs.181244 

427337 Z46223 



Hs.734S:MAD1 mitotic arrest deficient-n 
NM_138578:Homo sapiens BCL2-like 1 (BCL2 
(locuslink)NM 003661:Homo sapiens ap-rjlip 
Hs.210628:ESTs 

NM_006819:Homo sapiens stress-inducerj-ph 
NM_003668:Homo sapiens mitogen-activaled 
Hs.39850:uridine kinase-like 1 
Hs.290216:ESTs 

NM_000239:Homc sapiens lysozyme (renal a 
NM_016050:Homo sapiens mitochondrial rib 
He. 1 575:omall nuclear ribonucleoprotein 
MM .007273:Homo sapiens repressor of estr 
Hs.94761:KIM1691 protein 
Hs.301 226:Homo sapiens, clone IMAG&3456 
lls.378965:l lomo sapiens cDNA FU37658 f:s 
(locuslink)NM_130786:Homo sapiens alpha- 
NM_001625:Homo sapiens adenylate kinase 
NMJ02983:llomo sapiens small inducible c 
(locuslink)NM_003842:Homo sap er.s tumor 
Hs.190161:LR8 protein 
NM_001970:llomo sapiens eukaryoiic transl 
NM_017680:Homo sapiens asporin (LRRclas 
NM_001283:Homo sapiens adaptor-related p 
I js'inl;)NM_0 1064:H n p i transl 
Hs.61289:synaptojanin2 
NM_014765:Homo sapiens translxase of ou 
Hs.18 357:1 minii rece ( c 1 (67kC il > 



Hs.269092:ESTs 
NM_018120:Homo sapiens hypothetical prof 
Hs.121 '0 Hcmo sapiens cDNA FU35792 fis 
Hs.168159:bifunctional apoplosis reguiat 
NM_003384:Homo sapiens vaccinia related 
NM_002822.Homo sapiens protein tyrosine 
Hs.80680:major vault protein 



Hs.27268:Homo sapiens cDNA: FU21933fs 
NM_002849:Homo sapiens protein tyrosine 
Hs.25661 :ESTs, Moderately similar to hyp 
NM .00651 6:Homo sapiens solute earner fa 
NMJ)18643:Homosapiens triggering recept 
NM_017817:Homo sapiens RAB20, member FtAS 
NM_017882:Homosapiens ceroid-lipofuscin 
KM_013291:Homo sapiens cleavage and poly 
NM_001419:Homo sapiens ELAV (embryonic I 
Hs.181244:major histocompatibility compl 
NM_000569:Homo sapiens Fc fragment of Ig 
NM_005726:Homo sapiens Ts translation el 
Hs.43112:Homo sapiens mRNA; cDNA DKFZp43 
NM_032828:Homo sapiens ubiquBn UBF-fl 
lls.171802;llomo sapiens, clone IMAGE:3S50 

r r nsepf (EFt 
NWL002950:Homo sapiens ribophorin I (RPN 
NM_032339:I lomo sapiens hypothetical prot 
NM_030666:Homo sapiens sen'ne (or cyslei 
Hs.237825:signal recognition partcle 72 
NM_006743:Homo sapiens RNA binding motif 

1 00203: 0-linked N-acetylglucosamine ( 
NM_001228:Homo sapiens caspase 8, apopto 
Hs.31 1 81 :Homo sapiens cDNA: FU23230 lis 
Hs.12045:C2f protein 
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■ 20 
25 
30 



50 
55 



70 
75 



410668 


BE379794 


Hs. 159651 


NM_014452:Homosapiens tumor necrosis fa 




415173 


AW501735 




Hs.1 80059:Homo sapiens cDNA FU31 360 fis 






AA773820 


Hs.63970 


Hs.63970:ESTs 




427725 


U66839 


Hs.180533 


NM 002756:Homo sapiens mitogen-activated 




424408 


AI754813 


Hs. 146428 


Hs.146428sonagen, typeV, alpha 1 






NM 000507 




NM_000507:Homo sapiens fructose-1,6-bisp 




420187 


AK001714 


Hs.95744 


NM_019028:Homo sapiens hypothetical prot 




431498 




Hs. 258551 


NM_012100:Homo sapiens aspartyl aminopep 




411423 


AW845987 


Hs.68864 


(locuslink)NM 139248:Homo sapiens membra 




449964 


AA641636 


Hs.37477 


74 ESI > lysimil T4622I 




423671 


AW860155 




Hs.234101:Homo sapiens, similar to choli 






AA033627 


Hs.21858 








AI937227 










AF189062 




(locusli k)N 1 013334 apiens 1 1 




432465 




Hs.275163 


NM_002512:Homo sapiens non-mctastatic ce 






AK000157 


Hs. 108502 










Hs.374303 


h lirk)NM 144591'Hcmo sapiens t t 
He I8112S mmunoglobuH lambda lo 






AW407157 


Hs.181125 






AI498957 


Hs.351937 








AF123303 




NM 013386:Homo sapiens hypothetical prot 












418641 






H96850:yw03b12.s1 Soaresmelanocyte 2f/bH 




BE243136 




NrvL001 109:Homo sapiens a disintegrin and 






AA296696 


Hs.333418 




















Hs.324507 


! I (- ^ { L 






AI879159 


Hs.108219 


Hs.191379:ESTs ^ ^ ^ 




440246 




Hs.191379 






446770 


AV660309 


Hs. 154986 


Hs.154986:ESTs, Weakly similar to PLLP H 






AF038962 








425118 


AU076611 




Hs.1 54672:methylene tetiahydrofolate^eh 






AA534296 


Hs.20953 


Hs.20953:hypothelical protein BC010003 




412867 


AU076861 


Hs.74637 


I ls.74637;lestis enhanced gene transcript 


3'05 




AA476966 




NM_016310:Homo sapiens polymerase (RNA) 


3.05 


441238 




Hs.322456 


NM_032039:Homo sapiens hypothetical prot 




408524 


D87942 


lls.46328 


Hs.46328:fucosyltransferase 2 (secretor 


















Hs. 104637 




3M 


432281 


AK001239 


i is. 274263 


Hs.274263:hypothetica! protein FU10377 














423570 


AW838306 


Hs.129819 


NM 018344:Homo sapiens hypothetical prot 






AA1 37043 




NM_013374:II piens progr n j I j 






AK000461 


















407687 


AK002011 
AA320525 


Hs.201076 


NMJ)18339:Homo sapiens hypothetical prot 
Hs.201076:ESTs 






AW797627 


Hs!?06137 


Hs.347459:Homo sapiens oDNA FLJ13900 fis 










lis. 105137:1 lomo sapiens mRNAfor OK/SW-Ct 




448569 


BE382657 




Hs.21486:signal transducer and activator 






BE262943 




Mi a J i >:Hon ' r pi r, c iv^ 1( / in'it , 




439018 


AW300887 






412627 


BE391959 




HsJ4276:chlorideintracellularchannel P 9 




444309 






Hs.1 0303:calc;um and integrtn bindlna 1 




412969 


AI373162 


Hs.75103 


NM 003406:Homo sapiens tyrosine 3-monoox 
Hs.239784:scribble 




430364 


AA954810 


Hs.239784 














421179 


AL050373 




NMJ)15677:Homo sapiens hypothetical prot 




U72664 




NM_002810:Homo sapiens proteasome (proso 




451798 


BE297567 




Hs.27047:hypothetical protein FU20392 






AL037544 


Hs.184298 






447656 


NM_003726 




NM_003726:Homo sapiens sre family associ 




436823 


AW749865 








417896 


AA379770 




I r I rl nstcclldeat 










Hs.10574:so!ute carrier family 2, (facii 






H83265 




Hs.8881:Homo sap'ens cDNA FU32163 fis, 






AI521130 




(locuslink)NM_144686:Homo sapiens hypoth 






AI024860 


Hs.1 53591 






442504 


BE503373 


Hs.334335 


NM_022484:Homo sapiens hypothetical prot 9 


3 01 




BE560672 


Hs.13543 


(I ocuslink)NM_145214:Homo sapiens tripa 


3.01 


436540 


BE397032 




NM_020230:Homo sapiens peter pan homolog 






AI825880 


Hs!5985 


•i 5985:non-l na eO±4 ft ;torprotE 


300 


415697 


AI365603 


Hs.279696 


Hs.279696:DKFZP566l1024 protein 


3.00 


437469 


AW753112 


Hs.15514 


Hs.15514:hypothetioal protein MGC3260 


3.00 


436319 


H90727 


Hs.5123 


Hs.5123:hypotheticat protein BC008246 


3.00 


450126 


BE018138 


Hs.24447 


(locus!ink)NM_005866:Homo sapiens type 1 
Hs.29444:putativc small membrane protein 


3,00 


445985 


BE621800 


Hs.29444 


3.00 


406868 


AA505445 


Hs.300697 


Hs.300697:immunoglcbulin heavy constant 


3.00 


424439 


AA579635 


Hs.1770 


H 5 .1770:ligase 1, ONA, ATP-dependent 


3.00 


408452 


AA054683 


Hs.222728 


Hs.222728:Homo sapiens cDNA FU39004 fis 


3.0C 


417831 


H16423 


Hs.82685 


Hs.82685-.CD47 antigen (Rlnelated antige 


3.00 


TABLE" 


MB 
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Pkey: Unique Eos probesel identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

406685 0_0 M18728 

432407 MH1429J2 BG036675 BF772005 BF771866 BG960386 BG960381 NM_005712 AF110315 BE074534 BE182776 BE158000 BE157999 BE714315 AW818104 
AW847519AA099426AW817981 AW856396BG961122AA224498 AA308542 AW821833 BF902155 AI732411 BG778834 BG283641 BE748279 
BE748870 BG319540 BE748864 BF739224 BG9861 55 AK057283 BI861466 AA663341 AA457591 BG949294 AW392886 AA071 1 22 AA227849 
AA584918 BG959S70 BF773486 AL041698 BF959013 R87170 C16859 BF770411 BF771298 AI075321 L13823AA216700 BF771864 AW861859 
BE537068 C18935 AA155719 BF771172 BF769107 BF804964 AW818172 AW818143 AW392930 AW817057 AW858044 BF746211 AA179928 
AW861687 AW821826 BI055726 BF242643 AA207189 BF770412 BF771 157 BG430030 AA055592 



TABLE 11C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbersin this column are Genbank Identifier (Gl) numbers. 1 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 



403218 
403221 
405484 



403739 
403219 
404826 
400750 
400846 
401179 
400847 



5002624 
8918414 
9256114 
8119072 



64338-64517 
58039-58149 

66294-66438,66936-671 24 
199214-199579,199672-199920,200262-20049 



163497-163623,164715-164968,155369-16550 



61858-61995 
4772648046 
198991-199168,199316-199548 
39310-39474 
113477-113893 
4464344835 
14260-14537 
177372-177674 

116132-116407,116653-116922 
101532-101852,102006-102263 
14403-14602,17000-17147,17241-17368 



Table 12A lists 
GeneChip array such that the ratio of ' 
93* percentile' " 



ier compared to normal adult tissues. These were selected from 59680 probesets on the Affymetrix/Eos-Hu03 

'" ' — - . - ■ ■ cancer level was set I 

s. In order to remove gene- 



UnigenelD: 
Unigene Title: 



A: 1006 GENES UP-REGULATED IN CERVICAL CANCER COMPARED TO NORMAL ADULT TISSUES 
Unique Eos probeset identifi er number 



Ratio of tumor to normal body tissue 

ExAccn UnigenelD Unigene Title 

U19557 Hs.123035 squamous cell carcinoma antigen 2 (SCCA2) 

NML001944 Hs.1925 desmoglein3(pem ( l gi vul i utig 

M13509 Hs.83169 matrix metalloprateinase 1 (interstitial col! 

AI560129 Hs.329062 EST 

W79123 Hs.58561 G prateiiKoupled receptor 87 

AI591147 Hs.61232 ESTs 

AK001468 Hs.62180 anillin (Drosophila S:raps homolog), actin bi 

NMJ06183 Hs.80962 neurotensin 

AF027866 Hs.138202 serine (or cysteine) proteinase inhibitor, cl 

BE545072 Hs.122579 hypothetical protein FU 10461 
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423017 
429432 
415817 
435243 
428664 

421373 
416209 
441459 
447164 
412719 
417366 
431753 
438817 
404996 
443211 
414764 



450221 
422168 
415989 



AF200492 Hs.211238 

X07820 Hs.2258 

AJ001696 Hs.241407 

N78223 Hs.108106 

AW178761 Hs,227948 



40 
45 



414422 
442660 
449003 



438915 
425710 
421308 



Hs.202676 
Hs.78867 
Hs.261373 



AW016610 Hs.129911 

BE185289 Hs.1076 

X76029 Hs.2841 

AI023799 Hs.1 63242 

AI128388 Hs.143655 

AW0138B7 Hs.72047 

AA885360 Hs.160199 



inlerleukin-1 homo'og i 
matrix metalloproleinase 10 (stromslysin 2) 
serine proteinase inhibitor 13 (PI13; serpin 
transcription factor 

serine (or cysteine) proteinase inhibitor, c! 

synaptonemal complex protein 2 

protein tyrosine phosphatase, receptor-lype, 

hypothetical protein dJ43401 4.3 

Simla: to SALL1 [sal (Drosophila)* 

small inducible cytokine subfamly B (Cys-X-C 



Hs.97258 
Hs.203963 
Hs.312716 



AF053306 Hs.36708 

AA741180 Hs.29879 

X59373 Hs.188023 

AW014875 Hs.1 37007 

BE440042 Hs.83326 

T11832 Hs.127797 

AA147224 Hs.337232 

AW138174 Hs.130651 

X76342 Hs.389 

AA055611 Hs.226568 



small praline-rich protein 1B (comifin) 



Target CAT 

IGF-II mRNA-binding protein 3 
ESTs, Moderately similar to S29539 rib 
hypothetical protein FU 10339 
ESTs, Weakly similar la i 
collagen, type X, alpha 1 
cytochrome P450, subfamily XXIV (vitamin D 24 
cytoskeleton associated protein 2 
S1 00 calcium-binding protein A7 (psoriasin 1) 
ESTs 

preferentially expressed antigen in melanoma 

hypothetical protein FLJ23537 

budding uninhibited by benzimidazoles 1 (yeas 



NM.007057 

W79117 Hs.58559 ESTs 
AI420183 Hs.1 71077 



lo HXDAJHUMANHOMEOB 



alcohol dehydrogenase 7 (class IV), mu or sig 
ESTs, Moderately simiiar to ALU4_HUMAN ALU SU 
Homo sapiens mRNA full length insert cDNA clo 
NMJ)07057":Homo sapiens ZW10 interactor (ZWIN 



ESTs, Weakly similar to T21 259 hypothet'eal p 
ESTs, Moderately similar to ALU2_HUMAN ALU SU 
SRY (sex determining region Y>box 2 
ATP-binding cassette, sub-family C (CFTR/MRP) 
ESTs, Weakly similar to T1 7330 hypothetical p 
fragile X mental retardation 1 
Homo sapiens cDNA FU14236 fe, clone NT2RP40 
regulator of G-protein signalling 20 
KIAA0751 gene product 
TTK protein kinase 
small prollne-rich protein 2A 



replication protein A3 (14kD) 



AI368680 Hs.816 

NM_004996 Hs.89433 

AI581344 Hs.127812 

S65791 Hs.89764 

AL079741 Hs.183114 

AF060877 Hs.99236 

AL157579 Hs.153610 

M86699 Hs.1 69840 

BE182592 Hs.1 1261 

AI281848 Hs.1 94691 

NMJJ16190 Hs.242057 

AL365406 Hs.1 0268 

NM_002497 Hs.1 53704 

AA383092 Hs.1 608 

BE062109 Hs.241551 

AF015592 Hs.28853 

AA372264 Hs.273193 
AW17B963 

U03398 Hs.1 524 

L49054 Hs.85195 

AI351010 Hs.1 02267 

AA814043 Hs.88045 ESTs 
AI624049 gb:ts41a09.x1 NCI_CGAP_UI1 Homo sapiens cDNA 

AI541214 Hs.46320 Small proiine-rich prolein SPRK [human, odont 

U38945 Hs.1174 cyclin-dependenl kinase inhibitor 2A(melanom 

AW900372 Hs.180793 ESTs, Weakly similar to S65657 alpha-1 C-adren 

AA907027 Hs.128S06 ESTs 

R84810 Hs.30464 cyrclin E2 

AL157425 Hs.133315 Homo sapiens mRNA; cDNA DKFZp761 J1324 (from c 

BE566742 Hs.58189 highly expressed in cancer, rich in leucine h 

AI553633 Hs.337139 ESTs 

AA280174 Hs.285681 Williams-Beuren syndrome chromosome region 14 

AF030880 Hs.1 59275 solute canter family, member 4 

AA687322 Hs.1 92843 leucine zipper protein FKSG14 
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CDC7 (cell division cycle 7, S. cerevisiae, h 
hypothetical protein FU 10706 
gb-WRO-ST0032-200899-001-b11 ST0032 Homo sapi 
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30 
35 



435159 
439232 
437616 



446364 
447254 
414148 
429548 
423725 
450192 
450149 
431941 
427821 



409719 
444342 
413573 



447342 
403471 
409041 
407339 
415652 
420900 
444271 
410153 
448693 
431622 
457405 
424345 
414812 
425734 
446435 
42194B 
419335 
429259 
406747 
453004 
423735 
421773 



427217 
441820 
437958 



M668879 Hs.1 16649 

N48590 Hs.46693 

AI797163 Hs.207954 

AB027249 Hs.104741 

AF077374 Hs. 139322 

M397769 Hs.86617 

AB006624 Hs.14912 

NMJ04153 Hs.17908 

AW138B72 Hs.135288 

AJ403108 Hs.132127 

M263143 Hs.24596 

AW969781 Hs.132863 

AK000106 Hs.272227 
M470158 



Target Exon 

PDZ-binding kinase; T-celi originated protein 

small proline-rich protein 3 

ESTs 

KIAA0286 protein 

origin ogi n ( n li iibunit 1 (y ! 
gb:PMQ-BT0651-270400-003-f02BT0651 Homosapi 
ESTs 

hypothetical protein LOC57822 

RAD51 -interacting protein 

Zio family member 2 (odd-paired Drosoptiila ho 

Homo sapiens cDNA FU20099 fis, clone COL0454 



Hs.334828 hypothetical protein FU10719; KIAA1794 prate 



AW664214 Hs.196729 

AI912012 Hs.200737 

AI769160 Hs.108681 

NM_014398 Hs.10887 



iimilar to rysosome-associated membrane glyco 



D30783 Hs.115263 epiregulin 



AB033025 Hs.50081 

M045144 Hs.161566 

T79213 Hs.272073 

AL045633 Hs.44269 

AW452569 Hs.149804 

BE311926 Hs.15830 

AW004854 Hs.228320 

AW979271 Hs.293184 ESTs 
AA504860 

AK001380 Hs.145479 

X72755 Hs.77367 

AF056209 Hs.159396 

AW206737 Hs.253582 

L42583 Hs.334309 

AW960146 Hs.284137 

AA420450 Hs.292911 

AI925153 Hs.217493 

AA355925 Hs.36232 



gb:IL3-CT0219-261099-023-D1 1 CT0219 Homo sapi 
Homo sapiens, Similar to RIKEN cDNA 201 031 7E2 
Target Exon 
KIAA1199 protein 



ESTs 



W69233 Hs.1 12457 

AW972024 Hs.142653 

AK001675 Hs.24321 

AA397679 Hs.3991 

AW885757 Hs.257862 

AA481396 Hs.105167 

AL039379 Hs.209602 

AA399272 Hs.144341 

AA969119 Hs.143502 

BE139550 Hs.1 21 668 

AA486293 Hs.272068 

H81213 Hs.14825 

M34455 Hs.840 

D90070 Hs.96 

AA142989 Hs.71730 

AA090421 Hs.5555 

U17760 Hs.75517 

AW451103 Hs.71371 

AI970536 Hs.1 6603 

AA159804 Hs.149305 

NM_005401 Hs.1 59238 

BE241831 Hs.172330 

243758 Hs.26037 ESTs 

W07318 Hs.240 



hypothetical protein FU12888 
ESTs, Highly similar to S60712 band-6-protein 
annexinA2 

KIAA0186 gene product 

gb:EST33963 Embryo, 12weekll Homosapiensc 



ESTs, Weakly similar to ubiquitous TPR motif, 



ESTs, Weakly similar to en 
ESTs, Moderately similar to PC4259 ferritin a 
ESTs, Weakly similar to ALU3 HUMAN ALU SUBFAM 
l kly sir to KIAA1 503 prote'n [I 



phorbol-12-myristate-13-acetate-induced prate 



laminin, beta 3 (nicein (125kD), kalinin (140 
ESTs 

hypothetical protein FU13163 
hypothetical protein MGC2603 
protein tyrosine phosphatase, non-receptor ty 
hypothetical protein MGC2705 



428365 
407746 
452401 

451307 
441531 
418663 
410659 
432869 



AI278360 Hs.31409 

BE263782 Hs.77695 

BE379623 Hs.27693 

AA295331 Hs.183861 

AK001962 Hs.38114 

NM.007115 Hs.29352 

X81334 Hs.2936 

AW293207 Hs.211516 

AW291239 Hs.271111 

AK001100 Hs.41690 

AI080175 Hs.68826 
AW974094 



gb:EST388262 MAGE resequences, MAGN Homo sapi 
ESTs 

KIAA0008gene product 

pi pi dylj n lylisornera e (cyclophilin) ike i 

Homo sapiens cDNA FU20042 fis, clone COL0042 

hypothetical protein FLJ11100 

tumor necrosis factor, alpha-induced protein 

matrix metalloproteinase 13 (collagenase 3) 

ESTs 



MAGM Homo sapi 
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s1 NCLCGAP Pr12 Homo sapiens cDNA 
liens full length insert cDNA clone 
fectin, galactoside-binding, soluble, 7 (gale 



10 
15 
20 



40 
45 
50 
55 
60 
65 



Homo sapiens cDNA FLJ10570 fis, clone NT2F 
dickkopf (Xenopus laevis) homotog 1 
RAB6 interacting, kinesin-like (rabkinesin 6) 



AW381532 Hs.135188 

NMJM3686 Hs.47504 , 

AA017492 Hs,135655 

AK001432 Hs.153408 

AA336797 Hs.40499 

AA219691 Hs.73625 

AI831190 Hs.166676 

AW023482 Hs.97849 

NM_002407 Hs.97644 

AW514083 Hs.190135 

AI798994 Hs.1 52923 

AI699933 Hs.192175 

AA628608 Hs.61656 

AI205540 Hs.281295 

BE536911 Hs.234545 

AW872527 Hs.59761 

M991355 Hs.298312 

AI928402 Hs.6933 

AW182766 Hs.273558 



AL035864 Hs.69517 cDNA for differentially expressed C01 5 gene 

AB009303 Hs.90800 matrix metalloproteinase 16 (membrane-inserte 

AA481269 Hs.108660 ATP-binding casselte, sub-family C (CFITOMRP) 
NM_018833-.Homo sapiens transporter 2, ATP-bi 

AI432364 Hs.160594 ESTs 

AA576953 Hs.22972 

AW238299 Hs.250618 

AI932903 Hs.211636 

W47596 Hs.1 69300 



hypothetical protein NUF2R 
ESTs, Weakly similar to DAP1_HUMAN DEATH-ASSO 
hypothetical protein DKF2p434A1315 
hypothetical protein FLJ12684 



AW291168 Hs.41296 

AI202703 Hs.152414 

AW388633 Hs.6682 

AW959908 Hs.1 690 

NMJ03245 Hs.2022 

AW016936 Hs.233364 

D60730 Hs.57471 



NM.006306 Hs.211602 

AI337228 Hs,197083 

M631492 Hs.23921 

AA299264 Hs.57776 

H87879 Hs.1 02267 

T89697 Hs.16645 

R53734 Hs.25978 

BE391804 Hs.62661 

BE003054 Hs.1695 

AI346487 Hs.28739 

AA857025 Hs,8878 

AA009647 Hs.8850 

AA204664 Hs.182437 

AB023021 Hs.226945 

AW630803 Hs.89497 

AA810278 Hs.24250 
AA099020 

N69243 Hs.1 92974 

AA166950 Hs.195870 

AA218940 Hs.137516 

AL359055 Hs.67709 

AA356792 Hs.334824 

AW301344 Hs.1 22908 

AB035898 Hs.1 50587 



AK000776 Hs.1 28753 

AJ243662 Hs.1 10196 

AL137379 Hs.47125 

AK002032 Hs.272245 

AA010056 Hs.242998 

BE568452 Hs.5101 

NMJ03866 Hs.1 53687 



transforming growth factor, beta 2 
gb:PT2.1_6_G03.r tumor2 Homo sapiens cDNA 3', 
ESTs, Weakly similar to MUC2_HUMAN MUCIN 2 PR 



hypothetical protein DKFZp547A023 



ESTs, Weakly similar to 2109260A B cell growt 
guanylate binding protein 1, interferon-induc 
matrix metalloproteinase 12 (macrophage etast 



gb:zn45h01.s1 Stratagene Hela eel s3 937216 
hypothetical protein FU12735 
hypothetical protein FU14991 
fidgetin-like 1 

Homo sapiens keratin 17 (KRT17) 

Homo sapiens mRNA full length insert cDNA do 

hypothetical protein FU 14825 

DNA replication factor 

kinesin-like protein 2 

gb:QV3-DT0044-221299-045-b09 DT0044 Homo sapi 
Target Exon 
Target Exon 

Homo sapiens cDNA FU20769 fis, clone COL0667 
NICE-1 protein 

hypothetical protein FU13912 

Homo sapiens cDNA FU 1 1 170 As, clone PLACE10 

protein regulator of cytokinesis 1 
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gb:yn72d1 2.r1 Soares adult brain N2b5HB55Y Ho 



444781 
429170 
414035 



425916 
40S420 
416320 



414132 
424012 
411835 
433330 
428613 
425921 
447078 
434699 
428758 
405708 
433405 
456443 



U64198 Hs.73165 

AA125985 Hs.56145 

AK001379 Hs.121028 

AA489041 Hs.295448 

AI769400 Hs.1 89729 

NM_013989 Hs.154424 

H50621 Hs.134156 

AA033813 Hs.79018 

BE296227 Hs.250822 

NM_014400 Hs.11950 



GPI-anchored mt 
dual specificity phosphatase 4 
serine (or cysteine) proteinase Inhibitor, cl 
AF15q1 4 protein 



laminin, gamma 2 (nicein (100kD), kalinin (1 0 



Hs.75716 

AA662240 Hs.283099 

AI283125 Hs.1 50009 

U46258 Hs.339665 

NM_006786 Hs.1 62200 

Z15008 Hs.54451 

H47867 Hs.34024 

M30703 Hs.270633 

BE660200 Hs.127197 ESTs 

AI801235 Hs.484B0 ESTs 

AW368377 Hs.137569 tumor protein 63 kDa with strong homology to 

U29343 Hs.72550 hyaluronan-mediated motiSly receptor (RHAMM) 

AW207084 lls.132810 hypothetical protein MGC14801 

AB037749 Hs.186928 KIAA1328 protein 

NMJ107231 Hs.162211 solute earner family 6 (neurotransmitter tra 

AW885727 Hs.301570 ESTs 

M643687 Hs.149425 Homo sapiens CDNAFU1 1980 fis, clone HEMBB10 

AA433988 Hs.98502 hypothetical protein FLJ14303 
Target Exon 



AW96^500 

Y00272 

AI097144 

M31126 
AW948343 
AW195285 
BE380149 
AW449679 
AI791495 
AI174456 



;.133543 ESTs 



cell division cycle 2, G1 to S and G2 to M 
ESTs. Weakly similar io ALU1_HUMAN ALU SUBFAM 
Homo sapiens clone TCCCTA001 51 mRNA sequence 
pregnancy specific bcta-1 -glycoprotein 9 
gb:RCO-MT0015-130400-031-c01 MT0015 Homo sapi 
ESTs, Weakly similar to I38022 hypothetical p 
ESTs, Weakly similar to T33188 hypothetical p 
n pier XGmRt ' cl ne PEP11) 
calmodulin-likeskin protein 
ESTs, Moderately similar to I38022 hypothetic 




458116 
444105 
426010 



402048 
427025 
423515 



455310 AW893961 

418696 AW959433 

426642 AW068223 

423738 AB002134 

448243 AW369771 

411559 BE144081 

423553 AA405635 

457030 AI301740 

424115 AA335497 



gb:QVO-CT0387-170200-121-c05 CT0387 Homo sapi 
I ls.270845 kinesin-like 5 (mitotic kines ! r,-5ke protein 
Hs.94210 eyes absent (Drosophila) homotog 1 
Hs.23920 hypothetical protein FU 1 1 1 05 
Hs.47367 KIAA1785 protein 
Hs.166597 ESTs 

Hs.1975 hypothetical protein FU21007 
Hs.55279 serine (or cysteine); 
Hs.83765 dihydrofolate reductase 
Hs.336753 EST 

Target Exon 
Hs.97523 ESTs 

Hs.134795 Homo sapiens cDNA FU11 180 fis, clone PLACE10 

gb:RC4-NN0027-O60400-011-d11 NN0C27 Homo sapi 
Hs.326290 hypothetical protein FU12581 
Hs.171581 ubiquitin C-terminal hydrolase UCH37 
Hs.132195 ' 



gb:MRO-HT0165-O3O200-007-d06 HT0165 Homo sapi 
Hs.96854 ESTs, Weakly similar to DYLX_HUMAN CYTOPLASM! 
Hs.173381 cihydropyrimidinase-like 2 
Hs.293965 ESTs, Weakly similar to I33D22 hypothetical p 
Hs.301885 Homo sapiens cDNA FU11346fis, clone PLACE10 
Hs.152759 activator of S phase kinase 
Hs.124244 ESTs 

Hs.5662 guanine nucleotide binding protein (G protein 
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10 
15 

20 
25 




Homo sapiens mRNA; cDNA DKFZp761G2123 (from c 



gb:EST383212 MAGE resequences, MAGL Homo sapi 



AW27C404 Hs.193161 

H06382 Hs.21400 

AA866115 Hs.127797 
AI422017 

AA442153 Hs.104744 
AA460421 

H56037 Hs.108146 
AW975920 

AA335714 

BE561665 Hs.177677 

H03754 Hs.152213 

AA122393 Hs.70811 

AI143139 Hs.2288 

AA385751 Hs.196379 

AF113013 Hs.278919 

Z36830 Hs.87268 

AA769578 Hs.90488 

M62246 Hs.35406 

AA831850 Hs.58149 

AJ224172 Hs.204096 

AW016824 Hs.255527 

BE613126 Hs.47783 

N71183 Hs.121806 

AI791749 Hs.128896 
H85310 



.30875 ESTs 



Homo sapiens oDNA FU11381 fis, clone HEMBA10 
gb:tt45f12.x1 NCI_CGAP_Brn23 Homo sapiens cDN 
hypothetical protein DKFZp434J061 7 



gb;CM2-HT0176-041099-017-c02HT0176 Homo sapi 

i component Rrp40 
. -typeMMTVintegratio .. . 
NMJ25001':Homo sapiens hypothetical protein 
— hFU20516 



ESTs, Highly sir 
hypothetical protein MGC14136 
lipophilin B (uteroglobin family member), pro 
hypothetical protein MGC14128 
B aggressive lymphomagene 
Homo sapiens cDNA FLU 1 971 fis, clone HEMBB10 
ESTs 

ESTs, Weakly sin 
ENSP000G0016943-:CONA 
gb:Homo sapiens skin-specific protein (xp33) 
Target Exon 

keratin 1 (epidermolytic hyperkeratosis) 



gb:Homo sag 



iin alpha 3b chain mRNA, 




Rh type C glycoprotein 

U5 snRNP-specific 40 kDa protein (hPrpS-bindi 
(MOR/TAP)(TAP2) 

Homo sapiens clone TCCCTA00142 mRNA sequence 
ESTs 

gb:ze45b01.r1 Soares retina N2MHR Homo sapie 



Homo sapiens cDNA: FU21869 fis, clone HEP024 



423441 
449978 
410784 
409582 
428182 
458446 
433040 



BE244318 Hs.213194 

AW003582 Hs.226414 

BE465884 Hs.280728 

AI655494 Hs.195718 

AB006532 Hs.31442 

AA693537 Hs.321411 
R64719 

N48769 Hs.44609 

AI188613 Hs.41690 

NM.004833 Hs.105115 
AI702055 

W33191 Hs.28907 

AA31C693 Hs.87329 

BE379594 Hs.49136 

R68649 Hs.278359 

AI806335 Hs.200829 

R27430 Hs.271565 : 

BE386042 Hs.293317 

AW394104 Hs.43744 

H70423 Hs.300511 

AA987351 Hs.184993 



-11 Homo sapiens cDNA clone 22d 



ESTs, Moderately similar to ALU7_HUMAN ALU SU 
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400441 
440138 
417830 
427131 
413278 



424639 
419196 
434321 
426514 
427335 
413472 
408000 
426749 



424927 
108591 
403867 



AB033023 Hs.318127 
AW504786 Hs.122579 
AA448460 Hs.112017 



AI631874 Hs.155140 

AI917494 Hs.9812 

AF110908 Hs.297660 
AA629368 

BE616633 Hs.170195 

AA448542 Hs.251677 

BE242870 Hs.75379 

L11690 Hs.620 

AW842283 Hs79933 

AI253123 Hs.127356 

AW973666 Hs.153850 

AF015224 Hs.46452 

AA437199 Hs.656 

BE252383 Hs.1 84668 
AA492588 

AA296961 

AB011142 Hs.114293 

"■■ - Hs.12449 

N77221 Hs.187824 



potassium voltage-g 
hypothetical protein FU10483 
gb:QV1-BT0631-180200-078-c08 BT0631 Homo sapi 
B-ceB growth factor 1 (12kD) 
hypothetical protein FLJ10201 
hypothetical protein FU10461 
GE36gene 

interferon-stimulated protein, 15 kDa 

gb:EST380198 MAGE resequences, MAGJ Homo sapi 

SMC2 (structural maintenance of chromosomes 2 

casein kinase 2, alpha 1 polypeptide 

Homo sapiens cDNA FU14388 As, cbne HEMBA10 

TNF receptor-associated factor 3 

gb:zu78a11.s1 Soares_testis_NHT Homo sapiens 

bone morphogenetic protein 7 (osteogenic prot 



cell division cycle 25C 
SBBI31 protein 

gb:ng99c08.s1 NCI_CGAP_Thy1 Homo sapiens cDNA 
NM_005557':Homo sapiens keratin 16 (focal non 
gb:EST1 12514 Adrenal gland tumor Homo sapiens 
KIAA0570gene product 

" ' protein HTMP10(HT 




430704 
419807 
451778 



418203 
453775 
417009 



AW963372 Hs.46677 

C05766 Hs.181022 

AW290975 Hs.118923 

AW445166 Hs.1 70802 
T97307 

NM002776 Hs.69423 

AA393351 Hs.132121 



M63193 Hs.73946 

AA829976 Hs.239114 

T65617 Hs.101257 

AI379772 Hs.99206 

AW813091 Hs.335799 
R77402 

AI826131 Hs.71243 

AI924533 Hs.105607 



gb:nc16e03.s1 NCLCGAP_Pr1 Homo sapiens cDNA 
gb:RC2-CT0304-080100-01 1-h1 2 CT0304 Homo sapi 
PRO2000 protein 
CGI-07 protein 



gb:ye53h05.s1 Soares fetal liver spleen 1NFLS 
kaOikrein 10 

NM_002425:Homo sapiens matrix metalloproteina 



gb:QV2-LT0038-270300-10&-d12 LT0038 Homo sapi 
endothelial cell growth factor 1 (platclet-de 
mannosidase, alpha, class 1 A, member 2 



gb:yi75fl 1.s1 Soares placenta Nb2HP Homo sapi 
ESTs, Weakly similar to zinc finger protein [ 
bicarbonate transporter related protein 1 



BE280074 Hs,23960 cyclin B1 



X54942 

NM.002916 Hs.35120 

AA191719 Hs,314714 

AA045648 Hs.301957 

AW136973 Hs.288516 

AA878200 Hs.118727 

AJ010901 Hs.198267 
AA828995 



AF105366 Hs.4876 

Hs.1 70623 
AA557191 Hs.55028 
Hs.1 71 176 

AI201849 



gb:CM0-HT0323-151299-126-b04 HT0323 Homo sapi 



nudix (nucleoside diphosphate linked moiety X 
ESTs, Weakly similar to S69890 mitogen induci 
Homo sapiens cDNA FU13692 fis, clone PLACE20 




Hs.61210 
AF217513 Hs.279905 



gb:od77bC8.s1 NCI_CGAP_Ov2 Homo sapiens cDNA 
Target Exon 

solute earner family 12 (potassium/ch'oride 

hypothetical protein FU11183 

ESTs, Woakly similar io I54374 gone NF2 protc 

ESTs 

gb:qs76g04.x1 NCI_CGAP_Pr28 Homo sapiens cDNA 
Homo sapiens mRNA; cDNA DKFZp566A1046 (liom c 
G protein-coupled receptor 
selectin E (endothelial adhesion molecule 1) 
Homo sapiens cDNA: FU23228 As, clone CAE066 
KIAA0761 protein 

C7001692*;gi|6724096|gb|AAF26844.1|(AF195021 
gb:yg18h11.r1 Soares infant brain 1MB Homo s 
E i \ Ml I380 hyp lot! £ p 
Target Exon 

clone HQ0310PRO0310p1 
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443903 
432015 
447153 

421379 



Homo sapiens mRNA; cDNA DKFZp5B600724 (from c 



20 
25 
30 



414774 
412246 
412903 
431958 



418462 
424687 
405196 



Hs,33106 ESTs 



50 
55 



437641 
414761 
434208 



408296 
447815 
441974 
446474 



X02419 
AI160873 Hs,69233 
BE007967 Hs.155795 



J05070 Hs.151738 
AJ003631 

NM_014125 Hs.279812 

BE280486 Hs.84045 

AW207091 Hs.72307 

AW958037 Hs.286 

AK001128 Hs.210297 

AI939461 Hs.161370 



ESTs 

gb:zm95b09.r1 Stratagene colon HT29 (937221) 
integrin. beta 4 

matrix melalloproteinase 9 (gelstinase B, 92k 
C2000662':gi|7512792|pir||T12482 hypothetical 
gb:AJ003631 Selected chromosome 21 cDNA libra 



hypothetical protein FLJ20288 
ESTs 

ribosomal protein L4 

Homo sapiens cDNA RJ10266 fis, clone HEMBB10 
ESTs 

gb;IL2-BT0731-240400-069-C03 BT0731 Homo sapi 




BE326588 Hs.141454 
AW241510 Hs.252713 
BE145808 



AA573263 Hs.1 20860 

Z15005 Hs.75573 

AA804258 Hs.123229 

AW813731 Hs.159153 

AA938471 Hs.54431 

BE562088 Hs.108196 

T82427 Hs.194101 

AW020782 Hs.79881 



AK000883 Hs.301645 

AF098158 Hs.9329 

AA811452 Hs.291911 

AU077228 Hs.77256 

T92641 Hs.127648 

AW514472 Hs.238415 

AA581322 Hs.4213 

Y12812 Hs.24422 

Y09763 Hs.22785 

AL359053 Hs.57664 

AW411307 Hs.114311 

AI734009 Hs.127699 

AW972542 Hs.289008 

AW977534 Hs.151469 
AW970622 

U89995 Hs.159234 
BE1 78323 

AA291101 Hs.33020 

AL080207 Hs.1 34585 

AL1 17452 Hs.44155 

AI432199 Hs.247084 

AI683782 Hs.1 28245 

AI301227 Hs.150186 

AI948607 Hs.264680 

BE379761 Hs.14248 

AW600291 Hs.6823 



gb:PM4-SN002O-O1O40O-008-b09 SN0020 Homo sapi 
gb30g7 Human retina cDNA randomly primed sub 
Target Exon 
ESTs 

ESTs. Weakly similar to SFR4_HUMAN SPLICING F 



gb:MR0-HT0208-10129S-10M11 HT0208 Homo sapi 
neurotraphin 3 

C4000647':gi|4758508|refINP_0D4253.1| airway 
ESTs 

centromere protein E (312kD) 
ESTs 

ESTs, Moderately similar to S65657 alpha-1C-a 
specific granule protein (28 kDa); cysteines 
HSPC037 protein 

Homo sapiens cDNA: FU20869 fis, clone ADKA02 
Homo sapiens cDNA: FU23006 fis, clono LNG004 
Target Exon 

gb:CM4-NT0007-130500-551-f06 NT0007 Homo sapi 
Homo sapiens cDNA FU 1 0021 fis, clone HEMBA10 

ESTs 

enhancer of zeste (Drosophila) homolog 2 
hypothetical protein PR021 76 
ESTs, Moderately similar to ALU8_HUMAN ALU SU 
hypothetical protein MGC16207 
regulatory factor X-associated protein 
gamma-aminobutyric acid (GABA) A recepto:, ep 
Homo sapiens mRNA full length insert cDNA do 
CDC45 (cell division cycle 45, S.cerevisiae, 
KIAA1603 protein 

Homo sapiens cDNA: FU21814 fis, clone HEP01 0 
cclcium/cclrnodulin-dependentserine protein k 
gb:EST382704 MAGE resequences, MAGK Homo sapi 
forkhead box E1 (thyroid transcription factor 
gb:RC3-HT0600-240400-023^05 HT0500 Homo sapi 
IMAGE:3S39163, mRNA, part 
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25 



447701 BE619526 Hs.255527 hypothetical protein MGC1 41 28 

443648 AI085377 Hs.143610 ESTs 

414696 AFQ02020 Hs.76918 Niemann-Pick disease, type C1 

449441 AI656040 Hs.196532 ESTs 

458145 AI239457 Hs,130794 ESTs 

444588 AI221321 Hs.167559 ESTs 

450832 AW970602 Hs.1 05421 ESTs 

449425 AW103433 Hs.195684 ESTs 

440953 AI683036 Hs.124135 Homo sapiens cDNA FLJ13051 fis, clone NT2RP30 

422158 L10343 Hs,112341 protease inhibitor 3, skin-derived (SKALP) 

416120 H46739 gb;yo14h02.s1 Soares adult brain N2b5HB55Y Ho 

425081 X74794 Hs.154443 minichromosome mainlenance deficient (S. cere 

404107 Target Exon 

419563 AA526235 Hs.193162 Homo sapiens CDNAFU1 1983 fis, clone HEMBB10 
417168 AL133117 Hs.81376 Homo sapiens mRNA; cDNA DKF2p586L1 121 (from c 
AA625279 Hs.26892 ^characterized bone marrow protein BM040 



40 
45 
50 



BE178065 



ESTs 



AI651474 Hs.163944 ESTs 



70 
75 



KIAA01 96 gene product 
sulfotransferase family, cytosolic, 2B, membe 
cofactor required for Sp1 transcriptional act 
Target Exon 
Target Exon 

heal shock protein (hsp110 family) 

gb:zk15e04.s1 Soares_pregnanLuterus_NbHPU Ho 

ESTs 



Hs.335018 
Hs.271691 
Hs.1 56457 



hypothetical protein FU22408 

gb:wq05c02.x1 NCI_CGAP_Kid12 Homo sapiens cDN 

transcription factor BMAL2 

Homo sapiens, Similar to RIKEN cDNA 281 002701 

ESTs 

gb:ln07g03.x1 NCI_CGAP_Bm25 Homo sapiens cDN 



424441 
417734 
449676 
445425 
427061 
433584 
444477 
446255 
400612 
450841 
410561 
433871 
401994 
449272 
409703 
400250 
408015 
436414 
432220 
420831 
433644 
447313 
428262 
452778 



AL040021 Hs.252674 
BE075114 

M161071 Hs.71465 



:i_CGAP_Pr 



W02410 Hs.205555 



411098 
435399 
431070 
427986 



AW1 36771 Hs.244349 

BE264633 Hs,1 43638 

AI571306 Hs.232224 

AA280824 Hs.1 90035 
AW342028 

U92981 Hs.18081 

AI651324 Hs.7298 

R71338 Hs.5921 

H60720 Hs.81892 

AW411194 Hs.85195 
AI493076 

R72694 Hs.7720 

AA393167 Hs.41294 

AA709270 Hs.136672 

AK000517 Hs.6844 

AW817238 
AA679463 

AW408164 Hs.249184 

N45214 Hs.282387 

AW105231 Hs.1 92035 

AW748078 Hs.214410 



gb:nh30c04.s1 N 1 
ESTs, Weakly sir 
gb:PM1-BT0585-1 10200-003^:1 1 BT0585 He 
squalene epoxidase 
H2Ahistone family, mer 
ESTs 



C10001034:gi|7513113|pir||T13078 KIAAG992 pro 
ESTs 

Homo sapiens cDNA FU22044fls, clone HEP091 
ESTs 

Target Exon 
ESTs 

2'-5'-oligoadenylate synthetase 3 (100 kD) 
Eos Control 

epidermal differentiation complex protein lik 

WD repeat domain 4 

ESTs 

gb:hb75d03.x1 NCI_CGAP_Ut2 Homo sapiens cDNA 
Homo sapiens clone DT1 P1B6 mRNA, CAG repeat r 
biphenyl hydrolase-like (serine hydrolase; br 
Homo sapiens cDNA: FU21592 fis, clone COL070 
KIAA0101 gene product 
myeloid leukemia factor 1 

■ " !family1,memberC2(dihy 
dynein, cytoplasmic, heavy polypeptide 1 



412279 BE245511 



EST 

hypothetical protein FU20510 

C1001383*:gi|538695|pir||A61183hypothelirjat 

gb:QVO-ST0247-090200-105-b07 ST0247 Homo sapi 

gb:ac50c03.s1 Stratagene hNT neuron (937233) 

transcription factor 19 (SC1) 

Homo sapiens cDNA: FU21837 fis, clone HEP016 

ESTs 

ESTs, Weakly similar to MUC2J1UMAN MUCIN 2 PR 
gb:TCBAP1D3235 Pediatric pre-B cell acute lym 
ENSP00000211621-:Keratin, type II cytoskeleta 
gb:EST33985 Embryo, 12 week II Homo sapiens c 



WO 03/042661 PCT/US02/36810 



AW895387 gb:QV4-NN0038-300300-157-c10 NN0038 Homo sapi 

AJ404672 Hs.334483 hypothetical protein FLJ23571 

AW303544 Hs.118654 ESTs 

AU076609 Hs.2934 ribonucleotide reductase M1 polypeptide 

NM_006769 Hs.3844 LIM domain only 4 

AW263155 Hs.14559 hypothetical protein FU10540 

AW945964 gb:QVO-ET0001-050500-228-e09 ET0001 Homo sap: 

BE005252 Hs.321583 Homo sapiens cDNA FU 20779 fis, clone COL0507 

U95031 Hs.1 02482 mucin 5, subtype B, tr; 



NMJJ13409 Hs.9914 follistatin 



endothelial zinc linger protein induced by tu 
Homo sapiens, clone IMAG&3535294, mRNA, part 
Target Exon 



30 

35 
40 



55 
60 



ESTs, Weakly similar to fatty acid omega-hydr 
gb:qn14d12.x1 NCL.CGAPJ.u5 Homo sapiens cDNA 
syntrophin, gamma 2 

Homo sapiens mRNA; cDNA DKFZp761 C029 (from cl 

hypothetical protein FLJ23468 

ESI „ Klj imilarl 022 hypothetical p 



Ig superfamily receptor LNIR 



ESTs, Weakly similar to 138022 hypothetical p 



AJ003029 Hs.65792 

BE393822 Hs.29645 

BE093589 Hs.38178 

W31489 Hs.95044 

AF124250 Hs.6564 

AF160477 Hs.61460 

AA782163 Hs.293502 

AF086453 Hs.58611 

AW152225 Hs.165909 

AA089474 Hs.272153 

NM.002462 Hs.76391 

AI668594 Hs.176588 

AF027219 Hs.9443 

AI684969 Hs.46772 



AI436323 Hs.31 1 41 Homo sapiens mRNA for KIAA1568 protein, parti 

NM.006042 tls.483B4 heparan sulfate (glucosamine) 3-O-sulfotransf 

AA256213 Hs.72010 ESTs 

AK000512 Hs.69388 hypothetical protein FU20505 

AA005129 gb:zhS0h03.r1 Scare felal her„spleenJNFLS 

AH27076 Hs.334473 hypothetical protein DKFZp56401 278 

AA379597 Hs.5199 HSPC150 protein similar to ubiquitin-conjugat 

AF173937 Hs.109494 secreted protein of unknown function 

H20669 Hs.35406 ESTs, Highly similar to unnamed protein produ 

BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypothetical p 

AF226667 Hs.58553 CTP synthase II 

AI538323 Hs.52620 inlegnn, beta 8 

AU077025 Hs.265827 interferon, alpha-inducible protein (clone IF 

AI620463 Hs.293984 hypothefcal protein MGC13102 



AA584062 Hs.272798 



AA027952 Hs.165216 ESTs 



protein FU2041 3 
NM_002864:Homo sapiens pregnar 
ESTs, Weakly similar to S34159 trai 
XAGE-1 protein 

ESTs, Weakly similar to ALULHUMAN ALU SUBFAM 
minichromosome maintenance deficient (mis5, S 

poly(A)-binding protein, cytoplasmic Mike 



gb:RC4-BT0311-250200-014-h06 BT0311 Homo sapi 



BE567665 Hs.288550 Homo sapiens cDNA: 
AI160011 Hs,193341 ESTs 



AL080098 

AW579207 Hs.304666 

AA323037 Hs.1 28645 

AI826645 Hs.211534 

NM.000246 Hs.3076 

AA776564 Hs.41 891 

AW450826 Hs.1 27786 

AW293165 Hs.143134 

AI225235 Hs.288300 

AW591528 Hs.202072 



Homo sapiens cDNA FU11469 fis, clone HEMBA1 0 
gb:RC3-CT0297-290100-013-d03 CT0297 Homo sapi 
coagulation factorlll (thromboplastin, flssu 
collagen, type V, alpha 1 
ESTs 

Target Exon 

gb:Homo sapiens mRNA; cDNA DKFZp564C1072 (fro 
51 ,V-,siT:lartol78885senne/Uireoni 



hypothetical protein FU23231 
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AW295647 Hs.71331 

AI611378 Hs.192610 

AW025980 Hs.138965 

NMJ06033 Hs.65370 

BE250716 Hs.87614 

S73265 Hs.1473 

BE514982 Hs.38991 

AI267371 Hs.172636 

AW749511 Hs.301554 

AW294416 Hs.144687 

AA477609 Hs.89563 



413273 
439846 
429597 



444172 
406753 
415747 
419875 



40 
45 



435347 
457339 
41739B 



ESTs, Weakly similar to RS1A_HUMAN 40S RIBOSO 
gamma-aminobutyric acid (GABA) A recepto:, pi 
thyroid hormone receptor interactor 13 



ESTs, Weakly similar to I38022 hypothetical p 



ESTs, Weakly similar to AF133298 1 cytochrome 
Homo sapiens cDNA FU12981 fis. clone NT2RP20 
nuclear cap hinding protein subunil 1, 80kD 
ESTs, Weakly similar to I38022 hyf 
U5 snRNP-specilic protein, 116 kD 
stem-loop (histone) binding protein 
hypothetical protein FL 



ls.188751 



407287 
448275 
412977 
431721 



NMJ14642 Hs.169387 

U31116 Hs.77601 

AL136877 Hs.50758 

X99226 Hs.284153 

AA446869 Hs.1 19316 

AW405731 Hs.18498 

AI681545 Hs.152982 

BE147740 Hs.104558 

AA505665 Hs.217493 
AA381209 

AA853410 Hs.93557 

AW084176 Hs.223296 

AF052573 Hs.241517 

L36463 Hs.1030 
H62953 
AW974175 
AW014873 

AW971949 Hs-291252 

N78541 Hs.177366 

AF123050 Hs.44532 

AW444828 Hs.184323 

AK000093 Hs.46821 
AW178363 

BE614410 Hs.23044 

AF072873 Hs.114218 

AI422023 Hs.161338 

AI079496 Hs.134169 

AF075312 Hs.236760 

BE091926 Hs.16244 

AW904466 Hs.321197 

AA581386 Hs.73452 

N69546 Hs.44563 

AI678812 

BE514434 Hs.20830 

AA125910 Hs.191461 



dual specificity phosphatase 10 
gb:RC6-BT0627-140200-01 1-A04 BT0627 Homo sapi 
gb:RCO-DT0076-1 1 01 00-031 -<i 10 DTQQ76 Homo sapi 



gh-.RC2-BT0318-1101 
KIAA0036 gene product 
sarcogiycan, beta (43kD dyslrophin-associatod 
SMC4 (structural maintenance of ch-omosomes 4 
Fanconl anemia, complementation group A 
ESTs 

Homo sapiens cDNA FU12277 fis, clone MAMMA1 0 
hypothetical protein FU1311 7 
NM_001059':Homo sapiens tachykinin receptor 3 
ESTs, Moderately similar to I38022 hypothetic 

gb:EST94257ActwatedT-cel!s I Homo sapiens 



>52 ESTs, Weakly similar to ZN91J IU MAN ZINC FINGE 



gb:RC3-HT0105-010999-002-H06 HT0105 Homo sapi 
RAD51 (S. cerevisiae) homolog (E col; RecA ho 
frizzled (Drosophila) homolog 6 
Target Exon 



Homo sapiens clone HQ0262 

mitotic spindle coiled-coil related protein 

Target Exon 

PDZ domain protein (Drosophila inaD-like) 
hypothetical protein MGC10791 
hypothetical protein 

Homo sapiens cDNA FU12149 fis, clone MAMMA10 
gb:tu59d08.x1 NCI_CGAP_Gas4 Homo sapiens cDNA 
kinesin-like 2 
ESTs 



AF260257 Hs.131917 : 



AF231022 Hs.15 

AA862231 Hs.334443 

BE153855 Hs.61460 

AB001325 Hs.234642 

BE395085 Hs.10086 

AV646388 Hs.93961 

BE207573 Hs.83321 

AW969626 Hs.31704 
AW516565 
AA513906 

AA366143 Hs.179669 



KIAA0227 protein 

ESTs, Moderately similar to A Chain A, Cyclop 
FAT tumor suppressor (Drosophila) homolog 2 
NM_014191":Homo sapiens sodium channe'.vo'ta 



Homo sapiens mRNA; cDNA DKFZp667D0S5 (frcm cl 

ESTs, Weakly similar to KIAA0227 [ksapiens] 

q01dC NHCeC lumorHom 

gb:ng67c08.s1 NCI.CGAP_Up2 Homo sapiens cDNA 
hypothetical protein FU20637 
hypothetical protein FU14627 
ESTs 

phospholipase A2-activating protein 
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410008 
406671 



408625 
412530 
439245 



AI335361 Hs.226376 

AA251131 Hs.220697 
AW947884 

AV654382 Hs.17947 

AW168067 Hs.252956 

AI733249 Hs.126897 

AI864053 Hs.39972 

NM 005429 Hs.79141 

R27496 Hs.1378 

AA420683 Hs.98321 

AW276866 Hs.192715 

AA394104 Hs.97489 

AI368873 Hs.271 257 

M402307 Hs.322844 

AI753709 Hs.1 52484 
AW864542 

AA897461 Hs.328737 

R27299 Hs.10172 

AA100293 Hs.185043 

AA737415 Hs.152826 

AW205878 Hs.29643 

AU077005 Hs.92208 

AA988176 Hs.121553 
AA07611I 



451153 
407327 
440159 
404184 



449432 
425062 
452281 

421506 

427099 
431750 
451807 
453331 
447175 
451878 



70 
75 



444826 
413331 
414987 
405041 
413864 



ESTs, Weakly similarto I38588 re\ 



annexin A3 
fibrillin 2 (congenital contraclural arachnod 
' • HJ14-03 



ESTs 
ESTs 

ESTs, Weakly similar to I38022 hypothetical p 

hypothetical protein DKFZp564A176 

ESI \\^H/ymildrtol38022 hypothetical p 

gb:PM4-SN0016-120500-003-h02 SN0016 Homo sa| 

ESTs, Weakly similar to envelope protein [H.s 

C100103t*;gi|7305041|ref|NP 038876.1 1 erytlro 

ESTs 



ne NT2RP30 



ls.123058 



AA079552 
D26528 
AA1 29547 



AI765276 Hs.101257 

AF146074 Hs.108660 
AW847663 

AW243323 Hs.266785 

AA766268 Hs.266273 

NMJ13381 Hs.6510 

X17033 Hs.271986 

U30372 Hs.77204 



Homo sapiens cDNAFL 
a disintegrin an ' 
hypothetical protein FU20641 
gb:zm18e06.s1 Stratagene pancreas (937208) Ho 
gb:zm20h12.s1 Stratagene pancreas (937208) Ho 
DEAD/H (Asp-Glu-Alo-Asp/His) box polypeptide 
met proto-oncogene (hepatocyte growth factor 
ER01 (S.cerevislae)-like 
hypothetical protein MGC3295 
ATP-binding cassette, sub-famiiy C (CFTR/MRP) 
gb:IL3-CT0213-280100-056-F02 CT0213 Homo sapi 



AW274560 Hs.1 29520 ESTs 



FU13346 

I troj r i_ j one njingeclo 
integrin, alpha 2 (CD49B, alpha 2 subunit of 
centromere protein F (350/400kD, mitosin) 
NM_014155':Homo sapiens HSPC063 protein (HSPC 
Target Exon 

gb:CM2-BT0742-10O400-147-hO4 BT0742 Homo sapi 
ESTs, Weakly similar to Z195_HUMAN ZINC FiNGE 
ESTs 

NMJ30903':Homo sapiens olfactory receptor, f 



AI821027 



NMJ14791 Hs.184339 

NM_016112 Hs.159241 

AA913909 Hs.1 53088 

AW451361 Hs.196529 

H09748 Hs.57987 

T93500 Hs.28792 

U90304 Hs.25351 

BE302796 Hs.105097 
AW859276 

AB032953 Hs.173560 

AA514986 Hs,283705 

W52854 Hs.27099 

AI240655 Hs.8895 

AI365208 Hs.293606 ESTs 
Hs.8429 
ls.288968 

AA215535 Hs.98133 

AW855466 Hs.271 866 

U79293 Hs.159264 

M744597 Hs.88854 ESls 

AI023707 Hs.1 34273 ESTs 

AW195634 Hs.170401 ESTs 

AI127958 Hs.83393 cystatjn E/M 

BE542950 Hs.1 55548 — 



NM_006413 Hs.1 391 20 
D55644 

AA490346 Hs.40530 

W55956 Hs.94030 

AA248998 Hs.1 73044 

AL137279 Hs.1301B7 

M13903 Hs.157091 

AI674482 Hs.148441 



KIAA0175 gene product 
polycystic kidney disease 2-like 1 
TATA box binding p-otein (TBP)-associ8ted fac 
ESTs 

B-cell CLl/lymphoma 1 1 B {zinc finger protein) 
Homo sapiens cDNA FU11041 fis, clone PLACE10 
iroquois homeobox protein 5 
thymidine kinase 1, soluble 
gb:MR1-CT0352-240200-105J02 CT0352 Homo sapi 
odd Oz/ten-m homolog 2 (Drosophila, mouse) 
ESTs 

hypothetical protein FU23293 similar to ARL- 



RAB22A, member RAS oncogene family 



Target Exon 
ribonucleaseP(30kD) 

gb:Human spleen PABL (pseudoautosomal boundar 
Homo sapiens, clone MGQ17624, mRNA, complete 
Homo sapiens mRNA; cDNA DKFZp586E1624 (from c 
ESTs, Weakly similar to I38022 hypothetical p 
Homo sapiens mRNA; cDNA DKFZp43401214 (from c 



M524394 Hs 294022 hypothetical protein 



gb:PMO-BT0651-260200-001-b11 BT0651 Homo sapi 
hypothetical protein FU14950 
C3001706*:gi|1345652|sp|P15989|CA36_CHICKCOL 
gb:RC5-HT0580-10050M22-C01 HT0580 Homo sapi 
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AI885815 Hs.184727 



409676 
448305 
429413 



30 
35 
40 



AI870435 Hs.1569 
AW820260 

U72671 Hs.151250 

AA926960 Hs.334883 

NM.007019 Hs.93002 

BE069288 Hs.34744 

AA885108 Hs.223806 
AW994222 

AA077118 Hs.197298 

AA625207 Hs.264915 

NM_014058 Hs.201877 

BE614743 Hs.146688 

Z47542 Hs.179312 



407103 
458175 
431130 

438533 
412313 
421733 
443757 
449300 



60 
65 



gb:EST391184MAGEi 

hypothetical protein AF301 222 
LIM homeobox protein 2 

gb:QV2-ST0296-150200-040-c 10 ST0295 Homo sapi 
gb:/ri88b01.s1 Soares p'xenta Nb2HP Homo sapi 
intercellular adhesion molecule 5, ielencepha 



JA FLJ 1 2908 fis, clone NT2RP20 



small nuclear RNA activating complex, polypep 
Aiwoo^r ns.iuaooa KIAA1 688 protein 
NM.014314 Hs.145612 RNAhelicase 

gb:CM1-CT0278-031199-032-eOB CT0278 Homo sapi 



AA496799 Hs.36958 

U03493 , Hs.1 38959 

BE270447 Hs.1 74070 

AA424881 Hs.256301 

AW296024 Hs.1 50434 

NM.006103 Hs.2719 

-— Hs.61753 



gap junction protein, alpha 7, 45kD (connexin 
hypothetical protein MGC13170 
EST ? . h 



Hs.170673 ESTs, Weakly similar to T24832 hy| 

gb:PM2-DT002M50400 003-h03 DT0023 Homo sapi 
Hs.1420 fibroblast growth factor receptor 3 (achondro 



AW872860 Hs.1 1056 RALBP1 protein 
AI652855 Hs.23363 hypothetical protein FU10983 

Homo sapiens mRNA; cDNA DKFZp761L231 21 (from 
I lomo sapiens, Similar to zinc finger protein 



AL353942 Hs.306504 
AW409872 Hs.184846 
ls.191990 



420170 U43374 



ls.312691 ESTs 



452220 
433675 
429163 
441703 



430072 
442966 

418327 
440381 
403983 

447888 
441794 
424153 
453633 
435647 
428780 
439108 



443362 
433183 
438214 
446745 
414416 



BE140845 Hs.57868 

AI672379 Hs.1 22970 

BE158006 Hs.212296 

AW977653 Hs.75319 
AA884766 

AW390054 Hs.1 92843 

D31118 Hs.191735 

AA641836 Hs.30085 



AI014546 Hs.130912 

NMJH3371 Hs.71979 

AW375974 Hs.1 56704 

AF284221 Hs.59506 

AW809201 Hs.314248 

AI334430 Hs.86984 

NM 004272 Hs.337737 

NiVL005879 Hs.21254 

X13294 Hs.300592 

AI394036 Hs.1 32237 

AI799444 Hs.247095 

U70370 Hs.84136 

AA917808 Hs.190495 



hypothetical protein FU21579 
ESTs 



ise M2 polypeptide 
gb:am20a10.s1 Soares_NFL_T_GBCS1 Homo sapien 
leucine zipper protein FKSG14 
hypothetical protein MGC1 0520 
hypothetical protein FU23186 
gb:v»i60b11.x1 NCI_CGAP_Co16 Homo sapiens cDNA 

in FU10933 



ESTs, Weakly similar to DUS8.HUMAN DUAL SPECI 
ESTs, Moderately similar to ALU7_HUMAN ALU SU 
paired-like homeodomain transcription factor 



AA357001 
AI653240 
AI478578 
AW163034 



Hs.49823 ESTs 
Hs.50636 ESTs 
Hs.6467 synaptogyrin 3 



AW247090 Hs.57101 

AI053464 Hs.166505 

AF231338 Hs.222024 

H06076 Hs.26320 

AW118189 Hs.1 56400 

AW409985 Hs.76084 

AW269088 Hs.118183 



ed (Drosophila)4ike (sea urchin fascin h 

hypothetical protein FU10545 
minichromosome maintenance delicient(S. c 
ESTs 

transcription factor BMAL2 
TRABID protein 

hypothetical protein MGC2721 
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Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



407642 1007175J 

408391 1055687.1 

408690 107490J 

409594 114249J 

409695 114876J 

409990 116644J 

410008 116812J 

410049 1172307J 

410784 1221005J 

411098 1232093J 

411171 1234393J 

411465 1246768J 

411559 1249417_1 

411819 1259748J 

411958 126717J 

412279 1287332J 

412313 1288355.1 



412333 1289037.1 



412879 1334272J 

413083 1348639.1 

413331 1361726 1 

413864 1395783.1 

414147 142127.-1 

414148 142133.1 
415317 1533847.1 
415613 1540602.1 
416747 155189.1 
416120 1571266.1 
416168 1574545.1 
416648 1600181.1 
417742 1696282.1 
418347 174149.1 
418869 179863.1 
419807 188252.1 
420373 193194.1 
420637 195241.1 
420930 197736.1 
422689 219896.1 
423733 231476.1 
423735 231498.1 
423841 232507.1 
425005 245908.1 
429163 300543.1 
431120 328264.1 
431322 331543.1 
431688 336609.1 



433584 370400.1 

433644 371919.1 

434321 383473.1 

435399 405576.1 

437182 43421.1 

437938 44573.2 



438390 45662.1 
438966 467436.1 
438993 467651.1 



AW178963AW845721 BE141041AW8re977A\Am5ra0AWfM5691AW845724AW176564AW845705AW845603AWM5729AW845722AW178966 

AW845693AW845585AW845707BE141053 BE141046 AW845570 AW345575 RE1 11043 BE141039BE141070 AW062443 AW806980 AW845643 

AW806969 AW845686BE141054 BE141040 BE141044 BE141042 AW845571 AW845604 BE141047 BE141071 AW062442 AW845633 AW1 78963 

AW859276 AW859274 AW1 90959 T91463 

AW864542 AA056567 AW882724 

AAO76118AA975618AA076220 

AA296961 AA296889 AA076945 AA077528 AA077497 

AA079337AA079338AW272100 



AW82O260AW820332 R94406 
AW847663 AW847861 AW861080 
BE144081 BE144190AW851155 

AW947884 AW947918 AW947888 AW947883 AW947897 AW94791 0 AW947905 AWB64751 AW947878 
AA099020 AW751275 AW751276 AW751289 
BE245511 BE246133AW935247 

AW936832 AW936609 AW936657 AW93661 1 AW936739 AW93G734 AW936779 AW936688 AW936659 AW936738 AW936327 AW936737 AW936736 
AW936740 AW936833 AW936777 AW936830 AW936834 AW936829 AW936772 AW936638 AW936658 AW936636 AW936774 AW936778 AW936766 
AW936776AW936831 AW936760 AW936819 
AW937485 AW937589 AW937658 AW937G54 AW937492 

AW945964 AW946020 AW946034 AW946027 AW946041 AW946044 AW946033 AW946024 AW946021 AW946029 AW94601 5 AW94601 6 AW946039 
AW946045 AW946028 AW946036 



BE084049 AW292907 M1 35984 
Z43388 F05453 R19673 R20275 H0691 7 
R20233F12901T74740 
AA381209M381245AA167683 
H46739H51513H19779 
H23687H46460 H40239 
H62953N76608 N72413 
R64719 Z44680R12451 
AA216419F03238AA229517 
AW51 6565 AA229762 AA230035 
R77402 AA262462 AA250988 R06794 
AW968228AA259146 W01465 
AW976153 AA278945 AA747691 

AW888650 AW888651 BE149946 BE149948 BE149951 BE149947 AW888649 AA281840 AA281 822 AW888652 

AW856665 AA315006 AW954733 

AA330281 AA330232AW962521 

AA330259 M661 806 AA502431 AW974633 AA649496 

AW753967 AA370795 AA331 630 AW962550 

AI565851 AA349656 R24798 

AA884766 AW974271 AA592975 AA447312 

AA492588 AA492498 AA492571 

AW970622 M503009 AA502998 AA502989 AA502805 T92188 

AA51 3906 AA847734 AI357044 

AW971125 AA527731 N52655 AI821508 AA532420 

AA527941 AI810608 AI620190 AA63S266 

AW974094 AA569074 AA602574 

AF005258 

AW295399 AW207772 AW300641 AW070290 BE348854 AW170383 AA600966 AA778832 

AW342028 AA641080 AA603282 

AA629368 AW849574 AW849573 

AA679463 AW813779 AW813709 

AL080098AL037472AA432051 

AI950087 N70208 R97040 N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 AA908598 AA251875 
AI820501 AI820532 W87891 T8S904 U71456T82391 BE326571 T75102 R34725 AA884922 BE328517 AI219788 AA884444 N92578 F13493 AA927794 
AI560251 AW874068 AL1 34043 AW235353 M663345 AW008282 AA488964 M283144 AI890387 AI950344 AI741346 AI689062 AA28291 5 AW102898 
AI872193 AI763273 AW1 73586 AW150329 AI653332 AI762688 AA988777 AA488892 AI356394 AW10381 3 AI539B42 AA642789 AA856975 AW50551 2 
AI961530AW629970BE612881 AW276997AW 1 0 f A344209 AW856538 AA180009 AA337499AW961101 AA251669AA251874 

AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 AA969759 N75628 N22388 
H84729H60052T92487AI022058AA78041CAA', I006W8070 6134 I 2 A 564269 F00531 H83488 W37181 W7B802 R66056 AI002839 
R67840AA300207 AW959581 T63226 F04005 

AI422017 AI422945 AI363249 AI4231 13 A1925592 AI420795 AI2081B7 AI423279 AI423645 AI424090 AI359637 AL044732 D17003 
AW979074AAB34841AA828650 
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439579 47404.1 

440320 491930J 

444910 624951.1 

446417 676384.1 

447197 711623J 

448599 770766.1 

449034 79481 7_1 



451340 86640.1 

454202 1050507.1 

454241 1067807J 

454707 1230250J 

454891 1239217J 

454988 1248607J 



455203 1259973.1 

455310 1278158.1 

455319 1279172.1 

455365 1284681.1 



455987 1397735.1 
30 457405 333127.1 
458829 773443.1 



AF086400W79232W73990 
AA879294 N67538AI474541 
AI201849BE069007AW946544 
A1299050 BE256910 
R36075 AI366546 R36167 
AW860912AI540866 
AI624049 AW1 1 7770 AI858360 
AA005129AA679084AA694399 
AI702055 R89204 R86260 
AI761 324 AW880941 AW880937 
BE092900AA015877AA018521 
AW936273AW340350AA017208 

AW1 78363 AW846011 AW845964AW845988 AW845977AW846002 
BE144666 BE1 84942 AW238414 BE184946 
AW814989AW814852AW814808 
AW837349AW837355AW882717 
AW850140AW850195AW850192 

BE079752 BE079868 BE148989 AW855532 BE148818 BE148815 BE148796 
BE1 52428 AW855572 AW855607 

AW855598 AW855608 BE148763 BE148764 AW855645 AW85561 5 AW855596 AW855610 AW855601 AW855605 
AW865450 AW8651 1 9 AW865452 AW865461 AW865325 AW8651 1 4 AW8651 16 AW865321 AW865590 AW865390 
AW893961 AW893998 AW894034 AW894019 



AW948343 AW948341 AW902855 AW984737 

AW994222AW994377 

BE065813 BE066788 BE065889 BE065832 

BE075114 BE076283BE075118 

BE145808 BE14S807 BE181883 

BE178323BE1 77978 

AA504860AA504911 

AI557388 BE158936 

AJ003631AJ003650AJ003651 



s. "Dunham I. et al." refers to the publication entitled The DNA sequence of 



400612 9929646 Minus 



401050 
401137 
401230 
401260 
401367 
401458 



8118496 
8117628 
2547238 



9187886 Plus 

7341763 Plus 

7229804 Minus 

9789672 Minus 



401760 
401780 
401781 



70 
75 



8072512 Plus 

6598824 Plus 

6957691 Plus 

9797406 Plus 

9863723 Plus 

9958129 Minus 

6010175 Plus 

8086844 Minus 

8072441 Minus 



403806 8140491 

403983 8576059 

404107 8099028 

404132 6981900 

404184 4581418 



17982-18115,20297-20456 
78449-79425 
598-1009 

33835-34006,34539-34592,36461-35745,48925-49098,52604-52758 
86008-86355 
145356-145807 
76485-77597 

32585-32756,36281-36540,4079140933,4401844179 
76253-76364 

118596-118816,119119-119244,119609-119761,120422-120990,130161-130381, 
1319 114 11" \< I 1911 
83126-83250,85320-85540,94719-95287 

28397-28617,2B920-2^45 221 35-2:0:= T941 1-29567,29705-29787,30224-30573 



,130468-130593,131097-131258,131866- 



41164286,16811-16973,17107-17256,19715-20040,22029-22205 



13579-14026 
20167-22383 

4392144049,4618146273 
194384-194645 

104069-104179,105683-105859 



4357543887 



7407964 
4432779 
6007890 
7547195 
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405277 
405281 
405336 
405454 



6094635 Plus 

7656675 Plus 

1054740 Plus 

1054740 Plus 

4827303 Minus 

4557087 Plus 

4156182 Plus 

6758795 Plus 
7767812 Minus 

6758796 Plus 
9123919 Minus 
9123919 Plus 



7711566 
7711569 
8248616 



34202-34351,: 
33267-33563 
133807-134053 

118677-118807,119091-119296,121626-121823 

124361-124520,124914-125050 

104132-104293 



251370-251797,252168-252882 



Plus 106956-107121 



Table 13A lists about 465 genes up-regulated in 
as for Table 1 2A, except that the ratio was great< 
protein contained a structural domain that is in* 
domains are noted. 

Pkey: Unique Eos probeset identifier nur 
ExAccn: Exemplar Accession nu 
UnigenelD: Unigene number 
PPDomains: Predicted Protein Domains 
Unigene Title: Unigene gene title 
R1: Ratio of tumor to normal adult tissues 

Pkey ExAccn UnigenelD PPDomains 



vical cancer compared to normal adult tissues that are likely to encode extrxeliular or cell-surface proteins. These were selected 
nan or equal to 1.7, and the 96* percentile value amongst cervical cancers was greater than or equal 100 units, and the predicted 
ve of extracellular localization (e.g. ig, fn3, egf, 7tm domains, signal sequences, transmembrane domains). The predicted protein 



425650 NM_001944 Hs.1925 
418007 M13509 Hs.83169 
439606 W79123 Hs.58561 
452240 AI591147 Hs.61232 
424046 AF027866 Hs.138202 
400289 X07820 Hs.2258 
418345 AJ001696 Hs.241407 
423017 AW178761 Hs.227948 
428227 AA321649 Hs.2248 
447164 AF026941 Hs.17518 
414764 AW013887 Hs.72047 
416661 AA634543 Hs.79440 
427585 D31152 Hs.179729 
406467 

428368 BE440042 Hs.83326 
418882 NM_004996 Hs.89433 
419247 S65791 Hs.89764 
446232 A1281848 Hs.194691 
424905 NM_002497 Hs.1 53704 
422420 U03398 Hs.1524 
427821 AA470158 Hs.98202 
436211 AK001581 Hs.334828 
444342 NM_014398 Hs.10887 
422330 D30783 Hs.1 15263 
447342 AI199268 Hs.19322 
407839 AA045144 Hs.161566 
410153 BE311926 Hs.15830 
414812 X72755 Hs.77367 
421773 W69233 Hs.112457 
413385 M34455 Hs.840 
413753 U17760 Hs.75517 
432239 X81334 Hs.2936 
418663 AK001100 Hs.41690 
407366 AF026942 
433091 Y12642 Hs.3185 
408536 AW381532 Hs.135188 
420440 NMJW2407 Hs.97644 
437044 AL035864 Hs.69517 
405547 

439223 AW238299 Hs.250618 
426320 W47595 Hs.169300 
423634 AW959908 Hs.1690 
426350 NMJM3245 Hs.2022 
409744 AW675258 Hs.56265 



TM,cadherin,Cadherin_C_temi 
C 3 hen i n r i lida c I,' ' 
TMJtmJ 
TM 

SS,TM,serpin 

hemopexin,Peptidase_M10 
SS,TM,serpin 



SS,Peptidase_M10,hemopexin 
TM,ABC_membrane,ABC_tran 
TM.KH-domain 
TM,7tm_3RibosomaLL13 



TM,cadherin,Cadherin_CJe 

Glycos_transf_2 

SS,TM,IL8 



Unigene Title 

desmoglein 3 (pemphigus vulgaris antigen) 



matrix mctalloproteinase 10(stromelysin 
serine proteinase inhibitor 13(PI13;se 
serine (or cysteine) proteinase inhibitor 
small inducible cytokine subfamily B (Cys 
Homo sapiens cig5 mRNA, partial sequence 
ESTs 

IGF-II mRNA-binding protein 3 

collagen, type X, alpha 1 (Schmid melaphy 

matrix metalloproteinase 3 (slromelysin 1 
ATP-binding cassette, sub-family C (CFTR/ 
fragile X mental retardation 1 

NIMA (never in mitosis gene a)-related ki 



SS,Peptidase_M10,hemopexin 



TMJBR 

SS,TM,UPAR_LY6 

SS,TM,E1-E2_ATPase,Cation_ATFase_C, N 

SRCRUteroglobin 

TM 

SS,TM,ABC_membrane,ABCJran,ig 
SS,TM,TGF-beta,TGFb_propeptide 




cDNA for differentially expressed C016 ge 
NM_01 -i "Hill i en ransp r2 Al 
UL16 binding pn " - _ 



heparin-binding growth factor binding pro 
Iransglutamin ( i (E poly, ptid p ti 
Homo sapiens mRNA; cDNA DKFZp586P2321 (fr 
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444461 R53734 






ESTs, Weakly similar to 2109260A B cell g 


4.8 


410361 BE391804 


Hs'62661 


SS,TM,GBP 


guanylate binding protein 1, interferon-i 


4.8 


423673 BE003054 




SS,TM,Peptidase_M10,hemopexin 


matrix metalloproteinase 12 (macrophage e 




450375 AA009647 




TM,disintegrin,Pep M12D propep.Reprolysin 


a disintegrin and metalloproteinase domai 


4.8 








Target Exon 


4.6 


428484 AF1 04032 


Hs.184601 




solute canier family 7 (canonic amino a 


4.5 


425071 NM_013989 




SS,TM,T4 deiodinase 


deiodinase, iodothyronine, type II 


4.4 






SS.TM.EGF 


amphiregulin (schwannoma-derived growth f 


4.3 


434699 M643687 


Hs.149425 


TM,Nucleoside_tra2 


Homo sapiens cDNA FLJ11980 Us, clone HEM 


4.3 


406687 M31126 


Hs.272620 


SS,Peptidase_M10,hemopexin 


pregnancy specific beta-1-glycoprotein 9 


4.2 


404440 




TM.MAGE 


NM_021048:Homo sapiens melanoma antigen, 


4.2 


449228 AJ403107 




TM,PAF-AH,p450 


protein related with psoriasis 




444105 AW1 89097 






ESTs 




409632 W74001 




SS,TM,UPAR_LY6 


serine (or cysteine) proteinase inhibitor 




423738 AB002134 


lis. 1321 95 


SS,TM,trypsin,SEA 


airway trypsin-like protease 




423553 AA405635 






ESTs, Weakly similar to DYLX HUMAN CYTOPL 




445537 AJ245671 






EGF-like-domain, multiple 6 




446989 AK001898 






hypothetical protein FU11036 




428536 AI143139 






visinin-tike 1 


19 


413801 M62246 






ESTs, Highly similar to unnamed protein p 


3.9 


429441 AJ224172 


Hs.204096 


Uteroglobin 


lipophilin B (uteroglobin family member), 


3.9 


409601 AF237621 


Hs.80828 


TM,fiiamentfilament,C2 


keratin 1 (epidermolytic hyperkeratosis) 


3.8 


439238 N47305 






ESTs 


3.8 


446292 AF081497 






Rh type C glycoprotein 


3.8 






SS,TM,proteasome,ig,ABCjnerrib,1ranABC Iran, 


(MDR/TAP) (TAP2) 


3.8 


422938 NMJ01809 




™',thic'lase 


centromere protein A (17kD) 


3.7 


423217 NMJJ00094 




SS,TM,fn3,vwa,Collagen,Kunitz BPTI 


collagen, type VII, alpha 1 (epidermolysi 




430686 NM 001942 


Hs!2633 


SS,TM,cadherin,Cadherin_C_term 


desmoglein 1 


3.7 


444707 AI188613 




TM,cadherin 


desmocoltin 3 


3.7 


409582 R27430 








3.6 


408771 AW732573 




TM.KJetraJon trans 


potassium voltage-gated channel, delayed- 




400441 M15530 


Hs.99879 


TM.G- alpha 


B-cell growth fxtor1(12kD) 


16 


413278 BE563085 




TM]ubiquiu'nlaminln_G,laminin_EGF,kazal 


inlerferon-stimulated protein, 15 kDa 


3.6 


426614 BE616633 


Hs.1 70195 


SS,TGFb_prapeptide,TGF-bela 


bone morphogenetic protein 7 (osteogenic 


3.6 


424927 AW973666 


lis. 153850 




hypothclical protein C321D2.4 


3.6 


408591 AF015224 




SS.TM.Uieroglobin 


mammaglobin 1 


3.5 


407756 AA1 16021 


Hs.38260 


SS,UCH-1,UCH-2 


ubiquitin specific protease 18 




407137 T97307 




TM.GDA1 CD39 


gb:yc53h05.s1 Soares fetal liver spleen 1 


3.5 


411274 NM_002776 






kallikrein 10 


3.5 








NM 002425 Homo sapiens matrix metalloprot 


3.5 


412471 M63193 




SS.TM.GIycos tran^f 3 Cam acyltransf 


idol ia SI growth) r1i lal 






Hs. 101 257 




hypolhelical protein MGC3295 




451778 AI826131 






ESTs , Weakly similar to zinc linger prate 




430397 AI924533 






bicarbonate transporter related protein 1 




449722 BE280074 


Hs. 23960 


TM.cyclin 




3.4 


422487 AJ010901 


Hs.1 98267 




mucin 4, tracheobronchial 


3.4 


449101 AA205847 




SS,TM,7tm 1 






418994 AA296520 




SSJMjectiii c,sushi,EGF 


selectin E (endothelial adhesion molecule 


3.3 


421379 Y15221 


Hs.103982 




small inducible cytokine subfamily B (Cys 




414774 X02419 




Ss|krin'gle,trypsin 


plasminogen activator, urokinase 


3.3 


431958 X63629 






cadherin 3, type 1, P-cadherin (placental 


3.3 


418462 BE001596 


Hs.85266 


SSJM|integrinJJ,fn3 


Integrin, beta 4 


3.3 


424687 J05070 


Hs,151738 




matrix metalloproteinase 9 (gelalinase B, 


3.3 


401486 NA 






C4000647':gi|4758508|ref|NP 004253.1| air 


- 3.2 


408113 T82427 


Hs.194101 


TM,7tm_3Ribosomal_L13 


Homo sapiens cDNA: FU20869fs, clone AD 


3.2 


427359 AW020782 


Hs79881 




Homo sapiens cDNA: FU23006f s, clone LN 


3.2 


452934 M581322 


Hs.4213 


SS,TM,ig 


hypothetical protein MGC16207 


3.1 


448988 Y09763 


Hs.22785 


gamma-aminobutyric acid (GABA) A receptor 




439750 AL359053 


Hs,57664 




Homo sapiens mRNA full length insert cDNA 




414696 AF002020 


Hs,76918 


SS.TM.Patched 


Niemann-Pick disease, type C1 


3.1 


435604 M625279 


Hs.26892 




uncharacterized bone marrow protein BM040 


3.1 


453883 AI638516 


Hs.22630 


TM.Ets.SAM PNT 


cofactor required for Sp1 transcriptional 


3.0 


448733 NM 00562! 


) Hs.187958 


SS,TM,W,ABCJran,isodh,pl<inase,DSPc,P4bosomaL 


solute cam'er family 6 (neurotransmitter 




444946 AW1 39205 


Hs.156457 




hypothetical protein FU22408 


3,0 


437938 AI950087 




TM,histone,ig,MHC_l 


gb:wq05c02.x1 NCI CGAP Kld12 Homo sapiens 


3.0 


424441 X14850 


Hs.147097 




H2Ahistone family, member X 




427061 AB032971 


Hs.173392 




KIAA1 145 protein 


3.0 


409703 NMJM6187 Hs,56009 




2 , -5'-oligoadenylate synthetase 3 (1 00 kD 




447313 U92981 


Hs.18081 




Homo sapiens clone DT1P1B6 mRNA, CAG repe 


3.0 


431070 AW408164 


Hs,249184 


ABC_tran 


transcription factor 19 (SC1) 


2.9 


446269 AW263155 






hypothetical protein FU 10540 


2.9 


421190 U95031 


Hs!l 02482 


TM,vwd 


mucin 5, subtype B, tracheobronchial 


2.9 


452732 BE300078 


Hs,80449 


TM 


Homo sapiens, clone 1MAGE3535294, mRNA, 


2.9 


443859 NM 013409 Hs.9914 






2,9 


446733 AA863360 


Hs.26040 


TM,p450 
SS,p450 


ESTs, Weakly similar to fatty acid omega- 


2.9 


449746 A1668594 


Hs.176588 


ESTo, Weekly similar to CP4Y_HUMAN CYTOCH 


2.9 


418844 M62982 


Hs.1200 


SS,TM,lipoxygenase,PLAT 




2.9 


414581 AA256213 


Hs.72010 


TM,Cam_acylfransf,Choline_kiri33e,SC01.SenC 


ESTs 


2.9 


431629 AU077025 


Hs.265827 


SS,tRNA_antiSH2,SH3,pkinase 


interferon, alpha-inducible protein (clon 


2.8 


445873 AA250970 


Hs.251946 


SS,rrm,PABPpkinase,14-3-3,rrm 


poly(A)-binding protein, cytoplasmic 1-li 


2.8 


438113 AI467908 


Hs.8882 


TM,7tm_1 


ESTs 


2.8 


410310 J02931 
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coagulation factor III (thromboplastin, t 
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NM_014053 Hs.201877 

BE614743 Hs.146688 

BE270447 Hs.174070 
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AL119671 Hs.1420 
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M641836 Hs.30085 
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TM.SDF 

SS,TM,ig 

TM,ABCJran,ABC_membrane,Rhomboid 

TM.MAPEG 
TM,UQ_con 

TM,cNMP_bindlngtrypsin 
SS,TM,wap 
FAN,! 



TM 
TM 
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ESTs, Weakly similar to I38022 hypothetic 
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hypothetical protein MGC10791 
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hypothetical protein DKFZp564A176 
a dislntegrin and metalloproteinase domai 
met prolo-oncogene (hepatocyte growth fac 
hypothetical protein MGC3295 
ATP-binding cassette, sub-family C (CFTR/ 
integrin, alpha 2 (CD49B, alpha 2 subunit 
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thymidine kinase 1, soluble 
ESTs 
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prostaglandin E synthase 
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ESTs 
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hypothetical protein FU20401 
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NMJ»2508:Homo sapiens 
KIAA0125 gene product 
gb:nf37c09.s1 NCI_CGAP_Pr2 Homo sapiens c 
hypothetical protein MGC861 
gap junction protein, beta 2, 26kD (conne 
ESTs, Weakly similar to A47582 Bxell gro 
gap junction protein, beta3, 31kD (conne 
hypothetical protein FU22969 
esophagus cancer-related gene-2 
insulin-like growth factor 1 (somatomedin 
vanilloid receptor-related osmotically ac 
hypothetical protein FLJ14451 
ESTs ' 

hypothetical protein MGC2487 
integrin, alpha 3 (antigen CD49C, alpha 3 
cytochrome P450, subfamily IIS, polypepti 
Fanconi anemia, complementation group G 

matrix metalloproteinase 14 (mei 
cystatinB(stefinB) 
ESTs 
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KIAA0683 gene product 
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ESTs 

Homo sapiens, clone IMAGE:3535294, mRNA, 
transporter 1, ATP-binding cassette, sub- 
cysteine knot superfamily 1 , BMP antagoni 
hypothetical protein FU12541 similar to 
proslaglsndin-endoperoxide synthase 2 (pr 
ESTs, Weakly similar to A34087 hypothetic 
Homo sapiens, Similar to DNA segment, Chr 
H1 histone family, member 0 
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TM,PMP22_Claudin 
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TM.TNF 

TM,RhoGEF,FYVE,PH 
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TM.pkinase.trra 
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TONDU 

hypothetical protein DKFZp564D0462 
mitogen-activated protein kinxe kinase k 
hypothetical protein FU20561 



CDC20 (cell division cycle 20, S. cerevis 
serine protease inhibitor, Kazal type, 5 
Homo sapiens cDNA: FU2300Bfis, clone LN 
Ita (Dr so[ ii )-like I 



inhibin. beta A (activin A, activin AB al 
ESTs 

NM_007196:Homo sapiens kallikrein 8 [neur 
Homo sapiens, Similar to RIKEN cDNA 57305 
guanine nucleotide binding protein (G pro 
stratifin 

metallocarboxypeptidase CPX-1 
potassium channel, subfamily K, member 1 
CGI-09 protein 
collagen, type V, alpha 2 
germ cell specific Y-t ' ' 
prolactin receptor 
EHC (Src homology 2 domain-containing) tr 
small inducib'e cytokine A5 (RANTES) 
ATPase, Ca transporting, type 2C, member 
gb:nv35f03.r1 NCI_CGAP_Br5 Homo sapiens c 
putative secreted ligand homologous to fj 
hypothetical protein MGC2991 
sema domain, immunoglobulin domain (Ig), 



pyrimidinergic receptor P2Y, G-prolein co 

bone morphogenelic protein 1 

squamous cell carcinoma antigen recognize 

KIAA0450 gene product 

olfactory receptor, family 2, subfamily I 



RAD1 (S. pombe) homolog 
Homo sapiens, Similar to RIKEN cDNA 11100 
gb:MR0-HT0065-081199-002-b06 HT0065 Homo 
cadherin, EGF LAG seven-pass G-type recep 
gap junction protein, beta 5 (connexin 31 



zinc/iron regulated transporter-like 
jagged 2 

hypothetical protein from EUROIMAGE 17593 
mitochondrial ribosomal protein L17 
ESTs, Highly similar to unnamed protein p 
collagen, type III, alpha 1 (Ehlers-Danlo 
ESTs 

Homo sapiens, clone IMAGE3544662, mRNA, 
ESTs 

small inducible cytokine subfamily B (Cys 
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AU076657 Hs.1600 

NM_004341 Hs.154868 
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TM,Transglulamin_C,Transglutamin_N,TransgluLcore 
TM,Rhomboid,HMG_boxTPR 
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i9 
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SS,TM,zf-DHHC 
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SS,laminin_EGF,laniinin_Nlerm 

SS,TM,serpin,Marek_A 

SS.TM 

TM,Ski_Sno 

SS,TM,IL8 

TM,RibosornaLS17RibosornaLL13 
TM.GIyco.hydroJ 

TM.ABC tran,ABC_membraneion_trans 
SS.sushi 

SS,TM,Collagen,COLFI,TSPN 

TM,transmembrane4RasGEF,RA 

TM.SDF 



SS,TM,PeptidaseJrf10,hemopexin 
TM,cpn60_TCP1,Sema 

SS,TM,GATase,OTCace,CPSaseJ^chain,Diriydrooro 
Ricin_B_lecfin 

SS,TM,fusionjly,MycsinJalladh_short 

TM,Na_Ca_ExCam_acyltransf 

Uteroglobin 

SS.TM.UNG 

TM,ubiquitin,laminin_G,[aminin_EGF,kazal 
TM,p450 
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FGF 
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AA887391 Hs.202229 
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M311426 Hs.21635 
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AI224456 Hs.4934 
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AI498072 Hs.77783 
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SS,Y_phosphataseTIG 

TM,EGF,laminiR_G,fibrinoger,_C,F5_F8_type_C,tubulin 

TM.Ephrin.tlisLdeacetyl 

SS,TM,TSPN,Collagen 

TM,LRR,LRRCT 

TM.LRRCT 

SS,TM,REJ,PLAT,PKD,WSC,LRRCT,GPSPMP22_Cla 
SS,TM,bypsin 
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ryanodine receptor 1 (skeletal) 
cathepsin D (lysosomal aspartyl protease) 
gap junction protein, beta 6 (connexin 30 
WNT1 inducible signaling pathway protein 
gb:nc32g04.s1 NCLCGAP_Pr2 Homo sapiens c 
NM_003105*:Homo sapiens sortifin-related 
solute carrier family 15 (H??? transporte 



rallAP 
syndecan 1 
KIM1355 protein 
hypothetical protein MGi 
no sapiens cDNA FL 



Homer, neuronal immediate early gene, 3 

small nuclear ribonucleoprotein D3 polype 
'-complex 2, sigma 



2'-5'-oligoadenylate synthetase 2 (69-71 

epithelial V-like antigen 1 

poliovinrs receptor-related 2 (herpesviru 

sema domain, immunoglobulin domain (Ig), 

ESTs 

DnaJ (Hsp40) homolog, subfamily B, member 
Homo sapiens, clone MGC:2908, mRNA, compl 
Homo sapiens clone TUA8 Cri-du-chal regio 



ESTs 

small inducible cytokine B subfamily (Cys 
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ESTs, Weakly similar to Z195_HUMAN ZINC F 
ESTs, Weakly similar to ALU1_HUMAN ALU SU 
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RalGEF-like protein 3, mouse homolog 




UDP-N-acetyl-alpha-r>galactosamine:polype 
arginase, type II 

Homo sapiens, Similar to RIKEN cDNA 54304 
hypothetical protein FU22316 
uracil-DNA glycosylase 
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solute carrier family 2 (facilitated glue 



C-type (calcium dependent, carbohydrate-r 

neurotrophin 5 (neurotrophin 4/5) 

hypothetical protein FU10402 

gb:hn41d 1.X1 NCI_CGAP_RDF2 Homo sapiens 

Homo sapiens mRNA; cDNA DKFZp564B1264 (ft 

lysyl oxidase-like 2 

ESTs 

ESTs 

ESTs, Weakly similar to A46010 X-linked r 
DNA segment on chromosome 12 (unique) 248 
p53-induced protein PIGPC1 
olfactory receptor, family 2, subfamily I 
nucleolar protein (KKE/D repeat) 



membrane-associated tyrosine- and threoni 
leukemia-associated phosphopratein p18 (s 
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Hs.30922 TM 
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AL044056 Hs.158047 
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TM 

TM,Riboscmal_S17RibosomaLL13 
hormone_rec,Prog_receptor,zf-C4 
SS,TM,Na_K-ATPaseE2F_TDP 
TM.Collagen 

SS,TM,LIM,PDZsugar_tr,PDZ,LIM 
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TM.IRF 
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TM 

TM.SPRY 
SS,TGF-beta 

TM,Galactosyl_T_2ATP-synt_C 
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Hs.13453 
Hs.21420 
Hs.10247 



TM,Sema,Plexln_repeatTIG,Plexin_repeal 

SS,thyroglobulin_1,IGFBP 

TM,DIX,RGS,thiored 

GST_C,tRNA-synt_l,WHEP-TRS,TGF-bela 

TM,thiored,Y_phosphatase,MAM,ig,fn3MSF._domain 

TM.ionJrans 

SS,TM,ig 

Sema,Plexin_repeat,TIG 
TM 

TM,Ammonium_transp 

SS,TM,adh_short 

TM,ArfGap 

TM 

TM.ig.lTAM 

TM,Ricin_B_lectin 

TM,ion_tTans,UM,Synaptophysin 

TM.connexin 

SS.MIF 

SS,Peptidase_M10,fn2,hemopexin 

SSJNF 

TM,PX 

TM.kinesin 

TM,SURF4,SURF1 .DEADIipocalin 
TM,death,TNFR_c6 

Tm!bcI-2,BH4 

TM,Cytdylyltransf,SIR271m_2 
TM,Armadillo_seg 

TM,TFIIS,RNA.POL_M_15KDserpin,honnone_ia;,zf-C4 
SS,TNFR_c6 

TM,Ttansglulamin_C,Transglutamin_N,Transg , ut_core 
laminin_B,laminin_EGF 



TM 



hypothetical protein MGC2603 
PRP4/STK/WD splicing factor 
Cip1-interacting zinc finger protein 
ESTs, Moderately similar to R27328 2 [H.s 
DNA segment on chromosome 19(unique) 117 
hypothetical protein FU13154 
gb:oy92e04.x1 Soares_fetal_liver_spleen_1 
glycyl-tRNA synthetase 
r " em FU10199 



DKFZP586E1 621 protein 
ESTs 

hypothetical protein FU10385 
wingless-type MMTV integration site famil 
hypothetical protein FU10407 
pregnancy specific beta-1-glycoprotein 11 



ENSP00000202587*-.Bicarbonatelransporter- 



KIAA1268 protein 

hypothetical protein FU20979 

Homo sapiens clone TCCCIA00164 mRNA seque 

ESTs 

ATPase, Na? transporting, beta 3 polypept 
CSR1 protein 
LIM domain protein 
DKFZP434I216 protein 
interferon-stimulated transcription facto 

ESTs, Weakly similar to T17227 hypothetic 
ESTs 

regulator of nonsense transcripts 1 
Homo sapiens cDNA FU 14227 fis, clone NT2 
prolcasomc (prosomc, macropain) subunit, 
hypothetical protein FU1243G 
insulin-like growth factor binding protel 
five-span transmembrane protein M83 
glioma-assxiated oncogene homolog (zinc 
hy heti I prolein DC Zp ' H 11 
HomosapienscDNAFU11341 fis, clone PLA 
CD83 antigen (activated B lymphocytes, im 



ESTs 

hydroxysteroid (17-beta) dehydrogenase 1 
hypothetical protein FU10767 
gb:nc15d10,s1 NCI_CGAP_Pr1 Homo sapiens c 
fibroblast growth factor 1 1 
UDP-N-acetyl-alpha-t>galactosamine: polype 
proteolipid protein 2 (colonic epithelium 
ESTs 

macrophage migration inhibitory factor (g 
matrix metalloproteinase 2 (gelatinase A, 
ESTs, Moderately similar to unknown [H.sa 
ESTs 

kinosin family member 4A 
Homo sapiens cDNA: FU22993 fis, clone KA 
tumor necrosis factor receptor superfamil 
hypothetical protein FU 10337 
BCL2-like 1 

ESTs, Moderately similar to ALU2_HUMANAL 
ESTs, Weakly similar to ALU8_HUMAN ALU " 
polymerase (RNA) III (DNA directed) polyp 
tumor necrosis factor receptor superfamil 
proteasome (prosome, macropain) activator 



SU 



iyr^Nti' 1 protein Ft M 17' 1 



223 



WO 03/042661 



PCT/US02/36810 



Unique Eos probeset identifier nu 
en Gene cluster number 
Genbank accession numbers 



105240.1 
176677.1 
179717.1 
179863.1 
44573.2 



AI950087 N70208 R97040 N36809 AI3081 19 AW957677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 AA908598 AA251875 
AI820501 AI820532 W87391 T85904 U71456 T82391 BE328571 T75102 R34725AA884922BE328517 AI219788AA884444 N92578 F13493 
M927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 M488964 AA283144 AI890387 A1950344 AI741346 AI689062 
AA282915 AW102898 AI872193 AI763273 AW173586 AW150329 AI653832 AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 
AA642789 AA856975 AW505512 AI961 530 AW629970 BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 
AW961101 AA251669 AA251874 AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 M972584 AA908741 AW072629 AW513995 
AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA78041 9 AA551 005 W80701 AW613456 AI373032 AI564269 F00531 
H83488 W37181 W78802 R660S6AI002839 R67840AA300207 AW959581 T63226F04005 



BE140966 BE140961 BE140967 BE141006 BE140985 BE140970 BE141669 BE 141653 BE141664 BE141B55 BE141651 BE141660 BE140969 
BE141673 BE141650 BE141674 BE141550 BE141688 AW178241 BE140994 BE141666 BE 140998 BE141008 BE140988 BE14101 1 BE140975 
BE141667 BE141675 BE141657 BE141681 BE141656 BE141672 BE141680 AW178237 BE141012 BE140990 BE141658 BE141648 BE141013 
BE141663 BE140973 BE141004 BE140963 BE140984BE141009AW178232BE141007BE141649AW178293BE140993AW178233BE141546 
BE141005 BE141691 BE141000 BE141652 BE140965 BE141562 BE140960 BE 140962 BE141001 BE140978 AW178229 AW178239 BE141671 
AW178230 BE141547 AW178235 BE141663 BE141549 BE140996 BE141003 AW178236 BE141002 BE141556 
AW897851AW897852 
AA579426AA579436AA573736 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. 'Dunham I. et aL" refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et al„ Nature (1999) 402:489495. 

Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

400666 8118496 Plus 17982-18115,20297-20456 



401103 
401486 
401575 
402745 
402915 



98330-98449 

32585-32756,36281-36540,4079140933,4401844179 
76253-76364 



9212537 
1054740 
1054740 



140-276 

80430-81581 

72019-72509 

118677-118807,119091-119296,121626-121823 

124361-124520,124914-125050 

1553 1 '.., 1378-2140,42524385,5922-6077 



M ARE LIKELY TO ENCODE El 



Table 14A lists about 209 genes up-regulated in cervical cancer compared to normal adult tissues that are likely to encode either enzymes or proteins 
small molecules. These were selected as for Table 12A, except that the ratio was greater than or equal to 2.0, and the 96* percentile value amongst ci 
or equal 40 units, and the predicted protein contained a structural domain that is indicative of enzymatic function or of being modulatable by small molecules (e.g. 



modulation by 
its was greater than 
pkinase, peptidase, 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Predicted Protein Domains 

Unigene gene title 

Ratio of tumor to normal adult tissues 



Pkey ExAccn UnigenelD 

418007 M13509 Hs.83159 

439606 W79123 Hs.58561 

400289 X07820 Hs.2258 

415817 U88967 Hs.78867 

416209 AA236776 Hs.79078 

404996 NM.001333 Hs.87417 



SS,hemopexin,Peplidase_M10 
TM,7tm_1 

hemopexin,PepUdase_M10 

SS,TM,Yjhosphatase,carb_anhyd 

TM.HORMA 

Peptidase.CI 



WO 03/042661 



PCT/US02/36810 



420759 T11832 

458194 AW383618 Hs.265459 

446232 AI281848 Hs.194691 

424905 NM_002497 Hs.153704 

452291 AF015592 Hs.28853 

424086 AI351010 Hs.102267 

425710 AF030880 Hs.159275 



25 
30 

35 



SS,Peptidase_M10,hemopexin 



447254 
431941 
427821 
436211 
403471 I 
410153 I 
457405 , 
421948 I 



NM_004153 Hs.17908 

AK000106 Hs.272227 

AA470158 Hs.98202 

AK001581 Hs.334828 



Hs.8850 

Hs.137516 

Hs.122908 



: AA090421 Hs.5555 

413625 AW451103 Hs.71371 

425695 NMJM5401 Hs.159238 

438394 BE379623 Hs.27693 

432239 X81334 Hs.2936 

408536 AW381532 Hs.135188 

432226 AW182766 Hs.273558 

419520 AB009303 Hs.90800 

426350 NM_003245 Hs.2022 

421155 H87879 Hs.102267 

423673 BE003054 

450375 AA009647 

418379 AA218940 

457465 AW301344 

450510 AA010056 Hs.242998 

436291 BE568452 Hs.5101 

446353 AI290919 Hs.153661 

435435 T89473 Hs.192328 

425071 NM_013989 Hs.154424 

433322 H50621 Hs.134156 

408908 BE296227 Hs.250822 

444781 NM_014400 Hs.11950 

428479 Y00272 Hs.184572 

406687 M31126 Hs.272620 

423035 AW449679 Hs.156739 

449228 AJ403107 Hs.148590 

423738 AB002134 Hs.132195 

457030 AI301740 Hs.173381 

448995 AI613276 Hs.5662 

415857 AA866115 Hs.127797 

438390 AI422017 



445292 AF081497 

422938 NM_001809 

408771 AW732573 

424296 AI631874 



411274 NM_002776 Hs.69423 



NMJW2916 
AA828995 
AB037783 
AA205847 



Hs.170623 
Hs.23016 
Hs.1 14121 



443426 AF098158 



TM.pkinase 
LysyLoxidase 
TM,Sulfate_transp,STAS 
TM.Collagen.pkinase 
SS, AAA, BAH 

pkinase,Furin-like,Recep_L_dom 

TM,71m_1 

Ammoniumjransp 

J. rM.lrypsir, 

filament,HC03_colranspfilamenl 

TM,AAA,Fem'c_reducl 

TM,E1-E2_ATPase,Hydralase 

TM,Band_41,Y_phosphatase 

SS,pro_lsomerase 

SS.Peplidase M10,hemopexin 

SS,TM,E1-E2_ATPase,Cation_ATPa 

Cytidylyitransf 

TM,hemopexin,Peplidase_M10 

TM,Transglutamin_C,TransgIutam 

SS,Lysyl_oxidase i Aldose_epim,E 

SS,TM,Peptidase_M10,liemopex'tn 

TM,disintegrin,Pep_M12B_propep 

Pribosyltran.Sulfatase 
TM,7lm_1 

ONAJopoisoll.DNAJopoisolVIGF 



Target CAT 

hypothetica! protein FU10339 
cytochrome P450, subfamily XXiV (vi 
matrix metalloproteinase 3 (stromel 
Homo sapiens cDNA FLJ11381 lis, do 
ESTs, Moderately similar to ALU2.HU 



CDC7 (cell division cycle 7, S. cer 
lysyl oxidase 

solute earner family, member 4 
PDZ-binding kinase; T-cell original 



Hs.30875 
Hs.279682 
Hs.1594 
Hs.47584 
Ho, 155140 



Homo sapiens cDNA FLJ20099 fi s, do 
ESTs 

hypothetical protein FLJ10719; KIM 
Target Exon 

hypothetical protein FU12691 
gb:ab03a10.s1 Stratagene fetal reli 



HECTpkinase 

lipase,PLAT 

SS.TM.T4_deiodinase 

TM,ion_transNB-ARC,CARD4HilO_c 

TM.pkinase 

PH,ladamase_B 

SS,Peptidase_M10,hemopexin 
TM,Glyco transf 8 
TM,PAF-AH,p450 
SSTM.bypsin.SEA 



connexinhormone_rec,zf-C4 

SS,hemopexin,Peptidase M10 
PDEase 

SS,TM,Glycos_transf_3,Cam_acy 
Sulfatase 

AAA,PI3_PI4_kinase,PI3Ka,PI3K_ 

integrin_B 

hormone_reo,zf-C4 

SS,TM,7tm_1 

hormonejec,zf-C4 

TM,GDI,7tm_1 

SS.kringle.trypsin 

SulfotransferACOX 

SS,TM,integrin_B,fn3 

SS,Peplidase_M10,fn2,hemopexin 

SSTM.trypsin 

TM,7tm_3Ribosomal_L13 

TM,7tm_1 



Hs.9329 
Hs.57664 
420039 NM_004505 Hs.94581 
448733 NM_005629 Hs.187958 



TM,integrin_B,Ricin_B leclinrr 

CARD.SulfotransferDAGKc 

SS,TM,SNF,ABC_tran,isodh,pkina 




protein regulator of cytokinesis 1 

ESTs 

ESTs 

deiodinase, iodolliyronine. type II 
ESTs, Weakly similar to 138022 hypo 
serine/threonine kinase 15 
GPI-anchored metastasis-associated 
cell division cycle 2, G1 to Sand 
pregnancy specific beta-1-glycoprot 
Rsapiens XG mRNA (clone PEP11) 
protein related with psoriasis 
airway trypsin-like protease 



NM_002425:Homo sapiens matrix metal 
ESTs 

endothelial cell growth factor 1 (p 



gb:CM0-HT0323-151299-126-b04 HT0323 



Homo sapiens cDNA: FU23228 Hs, cl 
NM_001821*:Homo sapiens choroiderem 
plasminogen activator, urokinase 
zinc finger protein 
integrin, beta 4 

matrix metalloproteinase 9 (gelatin 
C4000647*:gi|4758508|ref|NP_004253. 
Homo sapiens cDNA: FU20869 fis, cl 
Homo sapiens cDNA:FU23006fis 1 cl 
Target Exon 

gb:CM4-NT0007-13050O-551-fO6 NT0007 
chromosome 20 open reading frame 1 
Homo sapiens mRNA full length inser 



ier family 6 (m 



WO 03/042661 



PCT/US02/36810 



444946 AW139205 

450841 AI741465 

428262 AI651324 

435399 AA679463 

446733 AA863360 

449746 AI668594 

418844 M62982 

414581 AA256213 

431629 AU077025 

445873 AA250970 

438113 AI467908 

422689 AW856665 

439453 BE264974 

413582 AW295647 

410664 NMJ06033 

456456 AA477609 

413273 U75679 

426343 NMJJ14642 
403763 

408380 AF123050 
401230 

418030 BE207573 

445640 AW969626 

432865 AI753709 

419667 AU077005 

406671 M129547 

412530 AA766268 

431890 X17033 

404184 NA 

428450 NM_014791 

425698 NM_016112 

453331 AI240665 

444826 AI674482 

414987 AA524394 



Hs.156457 SS.TM.abhydrolase 
Hs.270515 projsomerase 
Hs,7298 death.pkinase 



Hs.26040 TM,p450 
Hs.176588 SS,p450 



Hs.6566 
Hs.71331 
Hs.65370 
Hs.89563 
Hs.75257 
Hs.1 69387 

Hs.44532 

Hs.83321 

Hs.152484 
Hs.92208 
Hs.285754 
Hs.256273 
Hs.271986 



3,TM,lipoxygenase,PlAT 
TM.Cam acyltransf,Cioline_kin 
SS.tRNA antiSH2,SH3,pkinase 
SS,rrm,PABPpkinase,14-3-3,rfm 
TM,7tm_1 

helicase_C,SNF2_Nhelicase_C 

SS.AAA 

carb_anhydrase 

TMJipase.PLAT 

TM.ig.pkinase 
TM,SCAN7tm_1 
TM,7tm_1 

TM,ubiquitin7tmJ,ANF_receptor 
SS,TM,ion_lrans,IQ 
SS,TM,Peptidase_S26, Bombesin 
TM,a!pha-amylase 

TM,ion_transNB-ARC,CARD,WD40,m 

SS,TM,disintegrin,Reprolysin,P 

TM,pklnase,Plexin_repeat,Sema, 

abhydralase 

w/a,FG-GAP,integrin_A 

SS,TM,7lm_1 

pkinase,KA1 

TM,pkinase,ion_lrans 

TM,disintegrin,Pep_M12B_propep 

pkinase.SAM 



hypothetical protein FU22408 
ESTs 

bipheny] hydrolase-like (serine hyd 
gb:ac50c03.s1 Stratagene hNT neuroi 
ESTs, Weakly similar to fatty acid 
ESTs, Weakly similar to CP4Y_HUMA 



gb:RC3-CT0297-29010D-013J03 CT0297 
thyroid hormone receptor interactor 
hypothetical protein MGC5350 
lipase, endothelial 
nuclear cap binding protein subunit 



r*il_014191*:Homo sapiens sodium chan 
neuromedin B 

ESTs, Weakly similar to KIAA0227 [H 
ESTs, Weakly similar to I38022 hypo 
adisintegrin and metalloproteinase 
met proto-oncogene (hepatocyte grow 
in FU13346 



NM_030903*:Homo sapiens olfactory r 
KIAA0175 gene product 
polycystic kidney disease 2-llke 1 



hypothetical protein FLJ14950 



40 
45 
50 
55 
60 
65 



421733 AL119671 

4S2220 BE158006 

417975 AA641836 

400301 X03635 

408938 M059013 

411643 AI924519 

446638 AL133063 

430129 BE301708 

417655 AA780791 



Hs.256301 

Hs.61753 

Hs.1420 

Hs.212296 

Hs.30085 

Hs.1657 

Hs.22607 

Hs.1 92570 

Hs.15783 

Hs.233955 

Hs.14014 

Hs.170378 

Hs!l45170 



432777 AA564991 

421247 BE391727 

425465 L18964 

419281 H96452 

434205 AF119861 

453406 AI192987 

435542 AA687376 

443151 AI827193 

431630 NM_002204 

422310 AA316622 

441954 AI744935 

414907 X90725 

439810 AL109710 

429359 W00482 

432284 AA532807 

452947 AW130413 

423229 AC003965 

453941 U39817 



416763 AI908127 

414733 BE514535 

443171 BE281128 

430637 BE160081 

452367 U71207 

408308 AL033377 

417900 BE250127 



TM,cNMP_bindingtrypsin 

PAN.kringle.trypsin 

SS,TM.ig.pkinase 

TM,integrin_A,F&GAP 

SS.trypsin 

TM.Oest_recep,zf-C4,horrnone_re 



Hs.77597 
Hs.85568 
Hs.2399 
Hs.1 05822 

Hs.1 25532 
Hs.36820 
Hs.6793 

Hs.213436 

Hs.226117 

Hs.19574 

Hs.82285 

Hs.57735 

Hs.79748 

Hs.77171 

Hs.9030 

Hs.256290 

Hs.29279 

Hs.44197 

Hs.82906 



TM.Iipoxygenase.PLAT 

SS,projsomerase 

DNAJorxiisoll.DNAJopoisolVIGF 



SS,TM,Peptidase_M10,hemopexin 

alpha-amylase 
SS.trypsin 

DEAD,HRDC,helicase_C 

TM,p450Ets 

DNAJigase 

Glyco.transf.29 

TMJinker histone7trn_1 

TM.EPH_lbd,fn3,pkin i SAM 

TMAIRS.formyl.transf.GARS 

IP_transSH2,SH3 

TM,alpha-amylase7tm_1 

TM.MCMHeme.oxygenase 

SS,TM,7tm_1,rrm 

S.100Peptidase_M16 



hypothetical protein FU22028 
Homo sapiens mRNA; cDNA DKFZp434P11 
hypothetical protein FU20401 
hypothetical protein FU14813 



TM,tRNA-synt_1 ,SPRYF5.F8_type. 

TM,pkinase,DAG_PE-bind.OPR,pM 

TM,E1-E2J\TPase,HMA,Hydrolase 

SH3,efhand,C2,PH,RhoGEFAAA,PG 

pkinase,Furin-like,Recep_L_dom 

pkinase,RhoGEF,ig,PH,SH3 

DNA.mis_repair,HATPase.cAcylph 

SS,TM,FG-GAP,integrin_A 

SS,TM,fn3,ig,pkinase,Ribosomal 

TM,Band 7,AAA,cdc48_N 

SS,TM,pkinase,POLO_box 



integrin, alpha 3 (antigen CD49C, a 
cytochrome P450, subfamily IIS, pol 
Fanconi anemia, complementation gro 
polo (Drosophia)-like kinase 
EST 

matrix metalloproteinase 14 (membra 
ESTs 

gb:xf50fO4.x1 NCI_CGAP_Gas4 Homo sa 
protease, serine, 26 
Bloom syndrome 
platelet-activating factor acetylhy 
ligase I, DNA, ATP-dependent 
ESTs, Weakly similar to A34087 hypo 
H1 histono family, member 0 
hypothetical protein MGC5459 



TONDU 

S100calcium-bindingproteinA11(c 
eyes absent (Drosophila) homolog 2 
hypothetical protein DKFZp564D0462 
CDC20 (cell division cycle 20, S. c 

226 



WO 03/042661 



PCT/US02/36810 



424490 AJ278016 

412834 R77123 

410855 X97795 

418804 AA809632 

447674 BE270640 

450663 H43540 

408805 H69912 

429415 NM.002593 

447827 U73727 

428273 AI867228 
404274 
403133 

440249 A1246590 

438580 AA811262 
405400 

427375 AL035460 

423453 AW450737 

433716 AA608808 



SNF2_N,helicase_C 

HATPase_c,HSP90,PHD,zf-C2H2 

TM.pkinaseras.arf 

SS,TM,RNase_HII 

TM,pkinase 

SS,CUB,NTR,MAM,TIL,TILa,vwd,EP 
SS,TM,Y_phosphatase,fn3,ig,MAM 
Glycos_transf_2 
SS,TM,pkinase,fn3 
pkinase,KJetra,Band_41,RhoGEF 
VHL.TatD.DNase 



Hs.99915 
Hs.154196 
Hs.106778 
Hs.192738 



456327 H68741 

422429 AA310527 

402974 NM.001501 

458016 AW188099 

452194 AI694413 

428028 U52112 

427747 AW411425 

452841 T17431 

449539 W80363 

418140 BE613836 

430076 AA465115 

425749 AW328587 

425855 AF135025 



SS,TM,lrypsin 



SS,Granin,CDP-OH_P.t 
Acylphosphatase 
TM,hormone_rec,Androgen_recep, 
DNase.l.KJetra 

TM,E1-E2_ATPase,HydrolaseE1-E2 
projsomerase 

TM,pklnase,Sema,Plexin_repeat, 

TM,Glyoo_lransf_8 

pkinase,RGS,PHpkinase,PH,RGS 

GnRHhormane5,hQrmane4 

pkinase 

TM,7tmJ,ANF_receptor,sushi 

TM.pkinase.MBD 

pkinaseJipoxygenase.PLATIlpox 



ankyrin repeat domain 3 

Homo sapiens cDNA: FU23006 fis, ol 

RAD54(S.cerevisiae)-like 

gb:nz17h04.s1 NCLCGAP.6CB1 Homosa 

cyclin-dependentknase2 

ribonuclease HI, large subunit 

vaccinia related kinase 1 



protein tyrosine 



NM_0071Q6:Homo sapiens kallikrei 
metallocarboxypeptidase CPX-1 
CGI-09 protein 



ATPase, Ca transporting, type 2C, m 
ESTs 

gb:nv35fD3.r1 NCI_CGAP_Br5 Homo sap 



Hs.159448 
Hs.1 59679 
Hs.118890 



DEAD/H (Asp-Glu-Ala-Asp/His) box po 
ESTs 

microti brillar-associated protein 2 
KIAA1836 protein 
surfeit 2 
kallikrein 12 

glycogen synthase kinases alpha 



179138.1 AA809632 AI917245 AI701732 AA228406 

197736 1 AW888650 AW888651 BE149946 BE149948 BE149951 BE149947 AW8B8549 AA281B40 AA281822 AW888652 

216469.1 AA310527 AW962295 Z44865 H06641 

219896 1 AW856665AA315006AW954733 

405576.1 AA679463AW813779AW813709 

45662.1 AI422017 AI422945 AI363249 AI4231 13 AI925592 AI420795 AI2081B7 AI423279 AI423645 AI424090 AI359637 AL044732 D17003 

467651.1 AA828995AA834879AI926351 

939810 1 AW130413AI932362 

1252971.1 BE152428AW855572AW855607 

333127 1 AA504860AA504911 



TABLE 14C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al.'refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:439495. 

Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

400666 8118496 Plus 17982-18115,20297-20456 

401230 9929527 Minus 33835-34006,34539-34592,36461-36745,4392549393,52604-52758 

401486 7341763 Plus 32535-32756,36281-36540,40791-40933,44018-44179 

402337 6957691 Plus 4116-4286,16811-16973,17107-17256,19715-20040,22029-22205 

402481 9797406 Plus 87891-88991 

403133 7331427 Plus 

403471 9930659 Minus 

403763 7229888 Minus 

404184 4581418 Minus 

404274 9885189 Plus 

408400 9256298 Plus 
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TABLE 15A: 752 GENES UP-REGULATED IN CERVICAL CANCER COMPARED TO NORMAL ADULT CERVIX 

Table 15A lists about 752 genes up-regulated in cervical cancer compared to normal adult cervix. These were selected as for Table 12A, except that the 
equal to 7.0, the denominatorwas the median value for three non-malignant cervical specimens, and the 96- percentile value amongst cervical cance.a 




Pkey ExAccn UnigenelD Unigene Title 



414915 NMJM2462 Hs.76391 

411248 AA551538 Hs.334605 

428227 AA321649 Hs.2248 

421508 NM_004833 Hs.105115 

443639 BE269042 Hs.9661 

454390 AB020713 Hs.56966 



413503 
411669 
414132 



449722 
414812 
408405 
432917 
412530 



424878 
412140 
444371 



BE612676 Hs.3031 16 

AI801235 Hs.48480 

AK001379 Hs.121028 

BE382657 Hs.21486 

BE265489 Hs.3123 

BE304571 Hs.89529 

AW247529 Hs.6793 

BE280074 Hs.23960 

X72755 Hs.77367 

AK001332 Hs.44672 

NM_014125 Hs.279812 

AA766268 Hs.266273 

AW301344 Hs.122908 

AW847814 Hs.289005 

Y09397 Hs.227817 

AW450671 Hs.189284 

H57111 Hs.221132 

AA219691 Hs.73625 

BE540274 Hs.239 

BE207573 Hs.83321 



myxovirus (influenza) resistance 1, homolog of murine 
Homo sapiens cDNA FU14408 Rs, clone HEMBA1004341 
small inducible cytokine subfamily B (Cys-X-Cys), mem 
absent in melanoma 2 

proteasome (prosome, macropain) subunlt, beta type, 1 
KIAA0906 protein 
proliferating cell nuclear antigen 
KIAA1488 protein 

heatshock 70kD protein 5 (glucose-regulated protein, 

stromal cell-derived factor 2-like 1 

ESTs 

hypothetical protein FU10549 

signal transducer and activator of transcription 1, 9 

lethal giantlarvae (Drosophila) homolog 2 

aldo-keto reductase family 1, member A1 (aldehyde red 

platelet-activating factor acetylhydrolase, isoform I 



70 
75 



PRO0327 protein 
hypothetical protein FLJ13346 
DNA replication factor 

Homo sapiens cDNA: FU21532 Us, clone COL06049 
BCL2-related protein A1 
Target Exon 



RAB6 interacting, kinesin-like (rabkinesin 6) 



400196 

416795 AI497778 Hs.20509 

424365 AP011333 Hs.153563 

438011 BE466173 Hs.145696 

428368 BE440042 Hs.83326 

436923 AW293704 Hs.122658 

415791 H09366 Hs.78853 

448775 AB025237 Hs.388 

435647 AI653240 Hs.49823 

431049 AA846576 Hs.103267 

429486 AF155827 Hs.203963 

428433 AA521410 Hs.41371 

418322 AA284166 Hs.84113 

417308 H60720 Hs.81892 

429574 BE268321 Hs.208912 

407204 R41933 Hs.140237 

408901 AK001330 Hs.48855 

438899 AF085833 Hs.135624 

456362 AW973003 Hs.179909 

438598 AI805943 Hs.326067 

408908 BE296227 Hs.250822 

427488 M91401 Hs.178658 

U30872 Hs.77204 

AW292539 Hs.30177 

AB006532 Hs.31442 

AI538438 Hs.159087 

AA551196 Hs.188952 

AF182294 Hs.241578 
I R34356 

! AA004879 Hs.187820 

! AB002375 Hs.155814 

1 AI420611 Hs.127832 

I AK000568 Hs.43654 

I AA704899 Hs.291651 

' AW975570 Hs.97387 

) AI420183 Hs.171077 



CDC7I nonc/cle7 S. cerevisiae, homolog)- 
Eos Control 

HBVpX associated prbtein-8 
lymphocyte antigen 75 
splicing factor (CC1.3) 
matrix mi 
ESTs 



cyclin-dependent kinase inhibitor 3 (CDK2-assoc:ated 
KIAA0101 gene product 
hypothetical protein MGC861 

ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SE 
hypothetical protein FU10468 



RAD23 (S. cerevisiae) homolog B 

NMJ»7057':Homo sapiens ZW10 interactor (ZWINT), tran 

centromere protein F(350/400kD, mitosin) 



KIAA0377 gene product 



ESTs, Weakly similar to T21259 hypothetical protein F 
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413943 
407720 
425316 
423673 
419777 



30 
35 



AB035898 Hs.150587 

AF052573 Hs.241517 

AI638516 Hs.22630 

U39817 Hs.36820 

AW075485 Hs.286049 

AI126271 Hs.49433 

AW294416 Hs.144687 

AB037776 Hs.38002 

AA354977 Hs.191565 

BE003054 Hs.1695 

D60134 Hs.270975 

R66282 Hs.20247 

AI583187 Hs.9700 

AA687538 Hs.38972 

AK000142 Hs.101774 

W27202 Hs.82327 

AW974944 Hs.200577 

AB012113 Hs.16530 

A1435128 Hs.181369 

NM_000878 Hs.75596 

AU077025 Hs.265827 

AA678267 Hs.117115 

AW514601 Hs.156110 
AI420311 Hs.126550 
X02308 Hs.82962 
AA837782 Hs.321058 
W31092 Hs.55847 
NMJ13330 Hs.274479 
J04088 Hs.1 56346 
AW972872 Hs.293736 
AI378817 Hs.191847 
AA806113 Hs.189025 
AW960673 Hs.177530 
AI474533 H5.170528 
AW043637 Hs.21766 
BE081342 Hs.283037 
Hs.62 



ls.34487 



432721 
416975 
413314 



412059 
407756 
437056 



418205 
433301 



kinesin-like protein 2 

polymerase (DNA directed), theta 

cofactor required for Sp1 transcriptional activation, 



ESTs, Weakly similar to YZ28.HUMAN HYPOTHETICAL PROTE 
Homo sapiens cDNA FLJ12981 fis, clone MT2RP2006454 
KIAA1355 protein 

ESTs, Moderately similar to T14342 NSD1 protein - mou 
matrix metalloproteinase 12 (macrophage elastase) 



57 alpha-1C-adrenergic re 



hypothetical protein FU23045 



small inducible cytokine subfamily A (Cys-Cys), membe 

ubiquilin fusion degradation 1-tike 

interleukin 2 receptor, beta 

interferon, alpha-inducible protein (clone 1F1-6-16) 



suppressor of K transport defect 1 



443957 AA521049 

418803 U50079 Hs.e8556 

434094 AA305599 Hs.238205 

420139 NMJ05357 Hs.95351 

444783 AK001468 Hs.62180 

433255 AI274270 Hs.96840 

431838 AI097229 Hs.217484 

449801 AA477355 Hs.288300 

447078 AW885727 Hs.301570 

441240 AA923749 Hs.132442 

439398 AA284267 Hs.221504 
404630 

408321 AW405882 Hs.44205 

426427 M86699 Hs.169840 

413278 BE563085 Hs.833 
403055 

456614 AV653110 Hs.1 06650 

425261 BE385099 Hs.334727 

439926 AW014875 Hs.137007 

411263 BE297802 Hs.69360 

451141 AW772713 Hs.247186 

447390 X95384 Hs.18426 

419828 T81422 Hs.14922 

428147 AW629965 Hs.234983 



BE350012 Hs.248355 

AL121478 Hs.180532 

NM_004131 Hs.1051 
BE081585 

AA489166 Hs.156933 

AW016812 Hs.200266 

AA032279 Hs,61635 
Hs.81226 
Hs.44648 

AA317962 Hs.249721 

AA116021 Hs.38260 
AI147061 

AI307416 Hs.184675 

W07318 Hs.240 

AA010907 Hs.184456 

L21715 Hs.83760 

AI026718 Hs.16954 

AW296280 Hs.152016 

AA757900 Hs.270823 

BE301871 Hs.4867 

AW409701 Hs.1578 



ATP synthase, HI . 
ESTs, Moderately similar to ALUC_HUMAN Hi: ALU CLASS 
ESTs Weakly simila to ' LU5_HUMAN ALU SUBFAMILY SC S 
HSPC039 protein 

guanylate binding protein 1, interferon-inducible, 57 
hypothetical protein FU23412 
histone deacetylase 1 
hypothetical protein PRO2013 



anillin (Drosophila Scraps homolag), actin binding pr 
KIAA1527 protein 

ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SE 
hypothetical protein FU23231 



Target Exon 

TTK protein kinase 
interferon-slimulated protein, 15kDa 
C2002219':gi|12737280|ref]XP_006682.2| keratin 18 [Ho 
hypothetical protein FU20533 



translational inhibitor protein p14.5 



glucose phosphate isomerase 

granzyme B (granzyme 2, cytotoxic T-iymphocyte-associ 

gb:QV2-BT0635-210400-156-b07 BT0635 Homo sapiens cDNA 



six transmembrane epithelial antigen of the prostate 



ESTs, Moderately similar to PC4259 ferritin associate 



gb-.ok33a11.s1 Soares_NSF.F8_9W_OT_PA_P_S1 He 
ESTs, Moderately similar to ALU7_HUMAN ALU SUBFAMILY 
M-phase phosphopratein 1 



Homo sapiens cDNA: FU22140 fis, done HEP20977 
ESTs, Weakly similar to S65657 alptia-1C-adrenergic re 
mannosyl (alpha-1,3-)-glycoprolein beta-1,4-N-acetylg 
baculoviral IAP repeat-containing 5 (survivin) 
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410245 C17908 Hs.194125 ESTs 
424927 AW973666 Hs.153850 hypothetical protein C321D2.4 
~ Hs.155218 



419945 
442159 
436169 
407804 



AU077228 Hs.77256 

U66097 Hs.86724 

AL137257 Hs.23458 

AA866115 Hs.127797 

AW291212 Hs.293943 

T07353 Hs.7948 

I AI765047 Hs.99736 

i AA579297 Hs.26937 

I N77221 Hs.187824 

I AA863400 Hs.23054 

l NM_006769 Hs.3844 

I AW392550 Hs.9280 

AW608906 Hs.334767 

AW290975 Hs.1 18923 

AW163390 Hs.278554 

AA888311 Hs.17602 

AF228603 Hs.39957 

AI031771 Hs.132586 



;er of zeste (Drasophila) homolog 2 
GTP cyclohydrolase 1 (dopa-responsive dystonia) 
Homo sapiens cDNA: FU23015 fs, clone LNG00818 
Homo sapiens cDNA FLJ1 1381 fis, done HEMBA1000501 
hypothetical protein MGC11266 
ESTs 
ESTs 

brain and nasopharyngeal carcinoma susceptibility pro 



proteasonie (prosome, macropain) submit, beta type, 9 
hypothetical protein MGC5629 



409231 AA446644 



429112 
445655 
419138 



AA873830 
U48508 
AI809057 
AK001057 



s.274268 



443491 
42433S 
419741 



AA055449 Hs.63187 

NMJ06187 Hs.56009 

T85314 Hs.42644 

BE293233 Hs.129771 

AA371307 Hs.125056 

BE409813 Hs.152337 

AW015644 Hs.155005 

BE465884 Hs.280728 

AF129534 Hs.49210 

AA737415 Hs.152826 

AI905780 Hs.198272 

AA902780 Hs.119326 

AW499665 Hs.9456 

BE257148 Hs.145416 

NM_007019 Hs.93002 

AI686945 Hs.272062 

AU075617 Hs.16251 



446099 
451066 
409235 
451730 
428054 
441636 



442737 AB002319 Hs.8663 

409113 AA074397 

415782 AA169345 Hs.123177 

417958 AA767382 Hs.193417 

402539 AW502761 Hs.30909 

413677 AW503116 Hs.301819 

414706 AW340125 Hs.76989 

421632 AA825426 Hs.238832 

438995 AI277986 Hs.164875 

400289 X07820 Hs.2258 

432363 AA534489 

451655 H85689 Hs.225560 

429237 AA448417 Hs.104990 

427719 AI393122 Hs.134726 

444665 BE613126 Hs.47783 

400080 

424517 AI539443 Hs.137447 



Homo sapiens cDNA FLJ12381 fis, clone MAMMA1002566 
pleckstrin 2 (mouse) homolog 
Target Exon 



GA733-2 antigen; epithelial glycoprotein (EGP) (KSA) 

POU domain, class 2, associating factor 1 

B cell linker protein ' 

ryanodine receptor 1 (skeletal) 

immunoglobulin heavy constant mu 

Homo sapiens cDNA FLJ10195 fis, clone HEMBA1004771 

ESTs, Weakly similar to ALUC.HUMAN ili! ALU CLASS C W 

2-5-oligoadenylate synthetase 3 (100 kD) 

thioredoxin-iike 

ESTs 



Target CAT 

Huntingtin-interacting protein A 

SWI/SNF related, matrix associated, actin dependent r 

endoglycan 

ubiquiun earner protein E2-C 
ESTs 

cleavage and polyadenylalion specific factor 3, 73kD 
Homo sapiens. Similar to RIKEN cDNA 5730578N08 gene, 
KIAA0321 protein 

gb:zm85a05.r1 Stratagene ovarian cancer (93721 9) Homo 



ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SE 
ESTs 

matrix metalloproteinase 10 (stromelysin 2) 

gb:nf76g11.s1 NCI_CGAP_Co3 Homo sapiens cDNAclone3' 

ESTs 

ESTs 

ESTs 



T93096 Hs.17126 

AI758560 Hs.206132 

AA188827 Hs.7988 

AF095687 Hs.26937 

AI948688 Hs.266619 

AA081846 Hs.7921 

AI005270 Hs.1 23543 

W74048 Hs.1765 

A1815728 Hs.19980 

AI056769 Hs.133512 

AB000450 Hs.82771 

AW328081 Hs.6817 

AW136973 Hs.288516 
AW812535 

AA687376 Hs.269533 

AA504359 Hs.1 10067 



ESTs, 'Ai.aH' similar to KIAA0970 protein [H.sapiens] 
Eos Control 

Homo sapiens cDNA FU12169 fis, clone MAMMA1000B43 
NM 002675:Homo sapiens promyelocyte leukemia (PML), 
hypothetical protein MGC15912 



Homo sapiens mRNA; cDNA DKFZp566E183 (from clone DKFZ 



inosine triphosphatase (nucleoside Wphosphate pyrop 

g^CM^TO^^^ 
ESTs 

ESTs, Weakly similar to ALU5 JtUMAN ALU SUBFAMILY SC S 
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AW795793 Hs.1 79827 
AW873704 Hs.320831 
AF155510 Hs.44227 



410146 AW592655 



Hs.179774 

M13509 Hs.83169 
M34516 

A165U74 Hs. 163944 

AW451693 Hs.220826 
AA425488 

AW207002 Hs. 134342 

AW274673 Hs.279706 

AA306007 Hs.59461 

U92638 Hs.1 63138 

AF025441 Hs.1 16206 

J05070 Hs.151738 

L26851 Hs.851 

BE145360 Hs.190064 

H38587 Hs.82295 

AA521458 Hs.192738 



440572 AW183778 Hs.249584 



433160 
408809 
410174 
424792 
422406 
424687 
413809 



444314 
426125 
430848 



NM.M2068 Hs.9754 

M310527 

Al 140497 



gb:Human omega light chain pre 



ESTs 

gb:zw46d02.r1 Soares_total_felus_Nb2HF8_9w Homo sapie 
TASP for testis-specific adriamycin sensitivity prote 
ESTs, Weakly similar to A47582 B-celt growth factor p 
DKFZP434C245 protein 

origin recognition complex, subunit 5 (yeast homolog) 
Opa-interacting protein 5 

matrix metalloproteinase 9 (gelatinase B, 92kD gelsti 
integrin, alpha E (anOgen CD103, human mucosal lymph 
ESTs V. 1 1 , similar to 138022 hypothetical protein [ 
dedicator of cytc-kinesis 1 
ESTs 

NM_021624:Homo sapiens histamine H4 receptor (HRH4), 

E T >a M y i„fl rt M HUMAN M OSIN HEW IAII 

solute carrier family 29 (nucleoside transpoiters), m 

activating transcription factor 5 

gb:EST181333 JurkatT-cells V Homo sapiens cDNA 5' en 

gb:ow76b09.s1 Soares_fetalJ;ver_spleenJNFLS_S1 Homo 

FAT tumor suppressor (Drosophila) homolog 

:ochlea Homo sapiens cDNA 



AA730407 Hs.1 591 56 
AF185611 

AI916685 Hs.194601 

AA372264 Hs.273193 



439685 AW956781 Hs.293937 



Hs.44566 



60 
65 



HS.124244 
Hs.301005 
Hs.258110 
Hs.173524 
Hs.191933 
Hs.28555 
Hs.192843 
Hs.103812 
Hs.174308 
Hs.105039 



437267 AW511443 

455978 AI310151 

435851 AA700946 

452243 AL355715 

441703 AW390054 

414001 AI610347 

436669 AA535975 

421502 AF111856 

417087 AA193193 

455855 BE147440 

410390 AA876905 

418526 BE019020 

442660 AW138174 

436186 BE390717 Hs.5074 

425773 NMJ15556 Hs.172180 

413476 U25849 Hs.75393 

418347 AA216419 

448752 AA593867 Hs.300842 

440349 AA884196 Hs.31475 

431363 M86528 Hs.266902 

430752 AA485330 Hs.303278 

436523 BE612990 Hs.5212 

415740 N80486 Hs.39911 

411930 F06485 Hs.7740 

430832 AI073913 Hs.100686 

452234 AW084176 



Hs.125286 
Hs.85838 
Hs.130651 



Hs.223296 
Hs.57749 
Hs.35380 



440463 AI733087 
420851 AA281062 
445326 AI220072 



Target Exon 
ESTs 

ESTs, Weakly similar to FXD2.HUMAN FORKHEAD BOX PROTE 

WM1641 C prote!n 
hypothetical protein FU20411 
ESTs 

purine-rich element binding protein B 



leucine zipper protein FKSG14 

ESTs, Moderately similar to ALU1_HUMAN ALU SUBFAMILY 
clone IMAGE -.3453347, mRNA, partial cds 



solute earner family 16 (monocarboxylic acid transpo 



434957 AF283775 

407292 AA876638 

459109 AW292447 Hs.140821 

457892 AA744389 



Hs.149723 
Hs.129994 
Hs.29493 
Hs.165893 
Hs.121573 



si NCI_CGAP_Pr1 Homo sapiens cDNA clone si 
KIAA1608 protein 

Homo sapiens cDNA FU13872 fs, clone THYRO1001322 

neurotrophin 5 (neurotrophin 4/5) 

ESTs 

single-strand selective monofunctional uracil DNAgly 
Homo sapiens mRNA for FU00089 protein, partial cds 
oxysterol binding protein-like 1 
ESTs, Weakiv similar lo JE0350 Antenor gradient-2 [H 
ESTs, Weakly similar to I38022 hypothetical protein [ 
synaptic nuclei expressed gene 2; KIAA1011 protein 
x 001 protein 

gb:nz45e05.s1 NCI.CGAP.Pr1 2 Homo sapiens cDNA clone s 
ESTs 

gb:ny51e10.s1 NCI.CGAP.Pr18 Homo sapiens cDNA clones 

ESTs 

ESTs 

hypothetical protein FLJ20142 
ESTs 

ESTs, Weakly similar to TRHY_HUMAN TRICHOHYALI [H.sap 
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415853 H06016 



Hs.194718 
Hs.100855 
Hs.123664 



417037 BE083936 

449317 AW293413 Hs.132906 

436588 M759233 Hs.126506 

409261 BE315042 Hs.19210 
401069 

414065 AW515373 Hs.271249 

409902 AI337658 Hs.156351 

432258 AW973078 Hs.293039 

438581 AW977766 Hs.292133 



455410 AW936678 



419474 AW968619 1 
406464 

407881 AW072003 I 

427258 AA400091 I 

433840 M129782 I 

423642 AW452650 I 

457008 AA410446 I 

435099 AC004770 I 

451846 T65840 

419988 W39388 



il protein MGC11308 

1 f 1 1 1 ) 4 !| 1 |AF302150_1 (AF30 

Homo sapiens cDNA FU13580 fis, clone PLACE1008851 



ESTs, Moderately sim' 
NM_005805:Homosa| 
AF15q14 protein 
ESTs 

gb:Homo sapiens full length insert cDNA clone ZB94A08 
gb:PM2-DT0023-08030u-004-a04 DT0023 Homo sapiens cDNA 
Target Exon 



flap structure-specific endonuclease 1 



455601 AI368680 Hs.816 

441075 AA915991 Hs.179214 

451107 AA235108 Hs.17639 
404649 

420897 AW139261 Hs.232280 

418867 D31771 Hs.89404 

420298 AI199510 Hs.267912 

449893 T97999 Hs.18214 

420101 AW500529 Hs.95180 

428166 AA423849 Hs.79530 

420022 AA256253 Hs.120817 

444020 R92962 Hs.35052 
454765 AW819629 

415021 R54409 Hs.301693 

418506 AA084248 Hs.85339 

415009 C75253 Hs.220950 

428845 AL157579 tls.153610 

433348 AA877996 Hs.125376 
417881 AI879117 

446354 AW449650 Hs.202249 

427018 AA397538 Hs.136280 
43441C AA632644 

448072 AI459306 Hs.24908 

457322 AI815486 Hs.243901 

424317 AI865032 Hs.26017 

433001 AF217513 Hs.279905 

404112 BE302729 Hs.173162 

433334 AI927208 Hs.231958 

434960 AW374941 Hs.72545 

431658 BE409917 Hs.266935 

443081 H86858 Hs'.132909 

429432 AI678059 Hs.202676 

452706 AW449390 Hs.257150 

437044 AL035864 Hs.69517 

430780 N95102 Hs.334858 

426793 X89887 Hs.172350 

418379 AA218940 Hs.137516 

431405 AI470895 Hs.252574 
405454 

438362 AA805678 Hs.12325 
401940 

424834 AK001432 Hs.153408 
442432 BE093589 Hs.38178 
459086 AA021163 Hs.22287 
418653 AI734064 Hs.136212 
444152 AI125694 Hs.149305 
437534 AA814471 Hs.291800 
435074 AI760944 Hs.1 16937 
406722 H27498 Hs.293441 
442829 AW263123 Hs.1 27554 
431675 AA699965 Hs.202375 
447164 AF026941 Hs.17518 



Homo sapiens, clone MGC:17 
Target Exon 

SRY (sex determining region 1 
els variant gene 3 
Homo sapiens ubiquitin protei 
Target Exon 



1, mRNA, compete cds 



ESTs 

msh (Drosophila) homeo box homolog 2 

ESTs, Weakly similar to ALU7_H UMAN ALU SUBFAMILY SQ S 

ESTs, Weakly similar to B34087 hypothetical protein [ 

KIAA0767 protein 

M5-14 protein 

ESTs 

ESTs 

gb:RC5-ST0293-140200-014-H05 ST0293 Homo sapiens cDNA 
Homo sapiens, clone IMAGE3638994, mRNA, partial cds 
G protein-coupled receptor 39 
ESTs 

KIAA0751 gene product 

ESTs, Weakly similar to JC5314 CDC28/cdc2-like kinase 
gb:au54g09.y1 Schneider fetal brain 00004 Homo saplen 



no sapiens cDNA FU20738 fis, clone HEP08257 



tRNA selenocysteine associated protein 

ESTs 

ESTs 

synaptonemal complex protein 2 

EST?, Moderately similar to SUR1.HUMAN SURFEIT LOCUS 

cDNA for differentially expressed C016 gene 

hypothetical protein MGC12250 

HIR (histone cell cycle regulation defective, S. cere 



Target Exon 

Homo sapiens cDNA FU 10570 fis, clone NT2RP20031 17 
hypothetical protein FU23468 



hypothetical protein MGC2603 



Homo sapiens SNC73 protein (SNC73) mRNA, complete cds 



o sapiens cig5 mRNA, partial sequence 



WO 03/042661 
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421133 
407605 
441370 
437966 
426360 



15 
20 
25 



W92885 Hs.143408 

AA814971 Hs.257634 

W03512 Hs.6479 

AI242433 Hs.270085 

AW891130 Hs.38173 

AW290981 Hs.211296 

M053486 Hs.20315 

AW137133 Hs.245867 

AI553633 Hs.337139 

AF108138 Hs.1 12160 

AW970116 Hs.310616 

AF207881 Hs.155185 

AL360257 Hs.213493 

M553612 Hs.324696 

W80462 Hs.270521 

A1146771 Hs.1 58008 

R10799 Hs.191990 

AW016610 Hs.129911 

H69912 Hs.48269 

M194554 Hs.183434 

AA410506 Hs.27973 

NM_016578 Hs.20509 

AW055308 Hs.31803 

AW131759 Hs.248286 

NMJ101197 Hs.155419 

AA236776 Hs.79078 

AW241867 Hs.127728 
AI279186 

AI034357 Hs.211194 

AI192657 Hs.143897 



443117 AI248826 
457434 AW628192 
442505 AW003775 



417739 Z43995 

415961 H10983 

424042 Y10601 

451035 AU076785 

447155 M100605 

412668 M456195 

458042 AW058464 

456530 AL049437 



W91903 



430413 AW842182 
423494 AW504365 
415018 U49395 



Hs.18851 
Hs.198248 
tls.126711 
Hs.250618 



tls.121557 
Hs.10056 
Hs.6430 
Hs.100292 



Hs.154443 
Hs.241392 
Hs.24143 
Hs.77807 



434932 BE613162 

420991 AW504814 

457854 BE547674 

455994 BE179190 

423426 AW389579 

429568 AI088S91 
404110 

424441 X14850 

433155 AL037035 

414839 X63692 

406867 M157857 



456248 AL035786 

403152 AA576664 

407649 BE066724 

448387 AI874402 

433671 AW138797 

425891 AI041717 

447347 AA570056 

439079 AF085937 



Homo sapiens DNAhelioase homolog (PIF1) mRNA, partia 
ESTs 

cytosolic ovarian carcinoma antigen 1 

Homosapiens mRNA full length insert cDNA clone EUROI 

KIAA1594 protein 

E T ' nilyimilar to ALU2_HUMAN ALU SUBFAMILY SB S 



ESTs 

vaccinia related kinase 1 

ATPase, H transporting, lysosomal (vacuolar proton pu 

KIAA0874 protein 

HBVpX associated protein-8 

ESTs, Weakly similar to N-WASP [H.sapiens] 



gb:qm24a04.x1 NCI_CGAP_Lu5 Homo sapiens cDNA clone 3' 
ESTs, Weakly similar to ALU8_HUMAN ALU SUBFAMILY SX S 
, fe lin I 1 1 nidi nu'-oilar dystrophy 2B(autosom 
gb:EST22383 Adipose tissue, white II Homo sapiens cDN 
ESTs 

hypothetical protein FU 10875 
UDP-Gal:betaGlcNAc beta 1,4- galactosyltransferase, p 
ESTs, Weakly similar to I38588 reverse transcriptase 
UL 16 binding protein 2 

gb:HSC1QB121 normalized infant brain cDNA Homo sapien 
ESTs 

plastin 1 (I isotbrm) 

ESTs, Weakly similar to DP1_HUMAN POLYPOSIS LOCUS PRO 
hypothetical protein FU14621 
protein with pclyglutamlne repest; calcium (ca2) home 
Homosapiens mRNA; cDNADKFZ P 586E1120 (from clone DKF 
hypothetical protein FU13117 
ESTs 

budding uninhibited by benzimidazoies 1 (yeast homolo 
gb:MRO-HT0208-221299-204-b12HT0208 Homosapiens cDNA 
minichramosome maintenance deficient (S. cerevisiae) 
small inducible cytokine A5 (RANTES) 
Wiskott-Aldrich syndrome protein interacting protein 
purinergic receptor P2X, Ilgand-gated ion channel, 5 
C11001758*:gi|12621132|reflMP.075243.1| MEGF1 [Rattus 
Hs.220251 ESTs 
Hs.189413 ESTs 

Hs.151973 hypothetical protein FU23511 

Hs.9029 DKFZP434G032 protein 

Hs.272458 protein phosphatase 3 (formerly 2B), catalytic subuni 

Target Exon 
Hs.284135 hypothetical protein MGC3036 
Hs.121004 Homo sapiens mRNA for FU001 11 protein, partial cds 
Hs.204169 EST l"e kly rnu'lL tt-5b57 alpha-1C-adrenerglcte 

gb:RC0-HT0613-210300-032-f07 HT0613 Homo sapiens cDNA 
Target Exon 

Homo sapiens ELISC-1 mRNA, partial cds 
Homo sapiens mRNA; cDNA DKFZp564[X]472 (from clone DKF 
NM_020245*:Homo sapiens tubb ipt amily protein (T 
H2A histone family, member X 
Homo sapiens cervical cancer me 
DNA(cytosine-5-)-methyltransferase 1 



AI088489 Hs.83937 



Hs.82425 
Hs.37078 
Hs.37427 
Hs.170810 
Hs.132906 
Hs.132141 
Hs.122730 
Hs.38348 



hypothetical protein ' 
ESTs 

acUn related protein 2/3 complex, subunit 5 (16 kD) 
v-crk avian sarcoma virus CT10 oncogene homolcg-llke 
erythrocyte membrane protein band 4.1 (elliptccytosis 



ESTs, Moderately similar to KIAA1215 protein [H.sapie 
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30 
35 





AK00177S 


Hs.143954 


hypothetical protein FU10914 




A1457996 


Hs.132578 








Hs.15043 




407989 


AW1 35208 


Hs.256092 


gb:yv27d04.s1 Soares fetal liver sp ! een 1 NFLS Homo sa 




Al 870685 


Hs.231022 












453718 


AL119317 


Hs.120360 


phospholipase A2, group VI (cytosolic, calclum-mdepe 








NM_007242:Homo sapiens DEAD/H (Asp-Glu-Ala-Asp/His) b 




AF161483 




HSPC134 protein 


419923 


AW081455 


Hs.120219 






AF078866 




Homo sapiens cDNA: FU22993fis, clone KAT11914 




M831004 


Hs.124874 










hypothetical protein from clones 23549 and 23762 


440065 




Hs.266331 


hypothetical protein MGC4595 










447983 


AW612726 


Hs.282113 


ESTs, Weakly similar to I38022 hypothetical protein [ 








hypothetical protein FLJ12876 


408182 


Yl'8418 5 '' 




gb:zf49g04.r1 Soares retina N2b4HR Homo sapiens cDNA 


432180 




Hs.272822 


RuvE (E coli homolog)-like 1 


436005 




Hs.158126 


Homo sapiens cDNA FU13350 Us, clone OVARC1002143 


414962 


AF273304 


Hs.235376 


XPMC2 protein 


444476 


AF020038 


Hs.11223 


isocitrate dehydrogenase 1 (NADP), soluble 


408175 


W29089 


Hs.19066 


hypothetical protein DKFZp66702416 






Hs.159914 


ESTs, Weakly similar to I78885 serine/threonine-sped 


437277 


AA748016 


Hs.123370 


ESTs 


431445 


AA505135 


Hs.44037 


ESTs 


418927 


BE349635 


Hs.190284 


ESTs 


452446 


AA086123 


Hs.297856 


ESTs 


445380 


AI222019 


Hs.144838 


ESTs 



421174 
417247 



AI498421 Hs.243168 

AI564022 Hs.138207 

422585 NMJJ16186 Hs.118620 

442522 AI087038 Hs.146592 

430684 AI808979 Hs.293193 

446442 BE221533 Hs.257858 

441410 AA932689 Hs.233304 

419485 M489023 Hs.99807 

449539 W80363 Hs.58446 

406663 U24683 Hs.302063 

423767 H18283 Hs.132753 

450937 R49131 Hs.26267 

430977 AA490069 Hs.306676 

455677 BE066061 Hs.8867 

436706 AA725808 Hs.194609 

459407 N92114 

444132 AK000452 Hs.10340 

437149 AI686651 Hs.202234 

418499 AI627392 Hs.302023 

411298 AW835858 

432571 AF151054 Hs.278429 

416295 AI064824 Hs.193385 

427485 AF039652 Hs.178655 

409857 AW501908 

433854 AA610649 Hs.333239 



443135 AI376331 



ESTs, Moderately similar to A46010 X-linked retinopat 
hypothetical protein MGC2722 
gb:yv63c01.s1 Soares tetal liver spleen 1NFLS Homo sa 



ESTs 

immunoglobulin heavy constant 
F-box only protein 2 
ATP-dependant interferon response protein 
Homo sapiens cDNA FU14302 Us, <*— " 

; inducer. 61 



gb:za22h11.r1 Soares feta: liver spleen 1NFLS Harness 
hypothetical protein FU20445 

E I A Hy Sim lar to ALU4_HUMAN ALU SUBFAMILY SB2 
hypothetical protein FKSG25 

gb:PMO-LT0017-031299-001-h07 LT0017Homo sapiens cDNA 
hepatocellular carcinoma-associated antigen 59 



436882 AW016722 

434502 AW974915 

435507 AI143579 

444896 AI201480 



445209 AW294230 

449193 AI637997 

447397 BE247676 

455037 BE144549 

453367 AW732847 

439317 AF086127 

424006 AF054815 



Hs.194976 

Hs.1 16550 

Hs.26510 

Hs.144856 

Hs.6137 

Hs.14559 

Hs.301701 

Hs.80988 

Hs.1 95653 



Hs.70573 
Hs.50600 
Hs.1 37548 



gb:UI-HF-BR0p-ajp-c-12-0-U ! .r1 NIH_MGC_52 Homo sapien 
ESTs 

gb:MRO-HT0157-021299-004-d08 HT0157 Homo sapiens cDNA 
gb:EST31993 Embryo, 12 week I Homo sapiens cDNA 5' en 
C8001441*:gi|8923061|reflNP_0601 14.1 [ hypothetical pr 
ESTs 

hypothetical protein FU10637 

hypothetical protein similar to swine acylneuramlnate 

seven transmembrane protein TM7SF3 

gb:nq63b04s1 NC _CGAP_Ov6 Homo sapiens cDNA clone si 



lar protein sorting 33B (yeast homolog) 



hypothetical protein FU10540 
Homo sapiens cDNA FU 12073 fis, done 
collagen, type VI, alpha 3 
ESTs 

gb:MRWT0167-081199-001-a02 HT0167 Homo sapiens cDNA 
PKC-1-related HIT protein 

ESTs, Weakly similar to T47156 hypothetical protein D 
CD84 antigen (leukocyte antigen) 
NM_004520 - :Horr,c apien kin .mi ,vy chain member 2 
eukaryotic translation initiation factor 3, subunit 8 
gb:CM3-HT0183-181099-023-b05 HT0183 Homo sapiens cDNA 



234 



WO 03/042661 



Hs.101813 



436216 AA380887 Hs.5085 

418623 AW194757 Hs.266804 

447197 R36075 

430146 AW815330 

441841 AA971819 Hs.176083 

457677 AA628890 

421090 BE301870 

434407 AW815333 

453579 AI204463 Hs.61857 

427584 BE410293 Hs.179718 

452139 M099969 Hs.16331 
405510 

440777 M994020 Hs.128553 

446424 AW134529 Hs.244647 

448004 AW451477 Hs.257456 

430610 AI821465 Hs.188810 

427080 AW068287 Hs.173466 

451693 BE220445 Hs.279635 

417558 AF045229 Hs.82280 

420344 BE463721 Hs.97101 

427735 AA916735 Hs.130610 
425423 NMJ05897 Hs.157180 

450S63 H43540 Hs.25292 



400247 

421116 T19132 Hs.101850 

426761 AI015709 Hs.172089 
405514 

412406 AW948172 

440226 AA873387 Hs.207330 

435625 H50654 Hs.113999 

AW005695 lls.250897 

D50915 Hs.38365 

AL049795 Hs.17987 

AW970612 Hs.172635 

X98411 Hs.121555 

AF151879 Hs.26706 

BE242274 Hs.1741 

U46258 Hs.339665 

NM_005982 Hs.54416 

AW900372 Hs.1 80793 

S68616 Hs.170222 

AB018284 Hs.1 58688 

D86980 Hs.79170 
AW364844 

AA194446 Hs.73451 

U37012 Hs.83727 
M926944 



dolichyl-phosphale mannosyltransfcrase polypeptide 1, 



solute carrier family 9 (sodium/hydrogen exchanger), 
HSPC150 protein similar to ubiquitin-conjugating enzy 
gb:QVO-ST0215-060100-083-g01 ST0215 Homo sapiens cDNA 
C5000010':gi|10440464|dbi|BAB15765.1|(AK024475)FUO 
ESTs 

v-myb avian myeloblastosis viral oncogene homolog-lik 
Homo sapiens cDNA: FU21482fis, clone COL05135 



407758 
447276 
449938 



424148 
447519 
409361 



417086 
418181 
436910 
401008 
413245 
446820 
439279 
426116 
410098 



452461 
405417 



BE244334 Hs.75249 

AW295037 Hs.254986 

A1039473 Hs.130636 

AA868729 Hs.144694 

BE326839 Hs.17433 

AI114875 Hs.78592 

AW404075 Hs.42785 

N80121 Hs.4983 



AK001407 Hs.88663 



435014 BE560898 Hs.10026 
403397 

436873 N23874 Hs.50477 

451386 AB029006 Hs.26334 
404914 

419839 U24577 



418978 T85295 



regulator of G-protein signalling 10 

putative G protein-coupled receptor 

splicing factor praline/gtutamine rich (po'ypyrimidin 

intracisternal A particle-promoted polypeptide 



Hs.93304 
Hs.152477 
Hs.268606 
446636 AC002563 Hs.15767 
454639 AW811633 
434522 AF189259 Hs.283081 
458236 AW297043 Hs.255604 
441043 AA913422 Hs.192104 
422838 AA524065 Hs.93670 



ESTs 
Target Exon 
Eos Control 

retinol-binding protein 1, cellular 

Homo sapiens mRNA; cDNA DKFZp586l2022 (from clone DKF 
ENSP00000241075:TRRAP PROTEIN. 
gb:RC0-MT0013-280300-021-b06 MT0013 Homo sapiens cDNA 
ESTs 

TRK-fused gene 

KIAA0125 gene product 

hypothetical protein MGC1203 

Homo sapiens cDNA: FU21367 fis, clone COL03051 



ESTs 



ESTs, Weakly similar to S65657 alpha-1C-adrenergic re 
solute carrier family 9 (sodium/hydrogen exchanger), 
KIAA0741 gene product 
KIAA0227 protein 

gb:QV3-DT0044-221299-04^c03 DT0044 Homo sapiens cDNA 
ESTs, Weakly similar to S55024 nebulin, skeletal muse 
cleavage and polyadenylalior, specific factor 1, 160kD 
gb;om68g01.s1 NCI_CGAP_GC4 Homo sapiens cDNA clone 3' 
Target Exon 

ADP-ribosylation factor-like 6 interacting protein 

ESTs 

ESTs 



transcription factor 
hypothetical protein FU10545 
CX001144':gi|7242973|dbj|BM92547.1 1 (AB037730) K1AA1 
gb:nc74e05.s1 NCI_CGAP_Pr2 Homo sapiens cDNA clone, m 
mitochondrial ribosomal protein L17 
ESTs 

Target Exon 

RAB27A, member RAS oncogene family 
spastic paraplegia 4 (autosomal dominant; spastjn) 
NM_004046*:Homo sapiens ATP synthase, H+ transporting 
phospholipase A2, group VII (platelet-activating fact 



gb:RC2-ST0158-091099-011-d05ST0158 Homo sapiens cDNA 
gamma-aminobutyric acid (GABA) receptor, theta 
ESTs, Weakly similar to A47234 homeobox protein H6 [H 



Homo sapiens chromosome 1 9, cosrrid R2 



WO 03/042661 
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455327 AW896238 

420982 AW5761S0 

424563 M446932 

417125 AW181998 

453902 BE502341 

446842 AI343510 

454128 AL031259 

427011 BE302729 

450872 AI742594 

451512 AI80C236 



Hs.334805 
Hs.100729 
Hs.151428 
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422742 AA316117 Hs.337128 

440031 BE045970 

429389 AA454779 

449656 AA002008 

444310 AI140432 



431150 
443217 
413405 



Hs.201441 
HS.18S633 
Hs.175936 
Hs.170980 
Hs.133494 



452061 
446416 
415023 

432566 
420252 
435403 



NMJ101545 Hs.9078 

AW022253 Hs.215976 

BE327277 Hs.161145 

NM_014757 Hs.76986 

AF027866 Hs.138202 

AW363284 Hs.32553 

AL039478 Hs.304447 

AK000150 Hs.78185 

AI127076 Hs.334473 

AB006624 Hs.14912 

AU076509 Hs.2934 

AA773838 Hs.5353 

AW815036 Hs.151251 

U34879 Hs.85279 

X90725 Hs.77597 

AI753247 Hs.29843 

AF005418 Hs.150595 
AA180467 

AB033015 Hs.23941 
AW947507 

AI023133 Hs.10739 
AW378445 Hs.123080 
J05581 Hs.89603 
AA431615 Hs.130722 
243011 Hs.21169 
AI862604 Hs.211884 
AW195515 Hs.253177 



Homo sapiens cDNA FU14604 fis, clone MT2RP1000363, m 
KIAA0S92 protein 
ret finger protein 2 

CUG triplet repeat, RNA-blnding protein 1 



programmed cell death 2 
neighbor of COX4 

fl b:wg55h05.x1 Soares_NSF_F8_9W_OT_PA_P_S1 Homo sapien 
ESTs 

gb:qt84a01.x1 NCI_CGAP_Co14 Homo sapiens cDNA clone I 
ESTs, Weakly similar to I38022 hypothetical prolein [ 
ESTs, Weakly similar to CHD4.HUMAN CHROMODOVAIN HELIC 
NM_020999-:Homo sapiens neurogenin 3 (NEUROG3), mRNA. 
gb:PM1-BTO585-11020O-003-c1 1 BT0585 Homo sapiens cDNA 
Target Exon 

thymidine kinase 1, soluble 

ESTs 

ESTs 

Homo sapiens cDNA FU11079 fis, clone PLACE1005111 



KIAA0948 protein 

Homo sapiens clone TCCCIA00164 mRNA sequence 
gb:yc16e01.s1 Stratagene lung (937210) Homo sapiens 
immature colon carcinoma transcript 1 



412647 AW975090 

437234 AI472213 

453366 AW958751 

425803 AI825204 

447383 N24231 

423864 BE275607 I 

450799 AW407504 

409592 BE280951 ! 

453945 NNL005171 

425196 AL037915 

439778 AL109729 

417662 R07478 

438087 AI863770 

452724 R84810 

448633 AA311426 

433154 AA578526 

440094 AI651558 

409253 H91200 

431270 BE046609 

407629 AA649242 



Hs.247711 
Hs.28921 
Hs.211408 



ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SE 
MAX-like bHLHZIP protein 
hypothetical protein DKFZp56401278 



caspasc 10, apoptosis-related cysteine protease 
ESTs 

hydroxysteroid (17-beta) dehydrogenase 1 
polo (DrosophiaHike kinase 

Homo sapiens cDNA FU13103 fis, clone NT2RP3002304 
cytochrome P450, subfamily XXVIA, polypeptide 1 
gb:zp14g08.s1 Stratagene fetal retina 937202 Homo sap 
KIAA1189 protein 

gb:RCO-MT0002-140300-011-a12 MT0002 Homo sapiens cDNA 
ESTs 

Homo sapiens unknown protein mRNA, partial cds 



AV658299 Hs.1 63959 

AA932146 Hs.133494 

AA742222 Hs.1 20634 

AW439330 Hs.256889 

AW270404 Hs.193161 

AA779987 Hs.269658 



ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SE 
ESTs 

succinate dehydrogenase complex, subunit A, flavoprot 
ESTs 

Homo sapiens clone TCCCIA00164 mRNAsequence 
ESTs 

ESTs, Weakly similar to 2109260A B cell growth factor 



gb:EST380398 MAGE resequences, MAGJ Homo sapiens cDNA 
ESTs 

solute carrier family 6 (neurotransmitter transporter 
Eos Control 

gb:EST387196 MAGE resequences, MAGN Homo sapiens cDNA 
hypothetical protein FU20557 
zinc finger protein 
ESTs 

gb:yx22a11.r1 Soares melanocyte 2NbHM Homo sapiens cD 
cha'peronin containing TCP1, subunit 3 (gamma) 
b I II HF-I MO-ad i 1 : U 1 NIH_MGC_38 Homo sapiens 



Hs.268845 
Hs.190422 
Hs.30464 
Hs.21635 
Hs.160994 
fis 270372 
Hs.52002 

Hs.62632 



ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SE 
CDS antigen-li e (sc e jenger receptor cysteine rich fa 
gb:hn41e1 1.x1 NCI_CGAP_RDF2 Homo sapiens cDNA clone 3 
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408492 
428894 
419102 
429067 
422684 
424701 
412513 
443599 
400715 
446514 
413992 
402442 
419497 
439575 
407027 



BE243084 Hs.12719 

M398193 Hs.97584 

AW811978 Hs.254037 
M381437 

AA555217 Hs.183684 

M437066 Hs.271736 

AA234098 Hs.42424 

M446019 Hs.104967 

BE561617 Hs.119192 

NM_005923 Hs.151988 

M322599 Hs.5163 

AI079559 Hs.134125 



NM.006410 Hs.90753 

W79259 

U63312 



H2A hlstone family, member Z 
mitogen-aclivated protein kinase kinase kinase 5 
ESTs, Weakly similar to AF151840 1 CGI-82 protein [H. 
ESTs 

ENSP00000237081*:KIAA1217 PROTEIN (FRAGMENT). 



Target Exon 

Tat-interacting protein (30kD) 

gb:zd75c06.r1 Soares_fetal_heart_NbHH19W Homo sapiens 
gbMiman cosmid LL12NC01-242E1, ETV6 gene, exons 1B a 



412647 
413258 
413314 
413500 
414076 
416450 
417247 

417881 
418347 
422429 
423573 



AA047854 AA057506 AA053841 

AA074897 AA1 13914 AA064871 AA079329 AA071309 AA084710 AA129030 AA075042 AA074794 AA071453 AA078803 AA148628 AA122204 
AA074159 AA126185 AA0791 17 AA127089 AA070912 AA079280 AA131372 AA078833 AA071087 AA076131 AA071047 AA079401 AA083070 
AA102076 AA115163 AA074198 AA134725 AA113889 AA121103 AA075041 AA065148 AA071310 AA101144 AA079659 AA078931 AA079209 
AA070928 AA068994 AA069917 AA076187 AA069053 AA131489 AA071308 AA063317 AA070156 AA071430 AA076056 AA075684 AA070053 
AA126283 AA126078 AA075895 AA07S208 AA074583 A4071086 AA079623 AA070627 AA078802 AA07GG22 AA0G5051 AA079143 AA071110 
AA079434 AA1 48748 AA079230 AA085188 AA074485 AA070580 AA076151 AA083166 AA0851 18 AA079450 AA085044 AA120938 AA0792OO 
AA100188 AA081472 AA122355 AA129031 AA085362 AA069220 AA070940 AA075968 AA074563 AA084027 AA115929 
AW364844 AW364847 AW937534 AW937593 AW937659 
AW501908AW502959AW502540 



410146 1178974J AW592655 R05927 R0G916 



1317604.1 
1355998J 
1360034 1 
1373933J 
141490.1 



174149.1 
216469 1 
229714.1 



430151 
430709 
430848 
431150 
431270 



439575 
444314 
447197 
447383 



AW835858AW835836 AW835823 AW835834 AW835831 AW835832 AV 



385744.1 
385798 1 
429182.1 



711623 1 
71990 T 
847242.1 
849959 1 
1130264 1 
1227728 1 
1233905.1 



AW948172 AW948178 AW948169 AW948176 AW948191 AVV948192 AW948186 AW948184 AW948187 AW948188 AW948189 AW948181 
AW948177 AW948171 AW948183 AW948173 
AW975090 N44182 



AA467736 AA135210 AW968166 AA467804 
AA180467AA449184AA464831 AA505048 
N58024 T58194 T1 1693 N64222 T05848 
Z43995 R12357 R34740 

AI879117 AW161351 Z45755 BE003661 AA206949 AA476541 

AA216419F03238AA229517 

AA310527 AW962295 Z44865 H06641 

AA328504 AA327783 AW962370 

AA381437AA628833AW407275 

AA425488AA496895F23221 

AW815330 AW968170 AI732687 AI732725 AA468343 AA467817 AW063961 

AW968203 AI732757 AA470353 AA468025 AA468479 AI734151 

R34356 AW969880 AA484613 

AW021726AA487752AA488085 

T63857 AW971220 AA493469 T63699 

BE046609 BE045118 AA501504 

AA534489 AW970240 AW970323 

AA527223 AA643443 AA65061 9 AA643463 AA643453 AA643439 AA643438 AW802954 AW821 595 AW821 594 AA643431 M 
AW827513 



AA632644AA635376AA664188 

AA92S944 AA767974 AA737237 

A1147061 AA743380 AA765223 AW976398 A 803927 

AF086174 W31796 W04694 

W79259AF086396W73927 

AI140497 AW749625 AW749626 AW749644 

R36075AI366546 R36167 

N24231 BE617964 N36313 

AW407504W31274AI738877 

AI742594 AI761397 R31198 AI819332 R31257 

AW812535AW812536AW390307 

AW811633AW811652AW811893 

AW819629AW854320 
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15 

20 



35 
40 
45 
50 



455037 1249783J BE144549 AW851677 AW851643 AW851711 AW851719 
455096 1253078J AW855718 AW855740 AW855748 

455161 1256167J BE145900 AW859906 BE145895 BE145831 BE145914 BE145820 BE145817 BE145890 BE145908 

455410 1288380J AW936678 AW936637 AW936682 AW936685 AW936817 AW93681 1 AW936762 AW936653 AW936815 AW936812 AW936683 AW936822 

AW935823 AW936821 AW936732 AW936730 AW93S781 
455447 1292444 J AW947507 AW947509 AW947791 BE008335 
455855 1375834J BE147440 BE147708 BE147563BE147456 
455994 1398737_1 BE17919C BE179206 BE179182 BE179185 BE179186 BE179194 
457892 432926.1 M744389 M744270 AA744284 AA744299 AA745380 AA744337 AA846905AA847698 
458080 471050J BE142728 M834047 AW937124 

4581 15 47705_29 BE091587 BE091730 BE091577 BE091655 BE091729 BE091640 BE091578 BE091727 BE091803 BE091660 BE091721 BE088255 BE076582 

AW992312 BE008791 BE082385 BE083504 BE08345G AW9979G7 AW997991 BE166595 AW843686 AW844334 BE079091 AW603391 BE081427 
BE079514 BE184580BE009962BE008722AW579912AW860561AW890184AW795276AW860410AW860411AW610330AW860564AW860578 
AW862519 BE073924 BE008687 BE073857 BE073921 AW2741 06 BE011060 AW268120 AI335067 AW793748 AW997736 BE0801 17 AW867987 
AI547161 AW844767AW393596AW579444BE083334AI547158AW799863M585179AW992792AW882215BE011913AW997894AI547159 
AW992772 AW581 178 AA092247 AW843916 BE0791 90 AW878478 BE083648 BE066454 AI469937 AW393594 AW579899 AW939276 BE173265 
AW878631 AW878638 AW992802 BE07991 3 AA633638 AW369008 BE076590 AW843456 AW992791 BE173247 AW843921 AW843333 AW878334 
BE090236 BE078240 BE066325 AW503276 BE16931 0 AW817299 BE091641 BE0001SO AW898164 AW994624 AW999391 AW862797 AW899438 
AW750667AW939685AW940017A1200402 BE167391 AW939318AW799837AW939953AW939681 BE078188AA449059BE076595AW297451 
BE076544AW803372BE081223AW939237BE084239AI174202BE077804BE078028 M512912AI124808AI147524AI884882AW939962 
AW939254 BE171687 AW998400 AW998348 AW998353 AW998288 AW998303 AW998302 AW998299 M502748 BE077882 AW998295 BE090238 
AW604665 AI423051 BE093093 AW579913 AW606384 BE006143 BE170415 AW998054 BE084608 AW992779 BE088111 AW610555 AW844153 
AW939423 BE085404 AW579905 BE080994 AW468482 AW876865 BE091581 BE080940 AI811189 AW868088 AW893127 BE080064 BE184254 
AW998350AW884228AW992315AW992364BE091569AW750680BE066386AA578227 BE091735AW938830BE078710BE087253BE084182 
AW800859AW801017AW581371 BE0883MAW995341BE090233AW663786BE091739BE0B0113AW578162AW799799AW992366AW994673 
BE185170 AW792778 AW663225 BE075590 BE0801 1 1 M582934 BE090227 AI475441 BE085684 BE090223 AW581366 BE010705 AW898740 
BE088170 AW992375 BE077833 BE083557 BE010683 AW998450 AW803434 BE083280 AW892655 M506666 BE088288 BE005359 BE173856 
BE001319 AA610814 BE011965 BE005855 BE005869 M973929 BE185729 AW884298 BE185743 BE001342 BE005876 AI002988 AW799056 
BE085411 AW841264 AW6O3110 BE006134 BE006139 BE006148 BE0OS147 BE006155 AA578273 BE008706 BE185440 AW946428 AA501940 
AW603114 BE085757 AI460195 AA491145 AA772914 AA632730 AA508388 BE080196 BE185442 BE093446 AW946433 BE080119 BE001352 
"""™"03 BE006145 BE085405 BE008580 BE081423 AW581373 AW607246 BE094328 BE001336 AW868170 BE0741 19 AW884149 BE091734 

7BE185806 BE080193 BE185858AA47G398 BE081040 BE074724 BE085426BE074725AW998297 AW867606 BE185798 

39 BE081672 BE088178 AA.610264 BE0881 18 AA284217 AW578085 BE074518 BE001359 BE001328 AW820227 AW868196 
26 BE012037 BE079061 BE005870 AW867804 AW878433 BE008751 BE005875 BE008748 BE093440 BE183050 
AA506676 BE001329 BE0088D3 BE080123 BE003041 AW994G88 AW994G75 AW994760 AW994691 AW994681 BE080189BE080112AW868173 
AI768300 AW833094 AW868179 BE080201 AW665449 BE067473 BE008746 BE184053 BE076437 BE076376 BE076402 AA480395 BE082436 
BE1841 34 BE185224 BE085428 BE008682 AW868181 AW99B358 AW866102 BE083507 BE077974 BE008835 BE093439 BE076108 AI416987 
BE008788 BE069909 BE093441 BE185502 BE183053 AW750669 BE01 1812 BE008672 BE081684 BE093445 AW868184 BE081839 BE008797 
AW842067 BE008678 BE008670 BE551820 AW838974 BE081637 BE046696 BE008673 BE010328 BE083250 BE089614 BE082052 BE081424 
BE001678 AW581368 AA503194 AW883721 AW883522 BE085564 AW868717 BE171078 BE078249 BE078194 AA565255 BE083486 AW842081 
AW842080 AW868204 BE008717 AA4843G9 A1831719 AW997365 BE079327 AA503956 BE091999 AW793852 BE080251 BE078086 BE092515 
BE170364 AW866193 AW067958 BE008042 BE00B761 BE081681 BE081671 AW857400 BE082003 BE082253 BE081439 BE081486 BE081692 
AW606020 AA501778 AW996417 BE045756 BE338394 AA491 068 AW893099 AW578695 BE150440 BE006150 BE0B4684 AW992796 BE086891 
AW866792 AW753605 BE082045 BE081 106 BE008373 BE075399 AW996628 AW578707 BE084309 AW753G04 BE185916 AW842220 BE185222 
BE006152 BE008795 AW578706 BE080256 BE183984 AI934532 AA449548 AW578699 BE150514 AW883580 AA493568 BE085748 AW753601 
BE150562AW882677 BE091797AW899123 BE081679BE080121 AW606787 AW603410 BE001317 AW905799 BE150513 BE092206 AW996343 
BE086922 BE008806 AW844759 AW606009 BE150487 AW750728 BE150491 BE150515 AW60601O BE150508 BE008718 AW578702 BE150509 
AA436751 AW883918 BE183883 AW753607 BE038669 BE150446 AA533458 BE079219 AW838884 AW063806 AW063837 AW878479 BE078815 
BE008802 AW992789 BE007925 AW8022O4 BE011825 BE092130 BE184059 BE079087 BE150568 BE185497 BE078808 AW883761 AW8422g6 
BE161523 AA484796 AA480390 AW994667 B E073205 AVV60731 6 BE083201 AWB02265 AW578700 BE078715 AW8604O3 AW897456 AW996558 
AW890602AW860413AA425412BE0083G4BE150438AW602606AI435236M574285AI823745AA501773AI002987AW832749BE185491 
AW9964S9 BE001442 AW946425 BE001 586 AI524864 BE085558 AW867549 AW504038 BE079832 AI752160 AW999398 AW883904 AW882376 
AW467098 BE0801 16 AW883984 AW883995 AA424095 BE074091 AW996348 AW860625 AW860633 AW946513 BE083485 AW860412 AW602207 
BE075407 AW838972 AW607023 AW602201 AW799772 AW862452 AW862451 BE505041 BE161537 AW602206 AW860404 AW860555 BE007843 
AW860632AW862457AW998019AW860405 BE092062 AW603S21 BE183386 AW868194 BE075664 BE078184 AI541202 AI204949 BE092451 
AW60311 1 M484587 AA484402 AW998675 AW896064 BE069923 AW867965 BE069919 BE092069 AA8Q7842 AW605500 AW605501 BE085409 
AA506738 BE080080 AW749523 AA4931 34 AW3701 37 AA491 844 AA504425 AW605473 BE092455 BE010682 AW602608 DE079093 AA48491 1 
BE010942 AI205087 AW794933 BE081848 BE01 1792 AW799897 BE174618 AW83884B AW821741 AW842724 BE008764 BE183962 AA501765 
BE092513 BE183342 AW799806 AA442935 BE092268 AA580022 AW843219 BE093308 BE092275 BE087111 BE183392 AW842678 BE185597 
BE183395 BE185278BE082343AW946219BE079199BE092272AA586687AW946109AW946175AW946184BE008365BE078172BE085673 
BE076240 BE083194 BE010604 BE079196 AW878636 AW799803 BE001348 BE077683 BE081835 AW992309 BE081012 BE078106 AW881899 
BE008407 BE008410 AW842670 AW603738 BE088661 AA484571 A1799184 BE174545 BE001405 AA436967 AW995884 AW995785 AW896598 
AW883999 BE075967 AA503938 BE092281 BE092279 BE087460 BE087569 BE081542 AI375386 AW843886 BE080115 BE171517 BE079898 
AW882382 BE001450 BE07S430 AW579377 BE008412 BE008790 BE182296 BE182297 BE078805 AW899132 BE078810 BE185867 BE087790 
AA484928 AW578985 BE008400 BE074080 AW605101 BE0761 10 AW799904 A1205094 BE008370 BE182345 BE182373 BE008401 AA984441 
BE182362 BE182372 BE008414 BE078186 BE009165BE010266 BE009162 BE009167 BE011006 BE073335 BE182370 AW750556 BE182347 
BE011000 AA484576 BE092982 BE183897 BE092973 AA573037 AW882317 BE081832 AA478471 AA551613 BE182366 AW838886 AW026827 
BE008413AW896605AA503558AA776622BE084825 M502971 BE081842BE010B28AW802218 AI888924AW867986AW881775BEO79220 
AI241060 AW802041 AW802005 BE011244 BE087051 AA984758 AA452997 AW992786 AW797500 BE077829 BE008402 AW881760 BE093516 
AW802084 AW369007 BE165123 BE937775 AWB01018 BE093443 AW867978 AW843271 BE173850 V997359 BE01 20 AW992516 AW843908 
BE083200 BE164675 BE074340 AWSfO. 9 BE TV 2 bU03456 AW946438 BE066570 BE093547 AA508107 AW867992 BE076239 BE183881 
AW617422 BE087717 AW899147 BE010608 AW992295 AA436737 BE075412 BE093011 AW581656 BE089529 BE008756 BE1 50494 AW903020 
AW883102 BE076370 BE000625 BE1 66095 AW867979 BE182159 AW577501 AW577488 AW577491 BE010637 BE069910 BE093295 BE005243 
AI620783 AW992550 AW890590 AW577496 AW577504 AW842725 AW842666 AW864691 AW997722 AW842662 BE009233 BE087809 BE083196 
AA287768 AW939691 AW815631 BE001453 AW841903 BE077613 AW577500 BE081479 AW992558 BE01 1065 AW843187 AW867990 AW898296 
BE074339 AA501697 AW749997 BE076249 AW867991 BE085718 AW994607 BE010678 BE075436 AW386825 AA484467 BE081 144 AW577492 
AW997932 AW899089 AW842705 AW890727 AW843175 BE075428 AW843155 AW842679 AW842708 BE069915 AW842721 AW438792 AI251478 
BE069911 BE067054 BE079889 BE075453 BE069927 AA491920 BE170606 BE182305 BE080052 AW843406 BE011075 BE075969 AA525261 
AW391518BED79202AA658195BE076138AW799901 AA493859 AW992510BE011810 AA508724 BE075488 BE075661 AA258982BE078726 
BE069914AA213698BE075376AA484 )0 !09 PFC'77 72/ <\503571 AW884724 AW880124 AW868454AW577502 AW577489 AW799829 
AW992545 BE075806 AW994606 BE350368 AA557836 EE077682 AW844660 AW883431 BE085872 AW838887 AW843890 AW868404 AA578417 
BE074115AW842680AW277193 AW8^ 728 / A'60! 111 Frn93940AW890710BE085560AW868180AW896778BE069925BE011054BE075965 
AW842668 AW868310 BE01 1071 BE075429 AW843152 AW905848 BE075397 AW842762 BE07E402 BE077950 AW837810 BE079998 BE183965 
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BE075431 AW815917 AW998359 AW799883 AW603782 AA557480 AW841444 BE075915 AAS48034 AW843393 AW391559 BE083265 AW939721 
AW80C857 AW079109 AW3E 01 AA<*5526 AW799848 BE182463 AA77611 1 AW799915 BE008399 BE075377 AW577809 BE010272 

BE182443 BE010295 AW577806 BE008415 BE184035 BE076597 A1817413 AW795053 AW896761 AW841433 BE182458 AW603796 AW842676 
BE085455 AW884879 BE075414 AW836636 AW878273 AW998088 AW799778 AW899125 BE082247 M774870 BE001401 BE001485 AW817297 
A'JV7Q6670 AW394053 BE001396 AW394070 AW603797 BE182447 AW582483 AW843283 AW749520 AW867449 AW899274 AW578232 
AW603765 AW843919 AW578235 BE184139 AW997742 BE1 83923 BE084210 AW802033 AW748724 AW939018 AW997459 AW842742 AA213697 
BE182303 BE011078 AWG07702 AW882623 BE080016 AW580994 BE076531 AA443462 AW607407 AW883382 AW939399 AW605627 AW844615 
AW939724 AW815931 AW883765 AA287421 BE075626 AW946171 AW841445 AW797994 AW815957 AA683300 AW369004 BE075368 BE081560 
AW605626 AW939398 M507280 AA506317 AW84 1230 AW99251 9 M465332 AA425246 BE090234 BE090236 AA483259 AA451961 AA535566 
AA506406 AA888571 AA503568 AA507130 AA532944 AA501672 BE168634 AA492022 AA507662 AW842286 AA494226 AA776038 AA442419 
AW579900 BE171816 AA863055 AA491915 AA447490 AA461423 AA434543 M243279 AW997466 AW603740 BE000295 AA658571 



TABLE 15C 

Pkey: Unique number corresponding to an Eos prabeset ,...,,„. ■ Jm . „„. 

Ref- Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. -Dunham I. et al." refers to the publication entitled The DNA 

sequence of human chromosome 22.' Dunham I. et al„ Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 









NLposition 




8118885 




80151-80297 


400736 




PluT 


143447-143851 


401008 






81421-81551,82364-82512,82862-82938 


401069 


3927852 


Minus 


45682-45831 


401375 






6121-6766 


401405 




Minus 


69276-69452,69548-69958 
62028-62608 


401539 
401557 


8072433 
8099866 




112785-112924 


401654 


9097132 


Minus 


64695-64797 


401940 


3738108 


Plus 


153460-153592 


402025 


7547159 




173835-173998 


402442 


Plus 


141714-141842,142010-142122 


402682 


8138477 




147522-147795 


40279G 
402967 


3646083 
5360987 


Minus 


6126-6265,6416-6689 
33518-34546 


403038 


7717439 




290021-290284 


403055 


8743904 




109532-110225 


403310 


8139936 




183883-184026 


403397 


9438368 




84481-84655 


403839 


4176355 


Plus 


21201-22223 


404110 


9212839 




18344-18510 


404495 


8151634 




59449-60477 


404534 


8247909 


Minus 


147853-148086 


404630 


9796665 




74495-74715 


404649 
404680 


9796926 
9797204 


Minus 


100027-100399 

159810-159979,160213-160321,161023-161304,162862-163140,164490-164644,166404-166530, 


404914 


7341760 




92603-92827 


405417 


4753290 




50704-51499 


405454 


7656675 




133807-134053 


405510 


7630909 




101028-101174 


405514 


9454624 


PluT 


35953-36151 


405536 


9795661 




164091-164162,164397-164516,166720-166790,167785-167935 


406016 


8272661 




41341-41940 


406410 


9256394 




115806-116104 


406464 


9789674 




72161-72562 


406562 


7711584 


Plus 


37316-37426 



TABLE 16A: 200 GENES DOWN-REGULATED IN CERVICAL CANCER COMPARED TO NORMAL ADULT CERVIX 

Table 16A shows 200 genes down-regulated in cervical i 
were switched, the median value amc 
normal cervix). 

Pkey: Unique Eos probeset identifier number 



ExAccn 



UnigenelD Unigene Title 



453596 AA441838 Hs.62905 hypothetical protein FU14834 

443912 R37257 Hs.184780 ESTs 

420923 AF097021 Hs.273321 differentially expressed in hematopoietic lineages 

414422 • AA147224 Hs.337232 HomeoboxA13 

420058 AK001423 Hs.94694 Homo sapiens cDNA FU10561 fs, clone NT2RP2002672 

412639 AW961284 Hs.296235 ESTs 
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AA296520 Hs.89546 
AA905097 Hs.85050 
AI446543 Hs.95511 
AW964490 Hs.32241 
Hs.23804 



424765 
452106 
428780 
431706 



443790 
448944 
401486 
417511 



NM_000055 
AW294092 
AA428211 
AI141031 

AI816086 
AW973708 
NM_004673 
NM.003500 



ls.21594 



AL049176 
AA460421 
L27943 



442748 
415672 
414175 



Hs.28 

Hs.21342 

Hs.50636 

Hs.296341 

Hs.201925 

Hs.241519 

Hs.9795 

Hs.22599 

Hs.82223 

Hs.30875 

Hs.72924 

Hs.42945 

Hs.21380 

Hs.244569 

Hs.1408 

Hs.30098 

Hs.809 

Hs.92448 

Hs.135787 

Hs.193579 

Hs.103849 

Hs.80828 

Hs.151407 



ESTs 

ESTs, Weakly similar to S6 
:ST Weal ly similai i > PN0099 son3 protein [H-sapiens] 
butyrylcholinesterase 
hypothetical protein MGC15754 
hypothetical protein FU14033 similar to hypoxia induci 
ESTs 



acyl-Coenzyme A oxidase 2, branched chain 
atrophin-1 interacting protein 1; activin receptor 
C4000647':gi|4758508|ref|NP_004253.1| airway trypsln-li 
chordin-like 
ESTs 

Homo saptens'mRNA; cDNA DKFZp586P1 124 (from clone DKFZp 




Hs.12369 ESTs 



hypothetical protein DKFZp761D112 

keratin 1 (epidermolytic hyperkeratosis) 

cartilage intermediate layer prolein, nucleotide pyroph 

glycoprotein M6A 

ESTs 

chromosome 11 open reading frame 8 
EphA3 

Homo sapiens clone 1 thrombospondin mRNA, complete cds 
MAGE-like2 



titin immunoglobulin domain protein (myob'Nn) 
general transcription factor IIIA 
Homo sapiens cDNA FLJ11550 fis, clone HEMBA1002970 
serum/glucocorfcoid regulated kinase 

gene product 

assembly protein Mike 2 




55 
60 
65 



404097 
422546 
445872 



452073 
430032 
408767 



AB007969 
AI681573 
AI761902 



Hs.99597 ESTs 
Hs.234863 
Hs.7413 
Hs.82098 
Hs.99610 



ESTs 

multiple PDZ domain protein 
KIAA1474 protein 

37 kDa leucine-rich repeat (LRR) protein 
C5000242*:gi|9369379|gb|AAF8712ai|AC005434_24(AC00643 
KIAA05C0 protein 

Homo sapiens cDNA FU11997 fis, clone HEMBB100145B 



cDNAFU12082fis,clc 



AW450866 

AW392022 

U94333 

D13168 

AK000776 

AA489732 

X04430 



Hs.31412 HomosapienscDMAFU11422lis,c!one HEWBA1001008 

— ESTs 

KIAA0582 protein 

complement component C1q receptor 
endothelin receptor type B 

■ ' no sapiens cDNA FU20769 fis, clone COL06574 



Hs.154918 
Hs.93913 
Hs.18676 

Hs.210297 
Hs.50724 
Hs.59297 
Hs.243121 



ESTs 

interteukin 6 (interferon, beta 2) 
sprouty (Drosophila) homolog 2 
Target Exon 

Homo sapiens cDNA FU 10266 fis, clone HEWBB1001 024 
Homo sapiens cDNA FU 1 0934 As, clone OVARC1 000640 
ESTs 

G protein-coupled receptor 22 
ESTs 

C8001409*:gi|7441226|pirl|S31212 collagen alpha 1(XW) 
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AW469413 

AW780192 

R23324 

AI220122 

AW898161 



Hs.18912 ESTs 

Hs.122540 tetraspan 2 

Hs.110950 Rag C protein 

Hs.151145 ESTs 



430376 
411037 
442803 
414831 



401270 
419447 
414807 
427019 
434469 
444618 
413947 
41S434 



45 
50 



M31158 

AA345824 
AL049240 
W79408 



AA001732 

AA634806 

AV653785 

W52990 

AW163045 

BE184348 



Hs.53112 
Hs.202095 
Hs.12532 



Hs.173334 
Hs.22860 
Hs.79334 



DnaJ (Hsp40) homolog, subfamily B, member 4 
hypothetical protein MGC2780 

ESTs, Moderately similar to ALU8.HUMAN ALU SUBFAMILY SX 



407945 X6S208 

447499 AW262580 

430686 NM.001942 

409882 AJ243191 

419047 AW952771 

414272 AI651603 

443808 AW377736 

426883 H21620 

410659 AI080175 

431292 AA370141 

432181 AA527650 

422890 Z43784 
AA034413 



chromosome 1 open reading frame 21 



se 15-(NAD) 

hypothetical protein FLJ10970 
gb:ab28c02.r1 Stratagene lung (937210) Homo sapiens cDN 
ELL-RELATED RNA POLYMERASE II, ELONGATION FACTOR 
ESTs 

nuclear factor, interleukin 3 regulated 
gb:CM0-HT0675-010500-3S5-e11 HT0676 Homo sapiens cDNA, 
ATPase, Cu transporting, alpha polypeptide (Menkes syrid 
protocadherin beta 16 
desmoglein 1 

heat shock 27kD protein family, member 7 (cardlovascula 



400878 
401103 
436670 



437140 
418421 
443476 

443567 
451879 
421013 



AI804054 
AF157116 
NM.007250 



Hs.201536 
Hs.232165 
lls.254660 
Hs.1 12885 
Hs.22350 
Hs.320861 
Hs.283689 
Hs.85050 



spinal cord-derived growth factor-B 
hypothetical protein LOC56757 
Kruppel-like factor 8 
activator of CREM in testis 



AI468574 
AA453734 
AI288430 
AL079440 



414681 
400802 
430015 
451978 
449088 

425113 AI936992 



Hs.1345 

Hs.27197 

Hs.171965 

Hs.10198 

Hs.64004 

Hs.74002 

Hs.112157 
Hs.27371 
Hs.196556 



gb:yb52f1 1.y5 Stratagene ovary (937217) i 
mutated in colorectal cancers 
SUMO-1-speafic protease 



70 
75 



407118 
423587 
443178 
430694 
423073 
437950 
419368 
447335 



Homo sapiens mRNA; cDNA DKFZp566J123 (from clone DKFZp5 
ESTs 

pleckstrin and Sec7 domain protein 
AI124553 Hs.48965 Homo sapiens cDNA: FU21693 fis, clone COL09609 
BE262895 Hs.276916 nuclear receptor subfamily 1, group D, member 1 
NA Target Exon 

AW904361 Hs.131191 ESTs, Weakly similar lo ALU7.HUMAN ALU SUBFAMILY SQ SEQ 
AW962712 Hs.126712 ESTs, Weakly similar to AF191020 1 E2IG5 [H. sapiens] 
AA156790 Hs.262036 ESTs, Weakly simiia: to Z223_HUMAN ZINC FINGER PROTEIN 
Hs.284256 hypothetical protein FU 14033 similar to hypoxia induci 
Hs.47312 ESTs 

similar to H2BH.HUMAN HISTONE H2B H [H.sap 



AA328074 
AI631241 
AA310624 
BE252922 
U79244 



Hs.30936 
Hs.123119 

Hs.1 12542 ESTs 

Hs.209464 KIAA1604 protein 

Hs.286192 hypothetical protein FU20940 

AI793124 Hs.144479 ESTs 

AI092790 Hs.334703 hypothetical protein FU 14529 

Z19002 Hs.37096 zinc finger protein 145 (Kruppel-like, expressed in pro 

AW088826 Hs.117176 poly(A)-bincflng protein, nuclear 1 

NM_006691 Hs.17917 extracellular link domaiiwontaining 1 
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404195 
428819 
425198 
420833 
413156 
413607 
443960 
428790 



R47948 
AA127133 
T64741 
AI093577 
AF023456 
AA205273 
AA531287 
429303 AW137635 
439734 AC005013 
433546 AI075877 
430317 AB020645 
425130 AA448208 
444195 AB002351 
409007 AL122107 
453773 AL133761 
442974 AI025670 
446936 H10207 



Hs.222 
Hs.25329 
Hs.105964 

Hs.193914 
Hs.128003 
Hs.188732 

Hs.255416 

Hs.193558 

Hs.177011 

Hs.105805 

Hs.44238 

Hs.149 

Hs.125461 

Hs.239189 

Hs.99163 



Target Exon 

NM_015718*:Homo sapiens NADPH oxidase 3 (NOX3), mRNA. V 
KIAA0575 gene product 
hypothetical protein FU21213 
ESTs 

gb:zl87e03.r1 Stratagene colon (937204) Homo sapiens cD 

gb:yc48f11.r1 Stratagene liver (937224) Hi r « sap ens r 

hypothetical protein FU21986 

protein phosphatase, EF hand calcium-binding domain 2 

hypothetical protein 

ESTs 

ESTs, Weakly similar to S66657 alpha-1C-adrenergic rece 

cAMP response element-binding protein CRE-BPa 

hypothetical protein FU11539 

glutaminase 

ESTs 

KIAA0353 protein 

Homo sapiens mRNA; oDNA DKFZp434G0827 (from clone DKFZp 
gb:DKFZp761C1413_r1 761 (synonym: hamy2) Homo saj ' 
ESTs, Weakly similar t ' ' 



'Pkey CAT Number Accessions 

413156 135116J AA127133 AA384396 AW953912 T72119 

413607 1379911.1 T64741 BE1 58393 BE152805 

417194 1657323.1 N53793 N53716 N53739 

434469 387447.1 AA634806 C18732 M729161 AA729860 

451879 888642.1 AI821030 T47126 AIB21318 

453773 980699.1 AL1 33761 AL1 33767 

454736 1232235.1 BE184348 AW817453 BE011068 

456304 176820.1 AI820973 AI734077 A1820984 AA225796 AA225060 AA225101 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Gcnbank Identifier (Gl) numbers 
sequence of human chromosome 22.' Dunham I. et al„ Nature (1999) 402:489495. 



sm I. etal." refers to the publication en 



400628 
400802 
400878 
401103 
401159 
401270 
401486 



3180-3953 
141659-141813 

32585-32756,36281-36540,40791-40933,4401844179 



114550-114688,117265-117407,119490-119599,123237-123395,131140-131217 



TABLE 17A: 605 genes upregulated in testicular cancer relative to normal body tissues 

Table 17A lists about 605 genes upregulated in cervical cancer relative to normal body tissues that are likely to encode proteins amenable to modulation by small molecules, peptides, 
or antibodies. These genes .were selected from 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis 

sex f ssed rr ■) "" r ' i 1 c normal zed velue rcflecUng the relative level of mRNA expreasion. The protein products of these genes often contain oneor more domains 
inb'cabvc of have exogenic foncfon or of tra^ < ntibod e pkinas l idoman7lm 

phosphatase, or ion.transporter). Certain predicted proteindomains are noted. 19 ^ ' aeauw,oma,n ' ' m ' 

Pkey: Unique Eos probeset identifier number 

iBank accession number 

ne domain; SS, signal sequence; =Y, very likely to contain; =M, 
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likely lo contain; other protein doma'n abbreviations are from PFAM (Nucleic Acids Research, 2002, 30:276-280). 
UniGene gene title 

95th percentile of cervical cancer Als divided by the SOIh percentile of normal tissue Als, where the 10th percentile of all normal tissue Als was subtracted from 



Pkey; ExAccn; UnigenelD; Unigene Title; Pred.ProLDomains; R1 

408522; AI541214; Hs.46320; Small proline-rich protein SPRK [human, ; none.Cornifin; 33.942 

422168; AA586894; Hs.112408; S100 calcium-binding protein A7 (psorias; efhand,S_100;TM=M;SS=N; 33.05 

424098; AF077374; Hs.139322; small proline-rich protein 3; Comifin;TM=M;SS=N; 32.656 

1221 110343; Hs.112341; protease hhSUtar 3, skin-derive* (SKAl; wq ,fc fi [ !=Y r < 1 

433091; Y12642; Hs.3135; lymphocyte antigen 6 complex, locus D; UPAR_LY6,toxtn,A=tivin rccp;TM=M;SS=Y; 2756064945 

421948; L42583; Hs.334309; keratin 6A; filamentRhoGAP,DUF236,bZIP,Tropomyosin,tubulin,DUF164 l TBCA,Collageii;TM=M;SS=N; 26.778 - 

446292; AF081497; Hs.279582; Rh type C glycoprotein; Ammoniiim_transp,FecCD;TM=Y;SS=M; 26. 1 133829 

407242; M18728; ; gb:Human nonspecific crossreacting antig; ig;TM=M;SS=M; 23.382 

424687; J05070; Hs.151738; matrix metalloproteinase 9 (gelatinase B; fn2,hemopexin,Pepfidase_M10;; 22.522 

412719; AW016610; Hs.816; ESTs; none.none; 21.198 

406690; M29540; Hs.220529; carcinoembryonic antigen-related cell ad; ig;TM=M;SS-M; 20.028 
402075; ; ; ENSP00000251056':Plasma membrane calcium; none;; 19.038 

431958; X63629; Hs.2877; cadhenn 3 type 1 P- ddh-m placenl > iin,Cadherin_C_term;TM SS=M; 17.92061281 
412471; M63193; Hs.73946; endothelial cell growth factor 1 (platel; Giycosjransf J,Glycos_trans_3N;TM=M;SS=M; 17.8978979 
417308; H60720; Hs.81892; KIM0101 gene product; none;TU=M;SS=N; 17.08333333 
429259; AA420450; Hs.380088: Plakophilin; none,none; 17.08235294 
417079; U65590; Hs.81134; interteiihi, , receptor antagonist 1L1„ 16 91568628 
439926; AW01487E; H 13700 ESTs ,u,e,none; 16.69 

419693; AA133749; Hs.301350; FXYD domain-containing ion transport reg; ATP1G1_PLM_MAT8;TM=Y;SS=M; 16.365 
413753; U17760; Hs.75517; laminin, beta 3 (nicein (125kD), kalinin; laminin_EGF,laminin Mterm;; 15.75294118 
413278; BE563085; Hs.833; interferon-slimulated protein, 15kDa; ubiquitin;; 15.48600509 
401781;;; Target Exon; filamcnt;TM=M;SS=N; 15.43668831 
420440; NIO02407; Hs.97644; mammaglobin 2; Uteroglobin;; 15.394 

441633; AW958544; Hs.112242; normal mucosa of esophagus specific 1; none;TM=M;SS=M; 15.12264151 
452240, AI591 14 / k E-T n< rnonc;14.63 

428957; NM.003881; lls.194679; WNT1 inducible signaling pathway protein; tsp_1,vwc,IGFBP;TM=M;SS=M; 14.49772727 

414987;AA524394; Hs.294022; hypothetical protein FLJ14950; SH2;TM=M;SS=N; 14.4389313 

432374; W68815; Hs.301885; Homo sapiens cDNAFU11346fis, clone PL; none, none; 14.00909091 

400289; X07820; Hs.2258; matrix metalloproteinase 10 (stromelysin; hemopexin,Peptidase_M10,Asladn:; 13.824 

414812; X72755; Hs.77367; monokine induced by gamma interferon; IL8;TM=M;SS=Y; 13.7754386 

421552; AF026692; Hs.105700; secreted frizzled-related protein 4; Fz.NTR;; 13.74595843 

400284; ; ; NM_000125*:Homo sapiens estrogen recepto; hormone_rec,zf-C4,Oest_recep;TM=M;SS=M; 13.31578947 

428227; AA321049 H 2z48 al Ik Lit/ Y 13 f "=294118 

411274; NMJ02776; Hs.69423; kallikrein 10; tryps'n;TM=M;SS=N; 13.038 

406687; M31126; Hs.352054; matrix metalloproteinase 11 (stromelysin; hemopexin,Peptidase_M10;; 13.00311527 

427666;AI791495;Hs.180142;calmoJj i-l kin pi , n (CLE ') efnand;TM=M;SS=N 12.79 

400301; X03635; Hs.1657; est/ogen receptor 1; F-box,hormonejec,zf-C4,Oestjecep,&dh_2inc l kctoocylsynl,pp-bin 

synLC,AAA,E7,RFX_DNA_binding;TM=M;SS=N; 12.472 
410001 AB041C36 H Ika't . 11; try n;TM=M;SS=M; 12.47 

422310; AA316622; Hs.98370; cytochrome P450, subfamily IIS, polypept none,pkinase,fn3,ig; 12.28597122 
130630; AW269920r Hs.2621; . alin A (stefin A); cysta0n;TM=M;SS=N; 12.13379205 
437044; AL035864; Hs.69517; differentially expressed in Fanconi's an; none;TM=M;SS=M; 12.04945055 
418462, BE001596.HS 85266 intej,, fn3,integr B,Calx-be I F;TM=M VI 11 95538462 

443859; NMJM3409 H 3914; follistatin ' ^al 11 95467422 

426350; NMJ03245; Hs.2022; transglutaminase 3 (E polypeptide, prote; Transgliitamin_N,Transg!utamin C,TransgluL"xre;TM=M;SS=N; 11.61 
408243; Y00787; Hs.624; interleukin 8; HLH,PAS,IL8;TM=M;SS=N; 11.564 

444781; NM_01 110' Hs.11950 i in lorHm i ta ociat ^rote UPARJ.Y6,lactamaseJ3; 11.55285714 
428484; AF104032; Hs.184601; solute earner family 7 (cationic amino ; 

aaj>eiTneases,pyridoxatJeC,bromodomain,PHD,MBD,ATj™ 
418663;AK001100;Hs.41690;desmocollin3;cadherin,Cadherin C term.none; 11.456 
433001 ;AF217513;Hs.279905; clone HQ0310 PRO0310- " none; 1 45352113 

423217; NMJ00094; Hs.1540; collagen, type VII, alpha 1 (epidermolys; Kunitz_BPTI,fn3,vwa,Collagen,beta-lxtamase;TM=M;SS=M; 1 1.32234432 
428970 BE27b8 Q 1 tt c L duccd 3 (RA'G1); rr r TM M 11 28686327 

424834; AK001432; Hs.153408; Homo sapiens cDNAFLJ10570fis, clone NT; none.none; 11.076 
451541; BE279383; Hs.26557; plakophilin 3; Armadillo_seg;TM=M;SS=N; 11.0381579 
41B478; U38945 Hs 11 - yclin-dependent kinas r h c . r2A (mo, ank;; 11 

423673; BE003054; Hs.1695; matrix metalloproteinase 12 (macrophage ; hemopexin,Peptidase_M10;TM=M;SS=M; 1 1 

425071; NMJ13989; Hs.154424; deiodinase, iodothyronine, type II; T4_deodinase;TM=M;SS=Y; 10.93859349 

437938; AI950087; Hs.369628; gbwq05c02.x1 NCI_CGAP_Kid12 Homo sapien; none,none; 10.78064516 

425357; BE271188 Hs I5S975 t , t ntyr0 me phosphatase, receptor t none;TM=M;SS=Y; 10.74825175 

4 1 2527; I 61;ESTs,Weakl i t I HIM N DEATH; n none; 1 

437897; AA770561; Hs. 146170; hypothetical protein FL I 2 69 zf-DHI L .one, 10 49538462 

431629; AU077025; Hs.265827; interferon, alpha-inducible protein (do; none;TM=M;SS=Y; 10.48210735 

411558; AA102670; Hs.70725; gamma-aminobutyric acid (GABA) A recepto; Neui_chan_LBD,Neui_chan memb;TM=Y;SS=M; 1026714286 
409142; AL136877; Hs.50758; SMC4 (structural maintenance of chromoso;ABC_tran,M,SMC_N,SMC_C,DUF164,none; 10.142 
42150 NM_001833;Hs.105 t rr om 2; F L r I! r- M "' c 1 1 

418641; BE243136; Hs.86947; a disintegrin and metalloproteinase doma; disintsgrin,Reprolysin i Pep_M12B_propep,EGF;TM=Y;SS=M; 10.072 
425397; J04088; Hs.156346; topoisomerase (DNA) II alpha (170kD); DNAjyraseB.DNA topoisolV.HATPase c;; 9.996363636 
414035; Y00630; Hs.75716; serine (or cysteine) proteinase inhib'to; serpin;; 9.895325397 
1215 3; BE302796; H 10509 thymid le kinase 1 oluble Tl TM M;SS=N )8 1 88E 
407786; AA637538; Hs.38972; tetraspan 1; transmembrane4;TM=Y;SS=M; 9.876056338 
424441; X14850; Hs.147097; H2A histone family, member X; histone,CBFD_NFYB_HMF;; 9.851635514 
438091; AW373062; Hs.351546; nuclear receptor subfamily 1, group I, m; hormone_rec,zf-C4,none; 9.840720222 
413859; AW992356; Hs.8364; Homo sapiens pyruvate dehydrogenase k'na; SAM_PNT,none; 9.823170732 

' ' '"" " ' tus pemphigoid anUgen 1 (230/240kD); efhand,spectrin l GAS2,SH3,PlecUn,RA,Xylose_isom,FliD,bZIP,Tropomyosin,fvtyc- 

i.ce-M. o mi 
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409893; AW247090; Hs.57101;minichromosome maintenance defi it( CM,a!do_kol IM » SS-N 9787878788 
1'. ! AF 7 037 H 324051 Re'.f a sclaleci ihibil H ik;TM M;SS=N 9.6370 71 i 
425650; NM 001944; Hs.1925;desmoglein 3 (pemphigu vugan « t in;TM M SS~M 9 596 

417900, BE2501 27, Hs 82906 CDl t [ oncycle20 cere , 01 ~M M SS-N 9,558 
444946; AW139205; Hs.156457; hypothetical protein FU22408; abhydrolase,abhydrolase_2;TM=Y;SS=M; 9.55 
421481; AW391 972; Hs.104696; KIAA1324 protein; none;TM=M;SS=M; 9.529085873 
408591; AF015224; Hs.46452; raammaglobin 1; Uterog!obin;TM=M;SS=M; 9.505 
444381; BE387335; Hs.283713; hypothetical protein BC014245; Collagen;TM=M;SS=M; 9.477961433 
444006; BE395085; Hc.334762; type : transmembrane protein Fn14; ldljWBpLs,PKD,MHCJ;TM=M;SS=Y; 9.415151515 
413719; BE439580; Hs.75498; small inducible cytokine subfamily A (Cy; IL8;; 9.408 
424364; AW383226; Hs.163834; ESTs, Weakly similar to G01763 r-tiLj I 1 i r 
429002; AW248439; Hs.2340; junction plakoglobin; Armadillo_seg;TM=M;SS=N; 9.315693431 
421379; Y15221; Hs.103982; small inducible cytokine subfamily B (Cy; IL8;TM=M;SS=Y; 9.31 
418004; U37519; Hs.87539; aldehyde dehydrogenase 3 family, member ; aldedh;TM=M;SS=M; 9.29 
454034; NM_000691; Hs.575; aldehyde dehydrogenase 3 family, member ; aldedh;; 9.264 
417389 BE260964 II 0 r n ii r-unte growth-promoting factor; PTN_MK;TM=M;SS-Y; 9.241561181 
445033; AV652402; Hs.72901; cyclin-dependent kinase inhibitor 2B (p1; ank;; 9.207272727 
443426; AF0981 58; Hs.9329; chromosome 20 open reading frame 1; none;TM=M;SS=N; 9.195167286 
23;AW238299;H 061 L16 bin ng pr i r a,PKD,M MM . i 

42875=1 AA433988 H 350 A125anl n 16; SE >WS~-N 9 028 

421777; BE562088; Hs.108196; HSPC037 protein; none;TM=M;SS=N; 9.004 

448988; Y09763; Hs.22785; gamma-aminobutyric acid (GABA) A recepto; Neur_chan.LBD,Neur_chan_memb;TM=Y;SS=M; 9.001096491 
418969 W33191 ri i tical piotein Fl I r 1 00 N 8 942 

455601; AI368680; Hs.816; SRY (sex determining region Y)-bcx 2; HMGJmx;; a87 
429211; AF052693; Hs.198249; gap junction protein, beta 5 (connexin 3, connexin;TM=Y;SS=M; 8.77131783 
456906; AF117646; Hs.156637; Cas-Br-M (murine) ectopic retroviral tr; zf-C3HC4,CbLN,Cbl_N2,CbLN3;TM=M;SS=N; 8.738 
430397; AI924533; Hs.105607; bicarbonate transporter related protein ; HC03_cotransp;TM=Y;SS=N; 8.736 
IS3 ?1 'IM " ' .1 ■-■:■•; I ' V .:n. .-..I n ,. . 
429083; Y09397; Hs.227817; BCL2-related protein A1; Bcl-2;TU=M;SS-N; 8.536 
408113; T82427; Hs.194101; Homo sapiens cDNA: FU20869 fis, cloneA;7tmJ,none; 8.49 
439285; AL133916; Hs.47860; hypothetical protein FU20093; ig,pklnase,LRR,LRRNT,LRRCT,none; 8.460655738 
409420 Z15008 H 1451; lam r ( n I I I n EbF 8 414 

438746; AI885815; Hs.184727; Human melanoma-associated antigen p97(m; transferrin,Guanylate_kin,PDZ,SH3; 8.376205788 
439606; W79123; Hs.58561; G protein-coupled receptor 87; 7tm_1;TM=Y;SS=M; 8.37 
430486; BE062109; Hs.241551; chloride channel, calcium activated, lam; none;TM=Y;SS=M; 8.364 

429170; NM.001394; Hs.2359; dual specificity phosphatase 4; Rhodanese.DSPc.Y jhosphalase,Ribosomal_S3_N;TM=M;SS=N; 8.266 

417771, AA804698 Hi' 8i r 1 retinoic a < orresponder(tazaio in none 8 248314607 

431620; AA126109; Hs.264981; 2'-5'-oligoadenylate synthetase 2 (69-71; NTP_transf_2;TM=M;SS=N; 8.156 

412270; AC005262; Hs.73797; guanine nucleotide binding protein {C r G ' I n TV 'iC c f 0 1 . °'fi 

448733; NM_005629; Hs. 187958; solute carrier family 6 (neurotrdiibii lie; St F;#l SS=N; 8.137559809 

427557; NMJ02659; Hs.179657; plasminogen activator, urokinase recepto; UPAR_LY6,ET,PLA2Jnh;; 8.043478261 

424439; AA579635; lls.1770; ligase I, DNA, ATP-dependent; DNAJigase;; 8.038194444 

418322; M284166;Hs.84113;cyclin-dependentkinase inhibitor 3 (CDK;Yjhosphatase,DSPc;TM=M;SS=N; 8.024752475 

453857; AL080235; Hs.35861; Ras-induced senescence 1 (RIS1); none ; ™=Y;SS=M; 8 

424046; AF027866; lls.138202; serine (or cysteine) proteinase inhibito; scrpin;TM=M;SS=N; 7.982 

418526;BE019020;Hs.85838;solutecarrierfamily16(monocaib «! mi.eTf/ ';SS M 7 973684211 

413219; AA878200; Hs.118727; Homo sapiens cDNAFU 13692 fis, clone PL; HLH,deatti,TNFR_c6,Acyl-CoA_hydro; 7.892 

12280 AK0013 );ll 121021 hypothetical otein FU10549; IQ;TM=M;S£ N 7.821 

452203; X57522; Hs.352018; transporter 1, ATP-binding cassette, sub; ABCJran AEC membrane SRPM Thy m / TM < M 7 823874755 
431630; NM_002204; Hs.265829; integrin, alpha 3 (antigen CD49C, alpha ; FG-GAP,Rhabd_glycop,integrin_A;TM=Y;SS=M; 7.758985201 

V94322 1 n ma inhiblory activity; SH3;TM=M 

439453; BE264974; Hs.6556; thyroid hormonei-teptoi intei 1 i 13 AW «iDC tran,CoaE;TM=M;SS=N; 7.757751938 
452747; BE153855; Hs.61460; Ig superfamily receptor LNIR; ig,Rhabdjlycop;TW=Y;SS=M; 7.624 
438089; W05391; Hs.351546; nuclear receptor subfamily 1, group I, m; hormone_rec,zf-C4 t none; 7.605660377 
427747; AW411425; Hs.180655; serine/threonine kinase 12; pkinase;TM=M;SS=N; 7.578 
430280; AA361258; Hs.237868; interleukin 7 receptor, fn3,none; 7.476 
429299; AI620463; Hs.347408; hypothetical protein MGC13102; nonc;TM=Y;SS=N; 7.442528736 
441384; AA447849; Hs.288660; retinoic acid induced 3; 7tm_3,none; 7.442495127 



414774; X02419; Hs.77274; plasminogen activator, ui kin e; kringle rip u pi ntjhionins 7 435897436 

439237; AW408158; Hs.318893; ESTs, Weakly similar to A47582 Qxellgr; Furin-like,pkin8se t Recep_L_domain,YLP,none; 7.398360656 

432636; AA340864; Hs.278562; claudin 7; PMP22_Claudin;TM=Y;SS=M; 7.394039735 

431890; X17033; Hs.271986; integrin, alpha 2 (CD49B, alpha 2 subuni; ™a 1 integrinJ\,FG-GAP;TM=Y;SS=M; 7.383419689 
416084; L16991; Hs.79006; deoxythymidylate kinase (thymidylate kin; none.none; 7.382 
436972; AA284679; Hs.25640; claudin 3; PMP22_Claudin;TM=Y;SS=M; 7.327160494 
409799; D11928; Hs.76845; phosphoserine phosphatase-like; Hydrolase;TM=M;SS=N; 7.316 

448569; BE382657; Hs.21486; signal transducer and activator of trans; SH2,STAT 1 STAT_bind l STAT_prot;TM=M;SS=N; 7.315412186 
428450;NM_014791;Hs.184339;KIAA0175geneproduct;KA1,pkina 3 eTM MC N; 7.2984 75 
422283; AW411307; Hs.114311; CDC45 (cell divi c :nc c 4 ' erevis; CDC4 T" -M SS=N; 7.28 
451253; H48299; Hs.26126; claudin 10; PMP22_aaudin,Pepti , dase_M1,KJetra;TM=Y;SS=M; 7.256802721 
413819; U77735; Hs.80205; pim-2 oncogene; pkinase;; 7.234455959 

421817; AF146074; Hs.108660; ATP-binding cassette, sub-family C (CFTR; Fasciclin,ABC_tran,ABC_membrane,GTP_EFTU;TM=M;SS=M; 7.162534435 

451035; AU076785; Hs.430; plaslin 1 (1 feofotro); efhond.CH Adapting;; 7.145454546 

424008; R02740; Hs.137555; putative chemokine recepton GTP-binding; 7lm_1;TM=Y;SS=M; 7.126 

414482; S57498; Hs.76252; endothelin receptor type A; 7tm_1;TM=Y;SS=M; 7.122413793 

425003; AF119046; Hs.154149; apurinic/apyrimidinicendonuclease(APEX ; Troponin,Exo_endo_phos,IQ;TM=M;SS=N; 7.106719368 

4X893 <o4'32 II 2 1 J an I n;TM=M M 7.088937093 

407792; AI077715; Hs.39384; putative secreted llgand homologous to t none;TM=M;SS=Y; 7.052 

426514; BE616633; Hs.170195; bone morphogenetic protein 7 (osteogenic; TGF-beta,TGFb jropeptlde;; 7.042 

431241; AA495799 H* ESI S a GEF none" 703 

437139; W73685; Hs.1 18513; ESTs, Weakly similar to RTA RAT PROBABLE; 7tm J ;TV=Y;SG-V; 7.03 
420311; AW445044; Hs.38207; Human DNA sequence from clone RP4-530I15; none.none; 7.026 
439979; AW600291; Hs.6823; hypothetical protein FU10430; none;TM=M;SS=N; 7.008 
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422846; BE513934; Hs.1583; neutrophil cytosotic factor 1 (47kD, chr; SH3,PX;TM=M;SS=N; 6.991626794 
416250; AA581386; Hs.73452; Kremen 2; krlngle.CUB.WSC;; 5.972 
430770; AA7S5594, !-'- i;>; ? „- ESTs; .onc.none; 6.S5 

418869; AW516565; ; gb:xq01d05.x1 Soares_NHCeC_cervical_tumo; none,RasGAP,WW,IQ; 6.948 
5 428953; AA306610; Hs.348183; tumor necrosis factor receptor superfami; 60s_ribosomal,Ribosomal_L10,TNFR_c6,DEAD;; 6.914 
418283; S79895; Hs.83942; cathepsin K (pycnodysostosis); Peptldase_C1;; 6.876190476 

419667 AU077005 H i n ri I dom i ri ly n Pep_M12B_propep;TM=M;SS=M; 6.852970711 

421143; AB024536; Hs.102171; immunoglobulin superfamily containing le; ig,LRR,LRRNT,LRRCT;TM=M;SS=M; 6.849056604 
456181 L3 463 H 101) r htt r t Pi »M^N 6762 
10 436856; AI469355; Hs.127310; ESTs; pkinase,rrm;TM=M;SS=N; 6.721428571 

41 1 125; M151647; Hs.68877; cytochrome b-245, alpha polypeptide; none;TM=Y;SS=M; 6.720348837 

439750; AL359053; Hs.57664; Homo sapiens mRNA full length insert cDN; IMPDH.C MF DH_N r r> S 1 !i,logrn_B 1 Ricin_BJectin; 6.717307692 

450334; AF035959; Hs.24879; phosphatidic acid phosphatase type 2C; PAP2;TM=Y;SS=M; 6.715240642 

426437. EEO'^ 2 I 69895; ubiquitir i i,i ,. a tmg pn?/meE2L6; ArmadilIo_seg,UQ_con,none; 6.G88194444 
1 5 439738; BE246502; Hs.9598; sema domain, immunoglobulin domain (lg),; Sema,PSI,integrin_B;TM=Y;SS=N; 6.570553936 

428385; AF112213; Hs.184062; putative Rab5-interacting protein; SH2.SH3;; 6.552021348 

456534; X91195; Hs.100623; phospholipase C, beta 3, neiglitn, pseudo LIM.PDZ , hn a 6 'f. 1713299 

425289; AW139342; Hs.155530; interferon, gamma-inducible protein 16; PAAD.DAPIN.HIN;; 6.652671756 

426500 Nf1 <; 1 KIAA0450 gen I 1 W < SS N 639655172 

20 4381 13; AI467908; Hs.8882; ESTs; 7lm_1,none; 6.6 

444783; AK001468; Hs.62180; anillin (Drosophila Scraps homdog), act; PH.none; 6.6 

408482; NMJ00676 h I I 't M M 6 548148148 

410290; M402307; Hs.322844; hypothetical protein DKFZp564A176; Sema,PSI,TIG,integrin_B;TM=Y;SS=M; 6.532763533 
414809; AI434699; Hs.77356; transferrin receptor (p90, CD71); PA;TM=Y;SS=N; 6.526951673 
25 426440; BE382756; Hs.169902; solutecarrier family 2 (facilitated jlu; .< gar_tr;TM=Y;SS=M; 6.512704174 
420039; NM_004605; Hs.376147; sulfotransferase family, cytosolic, 2B, ; Sulfotransfer;; 6.406 
423031, Al27d^ H i ESTs; none.none, 6 447658402 

421445; M913059; Hs.104433; Homo sapiens, clone IMAGE40S4868, mRNA; ion_trans,K_tetra,asp; 6.426666667 
433933; AI754389; Hs.355397; I lomo sapiens clone TCCCIA00164 mRNA sequ; none;MA;NA; 5.4 
30 435094; A1560129; Hs,289008; EST; none.none; 6.312 

432106; H58323; Hs.269098; ESTs, Weakly similar to RETRCMRUS-RELAT; SH3,PDZ,Guanylate_kin,none; 5 276555777 

427540; AF058293; Hs.180015; D-dopachrome tautomerase; COX8 l SHMT,MIF,GST_aEF1G_domain,GST_N,S1,F2,Friz2led,calreticulin,7tm_2,rrm,PAP_asscc;TM=Y;SS=f^ 
6.272727273 

435232; HMJ101262; Hs.4854; cyclin-dependentklnase inhibitor 2C (pi ank;™=M;SS=N; 6.269720102 
3 5 418203 4042 Hs.83 1 f DC £ rot in kinas I <S 6219081272 

411263; BE297802; Hs.69360; kinesin-like 6 (mitotic centromere-assoc; kinesin;TM=M;SS=N; 6.19 

409512; AW979187; Hs.293591; melanoma differentiation associated prat DEAD,helicase_C,CARD;TM=M;SS=N; 6.188888889 

449230; BE613348; Hs.356392; melanoma cell adhesion moecule; ig,isodh,Ribosomal_L5.F-Dox;TW=Y;SS=M; 6.188046647 

440006; AK000517; lls.6844; NALP2 protein; PYRIH-Containing APAFMi; AAA,NB-ARC,PAAD_DAPIN;NA;NA; 6.15503876 
40 450581; AF081 513; Hs.25195; TGF-bela 4; TGF-teta,TGFb_propeptide;; 6.152 

432314; AA533447; Hs.285173; ESTs; Xlink.none; 6.123040752 

418844; M62982; lls.1200; arachidonale ^lipoxygenase; lipcxygenase,PLAT;TM=M;SS=N; 6.12 
421733; AL119671; Hs.1420; fibroblast growth factor recpploi 3, <\, j.pkii e;TM i 5S M i.095758355 
t; 1 W327546; H 1110 4 olutei rri m (i cl * i mito_carr;TM=M =N; 6.C ) 786 
45 452683; AI089575; I ls.374574; progesterone membrane binding protein; homeotox.nonc; 5.06284153 
1455 1245671; Hs.12844 F n lip EGF.MAM;; 6.05 1330 

444309; U83236; Hs.10803; calcium and integrin binding protein (DN; effiand;; 6.04015544 

414166; AW888941; lls.75739; N-myc downstream regulated; DEAD,hclicasc_C.rrm 1 Ndr,Cys_knot,TIUvwa,vwc,vwd,IQ,Rlla,abhydrolase,TGF- 
, . beta,DUF139,TPR,DSPc,tsp 1,Ribcsomal_S2f,rvp;TM=M;SS=N; 6.009562842 

5 0 438108;AI471795; Hs.287776; vanilloid receptor-related osmotically a: ank.ionJrsrs.TM =Y;SS=N; 6.t>34 

413869; NM_000878; Hs.75596; interleukin 2 receptor, beta; none;TM=Y;SS=M; 5.984536083 

405484; ; ; C3002124*:gi|12737280|ref|XP_006682.2l k; none;; 5.978964401 

414907; X90725; Hs.77597; polo (Drosophia)4ike kinase; Ribosomal_L37ae,pkinase,POLO_box,tRNA.synL1b,dynamin,dynamin_2,GED,bZIP,M;; 5.978431373 
419216; AU076718; Hs.164021; small inducible cytokine subfamily B (Cy; IL8;; 5.976 
55 414135; IMM 004419; Hs.2128; dual specificity phosphatase 5; Rhodanese,DSPc,Y_phosphatase;TM=M;SS=N; 5.969387755 
41 1756; BE294350; Hs.71891; discoidin domain receptor family, member; pkinase,F5_F8_type_C;TM=Y;SS=M; 5.95184136 
424291;AL120051; Hs.144700; ephrin-B1; Ephiin;TM=Y;SS=M; 5.951550388 
453459; BE047032; Hs.257789; ESTs; none.none; 5.95 

456373; BE247706; Hs.86693; membrane-spanning 4-domains, subfamily A; none;TM=Y;SS=N; 5.938 
60 429359; W00482; Hs.2399; matrix metalloproteinase 14(membrane-in; hemopexin,Peptidase_M10;TM=M;SS=M; 5.917857143 

414703; BE243877; Hs.380063; ATPase. Na? transporting, beta 3 poiypep Ha ATI M=> ' 5910455487 

448775; AB025237; Hs.388; nudix (nucleoside diphosphate linked mo:; NUDIX;TM=M;SS=M; 5.901386793 

452239; AW379378; Hs.356289; protein tyrosine phosphatase, receptor C none.none; 5.868362832 

418345; AJ001696; Hs.241407; serine (or cysteine) proteinase inhibito; serpin;TM=Y;SS=M; 5.842 
65 452875; BE275760; Hs.30928; DNA segment on chromosome 19 (unique) 11; Eukjorin;TM=M;SS=M;5.B16363636 

439625; AF086453; Hs.58611; ESTs; ForkJiead,glycolytic_9nzy,Na_sulph_symp; 5.811594203 

447343; AA256641; Hs.236894; ESTs, Highly smila u n J p 3-2 r i none none 5 31 

422765; AW409701; Hs.1578; baculoviral IAP repeat-containing 5 (sur; BIR;TM=M;SS=N; 5.806 

415198; AW009480; Hs.943; natural killer cell transcnp' non 1 1 -M S^-H 5 804137931 
70 431941; AK000106; Hs.272227; Homo sapiens cDNA FU200S9 f s, clone CO; pk;naso.Fi:rin-like,Recep_L_domain,none; 5.8 

457001; J03258; Hs.2062; vitamin D (1,25- dihydraxyvitamin D3) re; hormone_rec,zf-C4,Wetallothio_5;TM=M;SS=H; 5.794 

439335; AA742697; Hs.62492; NM_052863:Homo sapiens secretoglobin, fa; none;; 5.778588808 

439246; AI498072; Hs.351474; membrane-associated tyrosine- and threon; ank,pkinase,UPF0073;; 5.763492064 

452461 N78223 Hs.108106 transcription factor ;f-C3 IC4,ubiquiUn,Pt ID.YDG SRA;TM M;SS=N; 5.72£ 
75 414883; AA926960; Hs.348669; CDC28 protein kinase 1; CKS;; 5.714634146 

424517; AI539443; Hs.137447; Homo sapiens cDHA FU12159 fc, clone MA; SH2.STAT ,STAT_bind,STAT_prot,none; 5.701666667 

419056; M89957; h CD79B antig i(immun bulin-associat j,ITAM;TM=<i S=M 1 3 2 

432269; NM.002447; Hs.2942; macrophage stimulating 1 ,eceptoi (c-mel; pkin E se,Se l na 1 PSI 1 TIG,A4_EXTRA;TM=M;SS=M; 5.686 

452696;AI826645;Hs.211534;ESTs;ArfGap l PH,ank l Guan V McJ:n t 'u;iH3; 5.683673469 
80 41 1030; BE387193; Hs.67896; 7-60 protein; nonc;TM=M;SS=N; 5.676757677 

447131; NM_004585; Hs.17466; retinoic acid receptor responder (tazaro; none;TM=Y;SS=N; 5.672977625 

426227; U67058; H ' Hin rr [ tv | nR; 7tm_1 .'i YS^=M 5 555 

407722; BE252241; Hs.38041; pyridoxal (pyridoxine, vitaminB6) kinas; pfkB;TM=M;SS=N; 5.655616943 
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... „ s . - -se 13; pkinase;TM=M;SS=N; 5.6465623 

13; membrane-associated tyrosine- and threon; ank,pkinase,UPF0073;; 5.648 
452690; A1535070; Hs.15085; ESTs; pou,homeobox,lig_chan,ANF_receptor; 6.646 

tiicp i i i t h f t 

418703; NM.014448; Hs.87435; Rho guanine exchange factor (GEF) 16; SH3,PH,RhoGEF,Bima_VP3,TM=M;SS=N; 5.636 

426108; M622037; Hs.166468; programmed cell death 5; DUF122;TM=M;SS=N; 5.635087719 

424490; AJ278016; Hs.65565; ankyrin repeat domain 3 nk,pkinase;TM=M 3S=N; 5,620930233 

432065; AA401039; Hs.2903; protein phosphatase 4 (formerly X), cata; MetalIophos;TM=M;SS=N; 5.608352145 

417018; M16038; Hs.80887; v-yes 1 Yam chi sarcorr I H I 1M 11 bS N 5.596052632 

430696; AA531276; Hs.59509; ESTs; pkinase,PP2C,none; 5.575112108 

435017; AA336522; Hs.12854; angiotensin II, type I receptor-associai: none;TM=Y;SS=M; 5.55691056S 

139963 TO47529; Hs.67E tin,, ctiv ng factor acetylhydrola;PAF-AH_lb,L:pose_GDSL;TM=M N ,56195 5 

415012; NM 004383 Hs 77793 c f i Hi T <M J r rr 5421687 

424909; S73187; Hs.153752; cell div si n :ycle 25B; Rhodanese;; 5.549751244 

413969; X14034; Hs.75643; phospholipase C. gamma (f'-o h tu>l ' H3,C2,P! I PLOY PI PLC-X.PDGF;; 5.541366907 
406621; X57309; Hs 181125; immunoglobulin lamMa 'ocus; ig,HSP70.Ppx-GppA;TM=M;SS=N; 5.54076037 
417700; M36542; Hs.1 101; POU domain, class 2, transcription facto, homeobo^ poci;TM=M;S r 36 
456362; AW973003; Hs.179909; hypothetical protein FU22995; none;TM=M;SS=N; 5.52661597 
436576, AI458213 H=" ( <1 I "i _1 DnaJ;5.526381S1 

425465; L18964; Hs.1904; protein kinase C, iota; pkinase,DAG_PE br pkii K i c TM-M SS=N; 5.519672131 

412276; DE262621; I ls.73798; macrophage migrator rl ni\"l ,,r 13 5 c 16453382 

417433; BE270266; Hs.82128; 5T4 oncofetal trophoblast glycoprotein; LRR,LRRNT,LRRCT;TM=Y;SS=M; 5.514964789 

447827 U73727 H 19718; pr phosph It ! MAM;TM=Y;SS=M 494202899 

419508; AW997938; Hs.90786; ATP-binding cassette, sub-family C (CFTR; ABCJfan,ABC_membrane;TM=Y;SS=M; 5.471947195 

410608; AI538438; Hs. 159087; ESTs; ubiquitin,integn'n_B,UBA,none; 5.465384615 

448633; M31 1426; Hs.21635; tubulin, gamma 1; tubulin;TM=M;SS=N; 5.4G007G046 

408716; AI567839; Hs.151714; Homo sapiens mRNAfor KIAA1769 protein, ; UvrO-hellcase,RNB,Runt;TM=M;SS=N; 5.450413223 

423410; BE29844C, I Is 3t'GC°C BCL2-lik. 1, Bcl-2,BH4,none; 5.444805196 

457819; M057484; Hs.35406; FU20522 Hypothetical protein FU20522; none.none; 5.444281525 

422597; BE245909; Hs.118634; ATP-binding cassette, sub-family B (MDR/; ABCJran,ABCjriembrane,PRK;TM=Y;SS=N; 5.437931035 

429191; AF065215; Hs.198161; phospholipase A2, group IVB (cytosolic); C2,PLA2_B,]'mjC;TW=M;SS=N; 5.4375 

449951; AW265634; Hs.133100; ESTs; pkinase,Furin-like,Recep_L_domain,none; 5.435211268 

409012; AL1 17436; Hs.49725; DKFZP434I216 pratein; PH,RhoGEF;TM=M;SS=M; 5.433333333 

443466; BE243123; Hs.321045; IKK-rclatod kinase epsilon; inducible Ik; pWnase,PJ01;TM=M;SS=N; 5.429657795 

434826; AF155661; Hs.22265; pyruvate dehydrogenase phosphatase; PP2C,nonc; 5.423322684 

423189; M59371; Hs.171596; EphA2; fn3,pkInase,SAM,EPHJbd;TM=Y;SS=M; 5.421621622 

452291; AF015592; Hs.23353; CDC7 (cell division cycle 7, S. cerevisi; pkinase;TM=M;SS=N; 5.412 

432527; AW975028; Hs.102754; ESTs; none,none; 5.40625 

422278- AF072873; Hs.114218; frizzled (Drosopl ildl h noiog Frizzled /I n 2 TM Y CS-M 5.405504587 

410024; AW191024; Hs.55016; hypothetical protein FLJ21935; SH3;TM=M;SS=N; 5.396 

434407; BE552368; I ts.231853; Homo sapiens cDNA FU13445 fis, clone PL; 7tm_1,none; 5.391472868 

438974; AF089816; Hs.6454; chromosome 19 open reading frame 3; PDZ;; 5.3892S0814 

439670; AF088076; Hs.59507; ESTs, Weakly similar to AC004858 3 U1 sm; none,none; 5.382 

437016^AU0769m ^^^^^^/gMP ! '" c!oc kidinivEATS metalthio,EB,hemeJ,RCC1,ZZ,FeThFtedJ\,ENTH,Band_41,HECT;TM=M;SS=N; 5.373937677 
424848, AI2632o1 H' 327090 L>l H3,PDZ,Guanylat tin none; 5.36 

452721; AJ269529 II, J1 71 o arrior family 37 (glycerol-3-phc Pt ar_t TM Y SS-M; 5.35971 223 
40593; i 15000305:. if 1V122kjblAAC69198.il (AF0; ras;TM=M;SS=N; 5.349226804 

i W0 CD79A antigen (immunoglobr I r J y r n I TM-YSS M; 5. 3461 53846 

453143; AA382234; Hs.356289; protein tyrosinephosphatase, receptor t. serpin;; 5.333667335 
423973; AF038461; Hs.136574; arachidonate 12-lipoxygenase, 12R type; lipoxygenase,PLAT;TM=M;5S=N; 5.33 
408308; AL033377; Hs.44197; hypothetical protein DKFZp564D0462; none.none; 5328 
414821; M63835; Hs.77424; Fc fragment of lgG, high affinity .a re; ig;TM=Y;SS=M; 5.316 
427337; Z46223; Hs.176663; Fc fragment of lgG, low affinity lllb, r, ig;TM=Y;SS=M; 5.309638554 
410165; BE560228, H" 71869; api ,.1 >si -a ssTlated speck-like pratein ; PAAD_DAPIN,CARD;TM=M;SS=N; 5.293560606 
415817; U88967; Hs.78867; protein tyrosine phosphatase, receptor-t. fn3,Yj>hosphatase,carb_anhydrase;TM=Y;SS=M; 5.28 
427315; AA179949; Hs.1755G3; Homo sapiens mRNA; cDNA DKFZp564N0763 (f; nonc,spectrin,SH3,PH,CH; 5.278947368 
431441; U81961; Hs.2794; sodium channel, nonvoltage-gated 1 alpha; ASC;TM=Y;SS=N; 5.274746193 
416207; NM.014745; Hs.79077; Homo sapiens, clone MGC2908, mRNA, comp; none;TM=Y;SS=M; 5.272222222 
41G117 AF120499: Hs.73016; polynucleotide kinase 3'-phosph £ tas' Vir I helica i i V-M,SS=N;5.27 
435905; AW997484; Hs.5003; KIAA0456 pratein; SH3,RhoGAP,FCH;TM=M;SS=N; 5.251 865672 
409430; R21945; Hs.346735; splicing factor, argininaserine-rich 5; DSPc.Rhodanese.none; 5.248 
431846; BE019924; Hs.271580; uroplakin 1B; transmembrane4;TM=Y;SS=M; 5.232 
422017; NM 003877; Hs.1 10776; STAT induced STAT inhibitor-2; SH2,; 5.21 2418301 
436469; AK001 455; Hs.51 98; Down syndrome critical region gene 2; none;; 5.209259259 

421502; AF1 11 856; Hs.1 05039; solute earner family 34 (sodium phospha; Rbosorna!_L20,Na_PLcotrans;TM=Y;SS=N. 5.202 

425356; BE244879, H ; I55939; inc sil I polyphosphate-5-phosphatase, 14; Exo_endo_phos,SH2;TM=M;SS=N; 5.19979716 

437412; BE069288; Hs.34744; Homo sapiens mRNA; cDNA DKFZp547C1 35 (fr; ABCJran,GTP_ERLt,ABC_membrane,none; 5.199074074 

416602; NM_00615° F ' i r ol ir l ding proteii II I. EG *c,T F^l c 1 8224852 

429556: AW139399; Hs.314807; ESTs; none;TM=M;SS=N; 5.192439863 

427857; AL133017; Hs.288679; hypothetical protein FU22865; myosin_head,IQ,zf-MYND;TM=M;SS=M; 5.190251572 
400517; ;;lengsin;none;TM=M;SS=N; 5.18 

413436; AF238083; Hs.68061; sph'.ngosine kinase 1; DAGKc;TM=M;SS=N; 5.172881356 
423527; AI206965; Hs.1 05861; hypothetical protein FU13824; none;TM=M;SS=N; 5.165060241 

41 9138; U48508; Hs.89631; ryanodine receptor 1 ( If } trans.SF * h(OF TF \ "ir T> Y SS=N; 5.156976744 
437809; AL137723, 1 Is 5E E 5 Homo sapiens mRNA; cDNA DKFZp434D081 8 (f; none.none; 5.154676259 
452069; AB028949; Hs.1 83994; KIAA1 026 protein M tall )l Ti< M;SS=t 5152360515 
409340; BE174629; Hs.321130; hypothetical protein MGC2771; 

aajermeases,pyridoxaLdeC,bromodomain,PHD,MBD,AT_hook 1 DDT,PI3_PI4Jinase,FAT,FATC,BolARUN;TM=M;SS=N; 5.144859813 
442875; BE623003; Hs.23625; Homo sapiens clone TCCCTA00142 mRNA sequ; KJetra,DUF51 .none; 5.142 
434883; AW381538; Hs.19807, hyp II I I protein WGC1 2959 rl PH WWRtioGAP;; 5.141534392 
434808; AF1 55108; Hs.2561 50; NY-REN41 antigen; none;TM=M;SS=N; 5.14 

431341; AA30721 1; Hs.251531; proteasome (prosome, macropain) subunit; proteasome;TM=M;SS=N; 5.13968254 
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431 685; AW2961 35; Hs.267659; vav 3 oncogene; CH,DAG PE-bind.PH,RhoGEF,SH2,SH3,DC1;TM=M;SS=N; 5.129476584 
41 1 1 90; AA306342; Hs.691 71 ; protein K nase C-lih e ^ p nase.pkinas ( HR1;TM-M;SS-N 5 121527778 

433573; AF234887; Hs.57652; cadhen'n, EGF LAG seven-pass G-type rece; 7tm.2,EGF,cadherin,laminin.EGF,laminin.G,Trypanjlycop,GPS,HRM;TM=Y;SS=M; 5.107438017 

433662; W071 62; Hs.150826; RAB25 RAB25, member RAS oncogene family; ras,ABCJran,arf;TM=M;SS=M; 5.10251046 
-> 419493; AF001 212; Hs.90744; proteasome (prosome, macropain) 26S subu; CDK5 activator.PCI.none; 5.095194085 

411027; AF072099; Hs.67846; leukocyte immunog'obulin-like receptor, ; inosito!_P ig;TM=M;SS=N; 5.092 

435243; AW292886; Hs.348932; hypothetical protein dJ434014.3; IRF.none; 5.092 

434417; AL110157; Hs.3843; Homo sapiens mRNA; cDNA DKFZp58GF2224 (f; DSPc.none; 5.091922006 

415323; BE269352; Hs.949; neutrophil cytosolic factor 2 (65kD, chr; SH3,TPR;TM=M;SS=N; 5.088932806 
1 U 418629; BE247550; Hs.86859; growth factor receptor-bound protein 7; SH2,PH,RAr. 5.082840237 

420030, BE503' Hs.14 AA041 ne product; SH3,non ' 080645161 

444065; AW449415; Hs.10260; Homo sapiens cDNAFU11341 fis, clone PL; SH3;; 5.063953488 

421677; H64092; Hs.38282; ESTs; A1pp,Armadillo seg.IBB; 5.056 

411165; NM_000169; Hs.69089; galactosidase, alpha; Melibiase;; 5.054133668 
15 428479; Y00272; Hs.334562; cell division cycle 2, G1 to S and G2 to: pkinase.lCE o1 0.ICE D20;TM=M;SS=M; 5.054 

423883; AF250238; Hs.134514; ATP-binding cassette, sub-family A(ABC1;ABC_tran,photoRC,SRP54,Ca channeLB,Pterin_4a;TM=Y;SS=M; 5.0517241 38 

421917; AB028943; Hs.109445; KIM1020 protein; BTB,zf-C2H2,PI3_PI4 kinase,PI3Ka;TM=M;SS=N; 5.051282051 

422241 ; Y00062 H 170 i > M N 5 047311828 

410026; AI912061; Hs.55016; hypothetical protein FU21935; ncoe.none; 5.04674221 
2U 426395; BE1 51 985; Hs.355669; hypothetical protein FU2331 6; pkinase.none; 5.040298508 

418054; NM_002318; lis 83351 I, I da - RCR,LysyLoxidase;TM=M;SS=M; 5.039039039 

444895; AI674383; Hs.22891; solute earner family 7 (cationic amino : ASC.deathTNFR c6; 5.037151703 

413472 BE24?870 H 753 o! I 1 (glial higl D IM 1 

446272; BE268912; Hs.1460 1 ; hematopoietic cell-specific Lyn substrat; SH3,HS1_rep;TM=M;SS=M; 5.03030303 
ZD 410772; BE275297; Hs.194685; Homosaplens clone 24675 mRNA sequence; Topoisom bacToprim;; 5.027985075 

418613; AA744529; Hs.86575; mitogen-activated protein kinase kinase : pkinase,CNH;TM=M;SS=N; 5.014652015 

415166; NM_003652; Hs.78068; carboxypeptidase Z; ZnjarbOpepLDioxygenase.Fz;; 5.012209939 

415498; U33632; Hs.79351 ; potassium channel, subfamily K, member 1 ; ion trans;TM=Y;SS=M; 5.001 81 1 594 

41 0687; U24389; Hs.65436; lysyl oxidase-like 1 ; LysyLoxidase;; 4.997983871 
JU 430024; AI808780; Hs.227730; integrin, alpha 6; integrin_A,FG-GAP;TM=Y;SS=M; 4.994871795 

409220; BE243323; Hs.51233; tumor necrosis factor receptor superfami;TNFR.c6,deatli,Lipoprotein_5,TIL;TM=Y;SS=M; 4.987135506 

423804; AW403448; Hs.1706; interferon-stimulated transcription fact IRF,zf-C3HC4,iBR,zf-RanBP;TM=M;SS=N; 4.985185185 

423011; NMJJ00683; Hs.123C22; adrenergic, a!pha-2C-, receptor; 7tm_1 ;TM=Y;SS=M; 4.984 

419577; L36531; Hs.91296; integrin. alpha8;integnn AFG-j P Tri S '4 
5 5 402328 Target Ex > M la IM M N 4 96728972 

421242; AW161386; Hs.13561; hypothetical protein MGC4592; none;NA;NA; 4.965334165 

435523; T62849; Hs.1 1090; membrane-spanning 4-domains, subfamily A; none;TM=Y;S3=M; 4.964491363 

414203; BE262170; Hs.78629; ATPase, Na? transporting, beta 1 polypep; none.none; 4.961956522 

409582; R27430; Hs.271 565; ESTs; none,Neur_chan_LBD,Neur_chan_memb; 4.946 
4U 451292; AB037716;Hs 26204 KIAA12"prr It H3;TM=I N 4943181818 

453449; W1G752; Hs.32981; sema domd i murium glnbulinilc main (Ig) ,; ig.S Psl 4.930508475 

425233; Z17861; Hs.155218;E1B-55kDa-associated protein 5; SPRY,SAP,pkinase,fn3,ig; 4.925347222 

425247; NMJJ05940; Hs.155324; matrix metalloproteinase 1 1 (stramelysin; hemopexin.Pept'dase M10;; 4.92 

422282; AF019225; Hs 114 19 ipolipi note n L; MotA_ExbB;TM=Y;SS=M; 4.912181303 
45 442572; AI001922; Hs.135121; hypothetical protein FU22415; none,HSP70; 4.910224439 

425743; BE396495; Hs.159428; BCL2-associated X protein; Bcl-2;TM=Y;SS=N; 4.909972299 

449523; NM_000579; Hs.54443; chemokine (C-C motif) receptor 5; 7tm_1;TM=Y;SS=M; 4.904 

409213; U6141 2; Hs.51133; PTK6 protein tyrosine kinase 6; SH2,SH3,pkinase;TM=M;SS=N; 4.897338403 

-111770; NM_014278; Hs.71992; heat shock croiein (hspl 10 family); I ISP70;TM=M;SS=N; 4,894 
5U 1*-7. II 1. . >|..i. ..F.i.'Hri i.i .7hi I TM r SS=M;4 886 

423198; M81933, Hs.1634; eel! division cycle 25A; Rhodanese.none; 4.884 

445462; AA378776; Hs.288649; hypothetical protein MGC3077; none;; 4.S7637S691 

c 71 20521; HMT1 (hnRNP metl i! 6117497 

427792; M63928; Hs.180841; tumor necrosis factor receptor superfami; SRP14,TNFR_c6;; 4.873684211 
55 449027; AJ271216; Hs.22880; dipeptidylpeptdase III; Peptidase M4S.EGI ig.Ne regulin;TM M CS=N, 4.872641509 

421541; NM_003942;Hs.105584;ribosomal protein S6 kinase, 90kD poly ( pt i. h mii e_ TM M M 4%9318182 

429619; AL120751; Hs.21 1568; eukaryotic translation initiation factor; none.none; 4.868073879 

458873; AW150717; Hs.345728; STAT induced STAT inhibitor 3; none.none; 4.861538452 

437669; AI358105; Hs.123164; ESTs, Weakly similar to match to ESTs AA; none.pkinase.pkinase C; 4.854651163 
OU 405545; ; ; Target Exon; ABC_tran,SRP54,ABC_membrane;TM=Y;SS=M; 4.85 

424779; AL046851 ; Hs.1 53053; C037 antigen; transmembrane4;TM=Y;SS=M; 4.848387097 

424263; M77640; Hs.1757; L1 cell adhesion molecule (hydroceM fi g,IF i '1 Y SS-M 4.B46153846 

425421; L11669; Hs.157145; tetracycline transporter-like protein; sugar_tr;TM=Y;SS=M; 4.843694494 

421267; BE314724; Hs.103081; ribosomal protein S6 kinase, 70kD, polyp; pkinaso,pkinaso_C;TM=M;SS=N; 4.842532468 
05 1 8736; T1 8979; H ted CBP activator pr T_hook,SNF2 N;TM-M;SS 

444143; AW747996; Hs.160999; ESTs, Moderately similar to A56194throm; Bcl-2,none; 4.841071429 

417331; AW41 1 297; Hs.81 972; $HC(Src homology 2 domainxontainin t SH2,PiD.2l-C2h2,SCAN^P-bind^,KW^TM-M-^S=N: 4.839464883 

422010; AA302049; Hs.31181; Homo sapiens cDNA: FL '23230 ffe, clone C; none.SDF.sugar tn 4.837837838 

434521; NMJ02267; Hs.3886; karyopherin alpha 3 (importin alpha 4), Armadillo.seg IBB;TM=M;SS=N; 4.833333333 
/U 450447; AF212223; Hs.25010; hypothetical protein P15-2; NTF2;TM=M;SS=N; 4.821666667 

•I', --.II l; lv ■ ■-. [..!-■ I.I. .:,<.>..:„ FU>: " J ' . i:r. f;r;.7M=, : :>l.'. 4 VS.XiiX 

417949; AL049795 II f "M t I n41 4 31443299 • 

60 ri 9 n f F j r JE n_E IlLTtt 'I M; 4.81 

432581; AU076465; Hs.278441; KIAA0015 gene product; PP2QTM=M;SS=N; 4.805063291 
/ J 432194; AL040801; Hs.273219; breast cancer anti-estrogen resistance T, SH3;; 4.803191489 

431472; AK001023; Hs.256549; nucleotide binding protein 2 (E.coll Min; fer4_NitH,ParA,APS kinase,ArsA.ATPase;TM=M;SS=N; 4.800990099 

450690; AA296696; Hs.333418; FXYD domain-containing ion transport reg; ATP1G1 PLM_MAT8;TM=Y;SS=U; 4.795480881 

448950; AF288687; Hs.9275; CGI-152 protein; E1-E2_ATPase,Hydrolase,TM-> £ =N I.77G923077 

427681;AB018263;Hs.284232;tumornecrosisfaclorreceptorsuperfami death Ifirf, c6,PH xj, y Rh GEF.Metallothio ! 1M=M;SS=M; 4.772196262 
OU 432827; Z68128; Hs.3109; Rho GTPase activating protein 4; FCH,RhoGAP,SH3;TM=M;SS=N; 4,7601 1 5607 
433376; AI249361; Hs.741 22; caspase 4, apoptosis-related cysteine pr; CARD,ICE_p10,ICE_p20;; 4.7" ' " 
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lor receptor,; !g,pkinase;TM=Y; 



419250;AW770185;Hs.356066; U5 snRNP-specific protein, 116 kD; 7trn_1,BAH,zf-CXXC,DNA_methylase; 4.' 
417903; NM_002342; Hs.1 116; lymphotoxin beta receptor (TNFR superfam; TNFR_c6;TM^M;SS=M; 471 8858132 
414368;W70171; Hs.75939; uridine monophosphate kinase; PRK.CoaE;; 4.718836565 
426059; BE292842; Hs.166120; interferon regulatory factor 7; IRF;; 4.718543046 
414788; X78342; Hs.77313; cyclin-dependent kinase (CDC2-like) 10; pkinase;TM=M;SS=N; 4.708 
407601; AC002300;Hs.37129; sodium channel, nonvoltage-galed 1, beta; ASC;TM=Y;SS=M; 4.707920792 
448520; AB002367; Hs.21355; doublecortin and CaM kinase-like 1 ; pkinase,DCX;TM=M;SS=N; 4.707671 958 
407143; C14076; lls.332329; EST; none;TM=Y;SS=M; 4.682676815 
428582; BE336699; Hs.185055; BENE protein; none;TM=Y;SS=M; 4.681818182 

408806; AW847814; Hs.75608; Homo sapienscDNA:FU21532fi cloi C 5H3.PDZ ,u yiat none; 4.680440771 
448133; M723157; Hs.73769; folate receptor 1 (adult); Folate_rec,MIP;TM=M;SS=M; 4.679841897 
418836; AI655499; Hs.161712; ESTs; pkinase,Actlvin_recp,PDZ,ZU5,death; 4.679180887 
425308; M97639; Hs.155585; receptor tyrosine kinase-like orphan rec; ig,kringle,pkinase,Fz;TM=Y;SS=M; 4.675342466 
414665; AA160873; I ls.356307; serum amyloid A1; zf-C2H2,BTB,KJctra,none; 4.67447496 
449843; R85337; Hs.24030; solute earner family 31 (copper transpo; none;TM=Y;SS=M; 4.5737012S9 
428245; AF151048; Hs.183180; anaphase promoting complex stibunit 11 (y; none;; 4.656756757 
417088; MS4915; Hs.81170; pim-1 oncogene; pkinase;TM=M;SS=N; 4.655190476 
420340; NM_000734; Hs.97087; CD3Z antigen, zeta polypeptide (TiT3 com; ITAM;TM=M;SS=M; 4.65 
4 NM_001761; Hs.1973; cyctin lb Tf 1 f 4 "4J 

417929; R27219; Hs.74647; Human T-cell receptor active alpha-chain; ig.abhydrolase; 4.640384615 
430603; AA148164; Hs.247280; HBV associated facton zf-C3HC4,zf-RanBP,pkinase; 4.630653266 
41 9273; BE271 1 80; Hs.293490; ESTs, Weakly similar to I38022 hypotheti; none.none; 4.628 
453880; AI803166; Hs.135121; ESTs, Weakly similar to I38022 hypotheli; HSP70,none; 4.619047619 
459399; BE407712; Hs.153998; creaUne kinase, mitochondrial 1 (ubiqui; none.none; 4.61 3577075 
412970; AB026436; Hs.177534; dual specificity phosphatase 1ft Rhodanese.DSPc;; 4.616 
433577; AW007080; Hs.2841 92; ESTs; none.none; 4.614 
'444838; AV651680; Hs.208558; ESTs; integrin_A,FG-GAP,none; 4.61 21 49633 
408369; R38438; Hs.118747; SLC15A2 Solute carrier family 15 (Ik/pep; PTR2;TM=Y;3S=N; 4.602 
450825; AC005954; Hs.25527; tight junction protein 3 (zona occludens; PDZ,Guanyl8te_kin;; 4.596875 
443951; F13272; Hs.356835; ferritin, light polypeptide; PMP22_Claudin,none; 4.587931035 
433083, ALC4275 1 H a 1 4 1 r ?; EST P i M M SS"N; 4.586 

407239; AA076350; Hs.67846; leukocyte Immunoglobulin-like receptor, ; ig;TM=Y;SS=M; 4.58557047 
412926; AI879076; Hs.75061; macrophage myristoylated alanine-nch C ; MARCKS;; 4.579087049 
422009; AI742845; Hs.1 1071 3; DEK oncogene (DNA binding); SAP;; 4.576347305 
412584; X54870;Hs.740B5;DNA segment on rhwn omi 15 ui |u ) 24 none,lectin_c; 4.57312253 
41 4561; AI064813; Hs.195155; Homo sapiens amino acid transport system; Aa_trans;TM=Y;S?- M - " c 
422627;DE336857:Hs.118787; transforming growthfa y, beta-indue i ras din ABCJran/ 
459053; AI807052; Hs.97792; ESTs; none,7im_2,GPS; 4.569230769 

12424 11 H I v i i I ) I tl J f 115 

139975 \ 8081; H 1 7 nosine t ip - phal ; 'rid do triph; HamlpJ HHtj i 1.56056338 
41 6178; AI808527; Hs.192822; serologically defined breast cancer anti; none;TM-M;SS=N; 4.558 
408051 ; AI623351 ; Hs.1 72148; ESTs; PH.RhoGAP.none; 4.552307692 
421 846; M01 7707; I ls.1432 : protein kinase C substrate 80K-H; cihand,ldl_recept_a;; 4 .547761 1 94 

fl H Mel I gG protein; 7tm_1,LRR;TM-Y;SS-N; 4.54 1 

426201; AW182614; Hs.128499; ESTs; SH3,none; 4.541666667 

424905 NW 002497 1 153704 »Mn erinmito gei a)-rcl £ ,1 r i >'-M EE _ M 4 a36 
445229; BE276013; Hs.343828; Homo sapiens mRNA for FU00086 protein, ; αTM=M;SS=N; 4.530588235 
413109; AW389845; Hs.110855; ESTs; PH04,none; 4.529761905 

426125; X87241; lis 166994; I AT tun o -t ppresaoi (Drosophila) homolo; EGF.ccdhcrin laminin_G;TM=Y;SS=M; 4.529710145 
402330; ; ; Target Exon; pkinase.none; 4.528070175 

439238; N47305; Hs.302161; EDG-8 (endothelial differentiation, sph; 7tm_1;TM=Y;SS=M; 4.524 
433437; U20536; Hs.3280; caspasc 6, apoptosis-related cysteine pr; ICE _p10,ICE_p20;; 4.523715415 
413781; J05272; Hs.850; IMP (inosine monophosphate) dehydrcgenas; CBSJMPDI LCJMPDI LN.NPD;TM=M;SS=N; 4.522900763 
431429; AF072813; Hs.252831 ; reticulo,. 3; Ret cu on,Fz,ig,kringlt ,p'<in , 3 TM=Y;SS=N; 4.51 2 
, 424078; AB00662 H 13 1 p i r / fi ' zf C.H2 Kh/B n n 1512 
42C602; AF000877; Hs.99230; regulator of G-protein signalling 20; RGS;TM=M;SS=N; 4.51 
449101; AA205847; Hs.23015; G proteinxoupled receptor; 7tm_1;TM=Y;SS=M; 4.506 
408157; AA047685; Hs.62946; ESTs; none,pkinase; 4.504 

44C291; BE397753; i is.14523; interferon, gamma-induclhle protein 30; GILT;TM=M;SS=Y; 4.502152O8 
431326; AW970580; Hs.198689; KIAA0728 protein; none.none; 4.501 

428072; BE258602; Hs.182366; heat shock protein 75; HATPase_c,HSP90;TM=M;SS=N; 4.488281 25 
415149; X12451; Hs.78056; cathepsin L; Peptidase_C1;; 4.484375 

421 959; AW751497; Hs.98370; cytochrome P450, subfamily IIS, polypept; p450;TM=Y;SS=M; 4.48 

445143; U29171; Hs.378918; casein kinase 1, delta; zf-C3HC4,filamin,zf-BJx)x,NHL,pkinase,zf-MIZ;TM=M;SS=N; 4.478092784 

421071; AI311238; Hs.104476; ESTs, Weakly similar to CGHU1E collagen ; nonc;TM=Y;SS=M; 4.4773371 1 1 

410590, BE615216, Hs 64746, cliui hi » ula annel3;nor I f 1-M SS=N. 4.476 

438774; AA431620; Hs.379034; hypothetical protein MGC2745; none.none; 4.474874372 

410726.AI623859 Hs. 1 5936; ESI i nascpro isomerase.none; 4.47 

429903; AL134197; Hs.93597; cyclin-dependent kinase 5. regulatory su; CDK5_activator,none; 4.46B 

426485; NM_006207; Hs.1 70040; platelet-derived growth factor receptor-; 1g;; 4.464944649 ' 

433646; AA603319; Hs.155195; ESTs; pou,homeobox,lig_chan,ANF_receptor, 4.458 

410293; AK000047; Hs.61960; hypothetical protein; K_letra;TM=M;SS=N; 4.453020134 

453464;AI884911;Hs.32989;receptor(caIcitonin) mtivity modify ig m ne;TM=Y;SS=N; 4.448198198 

410583; AW770280; Hs.36258; ESTs, Moderately simitar to JC5236 gala:; SH3,PDZ,Guanylate_'<in,none; 4.446927374 

441455; AJ271671; Hs.7854; ancfiron regulated Uansporter-like; Zip;TM=Y;SS=M; 4.445010183 

453064; R40334; Hs.89463; potassium large conductance calcium-acti; none,none; 4.436480187 

143303; U 319;H 6;casf 7, apo late-dc) ine pr; pk se,ICE_p10,ICE_p20;TM=M H 'MM 

411825; AK000334;Hs.352415; solute earner family 39 (zinc transport; SNF,Zip,TM=Y;SS=N; 4.432765152 

4-> 1 pyrophosphatase) [ phetase;TM=M;SS=N; 4.42! 
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4271 38; N77624; Hs.173717; phosphatide add phosphatase type 2B; PAP2,none; 4.4234375 

414496; W73353; Hs.355424; ESTs; Dkinase,F5_F8_type_C,adh_short,none; 4.42114094 

429432; AI678059; Hs.202676; synaptonemal complex protein 2; none;TM=M;SS=N; 4.42 

429922; Z97630; Hs.2261 17; H1 histone family, member 0; linkor_histone;TM=M;SS-N; 4.419207317 

446700; AW206257; Hs.156326; Human DNA sequence from clone RP11-145L2; none;TM=M;SS=N; 4.4181 B1 818 

435411; AW44461 9; Hs.138211; ESTs; none,pkinase; 4.414 

414531 AA256213;H 72010; ESTs; none.Cam Ksy'transf,Choline_kinase,SCO Z 3,Glyco transf_3,GlycosJrans_3N; 4.41 

418558; AW082266; Hs.86131; Fas (TNFRSF6)-asscc:ated via death domai; death.DED;; 4.408523909 

442259; AI690269; Hs.201345; ESTs; Acetyltransf,RhoGAP,FCH,SH3,Kelch,fn3; 4.406 

415860; D56051; Hs.78888; diazepam binding inhibitor (GABA recepto; ACBP;TM=M;SS=N; 4.404678363 

434419; AL040606; Hs.296938; dual specificity phosphatase 7; DSPc;TM=M;SS=N; 4.404 

404440; ; ; NM 021 048:Homo sapiens melanoma antigen,; MAGE;TM=M;SS=N; 4.4 

435542 AA687370 h )5122G E I3ig pknase,PH,spectrin,RhoGEF,none; 4.394 

413367; NM 006517; Hs.75317; solute carrier family 16 (mcnccarboxylic; sugar Jj;TM-Y;SS=N; 4.39028777 

435732; AF229178; Hs.123136; leucine rich repeat and death domain con; none.none; 4.38490566 

427359; AW020782; Hs.79881; Homo sapiens cDNA: FLJ23006 fis, clone L; 7tm_1,none; 4.3821 29278 

425749; AW328587; Hs.1 59448; surfeit 2; none;; 4.382 

417874; BE616160; Hs.82829; protein tyrosine phosphatase, non-recept; Y_phosphatase;TM=Y;SS=N; 4.381422925 

414806; D14694 H 77 ' pi tidy crn ynthase1;F IV , ,-M 1380681818 

431837; T79326; Hs.331967; olfactory receptor, family 2, subfamily ; none.7tm 3,sushi,ANF_recepton 4.376 

417115; AW952792; Hs.334612; small nuclear ribonucleoproteln polypept; Sm.pkir.ase;; 4.370247934 

434876; AF160477; Hs.61460; Ig supcrfamly receptor LNIR; ig,Rhabd_glycop;TM=Y;SS=M; 4.37 

430379; AF134149; Hs.240395; potassium channel, subfan ily K, r lernber - ion Ir ins;TM=Y;SS=M; 4.367777778 

403912; ; ; C5000394-;g;|12737280|reflXPJ06632.2i k; none;TM=M;SS=N; 4.337684478 

426268; AF083420; Hs.168913; serine/threonine kinase 24 (Ste20, yeast; pkinase;; 4.366348449 

434263, N34895, H- 1£ ESTs; ig.no. 4 358527132 

404760; ; ; Target Exon; cadherin;TM=M;SS=M; 4.356 

413076; U10664; Hs.751 88; weel (S. pombe) homolog; pkinase;TM=M;SS=N; 4.35472973 

420757; X78592. Hs.99915; androgen receptor (dihydrotestosterone r; hormone_rec,zf-C4,Androgen_recep;TM=M;SS=N; 4.354 

426812; AF105365; Hs.1 72613; solute carrier family 12(potass;um/ch:o; none;TM=Y;SS=N; 4.353244838 

431674; AA098901; Hs.301642; G-protein coupled receptor none,GCV_H; 4.35 

431886; L77964; Hs.271980; mitogen-activated protein kinase 6; pkinase;TM=M;SS=N; 4.347893916 

447719; BE387402 H 19 /potlietlcal protein FLJ10349; adenyl f 1 M M C S N 4346007605 

424837; BE2761 13; Hs.333034; N-acetyltransferase, homolog of S. cerev; Acetyltransf;TM=M;SS=N; 4.344 

449437, AI702038,Hs 100057 Hon. r. :DNA U22902 ti ~'~-ie K none.none; 4.334722222 

411768; NM_013371; Hs.71979, interleukin 19; IL10;; 4.322 

445350; AF0521 12; Hs.1 2540; lysophosphollpase I; abtiydrdase_2;TM=M;SS=N; 4,320359281 

425964; AW889928; Hs.9071; progesterone membrane binding protein; homeobox.none; 4.318867925 

450998; BE387614; lls.25797; splicing factor 3b, subunit 4, 49kD; rrrn;TM--M;SS-N; 4.316573557 

408908; BE296227; Hs.250822; serine/threonine kinase 15; pkinase;; 4.310 

400290; H18836; Hs.31608; hypothetical protein FU20041; none.CysJmot; 4.314728682 

438899; AF085833; Hs.1 35624; ESTs; none,PI3_PI4J "I Ka,PI3t C 2 FI3F rod PI3K_p85B; 4,314084507 

4188E E )3 Is. 1211; s ! r i tart Metallophos;TM=M ft 12121212 

11960 R h I 7 3 Homo sapiens clone 23783 mRNA sequence; IMP4TM ' )044 11 

421532; AW138207; I ls.140170; hypothetical protein FLJ22969; Armadillo_scg,HEW;TM=-M;SS-N; 4.304 

430017; AA263172; Hs.35; protein tyrosine phosphatase, non-recept; Y jhosphatase;TM=M;SS=M; 4.302 

447224 BE61 71 25 1207 gb;601441664f Ih f o sapiens c; n e NA NA 4 302 

1; NM 004954; I ls.1571S9; ELKL r pkinase I V lM M N; 4.301639 1 1 
454042; H22570; Hs.47860; hypotheUcal protein FU20093; ig,pkinase,LRR,LRRNT,LRRCT,none; 4.30141844 
446143; BE245342; Hs,306079; sec61 homolog; NUDIX,secY,E1 _dehydrog,transkPt_Fyr;TM=Y;f F=M; 4 30037209/ 
i 1 [ I ( I ESTs, Weakl toALU2_HUM I r rrr 2 

■i vr'-;\MH n r.K-f.,...-r:: . j- ?c».i ■ ■ " i:>>;t:»4-> 
432945; AUJ43683; Hs.8173; hypothetical protein FLM0803; none;TM=M;SS=M; 4.288405797 

421921; H83363; Hs.355993; translocase of inner mitochondrial membr; zf-Tim10_DDP,efhand,CH,spectrin,serpin;TU=M;SS=N; 4.284 
448564, AL044962 f 2145 I U5-tnsphosphate3-kinaseC;IPK;; 4.28057554 
453941; U39817; Hs.36820; Bloom syndrome; DEAD,helicase_C,HRDC;TM=M;SS=N; 4.28 

437712; X04588;Hs 85844, neuroirpl tyrosine kin c, receptor omy i p nas ,LRR,LRRCT,Hydantomase_B,Hydantoinase_A;TM=M;SS=N; 4.277477478 

417426; NM_002291; Hs.82124; laminin, beta 1 ; laminin_EGF,laminin_Nterm,integrin_B;; 4.276162791 

450296; AL041949; Hs.24756; hepatocyte growth factor-regulated tyros; none.none; 4.273927393 

438012; AA393264; Hs.43619; ESTs; Armadillo_seg,none; 4.273134328 

409619; AK001015; Hs.55220; BCL2-associated athanogene 2; BAG;TM=M;SS=N; 4.273109244 

41 8529; AW005695; Hs.250897; TRK-fused gene; Band_41 ,ERM l pki'nase,LRR l LRRCTA1AM,Nycleoplasmin,Tropomyosin,OPR,lilament,bZIP,&gamma,M,DIJF1 64;TM=M;SS=N; 
4.272123894 

41 5214; AI445236;Hs.125124;EphB2;fn3,pkinase,SAM,EPHJbd;TM=Y;SS=M; 4.268 

438233; W52448; Hs.56 I47; ESTs; Neur.chai i_LBD,Neu _chan_memlj,MAGE; 4.20284585 

429019;AA443280,H- U I myosin IIIA; rrr in_ r ra r PI 10 1M M SS=N; 4.262 

424959 NM_0057 1,113.1539 activated p irdc t kinase I* ldh_C,SH3,pkinase,UBA;TM=M;SS-N; 4.258695652 

453655; AW960427; Hs.342874; transforming growth factor, beta recepto; zor.g_pellucida.none; 4.257208766 

417414; AA434589; Hs.367676; dUTP pyrophosphatase; dUTPase.KRAB;; 4,251785714 

453905;NMJ02314;Hs.36566;UMdomainkinase1;pkinsseLlM^ f-P RF*TM M;SS=N 4249116608 

424232; AB01 5982 H 1 o i- ra. u; pkinase, DAG_PE-b!n FltlM (' N 4,247692308 

404883; ;; ENSP00000216009;Sodium-glucose cctrar > F;TM M 4242424242 

412507; L36645; Hs.73964; EphA4; fn3,pkinase,SAM,EPH_lbd;TM=Y;SS=M; 4.239285714 

41 1089; AA456454; Hs.355702; cell division cycle 2-like 1 (PITSLRE pr none.none; 4.237313433 

436957; AA902488; Hs.122952; ESTs; none,DAGKc,DAGKa,RA,DAG_PE-birid; 4.236 

452568; AA805634 Hs.3008/ ) Hi r, o s ipiens mRNA; cDNA DKFZp547M072 (tr PI3_PI4_k la V M SS=M; 4.23537415 

433535; AF111106. lis <'<t: prol n phosphatase 4, regula . subuni; I F Ai IM S! -N, 4.234793187 

432728 NM 006°7 C H 3 1 s II reg on expr ed gene KE4 Zip g I Mi r< 4.234545455 

41 6350; AF188625; Hs.189507; phospholipase A2, group IID; phoslip;TM=M;SS=Y; 4.234 

409533; AW969543; I ls.144609; mitogen-activated protein kinase kinase ; Peptidase_C48,none; 4.230666667 

427127; AW802282; Hs.22265; pyruvate dehydrogenase phosphatase; PP2C,none; 4.228009259 

403362; ; ; NM_001615*:Homo sapiens actin, gamma 2, ; actin;; 4.22688478 

417866; AW067903; Hs.82772; collagen, type XI, alpha 1; Collagen,COLFI,TSPN,lan-.inin_G,CorA;; 4.226388889 

249 



WO 03/042661 



PCT/US02/36810 



428897; AJ245719; Hs.194385; hypothetical protein FU20234; SH2;TM=M;SS°N; 4.224731 1 83 

425771; BE561776;Hs.159494;Bruton agammaglobulinemia tyrosinekr, ;; SH2,SH3.pkinase P' ! FJ TK TM=M;SS=N; 4.22368421 1 
413566; C21 220; Hs.321717; hypothetical protein FLJ10875; zf-C2H2,BTB,K_tetra,7tmJ; 4.222807018 
454098; W27953; Hs.21 7433; Plakophilin; none.none; 4.22 

424381; AA28524M 141 f I , ki la hk; HEK2); | n p FHA,DnaJ;TM=M;SS=N; 4.21875 

419223; X60111; Hs.1 244; CD9 antigen (p24); transmembrane4;TM=Y;SS=M; 4.217130215 

436756; Z18364; Hs.198298; v-src avian sarcoma (Schmidt-Ruppin A-2); none.none; 4.216 

450167;M446404;Hs.24563;NTF2-related expert protein I; NTF2;TM=M;SS-N; 4.215163934 

■11 ;_1 l!V ..' ( ; .>; h- Kl>:jc-xt:r. 1 iF-h:r.--:;i-:.u-:-, :■ nln. efhand;; 4212041885 

432539; AL138169; Hs.278378;karyopherin beta 2b, transportin; none,DS,UPF0139 l Glyco_hydro_38; 4.207407407 

416661; M634543; Hs.79440; IGF-II mRNA-blnding protein 3; KH-domain,rrm;TM^M;SS^N; 4.206 

432284; AA532807; Hs.287740; ESTs; pkinase.none; 4.205454546 

418758; AW959311; Hsi172012; hypothetical protein DKFZp434J037; pkinase,RI01;TM=M;SS=N; 4.204142012 

450056; BE047394; Hs.502; ESTs, Weakly similar to S7151 2 hypolheti; ABC_tran,ABC_membrane 1 ig,MHCJLbeta,SRP54,proteasome,ABC_membrane,ABC_tran; 4.202572347 

412817; AL037159; Hs.74619; protcaGomc (prosome, macropain) 26S subu; PC_rep;TM=M;SS=N; 4.202061856 

425394; AA356730; Hs.323949; kangai 1 (suppression of tumotlgenicity ; tran3mcmbranc4 l none; 4.195014663 

449335; AW1 50717; Hs.345728; STAT nd 11 I 1 M S N 4 192248062 

415023; AA932146; Hs.355397; Homo sapiens clone TCCCIA00164 mRN'A sequ; none;NA;NA; 4.192 

443907; AU076484; Hs.9963; TYRO protein tyrosine kinase binding pro; none;TM-M;S5-Y; 4.191878981 

445330; R52656; Hs.21691; ESTs; 7tm_1,none; 4.189922481 

430016; NM 004736; Hs.227656, xenotropic and polytropic retroviru i r > L 1 i i SS=N; 4.188333333 
434633; AI189587; Hs.120915; ESTs; SH3,PH,RhoGAP,none; 4.187106918 

452908; AB001451; Hs.30965; neuronal She adaptor homolog; SH2,PID,Zn_carbOpept;TM=M;SS=N; 4.1 86885246 
439318; AW837046; Hs.6527; G protein-coupled receptor 56; 7tm.2,CytC_asm,GPS;TM=Y;SS=M; 3.930957684 
432201; AI538613; Hs.298241; Transmembrane protease, serine 3; ldl_recept_a,trypsin;TM=Y;SS=M; 3.893103448 
428969; AF120274; Hs.194689, artemin; TGF-beta;; 3.884030418 
444633; AF111713; Hs.12284; junctional adhesion molecule 1; ig;TM=Y;SS=M; 3.831669044 

432305. M62402 H- 2 ^ . growth ort ling pro thyrog bulin.1,1 FBP.A2M 1 ' V SS N 3 742996346 

405547; ; ; NM_018833':Homosapiens transporter 2, A; ABC_tran,SRP54 1 ABC_membrane;TM=Y;SS=M; 3.676 

407853; AA336797; Hs.40499; dickkopf (Xenopus laevis] homolog 1; none;TM=M;SS=Y; 3.634 

426427; M86699; Hs.169840; TTK protein kinase; pkinase;; 3.562 

1 7585 D ii rr 1 1- 1 ""'729,coBagen, type X, alpha 1 (Sohmid metaph; C1q,Collagen;; 3.49 

405546 NM.01P "H ^ Iran porter 2 A AEC trar SRP c 4ABC_membrane;TM=Y;SS=M; 3.422661871 

439820; AL360204; Hs.283853; Homo sapiens mRNA fill length insert cDN; none.none; 3.402 

404210, , , NM_0359 f. Homo p jC lo d lymphoid , FHA,PDZ,RA,DIL;TM=M;SS=N; 3.368807339 

L134847; Hs.1 ro r ' rase 90kD, polyp; pkinase,pkinase_C;; 3.213402062 

418678; NM 001 327; Hs.87225; cancer/testis antigen (NY-ESO-1 ); none;TM=M;SS=N; 3.084 
451106; BE382701; Hs.25960; N-MYC oncogene; HLH,Myc_NJerm;TM=M;SS=N; 1.55 



Pkey CAT Number Accession 

418869 12789.14 AA229762 AA230035 



■5. "Dunham I. et al." refers to the pul 

Dunham I. eta!., Nature (1999) 402:489495. 

Strand: Indicates DNAstrand from which exons were predicted. 

NLposition; Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

402075 8117407 Plus 121907-122035,122804-122921,124019-12416 

401781 7249190 Minus 83215-83435,83531-83656,8374083901,8423 

405484 5922025 Plus 199214-199579,199672-199920,200262-20049 

405932 7767812 Minus 123525-123713 

400517 9796686 Minus 49996-50346 1 

402328 4464283 Minus 13758-13922,14558-14752 

405545 1054740 Plus 118677-118807,119091-119296,121626-12182 
402330 4464283 Minus 15325-15380,15484-15588,15642-15915 
404440 7528051 Plus 80430-81581 

403912 7710730 Minus 72000-72290,72431-72700,72929-73199 

404760 7767724 Plus 223266-223352,224472-224585 

404883 5101762 Minus 94626-94730,96998-97069 

403362 8571772 Plus 64099-64260 

405547 1054740 Plus 124361-124520,124914-125050 

405546 1054740 Plus 124010-124183 
404210 5006246 Plus 169926-170121 
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levels of non-specific hybridization, the 15" percentile value an 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title:Unigene gene title 

R1 : Ratio of tumor vs. normal tissue 



Pkey ExAccn 

449034 AI624049 

435094 AI560129 

438817 , 

421478 , 

452838 ' 
450451 



UnigenelD Unigene Title 



gb:ts41a09.x1 NCI_CGAP_Ut1 Homo sapiens cDNA 



AW513143 Hs.98367 hypothetical protein FU22252 
AI687303 Hs.285529 G protein-coupled receptor 49 (GPR49) (HG38) 
M828995 Hs.52620 integrin; beta 8 
AA731 1 1 1 Hs.291891 ESTs 

AL109791 Hs.241559 Homo sapiens mRNAfull length insert cDNA clo 
BE218239 Hs.202656 ESTs 

Matrix Metatloproteinase 10 (Stromolysin 2) 
BMPR-lb;bone morphogenetic protein receptor lb 
predicted exon 

gb:os26b03.s1 NCI_CGAP_Kid5 Horno sapiens cDNA 
HER4 (c-erb-B4) 
ESTs 



447350 AI375572 

453964 AI961486 

443830 AM42095 

459325 AW088369 

415245 NS96S0 

446608 N75217 

426635 BE395109 

433426 H69125 



413335 AI613318 

449611 AI970394 

449260 AA741180 

412140 AA219691 

443715 AI583187 

432113 AA935C65 

424834 AK001432 

41C658 AW1 05231 

426465 AI758948 

446704 AI337228 

419503 AA243642 

444342 NMJ14398 

436076 AI193277 

406687 M31126 

445258 AI635931 

440901 AA909358 

434636 AA083764 

429334 D63078 

418852 BE537037 



Hs.12744 
Hs.1 43273 
Hs.282184 
Hs.27252 
Hs.257846 
Hs.1 29327 
Hs.1 33525 
Hs.222194 
Hs.1 69006 
Hs.158469 



Hs.31141 
Hs.1 10667 
Hs.109057 

Hs.99348 

Hs.225106 

lls.108106 

Hs.48442 

Hs.197075 

Hs.29879 

Hs.73625 

Hs.9700 

Hs.1 52385 

Hs.1 53408 



ESTs 



Homo sapiens cDNA FU13793 fis, clone THYRO10 
ESTs 

Roundabout homolog 2 transmembrane receptor (robo2) 
ESTs 

ESTs, Weakly similar to ALU5_HUMAN ALU SUBFAM 
ESTs; Weakly similar to Gag-Pol pdyprotein 
distal-less homeo box 5 



Hs.197083 ESTs 
Hs.1 37422 
Hs.1 0887 
Hs.1 20954 
Hs.272620 
Hs.1 47613 
Hs.128612 
Hs.241334 
Hs.1 861 80 
Hs.273294 



Homo sapiens cDNA FU10570 fis, clone NT2RP20 
ESTs 

gb'.ty16f07j(1 NCI CGAP_Ut3 Homo sapiens cDNA 



pregnancy specific beta-1-glycopratein 9 



436787 AA908554 Hs.1 92756 

400301 X03635 Hs.1657 

428771 AB028992 Hs.193143 

444929 AI685841 Hs.161354 

453922 AF053306 Hs.36708 



Homo sapiens cDNA: FL) 23038 fis, clone LNG020 
hypothetical protein FU20069 
gb:IL-BT1 52-060393-004 BT152 Homo sapiens cDN 



410102 AW248508 

433283 BE041135 

443270 NMJ04272 

410247 AF181721 

422589 AA312735 

452771 T05477 

407275 AI364186 

420440 NM.002407 

451105 AI761324 



Hs.279727 
Hs.175622 
Hs.9192 
Hs.61345 
Hs.1 79725 



Homer, neuronal immediate early gene, 1B 



gb:wi60b11.x1 NCI_CGAP_Co 



3 Homo sapiens cDNA 
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453616 NM_003462 

424115 AA335497 

414245 BE148072 

423244 AL039379 

441031 AM 10684 

447048 AW393080 

458861 AI630223 

428758 M433988 

420149 M255920 

433479 AW511459 

449416 AI651016 

457551 AW821319 

450109 AI539295 

436954 AA740151 

415511 AI732617 

410153 BE311926 
406411 

416209 AA236776 

416456 H57052 

454692 AW813350 

452249 BE394412 

436211 AK001581 

449765 N92293 

434988 A1418055 

423515 AA327017 

435407 AM 49774 

440886 AW511032 

444783 AK001468 

452039 AI922988 

407300 M102616 

425176 AW015644 

449433 AI672096 

419335 AW960146 

422711 D60641 

453096 AW294631 

441962 AW972542 

445034 AW293376 

418677 S83308 



Hs.33846 
Hs.293965 
Hs.75850 
Hs.209602 



dynein, axonemal, light intermediate polypept 
ESTs 

WAS protein family, member 1 

ESTs, Weakly similar to ubiquitous TFR motif, 

fibrinogen, B beta polypeptide 

Homo sapiens cDNA: FLJ23537 fs, clone LNG076 



Homo sapiens cDNA FU1 4303 fis, clone PLACE20 



HS.2B8928 Homo sapiens cDNA: FU23296 fis.clone HEP106 



Homo sapiens cDNA FLJ12691 (is, clone NT2RM40 
predicted exon 

MAD2 (mitotic arrest deficient, yeast, homolo 
hypothetical protein EDAG-1 
gb:MR3-ST0192-100100-024-g07 



Hs.61252 
Hs.80961 
Hs.206832 
Hs.161160 
Hs.1 62204 

Hs'.190516 
Hs.62180 
Hs.172510 



Hs.289008 
Hs.160323 
Hs.87224 



polymerase (DNA directed), 



ALU8_HUMAN ALU SUB 



Homo sapiens cDNA: FU21814tis, clone HEP010 



428227 AA321649 

420092 AA814043 

415138 C18356 

437212 AI765021 

409867 AW502161 

421477 AI904743 

427119 AW880562 

458154 AW816379 



447342 AI199268 

413472 BE242870 

446619 AU076643 

443613 AI079356 

441285 NM_002374 

409731 AA125985 

417847 AI521558 

441484 AA935481 

415802 AA169515 

448112 AW245919 



424917 AI636208 



ls.214410 
Hs.152213 
Hs.73149 
Hs.19322 
Hs.75379 

Hs.36353 

Hs.167 

Hs .56145 

Hs.288312 

Hs.58972 

Hs.6006 

Hs.301018 

Hs.2256 



INTERFERON-GAMMA INDUCED PROTEIN 
ESTs 

tissue factor pathway inhibitor 2 TFPI2 
ESTs 

gb:UI-HF-BR0p-ajr-g-12-0-U!.r1 NIH_MGC_52 

hypothetical protein FU10292 

ESTs 

gb:QV4-ST0234-181199-035-g01 ST0234 
ESTs 

wingless-type MMTV integraUon site family 
paired box gene 8 (PAX-8) 
ESTs; Weakly similar to III! ALU SUBFAMILY J 
it rfamil glial h affinity 



Hs.96901 
Hs.5378 
Hs.267087 



Homo sapiens mRNA full length insert cDNA do 
gb:oz39b09.s1 Soares_NhHMPu_S1 Homo sapiens c 
microtubule-associated protein 2 
thymosin, beta, identified in neuroblastoma c 
Homo sapiens cDNA: FU22316 fis, clone HRC052 
ESTs 



451842 AI820539 

455666 BE065813 

431731 BE266322 Hs211374 

443695 AW204099 Hs.112759 

75 410358 AW975168 Hs.13337 
406030 

409745 AA077391 

430481 AA479678 

437641 AA811452 Hs.291911 

80 415211 R64730.comp Hs.155986 

443450 N66045 Hs.133529 

457438 NM_014O53 Hs.270594 

451254 AI571016 Hs.172967 



ESTs 

Homo sapiens cDNA: FLI23049 fis, clone LNG025 
spondin 1, (f-spondin) extracellular matrix p 
ESTs, Moderately similar lo ALU4_HUMAN ALU SU 
gb:RC2-BT0318-110100-012-a08 BT0318 Homo sapi 
ESTs, Weakly similar lo SP49.HUMAN SPLICEOSOM 
ESTs, Weakly similar to AF125780 1 retinal sh 
ESTs, Weakly similar to unnamed protein proc'u 



ESTs; Highly similar to SPERM SURFACE PROTEIN 
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419563 AA526235 Hs.193162 
427778 AA412323 Hs.105323 ESTs 
435031 AI632091 Hs.1 16877 



i sapiens cDNA FU11983 lis, clone HEMBB10 



431548 AI834273 

423126 AA322245 

433420 AI674093 

435174 AA687378 

444743 AA045648 

452588 AA889120 

427304 AA761526 

419917 M320068 

417728 AW138437 

419356 AI656166 



416623 N74925 

405174 

403776 

426274 D38122 
431255 AA497043 



406400 

439949 AW979197 

430704 AW813091 
401517 

417830 AW504786 

435267 N23797 

426384 AI472078 

422797 AB033064 

428832 M578229 

449722 BE280074 

418478 U38945 



Hs.289014 

Hs.290165 ESTs 

Hs.293961 ESTs 

Hs.194624 ESTs 



gb:Homo sapiens cig33 raRNA, partial sequence. 
Human DNA sequence from clone RP1-187J11 or, c 
Homo sapiens cDNA FU13018 fis. clone NT2RP30 



Hs.93701 

Hs.24790 

Hs.7331 

Hs.97984 

Hs.289109 

Hs.38761 



Hs.115685 
Hs.49136 
Hs.1494 



E We y simil ] WASP-fai protein [ 
Homo sapiens cDNA: FU21564 fis, clone COL064 



lor necrosis factor (ligand) superfamily, m 



msh (Drosophila) homeo box homolog 1 (formerl 
kallikrein 6 (neurosin, zyme) 
gb:601150677F1 NIH.MGC.19 Homo sapiens cDNA c 
kallikrein 8 (neuropsin/ovasin) 
is.292073 ESTs 

gb'.RC3-ST0186-240400-1 1 1-d07 ST0186 Homo sepi ' 
predicted exon 

ls.132808 epithelial cell transforming sequence 2 oncog 
ls.110114 ESTs . 

ESTs 

ls.120908 KIAA1238 protein 

gb:nl22b12.s1 NCI_CGAP_HSC1 Homo sapiens cDNA 
ls.23960 cyclinBI 

19.1 174 cyclin-depcndenl kinase inhibitor 2A 

gb:RC3-CT0297-29010(H)13-d03 CT0297 Homo sapi 



Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 
Accession: Ge 



451105 
452771 
454392 
454692 



115237.1 M077391 AI347618AI361453AI088754AW207491AW960912AA921874AA286B33AA150722BE152353AW188822BE152450 

1156530.1 AW502151 AW502587 AW502345 

213547.1 AW978073 AW978072 AA807550 AA306567 

219896 1 AW856665AA315006AW954733 

26621 fl AI472078AA377209AA865807 

267664 1 AI758948AA379527AA379948AA379262AW963933 

296144 1 AA578229AA436432AA481375AA481363 

322217 1 AW813091AW206655AA484440 

331543~1 AW970622AA503009AA502998AA502989AA502805T92188 

44573.2 AI950087 N70208 R97040 N36B09 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 AA908598 

AA251875 AI820501 AI820532W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 AI219788 AA884444 N92578 F13493 
AA927794 AI560251 AW874068 AL1 34043 AW235363 AA653345 AW008282 AA488964 AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 
AW102898 AI872193 AI763273 AW17358B AW150329 AI653832 AI762688 AA988777 AM88892 AI356394 AW103813 AI639642 AA642789 AAB56975 
AW505512 AI961530 AW629970 BE612881 AW276997 AW513601 AW51 2843 AA044209 AW856538 AA1 80009 AA337499 AW961101 AA251669 
AA251874 AI819225 AW205862 AI663338 AI858509 AW275905 AI533006 AA972584 AA908741 AW072629 AW513996 AA293273 AA969759 N75628 
N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 F00531 H83488 W37181 W78802 R66056 
*67840 AA300207 AW959581 T63226 F04005 



438993 467651.1 



794817.1 
859083.1 
930983.1 
115882.1 
1229118.1 
1349545 1 
491768.1 
798085.1 



I AI916584 R61781 T77332 F07756 F08149 F07647 
AI079356 W23287 
AI624049AW1 17770 AI858360 
AI761324AW880941AW880937 
T05477 T07855AI917711 



AW813350 AW816082 AW813476 AW813383 



i" refers to the publication entitled The DNA sequence of 
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7381715 
7108030 



29278-29770 

81747-82094 

1414-1513,1624-1756 

9144-9350 

102814-103063 

42814-43010.43583-43783,44863-15033,46429-46554,47815-48018,49961-50153,51624-51727,51823-51959,52702- 
52918,55469-55601,57111-57307,58169-58296,60215-60332,61482-61727 



1553-1712,1878-2140,4252-4385,5922-6077 



Table 19A: 225 Up-Regulated Genes Encoding Extracellular/Cell Surface Proteins, UTERINE Cancer Versus Normal AdullTissues 

Table 19A lists about 225 genes up-regulated in uterine cancer compared to normal adult tissues that are likely to encode extracellular or cell-surface proteins. These were selected 
as for Table 18A, except that the ratio was greater than or equal to 2.0, and the predicted protein contained a structural domain thatis indicative of extracellular localization (e.g. ig, 
fn3, erjf 7tm doma ns qi quei I n m mbrane domains). The predicted protein domains are noted. 



Pkey 




















Hs.52620 


integrin; beta 8 




























AI687284 






408562 


AI435323 


HS31141 


Roundabout homolog 2 transmembrane 


420610 


AI683183 


Hs.99348 


distal-less homeo box 5 


412140 


AA219691 


Hs.73525 


RAB6 interacting, kinesin-like (rsb 


443715 


AI583187 


HS.9700 


cyciln E1 


432113 


AA935065 


Hs.152385 


ESTs 


419503 


AA243642 


Hs.137422 


ESTs 


444342 


NM_014398 


Hs.10887 


similar to lysosome-associated memb 


436076 


AI193277 


Hs.120954 


ESTs 


406687 


M31125 


Hs.272620 


pregnancy specific beta-1-glycoprot 


418852 


BE537037 


Hs.273294 


hypothetical protein FU20069 


459583 


AI907673 




gb:IL-BT1 52-080399-004 BT152 Homo s 


400301 


X03635 


Hs.1657 


Estrogen receptor 1 








predicted exon 


453922 


AF053306 


Hs.36708 


budding uninhibited by benzimidazol 


410102 


AW248508 


Hs.279727 


ESTs; 


433283 


BE041135 


Hs.175622 


ESTs 


443270 


NMJW4272 


Hs.9192 


Homer, neuronal immediate eariygen 


410247 


AF181721 


Hs.61345 


RU2S 


422589 




Hs.179725 


ESTs 


407275 


AI364186 




gb:qw34h07.x1 NCI_CGAP_Ut4 Homo sap 


420440 


NM 002407 


Hs.97644 


mammaglobin 2 


453616 


NM_003462 


Hs.33846 


dynein, axonemal, light intermedial 


424115 


AA335497 


Hs.293965 


ESTs 


414245 


BE148072 


Hs.75850 


WAS protein family, member 1 


458861 


AI530223 




PHD finger DNA binding protein isof 


449416 


AI551016 


Hs.246311 


ESTs 


420149 


AA255920 


Hs.88095 




433479 


AW511459 


Hs.249972 


ESTs 


457551 




Hs.288928 


Homo sapiens cDNA: FU23296 fis, cl 


406411 






predicted exon 


416456 


H57052 


Hs.176626 


hypothetical protein EDAG-1 


454692 


AW813350 




gb:MR3-ST0192-100100-024-g07 ST0192 


436211 


AK001581 


Hs.80961 


polymerase (DNA directed), gamma 


434988 


AI418055 


Hs.161160 


ESTs 


444783 


AK001468 


Hs.62180 


ESTs 


440886 


AW511032 


Hs.190516 


ESTs 


425176 


AW015644 


Hs.301430 


ESTs, Moderately similar to TEF1_HU 


445034 


AW293376 


Hs.160323 


ESTs 


418677 


S83308 


Hs.87224 


SRY (sex determining region Y)-box 


400250 






predicted exon 


428227 


AA321649 


Hs.2248 


interferon-gamma induced protein 


415138 


C18355 


Hs.78045 


tissue factor pathway inhibitor 2 T 


45B154 


AW816379 




gb:QV4-ST0234-181 199-035^01 ST0234 


421477 


AI904743 


Hs.104650 


hypothetical protein FU10292 



TM.PAX 

SS,TM,ig.fn3 

TM.homeobox 



TM.WH2 
TM.PHD 
SS.TM, 



TM.PH 
TM.FG-GAP 

TM,Glyco_transf_29,TEA 



TM,Kunilz_BPTI,G-gamma 
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447342 AI199268 

446619 AU075643 

453891 AB037751 

441285 NM_002374 

409731 AA125985 



437641 AA811452 

415211 R64730.comp 

443450 H66045 

457438 NM_014053 

435031 AI632091 

417411 AW500008 

435174 AA687378 

444743 AA045648 

433420 A1674093 

419917 AA320068 

417728 AW138437 
403776 

426274 D38122 

416623 N74925 
405174 

431255 AA497043 

456662 NM_002448 

416530 U62801 

454392 BE260893 
406400 
401517 

417830 AW504786 



441794 AW197794 



Hs.75379 
Hs.19322 
Hs.313 



solute carrier family 1 (glial high 
ESTs; Weakly similar to!!!! ALU SU 
secreted phosphoprotein 1 (osteopon 
Homo sapiens mRNA full length inser 
' 'jbule-associated protein 2 




Hs.115685 ESTs 



msh (Drosophila) homeo box homolog 
kallikrein 6 (neurosin, zyme) 
gb:601150677F1 NIH_MGC_19 Homosapl 
kallikrein 8 (neurapsin/ovasin) 
predicted exon 

epithelial cell transforming sequen 
ESTs 
cyclinB 



438578 AA811244 

423513 AF035960 

448966 AW372914 

431870 AW449902 

409457 AW818081 

438777 AA825487 

451807 W52854 

433326 AI379486 

448221 BE622615 

448141 AI471598 

456311 AA225632 
405454 

459287 AL079369 

438935 H40665 

421312 AA824627 

418882 NM 004996 

424345 AK001380 

417956 AA210704 

445537 AJ245671 

448089 AI467945 

446643 AA194417 

456671 AB011142 

457256 AA459443 

438986 AF085888 

435313 AI769400 

417351 T90278 
412198 AA937111 
413278 BE563085 
421502 AF111856 
418092 R45154 
410008 AA079552 
420362 U79734 
431974 AW972689 
438209 AL120659 
447578 AA912347 
414812 X72755 
421515 Y11339 
416402 NM.000715 



Hs.164168 
Hs.129719 
Hs.287462 
Hs.105500 

Hs.142179 
Hs.27099 
Hs.1 59430 



Hs.31564 
Hs.291670 
Hs.89433 



EGF-like-domain; multiple 6 



Hs.282060 ESTs 
Hs.1 14293 KIAA0570 gene product 



Hs.231816 ESTs 
Hs.269307 ESTs 



Hs.106604 ESTs 
qb:zm20 

Hs.97206 
Hs.200934 
Hs.6111 
Hs.136585 
Hs.77367 
Hs.105352 
Hs.1012 



KIAA0307 gene product 
ESTs 

monokine induced by gamma interfero 
GalNAcalpha-2, 6-s:alyltransfe;ase 
complement component 4-binding prot 



SS,TM,IL8 
TM.TNF 
TM,Ets 



TM,trypsin,proJsomerase 



TM,trypsin 
TM.HMG14.1 



TM,SNF2_N 

TM,ank 

TM.EGFJB 

TM,HMG_box 

TMJEA 

SS 



Homo sapiens cDNAFU11875fis, do 

gb'.CM4-ST0276-101299-059-b09 ST0276 
ESTs, Weakly similar to ORF2 [Mmus 
DKFZP564J0863 protein 
ESTs 

gb:60144O775T1 NIH_MGC_72 Homo sapi 



TM,Spin-Ssty 

TM.MBD 

TM.CH 



TM.FG-GAP 
TM,EMTH,I_LWEQ 
TM,bZIP 
TM.HLH.PAS 
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Hs.187561 ESTs, Moderately similar to ALU1_HU 
Hs.2250 leukemia inhibitory factor (choline 
Hs.40368 adaptor-related protein complex 1 , 

gb:zwB0c03.s1 SoaresJeslisJIHT Mom 
Hs.168732 ESTs 



Hs.9329 Homo sapiens mRNA for fls353, compl 
Hs.288036 tRNA isopentenylpyrophosphate trans 
Hs.256150 ESTs, Highly similar to NY-REN-41 a 



428679 AA431765 



426698 AA394104 

443426 AF098158 

406815 AA833930 

434808 AF15510B 

432441 AW292425 

435615 Y15065 
402298 

435542 AA687376 

442952 AI743261 

418203 X54942 

429228 AI553633 

418969 W33191 

447570 AI868315 



• 416566 NMJM3914 Hs.79378 cyclin A1 

420900 AL045633 Hs.44269 ESTs 

430563 AA481269 Hs.173381 ESTs 

417372 T99755 Hs.290814 ESTs 

449083 AI948808 Hs.191144 ESTs 

Hs.62661 guanylate binding protein 1, interf 

Hs.143659 ESTs 

Hs.271580 Uroplakin 1B 



Hs.269533 ESTs 

Hs.131860 ESTs 

Hs.83758 CDC28 protein kinase 2 

Hs.104985 r" 
Hs.28907 



hypothetical protein FU20258 



410361 BE391804 

434131 AI858275 

431846 BE019924 

425638 NW.012337 

440006 AK000517 

445870 AW410053 

430639 AW025427 

439018 AW300887 

422095 AI868872 

411558 AA102670 

408380 AF123050 
403721 

440711 AA904389 



40 
45 
50 



Hs.6844 hypothetical protein FU20510 

Hs.13406 syntaxin 18 

Hs.233552 ESTs 

Hs.26638 membrane-spanning 4-domains, subfam 



Hs.70725 "Human GABA-A rec 

Hs.44532 diubiquitin 

predicted exon 

Hs.143611 ESTs 

Hs.228780 ESTs, Highly similar to AF199597 1 

HS.122S79 ESTs 

Hs.152617 ESTs 

I ls.222695 Homo sapiens cDNA: FU20986 fis, cl 

Hs.130865 ESTs 

Hs.226376 ESTs 
Hs.61152 
Hs.1 14309 

Hs.14658 ESTs 

Hs.190089 ESTs 



436679 AI127483 

451061 AW291487 

410664 NM.006033 

449378 AW664026 

433345 AI681545 

425851 NM 001490 

431832 AW276866 

448275 BE514434 

423049 X59373 

427510 Z47542 

418076 R61388 

413670 AB000115 

429183 AB014604 

439031 AF075079 

451494 AI799444 

419978 NMJ01454 

404535 Z25884 

445181 AW338972 

452367 U71207 



Hs.120451 

Hs.213659 

Hs.65370 

Hs.59892 

Hs.152982 

Hs.1 59642 

Hs.192715 

Hs.20830 

Hs.1 88023 

Hs.179312 

Hs.6724 

Hs.75470 



411828 AW161449 
440052 AI633744 
410718 AI920783 



Hs.249171 
Hs.247095 
Hs.93974 
Hs. 121483 
Hs,147471 
Hs.29279 
Hs.179240 
Hs.170675 
Hs.50334 
Hs.224623 
Hs.120925 
Hs.153746 
Hs.220697 
Hs.14041 
Hs, 72290 
Hs.195648 
Hs.191435 



synaptic Ras GTPase activating prot 
ESTs 

small nuclear RNA activating comple 
ESTs 

hypothetical protein, expressed In 
gb:Homo sapiens full length insert 



eyes absent (Drosophila) homolog 2 
ESTs 

ESTs, Weakly similar to TMS2_HUMAN 



Homo sapiens cDNA: FLJ22490 fis, cl 
ESTs 

wingless-type MMTV integration site 



SS,LIF_OSM 

TM,Clat_adaptor_s 

TM.HECT 



TM.CKS 
TM 

TM.SH3 
TM.PHD 
TM.FMO-like 
TM.cyclin 

TM,FAD_binding_5 
TM,ABC_nv ' 



TM 



SS,TM,Cu-oxidase 

TM,neur_chan 

TM,7tm_3,ANF_receptor 



SS,TM,Ribosomal_L22,lipase 



SS.TM.Branch 
TM,Ets,SAM_PNT 
TM,kinesin ,abhydrdase_2 



447462 AW337214 Hs.158973 ESTs 
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442255 AI701857 Hs.202388 ESTs 

410292 AA843087 Hs.124194 ESTs 

442748 AI015713 Hs.135787 ESTs 

458760 AI498631 Hs.1 11334 ferritin, light polypepfide 

409799 D11928 Hs.78845 ' 
401324 

432140 AK000404 

447541 AK000288 

421379 Y15221 

TABLE 19B 

Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 

409457 1132521J AW818081 AW392887AW514700 AW392881 

410008 116812J AA079552 BE14252S BE142527 

422689 219896J AW856665 AA315006 AW954733 

428679 294049J AA431765 AA432015 

438993 467651.1 AA828995AA834879 AI926361 

439031 46798.1 AF075079 H48601 H48795 

448221 75534_-1 BE622615 

454392 1 15882.1 BE260893 AA078319 R85057 AW803024 H8581 1 « 

454692 1229118J AW813350 AW81 6082 AW81 3476 AW813383 

458154 491768.1 AW816379AA8B8282AA879046AA879195 

458861 798085.1 AI630223 AI630470 

459287 977129.1 AL079369 D81804 

TABLE 19C 



Sequence source. The 7 digit numbers in this column are Genbank Identifier (G I) nui 
human chromosome 22.' Dunham I. et al., Nature (1999) 402.489495. 
Indicates DNA strand from which e» 



. "Dunham I. et al." refers to the publication entitled The DNA sequence of 



401324 9863791 

401517 7677912 

401933 3810668 

402298 6598824 

402606 9909429 

403721 7528046 

403776 7770611 

404767 7882827 

405032 7107731 

405174 7108030 

405454 7656675 

405609 5757553 



234057-234174 
29278-29770 
48725-49057,- 
36758-37953 



131945-132224 
102814-103063 
133807-134053 



51727,51823-51959,52702-52918,55469-55601,57111-57307,58159-58296,60215- 



1553-1712,1878-2140,4252-4385,5922-6077 



Table 20A: 56 Up-Regulated Genes Encoding Extracellular /Cell Surface Proteins, Uten'ne Cancer Versus Normal Adult Tissues 

Table 20A lists about 56 genes up-regulated in uterine cancer compared to normal adult tissues that are likely to encode either enzymes or proteins amenable to modulation by small 
molecules. These were selected as for Table 18A, except that the ratio was greater than or equal to 2.0, and the predicted prole n ton, neda ructnrsl domain that is indicative of 
enzymatic function or of being modulated by small molecules (e.g. pklnase, peptidase, isomerase, transporters). The predicted protein domains are noted. 

Pkey: Unique Eos probeset identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 



PSDomain: Protein Stri 


ictural Domain 








R1: Ratio of tun 


ror vs. normal I 








Pkey ExAccn 


UnigenelD 


Unigene Title 




R1 


428187 AI687303 


Hs.285529 


G protein-coupled receptor 49 (GPR49) 


7tm_1 


24.2 


400289 X07820 


Hs.2258 


Matrix Metalloproteinase 10 (Stamoly 


hemopexin 


12.3 


447350 AI375572 


Hs.172634 


HER4 (c-erb-B4) 


kinase 


9.8 


420510 A1683183 


Hs.99348 


distal-less homeo box 5 




6.2 


405609 




predicted exon 


Myosin.tail.myosinjiead 


5.0 


458861 NM.007358 


Hs.31016 


PHD finger DNA binding protein 


PHD 


4.4 


410153 BE311926 


Hs.15830 


Homo sapiens cDNA FU12691 lis, clone 


NA 


4.3 
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436211 AK001581 

444783 AK001468 

418677 S83308 

413472 BE242870 



435031 AI632091 

417411 AW500008 

435185 AA669490 

416530 U62801 

406400 AA343629 

418478 U38945 

441794 AW197794 

402373 AL135225 

423513 AF035960 

448141 AI471598 

415076 NM_000857 

418882 NMJ04996 



430372 AI205173 

431974 AW972589 

428046 AW812795 

421515 Y11339 
403095 

435615 Y15065 
402298 

418203 X54942 

447570 AI868315 

439018 AW300887 

415539 AI733881 



418506 AA084248 

410664 NM 006033 

425851 NMJ01490 

448275 BE514434 

429782 NM_005764 

404535 Z25884 

448105 AW591433 



Hs.80961 
Hs.62180 
Hs.87224 
Hs.75379 



polymerase (DNAdirected), gamma 
ESTs 

SRY (sex determining region Y)-box 5 
solute carrier family 1 

gb:oz39b09.s1 Soares_NhHMPu_S1 Homos 
matrix metalloproleinase 7 (matrilysin) 
ESTs, Weakly similar to AF126780 1 re 




Hs.159642 
Hs.20830 
Hs.220689 
Hs.121483 
Hs.170675 
Hs.14846 
Hs.111334 
lls.76845 



ibrane-spanning 4-domains, subfamil 
BMPR-lb; 

ceruloplasmin (ferroxidase) 

diubiquitin 

ESTs 

ESTs, Highly similar to AF199597 1 A- 
G protein-coupled receptor 39 
lipase, endothelial 

glucosaminyl (N-acetyl) transferase 1 
synaptic Ras GTPase activating protein 
Ras-GTPase-activatng protein SH3-dorn 



trypsin.projsomerase 



ABC_membrane,p450 



RibosomaLL22,lipase,PLAT 



HC03_cotransp,zf-C3HC4 



Pkey; Unique Eos probeset ident'fier nu 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



458861 798085.1 AI630223 AI630470 
TABLE 20C 

Pkey: Unique numbercorresponding to an Eos probeset 

Reft Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. 'Dunham I. et al." refere to (he publication entitled The DNA sequence of 

human chromosome 22." Dunham I. et al. Nature (1999) 402:489495. 

Strand; Indicates DNA strand from which exons were predicted. 

Ntjosition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

401324 9863791 Plus 234057-234174 

402298 6598824 Plus 36758-37953 

403095 8954339 Plus 150025-150240,151564-151690 



51727,51823-51959,52702-52918,55469-55601,57111-57307,58159-58296,60215- 
60332,61482-61727 

406400 9256298 Plus 1553-1712,1878-2140,4252-4385,5922-6077 
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Table 21A: 270 Up-Regulated Genes, Ul 



is Normal Uterus 



Table 21A lists about 270 genes up-regulated in uterine cancer compared lo normal uterus. These were selected as for Table 18A, except that the ral 

5.0, and the denominator was the median value for six non-malignant uterine specimens. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene TfcUnigene gene title 

R1: Ratio of tumor vs. normal tissue 



Pkey ExAccn 



UnigenelD Unigene Title 



R1 



15 

20 



30 
35 



434779 AF153815 

441633 AW958544 

429183 AB014604 

436775 AA731111 

441031 AI110584 

446921 AB012113 

413753 U17760 

421515 Y11339 

414646 AA353776 

453891 AB037751 

425196 AL037915 

444863 AW384082 

449785 AI225235 

446839 BE091926 

449801 AA477355 

411773 NMJ06799 

414812 X72766 

410361 BE391804 

423645 AI215632 

442438 AA995998 

416786 AW419196 

458017 AA813426 

435525 AI831297 

413335 AI613318 

420297 AI628272 

452799 AI948829 

434311 BE543469 

408243 Y00787 

430713 AA351647 

452092 BE245374 

444342 NMJM4398 

443830 AI142095 

442547 AA306997 

421633 AF121860 



424966 AU077312 

425495 AA358454 

428862 NMJ00346 

438986 AF085888 

422731 AL138411 

441081 AI584019 

415992 C05837 

431211 M86849 

409865 AW502208 

448158 AI627292 
401519 

441730 AI243276 

432441 AW292425 

448275 BE514434 

438424 AI912498 

447342 AI199268 

408369 R38438 

423081 AF262992 

414484 BE314385 

420931 AF044197 



436679 AI127483 

452607 AI160029 

443171 BE281128 

459081 W07808 

431195 AA503083 

444459 AI680624 



Hs.291891 ESTs 

Hs.7645 fibrinogen, B beta polypeptide 
Hs.16530 CC chemokine SCYA18 (MIP-4) (PARC! 
Hs.301103 Laminin, beta 3 (nicein (125kD), kalinin 
Hs.105352 GalNAc alpha-2, 6-sialyllransferase I, I 
CD48 antigen (B-celt membrane protein) 
Homo sapiens mRNAfull length insert cDN 
carbonic anhydrase II 



ESTs 

Homo sapiens cDNA: FU23231 fis, clone C 
mitotic spindle coiled-coil related prat 
Homo sapiens cDNA: FU23231 fis, clcne C 
.protease, serine, 21 (testisin) 
monokine induced by gamma interferon 
guanylate binding protein 1, interferon- 
ESTs 

gb:os26b03.s1 NCI_CGAPJ<id5 Homo sapiens 
ESTs 

ESTs, Weakly similar to KIAA0705 protein 



Hs.901 
Hs.36353 
Hs.155097 
Hs.301323 
Hs.288300 
Hs.16244 
Hs.288300 
Hs.72026 
Hs.77367 
Hs.62661 
Hs.147487 

Hs.257924 
Hs.192034 
Hs.123310 
Hs.48442 



Hs.213786 ESTs 

Hs.266263 Homo sapiens cDNA FLJ141 15 fis, clone MA 

Hs.624 interleukin 8 

Hs.2642 eukaryoucUansiationelongation factor 

Hs.27842 hypothetical protein FIJ1 1210 

Hs.10887 similar to lysosome-asscciated membrane 



Hs.129327 ESTs 
Hs.150308 ESTs 

Hs.5101 ESTs; Highly similar to protein regulati 



Hs.1 53985 solute carrier family 7 (cationic amino 
H 8 E ESI Wcakl nil - lilartoanky 
Hs.2316 SRY (sex-determining region Y)-box 9 
Hs.269307 ESTs 

gb:DKFZ P 434A1229_r1 434 (synonym: htes3) 
Hs.169006 ESTs, Moderately similar to piakophilin 
Hs.145807 Homo sapiens cDNA FU13593 fis, clone PL 
Hs.5566 Homo sapiens connexin 26 (GJB2) mRNA, co 

gb:UI-HF-BR0p-aju-e-09-0-Ul.r1 NIH MGC 5 
Hs.1 90877 ESTs 

Hs.149017 ESTs 



Hs.1 9322 
Hs.1 82575 
Hs.123159 



Hs.120451 
Hs.61438 
Hs.9030 



synaptic Ras GTPase activating protein 1 
ESTs, Weakly similar to PI-3 kinase [Hs 
ESTs; Weakly s : milar to !!!! ALU SUBFAMI 
solute carrier family 15 (H+/peptide Ira 
sperm associated antigen 4 
gb:601154649F1 NIH_MGC_19 Homo sapiens c 
small Inducible cytokine B subfamily (Cy 
gb:RC-BT029-120199-219_1 BT029 Homosapi 
gb;CM4-BT0320-221199-047-g10 BT0320 Homo 
ESTs, Weakly similar to unnamed protein 
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422765 AW409701 

414918 AI219207 

429334 D63078 

448865 R35027 

409219 AA393383 

400491 H25530 
403485 

408350 AW183350 

445873 AA250970 
400995 



431842 NM_005764 

429732 U20158 

427494 AI628365 

429272 W25140 

427258 AA400091 

449309 AW589823 
400104 

415402 NM.000715 



403127 AI904493 

418203 X54942 

425858 AA364923 

421712 AK000140 

455903 D49441 

414564 AA164803 

457942 AW665665 

410442 X73424 

424596 AB020639 

445537 AJ245671 

429597 NM.003816 

413472 BE242870 

410564 NMJJ06033 

428575 M19684 
405400 



412570 AA033517 

. . . 424349 AF141289 

45 448681 AL109781 

445258 AI635931 

456032 AW957446 
404727 

422810 AA317400 

50 440044 AW665167 

415498 U33632 

426600 NM_003378 

422170 AI791949 

449611 AI970394 

55 402539 AW502761 

456983 AI081687 

407910 AA650274 

457887 AI240007 

431765 AF124249 

OU 420344 BE463721 

443494 T99719 

456844 AI264155 



435313 AI769400 

441666 AI188346 

416111 AA033813 

427308 D26067 

423069 W15613 

416655 AW968613 

417079 U65590 

449409 AI650935 
400855 

454S92 AW813350 

414869 AA157291 

439662 H97552 

445181 AW338972 

437129 AL049327 

440128 AA962623 

443715 AI583187 



baculoviral IAP repeat-containing 5 (suf 
Hypothetical protein FU 13459 
Homosapiens cDNA: FU23038 (is, clone L 
gb:yg60g02r1 Soares infant brain 1NIB H 




Hs.130412 
Hs.110667 
Hs.39421 



epithelial protein up-regulated in carci 
lymphocyte cytosotic protein 2 (SH2 doma 
ESTs, Weakly similar to sre-2 fC.elegans 



complement component 4-binding protein, 



Hs.57664 
Hs.99890 
Hs.83758 



Hs.171014 
Hs.1 12432 
Hs.197075 
Hs.30909 
Hs.1 70225 



Hs.97101 
Hs.270404 
Hs.1 52981 
Hs.38761 
Hs.279193 



Hs.107139 
Hs.155981 
Hs.71994 
Hs.1 53034 
Hs.63788 
Hs.151017 
Hs.1 2844 
Hs.2442 
Hs.75379 
Hs.65370 
Hs.1 84929 

Hs.169294 
Hs.226804 
Hs.74047 
Hs. 145550 
Hs.21754 
Hs.147613 
Hs.301711 



polymerase (DNA directed), delta 1 , cata 

CDC28 protein kinase 2 

gb:EST75602 Pineal gland il Homo sapiens 



propionyl Coenzyme A carboxylase, beta p 



disintegrin and metaltoproteinase doma 
solute earner family 1 (glial high affi 
lipase, endothelial 

serine (or cysteine) proteinase inhib'to 

transcription factor 7 (T-cell specific, 
ESTs, Moderately similar to ALUC_HUMAN ! 
alectr r hi fe flavoprot i, beta pel 
solute carrier family 7 (cationic amino 
Homo sapiens mRNA full length insert cDN 
ESTs 
ESTs 

gb:EST19374 Retina II Homo sapiens cDNA 
EST 

potassium channel, subfamily K, member 1 
VGF nerve growth factor inducible 
anti-Mullerian hormone 
ESTs 

KIAA0430 gene product 



novel SH2-containing protein 1 
Putative G protein-coupled receptor GPCR 
Homosapiens cDNA: FU22389 fis, clone H 
CDP-diacylglyceral synthase (phosphatida 
Homosapiens cDNA; FLJ21564 fis, clone C 



gb:AF150286 Hu 



in mRNA from cd34* stem c 



s.193143 KIAA1069 protein 

S.189729 ESTs 

S.301776 ESTs 

s.79018 chromatin assembly factor 1, subunitA ( 

s.174905 KIAA0033 protein 

s.1613 adenosine A2a receptor 

s.79428 BCL2/adenovirus E1B 19kD-interacting p-o 
■ " interleukin 1 receptor antagonist 



Hs.301694 ESTs 



gb:MR3-ST0192-100100-024-g07 ST0192 Homo 
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422355 AW403724 
405291 

432113 AA935065 

441236 AA923489 

424418 BE503432 

453028 AB006532 

407137 T97307 

443482 AI064690 



446501 AI302616 
433921 AA618174 
409615 AW444861 



Hs.140 immunoglobulin heavy constant gamma 3 

Hs.152385 ESTs 

Hs.130432 ESTs 

Hs.66170 HSKM-B protein 

Hs.31442 ReoQ protein-like 4 

Hs.199067 EST 

Hs.171176 ESTs 

gb:601150677F1 NIH_MGC_19 Homo sapiens c 

Hs.190016 ESTs 

Hs.150819 ESTs 

gb:nq14f01.s1 NCI_CGAP_Thy1 Hcmosap ; ens 
gb-.UI-H-BI3-ajz-a-04-0-UI.s1 NCI CGAP Su 
gb:601277913F1 NIH_MGC_20 Homo sapiens c 



426465 AI758948 
443695 AW204099 
437372 M323968 



437100 AI761073 



gb:ty16f07.x1 NCI_CGAP_UI3 Homo sapiens 
Hs.112759 ESTs Weakly similar to AF126780 1 relin 
Hs.283631 ' ' 



Homo sapiens cDNA: FU22314 fis, clone H 



420973 AA743415 

430491 AL109791 

442549 AI751601 

409867 AW502161 

451110 AI955040 

418216 AA662240 

411897 AW875066 



432540 AI821517 

446315 NM.016293 

443270 NM_004272 

451035 AU076785 

406686 M18728 

454590 AW809762 
402430 

446704 AI337228 

435282 AA677428 

426062 N57014 

415451 H19415 

456002 AI628729 

409613 AW444816 



430250 NM_016929 

418327 U70370 

400379 NMJJ18432 

436076 AI193277 

432119 T80289 

445774 AI254165 

455604 BE011183 

411425 BE141714 



454417 AI244459 

439949 AW979197 

414995 C18200 

428071 AF212848 

412323 AW937143 

434283 AW235341 

447798 AI425049 
401723 
406270 

452194 AI694413 

415757 AA830854 



435615 



Hs.291368 ESTs 

Hs.241559 Homo sapiens mRNAfull length insert cDN 
Hs.8375 TNF receptor-associated factor 4 

gb:UI-HF-BR0p-a|r-g-12-O-Ul.r1 NIH_MGC_5 
Hs.301584 ESTs 
Hs.283099 AF15q14 protein 

gb:RC5-PT0001-18010O-021-FO4 PT0001 Homo 
Hs.282093 Homo sapiens cDNA: FU21918 fis, clone H 

0 

Hs.105866 ESTs 
Hs.14770 bridging integrator 2 
Hs.9192 V 
Hs.430 f 



Hs.222056 Homo sapiens cDNA FU1 1 572 fis, clone HE 
0 

Hs.197083 ESTs 
Hs.189731 ESTs 
Hs.44013 ESTs 

Hs.268720 ESTs, Moderately similar to ALU1 JHUMAN A 
Hs.191450 ESTs, Weakly similar to type II membrane 
Hs.171537 Homo sapiens cDNA; FU21 596 fis, clone C 
Hs.127826 RalGEF-like protein 3, mouse homolcg 

gb:yi60c11.r1 Soares placenta Nb2HP Homo 
Hs.283021 chloride intracellular channel 5 
Hs.84136 paired-like homeodomain transcription fa 
Hs.283076 Homo sapiens ovarian cancer related prot 
Hs.120954 ESTs 

gb:yd03h04.r1 Soares infant brain 1NIB H 

Hs.145504 ESTs 

gb:PM3-BN0218-100500-003-d09 BN0218 Homo 
gb:QV0-HTO1O1-061O99-O32-c04 HT0101 Homo 

Hs.253503 ESTs 

gb:601141065F1 NIH MGCJ Homo sapiens cD 

Hs.259307 ESTs 

Hs.110826 trinucleotide repeat containing 9 
Hs.292073 ESTs 

gb:C18200 Human placenta cDKA (TFujiwara 
Hs.182339 transcription factor ESE-3B 

gb:PM1-DT0041-281299-001-f01 DT0041 Homo 
Hs.58715 mouse thiamin pyroptiosphokinase homolog 
Hs.119629 ESTs, Moderately similar to ALU1.HUMAN A 



Hs.298262 ESTs, Weakly similar to dJ88J8.1 [H.sapi 

Hs.187810 ESTs 

Hs.52515 transducin(beta)-like2 

Hs.4975 potassium voltage-gated channel, KQT-lik 

gb:IL-BT1 52-080399-004 BT152 Homo sapien 

Hs.224812 ESTs 

Hs.137476 KIAA1051 protein 

Hs.136912 ESTs 

Hs.256897 ESTs, Weakly similar to dJ355012.1 [H.sa 

Hs.183961 ESTs 

Hs.18442 E-1 enzyme 
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449183 AW445022 

410146 AW592655 

45B164 AI208666 

410153 BE311926 



425188 AK002052 

417173 U61397 

433279 AW971745 

454112 NM.000885 

423261 Z43509 

434084 AI061640 

446115 AI733075 

416719 H79731 

421462 AF016495 

424517 AI539443 

430832 AI073913 

436070 AK000073 

416969 AI815443 

444929 AI685841 

453922 AF053306 

439031 AF075079 

414539 BE379046 

425349 AA425234 

449986 AW864502 

418717 AI334430 

438769 AA830684 

441859 AW194364 



Hs.15830 
Hs.58314 
Hs.118442 



Hs.301806 
Hs-192788 



Homo sapiens cDNA FU 12691 fe, clone NT 
cyclin C 

gb:EST381912 MAGE resequences, MAGK Homo 
gb:QV4-TT0008-091199-02Se09 TT0008 Homo 
Homo sapiens cDNA FU13881 fs, clone TH 
hypothetical protein FU11190 
ubiquitin-lta 1 (sentrin) 

gb:EST383834 MAGE resequences, MAGL Homo 
ESTs 

hypothetical protein PRO1905 

ESTs, Weakly similar to S69913 hypertens 

gb:yu81f12.r1 Soares fetal liver spleen 

Homo sapiens cDNA FU12169 lis, clone MA 



Hs.100686 ESTs, Weakly similar to : 

gb:Homo sapiens cDNA FLJ20066 lis, clone 
Hs.283404 organic cation transporter 
Hs.161354 ESTs 

Hs.36708 budding uninhibited by benzimidazoles 1 
gb:Homo sapiens full length Insert cDNA 
gb:601236645F1 N1H.MGC._44 Homo sapiens c 

Hs.79886 ribose 5-phosphate isomerase A (ribose 5 

gb:PM4-SN0016-120400-004-b12 SNQQ16 Homo 

Hs.86984 ESTs 

Hs.163426 ESTs 

Hs.128022 ESTs, Weakly similar to FIG1 MOUSE FIG-1 
Hs.15113 homogentisate 1,2-dioxygenase 



Pkey: Unique Eos probeset identifier nui 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



AW444861 BE074994 BE074965 BE074S92 
AW502208 AW502366 AW502148 
AW502161 AW502587 AW502345 
AW592655R05927R06916 



BE141714AW845993AW845989 

AW875066 AW875079 AW875075 AW875062 AW875061 AW875074 

AW9371 43 AW9371 50 AW937141 AW937151 AW9371 32 AW937160 AW9371 91 AW9371 74 AW9371 95 AW937173 
AW937159AW937139AW937171AVW M W937145AV 71 V37163 AW937154 AW937137AW937179 
AW937155AW937140AW937135AW937170 
BE271584M112511 
1 BE314385 



377350_1 
38950J 
41426 1 
43343J 
46798.1 
542469.1 



C18200 D78581 T82025 
H79731 H79732 
AL138411AL138412AA315860 

Z43509 H09001 AA375202 AW954383 
AA364923 AW963483 BE182774 C21461 
AI758948AA379527AA 
AW969834 AA528493 AA483165 AW969842 
T80289AF052168 
AW971745 AA581359 AA581 358 
AA618174 AI114549 R36464 R36465 
R76593AF147390 R76594 
AK000073 AA380183 AA380181 AW963533 
AL049327AA847105 
AF075079H48601 H48795 



R35027R12034BE407120 
AI624049 AW117770 AI858360 
AW864502 AW864369 AI678780 



32 F07756 F08149 F07647 



AI903396 AI903361 AI903360 
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Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al.' refers to the pu 
sequence of human chromosome 22." Dunham I. et al, Nature (1999) 402:489-495. 
Indicates DNA strand from which exi 



401723 
402104 
402430 



403824 
404220 
404727 
404767 
405291 
405392 



1931571 
8099094 
6649315 



17801-18228 

141186-141601 

157315-157950 

147273-147503 

122409-122600 

62382-62552 

44264-44443 

26009-26178 

119837-121197 

2888-3001,3198-3532,36554117 

473-887 

46107-46439 

115534-115747 

23244-23759 

19999-20473,20672-21036,21147-21285,21378-21667 

116167-116289,118879-119030 

9418-9573 



Plus 1553-1712,1878-2140,4252-4385,5922-6077 



TABLE 22A: 430 SIGNIFICANTLY DOWN-REGULATED GENES, UTERINE CANCER VERSUS NORMAL UTERUS 
Table 22A lists about 430 genes significantly down-regulated in i 



Pkey: Unique Eos pre 
ExAccn: E» .' ' 
UnigenelD: Unigene number 
Unigene Tite Unigene gene title 



Pkey ExAccn 

414063 H26904 

447990 BE048821 

407815 AW373860 

452547 AA335295 

415165 AW887504 

453655 AW960427 

429350 AI754634 

407228 M25079 

425869 AA524547 

416585 X54162 

408614 AL137698 

417542 J04129 

412295 AW088826 

421998 R74441 

452093 AA447453 

429707 W76631 

416950 AL049798 

408221 AA912183 

406791 AI220684 

446500 U78093 

407938 AA905097 

410677 NM_003278 

412524 AA417813 

452426 AI904823 

414290 AI568801 

439627 BE621702 
400258 

414807 AI738616 

410023 AB017169 

407663 NM_016429 

410285 AI739159 

418986 AI123555 

409050 AI815867 



UnigenelD Unigene Title 

Hs.75736 apolipoprotein D 

Hs.20144 small inducible cytokine subfamily A, member 14 

Hs.301716 ESTs 

Hs.74120 adipose specific 2 

Hs.78055 complement component 7 

Hs.300878 ESTs, Moderately similarto TGR3_HUMAN TGF-BE 

Hs.131987 ESTs 



Hs.160318 FXYDdo 

Hs.79386 leiomodin 1 (smooth muscle) 

Hs.46531 Homo sapiens mRNA; cDNA DKFZp434C1915 (frame 

Hs.82269 progestagen-associated endometrial protein (p 

Hs.22971 ESTs 

Hs.117176 poly(A)-binding protein, nuclear 1 
Hs.27860 Homo sapiens mRNA; cDNA DKFZp586M0723 (from c 
' liti 



Hs.47447 ESTs 

Hs.272572 hemoglobin, alpha 2 

Hs.15154 sushi-repeat-containing , 

Hs.85050 phospholamban 

Hs.65424 tetranectin (plasminogen-binding protein) 

Hs.11177 ESTs 

Hs.31297 Homo sapiens cDNA; FU23001 lis, clone LNG002 

Hs.71721 ESTs 

Hs.29076 Homo sapiens cDNA: FU21841 Ss, clone HEP018 

Hs.79064 deoxyhypusine synthase 

Hs.77348 hydroxyprostaglandin dehydrogenase 15-(NAD) 

Hs.57929 slit (Drosophiia) homolog 3 

Hs.37482 COP22 for nonclathrin coat protein zeta-COP 

Hs.61898 DKFZP586N2124 protein 

Hs.81796 ESTs 

Hs.50130 necdin (mouse) homolog 

Hs.279837 glutathione S-transferase M2 (muscle) 

Hs.1327 butyrylcholinesterase 

Hs.1 1 1 9 n uclear receptor subfamly 4, group A, member 

Hs.2551 1 transforming growth factor beta 1 induced tra 

Hs.300878 ESTs, Moderately similar to TGR3_HUMAN TGF-BE 
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MJ04673 



430468 
453060 

424206 NMJ03734 

422126 AW973734 

406082 S47833 

421639 NM 012082 
402520 

418043 AW377752 

443906 AA348031 

450958 AL137669 

418828 AF020774 

412828 AL133396 

429507 NM_003102 
400545 

425078 NM_002599 

429942 AI338993 

438303 AB028998 

419971 AA400027 

443060 D78874 

452877 AI250789 

412442 AI983730 



amine oxidxe, copper containing 3 (vascular 
Misshapen/NIK-related kinase 
adenosine monophosphate deaminase 2 (isoform 
Homo sapiens mRNA full length insert cDNA do 

H.sapiens mRNA for tyrosine kinase receptor 
ESTs 

Homo sapiens mRNA; cDN A DKFZp434M0435 (from c 



447786 BE620810 
400023 

453874 AW591783 

414134 XB0188 

428451 AW970451 

435520 AA297990 

437179 AA393508 

441481 AA935303 

450227 BE388192 
403731 

452814 AI092790 

410036 RS7171 

416854 H40164 

418421 R58620 

407000 U12139 

421803 NMJ312205 

445613 BE550889 

432302 AA345857 

420796 L34355 

423720 AL044191 

417302 BE245812 

421913 AI934365 

440130 AI083899 

431967 AJ243653 

424580 AA446539 

406907 Z25427 

443745 AB039670 

429101 AW452174 

410691 AW239225 

408853 AW291484 

407979 M046306 

448619 AI867182 

424585 M464840 

407891 AA485620 

407196 D11747 

426990 AL044315 

450493 M93718 

420120 AL049610 

423690 AA329648 
402865 

417387 AW021102 

456898 NM_001928 
459722 

422927 AW247388 
402195 

418213 AW978753 

440274 R24595 

455818 AI733747 

420851 AI039044 
405228 

441292 AF131218 

432553 AA553334 

417098 AB017365 



413186 AU077141 



Hs.130699 
Hs.21594 
Hs.198241 
Hs.112028 
Hs.82927 
Hs.297921 

Hs.83341 
Hs.7913 
Hs.25700 
Hs.88844 
Hs.74621 
Hs.2420 



Hs.134535 ESTs 

Hs.6147 KIAA1075 protein 

Hs.296234 ESTs, Highly similar to mitogen-activated pro 

Hs.8944 procollagen C-endopeptidase enhancer 2 

Hs.32478 ESTs 

Hs.26530 serum deprivation response (phosphatidyiserln 

Hs.184367 GTPase activating protein-like 

Hs.28625 ESTs 

Hs.39619 hypothetical protein LOC57333 

AFFX control: 18S ribosomal RNA 

Hs.36131 collagen, type XIV, alpha 1 (undulin) 

Hs.861 mitogen-activated protein kinase 3 

Hs.98570 ESTs 

Hs.9315 HNOEL-iso protein 

Hs.171409 serologically defined colon cancer antigen 8 



ns cDNA: FU21 193 (is, clone COL001 



ESTs 

Hs.78521 Homo sa 

Hs.55016 hypothetical protein FLJ21935 

Hs.57975 calsequestrin 2, cardiac muscle 

Hs.80296 Purkinje cell prolan 4 

Hs.85050 phospholamban 

gb:Human alphal(XI) collagen [COL1 1A1] gene, 

Hs.1 08441 3-hydroxyanthranilate 3,4<)ioxygenase 

Hs.158491 ESTs 

Hs.274307 KIAA1442 protein 

lls.99931 sarcogtycan, alpha (50kD dystrophin-associatc 

Hs.23388 Homo sapiens cDNA: FLJ21310 lis, clone COL021 

Hs.8941 ESTs 
Hs.109439 

Hs.1 57527 ESTs 
Hs.283404 
Hs.35092 



Hs.254967 ESTs 

Hs.62927 ESTs 

Hs.202255 ESTs 

gb:zx43h11.r1 SoaresJotelJetus_Nb2HF8_9w Ho 

Hs.41135 Endomucin2 

Hs.177415 Finkel-Biskis-Reilly murine sarcoma virus (FB 

Hs.173094 Homo sap^ns mRNA; cDNA DKFZp564H142 (hand 

Hs 166373 nitric o,idt ith e ) (end helialcell) 

Hs.95243 transcription elongatioa factor A(Sll)-:ike 

Hs.23804 ESTs 

Hs.21509 ESTs 

Hs.155597 D component of complement (adipsin) 

Homo sapiens cDNA: FU23449 rls, clone HSI058 

Hs.301423 calcium binding protein 1 (calbrain) 



Hs.7122 scrapie responsive protein 1 

gbzn86d04.y5 Stratagene lung carcinoma 93721 
Hs.88827 Homo sapiens mRNA for FU00033 protein, parti 

Hs.7765 chromosome 16open reading fame 5 
Hs.211095 ESTs 

frizzled (Drosophila) homolog 7 



Hs.34074 dipepUdylpeptidaseVI 



Hs.289006 
Hs.75231 
Hs.154476 
Hs.108124 



ESTs, Weakly similar to 
solute earner family 16 (monocarboxylic acid 
hypothetical protein FU20626 
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430036 AL050284 

430233 AW367902 

436130 AA341497 

434843 R43707 

429303 AW137535 

442422 AI344415 

410399 BE068889 

435869 AF255910 

447384 AI377221 

440610 AI733098 

445806 AL137516 

433657 AI244368 

436467 AW450278 

440191 AI990417 

417511 AL049176 

406976 M50299 

443547 AW271273 

417998 AW967420 

419313 AA843387 

408322 AW181985 

448422 BE263813 
403121 

424198 AB029010 

459060 H89244 

457829 AI742291 

445029 AF196481 

424362 AL137646 

417067 AJ001417 

413972 BE279548 

435891 AW249394 

447551 BE066634 
400637 

409882 AJ243191 

430310 U60115 
402741 
401703 



430342 NM.005938 
404033 

411939 AI365585 

431227 X63755 

452669 AA216363 

439698 AW779654 

416253 BE250659 

418556 T02850 

408877 M479033 

415994 NM_002923 

417054 AF017060 
404654 

420174 AI824144 



Hs.227782 DKFZP586M1019 protein 

Hs236443 Homo sapiens mRNA; cDNA DKFZp564N1063 (from c 

Hs.31408 ESTs 

Hs.1331 59 ESTs, Weakly similar lo PIHUSO salivary pro!; 

Hs.44238 ESTs 

Hs.1 56082 ESTs 

Hs.63236 synuclein, gamma (breast cancer-specife prol 

Hs.54650 ESTs, Weakly similar to (defline not availabl 

Hs.40528 ESTs 

Hs.1 30800 ESTs 
Hs.13323 



Hs.237B7 ESTs 



gb:Human alpha-1 collagen type II gene, ex 



gb:EST379495 MAGE resequences, MAGJ Homo sapi 
ESTs 
ESTs 

gb:601 194177F1 NIH_MGC_7 Homo sapiens cDNA c! 
KIAA1087 protein 

heterogeneous nuclear ribonucleoprotein D (AU 
ESTs, Weakly similar to dJ1039K5,2 [H.sapiens 

Homo sapiens mRNA; cONA DKFZp586F0824 (from c 
solute earner family 22 (extraneuronal monoa 
ESTs, Weakly similar to HPPD_HUMAN 4-HYDROXYP 
copper chaperone for superoxide dlsmutase 



Hs.143026 

Hs.79625 

Hs.210843 

Hs.12256 

Hs.1 46001 

Hs.81086 

Hs.1 6271 7 

Hs.5002 

Hs.929 



Hs.251928 

Hs.19447 Homo sapiens mRNA for FUOOii 

Hs.239663 myeloid/lymphoid, ' 

Hs.146246 ESTs 

Hs.2743 keratin, cuticle, ultrahigh sulphur 1 

Hs.262958 ESTs, Weakly similar to alternatively spliced 



Hs.15463 ESTs 



gb:FB12A9 Fetal brain, Stratagene Homosapien 



Hs.23912 ESTs 



55 
60 



420105 AW015571 

404619 BE514535 

423282 AL137563 

424097 M13981 

448543 AW897741 

427605 NM_000997 
406535 

418947 W52990 

414323 NM_014759 

457111 AA482027 

418373 AW750770 

424461 D83542 
451565 NM_000897 
407751 BE276096 
432031 AF039196 
404608 H58589 
451962 AW078832 
424100 AI793080 
451509 AI969529 
453512 AL040160 
429924 W39693 
423780 AA352013 
427030 AA397600 
439872 T81058 
407836 T79340 
451427 AI091441 

424462 AU0766S6 
451533 NM 004657 
422319 AW403342 



Hs.192375 

Hs.32244 

Hs.77171 

Hs.126378 

Hs.1734 

Hs.21380 

Hs.179779 



Hs.239500 KIAA0273 gene product 



rtodJ127B20.3 [H.sapiens 
3 deficient (S. cere 



Hs.1 42569 
Hs.84344 
Hs.148090 
Hs.456 



Hs.123525 
Hs.171637 
Hs.209542 
Hs.226138 



Hs.97531 ESTs 



ESTs 
CGI-135 protein 

cadherin 15, M-cadherin (myotubule) 
leukotriene C4 synthase 
from HeLa cyclln-dependent k'mase 2 interacti 
hairless (mouse) homolog 

Homo sapiens cDNA FU11027 fs, done PLACE10 
ESTs 

ESTs, Weakly similar to NGAL RAT NEUTROPHIL G 
Homo sapiens cDNA: FU21937 fis, clone HEP044 
ESTs, Weakly similar to B cell linker protein 

sapiens mRNA; cDNA DKFZp566H2446 (from o 
cDNAff 



Hs.22575 
Hs.26401 
Hs.148101 



gb:yd26c08.r1 Soares fetal liver spleen 1 N FLS 
Homo sapiens cDNA. FU21042 fis, clone CAE112 
tumor necrosis factor (li'gand) superfamily, m 
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429790 AK001352 

422796 AW897255 

427980 M418305 

409543 AW410200 

440206 AI762232 

455904 BE1 56173 

427707 NM_005578 

437140 M312799 

417637 AA204969 

419171 NM 002846 

417808 AF177909 

426232 Z70024 

440747 AW297226 

415307 F05232 

407049 X72632 

454054 AI336329 

411085 AF022991 

443104 AA088470 

424106 M412442 

446716 M436575 

448677 AI560769 

434919 A1821740 

401171 M360954 

456804 AI421645 



30 
35 
40 
45 



60 
65 



Hs.59563 
Hs.23437 
Hs.98314 
Hs.221737 



Hs.46794 
Hs.180398 




Homo sapiens mRNA for FU00007 protein, parti 
Homo sapiens cDNA FLJ13555 5s, clone PLACE10 
Homo sapiens mRNA; oDNA DKFZp585L0120 (from c 
hypothetical protein RJ10490 
gb:CMO-NN0057-150400-335-a04 NN0057 Homo sapi 
gb:zv96g05.s1 Soarss_NhHf/Pu_S1 Homo sapiens c 
gb:fh05b12.x1 NIH_MGCJ7 Homo sapiens cDNA o! 



gb:QV0-HT0367-201299-079-aO2 HT0367 Homo sapi 
LIM domain-containing preferred translocat'on 
tis.ztttdtiy activator of CREM in testis 
Hs.234863 Homo sapiens cDNA FU12082 fis, clone HEMBB10 
Hs.89655 protein tyrosine phosphatase, receptor type, 
Hs.12828 tweety (Drosophila) homolog 1 
Hs. 168157 nuclear transcription factor Y, gamma 
Hs.137840 ESTs, Moderately similar to SIX1_HUMAN HOMEOB 
Hs,27495 prostate cancer associated protein 7 
(NONE) 

Hs.301519 Homo sapiens cDNA FU12536 fis, clone NT2RM40 
Hs,68398 period (Drosophila) homolog 1 
Hs.83135 p53-responsivegene6 
Hs.98132 ESTs 
Hs.16602 ESTs 
Hs.227051 
Hs.116531 
Hs.27268 
Hs.139851 



420037 

428818 AI131291 

426485 NMJ06207 
404947 

412577 AW0296C8 
401551 

408053 AW139474 

425016 AA376049 

418179 X51530 

418994 AA296520 

457514 AA775208 

426275 BE151551 

457924 AL390142 

430712 AW044647 

455144 AW875942 

407524 X64985 

426712 AW173177 



436950 L05779 

415196 AK000150 

442197 AW837912 

433457 AA830194 
402316 

409736 AA078628 

407964 AW130334 

433677 AI791912 

425507 AI684745 

413724 AA131466 

408922 R87388 

413055 AV655701 

435977 AL138079 

442208 AW296984 



Hs.48938 
Hs.231444 
Hs.56729 
Hs.150557 
Hs.11690 
Hs.181641 
Hs. 157425 
Hs.156336 



Hs.122540 
Hs.108675 
Hs.135569 



Homo sapiens mRNA; cDNA DKFZp564N196 (from cl 

gb:QV3-BT0537-280100-07te04 BT0537 Homo sapi 
Homo sapiens cDNA; FU21802 fis, clone HEP007 
ESTs 

lymphocyte-specific protein 1 



Weakly similar to NEUROD [H.sapiens] 
platelet-derived growth factor receptor-like 



Hs.154162 ADP-ribosylation factor-like 2 
Hs.1145 Wilms tumor 1 

Hs.89546 Selectin E (endothelial adhesion molecule 1) 



Hs.196284 ESTs 



Hs.197755 
Hs.21374 
Hs.225795 
Hs.131194 
Hs.288835 
Hs.192491 
Hs.113 
Hs.78185 



ESTs, Highly similar to OTX1.HUMAN HOMEOBOX P 
ESTs 

cell death-inducing DFFA-like effector b 
ESTs 

epoxide hydrolase 2, cytoplasmic 
MAX-like bHIHZIP protein 

gb:QV3-LT0048-2601O0-O68-c02 LT0048 Homo sapi 
ns. iswi / Homo sapiens mRNA for FU00027 protein, parti 

gb:7P07H07 Chromosome 7 Placental cDNA Ubrar 
Hs.281111 ESTs 

Hs.190885 ESTs, Moderately similar to ALU1.HUMAN ALU SU 
Hs.165983 hypothetical C2H2 zinc finger protein FU2250 
Hs.23767 Homo sapiens cDNA FU 1 2666 fis, clone NT2RM40 
gb;ym88g04.r1 Soares adult brain N2b4HB55Y Ho 
Hs.75183 cytochrome P450, subfamily IIE(ethanoUnduc 
Hs.5012 brain-specific membrane-anchored protein 
Hs.255595 ESTs, Weakly similar to PSF_HUMAN PTRASSOCIA 

Hs. 14394 hypothetical protein FU20157 
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416193 T25400 

407065 Y10141 

441785 AW138139 

413784 BE155819 

429092 AI190864 

408499 AW205323 

453754 AW972580 

450826 U43030 

428486 AW583497 



423334 AK000906 

422948 AW810824 

447852 AW504781 

419084 AA496539 

455771 AW016739 

438564 AA381553 

448705 H05072 

454460 X65945 

458893 BE161733 



434179 AI743448 



gb:CMO-HT0485-220333-30 1-d1 2 HT0486 Homo sapi 
Hs.178225 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAM 
Hs.253475 ESTs 
Hs.172753 ESTs 



Hs.184604 pancreatic polypeptide 

Hs.48793 ESTs 

Hs.127273 hypothetical protein FU10044 

Hs.21351 ESTs 

gb:UI-HF-BN0-aln-c-04-0-Ul.r1 NIH_MGC_50 Homo 

Hs.1 79902 transporter-like protein 

Hs.232201 ESTs 

Hs.198253 majorhislocompalibilitycomplex.class II, D 

Hs.124984 ESTs, Moderately similar to unnamed p-otein p 

Hs.748 fibroblast growth factor receptor 1 (fms-rela 

Hs.97283 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAM 

Hs.21213 ESTs 

Hs.24947 ESTs 

Hs.1 16177 ESTs 



443918 AA305475 



441704 AI458766 

414272 AI651603 

448224 R48700 

404611 H58589 

448381 D61580 

464719 BE006547 

446973 H95724 

457760 AA668123 

440144 AW082297 



Hs.276489 ESTs 

Hs.1 26519 ESTs 

Hs.97044 Kv channel-interacting protein 2 

Hs.22660 HomosapienscDNAFU11658fis,clone HEMBA10 

Hs.300459 ESTs 

Hs.128942 ESTs 

Hs.32699 ESTs, Weakly similar to LIV-1 protein [H.sapi 

Hs.201988 ESTs 

Hs.46988 ESTs 

Hs.20733 EH-domain containing 2 

Hs.35156 HomosapienscDNAFU11027fis,clone PLACE10 

Hs.21036 Homo sapiens mRNA;cDNA DKFZp434A1 010 (from c 
gb:RC2-DN013Q-04040Q-0 1 1 -b03 BN0130 Homo sapi 

Hs.4283 ESTs 

Hs.1 341 70 ESTs 



l AA416756 



435487 W07343 

414213 BE297765 

455916 BE156710 

448943 AI608810 

418026 BE379727 

454082 AF28350a 

453308 AW959731 

458823 AW207574 

452532 AI905811 

418464 R87580 

409473 AL137716 

449779 AA004258 

457546 AA568484 



443528 AK001778 



Hs.161061 ESTs, Moderately similar to ALU6_HUMAN ALU SI 



i r i 

gb:601176246F1 



1 NIH_MGC_17 Homo sapiens cDNAc 
gb:QVO-HT0366-310300-181-d01 HT0368 Homo sapi 
Hs.193288 ESTs 

Hs.8321 3 fatty acid binding protein 4, adipocyte 
Hs.63168 cell death regulator aven 
HS32538 ESTs 
Hs.179501 ESTs 

Hs.110757 DNA segment on chromosome 21 (unique) 2056 ex 
gb.yn89h07n Scares adult bran N2b4HB55Y Ho 
HS296557 Homo sapiens mRNA; cDNA DKFZp434D2030 (from c 
Hs.25218 ESTs, Weakly similar to AUJB_HUMAN !!!! ALU C 
Hs.153632 ESTs 



;al protein FU10916 
11 receptor, alpha 



406929 U04690 

436365 AW444548 
402550 

441782 AW140126 



401082 

454316 AW366144 
421572 AA531607 



Hs.163118 ESTs 



n olfactory receptor (OR17-210) gene, 



Hs.132357 ESTs 

Hs.193579 ESTs 

Hs.143964 ESTs 

Hs.159492 spastic ataxia of Chaievoix-Saguenay (sacsin 

Hs.10475 ESTs 

Hs.153450 ESTs, Weakly similar to 1909123A Na glucose c 

Hs.18686 Mouse Mammary Turmor Virus Receptor homolog 



gb:QVO-HT0101-061099-032-b12 HT0101 Homo sapi 
Hs.125143 ESTs, Weakly similar to POL2 MOUSE RETROVIRUS 
Hs.150968 hypocretin (orexin) receptor 1 
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Hs.185202 ESTs 



454012 M76424 
402457 

454613 AWB10814 

429821 AL096749 

431073 BE254470 

421143 AB024536 
401223 

407124 R08160 

437217 AW779241 

427627 R87582 



405479 

418432 M14156 

426315 NM_002430 

412171 AW897452 

447241 BE382838 
402100 

438285 AW139266 

407947 AI500332 



Hs.97203 
Hs.172944 
Hs.82120 



gb:yq12a08.r1 Soa-es fetal liver spleen 1NFLS 
small Inducible cySoKine subfamily A (Cys-Cys 
chorionic gonadotropin, beta polypeptide 
nuclear receptor subfamily 4, group A, member 



Hs.225433 Homo sa 



fs.102171 immunoglobulin superfamily containing leucine 



Hs.123473 

Hs.268857 

Hs.155316 

Hs.179915 

Hs.739 

Hs.171965 

Hs.75106 

Hs.4 

Hs.79457 



lar to ALU1_HUMAN ALU SUBFAM 



u 1 t r r 4j 
alcohol dehydrogenase 2 (class I), beta polyp 
■ 1 FLJ20519 



Hs.19322 
Hs. 134807 



insulin-like growth factor 1 (somatomedia C] 
Human DNA sequence from clone 437G10 on chrom 
gb:CMO-NN0058-150400-337-b08 NN0358 Homo sapi 



439624 AA838771 

444455 A11 49879 

455314 Y17114 

427872 AA835058 

409826 AW501112 

414002 NM.006732 

442682 AI014545 

457033 AF029674 

410480 R97457 
401007 

458274 AF149297 

454106 D1S687 

432928 AA570454 

425352 NM.000939 

433887 AW204232 

434927 H46612 
404282 

422581 NM_016339 

424823 NM_006226 

408107 AA806754 
401577 

4338B3 AI925688 

408104 AW972927 
404642 
400675 



Hs.1 24407 ESTs 

Hs.175024 Homo sapiens cDNA: FU23447 fis, clone HSI033 
Hs.73393 eyes absent (Drosophila) homolog 4 



Hs.2111 



ESTs 



hypothetical protein FLJ23412 
Hs.75678 FBJ murine osteosarcoma viral oncogene ho 
Hs.231027 EST 
Hs.173422 KIM1605 protein 

" cadherin 13, H-cadherin (heart) 



Hs.63984 



Hs.8087 NAG-5 
Hs.245146 ESTs 
Hs.186467 
Hs.1897 
Hs.279522 
Hs.293815 



Weakly similar to PLM.HUMAN PHOSPHOLEMM 
i nudeofide exchange factor II 



[s, Weakly similar to B24264 pn 



!Ts, Weakly similar to ALU4_HUMAN ALU SUBFAM 



Unique Eos probeset id< 
;n Gene cluster number 
Genbank accession nun 



1 3891 50_ 
1426375 1 
1577102J 
1577561.1 
171375 1 
1759038_-2 
1767866 -1 
221500.1 

231952.1 



AW41 0200 AW409705 AW41 1 433 BE 
AAO78628R09O51AAO78197AAO77334AW748808 AW748807 
AW897452 Z20302 D55805 D52877 D60432 



AW967420 AA210915 AA236991 M210916 



AW897265 AW897274 AL1 1 9504 AW897275 AW897270 AW89731 2 AW89731 8 AW89731 7 M317240 AW961 361 
T05241 AA326794AL138130AW407975AW999277 
AA35201 3 AA330878 AA339379 AW966303 
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427980 
439872 
442197 

448422 
453621 
454316 
454613 

454719 
455144 
455818 



241151.1 
263712 1 
2B5225J 
47B23J 



1226904 J 

1230646.1 
1254914.1 
137219.1 
1382290.1 
1382748.1 
1397288.1 



AA41 8305 Al 264351 

T81058AL357200 T70270 

AW83791 2 AW837934 AA984475 AW997490 

AW504781 BE620394 

BE263813 BE253504 AI500202 BE251145 

AW749983AL045823 

AW366144AW366154AW366142AW366151 AW366140AW365155 BE141715 BE141718 BE141698 

AW810814 AW810787 AW81 0854 AW81 0773 AW810735 AW81 0785 AW81 0660 AW810834 AW81 0874 AW810723 

AW610881 AW810791 AW81 0644 AW81 0659 AW810676 

BE006547 AW81 5578 AW81 631 1 AW856304 

AW875942 AW858234 AW875938 AW87594 1 AW858235 AW875958 

AI733747M129802 

BE156173 BE156305 BE1S6196 

BE156710 BE156726 BE156712 

BE177442 BE177439 BE177445 BE177440 BE177448 BE177444 BE177433 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank identifier (Gl) numbers. '[ 
human chromosome 22." Dunham l.etal., Nalure(1999] 402:489495. 
Indicates DNA strand from which exons were predicted. 
: Indicates nucleotide positions of predicted exons. 

Ref Strand Imposition 

8954013 Plus 131475-131652 



3d "The DNA sequence of 



400632 
400637 
400675 



8118750 
8117333 
3242744 



9280797 
4B26475 
8084234 
8117697 
7689778 
2935596 
7527774 



402457 

102550 
402741 



3342751 
6453391 
7577903 



11223-11816 

140821-141050 

22937-23494,27677-27966 

148940-150214 

189824-190728 

139377-139674,141195-141281,142217-142340 

135-1229 

207002-207288 

133649-133792 

147901-148884 

31065-31233,33680-33771,34345-34411,38890-39125,39779-39943 

107 1 1091 10817 r > 223 

131788-132729 

24502-24666,24986-25102 

73590-73824 

16513-16577,16838-16926 

171761-171995 

80413-80673 

18603-18760,19719-19890 

3197-3429,3722-3914,5795-5987,6802-6961,8653-8815,9292-9660 

36729-37084 

4059-4258 

70286-70429,75165-75258 

117504-117662 

144000-144618 

7976-8156 

161506-161781 



102999-103145 
6275-6527 

05,30176-30412 



TABLE 23A: 626 genes upregulated in uterine cancer relative to normal body lis: 

Table 23A lists about 626 genes upregulated in uterine c . 

or antibodies. These genes were selected from 59680 probesets on the 
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was expressed as average intensity (At), a normalized value reflecting the relative level of mRNA expression. The protein products of these genes often contain one or more doms 
indicative of have oncogenic function or of transducing n < In rsigi 1 urrf being modulatable by small molecules, peptides, or antibodies (e.g. pkinase, death-domain, 7tm, 
phosphatase, or ionjransporter). Certain predicted protein domains are noted. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar accession number, GenBank accession number 

UniGenelD: UniGene number 

Pred.Prot.Domains: Certain predicted protein domains. Abbrevlat'ons used: TM, t-ansmembrsne domain; SS, signal sequence; =Y, very likely to contain; =M, 

likely to contain; other protein domain abbreviations are from PFAM (Nucleic Acids Research, 2C02, 30:276-280). 

UniGene Title: UniGene gene title 

R1 95th percentile of uterine cancer Als divided by the 50lh percentile of normal tissue Als, where the 1 0th percentile of all normal tissue Als was subtracted fro 

Pkey; ExAccn; UnigenelD; Unigene Title; Pred.Prot.Domains; R1 

428330; L22524; Hs.2256; matrix melalloproteinase 7 (malrilysin, ; PepSdase_M10;; 35.11 

420440; NM.002407; Hs.97644; mamnaglobin 2: Uteroglobin;; 22.80 

439335; AA742697; Hs.62492; NM 052863:Homo sapiens secretoglobin, fa; none;; 21.66 

NMJ ); Hs.15931 kkopf ( lae > 111 

421481; AW391972; Hs.104696; KIM1324 protein; none;TM=M;SS=M; 20.20 
437938; AI950087; Hs.369628; gb:wq05c02.x1 NCI CGAP Kid12 Homo sapien; none,none; 19.83 
406687; M31 126; Hs.352054; matrix metalloproteinase 11 (stront n I m in.Peptid e_M10 17.68 
446619; AU076643; Hs.313; secreted phosphoprotein 1 (osteopontin, ; Osteopontin;; 17.60 
418281; U09550; Hs.1154; oviductal glycoprotein 1, 120kD (mucin 9; Glyco hydro_18;TM=M;SS=M; 17.-18 
lit I ill. NM 00610J. II:- 27 19 IC-1. . ,t*j tl io :: ,■: .7. ;.; i - ,■..:, MJ--V 0S=Y; 16.59 

400301; X03635; Hs.1657; estrogen receptor 1; F-box,hormone_rec,zf-C4.0esLrecep 1 ^h_zinc,ketoacyl-synt,pr>binding,AcylJrarsf,Thioesterase,ketoacyl- 

synt_C,AAA,E7,RFX DNA binding;TM=M;SS=N; 16.1 1 
419356; AI6561 66; Hs 7331; hypothetical protein FU2231 6; Asparaginase 2,none; 15.90 
433222; AW514472; Hs.238415; dickkopf (Xenopus laevis) homolog 4; none,PH04; 1 5.39 
417931 ; W95642; Hs.82961; trefoil factor 3 (intestinal); trefoil;; 15.39 

400234; ; ; NM_000125*:Homo sapiens estrogen recepto; hormone_rec 1 zf-C4,Oest_recep;TM=M;SS=M; 15.23 
456652; NM .002448; Hs.1494; msh (Drosophila) homeo box homolog 1 (fo; homeobox.none; 1 5.04 
438817; AI023799; Hs.163242; ESTs; none.none; 13.72 

453857; AL080235; Hs.35861 ; Ras-induced senescence 1 (RIS1); non<sTM=Y;SS=M; 13.67 
424687; J05070; lls.151738; matrix metalloproteinase 9 (gelatinase B; fn2,hemopexin,Peptidase_M10;; 13.51 
458627, AW088642; Hs.97984; SRY (sex dete-mining region Y)-hox 17 (S; HMG_box;TM=M;SS=N: 13.44 
410001; AB04 1036; Hs.57771 ; kallikrem 1 1 ; lrypsin;TM=M;SS=M; 1 3.11 

421445; AA913059; Hs.104433; Homo sapiens, clone IMAGE:4054868, mRNA; ion_trans.K_letra.asp; 13.27 
449048; Z45051 ; Hs.22920; similar to S68401 (cattle) glucose indue; Lamp;TM=M;SS=M; 1 2,76 
436972; AA284679; Hs.25640; claudin 3; PMP22_C!audin;TM=Y;SS=M; 12.59 
450693; AW450461; Hs.203965; ESTs; Sema.ig.none; 12.52 

415457; AW081710; Hs.7369; Homo sapiens testes specific A2 homolog ; MORN,sugar_t.TM=Y;SS=M; 1 2.46 
413719; BE439580; Hs.75498; small inducible cytokine subfamily A (Cy; IL8;; 12.23 
431629; AU077025; Hs.265827; interferon, alpha-inducible protein (do; none;TM=M;SS=Y; 12.09 
417389; BE260954; Hs.82045; midkine (neurite growth-promoting facia; PTN_MK;TM=M;SS=Y; 12,08 
407786; AA687538; Hs.38972; tetraspan 1; transmembrane4;TM=Y;SS=M; 1 1.91 
444381; BE387335; Hs.283713; hypothetical protein BC014245; Collagen;TM=M;SS=M; 11.86 
446608; N75217; Hs.175622; ESTs; Armadillo_seg,HEAT PBS;TM=M;SS=M; 11.72 
447835; AW591 623; Hs.164129; ESTs, Weakly similar to 138022 hypotheti; none,UQ„con; 1 1 .59 
420181; AI380089, Hs 158951, ESTc non ig,p in I fURPCT 11 19 
451253; H48299; Hs.26126; claudin 10; PMP22_aaudin,PepUdase_M1,K_tetra;TM=Y;SS=M; 1 1.45 
r 5 « Hs.62711; Higl jilit 3 rou f nonhistone chromoso; HMGjox.i i 

448133; AA723157; Hs.73769; folale receptor 1 (adult); Folate_rec,MIP;TM=M;SS=M; 1 1 ,37 
421552; AF026692; Hs.105700; secreted frizzled-related protein 4; Fz,NTR;; 11.08 
47 r-7 i 71207 Hs,29279 eyes absent (Drosophila) homolog 2; Hydrolase;; 11.01 

409745; AA077391; ; gb:7B14E12 Chromosome 7 Fetal Brain cDNA; 7tm_1,zf-C3HC4,fn3,SPRY,KRAB,zf-C2H^rve,zf-B_box;TM=Y;SS=M; 10.95 

415138; C18356; Hs.295944; tissue factor pathway inhibitor 2; Kunit_BPTI,rione; 10.91 

416658; U03272; Hs.79432; fibrillin 2 (congenital contrxtural ara: EGF,TB,granulir,,PS!,EB,TIL;TM=M;SS=M; 10.81 

41 1558; AA102670; Hs.70725; gamma-aminobu ly ric xid (GABA) A rocepto; Ncur_chanJ_BD,Ncur_chan_mcmb;TM=Y;SS=M; 10.72 

438091; AW373062;Hs.351546; nuclear receptor subfamily 1, croup I, n; hormone_rec,zf-C4,none; 10.66 

425071; NM_013989; Hs.154424; deiodinase, iodothyronine, type 1 1 T4 deiodina ■ TM=M;SS= 7, 10 65 

430832; AI073913; Hs.100686; ESTs, Weakly similar to JE0350 Anterior ; none.nonc; 10.52 

451497; H8329 1 Hs 28 122; V. ihibit factor^; EG v'.ir 10 53 

421478; AI683243; Hs.97258; ESTs, Moderately similar to S29539 ribos; none, none; 1 0 50 

409231, AA446644 H' } ) thelial glycoprotein; thy bill 1;TI Y;SS-W; 10.3E 

443785, AW44935 Hs.1 901 25; basic-he 'ix-loop-hel nil Ff ' S N 10.34 

409142; AL136877; Hs.50758; SMC4 (structural maintenance of chromoso; ABC tran,M,SMC N.SMC CDUF1 64,none; 10.34 

431846; BE019924; Hs.271580; uroplakin 1B; transmembrane4;TM=Y;SS=M; 10.34 

415539; AI733881; Hs 72472, NAME OMITTED re i Ir ir j I e.AcI . PDZ,ZU5,death; 10.31 

41 1274; NM_002776; Hs.69423; kallikrein 10; lrypsin;TM=M;SS=N; 10.24 

423673; BE003054; Hs.1695; matrix melalloproteinase 12 (macrophage ; hemopexin,Peptidase_M1 0;TM=M;SS=M; 10,24 
441377 BE2182o" H 026! t 1 " r nenerc 10,17 

400292; AA250737; Hs.72472; NAME OMITTED ... receptor kinase; pkinase/.ctiv:iuecp,PDZ,ZU5,dealh; 10.17 

452594; AU076405; Hs.29981; solute carrier family 26 1 u Iran <an_ur_ i ase.Sulf I an i bTAS,HMG_box; 10.12 

429663; M68874; Hs.211587; phospholipase A2, group IVA (cytosolic, ; C2.PLA2 B;TM=M;SS=N; 9.87 

413859; AW992356; Hs.8364; Homo sapiens pyruvate dehydrogenase kina; SAM_PNT,none: 9.87 

408562; AI436323; Hs.31141; roundabout (axon guidance receptor, Pros' i 0l fn3;TM=M;SS=N; 9 36 

428970; BE276891; Hs.194691; retinoicacid induced 3 (RAIG1); metabo; 7tm 3;TM=Y;SS=M; 9.72 

438089; W05391; Hs.351545; nuclear receptor subfamily 1, group I, m; hormone_rec,zf-C4,none; 9.58 

411 9 / r f Hs.355702; cell d ion c:e Mike 1 (PITSI _-f hoik i jii 5,5 

450451; AW591528; Hs.202072; ESTs; none,none; 9.53 

456062; AI866285; Hs.71962; ESTs, Weakly similar to B36298 proline-r; none,non. 9.50 
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418113; AI272141; Hs.83484; SRY (sex determining region Y)-box 4; HMGJx>x,horneobox;TM=M;SS=N; 9.38 

412791; AI131192; Hs.143199; ESTs, Weakly similar to S72481 probable ; pkinase.PBD.none; 9.36 

432435; BE21B886; Hs.282070; ESTs; none,none; 9.35 

416530; U62B01; Hs.79361; kallikrein 5 (neurosin, zyme); lrypsin;TM=M;SS=M; 9.32 

43901 8; AW300687; Hs.26638; membrane-spanning 4-domains, subfamily A; none;TM=Y;SS=M; 9.23 

445537;'AJ245671; Hs.12844; EGF-like-domain, multiple 6; EGF.MAM;; 9.19 

410407; X66B39; Hs.632B7; carbonic anhydrase IX; cars anhydrase;TM=M;SS=M; 9.19 

417165; R80137; Hs.30273B; Homo sapiens cDNA: FLJ21 425 lis clone C: Sulfate Iransp.STAS.HMG box; 9.17 

453459; BE047032; Hs.257789; ESTs; none.none; 9.14 

431674; AA098901; Hs.301642; G-protein coupled recepton none.GCVH; 9.05 

418004; U37519; Hs,87539; aldehyde dehydrogenase 3 family, member ; aldedh;TM=M;SS=M; 9.00 

413278; BE5630B5; Hs.833; interferon-stimulaled protein, 15kDa; ubiquitin;; 8.93 

436954; AA740151; Hs.130425; ESTs; none.none; B.91 

420344; BE463721; Hs.97101; putative G protein^oupled receptor; Melhylliansf 5;TM=Y,SS-r/; 8.89 

425397; J04088; Hs.1 56346; topoisomerase (DNA) II alpha (170kD); DNA gyraseB,DNAJopoisolV,HATPase c;; 8.85 

407792; AI077715; Hs.39384; putative secreted ligand homologous to f; none;TM=M;SS-Y; 8.80 

451027; AW519204; Hs.40808; Homo sapiens, Similar to RIKENcDNA 2810; none.none; 8.79 

422809; AK001379; Hs.121028; hypothetical protein FU10549; IQ;TM=M;SS=N; 8.72 

413385; M34455; Hs.840; indoleamine-pyrrole 2,3 dioxygenase; IDO;TM=M;SS=N; 8.70 

444784; D12485; Hs,11951; ectonucleotide pyrophosphatase/phosphodi; Sonalomedin_B,Endonuclease,Phosphodiesl;TM=Y;SS=M; 8.69 

421502; AF11185S; Hs 1Q5Q39; solute earner family 34 (sodium pliospha; Ribo=Lmal I I i TM=YSS=N,867 

437935; AW939591; Hs.5940; mucin 1 3, epithelial transmembrane; EGF,SEA;TM=Y;SS=M; 8.56 

408692; AL0401 27; Hs.34074; dipeptidylpeptidaseVI;DPPIV_NJerm,Peptidase_S9,none; 8.55 

414812; X72755; Hs.77367; monokine induced by gamma interferon; ILB;TM=M;SS^Y; 8.53 

428187; AI687303; Hs.285529; G prolein-coupled receptor 49; 7tm_1,none; 8.49 

148672; AI95551 1 H « « r cjchon,ANF_receptor,SBP_bac_3;TM-Y;SS=M; 3,44 

425776; U25128; Hs.159499; parathyroid hormone receptor 2; 7lm_2,HRM;TM=Y;SS=M; 8.40 

443426; AF098158; Hs.9329; chromosome 20 open reading frame 1 ; none;TM=M;SS=N; 8.40 

452093; AA447453; Hs.27860; Homo sapiens mRNA; cDNA DKFZp586M0723 (f; 7lm_1,none; 8.33 

407894; AJ278313; Hs.41 143; phosphoinositide-specific phospholipase ; C2,PI-PLC-Y,PI-PLC-X;TM=M;SS=N; 8.23 

409799; D11928; Hs.76845; phasphoserine phobphal like; Hyd T" S I I 

41 9508; AW997938; Hs.90786; ATP-binding cassette, sub-family C (CFTR; ABCJran,ABC_membrane;TM=Y;SS=M; 8.20 

424441; X14850; Hs.147097; H2A h ; Gtonc family, member X; histonc,CBFD_NFYB_HMF;; 8.20 

40B243; Y00787; Hs.624; inlerleukin 8; HLH,PAS,IL8;TM=M;SS=N; 8.00 

415752; BE314524; Hs.78776; putative transmembrane protein; none;TM=Y;SS=N; 7.99 

422608; AW160644; Hs.118695; potassium voltage-gated channc -urt ionJrans.KJ ™ s=n )9 

433001; AF217513; Hs.279905; clone HQ0310 PRO0310p1 ; none;; 7.95 

409649; AA159216; Hs.55505; hypothetical protein FLJ20442; Y_phosphatase,DSPc;TM=M;SS=N; 7.95 
424078; AB00 He. 1390 paternally c . zf-C2H2,KRAB,nonc; 7.86 

432179; X75208; t ls.2913; EphB3; EPH lbd.fn3.pkinase.SAM;TM=Y;SS=r/; 7.85 

424531 M62052 Hs15 ^ cate in (cadherin-as a led protein) a Moulin DHAJigase_ZBD;TM=M;SS=N; 7.84 

420510; AI683183; Hs.99348; distal less homeo box 5; homeobox;TM=M;SS=N; 7.81 

436B A469J lls.127310 r e r M S tl 81 

411125; AA151647; Hs.68877; cytochrome b-245, alpha polypeptide; none;TM=Y;SS=M; 7.80 

407811; AW190902; Hs.40098; cysteine knotsuperfamily 1, BMP antagon; TGF-bela,DAN;; 7.78 

424399; AI905687; Hs.348419; AI905687:IL-BT095-190199-019 BT095 Homo ; none;TM=M;SS=M; 7.65 

418835; AI655499; Hs.161712; ESTs; pkinase.Activin recp,PD7_ZU5,death; 7.64 

435793; AB037734; Hs.4993; KIAA1313 protein; none;TM-M;SS^N; 7.61 

426201; AW1 8261 4; Hs.128499; ESTs; SH3,none; 7.59 

447400; AK000322; Hs.18457; hypothetical protein FLJ20315; zf-C3HC4;Tf/=Y;SS=M; 7.55 

410850; AW362867; Hs.302738; Homo sapiens cDNA FU21425 fis, clone C; Sulfate_lransp,STAS,HMG box; 7.55 

453464; AI884911; Hs.32989; receptor (calcitonin) activity modifying; none;TM=Y;SS=N; 7.54 

417115; AW952792; Hs.334612; small nuclear ribonucleoprotein polypepf, Sm.pklnase;; 7.52 

437897; AA770561; Hs.146170; hypothetical protein FU22969; zf-DHHC,none; 7.38 

443991; NM_002250; Hs.10082; potassium intermediate/small conductance; CaMBD,SK_channel,ion_trans;TM=Y;SS=M; 7.36 

414617; AI339520; Hs.288817; ESTs, Moderately similar to N Chain N, M; hexokinase,hexokinase2;TM=Y;SS=N; 7.35 

422017; NM 003877; Hs.110776; STAT induced STAT inhibitor-2; SH2;; 7.33 

424834; AK0O1432; Hs.15340B; Homo sapiens eOKA FU10570 fit, clone NT; none.none; 7.30 

409041; AB033025; Hs.50081; Hypothetical protein, XP_051860 (KIAA119; none;TM=M;SS=M; 7.28 

417079; U65590; Hs.81134; interteukin 1 receptor antagonist; 11;; 7.28 

429170; NMJ01394; Hs.2359; dual specificity phosphatase 4; Rhodanese,DSPc,Yjihosphatase,Ribosomal_S3_N;TM=M;SS=N; 7.28 
418506; AA084248; Hs.372651 ; Unknown protein for MGC29643 (formerly , none.none; 7.27 

448913; AA194422; Hs.22564; myosin VI; rrm,zf-RanBP,pkinase,GST_C,Ets,SAM_PNT.ABC2_membrane,myosin head,IQ,Myosin_N,bZIP,zf- 

C2H2,PHD,BTB,TFIIS,AT_hook,SAM;TM=M;SS=N;7.26 
409340; BE174629; Hs.321130; hypothetical protein MGC2771; 

aajermeases,pyridoxal_deC,bromodomain,PHD,MBD,AT_hook,DDT,PI3_PI4Jinase,FAT.FATC,BolA,RUN;TM=M;SS=N; 7.26 
424317; AI865032; Ho.26017; ESTs; nonepkinase; 7.21 

410361; BE391804; Hs.62661; guanylate binding protein 1, interferon-; GBP,GBP„C;TM=Y;SS=M; 7.21 
428450; NM_014791; Hs.184339; KIAA0175 gene product; KA1,pkinase;TM=M;SS=N;7.14 
438707; L08239; Hs.5326; amino acid system N transporter 2; porcu; ACAT,MBOAT;TM=Y;SS=M; 7.05 
423011; NMJ00683; Hs.123022; adrenergic, alpha-2C-, receptor; 7trn_1;TM=Y;SS=M; 7.03 
435021; AA922192; Hs.73962; ESTs; EPH_lbd,pkinase,fn3,SAM,none; 7.02 
446163; AA026880; Hs.25252; prolactin receptor; none;NA;NA; 7.01 

447768; X86400; Hs.19520; FXYD domain-containing ion transport -eg; ATP 1 G1 _PLM_fv'AT8;TM=rVf;SS=N; 7.00 

439453; BE264974; Hs.6566; thyroid hormone receptor inleractor 13; AAA,ABC_tran,CcaE;TM=M;SS=N; 6.99 

451035; AU076785; Hs.430: plastm I (I isoform): efhand,CH,Adaptin_N;; 6.99 

450581; AF081513; Hs.25195; TGF-bela4; TGF-betaTGFb_propeptide;: 6.95 

424054; AA33451 1; Hs.26638; membrane-spanning 4-domains, subfamily A; none;TM=Y;SS=M; 6.93 

432519; AI221 311; Hs.130704; ESTs, Weakly similar to BCHUIA S-100 pro; none.none; 6.93 

436481; AA379597; Hs.5199; HSPC150 protein similar to ubiquitin-con; UQ con;TM-M;SS=N; 6.92 

41 9693; AA1 33749; Hs.301350; FXYD domain-containing ion transport reg; ATP1 G1„PLM_MAT3;TM=Y;SS=M; 6.92 

437139; W73685;Hs.1 1851 3; ESTs, Weakly similar to RTA RAT PROBABLE; 7tm 1;TM=Y;SS=M; 5.87 

418054; NM.002318; Hs.83354; lysyl oxidase-like 2; SRCR,Lysyl_oxidase;TM=M;SS=M; 6.87 
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41 8203; X54942; Hs.33758; CDC28 protein kinase 2; CKS;; 6.86 

410467; AF1 02546; Hs.53931, dachshun l(C c phi homoog;» r TVMSS=M;6.35 

425247; NM_005940; Hs.155324; matrix metalloproleinase 1 1 (sirci n /sin -pita H xin Peptidase_M10;; 6.B5 

453064; R40334; Hs.89463; potassium large conductance caicium-acti; none,none; 6.83 

452046; AB01 8346; Hs.27657; KIAA0802 protein; none;TM=M;SS=N; 6.79 

417771; AAB04698; Hs.82547; retinoic acid receptor responder (tazaro; none,none; 6.79 

422293; X94453; Hs.114366; pyrroline-5-cartoxy c yn!helase(g ., i r -TM-M SS=N' 677 

431470; AAB32417; Hs.139650; ESTs; none,ig,pkinase,LRR,LRRCT; 6.76 

418738;AW388633; Hs.6682; solute carrier family 7 (, Horn wine r„ne rone; 6.75 

418751; BE389014;Hs.372548;phosphoinositide 3 kirase, regul t H2,none 4 

417886; AA214684; ; ESTs; SPRY,7tm_3,ANF_receptor,none; 6.72 

412926; AI879076; Hs.75061; macrophage myrisicylated alarine-rich C : VARCKS" 6 70 

437960; AI669586; Hs.369312; ESTs; none.none; 6.68 

428953; AA306610; Hs.348183; tumor necrosis factor receptor superfami; 60s_ribosoma:,Ritosomal L10,TNFR_c6,DEAD;; 6.66 
444006; BE395085; Hs.334762; type I transmembrane protein Fn14; ldl_recepLa,PKD,MHCJ;TM=M;SS=Y; 6.65 
413040;M193338;Hs.12321;sodiumcalciumexchanger;Na_Ca_L 3S=M 664 
449656; AA002008; Hs.1 88633; ESTs; PIP5K,none; 6.64 

447495; AW401864; Hs.18720; programmed cell death 8 (apoptosis-induc; pyr_redox;TM=M:SS=N; 6.62 

446063 AI72014D HS16II-4 " r is r unnel none bfcl 

424762; AL1 19442; Hs.1 83684; eukaryotic translation initiation facto-; rone,none; &60 

421554 AW137C7C , 75; EST nenone;5.59 

418478; U38945; Hs.1 174; cyclin-dependent kinase inhibitor 2A (me; ank;; 6.55 

424905; NM_002497; Hs.153704; NIMA (never in mitosis gene a)-related k; pkinase;TM-M;SS=N; 6 54 

448730; AB032983; Hs.21 394; KIAA11 r 7,m in PP2C;TM=M;SS=N 654 

433577; AW007080; Hs.284192; ESTs; none,none; 6.53 

422627; BE336857; H r 1 1 37; transforr ing growth factor, beta-induced; Fasdclin,ABC_lran,ABCmembrane,GTP EFTU;TM=M;SS=M; 6 53 
442133; AW874138; Hs.129017; ESTs; type la transmembrane protein; LRR,LRRNT,LRRCT;TM=Y;SS=M; 6.52 
430259; BE550182; Hs.375142; RalGEF-IIke protein 3, mouse homolc 
434263; N34895; Hs.79187; ESTs; ig,none; 6.49 

418322; AA284166; Hs.841 13; cyclin-dependent kinase inhibitor 3 (CDK; Yj)hosphafase,DSPc;TM=M;SS=N; 6.48 
419942; U25138; Hs.93841 ; potassium large conductance calcium-ac:i; CaKB;TM=Y;S5=M; 6.47 
421064; AI245432; Hs.101382; tumor necrosis factor, alpha-induced pro; none;TM=M;SS=N; 6 4? 
432636; AA340864; Hs.278562; claudin 7; PMP22 Claudir.;TM=Y;SS-M; 6.45 

431685; AW296135; Hs.267659; vav 3 oncogene; CH,DAG_PE-bir.d,PH l RhoGEF,SH2,SH3,DC1;TM=M;SS=N; 6.44 
428832; AA578229; Hs.324239; ESTs, Moderately similar to ZN91 HUMAN Z; Osteopontin.none; 6 39 
436775; AA731 111; Hs.372225; ESTs; none,none; 6.39 

424343; AW956360; Hs.4748; adenylate cyclase activating polypeptide; 7tm_2,HRM l none; 6.37 

1 11 104476; ESI HU1 | r non = TM YSS=M;6.37 

438209;AL120659;Hs.6111;aryl-hydrocarbonreceptornucieartransl;HLH,PA c :iL rM=t S 16 
438993; AA828995; ; gb:od77b08.s1 NCI_CGAP_Ov2 Homo sapiens ; EGF,metalthio,integrin_B,PSI rone- 6 27 
406400 I- =llikrein 8 (neuropsin/ovasin) (KLK8); lrypsir.;TV=M;SS-M; 6 27 
429556; AW1 39399; H- 314807 EET nons;TM=M;SS=N 

409259; AA576953; Hs.22972; steroid 5 alpha-reductase 2-like; H5AR g; Steroid _dh;TM=Y;SS=M- 6 25 
435732; AF229178; Hs.123136; leucine rich repeal and death domain con; nDne.none; 6.24 
439668; AIQ91 277; Hs.3Q2634; frizzled (Drosophila) homolog 8 Fi I.Fz.7tm_2,lcxii M r 
412276; DE262 4 7 7 3 n opl gemig r inl ory I tor ( MIF.suga-J-,noi 6.23 
436961 AW- 75974 H 156704 ESTs non none 5.23 
434808, AF1 55108, Hs.2 150; NY-REN-41 anti r; r no TM M SS=N; 6.22 

440006; AK000517; Hs.6844; NALP2 protein PYRItl Corl Pi AAJME M F ^D_DAPIN;NA;NA; 6 20 

16224; NM_002902; H 38; reticule n 2, EF-hand calcium birdin; e 
449027; AJ271216; Hs.22880; dipepSdylpeptidase III; Peptidase J'49 EGr.ig.N regu ir TM-M,SS=N; 6.19 
418318; U47732; Hs.84072; transmembrane 4 superfamily membe- 3: lransrnembrane4;TM=Y-SS=M- 6 11 
452551; L27071; Hs.29877; TXK tyrosine kinase; Beach,WD40,SH2 1 SH3,pkinase;TM=M;SS=N; 6.14 
418969; W33191; Hs.28907; hypothetical protein FLJ20258; SH3;TM=M;SS=N; 6.12 
445452; AA378776;H- 233640 hypcthcn e protein MGC3077; n 6.1 
456534; X91195; Hs.100623; phospholipase C, beta 3, neighbor pseudo; LIM.PDZ.okinase;; 6 1 1 
449700; L02867; Hs.78358; paraneoplastic antigen; none;TM=M;SS=N; 6.10 

428479; Y00272 Hs.334562 )i e 2, G1 to S and G2 r irE P 10,ICE_p20;TM=U;SS=M- 6.10 

41 6209; AA236776; Hs.79078; MAD2 (mitotic arrest deficient je^t t l-P IA;Tf M tl 6 39 
433159; AB035898; Hs.150587; kinesin-like protein !; bZIP.kinesh l £ 

432;/ 1 Hs.11 1; ESTs; ig i F I r n 7 
409619; AK001015; Hs.55220; BCL2-associated athanogene 2 BAG;TM=M;SS=N; 6.04 
412723; AA648459; Hs.335951 ; hypothetical protein AF301222; none;TM=M;SS=N; 6.03 
424273; W40460; Hs.144442; phospholipase A2, group X; phoslip;TM=M;SS=Y; 6.03 
409430; R21945;Hs34o pi oh r DSPc.Rhodanese.none; 6.00 

421143; AB024536; Hs.102171;immunoglobulin superfamily containing le; ig,LRR,LRRNT,LRRCT;TM=M;SS=M;6.00 
428677; AI6571 1 9; Hs.351 582; troponin I, cardiac; none;TM=M;SS=N; 6.00 
450098; W27249;Hs.81 09, hypothetic pre n TL J 2 1 0 00 El [» ND;TM=M;SS=N; 6.00 
410422; AL042014; Hs.63348; Homo sapiens, clone MGG15203, mRNA, com; C1 q.Collaqen;: 5 99 
419972; AL041465; Hs.182982; golgin-67; none,none; 5.99 
421251; Z28913; Hs.102948; enigma (LIM domain protein); LIM.PDZ;; 5.97 
403352; ; ; NM_001615':Homo sapiens actin, gamma 2, ; actin" 5 95 
451541; BE279383; Hs.26557; plakophi r r , eTUMSS-N,5 95 
420253; AI656O55;Hs.9620O; neighbor of A-kinas nchorlr.q protein 9 t I 5.93 
421506; BE302796; Hs.105097; thymidine kinase 1, soluble; TK;TM=M;SS=N 5,93 
450447; AF2 12221 -I 10; hypothetical teinPI 2; NTF '1 1 SS-N; 5.92 

J 1 III 111 E 1 n L r 

^^tS 

429910; NM.000867; Hs.2507; 5-hydroxytryptamine (serotonin) receptor; 7tm 1;TM-Y;SS=N; 5 90 
447131; NM_004585; Hs.1746C c J ponds- (I none;Tt N 

437952; D53209; H n 1 r led; none;T f 

446291; BE397753; Hs.14623; interferon, gamma-mc I c c n r;TM=M;E 
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453102; NM 007197; Hs.31664; frizzled (Drosophila) homolog 10; Fz,Frizzled,7ln_2;TM=Y;SS=M;5.87 
426761; AI015709; Hs.172089; PORIMIN Pro-oncosis receptor indjcinq, ie r ne;TJ y 1 5 
425367; BE2711B8; Hs.155975; protein tyrosine phosphatase, receptor t; none;TM-M;SS=Y; 5.85 
426108; AA622037; Hs.166468; programmed cell death 5; DUF122;TM=M;SS=N; 5.84 
450502; T08065; Hs.1 18262; ESTs; ion transjon trans; 5.84 

442652; AI005163; Hs.201 378; Homo sapiens cDNA FLJ40427 fis; none;TM=M;SS=N; 5.83 
424917; AI636208; Hs.96901; hypothetical protein FLJ23049; none;TM=M;SS=N; 5.83 

448569; BE382657; Hs.21486; signal transducer and activator of trans; SH2,STAT,STAT_bind,STAT_prot;TM=M;SS=N; 5.82 
422616; BE30033O; Hs.118725; selenophosphate synthetase 2; AIRS,AIRS_C;TM=M;SS=N; 5.82 

445133; AW157646; Hs.198689; ESTs; emand,spectrin,GAS2,SH3,P]ectin,RA,Xylosejsom FliD,bZIP,Tropomyosir,Myc-LZ,M,ldh.C,CH,AtP3;TM=M;SS=N; 5.79 

426215; AW963419; Hs.155223; stanniocalcin 2; Stanniocalcin;; 5.78 

414482; S57498; Hs.76252; endothelin receptor type A; 7tm_1;TM=Y;SS=M; 5.75 

414809; AI434699; Hs.77356; transferrin receptor (p90, CD71); PA ; ™=Y;SS=N; 5.74 

452683; AI089575; Hs.374574; progesterone membrane binding protein; homeobox,none; 5.72 

432201; AI538613; Hs.298241; Transmembrane protease, serine 3; ldLrecept_a,trypsin;TM=Y;SS=M; 5.72 

429345; R1 1 141; Hs.199695; hypothetical protein; K_tetra,SAM; 5.72 

449458; AI805078; Hs.208261; ESTs; Fri-.zled.Fz.none; 5.72 

418526; BE019020; Hs.85838; solute earner family 16 (monocarboxylic; nor.e;TW=Y;SS-M; 5.71 
418848; AI820961; Hs.193465; ESTs; PDZ,pkinase,none; 5.70 

426227; U67058; Hs.154299; Human proteinase activated receplor-2 mR; 7tm_1 ;TM=Y;SS=M: 5.69 

411190; AA306342; Hs.69171; protein kinase C-like 2; pkinase,pkinase_C,HR1 ; TM=M;SG=N: 5.69 

411263; BE297802; Hs.69360; kinesin-like 6 (mitotic cenlromere-assoc; kinesin;TM=tvtSS=N; 5.69 

445136; AI348014; Hs.143949; ESTs, Weakly similar to Achaete-Scute ho; ionjrans.ionjrans; 5.69 

409223; AA312572; Hs.362B52; phosphoinontid- inase, requleton SH2.SH ! iGAF none 5 6 7 

430016; NM. 004736; Hs.227656; xenotropic and polytropic retrovirus rec; SPX,EXS;TM=Y;SS-N; 5.66 

429638; AI916ob2 Hs211> ir clir ! ( inesin raptor); bZIP,Traponyosin,spectrin,LBP_BPLCETP,B5B,M;TM=Y;SS=M; 5.65 

450334; AF035959; Hs.24879; phosphalidic acid phosphatase type 2C; PAP2;TM=Y;SS=M; 5.64 

453950; AA156998; Hs.348037; eukaryotictranslatcn in liati i factor; n 'I 

425889; M57414; Hs.161 305; tachykinin receptor 2; 7tm_1;TM=Y;SS=M; 5.64 

432527; AW975028; Hs.102754; ESTs; none.none; 5.64 

441384; AA447849; Hs.288660; retinoic acid induced 3; 7tm_3,none; 5.63 

419080; AW150835; Hs.18878; hypolhelical protein FLJ21620; 20G-Fell_Oxy;TM=M;SS=N; 5.63 

447217; BE465754; Hs.17778; neuropilin 2; CUB,MAM,F5 F8_type_l TW=M -'1 5.61 

440422; AW4-2696 H 1'C7bO phosph FBI non 558 

431341; AA30721 1; Hs.251531; proteasome (prosome, macropain) ,ll nil r, e;-:some ~M-M,SS=N; 5.58 

432805; X94630; Hs.3107; CD97 antigen; 7tm_2,EGF,GPS,FecCD;TM=Y;SS=M; 5.55 

449230; BE613348; Hs.356392; melanoma cell adhesion molecule; ig,isodh,RibosomaU6,F-box;TM=Y;SS=M; 5.55 

441607; NNL005010; Hs.7912; neuronal cell adhesion molecule; WD40,fn3,ig;TM=M;SS-N; 5.54 

400303; AA242758; Hs.79136; LIV-1 protein, estrogen regulated; none.none; 5.54 

434826; AF1 55661; Hs.22265; pyruvate dehydrogenase phosphatase; PP2C,none; 5.53 

404210; ; ; NM 005936:Homo sapiens myeloid/lymphoid ; FHA,PDZ,RA,DIL;TM-M;SS-N; 5.53 

408051; AI623351; Hs.172148; ESTs; PH.RhoGAP.none; 5.53 

436726; AA324975; Hs.19B639; ESTs Weakly sirrilar to T00079 hvpotheti; efhand l spsclrin,GAS2,SH3 l Plcctin,RA,XyloseJsom,FliD,bZIP,Tropomyosin,Myc- 

IZ.M.Idh C,CH,AIP3;TM=M;SS=N; 5.53 
416084; L1 6991; Hs.79006; deoxythymidylate kinase (thymidylale kin; none.none; 5.52 
428667; AI375550; Hs.346868; nucleolar protein p40; homolog of yeast ; none.none; 5.51 
433907; AW296107; Hs.1 52686; ESTs; Armadillo _seg,none; 5.50 
442821 ; BE391929; Hs.8752; transmembrane protein 4; none;; 5.50 
422282; AF019225; Hs.1 14309; apolipoprotein L; MotA_ExbB;TM=Y;SS=M; 5.49 
439820; AL360204; Hs.283853; Homo sapiens mRNA full length insert cDN; none.none; 5.49 
428771; AB028992; Hs.193143; KIAA1069 protein; C2,PI-PLC-Y,PI-PLC-X;TM=M;SS=N; 5.48 
452256; AK000933; Hs.28661; Homo sapiens cDNA FLJ10071 lis, clone HE; GDl,7lmJ,none; 5.48 
442013; AA506476; Hs.375009; Human DNA sequence from done RP11-353C1; none.none; 5.48 
408056; AA312329; Hs.42331; ephrin-A4; Ephrln;TM=M;SS=M; 5.47 
422765; AW409701; Hs.1578; baculoviral IAP repeat-containing 5 (sun BIR;TM=M;SS=N; 5.47 
420297; AI628272; Hs.1 23757, ESTs, Weakly sirnlar to ALU1_HWAN ALU S; pkinase,TUDOR,none; 5.47 
428385; AF112213; Hs.184062; putative Rab5-interacling protein; SH2.SH3;; 5.46 

424517; AI539443; Hs.137447; Homo sapiens cDNA FLJ12169 fis, clone MA; SH2,STAT,STAT_bind,STAT_prot,none; 5.45 

441560; F13386; Hs.7888; v-erb-a avian erythroblastic leukemia vi; pkinase,Recep_Ldomain,Furin-like,YLP,none; 5.44 

414883; AA926960; Hs.348669; CDC28 protein kinase 1; CKS;; 5.43 

450402; BE218027; Hs.89969; ESTs; SH3,none; 5.42 

428484; AF104032; Hs.184601; solute carrier family 7 (cationic amino ; 

aaj)ermeases,pyridoxaLdeC,bromodomain,PHD,MBD,AT_hook,DDT,PI3_PI4Jinase,FAT.FATC,BolA,RUN;TM=M;SS=N;5.42 
430696; AA531276; Hs.59509; ESTs; pkinase,PP2C,none; 5.42 

412350; AI659306; Hs.73826; protein tyrosine phosphatase, non-recept; Y_phosphalase,Band_41,PDZ;TU=M;SS=N; 5.42 

444783 AK001468 HS 62180 anilli [ a homolog) IPHrone,541 

448379; AI097463; Hs.21035; KIAA1130 protein; none,Zip; 5.41 

' - ■.■ 1':l > ■':.11.'.!. i*i [■•■:■ :■ i ■ ■:<:,,■ : 1. r,vr, TM=M SS=N; 5.41 

1 Ir , )2S00;Hs.7 i r n-activated teinkir kina ki T" " o M r 1 

445413; AA151342; Hs.12677; CGI-147 protein; UPF0099;TM=M;SS=M; 5.39 

451863; AU20634; Hs.331803; ATPase, Ca transporting, plasma membrane; con60_TCP1 l E1-E2_ATPase,Cation_ATPase_C,Cation_ATPase N.Hydrolase;; 5.38 

442875; BE623003; Hs.23625; Homo sapiens clone TCCCTA00142 mRNA seq j; K_tetra.DUF51.none; 5.38 

439963; AW247529; Hs.6793; platelet-activating factor acetylhydrola: PAF-AH Jb,Lipase_GDSL;TM=M;SS=N; 5.36 

450825; AC005954; Hs.25527; tight junction protein 3 (zona occludens; PDZ.Guar.ylate kin.:: 5.34 

441031; AI110684; Hs.7645; fibrinogen, B beta polypeptide; fibrinogen_C,G-aloha,arf;TM=M;SS=M; 5.33 

408369; R38438; Hs.118747; SLC15A2 Solute carrierfamily 15 (H+/pep; PTR2;TM=Y;SS=N; 5.33 

435391; AA704588; Hs.58934; ESTs; PIP5K,none; 5.33 

411779; AA292811; Hs.72050; r.on-metastatic cells 5, protein expresse; NDKs 5.33 

422170; AI791949; Hs.112432; anti-Mullerian hormone; TGF-beta;; 5.32 

447350; A1375572; Hs.172634; v-erb-a avian erythroblastic leukemia vi; pkinase,Reeep_L_domain,Furin-like,YLP,none; 5.32 

19964; AW001741; h i I Ir 110 OE pi r T'1 N ! 1 

426427; M86699; Hs.1 69840; TTK protein kinase; pkinase;; 5.30 
430407; H23551; Hs.30974; ESTs; pkinase.PBD.none; 5.29 
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416847; L43821; Hs.80261; enhancer of filamentallon 1 (cas-like dc; SH3;TM=M;SS=N; 5.27 

425308; M97639; Hs.1 55585; receptor tyrosine kinase-like orphan r j rq!< ki e FzTM=Y:SS=M; 5.27 

424596; AB020639; Hs.151017; estrogen-related receptor qemrna r - rc CA-JW 

423013; AF151020; Hs.181444; hypothetical protein; none;TM=Y;SS=M; 5.26 

447384, AI377221 Hs.405 Er -.rii ,orb ncne, 5 26 

441824; AB007871; Hs.7977; KIAA0411 gene product; SH3,RhoGAP;TM=M;SS=N; 5.26 

438493; AM 30740, H 6241 phospkoinos Mnasi «n I i. n i SH2,SH3,RhoGAP;TM=M;SS=N; 5.26 

428579; NM_005756;Hs.184942;G protein-coupled receptor 64 7 tm 2 F 1 -t 3S M ! 2£ 

414359; M62194; Hs.75929; cadherin 11, type 2, OB-cadherin (osteob; cadherin.Cadhsrin C_term;TM=Y;SS=M; 5.25 

426440; BE382756; Hs.169902; solute carrier family 2 (facilitated gin; sugar tr;TM=Y;SS=M; 5.24 

427157; U51 166; Hs.173824; thymine-DNA glycosylase; UDG;TM=M;SS=N; 5.24 

423685; BE350494; Hs.49753; uveal auloantigen with coiled coil domai; ank,bZIP,Vi,DUF164,AIP3;; 5.23 

452721; AJ269529; Hs.301871; solute carrier family 37 (glycerol-3-pho; MORN,sugar_lr;TM=Y;SS=M; 5.23 

427747;AW411425;Hs.180655;serine/threoninekirase12,pkinrs- rM=f .,-'1 ' 2 

417821; BE245149; Hs.82643; protein tyrosine kinase 9; cofilin_ADF;; 5.23 

412507; L36645; Hs.73964; EphA4;fn3,pkinase,SAM EFH lbd;TM= ';SS=M 5.2 

426770; AI9486I8; Hs.150178; ESTs; Sulfate _transp,STAS;TM=Y;SS=N; 5.23 

422583; AA410506; Hs.27973; KIAA0874 protein; ank,G-alpha;TM=M;SS=N; 5.22 

41 4368; W701 71 ; Hs.75939; uridine monophosphate kinase; PRK.CoaE;; 5.22 

448093; AW977382; Hs.15898; 2,4-dienoyl CoA reductase 2, peroxisomal; adh_shcrt;; 5.21 

443646; AI085198; Hs.164226; Thrombospondin 1; EGF,tsp 1,vwc,TSPN,tsp_3,none; 5.18 

457916; BE085271; Hs.8834; ring finger protein 3; pkinase.none; 5.18 

433933; AI754339; Hs.355397; Homo sapiens clone TCCCIA00164 mRNA sequ; nor.e;NA;NA; 5.18 
436469; AK001455; Hs.5198; Down syndrome critical region gene 2; none;; 5.17 

433662; W07162; Hs.150826; RAB25 RAB25, member RAS oncogene family; ras.ABC trar.,arf;TM=M;SS=M; 5 17 

11 r i tr r l j l in r I i ; Myosir rl r I 

409132; AJ224538; Hs.50732; protein kinase, AMP activated beta I r n fM M;SS-N; 5.1 
454438; AA224053; Hs.172405; cell division cycle 27; SPRY,7tm_3,ANF_receptor 5.14 
413869; NM_000878; Hs.75596; interleukin 2 receptor, beta; none;TM=Y;SS=M; 5.14 

2051 r F I 1 r i rier family 25 (mitocl I r 1" 1 5.1 

415474; NM_014252; Hs.78457; solute carrier family 25 (mitochondrial ; mito_carr:TM=M;SS=N' 5 14 
411704; AI499220; Hs.71S73; hypothetical protein FU10074 p-ina I ' " II 13 
422278; AF072873; Hs.11421 8; frizzled (Drosophila) homolog 6; Fz,Frizzled.7tm_2;TM=Y;SS=M- 513 
4541 28; AL031259;Hs.367900; programmed cell death 2, zf MrTir I M;SS 1 
431322; AW97G622; Hs.376626; gb:EST382704 MAGE resequences, MAGK Homo; none.none; 5.13 
444754; T8391 1; Hs.1 1881; transmembrane 4 superfamily member 4; none;TM=Y;SS=M; 5 12 
422867; L32137; Hs.1584; cartilage oligomeric matrix prote n (pse; tsp_3,EGF;; 6.12 
416498; U33632; Hs.79351; potassium channel, subfamily K, member 1; ion lrans;TM=Y;SS=M; 5.1 1 
436494; AA720997; Hs.128295; ESTs; ncne,CAPJ3LY,HC03_cotransp,Glyco_hydro J3,PH; 5. 1 1 
419B33; AA251131; Hs.220697; Homo sapiens tryptophanyl-tRNA synthetas; WHEP-TRS,IRNA-synt_1b,none; 5.10 
453387; AI990741; Hs.252809; ESTs; Na_Ca_Ex,none; 5.07 
413076; U10564; Hs.75188; weel (S. pombe] homolog; pkinase;TM=M;SS=N; 5.07 

456906; AF1 17646; Hs.156637; Cas-Br-M (murine) ectopic retroviral ir: zf-C3HC4,Cbl N.Cbl N2,Cb _N3;TM=M;SS=N; 5.07 
438746; AI885815; Hs.184727; Human melanoma a-ociateJ anti .>n , 7(m;t nsfemn.Gr nylal hn PDZSH3,507 

18520; AB00 I 1 r d t It n i Mt I in i T "I r E e 
413745; AW247252; Hs.75514; nucleoside phosphorylase; Mtap_PNP;; 5.06 

407235; D20569; Hs.169407; SAC2 (suppressor of actio m nations 2. y none,Ribosomal..S1 3,GalactosyLT,Zip,adh_short,zf-C3HC4; 5.06 

421369;NMJ)05089;Hs.171909;U2smallnuclearribonudeoprolsnar r m f-rCTH nj ntegrir C TM=M,SS=N, 5 06 

412170; D16532; Hs.73729; very low density lipoprotein receptor; ldljecept_a,ld: recept_b,EGF;TM-M;SS=M; 5.06 

442599; AF078037; Hs.324051; RelA-assocrated rnhit H r iMf^S=N;5.05 

421109; L32832; Hs.101842; AT-binding transcription factor 1; HMG14_17,homeobox,zf-C2H2;TM=M;SS=M; 5.05 

453BB0; AI803166 Hs.135121 t t .' r t i Vimilar to 138022 hypotheti; HSP70,none; 5.05 

431512; BE270734; Hs.2795; lactate dehydrogenase A; Idh ldh_ r H r e.UO Tt'l'G C 

43541 1; AW444619; Hs.13B211 ESI , pi a r 04 

419088; AI53S323; Hs.367688; integnn, beta 8; integrin_B,none; 5.04 

425003; AF1 1 9046; Hs.154149; apunrac/apyrmidinic endonuclease(APEX : Troponin,Exo_endo_phos,IQ;TrV1=M;SS=N; 5.04 

428376; AF1 19665; Hs.18401 1; pyrophosphatase (inorganic); Pyrophosphatase;TM=M;SS=N; 5.03 

413073; AL038165; Hs.75187; translocase of outer mitochondrial membr,MAS20,zf-A20,VPS9;TM=M;SS=M; 5.03 

43641 5; BE265254; Hs.343258; proliferation-associated 2G4 q o F a r I c > ep L domain efhand; 5 01 

449674; AW444937; Hs.233482; ESTs; C2,PI-PLC-Y,PI-PLC-X,none; 5.01 

445333; BE537641 ; Hs.44278; hypothetical protein f I 112 3 similar lo as arfTK;; 5.01 

412133; U83460; Hs.104557; solute carrier family 31 (copper transpo; nor.e;TM=Y;SS=N; 5.01 

446488; AB037782; Hs.151 19; KIAA1361 protein; pkinase;; 5.00 

449474; AA019344; Hs.2055; ubiquilin-aclivating enzyme E1 (A1S9T an; ThiF,UBACT,pkinase,UCH-2,UCH-1,rrm,zf-C2H2,zf-RanBP,G-patch; 5.00 

416365; U15131; Hs.79265; suppression of tumorigericity 5; DENN,dDENN,uDENN;TM-M;SS=N; 5.00 

421351; AU0766S7; Hs.103755; reoeptor-interactmq i ^ L r h fM V SS-N 4 99 

413219; AA878200; Hs.1 1 3727; Homo sapiens cDNA FLJ13692 fis. clone PL; HLH, death ,TNFR c6,Acyl-CoA hydro; 4.98 

442007; AA301116; Hs.142838; nucleolar phosphoprotein Nopp34; rrm.IRK;; 4.95 

441085; AW136551; Hs.181245; Homo sapiens cDNA FLJ12532 fis, clone NT; none.none; 4.95 

426310; NMJ00909; Hs.169266; neuropeptide Y receptorYl; 71rnJ ;TM=Y;SS=M; 4 95 

457718; F18572; Hs.22978; ESTs, Weakly similar to AIU4_HUMAN ALU S; pkinase.pk nase; 4.94 

408805; H69912; Hs.48269; vaccinia related kinase 1; pkinase;TM=M;SS=N; 4.94 

427541; AI798983; Hs.375835; solute carrier family 35 (CMP-sialic aci; nore,none; 4.94 

452792; AB037765; Hs.30652; KIAA1344 protein; thiored;TM=M;SS=M; 4.93 

430713; AA351647; Hs.2642; eukaryolic translation elongation factor; GTP_EFTU,GTP_EFTU_D3,GTP_EFTU_D2;; 4.93 
444838; AV651680; Hs.208558; ESTs; integrin_A,FG-GAP,none; 4.93 

440516; S42303; Hs.161; cadherin 2, type 1, N-cadherin (neuronal; HNH,cadherin,Cadher:n_C_term;TM=M-SS=N- 4 92 
421302; T34462; Hs.103291; neuritin; none;TM=M;SS=Y; 4.91 

452291; AF01 5592; Hs.28853; CDC7 (cell division cycle 7, S. cerevisi; pkinase;TM=M;SS-N; 4.91 
408657; AA782601; Hs.378549; ESTs; B56,none; 4.91 

439750; AL359053; Hs.57664; Homo sapiens mRNA full length insert cDN; IMPDH_CJMPDH_N,CBS,integrin_B,Ricin_BJectin; 4.91 
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438564; AA381553; Hs.198253; major histocompatibility complex, class ; ig.MHCJLalpha.none; 4.89 

427640;AF058293;Hs.180015;D-dopachrometautomerase;COX3SH-.-J»1li , SSI r „:no_domain,GST N,S1,Fz,Frizzled calreticulin 7tm 2mtiPAP assocTM=YSS=M- 4 RR 

434521; NM_C02267; Hs.3886; karyopherin alpha 3 (importin alpha 4); Armadillo seg IBB TM=M;SS=N; 4.88 m,nm.Armn,hAK_assoc,TM Y.SS M.4.88 

414821; M63835; Hs.77424; Fc fragment of IgG. high affinity Is, re: ig:TM=Y;SS=M; 4.88 

424118; BE2B9041; Hs.140452; cargo selection prolpir ri phosp penlpm„488 

410639; BE269047; Hs.65234; hypothetical prots n F L I205S6; DEAD helic i : C.PRI ,AIP3;TM=M;SS=N; 4.B7 

417089; H52280; Hs.18612; Homo sapiens cDNA: FU21909 fis, clone H; voltage_CLC,CBS,r.or.e; 4,86 

429303; AW137635; Hs.44238; ESTs, Weakly similar to S65657 abha-1C-; Phosphodiest,Sornatomedin_B,Endonuclease none- 4 86 

417666; AI345001; Hs.82380; menage a trois 1 (CAK assembly factor), zf-C3HC4;TM=M;SS=N; 4.86 

453854 AW0214C Hs.21( hypo i protein r jnC none; 4.35 

453082; H1BB35; Hs.31608; hypothetical protein FLJ20041; ion trans;TM=Y;SS=M; 4,85 

413407; AI356293; Hs.75339; inositol polyphosphate phosphatase-like ; SH2,SAM,Exo_endo phos;; 4.B5 

417866; AW067903; Hs.82772; collagen, type XI. alpha 1 ; Collagen,COLFI t TSPN,lamir,in_G CorA- 4 85 

435652; N32388; Hs.334370; uncharaclerized hypothalamus prot= nh r F n Dne R i f b=M 184 

419355; AA428520; Hs.90061; progesterone binding protein; heme 1;TM=Y;SS=M; 4.83 

431441; U81961; Hs.2794; sodium channel, nonvoltage-gated 1 alpha; ASC;TM=Y;SS=N; 4.83 

408983 NMJ00492 r IU lrdnsmembraneconductanc;ABC_tran,ABC_membrane,PRK,Bac_export_3;TM=Y;SS=N;4.83 

425465; L18964; Hs.1904; protein kinase C, iota; pkinase,DA&_PE nj^ r r p R;T M=M;S N; 4.32 

435232 NM.1-U1262 H- 4854 -/dndeperdcntlr^-nir I i2"fp1 i TV < r '-N4B1 

424490; AJ278016; Hs.55565; ankyrin repeat domain 3; ank,pkinase;TM=M;SS=N; 4.81 

425743; BE396495; Hs.159428; BCL2-associ , ;s=N 4 81 

453354; W55946; Hs.234863; Homo sapiens cDNA FLJ12082 fis, clone HE; transmembrane4,none; 4.81 

450883; NU_001348;Hs.2S619; death-associated protein kma e 3 ^ 5; TM=M;S 1:4,79 

" " 1 ■ h , 1 i i ., | ........ ■ ... | - 

431183; NNL006855; Hs.250696; KDEL (Lys^sp-Glu-Leu) endoplasmic retic: ERJumenj"ec5Dt;TrVl=M;SS=M; 4.79 
409960; BE251944; Hs.355264; hexokinase 1 ; none.none; 4.78 

422795; AB033109; Hs.375610; KIAA1283 protein; 7tm 1,kazal,A2M,A2M_N;TM=Y;SS=M- 4 78 
423619; T48691; Hs.249159; adrenergic, alpha-2A-, receptor, 7tmJ 7tr 2;TW=\ =M 4.78 
429335; AF095727; Hs.237832; myefn protein zero-like 1; iq,transmembrane4;TM=Y;SS=-M; 4.73 
427700; M262294;Hs.180383; dual specificity phosphatase 6; Rh U F r 1 

7 v, 11; H 3 ! EST High'y similar to S023 r rc one - 

^";Z95636;H s .i 1669| jm a 

407722; BE252241; Hs 38041 pyre pynd in tamin B kin pi T'* \ II 4 75 

13 330 AW450572 I 5,257 16 ESTs pkinase.zf-i 4.ERM NH.no e 4.75 

410637; U24339; Hs.65436; lysyl oxidase-like 1; LysyLoxidase;; 4.75 

411165; NM_000169; Hs.69089; galactosidase. alpha; Melibiase;; 4.75 

425548;AA890023,Hs19;r. picfadinre iplor; fn3;TM=Y;SS=M; 4.73 

434158; T86534; Hs.14372; ESTs; adenylatekinase.none; 4.73 

459035 AW291 109- H- 563; ESTs, Weakly similar to T3161 1 hypothe r H v 

409012;AL117435;Hs.49725.DKFZP434l216pralein;PH,RhoGEF Jf M;SS=M 4 , 

434503; T96231; Hs.1 7762; ESTs; SH3,Sorb,none; 4,73 

446342; BE298665; Hs.14846; solute carrier family 7 (cationic amino ; none;TM=M;SS-N; 4.72 
427413; AA402587; Hs.355667; LAT1-3TM arotein; none.none; 4.71 

449433 AI672096 H <H 2 E 1 akl OS26650DN h 'I j Fz.Frizzlcd.Fz; 4.71 

418910; Z25821; Hs.39466; Homo sapiens, Similar to dodeceno/l-Coen: ECH;; 4.70 
I''. 7 Hs-77597; polo (Drosophia)-lik Ribosonv.LL37as.pW. Pi OJox,tRM-synL1b,dynamin,dynamin_2,GED,bZIP,M;;4.70 

442199; BE277633; Hs.372542; etoposide-induced mRNA; none;TM=Y;SS-M; 4.69 
418870; AF147204; Hs.89414; chemokine (C-X-C motif), receptor 4 fu 7 n 1 7tm 1 1 SS=M; 4.69 

1 Hs.36708; t j uninhibited mid n 3 4 M 

418558; AW082266; Hs.86131; Fas (TNFRSF6)-associated via death domai; death.DED;; 4.68 
434164; AW207019; Hs.148135; serine/threonine kinase 33; pkins 

443323; BE550621; Hs.9222; estrogen receptor binding site associaie; none;TM=M;SS=M; 4.68 
400283; X06256; Hs.149609; integrin, alpha 5 (fibronectin receptor integrir ^.FG-GAP TM=Y,SS=N: 4.68 
' 1 198elcnu ph r , , r r h r | J | h , TM ^ 1 ;4 6 

417425; NM_0C2291; Hs.82124; leminin, beta 1; laminin_EGF.Iamin:r_Nlerm inteqrin.B- 4 67 
425976; C75094; Hs 334514 NG22 prDlem voltage L i tbb 
407844; AW073716; Hs.8037; ESTs; transmembrane4,none; 4.66 
450656; AA010539; Hs.18912; unnamed protein product; zf-C2H2;; 4.66 
42031 1; AW445044; Hs.38207; Human DNA sequence from clone RP4-530I1 5; none.none' 4 65 
404287; ; ; FGENESH predicted novel CUB-domain conla; none.none; 4,64 
452747; BE153855; Hs.61460; Ig superfamily receptor LNIR; ig,Rhabd_glycop;TM=Y;SS=M; 4.63 
426680; AA320160; Hs.17181 1; adenylate kinase 2; adenylatekinase;TM=M;SS=N; 4.63 
430397; AI924533; Hs.105607; bicarbonate transporter related protein ; HC03_cotransp;TM=Y;SS=N; 4 53 
447655; NM_003726; Hs.19126; sre kinase-associated phosphoprotein of ; SH3,PH;TM=M;SS=N; 4,53 
""Va Kwmlri H u 7 anlai Sx" as f ° h ' nd j n 9P ralein 1 ■ bromodomain,PHD,PWWP,zf-MYND;TM=M;SS=N; 4.62 
)9;COX8,SHMT,MIF t GST_C.fEF1G_don^ 

409220; BE243323; Hs.51 233; tumor necrosis factor receptor sur»rfami;TNFR_c6,dealh,Lipoprolein_5,TIL;TM=Y;SS=M; 4.60 

h h 1 d of inner mitochondrial mem zf-Tirr Mr land.CH.spedrin in;TM=' t r 

^ H 7817; 01 tei TM=t SS=N 159 

400290; H18836; Hs.31608; hypothetical protein FU20041; none,Cys_knot; 4.59 
430379; AF1 34149; Hs.240395; potassium channel, subfamily K, merit*! 6; ion_trans;TM=Y;SS=M; 4.59 
409645; AI142265; Hs.55498; geranylgeranyl diphosphate syntha e 1; pi n >, n\l r i TM •' E - n 4 59 
427373; AB007972; Hs.130760; myosin phosphatase, target submit 2, art ;T M=W;SS= N; 4 58 
437212; A'765021; Hs.210775; ESTs; UDPGT.none; 4.58 

43039b D4974' H 2 1 h lur inding [ n th,ZU5, EGF.k ingl psin,Nebulin,LIM;;4.57 

452069; AB028949; Hs.133994; KIAA.1026prolein; Metallophos;TM=M;SS=N; 4.56 
416041; AA345547; Hs.53263; hypothetical protein FLJ13287; WD40;; 4.55 
434511 R23932 Hs.18106; ESTs; p e I r r ^9 4 55 
410174 AA33D00? H 1;DKFZ p.olein; i ne DSPc 4" 

418758; AW959311; Hs.172012; hypothetical protein DKFZp434J037; pkinase,RI01;TM=M;SS=N; 4 55 
451367; AA923729; Hs.26322, cell cycle ri I led kina pkinase;TM V,SS=N; 4.54 
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417433; BE270266; Hs.82128; 5T4 oncofetal trophoblast glycoprotein; LRR,LRRNT,LRRCT;TM=Y;SS=M; 4.54 

411296; BE207307; Hs.10114; growth suppressor 1; 20G-Fell Oxy;TM=M;SS=M; 4.53 

439975; AW328081; Hs.681 7; inosine triphosphatase (nucleo-ide I i Harr,1p_li Til V N =1 

431992; NMJ302742; Hs.2891; protein kinase C, mu; pkinase.DA: r :-lir J ,P 1;TM=M;SS=M; 4.53 

443303; U67319; Hs.9216; caspase7, apoptosis-relaled cysteine pr pkir ise.lCE plOJCE p20,TV=M;SS=M; 4.53 

428005; AW302245; Hs. 1 81 390; casein kinase 1 , gamma 2; pkinase;TM=M;SS=N- 4 52 

407775; NM.004914; Hs.38772; RAB36, member RAS oncogene fam ly.ras irf;TM=M;S 'I 1.52 

435523; TS2849; Hs.1 1090; membrane-spanning 4-domains, subf^mii A nr rv , ;=M I 

447321; AW271217; Hs.281434; Homo sapiens cDNA FLJ14028 lis, clone HE; rone.nore; 4.51 

405484; ; ; C30021 24":gi|12737280|rel]XP 006682.2| k; none;; 4.50 

443605; H06B65; Hs.134131; ESTs; elhand,ion_trans,none; 4.50 

1nlI?oo^ 2 fJ, 26 '' HS ' 288549 h yP° thetiral P rolel ' l '"™ 'trr_1,zf-C3HC4,fn3,SPRY,KRAB,zf-C2H2,rve,zf-B_box;TM=Y;SS=M;4.50 
422112; BEM0240; He.1 1 1733; Lcm1 prolein; Sm,BAG;;4.49 

418869; AW516565; ; gb:xq01d05.x1 SoaresJIH' C_cei M none RasGAPWWIQ;448 

44789B; AW969638; Hs. 1 1 23 16; 6 2 kd protein; none.none; 4.48 

450607; AL050373; Hs.2521 3; hypothetical protein; SH3;TM=M;SS=N; 4.48 

418918; X07871; Hs.89476; CD2 antigen (p50], sheep red blood cell ; ig;TM=Y;SS=M- 448 

424823; NMJW6226; Hs.153322; phosphollpasp C, epsilon >,PH,PI-PLC-Y,PI-PLC-> TM"M,SS=N; 4.48 

426812 AF1D5365 H 1 61 ier famFy 1 2 (| im/chlc r T II 447 

H 318893 EET " " > ml lFr n^Rec>p_L_domain,YLP,none;4.47 
435615; Y15065; Hs.4975; potassium vollage-gated channel, KQT-lik; ion_trans,KCNQ1 char.nel;TM=Y;SS=N; 4.47 
427557; NMJM2659; Hs.179657; plasminogen activator, urokinase receplo; UPAR_LY5,ET,PLA2Jnh;; 4.47 
428727; AF078847; Hs.78452; general transcription factor IIH, polype; PH04,LIM;TM=M;SS=N; 4.45 
412760; AW379030; Hs. 41324 EST r L I Cbl N3.IJE i HC4 none 446 

409093; BE243834; Hs.50441; CGI-04 protein; Ribosomal.LSTae.pkinase.POLOJox.tRNA-synLlb.dynamin.dynaminJ.GED.bZIP.M;^ 46 
™ JS 10 '' 1 ?' 3 ? 33 ^phosphoadenosmeS-phosphosulfalesy APS ki Tp || , lase.FRi n i J late ^n 446 
447434 R16890 Hs. 137135; ESTs; pkir i h l fn3 4 45 

422010; AA302049; Hs.31 181; Homo sapiens cDNA: FLJ23230 lis, clone C; none,SDF,sugar_tr 4 45 

414108; AI267592; Hs.75761; SFRS protein kinase 1; ank,PH,Oxysterol_BP,pkinase;TM=U;SS=N; 444 

457001; J03258; Hs.2062; vitamin D (1,25- dihydroxyvitamin D3) re; hormonp ret 7 ( li i " 1MSS=N . 444 

409686; AK000002; Hs.55879; Homo sapiens mRNA; cDNA DKFZp434L0827 (f; ABCJran ABCmembra e TM=M SS=M 1 44 

4O8113;T82427;Hs.194101;HomosapienscDNA;FU20869fr clor i non 14 

436823; AW749865; Hs.1 17077; ESTs, Wealiv s milai 1 1 30022 hypotheti; aa~L mease 

™5? B | il,Iil ..deC.bromodomain.PHD.MBDAT hcok.DDT.PI Pi4 inase FAT,FATC,BolA,RUN,TFIIS;TM=M;SS=N; 4.44 

450505; NMJ104572; Hs.25051; plakophilin 2; Armadillo_seg;TM=M;SS=N; 4.43 

437915; AI637993; Hs.202312; Homo sapiens clone N11 NTera2D1 teratoca; none,r.one; 4.43 

11741 1 r H 112; interleukin 1 rec P loi type ! ig.TIRTN M:SS (' 4 

445033; AV652402; Hs.72901 ; cyclin-dependr-nt ir e inhibitor 2E ,1 ank,443 

11 I TW;\ yte ,r mo , J ptor, ; inosi P,ig;TM=f I 4, 

452124; AA45422u H: 117. EST pkinase none; 4.43 

422599; BE387202; Hs.1 18638; non-metastalic cells 1, protein (NM23A) ; NDK,PH,Oxysterol BP;; 4.42 
416202; AW964492; Hs.169624; ESTs; none;TM=M;SS=N; 4.42 
441518; AW161697; Hs.2S4150; ESTs; Y phosphatase.DSPc.none; 4.42 

441680; AW444598; Hs.7940; RAP1, GTP-GDPdi t i r 1 Arm <jillo_seg;TM=M-SS=ri 4 42 

131429: AF072813; I )1 ; reticulon jlon,Fz, g i 71 || 

453870; AW385001; Hs.8042; Homo sapiens cDNA: FU23173 r ne L; FG-GAP integrii NIF'4 41 

1 /V161386: Hs 13561; hypothetical pi 4 r r H 

456362; AW973003; Hs.1 79909; hypothetical protein Fl '.27- onf fM=M 3S I 4.41 
'1 1 Is, 16391 om [i DMA FLJ12797 fi n ,T I 115 - 4 4 

2211 C 4 j J 333; hypothetical protein FU 1i FGGY_C;TM-M;S I; 4.41 
4133 NVL00651 H .,1 ite carrier family 16 (monocarboxy suq t f 3 I; 4.41 
458097; AW341 136 Hs 5611 1 ESTs i ono SH3.PID; 4.40 

458248; BE407379; Hs.108082; ESTs, Weakly similar to T31636 hypotheti; C1q,Ccllagen;TM-M;SS=Y- 4 40 

427681;AB018263;Hs.284232;tumornecrosisfactorreceplor--jp I r i d I T FI PH.XIir li GEF Melallothio 5;TM=M;SS=M- 4.40 

443693; AI344782; Hs.9683; DnaJ(Hsp40) homolog, subfamily C, mu I iii.DnsJ,TPR;TM=M i 4 40 

437162; AW005505; Hs.5464; thyroid hormone receptor coactivsting pr, bromodornain-TM=M;SS=N- 4 39 

453891; AB037751; Hs.301242; Homo sapiens mRNA full length insert cDN; none.none; 4 39 

442572; AI001922; Hs.135121; hypothetical protein FLJ22415; none,HSP70; 4.39 

427337; Z46223; Hs.176663; Fc fragment of IgG, low affinity lib, r; ig;TM=Y;SS=M; 4.37 

445817; NM_003642; Hs.13340; hlstone aceryltran-i i ; 1 rr r ~M SS-N, 4 37 

444895; AI6743B3; Hs.22891 ; solute carrier family 7 (calionic amino ; ASC,death,TNFR_c6- 4 37 

408912; AB011084; Hs.48924;'KIAA0512gene product; ALEX?; Armadillo_seg;TM=M;SS=W 4 37 

432106; N58323; Hs.269098; ESTs, Weakly similar to RETROVIRUS-RELAT; SH3,PDZ,Guanylete_kin none' 4 37 

418283; S79895; Hs.83942; cathepsin K (pyenodyc^tf is) 3 eptid 4 67 

445826; BE313754; Hs.13350; Homo sapiens mRNA;cDNADKFZp586D0918 (f: ig,lsp_1,ZU5.Nucleoside tran; 4.37 
417674; BE616160; Hs.82829; prolein tyrosine phosphatase, non-recept; Y_phosphatase;TU=Y;SS=N; 4 3~6 
400257;;Hs.76366;ENSP00000000452:BADprotein.LCL bindi; nor TM=M;SS=N 36 
431476; BE612705; Hs.256697; histidine triad nucleolide-binding prole; HIT;; 4.36 
416173; AI808527; Hs.192822, serologically defined breast cancer anti; none;TM=M;SS=N; 4.36 
456629; AW891965; Hs.367942; histone deacetylase 3; HSP90,HATPase_c,zf-C2H2,PHD,none- 4 36 
- 427716; L38951; Hs.180446; karyopherin (importin) beta 1; Amid, I II j.HEAT •< <;SS=N; 4 c 

t^ , ^Si^ A ^! 2 J: i lf [SSSaMsimm " " ,b DN/ Llr r 1 WH .BAH,bromodomain,PHD,SET;TM=M;SS=N;4.35 

420251 ;AW206093;Hs.748; fibroblast growth factor receptor 1 (fm p pkina i I r \\ P 4 3 r 

445926; AF054284 H 334826 p ng f r Ljr1 r n T N; 4.35 

410726; AI623859: Hs.15936; ESTs. pkinase,proJsomerase,nore; 4.35 

433996; AW135357; Hs.192374; ESTs; HSP90,HATPase c.UDG; 4.34 

S S 642 ' H " >h i ! " ' r ° rd cancer (™" homolog) ; non*TftH«S»4t 4.34 

437103 OT1.9408 H I5294 T r ir r <, 34 

410058; All H r Pbindir 1 1 i H3;T M;SS-N; 4 34 

> ^ 4 I * 1 is-Rcilly murine ssrc u; ub I r Tl 3-N; 4.3 

39 0 IM376417; Hs.374608; hypothetical protein MGC bhydml TM=f 

453308; AW959731; Hs,323099; ESTs; none,pkinase,Activin recp; 4.33 
424893; AW295112; Hs.153648; Homo sapiens cDNA FU13303 lis, clone OV; SAM;; 4.33 
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444664; N26362; Hs.11615; map kinase phospha:ese-like protein MK-S; DSPc;TM=M;SS=N; 4.33 
429655 H 1 C i r in r » n fr 3 g none; 4.32 

409121; M902256; Hs.78979; Golgi apparatus protein 1; cys rich_FGFR,nons; 4.32 
430280; AA361 258; Hs.237868; interleukin 7 receptor; fn3,none; 4.32 

423798; AF047033;Hs. 132904; solute carrierfamity 4, sodium bica-bor; HC03 cotransp;TM=Y;SS=M;4.29 

425654; AB033022; Hs.158654; KIM1196 protein; zf-C2H2;TM=M;SS=N; 4.29 

457500; NMJ02759; Hs.274382; protein kinase, ir.terferon-inducibe dou; dsrm.pkina I M;Sf M 29 

4271 27; AW802282; Hs.22265; pyruvate dehydrogenase phosphatase; PP2Cjione; 4.29 

447191; NM.014521; Hs.17667; SH3-domain binding protein 4; SH3;TM=M;SS=N; 4.29 

408331; NM_007240; Hs.44229; dual specificity phosphatase 12; DSPc;Tfrf=M;S8=H429 

441 1 30; AI1 60734; Hs.267604; Homo sapiens PNAS-1 29 m RNA r pi! ds FTF I Ich K_tefa,DSPc;TM=M;SS=N; 4.28 

430057; AW450303; Hs.2534; bone morphogenetic protein receptor, typ; Act vin recp,pkinase;TM=Y;SS=M; 4.28 

430250; NMJ16929; Hs.283021; chloride intracellular channel 5; none;TM=M;SS=N; 4.28 

406774; AW518383; Hs.177592; ribosomal protein, large, PI; 60s_ribosomal;; 4.28 

413809; 125851; Hs.851; integrin, alpha E (antigen CD103, human ; vwa,integrin_A r FG-GAP,TM=M;SS=Y; 4.27 

443960; AI093577; Hs.255416; hypothetical protein FLJ21986; TTL;TM=M;SS=N; 4.27 

427378; BE515037; Hs.177556; melanoma antigen, family D, 1 VAGE;TM=M =N 1 

412204; AM 25507, Hs 24 3 ESTs ig.rrm.nor.e; 4.26 

439506; AI361238; Hs.41136; ESTs: MAM lP kinese,Nucleoplasmin.none;4.2S 

451295; AI55721 2; Hs. 171 11 J la. lo 154374 ge , i E-bind.pWnase C OPR,none; 4.26 

452488; N74921; Hs.1843 E ne;TM=M N 2( 

450973; AF01 2072; Hs.25732; eukaryotic translation initiation facto-; W2.MA3,Mir4G;TM=M;SS=N; 4.26 
452437; AA026237; Hs.181272; ESTs; efhand.ionjrans.none; 4.26 
433204; AI589645; Hs. 123590; ESTs: none,7tm 1;4.25 

424756; AW504657; Hs.152931; lamin B receptor; ERG4 ERG24,FKBP;TM=Y;SS=N; 4.25 
430570; AI41 7881 ; Hs.292464; ESTs; 7tm_2,Fz,Frizzled,none; 4.25 
445709; H02592; Hs.74280; ESTs; PDZ.ncne; 4.25 
428134; AA421773; Hs.161008; ESTs; Armadillo_seg,none; 4.24 
434149;Z43829; Hs 2446^4 hyp Ifetc tit 1; 4.24 

t H 1 r I I I r T 1 

binding,THF_DHG_CYH,THF_DHG_CYH_C,CAP_GLY,AAA,LON,Peptidase_C9,bZIP,M,xar._ur_permease,HC03 cotransp;TM=M;SS=N;4.24 
t 1 t v^t I; Hs.7412 cas set popto r I o , teins r RD.ICE [I ICE p2C 4.24 
447818; W79940; Hs.355279; Homo sapiens clone246'70 mRNA sequence; none pkinase I 24 
450684; AAa72605;Hs.25333;inlerleukln1 receptor, lypp II jTM=Y:SS M 1.2: 
4 H T SH J n I I n r " ' FF ion 4.23 

426224; BE085860; Hs.374468; karyophenn (importin) beta 2 Am lillo_ < HEAT;TM=M;SS=N; 4.23 
413284; AU077055; Hs.289107; baculoviral IAP repeft-containlng ?; zf-C3HGl,CARD.BIR,desth,ia;TM=M;SS=N; 4.22 
421917;AB028943;Hs.109445;KIAA1020protein,BTB,zfC2H^PI C|) kin PI3Ka;TW=M N 2 
431239; AL039971; Hs.251216; hypothetical protein DKFZp434A1 r C SH2,a V H 4.: 
419685; W76083; Hs.13413S EST one TM=M;SS=N; 4.22 

431630; NM_002204; Hs.265829; integrin, alpha 3 (antigen CD49C, alpha ; FG-GAP,Rhabdjjlycop,integn'n_A;TM=Y;SS=M; 4.22 
425177; AF127577; Hs.155017; nuclear receptor interacting protein 1 ; none;; 4.21 
422219; AW978073; Hs.1010; regulator of mitotic spirdle assemby I; pkinase.none; 4.21 
450746; D82673; Hs.278589; general transcription factor II, i; none,SH3,PX; 4.21 
428023; AL038843; Hs.374530; Homo sapiens cDNA; FU23602 fs, clone L; 

aa_permeases,pyridoxaLdeC,bromodomain,PHD,MBD,AT_hook,DDT,PI3 PI4 kinase, FAT,FATC,BolA,RUN;TM=M;SS=N; 4.21 
416907; W60909; ; gbzd29g10.s1 Soares_fetal_hearLNbHH19W; ion trans.none; 4.21 
411763; NW_013371; Hs.71979; interleukin 19; IL10;;4.21 
425262; D87119; Hs.155418; GS3955 protein; pkinase;;4.21 

430035; NM.003463; Hs.227777; protein tyrosine phosphatase type IVA, m; Y_phosphatase,DSPc;TM=M;SS=N; 4.21 

1 5505 A I R,LRRNT,LF M 

'13 f 1 4 r olule earner family 16 (monocarboxvlic; none,r.one; 4.14 

419452; U33635; Hs.90572; PTK7 protein tyrosine kinase 7; ig,pkinase;TM=Y:SS-M; 4.13 

431745; AW972448; Hs.163425; Novel FGENESH predicted cadherin repeat ; none.none; 4.1 0 

416965; N26223; Hs.150436; MDAC1; none;NA;NA; 3.94 

426830; AA39.3 1 6'; Hs.41 294; ESTs; none.none; 3,88 

442438; AA995998; Hs.370007; gb;os26b03.s1 NCI_CGAP_Kid5 Homo sapiens; none,DNA.poLB.DNA_poLB_e«); 3.86 
41231 4; AA825247; Hs.356084; downstream of: G protein-coupled recepto; 71m 1 ;TM=Y;SS=M; 3.84 
448243; AW369771; Hs.367688; Integrin, beta 8; integrin_B,none; 3.64 

439318; AW837046; Hs.6527; G protein-coupled receptor 56; 7trn_2,CytC_asm,GPS;TM=Y;SS=W 3.61 
415999; AA172179; Hs.294029; ESTs; none.none; 3.60 



407853; AA336797; Hs.40499; dickkopf (Xenopus laevls) homolog 1; none;TM=M;SS=Y; 3.34 
(1 u i " I 

439180; AI393742; Hs.199067; v-erb-b2 avian erythroblastic |e iher a « n r li pkinase.R p_L main. I urn i pkinase,Recep_L_domam Peptidase M24; 2,88 
426158; MI«L001982; Hs.199067; v-erb-b2 avian erythroblastic hjl n Furr ri p_L n r a e.pkii r il i eptda e_M24 2.84 

414521; D28124; Hs.76307; neuroblastoma, suppression of tumo-igeni; DAN;TM=M;SS=M; 2.81 
424522; AL134847; Hs.149957; ribosomal protein S6 kinase, 90kD, polyp pkinase.pt inase_C;; 2.70 
438167; R28363; Hs.24286; chemokine binding protein 2 (CCBP2) rrFd nore;TM=Y - f 2.6E 
418888; AU076B01; Hs.89436; cadherin 17, LI cadherin (liver-intestin; cadherin;TM=Y;SS=M; 2.17 

TABLE 23B 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

409745 MH1944.5 BI030997AA921674A - ' i > ' 54 W207491 AA077391 BG012775 BG997382 AA286333 AA150722 B 

BI027E64BI333 130 [1006275 '■ XC273 D'331 300 DI0298C4 3C06277 EI007 i V7h3 EI00 e2 BG997377 AA150780 B 

BI027818 BG01 5789 BI033807 AA341445 

417886 1031334J AA21 0987 D57294AA21 4584 AA207006 D56572 

438993 25801 63_1 AI926361 AA834879 AA828995 
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Unique number corresponding to an Eos probeset 
Sequence source. The 7 digit numbers in this columr 
sequence of human chromosome 22." 



406400 
403362 
404210 
404287 
405484 
400517 



9256298 
8571772 
5006246 



9796686 Minus 



NLposition 

1553-1712,1878-2140,4252-4385,5922-6077 
64099-64260 



TABLE 24A: 571 GENES UP-REGULATED IN HEAD AND NECK TUMORS COMPARED WITH NORMAL BODY TISSUES 

Table 24A lists a 
Hu03 Genechip i 
level of mRNA expression 



samples vs. the 80th percentile of the Al for no 
UnigenelD Unigene Title 



420602 
445019 
452281 
449722 
423575 
424086 



419948 
451807 
442875 
452795 
420931 
416283 
450221 



AL035461 

AF060877 

AI205540 

T93500 

BE280074 

C18863 

AI351010 



M22440 
AL039079 
AB041035 



449318 
418345 
415076 
414142 



Hs.334822 

Hs.1 34585 

Hs.91143 

Hs.2281 

Hs.99236 

Hs.281295 

Hs.28792 

Hs.23960 

Hs. 163443 

Hs.102267 

Hs.301570 



Hs.1 70009 
Hs.75256 
Hs.93847 
Hs.27099 
Hs.23625 
Hs.18878 
Hs.1 00431 



438533 AI440266 



414972 
452943 
416661 



AW236021 
AJ001696 
NMJ00857 



hypothetical protein MGC4485 
DKFZP434G232 protein 
jagged 1 (Alagille syndrome) 
chramoyranin B (sei r i ]ran 1) 



nscDNAFLJ1 1041 fis. clone PL 
scDNA FLJ1 1576 fis, clone HE 



NCK adaptor protein 1 
transforming growth factor, alpha 
regulator of G-protein signalling i 
NADPH oxidase 4 



Homo sapiens clone TCCCTA00142 mRNA sequ 



AF044197 

NMJJ05429 Hs.79141 
AA328102 Hs.24641 
AA205847 Hs.23016 
Hs.1 77537 
Hs.170673 
Hs.48480 
Hs.17518 

Hs.77695 
Hs.31082 
Hs.79440 
Hs.173560 
Hs.78531 
Hs.241407 
Hs.77890 
Hs.1 50042 
A1753709 Hs.152484 
M30703 Hs.270833 
BE562298 Hs.71827 
AB037788 Hs.224961 
BE379623 Hs.27693 
A1097560 Hs.61210 
AA383092 Hs.1608 



small inducible cytokine B subfam ly (Cy 
vascular endothelial growth factor C 
cytoskeleton associated protein 2 



ESTs 

Homo sapiens cig5 mRNA, partial sequence 
hypothetical protein MGC2550 
KIAA0008 gene product 
hypothelii r r "Lllu _ 
IGF-ll mRNA-binding protein 3 
odd Oz/ten-m homolog 2 (Drosophila, mous 
J r . S i I E I DUt c j 
serine (or cysteine) proteinase inhibito 
guanylate cydase 1 , soluble, beta 3 
Homo sapiens cDNA FLJ14436 fis, clone HE 
ESTs, Weakly similar to [38022 hypothetj 
amph ju ii w nnon m,^ m h 
K1AA01 12 protein; homoioc of yeast ibos 
cleavage and potyadenylation specific fa 
peptidylprolyi isomerase (cyclophilin)-l 
ESTs, Weakly similar lo 38022 hypott . 
M14kD) 



WO 03/042661 



PCT/US02/36810 



422426 


W79117 


Hs.58559 


ESTs 


49.00 






Hs.2 17493 




46.00 


445828 


F05802 




ESTs 




431806 


AF186114 


Hs.270737 




44.00 


452909 


NM_015368 








432226 






phosphate cytidylyltransferase 1, cholin 






L49054 


Hs!85195 


myeloid leukemia factor 1 




443354 


AW970672 


Hs.9247 






416049 


A1970536 


Hs. 16603 


hypothetical protein FLJ13163 




431494 


AA991355 


Hs.298312 


hypothetical protein DKFZp434A1315 




433859 


AW896758 


Hs.273789 


ESTs 




426753 


T89832 


Hs. 170278 


ESTs 




400792 
402034 






Homo sapiens mRNA; cDNA DKFZp434CC515 (f 




424073 


U03493 








458424 


AI084049 








435159 


AA668879 


Hs!l 16649 


ESTs 


409269 


AA576953 


Hs.22972 


hypolhetical protein FU13352 






W76027 




hypothetical protein FU11105 




439128 


A1949371 


Hs.153089 






420795 


AA323037 


Hs.128645 


sorting nexin 16 




422505 


AL120862 


Hs.124165 


ESTs 




434828 


D90070 


Hs.96 


phorbol-12-myristate-13-acetate-induGed 




410561 


BE540255 
AW449679 


Hs.6994 


Homo sapiens cDNA: FLJ22044 fis, clone H 




423035 


Hs.156739 


H.sapiens XG mRNA (clone PEP11) 




417655 


AA780791 


Hs.14014 


h rati li i orDlem FLJ14813 




414869 


AA157291 


Hs.21479 




453049 


BE537217 


Hs.30343 


ESTs 




417801 


AA417383 


Hs.82582 


integrin, beta-like 1 (with EGF-likerep 




435243 
431211 


AW292B86 


Hs.261373 


hypothetical protein dJ434014.3 




M86849 


Hs.323733 


gap junction protein, bela 2, 26kD (conn 




407746 


AK001962 




hypolhetical protein FU 11100 




416498 


U33632 








414231 


AI468004 


1K278956 


hypoiheti il protein FU12929 ' 




426227 


U67058 


Hs.168102 






439452 


AA918317 






a'07 


439999 


AA1 15811 








417791 


AW965339 


Hs.1 11471 


ESTs 




436486 


AA742221 




ESTs 




432731 


R31178 


Hs^287820 


fibronectin 1 




429903 


AL1 34197 








435039 


AW043921 


Hs.1 30526 




419743 


AW408762 


Hs.5957 


1 lomo sapiens clone 2441 6 mRNA sequence 




457001 


J03258 


Hs.2062 


vitamin □ (1,25- dihydroxyvitamin D3) re 
interleukin 1 receptor, type II 




450684 


AA872605 


Hs.25333 




458531 


NM 004812 


Hs.1 16724 


aldo-keto reductase family 1, memberBIO 


319 


AA367718 


Hs.1 59083 


ESTs 






BE267931 




proliferating cell n uclea r a ntige n 




411388 


X72925 


Hs.69752 


desmocollin 1 


419750 


AL079741 


Hs.183114 


Homo sapiens cDNA FLJ14236 fis, clone NT 


1.00 


429370 


C19097 


Hs.89709 


glutamate-cysteine ligase, modifier subu 




429921 


AA526911 


Hs.82772 


collagen, type XI, alpha 1 




449467 


AW205006 


Hs.197042 






453102 


NM.007197 


Hs.31664 


frizzled (Drosophila) homolog 10 




453637 


NM_002589 


Hs.34073 


BH-protocadherin (brain-heart) 




400289 




Hs.2258 


matrix metalloproteinase 10 (stromelysin 


517.00 


418007 


M13509 


Hs.83169 


atrix raetaliop n 1 (intei til 




428368 


BE440042 


Hs.83326 






428227 


AA321649 


Hs.2248 




450375 


AA009647 


Hs.8850 




56.11 


427585 


D31152 


Hs.1 79729 


collagen, type X, alpha 1 [Schmid meiaph 




424717 


H03754 


Hs.152213 


wingless-type MMTV integ a ion le fan 




424735 


U31875 


Hs.272499 


short-chain alcohol dehydrogenase family 


420159 


A1572490 




pi J FLJ21245 f t 




415511 


AI732617 


Hs.182362 


ESTs 




406467 












D30783 


Hs.115263 


epiregulin 




452461 


N78223 


Hs.108106 


transcription factor 




415542 


R13474 


Hs.290263 


r 1 ly ill f h t 


413324 








431571 


AW500486 


Hs!l80610 


splicing factor proline/glutamine rich ( 


7.6D 






Hs.1 43655 


ESTs 


99.00 
1.00 


451844 


T61430 




gb:yc06a03.s1 Stratagene lung (937210) H 


441877 


AW273802 


Hs.1 1340 


hypothetical protein FLJ23047 


3.00 


439926 


AW014875 


Hs.137007 


ESTs 


2.79 


432015 


AL157504 


Hs.1 591 15 


Homo sapiens mRNA; cDNA DKFZp586O0724 (f 


94.00 


421103 


A1625835 


Hs.27104 


Mono sapiens mRNA; cDNA DKFZ P 667D226 (fr 


1.22 


448062 


AW295923 


Hs.255472 


KIAA1843 protein 


1.00 


432222 


AI204995 




gb:an03c03.x1 Stratagene schizo brain S1 


1.27 


421577 


BE465451 


Hs.105925 


single-minded (Drosophila] homolcg 1 


1.00 


421187 


NM_014721 


Hs.1 02471 


KIAA0680 gene product 


5.00 


408908 
437214 


BE296227 


Hs.250822 


serine/threonine kinase 15 


39.00 


BE092336 




gb:IL2-BT0734-240400-072-A12 BT0734 Homo 


1.00 
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443054 
453216 



400844 
409402 
412420 



X81334 
AW207091 
W02414 
BE281128 



Hs.2905 

Hs.32405 

Hs.234790 

Hs.2936 

Hs.72307 

Hs.38383 

Hs.9030 

Hs.1 53408 



AF208234 Hs.695 
AL035668 Hs.73853 
AF210317 Hs.95497 



ESTs 

yes-associated protein 65kDa 
progesterone receptor 

Homo sapiens mRNA; cDNA DKFZp586G0321 (f 
contactin 5 

matrix metalloproteinase 13 (collagenase 

ESTs 

ESTs 



purinergic receptor P2Y, G-prote'm coup! 



55 
60 



415327 
418624 
452850 



420923 
418216 
444649 



418686 
421508 
4105S3 
419183 



402075 
423017 
406687 



414221 
412296 
405494 
407189 
403085 
408633 
435257 
410044 
445182 
417275 
418406 
421110 
406081 
449448 



440304 
402778 
406117 



AU076643 Hs.313 
U03272 Hs.79432 
A1026718 



gb:ym54c02.r1 Soares infant brain 1NIB H 



BE093589 Hs,38178 
AA215404 Hs.137289 
W68815 Hs.301885 



irtoA46010X-lin 
i17(KRT17), 
hypothetical protein FLJ23468 
ESTs 

Homo sapiens cDNA FLJ11346 fis, clone PL 



AF097021 Hs.273321 differentially expressed in hematopcfefi 



AW190902 Hs.40098 

AA766268 Hs.266273 

M515032 Hs.91109 

W60377 Hs.57772 

Z36830 Hs.87268 

NM.004833 Hs.105115 

AW016824 Hs.255527 
Hs.89663 



AF15q14 protein 
ESTs 

cysteine knot superfamily 1 , BMP anlagon 

hypothetical protein FU13346 
ESTs 



AC002115 

Y13645 

AA205625 



AK001666 
AA972965 
AW450979 
AW450979 
AW936233 



AW963372 

AA677026 

BE566742 

AW189787 

X63578 

X73501 

AJ250717 



rs, Weakly similar to dJ3650 12.1 [H.sa 



similar to SALL1 (sal (DrosophilaJ-Ike 

gb.UI-H-BI3-ala-a-1 2-0-Ul.s1 NCl_CGAP_Su 
gb:UI-H-BI3-ala-a-12-0-Ul.s1 NCI_CGAP_Su 
gb:QV0-DT0020-090200-1 07-a06 DT0020 Home 

gb:ae37e03.s1 Gessler Wilms lumor Homos 



parvalbumin 
cytokeratin 20 
cathepsin E 



BE536911 Hs.234545 



ring finger protein 
hypothetical protein FLJ10339 
hypothetical protein NUF2R 
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417715 
442577 
418867 
426088 



AA292993 

D31771 

AF038007 



Hs.72222 hypothetical protein FLJ13459 

Hs.86366 ESTs 

Hs.163900 ESTs 

Hs.89404 msh (Drosophila) homeo box homolog 2 

Hs.166196 ATPase, Class I, type 8B. member 1 

Hs.74126 fatty acid binding protein 6, ileat (gas 

Hs.76888 hypothetical protein MGC12702 

gb:EST382704 MAGE resequences, MAGK Homo 



425852 
414987 
430168 



439738 
437181 
404875 



25 

30 



AF019225 Hs.114309 apolipoprotein L 



417151 
411248 
405034 



AA524394 Hs.294022 



death receptor 6, TNF superfamily mt 
hypothetical protein FLJ14950 
DKFZF434I1735 protein 



Hs.156452 ESTs 



ESTs, Weakly similar to KIAA0758 protein 



BE272506 Hs.82109 
BE246502 Hs.9598 
AI306615 Hs.125343 



AI601188 Hs.120910 ESTs 

AK001379 Hs.121028 hypothetical protein FU 10549 

AL137708 Hs.161031 Homo sapiens mRNA; cDNA DKFZp434K0322 (f 

Insulin-like growth factor 2 (somatomedi 

AI133461 Hs.251664 insulin like qrowth (aclor 2 (somatomedi 

T64682 gb:yc48b02.r1 Stratagene liver (937224) 

Hs.334822 hypothetical protein MGC4485 



AL035754 
AA129547 
AW408164 
AA032211 



;d receptor, L(DLR class) 
Homo sapiens cDNA: FLJ21763 fis, clone C 
aquaporin 3 



Hs.293858 

Hs.334605 

Hs.2474 

,Hs.285754 

Hs.249184 

Hs.118493 



hypothetical protein FLJ20510 
ESTs, Weakly similar lo ISHUSS protein d 
ESTs, Weakly similar to G01 763 atrophin- 
gb:Homo sapiens mRNA full li ngth in ert 



407581 
433078 
417003 



426847 
446009 
457292 
415949 



450983 
414792 
437553 
421218 



445911 
436608 
423916 



AL038170 
AB040946 
AW411425 
NMJ07019 
AI638516 
S78723 
AI989885 
AI921270 
H 10562 
AI623693 
NMJ16361 
AA305384 
BE314949 



Hs.173508 
Hs.121575 
Hs.80756 
Hs.284227 
Hs.180655 
Hs.93002 
Hs.22630 
Hs.298623 
Hs.231926 
Hs.334882 
Hs.21691 
Hs.191533 



Is .25740 
ls.87128 
ls.130497 
NM.000499 Hs.72912 
AW163564 Hs.142375 
AA641726 Hs.289015 

AL137967 



P3ECSL 

Homo sapiens cDNA FLJ12231 fis. clone MA 
betaine-homocysteine melhyltransferase 
WAA1513 protein 



5-hydroxytryptamine (serotonin) receptor 
ESTs 

hypothetical protein FU14251 

ESTs 

ESTs 

LPAP for lysophosphatdic acid phosphata 

ER01 (S.cerevisiae)-like 

hypothetical protein FLJ23309 

ESTs, Weakly similar to MAT8.HUVAN CHL 

cytochrome P450, subfamily I (aromatic c 

ESTs 



gb:DKFZ P 761D2315_r1 76 
Hs.145645 ESTs, Moderately simiiarto ALU1_HUMAN A 
Hs.192371 down syndrome critical reg'ion protein DS 
Hs.17235 HomosapienscloneTCCCIA00176raRNAsequ 



421064 
421373 
427239 



413278 
453389 
454789 



AA808229 Hs,16 



Hs.1 16206 Opa-interacting protein 5 
Hs.833 interferon-stimuiated protein, 15 kDa 
Hs.32963 cadherin6,type2,K-cadherin(fetalki 

gb:QV0-HT0367-150200-114-d02 HT0367 He 
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5 

10 
15 

20 
25 
30 



434487 
424008 
418067 



453204 
437240 
405531 
440249 
426783 
434192 
407881 
402001 



444008 
421307 
423853 
407846 
410348 
419078 



441495 
417933 
412661 
411880 



AF143867 

R02740 

AI127958 

AW242799 

L22075 

BE311926 

AF153341 

AA298484 

BE348244 

AA211329 

R10799 



Z19084 

AW072003 

AF113018 
A1805416 
AI791949 



AB011537 
AA426202 
AW182663 



Hs.61265 
Hs.202628 
Hs.26006 
Hs.191990 



cystatin E/M 
ESTs 

guanine nucleotide binding p-otein (G 
hypothetical protein FU 12691 
Homo sapiens winged helix'fo'h head 
ESTs, Moderately similar to G736_Hl 
ESTs, Weakly similar to I78885 serine 
hypothetical protein FU10559 
ESTs 

gb:nx85c05.st NCI_CGAP GCB1 Ho 



Hs.337275 
Hs.34371 



Hs.284302 
Hs.213897 
Hs.1 12432 
Hs.49117 
Hs.80449 
Hs.128152 
Hs.1 64502 
Hs.220756 
Hs.103305 
Hs.133466 
Hs.40403 
Hs.95469 
Hs.89584 
Hs.77597 



heparan sulfate (glucosamine) 3-O-sJtfot 

PR01621 protein 
ESTs 

anti-Mullerian hormone 

Homo sapiens mRNA; cDNA DKFZp564N1662 (f 
Homo sapiens, clone IMAGE:3535294, mRNA, 
ESTs 
ESTs 

ESTs, Weakly similar to SFR4_HUMAN SPLIC 
Homo sapiens mRNA; cDNA DKFZp434B0425 (f 
lit (Drosophii i homolog 1 
Cbp/p300-interacting transaclivator, wit 



X90725 
N58115 

AA604379 Hs.86211 

NMJ14398 Hs.10887 

M34455 Hs.840 

AW294603 Hs.1 27039 

X02308 Hs.82962 

N32860 Hs.24611 . 



polo(Drosophia)-likeki 
AD024 prolein 
hypothetical protein 
simfer lo lysoDome-as: 
indoleamine-pyrrole 2,3 dioxygenase 

thymidylate synthetase 

ESTs, Weakly similar to I54374 gene NF2 

gb:hm30fO3 x1 NCl_CGAP_Thy4 Homo sapiens 



45 
50 



429257 


AW1 63799 


Hs.198365 




2.27 


417599 


AA204688 


Hs.136201 


ESTs 


1.01 


438366 


AA805760 


Hs.303567 


ESTs 


1.00 


438746 


AI8B5815 


Hs.1 84727 


ESTs 


1.47 


409691 




Hs.246042 


Homo sapiens, clone MGC:5437, mRNA, comp 


1.00 


408827 


AW275730 


Hs.254825 


ESTs 


1.00 


414735 


BE468016 


Hs.281904 


ESTs 


1.00 


422278 


AF072873 


Hs.114218 


frizzled (Drosoptiila) homolog 6 
ESTs 


5.21 


412719 


AW016610 


Hs.129911 


494.00 


417034 


NM 006183 


Hs.80962 




1.00 


430486 


BE062109 


Hs.241551 


chloride channel, calcium activated, fam 


23.36 


407788 


BE514982 


Hs.38991 


S100 calcium-binding protein A2 


6.98 


413753 


U17760 


Hs.75517 


laminin,beta3(nicein(125kD),kalinin 


7.50 


424012 


AW368377 


Hs.1 37569 


tumor protein 63 kDa will] skong homo'cg 


9.77 


425650 


NMJW944 


Hs.1925 


desmoglein 3 (pemphigus vulgaris antigen 


445.00 




AA219691 


Hs.73625 


RAB6 interacting, kinesin-like (-abkines 


13.93 


423673 


BE003054 


Hs.1 695 


matrix metalloproteinase 12 (macrophage 


12.77 


452B38 




Hs.30743 


preferentially expressed antigen in meia 


45.00 


418663 


AK001100 


Hs.41690 


desmocollin 3 


10.CS 


409632 


W74001 


Hs.55279 


serine (or cysteine) proteinase inhibito 


6.2S 


429610 


AB024937 


Hs.211092 


LUNX protein; PLUNC (palate iung and nas 


0.88 




M29540 




carcinoembryonic antigen-related cell ad 


33.31 


421948 


U2583 


Hs.334309 


keratin 6A 


36.81 


431846 


BE019924 




uroplakin 1 B 


1.37 




AF077374 


Hs!l39322 


small praline-rich protein 3 


8.85 


453964 


AI961486 


Hs.12744 


ESTs 


0.40 


446856 


AI814373 


Hs.164175 


ESTs 


1.16 


443648 


AI085377 


Hs.143610 


ESTs 


2.15 


408522 




Hs.46320 


Small proline-rich protein SPRK [human, 


4.39 


431384 


BE158000 




gb:MR2-HT0377-15D200-202-e03 HT0377 Homo 


1.18 


42215B 






protease inhibitor 3, skin-derived (SKAL 


4.22 


435505 


AF200492 


Hs!211238 


interleukin-1 homolog 1 


164.00 


417366 


BE1 85289 


Hs.1076 


small proline-rich protein 1B (comifin) 


9.85 


43195B 
441020 


XB3629 


Hs.2877 


cadherin 3, type 1, P-cadherin (placenta 


5.59 


W79283 


Hs.35962 


ESTs 


5.76 


423217 


NM 000094 


Hs,1640 


collagen, type VII a pha ( c rmo 


1.97 


448733 


NMJ05629 


Hs.187958 


solute cam'erfamily 6 (neurotransmitte 


1.09 


444371 


BE540274 


Hs.239 


forkhead boxM1 


2.44 


422168 


AA5B6894 


Hs.112408 


S100 calcium-binding protein A7 (psorias 


8.39 
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AA420450 Hs.292911 

BE332756 Hs.1 69902 

AL035864 Hs.69517 

AK001035 Hs.130881 

AF104032 Hs.184601 



ESTs, Highly similar to S60712 band-6-pr 
sdutecarrierfamily 2 (facilitated glu 
cDNA for differentially expressed C016 g 
B-cell CLL/lymphoma 1 1A (zinc tinge- pro 
solute carrier family 7 (calionic amino 



cerevisiae) homolog (E coii Re 
insulin-like growth factor bindina prote 
DNA-aclivated, catalylic 
SRB7 (suppressor of RNA polymerase B, ye 
alcohol dehydrogenase 7 (dass IV), mu o 
gap junclion protein, beta 6 (connexin 3 
XAGE-1 protein 



■ofzeste(Drosoptiiia) homolog 2 




pituitary tumor-lransforming 1 
melanoma cell adhesion molecule 
hypothetical protein FU11036 
ESTs, Highly similar to unnamed protein 
disinlegrin and melalloproleinase doma 



hypoxia-inducible protein 2 
mitochondrial ribosoma! p-otein S17 
RAN binding protein 1 
ESTs 

cyclin-dependent kinase inhibitor 2A (mo 
Hypothetical protein FLJ 20764 
hypothetical protein FLJ10540 

' lAP repeat-containing 5 (sur 
HSPC1 50 protein similar loubi " 
idlsintegrinam' ' " 



G protein-coupled receptor 87 
WAA0186 gene product 
hypothetical protein MGC16207 



R27430 Hs.271565 

NM_005025 Hs.78589 

Y12642 Hs.3185 

A1432652 Hs.42824 



transcription factor BMAL2 
GPI-anchored metastasis-associated prote 
pim-2 oncogene 

diacylglycerol kinase, zeta (104kD) 
hyaluronan synthase 3 
aldehyde dehydrogenase 3 fsmily, member 
topoisomerase(DNA) II alpha (TOkD) 
interleukin enhancer binding factor 2, 4 
UDP-GlcNAc:betaGal beta-1 ,3-N-acatylgljc 
protein tyrosine phosphatase, receptor-t 
NADH:ubiquinone oxidoreductase MLRQ subu 
calmodulin-like 3 

KIAA0144gene product 
hypothetical protein FLJ10201 
ATP-binding cassette, sub-family C (CFTR 
gene product 



ESTs, Weakly similar to GGC1_HUMAN G ANT 
G antigen 7B 

laminin, gamma 2 (nicein (100kD), kalini 
Human DNA sequence from clone RP5-850E9 
KIAA1020 protein 



ne (or cysteine) proteinase inhttto 



hypothetical protein FLJ10718 
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428479 
422487 
423761 
436291 
423725 



446292 
429538 
447289 
428004 
415091 
416209 
408572 
404996 



438702 
444378 
433485 
407839 
439223 
409041 
429228 
409757 
411089 



45 

50 



AF1 23050 Hs.44532 
BE069288 Hs.34744 
H06350 Hs.135056 
AW1 68067 Hs.252956 
Y00272 Hs.184572 
AJ010901 Hs.198267 
NMJ06194 Hs.132576 
BE568452 Hs.5101 
AJ403108 Hs.132127 
AF134160 Hs.7327 
H5.1277B0 
Hs.202312 
Hs.106243 
Hs.121296 
Hs. 13751 6 
Hs.152213 



AI637993 
AI819863 
AA281219 
AA218940 
AI683487 

NM_014058 Hs.201877 

AW411307 Hs.114311 

F07953 Hs.16085 

AL157425 Hs.133315 

AF081497 Hs.279682 

BE182592 Hs.11261 

AW247017 Hs.36978 

AA449563 Hs.151393 

ALC44872 Hs.7791C 

AA236776 Hs.79078 

AA055611 Hs.226568 



445443 
407137 
401785 
412723 
450701 



414774 
424629 
437789 
454098 
453968 

403478 



cell division cycle 2, G1 to S and G2 to 



paired box gene 9 

protein regulator of cytokinesis i 

hypothetical protein LOC57822 



Homo sapiens clone N11 NTera2D1 ter 



wingless-type MMTV integration site fami 
DESC1 protein 

CDC45 (cell division cycle 45, Sxerevis 
putative G-protein coupled receptor 
Homo sapiens mRNA; cDNA DKFZp761 J1324 (f 
Rh type C glycoprotein 
small praline-rich protein 2A 
melanoma antigen, family A, 3 



AA280174 
AF053306 
AF027866 
AA481269 
AI879064 
R41339 
AI493076 
AA045144 
AW238299 
AB033025 
AI553633 

NM.001398 Hs.123114 cystatin! 
AA456454 Hs.183418 cell divisi 
AA721252 Hs.291502 ESTs 
AL044222 Hs.23255 nucleopo 
AI240665 Hs.8895 ESTs 
AW511413 Hs.278025 ESTs 
AF1 20274 Hs.194689 artemin 
AV653838 Hs.322971 ESTs 



3-hydroxy-3-methylglutaryl-Coenzyme A sy 
MAD2 (mitotic arrest deficient, yeast, h 
ESTs, Moderately similar to ALU4.HUMAN A 

Williams-Beuren syndrome chromosome regi 
budding uninhibited by benzimidazoles 1 
serine (or cysteine) proteinase inhibito 
ATP-binding cassette, sub-family C(CFTR 



ito reductase family 1, member C2 



Hs.250618 
Hs.50081 
Hs.337139 



T97307 



BE264974 



X02419 Hs.77274 

M90656 Hs.151393 

AI581344 Hs.127812 

W27953 Hs.292911 

AA847843 Hs.62711 

AA412605 Hs.178053 



gb:ye53h05.s1 Soares fetal liver spl 



glutamate-cysteine ligase, catalytic sub 
ESTs, Weakly similar to T17330 hypotheti 
ESTs, Highly similar to S60712 band-6-pr 
Homo sapiens, clone IMAGE3351295, mRNA 
SPANX family, member C 



427260 

401994 

404298 
404927 
434105 
436961 
429563 
426067 
428810 



AA504777 Hs.105293 ESTs 



AW952124 
AW375974 
BE619413 



AI732182 
AB023173 
U89995 



Hs.13094 



presenilis associated •homboid-like pro 
ESTs 

eukaryotic translation initiation faclor 



Hs.2437 
Hs.97053 

Hs.193788 nitric oxide synthase 2A (inducible, hep 

Hs.97387 ESTs 

Hs.134791 ESTs 

Hs.191866 ESTs 

Hs.75478 ATPase, Class VI, type 11B 
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403715 
428645 
425734 



Hs.133034 
Hs.30837 
Hs.1 14256 
Hs.22010 



413573 AI733859 Hs.149089 ESTs 
430686 NMJ01942 Hs.2633 desmoglein 1 



hypothetical protein MGC5350 



gb:od77b08.s1 NCI_CGAP_0v2 Homo sapiens 



444783 
426427 
422956 
445537 



408000 
453160 
409361 



Hs.52620 integrin, bela 8 



anillin (Drosophila Scraps homolog), act 
TTK protein kinase 

■ ;inFU10461 



AA032279 
AI267700 
L11690 



Hs.153704 
Hs.61635 
Hs.317584 
Hs.620 



106.00 
15S.00 
196.00 



Unique Eos probeset identifer nu 
Gene cluster number 
Genbank accession numbers 




AW872477BE0B8101T05990 
AW936233AW936272 
T64682 BE168190 BE168256 

AW450979AA136653AA136656AW419381 AA984358AA492C73BE168945AA8D9054AW23B03BBE011212BE011359 BE011367 
BE01 1368 BE01 1 362 BE01 1 21 5 BE01 1 355 BE01 1363 
H22769 R35182 Z43545 F05783 N92089 H71928 
AA663848AA400100AA401424 

AW970622 AA503009 M502998 AA502989 AA5D2805 T921 88 

BE15E0OO BE157999 H75671 H70965 C188S5 BE383512 BE385315 BE3S029B AI341995 BE074534AA055592AA1 32265 AI733757 

AA134504 BE145037 AA055887 BE070191 R66492 AW858018 AW858058 AW817057 AW862031 AW861688 AWB62029 AW858805 AV\ 

AW862028 AW85801 7 AW819164 AW853B9B A15221 61 AW854789 AW817403 BE152005 AI73241 1 AA1330B4 

AI204995 AW827539 AW969908 AW440776 AA528756 

BE092336 BE092259 BE092497 BE092051 AA746882 A1336378 

AA747537 BE089068 BE089070 

AA828995AA834879AI926361 

AL109688 R23665 R26578 

Al 1 26098 Al 1 84746 At 148521 

T61430AI820546AI821336 

AL137967 BE064160 BE064186 

BE156314 BE156316AW820750 

AW450979AA136653AA136656AW41S381AA984358AA492073 BE 1 68945 AA809054 AW238038 BE011212 BE011359 BE011367 
BE01 1 368 BE01 1 362 BE0 1 1 21 5 BE01 1 365 BE01 1 363 



TABLE 24C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank identifier (Gil numbers. 'Dunham I. etal." refers to the publication entitled The DN A sequence of 

human chromosome 22." Dunham I. et al., Nature (1999) 402:489495. 
Strand: Indicates DN A strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 






400751 


7331445 


Minus 


35395-35533 


400842 


1927148 


Plus 


90462-90673 


400843 


9188605 


Plus 


5863-5970,7653-7784.8892-9023,9673-9807,10634-10789,15254-15^03,23827-23958 


400844 


9188605 




24746-24872,25035-25204 


400846 


9188605 


Plus 


39310-39474 


401486 


7341763 


Plus 


32585-32756,36281-36540,4079140933,44018-44179 


401747 






118596-118816,119119-119244,119609-113761,120422-120990,130161-130381,130468-130593,131097-131258,131866- 








131932,132451-1 32575,1 33580-13401 1 


401760 


9929699 




83126-83250,85320-85540,94719-95287 


401780 


7249190 




28397-28617,28920-23 1 9135 2 3 2 1411 23567,23705-23787,30224-30573 


401781 


7249190 


M 


83215-83435,33531- 55 53740-f 901,84 ? h ( 3 84355-85037,86290-86814 


401785 


7249190 




165776-165996,166189-156314,155408-166569,157112-167258,167387-167469,168634-168942 


401994 


4153858 




42904-43124,43211 43333.4460744763,451 £ IE ? r 1 46337-43732 


402001 


9501818 


pi'uT 


68052-68223 
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402075 
402230 
402305 



403715 
403903 
404148 



7684482 
8117407 
9966312 
7328724 
9796339 
9588235 



9944263 
7528051 
9801324 



404996 
405033 
405064 
405494 



7107731 
7658416 
8050952 
9665194 
4914350 
2735037 
7767812 



86227-86451 

121907-122035.122304-122921,124319-124161,124455-124610,125672-126076 

29782-29932 

4083241362 

129750-129919 

126786-126948 

128560-128702 

175426-175667 



101165-102597 

78218-78418,79571-79709 

73591-73723 



96588-96732,97722-97831 



43081-43229 

37999-38145,38652-38998,39727-39872,40557-40674,42351-42450 

142358-142546 

81207-81416 

70284-70518 

35602-35803 

741-969 

61057-62075 

123525-123713 

38115-38691 

54304-54584 

7513-7673 

182212-182958 



TABLE 25A: 691 genes upregulated in head and neck cancer relative to normal body tissues 

Table 25A lists about 691 genes upregulated in head and neck cancer relative to normal body tissues that are likely to encode proteins amenable to modulation by small molecules, 
peptides, or antibodies. Thesegenes were selected from 59680 prabesets on the Ens//- ' ,m IrixHuO en hipa y.Geneexpii in data foi each [ iiesel obtained from this 
analysis was expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA expression. The protein products of these genes often contain one or more 
domains indicative of have oncogenic function or of transducing intracellular signals, or of being moduialable by small molecules, peptides, or antibodies [e.g. pkinase, death-domain, 
7tm, phosphatase, orionjransporler]. Certain predicted protein domains are noted. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar accession number, GenBank accession number 

UniGenelD: UniGene number 

PredProtDomains: Certain predicted protein domains. Abbreviations used: TM, transmembrane domain; SS, signal sequence; =Y, very likely to contain; =M, 

likely to contain; other protein domain abbreviations are from PFAM (Nucleic Acids Research, 2002, 30:276-280). 
UniGene Title: UniGene gene title 

R1 85th percentile of head and neck cancer Als div ided by the 50th pe-centite of normal tissue Als, where the 10th percentile of all normal tissue Als was 

Pkey; ExAccn; UnigeneiD; Unigene Title; Pred.Prot.Domains; R1 

422168; AA586894; Hs.112408; S100 calcium-binding protein A7 (psorias; efiiar,d,S_100;TM=M;SS=N; 46.25 
408522; AI541214; Hs.46320; Small proline -rich protein SPRK [human, : none.Co-nifin; 40.37 
417366; BE185289; Hs.1076; small praline-rich protein 1B (comif.n); Co-nifin;TM=M;SS=N; 36.94 
401781; ; ; Target Exon; filament;TM=M;SS=N; 29.74 

422158; L10343; Hs.112341; protease inhibitor 3, skin-derived (SKAL; wap;TM=M;SS=Y; 29.54 
401780; ; ; NM_005557' Hi 1 10 iapiei s kei atin 15 (foca; filament.filament; 28,58 
424098; AF077374; Hs.139322; small proline-rich protein 3; Comifn;TM=M;SS=N; 2855 

421948; L42583; Hs.334309; keratin 6A; filament,RhoGAP,DUF285,bZIP,Tropomyosin,tubulin,DUF164,TBCA,Col!agen;TM=M;SS=N; 25.74 

428471; X57343 H 164510 ra n; 14-3 l;TM=M;SS=N; 23.65 

417079; U65590; Hs,81 134; interleukin 1 receptor antagonist' 111;; 21.02 

421 574; AJ0001 52; Hs. 105924; defensin, beta 2; Defensin_beta;TM=M;SS=M; 20.83 

409601; AF237621; Hs.80828; keratin 1 (epidermolytic hyperkeratosis); filament,bZlP,Uvrf>helicase r TBCA;TM=M;SS=N; 20.72 
433091;Y12642;Hs.3185;lymphocyteantigen6complex,locusD;UPAR_LYbln n,A i it'MSS=, 1961 
446292; AF081497; Hs.279682; Rh type C glycoprotein; Ammonium_transp,FecCD;TM=Y;SS=M; 19.53 
I ! In t t nt e bindjeel N; 19.1 

407788; BE514982; Hs.38991; S100 calcium-binding protein A2; efhand,S_100,S.100.efnand; 17.93 
416091; AF295370; Hs.283082; defensin, beta 3; Defensin_beta;TM=M;SS=M; 17.63 
43121 1; M86849; Hs.323733; gap junction protein, beta 2, 26kD (conn; connexin;TM=Y;SS=M: 1 6.94 
429259; AA420450; Hs.380088; Plakophilin; none.none; 14.92 

417515; L24203; Hs.82237; ataxia-telangiectasia group D-associated; zf-6_box.zf-UBR1 ;TM=M;SS=N; 14.75 

423634; AW959908; Hs.1690; heparin-binding growth factor binding pr; r,one;TM=M;SS=M; 14.45 

418007; M13509; Hs.83169; matrix metalloproteinase 1 (interstitial; hemopexin,Peptidase_M10,Aslacin,PG_binding_1,- 13.02 

409632; W74001; Hs.55279; serine (or cysteine) proteinase inhibito; serpin;; 12.82 

406621; X57809; Hs.1 81 1 25; immunoglobulin lambda locus; ig,HSP70,Ppx-GppA;TM=M:SS=N; 1 2.31 

431958; X63629; Hs.2877; cadherin 3, type 1, P-cadhe in Cadhen'n_C_term;TM=Y;SS=M; 12.45 

446921; AB01 21 13; Hs.16530 si i nducit tokine subfamily A yiL8 1171 

401760; ; ; Target Exon; none.bromodomain; 1 1 .68 

407839; AA045144; Hs.161566; ESTs; cadherin.cadherin; 1 1.65 

454034; NM.000691; Hs.575; aldehyde dehydrogenase 3 family, member ; aldedh;; 11.56 

444781; NM_014400; Hs.11950; GPl-anchored metastasis-associated prote; UPAR_LY6,iactamase_B; 11.31 
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453857; AL080235; Hs.35861; Ras-induced senescence 1 (RIS1); ncne;TM=Y;SS=M; 11.03 

I 111 1 L q I- Tf M I 1 

430630; AW269920; Hs.2621; cystatin A (steSn A]; cystatln;TM=M;SS=N; 10.58 

419693; M133749; Hs.301350; FXYD domain-containing ion transport -eg; ATP1G1_PLM_MAT8;TM=Y;SS=M; 10.30 

41 1 274; NMJ02776; Hs.69423; kallikrein 1 0; irypsir,:TW=M;SS=N; 1C.25 

441633; AW958544; Hs.1 12242; normal mucosa of esophagus specifr I e;TM=M;! =M;9 i 

446989; AK001898; Hs.16740; hypothetical protein FLJ11036; none;TM=Y;SS=N; 9.74 

40207 ENSP00000; 1051 Pla i memb it :a!i urn rcne;; 9.50 

444381; BE387335; Hs.283713; hypothetical protein BC014245; Collagen;TM=M:SS=M; 9 50 

431009; BE149762; Hs.48956; gap junction protein, beta 6 (connexin 3: connexin;l M=Y;SS=V; 9. 18 

439310; AF086120; Hs.102793; ESTs; casein_kappa,pkinase,ig,none; 9.43 

414987; AA524394; Hs.294022; hypothetical protein FU14950; SH2;TM=M;SS=N; 9.33 

418004; U37519; Hs.87539; aldehyde dehydrogenase 3 familv nerr dedh;TM ' SS~V 9 14 

408000; L1 1690; Hs.198689; bullous pemphigoid antigen 1 (230/240kD); efnand l spectrin l GAS2.SH3 1 Plectin l RA,Xylose_isom,FliD,bZIP,Tropomyosin,Myc- 

l_Z,M,ldh_C,CH,AIP3;TM=M;SS=N;9.12 
451541; BE279383; Hs.26557; plakophilin 3' A-madillo_seg;TM=M;SS=N; 9.1 1 
425650; NMJ01944; Hs.1925; desmoglein 3 (pemphigus vulgaris antiqen; cadherin;TM=M;SS=M; 8.66 
452240; AI591147; Hs.61232; ESTs; none.none; 8.57 
429228; AI553633; Hs.356828; ESTs: none.none; 3.46 

400289; X07820; Hs.2258; matrix metalloproteinase 10 (stromclym - r r F l iua=c_M13,Astacin;;8.44 

425071; NM_013989; Hs.154424; deiodinase, icdothyronine, type II; T4_deiodinase;TM=M;SS=Y; 8.15 

407242; M18728; ; gb:Human nonspecific crossreacting antig; ig;TM=M;SS=M; 8.05 

407944; R34008; Hs.239727; desmocollin 2; cadherin.Cadherin.CJ m F in Ti M; 7.9I 

413278; BE563085; Hs.833; interferon-stimulated protein, 15 kDa; ubiquitin;; 7.82 

428330; L22524; Hs.2256; matrix metalloproteinase 7 (matrilysin, ; Peptidase_M1 0;; 7.82 

417308; H60720; Hs.81892; KIAA0101 gene product; none;TM=M;SS=N; 7.77 

13753; U1 ); lls.7551 m n. t in (12 kD), kalinin; laminin 31 rnininjl m i 
423217; NMJ00094; Hs.1 640; collagen, type VII, alpha 1 (epidermolys; Kunilz_BPTI,fn3,™a,Collagen,bela-lac1amase;TM=M;SS i | i !; 7.71 
430686; NMJ01942; Hs.2633; desmoglein 1; cadherin,Cadherin_CJerm;TM=Y;SS=W, 7.6S 
412270; AC005262; Hs.73797; guanine nucleotide binding protein (G pr; G-alpha,arf;TM=M;SS=N; 7.54 
428484; AF104032; Hs.184601; solule carrie family 7 i I nic ji.il 

aa_permeases l pyridoxaLdeC,bromodcmain 1 PHD,MBD,AT_hook,DDT.PI3„P;4_kinase,FAT,FATC,BDlA,RUN;TU=M;SS=N;7.53 
418663; AK0C1100; Hs.41690; desmocollin 3; c&dherin,Cadh3rin_C_term,none; 7.30 
452281; T93500; Hs.28792; Homo sapiens cDNA FLJ11041 (is, clone PL; TGFb_p-opeptideTGF-bela,none; 7.28 
429211; AFG52693;Hs.198249; gap junction protein, beta 5 (-nnn. -n 3 , in r,Tf'=> SS=M; 7.26 
412719; AW016610; Hs.816; ESTs; none.none; 7.17 

446619; AU076643; Hs.313; secreted phosphoprotein 1 (osteopontin, ; Osteopontin;; 7,10 

423961;D13666;Hs 1363-1 periostil (OSF-2os); Fasciclin;TO=M;SS=M; 7.09 i 
427666; AI791495; Hs.180142; calrnoduiin-iike jMn protein (CLSPj; ethand;TV=M;SS=N; 7.08 
4 LE 1 24; H 1 J-of 1D;tran bran 1:TM (' 7 c 

423673; BE003054; Hs.1695; matrix metalloproteinase 12 (macrophage ; hemopexin,Peptidase_M10;TM=M;SS=M; 7 03 

401747; ; : Homo sapiens keratin 17 (KRT17); none.bromoccmain; 7.01 

413859; AW992356; Hs.8364; Homo sapiens pyruvate dehydrogenase kina; SAM_PNT,none; 6.98 

429002; AW248439; Hs.2340' iun rr 1 n , rm t r i i 1 _ il"=M SS=N,696 

432239; X81334; Hs.2936; matrix metalloprcteinase 13 (collsgenase; hemopexin.Peptidase_M10;; 6.87 

417715; AW969587; Hs.86366; ESTs; none.none; 6.72 

422440; NMJM4812; Hs.1 1 6724; aldo-keto reductase family 1, member B10: aldrj ket_red,ROK;TM=M;SS=N; 6.50 

429359; W00482; Hs.2399; matrix metalloproteinase 14 (nemb i n; hemopexin.Peptida M1 i f'=M SS=M; 6.39 

418844; M62982; Hs.1200; arachidonate 12-lipoxygenase; iipoxygenase,FLAT;TU=M;SS=N; 6.38 

420039; NM_004605; Hs.376147; sulfotransferase family, cylosolic, 2B, ; Sulfotransfer;; 6.38 

425397; J04088; Hs.156346; topoisomerase (DNA) II alpha (170kD); DNA_gyraseB,DNA_topoisoiV,HATPase_c;; 6.35 

442599; AF078037;Hs.324051;RelA-assocated nl t I i T " 3=N; 6.30 

409420; Z15008; Hs.54451; laminin, gamma 2 (nicein (100kD), kalini; lamin'n_E,laminin_EGF;; 6.28 

424364; AW383226; Hs.163834; ESTs, Weakly similar to G01 763 atrophin-; ras;TM=M;SS=N; 6.27 

414812; X72755; Hs.77367; monokine induced by gamma interferon; IL8;TM=W;SS=Y; 6.23 

424687; J05070; Hs.151738; matrix metalloproteinase 9 (gelatinase B; fn2,hemopexin,Peptidase_M10;; 6.22 

443426; AF0981 58; Hs.9329; chromosome 20 open reading frame 1 : none;TM=M;SS=N; 6.21 

428970; BE276891; Hs .1 94691 ; retinoic acid induced 3 (RAIG1); mctabo; 7tm_3;TIV=Y;SS=W; 6.12 

423017; AW178761; Hs.227948; serine (or cysteine] proteinase inhibito; serpinr; 6.08 

424834; AK001432; Hs.153408; Homo sapiens cDNA FU10570 fis, clone NT; none.none; 6,08 

426440; BE382756; Hs.169902; solute earner family 2 (facilitated glu; sugar_tnTM= i ;S 3= V; 6 3 4 

439335; AA742697; Hs.62492; NM_052863:Homo sapiens sec'etoglobin, fa; none;; 5.81 

439223; AW238299; Hs.250618; UL16 binding protein 2; Idljecept.a.PKD MHCJ;TV=M;SS=Y; 5.77 

418054; NMJ02318; Hs.83354; lysyl oxidase-like 2; SRCRJ_ysyLoxid3se;TM=M:SS-M; 5.72 

45409B; W27°53 lis 17493; Plakophili none none 5 71 

417900; BE250127; Hs.82906; CDC20 (cell division cycle 20, S. cerevi; WD40;TW=M;SS=N; 5.70 

435505; AF200492; Hs.211238;interteukin-1 homolog 1; IL1;TM=M;SS=N; 5.69 

406685; M18728; ; gb'.Human nonspecific crassrea. tinq q;TM=f 1 

430280; AA361258; Hs.237868; interleukin 7 receptor; fn3,none; 5.63 

430486; BE062109; Hs.241551; chloride channel, calcium activated, fam; none;TM=Y;SS=M; 5.61 

449722; BE280074; Hs.23960; evelin El; cydin,cyclin_C;Tfvl=M;SS=N; 5.51 

439606; W79123; Hs.58561; G protein-coupled receptor 87;7tmJ;TM=Y;SS=l* 5.60 

452862; AW378065 H 8687 ADA VTS2 (ad n It F n M r i prol/ n none 5 58 

433662; W071 62; Hs.1 50826; RAB25 RAB25, member RASoncogene family, -a ,/ BC I an,arf;TM=M;SS=M; 5.57 
41 1296; BE207307; Hs.10114; growth suppressor 1; 2OG-Fell_0xy;TM=M;SS=M;5.55 
433848; AF095719; Hs.93764; carboxypeptidase A4; Zn_carbOpept,Propep_M14;; 5.54 

6819; U777 1 r 05 pn.oncoqere pknas 5 48 
428368; BE440042; Hs.83326; matrix metalloproteinase 3 (sfomelysin ; hemopexin,Feptidase_M10,Astacin;; 5.47 
452747; BE153855; Hs.61460; tg superfamily receptor LNIR; ig,Rhabd_glycop;TM=Y;SS=M; 5.46 
444946; AW139205; Hs.156457; hypothetical protein FU22403; abt,, ro - bhyd lase_2;TM=Y;SS=M; 5.42 
41 3719; BE439580; Hs.75498; small inducible cytokine subfamily A (Cy; IL8;; 5.35 
445033; AV652402; Hs.72901; cyclin-dependent kinase inhibitor 2B (p1; ank;; 5.28 
418462; BE001596; Hs.85266; integrin, beta 4; fn3,integrin.B,Calx-beta,EGF;TM=V;SS=V; 5.26 
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429554; NM.012275; Hs.207224; interleukjn 1, delta; IL1;TM=M;SS=N; 5.14 

421508; NMJM4833; Hs.105115; absent in melanoma 2; PAAD_DAPIN,HIN;TM-M;SS=N; 5.13 

439979; AW600291; Hs.6823; hypothetical protein FU10430; none;TM=M;SS=N; 5.11 

427099; AB032953; Hs. 173660; odd Oz/ten-m homolog 2 (Drosophib, raw's; NHL;TW-M;SS~N; 5.! i 

428227; AA321649; Hs.2248 mollmji r ly B iloTMVSS-i 08 

436396; AI683487; Hs.1 52213; wingless-type MMTV integration site fami; wnbnone; 5.07 

406690; M29540; Hs.220529; carcinoembryonic antigen-related cell ad: ig;TM=M;SS=M; 5.05 

453905; NM.002314; Hs.36566; LIW domain kinase 1; pkinase,LIM,PDZ,zf-PARP;TM=M;SS=N; 5.04 

414035; Y00630; Hs.75716; serine (or cysteine) proteinase inhibito; serpin:: 5.00 

413219; M878200; Hs.118727; Homo sapiens cDNA FLJ13692 (is, clone PL: HLH,death,TNFR_c6,Asvl-CoA_hydro; 4.96 
421506; BE302796; Hs.105097; thymidine kinase 1, soluble; TK;TM=M;SS=N; 4.93 

412140; AA219691; Hs.73625; RAB6 interacting kme ike (rat esin.Tropon ir TV=M;SS=N;4.92 

445537; AJ245671; Hs.12844; EGF-like^lomain, multiple 6; EGF,MAM;;4.91 

428953; M306610; Hs.348183; tumor necrosis factor receptorsuperami; 50s_ribosomal,RibosomaU10,TNFR_c6,DEAD;; 4.90 

436553; AW4071 57; Hs.181125; immunoglobulin lambda locus; ig,HSP70,Ppx-GppA;TM=M;SS=N; 4.89 

447343; M256641; Hs.236894; ESTs, Highly similar to S02392 aloha-2-m; none.none; 4.84 

430024; A1808780; Hs.227730; integrin, alpha 6; integrin A,FG-GAP;TM=Y;SS=M; 4.81 

439706; AW872527; Hs.59761 ; ESTs, Weakly similar to DAP1 HUMAN DEATH; none.none; 4.80 

444371; BE540274; Hs.239; forkhead box M1; Fork_head;TM=M;SS=N; 4.75 

428582; BE336699; Hs.185055; BENE protein; none;TM=Y;SS=M; 4.74 

419596; BE379320; Hs.91448; MKP-1 like protein tyrosine phosphatase; DSPc;; 4.69 

431630; NMJM2204; Hs.265829; integrin, alpha 3 (antigen CD49C, alpha ; FG-GAP,Rhabd_glycop,integrin_A;TM=Y;SS=M; 4.69 
422310; AA316622; Hs.98370; cytochrome P450, subfamily IIS, polypept none,pkinase,fn3,ig; 4.68 
418067; AI127958; Hs.83393; cyslaiin E/M; cyslalin;; 4.66 

414774; X02419; Hs.77274; plasrrrnogen activato", urokinase; kr ngle.trypsin.plant thionins;; 4.64 
456534; X91 195; Hs.100623; phospholipase C, beta 3, neighbor psoudo; UM.PDZ.pkinase;; 4.62 
410418; D31382; Hs.63325; transmembrane protease, serine 4; IdLrecept a,trypsin;TM=Y;SS=M; 4.60 
417866; AW067903; Hs.82772; collagen, type XI, alpha 1; Collagen,COLF!,TSPN,laminln G.CorA;; 4.60 
438113; AI467908; Hs.8882 ; ESTs; 7:ra Inone: 4.60 

418140; BE613836 II I i il r dr 1 2; non Tift SS=M;4.57 
408380; AF123050; Hs.44532; diubiq.iilin: i.biq.iilin:TM=V;SS=N; 4.55 

422627; BE336857; Hs.1 1 3787; transforming growth factor, beta-induced; Fasciciin,ABC_tran,ABC_membrane,GTP_ERU;TM=M;SS=M; 4.50 
425247; NM.005940; Hs.155324; matrix metalloproteinase 11 (stromelysin; hemopexin,Peptidase_M10;;4.50 
418558; AW082266; Hs.86131; Fas (TNFRSF6)-associated via death domai; death.DED;; 4.4S 
408482; NMJJ00676; Hs.45743; adenosine A2b receptor; 7tm_1;TM=Y;SS=M; 4.48 

414166; AW888941; Hs.75789; N-myc downstream regulated; DEAD,helicase_C,rrm,Ndr,CysJnot,TIL,vwa,vwc,vwd,IQ,Rlla,abhydrolase,TGF- 

beta,DUF139,TPR,DSPc,tspJ,Ribosomal S21 ,ivp;TM-M;SS=N; 4.47 
416178; AI808527; Hs.192822; serclnqiMi , I tn^ breast cane an no T"=M;SS=N; 4.47 
411789; AF245505; Hs.72157;Adlican;ig,LRR,LRRNT,LRRCT;TM=M;SS=M;4.47 
414561; AI064813; Hs.195155; Homo sapiens amino acid transport system; AaJrans;TM=Y;SS=N; 4.47 
422765; AW409701; Hs.1 578; baculoviral IAP repeal I n i r ( BIR;TM=f i 
427557; NM_002659; Hs.179657; plasminogen activator, urokinase recepto; UPAR_LY6,ET,PLA2_inh;; 4.43 
418322; AA284166; Hs.84113; cyclin-dependent kinase inhibitor 3 (CDK; Y_phosphatase,DSPc;TM=V;SS=N; 4.42 
409041; AB033025; Hs.50081; Hypothetical protein, XPJ51860 (KIAA119; none;TM=M;SS=M; 4.41 
406908; Z25437; ; gb:H.sapiens protein-tyrosine kinase gen; none.none; 4.40 
450701; H39960; Hs.288467; hypothetical protein XPJ98151 (leucine-: none.LRRCT LRR; 4,40 
409213; U61412; Hs.51133; PTK6 protein tyrosine kinase 6: SH2,SH3,pkinase;TM=M;SS=N; 4.3B 

429500; X78565; Hs.289114; hexabrachion (tenascin C, cytotactin); EGF,fn3,fibrir,ogen_C,toxin_2,Keratin_B2;TM=M;SS=Y; 4.38 
448569; BE382657; Hs.21486; signal transducer and activator of trans; SH2,STAT,STAT_bind,STAT_prot;TM=M;SS=N; 4.32 

1403108; Hs.132127; hypclhel lei 22; r,cne;TM=M N;4 

411573; AB029000; Hs.70823; KIAA1077 protein; Sulfatase;TM=M;SS=N; 4.31 
408243; Y00787: Hs.624 interlenkin i HLH,PAS,IL3 TM=M SS=N 4 31 
418738; AW388633; Hs.6682; solute carrier family 7, (cationic amino; r.one.nor.e; 4.30 
437897; AA770561; Hs.146170; hypothetical protein FU22969; zf-DHHC,none; 4.29 
424247; X14008; Hs.234734; lysozyme (renal amyloidosis); lys,ig,FAD_Synth,ldh,ldh_C,pkinase;; 4.29 
414021; M63835; Hs.77424; Fc fragment of IgG hiyn i la it nTM-r SS-M 429 
404996; ; ; Target Exor,; Peptidase_C1;TM=M;SS=M; 4.29 

416539; Y07909; Hs.79368; epithelial membrane protein 1; PMP22_Claudin,oxidorcd_q5_N;TM=Y;SS=M; 4.23 

409142; AL136877; Hs.50758; SMC4 (sijuclura 1 maintenance of chromoso; ABC ti£n,M,SMC_N,SMC^C,DUF164,none; 4.25 

421532; AW138207; Hs.146170; hypothetical protein FLJ22969; Armadillo seg,HEAT;TM=M;SS=N;4.25 

424503; NM_002205; Hs.149609; integrin, alpha 5 (fibroncctin receptor,; integrin A,FG-GAP;TM=Y;SS=N; 4.24 

414809; AI434699; Hs.77356; transfem'n receptor (p90, CD71); PA;TM=Y;SS=N; 4.24 

439720; AI935202; Hs.31181; Homo sapiens cDNA: FU 23230 lis, clone C; none.SDF.sugarJr; 4.23 

437044; AL035864; Hs.69517; differentially expressed in Fcnconi's an; nor,e;Tt/raM;SS^M; 4.23 

409956; AW103364; Hs.727; inhibin, beta A (activin A, activin AB a; TGF-beta,TGFb_propeptide,Tub;; 4.20 

439453; BE264974; Hs.6566; thyroid hormone receptor interactor 13; AAA,ABCJran,CoaE;TM=M;SS=N; 4.20 

417389; BE260964; Hs.82045 mrt up' uriteg moting factor; PTN_MK;TM--M;SS-Y; 4.19 

407137; T97307; ; gb:ye53h05.s1 Soares fetal liver spleen ; GDA1_CD39,none; 4.18 

419235; AW470411; Hs.288433; neurotrimin; none.none; 4.18 

410290; AA402307; Hs.322844; hypothetical protein DKFZp564A176; Sema,PSI,TIG,integrin B;TM-Y;SS=M; 4.13 

456906; AF1 17646; Hs.156637; Cas-Br-M (murine) ectropic retrovira 1 In zf-C3HC4,CbLN,Cbl N2,Cbl_N3;TM=M;SS=N: 4.17 

448775; AB025237, Hs 388; nudix (nucleoside diphosphate linked moi NUDI Tf V 3=W;4.I 

400288; X06256; Hs.149609; integrin, alpha 5 (fibronectin receptar,; inteqrir,_A,FG-GAP;TM=Y;SS=N; 1.14 

409799; D1 1928; H 76345 pho ph r i ft TMMSSN413 

445417; AK001058; Hs.12680; a disintegrin-like and metalloprotease w; tsp_1 ,ReprolysinPep_M12B_propep,none; 4.12 

433895; AI287912; Hs.3628; mitogen-activated protein kinase ku e iir i ' M I'' MSS=N;4.12 

424490; AJ278016; Hs.55565; ankyrin repeat domain 3; ank,pkinase;TM=M;SS=N; 4.09 

419121; AA374372; Hs.89626; parathyroid hormone-like hormone; none.none; 4.08 

415602; NM.006159; Hs.367895; Protein kinase C-binding protein NELL2; EGF,vwc,TSFN;; 4.07 

424008; R02740; Hs.137555; putative chemokine receptor; GTP-binding; 7tm_1;TM=Y;SS=M; 4.07 

127747 \' i 1 [ r n ,Hn e;TM=M;SS4 ( 

427490; Z95152; Hs.178695; mitogen-activated protein kinase 13; pklnase;TM=M;SS=N; 4.03 

439738; BE246502; Hs.9598; sema domain, immunoglobulin domain (lg),; SemaPS!,integrin.B;TM=Y;SS=N; 4.02 
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414883; AA926960; Hs.348669; CDC28 protein kinase 1; CKS;; 4.02 

413186; AU077141; Hs.374548; solute carrier family 16 (monocarboxylic; sugar Jr;TM=Y;SS=M; 4.01 

418203; X54942; Hs.83758; CDC 28 protein kinase 2; CKS;; 4.01 

406906; Z25424; ; gbiH.sapiens protein-serine/lhreonine ki; none.none; 3.98 

450375; AA009647, Hs 3:::Z~> a dis ntegrir ^nd metal cproleinase doma; Reprolysin,Pep.M12B_propep,disintegrin,Reprolysin,Pep_M12B_propep,disintegrin; 3.98 

410687; U24389; Hs.65436; lysyf oxidase-like 1 ; Lysyl_oxldase;; 3.96 

410342; R31350; Hs.743; Fc fragment of IgE, high atlinity I, fee !TAM;TM=Y;SS=M; 3.95 

425849; AJ000512; Hs.296323; serum/glucocorticoid regulated kir e; pkina ( \ se_ M=M;S M ' r 

417433; BE270266; Hs.82128; 5T4 oncofetal trophoblasl glycoprotein; LRR,LRRNT,LRRCT;TM=Y;SS=M; 3.95 

427792; M63928; Hs.180841; tumor necrosis factor receptor superfami; SRP14 ,TNFR_c6;; 3.93 

407792; AI077715; Hs.39384; putative secreted ligand homologous to f; none;TM=M;SS=Y; 3.91 

424441; X14850; Hs.147097; H2A histone family, member X; hislone,CBFD_NFYB_HMF;; 3.91 

415989; AI267700; Hs.351201; ESTs; none.none; 3.90 

423189; M59371; Hs.171596; EphA2; fn3,pkinase,SAM,EPH_lbd;TM=Y;SS=M; 3.90 

443B59: NMJ13409; Hs.9914; fcllistatin; kazal;; 3.89 

429612; AF062649; Hs.252587; pituitary lumor-transtbrming 1; none;; 3.89 

419073; AW372170; Hs.183918; Homo sapiens cDNA FU12797 fis, clone NT; dea;h,ZU5;; 3.88 

450684; AA872605; Hs.25333; inlerleukin 1 receptor, type II; ig;TM-Y;SS-M; 3.88 

428450; NMJJ14791; Hs. 184339; KIAA0175 gene product; KA1,pkinase;TM=M;SS=N; 3.86 

413441; AI929374; Hs.75367; Src-Iike-adapter; SH2,SH3;TM=M;SS=N; 3.84 

437763; AA469369; Hs.5831; tissue inhibitor of metalloproteinase 1 T!MP,pkinase,DAG PE-bind.RBD; 3.83 
436291; BE568452; Hs.344037; protein regulatrrrfr.U esi; none;TM MSS-N382 
417512; X76534; Hs.82226; glycoprotein (transmembrane) nmb; PKD;TM=Y;SS=M; 3.81 
427647; W19744; Hs.180059; Homo sapiens cDNA FLJ20653 fis, done KA; none.pkinase; 3.80 
431629; AU077025; Hs.265827; interferon, alpha-inducible protein (do; none;TM=M;SS=Y; 3.30 
434699; AA643687; Hs.149425; Homo sapiens cDNA FU11980 fis, clone HE; Nucleosidejra2,none; 3.80 
430378; Z29572; Hs.2555; lumor necrosis factor receptor superfami; IL2;; 3.79 
428157; AI738719; Hs.198427; hexokinase 2; hexokina e.hexokina none; 3.78 

409512; AW979187; Hs.293591; melanoma differentiation associated prot; DEAD,helicase_C,CARD;TM=M;SS=N; 3.78 
417720; AA205625; Hs.208067; ESTs; none.none; 3.77 

439237; AW408158; Hs.318893; ESTs, Weakly similar to A47582 B-cell gr; Furin-like,pkinase,Recep_L_domain,YLP,none; 3.77 
449029; N28989; Hs.22891; solute carrier family 7 (cationic amino ; aa_permeases;TM=Y;SS=r/; 3.76 
413436; AF238083; Hs.68061; sphingosine kinase 1; DAGKc;TM=M;SS=N; 3.75 

416714; AF283770; Hs.79630; CD79A antigen (immnr lot lin sscciated q IT M,Zi clus TM=Y;SS=M; 3.74 

413281; M861271; Hs.222024; transcription factor BMAL2; HLH,PAS;;3.74 

436481; AA379597; Hs.5199; HSPC150 protein similar to ubiquitin cor.; UQ_con;TM=M;SS=N; 3.74 

431890; X17033; Hs.271986; integrin, alpha 2 (CMC alpha icj ™a,ir 3 rin_A,FG-GAP;TM=Y;SS=M; 3.74 . 

424118; BE269041; Hs.140452; cargo selection protein (mannose 6 phosp; perilipin;; 3.73 

426471; M22440; Hs.170009; transforming growth factor, alpha: EGF;TM=M;SS=M; 3.72 

422487; AJ010901; Hs.198267; mucin 4, tracheobronchial; EGF.vwd AMOP;; 3.72 

450125; AA005418; Hs.158186; ESTs; CIDE-N,7tmJ,none; 3.71 

427337; Z46223; Hs.176663; Fc fragment of IgG, lew a'flnity lllb, r; ig;TM=Y;SS=M; 3.70 

444006; BE395085; Hs.334762; type I transmembrane protein Fn14; ldLrecept_a,PKD,MHCJ;TM=M;SS-Y; 3.70 

422010;AA302049;Hs.31181;HomDsapier- ; rDN^ F I, 0 ti clc eC n ie,SDF,sugar_tr; 3.70 

418969; W33191; Hs.28907; hypothetical protein FU20258; SH3;TM=M;SS=N; 3.69 

415817; U88967; Hs.78867; protein tyrosine phosphatase, receplor-t; fr hosphct t n di - Tf !> 3 1 

BE Hit FXYt ii ( ing ion transport i TP1G1_PLV M TjT»' 

452341; ,V264357; Hs.55405; hypothetical protein MGC16212; Sulfate _trar,sp,STAS;; 3.68 
439625 Ar086453 H 1 1 1 J i • rk_head,glycolytic.enzy,Na_sulph_synp; 3.55 
426227; U67058; Hs.154299; Human proteinase activated receptor-2 mR; 7tm„1;TM-Y;SS=M; 3.66 
452363; AI582743; Hs.94953; Homo sapiens, Similar to complement comp; Ctq.Collagen;; 3.65 
447365; BE383676; Hs.334; Rho guanine nucleotide exchange faster (; SH3,PH,RhoGEF;TM=M;SS=N; 3.64 
418883; BE387036; Hs.1211; acid phosphalase 5, tartrate resisla it; Mela'lophos; i M=tJ 3S=M; 3.64 
438707; L08239; Hs.5326; amino acid system N transporter 2; porcu; ACAT,MBOAT;TM=Y;SS=M; 3.64 
422596; AF063611; Hs.118633; 2'-5'-oligoadenyIatesynthetase-like; ubiquitin;; 3.63 
449318; AW236021; Hs.78531; Homo sapiens, Similar to RIKEN cDNA 5730; none;TM=M;SS=N; 3.62 
438746; AI885815; Hs.184727; Human melanoma-associated antigen p97 (m; transfemn,Guanylale_kin,PDZ,SH3; 3.62 
446291; BE397753; Hs.14623; interferon, gamma-inducible protein 30; GILT;TM=M;SS=Y; 3.62 
452696; AI826645; Hs.211534; ESTs; ArfGap.PH,ank,Guanylate_kin,PDZ,SH3; 3.60 
407634; AW016569; Hs.136414; UDP-GlcNAc:betaGal beta-1 3-N ,i In i il tr /I T TM=M;SS=Y; 3.59 
423575; C18863; Hs.163443: intron of periostin (OSF-2os); Fasciclin.none; 3.59 

421391; AW304350; Hs.191958; immunoglobulin superfamily receptor tran; ig.none; 3.58 
419452; U33635; Hs.90572; PTK7 protein tyrosine kinase 7; ig,pkinase;TM=Y;SS=M; 3.58 
419912; AF249745; Hs.6066; Rho guanine nucleotide exchange factor (; SH3,PH,RhoGEF;TM=M;SS=N; 3.58 
431457; NM_012211; Hs.256297; integrin, alpha 11; FG-GAP,vwa;TM=Y;SS=M; 3,57 
430379; AF134149; Hs.240395; potassium channel, subfamily K, member 5 ion. Tt M; 3 

418526; BE019020; Hs.85838; solute earner family 16 (monocarbcxylic; nor,e;TM=Y;SS=M; 3.55 
441362; BE614410; Hs.23044; RAD51 (S. cerevisiae) homolog (E coli Re; none;; 3.53 
426500; NM_014638;Hs.170156; KIAA0450 gene product C2,PI-PLC-Y;TM=M;SS=N; 3.53 
429556; AW139399; Hs.314807; ESTs; none;TM=M;SS=N; 3.52 
449101; AA205847; Hs.23016; G protein-coupled receptor; 7tm 1;TM=Y;SS=M; 3.52 
432636; AA340864; Hs.278562; daudin 7; PMP22_Claudin;TM=Y;SS=M; 3.51 
433470; AW960564; Hs.351316; transmembrane 4superfamlly member 1; none;TM=Y;SS=M; 3.51 

452203; X57522; Hs.352018; transporter 1, ATP-binding cassette, sub; ABC_t-an,ABC_membrane,SRP54,Thymidylate_kin;TM=Y;SS=M; 3.49 
425566; AW162943; Hs.250618; UL1S binding protein 2; ldl_recep'_ 1 1 D h\ ;j T!,'-M,SS=Y;3.48 
402447; ; ; C1000201:gi|204416|gb|AAA02627.1| (L0519; none;TM=Y;SS=M; 3.48 

431183; NM_006855; H&250696; KDEL (Lys-Asp-Glu-Leu) endoplasmic relic; ERJurnenjeceplTM=M;SS=M; 3.48 

448988; Y09763; Hs.22785; gamma-aminobutyric acid (GABA) A receplo; Neur_chan.LBD,Neur_chan_memb;TM=Y;SS=M; 3.48 

415323; BE269352; Hs.949; neutrophil cytosolic factor 2 (65kD, du; SH3,TPR;TM=M;SS=N; 3,48 

425003; AF119046; Hs.154149; apurinic/apyhmidinic endonuclease(APEX : Trcponir„Exo_endoj)hos 1 IQ;TM=M;SS=N; 3.47 

424909; S78187, H' 1 , 75; sell divi rr cycle 25B; Rhodanese;; 3.44 

446051; BE048061; Hs.37054; ephrin-A3; Ephrin,A_deamin,dsrm,z-alpha; 3.43 

418641; BE243136; Hs.86947; a disintegrin and metalloproteinase doma; disintegrin,Reprolysln,Pep_M12B.propep,EGF;TM=Y;SS=M; 3.42 
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417426; NM 002291; Hs,32124; laminin betel; laminin_EGF,laminin_Nterrn,integrin_6;; 3.42 
411263; BE2978C2 H-f i I kinesin-like (milct entrc r ine-inTM=M;SS=N; 3.42 

430044; M464510; Hs.152812; ESTs; none.none; 3.42 

425289; AW139342; Hs.155530; interferon, gamraa-inducible protein 16; PAAD_DAPIN,HIN;; 3.39 
425354; U62027; Hs. 1 55935; complement component 3a receptor 1 ; 7tm_1;TM=Y;SS=M; 3.39 
428293; BE250944; Hs 183556 oluiB carrier family 1 (neutral amino a; elF6,SDF;TM=M;S3=N; 3.39 
443648; AI085377; Hs.143610; ESTs; Fork_head,none; 3.39 

418869; AW516565; ; gb:xq01d05.x1 Soares_NHCeC_cervicat_tumo; none,RasGAP,WW,IQ; 3.38 
432179; X75208; Hs.2913; EphB3; EPHJbd,fn3,pkinase,SAM;TM=Y;SS=M; 3.38 
418918; X07871; Hs.89476; CD2 antigen (p50), sheep red bleed cell ; ig;TM=Y;SS=M; 3.38 
414368; W70171; Hs.75939f uridine monophosphate kinase; PRK.CoaE;; 3.37 

408716; AI567839; Hs. 151714; Homo sapiens mRNA for KIAA1769 protein, ; UvrD helicase RNE unt;TM-M;! N 1 
457001; J03258; Hs.2062; vitamin D (1,25-dihydroxyvitamin D3] re; hormone_rec,zf-C4,Wetallothio_5;TM=M;SS=N; 3.37 
422283; AW411307; Hs.114311; CDC45 (cell division cycle 45, S.cerevis; CDC45;TM=M;SS=N; 3.37 

421317 aF1 460^4 H 03660; ATP-l et b-f r -i r r t an ABC membrane,GTP_ERU;TM=M;SS=M; 3.36 

400298; AA032279; Hs.61635; six transmembrane epithelial antigen of; none;TM=Y;SS=N; 3.35 

428385; AF112213; Hs.184062; putative Rab5-inleractjng protein; SH2.SH3;; 3.33 

400261; ; Hs.1802; Eos Control; ig MHO ll_beta;TM=Y;3S=M; 3.33 

410024; AW191024; Hs.55016; hypothetical protein FU21935; SH3;TM=M;SS=N; 3.32 

412584; X54870; Hs.74085; DNA segment on chromosome 12 (unique) 24; none,lectin_c; 3.32 

416065; BE267931; Hs.78996; proliferating cell nuclear antigen; PCNA,PCNA_C;TM=M;SS=N; 3.31 

426437; BE076537; Hs. 169895; ubiquitln-conjugaling enzyme E2L6; Armadillo_seg.UQ.con, none; 3.31 

426840; BE244217; Hs.172690; diacylglycerol kinase, alpha (80kD); efhand,DAG_PE-bind,DAGKa,DAGKc,DC1;TM=M;SS=N; 3.31 

4 9; ALO 1 I 3 1 specif phospl e 7; DSPc;TM=M;SS=t 
418758; AW959311; Hs.172012; hypothetical protein DKFZp434J037; pkinase,RI01;TM-M;SS=N; 3.31 
424779; AL046851; Hs.153053; CD37 antigen; transmembrane4;TM=Y;SS=M; 3.31 
423973; AF038461; Hs.136574; arachidonate 124ipoxygenase, 12R type; lipoxygenase,PLAT;TM=M;SS=N;3.30 

21 L1 19571; Hs,1420; fibrol Hit ptor 3 (ach p'lr 7 M 

449027; AJ271216; Hs.22880; dipeptldylpeptldase III; Peptldase_M49.EGF,in Neuregulin;TM=M;! S I] 28 
423778 Y09267 Hs. 132821 l n n ! e i FMO-I 1 I 1 > SS M 3 28 

426457; AW894667: Hs.330138; chimerin (chimaerin) 1; DAG_PE-bind,RhoGAP,SH2;TM=M;SS=N; 3.28 
435523; T62849; Hs. 11090; membrane-spanning 4-domains, subfamily A nil" } i 
431886; L77964; Hs.271980; mitogen-activated protein kinase 6; pkir,ase;TM=M;SS=N; 3 27 
430397; AI924533; Hs. 105607; bicarbonate transporter related protein ; HC03_cotransp;Tr^Y;SS=N; 3.27 
425322; U63630; Hs.155637; protein kinase, DNA-aclivaled, catalytic; Pi3 P'4 kinase, FAT, FATC;TM=M;SS=N; 3.26 
446006; NMJ04403; Hs.13530; deafness, autosomal dominant 5; none;TM=M;SS=M; 3.26 
444783; AK001468; Hs.62180; anillin (Drosophila Scraps homolog), act; PH.none; 3.25 
422278; AF072873; Hs.114218; frizzled (Drosophila) homolog 6; Fz,Frizzied,7tm 2;TM-Y;SS=M; 3.25 
405932; ; ; C15000305:gi|3806122|gb|AAC69198.1| (AF0; ras;TM=M;SS=N; 3.25 
400205; ; Hs.81848; NM_006265':Homo sapiens RAD21 (S. pombe); DUF173;; 3.25 
432874; W94322; Hs.279651 ; melanoma inhibita / activity ; SH3;TM-M;SS-Y; 3.24 
14 1;t CI y o t e f eT ' t 1 

435472; AW972330; Hs.283022; triggering receptor expressed on myeloid; ig;TM=M;SS=M; 3.22 
439285; AL133916; Hs.47860; hypothetical protein FLJ20093; ig,pkinase,LRR,LRRNT,LRRCT,none; 3.22 
410434; AF051152; Hs.63668; toll-like receptor 2; LRR,LRRCT,TIR;TM=M;SS=M; 3.22 
427318; AF186081, H- 175783 zinc r F . e p TA~\ SS-W 3 22 
436075; BE090176; Hs.179902; transporter-like protein; none;TM=Y;SS=M; 3.22 
428698; AA852773; Hs.334838; KIAA1866 protein; none;NA;NA; 3.22 
448888; AW196663; Hs.200242; caspase recruitment domain protein 6; CARD;TM=M:SS=N; 3,22 
415149; X12451; Hs.78056; cathepsin L; Peptidase_C1;; 3.21 

423393; R37772; Hs.21420; p21-activated protein kinase 6; pkinase,PBD;TM=M;SS=N; 3.21 

1 L29472; Hg.130 n rh nr it ity I MHC I bet TM t ) 

438564; AA381553; Hs.198253; majorhistocompatibility complex, class ; iq.MHC II alpha.none; 3.20 
456181; L36463; Hs.1030; ras inhibitor; RA,SH2,VPS9;TM=M;SS=N; 3.20 

418613; AA744529; Hs.86575; mitogen-activated protein kinase kinase ; pkinase,CNH;TfV.=M;SS=N; 3.19 

440682; AW362152; Hs.27161; nuclear receptor binding factor-2; cyc'in,bZIP;TM=M;SS=N; 3.13 

415010; NM.004203; Hs.77783; membrane-associated tyrosine- and ttireon; ank,pkinase,UPF0073;; 3.16 

419216; AU076718; Hs.164021; small inducible cytokine subfamily B (Cy; 1L8;; 3.16 

450737; AW007152; Hs.63325; transmembrane protease, serine 4; trypsinjdljecep l.a.none; 3.1 6 

426395; BE151985; Hs.355669; hypothetical protein FLJ23316; pkinase.none; 3.15 

412939; AW411491; Hs.75069; eukayotic translation elongalion factor; none,none; 3.15 

433376; AI249361; Hs.74122; caspase 4, apoptosis-related cysteine pn CARD,ICE_p10,ICE.p20;; 3.15 

410668; BE379794; Hs.159651; hypothetical protein; death,TNFR_c6;TM=Y;SS-rY; 3.15 

431441; U81961; Hs.2794; sodium channel, nonvoltage-gated 1 alpha; ASC;TM=Y;SS=N; 3.15 

432251; AW972983; Hs.232165; polycythemia rubra vera 1; cell surface ; none;TM"M;SS=M; 3.15 

407844; AW073^1 b H 3037; ESTs; I i ibrane4,none; 3.14 

408634; AW407254; Hs.356216; calmodulin 2 (phosphorylase kinase, delt; none.none; 3.14 

423061; AI290473, H 1480 EST Negri Ser I CI TIG, none; 3.14 

438974; AF089816; Hs.6454; chromosome 19 open reading frame 3; PDZ;; 113 

431236; AV656840; Hs.285115; interteukin 13 receptor, alpha 1; fn3;TM=Y;SS=M; 3.13 

425394; AA356730; Hs.323949; kangai 1 (suppression of tumorigenicity ; transmembrar,e4,none; 3.13 

429336; AB005038; Hs.199270; cytochrome P450, subfamily XXVIIB (25-hy; p450;; 3.13 

449230; BE613348; Hs.356392; melanoma cell adhesion molecule, ig i h.Ribosoma I F box TM=Y;SS=M; 3.13 

429305; AF095727; Hs.287832; myelin protein zero-like 1; ig,fransmembrane4;TM=Y;SS-M; 3.12 

419034; NMJ02110; Hs.89555; hemopoietic cell kinase I2,SH3,| i TM=M;SS=N; 3.12 

417386; AL037228; Hs.301957; D123gene product; NUDIX,secY,E1.dehydrcg,transket pynTM=Y;SS=M; 3.11 

419138; U48508; Hs.89631; ryanodine receptor 1 (skeletal); ion.trans,SPRY l RYDRJTPR l RyR,MIR;TM=Y;SS=N; 3.11 

440006; AK000517; Hs.6844; NALP2 protein; PYRIN-Containing APAF1-H; AAA,NB-ARC,PAAD_DAPIN;NA;NA; 3.10 

406467; ; ; Targel Exon; efhand.Acyltransferase.none; 3.10 

422956; BE545072; Hs.122579; ECT2 protein (Epithelial cell transforml; BRCT,RhoGEF;TM=M;SS=N; 3 10 
417771; AA804698; Hs.82547; retinoic acid receptor responder (tazaro; none.none; 3.09 
437016; AU076916; Hs.5398; guanine monphosphate synthetase; PHD.SET.zf- 

CXXC,EGF,ank,notch,WW,FCH,GATase,GMP_synLC,Occludin,YEATS,rneta1lhio,EB,hemeJ,RCC1,ZZ,FeThRedjA,ENTH,Ba^ 
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441 384; AA447849; Hs.288660; retinoic acid induced 3; 7tm_3,none; 3.09 

416636; N32536; Hs.42645; solute carrier family 16 (monocarboxylic; none,none; 3.09 

416498; U33632; Hs.79351; potassium channel, subfamily K, member 1; ion_trans;TM=Y;SS=M; 3.09 

426108; M622037; Hs.166468; programmed cell death 5; DUF122;TM=M;SS=N; 3.08 

414806; D14694; Hs.77329; phosphatidylserine synthase 1; PSS;TM=Y;SS=V; 3.08 

402233; ; ; NM 030760":Homo sapiens endothelial diff; 7tm_1;TM=Y;SS=M; 3.07 

430066; AI929659; Hs.237825; signal recognition palicle 72kD; TPfVJRC.SAICAR.synt; 3.07 

425367; BE271188; Hs.155975; protein tyrosine phosphatase, receptor t; none;TM-W rt 0 

434253; N34895; Hs.79187; ESTs; Ig.none; 3.06 

443907; AU075484; Hs.9963; TYRO protein tyrosine kinase binding pro; none;TM=M;SS=Y; 3.05 

409378; U42387; Hs.54426; pancreatic polypeptide receptor 1; 7tmJ;TM=Y;SS=M; 3.05 

410165; BE550228; Hs.71869; apoptosis-associated speck-like protein ; PAAD_DAPIN,CARD;TM=M;SS=N; 3.05 

440270; NM.01 5985; Hs.7120; cytokine receptor-like molecule 9; fn3;; 3.Q5 

449003; X76342; Hs.389; alcohol dehydrogenase 7 (class IV), mu o; adhjrinc;TM-iV.;SS»N; 3.05 

420189; AW296380; Hs.95821; osteoclast stimulating factor 1; SH3,ank;; 3.05 

429732; U20158; Hs.2488; lymphocyte cytosolic protein 2 (SH2 doma; SH2;; 3.05 

421541; NM.003942; Hs.105584; ribosomal protein S6 kinase, 90kD, polyp; pklnase,pkinase_C;TM=M;SS=N; 3.04 
415444; BE247295; Hs.78452; solute carrier family 20 (phosphate Iran; PH04,LIM;TtV=M;SS=N; 3.03 
425118; AU076611; Hs.154672; methylene tetrahydrofolate dehydrogenase; nyb.DNA- 

binding,THF_DHG_CYH,THF_DHG_CYH_C,CAP_GLY,AAA,LON,Peptidase_C9 1 bZIP,M 1 >;an_ur_permease,HCO3_cotransp;TM=M;SS=N;3.03 
449048; Z45051; Hs.22920; similar lo S68401 (cattle) glucose indue; Lamp;TM=M;SS=M; 3.03 
413869; NM.000878; Hs.75596; interleukin 2 receptor, beta; none;TM=Y;SS=M; 3.02 
442875; BE623003; Hs.23625; Homo sapiens clone TCCCTA00142 mRNA sequ; K_tetra,DUF51,none; 3.02 
436576; AI458213; Hs.77542; ESTs; 7tm_1,DnaJ; 3.02 

446269; AW263155; Hs.14559; hypothetical protein FLJ10540 nonr- r>' M , -V r 

418870; AF147204; Hs.89414; chemokine (C-X-C motif), receptor 4 (fus; 'lm 1,7m 2;TM=Y;SS=M; 3.01 

421379; Y15221; Hs.103982; small inducible cyotmc su fa I L Cj H 3;TM=M;SS=Y 3 00 

421267; BE314724; Hs.103031; ribosomal protein S6 kinase, 70kD, polyp; pkinase,pkinase_C;TM=M;SS=N; 3.00 

409705; M37762; Hs.56023; brain-derived neurotrophic factor; NGF;; 2.99 

429903; AL134197; Hs.93597; cyclin-dependent kinase 5, regulatory su; CDK5_a=tivator,none; 2.99 

430696; AA531276; Hs.59509; ESTs; pkinase,PP2C,none; 2.98 

418299; AA279530; Hs.83968; integrin, beta 2 (antigen CD18 (p95), ly; inlegrin_B.EGF,PSI;TM=Y;SS=M;2.97 

410026; AI912061; Hs.55016; hypothetical protein FU21935; none.none; 2.97 

448733; NM.005629; Hs.187958; solute carrier family 6 (neurotransmitte; SNP,TM=Y;SS=N; 2.97 

432562; BE531048; Hs.278422; DKFZP586G1 122 protein; zf-C2H2;TM=M;SS=N; 2 97 

453035; AW581943; Hs.334; Rho guanine nucleotide exchange factor (; none.none; 2.97 

(2 i , 179949 H 175563 Homo sapiens mRNA; cDNA DKFZp554N0753 (f; none, spectrin, SH3,PH.CH; 2.97 

431941; AK000106; Hs.272227; Homo sapiens cDNA FLJ20099 fis, clone CO; pkinase,Furin-like,Recep_L_domain,none; 2.96 

441389; AF134838; Hs.7835; endocytic receptor (macrophage mannose r; fn2,lectin_c:TM=Y;SS=M; 2.95 

412276; BE262621; Hs.73798; macrophage migration inhibitory factor (; MIF,sugar_tr,none; 2.94 

427359; AW020782; Hs.7988t; Homo sapiens cDNA: FLJ23006 fis, clone L; 7lm_1.none; 2.94 

402558; ; ; C1000201:gi|204416|gb|AAA02627.1| (L0519; none;TM=Y;SS=M; 2.94 

425852; AK001504;Hs.159651; death receptor 6, TNFsuperfamily member death,TNFR_c5;TM=Y;SS=M; 2.94 
442080; AW444761; Hs.7290t; ESTs; ank;; 2.94 

I HI F21222 H 5010; hypothetical protein P1 2; NTF2;TM=M;SS=nN 93 

444809; BE207568;Hs.208219;oculospanin;uansmembrane4;TM=Y;SS=N; 2.93 

449843; R85337; Hs ?4u i c lei rier f ily 31 (copper I ir T MrSS-M293 

416110; Z42262; Hs.322844; hypothetical protein DKFZp564A176; Sema,PSI,TIG 1 integrin_B;TM=Y;SS=M; 2.93 

453768; BE382670; Hs.198511; Homo sapiens mRNA; cDNA DKFZp761H77 (fr; arf,G-alpha,none; 2.92 

414825; X06370; Hs.77432; epidermal growth factor receptor (avian ; Furin-like.pkinase.Recep.L jfomain;TM=M;SS=M; 2.92 

421429; NM_014922; Hs.104305; death effector filament-forming Ced-4-li; LRR,PAAD_DAPIN,AAA,CARD,NB-ARC;NA;NA; 2.92 

434826; AF1 55661; Hs.22265; pyruvate dehydrogenase phosphatase; PP2C,none; 2.91 

451292; AB037716; Hs.26204; KIAA1295 protein; SH3;TM=M;SS=N; 2.91 

422127; AW504286; Hs.112049; SET binding factor 1; dDENN,DENN,GRAM,PH;; 2.91 

419508; AW997938; Hs.90786; ATP-binding cassette, sub-family C (CFTR; ABCJran,ABC.membrane;TM=Y;SS=iVI; 2.90 

430451; AA835472; Hs.297939; cathepsin B; Peptidase.d.prajsomerase;; 2.90 

424046; AF027866; Hs.138202; serine (or cysteine) proteinase inhibito; serpin;TM=M;SS=N; 2.89 

414907; X90725; Hs.77597; polo (Drasophia)-like kinase; Ribosomal_L37ae,pkinase l POLO.box,IRNfrsynt.1b,dynamin,dynamin.2,GED,bZIP,M;; 2.89 

429619; AL1 20751 ;Hs.21 1568; eukaiyotic translaticr, initiation facbt; ncne.none; 2.89 

413879; AA132951; Hs.212533; Homo sapiens cDNA: FU22572 fis, clone H; none.none; 2 89 

417018; M16038; Hs.80887; v-yes-1 Yamaguchi sarcoma viral related ; SH2,SH3,pkinase;TM=M;SS»N; 2.89 

422610; AF153820; Hs.1547; potassium inwardly-rectifying channel, s; IRK;TM=Y;SS=N; 2.89 

405556; ; ; homeodomain-interacting protein kinase 3; trypsin;TM=M;SS=N; 2.89 

423804; AW403448; Hs.1706; interferon-stimulated transcription fact; IRF.zf C3HC4,IBR,zf Rar,BP;TM=M;SS=N; 2.89 
425262; D87119; Hs.155418; GS3955 protein; pkinase;; 2.88 

422599; BE387202; Hs.118638; non-metaslatic cells 1, protein (NM23A) ; NDK,PH,Oxysterol.BP;; 2.88 
452888; AW955454; Hs.30942; ephrin-B2; Ephrin,fn2;TM=Y;SS=M; 2.88 

414703; BE243877; Hs.380063; ATPase, Na? transporting, beta 3 polypep; Na_K-ATPase;TM=Y;SS=M; 2.37 

444143; AW747996; Hs.160999; ESTs, Moderately similar to A56194 throm; BcI-2,none; 2.36 

413472; BE242870; Hs.75379; solute earner familv 1 (glial high affi; SDF;TtV.=Y;SS=M; 2.86 

458039; AA835884; Hs.130685; leukotriene b4 receptor (chemokine recep; ClDE-N,none; 2.86 

434417; AL1 10157; Hs.3843; Homo sapiens mRNA; cDNA DKFZp536F2224 (f; DSPcnone; 2.86 

425802; Y 14838; ; chemokine-like receptor 1; 7tm 1,none; 2.86 

4031 1 2; ; ; Target Exon; efhand,C2,PH,PI-PLC-Y,PI-PLC-X;; 2.86 

435563; AF210317; Hs.95497; solute carrier famlv 2 1, lital i gai t Tl .11 3 

442117; AW664964;Hs.128899; ESTs; hypothetical protein for IMAGE:447; ncne.none; 2.84 

457819; AA057484; Hs.35406; FU20522 Hypothetical protein FU20522; none,none; 284 

456629; AW891965; Hs.367942; histone deacetylase 3; HSP90 1 HATPase.c,zf-C2H2,PHD,none; 2.83 

408873; AL046017; Hs.356216; calmodulin 2 (phosphorylase kinase, delt none,none; 2.83 

446947; AF146747 H- I ' •> I tia rubra vera 1; cell surface ; none;TM=M;SS=M; 2.83 

448386, AB037750,Hs 21 11 132 t r r r [J P TM ^ h> 

427857; AL1 33017; Hs.288679; hypothetical protein FLJ22865; myosin he£d,IQ,zf-MYND;TM=M;SS=M; 2.82 
407601; AC002300; Hs.37129; sodium channel, nonvoltage-gated 1, beta; ASC;TM=Y;SS=M; 2.82 
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459707; AA631362; Hs.120866; gb:np86b01.s1 NCI CGAPJhyl Homo sapiens; 7tm.1,none; 2.82 

422699; BE410590; Hs.1 19257; erasl sequence (mammary tumor and squamou; SH3,HS1_rep;TM=M;SS=N; 2.82 

4381 08; AI471795; Hs.287776; varllloid receptor-related osmoticall, an .ionjran TM=Y;SS=N; 2.82 

422241; Y00062; Hs.170121; protein tyrosine phosphatase, receptor t; kinesin,fn3,Yj>hosphatase;TM=M;SS=N; 2.82 

448595; AB014544; Hs.21572; KIAA0644 gene product; LRR,LRRCT;TM=Y;SS=M; 2.81 

423598; BE247600; Hs.377968; ESTs; 7tmJ;TM=Y;SS=M; 2.81 

412970; AB026436; Hs.177534; dual specificity phosphatase 10; Rhodanese.DSPo;; 2.81 

414193; AW5G5303;Hs.75812;phosphoenolpyruvate carboxylase 2 mil PEPCi 2 1 

436729; BE521807; Hs.351316; transmembrane 4 superfamlly member 1; nor,e;TM=Y;SS=M; 2.81 

432314; AA533447; Hs.285173; ESTs; Xlink,none; 2.81 

416207; NM.014745; Hs.79077; Homo sapiens, clone MGC2908, mRNA comp; none;TM=Y;SS=M; 2.80 
446985; AL038704; Hs.1 56827; ESTs, Weakly similar to ALU1_HUMAN ALU S; SAM,SH3,HS1_rep; 2.80 
428023; AL038843; Hs.374530; Homo sapiens cDNA: FLJ23602 fis. clone L; 

aa_permeases,pyridoxaLdeC,bromodomain,PHD,MBD,AT_hook 1 DDT,PI3_PI4_kinase,FAT,FATC,BolA,RUN;TM=M;SS=N; 280 
432886; BE159028; Hs.279704; chromatin accessibility complex 1; none;TM=M;SS=N; 2.80 
426006; R49031; Hs.22527; ESTs; pkinase.TBC; 2.79 

414217; AI309298; Hs.279898; Homo sapiens cDNA; FLJ231 55 fis, clone L; none;NA;NA; 2.79 
411165; NM.000169; Hs.69089; galaclosidase, alpha; Melibiase;; 2.79 

450056; BE047394; Hs.502; ESTs, Weakly similar to S71512 hypotheti; ABC_tr£n,ABC_membrane 1 ig,MHCJI_beta,SRP54,proteasome 1 ABC.membrane,ABCJran; 2.78 

424291; AL120051; Hs.144700; ephrin-B1 ; Ephrin;TM=Y;SS=M; 2.78 

421448, AFD33850 UN pin pholil e D2; PH,PLDc,PX;TM=M;SS=N; 2.78 

410226; AI831958; Hs.61053; hypothetical protein; SH3,TPR;TM=M;SS=N; 2.78 

433535; AF111106; Hs.3382; protein phosphatase 4, regulatory subuni; HEAT;TW-M;SS=N; 2.78 

442503; AF147078; Hs.375031; p53-responsive gene 5; K_tetra,ion_lrans,none; 2.77 

413900; AW409747; Hs.75612;stress-induced-phosphoprotein 1 (Hsp70/H;TPR,PDZ,WW,Guanylaie_kin;TM=M;SS=N; 2.77 
454294; AB000734; Hs.50640; JAK binding protein; SH2;TM=M;SS=N; 2.77 
440188; AK0O1812; Hs.7036; N-Acetylglucosamine kinase; ROK;TM=M;SS=N; 2.77 
449539; W80363; Hs.58446; ESTs; pkinase,Furin-like,Recep_L_domain,none; 2.76 
422667; H25642; Hs.132821; ESTs; FMO-like,FMO-like; 2.76 

41 5012; NMJ04383; Hs.77793; c-src tyrosine kinase; SH2,SH3,pkinase;TM=M;SS=N; 2.75 

40231 6; ; ; NM.01344T.Homo sapiens egf-like module c; 7hi_2,GPS;TM=r/;SS-r/; 2.75 

425465; L18964; Hs.1904; protein kinase C, iota; pkinase.DAG _PE.bind,pkinase_C,OPR;TM=M;SS=N; 275 

447250; AI878909; Hs.17883; protein phosphatase 1G (formerly 2C), ma; PP2C;TM=M;SS=N; 2.75 

138629; AM H 2 7 i PS \ ly similar to T1 251 potheti TN' I c6,non 2 

451144; AW956103; Hs.61712; pyruvate dehydrogenase kinase, isoenzyme; HATPase.c.ncne; 2.74 

408543; N78098; Hs.44289; ESTs; none;TM=M;SS=N; 2.74 

429345; R11 141; I is. 199695; hypothetical protein; KJetra.SAM; 2.74 

407722; BE252241; Hs.38041; pyrldoxal (pyrldoxine, vitamin B6) kinas; pfkB;TM=M;SS=N; 2.73 

420602; AF060877; Hs.99236; regulator of G-protein signaling 20; RGS;TM=M;SS=N;273 

407217; AA477136; Hs.105584; ribosomal protein S6 kinase, 90kD, polyp; pkinase,pkinase_C;TM=M;SS=N; 2.73 

414135; NMJ04419; Hs.2128; dual specificity phosphatase 5; Rhodanese,DSPc,Y_phcsphatase;TM=M;SS=N; 2.73 

410590; BE615216; Hs.64746; chloride intracellular channel 3; none;TM=M;SS=N; 2.73 

41 1125; AA151647; Hs.68877; cytochrome b-245, alpha polypeptide; none;Tf/=Y;SS=f/; 2.73 

438022; AW517524; Hs.135201; NOD2 protein; LRR.CARD GTPJ DC VI J hel se TM-M IS N 2 72 

420929; A1694143; Hs.326248; programmed cell death 4; MA3;TM=M;SS=N; 2.72 

421155; H87879; Hs.102267; iysyl oxidase; Lysyl oxidase.Aldose 3pim,Epinrera 2 2 

448564; AL044962; Hs.21453; inositol 1,4,5-trisphosphate 3-kinase C; IPK;; 2.71 

449961; AW265634; Hs.133100; ESTs; pkinase,Furin-like,Recep_L_domain,none; 2.71 

444633; AF111713; Hs.12284; junctional adhesion molecule 1; iq;TM=Y;SS=M; 2.71 

412259; AI5602^2 H 3 j pi i hrsphatase 2 (formerly 2A), reg; WD40;TM=M;SS=N; 2.71 

419569; AI971651; Hs. 91143; jagged 1 (Alagille syndrome), DSL E FL- inir i I io f «M z i 

452401; NMJ07115; Hs.29352; tumor necrosis factor, alpha-induced pro; Xlink.CUB;; 2.71 

458190; BE561793; Hs.21446; KIAA1716 protein; ASC,Galactosyl_T,none; 270 

432126; AA865239; Hs.37196; ESTs; 7tm_1;TM=Y;SS=M;2.70 

422616; BE300330; Hs.118725;selenophosphate synthetase 2; AIRS,AIRS_C;TM=M;SS=N; 2.70 
424717; H03754; Hs.152213; wingless-type MMTV integration site fami; wnt.none; 2.70 
414108;AI267592;Hs.75761;SFRSproteinkinase1;ank,PH,Oxysterol_BPFln e;TM M;SS=N 2.7I 
446272; BE268912; Hs.14601; hematopoietic eel n I I i H3.HS1 TM M SS=N; 2.70 

116084; L1699 H 79006 df vln te kinase (Ihymidyla in; none ne;2 

427157; U51166; Hs.173824; thymine-DNAglycosylase; UDG;TM=M;SS=N; 2.69 
404891; ; ; Target Exon; none.none; 2.69 

432581; AU076465; Hs.278441; KIAA0015 gene product; PP2C;TM=M;SS=N; 2.69 
424321; W74048; Hs.1765; lymphocyte-specific protein tyrosine kin; SH2,SH3,pkinase;TM=M;SS=N; 2.68 
425308; M97639; Hs.155585; receptor tyrosine kinase-like orphan rec; ig,kringle,pkir,ase,Fz;TM=Y;SS=M; 2.68 
414443;AU077268;Hs.76144;platelet-derivedgrowthfactorrece[t' ig.pkii ;e;TM=Y;SS=N 268 

427274; NM_005211; Hs 174142 colony stir a'ing factor 1 receptor to ig,pkinsse;TM=Y;SS=M; 2.68 

436856; AI469355; Hs.127310; ESTs; pkinase,rrm;TM=M;SS=N; 2.68 

437429; H79981; Hs.5613; Homo sapiens mRNA; cONA DKFZp564E2222 (f, SH2.SH3.BTB; 2.67 

450690; AA296696; Hs.333418; FXYD domain-containing icn Iransport reg; ATP1G1_PLM_MAT8;TM=Y;SS=M; 2.67 

452069; AB028949; Hs.183994; KIAA1026 protein; Metallophos;TM=M;SS=N; 2.67 

445330; R52656; Hs.21691; ESTs; 7tm_1,none; 2.67 

452698; NM_001295; Hs.301921; chemckine (C-C motif) receptor 1; 7tm_1 ;TM=Y;SS=M; 2.57 
419754; H52299;Hs.308467; Homo sapiens mRNA; cDNA OFFZcV , 1523 (f i ne;TM=M;SS=N; 2.67 
434237; AF119908; Hs 235516; hypothetical protein PR02955; none;; 2.67 

445826; BE31 3754; Hs. 13350; Homo sapiens mRNA; cDNA CFFZpL" L r 'i n d ml | 1,ZU5,Nucleoside_tran; 2.66 

445696; AF279265; Hs.298476; solute carrier family 26, member 6; Sulf; te Ir nsp STAS,> , n jrr_permease;TM=Y;SS=N; 2.66 

439750; AL359053; Hs.57664; Homo sapiens mRNA full length insert cDN; IMPDH_C,IMPDH_N,CBS,integrin_B,Ricin_B_Iectin; 2.66 

413745; AW247252; Hs.75514; nucleoside phosphorylase; Mtap_PNP;; 2.65 

429083; Y09397, Hs 227 17; ECI elaled proleii i 1 TM=M,SS=N; 2.66 

449523; NM_000579; Hs.54443; chemokine (C-C motif) receptor 5; 7tm_1;TM=Y;SS=M; 2.66 

1 J A I 1 r RNAforFLJ00016 prol r t TM * >/ 

434979; AI953054; Hs.89543; Iransketolase (Wernicke Korsak ofr m r , srskekl se ransld pyr transketolaseJXpkinase; 2.66 
406137; ;;NM.000179-;Homo sapiens mutS(E K/ py> MutS_N;l f1SS=N 2 56 
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41 2935; BE267045; Hs.75064; lubulin-specific chaperone c; none;; 266 

408633; AW963372; Hs.222088; PRO2000 protein; bromodomain,AAA,Sigma54 activat;; 2.66 

412817; AL037159; Hs74619; proteasome (prasorae, macropain) 26S subu; PC_rep;TM=M;SS=N; 2.65 

452682; AA456193; Hs.374574; progesteronB membrane binding prctein; homeobox.none; 2.65 

401752; ; ; RAN binding protein 3; SH2,STAT,STAT_bind,STATjrot l ionJrans,PAC,PAS,Orexin; 2.65 

450747; AI064821; Hs.129953; ESTs, Highly similar to 1318357A EWS gen; rrm,zf-RanBP,GAS2; 2.65 

425776; U25128; Hs.159499; parathyroid hormone receptor 2; 7tm_2,HRM;TM=Y;SS=M; 2.54 

452701; NM_005110; Hs.30332; glutamine-fructose-6-phosphate transamin; GATase 2,SIS;TM=M;SS=N; 264 

133933; AI754389; Hs.355397; Homo sapiens clone TCCCIA00164 mRNA sequ; none;NA;NA; 2.64 

421677; H64092; Hs.38282; ESTs; A1pp,Armadillo_seg,IBB; 2.64 

436469; AK001455; Hs.5198; Down syndrome critical region gene 2; none;; 2.64 

423198; M81933; Hs.1634; cell division cycle 25A; Rhodanese.none; 2.54 

435905; AW997484; Hs.5003; KIAA0456 protein; SH3,RhoGAP,FCH;TM=M;SS=N; 264 

437712; X04588; Hs.85844; neurotrophic tyrosine kinase, receptor, ; Tropomyosin 1 pWnase,LRR l LRRCT,Hydantoinase_B,Hydantoinase_A;TM=M;SS=N; 2.63 

458946; AA009716; Hs.4231 1; ESTs; none,DSPc,Y_phosphatase; 263 

447217; BE465754; Hs.17778; neuropilin 2 CUB,MAM,F5.F8Jype.C;TM=M;SS=M; 2.63 

445462; AA378776; Hs.288649; hypothetical protein MGC3Q77; none;; 2.63 

425075; AA506324; Hs.1852; acid phosphatase, prostate; acid.phosphat;TM=Y;SS=M; 2.63 

405588; ; ; NM_000299*:Homo sapiens plakophilin i / adi eg;TM=M;SS=N; 2.63 

4 1V Hs.25 NH.none; 2.63 

448243; AW369771 ; Hs.367688; integrin, beta 8; integrin_B,none; 263 

45201 2; M307703;Hs 2^9766 Hno in' i i nber 4A; kinesin,DNA_topoisolV,K-box;Tr/=M;SS=N; 2.53 
412132.AA205588 h ) i lor, argininefterine-rich, 4; rrm,hormone_rec,zf-C4,sugEr_tr;; 2.63 

423887; AL080207; Hs.134685; DKFZP434G232 protein; ABCJran;TM=Y;SS=N; 263 

,1 7 4 2482 43.82212; CC ntiqen; ti ml ane4;TM •• 32 

413407; AI35623 3 II e dc plo 2 62 

414998; NM_002543; Hs.77729; oxidised lowdensity lipoprctp.n ,t tin; le r W= 33=M2b2 

417880; BE241595; Hs.82848; selectin L (lymphocyte adhesion rrcl le; EGFJectir T f< I 

429922; Z97630; Hs.2261 17; HI histone family, member 0; linker hisione:TM=M;SS=N; 2.62 

401812; ; ; sorting nexln 14; AM,NB-ARC,APS_kinase,cdc48_N,cdc48_2,none; 261 

417886; AA214584; ; ESTs; SPRY,7tm_3,ANF_receptor,none; 2.61 

457670; AF1 19666; Hs.23449; insulin receptor tyrosine kinase substra I TI f < 1 

428512; AI018187; Hs.375624 ; Human DNA sequence from clone RP11-243J1; none;; 2.61 

126745 <0' t Hs.20 J; uridine monophosphate synthetase (orolat; Pribosyltran,OMPdecase;TM=M;SS=N; 2.61 
454042; H22570; Hs.47860; hypothetical protein FLJ20093;ig,pkinase,l-RR,LRRNT,LRRCT,none; 2.51 

421077; AK000061; Hs.101590; hypothetical proieir art pkins e.death SPRY,SAP,Ribosomal_L24e,SRP54,dDENN,DENN,uDENN;TM=M;SS=N; 2.60 

420162; BE378432; Hs.95577; cyclin-dependent kinase 4; pkinase;TM=M;SS=N; 2.60 

416661; AA634543; Hs.79440; IGF-II mRNA-binding protein 3; KH-domain,rrm;TM=M;SS=N; 2.B0 

417821; BE245149; Hs.82643; protein tyrosine kinase 9; cofilinJOT;; 2.60 

411133; AW819204; ; gb:CM1-ST0283-071299-061-h03 ST0283 Hcmo; ANF_receptcr,none; 2.60 

405602; ; ; Target Exon; pkinase;; 2.60 

400440; X83957; Hs.83870; nebulin; SH3,Nebulin;; 2.60 

Hs.327090;EST;SH3,PDZ 1 i 

432268; BE311856; Hs.274230; 3'-phosphoadenosine 5'-phosphosulfa'.e sy; APS_kinase,ATP-sulfurylase;TM=M;SS=N; 2.59 
452690; AI536070; Hs.15085; ESTs; pou,homeobox,lig_chan,ANFjecep:or, 2.59 
422753; AI928995; Hs.1575; small nuclear ribonucleoprotein D3 polyp; Sm;; 2.59 
428028; U52112; Hs.182018; interieukin-1 receptor-associa'.eo iin se d< rth.p r, e;TM=M SS=N; 2.58 

433573; AF234887; Hs.57652; cadherin, EGF LAG seven-pass G-type rece, 7tm_2,EGF, dl> rin.l iumiin_EGF,laminin_G,Trypanjlycop,GPS,HRM;TM=Y;SS=M; 2.58 

422785; AI824114; Hs.289088; heat shock 90kD protein 1, alpha; if-C2H2,none; 2.58 

418685; U76376; Hs.87247; harakiri, BCL24'nteracting protein (cont; none;TM=M;SS=M; 2.58 

452329; N36626; Hs.29106; mitogen-activated protein kinase phospha; DSPc;TM=M;SS=N; 258 

428405; Y00762; Hs.2266; cholinergic receptor, nicotinic, alpha p; Neur_chan_LBD,Neur_chan memb;TM=Y;SS=M; 2.58 

421251; Z28913; Hs.102948; enigma (LIM domain protein); LIM.PDZ;; 2.57 

407245. X90563 H 172004; titii ' ig C SG,pkinase;TM=M;SS=N; 2.57 

422309 II79745 4s Il49zl le carrier family 16 (rr r ar_tr;TM=Y;SS=M; 2.57 

401751; ; ; RAN binding protein 3; Orexin,SH2,STAT,STAT bind,STAT_pral,ion_trans,PAC,PAS,none; 2.57 

447887; AA114050; Hs.211610; caspase 8, apoptosis-related cysteine pr; 'CE_p10,ICE_p20,DED;TM=M;SS=N; 2.57 

422282; AF019225; Hs.1 14309; apolipoprotein L; MotA_ExbB;TM=Y;SS=M; 2.57 

439863; BE547830; Hs.375208; paired immunoglobulin-like receptor beta; lipoxygenase,PLAT,lipoxygenase,PLAT; 2.57 

425743; BE396495; Hs.159428; BCL2-associated X protein; Bci-2;TM=Y;SS=N; 2.57 

401218; ; ; eukaryolic translation elongation factor; ion_trans;TM=Y;SS=N; 2.57 

412773; H15785; Hs.74573; similar to vaccinia virus Hindlll K4L OR; PLDc;TM=M;SS=N; 2.57 

444743; AA045648; Hs.301957; nudix (nucleoside diphosphate linked mci; NUDIX,secY,E1_dehydrog,transket_pynTM=Y;SS=M; 2.56 

429782; NMJJ05754; Hs.220689; Ras-GTPase-activating protein SH3-domain; mn,NTF2;TM=M;SS=N; 2.56 

442994; AI026718; Hs.16954; ESTs; ank,pkinase,death,Ribosomal S14; 2.56 

456602; AA41 1607; Hs.118964; E£Tc, Weal h sir i lai to I I \A1150 prote r, ncne.pkinase; 2.56 

422846; BE513934; Hs.1583; neutrophil cylosolic factor 1 (47kD, chr; SH3,PX;TM=M;SS=N; 2.56 

441699; AW511126; Hs.127572; ESTs; none,Aa_trans; 2.56 

447912; AW575549; Hs.155728; ESTs, Weakly similai to I38022 hype heti; none.GSHPxABCJran; 2.56 
442945; AI024849; Hs.1 31853; ESTs; pkinase.none; 2.55 

453199; AI335266; Hs.32353; mitogen-activated protein kinase kinase ; pkinase;TM=M;SS=N; 2.56 
451477; AI798425; Hs.42710; EST;; SH3,r,cne; 2.56 

415091; AL044872; Hs 1 in 1, 3 h) droxy-3 me-hytgUaryl-Coenzyme A sy; HMG_CoA_synt;; 2.55 
413529; U11874; Hs.846; interieukin 8 receptor, beta; 7tmJ;TM=Y;SS=N; 2.55 

425345; AU077297; Hs.155894; protein tyrosine phosphatase, non-recept; Y phosphatase, DSPc;TM=M;SS=M; 255 
401321; ;; receptor tyrosine kinase-like orphan rec n r l"M=M I 2 ' 
446999; AA151520; Hs.351416; hypothetical protein MGC4485; none.none; 2.55 
4 1,3'" eukaryot translat n elongation factor; ion_trans,IQ;TM=Y;SS=N; 2,55 
414509; AW161311; Hs.76294; CD63 antigen (melanoma 1 antigen); lransmembrane4;TM=Y;SS=M; 2.55 
408204; AA454501; Hs.43666; protein tyrosine phosphatase type IVA, m; Y_phosphatase;TM=M;SS=N; 2.54 
424539; L02911;Hs.150402; Activin A receptor, type I (ACVR1) (ALK; pkinase.Adivin recp;TM=M;SS=M; 2.54 
459060; H89244; Hs.303627; heterogeneous nuclear ribonucleoprctein ; rrm,pkinase;TM=M;SS=N; 2.54 
450167; AA446404; Hs.24563; NTF2-related export protein 1; NTF2;TM=M;SS=N; 254 



293 



WO 03/042661 



PCT/US02/36810 



425966; NM_001761; Hs.1973; cyclin F; cyclin,F-box,cyclin_C;TM=M;SS=N; 2.54 
446566; H95741; Hs.17914; membrane-spanning 4-d i ail , sjbfar i, non r>,< v =M;2.54 
412834; R77123; Hs.79881 ; Homo sapiens cDNA: FLJ23006 iis, clone L; 7tmJ .none; 2.54 
457255; AL1 33011; Hs.253920; Homo sapiens mRNA;cDNADKFZp434P201 (fr; none,none; 2.54 
5 431341; AA307211;Hs.251531;proteasome (prosome, macropain) subur.it,; proteasome;TM=M;SS=N; 2.53 

417331;AW41129 1 )l c lt IC 1 C J AMP bmding,KRAB;TM=M;SS=N; 2.53 

414570; Y00285; Hs.76473; insulin-like growth factor 2 receptor; fn2,CIMR;TM=M;SS=M; 2.53 
444838; AV651680; Hs.208558; ESTs; integrin A,FG-GAP,none; 253 

422609 Z46023,Hs 118721 s i i t 1 cmalsial e FUR 0H2,SH3,pkinase;TM=Y;SS=M; 2.53 
10 450296; AL041949; Hs.24756; hepatocyte growth factor-regulated tyros; r,or,e,nor,e; 2.53 
400702; ; ; Target Exon; lig_chan,SBP bac JANF_receptor,TM=Y;SS=M; 2.53 
432336; NM_002759; Hs.274382; protein kinase, interferon-inducible dor, dsrm,pkinase;TM=M;SS=N; 2.53 
442b43 J3275b Hs.3/49 it D splicing fact r VvD40 2.52 
452060; W26980; Hs.349089; ATP-binding cassetle, sub-family F (GCN2; ABCJran,IRK,SWIB; 2.52 
15 443951 F132.il' I t n light polypeptide; PMP22_Claudin,none; 2.52 

428975; NM_004672; Hs.194694; mitogen-activated proteinkinase kinase ; pkinase;; 2.52 
407608; AI928218; Hs.380063; ATPase, Na? transporting, beta 3 polypep; ncne.nore, 2.51 
414482; 357498; Hs.76252; endothelin receptor type A; 7tm 1 ;TM=Y;SS=M; 2.51 
410293;AK000047;Hs.61960;hypothPlin |.r i ir TM=MSS=N 251 
20 429663; M68874; Hs.211587; phaspholipaseA2,group IV (cyt otic 2.PLA B;TM=M S=N; 2 i 
425424; NMJ304954; Hs.157199; ELKL motif kinase; p kinase, UBA,KA1;TM=M;S8=N; 2.51 
457013; AA0371 45; Hs.172865; cleavage stimulation factor, 3' pre-RNA,; WD40;TM=M;SS=N; 2.51 
439221; AA737105; Hs.32250; ESTs, Moderately similar to I78885 serin; adh_short,Bcl-2,BH4,none; 2.51 
405429; ;; Tarqet Exon i phos phatase.none; 2.51 
25 443466 BE24 1 2 4 1 1U1 r li I ill r n; inducible I pkinase,RI01;TM=M;SS=N; 2,51 
418478; U38945; Hs.1174; cyclin-dependent kinase inhibitor 2A (me; srtc 2.50 
408056; AA312329; Hs.42331; ephrin-A4; Ephrin;TM=M;SS=M; 2.50 
414419; F06829; Hs.76090; tumor necrosis faclor, alpha-induced pro; K tetra;TM=M;SS=N; 2.50 
405369; ; ; NM_005569*:Homo sapiens LIM domain kinas; pkinase.LIM.PDZ;; 2.50 
30 418216; AA662240; Hs.283099; AF15q14 protein; Hemagg l utinin,squash;TM=Y;SS=N; 2.50 
404321; ; ; C7001741':gi|2499629|sp|Q63932|MPK2_MOUS; none.none; 2.S0 
430900; U91939; Hs.248123; G protein-coupled receptor 25; 7lm_1;TM=Y;SS=M; 2.49 
440861; BE244115; Hs.7482; KIAA0682 gene produd; rrm,Guany'ate_kin;TM=M;SS=N; 2.49 

415801; R24219; Hs.278443; Fc fragrnen' of IgG, lo\. /lit e r,T» =, 0 r =N 249 

35 418741; H83265; Hs.a881; ESTs, Weakly similar to S41044 chromoscm; pkinase,Acfvin_recp,pkinBse,Aclivln_recp; 2.49 
417034; NMJM6183; Hs.80962; neurotensin; none;; 2.49 
400303; AA242758; Hs.79136; LIV-1 protein, estrogen regulated; none.none; 2.49 
408805; H69912; Hs.48269; vaccinia related kinase 1; pkinase;TM=M;SS=N; 2.49 
418255; AW135405; Hs.37251; ESTs; pkinase,none; 2.49 
40 424905; NM_002497; Hs.153704; NIMA (never in mitosis gene a) related k; pkinase;TM=M;SS=N; 2.44 

417791; AW965339; Hs.44269; ESTs; none,fer2,FAD_binding_5,Ald_Xan. dh_C,fer2_2,Ald_Xan_dh_C2,CO_deh_flav_C; 2.44 
453941; U39817; Hs.36820; Bloom syndrome; DEAD,helicase_C,HRDC;TM=M;SS=N; 2.41 
417849; AW291587; Hs.82733; nidogen 2; EGF,!dl_receptJj,lhyroglobulir,J;TM-M;SS=M; 2.39 
408908; BE296227; Hs.250822; serine/threonine kinase 15; pkinase;; 2.32 
45 428513; BE220806; Hs.184697; plexin C1; PSI.none; 2.31 

426761; AI015709; Hs.172089; PORIMIN Prc-oncosis receptor inducing me; r,one;TM=Y;SS=M; 2.31 
427585; D31 152; Hs.179729; collagen, typeX, alpha 1 (Schmid metaph; C1q,Collagen;; 2.28 
412723; AA648459; Hs.335951 ; hypothetical protein AF301222; none;TM=M;SS=N; 2.28 
452461; N78223; Hs.108106; transcription factor; zf-C3HC4,ubiquitin,PHD,YDG„SRA;TM=M;SS=N; 2.26 
50 429547; AW009166; Hs.99376; FGENESH predicted novel secreted protein; none.none; 2.15 

429486; AF155827; Hs.203963; hypothetical protein FU10339; SNF2_N,heIlcESB„C;TM=M;SS=N; 2.15 
401486; ; ; C4000647*:gi|4758508|ref|NP_OO4253.1| ai; none;TM=Y;SS=M; 2.15 
416209; AA236776; Hs.79078; MAD2 (mitotic arrest deficient, yeast, h; HORMA;TM=M;SS=N; 2. 1 4 
424399; AI905687; Hs.348419; AI905687:l L-BT095-1 901 99-019 BT095 Hcrno ; none;TM=M;SS=M; 2.14 
5 5 423761; NM_006194; Hs.132576; paired box gene 9; PAX;TM=M;SS=N; 2.13 

439670; AF088076; Hs.59507; ESTs, Weakly similar to AC004858 3 U1 sm; none.none; 2.13 
439318; AW837046; Hs.6527; G protein-coupled receptor 56; 7tm_2,CytC_asm,GPS;TM=Y;SS=M; 2.03 
445019; AI205540; Hs.281295; ESTs; none, none; 2.00 
, n 443211; AI128388; Hs.143655; ESTs; none.none; 1.98 
60 449448; D60730; Hs.57471; ESTs; none.none; 1.92 

435243; AW292886; Hs.348932; hypothetical protein dJ434014.3; IRF.none; 1.85 

406360; ; ; Target Exon; WD40;TM=M;SS=N; 1 .84 

411368; X72925; Hs.69752; desmocollin 1; cadheriri;TM-Y;SS-N; 1.84 

453102; NM_007197;Hs.31664; frizzled (Drosophila) homolog 10; Fz,Frizzled,7tm_2;TM=Y;SS=M; 1. 79 
65 419183; U60669; Hs.89663; cytochrome P450, subfamily XXIV (vitamin; p450;; 1.78 

420344; BE463721; Hs.97101; putative G protein-coupled receptor Methy!tr£nsL5;TM-Y;SS-M; 1.77 

432842; AW574093; Hs.334822; hypothetical protein MGC4485; Ribosona!J-4;TM=M;SS=N; 1 .76 

419743; AW408762; Hs.5957; Homo sapiens clone 24416 mRNA sequence; none.none; 1 .73 

426427; M86699; Hs.1 69840; TTK protein kinase; pkinase;; 1.62 
70 437915;AI637993;Hs.202312;HomosapienscloneN11 NTera2D1 teratcca; none.none; 1.58 

433336; AF017986; Hs.31386; secreted frizzled-related protein 2 (str; Fz.NTR;; 1.50 

434377; AW137148; Hs.306593; intron of periostin (OSF-2os); Fasciclin.none; 1.47 

451 592; AI80541 6; Hs.21 3897; ESTs; none.none; 1.47 

404927; ; ; Target Exon; Galactosyl_T;TM=M;SS=Y; 1.28 
75 421552; AF026692; Hs.105700; secreted frizzled-related protein 4; Fz.NTR;; 1.27 

427335; AA448542; Hs.278444; G antigen 7B; none;; 1.25 

431808; M30703; Hs.270833; amphiregulin (schwannoma-derived growth ; EGF;TM=Y;SS=M; 1.24 
447993; AW139525; Hs. 170362: ESTs; none.none; 1.21 

428182 BE386042 H 29 E y I I MAN G AN1 one;TM=M;SS=N; 1.13 

80 453537; NM_002589; Hs.34073; BH-prctocadherin (brain-heart); cadherin;TM=Y;SS=M; 1.14 
438274; AI918906; Hs.55080; ESTs; PAX,none; 1.14 

453966; BE148734; Hs.63325; transmembrane protease, serine 4; trypsin.ldl recepLa.none; 1.10 
413268; AL039079; Hs.75256; regulator of G-prolein signalling 1; RGS;TM=M;SS=N; 1.07 
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TABLE 25B 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 



418869 
425802 
417886 



0_0 



AA229762 AA230035 
88B4_3 AA122298AA360788 
1Q31334J AA210987 D57294 AA214584 AA207006 D56572 
411133 1070995J AW819203 AW819204 AW819197AW819202 AW819211 BE1 58469 AW819221 BE158473AW819235AW819207AW819220 AW819208 AW819238 
AW819198AW819234 

TABLE 25C 



Unique number corresponding to an Eos probeset 



Dunham I. et al„ Nature (1999) 402:489495. 



401781 
401780 
401760 
402075 
401747 



404891 
406137 
401752 



401751 
401218 
401321 
401057 
400702 




-Dunham I. et al." refers to the publication entitled "The DNA 



1 21907-1 22035,1 22804-1 22921 ,1 2401 9-1 241 6 
118596-118816,119119-119244,119609-11975 
37999-38145,38652-38998,39727-39872,4055 



182212-182958 
90281-91477 

19047-19145,21 133-21 293,33968-34069 



113051-113195 
10751-10919,18817-19052,22131-22328 
84974-85125 
30487-31058 
144600-144794 
46180-46366 



44647-44778 
139165-139322 
40793-41031 
104278-104748 



1 1457- 1 1 585,2631 1-26536,27902-28067,3204 
51577-51723 

34183-34357,35686-35751 
76594-77805 

32585-32756,35281-36540,40791-40933,4401 
7513-7673 



TABLE 26A: 834 GENES UP-REGULATED IN EWING'S SARCOMA COMPARED TO NORMAL ADULT TISSUES 

Table 26A lists about 834 genes up-regulated in Swing's sarcoma compared to noma! adult tissues. These were selected from 35403 probesets on the Affymetrix/Eos Hu03 
GeneChip array such that the ratio of "average" kidney cancer to "average" normal adult tissues was greater than or equal to 1 .5. The "average" kidney cancer level was set to the 
75* percentile amongst Ewing sarcomas, The "average" normal adult tissue :evel was set to the 85* percentile amongst non-malignant tissues. In order to remove gene-specific 
background levels of non-specific hybridization, the 7.5* percentile value amongst non-malignant tissues was subtracted from both the numerator and the denominator before the rati 

Pkey: Unique Eos pn 
ExAccn: Exemplar Accession nui 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 
R1 : Ratio of Ewing sarcoma to normal tissue 

Pkey ' ExAccn UniGenelD UniGene Title 

101447 M21305 

115881 NM 005756 Hs.1 84942 

110278 AF061573 Hs.19492 

121362 AF050147 Hs.97932 

101104 AW862258 Hs.169266 neuropeptide Y receptor Y1 
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AL1 34708 
124006 A1147155 
110728 AA737106 
105782 H09748 
102836 U94320 
104691 U29690 
121231 AA814948 
129526 S696B1 
119791 AA554907 
116301 AW969706 
123308 C14187 
127742 AW293496 
131601 NMJJ07115 
127489 AA650250 
AW872527 
101063 D54745 
134570 U66615 
100299 D49493 
127987 AI022103 
131313 R96290 
126799 AW753865 
125847 AW161885 
100380 D82343 
114837 BE244930 
123049 BE047680 
129977 NM_000399 
127695 AA714731 
125186 AA510620 
118644 AA443241 
119717 AA918317 
101879 AA176374 
113003 AW292315 
126545 AA316181 
101050 AU077324 
116790 AW161357 
119082 AF252297 
i AF091085 
126098 M79088 
M78772 
126426 AA125984 
131307 NMJM0025 



Hs.37744 
Hs.96343 
Hs.177582 
Hs.58291 ESTs 
Hs.293332 ESTs 
Hs.103538 ESTs 



ESTs, Moderately similar to I78885 serin 
B-cell CLUIymphoma 11B (zinc finger pro 



Hs.272076 ESTs 



ESTs, Weakly similar to DAPLHUMAN DEATH 
cholecystokinin 

SWI/SNF related, matrix associated, acti 
growth differentiation factor 10 



Hs.74376 

Hs.249034 

Hs.18551 

Hs.166895 

Hs.211869 

Hs.1395 

Hs.291457 

Hs.181244 

Hs.336629 

Hs.57987 

Hs.243886 

Hs.61635 
Hs.1832 
Hs.101174 
Hs.91546 



AB033064 
AI671245 

112268 W39609 

106516 AL137311 

128132 AA225632 

129012 R81936 

125447 AI582222 

134676 W28051 

119040 R02394 

128391 AW188326 

123829 AF251237 

I AA521665 

126872 AW450979 

101266 L36645 

121309 AA293834 

130637 AA356764 

125464 N71807 

135175 M91463 

107599 AW664072 

102681 Y08890 

131688 AI935413 

120147 AI917116 

110343 AW136703 

127664 AA806164 

103075 NM_001034 
126127 N95428 



127262 AA828125 



128686 ESTs 



Hs.234074 

Hs.336629 
Hs.128686 
Hs.87819 
Hs.269435 
Hs.170652 
Hs.112208 
Hs.208957 



ESTs 



re tissue) protein 



dickkopf (Xenopus laevis] homolog 2 
early growth response 2 (Krox-20 (Drosop 
ESTs, Weakly similar to heterogeneous r 
major histocompatibility complex, class 
ribosomal protein L44 
B-cell CLUIymphoma 1 1 B (zinc fi nger pro 



hypothetical protein 



Hs.210836 ESTs 



Stra 



gb:zn27h06.r1 Stratagene ne 
adrenergic, beta-3-, receptor 
gb:no97c02.s1 NCLCGAP_Pr2 Homo sapiens 
ESTs 

ATPase, Class V,type10D 

KIAA1238 protein 
hypothetical protein FU14594 

I DN [ 61 1 

gb:nc08a07.r1 NCLCGAP_Pr1 Homo sapiens 



gb:UI-H-BI3-ala-a-12-0-U!.s1 NCI_CGAP_Su 



integral membrane protein 2A 
gb:yz29d09.r1 Soares_mu»iple_s 
• ' iily2((acilitatedglu 




(Drosophila) homolog 2 



Human DNA sequence from clone RP4-530I15 
trinucleotide repeat conta'ning 1 
gb:od71a09.s1 NCI_CGAP_Ov2 Homo sapiens 
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106472 AI207162 

125032 T74884 

127315 AF116622 

126600 AA699949 

120325 AA195651 

127256 AI738610 

117357 N24829 

126735 M69113 

102745 AW753865 

128040 AW500486 

129706 AA443241 

107731 AA016086 

128283 AI076570 

125165 W45350 

111148 AB020690 

105577 AW852257 

128301 U90552 
130262 D63216 
132967 AA316181 
102479 NMJ01991 
128531 H03721 
126165 AI741816 
126086 H75681 
118967 AI668670 
120830 AI568170 
127229 AA316181 
129428 AA256906 
110151 H18835 
131381 M92642 
133761 AF041430 
125590 R23858 



Hs.191385 
Hs.104106 
Hs.257967 



Hs.153684 
Hs.61635 
Hs.1 94669 
Hs.2953 
Hs. 125897 

Hs.216756 



126021 AA775894 Hs.187516 ESTs 



splicing factor proiine'glutamine rich ( 

ribosomal protein L44 

ESTs, Weakly similar lo I33022 hypoiheti 

ESTs 

gb:zc81h08.s1 Pancreatic islet Homo sapi 
paraneoplastic anligen MA2 
C-terminal binding protein 2 
butyraphilin, subfamily 3, member A1 
frizzled-relaled protein 
six transmembrane epithelial ant'gen of 
enhancer of zeste (Drosophila) homolog 1 
ribosomal protein S15a 
ESTs 

gb:yr77g01.r1 Soares fetal tiver spleen 

ESTs 

ESTs 

six transmembrane epithelial antigen of 
ESTs, Weakly similar to ubiquitous TPR m 
hypothetical protein FLJ20041 
collagen, type XVI, alpha 1 
brain protein 13 

Homo apiens, done IMAGE 3340937, mRNA, 
gb:C05723 Human pancreatic Islet Homo sa 



113119 T47910 

123110 AA486256 

113283 T66813 

107711 W96141 

128992 H04150 

106111 AW875398 

129948 AI537162 

125728 AW954565 

116728 F13687 

103100 NM_005574 

124971 T23800 

131019 W28614 

128671 AI885045 

111795 AI435437 

119127 AA70B035 

117602 N35020 

111898 R38944 

131916 AA025976 

130850 AB040922 

100571 L14561 

126722 N66148 

123720 AA609734 

113609 T93263 

131136 AB033099 

129001 AA443323 

133529 W45623 

107593 AI093688 

123910 AA621262 

128817 BE395776 

103080 AU077231 

128367 AW611791 

123729 AL039779 

112342 AW410273 

114721 D61939 

127768 AW085002 

127706 AI174238 

126029 AA704253 

124250 AA350256 

117265 AA451966 

112501 AA972447 

129079 AK000157 

127252 AI049545 
129228 U40714 
114092 H81213 
109252 BE440157 
127889 AI147408 



Hs.1 93510 
Hs.12947 
Hs.220687 



chaperonir. containing TCP1, subunit 2 (b 
potassium inwardly-rectifying channel, s 
gb:ym37a05.r1 Soares infant brain 1MB H 
uncharacterized hypothalamus protein HBE 
gb:yb18b11.s1 Slralagene felal spleen (9 
EST 



Hs.151001 
Hs.306155 
Hs.211586 
Hs.24567 



Hs.1 83475 Homosai 



ls.16875 



B-cell CLUIymphoma 1 1 B (zinc finger pro 
EST 

LIU domain only 2 (ihombo'.in-like 1) 

hypothetical protein FU14728 

chorionic somatomammotropin hormone 1 (p 

ECTs'fwe^l'^Tm^ar r to S KB F3 JHU^IAN N U CL 
ESTs 

C3HC4-likezincnnger protein 



Hs.1 0781 2 
Hs.74571 
Hs.60051 
Hs.1 79923 
Hs.1 58640 



ATPase, Ca++ transporting, plasma membra 

HSPC033 protein 

EST 

ESTs, Weakly similar to S23650 relrovir 

KIAA1273 protein 

BPOZ protein 

ADP-ribosylalion factor 1 

ESTs 

ESTs, Weakly similar to S65657 alpha-1C- 
ankylosis, progressive (mouse) homo'og 
cyclin D1 (PRAD1: p; ' 




Hs.14825 
Hs.85944 
Hs.1 44941 



Homo sapiens mRNA; cDNA DKFZp434K087 (fr 
hypothetical protein FLJ20150 
DnaJ(Hsp40) homolog, suf " 
tyrosyl-tRNA synthetase 
ESTs, Weakty similar to KIAA1503 protein 
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ESTs Weakly similar to RfvlSI HUVAN REGUL 
















AA121123 










AA004798 




ESTs, Weakly similar to T0035 1 hypotheti 
















BE262438 




bela-1,4 mannosyltransferase 




































BE275031 




hypothetical protein Wi ;i 2555 






W04606 






























AI338993 








103319 














Hs.237955 
















































Al 80351 6 


















































AB01 5427 


















































BE293520 


Hs.18910 


prostate cancer overexpressed gene 1 
























































AW1 62758 




















AA447453 




Homo sapiens mRNA; cDNA DKFZp586M0723 (f 










ATP synthase, H+ transporting, mitochor.d 






AW293689 














cytochrome P450, subfamily HA (pnenobar 
















AW966243 




hypothetical protein FU12650 






BE391579 












Hs.26038 


ESTs, Weakly similar to I38022 hypomet 






Al 5801 50 




















AU077002 


Hs.24950 


















AW069534 








































BE390125 


Hs.143187 




































A1866286 










AL050391 


Hs.321247 
















129794 


AF161399 


Hs 23259 


hypothetical protein FU13433 


2.2 


100253 


D38024 


Hs.157425 




2.2 


130743 


AL049266 


Hs.18724 


Homo sapiens mRNA; cDNA DKFZp564F093 (ft 


2.2 


125466 


R08234 


Hs.180461 


ESTs 


2.2 


122682 


AA984531 


Hs.159293 


ESTs 


2.2 


133347 


BE257758 


Hs.71475 


acid cluster protein 33 


2.2 


104455 


AL1 10261 


Hs.1 57211- 


DKFZP586B0621 protein 


2.2 


116332 


AA491208 


Hs.62620 


chromosome 6 open reading frame 1 


2.2 


131163 


AA099524 


Hs.23754 


ESTs 


2.2 


109592 


AI198059 


Hs.26370 


ESTs 


2.2 


128721 


AW403911 


Hs.266175 






114046 


BE018658 


Hs.141003 


Homo sapiens cDNA: FU21691 fis, clone C 


2A 




AI190914 


Hs.143880 


ESTs 


2.1 
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103163 AU077018 
112379 AK001713 
127507 AA249573 
133097 W03512 
126153 H85692 
122110 A1123000 
100554 M95923 
104799 AA029703 
132664 AI740461 
114620 AA642974 
115348 AA281562 
133231 AK000517 
133160 N54968 
124656 AW297702 
133576 M19650 
132676 N92589 
126505 AA282881 
118865 AA736405 
134267 AI174596 
134104 L35253 
133493 AW998046 
112853 T02843 
117457 N29682 
112246 R51321 
134869 AL157518 
128869 AA768242 
129179 AW969025 
104857 AI920902 
101651 AL037111 
129726 H15474 
117186 H98988 
125271 AI250773 
116925 H73110 
128458 T23625 
,116031 AA452239 
130724 AK001507 
121897 M427419 
123808 AA620552 
122333 AA625872 
127841 AW1 36558 
100023 

113002 BE243513 
111567 F12628 
113697 T97183 
128033 AI248705 
105225 AA211777 
112370 AF052095 
132766 BE083422 
113226 AI821008 
117997 N52090 
116996 H83935 
127002 AL353940 
122591 A!188219 
107279 S57296 
103898 AA248884 
110312 BE256986 
127447 AA386192 
128352 AW137413 
113649 N94768 
128275 AI218235 
125976 AA436760 
120820 AA347417 
134937 A1251449 
129602 A1282193 
129535 AA397972 



128148 M918175 

125982 R98091 

125746 AL137506 

127835 M748762 

100661 BE623001 

124282 AA018408 

126926 M179472 

100221 D28383 

126053 H64450 

100944 L07518 

125581 AI272848 

128604 AI879099 

114612 AI124557 

130453 U80735 



Hs.54542 
Hs.292100 



:al protein FLJ10851 
ESTs, Moderately similar lo ZN91.HUMAN Z 
hypothetical protein MGC1 3272 
ESTs 

melanocortin 1 receptor (alpha melanocyt 
gb:Human 12-lipoxygenase mRNA. partial c 
gb:ze95h08.s1 Soares fetal_heart_NbHH19W 
ESTs 

gb:nr60h01.s1 NCI_CGAP_Lym3 Homo sapiens 



Hs.102915 
Hs.150741 
Hs.261038 
Hs.1 90057 
Hs.54530 



Hs.44071 
Hs.25780 
Hs.90421 
Hs.80618 
Hs.109154 
Hs.19058 
Hs.75641 
Hs.1 32898 
Hs.42612 
Hs.270012 
Hs.260603 
Hs.1 50580 



Hs.7212 
Hs.334786 
Hs.17992 
Hs.149321 



ESTs 

RAE1 (RNA export 1, S.pombe) homolog 
mitogen-activated protein kinase 14 
argir.ine-glutamic acid dipeplide (RE) re 
gb:FB1 1 H5 Fetal brain, Stratagene Homo s 
ESTs, Weakly similar lo ALU5.HUMAN ALU 
H n sapiens DNATU12 52 fis Her W 
PR02463 protein 
hypothetical protein 
ESTs 



ESTs, Weakly similar to 



ns clone FLB6914 PR01821 mRNA 
EST, Weakly similar to ZN91_HUMAN ZINC 
gb:ae58g11.s1 Stratagene lung carcinoma 
ESTs, Moderately similar to T34551 hypot 



in PP1044 
hypothetical protein MGC16040 
Homo sapiens mRNA; cDNA DKFZp434J1726 (f 
ESTs 

gb:zn57d02.s1 Stratsgene muscle 937239 H 

Homo sapiens clone 2391 1 mRNA sequence 

hypothetical protein MGC2668 

ESTs 

EST 

ESTs 

hypothetical protein DKFZp761P1010 
ESTs, Weakly similar to HSJ2_HUMAN DNAJ 
v-erb-b2avi 

gb:k3517.seq.F Human fetal he 
hypothetical protein FU 12089 
Homo sapiens cDNA FLJ1 1903 fis, clone HE 
ESTs 

ESTs, Weakly similar to KIAA1435 protein 



rl Soares Jota!_fetus_Nb2HFB_ 



s.171939 ESTs 



Hs.169965 
Hs.11713 
Hs.101189 
Hs.174151 
Hs.23965 
Hs.1 26637 

Hs.274256 
Hs.163113 
Hs.1 32748 
Hs.1 10287 
Hs.832 



in sarcoma (Schmidt-Ruppin A-2) 
emmenn (chimaerin) 1 
E74-Iike factor 5 (ets domain transcript 



solute earner family 22 (organic anion 



Homo sapiens ribosomal protein L39 mRNA, 
ESTs 

ESTs, Highly similar to A41029 inlegrin 
gb:Human mRNA for ATP synthase 6 chain, 
gb:yu62d01.r1 Weizmann Olfactory Epitfiel 



GlOT-3 for gonadotropin ir 
ESTs 

PAX transcription activation domain inle 
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135060 AK001887 
114419 AI248013 
126283 N40359 



106798 BE252749 

103760 AA642973 

118922 AW206193 

133196 AI434760 

133424 AA350994 

133765 M62194 

132347 BE271016 

125599 H13295 

114459 AW445217 

128478 AA708205 

127271 H96820 

111122 N63753 



Hs.259842 protein kinase, AMP-activaled, gamma 2 n 



ESTs 

hypothetical protein PR01331 

hypothetical protein MGC2722 

CK2 interacting protein 1 ; HO0024c prote 

TRABID protein 

hypothetical protein FLJ20345 

ubiquitin B 

hypothetical protein DKFZp761B2423 

KIAA1Q07 protein 

KIAA1700 

cadherin 1 1, type 2, OB-cadherin (osteob 
ESTs, Weakly similar to T21554 hypotheii 



Hs.1 73380 
Hs.26320 
Hs.20558 
Hs.1 83842 
Hs.91065 
Hs.279949 
Hs.20281 
Hs.75929 

Hs.106135 
Hs.103362 
Hs.100343 

Hs.1 6492 



119244 AW407564 

127503 AI016798 

113626 T94318 

128115 AI435590 

117639 AA377165 

127033 AF169301 

112411 R43090 

114601 AA075566 

127573 AA594196 

125500 AW952654 

119416 T97186 

115467 AI366784 

128902 AA036637 

127684 AA668631 

126288 AW449560 

122059 AA431737 



3.130168 ESTs 



ESTs 

gb:yv99b03.r1 Soares melanocyte 2NbHU Ho 
DKFZP564G2022 protein 
ESTs, Weakly similar to 21 09260A B cell 
melanoma antigen, family D, 1 
ribosomal protein S18 
hypothetical protein FLJ20772 
ESTs, Moderately similar to RL44.HUMAN 6 



Hs.44833 
Hs.9098 
Hs.271510 



ESTs 

sulfate transporter 1 

ESTs, Moderately similar to AUU1.HUMAN A 
gb:zm88f06.s1 Stratagene ovarian cancer 
ESTs, Weakly similar to S65657 alpha-1C- 



Hs.1 07052 
Hs.32556 
Hs.89576 
Hs.98749 



128895 AW467000 Hs.1 0698 

105301 AW352357 Hs.7457 

125536 F08266 Hs.77948 

121387 AA405854 

134126 NMJH3747 Hs.13181 

126860 BE242814 Hs.32349 

102907 BE409861 Hs.20283 

127804 AA740634 Hs.29208 

130566 R85474 Hs.16073 

113782 AK001567 Hs.3110C 

124119 AAO40123 Hs.2489£ 

132490 NMJ01290 Hs.4980 

125494 AU077029 " 

100237 D30715 

127687 AW772383 



EST, Moderately similar to T42671 hypoth 

ESTs 

ESTs 

MAGE1 protein 

ESTs, Weakly similar to ALU1_HUMAN ALU S 

gb:zn66g08s1 Soares JestisJIHT Homo sap 

tankyrase, TRF1-interacling ankyrin-rela 

ESTs, Weakly similar to T27544 zinc resi 

heme oxygenase (decycling) 1 

ESTs 

ESTs 



125704 R55094 

126208 N22588 

131902 AA180145 

128660 AA011597 

118049 N53145 

134624 AF035119 

127432 AW067708 

126414 AI363157 

120861 AA350394 

124669 AI571594 

126096 F08208 

103891 NMJ07212 

128727 AI223335 

126831 AI929107 

125360 AW898892 

124276 H83465 

126524 Z45455 

126647 AK000283 

125957 H41694 

121782 AW452957 

124059 BE387335 

130945 U20582 

126348 T16243 

103558 BE616547 

126982 AA211419 

125613 AA765957 

129601 AB032964 

126007 H51097 



Hs.306333 
Hs.300635 
Hs.247936 
Hs.26239 
Hs.288548 
Hs.34348 
Hs.177398 

Hs.8700 
Hs.170311 
Hs.24756 

Hs.96952 ESTs 
Hs.1 02943 
Hs.283844 
Hs.124186 
Hs.50651 
Hs.79933 
Hs.189741 

Hs.182447 



olfactory receptor, family 1 
Human DNA sequence fro 
Homo sapiens cDNA FU12368 fis, clone MA 
Homo sapiens mRNA; 



gb:yv55f09.s1 Soares fetal liver spleen 



hepatocyte growth factor-regulated tyros 



al protein MGC12916 
similar to rat tricarboxylate csmer-li 
ring linger protein 2 

Janus kinase 1 (a protein tyrosine kinas 
ESTs 

gb:ys91a11.s1 Soares retina N2b5HR Homo 



Hs.1 15726 
Hs.143261 
Hs.1 12668 



hypothetical protein FU20276 
gb:yo06b06.r1 Soares adult brain N2D3Hbo 
Homo sapiens, clone MGC: 15203. mRNA, co; 
ESTs, Weakly similar to S64054 hypolheti 
actin like protein 

Homo sapiens cDNA FLJ13992 lis, clone Y7 
keratin 17 

gb:zn55g05.s1 Stratagene muscle 937209 H 
KIAA0532 protein 
KIAA1 138 protein 
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135231 BE613615 






128897 AW979134 


Hs. 10700 


hypothetical protein 


109891 H04757 






127704 AA679609 




gb:ag72c02.s1 Gessler Wilms tumor Homos 


129340 H75334 






126502 T10077 






129619 AA209534 


Hs.284243 




127136 R36277 




Homo sapiens ubiquitin conjuga'ing enzym 


110636 H72868 






128862 BE250742 






104689 AA420450 


Hs.292911 


ESTs, Highly similar to S60712 band-6-pr 


130829 BE252530 




glutathione S-transferase M3 (brain) 


125768 A1557486 


Hs.1 191 22 


ribosomal protein L13a 


123613 AA609158 


Hs.291166 




127506 T51039 




ribosomal protein L10a 


123546 AA608B17 


Hs 1 1 2597 




126516 R95872 


Hs.1 17572 


chemokine binding protein 2 


103973 AA305729 




amino acid transporter system A1 


127426 AA854755 


Hs.1 24076 




112339 R56570 






129101 NMJ313403 


Hs.108665 




109442 AW296134 


Hs.86999 


ESTs, Weakly similar to Sb5657 alpha- 1 C- 


118103 AA401733 






125752 AW1 36622 






102926 W28363 


Hs.239752 




133975 C18356 


Hs.295944 


tissue factor pathway inhibitor 2 


134470 X54942 






127329 AW160551 


Hs.124021 


soggy-1 gene 


126659 T16245 






127297 AW629485 


Hs.140720 


GSK-3 binding protein FRAT2 


127640 AI557486 






103409 NMJ304454 






127964 F06298 






122365 AA813546 


Hs.99034 




128193 AJ224442 




putative methyltransferase 








125532 AI734146 


Hs.271800 


ESTs, Weakly similar to alternativs'y sp 


126541 AJ271671 




zinc/iron regulated transporter-like 


127309 AI669765 






129062 AA452970 


Hs.155218 


E1D-55kDa-associated protein 5 


126770 A1292320 


Hs.81361 




127775 AA12B808 


Hs.1 79902 




126994 AA455265 


Hs.86686 


ESTs, Moderately similar to I54374 gene 


130734 AW137091 




KIAA1052 protein 


114461 AA531187 






100842 U05597 




gb:Human anion exchanger 3 cardiac isofo 


127389 T65126 




carnitine O-octanoyltransferase 


125394 BE178502 




ESTs, Weakly similar to I78885 serineAh 


107736 AA016239 






125669 R51308 




ESTs, Weakly similar to ALU8. HUMAN ALU 


100370 D79989 




KIAA0167 gene product 


113479 AI023133 


Hs.1 0739 




105165 BE280787 




hypothetical protein FU 10233 


120602 AA808018 






112399 R60920 


Hs.296770 




123474 AA599209 




gb:ag34b1 1.s1 Jia bone marrow stroma Hom 


134212 AA654353 




EBP50-PDZinteractor of 64 kD 


104204 AK001691 


Hs.57655 


hypothetical protein FLJ 10829 


127464 AW971875 


Hs.292071 




116715 AL117440 


Hs.1 70263 




115041 AA252457 


Hs.86543 




132380 AW373665 












116356 AI371223 


Hs.288671 










128846 AA730767 


Hs.285753 


SCG 1 0-like-protein 


123869 AA620924 






108889 AA1 35722 


Hs.61481 




126528 Z24895 




gb:HSB67F122 STRATAGENE Human skeletal 


127629 AA293279 


Hs.29173 


hypothetical protein FU20515 


130004 AA703684 


Hs.245474 


ESTs, Moderately similar to ALU5J4UMAN A 


130847 A1672483 


Hs.20220 


lipase protein 


111620 R14B53 


Hs.307478 


EST, Weakly similar lo I39058 hypotheti 


131971 BE567100 


Hs.1 54938 


hypothetical protein MDS025 


121360 AA405635 


Hs.96854 


ESTs, Weakly similar to DYLXHUMAN CYTO 


127705 AJ003322 




gb:AJ003322 Selected chromosome 21 cDNA 


124687 AA833902 


Hs.270745 


ESTs 


126698 AI221147 


Hs.1 45088 


E T Weakl .mi ulo 15936 hypothe 


126730 AA442429 




gb v7 302.r1 5o in Jot tu Nb2H 


127916 A1239950 


Hs.294111 


ESTs, Moderately similar to B34C67 hypo 


128408 AI183407 


Hs.1 43704 


EST 
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123783 AA610112 

109152 AW380723 

107242 AB020672 

132804 AIB05943 

125387 AJ243669 

121578 M398791 

132944 T96641 

126295 AI281459 

133335 BE251012 

129879 AK001696 

125175 W52355 

126919 M577730 

" AA725863 

126495 AB029021 



Hs.73451 

Hs.175411 

Hs.326067 

Hs.8127 

Hs.178185 

Hs.6127 

Hs.270114 

Hs.263812 

Hs.13109 



gb:af19g05.s1 SoaresJotalJetus_Nb2HF8_ 
ESTs, Weakly similar to S55024 nebuiin, 
WAA0865 protein 
hypothetical protein MGC51 78 



Homo sap,ens cDNA: FU23020 lis, cfc 



M284112 
128232 AI830319 
126082 H81188 
120467 AW292562 
124041 AW590171 
AF098158 
123951 AB012922 
AF223944 
N65961 
133651 AB0 1740 

R12421 
125661 AA491830 

H 18298 
127245 AA323958 
111223 AA852773 

R44789 
124846 R59977 
100397 D84424 
127180 T27097 
102598 BE250742 
134076 AF086215 



AL1 10282 
I R93325 
129703 BE388665 
133531 BE276738 
119726 AF086289 
12519B W69474 

AW291477 
112542 AI458867 
10136B M13058 
AA730136 
AA056483 



113701 T97301 

116180 AA463902 

127133 AA280740 

113316 T70318 

123316 AI290561 

122638 AL137476 

105053 A1884911 

103305 X82279 

110384 H45282 

115625 AW630870 

126905 AW504027 

130820 AL353934 

112394 AK000373 

129589 AW504292 

126446 NM_015670 



AL120051 
121046 AB033083 
128403 AI908006 



Hs.269571 ESTs 
Hs.1 87528 
Hs.101413 
Hs.9329 
Hs.1 73043 
Hs.325443 



hypothetical protein DKFZp434l1916 



Hs.75551 
Hs.301463 
Hs.53250 
Hs.1 22556 



Hs.1 55361 
Hs.1 23509 
Hs.32989 

Hs.268798 

Hs.86574 

Hs.15301 

Hs.288798 

Hs.8358 

Hs.11517 

Hs.1 18926 

Hs.84072 

Hs.1 02237 

Hs.1 79925 



Hs.1 44700 
Hs.97377 
Hs.295362 
Hs.70604 
Hs.268912 



Hs.33191 
Hs.1 58196 
Hs.57697 
Hs.22790 
Hs.106673 

Hs.283709 
Hs.1 17869 
Hs.1 43243 
Hs.191901 
Hs.7709 



Hs.53590 ESTs 



metastasis-associated Mike 1 
breast cell glutaminase 
gb:za27d03.s1 Soares fetal liver spleen 
dihydropyrimidinase-like 2 
chromosome 21 open reading frame 59 
ESTs 

gb:yn48b09.r1 Soares adult brain N2b5HB5 
gb:EST26810 Cerebellum II Homo sapiens a 
KIAA1866 protein 

Homo sapiens, Similar to transmembrane r 
transcriptional adaptor 3 (ADA3, yeast 



g Horm sap iens full length insert cDNA 



Homo sapiens, clone IMAGE;3457C03, mRNA 
DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 
skin-specific protein 
ESTs 

testis expressed sequence 13A 
ESTs 

praline-rich protein Haelll subfamily 2 

teratocarcinoma-derived growth factor 1 

Human Chromosome 16 BAC clone CIT987SK-A 

bone morphogenelic protein receptor, typ 

r 1 H , , form! , 

ESTs 

ESTs 



Homo sapiens mRNA; cDNA DKFZp434IOB23 (f 
receptor (calcitonin) activity modifying 
gb:H.sapiens Fas, Apo-1 gene (promoter a 
ESTs 

ESTs, Weakly similar to hypothetical pro 
Homo sapiens cDNA FLJ12595 (is, clone NT 
hypothetical protein FU21012 
hypothetical protein FU20366 
ESTs 

sentrin/SUMO-specific protease 3 



ephrin-B1 
KIAA1257 protein 

Homo sapiens cDNA FU14459 fis, c!c 
ATPase, Class II, type 9A 



lar to ALU1_HUMAN ALU S 
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gb:yz39a12 s1 Morton Fetal Cochlea Homo 




















































127392 AI816736 




DHHC1 protein 




106879 A1190785 








128303 A1096444 
















125756 BE174587 


Hs.289721 


growth arrest specific transcript 5 




132332 AW978906 




hypothetical protein FLJ 12960 












128416 F13165 




























































































Homo sapiens mRNA; cDNA DKFZp434!0812 (f 




























131330 D13969 
























106124 H93366 




Homo sapiens cDNA: FLJ21962 fis, clone H 




















125636 HI 2332 
















111570 AF059203 
































105930 AF016371 




peptidyl prolyl isomerase H (cydopnilin 








hypothetical protein FU14299 




















































































































































132430 AW973652 


Hs.283105 


ESTs 


1.6 


133541 H75334 


Hs.11050 


F-box only protein 9 




102612 U65402 


Hs.248124 


G protein-coupled receptor 31 


1.6 


120228 AI192528 


Hs.164537 


ESTs 


1.6 


122652 AA454641 




gb:zx99d05.s1 Soares_NhHMPu_S1 Homosapi 


1.6 


103456 AA496425 


Hs.9629 


papillary renal cell carcinoma (transloc 




105355 AL031447 


Hs.26938 


Homo sapiens, clone !MAGE:4053044, nRNA, 


16 


108043 AA042873 


Hs.160412 


ESTs 


1.6 


128695 NM_003478 


Hs.101299 




1.5 


127984 AA846377 


Hs.193706 


ESTs, Weakly similar to ALU8_HUMAN ALU S 




124405 AA228137 


Hs.25005 


hypothetical protein MGC3329 


l'.S 


103934 BE278111 


Hs.1 34200 


DKFZP564C186 protein 


1.5 


124195 H83034 




gb:yq48e07.r1 Soares fetal Iver spleen 


1.6 
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102687 NM.007019 
121226 AA364109 
120415 AA235810 
123864 AA620882 
125045 A11 14630 
133425 AA444390 
126578 AF151861 
102406 U43177 
114126 BE566962 
125233 W85713 
109635 F04295 
125675 BE294972 
129707 AW572317 
127569 A1765107 
113302 T66919 
119705 AI984203 
127226 AL036559 
123489 M599708 
107468 AA740979 
115916 AI052731 
127815 AA743490 
100364 NM.004341 
125568 AW615396 
105260 N81201 
125659 T57693 
111275 N70970 
106542 AA339541 
133423 T84084 
124770 AA984414 
117936 AI382904 
134385 M14660 
108367 AW410478 
131143 NM.000312 
105441 N28522 
128215 AA973310 
127344 AI003929 
126478 BE541249 
122053 AI637498 
111760 BE651929 
112401 R61279 
103023 AW500470 
125575 H14983 
128765 AF073310 
108935 M147848 
121221 AI140708 
120091 AW024672 
107375 BE011845 
125803 AW876115 
115132 AA811762 
113346 AF143876 
107357 U63973 
125443 BE251057 
133803 M24461 
113378 T80738 
105540 BE391690 
127446 F13008 
134075 NM_012201 
127585 M604144 
125824 Z45258 
127606 M621135 
125585 AW298113 
107757 BE621721 
109978 H09356 
132297 BE272446 
115784 AW402151 



Hs.38034 
Hs.93002 
Hs.177990 



100025 
127063 
127945 
111557 

119858 
106509 
124124 
126713 
126475 



AI276526 

AA815031 

R09510 

AW1 37635 

W01370 

AI042309 

AW294404 

AW249181 



gb:zs41a03.s1 Soares_NhHMPu_S1 Homo sap: 
gb:af95g01.s1 SoaresJestis.NHT Homo sap 
Homo sapiens cDNA: FLJ21874lis.cloneH 
hydroxyacyl glutathione hydrolase 
androgen induced protein 
(NONE) 

Homo sapiens cDNA: FLJ2091 3 lis, clone A 
ESTs 

ESTs, Highly similar to MAONJCMAN NADP- 
Homosapiens CDNAFU13549 fis. Core PL 
Homo sapiens mRNA; cDNA DKFZp566L203 (fr 
hypothetical protein FLJ20550 



Hs.7063 
Hs.1 10092 
Hs.169161 
Hs.56406 
Hs.12082 
Hs.274422 
Hs.268575 
Hs.57874 
Hs.3463 

Hs.91389 ESTs 
Hs.91910 ESTs 
Hs.255015 ESTs 

Hs.154868 carbamoyl-phosphate synthetase 2, aspart 
Hs.105613 ESTs 
Hs.31755 
Hs.87929 
Hs.35006 
Hs.24956 
Hs.1 96008 
Hs.120429 
Hs.47213 
Hs.169274 
Hs.1 04019 
Hs.2351 



Hs.80624 
Hs.109697 
Hs.98745 
Hs.268754 
Hs.237536 
Hs.1 17950 

Hs.143648 
Hs.67991 
Hs.97461 
Hs.59558 

Hs!29852 
Hs.71433 
Hs.14318 
Hs.103501 
Hs.177592 
Hs.76305 
Hs.14757 



Homo sapiens cDNA FLJ13707 fis, cl. 



ESTs 
ESTs 

ESTs, Highly similar to IFT2_HUMAN INTER 
transforming, acidic coiled-coil conla'n 
protein C (inactivator of coagulation fa 
quinolinate phosphoribosyltransferase (n 
gb:op91e06.s1 Soares_NFL_T_GBC_S1 Homo 
hypothetical protein MGC2560 



Homo sapiens cDNA FLJ 11949 fis clone HE 
ESTs,WeaklysimilartoAri51067 1 HSPC2 
multilncttonal polypept'de similar lo S 
gb:ym19h09.r1Sc ' ' ,! 



Homo sapiens clone IMAGE: 113399 mRNAseq 
rhodopsin kinase 
ribosomal protein, large, P1 



Hs.136552 ESTs 



Hs.280792 
Hs.22528 
Hs.265317 
Hs.54673 
Hs.73818 
Hs.90073 
Hs.77274 



hypothetical protein MGC2562 
tumor necrosis factor (ligand) superfami 
ubiquinol-cytochrome c reductase hinge p 
" 1 (yeast homolog) 



gb:zt79e03.s1 Sc t NhTFon p 
Rho GDP dissociation inhibitor (GDI) alp 

■ ■ 1P11-110H4 



Hs.331564 Homosapiens mRNA; cDNA DKFZp434H1215 (f 

Hs.123598 ESTs 

Hs.20373 EST 

Hs.44238 ESTs, Weakly sin* tc c 5i 7 a i it 

Hs.46824 ESTs 

Hs.64552 hypothetical protein MGC1 5563 

Hs.144515 Homosapiens cDNA FU11672 fis. clone hit 

Hs.19954 ESTs, Weakly similar to T19873 hypotheti 

Hs.238797 ESTs, Moderately similar lo 138022 hypot 



WO 03/042661 PCT/US02/36810 



126851 R40611 

104820 AW162768 

127235 AI817309 

126552 AF1 68711 

127523 AA617637 

131692 BE559681 

112974 AL353955 

1 1 8921 N91914 

100676 X02761 

127721 T5957B 

115254 AA279024 

128173 AI457242 



134191 W26632 

107394 AA864798 

131562 NMJJ03512 

127310 AW450571 

122359 AA523486 

100524 M80902 

128422 T77794 

129902 AA076278 

126784 T81887 

123343 AI761902 

105458 AW954377 

112266 AI652534 

127622 AA628222 

113659 R06545 

116892 AI573283 
126995 NM 014351 
111657 R07364 

100243 AB028125 
115153 AF107203 
108892 AK000002 
113294 AI037922 
126691 W03046 
106979 AW015227 



112558 AK001621 

122046 AI560456 

122472 AA448509 

130753 AA205223 

131714 AA642831 

101233 AL135173 

10S501 AF047437 

126984 AA213820 

125765 BE243877 

127693 AA676727 

128453 X02761 

119416 T97590 



109710 D20044 

105704 AI282341 

112712 R91060 

100098 AF003743 

114122 R46128 

132397 AA021160 

107881 AI568350 

106302 AA398859 

125898 AK001823 

104957 AI359009 

102909 NMJM5269 

125559 BE297488 

109634 H17063 

116607 W05238 

127175 R11937 

110617 W93231 

125988 W27648 

115093 A1241932 

121207 AA705799 

112652 BE269699 

125213 AB014554 

125912 AW867467 

133046 R96881 

122791 AL1 22055 



Hs. 137565 
Hs.22620 
Hs.225583 
Hs.159397 

Hs.30736 

Hs.101174 

Hs.54751 

Hs.287820 

Hs.188440 

Hs.269316 

Hs.1 27024 

Hs.1 15910 

Hs.106551 

Hs.291956 

Hs.7979 

Hs.186180 

Hs.28777 



ESTs, Weakly similar to 2004399A chromes 
x010 protein 

gb:np34h12s1 NCI.CGAP_Lu1 Homo sapiens 
KIAA01 24 protein 
microtubule-associated protein tau 



ESTs, Weakly similar to ALU8 J-IUMAN ALU S 
KIAA0736gene product 
Homo sapiens cDNA FLJ23038 fe, clone L 
H2A hislone family, member L 



Hs.183704 
Hs.13277 



gb:ni67f11.s1 NCI_CGAP_Pr12 He 
ubiquitinC 

gb:yd20d09.r1 Soares fetal liver sp 
hypothetical protein FLJ22054 
HSPC163 protein 



ESTs, Weakly similar to ALU1JHUMAN AL 



Hs.189810 
Hs.268667 
Hs.77854 
Hs.57937 
Hs.55879 



ESTs, Weakly similar to ALU1_HUMAN ALU 



ataxin 2-binding protein 1 
Homo sapiens mRNA; cDNA DKFZp434L0827 (f 
leptin receptor ovi 




Hs.330761 ESTs 



ESTs, Weakly similar to S11998 linger pr 
ATPase, Na«-/K+ transporting, beta 3 poly 
gb:zj68b11.s1 Soares_fetalJiver_spleen_ 
fibronectin 1 

ES Weakl n to ALU1_HUMAN ALU 
guanine nucleotide binding protein (G pr 
ESTs, Moderately similar to JC6169 mid 
gb:zw65f1 1.s1 Soares _to'.al_fetus_Nb2HF8_ 
calcium/calmodulin-dependent protein kin 
hypothetical protein FU20721 
fibrinogen, gamma polypeptide 



Hs.279877 
Hs.183646 
Hs.94315 



Hs 35 
Hs.1 8: 
Hs.235782 
Hs.109299 
Hs.278712 
Hs.63609 
Hs.129836 



gb:Homo sapiens delayed rectifier potass 
ESTs 

hypothetical protein DKFZp564K0822 

hypothetical protein MGC2650 

hypothetical protein FLJ23221 

Homo sapiens mRNA; cDNA DKFZp564C2478 (f 

mitochondrial ribosomal protein L17 

gltoma-associaled oncogene homolog (zinc 

cell division protein RsJ 

ESTs 

ESTs, Weakly similar to T31613 hypotheti 
gb:yf54b0B.r1 Soares infant brain 1MB H 
Homo sapiens, cicne IMAGE-.3948563, mRNA, 
gb:37e1 0 Human retina cDNA randomly prim 
hypothetical protein FU 1 1 273 
ESTs 

solute carrier family 21 (organic anion 
protein tyrosine phosphatase, receptor t 
eukaryotic transla'ion ir.ifi tKr t, ctor 
Hpall tiny fragments locus 9C 
KIAA1028 protein 
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Pkiy CAT number Accession " 

108451 13766_27 AA079195 M084955AA126308AA084956 

124195 2606_3 H83034 H52379 

123619 371681J AA602964 AA609200 

125165 1852047.1 W45350W45406 

125324 1692163J R07786 T85948 T86972 

126053 1601238J H64450 H64464 

126086 1606216J H75681 H70975 

126098 1629789 1 M79088 N88221 

125464 168460J N71807AA203399 

125499 1562851 1 H10543 R11878 

126127 1205826J N95428 W24040 AW751366H81987 

125546 356478_1 H09950R18413 AA570553 AW973425 

125549 17021 79_1 R20215R18767 

125558 1703083J R59305R19748 

125575 1566885.1 H14983R21554 

125743 5025_5 H17151 H11956 

125761 1744008J R68351 R68364 

126426 110637_1 AA125934AA127189AA06507S AA070377AA100017 

AA079891 AA1 13255 AA0751 68 AA082764 AA083380 N84829 M0B4752 AA076512 AA0851 19 AA085208 M085045 

127155 200358_1 AA284993 AA478122AA477923 

127175 1695805.1 R11937Z45532 

126528 1276201J Z24895AW891336R01294 

125957 1583542_1 H41694H45213 

125976 296453J AA436760 AW237453 BE327496 N47347 N56967 

125982 1766315J R98091 W92898 

125988 1365728J W27648 R99193 BE090398 

127245 226662 J AA323958 AA370268 

127248 227560J AA364195 AA325029 AW962C50 

127262 231725J AA828125AA834883AA330555 

126659 1541209J T16245 R19694 F13545 H10299 T66048 T65279 H18006 

126593 87363 1 C05723 AA018342 

127315 37938.1 AF1 1 6622 Al 1 1 4507 AA640834 AA377999 

126730 297653J M442429 T19477 

103898 187213_-3 AA248884 

127446 16001.2 F13008T75435 

126826 127356.1 AA099764AA1 12950 

126872 142696.1 AW450979AA136653AA136656AW419381 M934358 AA492073, B E1 68945 AA809054 AW23G038 BE01 1 21 2 B E01 1 359 BE0 1 1 367 

BE01 1368 BE011362BE011215 BE01 1365 SE01 1363 

128132 177108.1 AA225632 AI820970 AI820952 M226472 AI7321 40 AI732059 AA225307 

AA225500 

127523 351071.1 AA61 7637 AA554963 

126982 171753.1 AA211419 AA211566 

128215 530345.-1 AA973310 

127704 405690.1 AA679609 AA694592 

127705 966283.2 AJ003322 AJ003324 
128422 1811283.1 T77794T85581 
127897 446527.1 AA773681 AA773857 
120734 208882.1 AA299948 AA299949 
100098 25117 -13 AF003743 

114620 32062.8 AA642974 M084223 

122652 26401.-30 M454641 

100842 tigr_HT4398 U05597 

123783 genbank_AA610112 

125032 genbankJ74884 

123808 genbank_AA620552 

123864 genbank_AA620882 

118049 genbank_N53145 

102406 entrez U43177 

116962 genbanK_H79677 

134076 40321.1 AF086215W02702AA284288 W25655 

125888 266863.1 H18298H46830 

127271 321389.1 H96820H79463 

113119 genbank.T47910 T47910 

104799 genbank_AA029703 AA029703 

127693 790317 1 AA676727 AA704704 

120415 " genbank_AA235810 AA235810 

127964 135151.1 F06298 R18057 

122359 681003.1 AA523486 AW026780 AI821660AA443898 

122420 genbank_AA44697l 

124276 genbank H83465 

101447 entrez_M21305 

124540 genbank_N63232 

124554 genbank_N65961 

117357 genbank N24829 

103305 entrez_X82279 
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103392 entrez_X94563 

119416 genbank T97186 

105225 genbank_AA211777 

121292 genbank_AA401807 

112853 genbank_T02843 

121387 genbank_AA405854 

114601 genbank AA075566 

100221 entrez_D28383 

130339 genbank_AA435746 

100554 tigr_HT2241 

123423 genbank M598484 

123474 genbankAA599209 

123489 genbank_AA599708 



TABLE 27A: ABOUT 895 GENES UP-REGULATED IN COMBINED LUNG FIBROSIS COMPARED TO NORMAL BODY 

Table 27A lists about 895 genes that are upregulated in lung fibrosis (collection of IFF, HP, and NSIP) samples as compared with the normal "body map" samples. These were 
selected from about 59680 probesets on an Affymetrix/Eos Hu03 GeneChip array such that the ratio of 'average" fibrosis sample expression level to "average" normal adull tissues 
was greater than or equal to about 2.0. The "average" fibrosis sample expression ievel was set to the 90* percentile amongst fibrosis samples. The "average" normal adult tissue 
iere! was set lo the 95* percentile amongst non-malignant tissues. In order to remove gene-specific background levels of non-specific hybridization, the 15* percentile value amongst 
non-malignant tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of fibrosis to normal body tissue 



















Homo sapiens cDNA: FLJ23494 f:s, clone L 




424917 


AI636208 


Hs.96901 


Homo sapiens cDNA- FLJ23049 fls! clone L 


26^5 


453310 


X70697 


Hs.553 


solute carrier family 6 (neurotransmitte 


25.5 






Hs. 197764 














































445885 


AI734009 


Hs.1 27699 


KIAA1603 protein 


18D 


440452 


AI92513G 


Hs.55150 


Z T W H/simlartoCAi'P HUMAN CALCY 


17.8 


422426 


W79117 


Hs.58559 


ESTs, Weakly similar to rhotekin [M.musc 




444929 


AI685841 


Hs.161354 


ESTs 


16^5 


440807 


AW269421 


Hs.128093 


ESTs 


16.3 


408826 


AF216077 


Hs.48376 


Homo sapiens clone HB-2 mRNA sequence 


14.2 


446967 


AI699629 


Hs.1 56781 


ESTs 


13.3 


417801 


AA417383 


Hs.82582 


integrin, beta-like 1 (with EGF-like rep 


12.3 


437119 


AI379921 


Hs.177043 


ESTs 


123 


451103 


R52804 


Hs.25956 


DKFZP564D206 protein 


11.5 


443450 


N66045 


Hs.133529 


ESTs 




411880 


AW872477 




gb:hm30f03.x1 NCI CGAP Thy4 Homo sapiens 


113 


432519 


AI221311 


Hs.130704 


ESTs 


11.3 


414142 


AW368397 


Hs.150042 


ESTs 


11.0 


433283 


BE041135 


Hs.175622 


ESTs 


10.1 


441082 


AW444804 


Hs.202655 


ESTs 


10.1 


452039 


AI922988 


Hs.172510 


ESTs 


10.0 


417204 


N81037 


Hs.1 074 


surfactant, pulmonary-associated protein 


9.9 




AA300900 


Hs.98849 


ESTs, Moderately similar to AF1 61 511 1 H 


9.8 


412372 


R65998 


Hs.1 18615 


ESTs 


9.8 


426274 


D38122 


Hs.2007 


tumor necrosis factor (ligand) superfarni 


9.7 


431007 


AF039564 


Hs.248211 


retinoblastoma-binding protein 9 


9.4 


443709 




Hs.1 34662 


ESTs 


9.3 


446232 


AI281848 


Hs.1 65547 


ESTs 


9.2 


448253 


H25899 


Hs.201591 


ESTs 


9.2 


432133 


AB033088 


Hs.272567 


KIAA1 262 protein 


9.1 


409238 


AL049990 


Hs.51515 


Homo sapiens mRNA; cDNA DKFZp564G1 1 2 (fr 


9.0 


431353 


AA828032 


Hs.189076 


ESTs 


8.8 


450050 


AI681268 


Hs.257883 


ESTs 


8.8 


458194 


AW383618 


Hs.265459 


ESTs, Moderately similar to ALU2_HUMAN A 


8.8 


414968 


C16096 


Hs.297777 


ESTs 


8.7 


425664 


AJ006276 


Hs.1 59003 


transient receptor potential channel 5 


8.7 


408562 


AI436323 




Homo sapiens mRNA for KIAA1568 protein. 


8.6 


453672 


U73531 


Hs34526 


G protein-coupled receptor 


8.5 


429420 


AK001679 


Hs.202289 


hypothetical protein FLJ1 0376 


8.5 


421478 


AI683243 


Hs.97258 


ESTs 


8.4 
8.4 


404916 
444396 


T65213 


Hs.4257 


ESTs 


8.3 


442275 


AW449467 


Hs.54795 


ESTs 


8.3 


437479 


R61866 


H; 1012/7 


ESTs 


8.2 


432203 


AA305746 


Hs.49 


macrophage scavenger receptor 1 


3.2 


431433 


X65018 


Hs.253495 


surfactant, pulmonary-associated protein 


79 


406747 


AI925153 


Hs.217493 


annexin A2 


7.8 
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445537 AJ245671 

450025 AK001875 

421798 N74880 

421155 H87879 

446917 AI347863 

422798 R92347 

426830 M385751 

437157 BE048860 

433231 AB040926 

451561 N52812 



427383 NM 005411 

409718 D86640 

443324 R44013 

431924 AK000850 

427356 AW023482 

418735 N48769 

429945 NM_006729 

407510 U96191 

430099 AW194988 

441835 AB036432 



433384 AI021992 
427043 AA397679 
425921 NMJ07231 



40 
45 



Hs.12844 
Hs.24321 
Hs.264330 
Hs. 102267 
Hs. 156672 
Hs.34574 
Hs.160392 
Hs.120655 
Hs.143552 
Hs.1 77403 
Hs.162080 
Hs.3731 
Hs.32295 
Hs.179747 
Hs.177582 
Hs.56045 : 
Hs.1 64225 
HS.2722C3 



EGF-like-domain, multiple 6 
Homo sapiens cDNA FLJ12028 fis, clone HE 
N-acylsphingosine amidohydrolase (acid c 



Homo sapiens cDNA FU12604 fis, clone NT 



no sapiens cDNA FLJ20843 (is, done AD 



Hs.226483 
Hs.20537 



diaphanous (Drosophila, homolog) 2 
gb:Human trophoblast hypoxia-regulated f 
Homo sapiens cDNA FLJ13942 fis, clone Y7 
advanced glycosylation end product-speci 



445186 AW614544 
431337 N48107 
434819 AA650099 
458219 H2219G 
434377 AW137148 
435933 AA805520 
436954 M740151 
445424 AB028945 
449108 AI140683 
410334 AW979261 
447112 H17800 
447700 AI420183 
449208 AW263635 
445657 AW612141 
421554 AW137676 
435299 AI745458 
415769 AI339257 
433527 AW235613 
452771 T05477 
427585 D31152 
411514 AW850178 
424034 AI940675 
444527 NM.005408 
429710 AI337113 
432113 AA935065 
447997 H00656 
449328 AI962493 
416575 W02414 
432009 AL137424 
434088 AF1 16677 
444342 NMJ14398 
414299 M142989 
431041 AA490967 
448104 AI674818 
445279 R41900 
408978 AL133617 
415094 D59513 
428244 AI564123 
452784 BE463857 
455431 AW938484 
449416 AI651016 
421659 NM.014459 
407638 AJ404672 
446164 AW273539 
413048 M93221 
446608 N75217 
419807 R77402 
447164 AF026941 
442652 AI005163 



Hs.22588 
Hs.135568 
Hs.1 24244 
Hs.298460 
Hs.162211 

Hs.293797 
Hs.1 28355 
Hs.1 23641 
Hs.292593 
Hs.291541 
Hs.31874 
Hs.1 36348 
Hs.1 92075 I 
Hs.1 30425 



ast specific factor 2 (fasciclin 



Hs.12696 coriaclin SH3 domain-binding protein 



Hs.98328 

Hs.291993 

Hs.7154 

Hs.171077 

Hs.48643 

Hs.279575 

Hs.97775 

Hs.1 22614 

Hs.115436 

Hs.1 33020 

Hs.1 79729 
Hs.1 8995 
Hs.20914 



Hs.1 52385 
Hs.29792 
Hs.1 97647 
Hs.38383 

Hs.249270 
Hs.1 0887 
Hs.71730 
Hs.1 05276 
Hs.178391 
H5.25245 
Hs.49421 

Hs.42500 
Hs.151258 
Hs.80738 
Hs.246311 
Hs.1 06511 
Hs.288693 
Hs.1 99329 
Hs.75182 
Hs.257846 



ESTs, Weakly similar to similar to se 



ESTs 

gb:EST03366 Fetal brain, Stratagene (cat 
collagen, type X, alpha 1 (Schmid metaph 
KIM1 304 protein 

Homo sapiens cDNA: FU2305B lis, clone L 

II inducible cylol \ r 

Homo sapiens cDNA: FU23594 fis, clone L 
ESTs 
ESTs 



ribosomal protein L44 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434M0723 (f 
gb:HUM042H10B Clontech human fetal brain 
ADP-ribosylalion factor-like 5 
Homo sapiens cDNA: FU21062 fis, clone C 
lopt (qpL115,leul n, CD43) 



ins cDNA FLJ1 1 667 fis, clone HE 
sapiens cDNA: FLJ23577 fis, clone L 
se receptor, C type 1 

gb:yi75f1 1 ,s1 Soares placenta Nb2HP Homo 
Homo sapiens c;g5 mRNA, partial sequence 
ESTs, Weakly similar to KIAA0944 protein 
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429859 NM 007050 

432824 AK001783 

425509 AF079363 

424717 H03754 

436061 AI248584 

444218 AF070641 

453382 AA709285 

447033 AI357412 

417235 AA810278 

418200 AW629751 

427652 AI673025 

431255 AA497043 

441143 AI027604 

452293 AI871833 

443903 AI220547 

422352 AA766296 

424105 AI142336 

439759 AL359055 

428227 AA321649 

430510 AW162916 

425804 BE501698 

435347 AW014873 

446002 AI346468 
452883 X80031 

442176 AA983764 
443253 AIM 121 2 
419556 U29615 
439920 H05430 
421502 AF111856 
434424 AI81 1 202 
408625 AW243323 



433815 AI696602 

416879 H98899 

432182 AW607789 

445386 AI422005 

450478 AW451709 

453080 AI423055 

435496 AW840171 



Hs.225952 
Hs.279012 
Hs.158213 
Hs.152213 
Hs.190745 
Hs.10684 



njlc MMTV r° n 

Homosapiens cDNA: FLJ21326 fis. clone C 
Homo sapiens clone 24421 mRNA sequence 
Homo sapiens cDNA FLJ 1 3078 lis, clone NT 



Hs.43874 ESTs 
Hs.1 15685 ESTs 
Hs.1 59650 ESTs 



ESTs, Weakly similar to 



Hs.1 35223 

Hs.99200 

Hs.43977 

Hs.67709 

Hs.2248 

Hs.241576 

Hs.258189 



gb:wm51h09.x1 NCI_CGAP_Ut2 Homo sa| 



sapiens mRNA full length insert cDN 



hypothetical protein PR02577 



Hs.1 50318 
Hs.128910 
Hs.132117 
Hs.91093 
Hs.1 44455 
Hs.1 05039 
Hs.125365 
Hs.266785 



gb:EST12592 Uterus tumor I Ho 



Hs.293119 
Hs.16038C 
HS.27120C 
Hs.23921 
Hs.265398 
Hs.1 1614 



ESTs, Weakly similar to ALU7_HUMAN ALU S 



Home sapiens cDNA FLJ12482 fis, clone NT 
ESTs, Weakly similar to transformation-r 
Homo sapiens cDNA: FLJ23555 fis, clone L 



435202 AI971313 

440320 AA879294 

438796 W67821 
400269 

447724 AW298375 

446509 AF169693 

451620 AW449888 

451963 AI825440 

456408 AI288348 

425895 AI269484 

447048 AW393080 

454024 AA993527 

415929 AA724373 

426625 T78300 

434334 AA912476 

437138 AI935622 

455024 AW851309 

436246 AW450963 

416030 H15261 

459267 AJ003631 

445122 AW241632 

414812 X72755 

421160 AL080215 

425734 AF056209 

429208 AA447990 

442957 AI949952 

444050 AW1 38295 

444078 BE246919 

451024 AA442176 

442832 AW206560 

423377 AL049377 

451895 T93573 

442353 BE379594 

421464 AA291553 
404043 

407055 X89211 

410008 AA079552 

410247 AF181721 

417461 R38403 



Hs.293697 ESTs 

Hs.1 70204 K1AA0551 protein 

gb:nw86e09.s1 NCI_CGAP_Pr1 2 Homosapiens 



Hs .224952 
Hs.23450 
Hs. 161427 



mRNA for FLJ 00023 protein 



in 215 



Hs.271245 ESTs 



no sapiens cDNA: FLJ23537 fis, clone L 
hypothetical protein FU23403 
ESTs, Highly similar to i 
serologically defined col 
Homo sapiens cDNA FU13221 fis, ck 



gb:IL3-CT0220-17G200-067-C1 1 CT0220 Homo 
ESTs 
ESTs 



ls.147377 
Hs.77367 
Hs.1 02301 



gb:AJ003631 Selected ch 
Homo sapiens cDNA: FLJ23593 fis, clone L 
monokine induced by gamma interferon 
Homo sapiens mRNA; cDNA DKFZp586J0323 (f 
peptidylglycine alpha-amidaling monooxyg 
ESTs 



Hs. 10290 U5snRNP-specific 40 kDa protein (hPrpB- 
gb:zw63b08.r1 SoaresJotal_fetus.Nb2HF8_ 

Hs.253569 ESTs 

gb:Homo sapiens mRNA; cDNA DKFZp586H0718 

Hs.1 6970 ESTs 

Hs.49136 ESTs 

Hs.190086 ESTs 



Hs.61345 RU2S 
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446921 AB012113 

444271 AW452569 

434217 AW014795 

452571 W31518 

423575 C18863 

408771 AW732573 



420313 AB023230 
423448 AK000776 
433492 AW605849 
434636 AA083764 
435747 AI079519 
458158 AW296778 
419261 X07876 
410060 NMJ101448 
426116 AA868729 
409203 AA780473 
414259 W44633 
406671 AA129547 
431889 AA521277 
430414 AW355665 
433426 H69125 
421754 AI681535 
410785 AW803341 
455235 AW875951 
408399 NM_005426 
429784 M89796 
436982 AB018305 
432231 AA339977 
432837 AA310693 
452166 AI948607 
458154 AW816379 
420362 U79734 
424202 BE350295 
410658 AW105231 
415467 AW081710 
419503 AA243642 
439479 AI734258 
448404 BE089973 
424268 AA397653 
420637 AW976153 
450716 AI266484 
428927 AA441837 
422544 AB018259 
431207 AA495925 
424508 ALO8O103 
441484 M935481 
425916 NM_006786 
401793 

431169 AW971240 
438038 A1732629 
439619 AW975998 
446577 AB040933 
450445 AW974636 
459482 AA625339 
445495 BE622641 
428743 AL080060 
426320 W47595 
432869 AW974094 
419235 AW470411 
429703 T93154 
413499 BE1 44884 
406182 

417307 N99673 
430140 AW296771 
436111 AI803082 
449729 R72032 
457620 AA602711 
428434 AW36359C 
406554 

451381 BE241831 
443113 AI040686 
421470 R27496 
446428 AW082270 



Hs.156984 ESTs 

Hs.179852 Homo sapiens cDNA FLJ1 2832 fis, clone NT 

Hs.1 8724 Homo sapiens mRNA; cDNA DKFZp564F093 (fr 

Hs.209502 ESTs, Weakly solar to iniquitous TPR m 

Hs.16530 small inducible cytokine subfamily A (Cy 

Hs.149804 ESTs 

Hs.23349 ESTs 

Hs.34665 ESTs 

Hs.163443 Homo sapiens cDNA FLJ11576 fis, clone HE 

Hs,47584 potassium voltage-gated channel, delayed 

gb:EST382704 MAGE resequences, MAGKHomo 

Hs.1 60323 ESTs 

Hs.124316 ESTs 

Hs.153716 Homo sapiens mRNA for Hmob33 protein, 3' 

Hs.72307 ESTs 

Hs.96427 KIAA1013 protein 

Hs.128753 Homo sapiens cDNA FLJ20769 fs, clone CO 

gb:MR0-HT0241-200100-006-g02 HT0241 Homo 

Hs.241334 ESTs 

Hs.1 34398 ESTs 

Hs.300357 ESTs, Highly similar to dJ416F21 .2 [H.sa 

Hs.89791 wingless-type MMTV integration site fami 

Hs.58367 glypican 4 

Hs.1 44694 ESTs 

Hs.687 cytochrome P450, subfamily IVB, polypept 

Hs.25044 Homo sapiens cDNA: FLJ231 31 fis, clone L 

Hs.285754 met proto-oncogene (hepatocyle growth fa 

Hs.1 24946 ESTs 

Hs.1 20388 ESTs 

Hs.1 33525 ESTs 

Hs.99342 ESTs, Weakly similar to KCC1_HUMAN CALCI 
gb:IL2-UM0079-090330-050-D03 UM0079 Homo 
gb:CM1 -PT001 3-1 31299-067-f D9 PT0013 Homo 

Hs.44585 tumor protein p53-binding protein, 2 
Hs.30 membrane-spanning 4-donalns, subfamily A 
Hs.5378 spondin 1 , (f-spondin) extracellular mat 
Hs.274127 CLST 11240 protein 
— HSPC072 protein 



Hs.264680 ESTs 



gb:QV4-ST0234-181 199-035-g01 ST0234 Homo 



Hs.192035 ESTs 



ESTs, Weakly similar to RAN binding prot 



Hs.7369 

Hs.137422 

Hs.245367 



ESTs, Weakly similar to AIU1_HUMAN ALU S 
ESTs 

ESTs, Weakly simitar to ALU1_HUMAN ALU S 
gb:RC6-BT0709-310300-021-G07 BT0709 Homo 
Human DNA sequence from clone 495010 on 
gb:EST388262MAGE resequences, MAGN Homo 
ESTs, Weakly simitar to KIAA1324 protein 
ESTs 



Hs.9394 
Hs.149770 
Hs.58972 
Hs.162200 



KIAA0716 gene product 
ESTs 

Homo sapiens cDNA FLJ13658 fis, clone PL 



Hs.1 5420 KIAA1500 protein 

Hs.194563 ESTs 

Hs.237052 EST, Weakly similar fo ALU1_HUMAN ALU SU 

Hs.38489 ESTs 

Hs.301549 Homo sapiens mRNA; cDNA DKFZp564H172 (fr 

Hs.169300 transforming growth fxtor, beta 2 

gb:EST386197 MAGE resequences, WAGM Homo 

Hs.288433 neurotrimin 

Hs.28705 ESTs 

gb:CM0-HT01 82-041099-065-e 1 1 HT0182 Homo 

Hs.3585 ESTs, Weakly similar to AF1 26743 1 DNAJ 

Hs.221999 ESTs 

Hs.157212 ESTs 

Hs.29235 ESTs 



Hs.1 32908 
Hs.1378 
Hs.210617 



ESTs, Weakly similar to ALU4_HUMAN ALU S 
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413136 BE066941 

429228 AI553633 HS.1U49B5 ESTs 

420252 AW270404 Hs.193161 ESTs 

423629 AW021173 Hs.18612 Homoss 

444339 T96555 Hs.31562 ESTs 

434164 AW207019 Hs.148135 ESTs 



ns cDNA: FLJ2190S (is, clone H 



426920 AA393351 

453736 AL118674 

408923 H73881 

430919 AA489041 

431622 AW979271 

433584 AW295399 

437073 AI885608 

438394 BE379623 

446242 N66336 

452542 AW812256 

454009 AW015927 

449765 N92293 

415652 T79213 

453931 AL121278 

439382 BE247684 

420077 AW512260 

430437 AI768801 

446745 AW118189 

408308 AL033377 

450320 AW291775 

429597 NM 003816 

449523 NMJ00579 

451110 AI955040 

431745 AW972448 

410781 AI375672 

419546 AA244199 
444330 AI597655 

408761 M057264 
409026 AL137554 
432055 AW972359 
432441 AW292425 
408045 AW138959 
427191 BE221825 
416965 N26223 
441594 AL041080 
406992 S82472 
431941 AK000106 



407942 AA378608 
417991 AA731452 
422589 AA312735 



Hs.132121 ESTs 

Hs.34871 KIAA0569 gene product 

Hs.255436 ESTs 

Hs.295448 ESTs 

Hs.293184 ESTs 

gb:UI-H-BI2-ahv-h-03-0-Ul.s1 NC!_CGAP_Su 

Hs.94122 ESTs 

Hs.27693 CGI-124 protein 

Hs.7360 ESTs 

gb:RC0-ST0174-191099-031-a07 ST0174 Homo 

Hs.233071 ESTs 

Hs.206832 EST, Moderately similarto ALU8_HUMAN AL 
ls.272073 



Hs.25 



ESTs 



449494 AW237014 

444188 AI393165 

400297 AI127076 

410811 AW805687 

450584 AA040403 

428043 T92248 

436120 AI248193 

442324 R63578 

448693 AW004854 

425555 AA359291 

431385 BE178536 

408427 AW194270 

459587 AA031956 

438128 AA904430 

408938 AA059013 



428890 AA393167 

427374 AI150033 

434208 T92641 

446465 H38026 

451229 AW967707 

415511 AI732617 

408776 AA057365 

421110 AJ250717 

453636 R67837 

436578 AI091435 

426083 AW962712 

419231 AL046294 



s.103070 
Hs.87767 ESTs 

Hs.1 69943 Homo sapiens cDNA FLJ 13569 fis, clone PL 
Hs.156400 ESTs 

Hs.44197 hypothetical protein DKFZp564D0462 
Hs.213793 ESTs 

Hs.2442 a disintegrin and metalloproleinase doma 
Hs.54443 cheraokine (C-C motif) receptor 5 
Hs.301584 ESTs 
Hs.163425 ESTs 
Hs.165028 ESTs 

gb:nc06c05.s1 NCI_CGAP_PM Homo sapiens 
Hs.49265 ESTs 
Hs.238936 ESTs 

Hs.49927 Homo sapiens mRNA; cDNA DKFZp434H1720 (f 

Hs.293334 ESTs 

Hs.163484 ESTs 

Hs.245123 ESTs 

Hs.97691 ESTs 

Hs.1 60436 ESTs 

Hs.208765 ESTs 

gb:beta -pol=DNA polymerase beta {exon a 
Hs.272227 Homo sapiens cDNA FLJ20099 fls, clone CO 
Hs.123369 ESTs 
Hs.294140 ESTs 
Hs.169391 ESTs 

gb:wd31f02.x1 Soares_NFl_T_GBC_S1 Homo s 
Hs.5894 hypothetical protein FLJ10305 
Hs.190008 ESTs 
Hs.179725 ESTs 
Hs.244627 ESTs 

1 smRNA;cDNADKFZp586N2424(f 



Hs.300648 ESTs 

Hs.60371 ESTs 

Hs.2240 uteroglobin 

Hs.1 19860 ESTs 

Hs.28426 ESTs 

Hs.228320 Homo sapiens cDNA: FLJ23537 fis, clone L 

Hs.130767 Homo sapiens cDNA: FLJ23553 fis, clone L 

Hs.1 1090 high affinity immunoglobulin epsilon rec 

Hs.1 77236 ESTs 

gb:zk15e04.s1 Soares_pregnant_u'.erus_NbH 

Hs.1 22049 ESTs, Weakly similar to U4/U6 small nucl 

Hs.22607 ESTs 

Hs.1 34682 Homo sapiens cDNA: FLJ23161 fis, clone L 
Hs.200442 ESTs 
Hs.41294 ESTs 
Hs.143686 ESTs 
Hs.127648 hypothetical protein F r 



ls.48473 ESTs 
ESTs 



;tin) 



Hs.63356 

Hs.1355 cathepsin b 

Hs.169872 ESTs 

Hs.134859 ESTs 

l is 126712 ESTs, Weakly similar to AF1 91 020 1 L21G5 

Hs.136245 ESTs, Weakly similar to dJ202!21.4 [H.sa 
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AI936450 






























AA664273 


Hs. 186104 
















































AA640773 




















AA749382 


Hs. 107233 








A1219490 












Hs. 163582 




















Hs.23196 










Hs.127842 








AW772449 










AA001713 




gb:zh86e08.s1 Soares_fetal„iver_spleen_ 








Hs.16986 


hypothetical protein FLJ11046 
















AA595800 










AL137526 


Hs.147472 


dynein intermediate chain 2 






AI620270 


Hs.129837 








AI351010 


Hs. 102267 


lysyt oxidase 






AI886558 










BE007967 


Hs.155795 








AW081681 




















AW901804 








434812 


AA649860 


H5.189496 


ESTs 


23 


432583 


AW023624 


Hs.162282 


ESTs 


2,3 
























AI949638 


































ESTs' Weakly similar 'o ALU1 _ HUMAN ALU S 


























AF150179 










AF007835 










BE17S776 




gbtRC3-HT058S-1 10300-01 1-g09 HT0586 Homo 






AL1 38043 


Hs.293549 








AA523696 


Hs.222695 








AW102613 


Hs. 15291 3 


















AI863140 














Homo sapiens cDNA; FU21592 fis, clone 0 






AA129703 




gb:zn92b05.r1 Stratagene !nng car~ : noma 








Hs.288381 


hypothetical protein DKFZp56401278 






AW975920 


Hs.283361 








AW009640 










NMJJ03462 


















AA195764 














Naoysp ingosineami o y roase(aci c 




447720 


AL038765 


Hs161304 




2.7 




H83294 


Hs 284122 


Wnt inhibitory factor-1 


2.7 


442074 


C17511 


Hs.128430 




2.7 


424115 


AA335497 


Hs.293965 


ESTs 


2.7 


417728 


AW138437 


Hs.24790 


KIAA1573 protein 


2.7 


433803 


AI823593 


Hs.27688 


ESTs 


2.7 


419247 


S65791 


Hs.89764 


fragile X mental retardation 1 


2.7 


424310 


AA338648 


Hs.50334 


ESTs 


2.6 


438504 


AW665281 


Hs.224625 


ESTs 




426486 


BE178285 


Hs.170056 


Homo sapiens mRNA; cDNA DKFZp536B0220 (f 


U 


430417 


AA461045 


Hs.50701 


ESTs 


2.6 


438297 


AW515196 


Hs.258238 


ESTs, Moderately similar to ALU1.HUMAN A 


2.6 


422505 


AL1 20862 


Hs.124165 


ESTs 


2.6 



457285 AI038858 

428667 AI375550 

431750 AA514986 

435575 AF213457 

413385 M34455 
403903 

407910 AA650274 

423424 AF150241 

436043 AW963838 

436645 AW023424 



Hs.74407 
Hs.283705 
Hs.44234 
Hs.840 



indoleamine-pyrrole 2,3 dioxygenase 



ns cDNA FLJ12136 fis, clone MA 
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415122 D60708 

416747 AW876523 

420159 AI572490 

444361 W76027 



452311 AW304029 

413802 AW964490 

417318 AW953937 

440028 AW473675 

437960 AI669586 

433687 AA743991 

430573 AA744550 

439737 AI751438 

453204 R10799 

436751 AA732217 

408165 AL137573 

431120 AA492588 

446638 AL133063 

438458 AW975186 

446063 AI720140 

430499 AW969408 

450496 AW449251 

441330 AI692984 

424433 H04607 

434677 AW444575 

445779 AI253104 

444649 AW207523 

415451 H19415 

432222 AI204995 
404288 

408572 M055611 

408727 AL137259 

408728 AL137379 
410095 AW589638 
410947 AKOOO305 
418343 AA216372 
423401 NM_001992 
428637 AW979268 



438651 H64500 

443830 AI142095 

446800 AI341635 

450262 AW409872 



452412 AA029608 

454288 BE222648 

445745 AB007924 

424943 AU077260 

440106 AA864968 

458429 AV646559 

415261 T40928 

420026 AI831190 

431806 AF186114 

458722 AA741545 

419449 H18417 

436260 BE172762 

433644 AW342028 

419172 AW338625 

437982 N93466 

443348 AW873596 

417218 AA005247 

419236 AA330447 

448030 N30714 

417203 AA406341 

449275 AW450848 

436198 AK001125 



Hs.22245 
Hs.15929 
Hs.99785 



416778 M16505 
458332 AI000341 
448140 AF146761 



Homo sapiens cDNA FLJ12910fis, clone NT 
Homosaf enscDW Fl 121245 fis clone C 
Homo sapiens cDNA FU 1 3124 fis, clo 
Human gene from PAC886K2,chi 



Hs.32241 ESTs 



Hs.1 91990 ESTs 



gb:ny57g01 .s1 NCI_CGAP_PM8 Horno sapiens 
ins mRNA full lenglh insert cDN 



Hs.294054 ESTs 



Hs.15783 
Hs.1 62875 
Hs.151079 



Hs.257131 ESTs 
Hs.129354 ESTs 
Hs.9218 ESTs 



Homo sapiens mRNA; cDNA DKFZp564A2463 (f 
gb:ng99c08.s1 NCI_CGAP_Thy1 Homo sapiens 
Homo sapiens mRNA; cDNA DKFZp434P1 1 1 5 (f 
E I Veakl solar L I r IMAN LU 



Hs.226568 ESTs, Moderately similar toALU4_HUMAN A 

Hs.47115 hypothetical protein DKFZp434D0513 

Hs.47125 hypothetical protein FLJ13912 

Hs.258947 ESTs 

Hs.67055 hypothetical protein FU20298 

Hs.128087 coagulation factor II (thrombin) recepto 

gb:EST391378 MAGE resequences, MAGP Home 

Hs.2259',5 fucosyitransferase 9 (alpha (1 ,3) fucosy 

gb:601286042F1 NIH MGC_44 Homo sapiens c 

Hs.192974 Homo sapiens cDNA FLJ12735 fis, clone NT 

Hs.123646 ESTs 

Hs.143273 ESTs 



ESTs, Moderately similar to ALU7_HUMAN A 



Hs.218933 ESTs 



Hs.61373 
Hs.279458 
Hs.13245 
Hs.153924 
Hs.127699 
Hs.12345 
Hs.8345 
Hs.166676 
Hs.270737 
Hs.282832 
Hs.57483 
Hs.292710 
Hs.256112 
Hs.22120 
Hs.1 21764 
Hs.57572 
Hs.285754 



ESTs, Highly similar to c380A1.1b [H.sap 
KIAA0455gene product 
death-associated protein kinase 1 
KIAA1603 protein 

Homo sapiens cDNA: FLJ21399 fis, clone C 

ESTs 

ESTs 

tumor necrosis factor (ligand) superfami 



ESTs, Weakly similar to testicular tekti 
ESTs 



. . proto-oncogene (hepatocyte growth fa 

hs.lMlos HomosapienscDNAFU114ai lis, clone HE 

Hs.20161 HDCME31P protein 

Hs.269908 Homo sapiens cDNA FLJ11S91 lis, clone HE 

Hs.205457 KIAA1 620 protein 

Hs.300922 Homo sapiens cDNA FLJ10263 fis, clone HE 

Hs.28792 Homo sapiens cDNA FLJ1 1041 fis, clone PL 

Hs.81 36 endothelial PAS domain protein 1 

Hs.2291 Probe hTg737 (polycystic kidney disease, 

Hs.7473 ESTs 

Hs.1 19471 ESTs 
Hs.288617 
Hs.79876 
Hs.220491 
Hs.20450 



Hs.271004 ESTs 



313 
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447610 AW296286 

409519 AA075368 

441006 AW605267 

440817 AI341423 

420020 BE2958B6 

435395 AA729235 

424144 AA454033 

458145 AI239457 

408547 AA574291 

408941 AI452469 

409457 AW818081 

417137 U46265 

418950 T78517 

420756 M411800 

428316 AI860775 

432896 NM.014097 

436148 BE005252 

436284 AA708016 

437327 AL353942 

442611 BE077155 

456062 AI855286 

433014 NMJ14711 
401335 

428771 AB028992 

419140 AI982647 

454693 AW813428 

427785 X81053 

407339 AA777542 

408369 R38438 

427019 AA001732 

431089 BE041395 



419752 AA249573 

430073 U86136 

452401 NM.007115 

430345 AK000282 

407905 AW1 03655 

427660 AI741320 

422355 AW403724 

453049 BE537217 



Hs.1 05323 ESTs 

Hs.155981 mesothelin 

Hs.20887 hypothetical protein FLJ10392 

Hs.255534 ESTs 

gb:zm86h10.M Stratagene ovaria 
Hs.7627 CGI-60 protein 
ESTs 



Hs.94382 adenosine kinase 
Hs.117907 ESTs 

Hs.4 1 644 Homo sapiens cDNA: FLJ23003 fls, clone L 

Hs.1 30794 ESTs 
Hs.57837 ESTs 
Hs.1 65221 ESTs 

gb:CM4-ST0276-101299-059-b09 ST0276 Homo 
Hs.81281 hypothetical protein 
Hs.1 3941 ESTs 
Hs.1 89900 ESTs 
Hs.93506 ESTs 
Hs.279778 PR01 693 protein 

gb:CM1-BN011M30400-171-g02 BN0116Homo 
Hs.190389 ESTs 

gb:Homo sapiens mRNA; cDNA DKFZp761 L2312 
Hs.177537 ESTs 
Hs.71962 ESTs 
Hs.279912 KIAA041 9 gene product 



Hs.132670 ESTs 



424711 NMJJ05795 

446346 AI290205 

441974 AI683782 

444805 AB007899 

424027 AW337575 

419606 AW294795 

428613 AB037749 

434340 AI193043 

450297 AW901347 

432779 AW979241 

433650 AA503472 

419086 NM_000216 

428758 AA433988 

430153 AW968128 

418883 BE387036 

427669 AW451832 
400610 
402222 

407162 N63855 

415250 F02614 

421751 AW813731 

428552 AW274560 

432558 AW973769 

434742 M648302 

436586 AI308862 

441675 AI914329 

442039 AW276240 

443160 AI467915 

448764 AI568607 

449579 AW207260 

439810 AL1 09710 

413714 AI560944 

400289 X07820 

413384 NM_000401 

438670 AI275803 

419991 AJ000098 



Hs.173233 
Hs.283676 
Hs.49169 
Hs.181022 
Hs.1 52618 
Hs.232070 
Hs.29352 
Hs.239681' 
Hs.252905 
Hs.1 14121 
Hs.140 
Hs.30343 
Hs.1 1135 
Hs.91453 
Hs.152175 

Hs.128245 
Hs.12017 
Hs.201591 
Hs.198529 
Hs.186928 
Hs.128685 



solule carrier family 15 (H+/peptide tra 
hypothetical protein FLJ10970 
ESTs, Weakly similar to unknown protein 
KIAA1634 protein 
CGI-07 protein 
ESTs 



gb:ql79g06.x1 Soares_NhHMPu_S1 Homosapl 
ESTs 

KIAA0439 protein; homolog of yeast ubiqu 



Hs.98502 Homo sapiens cDNA FLJ14303 fis, clone PL 

gb:EST380338 MAGE resequences, MAGJ Homo 
Hs.1 21 1 acid phosphatase 5, tartrate resistant 
Hs.255938 ESTs, Moderately similar to KIAA1200 pro 



Hs.142634 zinc finger protein 

Hs.27319 ESTs 

Hs.159153 ESTs 

Hs.129520 ESTs 

Hs.162319 ESTs 

Hs.291695 ESTs 

Hs.167028 ESTs 

Hs.5461 ESTs 

Hs .1 28352 ESTs, Weakly similar to p80 [R.norvegiou 

Hs.36053 ESTs 

Hs.182112 ESTs 

Hs.134014 prostate cancer associated protein 6 

Hs.85568 EST 

Hs.71428 ESTs 



Hs.75334 exostoses (multipl 
Hs.123428 ESTs 

Hs.9421 0 eyes absent (Drosophila) homolog 1 
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414888 AL039185 

438474 AW865818 

453037 AA045175 

428467 AK002121 

413930 M86153 

422429 AA310527 

415083 AI632683 

417015 M83772 
406506 

448330 AL036449 

409719 AI769160 

423354 AB011130 

425188 AK002052 

427961 AW293165 

447357 AI375922 

412642 BE244598 

453716 M037675 

437370 AL359567 

407949 W21874 

427972 M864870 

453313 BE005771 

426476 NM 003296 

424238 AA337401 

452930 AW1 95285 

424527 AW13B558 

453095 AW295660 

449161 N53431 

429586 T73510 

423782 AI472209 

458124 AW005548 



Hs.77558 thyroid hormone receptor nteraotor 7 

Hs.6232 KIAA0764 gene product 

Hs.177552 ESTs 

Hs.184465 hypothetical protein FLJ11259 

Hs.75618 RAB11A, member RAS oncogene family 

gb:EST181333 Jurkat T-ce!ls V Homo sapie 

Hs.27179 Homo sapiens cDNA FLJ12933 fis, clone NT 
Hs.80876 



Hs.207163 
Hs.108681 
Hs.127436 
Hs.1 55071 
Hs.143134 



45 
50 
55 



441736 AW292779 
401049 

440727 AI073991 

419751 AW195581 



429399 AA452244 

444042 NM.004915 

432343 NM_002960 

436772 AW975688 

428784 Y12851 

445268 AI218358 
402481 

412608 AA247995 

416521 H60929 

416624 H69044 

419780 AA713522 

421211 AA284966 

427541 AI798983 



436461 



Hs.15 
Hs.309 
Hs.1 52675 
Hs.151962 
Hs.247057 
Hs.181304 
Hs.1 53746 
Hs.2042 
Hs.137635 
Hs.194097 
Hs.267158 
Hs.252756 
Hs.47647 
Hs.209153 
Hs.288369 



Hs.17967 ESTs 



511956 



hepalocyle growth factor (hepapoietin A; 
ESTs 

Homo sapiens mRNA; cDNA DKFZp547D023 (fr 
ESTs 

putative gene product 

Homo sapiens cDNA: FLJ22490 fis, clone H 
testis specif c protein 1 (probe H4-1 p3 



ESTs, Weakly similar to KIAA0423 [H.sapi 



ATP-binding cassette, sub-family C (CFTR 
ESTs 



440312 AW614597 

440479 M886461 

441178 W90789 

441235 AI884586 

443314 AW771701 

422165 AL041199 

450696 AI654223 

432974 BE348793 
404200 

435990 AI015862 

421309 AI222086 

451558 NM.001089 

416642 T96118 

406672 M26041 

417819 AI253112 



Hs.292737 
Hs.89433 
Hs.169799 



Hs.134268 ESTs 

Hs.93121 KIAA0761 protein 

Hs.31704 ESTs, Weakly similar to KIM0227 [H.sapi 

Hs.103982 small inducible cytokine subfamily B (Cy 

Hs.1473 gastrln-releasing peptide 

Hs.124194 ESTs 

Hs.130554 Homo sapiens cDNA: FLJ23C89 fis, clone L 

Hs.34550 ESTs 

Hs.16727 ESTs 

Hs.10237 ATP-binding cassette, sub-family G (WHIT 

Hs.2961 S100 calcium-binding protein A3 

Hs.250867 zona pellucida glycoprotein 3A (sperm re 

Hs.193470 purinergic receptor P2X, ligand-gated io 

Hs.175048 ESTs 

Hs.44898 Homo sapiens clone TCCCTA001 51 mRNAsequ 

Hs.44197 hypothetical protein DKFZp56400462 
gb:yr77h05.s1 Soares fetal liver spleen 

Hs.87752' ESTs 

Hs.266308 ESTs, Weakly similar to AF216312 1 type 

Hs.97961 ESTs 

Hs.162102 ESTs 

Hs.293261 ESTs 

Hs.201648 ESTs, Weakly similar to ZN42.HU MAN ZINC 
Hs.72475 ESTs 
Hs.208161 ESTs 
Hs.1 53976 ESTs 

Hs.135570 Homo sapiens cDNA: FU21268 fis, dene C 



Hs.1481 

Hs.1 6026 Homo sapiens cDNA: FU23191 Ss, clone L 

gb:M70g02.x1 NCI_CGAP_Lu24 Homo sapiens 

Hs.1 31 793 ESTs 

Hs.270449 ESTs, Moderately similar to ALU1_HIJMAN A 

Hs.26630 ATP-binding cassette, sub-family A (ABC1 

Hs.226313 ESTs Wea !. simil , loALUI HUMAN ALU S 

Hs.198253 major histocompatibility complex, class 

Hs.133540 ESTs 



Hs.101810 Homo sapiens cDNA FU14232 fis, clt 
Hs.1 18726 ESTs 



459574 AI741122 
404274 

415086 AI597963 .. 

418210 R54575 Hs.13337 ESTs, Weakly similar to t 

419220 AA811938 Hs.291759 ESTs 

444314 AI140497 gb:ow76b09.s1 Soares_fetalJiver_spleen_ 
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451050 AW937420 



410976 R36207 
406673 M34996 
449677 AA002071 
449321 MO0 11 50 
418557 BE140602 
416320 H47867 
426384 AI472073 
414140 AA281279 
419520 AB009303 
446999 AA151520 
457447 X78261 
451099 R52795 
407366 AF026942 
410048 W76467 
400880 

418092 R45154 
428780 AI478578 
431067 AW574823 
432803 AA565398 
412104 AW205197 
422319 AL1220B4 
454359 N71277 
424806 AA382523 
434445 AI349306 
442994 AI026718 
410371 AA084482 
450232 BE300815 
417924 AU077231 
430899 BE018217 
431814 BE256242 
417543 AA203620 
444542 AI161293 
404593 

434803 AW974640 
451623 H77818 
452466 N34635 
402046 

434927 H46612 
436192 W93847 
401987 

423119 AA322201 
427112 Z32887 
414464 AI870175 
447829 AI433029 
449679 AI823951 
405472 

413621 AI808648 
432212 AW1 37742 
404289 

415362 F06735 
427739 AW196755 
427772 AA412289 
430844 T94960 
434335 AA630107 
436052 AI021983 
442773 AB037722 
446799 AW978373 
450221 AA328102 
455673 BE065939 



447233 AW246333 

447197 R36075 

431087 H12723 

409064 AA062954 

427558 D49493 

426457 AW894667 

438118 AW753311 

427621 BE621182 

452114 N22687 



Hs.1BB08 ESTs, Weakly sin 



gb:tj85h03.x1 Scares_NSF_F8_9W_OT_PA_P_S 



Hs.272177 H.sapiens mRNA for TRE1 7 5' extremity an 
Hs.25954 interleukin 1 3 receptor, alpha 2 

gb:Hamo sapiens cig33 mRNA, partial sequ 
Hs.274550 proline oxidase homolog 

Hs.1 06604 ESTs 
Hs.50636 ESTs 
Hs.200413 ESTs 

gb:nk41f01.s1 NCI_CGAP_GC2 Homo sapiens 
Hs.240951 ESTs 

Hs.121073 hypothetical protein FU10466 
gb:2a36e03.s1 So 



Hs.11732 ESTs 
S.16954 ESTs 



Hs.11 



ESTs 



Hs.201326 ESTs 

Hs.82932 cydin D1 (PRAD1: parathyroid a 

Hs.183528 ESTs, Weakly similar to Bem46-tike prate 

Hs.270847 delta-tubulin 

Hs 110153 EST, Wea ly i lilai l [ ~GF_HUMAN B-CEL 

Hs.1 46862 ESTs, Weakly similar to KIAA0525 protein 

gb:EST386744 MAGE resequences, MAGM Home 

Hs.268991 ESTs 

Hs.29664 Human DNA sequence from clone 682J15 on 



no sapiens oDNA FL) 11846 fis, clone HE 



gb:HSC1JB091 
Hs.98105 ESTs 
Hs.98123 ESTs 

gb:ye38d07.r1 
Hs.213220 ESTs 
Hs.271432 ESTs 

Hs.8707 Homo sapiens mRNA; cDNA DKFZp434N 1131 (f 
Hs.49221 ESTs, Weakly similar to zinc finger prot 
Hs.24641 cytoskeleton associated protein 2 

gb:RC3-BT0319-10010O-012-c11 BT0319Homo 
Hs.181801 ESTs 



Hs.188912 ESTs 
Hs!l7901 



theScat protein FU12892 



Homo sapiens cDNA: FU21974 fis, clone H 
g b:yh88b01.s1 Soares placenta Nb2HP Homo 
Hs.290791 ESTs 
Hs.141883 ESTs 

Hs.2171 growth differentiation factor 10 
Hs.169965 chimerin (chimaerin) 1 
Hs.259415 ESTs 

Hs.179882 Homo sapiens cDNA FU12437 fis, clone NT 
Hs.8236 ESTs 
Hs.301550 KIAA0758 protein 



416402 NM_000715 Hs.1012 
452415 AA026115 Hs.114777 ESTs 
451609 AL046019 Hs.209276 ESTs 
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435934 R19382 

445158 AI992108 

407930 AA045847 

439335 AA742697 

443949 AW827419 

429716 R25685 

415817 U88967 

438675 AA813745 
405848 

416940 N75620 

442381 AI185136 

420036 R60336 

436252 AI539519 

413450 Z99716 

426572 AB037783 

439425 AF086244 

421168 AF182277 

449611 AI970394 



447482 AB033059 

457292 AI921270 

444974 AI203500 

456034 AW450979 

430634 AI860651 

426782 R14514 

452943 BE247449 

445326 AI220072 

421247 BE391727 

409994 D86864 

443268 AI800271 

455226 AW902103 

417321 N68722 

423778 Y09267 
404323 

448133 AA723157 

421047 AW514772 

426497 AA524696 

444623 AI103029 

412303 AW936336 



428330 L22524 

455807 BE141140 

425465 118964 

449424 AW448937 

427940 AA417812 

411502 AW946605 

411365 M76477 

412369 H80456 

452959 AI933416 

416580 T61572 

428775 M434579 



410011 AB020641 
411050 AW814902 
452453 AI902519 



Hs.117869 ESTs 

Hs.127206 ESTs 

Hs.188361 Homo sapiens cDNA FLJ12807 fis, clone NT 

Hs.62492 ESTs, Weakly similar to S59856 collagen 

Hs.235070 ESTs 

Hs.211933 collagen, type XIII, alpha 1 

Hs.78367 protein lj " 

Hs.123446 ESTs 



Hs.43157 ESTs 

Hs.48650 ESTs 

Hs.52792 Homo sapiens mRNA; cDNA DKFZp58S1 1 823 (f 

Hs.120969 HomosapienscDNAFLJ11562fis, cloneHE 

Hs.75372 N-acetylgalactosaminidase, alpha- 

Hs.170623 hypothetical protein FLJ1 11 83 

Hs.1 14659 ESTs 

Hs.1360 cytochrome P450, subfamily MB (phenobar 

Hs.197075 ESTs 

Hs.14825 ESTs 

Hs.16810 ESTs 

Hs.269123 ESTs 

Hs.18705 KIAA1233 protein 

Hs.214178 Homo sapiens cDNA FLJ14251 fis, clone OV 

Hs.151612 ESTs 

gb:UI-H-BI3-ala-a-12-0-Ul.s1 NCI_CGAP_Su 

Hs.26685 ESTs 

Hs.191254 ESTs 

Hs.31082 hypothetical protein FLJ10525 



ESTs 

Hs.102910 general transcription factor IIH, polype 
Hs.57735 acetyl LDL receptor; SREC 
Hs.129445 hypothetical protein FLJ12496 

gb:QV0-NN1022-120500-220-c07 NN1022 Homo 
Hs.191368 ESTs 

Hs.1 32821 flavin containing monooxygenase 2 

Hs.73769 folate receptor 1 (adult) 
Hs.1 04473 ESTs 
Hs.188644 ESTs 
Hs.202111 ESTs 

gb:CW4-DT0021-231299-070-g11 DT0021 Homo 
Hs.277901 ESTs 

Hs.2256 matrix metatloproteinase 7 (matrilysin, 

gb:MRO-HT0075.021299-006-d07 HT0075 Homo 
Hs.1904 protein kinase C, iota 
Hs.197030 ESTs 
Hs.38775 ESTs 

Hs.250154 Homo sapiens cDNA FLJ12973 lis, clone NT 
Hs.278242 tubulin, alpha, ubiquitous 
Hs.285243 Homo sapiens cDNA: FU22029 fis, clone H 
Hs.189674 ESTs 

Hs.79385 Human clone 23574 mRNA sequence 
Hs.143691 ESTs 

Hs.180726 Homo sapiens cDNA FLJ13543 fis, clone PL 

Hs.44205 cortistaUn 

Hs.57856 PFTAIRE protein kinase 1 

gb:MR1-ST0206-120400-022-f08 ST0206 Homo 
gb:QV-BT009-101 198-051 BTOOB Homo sapien 

Hs.1 25445 ESTs 

Hs.145268 ESTs 

Hs.158258 Homo sapiens mRNA; cDNA DKFZp434B1272 (f 



424368 AB037766 

421229 AI056590 

436304 AA339622 

453498 BE181412 

453280 AL1 57476 

420193 AI460080 

444610 AI174783 
401575 

419092 J05581 

430129 BE301708 

410763 AF279145 

414783 AW069569 

411492 T46848 
405963 

418378 AW962081 

420831 AA280824 

424152 AL133591 

424641 AB001106 



Homo sapiens cDNA FU11767 fis, clone HE 
ESTs, Weakly similar to unnamed protein 
Homo sapiens mRNA; cDNA DKFZp761C082 (fr 
ESTs 

gb:HA2501 Human fetal liver cDNA lib-ary 



Hs.23245 
Hs.26638 
Hs.32913 



Hs.89603 m 

Hs.233955 hypothetical protein FU20401 

Hs.8966 tumor endothelial marker 8 

Hs.75839 zinc linger protein 6 (CMPX1) 

Hs.70337 immunoglobulin superfanfly, mer 



gb:EST374154MAGE resequences, MAGG Homo 
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427616 
435115 
437636 



445388 
447101 
448796 



AI821726 

AA764781 

A1394151 

AF124250 

AI925280 

N72185 

AA1 47829 

C00719 

AI702416 

R19992 

AI355009 

M501760 

AW971364 



Hs.33193 ESTs.h 
Hs. 120440 ESTs 
Hs.200771 



lartoAC0072283BC372 



gb:EST383453 MAGE resequences, MAGL Homo 



Unique Eos probeset identifier nui 
5c Gene cluster number 
Genbank accession numbers 



413136 
413875 



418378 
419546 
419807 



432222 
432507 
432779 
432803 



433687 
434803 
436148 
437327 



450522 
451024 
451381 
452163 



12881 30_ 
1350379 
137391CL 



216469J 
266211 1 



34025.1 
343347J 
348711." 



367934 1 
370400J 
373061_1 
393471 1 
41500J 
43610J 
46684J 
491930 1 
600667.1 



AW818081 AW392887 AW514700 AW392881 
AA075368 AA075369 
AA0795S2 BE142525 BE142S27 

AW803341 AW803265 AW803403 AW803466 AW803402 AW80341 3 AWB03268 AW803396 AW803334 AW803355 
AW81 4902 BE156656 BE1 56667 BE156590 BE1 56441 BE1 56447 
AW872477 BE038101 T05990 
BE265067 BE264978 AW875420 
AW936336AW936339 



BE144884 H97942 

BE176776 H85072 

D53513 D59515 D80174 D59514 

F06735 R558S6R121 101103697 

H69044 T47567 H75691 T50292 

AW962081 AA218925 AA354237 

AA244199AA244272 H57440 

R77402 AA262462 AA250988 R06794 

AW976153AA278945AA747691 

AA31 0527 AW962295 Z44865 H06641 

AL049377 AL079930 AL047223 AW885968 AA385235 

AI472078AA377209AA865807 

AW979268 AA878419 AA431342 AA431628 

AW968128 AA468102 AA4681 65 

T94960AA487679 T95013 

AA492588 AA492498 AA492571 

AW971240 AA493843 AA493723 

AW970622 AA503009 AA502998 AA502989 AA502B05 T92188 

AL1 37424 BE007148 T52277 

AI204995 AW827539 AW969908 AW440776 AA528756 

BE391093 AA551334 BE389643 

AW979241 AA565006 AA847102 

AA565398AW894072 H97930 

AW974094 AA569074 AA602574 

BE34B793 AA573113 N79366 

AW605849 AW262898 N41060 AA594852 

AW295399 AW207772 AW300641 AW070290 BE348B54 AW1 70383 AA600968 AA778832 

AA743991 AA604852 AW272737 

AW974640AA649516 N75626 

BE005252AK000786 

AL353942AW994305 

AF085839 R69137AW1 88788 R69254 

AA879294 N67538AI474541 

Al 1 40497 AW749625 AW749626 AW749644 



673545_1 AI290205AW235762AI651268 



761515_1 BE089973AI498512AW805032 



AA299919 AW957012 AA001107 T83631 BE156389 
AA001713 HS3836 
AA002071 AA002232 T99209 



80945.2 
81270 1 
837264 1 
85565J 
867770_1 
902067 1 
909195_-1 

918300 1 

921410 1 AW812256AW812257AI906423AI906422 
930983 1 T05477 T07855 AI91771 1 
1130674.1 N71277AW390764 



AA442176AA259181 

BE241831 AW249135BE548847AW250245 
AI863140 W80703 R43474 
AI871833 
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1229132J 
1249196J 
1262534J 
1265634J 



340916J 
371514J 
491768_1 
966605J 



AW81 3428 AW81 3444 AW81 3367 AVV81 3368 AW31 3429 AW81 3424 
AW851 309 AW850888 AW85141 9 AW85141 2 AW851299 
AW902103 AW869012 AW869139 

AW875951 AW875950 AW875936 AW875948 AW875939 AW875957 
BE065939 BE066079 BE065956 

BE141140 BE141139 BE141105 BE141 143 BE141127 BE141202 EE141108 

142696 1 AW450979 AA136653 AA136656 AW419381 AA98435B AA492073 BE168945 AA809054 AW238038 BE011212 BE011359 BE011367 
BE01 1368 BE01 1 362 BE01 1 21 5 BE01 1365 BE01 1363 
AW971364AA525021 AA570759 
AA602711 BE078290 

AW816379 AA888282 AA879046 AA879195 
AJ003631 AJ003650AJ003651 



Ref. Sequence source: 7 digit numbers in 

chromosome 22." Dunham I. et al„ Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicled. 
NLposition: Indicates nucleotide positions of predicted exons. 



led "The DNA sequence of hu 



400610 
400880 
401049 
401335 
401575 
401793 



9887671 

7232177 
9884881 
7229804 



9797406 
9931216 
7712202 



404043 
404200 
404274 
404288 
404289 
404323 
404488 
404548 



404916 
405041 
405095 
405472 
405494 
405848 



2769644 
9719753 
8113286 
8570305 
9944086 
8705107 
7341826 
7547195 
8072599 
8439781 



7711305 
7711374 
7711566 
8248611 



NLposition 

117606-117928,124040-124147 

29235-29336,36363-36580 

149157-150692 

15736-16352 

76253-76364 

102945-103083 

72893-73021,76938-77049 

166394-166556,168167-168395 

3261-3834,3939-4269 

87891-83991 

33641-33775,34182-34372,36003-36084,40343-40612 
4591046260,47563-47824 
101165-102597 
12609-12773 



7066-7210 

104127-104318 

3512-3691 

15049-15286,30267-30457 

31913-32219 

64835-64994 



74922-75788 

110443-110733 

91057-91188 

121230-121714 

138877-139066 

106297-106447,108462-108596 

70284-70518 

28135-28244 



125036-125422 

6843-8077 

106956-107121 



TABLE 28A: ABOUT 796 GENES DOWN-REGULATED IN LUNG FIBROSIS COMPARED TO NORMAL BODY 

Table 28A lists of about 796 genes that are deregulated in lung fibrosis (collection of IPF, HP, and NSIP) samples as compared with normal "body map" samples. These were 
selected from about 59680 probesets on an Affymetrix/Ecs Hu03 GeneChip array such thai the ratio of "average" fibrosis sample expression level to "average" normal adult tissues 
sample expression was less than or equallo 0.1. The "average" normal lung tissue level was set to ihe 75* percen Ih i j i I ig tissues The "average" fibrosis 
expression level was set to the 95* percentile amongst fibrosis samples. In order to remove gene-specific background levels of non-specific hybridization, the 15* percentile value 
amongst non-malignant tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene TifeUnigene gene title 

R1: Ratio of normal lung to fibrosis 

Pkey ExAccn UnigenelD Unigene Title R1 

414002 NM_006732 Hs.75678 FBJ murine osteosarcoma viral oncogene h 18.18 

421218 NM_000499 Hs.72912 cytochrome P450, subfamily I (aromatic c 9.39 

404518 AI815601 Hs.79197 CD83 antigen (activated B lymphocytes, i 8.30 

404795 5.56 
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425571 AJ007292 
406357 

407979 AA046306 

452378 AA025855 

408053 AW1 39474 

421770 AA374192 

425126 N32759 
402386 
402448 

448245 AI923551 

413778 M090235 

419968 X04430 

447768 X86400 



Hs.158306 ephrin-A2 

Hs.62927 ESTs 

Hs.19597 ESTs 

Hs.246862 ESTs 

Hs.108124 ribosomal protein L41 

Hs.172944 chorionic gonadotropin, beta polypeptide 



Hs.170843 
Hs.75535 
Hs.93913 
Hs.19520 



402409 
456150 
427030 
426328 
429307 
400172 
431227 



myosin, light polypeptide 2, regulatory, 

interleukin 6 (interferon, beta 2) 

FXYD domain-containing ion transport reg 



437120 AI356125 

409020 AA062549 

431073 BE254470 

433495 AW373784 
403716 

424969 AW950928 
404348 

407070 Y10209 

412919 AI368680 



Is. 1 57767 ESTs, Weakly similar to hu 



se, mitochondrial 1 (ubiqui 



Z42308 

M397600 

AW631296 



Hs.97531 I 

Hs.198951 ; 

' Hs.2743 
Hs.222312 
Hs.174303 



406821 AA977896 Hs.128873 ESTs, Highly similar to ALFAJUMAN FRUCT 



401496 
421201 
402911 



416596 
400545 
418464 
426507 
403479 
406082 
401919 
449031 



AW241940 Hs.102500 hypothetical protein FLJ204B1 

D25216 Hs.155650 KIAA0014 gene product 

AA774984 Hs.220649 ESTs / eaklj " mila lo TfE2 MOUSE LOWA 

AI633040 Hs, 172730 ESTs 

AA584930 Hs.269451 ESTs, Weakly similar to XAP-5-like pro'.e 

H67669 Hs.38564 ESTs 



AI867502 Hs.271462 ESTs 



457937 
406101 
407080 
419947 
421905 
454019 



H58589 Hs.35156 Homo sapiens cDNA FU11027 fis, clonePL 

AW245207 Hs.5555 Homo sapiens cDNA FLJ13170 fis, clone NT 

D11747 Hs.177415 Finkel Biskis Rei'lv murine se comaviru 

X52638 Hs.739 6-phosphofructo-2-kinase/fructose-2,5-bi 

A1658621 Hs.127769 ESTs 

AW976930 Hs.1 28760 ESTs 

Z38133 Hs.1 13973 myosin, heavy polypeptide 8, skeletal mu 

AW298744 Hs.118894 ESTs 

AI660247 Hs.32699 ESTs, Weal ( , s i jla io IV I | otein III 

D31846 Hs.37025 aquaporin 2 (collecting duel) 

AA431734 Hs.104915 ESTs 



425182 AF041259 



Hs.155040 
NM_000218 Hs.156115 
AI244368 Hs.8124 



217 



im voltag elated channel, KQT-lik 



Hs.88844 Homo sapiens hair and skin epidenr.al-typ 



AA779958 
BE614551 
AA432007 



Hs.249484 ESTs 
Hs.157425 double h 
Hs.158491 ESTs 
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20 
25 
30 




423548 
402051 
415196 
455446 
442428 



Hs.7989 KiM1 045 protein 

Hs.6421 1 ESTs, Weakly similar :o similar to acyl- 

Hs.19413 S100 calcium-binding protein A1 2 (caigra 

" '89119 ESTs 

64257 ESTs 

gb:601275386F1 NIH_MGC_20 Homo sapiens t 

.157492 Home sapiens cDNA FLJ14079 fe.doneHE 

.46794 ESTs 

.30909 KIM0430 gene product 

Hs. 129782 mucin 3A, intestinal 



AK000150 Hs.78185 MAX-like bHLHZIP protein 

AW947749 gb:RC0-MT0005-1 30300-O31-b01 MT0005 He 

BE464988 Hs.298302 ESTs 



459184 L35001 
402968 

417575 R00382 



420619 
447241 
448793 
453014 
446775 
455075 
406704 
457546 
410197 
433677 
405703 
408840 
413958 
454421 
406702 
408664 
402457 
403612 



415423 AA1 64743 



AI937242 

AI792836 

AW854850 

M21665 

AA568484 

NMJ05518 

AI791912 



Hs.95669 ESTs 

Hs.191199 ESTs 

Hs.99430 testis zinc finger protein 

Hs.19322 ESTs 

Hs.215477 ESTs 

Hs.176590 ESTs 

Hs.232273 ESTs 

Hs.929 
Hs. 153632 
Hs.59889 
Hs.190885 



3, Moderately similar to ALU 1_HU WAN A 
Hs.254880 ESTs 

Hs.172364 Homo sapiens mRNA for FLJ00085 protein, 
Hs.59563 Homosapiens mRNA for FLJ00007 protein. 
Hs.278432 myosin, heavy polypeptide 6, cardiac mus 
gb:yg93c07.r1 Soares infant brain 1NIB H 



AL038170 
A13B3475 
AI591214 



AA056635 Hs.5356 



AA760849 
AW966864 
AW014139 



403997 
408704 
407005 
405075 
430728 



405895 
431929 
405217 
437569 
419822 
445918 
446149 
457829 
404282 
409778 
445353 
458764 
402195 
404247 



432301 U34249 

424958 AA984420 

442197 AW837912 

415003 M11437 

420767 AF072711 



Hs.1 80086 ESTs 

Hs.80756 
Hs.171697 
Hs.156336 
Hs.229570 
Hs.846 . 



interleukin 8 receptor, beta 



gb:EST380598 MAGE resequences, MAGJ Homo 
ESTs, Highly similar to S17112 inlerfero 



AW294163 Hs.146127 ESTs 



Hs.294052 
Hs.255780 
Hs.145656 
Hs.203181 
Hs.210843 



BE551465 Hs.175211 



ESTs 
ESTs 
ESTs 

ESTs, Weakly similar la dJ 1039K5.2 [Hsa 

gb:U1-HF-BR0p-ajk-b-05-0-U I .: 1 NIH.MGC.5 
ESTs 

gb:601473204F1 NIH.MGC.68 Homo sapiens c 



Hs.179718 v-myb avian myeloblastosis vi'al oncogen 

Hs.1 67075 ring finger protein 9 

Hs.283659 ESTs 

gb:QV3-LT0048-2601 00-06tHK)2 LT0048 Horao 

Hs.77741 kininogen 

Hs.99918 carboxyl ester lipase (bile salt-stimula 

Hs.1 21 544 interleukin 1 2 receptor, beta 1 
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Hs.1891 09 Homo sapiens cDNA: FU21458 lis. clone C 
Hs.272790 Homo sapiens cDNA FLJ2023B lis, done CO 




400672 
430713 
449748 
453756 



AW139415 Hs.61906 ESTs 



43B216 Z83952 



406118 
402165 

416982 J05401 Hs.80691 creatine kinase, mitochondria] 2 (sarcom 

425515 W26609 gb:35f12 Human retina cDNA randomly prim 
402951 

427886 AA417083 Hs.104789 ESTs 

447173 AW449385 Hs.157294 ESTs 

448703 BE613942 Hs.170890 Homo sapiens cDNA: FLJ21 129 fis. clone C 

426344 H41821 Hs.169393 transcriptional activator of the c-fos p 

401840 

403731 

405378 

405555 Y09306 Hs.30148 homsodomar nt' , ing p-otem kinase 3 

416559 AI039195 Hs.1 28060 ESTs, Weakly similar to cDNA EST yk481g5 

Hs.252815 ESTs 

gb:601311 130F1 N]H_MGC_44 Homo sapiens c 
gb:ws58g05.x1 NCI_CGAP_Brn25 Homo sapien 

Hs.299847 ESTs, Highly similar to A2AD.HUMAN ALPHA 

Hs.207423 ESTs 

Hs.281455 ESTs 

gb:NHTBCae03a05f1 Norma] Human Trabecula 
Hs.25451 5 ESTs, Moderately similar to AF248953 1 g 
Hs.209542 ESTs, Weakly similar to B eel] linker p: 

gb:FB12A9 Fetal brain, Stratagene Homo s 
Hs.1971 14 RNA binding protein; AT-rich element bin 
Hs.209431 ESTs 

Hs.1 B8417 ESTs, Weakly similar to ZnT-3 [H.sapiens 
Hs.75741 amiloride binding protein 1 (amine oxida 
Hs.235390 Homo sapiens mRNA; cDNA DKFZp761B101 (fr 
gb:qj73c12.x1 NCI_CGAP_Kid3 Homo sapiens 



423011 NMJ00683 

451172 AW206465 
401015 

414705 BE464157 

439894 AA853077 

446305 AW270149 

453512 AL040160 

418556 T02850 

457197 AB016092 

467276 AA463422 

458766 AW183618 

414075 U11862 

430210 AL1 57426 

442614 AI269030 
402538 
439891 



AL389940 Hs.109968 



440056 BE294828 Hs.13323 hypothetical protein FU22059 



406150 
426880 
447129 



AA453482 
AW014123 
BE161733 
A1458309 



gb:zx47a11.r1 Soares_testis„NHT Homo sap 
Hs.1 61402 ESTs 

Hs.97283 ESTs, Weakly similar to ALU1 _HUUAN ALU S 
Hs.1 17406 ESTs 



427586 
454108 
429749 



437433 
401688 
441748 
453072 
400635 
417176 
427858 



408922 
423668 
440338 
403115 



BE293546 
AW450592 
AA021163 
AA363733 
AA609661 
AA161071 
AI685174 
AW511138 
AA724543 
R74016 



gb:6011 86671 F1 NIH_MGC_15 Homo sapiens c 
Hs.300459 ESTs 
Hs.22287 ESTs 

Hs.1032 regenerating islet-derived 1 alpha (pane 



Hs.71465 

Hs.22293 ESTs 

Hs.256581 ESTs 

Hs.1 68824 ESTs 

Hs.121581 ESTs 



AW974475 Hs.143467 ESTs 

NM_001971 Hs.21 elastase 1, pancreatic 

AW837063 gb:QV1-LT0037-150200-069^08 LT0037 Homo 

BE217872 Hs.279537 ESTs 

R873B8 gb;ym88g04.r1 Soares adult brain N2b4HB5 

Y10148 Hs.1 31 138 neurotensin receptor 2 

R62431 Hs.12758 ESTs 

R17268 Hs.301560 ESTs 

A1971975 Hs.212892 ESTs 

BE168828 gb:QV1-HT0517-020400-145-f04 HT051 7 Homo 

A1025315 Hs.131615 ESTs 

T06607 Hs.6846 hypothetical protein FU 13055 
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421996 
436130 
407243 



AW583807 
AA341497 
AA058357 



30 
35 



442792 AI352340 

454406 AA213605 

424648 AA344576 

433963 AI218808 
400736 
405343 

409702 AI752244 

432092 AF135026 

441915 AI566116 

453147 M733098 

415604 Z44177 

422927 AW247388 

401211 AJ004832 

413808 J00287 

414433 BE407755 

421978 AJ243662 

431204 F28841 

433605 AI378012 

449383 AW444712 

455652 BE064675 
402382 

407282 AI345597 

457273 AI167145 

459073 AW968616 
402394 

428875 AW451624 

456634 AA609911 

434352 AF1 29505 

439281 AA100768 

444153 AKCC1610 
401122 

444340 AI143198 



Hs.1460 glucagon 

Hs.31408 ESTs 

Hs.74466 carcinoembryonic antigen-related cell ad 

Hs.37936 suppressor of variegation 3-9 (Drosophii 

Hs.131194 ESTs 

Hs.267861 ESTs 

gb:EST50478 Gall bladder I Homo sapiens 

Hs.187778 ESTs 



Hs.285749 Human DNA from chromosome 1 9-specific co 

gbiHomo sapiens kailikrein-like protein 

Hs.207066 ESTs, Weakly similar to FOG [M.musculus] 

Hs.279909 CGI-06 protein 

Hs.170434 Homo sapiens cDNA FLJ14242 fis, clone QV 

Hs.301 423 calcium binding protein 1 (calbrain) 

Hs.5038 neuropathy target esterase 

Hs.182183 caldesmonl 

Hs.1 691 00 Homo sapiens cDNA FLJ 1 2529 fis, clone NT 

Hs.1 10196 NICE-1 protein 

Hs.250760 cytochrome c oxidase subunit Via polypep 

Hs.1 47953 ESTs 

Hs.196573 ESTs 

gb:RC1-BT0313-301299-012-h1 1 BT031 3 Homo 

Hs.254727 ESTs 

Hs.165533 ESTs 

Hs.296234 ESTs, Highly similar to mitogen-activale 

Hs.178202 ESTs 

Hs.109012 ESTs 

Hs.86492 small muscle protein, X-linked 

Hs.48485 ESTs 

Hs.10414 hypothetical protein FLJ10748 

Hs.143561 ESTs 

gb:RC1-BT0313-110300-015-f06 BT0313 Homo 
gb:EST375158MAGE resequences, MAGH Homo 

Hs.88523 ESTs 



409302 AW500732 
430144 AI732722 
444580 AI168365 
401704 
401810 

424473 AK001405 
438573 AW135084 
BE009345 
AB002058 
NM.000939 
AA083609 
NM_014759 



55 
60 



429785 H82114 



422664 AA315933 Hs.120879 
432247 
453820 
400675 
405556 



Hs.148584 Homo sapiens cDNA FLJ10543 fis, clone NT 
Hs.299865 ESTs 
Hs.1 28942 ESTs 

Hs.1 13275 purinergic receptor P2X-like 1, orphan r 
Hs.1 897 proopiomelanocortin (adrenocorticotropin 
g bizm63d05.r1Slratagene fibroblast (937 
Hs.239500 KIAA0273 gene product 
Hs.255737 ESTs 

Hs.149797 hypothetical protein FLJ10979 

Hs.1872 p'hosphoenolpyruvate carboxylase 1 (sol 

Hs.301769 ESTs 

Hs.6846 hypothetical protein FU13055 
Hs.249260 ESTs 

heterogeneous i 
ESTs 

,105805 ESTs 



430176 
430631 
433114 



R77494 

Y09306 Hs.30148 homeodomain-interacting protein kinase 3 

M94891 Hs.278423 pregnancy specific beta-1-glycoprotein 4 

BE560553 Hs.205450 Homo sapiens cDNAi FLJ22570 fis, clone H 

AA088470 Hs.83135 p53-responsive gene 6 

W73753 Hs.58330 ESTs 

AW137262 Hs.192713 ESTs 

M81057 Hs.180884 carboxypeptidase B1 (tissue) 

AI825817 Hs.143272 ESTs 

AI762757 Hs.1 29869 ESTs, Weakly similar to AF1 13685 1 PRO09 

AI815978 Hs.1 60427 ESTs 

M078492 gb:7P04D1 1 Chromosome 7 Placental cDNA L 

D45371 Hs.80485 adipose most abundant gene transcript 1 

AL161995 Hs.234775 neurturin 

AJ003147 Hs.278464 olfactory receptor, family 1, subfamily 

AA121579 gb:zn77f02.r1 Stralagene NT2 neuronal pr 

U57352 Hs.6517 amiloride-sensitive cation channel 1 , ne 

AW166358 Hs.1 24979 ESTs 

AA010662 Hs.188639 ESTs 
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449891 
400527 
428581 
443647 



Hs.37848 ESTs 



436752 
437405 
449174 
449887 
453261 
454243 



AA001778 

AW298529 

M338837 

T66136 

AW0B0843 

AA034116 

AW241901 

M216382 

M393460 

AK000224 



421419 
405420 
405737 
414016 
415744 
420375 
426322 
421592 
401743 
405187 
442763 
451621 
413248 
423913 
439999 
440185 
450482 
413972 
420476 
428748 
431148 
447205 



Hs.104881 
Hs. 126261 
Hs.282475 
Hs.288156 
Hs.255774 
Hs.42547 
Hs.12880 
Hs.200275 
Hs.1 18494 
Hs.250683 
Hs.30002 



Hs.190787 
HS.31B44 
Hs.104134 



ESTs 



CIT 



sapiens Chromosome 

Homo sapiens cDNA: FLJ21819 fis, clone H 
ESTs 

Homo sapiens cDNA FU 13975 lis, clone Y7 

ESTs 

ESTs 

ESTs 

ESTs 

SH3-containing protein SH3GLB2 
gb:zt71e05.r1 Soares_testis_NHT Homo sap 
hypothetical protein FU20217 
gb:au94e02.y1 Schneider (etal brain 0000 
tissue inhibitor of metalloproteinase 4 
Homo sapiens cDNA FU1 2536 fis, clone NT 
homeo box (H6 family) 1 



AA134594 Hs.71528 ESTs 

AW964850 Hs.279307 ESTs 

AF182077 Hs.97244 glioma tumor suppressor candidate region 

J05068 Hs.2012 transcobalamin I (vitamin B 1 2 binding p 

AF009801 Hs.105941 bagpipe homeobox (Drosophila) homoiog 1 



AI017037 Hs.131121 ESTs 

AI879148 Hs.26770 fatty acid binding protein 7, brain 

T64358 Hs.21433 ESTs 

NM_016436 Hs.301055 hepatocellular carcinoma-associated antl 

AA1 15811 Hs.6838 ras homoiog gene family, member E 

AW104546 Hs.270929 ESTs 

AI697844 Hs.221720 ESTs 

BE279548 Hs.162717 ESTs, Weakly similar to HPPDJWMAN 4-HYD 

AW575863 Hs.1 36232 ESTs 

AW593206 Hs.98785 ESTs 

AA502653 Hs.28621 ESTs 

BE617015 Hs.1 1006 ESTs 

BE179190 g b:RC0-HT06 1 3-21 0300-032-f07 HT0613 Homo 



409762 AW498884 Hs.257970 ESTs 



441212 AW242447 

445624 AW140103 

458619 M872064 
401969 
403327 

407245 X90568 

417361 NM 000275 

436034 AF282693 

442682 AI014545 

458494 AI380906 
404682 

407402 AF035303 

409368 AA071059 

440362 AA883812 

448866 BE297743 



403186 
409543 
443672 



Hs.1 43884 ESTs 

Hs.21 333 Homo sapiens mRNA: cDNA DKFZp566N 1 047 (f 

Hs.258507 ESTs 

Hs.146182 ESTs, Weakly similar to lactase phlorizi 

Hs.78880 ilvB (bacterial acetolactate synthasej-l 

Hs.301218 ESTs, Weakly similar to Unknown gene pro 



Hs.82027 oculocutaneous albinism II (pink-eye dil 

Hs.150185 inflammation-related G protein-coupled r 

Hs.231027 EST 

Hs.1 58436 ESTs 



Hs.284203 myogenic factor 3 



AA367019 Hs.241395 



409543 AW410200 



437237 
lb 429134 
445041 



gb:fh05b12.x1 NIH_MGC_17 Homo sapiens cD 
Hs.9667 bulyrobetaine (gamma), 2-oxcgiuta-ate di 
Hs.202280 ESTs 
Hs.251836 ESTs 
Hs.181770 ESTs 
Hs.42456 ESTs 

Hs.92282 paired-like homeodomain transcriplion fa 
gb:601171435F1 NIH_MGC_15Ho 



Hs.1 1398 ESTs 

Hs.284249 Homo sapiens cDNA: FU22334 fis, clc 

Hs.217493 annexinA2 

Hs.234234 aldolase B, fructose-bisphosphafe 

Hs.191215 ESTs 

Hs.171941 ESTs 
Hs.46452 
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AW809157 




gb:RC0-ST01 18-041 099-031-c07_1 ST01 1 8 Ho 








ESTs, Weakly similar to AF170723 1 prate 




AC004597 


Hs.248088 


olfactory receptor, family 10, subfamily 




BE246440 




pre-B-cell leukemia transcription factor 








phospholipase A2, group VI (cytosolic, c 


459499 


AW402653 


Hs.28355 


Homo a| n- cl f /% FLJ224 21 :loneH 








apolipoprolein C-lll 




AI446586 


Hs.21835 






AI810474 


Hs.1 96945 


ESTs 


426998 


BE274360 




gb:601 121063F1 NIH_MGC_20 Homo sapiens c 


428407 


NM_003963 


Hs.184194 


transmembrane 4 su^, i anil m mbc 5 








gb:C75571 Human pancreatic islet Homo sa 








guanylate cyclase activator 2B (uroguany 




AW976183 




LI eaklysin JJ512E Hi 




BE246327 




gb:TCBAP1E1967 Pediatric pre-B cell acut 








aldehyde dehydrogenase 8 




AF196478 


Hs.188401 


















gb-.Human mRNA clone with similarity to L 








gb yy78d01 r1 Soare .multiple c 




AW1 50835 




hypothetical protein FU21620 




AI867182 


Hs.202255 










gb:H.sapiens mRNA HTPCRX1 1 for olfactory 
















guanylate cyclase activator 1B (retina) 




AW998865 


Hs. 186703 










engrailed homolog 1 




AI761930 


Hs.205127 


ESTs 


455366 


AW947563 




g b:RC0-MT0004-1 40300-03 1 -g 1 1 MT0004 Hon 


408421 


AW1 93734 


Hs.253067 


ESTs 


421907 


BE018556 


Hs.1 09358 


ATPase, Class V, type 10B 


432742 


AA564453 


Hs.162339 


ESTs 


436624 


T64297 




fatly acid binding protein 1, liver 


439543 


W75935 




ESTs 


443317 


A1051601 


Hs.200191 


ESTs 


449097 




Hs.95110 


ESTs. Weakly similar to PIP6_HUMAN 1-PHO 




AA194554 


Hs.183434 


ATPase, H+ transporting, lysosomal (vacu 


407387 


AB000895 




gb:Homo sapiens mRNA for cadherin FIB1 , 




U48263 


Hs.89040 


prepronociceplin 


436749 


AA584890 


Hs.5302 


lectin, galactoside-binding, soluble. 4 


458475 


AI650322 


Hs.143249 


ESTs 




gb:PM3-ST0307-241 299-002403 ST0307 Homo 




417421 
420562 
425011 



gb:oa39c09.s1 NCI_CGAP_GCB1 He 

Hs.268857 ESTs, Weakly similar to ALU1_HUMAN ALU S 

Hs.194517 ESTs 

Hs.1 11401 very long-chain acyl-CoA synthetase homo 

Hs.209637 Homo sapiens cDNA FU12921 fis, clone NT 

Hs.232655 ESTs 

Hs.1815 myosin, light polypeptide 3, alkali; ven 

Hs.5338 carbonic anhydrase XII 

Hs.132278 ESTs 
Hs.78871 



Hs.129998 Homo sapiens cDNA FLJ14267 fis, clone PL 
Hs.69517 ESTs, Highly similar to differentially e 
Hs.S4310 



Hs.1 340 colipase, pancreatic 

Hs.172004 titin 

Hs.172216 chromogranin 

Hs.194366 transthyretin I 

Hs.10755 dihydiopyrinidinase 

Hs.25333 interieukin 1 receptor 

Hs.205350 ESTs 

Hs.146133 ESTs 

Hs.201778 ESTs 

Hs.252819 ESTs 

Hs.75442 albumin 

Hs.1 144 glycoprotein IX 

Hs.157195 peptide YY, 2 (i 

gb:601 108143F1 NIH_MGC_16 Homo sapiens c 

AA203285 Hs.294141 ESTs, Weak') i a d I7CCD15.1 [H.sa 

AL138201 Hs.82120 nuclear receptor subfamily 4, group A, m 

AI345569 Hs.1 90046 ESTs 

T51986 Hs.283108 hemoglobin, gamma G 
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454771 
415672 
418141 
406706 
418197 
431821 
455433 
407743 
418888 
434001 
441031 



AI612788 

T11738 

AW819939 

N53097 

AW845738 

X03740 

AA214253 

AW939463 
AW814118 
AU076801 



Hs.132348 
Hs.68137 
Hs.127574 
Hs.273629 
Hs.193579 
Hs.171118 
Hs.231581 



adenylate cyclase activating polypeptide 

ESTs 

ESTs 



ESTs 



408349 
420391 
421126 
449329 
453615 
417296 
420287 
427583 



AA305079 

BE546947 

M456891 

M74587 

AW752783 

M195712 

L36196 

AA740907 

M82962 

AW296134 

AB018255 

BE045897 

M100573 

NM_002390 

BE011320 



Homo sapiens mRNA for FU00026 protein, 
myosin, heavy polypeptide 1 , skeletal mu 
gb:zn58g02.M Slratagene muscle 937209 H 
Hs.271221 hypothetical protein FU20064 

gb:QV1-DT0072-310100-056-g02 DT0072 Homo 
gb:MR3-ST0203-151199-011-d09 ST02Q3 Homo 



serine (or cysteine) proti 
fibrinogen, B beta polypeptide 

C 1 Tut 1-150200-071 9 TO I Homo 



417084 

423276 
433787 
413830 
423576 



442775 
454748 
437744 
451997 
452340 
411879 



Hs.89436 
Hs.3697 
Hs.7645 

Hs.1342 
Hs.44276 
Hs.79123 
Hs. 1021 22 

Hs.132696 
Hs.81884 
Hs.88297 
Hs.179704 
Hs.86999 
Hs.111138 
Hs.274454 
Hs.182421 



homeo boxC10 
KIM0084 protein 



gb:IL3-CT0219-221 199-029-F03 CT0219 Homo 
ESTs 

sulfotransferase family, cytosolic, 2A, 
ESTs 

meprin A, alpha (PABA peptide hydrolase) 
ESTs 

KIAA0712gene product 
ESTs 

troponin C2, fast 

a disintegrin and metalloproteinase doma 
gb:PM3-BN0218-090500-002-d09 BN0218 Homo 



H08370 Hs.33067 ESTs 

AC003034 Hs.126261 Homo sapiens Chromosome 16 BAC clone CIT 

AI472951 Hs.173688 ESTs 

BE263439 Hs.13144 HSPC160 protein 

NM_000383 Hs.129829 autoimmune regulator (automimmune polyen 



chymotrypsinogen B1 




W22206 

BE1 42052 

AI902761 

AA315993 

AB011171 

AW959498 

AW862014 



Hs.272087 
Hs.1 05484 
Hs.198037 



Hs.505 
Hs.2737 
Hs.144879 



glutathione peroxidase 2 (gastrointestin 



mRNA; cDNA DKFZp5B4N2464 (f 
gb:63E10 Human retina cDNA Tsp509l cleav 
gb:CM3-HT0137-150999-01 1-b0G HT0137 Homo 
ESTs 

ESTs, Weakly similar to LITBJHUMAN UTHO 
KIAA0599 protein 
chymotrypsin C (caldecrin) 
gb: RC3-CT0347-1 60200-01 3-d09 CT0347 Homo 
ESTs, Weakly sir " " 
KIM0724 gene product 



BE145354 Hs.273758 Homo sapiens cDNA: FLJ231 12 lis, clone L 
NM_001395 " " 



418575 
419818 
429845 

407013 

429780 



F13040 

AA452636 

AI657122 
AB020337 
AI081980 
U35637 
AC002301 

AW731886 



429184 AF095735 

437389 AL3595B7 

444412 AI147652 

451139 AW293316 

431284 AA570148 

431959 M366217 
40B158 

419648 T73661 

430681 AW969675 



Hs.182937 peptidylprolyl isomerase A (cyclophilin 

Hs.131057 ESTs, Moderately similar to CRGDJHUMAN G 

Hs.222886 ESTs 

Hs.301931 ESTs 

Hs.225943 UDP-Gal:betaGlcNAc bete 1 ,3-galactos/ltr 

Hs.285829 solute carrier family 25 (mitochondria! 

gbtHuman nebulin mRNA, partial cds 

Hs.184507 Homosapiens Chromosome 16 BACclone CIT 

Hs.300388 ESTs 

Hs.95196 ESTs, Weakly similar to T20B12.3 [C.eleg 



Hs.190042 ESTs 

Hs.198003 sarcosine dehydrogenase 

Hs.271586 hypothetical protein DKFZp752M115 

Hs.216381 Homo sapiens clone HH409 unknown mRNA 

Hs.205558 ESTs 

Hs.126783 Homo sapiens cDNA: FU22610fis, clone H 
carboxypeptidase A1 (pancreatic) 



Hs.2879 

Hs.91877 ESTs, Highly similar to THIH_HUMAN THYRO 

Hs.291232 ESTs 

Hs.101 cylochrome P450, subfamily IVF, polypept 
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25 













440089 


















U5 




AI475490 


Hs.1 70577 


















AA1 27873 




















AW897265 




















AA766707 


Hs.153039 








AI827946 


Hs.189118 










Hs. 169849 








AW1 30334 










AI763257 




















AW138211 








BE074165 




gb:Prvl3-BT0564-030300-002-e12 BT0564 Homo 


















Hs.283435 








AI653232 


Hs.195059 












gb:HSC35H091 normalized infant brain cDN 
















BE142384 










BE514778 




















AW970254 




















AI634534 










A1221409 




















AA829526 








































AA521268 






























AW014022 










AA700127 




















AI790425 




















AF039401 










AW972118 




















AI075877 












Hs.31818 


















AA280209 


Hs. 165270 








AA437009 










Al 21 5509 


Hs.143964 




























AW967420 










W28876 






















Hs76277 


ESTs, Weakly similar to TB2 [H.sapiensl. 








Hs.75576 


plasminogen 




429201 


X03178 


Hs.1 98246 


group-specific component (vitamin D bind 






W20128 


Hs.296039 


ESTs 




439450 




Hs.1 25304 


ESTs 


1.07 


458963 


AI701393 


Hs.278728 


Rad and Gem-related 2 1 athomolog) 


1.07 


405161 








1.06 




AA058357 




carcinoembrycnic antigen-relaled cell ad 


1.06 


424294 


BE299311 




gb:601119256F1 NIH_MGC_17 Homo sapiens c 


1.06 




M88700 


Hs.150403 


dopa decarboxylase (aromatic L-amino aci 


1.06 


444687 


AW972109 


Hs.1 351 07 


ESTs 


1.06 


444754 


T83911 


Hs.1 1881 


transmembrane 4 superfarrily member 4 


1.06 


421243 


AW873803 


Hs.102876 pancreatic lipase 


1.05 


. 444290 


AA262496 


Hs.29280 


ESTs 


I 05 


407984 


AW134708 


Hs.243569 ESTs 




439706 


AW872527 


Hs.59761 


ESTs 


104 


402194 








1.03 



427506 AK000134 Hs.179100 hypothetical protein FU20127 

428819 AL135623 Hs.193914 K1AA0575 gene product 

434590 T47232 gb:yb64b08.s1 Stratagene ovary (937217) 

416378 AW044467 Hs.73708 ESTs, Weakly similarto A57291 cytokine 

431912 AI660552 Hs.154903 ESTs,WeaklysimilartoA56154Ablsubsl 

443316 AI478463 Hs.18443 ESTs 

428585 AB007863 Hs.185140 KIM0403 protein 

400440 X83957 Hs.83870 

404619 BE514535 Hs.77171 

407168 R45175 gb:yg40f01.s1 Soares infant brain 1N 
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408052 AW501117 Hs.283585 ESTs 

409187 AF154830 Hs.50966 carbamoyl-phosphate synthetase 1, mitoch 

409327 L41162 Hs.53563 collagen, type IX, alpha 3 

410234 NM.003837 Hs.61255 fructose-1,6-bisphosphatase2 

410319 R23413 Hs.71935 putative zinc finger protein from EUROIM 

411000 N40449 Hs.201619 ESTs, Weakly similar to SEB4B [H.sapiens 

412098 AI493054 Hs.158968 ESTs 

412446 AI768015 Hs,92127 ESTs 

412637 AA1 15097 Hs.261313 ESTs 

41 31 47 BE067271 gb:PM2-BT0349-1 61 299-001 -b05 BT0349 Homo 

413597 AW3028B5 Hs.117183 ESTs 

414117 W88559 Hs.1787 proteolipid protein (Petizaeus-Merzbache 

Hs.75353 serine (or cysteine) proteinase inhibito 

Hs.81131 -- " -'- "-"-•"—< 



420923 
421100 
421204 
422189 
422792 
423371 
424208 
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BE067271 
AW302885 
W88559 
AU076633 
Z49878 
AF133820 
T72104 
Z44245 
AF097021 
AW351839 
AW081587 
AF252292 
AI951548 



Hs.84665 titinimn 
Hs.93194 



A-l 



Hs.273321 differentially expressed in hematopoieti 

Hs.124660 Homo sapiens cDNA'. FLJ21763 lis, clone C 

Hs.165051 ESTs 

Hs.1 12933 Tax interaction protein 40 

Hs.135163 ESTs 

AU076819 Hs.1650 solute earner family 26, member 3 

AW583123 Hs.143113 pancreatic lipase-related protein 2 

424922 BE386547 Hs.217112 ESTs, Weakly similar to Similarity to Ye 

425206 NM.002153 Hs.155109 hydroxysteroid (17-bela) dehydrogenase 2 

425545 N98529 Hs.158295 Human mRNA for myosin light chain 3 (MLC 

425983 AKO0O226 Hs.166619 mucin and cadherin-like 

426004 AW600300 lls.124123 EST Weal I ( imila ncoltn [I 10 

427627 R87582 Hs.179915 guanine nucleotide binding protein (Gpr 

428848 NM_000230 Hs.1 94236 leptin (murine obesity homolog) 

AL022314 Hs.194750 Human DNA sequence from clone 11 70K4 on 
gb:aj32e09.s1 SoaresJest'sJIHT Homo sap 

Hs.204096 lipophitin B (uteroglobin family member) 



429441 
429930 
429970 
430418 
431845 
433084 



AA813214 
AJ224172 
AI580809 
AK000072 



AA516469 
M18079 
F35430 



AB018274 



442403 AW207724 



448657 BE147857 
449238 AA428229 
450085 AW293791 
450390 N93227 
451661 
452093 
452528 
452624 



454517 
459367 
408021 
417435 
437206 
422890 
425878 
441888 



Hs.36029 heart are 

Hs.270554 ESTs 

Hs.282265 fatty acid binding protein 2, intestinal 

Hs.146070 ESTs, Weakly similar to ALU1_HUMAN ALU S 

Hs.1 16822 ESTs 

Hs.14148 ESTs 

Hs.6214 KIAA0731 protein 

Hs.1 2951 6 ESTs 

Hs.1 9991 7 ESTs 

AI277101 Hs.25890 ESTs, Weakly similar to transducin (H.sa 

AI277924 Hs.145199 ESTs 

AI439646 Hs.1 57494 ESTs, Weakly similar to KIAA0676 protein 

AA026880 Hs.25252 Homo sapiens cDNA FLJ13603fe,clonePL 

NMJ12093 Hs.18268 adenylate kinase 5 

BE066534 Hs.929 myosin, heavy polypeptide 7, cardiac mus 
29384 I e n oh C'i i e 

Hs.85524 muscle-specific RING-finge: protein homo 

Hs.60162 Homo sapiens cDNA: FU21528 lis,ctone C 

Hs.98403 ESTs 

Z28564 Hs.255950 ESTs, Weakly similar to AA64.HUMAN 64 KD 

AA447453 Hs.27860 Homosapiens """ ' 

AA742457 Hs.291479 ESTs 

Hs.30054 coagulation far 

Hs.172753 ESTs 

Hs.273741 ESTs 

gb:IL2-UM0079-090300-050-D02 UM0079 Homo 

BE148877 gb:CM4-HT0244-111199-040-h12 HT0244 Homo 
AW137133 Hs.245867 ESTs 



AW972580 
AWB03340 



NMJXJ5181 Hs.82129 

AW975934 Hs.283382 

Z43784 Hs.78713 

AW964806 Hs.38085 

AI733306 Hs.128071 

M25629 Hs.123107 

NMJ14796 Hs.33187 



ESTs, Weakly similar to Protein sequence 
solute carrier family 25 (mitochondria! 
ESTs, Weakly similar to putative glycine 
hypothetical protein FLJ21302 



NNL000102 Hs.1363 
AJ010063 
L47726 



411908 
415067 
437156 



KIAA0748 gene product 
gb:ae92b04.s1 Stralageneschlzo brain S1 
cytochrome P450, subfamily XVil (sic-roid 
Hs.1 111 10 titin-cap(telethonin) 
Hs.1870 phenylalanine hydroxylase 
Hs.84905 cytokeratin 20 
Hs.190146 ESTs 

Hs.73680 ESTs, Weakly similar to AF198455 1 epith 

Hs.28555 programmed cell death 9 

Hs.72924 cytidine deaminase 

Hs.929 myosin, heavy polypeptide 7, cardiac mus 

Hs.121194 Homosapiens cDNA: FLJ21569 fis, clone C 

Hs.423 pancreatitis-associated protein 
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427450 
432440 
426651 
414910 
423317 
424735 
439751 
452689 



438461 
446525 
453341 
403740 

430304 
421142 
444107 
419415 



X12662 
AJ272204 
U31875 



AW449258 
AL122071 
AW503944 



AB013885 
432938 T27013 
433447 U29195 
403047 

406707 S73840 



Hs.178121 KIAA0626 gene product 

Hs.171683 nuclear rocepto- subfamily 1, group H, m 

Hs.29679 cofactor required for Sp1 trsnscipliona 

Hs.64616 chromosome 1 2 open reading frame 3 

Hs.152677 Homo sapiens cDNA FLJ20338 fs, c'one HE 

Hs.50794 Homo sapiens mRNA full length insert cDN 

Hs.284176 transferrin 

Hs.170165 ESTs 

Hs.277053 ESTs 

Hs.61920 ESTs 

Hs.183075 ESTs, Highly similar to Ca2+ATPase off 

Hs.286049 phosphoserine aminotransferase 

Hs.211556 Homo sapiens cDNA: FU23378 fis, clone H 

Hs.296341 adenylyl cyclase-associated protein 2 

Hs.6187 ESTs 

Hs.238927 Homo sapiens mRNA; cDNA DKFZp434H1 235 (f 

Hs.130822 ESTs 

Hs.10319 UDP glycosyltransferase 2 family, polype 

Hs.1 92036 ESTs 

Hs.126926 beta-ureidopropionase 

Hs.31 32 steroidogenic acule regulatory protein 

Hs.3281 neuronal penlraxin II 



437776 AA768037 Hs.291671 ESTs 

415505 R39870 Hs.12548 ESTs 

444436 N25871 Hs.1 77337 ESTs 

409096 AA194412 Hs.50550 

432134 AI816782 Hs.122583 Homo sapiens cDNA: FLJ21934 fis, clone H 

437066 M743570 Hs.200935 ESTs 

427003 U19487 Hs.2090 prostaglandin E receptor 2 (subtype EP2) 

423634 AW959908 . Hs.1690 heparin-binding growth factor binding pr 

413333 M74028 Hs.75297 fibroblast growth factor 1 (acidic) 

420567 AK000812 Hs.98874 similar to praline-rich protein 48 

447145 AA761073 Hs.192943 ESTs 

452103 R42764 Hs.3248 mutS (E. coli) homolog 6 

410929 H47233 Hs.30643 ESTs 

400301 X03635 Hs.1657 estrogen receptor 1 

415702 F28877 gb:HSPD18414 HM3 Homo sapiens cDNA clone 

411396 C04646 Hs.85428 ESTs 

431706 AI816086 Hs.296341 adenylyl cyclase-associated protein 2 



CAT number Accession 

1012151 1 AW814118AW814257 AW072376 

1040435.1 AW163377AW160398 

1058667.1 AW195262R27868AW811262 

1073340.1 R56362 AW248096 R07152 R07285 

109017 1 R87388 R84328AA058916 

112377J AA071059 AA085201 AA085020 

1138723 1 AW410200AW409705AW411433BE296786BE27O309 

114833 1 AA078492 M078333AA077450AA077746AA076896 

1154206.1 AW499705AW502537AW503016 

1155179.1 AW500732AW504061 

119128 1 AA0836O9MO8379OM112O48 

1225682 1 AW809157 AW812181 AW812175 AW812172 AW8121S1 AWB121 55 

AW821291 AW821254AW821287 AW82129C AW8212B5 AW821280AWB21259 



1275641 1 
1350637J 
1355323.1 



BE067271 BE067266 BE067286 BE067278 BE067299 BEC67285 
BE074165BE075001 BE075009 
BE168828 BE168830 BE168823 BE1 
BE246327 BE244704 
BE293546 BE249848 
BE257021 BE258316BE257099 



151328 1 AW963085AA1 59005 AW963073 

1539861.1 F12119Z45475 T64832 

1547874.1 F28877F35687 

171375.1 AW967420 AA210915 AA236991 AA210916 

172864 1 AA214253 AA214259 Z28472 Z28881 Z1 7828 

1759038.-2 R87580 
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424334 
424648 
425515 



437728 
439894 
442197 
442614 
444475 
448427 



237907 1 
238221 1 
241947.1 
252721.1 
264901 1 
268382J 



301463 1 
322706 1 
34124.1 
35904.1 
38931.1 
43506.-2 
441520.1 



M313322 BE408282AA465612BE073382 

AW897265AW897274AL119504AW897275 AW397?70AA'8S7312AW897318AW897317AA317240AW961361T06241M326794AL138130 

AW407975AW999277 

BE29931 1 AA338954 AA338468 AW96O907 

M393460 AA338940 AW966277 AA41 9006 

M344576 AA732430 AA344601 

W26609W27360AA358818 

AW63129BM375484 

AA380285 AW934727 AW934914 

BE253469 BE176417BE176415 AA384133 

M453482 AF012388 

BE274360 

AA813214 AI936567 AI743529 AA448279 M994476 A1807452 A218180 AA972858 

AW968522 AA4851 12 M4851 62 AW968698 

AF135026AA583908 

M121579 AB005217 

T47232AF147365T47231 

BE513073 

M766719 AA767041 AW977440 
AA853077AA852175 

AW837912 AW837934 AA984475 AW997490 
AI269030A1204085AI004047 
C75571 AI150469T10778 



80484.1 
87667.1 
918391.1 
1221063.1 



1284685.1 
1290311.' 
1291969.' 
1333944 J 



1574395.1 
18391 13 2 
1842730.1 



AW752783 H38266AA001166 
AW072057AI225096AA018702 

BEQ80763 T96699 BE081 135 A190263Q H49182 A1904021 AI902697 
AW803340 AW803280 AW803275 AW80341 5 AW803343 AW803422 

AW862014 AW858740 AW858735 AW818542 AW858765 AW862027 AW858775 AW858771 AW858763 

AW836004 AW836087 AW836163 AWB361 B2 AWB36085 AW836084 AW836079 AW836083 AW836082 AW836086 AW836088 AW8361 66 AW836164 

AW836002AW836078AW836161AW862135AWB3B155AW336003 

AW837063 AW935882 AW935957 

AW854850AW854848 

BE064863 BE1 53698 AW856751 BE1 53820 BEC64737 BE153674 BE064730 BE065062 BE1 53536 AW856622 BE1 55079 BE064651 BE1 53665 
BE064650 BE064691 

AW947563 AW947543 AW947553 AW947549 AW947717 AW902859 AW902927 BE011D32 
AW939463 AW939484 AW939480 AW939459 AW939546 AW939593 AW93354B AW939595 AW939106 
AW947749 AW947746 AW947754 AW946636 AWS46674 

BE011320 BE006381 BE006361 BE011180 BE011328 BE011325 BE011157 BE006384BE006387BE006385BE011160 BE01 1319 BE01 1346 
BE006370 BE006386BE011173 BE0063B9 BE006376 BEC06375 BE006364 BE011321 BE006379 
BE064675 BE064761 BE064809 BE064673 BE034572 BE064674 
BE142384 BE142387 

BE179190 BE179206 BE179182 BE1791B5 BE179186 BE17B194 
Z42308 H23514 
W22206 W22498 W26922 



457416 
458764 



BE244587 AW938684 AW176490 AI940102 AW844995 AW938670 A1909850 AI909885 AI940079 AI909873 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) nu 
human chromosome 22.' Dunham I. et al. Nature (1999) 402:489-495. 
Indicates DNA strand from which exons were predicted, 
i: Indicates nucleotide positions of predicted exons. 



400489 

400545 

400635 
400672 
400675 



401122 
401238 
401442 
401496 



s. "Dunham I. et al. - refers to the publication entitled "The DNA sequence of 



Ref 



9800107 
7228177 
8567750 
8118724 
8118750 



Nt_position 

131475-131652 

160750-161007 

124618-124881 

94097-94755 

102800-102932,107482-107689 

148067-148503 

11223-11815 

94288-94442 

143447-143851 

39310-39474 

140821-141050 



82790-83002 
33547-33649 
116659-116780 

168318-168444.172964-173647 
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401743 
401767 
401810 
401840 
401886 
401919 



8134856 Minus 

2865207 Plus 

9958312 Plus 

7342191 Plus 

76B4597 Plus 

7229913 Minus 

9502466 Plus 

312B777 Plus 

8082020 Minus 



402158 
402165 
402194 



60 
65 



24712-25374 
82117-82920 
115475-115640 

156823-156921,157364-157554 
129063-129476 



7527774 
9690314 
9799769 



9801061 
9801137 
9864675 



402762 9230904 



402911 
402951 
402968 
403047 
403115 



7263904 Plus 



9838273 Plus 
9838287 Minus 
7630841 Minus 



403479 
403526 
403540 
403605 



403716 
403731 
403740 



404348 
404668 
404682 
404795 
404825 
404938 
405075 

405161 
405163 
405187 



7708819 
9838113 
7406725 
2276311 
7630858 
9797204 
9797231 
4826439 
6478944 
7381808 
7770506 
9438278 



7229826 Plus 



79215-79393 

67536-67666 

44863-45366 

19346-19480,20041-20119 

207002-207288 

122580-122987 

148298-148429,148566-148677 



70917-71191 

147901-148884 

655-951 

10751-10919,18817-19052,22131-22 
155331-155528 



8571-10061 

112942-113069,114303-114521 



18518-18656 
142689-142979 

73252-73329,73718-73877,76217-76299,78195-78401 



59793-59968 

1 42952-1 43094, 145474-1 4565 3, 
197548-197712 



100391-100652 

174311-174451,174587-174705,175523-175592 
108698-108830 

148369-148533,150678-150809 



91514-91718 



59027-69375 

85899-87122 

144000-144618 

86504-87227 

3297-3536 



134427-134593 
76707-76891 

83949-84214,84312-8441 5,84499-84677,84878-85 1 1 4 



11332-11546,12584-12718 
40977-41150 
147501-147780 
210382-210494 



124680-12 



17-157785 
161171-161299 

117025-117170,118567-118735 

32646-33138 

125904-126063 



15850-16061 

104275-104508,104755-104877 

66990-67484 

13856-14004 
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406101 9124019 

406118 9143818 

406150 9886026 

406168 7144874 

406343 9255974 

406357 9256093 

406563 7711604 



59331-59701 

62393-63016,65012-65578 

17284-17440,18489-18646,18917-19004,19384-19538 

77181-77415 

34401-34538 



TABLE 29A: 2286 GENES UP-REGULATED IN IDIOPATHIC PULMONARY FIBROSIS (IPF) COMPARED TO NORMAL BODY 

Table 29A lists about 2286 genes that are up regulated in idiopathic pulmonary fteosis samples as compared with normal "body map - samples. These were selected from about 
59680 probesets on an Affymetrix/Eos Hu03 GeneChip array such that the ratio of 'ave-age" idiopathic pulmonary fibrosis expression level to "average" normal adult tissues sample 
e pre: ion v, ■ ester than nr equal to about 2 0. The "average" fibrosis sample expression 'eve] was set lo the 90" percentile amongst idiopathic pulmonary fibrosis samples. The 
■average" normal adult tissue level was set to the 95* percentile amonq^ i o m igi i tissues r order to remove gene-specific background levels of non-specific hybridization, the 
1 5* percentile value amongst non-malignant tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

Pkey: Unique Eos probeset identifier nu mber 
ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 





Ratio of IPF to normal body 






Pkey 






Unigene Title 


R1 








surfactant, pulmonary-associated protein 






























advanced glycosylate end product-speci 




















N-acylsphingosme amidohydrolase (acid c 




































AJ245671 


Hs.12844 








M13509 


Hs.83169 






457200 






thyroid transcription factor 1 


44'g 


432519 






ESTs, Weakly similar to BCHUIA S-100 pro 


42.7 






































small inducible cy.okine subfamily A [Cy 










solute carrier family 6 (neurotransmitte 




















protocadherm 1 7 






AW451709 


Hs.271200 








NM 014398 


Hs.10887 


similar to lysosome-assoaated membrane 




447033 


AI357412 


Hs.157601 




194 


445885 


AI734009 


Hs.127699 


KIAA1603 protein 


18.9 


411880 






gb:hm3M03.x1 NCI CGAP Thy4 Homo sapiens 


17.9 


432437 


W07088 


Hs.293685 


ESTs 


17.8 


424105 


AI142336 


Hs.43977 


Human DNA sequence from clone RP11-196N1 


17.3 




AK000106 


Hs.272227 


Homo sapiens cDNA FU20099 fis, clone CO 


17.2 


440807 


AW269421 


Hs.1 28093 


ESTs 


16.7 


424917 


AI636208 


Hs.96901 


hypothetical protein FLJ23049 


16.4 


433365 


AF026944 


Hs.293797 


ESTs 


16.4 


445279 


R41900 


Hs.22245 


ESTs 


16.4 


417801 


AA417383 


Hs.82582 


integrin, beta-like 1 (with EGF-like rep 


16.4 


405654 
449328 




Hs.197647 


ESTs 


ie!i 


449494 


AW237014 


Hs.315369 


Homo sapiens cDNA: FLJ23075 fis, clone L 


15.7 


408826 


AF216077 


Hs.48376 


Homo sapiens clone HB-2 mRNA sequence 


15.5 


417728 


AW138437 


Hs.24790 


KIAA1573 protein 


15.0 


440452 


AI925136 


Hs.55150 


ESTs, Weakly similar lo CAYP.HUMAN CALCY 




452039 


AI922988 




ESTs 




408771 


AW732573 


Hs.47584 


potassium voltage^jated channel, delayed 




421464 


AA291553 


Hs.190086 


ESTs 


14.1 


421554 


AW137676 


Hs.97775 


ESTs 


13,8 




AA521277 


Hs.1 24946 


ESTs, Weakly similar to A46010 X-iinked 


13.2 


434424 


AI811202 


Hs.325335 


Homo sapiens cDNA: FU23523 fis, clone L 


13.2 


431924 


AK000850 


Hs.272203 


Homo sapiens cDNA FIJ20843 fis, done AD 


12.9 


459702 
421110 


AJ250717 


Hs.1355 


cathepsin E 


12.6 


407638 


AJ404672 




hypothetical protein FU23571 


12,6 


423575 


C18863 




Homo sapiens cDNA FLJ1 1 576 fis, clone HE 




423244 


AL039379 


Hsl209602 


ESTs, Weakly similar to ubiquitous TPRm 


122 


427585 


D31152 


Hs.1 79729 


collagen, type X, alpha 1 (Schmid melaph 


121 


436982 


AB018305 


Hs.5378 


spondin 1, (f-spondin) extracellular mat 


12.1 


451561 


N52812 


Hs.177403 


ESTs 


12.0 


424086 


AI351010 


Hs.102267 


lysyl oxidase 


12.0 


435299 


AI745458 


Hs.122614 


ESTs, Weakly similar to T20593 hypothec" 


12.0 


429496 


AA453800 


Hs.1 92793 


ESTs 


11.9 
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432365 AK001106 Hs.274419 hypothetical prolein FLJ 10244 



436061 AI248584 

431385 BE178535 

421470 R27496 

440209 H05049 

428927 AA441837 

448693 AW004854 

424717 H03754 Hs.152213 

416402 NM 000715 

44699B N99013 

442176 AA983764 



,190745 Homo sapiens cDNA: FLJ21326 lis, clone C 



Hs.1378 
Hs.22269 
Hs .90250 



i W79123 

452883 X80031 

417015 M83772 

422022 M302420 

447724 AW298375 

406671 M129547 

458194 AW383618 

446232 AI281848 

416208 AW291168 

453382 AA709285 

412372 R65998 

426830 AA385751 

407568 AA740964 

414259 W44633 

400289 X07820 

422426 W79117 
406747 AI925153 
450050 A1681268 
431337 N48107 
408427 AW1 94270 
447048 AW393080 
453636 R67837 
443450 N66045 
418735 N48769 
421160 AL080215 
449802 AW901804 
441233 AA972965 
469587 AA031956 
436246 AW450963 
445189 AI936450 
410781 AI375672 
446868 AV660737 
415817 U88967 
425664 AJ006276 
414968 C16096 
410334 AW979261 
442510 AF150179 
409238 AL049990 
431089 BE041395 
444929 AI6B5B41 
413802 AW964490 
444218 AF070641 
412719 AW016610 
453445 AL036532 
419261 X07876 
451110 AI955040 
433815 AI696602 
432203 AA305746 
451103 R52804 
425921 NMJ07231 
424989 AA985520 
433231 AB040926 
408217 AI433201 
431B06 AF186114 
436751 AA732217 
452891 N75582 
41304B M93221 
426803 AA362568 
409718 D86640 
423424 AF150241 
429597 NMJ03816 
421013 M62397 
437479 R61866 
416778 M16505 
421478 AI6B3243 
444396 T65213 
423629 AW021173 
450715 AI266484 



s.1012 
Hs.16762 
HS.12B910 
Hs.58561 
Hs.530 
Hs.80876 
Hs.200442 
Hs.24477 



hypothetical protein FU23537 
wingless-type MMTV integration site fam't 
complement component 4-binding protein, 
Homo sapiens mRNA; cDNA DKFZp564B2062 (f 



collagen, type tV, alpha 3 (Goodpasture 



Hs.265459 ESTs, Moderately similar lo ALU2_HUMAN A 
Hs.194691 
Hs.41295 
Hs.5997 
Hs.285243 
Hs.196379 
Hs.62699 
Hs.301296 
Hs.2258 



ESTs, Weakly similar to UUC2.HUMAN MUCIN 

hypothetical protein FU13078 

hypothetical protein FLJ22029 

ESTs, Weakly similar to putative p150 [H 

ESTs 

Homo sapiens cDNA: FU231 31 fis, clone L 



Hs.217493 annexinA2 
Hs.257883 ESTs 
Hs.292593 ESTs 
Hs.177236 
Hs.228320 



hypothetical prolein FLJ23537 



Hs.133529 ESTs 
Hs.44609 ESTs 



gb:2k15e04.s1 Soares_pregnanljjterus_NbH 



Hs.135100 
HS.78B57 
Hs.159003 



protein tyrosine phosphatase, receptor-1 



Hs.161354 
Hs.32241 
Hs.10684 
Hs,1 29911 
Hs.91453 
Hs,89791 
Hs.265398 



Hs. 23575 
Hs.1 43552 
Hs.279860 
Hs.270737 
Hs.294054 
Hs.212875 
Hs.75182 



KIAA1493 protein 

tumor protein, translationally-controlls 
tumor necrosis factor (tgand) superfami 
ESTs 

ESTs, Weakly similar to DYH9_HUMAN CILIA 
mannose receptor, Ctype 1 
ecotropic viral integration site 5 
src homology three (SH3) an 




Homo sapiens cDNA: FLJ21909 fis, clone H 
ESTs, Weakly similar to KIAA1 324 prolein 
ESTs, Weakly similar to 138022 hypolheli 
arrestin 3, retinal (X-arrestin) 
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449108 AI140BB3 

422798 R92347 

416030 H15261 

426486 BE178285 

424906 AI566086 

448206 BE622585 

432133 AB033088 

447112 H17800 

446917 AI347863 

428227 AA321649 

431761 AW002846 

428743 AL080060 

413499 BE144BB4 

423909 AJ223183 

438122 AI620270 

449611 AI970394 

453616 NM_003462 

410060 NM001448 

442353 BE379594 

452571 W31518 

453736 AL1 18674 

409203 AA780473 
405494 

442832 AW205560 

420193 AI460080 

434217 AW014795 

427356 AW023482 

436396 AI683487 

408308 AL033377 

442377 AA993807 

441143 AI027604 

445122 AW241632 

431353 AA828032 

426753 T89832 

445186 AW614544 

451963 AI825440 

400298 AA032279 

433426 H69125 

434377 AW137148 

415236 R41400 

409031 AA376B36 

427558 D49493 

437269 AI377755 

421952 AA300900 

447081 Y13B95 

430099 AW194988 

422475 AL359938 

413472 BE242870 

424750 D29956 
403574 

439759 AL359055 

415025 AW207091 

448104 AI674818 

444271 AW452569 

437157 BE048860 

444050 AW13B295 

414569 AF109298 

447505 AL049266 

424433 H04607 

407378 AA299264 

445424 AB028945 

422544 AB018259 

449765 N92293 

413930 M86153 

417246 AI76009B 

453652 AW009640 

411514 AW850178 

438909 AF0B5839 

446002 AI346468 

419236 AA330447 

419150 T29618 

424202 BE350295 

431723 AW058350 

415511 AI732617 

430510 AW162916 
416879 H98899 
432B03 AA56539B 
442862 BE080429 
435974 U29690 
441 0B2 AW444804 
404599 

453931 AL121278 



Hs.34574 
Hs.21948 
Hs.170056 
Hs.153716 



Homo sapiens mRNA; cDNA DKFZp535B0220 (f 
Homo sapiens mRNA for Hmob33 protein, 3' 
ESTs, Moderately similar to 138022 hypot 



Hs.7154 ESTs 



Hs.156672 
Hs.2248 
Hs.105468 
Hs.301549 

Hs.135194 
Hs.1 29837 
Hs.197075 
Hs.33846 
Hs.58367 
Hs.49136 
Hs.34565 
HS.34B71 
Hs.687 



Hs.253559 ESTs 



ESTs 

small inducible cytokine subfamily B (Cy 
hypothetical protein FLJ22690 
Homo sapiens mRNA; cDNA DKFZp564H172 (fr 
gb:CM0-HT0182-041099.065-e11 HT0132 Hono 
immunoglobulin superfamily, member 6 
ESTs, Weakly similar to Z263.HUMAN ZINC 



ESTs, Moderately similar to ALU7.HUWAN A 



epithelial antigen of 



Hs.152213 wingless-type MMTV integration site fami 

Hs.44197 hypothetical protein DKFZp564D0462 

Hs.1 67367 ESTs 

Hs.159650 ESTs 

Hs.147377 hypothetical protein FLJ23598 

Hs.189076 ESTs 

gb.Human Irophoblasl hypoxia-regulated f 

Hs.170278 ESTs 

Hs.123641 protein 

Hs.224952 ESTs 
Hs.61635 

Hs.1 33525 ESTs 

Hs.306593 Homo sapiens cDNA FLJ113B2 fis, clone HE 

gb:yf94b12.s1 Soares infant brain 1NIB H 

Hs.76728 ESTs 

Hs.2171 growth differentiation factor 10 ' ( 

Hs.1 20695 ESTs 

Hs.98849 ESTs, Moderately similarto AF161 511 1 H 

Hs.1 7287 potassium inwardly-rectifying channel, s 

Hs.20537 hypothetical protein FLJ13942 

Hs.1 17313 Meis(mouse)homolog3 

Hs.75379 solute carrierfamily 1 (glial high affi 

Hs.1 5281 8 ubiquitin specific protease 8 

Hs.67709 Homo sapiens mRNA full length insert cDN 

Hs.72307 ESTs 

Hs.315433 Homo sapiens cDNA FLJ11375 fis, clone HE 

Hs.1 49804 ESTs 

Hs.1 20655 ESTs 

Hs.135024 ESTs 

Hs.118258 prostate cancer associated protein 1 

Hs.18724 Homo sapiens mRNA; cDNA DKFZp564F093 (fr 

Hs.9218 ESTs 

Hs.57776 ESTs, Moderately similar to I38022 hypot 

Hs.12696 cortactin SH3 domain-binding protein 

Hs.11 8140 KIAA0716 gene product 

Hs.206832 ESTs, Moderately similar to ALU3.HUMAN A 

Hs,75618 RAB11A, member RAS oncogene family 

Hs.21411 ESTs 

Hs.28368 ESTs, Moderately similar to S65657 alpha 

gb;IL3-CT0219-271099-022-H12 CT0219 Homo 
gb:Homo sapiens full length insert cDNA 



Hs.145789 
Hs.135159 
Hs.89640 
Hs.1 5032 
Hs.16762 
Hs.1 82362 
Hs.241576 
Hs.42599 



no sapiens cDNA FLU 1481 fis, etc 



TQKivr 
RAN binding protein 17 

Homo sapiens mRNA; cDNA DKFZp564B2062 (f 
ESTs 

hypothetical protein PR02577 
ESTs 

gb:nk41f01.s1 NCLCGAP.GC2 Homo sapiens 
ESTs 

Homo sapiens beta-1 adrenergic receptor 



Hs.25144 ESTs 
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15 

20 



415457 AW081710 

424693 BE169810 

419172 AW338625 

413384 NM_000401 

453037 M045175 

444042 NMJ04915 

431169 AW971240 

422352 AA766296 

433527 AW235613 

420077 AW512260 

429703 T93154 

433098 AW190593 

451099 R52795 

449416 A1651016 

459023 AW968226 

450584 AA040403 

427660 AI741320 

429125 AA446B54 

450025 AK001875 

433479 AW511459 

443113 AI040686 

430414 AW365665 

419752 AA249573 

435420 AI928513 
404915 

424310 M338648 

448253 H25899 

430899 BE018217 

446967 AI699629 

435082 AA664273 

438842 AA827176 

437260 M747807 

410934 AW811114 
428043 T92248 

408045 AW138959 
450568 AL050078 



444805 AB007899 

446921 AB012113 

429784 M89796 

433225 AW816515 

416575 W02414 



430153 AW968128 



Hs.22120 ESTs 

Hs,75334 exostoses (multiple) 2 

Hs.177552 ESTs 

Hs.10237 ATP-binding cassette, sub-family G (WHIT 

gb:EST383329 MAGE resequences, MAGL Homo 
Hs.99200 ESTs 



(in 13 receptor, alpha 2 

Hs.60798 ESTs 

Hs.60371 ESTs 

Hs,114121 Homo sapienscDNA: FLJ23228 tis, clone C 

Hs.271004 ESTs, Weakly similcr to I38022 hypotheli 

Hs.24321 Homo sapienscDNA FLJ12028 fis, clone HE 

Hs.249972 ESTs 

Hs.1 32908 ESTs 

Hs.120388 ESTs 

Hs.152618 ESTs,ModeratelysimilartoZN91_HUMAN Z 

Hs.59203 ESTs 



Hs.50334 

Hs.201591 ESTs 
Hs.1 8352B hypothetical protein FLJ1 4906 
Hs.1 56781 ESTs 

Hs.1 86104 Homo sapiens cDNA FU13803 fis, clone TH 
Hs.124316 ESTs 
Hs.1 49500 ESTs 

gb:MR2-ST0131-111199-016 a04 STG131 Horn, 
uteroglobin 



mscDNAFU10784fis,clo 



Hs.245123 ESTs 
Hs.25159 



Hs.1 10376 ESTs 

Hs.12017 homolcg of yeast ubiqutr protein ligas 
Hs.1 ( mallinducil /lc ubfamiiyA (C 



2 NT 



434088 AF116677 

432113 AA935065 

446608 N75217 

419945 AW290975 

454024 AA993527 

420209 AA256444 

4393B2 BE247684 

428895 AA437124 

446577 AB040933 

419247 S65791 

427778 AA412323 

437138 AI935622 

431322 AW970622 

430437 AI768801 

435202 AI971313 

415076 NM_000857 



Hs.330778 ESTs 
Hs.30343 ESTs 
Hs.336679 ESTs 
Hs.300648 ESTs 
Hs.1 35223 ESTs 

Hs.202289 hypothetical protein DKFZp434P1735 
Hs.11217 KIAA0877 protein 

gb:RC0-ST0174-191099-031-a07 ST0174 He 
cai protein PR01966 



Hs.293907 
HS.1254B5 
Hs.1 03070 
Hs.187247 
Hs.1 5420 
Hs.89764 
Hs.1 05323 
Hs.271245 ESTs 



gb:EST382704 MAGE resequences, MAGK Homo 
Homo sapiens cDNA FU 13569 fis, clone PL 
KIAA0551 protein 

• ' Ie,beta3 



426269 H15302 Hs.168950 Homo sapiens mRNA; cDNA DKFZp556A1 046 (f 

416769 AI339257 Hs.115436 ESTs, Moderately similar to 154374 gene 

414299 AA142989 Hs.71730 ESTs 

420362 U79734 Hs.97206 huntingtin interacting protein 1 

459664 

425509 AF079363 Hs.158213 sperm associated antigen 6 
401497 

440727 AI073991 Hs.134268 ESTs, Weakly similar to 
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428434 AW363590 

408776 AA057365 

419991 AJ000098 

451050 AW937420 

400297 AI127076 
404957 

452771 T05477 

438885 AI8B6558 

428244 AI5B4123 

420481 U50525 

455047 AW852530 

408729 AA1 95764 

457100 AA417878 

426342 AF093419 

417154 AI674701 

411869 W20027 

427043 AA397679 

445635 AI769774 

442973 BE567665 

422063 BE1 56476 

448299 AA497044 

408677 AI279892 
404097 

437536 AA764781 

452822 X856B9 

410733 D84284 

439140 W85737 

407366 AF026942 



411149 N68715 

432441 AW292425 

419807 R77402 

440615 AI733055 



Hs.65551 Hocnosapiens, Similarto DNAsegmenl, Ch 

Hs.B3356 ESTs, Weakly similar to I38022 hypolheti 

Hs.9421 0 eyes absent (Drosophila) homolog 1 

Hs.69652 ESTs 

Hs.334473 hypothetical protein DKFZp56401 278 



Hs.1 84987 
Hs.42500 
Hs.98201 

Hs.72639 

Hs.48401 

Hs.169378 

Hs.21388 

Hs.23439 

Hs.3991 

Hs.209831 

Hs.288550 



ADP-ribosylation factor-like 5 

Human BRCA2 region, mRNA sequence CG029 

gb:PM1-CT0243-071099-001-g06 CTC'243 Homo 

ESTs 

ESTs, Moderately similar lo ALU8JHUMAN A 



ESTs, Weakly similar to ALLH.HUMAN ALU S 
Homo sapiens cDNA: FLJ23156 lis, clone L 
gb:QV0-HT0368 040100-082-C05 HT0368 Homo 
protein FLJ10392 



Hs.291844 
Hs.288617 
Hs.66052 
Hs.290830 



gb:Homo sapiens cig33 mRNA, partial sequ 
gb:Homo sapiens mRNA; cDNA DKFZp586H071 8 



Hs.28792 Homosa 



Hs.130806 
Hs.1 15740 
Hs.34550 
Hs.148135 



gb:>i75f11.s1 Soares placenta Nb2HP Homo 
ESTs 

KIAA0210gene product 



v;cDNADKFZp761G2123(f 



428780 AI478578 



424638 AI472106 

451497 H83294 

427652 AI673025 

458722 AA741545 

407327 AA487182 

41 101 B AW813428 

415261 T40928 

453543 AA485425 

438014 N71183 

407829 AA045084 

441006 AW605257 

412222 AA528283 

424115 AA335497 

453197 AI916269 

439398 AA284267 

436397 AA715013 
427535 R29543 
410901 AW810001 
425916 NM.006786 
447020 T27308 
427457 AW779105 
451620 AW449888 
408938 AA059013 



417181 L10123 
435347 AW014873 
425458 H89317 



1 0.x5 NCI_CGAP_Pr3 Homo sapiens 

Hs.1 05039 solute carrier family 34 (sodium phospha 

Hs.27252 ESTs 

Hs.50636 ESTs 

Hs.143659 ESTs 

Hs.125300 ring finger protein 21, interferon-respo 

Hs.49303 Homo sapiens cDNA FLJ11663 fis, clone HE 

Hs.284122 Wnt inhibitory factor-1 

Hs.43874 ESTs, Moderately similar to I54374 gene 

Hs,232832 ESTs, Weakly s milar io 1 24961 hypothec 

Hs.259414 ESTs, Weakly s milarto Z195_HUMAN ZINC 

gb:MR3-ST01 92-01 0200-21 0-c05 ST0I92 Homo 

Hs/t8919 Homo sapiens cDNA FLJ11508 lis, clone HE 

Hs.1 21806 Homo sapiens cDNA FLJ11971 fis, clone HE 

Hs,29725 hypothetical protein FU13197 

Hs.7627 CGI-60 protein 

Hs.292737 ESTs 
Hs.293965 
Hs.109057 
Hs.221504 



Hs.2164 

Hs.1 62200 
Hs.1 6986 
Hs.164682 
Hs.257224 
Hs.22607 
Hs.52792 
Hs.149770 
Hs.239681 
Hs.255938 
Hs.1071 



Homo sapiens mRNA; cDNA DKFZp58B!1823 (f 
Homo sapiens cDNA FLJ13658 fis, clone PL 
hypolheti! rrolein FU20275 
ESTs, Moderately similar to KIAA1200 pro 
surfactant protein A binding protein 
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436594 AI419982 

421237 U25029 

432731 R31178 

419750 AL079741 
426320 W47595 

419751 AW1955B1 
433515 AA595800 
451381 BE241831 
452784 BE463857 
438297 AW51519S 
406992 S82472 
431291 N25521 



Hs.183114 
Hs.169300 
Hs.93121 
Hs.190246 
Hs.172330 
Hs.151258 
Hs.258238 



447700 AI420183 
423735 AA330259 
424144 AA454033 
416258 N45551 
410011 AB020641 
454359 N71277 
422977 AA63149B 
433485 AI493076 
450192 AA263143 
432015 AL157504 
407266 AJ235664 
409041 AB033025 
434265 M846811 
452526 W38537 
403271 

450656 M010539 
446096 AI276454 
454036 AA374756 
437960 AI669586 
440862 H39048 
410615 AW772721 
413583 AL120806 
419449 H18417 
442324 R63578 
453080 AI423056 
435747 AI079519 
446509 AF169693 
448030 N30714 
414998 NM.002543 
448089 AI467945 
434367 AB020700 
434757 AI038997 
413453 AA129640 
454438 AA224053 
458154 AW816379 
430417 AA461045 
434819 AA650099 
438796 W67821 
415451 H19415 
420931 AF044197 
414812 X72755 
451895 T93573 
435434 AA680387 
449623 C00719 
433563 AI732637 
444649 AW207523 
441594 AL041080 
443314 AW771701 
400292 M250737 
427972 AA864870 
446932 AA961459 
445640 AW969626 
452393 HB7398 
443204 AW205878 
400608 

411156 AW819939 
435772 AA700019 
439830 AA846666 
455511 BE144762 
443257 AI334040 
436033 H75391 
420214 AI146375 
410519 AW612264 



447863 AL047611 



Hs.279575 
Hs.131905 
Hs.171077 

Hs.41644 
Hs.90011 
Hs.57856 



Hs.201967 
Hs.24596 
Hs.159115 

Hs.50081 
Hs.130554 
Hs.280740 



Human glucocorticoid receptor alpha mRNA 
fibronectin 1 

Homo sapiens cDNA FLJ14236 fis. clone NT 
transforming growth faclcr, beta 2 
KIAA0761 protein 
ESTs 

hypothetical protein MGC2705 

hypothetical protein FLJ21062 

ESTs, Moderately similar to ALU 1.HUMAN A 

gb:beta -pol=DNA polymerase beta (exon a 

Kruppel-type zinc linger protein 

ESTs 

ESTs, Weakly similar to 138022 rypothet 
Homo sapiens G-proteincoup'ed receptor 
ESTs, Moderately similar loZ195_HUMAN Z 
ESTs, Weakly similar to T21259 hypotheti 
gb:EST33963 Embryo, 12v/eek II Homo sapi 
AKAP-associated sperm protein 
adenylosuccinate synthase 
PRAIRE protein kinase 1 
gb:za36e03.s1 Soares fetal liver spleen 
gb:np83h04.s1 NCI.CGAP_Thy1 Homo sapiens 
aldo-keto reductase family 1 , member C2 
RAD51-interacting protein 
Homo sapiens mRNA; cDNA DKFZp586O0724 (f 
gbiHomo sapiens mRNA for immunoglobulin 
KIAA1 199 protein 

Homo sapiens cDNA: FU23089 fis, clone L 
1MGC3040 



Hs.93560 
Hs.222194 
Hs.127432 



Hs.23921 

Hs.134398 

Hs.1 32892 

Hs.325960 

Hs,77729 

Hs.173696 

Hs.3830 

Hs.132921 

Hs.1 28065 

Hs.1 72405 



ESTs 
ESTs 

gb:hl95c01.x1 NCI_CGAP_Thy8 Homos; 
ESTs 

Homo sapiens cDNA FLJ1429<1 fis, clone 
ESTs 

hypothetical protein DKFZp547A023 
ESTs 

prctocadherin 20 

membrane-spanning 4-rJomains, subfam: 
oxidised low density lipoprotein (lectin 
ESTs 

KIAA0893 protein 

ESTs 

ESTs 

cell division cycle 27 



Hs.50701 ESTs 

Hs.291 541 ESTs, Weakly similar to ALUB.HUMAN !!!! 

Hs.1 09590 genethoninl 

HS.26B720 ESTs.ModeratelysimilartoALULHUMAN A 

Hs.1 00431 small inducible cytokine B subfamily (Cy 

Hs.77367 monokine induced by gamma interferon 

Hs.16970 ESTs 

Hs.187850 ESTs 

Hs.120440 EST 

Hs.277901 ESTs 

Hs.19762B ESTs 

Hs.208765 ESTs, Moderately similar to ALU7„HUMAN A 

Hs.54646 ESTs 

Hs.72472 ESTs 

Hs.181304 putative gene product 

Hs.1 25644 ESTs 

Hs.31 704 ESTs, Weakly similar to KIAA0227 (H.sapi 

Hs.99858 ribosomal protein L7a 

Hs.29643 Homo sapiens cDNA FU13103 lis, clone NT 

Hs.273629 ESTs 

Hs.1 32992 ATP-binding cassette, sub-family G (WHIT 

Hs.151489 ESTs, Weakly similar to XE7.HUMAN PROTEI 

gb:CM0-HT01 80-041 099-055-b04 HT0180 Homo 

Hs.11614 HSPC065 protein 

Hs.255748 ESTs 

Hs.286073 ESTs, Moderately similar to ALU5.HUMAN A 

Hs.131705 ESTs 

Hs.273294 hypolhetical protein FU20069 

Hs.159388 Homo sapiens clone 23570 mRNA sequence 

HS.28BB85 Homo sapiens cDNA FLJ14246fis, clone OV 
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422429 AA310527 

434577 AW444575 
403310 

451B30 H18433 

422222 AI699372 

435S27 W88774 

436461 AW511956 

452166 AI948607 

413998 AW103807 



407910 AA650274 
428042 AA419529 
424641 AB001106 
417412 X16896 
409629 AW449589 
458771 AW295151 
415929 M724373 
436645 AW023424 
426083 AW962712 
445268 AI218358 
429629 BE501732 
431917 D16181 
443348 AW873596 
443151 AI827193 
419255 AA235672 
452291 AF015592 
452551 AI692181 
421106 AA877124 
42426B AA397653 
425211 M18667 
421515 Y11339 
423045 AW967472 
428771 AB028992 
445745 AB007924 
417009 AA191719 
436517 BE080932 
425905 AB032959 
414083 AL121282 
45272G AI915676 
409920 BE169746 
441802 AA968636 
431956 AK002032 
413875 BE176776 
444009 AI380792 
410785 AW803341 
418882 NM.004996 
43B993 AA828995 
435256 AF193766 
428104 AA421350 
439648 AW780192 
436194 AK001074 
446364 AB006624 
452744 AI267652 



gb:EST1 81 333 Jurkal T-ceils V Homo sapie 



hypothetical protein DKFZp434A171 




Hs.41296 
Hs.76391 
Hs.151413 
Hs.82112 
Hs.279724 
Hs.163612 
Hs.49344 
Hs.156520 
Hs.126712 
Hs.1 75048 
Hs.30622 
Hs.2868 
Hs.1 82278 
Hs.132714 
Hs.87491 
Hs.28853 
Hs.49169 
Hs.1 72844 
Hs.307438 
Hs.1 867 



hypothetical protein FU 11006 
ESTs 

ESTs, Weakly similar to AF1 91 020 1 E2IG5 
ESTs 

Homo sapiens cDNA FU1301 0 fis, clone NT 

peripheral myelin protein 2 

calmodulin 2 (phosphorylase kinase, delt 



Human DNA sequence Irom clone 495010 on 
progastricsin (pepsinogen C) 
GalNAc alpha-2, 6-sialyltransferase I, i 
PCTAIRE nrolein kinase 2 
KIAA1069 protein 

gene product 



Hs.1 83302 
Hs.193143 
Hs.13245 
Hs.314714 
Hs.1 35225 

Hs.318584 novel C3HC4 type Zinc finger (ring fingt 
Hs.257786 



ESTs 
ESTs 



ESTs 



Hs.12504 likely ortholog of mouse Arkadia 
H5.127E77 ESTs 

Hs.272245 Homo sapiens cDNA FU11170 fis, done PL 

gb:RC3-HT0505-1 10300-01 1-g09 HT0586 Homo 

Hs.135104 ESTs 

gb:IL2-UM0079-09030(W)50-D03 UM0079 Homo 

Hs.89433 ATP-binding cassette, sub-family C (CFTR 

gb:od77b08.s1 NCI_CGAP.Ov2 Homo sapiens 

Hs.1 3872 cytokine-like protein C17 

Hs.191604 ESTs 

Hs.267596 ESTs 

Hs.333435 HomosapienscDNAFU10212fis l clone HE 
Hs.14912 KIAA0286 protein 

Hs.30504 Homo sapiens mRNA; cONA DKFZp434E052 (fr 



408369 R38438 Hs.182575 solute earner family 15 (H+/peptide tra 



AL046134 
AW291775 
AI375572 



409928 / 

410292 / 

415811 ; 

420218 I 

426625 " 

417708 I 

451024 i 

411745 , 



Hs.13944 
Hs.213793 
Hs.172634 
Hs.128245 
Hs.7473 
Hs.57549 

Hs!l72963 
Hs.286 
Hs.300642 
Hs.50495 



AA862231 
I AI734258 
I AK000725 



450297 AW901347 
421002 AF1 16030 
445414 AV653692 



adrenergic, beta, receptor kinase 2 



hypothetical protein dJ473B4 
ESTs 

hypothetical protein FU14624 



serologically defined colon cancer antig 



Hs.334443 
Hs.245367 
Hs.50579 



i • n UJ1_HUMAN Al 

hypothetical protein FLJ20718 
gb:an03c03.x1 Stratagene scluo brain S1 
peptldylprolylisomerase B (cyclophilin 



Hs.38592 hypothetical protein FU23342 
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30 
35 
40 



55 
60 



447207 


AA442233 


Hs.17731 


hypothetical protein FLJ12892 


.4.0 


451353 


N21043 




ESTs 


4.0 


437075 


M743748 


Hs.40758 


ESTs 


3.9 


410505 


AW752139 


Hs.314323 


ESTs 


3.9 


449746 


AI668594 


Hs.17658B 


ESTs, Weakly similar lo CP4Y.HUMAN CYTOC 


3.9 




AA868729 




ESTs 


3.9 




N59294 




nucleosome assembly protein 1-like 1 


3.9 


436298 


AW293496 


Hs.18013B 


ESTs 


3.9 


417718 


T86540 


Hs.193981 


ESTs 


3.9 


436772 


AW975688 


Hs.74170 


melalloihionein 1E (functional) 


3.9 


401045 








3.9 


408767 


AA057279 


Hs.211928 


ESTs 


3.9 


407303 


AA016296 


Hs. 165200 


ESTs, Weakly similar to A561 94 thromboxa 


3.9 


432583 


AW023624 


Hs.162282 


potassium channel TASK-4; potassium chan 


3.9 


451623 


H77818 




ESTs 


3.9 


450063 


AI681509 


Hs.277133 


ESTs 


3.9 


416734 


H81213 


Hs.14825 


ESTs, Weakly similar lo KIAA1503 protein 


3.9 


419276 


BE165909 


Hs.306881 


MSTP043 protein 


3.9 


433132 


AB026264 


Hs.284245 


hypothetical protein IMPACT 


3.9 




AI754308 


Hs.159452 


ESTs 


3.9 


422667 


H25642 


Hs.133471 


ESTs 


3.9 


443486 


NMJ03428 


Hs.9450 


2inc finger protein 84 (HPF2) 


3.9 


458219 


H22195 


Hs.31874 


ESTs 


3.9 


443613 


AI079356 




gb:oz39b09.s1 Soares_NhHMPu_S1 Homosspi 


3.9 


439810 


AL109710 


Hs.85568 


EST 


3.9 


436578 


AI091435 


Hs.1 34859 




3.9 


415598 


AI433155 


Hs.9856 


ESTs 


3.9 






Hs. 126059 


ESTs 


3.9 


454111 


AW081681 


Hs.269064 


ESTs, Weakly similar lo T42689 hypotheti 


3.9 


409719 


AI769160 


Hs.1 08681 


Homo sapiens brain tumor associated prol 


3.9 


452466 


N84635 


Hs.29664 


hypothetical protein DKFZp564B052 


3,9 




NM0122BB 


Hs.1 53954 


TRAM-like protein 


3,9 


435823 


R07856 


Hs.1 6355 


ESTs 


3,9 


11 0533 


AI1406B6 


Hs.263320 


ESTs 


3.9 


429334 


D63078 


Hs.186180 


Homo sapiens cDNA: FU23038 fts, clone L 


3.9 




M045648 


Hs. 301957 


nudix (nucleoside diphosphate linked moi 


3.9 


430039 


BE253012 


Hs.1 53400 


EST V\ kly similar tc \LU1 I UMANA US 


3.9 


417461 


R38403 


Hs.1 3305 


ESTs 


3.9 


424051 


AL1 10203 


Hs.138411 


Homo sapiens mRNA; cDNA DKFZp536J 1922 (f 
ESTs 


3.8 


419140 


AI982647 


Hs.215725 


3.8 




T79213 


Hs.272073 


ESTs 


3.8 


430140 


AW296771 


Hs.221999 


ESTs 


3.8 


446896 


T15767 


Hs.22452 


Homo sapiens mRNA for KIAA1737 protein, 
hi i tin era boxylase 


3.8 


422165 


AL041199 


Hs.1481 


3.8 


417706 


T90797 


Hs.268623 


ESTs 


3.8 




AI631874 


Hs.155140 


casein kinase 2, alpha 1 polypeptide 


3.8 


450522 


AI698839 




gb:wd31f02.xt Soares_NFL_T_GBC_S1 Homo s 


3.8 


446619 


AU076643 


Hs.313 


secreted phosphoproteir, 1 (osteopontin, 


3.8 




R72032 


Hs.29235 


ESTs 


3.8 


414700 


H63202 


Hs.38163 


ESTs 


3.8 


440899 


AW449445 




DKFZP4341092 protein 


3.8 


439335 


AA742597 


Hs.62492 


ESTs, Weakly similar to B39066 proCne-r 


3.8 


40B625 


AW243323 


Hs.266785 


ESTs 


3.8 


421987 




Hs.286131 


CGI-101 protein 


3.8 




AI474778 


Hs.1 18977 




3.8 


410224 


M55513 


Hs.150208 


potassium voltage-gated channel, shaker- 


3.8 


429846 


AB023021 


Hs.225945 


fucosyltransferase 9 (alpha (1,3) fucosy 


3.8 






Hs.269805 


ESTs 


3.8 




BE221825 


Hs.97691 


ESTs 


3.8 


407942 


AA37860B 


Hs.5894 


hypothetical protein FLJ 10305 


3.8 




AA742577 


Hs.303781 


EST 


3.8 




AA417812 


Hs.38775 


ESTs 


3.7 




AI745185 


Hs.8939 


yes-associated protein 65 kDa 
tolloid-likel 


17 


449579 


AI823951 


Hs.129700 


3.7 




NM 013240 


Hs.1 63846 


putative N6-ONA-methyltransferase 


3.7 


458663 


AV658444 


Hs.280776 


tankyrase, TRFI-interacting ankyrin-rela 


3.7 


456443 


AW967500 


Hs.133543 


ESTs 


3.7 


439957 


AI453184 


Hs.66357 




3.7 




AA151520 


Hs.334822 


hypothetical protein MGC44B5 


3.7 




AL0499B0 


Hs.1 B4216 


DKFZP564C152 protein 


3.7 




AWB60972 




gb:QVO-CT0387-180300-167-h07 CT0387 Hono 


3.7 


418379 




Hs 137516 






419720 


AA249131 


Hs.337778 


hypothetical protein FLJ11068 


37 


443584 


AI807036 


Hs.287245 


hypolhetical protein FLJ14B03 


3.7 




AW975861 


Hs.47367 


KIAA1 785 protein 
ESTs 




417235 


AA810278 


Hs.24250 


37 


441720 


AI346487 


Hs.28739 


ESTs 


3.7 


451421 


W16522 




Homo sapiens cDNA FU13539 fis, clone PL 
endothelin receptor type B 


3.7 


417355 


D13168 


Hs]82002 


3.7 


449321 


AA001150 


Hs.132937 


ESTs 


3.7 


424806 


AA382523 


Hs.105689 




3.7 


45233B 


AW60B920 


Hs.29159 


zinc finger protein 75 (D8C6) 


3.7 


40924B 


AB033035 


Hs.51965 


KIAA1 209 protein 


3.7 
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421037 AI6B480B 

427088 AA398085 

420637 AW976153 

420026 AI831190 

429419 AB023226 

447410 AI470235 
404274 

416320 H47867 

412642 BE24459B 

431716 D89053 

446025 AW305075 

450458 AA009926 

423099 NMJ02837 

438257 AW474419 

440887 AI799488 

454693 AW813428 

432189 AA527941 



Hs.166676 
Hs.202276 
Hs.172698 



Hs.809 
Hi 1B0948 



448776 BE302464 

452293 AI871833 

428330 L22524 

443258 AI800271 

429208 AA447990 

458429 AV646559 
404476 



Hs.301152 
Hs.88594 
Hs.217081 
Hs.30057 
Hs.304609 
Hs.2256 
Hs.129445 
Hs.190478 
Hs.12346 



gb:EST38B262 MAGE resequences, MAGN Homo 
ESTs 

KIAA1009 protein 



hepatocyle growth factor (hepapoietin A; 
fatty-acid-Coenzyme A ligase, long-chain 
KIAA0729 protein 

gb:zi07e05.r1 Soares_fetal_liver_spleen_ 
protein tyrosine phosphatase, receptor t 



gb:MR3-ST0192-010200-210-c05 ST0192 Homo 
gb:nh30c04.s1 NCI_CGAP_Pr3 Homo sapiens 
Homo sapiens mRNA; cDNA DKFZp434F053 (tr 
ESTs 
ESTs 

MRS2 (S. cerevisiael-like, magnesium horn 



mamx meiauupruraiidse . u..<».i.j=.". 
hypothetical protein FU12496 
ESTs 

Homo sapiens cDNA: FLJ21399 (is, clone C 
438209 AU20659 Hs.6111 aryl-hydrocarbon receptor nuclear transl 



437918 / 

432408 t 

437641 I 

439635 I 

446102 i 

418384 ; 

425403 , 

432030 . 
446453 



432779 
440886 

45 449424 
418076 



AW448937 

R61388 

AW449679 



438016 AI949638 

50 455201 AW947884 

433293 AF007835 

455536 AW135986 

428679 AA431765 

414400 X06948 

55 435344 AA700326 

445056 AB014530 

449444 AW818436 

442652 AI005163 

423121 AW864B48 

60 449540 AA001713 

425734 AF056209 

428409 AW117207 

431087 H12723 

426920 AA393351 

65 427687 AW003867 



451531 AA018311 

457620 AA602711 

410658 AW105231 

427865 AA416931 

453390 AA862496 

419983 W55956 

454600 AW810001 

427718 AI798680 



Hs.121629 
Hs.332557 
Hs.291911 
Hs.94891 
Hs.252956 
Hs.25130 
Hs.1 56406 

Hs.188646 

Hs'.190535 

Hs.190515 

Hs.197030 
Hs.6724 
Hs.1 56739 

Hs.336846 

Hs.32417 



ESTs, Weakly similar to A46010 X-linked 
ESTs 

hypothetical protein FU22729 
ESTs 

Homo sapiens cDNA FLJ14923 fis, clone PL 
Human DNA sequence from clone 119BHS on 

ESTs, Weakly similar to ALU1_HUMAN ALU S 
hypothetical protein MGC10858 
ESTs 

gb:EST391351 MAGE resequences, MAGPHomo 



Hs.159396 
Hs.98523 
Hs.290791 
Hs.132121 



gb:PM1-MT0010-200300-001-gO8 MT0010 Homo 
hypothelical protein MGC4309 

gtawSOcOS.sl SoaresJestis.NHT Homo SF.p 
Fc fragment of IgE, high affinity I, rec 
) ESTs 

KIAA0630 protein 

solute carrier family 16 (monocarboxylic 
3 ESTs, Weaklysim'lar lo T12545 hypotheti 

gb:PM2-SN0018-290300-003-c09 SN0018 Homo 
ib:zh86e00.s13oa-es_fetoijiver_opleer._ 
peptidylgtyclne alpha-amidating monooxyg 



Hs.10B972 
Hs.231991 
Hs.1 14762 
Hs.336753 
Hs.192035 



gb:ny57g01.s1 NCI_CGAP_Pr18 Ho.hu,,..,. 
Homo sapiens mRNA; cDNA DKFZp434P228 (fr 



Hs.337616 
Hs.10592 
Hs.44565 



Homo sapiens mRNA; cDNA DKFZp5B6E1 624 (f 
gb:MR4-ST0124-270300-005-b1 1 ST0124 Homo 
ESTs 

gb:yr47f06.r1 Soares fetal fver spleen 
phosphodiesterase 3B, cGMP-inh' bited 
ESTs 

gb;Human nonspecific crossreacting anfg 
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20 

25 





BE565B92 


Hs.83077 


interleukin 18 (interferon-gamma-inducin 




AW975944 


Hs.237396 


ESTs 


432868 


AW974093 


Hs.292775 


ESTs 




AW605849 




gb:MR0-HT024 1-2001 00-006-g02 HT0241 


410252 


AW821182 


Hs.61418 


microtlbriilar-associated protein 1 


42B804 


AK000713 


Hs.193736 


hypothetical protein FLJ20706 


42B775 


AA434579 


Hs.143691 


ESTs 


410004 


AI298027 


Hs.5057 


carboxypeptidase D 


422093 


AF151852 


Hs.111449 


CGI-94 protein 


441736 


AW292779 




ESTs 


423017 


AW178761 


Hs^227948 


serine (or cysteine) proteinase inhibito 


405970 
431954 




Hs .272242 


hypothetical protein FU1 11 12 


459482 


AA625339 


Hs.237052 


EST, Weakly similar to I38022 hypothetic 


410361 


BE391804 


Hs.62661 


guanylate binding protein 1, interferon- 


410804 


U64820 


Hs.66521 


Machado-Joseph disease (spinocerebellar 


402230 
436120 


AI248193 


Hs.119860 


ESTs 



434374 M631439 

428911 Z43846 

437783 A1683150 

416057 AI927382 

435496 AW840171 

436088 AA704687 

408554 AA8363B1 

454076 AW204712 

431733 AW298410 

432974 BE348793 

412576 AA447718 

446142 A1754693 

447432 AW958473 



436111 AIB03082 

421236 AI287622 

433917 AI809325 
403515 

429657 D13626 

453375 AI990114 

448186 AA262105 

412209 AW901456 

421065 AA329711 

409642 AW450809 

420092 AA814043 

453365 AA035211 

437007 AA741300 

408031 AA081395 



433650 M603472 

410318 AA084050 

427019 AA001732 

413714 AI560944 

430887 N66801 

413618 BE154078 

420908 AL049974 

436168 AK000883 
405692 

432809 AA565509 

433805 M706910 

436192 W93847 

435451 AF195420 

411849 AW964970 



gb:np85d02.s1 NCI CGAP_Thy1 Homo sapiens 

Hs.194478 Homosapiens mRNA; cDNA DKFZo43401572 (f 

Hs.201550 ESTs, Weakly similar lo ALU1_HUMAN ALU S 

Hs.29857 ESTs 

Hs,265398 ESTs, Weakly similar to transformalion-r 

Hs.191294 ESTs 

Hs.3151 11 nuclear receptor co-repressor/HDAC3 comp 

Hs.61957 ESTs 

Hs.21475 ESTs 

Hs.233331 ESTs 

Hs.107057 ESTs 

Hs.145968 ESTs 

Hs,301957 nudix (nucleoside diphosphate linked moi 

Hs.1 24244 ESTs 

Hs.184156 ESTs 

gb:nc06c05.s1 NCI_CGAP_Pr1 Homosapiens 

Hs.157212 ESTs 

Hs.151956 ESTs 

Hs.122B14 Human DNAsequence from clone RP5-1028D1 

Hs.2465 KIAA0001 gene product; pulative 6-protei 

Hs.240091 ESTs 

Hs.4094 Homo sapiens cDNA FLJ 14208 fis. clone NT 

gb:RCO-NN1012-270300-031-c07 NN1012 Homo 
gb:EST33382 Embryo, 12 week II Homo sap: 

Hs.257347 ESTs 

Hs.88045 ESTs 

Hs.17404 ESTs 

Hs.202599 ESTs, Weakly similar to I38022 hypotheti 

Hs.42173 Homosapiens cDNA FU10366 fis, clone NT 

Hs.35598 ESTs 

Hs.85112 insulin-like growth factor 1 (somatomedi 

Hs.190008 ESTs 

Hs.28456 ESTs 

Hs.269259 ESTs, Weakly similar to S23650 retroviru 

Hs.173233 hypothetical protein FU10970 

Hs.71428 ESTs 

Hs.250287 KIAA1841 protein 

gb:PMO-HT0339-200400-O1O-F04 HT0339 Homo 

Hs.100261 Homo sapiens mRNA; cDNA DKFZp564B222 (fr 

Hs.301645 Homo sapiens cDNA FLJ10021 fis, clone HE 

Hs.131703 ESTs 

Hs.1 12742 ESTs 

Hs.24139 Homo sapiens cDNA: FU231 37 fis, clone L 

Hs.303006 ESTs, Weaklysimilar lo garnma-hereculin 

Hs.18861 ESTs, Moderately similar to KIAA1276 pro 

gb:RC6-BT0709-310300-021-G07 BT0709 Homo 

Hs,63668 toll-like receptor 2 

Hs 79306 eukaryotic translation nilialion factor 

Hs.27693 peptidylprolyl isomerase (cyclophi'in)-l 

Hs.10760 asporin(LRR class 1) 

Hs.97469 ESTs, Highly similar lo A39769 N-acetyll 

Hs.24427 DKFZP56601646 protein 



Hs.1 97693 ESTs 

Hs.2B9005 Homo sapiens cDNA; FLJ21532 tic, cl 
Hs.246645 ESTs 
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5 
10 



30 
35 



449897 AW819642 

428467 AK002121 

40B761 AA057264 
403895 

414B99 AW975433 

409044 AI129586 

447233 AW246333 

422219 AW978073 

427119 AW880562 

437073 AI885608 

443830 A1142095 

454962 AW847645 

433644 AW342028 

417561 AW974345 

446063 AI720140 

423609 AA328348 

428004 M449563 

453370 A1470523 

435808 AA702866 

424001 W67883 

415635 F13168 

418946 AI798841 

431750 AA514986 

425188 AK002052 

428268 AA424957 

418878 W20090 
416565 AW000960 
454288 BE222648 
446428 AW082270 
404588 

413087 BE064655 
444910 AI201849 
407339 AA777542 
414093 BE544867 
43B458 AW975186 
419340 AA236590 
423448 AK000776 
457030 AI301740 
421187 NM_014721 



423841 AW753967 

438839 AW297945 

410085 AA428482 

427961 AW293165 

429228 AI553633 

431548 AI834273 

441839 AW975512 

410389 AW954049 

441274 AW593781 

452401 NM 007115 

436154 AA764950 



434164 AW207019 

436739 BE208022 

451674 AA019104 

421166 M305407 

437872 AK002015 

440046 AW402306 

452824 W27643 

426457 AW894667 

424780 U39576 

456551 AW975051 

410763 AF279145 

431814 BE256242 

440099 AL080058 

436401 AI087958 

437439 H29796 
403277 

408547 AA574291 

424131 AA335714 

433222 AW514472 

434636 AA083764 

450519 AA010066 

415083 AI632683 

407905 AW1 03655 

452311 AW304029 



Hs.191990 ESTs 

Hs.16026 hypothetical protein FLJ23191 

Hs.143686 ESTs 

Hs.215937 ESTs 

Hs.1 56294 ESTs 

Hs.24135 transmembrane protein veratin; hypotheii 

Hs.184465 hypothetical protein FLJ1 1259 

Hs.238936 ESTs, Weakly similar to (detline not ava 

Hs.36288 ESTs 



Hs.1010 regulator of mitotic spindle assembly 1 

Hs.1 14574 ESTs 

Hs.94122 ESTs 

Hs.1 43273 ESTs 

gb:IL3-CT0213-280100-056-A04 CT0213 Homo 
gb;hb75d03.x1 NCI CGAP.U12 Homo sapiens 
gb:EST386449 MAGE resequences, MAGM Homo 

Hs.1 51 079 ESTs 

Hs.218289 ESTs 

Hs.151393 glulamate-cysteineligase, catalytic sub 
Hs.139336 ATP-binding cassette, sub-family C (CFTR 
Hs.1 131 50 ESTs 
Hs.137475 paternally expressed 10 

gb:HSC3JF101 normalized infant brain cDN 
Hs.1 64525 ESTs 

Hs 155071 hypothetical protein FU11 190 
Hs.294132 ESTs 
Hs.6616 ESTs 

Hs.44970 endoplasmic reticulum resident protein 5 
Hs.279458 ESTs, Highly similar to c380A1.1b [H.sap 
Hs.12496 ESTs, Weakly similar to ALU4.HUMAN ALU S 



Hs.87530 
Hs.128753 
Hs.1 73381 
Hs.102471 
Hs.93810 
Hs.198612 



Homo sapiens cDNA FLJ20769 fis, clone CO 

ih, 1 opyrimi Una 

KIAA0680 gene product 

cerebral cavernous malformations 1 

G protein-coupled receptor 51 

gb:RC2-CT0304-080100-011-M2 CT0304 Homo 




tumor necrosis factor, alpha-induced pr< 



KIAA1627 protein 

Homo sapiens cDNA: FU22016 fis, clone H 
potassium inwardly-rectifying channel, s 
RNA binding motif protein 7 
hypothetical protein FU10483 



butyrophilin, subfamily 1 , member A1 
ESTs, Weakly similar lo 178885 serine/th 
hypothetical protein FLJ21776 



Hs.243010 

Hs.148135 

Hs.127685 

Hs.175483 

Hs.102308 

Hs.5887 

Hs.6877 

Hs.73965 

Hs.169965 

Hs.153058 

Hs.293156 

Hs.8966 

Hs.270847 

Hs.6909 

Hs.29088 



Hs.57837 ESTs 

Hs.1 99665 ESTs 

Hs.238415 ESTs, Moderately similar to ALU8_HUMAN A 

Hs.6101 hypothetical protein MGC31 78 

Hs.224849 Homo sapiens cDNA FU 1 2583 fis, clone NT 

Hs.27179 Homo sapiens cDNA FLJ12933 fis, clone NT 

Hs.252905 ESTs 

Hs.252744 ESTs 



342 
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446770 AV660309 

424238 AA337401 

411643 AI924519 

447829 AI433029 
406506 

428301 AW628666 

428579 NM.005756 

451229 AW967707 
401103 

433589 AA886530 

459370 AA889982 

438533 AI440266 
404288 
406195 



20 

25 



457447 X78261 

443773 AV646452 

459371 R20991 

421823 N40850 

447247 AW369351 

452896 AA831508 



407340 AA810168 
401862 

444325 AW152618 

408171 AA301228 

423949 AI014546 

419519 AI193719 

434683 AW298724 

418454 AA315308 

415066 AI597963 

419220 AA811938 

418849 AW474547 

443634 H73972 

429682 NM.006306 
405090 

432267 AK000872 

443253 AI041212 

444974 AI203500 

445717 AW664658 

449347 AV649748 

452778 R71338 

414388 AL039185 

424405 D54120 

410371 AA084482 

426384 AI472078 

418200 AW629751 
427050 AA397789 
449579 AW207250 
411004 AW813242 
454032 W31790 



439416 W58294 

436635 AW104325 

419086 NM_000215 

412566 AW962574 

415452 F09134 

427874 AA732367 

447046 AA326187 

454193 BE141183 

454678 AW813089 

415122 D60708 

444665 BE613126 
400227 

411905 BE265067 

419503 AA243642 



434998 AW975157 



451494 AI799444 



Hs.1 54986 ESTs, Weakly similar to PLLP_HUMAN PLASM 

Hs.1 37635 ESTs 

Hs.1 92570 hypothetical protein FLJ22028 

Hs.1 641 04 ESTs 

Hs.98440 ESTs, Weakly similar to I38022 hypotheti 

Hs.1 84942 G protein-coupled receptor 64 

Hs.48473 ESTs 



Hs.22588 ESTs ■, 

Hs.29567 ESTs 

Hs.170623 hypothetical protein FU11183 

Hs.24135 transmembrane protein vezalin; hypotheti 

Hs.1 16957 ESTs 

Hs.1 3941 ESTs 

Hs.41294 ESTs 

Hs.272177 H.sapiens mRNA for TRE17 5' extremity ar, 

Hs.30941 calcium channel, voltage-dependent, beta 
gb:yg06h01.r1 Soares infant brain 1NIB H 

Hs.28625 ESTs 

Hs.287955 Homo sapiens cDNA FLJ13090 fis. clone NT 

Hs.32553 ESTs 

Hs.1 51427 zinc finger protein 215 

Hs.40919 Homo sapiens cDNA FLJ1451 1 fis, clone NT 

Hs.284289 vitiligo-associated protein VIT-1 



Hs.1 6757 ESTs 

Hs.43299 hypothetical protein FLJ 12390 

Hs.130912 ESTs 

Hs.176376 ESTs 

Hs.202639 ESTs 

Hs.195870 hypothetical protein FLJ14991 

Hs.118726 ESTs 

Hs.291759 ESTs 

Hs.53565 Homo sapiens PIG-M mRNA for 

Hs.134460 ESTs 

Hs.211602 SMC1 



Hs.274227 

Hs.1 321 17 

Hs.151612 

Hs.149332 

Hs.295901 

Hs.5921 

Hs.77558 

Hs.146409 

Hs.1 15850 

Hs.303662 

Hs.205654 

Hs.1 61803 

Hs.134014 



Homo sapiens cDNA FLJ10010 fis, clone HE 



cell division cycle 42 (GTP-binding prot 



Hs.56254 
Hs.272093 
Hs.89591 



ESTs, Weakly similar to alternatively sp 
ESTs 

ESTs, Weakly similar to T46425 hypotheti 
gb:MR3-ST0191-020200-207-g10 ST0191 Homo 
ES Is, Weal I/ similar to 154374 gene NF2 
SRY (sex determining region Y)-box 2 
KIAA1233 protein 
ESTs 

ESTs, Weakly similar to I78885 serine/th 

Kallmann syndrome 1 sequence 

gb:EST374647 MAGE resequences, MAGG Homo 



Hs.17170 G protein-coupled receptor 4 

gb:MR0-HT0071-19119S-001-b04 HT0071 Homo 
gb:RC3-ST01 85-240400- 1 1 1-b05 ST0186 Homo 

Hs.22245 ESTs 

Hs.47783 B aggressive lymphoma gene 

gb:601 1 93893F 1 Nl H.MGC.7 Homo sapiens cD 

Hs.137422 ESTs 

Hs.141454 ESTs 
Hs.130522 

Hs.26037 ESTs 

Hs.5076 Homo sapiens cDNA: FLJ22128 fis, clone H 

Hs.150402 xtivin A receptor, type I 

Hs.18200 ESTs 

Hs.294140 ESTs 

Hs.247095 ESTs, Moderately similar to ALU7_HUMAN A 
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442994 AI02S718 

408165 AL1 37573 

421072 AI215069 

456273 AF1 54846 
404548 

428201 AA424158 

441519 AA972740 

445413 AA151342 

418717 AI334430 



431023 A1283133 

432596 AJ224741 

452412 AA029608 

421309 A1222086 

438128 AA904430 



400880 

417014 M251720 
422278 AF072873 
406603 



423025 M831267 

422634 NM 016010 

448966 AW372914 

408590 AW864542 

408525 AW206972 

412248 BE176480 

432507 BE391093 

447290 AI476732 

424188 AW954552 

431448 AL137517 

400325 M85292 

408408 AF070571 

423119 AA322201 

423717 AA330036 

424152 AL 133591 

431980 AA523696 

434980 AW770553 

444339 T96555 

446745 AW1 18189 

459201 AW391177 

430573 AA744550 

451073 AI758905 

440575 AA889870 



453123 
418343 
431595 , 



FM3143 Homosapiens mRNA; cDNA DKFZp564A2463 (f 

Hs.89113 ESTs 

Hs.1148 zinc finger protein 

Hs.206461 ESTs 

Hs.127092 ESTs 

Hs.12677 CGI-147 protein 

Hs.86984 ESTs 

Hs.82302 Homo sapiens cDNA FLJ14814 fis, clone NT 

Hs.38365 KIAA0125 gene product 

Hs.37486 ESTs 

Hs.259746 ESTs, Weakly similar to A46010 X-linked 

Hs.297420 ESTs 

Hs.278461 matrilin 3 

Hs.51373 ESTs 

Hs.270449 ESTs, Moderately similar to ALULHUMAN A 

Hs.122049 ESTs, Weakly similar to T2D4_HUMAN TRANS 

Hs,44205 cortistalin 

Hs.2471 17 ESTs, Moderately similar to ALUFJWMAN ! 



Hs.158308 
Hs.181022 
Hs.29553 
Hs.333513 
Hs.12244 



Hs.253595 ESTs 



Hs.324667 
Hs.263912 
Hs.142634 

Hs!247924 
Hs.44690 
Hs.131976 
Hs.152003 
Hs.141480 
Hs.324507 
Hs.14553 
Hs.31562 
Hs.1 56400 

Hs.136345 
Hs.206063 
Hs.126006 



Hs.97365 ESTs 



gb:RC3-HT0585-160300-022-cO2 HT0585 Homo 



Homo sapiens clone 24739 mRNA sequence 
ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434N079 (fr 
hypothetic iMoNnFU20986 
sterol O-acyltransferase (acyl-Coenzyme 
ESTs 
ESTs 

gb:MR3-ST0203-221299-02W05 ST0203 Homo 



Hs.269543 
Hs.187636 
Hs.123369 
Hs.221349 
Hs.1 59501 



ESTs, Weakly similar to A56194 thromboxa 



ESTs 



ESTs 



429826 
437913 
441330 , 
443458 i 



AK000998 
BE048054 
AI796320 
AW195317 
AW450979 
AA976718 
H75701 
BE4B5884 
NM_003181 



gb:nh60f07.s1 NCI_CGAP_Pr8 Homo sapiens 
Hs.297221 Homo sapiens cDNA FU10135 fis, clone HE 

gb:tz46c03.y1 NCI_CGAP_Brn52 Homo sapien 
Hs.10299 Homo sapiens cDNA FLJ13545 fs, clone PL 
Hs.10771B hypothetical protein FU22344 

gb:UI-H-BI3-ala-a-12-0-Ul.s1Na_CGAP_Su 



408875 
424602 
426174 



Hs.99886 
Hs.280728 
Hs.143507 
Hs.40747 
Hs.121623 
Hs.129354 
Hs.143509 
Hs.124292 
Hs.1 1392 
Hs.84520 
Hs. 153205 
HS.16BB30 
HS.187B10 
Hs.84561 
S78234 Hs.172405 
NM.015434 Hs.48604 



ESTs 

complement component 4-binding protein, 



AA904244 
AW953838 
AA830854 
I AA299919 



hypothetical protein FU21924 

Homo sapiens cDNA: FU23123 fs, clone L 

c-fos induced growth factor (vascularen 

ESTs, Weakly similar to TRHY.HUMAN TRICH 

ESTs 

Homo sapiens cDNA FLJ12136 fe, cbne MA 



cell division cycle 27 
DKFZP434B168 protein 
hypothetical protein FLJ1119: 
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449318 AW236021 

429950 AW081608 

412733 AA984472 

423637 AL1 37279 

442655 AW027457 

420556 AA278300 

430447 W17064 

416871 H98716 

439737 AI751438 

406815 AA833930 
401094 
401526 

414140 AA281279 

417320 AA195667 

418282 AA215535 

442927 AI024347 

450006 AI241555 

419231 AL046294 



414696 AF002020 



Hs.78531 Homo sapiens, Similar to RIKEN cDNA 5730 

Hs.105053 ESTs 

Hs.74554 KIAA0080 protein 

Hs.130187 HomosapiensmRNA;cDNADKFZp43401214(f 

Hs 30323 EST: We / simila" 'o E 3408 7 hypotheti 

Hs.124292 Homo sapiens cDNA: FLJ23123 lis, clone L 

Hs.332848 SWI/SNF related, matrix associated, actj 

gb:yx13d08.s1 Soares melanocyte 2NbHM He 

Hs.41271 Homosapiens mRNA full length insert cDN 

Hs.288036 tRNA isopentenylpyrophosphale transferas 



Hs.23317 
Hs.86022 
Hs.98133 
Hs.131519 
Hs.60171 
Hs.136245 
Hs.38761 

Hs.7B9ia 
Hs.190325 
Hs.1 27699 
Hs.195641 
Hs.1 63425 



hypothetical protein FU 14681 



Niemann-Pick disease, type C1 



440106 AA864968 

448019 AW947164 

431745 AW972448 

421426 AA291101 HMaiKU 

433014 NM_014711 Hs.279912 

455100 BE160198 

441790 AW294909 
404443 

428129 A1244311 



415786 AW419196 

411213 AA676939 

420896 AW149342 

409994 D86864 

430388 AA356923 

419530 X98330 

455092 BE152428 

456118 AA380267 

440192 AA372282 

448466 AI522109 

414869 AA157291 

440351 AF030933 

407594 AW057584 

439235 N45513 

417061 AI675944 

434812 AA649860 

409731 AA125985 



Hs.132208 ESTs 



Homo sapiens, clone tMAGE:3939163, mRNA, 
KIAA0419 gene product 

gb:QV1-HT0413-010200-059-h03 HT0413 Homo 



Hs.26912 
Hs.54973 
Hs.283077 
Hs.224672 



Hs.24444 
Hs.57735 
Hs.240770 
lls.90821 

Hs.78277 

Hs.1 90596 

Hs.171066 

Hs.21479 

Hs.7179 

Hs.1 60681 

Hs.46608 

Hs.1 88691 

Hs.189496 

Hs.56145 



hypothetical protein FU13782 
neuropilin 1 

Homo sapiens cDNA: FU22165 fis, clone H 

acetyl LDL receptor; SREC 

nuclear cap binding protein subunit 2, 2 

ryanodine receptor 2 (cardiac) 

gb:CMO-HT0323-151299-126-b04 HT0323 Homo 

DKFZP434F2021 protein 

ESTs 



ESTs 

ubinuctein 1 

RAD1 (S. pombe) homotog 

ESTs 

ESTs 

Homo sapiens cDNA FU 1 2033 lis, clo 



410672 AW794600 

432798 AA565309 

416288 H51299 

438886 AA827728 

451558 NM.001089 

416940 N75620 

421750 AK000768 

438398 AA806526 

435313 AI769400 

414605 BE390440 

436508 AW6043B1 

413195 AA1 27382 

413829 NM001B72 
401323 

408296 All 17452 

428532 AF157326 

423454 AL110456 

436027 AI864053 

406970 M29994 

426172 AA371307 

452114 N22687 

439750 AL359053 

423130 AWB97586 

430660 R11884 

434138 AA625804 

427469 AA403084 

429881 T80112 
411492 T46848 



Hs.1 94015 ESTs 



gb:RC6-UM0014-170300-022-C05 UM0014 Homo 



gb:yp07c05.s1 Soares breast3NbHBst Homo 
E T Wea I pilar t F 942 2unkr 
ATP-binding cassette, sub-family A (ABC1 
ESTs 

hypothetical protein FU20761 



Hs.1 28705 
Hs.26630 
Hs.43157 
Hs.1 07872 
Hs.1 30277 

Hs.1 89729 ESTs 

gb:601283601F1 NIH.MGC.44 Homo sapiens c 

Hs.121121 ESTs, Weakly similar !o S00755 pleckstri 

Hs.22404 protease, serine, 1 2 (neurotrypsin, moto 

Hs,75572 carboxypeptidase B2 (plasma) 

Hs.44155 DKFZP586G1517 protein 

Hs.184786 TBP-interacting protein 

Hs.469 succinate dehydrogenase complex, subunit 

Hs.39972 ESTs, Weakly similar to I38588 reve-se t 
gb:Human alpha-! spectrin gene, exon 12. 

Hs.125056 ESTs 

Hs.8236 ESTs 

Hs.57664 Homo sapiens mRNA full length insert cDN 

Hs.21213 ESTs 

Hs.100826 ESTs 

gb:zu86h01.s1 SoaresJestis.NHT Home sap 

Hs.269347 ESTs, Weakly similar to 2109260A B celt 

Hs.192245 ESTs 

Hs.70337 immunoglobulin superfamily, member 4 
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41B912 
422505 
427752 



AI924940 

AL036449 

NM_000685 

AL1 20862 

AA470687 

AI566356 

AA210863 



430687 BE274217 

450400 AI694722 

456844 AI264155 

418342 BE002723 

42075B AA411800 

423532 BE090503 

440320 AA879294 

457314 AA479597 

439831 AW136488 

425661 AL1 33627 

407949 W21874 

418658 AW874263 

409978 D31897 

421340 F07783 

449071 NMJ05872 

409241 AF070602 

448219 AA228092 

408936 AL1 38043 

410784 AW803201 

426471 U22440 

454455 AW752710 

455310 AW893961 
401335 

435577 W84774 

409519 AA075368 

421003 T72080 



436007 AI247716 



4357S6 R11673 

410327 T33130 

416805 F13271 

417177 NMJ04458 

423020 AA383092 

427134 AA398409 

428137 AA421792 

429710 AI337113 

430844 T94960 

417576 AA339449 

441928 AI370188 

409721 AW887732 

427112 Z32887 
403776 

420159 AI572490 

427839 AA608823 

432837 M310693 

438782 AA82B380 

449396 BE169100 

458043 AW979009 

438171 AW976507 

452959 AI933416 

439556 AI623752 

446152 AI292036 

434803 AW974640 

407771 AL138272 

411069 AL133092 

417543 AA203620 
401517 
403677 

416337 H48713 
423401 NM_001992 

446800 AI341635 

457906 AW975939 
452277 AL049013 
416913 AW934714 
416370 N90470 



Hs.95424 ESTs 

Hs.8136 endothelial PASdomain protein 1 
gb:nv35f03.r1 NCI_CGAP_Br5 Hoi 

Hs.283713 ESTs, Weakly similar to S64054 hypolheti 

Hs.22581 ESTs 

Hs.108082 ESTs, Weakly similar lo T31636 hypothec 

Hs.207163 ESTs 

Hs.89472 angiotensin receptor 1 

Hs.124165 ESTs 

Hs.1 04772 ESTs 

Hs.171437 ESTs 

Hs.3532 nemo-like kinase 

Hs.22559 KIAA0197 protein 

Hs.249247 
Hs.279744 
HS.1529B1 
Hs.226627 
Hs.189900 



CDP-diacylgryoerol synthase (phosphaiida 



Hs.25545 
Hs.158923 
Hs.247057 
Hs.32468 



Hs.22960 
Hs.51649 
Hs.42656 
Hs.293549 



gb:RC6-BT0717-11040Q-01 1-F1 1 BT0717Homo 
gb:nw36e09s1 NCI_CGAP_Pr1 2 Homo sapiens 
hypothetical protein DKFZp586J1119 



gb:IL2-UM0077-070500-080-E06 UM0077 Homo 
transforming growth factor, alpha 
gb:IL3-CT0219-281099-024-A03 CT0219 Homo 
gb:RC4-NN0027-060400-01M11 NN0027 Homo 



Hs.1 7643 ESTs 



Hs.95667 
Hs.209927 
Hs.195870 
Hs.232168 
Hs.252730 



Hs.81452 
Hs.1608 
Hs.1 73561 
Hs.170999 



ESTs 
ESTs 

RAP2A, member of RAS oncogene family 
Human clone 23560 mRNA sequence 
fatty-acid-Coenzyme A iigase, long-chain 
replication protein A3 (14kD) 



oteinFU23594 
gb:ye38d07.r1 Stratagene lung (937210) H 
'phosphoribosylglycinamideformyltransfer 



Hs.99785 
Hs.98244 
Hs.87329 
Hs.126733 



Homo sapiens cDNA: FU21245 fis, cli 
ESTs 

HSPC072 protein 
ESTs 



Hs.326108 ESTs 

Hs.293515 ESTs 

Hs.189674 ESTs 

Hs.1 63603 ESTs 

Hs.150028 ESTs 

Hs.303413 ESTs 

Hs.52713 ESTs 

Hs.68055 hypothetical protein DKFZp434l0428 

Hs.1 101 53 ESTs 



gb:yq78d02.M Soares fetal liver spleen 
Hs.128087 coagulation factor II (thrombin) rerjepto 
Hs.1 56485 ESTs 

Hs.153290 Homo sapiens cDNA FU 1431 3 fis, clone PL 
Hs.28783 KIAA1223 protein 

gb:RC1-DT0001-031299-011-a11 DT0001 Ho 
Hs.203697 ESTs, Weakly similar to 138022 hypotheti 



WO 03/042661 
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408715 AA768873 
410743 AA089474 
427138 N77624 
436260 BE172762 
427565 A1287280 
406092 

410008 AA079552 
438504 AW665281 
414783 AW069569 
411479 AW848047 
418686 Z36830 
413795 AL04017B 
457528 AW973791 
444230 H95537 
403760 

416624 H69044 
428904 A1312526 
446311 AW007294 
458638 N78553 
459267 AJ003631 
424834 AK001432 
433906 AM 6781 6 
428966 AF0592U 
446554 AA1 51730 
446035 NM_006558 
421585 U95626 
445158 AI992108 
421175 A1879099 
401793 
410181 AI468210 
427038 NMJ14633 
451343 AW975057 
455992 BE179015 
438475 W03856 
455571 BE003714 
426298 AW965058 
407930 AA045847 
453891 AB037751 
451487 AA018072 
418269 AA806113 
419196 AF1 10908 
459160 AI904723 
441963 A1733307 
440273 AI805392 
426902 AI1 25334 
414271 AK000275 
453313 BE005771 
445265 AI218295 
422988 AW673847 
428613 AB037749 
444619 BE538082 
457300 AW297436 
402800 

425071 NM_013989 
414729 BE466928 
453716 AA037575 
452693 T79153 
439818 AL360137 
443305 AI050693 
416709 R99369 
419077 AA233885 
453878 AW964440 
445660 AI702668 
446817 AI700684 
442137 AA977235 
410406 AI969703 
442242 AV647908 
407830 NM_001086 
415138 C18356 
407055 X89211 
408812 BE397160 
440310 AA878939 
425659 AK000590 
418217 AI910647 
428667 AI375550 
414573 AI821846 
420000 AB036063 
452821 AW471181 
440138 AB033023 
428483 A1908539 
441350 AB020690 



Hs.112250 hypothetical protein FLJ23518 

Hs.272153 ESTs 

Hs.1 73717 phosphatide acid phosphatase type 2B 

Hs.292710 ESTs, Weakly sin* tc AL I5_HI M vM At U S 

Hs.97933 ESTs, Weakly similar to T46370 hypotheti 

gb:zm20h12.s1 Stralagene pancreas (93720 

Hs.224625 ESTs 

Hs.278270 unactive progesterone receptor, 23 kD 

gb:IL3€T0214-291299-052-A12 CT0214 Homo 

Hs.87268 annexinA8 

Hs.142003 ESTs 

Hs.292784 ESTs 

Hs.146067 ESTs 

gb:yr77h05.s1 Soares fetal liver spleen 

Hs.46640 ESTs 

Hs.149795 ESTs, Moderately similar to ALU1.HUMAN A 

Hs.282204 nucleosomal binding protein 1 

gb:AJ003631 Selected chromosome 21 cDNA 

Hs.153408 Homo sapiens cDNA FLJ10570 fis, clone NT 

Hs.43355 ESTs 

Hs.194687 cholesterol 25-hydroxylase 

Hs.301789 nudix (nucleoside diphosphate linked moi 

Hs.13565 Sam68 like phosphotyrosine protein, T-ST 

Hs.302043 chemokine (C-C motif) receptor-like 2 
Hs. 127206 ESTs 

Hs.102397 GIOT-3 for gonadotropin inducible transc 



Hs.261285 
Hs.173288 
Hs.293353 ESTs 



gb:RC3-HT0612-080500-013-h10 HT0612 Homo 
ESTs, Highly similar to Gene product wit 
gb:QV3-BN0096-200400-161-a01 BN0096 Homo 
ESTs, Weakly shilar to I38022 hypotheti 
Homo sapiens cDNA FLJ12807 fis, clone NT 
Homo sapiens mRNA full length insert cDN 
gb:ze51g02 r1 Sorjes retina N2b4HRHorno 
ESTs 



Hls.325335 Homo sapiens cDNA: FU23523 fis, clone L 



Hs.186928 KIAA1328 protein 

Hs.8172 ESTs, Moderately similar to A46010 X-lin 

Hs.158849 Homosapiens cDNA: FLJ21663 fis, clone C 

Hs.154424 deiodinase, lodothyronine, type II 

Hs.281901 ESTs 

Hs.152675 ESTs 



Hs.133318 ESTs 

Hs.283108 hemoglobin,; 

Hs.1 64526 ESTs 

Hs,19025 DC32 



ns mRMA full length insert cDN 



Hs.134166 
Hs.1 28830 
Hs.1466 
Hs.90424 



Hs.295944 tissue I 



ESTs, Weakly similar to Z1 92.HUMAN ZINC 
glycerol kinase 

Homo sapiens cDNA: FU23285fs, clone H 
arylacetamide deacetylase (esterase) 

pathway Inhibitor 2 
gb:H,sapiens DNA for endogenous retrovir 
ESTs, Weakly similar to A42442 integrin 
ESTs 

hypothetical protein FU20583 



Hs,254753 
Hs.125405 
Hs.158835 
Hs.1 3442 ESTs 
Hs.74407 nucleolar protein p40; homolog of yeast 
Hs.71999 — 
Hs.94262 
Hs.160874 
Hs.318127 
Hs.321444 
Hs.7782 



425178 H16097 Hs.161027 ESTs 
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40 
45 
50 
55 
60 
65 
70 



442952 AI743261 
428692 AI372822 
456179 H75490 
414136 AA812434 
459456 AA485036 
425527 AL1 62032 
424711 NM_005795 
417955 AA210704 
420621 AA278808 
425698 NM 016112 
438295 AI394151 
445550 AI242754 
450469 AI955049 
458804 AL157625 
443657 R14973 
429250 H56585 
437906 AA771704 
426775 AA384564 
443372 AI792557 
453785 A1358236 
425465 L18964 
422746 NM_004484 
413450 Z99716 
424527 AW138558 
414180 AI863304 
411402 BE297355 
445254 AI218263 
458861 AI630223 
415227 AW821113 
435429 AW592035 
434445 AI349306 
448570 AI923944 
452381 H23329 
422879 AI241409 
409026 AU37554 
425717 X07282 
429127 AA749382 
438298 H23542 
442717 R88362 
443555 N71710 



Hs.131860 ESTs 

Hs.1 10103 RNA polyme 

Hs.271930 ESTs 

Hs.1 19023 SMC2 (structural maintenance of chromoso 

Hs.190124 ESTs 

Hs.158258 Homo sapiens mRNA; cDNA DKFZp434B1272 (f 

Hs.1 521 75 calcitonin receptor-like 

Hs.190465 ESTs 

gb:zs79c09.r1 NCI_CGAP_GCB1 Homo sapiens 

Hs.1 59241 polycystic kidney disease 2-like 1 

Hs.37932 ESTs 

Hs.1 37306 ESTs 

Hs.281326 ESTs 

gb:DKFZp761L2016_M 761 (synonym: hamy2] 
gb:yf42f10.s1 Soares fetal liver spleen 

Hs.198308 tryptophan rich basic protein 

Hs.1 94626 ESTs 

Hs.1 08829 ESTs 

Hs.133107 ESTs 

.283732 ESTs, Moderatelysirr»!artoALU1_HUMAN A 



3.1904 protein l< 



Hs.267158 ESTs, Weakly similar to I54374 gene NF2 

Hs.120905 Homo sapiens cDNA FLJ1144S fis, clone HE 

Hs.69855 NRAS-relatedgene 

Hs.323472 EST 

gb:ad06g08.r1 Proliferating Erythroid Ce 

Hs.72402 ESTs 

Hs.254414 ESTs, Weakly similarto 1805195BRNW»n 

Hs.1 1782 ESTs 

Hs.30913 ESTs 

Hs.290880 ESTs, Weakly similartoALU1_HUUAN ALU S 

Hs.1 88092 ESTs 

Hs.49927 protein kinase NYD-SP1 5 

Hs.171495 relinoic acid receptor, beta 

Hs.1 1 8797 ubiquitin-conjugatlng enzyme E2D 3 (homo 

Hs.181783 ESTs 

Hs.180591 ESTs, Weakly similar to T23976 hypotheii 

Hs.21398 ESTs, Moderately similar lo A Chain A, H 



Hs.14 



ESTs 



se A2-activafc 



451813 NM 016117 Hs,27182 

452453 AI902519 gb:QV-BT009-1 01 198-051 BT009 Homo sapier, 

455870 AW452631 Hs.313803 ESTs, Highly similar to AF1 578331 noncl 

437939 AW298600 Hs.141840 ESTs, Weakly similar to S59501 interfero 

430719 AA488988 Hs.293796 ESTs 

452854 AA033714 Hs.287629 hypothetical protein FU 14260 

432095 AW022273 Hs.1 05769 ESTs 

431085 AI829692 Hs.211561 ESTs 

407783 AW996872 Hs.1 72028 a disintegrin and metalloproteinase doma 

423952 AW877787 Hs.135102 KIAA0853 protein 

Hs.61 779 Homo sapiens cDNA FLJ1 3591 lis, clone PL 

Hs.46039 phosphoglycerate mutase 2 (muscle) 

gb:MR3-SN001 0-270300-1 03-h02 SN0010 Homo 
Hs.275343 ESTs 
Hs.135173 ESTs 

Hs.1216BB ESTs, Moderately similarto PC4259 fern 

gb:Human forkhead family (AFX1) mRNA, pa 
Hs 269783 EST LU1 HUMAN ALUS 

gb:ye53h05.s1 Soares fetal liver spleen 
Hs.119699 hypothetical protein FLJ12969 
Hs.1 63230 ESTs 

Hs.337620 Homo sapiens AFG3L1 isoform 1 mRNA, part 
Hs.285754 met proto-oncogene (hepalocyle growth fa 
Hs.234478 Homo sapiens cDNA: FU22648 fis, clone H 
Hs.76578 protein inhibitor of activated STAT3 



408172 W02488 

430933 AW863635 

420691 AA829433 

429761 AI276780 

437958 BE139550 

407494 U10072 

436464 AI016175 

407137 T97307 

445223 BE300091 

438647 AA813118 

438192 AI859065 

417218 AA005247 

440460 H92571 

414512 BE274552 

428170 H05530 

457343 NM_013936 

424020 R76946 

455226 AW902103 

411965 BE467339 

432656 NM_000246 

455488 AA102322 

434340 A1193043 
404285 

418744 AI887288 

454714 AW815098 

429828 AB019494 

436387 AA714760 

448587 AI539652 
432865 AI753709 
440479 AA886461 
443160 AI467915 



... ESTs 
Hs.247862 olfactory receptor, family 1 2, subfamily 
Hs.39738 ESTs 

gb:QV0-NN1 022-1 20500-22Oc07 NN1022 Home 
Hs.280115 ESTs 

Hs.3075 MHC class II transactivator 

gb:zl90f03.r1 Stratagene color {937204! 
Hs.128585 ESTs, Weakly similar to T1 7226 hypotheli 

Hs,196379 ESTs, Weakly similar to putative p150 [H 

gb:QV4-ST021 2-091 199-023-flO ST0212 Homo 

Hs.225767 1DN3 protein 

Hs,240075 Homosapiens cDNA FLJ13234fs, clone CV 

Hs.28338 KIAA1546 protein 

Hs.1 52484 ESTs, Weakly similar to 138022 hypotheli 

Hs.208161 ESTs 

Hs.36053 ESTs 
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418122 R42778 

442875 BE623003 

414373 AW1 62907 

458760 AI498631 

434131 A1858275 

441805 AA285136 

457292 AI921270 

417351 T90278 

409695 AA296961 

432824 AK001783 

438038 A1732529 

454836 AW833711 

453919 AW959912 

422487 AJ010901 

408727 AL1 37259 

427491 R43279 

435102 AW899053 

409617 BE003760 



407756 AA1 16021 
401078 

410365 AI287518 

425201 AA352111 

457112 AW772449 

455252 AW876627 

444542 AI161293 

419249 X14767 

428497 BE010877 

457336 AW969657 

427621 BE621182 

423782 A1472209 

430403 AF039390 

429927 NM_001115 

408562 AI436323 

417137 U46265 

436787 AA908554 

440331 AL046412 

429716 R25685 

417169 R13550 

453020 AL1 62039 

455286 BE144384 

450654 AJ245587 

400432 AX015809 

415747 M381209 

446346 AI290205 

450209 AW008921 

453202 AW085781 



Hs.125445 
Hs.332938 
Hs.257789 
Hs.22217 



55 
60 



111334 ferritin, light polypeptide 



Hs.143659 
Hs.301914 
Hs.334882 
Hs.15049 



401254 

426604 H53354 

449535 W15267 

433138 AB029496 

425804 BE501698 

429515 AL031228 

437267 AW511443 

454305 BE062633 

455631 BE063031 
401878 

450350 T97817 

436532 AA721522 

427304 M761526 

419721 NM_001650 

431699 NNL001173 

446252 AI283125 

421229 A1055590 

434273 AA913143 
456088 BE177320 
434353 AA630863 
436198 AK001125 
452232 AW020603 
433764 AW753676 
412050 H96503 
422342 AA309272 
427510 Z47542 
428336 AA503115 



Hs.268081 

Hs.280380 
Hs.89768 
Hs.98584 
Hs.291029 
Hs.179882 
H5.323117 
Hs.241382 
Hs.2522 

Hs^81281 
Hs. 192756 
Hs.202151 
Hs.211933 
Hs.246773 
Hs.31422 



Hs.309940 

Hs.13138 

Hs.26270 

Hs.158244 

Hs.13775 

Hs.646 

Hs.1 19471 



ESTs 

gb:EST1 12514 Adrenal gland tumor Homo sa 
hypothetical protein FU10921 
ESTs, Weakly similar to TA2R HUN/AN, BETA 
gb:QV4-TT0008-251199-043-e11 TT0008 Homo 
KIAA1705 protein 
' mucin 4, tracheobronchial 

hypothetical protein DKFZp434D0513 
ESTs, Weakly similar to 138022 hypothec 
F-box only protein 8 

Homo sapiens mRNA; cDNA DKFZp434K0514 (1 
gb:CM0-HT0249-291099-084-c04 HT0249 Homo 
3 ESTs 

ubiquitin specif c protease 1 8 

Homo sapiens mRNA; cDNA DKFZp586D0923 (f 
gb:EST60061 Activated T-celis XX Homo sa 
ESTs, Weakly similar to ALU 1 _H U WAN ALU S 
gb:RC3-PT0028-120200-013-d11 PT0028 Homo 



gamma-aminobulyric acid (GABA) A recepto 



tumor necrosis factor (ligand] superfami 

adenylate cyclase 8 (brain) 

Homo sapiens mRNA for K1AA1568 protein, 

mitochondrial ribosomal protein S21 

ESTs 

ESTs 

collagen, type XIII, alpha 1 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434M229 (fr 
gb:MRO-HT0166-191 199-004-c1 1 HT01 66 Homo 
Kruppel-type zinc finger protein 
Sequence 8 from Patent WO9950285 
gb;EST94257 Activated T-cells I Homo sap 
ESTs 

Homo sapiens, clone IMAGE:3448343, mRNA, 
hypothetical protein FU11588 
KIAA0479 protein 
hypothetical protein SMAP31 
carboxypeptidase A3 (mast cell) 



Hs.258189 
Hs.204370 
Hs.258110 



DNA segment on chromosome 6 (unique, pse 



Hs.1 29206 
Hs.1 63853 
Hs.288650 
Hs.267831 
Hs.1 50009 
Hs.70 



gb:nv54h12.r1 NCI_CGAP_Ew1 Homo sapiens 



Hs.158423 
Hs.39982 
Hs.1 09087 



Rho GTPase activating protein 6 
ESTs 

hypothetical protein MGC1 2435 
ESTs 

hypothetical protein FLJ13231 

ESTs, Moderately similar to ALUB.HUMAN I 

Homo sapiens cDNA FU 10263 fis, clone HE 

radial spoke protein 3 

ESTs 

Homo sapiens cDNA: FLJ22845 As, clone K 
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454190 AW177821 
445865 AI262584 
451800 AW977435 



412095 AI624707 

453619 H87648 

431071 AA491379 

407939 W05608 

409045 M635062 

444575 AI264847 

408420 NM_006915 

417318 AW953937 

413382 BE090689 

406748 AW339106 

445898 AF070623 

441817 AW969706 

450551 AJ010046 

457940 AL360159 

446135 AW130288 

436907 AA737171 

429399 AA452244 

448782 AL050295 

434404 AW445034 

428571 NM_006531 

448164 R51680 

442295 AI827248 

450705 U90304 



Hs.48295 RNAhelicase family 

gb:600943376F1 NIH MGCJ7 Homo sapiei 
Hs.26799 DKFZP564D0764 protein 
Hs.15973 ESTs, Highly similarto bA393J16.3 (H.sa 

gb:AB214F6R Infant brain, LLNL array of 
Hs.116750 Homo sapiens cDNA FLJ13221 fis.cloneNT 
Hs.23196 ESTs 

gb:IL3-HT0059-1 80899-007-C05 HT0053 Ho 
Hs.145575 ESTs 
Hs.323867 ESTs 
Hs.173536 ESTs 

Hs.187698 
Hs.240112 
Hs,259855 

Hs.5921 Homo sapiens cDNA: FLJ21592 fis, clone C 

Hs.33922 Homo sapiens, clone MGC:9084, mRNA, comp 
gb:aa65f05.r1 NCI_CGAP_GCB1 Homo sapiens 

Hs.31 2679 ESTs, Weakly similar to A49019 dynein he 

Hs,50094 Homo sapiens mRNA; cDNA DKFZp434O051 5 (f 

Hs.22545 Homo sapiens cDNA FLJ12935 fis, clone NT 

Hs.44766 retinitis pigmentosa 2 (X-linked recessi 



Hs.217493 
Hs.13423 
Hs.293332 
Hs.25155 



gb:RC1-BT0720-280300-011-f08 BT0720 Homo 
annexinA2 

Homo sapiens clone 24468 mRNA sequence 

neuroepithelial cell transforming gene 1 
Homo sapiens TRIpartile motif protein ps 
hypothetical protein FLJ10147 



Hs.2291 Probe hTg737 (polycystic kidney disease, 

Hs.26904 ESTs, Moderately similar to Z195_HUMAN Z 

Hs.224398 Homo sapiens cDNA FLJ 11469 fis, clone HE 

Hs.25351 iroquoishomeobox protein 5 

Hs.1 58205 basic leucine zipper nuclear factor 1 (J 

Hs.136348 osteoblast specific factor 2 (fasciclin 



415250 F02514 

434627 AI221894 

443919 AI091284 

440400 AA994364 

400385 NM_020389 

411322 AW887330 

434638 H50758 

435559 AF209198 

447849 AI538147 

448005 AW207437 

454201 AB023191 

456869 BE467912 

449486 AI552715 

421516 AI362418 

412167 AW897230 

426910 AA470023 

429673 AA884407 
400641 

430576 AA767125 

434423 NMJM6769 

412104 AW205197 

441499 AW298235 

418113 AI272141 

417819 AI253112 

431728 NM.007351 

425025 AW953168 

421168 AF1 82277 

409432 D49372 

408867 AA437199 

439446 AI927629 



455107 BE154113 

458624 AI362790 

449523 NM_000579 

428784 Y12851 

453864 AW021407 

426497 AA379913 

418203 X54942 



Hs.27319 
Hs.39311 
Hs.1 35224 
Hs.1 25594 
Hs.283104 
Hs.1 72405 

Hs.42636 
Hs.1 64277 
Hs.1 70378 
Hs.44131 
Hs.154294 
Hs.270811 
Hs.105379 



gb:lL2-BT0734-240400-071-B04 BT0734 Hor 

ESTs, Weakly similar to A47582 B-cetl gr 
ESTs, Weakly similar to T25472 hypolheti 
putative capacitative calcium channel 

gb:yp86e06.r1 Soares fetal liver spleen 



Hs.19 



FT005 protein 

gb:CM0-NN0057-1 50400-335-a11 NN0057 Homo 
ESTs, Moderately similar to ALU1_HUMAN A 
protein tyrosine phosphatase, non-recepl 



iNcDNA2210 

Hs.83484 SRY (sex determining region Y)-box 4 

Hs.133540 ESTs 

Hs.268107 multimerin 

Hs.12407 ESTs 

Hs.330780 cytochrome P450, subfamily IB (phenobar 

Hs.54460 small inducible cytokine subfamily A (Cy 

Hs.656 cell division cycle 25C 

Hs.57873 ESTs 

Hs.143917 dJ467N1 1.1 protein 



gb:PM1-HT0340-151299-003-a08 HT0340 Homo 
Hs.278639 KIAA1684 protein; likely homolog of mous 
Hs.54443 chemokine (C-C motiO receptor 5 
Hs.1 93470 purinergic receptor P2X, ligand-gated io 
Hs.21058 hypothetical protein 

gb.EST92807 Skin tumor I Homo sapiens cD 
Hs.83758 CDC28 protein kinase 2 

gb:EST95683 Testis I Homo sapiens cDNA 5 
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447357 AI375922 

452631 AI188658 
405041 

409744 AW675258 

433868 AA612960 

437119 AI379921 

455114 AW857121 

431613 AA018515 



446659 AI335361 

418636 AW749855 

434995 AW974995 

438005 BE151746 

444755 AA431791 

427131 AA448460 

442039 AW276240 

448595 AB014544 

432949 M570749 

444314 AI140497 

417420 T85150 



416100 H18700 

419637 W27493 

449432 AW451361 

454403 BE065985 

419179 AW275291 

436391 AJ227892 

449511 AI436187 

447499 AW262580 

407244 M10014 

412877 BE011168 

435985 AA703154 

440674 BE561546 

446476 AW294072 

444100 AA383343 

435731 AA699581 

437105 AA744554 
406091 

457024 AA397546 
404249 

419556 U29615 

424943 AU077260 

444229 AV648613 
404860 



Hs.56265 Homo sapiens mRNA; cDNA DKFZp586P2321 (f 



Hs,264482 
Hs.183817 
Hs.279708 ESIs 
Hs.1 38215 Homo sapiens cDNA FLJ11400 fis, clone HE 
Hs.79385 Human clone 23574 mRNA sequence 
Hs.106747 serine carboxypeptidase 1 precursorprot 
Hs.226376 ESTs 

gb:QV4-BT0534-281299-053-c05 BT0534 Homo 
gb:EST387100 MAGE resequence:;, MAGN Homo 
gb:PM1-HT0305-061299-003-a06 HT0305 Homo 
•— ClpX(caseinolylic protease KE.coli) 



55 410467 
420843 
434927 
413642 
435998 

60 441235 
445748 
451018 
409073 
418782 

65 447870 
437370 
424765 
407385 
424049 
70 425398 
430702 

441675 
430884 
7 5 445523 
416972 



Hs.112017 
Hs.1 28352 
Hs.21572 



Hs.268814 
Hs.184387 



GE36ge 
ESTs 

KIAA0644 gene product 
ESTs 

g b:ow76b09.s1 Soares_fetalJiver_spleen_ 
ESTs 

gb:ye48b07.r1 Soares fetal liver spleen 
ESTs 

gb:EST386997 MAGE resequences, MAGN Hon 
gb:nh34b02.s1 NCI.CGAPJW Homo sapiens 
T RABID protein 

gb:31h10 Human retina cDNA randomly prim 



Hs.1 13009 
Hs.146274 
Hs.296261 
Hs.147674 
Hs.75431 



Hs.19 



gb:601347208F1 NIH_MGC_8 Homo sapiens cD 
ESTs 

CDC14 (cell division cycle 14, S. cerevi 



AF1 02546 

H96982 

H46612 

BE154837 

AA745625 



AA063458 
AI792648 
BE1 39479 
AL359567 
AA428211 
AA610150 
AB014524 



AF053748 
Z30118 
BE019670 
AW271273 



Hs.291414 
Hs.1 35570 
Hs.13252 
Hs.247324 



dachshund (Drosophila) homolog 
ESTs 

Homo sapiens HSPC285 mRNA, partial cds 
gb-.PM1-HT0345-121199-001.c08 HT0345 Homo 
ESTs, Weakly similar to ALU8.HUMAN ALU S 
Homo sapiens cDNA: FLJ21268 fis, clone C 



Hs.161492 ESTs 



Homo sapiens mRNA; cDNA DKFZp547D023 (fr 
hypothetical protein FU 14033 similar to 
ESTs, Weakly similar to I38022 hypotheti 
KIAA0624 protein 

hypothetical protein similar to tenascin 



AW813004 
AI524673 
AW971205 



Hs.284256 
Hs.272072 
Hs.138380 



Hs.13277 hypothetical protein FLJ22054 

Hs.5461 ESTs 

Hs.2481 14 glial cell derived neurotrophic factor 

Hs.293788 ESTs, Moderately similar to unnamed prot 

gb:bb28c01.x1 NIH_MGC_5 Homo sapiens cDN 

Hs.23767 hypothetical protein FLJ12666 

Hs.191284 E«Ts Iv milart LU1_HUMAN ALU S 

Hs.180582 Homo sapiens cDNA: FLJ21836 lis, clone H 

gb;RC3-ST0186-230300-0194i02ST0186Homo 
Hs.38170 ESTs 
Hs.1 14280 ESTs 
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457065 A1476318 
407945 XS9208 
419865 NM_007020 
423596 AA328195 
455807 BE141140 
435867 AA954229 
440195 N72847 
401213 

407291 M001464 
442490 AW965078 
452943 BE247449 
438138 R98299 
440283 AI732892 
447039 AV661798 
412777 AI335773 
421424 AW452690 
406673 M34996 
440555 D31292 
451516 AI800515 
424690 BE538356 
421046 AA810854 
423604 AA486585 
409029 BE087807 
444206 AW301017 
451836 T53573 
454784 AW820626 
423673 BE003054 
436671 AW137159 
434988 AI418055 
452862 AW378065 
439480 AL038511 
410606 AW418779 
426535 AU077012 
432239 X81334 
430217 N47863 
417479 AI057052 
421253 AI188102 
438180 AA808189 



Hs.192480 ESTs 

Hs.606 ATPase, Cu++ Iransporting, alpha polypep 

Hs.93502 UI-snRNP binding protein homolog (70kD) 

Hs 234101 EST similar I TL1 protein [H. 

gb:MR0-HT0075-021 299-006-d07 HT0075 Homo 

Hs.114052 ESTs 

Hs.125221 ESTs 

gb;ze45b01.r1 Soares retina N2b4HR Homo 

Hs.30212 thyroid receptor interacting protein 15 

Hs.31082 hypothetical protein FU10525 

Hs.1 77502 ESTs 

Hs.190489 ESTs 

Hs.282915 ESTs 

Hs.270123 ESTs 

Hs.258775 ESTs 

Hs.1 98253 major histocompatibility complex, class 

Hs.6853 hypothetical protein FLJ22167 

Hs.12024 ESTs 



Hs.258901 ESTs 

gb:QV1-BT0681-290400-181-g02 BT0681 Homo 

Hs.146492 ESTs 

Hs.1 73220 ESTs 

gb:RC0-ST0299.19010O-012-e1 0 ST0299 Homo 

Hs.1695 matrixmT " 

Hs.146151 ESTs 

Hs.1 61 160 ESTs 

Hs.8687 ESTs 

Hs.125316 ESTs, Weakly similarto S33990 linger pr 

Hs.1 14889 ESTs 

Hs.288582 ESTs, Weakly similar to ubiquitous TPRm 
Hs.2936 matrix metalloprotein 
Hs.336901 ribosomal protein S2 
Hs.133554 ESTs.Weakl alar lo Z195.HUVAN ZINC 
ESTs 



439715 
441398 AA932398 

413585 AI133452 

448831 AL080123 

412953 Z45794 

430789 AA632577 

422757 AI909935 

423003 AL120077 

428595 AB037795 

437887 AA811524 

447720 AL038765 

452355 N54926 

408374 AW025430 

440381 M917808 

425478 AB007953 

432231 AA339977 

431757 AA196930 

417517 AF001176 

452837 AL121053 
417426 NM.002291 
423739 AA398155 
416847 L43821 
425876 AW005887 
457411 AW085961 
413136 BE066941 
420313 AB023230 
421751 AW813731 

436331 AI239495 

439275 AF086093 

449272 AW1 37656 

454352 AW389668 

428758 AA433988 

407242 M18728 

445326 AI220072 

423778 Y09267 

452607 AI160029 

423161 AL049227 

418851 AI417828 
458332 AI000341 
432565 AA553477 
437511 AI807500 
430957 AI937072 
425898 AA365649 



Hs.272151 ESTs 



Hs.1 5536 
Hs.75431 
Hs.22182 



ESTs, Weakly similar to ALU1 .HUMAN ALU S 
ESTs, Weak ! y similar to D34( hy li 
hypothetical protein DKFZp761 J1 39 



zinc finger protein 23 (KOX 16) 

Hs.238809 ESTs 

Hs.310235 ESTs, Weakly similar to I78885 serine/th 

Hs.55551 Homo sapiens, Similarto DNA segment, Ch 

Hs.1 22967 kelch (Drosophila)-like 2 (Mayven) 

Hs.186547 KIAA1 374 protein 

Hs.29253 hypolhetical protein FU 1 1 896 

Hs.161304 ESTs 

Hs.29202 G protein-coupled receptor 34 

Hs.155591 forkheadboxFI 

Hs. 190495 ESTs 

Hs.268840 ESTs 

Hs.274127 CLST 11 240 protein 

Hs.268526 Homosapiens chromosome 21q22.1 anonymou 

Hs.82238 POP4 (processing of precursor , S. cerev 

Hs.5534 HomosapienscDNAFU12961 fis.cloneNT 

Hs.82124 laminin. beta 1 

Hs.97600 ESTs 

Hs.80261 enhancer of filamentation 1 (cas-like do 
Hs.234058 ESTs 
Hs.130093 ESTs 

gb:PM0-BT0340-091 299-002-a 1 1 BT0340 Homo 
Hs.96427 KIAA1013 protein 
Hs.1 591 53 ESTs, Moderately similar to S65657 alpha 
Hs.96867 Homo sapiens cDNA: FU23155 fis, clone L 
Hs.1 201 89 ESTs 



Hs.1 32821 flavin conl 

Hs.61438 ESTs 

Hs.124776 Homo sapiens mRNA; cDNA DKFZp554N1 1 1 6 (( 

Hs.192435 ESTs 

Hs.220491 ESTs 

Hs.1 52428 ESTs 
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448225 A1476429 
408955 BE315170 
416509 N57713 



440348 
436340 
444190 



R15891 

AW015802 

R42246 



AA445973 
AA974491 
AW994005 



453901 
418565 
433404 
409517 



436547 
437770 
409064 
442607 



413638 
449232 



AW241510 

AJ297351 

AA767881 

AA062954 

AA507576 

W57990 

AA297914 

AA832123 

AA524190 

H71252 

R52414 

AW1 92780 

BE1 74240 

AW752571 



Hs.19238 ESTs 
Hs.80" 



Hs.260899 
Hs.173044 
Hs.194695 
Hs .281 587 
Hs.47023 
Hs.21608 
Hs.10526 
Hs.89578 
Hs.53929 
Hs,13303 
Hs.21734 
Hs.337534 
Hs.23837 

Hs.86149 



Human (clone CTG-A4) mRNA sequence 



general transcription factor IIH, polype 
ESTs, Weakly similar to ALUDHUMAN !!!! 
Homo sapiens cDNA: FLJ21784 lis, clone H 



Homo sapiens cDNA FLJ11812 fis, done HE 
gb:RC2-BT0318-150200-011-b09 BT0318 Homo 
phosphoinositcl 3-phosphate-bind;ng prot 



Hs.1 49442 
Hs.252713 
Hs.30824 I 
Hs .122897 I 
Hs.141883 

Hs.60059 
Hs.1 11749 
Hs.177723 
Hs.120777 



hypothetical protein FLJ23476 



ne zipper transcription factor-like 



ESTs, Weakly similar to ELL2.HUVAN RNA P 
gb ;■: 12h 12 s1 Soares fetal liver spleen 
gb:yg80b05.r1 Soares infant brain 1MB H 



407809 AW082279 

412303 AW936336 

420478 AA521259 

422711 D60641 

424073 U03493 

426567 AA381579 

435708 AI362949 

441417 AI733297 

445117 AI208754 

447197 R36075 

434228 Z42047 

445527 W39694 

445280 AW055063 

420653 AI224532 

419926 AW900992 

447541 AK000288 

424408 AI754813 

411893 R82845 

428192 AA424051 

435634 T82384 

437637 AJ003029 

438018 AK001160 

4461 84 AW273539 
450232 BE300815 



417648 R06552 

427690 AI253134 

414217 AI309298 

450229 R18717 
400756 

408447 AK002089 
403388 

433643 AI821787 



Hs.244106 ESTs 

gb;QV4-DT0021-281299-O70-g11 DT0021 Homo 
Hs.1 93796 ESTs 

Hs.21 739 Homo sapiens mRNA; cDNA DKFZp586l 1 51 8 (f 
Hs.1 38959 gap junction protein, alpha 7, 45kD (con 
Hs.1 82962 ESTs 
Hs.75169 ESTs 
Hs.144474 ESTs 
Hs.147369 ESTs 

gb;yh88b01.s1 Soares placenta Nb2HPHomo 
Hs.283978 Homo sapiens PR02751 mRNA, complete cds 
Hs.83286 ESTs, Weakly similar to S14747 sphingomy 
Hs.306088 v-crk avian sarcoma virus CT10 oncogene 
ESTs 



Hs.18800 hypothetical protein FLJ20281 

Hs.145428 collagen, type V, alpha 1 

Hs.273789 ESTs 

Hs.304742 ESTs 

gb:yc14f05.r1 Stratagene lung (937210) H 

Hs.65792 syntrophin, gamma 2 

Hs.5999 hypothetical protein FU10298 

Hs.288750 hypothetical protein FU23577 

Hs.201326 ESTs 

Hs.1 87561 ESTs, Moderately similar to ALU1J1UMAN A 

Hs.1 12982 ESTs 

Hs.103159 ESTs 

Hs.148661 ESTs 

gb:y!09e12.r1 Soares feta! livei spleen 

Hs.283410 r" 
Hs.279898 
Hs.8929 

Hs.45080 Homo sapiens cDNA FLJ11227 lis, done PL 



432242 AW022715 

439920 H05430 

432436 AW300248 

429493 AL134708 

425555 AA359291 

450101 AV649989 



Hs.1 521 60 
HS.28B433 
Hs.1 81693 
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442757 AI739528 Hs.28345 ESTs 

430212 AA4S9153 gb;nc67f04.s1 NCI_CGAP_Pr1 Homo sapiens 

437146 AA730977 g b;nw55f05.s1 NCI_CGAP_Ew1 Homo sapiens 

432101 AIS18950 Hs.123642 EphA3 
459644 

453887 BE564037 Hs,36237 hypothetical prolan 

431170 AW971246 Hs.291022 ESTs 

428062 AA420683 Hs,98321 hypothetical protein FLJ14103 

443682 AI383061 Hs,47248 ESTs, Highly similar to similar to Cdd 4 

400441 M15530 Hs.99879 B-cell growth factor 1 (12kD) 

453874 AW591783 Hs.36131 collagen, type XIV, alpha 1 (undulin) 

425810 AI923627 Hs.31903 ESTs 

433037 NM_014158 Hs,279938 HSPC067 protein 

407162 N63855 Hs.142634 zinc finger protein 

441826 AW503603 Hs.129915 phosphotriesterase related 

446901 AI347274 gb:tc05d02.x1 NCi_CGAP_Co1 6 Homo sapiens 

454766 AW866497 gb:QV4-SN0024-170400 176-e07 SN0024 Homo 

414221 AW450979 U'-H-Bi3-ala-a-12-0-Ul.s1 NCIJ \P_Su 

459608 AL119471 gb:DKFZp761M141_r1 761 (synonym: hamy2) 



406149 
424027 
427531 



419216 AU076718 

447748 AI422023 

410594 AW770778 

438550 AW976002 



439443 AF086261 

418824 AW751651 

451273 NM 014811 

430607 AW973521 

432702 AW973953 

414195 BE263293 

425570 AA359558 

414935 C15671 

453153 N53893 

430832 A1073913 

439867 M847510 

419780 AA713522 

433420 AI674093 

434690 AI867679 

436572 AA723274 

447044 AF030107 

431688 AA513906 
403133 

414885 AA157531 

432111 AW972777 

410073 AW408163 



406687 M31126 
403378 

431369 BE184455 

438580 AA811262 

409191 AW818390 

412282 BE160188 

411966 AA099113 

443915 Z40763 



Hs.97957 ESTs 
Hs.231510 ESTs 

gb:ye68g01.r1 So 



Hs.265165 ESTs 

gb:zh90h08.r1SoaresJatalJiver_sp!een_ 

Hs.24641 cytoskeleton associated protein 2 

Hs.1 27892 ESTs 

Hs.53542 choreoacanthocytosis gene; KIM0986 prot 

Hs.26163 KIAA0649 gene product 

Hs.247324 mitochon ' " 

Hs.293744 ESTs 



Hs.24360 ESTs 

Hs.1 00686 ESTs, Weakly similar to JE0350 Anterior 

Hs.161292 ESTs 

Hs.87752 ESTs 

Hs.293961 ESTs, Moderately similar to putative DNA 

Hs.148410 ESTs 

Hs.279596 ESTs 

Hs.17165 regulator of G-protein signalling 13 

gb:ng67c08.s1 NCI_CGAP_Lip2 Homo sapiens 

Hs.269276 ESTs, Moderately similar to S65657 alpha 

gb:EST384871 MAGE resequences, MAGL Homo 

Hs.5B433 catenin (cadherin-associated protein), a 

Hs.12496 ESTs, Weakly similar to ALU4.HUMAN ALII S 

Hs,205353 ectonucleoside triphosphate diphosphohyd 

Hs.31 1 phosphoribosyl pyrophosphate amidotransf 

Hs.1 15830 heparan sulfate (glucosamine) 3-O-sulfot 
Hs,272620 pregnancy specific beta-1 -glycoprotein 9 



Hs.251754 
Hs.299202 
Hs.175613 

Hs.118609 
Hs.1 35292 
Hs.1 80828 



secretory leukocyte protease inhbitor ( 



409070 AA063003 

432713 AL118623 

419384 AA490866 

418858 AW961605 

408444 AW661839 

432128 AA1 27221 

418027 AB037807 

446060 Z42061 

436196 AK001084 

411987 AA375975 



Hs.120633 

Hs.224560 
Hs.29494 
Hs.39429 
Hs.21145 
Hs.253204 
Hs.1 17037 
Hs.83293 
Hs.27004 



gb:RC5-ST0293-061299-031-C(E ST0293 He 
ESTs 

PR01 91 2 protein 
ESTs 

hypothetical pro einFr M0 

ESTs 

ESTs 

hypothetical protein 
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426261 AW242243 

414564 AA1 64803 

427897 NM_017413 

419160 AA911342 

420111 AA255652 



453823 AL1 37967 

432074 AA525248 

431B48 AI378857 

447072 D61594 

409723 AW885757 

425627 AF019612 

435090 BE217923 

449369 AA001256 

425514 AF1 12345 

455821 BE1 43341 

427224 AL1 35554 



Hs.168670 
Hs.71994 
Hs.303084 



glutaminyl-peptide c 
peroxisomal farnesylated proicm 
ESTs, Weakly similar to I38022 hypotheli 
apelin; peptide ligand for APJ receptor 
KIAA1559 protein 

gb:zs21h11.r1 NCLCGAPJ3CB1 Homosapiens 
syntaxin binding protein 3 

I i-iel, ca'cium cctivad ~i 
gb:DKFZp761D2315_r1 761 (synonym: tiamy2) 



ESTs, Highly similar to AF1 75283 1 z 



Hs.149723 
Hs.126758 
Hs.17279 
Hs.257862 

Hs.297007 membrane-bound transcription factor prol 



integrin, alpha 10 

gb:MRO-HT0162-191099-002-dj4HT0162Hc 
sineoculis homeobox(Drosophila) homolo 



436032 AA1 50797 Hs.1 09276 latexin protein 
404356 

434205 AF1 19861 Hs.283032 hypothetical protein PRO2015 



30 
35 



417549 AW239285 

431277 AA501806 

454056 AI368836 
401694 

423531 AW752782 

431364 AW971382 

445908 R13580 

448390 AL035414 

449939 T86420 

455678 BE066007 
404555 

418186 BE541042 

419981 AAB97581 

449581 AI989517 



Hs.1 78703 ESTs 

ls.82359 tumor necrosis f 

Hs.249965 ESTs 

Hs.24808 ESTs, Weakly s 



lor receptor superfami 
ilar to 138022 hypolheti 



Hs.129750 hypothetical protein FU10546 

Hs.29401 6 ESTs, Moderately similar to B34087 hypot 

Hs.13436 Homo sapiens clone 24425 mRNA sequence 

Hs.21068 hypothetical protein 

Hs.272139 ESTs 



453345 AA302852 
424335 AW021508 
451072 AA013451 
417845 AL1 17461 
411571 AA122393 
438035 AA938198 
432374 W68815 
400241 

408908 BE296227 
409564 AA045857 
411151 AW866497 
446009 AI989885 
456855 AF035528 
416441 BE407197 
414891 R27124 
430172 AA468591 
422109 S73265 
422985 AU076411 
448552 AW973653 
422373 AK001843 
450726 AW204600 
438379 N23018 
412608 AA247995 
451270 AW341392 
412140 AA219691 
415992 C05837 
415533 T74009 
437132 AL049353 
415304 T66079 
450152 AI138535 
421147 AW592157 
421413 AI826128 
451750 AA280851 
417924 AU077231 



Hs.23240 Homo sapiens cDNA: FU21848 lis. done H 

Hs.128773 ESTs 

Hs.1 81605 ESTs 

Hs.282884 ESTs 

Hs.132136 solule carrier family 4, sodium bicarbon 

Hs.9443 zinc finger protein 202 

Hs.145444 Homo sapiens cDNA FU11494fis, clone HE 

Hs.90063 neurocalcin delta 

Hs.28170 ESTs 

Hs.1 17929 ESTs 

Hs.82719 Homo sapiens mRNA; cDNA DKFZp586F1822 (f 

Hs.7081 1 hypothetical protein FLJ2051 6 

Hs.1461 23 poly(A) polymerase gamma 

Hs.301885 Homo sapiens cDNA FU11 346 fis. clone PL 

Hs.250822 



Hs.153863 MAO (mothers against decapentaplegic, Dr 

gb:601301552F1 NIH_MGC_21 Homosapiens 
Hs.268754 Homo sapiens cDNA FLJ11 949 lis, clone HE 
Hs.1 61 889 ESTs 
Hs.1473 
Hs.1602 
Hs.20104 
Hs.1 15700 
Hs.250505 
Hs.171391 
Hs.44898 

Hs.235795 ESTs 
Hs.73625 RAB6 interacting, kii 
Hs.145807 hypothetical protein FLJ13593 
Hs.268738 ESTs, Weakly similar to ALU7_HUMAN Al 
Hs.297268 ESTs 
Hs.1 2799 
Hs.22968 
Hs.293299 
Hs.57637 
Hs.226930 
Hs.82932 



. , . "LJ00052 
i sapiens cDNA:FLJ235l5fe clone 
et loic id receptor, alpia 
-lerminal binding protein 2 
■ sapiens clone TCCCTA00151 



Homo sapiens clone IMAGE451939, mRNAse 
ESTs 

ESTs, Weakly similar to A49364 59 protei 



cyclin D1 (PRAD1: parathyroid i 
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35 

40 

45 
50 
55 
60 
65 
70 



406945 K01383 
452449 AW068658 
436009 H57130 
418637 T86737 
420495 AI338247 
449203 AI634578 
437751 AA767373 
428412 AA428240 
425707 AF1 15402 
441967 AA972742 
403317 U02687 
406018 

410566 AA373210 
416747 AW876523 
431229 AA496479 
43514B AI918049 
436349 AI445255 
446895 M1 66655 
448582 AI538880 
442762 AF035119 
442738 AW002370 
451874 AW963137 
448076 AJ133123 
443484 AI091458 
430686 NMJW1942 
446100 AW967109 
407624 AW1 57431 
435079 AA664192 
428923 BE047698 
422496 AA311301 
413385 M34455 
40B418 AW963897 
446733 AA863360 
427434 BE538374 
428822 W28418 
459325 AW088369 
416996 W91892 
426638 NMJ12337 
408000 111 690 
418894 W73921 
424687 J05070 
437378 AI198823 
454100 AI693231 
452786 R61362 
437311 AA370041 
400631 AF173937 
440028 AW473676 
426490 NMJ01621 
424103 NM_001918 
407995 AI094748 
449911 A1262106 
449509 AA001615 
452762 AW501435 
422B39 A1674784 



Hs.1 73451 metallothionein 1 A (functional) 

Hs.20943 ESTs 

Hs.1 20925 ESTs 

Hs.193536 ESTs 

Hs.98314 Homo sapiens mRNA; cDNA DKFZp586L0120 (f 

Hs.282121 ESTs 

Hs.35669 ESTs, Moderately similar to ALU1_HUMAN A 

Hs.126083 ESTs 

Hs.1 1713 E74-like factor 5 (ets domain transcript 

Hs.209786 ESTs 

Hs.3B5 fms-related tyrosine kinase 3 

Hs.43047 Homo sapiens cDNA FU 13585 lis, clone PL 

Hs.15929 hypothetical protein FLJ12910 

Hs.259929 ESTs 

Hs.1 24961 ESTs 

Hs.1 15315 ESTs 



Hs.94i 

Hs.131055 
Hs.194233 
Hs.20196 
Hs.1 34559 



ESTs 
ESTs 



Hs.301732 
Hs.30715 
Hs.282184 



gb:ac05b03.s1 Stratagene lung (937210) H 

ESTs 

ESTs 

indoleamine-pyrrole 2,3 dioxygenase 
KIAA1435 protein 

ESTs, Weakly similar to fatty acid omega 
hypothetical protein MGC5306 
potassium voltage-gated channel, Isk-re! 
ESTs 



bullous pemphigoid antigen 



Hs.151738 
Hs.1 60473 
Hs.1 26043 
Hs.106642 



ESTs 



21 open reading frame 51 
ESTs, Weakly similar lo T09052 hypotheli 
Hs.9456 SWI/SNF related, matrix associated, acti 
Hs.109494 secreted protein of unknown function 
Hs.125843 ESTs, Weakly similar lo T17227 hypotheli 
Hs. 170087 aryl hydrocarbon receptor 

dihydrolipoamide branched chain Iransacy 
hypothetical protein FU12787 



401200 

416248 H99169 

442262 BE170651 

449754 H00820 

453908 AW613920 

446965 BE242873 

412798 AW998657 

416085 H18072 

418378 AW962081 

455995 BE1 79408 

422411 AW749443 

410888 AW861207 

446893 AI610818 

442992 A1914699 

407021 U52077 

435938 AW1 39680 

433194 AB040883 

454790 AWB20852 

431130 NM.006103 

434739 AA804487 
40646B 

457023 AA001732 

416226 N55342 

422306 BE044325 

432810 AA863400 

412894 R09778 

430602 D13752 



Hs.139410 
Hs.100134 
Hs.12653 
Hs.84561 
Hs.278582 
Hs.298908 
Hs.152825 



Hs.282178 
Hs.16677 
Hs.119120 
Hs.92576 



mitochondrial ribosomal protein S25 
deleted in liver cancer 1 
ESTs, Weakly similar to B34087 hypotheti 



Hs.22511 ESTs 



gb:RC1-CT0302-120200-013-d04 CT0302 Ho 



KIAA1450 protein 

gb:RC2-ST0301-120200-01 1-f12 ST0301 He 
pididyi ( < hey-ai cp eii 
ESTs 



2inFLJ10970 



Hs,173233 hypothet 

Hs.34372 ESTs 

Hs.2272B0 U6 snRNA-associated Sm-like protein 

Hs,23054 ESTs 

Hs.186510 ESTs 

Hs.1 84927 cytochrome P450, subfamily XIB (steroid 

Hs.293316 ESTs 
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449838 
447160 
422156 
440137 
423998 
436471 
414680 
426782 



413252 
427115 
444610 
400451 
435255 
450159 
418375 



AB037791 

AB020653 

AA330310 

N34524 

AA866199 

H29138 

AA719813 

AA743331 

R14614 

AB023197 

AL050201 



Hs.171397 
Hs.157113 
Hs.117662 
Hs.272572 
Hs.191254 
Hs.227743 
Hs.70769 



gb:yy56d10.s1 Soares_multiple_sclerosis_ 
ESTs 

coenzyme Q, 7 (rat, yeast) homoiog 



AW972853 Hs.1 12237 ESTs 



KIAA0980 protein 

hypothetical protein DKFZp586E1923 
gb:RC5-BT0580-170300,021-F12 BT0580 He 



20 400196 
415734 
415189 



W87434 
A1702416 
NM_003081 
AI021989 



425349 AA425234 

448515 H68441 

410557 AA085803 

442562 BE379584 

413488 BE144017 

419088 AI538323 

447373 AI381922 

457465 AW301344 

413918 AW015898 
402820 

424872 AA347923 

428552 AW274560 

435464 BE548300 

449008 AW578003 

420838 AW1 18210 

428231 U17989 

434933 R91095 

444870 AI200621 

425354 U62027 

429183 AB014604 

439155 H81076 

442787 W9304B 
429864 AA460039 
438563 AA810665 
437140 AA312799 
421991 NM_01491B 



402318 

410878 AW809201 

414494 AA768491 

421306 AA806207 

427027 AI924294 

429088 D61542 

429859 NMJ07050 

428060 AA420616 

419953 BE267154 

443718 AI0B3580 

444187 AW138466 

428048 AA705745 

420195 N44348 

417404 NM.007350 

442833 AA328153 

430335 DB0007 

414618 AI204600 

434029 AA621763 

410945 AW811552 
421247 BE391727 

433374 AI821409 
445544 R77766 
450271 AI693900 
4480B4 AI467800 
407864 AF069291 
43099B AF128847 
435542 M687376 
443544 A1076315 
421103 AI625835 
405759 

446242 N66336 



gb:HA2501 Human fetal liver cDNA library 

Hs.106015 ESTs, Moderately similar to ALU1_HUMAN A 

Hs.20077 1 ESTs, Moderalely similar to A Chain A, T 

Hs.84389 synaptosomal-associated protein, 25kD 

Hs.1 31903 ESTs 

Hs.78748 KIAA0237 gene product 

Hs.781 46 platelet/endothelial cell adhesion molec 

Hs.271609 ESTs 

Hs.79886 ribose 5-phosphate isomerase A (ribose 5 

Hs.13528 hypothetical protein FU14054 

Hs.192997 ESTs, Mo - " or 

h ' - dolichyl-d Dspn 3 r i r i 

gb:MR0-HT0165-191199-004-d09 HT0165 Homo 

Hs.52620 integrin, bela 8 

Hs.1 58781 ESTs 

Hs.122908 DNA replication factor 



71245 

gb:EST54302 Fetal he 
Hs.1 29520 ESTs 
Hs.1 92999 ESTs, Moderately simi 



to 173885 serin 



Hs.5244 ESTs 
Hs.1 83105 nuclear autoantigen 
Hs.4276 KIAA1701 protein 
Hs.1 48504 ESTs 

Hs.1 55935 complement component 3a receptor 1 
Hs.197955 KIAA0704 protein 
Hs.269001 ESTs 

Hs .250723 hypothetical protein MGC2747 

Hs.286 ribosomal protein L4 

Hs.134746 ESTs, Weakly similar lo A46010 X-linked 

Hs.283689 activator of CREM in testis 

Hs.1 10488 KIAA0990 protein 

Hs.175225 ESTs 

Hs.40968 heparan sulfate (glucosamine) 3-O-suifot 
Hs.208558 ESTs 

Hs.314248 ESTs, Weakly similarto ALU4_HUMAN ALU S 
Hs.6783 hypothetical protein FU22724 
Hs.1 25889 ESTs 

Hs.173259 uncharacterized bone marrow protein BM03 

Hs.227716 KIAA0934 protein 

Hs.225952 protein tyrosine phosphatase, receptor t 

Hs.249483 ESTs 

Hs.1 25752 ESTs 

Hs.221373 ESTs 

Hs.151274 ESTs 

gb:zf41b11.s1Soares_fetal_heart_NbHH19W 
Hs.26243 HomosapienscDNAFU11177 fs, clcne PL 
Hs,82101 pleckstrin homology-like domain, family 
Hs.88201 ESTs, Weakly similar to A Chain A, Cryst 
Hs.239499 KIAA0185 protein 
Hs.96978 hypothetical protein MGC1 0764 
Hs.170434 Homo sapiens cDNA r 11424: - Ion 

gb:QV2-ST0145-071299-017-h10 ST0145 Homo 
Hs.102910 general transcription factor IIH, polype 
Hs.332789 EST 

Hs.271593 ESTs, Moderately similar to A47582 B-cel 
Hs.200920 ESTs 

Hs.271000 ESTs, Weakly similarto 138022 hypotheii 
Hs.40539 chromosome 8 open reading frame 1 
Hs.204038 indolethylamine N-i 
Hs.269533 ESTs 



Hs.7360 ESTs 
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432839 M579465 

439224 AW471088 

410976 R36207 

454574 AW809109 

411020 NM.006770 

452279 M286844 

446891 AL036877 

434294 AJ271379 

449057 AB037784 

432769 AA620814 

441224 AU076964 

407891 M486620 

429017 AA463605 

406817 AI936028 

430566 M481282 

449208 AW253635 

451397 AA017432 

452042 H38857 

444779 AI192105 

433672 BE281165 

415964 M171850 

420273 AI652864 

411354 AW992424 

422389 AF240635 

446994 AV650435 

417793 AW405434 

422654 AA3U316 

425999 AW513051 
405634 

451562 H04150 

431958 X63629 

422095 AI868872 

442010 AI032680 

442991 BE281238 

425312 AA354940 

415191 AA190381 

416406 D86961 

425316 M354977 

413753 U17760 

452241 AL050204 

433571 AA765256 

417094 NM.00S895 

409190 AU076536 

413783 M314337 

423867 M331886 

429418 AI381023 

445829 AI452457 

452366 AK000464 

425704 U79293 

446593 W79572 
400462 

422003 AA361760 

444585 AW170015 

444898 AI201548 
403525 

443031 AW134696 

430818 AI311928 

423690 AA329648 

440941 BE258362 

409627 AW997628 

433258 AI806626 

412863 AA121673 

435476 AA326108 



445019 AI362520 

435472 AW972330 

424882 AI379461 

430473 AW1 30690 

431353 M86528 

438118 AW753311 



Hs.67726 
Hs.61250 
Hs.282878 
Hs.76194 
Hs.22941 
Hs.144959 
Hs.7753 
Hs.41135 
Hs.66295 

Hs.190149 
Hs.48643 
Hs.84529 
Hs.243901 
Hs.147170 
Hs.288038 
Hs.42251 
Hs.197257 
Hs.288141 
Hs.115897 
Hs.16755 
Hs.82575 
Hs.163725 
Hs.332981 



hypothetical protein KIAA1335 
ESTs, Highly similar to T08692 hypolheii 
hypothetical protein MGC10744 
gb:MR4-ST01 1 7-0701 00-027-a04 STOW Ho 



calumenin 
endomucin-2 

mulli-PDZ-domain-contalning protein 
gb:wo47a09.x1 NCI_CGAP_Gas4 Hcmo sapiens 



hypothetical protein MGC3156 




Hs.132213 
Hs.8886 
HS.14595B 
Hs.120810 
Hs.79299 
Hs.191565 
Hs.75517 
Hs.28540 
Hs.135191 
Hs.81182 
Hs.50984 



Hs.29276 
Hs. 159264 
Hs.13277 



hypolhelical protein FU 20424 



lipoma HMGIC fusion partner-like 2 
ESTs, Moderately similar to T14342 NSD1 
iaminin, beta 3 (nicein (125kD), kalinin 
Homo sapiens mRNA; cDNA DKFZp586F1223 (f 
ESTs, Weakly similar to unnamed protein 
histamine N-methyltransferase 
sarcoma amplified sequence 
ritosomat protein S7 
gb;EST35757 Embryo, 8 week 



hypothetical protein FLJ20457 
" i clone 23948 mRNA sequence 
' I FU22054 



Hs.49418 ESTs 



Hs.23804 

Hs.7535 

Hs.313637 

Hs.207300 

Hs.59757 

Hs.33829 

Hs.127826 

Hs.262206 

Hs.300938 

Hs. 153203 



Hs.153636 
Hs.59962 
Hs.266902 
Hs.259415 



gb:qo89h04.*1 NCI_CGAP_KIc5 Homo sapiens 
ESTs, Weakly similar to PN0099 son3 prot 
COBW-like protein 
ESTs 

ESTs, Weakly similar to ALUB_HUMAN !!!! 

zinc ringer protein 281 

bHLH protein DEC2 

RalGEF-like protein 3, mouse homolog 



histone deacetylase 3 

triggering receptor expressed on myeloid 

far upstream element (FUSE) binding prot 

ESTs 

neurotrophin 5 (neurotrophin 4/5) 



415258 AW752247 

420314 H81671 

437733 AI792574 

452019 AL1 57503 

453118 AW1 95849 
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430706 NM.003540 

420568 F09247 

452759 AW590773 

408496 AI683802 

420674 NM_000055 

410358 AW975168 

450086 AW016343 

410853 H04588 

438607 AW080237 

422232 D43945 

432801 NM_016260 
402490 

446551 A1308176 

438315 R56795 

445261 T79759 
401985 

420335 AA258771 

424698 M1 64366 

435413 AI267476 

458175 AW295024 

458433 AL1 35352 

417494 A1369494 

416045 H15990 

424181 AL039482 

445873 AA250970 
411578 AW238524 
453116 AI276680 
425692 D90041 
435608 AW183971 
402102 

458308 AI828155 
438177 BE327015 
415205 H71616 
427244 AA402400 
441384 AA447849 
438979 AW976218 



Hs.247816 



histone family, member C 

Hs.247735 protocadherin alpha 10 

Hs.258996 ESTs 

Hs.136182 ESTs 

Hs.1327 butyrylcholinesterase 

Hs.13337 ESTs, Weakly similar to unnamed protein 

Hs.233301 ESTs 

Hs.30469 ESTs 



Hs.65636 ESTs 

Hs.82419 ESTs 

Hs.282939 ESTs, Weakly similar to 138022 hypotheti 

Hs.4361 6 Homo sapiens mRNA for FU00029 prolein, 

Hs.151 973 hypothetical prolein FU2351 1 

Hs.46559 ESTs 

Hs.1 50434 ESTs 

Hs.255883 ESTs, Weakly similar to 138022 hypotheti 

Hs.222137 ESTs 

Hs.31403 ESTs 

Hs.142517 Homo sapiens mRNA; cDNA DKFZp434P081 0 (f 

Hs,251946 poly(A)-binding protein, cytoplasmic 1-1 

gb:xp27c05.x1 NCI_CGAP_HN10 Homo sapiens 



Hs I <l 



ESTs 



o sapiens cDNA: FLJ221 82 fis, clone H 



gb:Homo sapiens putative mitochondrial s 
NYD-SP1 4 protein 

gb:UI-H-DI2-ahv-h-03-0-Ul.s1 NCI_CGAP.Su 
cofactor required for Sp1 transcriptiona 



408873 
426226 
401157 
407721 
425646 

419839 
402855 
443644 
445572 



AI311457 
N72185 
AL046017 
AA769045 



Hs.161712 
Hs.99472 
Hs.44189 
Hs.182278 



gb:oa80h07.s1 NCI_CGAP_GCB1 Homo sapiens 

dual-specificity tyrosine-(Y)-phosphoryl 
cyclin-dependent kinase-like 2 (CDC2-rel 
beta-site APP-cleaving enzyme 
phospholipase A2, group VII (platelet-ac 

ESTs, Moderately similar to S65657 alpha 
ESTs 

S100 calcium-binding protein A3 
ESTs, Weakly similar to FAT DROME CADHER 
I ESTs 

hypothetical protein FU10875 



423347 AI650412 



437373 H67505 
70 439217 AF086041 
446609 BE395090 



Homo sapiens cDNA FU13542fis, clone PL 
gb:Homo sapiens Ras-GRF2 mRNA, partial c 
KIAA1733 protein 



TABLE 3B: List of accession numbers for primekeys lacking unlgenelD's for Table 3A. c orsuch such probesetis listed a gene cli 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and rnRNAs. These sequences were cluste 
and AlignmentTools (DoubleTwist, Oakland California). Genbank accession numbers for sequences comprising each cluster are listed in the -Accession column. 

Pkey: Unique Eos piobeset identifier number 
CAT number: Gene cluster number 
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410008 
410615 
410672 
410784 
410785 
410888 
410901 



113722.1 

115812J 
1212203J 
1214882_1 
1221 005J 
1221055.1 
1225955.1 
1226077.1 



AW864542M056567AW882724 . 
BE087807 AW301579 
AA063458 AA063018 AI444822 
AA075368 AA075369 

AA296961 AA296B89 AA076945 M077528 AA077497 
AA079552 BE142525 BE142527 
AW772721AW873372 H89212 
AW794600AW794730 
AW803201 BE079700 BE062940 

AW803341 AW803265 AWB03403 AW803466 AW803402 AW80341 3 AW803268 AW303396 AW803334 AW803355 



411018 1229132.1 

411139 1233781.1 

411151 1234022.1 

411479 1247077.1 

411514 1248638.1 

411578 125013.1 

411745 1256160.1 

411880 1263110.1 

411905 1265181.1 

412167 1280605.1 

412209 1283610.1 

412248 1285000.1 

412282 1287679.1 

412303 1288130.1 

412566 1306469.1 

412877 1333953.1 
412879 1334272.1 
413087 1348720.1 
413136 1350379.1 
413252 1355877.1 
413382 1365954.1 
413488 1373234.1 
413499 1373910.1 
413618 1380384.1 
413538 1381238.1 
413642 
413875 
414109 
414221 



417859 
413378 
418536 
419546 
419637 



1381386.1 
1396766.1 
1417307.1 



1465790.-' 
1509157.1 
1531080.1 



168747.2 
1691070.1 
1692157.1 
1703724 1 
174656.1 
177402.1 
185766.1 



420621 
420637 
421065 
422063 
422156 

422342 
422429 
422977 



AW810001 AW810092 AW81 01 70 AW809884 AW80S664 AW81 0353 AW81 0428 AW81 0209 AW81 0429 AW810154 AW8101 68 AW809786 

AW810006AW809672AW809694AW.il A'A L " 810432 AW8099S0 

AW81 1 1 14 AW81 1095 AW81 1087 AWO 1 1 1 24 AW81 1054 AW81 1094 AW81 1 1 57 

AW81 1552 AW81 1470 AW81 1523 AW81 1482 AW81 1514 AWB1 1456 AW81 1525 

AW813004AW812962AW812961 

AW813242 BE146089AW813195AW813173AW813205BE145953BE146212AW813196AW854582AW813241 BE061582 
AW813428AW813444AW813367AW813368AW813429AW813424 
AW81 9461 AW819478 AW819479 AW81 9559 AW81 9505 
AW866497 AW819775 AW819868 AW866602 AW866561 

AW848047 AW848202 AW848631 AW848142 AW348702 AW848121 AW848632 AW848140 AWB48571 AW848009 AW848067 AW848069 

AW848905AW848214 

AW8501 78 AW850233 AW850445 AW850446 

AW238524 AA094059 R59022 

AW867826AW859896 

AW872477BE088101 T05990 

BE265067 BE264978AW875420 

AW897230 AW897252 AW897244 AW897231 AW897263 

AW901456AW901450AW901441 

BE176480 AW903298 AW903313 

BE160188 AW935785 BE160401 BE160319 BE160313 BE160395 

AW936336AW936339 

AW962574 BE073261 

BE01 1 168 BE006371 BE011172 BE01 1313 BE011 166 



BE066941 BE066911 BE066979 BE055929 BE066925 



BE144017BE185527BE144023 
BE144884 H97942 

BE154078 BE1 54006 BE153891 BE153523 BE153983 

BE154O37 E BE15487TdIi54850 BE154877BE154835BE154849 BE1 54902 BE1 54905 BE154867 BE154901 BE154904 BE154899 
BE176776 H85072 
BE250744 BE294770 

AW450979AA136653AA136656AW419381AA984358AA492073 BE168945M809054AW238038BE011212BE011359BE011367BE011368 

BE011362 BE011215BE011355 BE011363 

BE390440 

C15671 C15333 C15833 D81433 C14557 C14811 

R41400 H05559 Z38637 F02023 T16516 

F13168 R21289T77628 

AA381209AA381245AA167683 

R52414 R61681 H12009 T74429 

H51299 H44619 H46391 R86024 H51892T72744 

H48713H83057 H48694 H55247 

BE407197 AA182474 AA180369 BE275628 BE275131 

H62953 N76608N72413 

H69044T47567 H75691 T50292 

AW934714 BE161007 BE162500 AW749902 AW749864 BE162498 BE161005 AA190449 AW513465 BE161006 BE162499 

BE019670AA191284M191255 

AW974345 BE145400 T97088 AA639076 

R06552T98985R08656 

T99898 T92080 T91447 R07806 R07779 

T26453Z44226 R20425 

AW962081 AA218925AA354237 

AW749855 AA225995 AW750208 AW750206 

AA244199AA244272 H57440 

W27493 AA248063 AA249685 AA429679 

R77402 M262462 AA250988 R06794 

M255652 AA280911 AW967920 AA262684 

AA278808 BE082076 BE081812 BE081581 

AW976153AA278945M747691 

AA329711 AA287436AA283148 

BE156476 BE156473BE156474BE156475AA302839 „ 

N34524 AA305071 AW954803 AA502335 AI43343C A1203597 AW025570 AW255323 AW850787 AA317554 AW993643 AW835572 AW385512 

AI334955 W32951 H62656 H53902 R38904 AW835732 

AA309272AA309312AW961837 

M631498 A™ 2 91 AA491211 AA761823 M7M555 M768099 AA808286 AI934069 AA570223 AA574389 M582438 A1745346 AW964510 

AA319642AW853758 H56414 

AW864848 AA322213 AA322209 AW961624 
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22769J 
229362J 
231498J 
232507 1 
232732_1 
244505J 
247933J 



BE090499 BE090502 BE090504 BE090497 AA3272B5 



430818 
430844 
430933 
431071 



432189 
432222 
432779 
432803 
432869 
433347 
433492 



434613 
434638 
434950 
434995 
435079 
435463 
435634 
436532 
437146 



440320 
440674 
443613 
443657 
444314 
444610 
444910 
446096 
446901 
447197 
448404 
449540 



325757J 
327550_1 
328799_1 
331543 1 
335512J 

341618J 
342819J 
343347 1 
354024J 
354267J 
355475J 
36388 1 
367934J 
370400 1 
371919 1 
373061 1 
380572J 
3848B9J 



575391 J 
575685_1 
600667_1 
61 2257 J 
624951J 



711523 1 
76151 5 J 
80945_2 



1129667J 
1130674J 
115682_1 
1170435J 



AL049377 AL079930 AL047223 AW885968 AA385235 

BE090503 BE090490 BE090483 BE090501 BE090500 BE090365 BE090505 BE090509 
AA330259 AA661806 M502431 AW974633 AA649496 
AW753967 AA370795 AA331630 AW962550 
AA331886 AW962S59 AW962655 T89841 
AA347923 AA347928 AW961769 
AA3521 1 1 AW962247 AA429695 
AA524596AA358494AA513507 
AA359558AA359819AW963122 
AA769045 AA372590 AW963633 
AA379913AA379981 AW963523 
AA382291 AA994657 
T96203AA405343 T96121 
AA705745 AA420850 AA420433 
AA431765AA432015 
AA459153AI7185Q3M469225 

AI311928AA936030T51931M609816AA487195AA66/-2C7 
T94960AA487579T95013 

AW863635BE179901 BE179402AW863789 M489179BE010747 BE010748 
AA491379H86020AW969148 
AW971 240 M493843 AA493723 

AW970622 AA503009 AA502998 AA502989 AA502805 T92188 
AA508196BE142920 AI280311 AI205616 D61709 
AA513906 AA847734 AI357044 
AW972777 AA526795 M526305 
AA527941 AI810608 AI620190 M635266 
AI204995 AW827539 AW969908 AW440776 AA52875B 
AW979241 AA565006 M847102 
M565398AW894072 H97930 
AW974094 AA569074 M602574 
AF023130 AF181 250 AA984703 AA694303 AA351792 
AW605349 AW262898 N41060 AA594852 

AW295399 AW207772 AW300641 AW070290 BE348854 AW1 70383 AA600968 AA778832 
AW342028 AA641080 AA603282 
M743991 M604852 AW272737 
AA625804 AW418787 AW074833 AI575642 AI393368 
AA631439 AI086355 AI082577 
AI821826AA64C657 M658836AA650055AI821079 
H50758AF147434 H50666 
AW974892AA654375 

AW974995AI821880AI821932A!791195AA659617A1821137AA658925 
AA664192 H60250 T71388 
AA682507AW851124 
T82384R05307 M693714 
AA721522AW975443 T93070 

AA730977AI261584 AA334473 Z43283 AWB75861 AW938044 BE150701 AW936262 
AI239729 AI251752 M485791 BE568425 AW962Q58 
BE151745 BE336853 D53271 T94955 AA774994 
AW975186M807807D29548 
AF085839 R69137 AW188788 R69254 
AA828995 AA834879 AI926361 
AA879294 N67538 AI474541 
BE561546 Z25124 AI307139 Z28800 
AI079356 W23287 
R14973R14967AI081006 
AI140497 AW749625 AW749626 AW749644 
AI174783 R83559 R12271 
AI201849BE069007AW946544 
AI27S454 AI633717 AI27S116 
AI347274AW844024 
R36075AI366546 R36167 
BE089973AI498612AW805032 
M001713 H63836 , 
AA005129 AA679084 AA694399 
AA009926 BE149301 
AI698339 AI909260 AI909259 
M442176AA259181 
AA018072 N46370 R84847 
AI902519AI902518AI902516 
AW812256 AW812257 AI906423 AI906422 
AL137967 BE064160 BE064186 
BE065902 AW749032 AW003637 
I AW177821 AW177896 AW177867 

I BE141183 AW178157 AW178162 AW178166 AW178172 AW845393 AW173159 AW178222 AW178213 AW178215 AW178090 AW178091 
AW178161 AW178207 AW178210 AW173214 AW178212 BE140913 BE140917 AVY178135 AW178205 AW178209 AW178223 AW178220 
AW178206AW178203AW178165AW178168AW178160AW178136AW845878AW178131AW178138AW178105AW845894AW178129 
AW845810AW845828AW178216AVM 8112 W17821 I 2 BE 40315 AW178221 AW178130 AW178134AW178095 AW178108 
AW178133AW178164AW178218AW173rU.\1,> A,'.P8156 W178103 BE141189 AW178170 AW845816 BE141586 AW178156 
AW178104 AW178163 AW178093 AW178208 .' W178137 AV P8140 AW178219 BE141592 AW845901 BE141580 AW178155 BE141598 BE140957 
AW389668 AW389657 AW609198 AW389649 
N71277AW390764 
AW752571 AW847602 AA077979 
BE065985 BE065944 BE066008 BE066083 BE066093 
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1229132.1 
1230493J 
1234022.1 



1259748.1 
1262534.1 
1266222.1 
1273576.1 
1278158.1 

129372_"1 
1321229.1 
1321443.1 



1331885.1 
1347545.1 
1349716J 

1370914J 
1372714.1 
1377119.1 
398552 1 



75803.1 
798085.1 
920051.1 
925883.1 
956605.1 



AW807605 AW807690 AW807839 AW807752 AW807673 AW807667 AW807955 AW807760 AW807615 AW807898 AW807849 AW807821 
AW807832aAEC76V ") W807829AW8C W07825 AW807603 AW807612 AW807908 AW807595 AW807617 

AW807678 AA8C7687 / ^ 1 1 AW 17921 AW807596 AW8 f 2 Ai 07688 AW807609 AW807584 AW807770 AW807593 AW807754 
AW807679 AW807957 AW807633 AW807763 AW807902 AW807840 AW807819 AW807836 AW807769 AW807685 AW807847 AW807674 
AW807686 AW807670 AW807917 AW807577 AW307680 AW807900 AW807669 AW807952 AW807907 AW807846 AW807756 

AW807835 AW807608 AW8C7753 AW807601 AW807956 

AW809109AW809112AW809122AW80912o/tt r J-'l ' U 1 AW809131 AW809113 AW809111 AW809132 

AW810001 AW810092AW810170AW809884AVW) 54 « J U c i W810428 AW810209 AW810429 AW810154AW810168 AW809786 

AW810006 AW809672 AW809694 AW810552 AW81 0345 AW310432 AW809960 

AW813089W28102 

AW813428 AW8I3444 AW813367 AW813368 AW813429 AW813424 

AW81 5098 BE154843BE1 54831 

AW866497 AW319775 AW819868 AW866602 AW866561 

AW820626AW820621 AW820608 

AW820852 AW820773 AW821088 

AW833711 AW833620AW833699 

AW847645 AW847791 AW854083 AW853945 



BE160198 AW935898 T1 1520 AW935930 AW856073 AW861034 
BE1541 13 AW856797 AW856847 AW861128 AW856817 

AW860972 AW862598 AW862599 AW860988 AW860983 AW860898 AW860925 AW860922 AW860986 AW860984 AW860989 

AW947884 AW947918 AW947888 AW947883 AW947897 AW947910 AW947905 AW864751 AW947878 

AW902103 AW869012 AW869139 

AW876627 AW876630 AW876631 AW876625 

BE144384 AW887474 AW887403 BE144386 

AW893961 AW893998 AW894034 AW894019 



AA102322 

AW9 4 83 7 6 6 0 2 8 AvSiS AW983610 AW983688 AW983501 AW933645 AW983607 AW983640 AW983625 AW98361 2 AW983642 AW983687 
AW983602 AW98362'. AW983634 AW983637 AW983632 AW983617 AW983635 AW983630 AW983635 AW983639 AW983616 AW983689 
AW98364 1 AW983621 AW983603 AW983609 AW983623 AW983544 AW9335 1 8 AW98361 5 AW98361 1 AW983504 AW983686 AW983622 
AW98351 9 AW933633 AW9835B9 A'A'983605 AWS83626 AW9B3643 AW9B3631 AW983627 AW983613 AW983614 AW983685 AW983593 
AW9B3590AW983594AW9836A if 
BE003714 BE003721 DE003720 BE003716 
BE063031 BE063002BE063008BE 
BE066007 BE066017BE066C74 



BE141140BE141139 BE 141 105 BE14 1 143 BE141127 BE1412C2 BE141108 
BE143341 BE143344BE143378BE143358 

BE149024 BE149056 BE152826 BE149025 BE149057 BE152819 BE149030 BE149062 BE149023 BE149055 
BE179015 BE178965 BE1 79010 BE179002 BE178961 BE179005 BE1789B4 BE179012 BE179011 BE178963 BE178997 

AW4OT 4 9^AAiSmSaW419381 AA984358AA492073 BE168945AA809054AW238038BE011212BE011359 BE011357 BE011368 
BE011352 BE011215 BE01 1365 BE011353 
AL157525 N72696 BE622492 
AI630223 AI630470 

AI904723 AI904725 AI904729 AI90W2 A S04758 AI904736 

AW391177W45021 v 
AJ003531 AJ003650AJ003651 



;i) numbers. "Dunham I. et a!." refers to the publication er 



ed The DNA sequence of 



NLposition: Indicates nu 



es DNA strand from which ex> 



400451 8113550 Minus 

400452 9929659 Minus 
400608 9887666 Minus 
400639 9887597 Plus 
400641 8117693 Plus 
400756 8119084 Minus 
400859 9757499 Minus 
400880 9931121 Plus 
400889 9958234 Minus 
400983 8081198 Plus 
401045 8117619 Plus 
401049 7232177 Plus 
401078 3687273 Plus 
401094 9965511 Plus 
401103 8568122 Minus 
401157 9438289 Minus 
401189 9690246 Minus 



197610-197785 
96756-97558 
23150-23580 



91888-92018,98131-9829^,99474-99570 



107903-108832 
90044-90184.91111-91345 
149157-150692 
105052-105171 

127130 10-302,139283-139506 



114133-114247,11 
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111586-111306.114791-114916,115419-115583,116351-116446,116847-116907.122853-123067.124982-125407 



152209-152383 
213509-214450 
15736-16352 
92607-92813 



402102 
402103 
402230 
402318 



402820 
402855 
403133 
403271 
, 403277 
403310 
403329 



403574 
403637 
403677 
403691 
403760 
403776 



404249 
404274 
404285 
404288 
404356 
404443 
404476 
404488 
404513 
404548 
404555 
404561 
404588 
404593 



9662953 
7331427 
7230852 



162268-162474,163089-163195 
31137-31293 

166394-166556,168167-168395 



76516-76690 

43921-44049,46181-46273 
25026-25091,25844-25920 
82274-82443 



38314-38634 

134283-134485 

27494-27642 



96450-96598 
92839-93036 

112733-113001,114599-114735 1 

73431-73602 

173358-179553 

162431-153243 

46652-47332 



5542-6176 

142647-142771,145531-145762 
55008-55083,6286(K3051 



45910-46260,47563-47824 
1414-1513,1624-1756 
3502-4002,4070-4308 
12609-12773 

29042-29135,46597-46699 

55512-55781 

7066-7210 

64270-64633 

104127-104318 

32282-32416 

3512-3691 

126433-126623 

87198-87441 

101841-102043 

64835-64994 

112837-113339 

83896-84162 

63963-54157 

69039-70100 

40059-40210 

74922-75788 

110443-110733 



91057-91188 
147512-148011 
121230-121714 



73121-7 



106297-106447, 1 08^62-1 085S6 
70284-70518 

124361-124520,124914-125050 
59362-59607 

17856-17957,18302-18412,18837-1892 
53624-53759 
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405970 
406018 
406091 
406092 
406149 
406195 
406333 
406468 
406506 



9213235 Plus 

9795553 Plus 

7711374 Minus 

7711566 Plus 

8272559 Minus 



51762-51978 
45795-46295 
37795-38168 
197370-197935 

251370-251797,252168-252882 

44464-45164 

36293-36827 



TABLE 30A: ABOUT 1840 GENES UP-REGULATED IN IDIOPATHIC PULMONARY FIBROSIS (IPF) COMPARED TO HYPERSENSITIVITY PNEUMONITIS (HP) 

Table 30A lists about 1840 genes that are up regulated in idiopathic pulmonary fibrosis (IPF) samples as compared with hypersensitivity pneumonitis (HP) samples These were ^ 
selected from about 59580 probesets on an Affymetrix/Eos Hu03 Gene Chip array such that the ratio of "average" idiopathic pulmonary fibrosis sample expression level to average 
hypersensitivity pneumonitis sample expression was greater than or equal to about 2.0. The -average" idiopathic pulmonary fibrosis level was set to the 90* percentile amongst 
idiopathic pulmonary fibrosis samples. The "average" hypersensitivity pneumonitis level was set to the 90" percentile amongst hypersensitivity pneumonitis samples. 

Pkey: Unique Eos probeset identifier number 



UnigenelD: Unigenen 
Unigene Title: Unigene gene line 
R1: Ratio of IPF (idiopathic pulrr 

Pkey ExAccn UnigenelD 

450478 AW451709 Hs.271200 



inary fibrosis) to HP (hypersensitivity pneumonitis) 
Unigene Title 
ESTs 

hypothetical protein FLJ 10244 



440209 H05049 

407811 AW190902 

439606 W79123 

425259 AL049280 

426230 AA367019 

416653 AA768553 

420481 U50525 
403574 

415817 UB8967 

419519 AI198719 

435256 AF193766 

423017 AW178761 



428766 AA477989 

441802 AA968635 

453549 Y07494 

447410 AI470235 

442353 BE379594 
405494 

442377 AA993807 

409928 AL137163 

420407 AA814732 

415236 R41400 

451562 H04150 
403310 

445189 AI936450 

409632 W74001 

439780 AL109688 
402076 

415025 AW207091 

406690 M29540 

438557 AW364104 

428042 AA419529 

418007 M13509 

409545 BE296182 

445619 AU076643 

411966 AA099113 

440274 R24595 

442879 AF032922 

419236 AA330447 

420185 AL044055 

415672 N53097 

455488 AA102322 

420026 AI831190 



cysteine knot superfamily 1, BMP antagon 
G protein-coupled receptor 87 

p mRN DNAf i K1 i 



Hs.78867 
Hs.176376 
Hs.13072 
Hs.227948 
Hs.30622 



Hs.34114 
Hs.172698 
Hs.49136 

Hs.1 57367 
Hs.57549 
Hs.145010 



Human BRCA2 region, mRNA sequence CG029 

protein tyrosine phosphatase, receptor-t 
ESTs 

cytokine-like protein C17 

serine (or cysteine) proteinase inhibito 

Homo sapiens cDNA FLJ13010 (is, done NT 

ESTs 
ESTs 

ATPase, Na+/K->- transporting, alpha 2 (*) 
EST 

ESTs, Moderately similar to ALU7_HUMAN A 



Hs.107708 ESTs 



hypothetical protein dJ473B4 
lipopolysaccaride-specific response 5-li 
gb:yf34b12.s1 Soares infant brain 1NIB H 



carcinoembryonic antigen-related 
hypothetical protein FLJ21924 
myxovirus (influenza) resistarce ' 



hypothetical protein MGC 167 



Hs.1 66576 
Hs.135100 
Hs.293184 



gb:z!90f03.r1 Stratagene colon (937204) 



gb:Homo sapiens mRNA for inmunog obulin 
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414175 AI308876 

424693 BE169810 

436397 AA715013 

440504 AI948966 

409718 D86640 
403525 

418986 AI123555 

416035 H42314 

400292 AA250737 

442849 R10099 



gb:yo09e02s1 Soares adult brain N2b5HB5 



444963 AI916973 

447530 AW192063 

444992 R37658 

416575 W02414 

431211 M86849 

451830 H18433 

446466 H38026 
404043 

423454 AL1 10456 

455540 BE080231 

434683 AW298724 

445898 AF070623 

422306 BE044325 

428895 M437124 

450018 AA421642 

419249 X14767 

455047 AW852530 

454039 AW079064 
403537 

414725 AA769791 

409073 AA063458 
403329 

434001 AW950905 

459664 

401497 

410797 AW857191 

411402 BE297855 

448844 AI581519 

435202 AI971313 

439418 AI282149 

443584 AI807036 
434362 AF1 29505 
430838 N46664 
430882 BE174240 
440129 AA865818 
437636 AA764781 
455747 BE074910 
455464 AW983901 
418771 AA807881 
434820 AI821863 
440615 AI733055 
454482 BE147919 
400432 AX015809 
436508 AW604381 
423507 AA328329 
407415 AF073323 

443152 T49951 

451325 AA021283 

440515 AJ131245 
406333 

409105 AW467539 

408000 L11690 

421482 AL135462 

442757 AI739528 
459717 

436637 AI783529 

412222 AA528283 

450101 AV649989 

410901 AW810001 

426217 AW131888 

441640 AI733345 

422977 AA531498 

425361 AA355933 

414955 C15506 
411965 BE467339 



Hs.2164 

Hs.251871 
Hs.213603 
Hs.248865 
Hs.21375 
Hs.38383 



ESTs, Moderately similar to JC5238 gala: 



gap junction protein, beta 2, 26kD (conn 

KIAA1035 protein 

arrestin 3, retina! (X-aTestin) 



Hs.469 

Hs.202639 
Hs.13423 
Hs.227280 
Hs.1 87247 
Hs.24309 
Hs.89768 



Hs.245540 ESTs 



hypothetical protein FLJ11106 
gammaaminobutyric acid (GABA) A recepto 
gb:PM1-CT0243.071099-001-g06 CT0243 Homo 



Hs.177164 
Hs.170204 
Hs.56213 
Hs.267245 
Hs.86492 
Hs.169395 
Hs.79024 
Hs.174936 
Hs.291844 



ne (or cysteine) prateirase inhibilo 



KIAA0551 protein 

EST;- Highly Gimi or to FXD2 HUMAN FORKH 
hypothetical protein FU14803 



hypothetical protein FU12015 

ESTs, Weakly similar to S71885 Sle20-lik 



gb:ns90f05.x5 NCLCGAP_Pr3 Homo sapiens 
ESTs 

gb:RC3-HT0230-160200-016-a08 HT0230 Homo 
Sequence 8 from Patent WO9950285 
ESTs, Weakly similar to S00755 pleckslri 



SEC24 (S. cerevisiae) related genefamil 



Hs.255877 
Hs.620 
Hs.104715 
Hs.28345 

Hs.26766 
Hs.292737 
Hs.24385 



Hs.132221 
Hs.280115 



Human hbc647 mRNA sequence 
gb'.MR4-ST0124-270300-005-b11 STOW Homo 
ESTs, Weakly similar to hypothetical pro 



gb:np83h04.s1 NCl.CGAP.Thy1 
hypothetical protein FLJ12401 
gb:C15506 Clontech human aorta 



gb:CM3-CT0341-190400-152-h12 CT0341 Homo 
ESTs 
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25 
30 



417181 L10123 

426097 BE327369 

439199 R40373 

440728 AW086077 

434381' AA631834 

417428 N87579 

431291 N25521 

431242 AA987742 

426985 BE394849 

442360 AI374521 

452171 AI863302 

440801 AA906366 

411738 AW859353 

431447 AA505138 

433485 AI493076 
401365 

408281 BE141183 

411657 AW855583 

423065 R96158 

428528 AI004034 

454036 AA374756 

417252 AA195014 

417135 AA422067 
403089 

420691 AA829433 

412147 AW395984 

425578 U65652 

430403 AF039390 

454438 AA224053 

435434 M680387 

420828 AA280778 

435586 AI279137 

452393 H87398 

416170 H42454 

408691 AW250525 

428912 AW103117 
455511 BE144762 
413849 BE173561 
401189 

425733 F13287 
447863 AL047611 



417919 AI928203 
405784 

431853 AA521034 

409629 AW449589 



427173 
433717 
406777 
410481 



439382 
404957 
436332 
446393 
452728 
456386 



Hs.1 12238 
Hs.26299 
Hs.153272 



Hs.26299 ESTs 



Hs.25275 
Hs.251278 
Hs.131905 
Hs.29055 
Hs,211930 
Hs.190535 



416972 
65 427099 
403344 
438993 



surfactant protein A binding protein 
ESTs 



Homo sapiens cDNfc FU22715 fls. clone H 
gb:np77h05.s1 NCI_CGAP_Pr2 Homo sapiens 
gb:LL2030F Human fetal heart, Lambda ZAP 
Kruppel-type zinc finger protein 
KIAA1201 protein 

ESTs, Moderately similar to Z195JHUVAN Z 

ESTs 

EST 

ESTs 

gb:MR1-CT0353-150300-102-a12 CT0353 Homo 



AF063536 

T23625 

R34107 

AA429750 

AF015592 

N93057 

AA877124 

BE247684 

AL049679 
AW014174 
AI915676 



gb:MR0-HT0071-191 199-001 -bO* HT0071 Homo 
gb:CM4.CT0278-221099-027-f07 CT0278 Homo 
Homo sapiens, clone MGC5406, mRNA, comp 
ESTs 

Homo sapiens mRNA for KIAA1771 prolein, 



Hs.158313 
Hs.241382 
Hs.172405 
Hs.187850 
Hs.186878 



Hs.220645 

Hs.98949 

Hs.15384 

Hs.159388 
Hs.288885 
Hs.163725 

Hs.86379 

Hs.70834 
Hs.279724 

Hs.97540 

Hs.150580 
Hs.321450 
Hs.75113 
Hs.28853 
Hs.54888 
Hs. 172844 
Hs.103070 

Hs.82302 
Hs.301956 
Hs.239708 

Hs.311054 

Hs.173560 



gb:QV4-NN0039-040500-197-e08 NN0039 Homo 




gb:2821626.5prime N'.H_MGC_7 Homo sapiens 
ESTs, Weakly similar to MEA6 [H.sapiens| 
gb:CM0.HT0180-04-.099-065-b04 HT0180 Homo 
AP1 gamma subunit binding protein 1 



gb:zj18f08.s1 Soares_fetal_!iver_spleen_ 



ESTs 

gb:AF063536 Homo sapiens library (Yu Y) 
putative translation initiation factor 
pregnancy specific beta-1-glycoprotein 2 
genera! transcription factor MA 
CDC7 (cell division cycle 7, S. cerevisi 
ESTs 
ESTs 



gb:47e1 Human retina cDNA randomly prime 
Homo sapiens mRNA fu'.l length insert cDN 
M b bb28c 1 1 x1 NIH_MGC_5 Homosapiens cDN 
odd Oz/ten-m homolog 2 (Drosophila, mous 

gb:od77b08.s1 Na_CGAP_Ov2 Homo sapiens 
Homo sapiens cDNA FLJ13752 fis, clone PL 
Rag C prolein 



Homo sapiens mRNA; cDNA DKFZp43^E082 (fr 



441201 
434377 
440472 
418379 
435878 



Homo sapiens cDNA FU 1 1382 fis, do 



gb:nx97a04.s! NCI_CGAP_GCB1 Homosapiens 
ESTs 

gb:MRO-SN0039-300300-001-c02 SN0039 Homo 
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438660 U95740 

411597 AW852925 

415655 W05433 
404822 

441107 AA917075 
404834 

412768 AW996044 

428102 AA968441 

436511 M721252 

441247 AW1 18681 

453098 225935 



431071 AA491379 

436298 AW293496 

440356 AI933184 

452768 AW069459 

455241 AW876249 

409070 AA063003 

409044 AI129586 

419091 T85332 

422591 L07648 
403188 

418857 D10216 

413585 AI133452 

436149 AI754308 

443582 AI383061 

437916 BE566249 

439818 AL360137 

438361 AA805666 

451221 AI949701 

455475 AW948126 

433197 AB040889 

429881 T80112 

415598 AI433165 

431220 N52937 

433132 AB026264 

424029 AB014594 

407340 AA810168 
410318 AA084050 
412400 AW948066 
427167 AI239607 
438090 AA777534 
407938 AA905097 
440454 AI733037 
417706 T90797 
428692 AI372822 
407762 AW235638 
420727 H75701 
417508 BE163512 
413525 BE145899 
425798 AA364002 
459429 AA278779 
430205 AB025904 
437458 AL390131 
451073 AI758905 
452786 R61362 
429846 AB023021 
444414 AW293214 
402615 

410585 AW770523 
425168 R95366 
449729 R72032 
459359 N99545 
456443 AW967500 
439001 AF075068 
443657 R14973 
404193 

416379 N38857 
422511 AU076442 
426603 AA382291 
412589 R28660 
421037 AI684808 
427088 AA398085 
429927 NM_001115 
453375 AI990114 
435451 AF195420 
451882 AI821324 
419983 W55955 
405001 U58196 
422182 AL043892 



Hs.49890 
Hs.190520 ESTs 



Hs. 126866 
Hs.291502 
Hs.128051 
Hs.85379 
Hs.300648 
Hs.1 64502 

Hs.180138 
Hs.! 27922 
Hs.61539 



Homo sapiens, clone IMAGE:3010666. mRNA, 
gb-.PMO-CT0248-131099-001-f10 CT0248 Homo 



Homo sapiens thymic slromal lymphopoieli 

ESTs 

ESTs 

ESTs 

gb:aa65f05.r1 NCI_CGAP_GCB1 Homo sapiens 
ESTs 

ESTs, Moderately similar to S65657 alpha 
ESTs 

gb:PM4-PTQO19.131299-00S-B05 PT00'9 Homo 
ESTs 

hypothetical protein FLJ 14623 
ESTs 



ESTs, Highly similar to similar to Cdc14 
hypothetical protein FLJ231 42 
Homo sapiens mRNA full length insert cDN 
Homo sapiens cDNA: FLJ23077 (is, clone L 



Hs.281022 

Hs.192245 

Hs.9856 

Hs.102679 

Hs.284245 

Hs.137579 



Hs.99196 
Hs.191992 
Hs.85050 
Hs.1 29990 
Hs.268623 
Hs.110103 
Hs.29475 
Hs.99886 
Hs.180877 



vitiligo-associaled protein VIT1 

ESTs. Weaklj ; o S23650 retroviru 

gb:RC0-MT0012-290300-031-h10 MT0012 Ho 
hypothetical protein MGC1 1324 



RNA polymerase I transcription factor RR 



H3histone, family 3BIH3.3B) 
gb:MRC-HT0208-221299-204-b10 HT0208 Homo 
gb:EST74529 Pineal gland II Homo sapiens 



Hs.106642 
Hs.225945 
Hs.8752 

Hs.337501 

Hs.29235 

Hs.133543 



Hs.24305 



gb:yq37d04.s1 Soaresfet 
ESTs 

gb.za40a05.r1 Scares fetal liver spleen 



Hs.197553 ESTs 
Hs.142390 ESTs 



cyclase 8 (I 

ESTs, Weakly similar toga 
ESTs 

Hjmosapiei RNA r N L FZp53 I 1 (' 
Hs.180582 Homo sapiens cDNA: FLJ21836 fis, clone H 

367 



Hs.240091 
Hs.303005 
Hs.100445 
Hs.94030 
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15 
20 

25 



451917 AW391351 

432781 NM_014133 

443773 AV646452 

406964 M21305 

430682 AW971949 

449804 AI535663 

411505 AF155659 

430503 AA533574 

443305 AI050693 

415076 NM_000857 

452280 AI911410 

432189 AA527941 

406992 S82472 

441416 AI990139 

448776 BE302464 

413998 AW1 03807 

440385 AA884283 

431673 AW971302 
401887 
404793 

422054 AA322506 

432030 AI908400 

449645 AI961092 
404476 

449336 AL1 19995 
401200 



Hs.291252 
Hs.39379 
Hs.70565 
Hs.152274 



Homo sapiens j -v.ovr. mRNA 
PRO061 8 protein 

calcium channel, voltage-dependent beta 
]b:Hmsi-B|i'?.sati lite and satellite 3 
ESTs, Weakly similar to ZN91.HUMAN ZINC 
ESTs 

molybdenum cofaclor synthesis 2 



guanylate cyclase 1, soluble, beta 3 



Hs.30057 MRS2(S.. 



gb:EST25146 Cerebellum II Homo sapiens c 



Hs.15260 ESTs, Highly similar lo AC007228 2 BC372 



Hs.133485 ESTs 



417229 AA975096 

425403 AL023753 

436269 AA707472 

453823 AL1 37967 

416394 H64111 

432779 AW979241 

439326 W07140 

423035 AW449679 

435766 R11673 

448067 R68568 

441605 AA984647 

414400 X06948 

418405 AI868282 

437642 AL079309 

450350 T97817 

451704 AI755209 

459037 AW439497 

419247 S65791 



434273 AA913143 

438042 AW296971 

410500 R09442 

416154 Z46122 

418432 M14156 



444338 AI937026 

427687 AW003867 

415929 AA724373 

416009 Z43062 

421515 Y11339 
403515 

435793 AB037734 

439953 AA918129 

457620 AA602711 

442006 AW975183 

453931 AL121278 

453128 AW026516 

413468 BE504766 

454600 AW810001 

451065 AW295132 

444493 R59410 

426447 AV555843 

410908 AA121686 

440364 AA910460 
406190 

430762 AI343652 

451182 D52552 

432437 W07088 

442137 AA977235 
405970 

407676 AW064111 



Hs.190760 ESTs 



adrenergic, beta, receptor kinase 2 
■_. Ihelical protein PR02849 
Human DNA sequence from clcne 119EH6 on 



gb:DKFZp761D2315_r1 761 (synonym: hamy2) 
gb:yr57t03.r1 Soares fetal liver spleen 
gb:EST391351 MAGE resequences, MAGP Homo 



Hs.186498 ESTs 



Hs.174880 
Hs.205616 
Hs.290656 
Hs.89764 



Hs.49344 
Hs.105352 



Fc fragment of IgE, high affinity I, rec 
ESTs, Highly similar to KIAA1370 prolan 
gb:Homo sapiens mRNA full length insert 
ESTs 

ESTs, Weakly similar to ALU1_HUVAN ALU S 
EST 

gWM2-S^00^ SN0018 Homo 



insulin-like growth f; 
gb:QV3-HT0460-230200-101-M8 HT0460 Homo 
' 5inGKAP42 



hypothetical protein FU11006 
gb:HSC12E041 normalized infant brain cDN 
GalNAc alpha-2, 6-! 



Hs.169919 
Hs.10592 
Hs.1 28626 

Hs.105667 
Hs.296317 
Hs.293685 
Hs.1 28830 



ESTs, Weakly similar to S72482 hypotheti 
ESTs 

acylphosphatase 2, muscle type 
gb:hz40g01.x1 NCI_CGAP_GC6 Homosapiens 
gb:MR4-STO124-270300-005-b11 ST0124 Homo 
ESTs, Weakly similar to granule cell mar 
ESTs, Moderately similar to !38022hypot 



KIAA1789 protein 
ESTs 

ESTs, Weakly similar to Z192 HUMAN ZINC 



Hs.279823 ESTs 



WO 03/042661 



PCT/US02/36810 



428062 
423841 
420430 
443921 



X68242 

AK001974 

R20991 

AA420683 

AW753967 

AI703192 

AI091310 

AW379394 

AI066470 

AA812434 

R91778 

AW861207 

M224872 



Hs.134848 
Hs.145126 
Hs.134482 



gb:QV4-HT0492-270100-086-e12 HT0492 Home 
Hin-1 

hypothetical protein FU11 112 
gb:yg06h01.r1 Scares 
nypometical protein FLJ14103 
gb:RC2-CT0304-080100-011-h12 CT0304 Homo 
gb:wd92h04.x1 NCI_CGAP_Lu24 Homo sapiens 
ESTs 



iinlNIDH 



3MC2 (structural maintenance of chromoso 



3.115088 ESTs 



)2-120200-O13-d04CT0302Homo 



446672 
431548 
416182 
422899 
417663 
405455 



451426 
408172 
401626 



AW001835 
AW296771 
AA331906 
AW294659 
T05514 
AI834273 
NM_004354 
D16471 
R07483 

i AI531964 
AI056618 
AW451176 



Hs.13323 hypothetical protein FLJ22059 
Hs.221999 ESTs 



Hs.9711 
Hs.79069 
Hs.121571 
Hs.1 80461 

Hs.34447 
Hs.134314 
Hs.195954 
Hs.208063 ESTs 



35 

40 



404518 A1815601 

413581 BE150618 

426701 AI968103 

445510 AA946676 

418663 AK001100 

447617 AI400762 

448150 AI472167 

410140 AL134435 



410767 AJ001873 

433183 AF231338 

436168 AK000883 

438456 M913381 

411185 AW821257 

411880 AW872477 

433567 AF073299 

433805 M70691C 

409434 AF278761 

440184 AB002297 

456555 AW592167 

419189 T95862 

428648 AF052728 

407995 AI094748 

413200 AA127395 

416421 AA134006 

416737 AF154335 

428356 AL046991 

429216 AI369472 

432488 AA551010 

433386 AW360833 
400889 

415294 D86980 

446190 AI279299 

417801 AA417383 

418122 R42778 

418375 NM_003081 

443367 AW071349 

446645 AI336596 

434294 AJ271379 

452372 AI885742 

414241 AA425085 

436982 AB018305 

430548 AW450575 

427119 AW880562 

437073 AIB85608 

437845 AA769578 



FLJ20481 



Hs.79197 CD83 antigen (a , , . 

gb:RC3-HT0272-110'.00-013-c06 HT0272 Homo 

Hs.209461 Homo sapiens cDNA FU 1 2836 fis. clone NT 

Hs.282824 ESTs 

Hs.41690 desmocollin 3 

Hs.176675 ESTs 

Hs.302739 ESTs 

Hs.22269 neurexin3 

gb:601342622F1 NIH_MGC_53 Homo sapiens c 
gb:QVO-ST0294-240300-173-g04 ST0294 Homo 

Hs.66185 Homo Sapiens mRNA, partial cDNA sequence 

Hs.222024 transcription factor BMAL2 

Hs.301645 Homo sapiens cDNA FLJ10021 fis, clone HE 

Hs.190513 ESTs 

gb:PM3-ST0307-231299-001-b11 ST0307 Homo 
gb:hm30f03.x1 NCI_CGAP_Thy4 Homo sapiens 

Hs.103132 solute carrier family 9 (sodium/hydrogen 

Hs.112742 ~" 

Hs.131581 

Hs.7022 

Hs.293299 

Hs.112318 



Hs.79306 
Hs.79691 
Hs. 10338 ESTs 
Hs.66407 ESTs 
Hs.216640 ESTs 

gb:PM1-CT0243-201099-004-d08 CT0243 Homo 



Hs.22217 

Hs.84389 

Hs.215337 

Hs.156294 

Hs.76194 

Hs.228474 

Hs.4007 

Hs.5378 

Hs.1 63203 

Hs.1 14574 

Hs.94122 

Hs.90488 



Sarcolemmal-associated protein 



ESTs, Weakly similar to B34087 hypotheti 
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WO 03/042661 



PCT/US02/36810 



30 
35 





AW847645 




gb:IL3-CT0213-2801 00-056-A04 CT0213 Homo 


3.2 




AI904738 


Hs.76053 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 


3.2 










3.2 


456536 


MW1 5 35986 


Hs.257859 


ESTs 


3.2 










3.2 




BE008347 




gb:CM0-BN0154-08040O-325JiO4 BN0154 Homo 


3.2 




M449563 




glulamate-cysleine ligase, catalytic sub 


3.2 




AI745400 


Hs.204662 




3.2 




AI827786 


Hs.259044 




3.2 


452879 


AW905328 


Hs.180842 


ribosomal protein L13 


3.2 




AWB19191 




CM ! 2:13 C71299-06'-d08ST0283 Homo 


3.2 




AA453488 






3.2 




AI608821 


Hs.270289 




3.2 


401324 








3.2 


404731 












AI792788 




gb:o!91d05.y5 NC! CGAP Kid5 Homo sapiens 


32 




BE003714 




gb:QV3-BN0O96-200400-161-aO1 BN0096 Homo 






AA889328 


Hs.112950 










Hs. 139270 




3.2 






Hs.6616 










Hs.191223 




3.2 
















Hs, 129206 




12 






Hs,62798 








AB040933 




KIAA1500 protein 


3,2 




AW969834 


Hs.303303 




3.2 




AW084991 


Hs.26".00 




3.2 




BE064655 




gb:RC1-BT0313-301299-012-c09 BT0313 Homo 


3.2 




AW593050 






32 




AW9 58859 




Homo sapiens cDNA FLJ12141 fis, clone MA 


3,2 


421916 


R34441 


Hs.101007 


Hoi ir n cDN/ Fl 546 fis, clorc- 1 


3.2 


419261 




Hs.89791 


wingless-type MMTV integration site (ami 






AA236590 






a2 




AB023201 




KIAA0984 protein 


3.2 




AV653034 


Hs.297559 




3,2 




AI301740 


Hs. 173381 


dihydropyrimidinase-like 2 


3.2 




AW5 14652 


Hs.321637 




3.2 




AA428482 


Hs .58589 




3.2 


411018 


AW813428 




gb:MR3-ST0l92-010200-210-c05 ST0192 Homo 


3.2 










3.2 


432223 


AA333283 


Hs.121001 


Homo sapiens, clone !MAGE:3460280, mRNA 


3.2 




AW138295 


Hs.1 35024 








AA810560 






3.2 


404404 








3,1 






Hs.253800 








AU076633 




serine (or cysteine) proteinase inhibilo 






AW851980 


Hs.262348 


ESTs, Weakly similar lo S72482 hypolheli 








Hs.131357 








AA852713 










NM_007115 




















AF022913 


Hs.62187 


phosphatidylinositol glycan, class K 


3.1 




AW328038 








440046 


AW402306 




hypothetical protein FLJ 10483 






AI693900 






3.1 




AA450191 




hypothetical protein FLJ14624 








Hs.22229 








AA010066 


Hs. 224849 


Homo sapiens cDNA FLJ 1 2583 fis, clone NT 


11 








Homo sapiens cDNA FLJ13539 [is, clone PL 






AB006624 




KIAA0286 protein 


3.1 




AI271945 


Hs. 134984 


ESTs 






R40058 












Hs.171066 








BE277929 




















AI470790 


Hs.34494 






45602i 


BE327387 


Hs. 13913 


KIAA1577 protein 




458023 


AW978161 


Hs.268555 






417079 


U65590 


Hs.81134 


interleukin 1 receptor antagonist 




421308 


AA687322 


Hs.192843 


leucine zipper protein FKSG14 






R54418 


Hs. 183745 


hypothetical protein FLJ13456 




449138 


AW294215 


Hs.195631 


ESTs 




455756 


BE079307 




gb:RC1-BT0623-1 20200-01 1-g09 BT0523 Hcmo 




428170 


H05530 


Hs.12555 


ESTs 




429878 


AA460188 


Hs.127263 


ESTs 




455000 


AW850283 


Hs.324429 


Homo sapiens cDNA FLJ14015fs, clone HE 




438369 


T77885 


Hs.83428 


nuclear factor of kappa light polypeptid 




415840 R15955 


Hs.21758 


ESTs 




444955 


AW002844 


Hs.148641 


ESTs 




43602C 


AA778177 


Hs.121724 


ESTs 




45305 


AW196690 


Hs.224259 


ESTs 





WO 03/042661 
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42517B H16097 
402145 

410685 M497117 

449238 AA428229 

456737 BE247203 

438214 H06076 

436250 AY004867 

411622 AI807894 

418454 M315308 

449357 AI076363 



452778 R71338 

455577 BE006341 

408235 AA053381 

436194 AK001074 

452073 AA625150 

427050 AA397789 

427244 M402400 

448405 AW207634 

433767 AA609245 

421376 M287948 

441519 AA972740 
404367 

453502 AL039786 

421948 L42583 

438165 AA779344 
400608 
404042 



Hs.161027 ESTs 
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Hs.129500 
Hs.331561 
HS.124B31 
Hs .26320 



447046 AA326187 

455851 BE146879 

429014 AI800518 

400227 

439037 AF075084 

439693 AI741816 

427533 R36022 

418355 L42563 

433536 AI732163 

448446 AI521251 

449623 C00719 

445568 H0091B 

440448 AA885428 

42B201 AA42415B 

444148 AW003204 

447972 AL137275 

432584 AA928829 

440925 AW511090 

428398 AI249368 

415913 H70302 

418145 AF121260 

413252 BE074910 

400335 Y13187 

426132 AA370501 

436938 AW1 39680 

437980 R50393 

455955 BE162394 



Hs.21273 
Hs.334309 
Hs.138136 



Hs.83577 
HS.24B067 



ESTs, Moderately similar lo ALULHUMAN A 
muscle-specific RING-finger Drotein 3 
CGI-67 protein 
TRABID protein 




Hs.1165 

Hs.171030 
Hs. 120440 
Hs.268744 
Hs.l 25646 
Hs.206461 
Hs.151167 
Hs.20137 
Hs.47099 
Hs.130419 



I i i H DNA D 1B176 (f 

hypothetical protein FLJ14991 
Homo sapiens cDNA FLJ11778 fis, clone HE 



no sapiens cDNA FLJ10212 lis, clone HE 



gb:af13a11.s1 Soares_leslis_NHT He 



transcription factor NYD-sp10 

ESTs, Weakly similar to 151 0254A L1 rej 



Hs.55950 
Hs.12839 
Hs.240112 KIAA0276 



Weakly similar to KIAA1 330 protein 



gb.Homo sapiens lull length insert cDNA 
ESTs 

hypothetical protein FU22624 
ATPase,H*/K+tr 
ESTs, Weakly sin 
ESTs 



EST 



ESTs 

gb:yr95f07.r1 Soares fetal liver spleen 
cysteine and glycine-rich protein 3 (car 
gb:RC5-BT0580-170300-021-F12 BT0580 Homo 
Homo sapiens dmd gene, intron 1 1 

11 Prostate gland I Homo sapier. 



Hs.36288 
Hs.1589B9 



445900 AF070526 

457041 AA399018 

415716 N59294 

422336 AI751322 

451664 AA889081 

407244 M10014 

455249 AW876538 

428862 NM 000346 

406076 AL390179 
405302 

400325 MB5292 

408408 AF070571 

423119 AA322201 

424152 AL1 33591 

431980 AA523696 



Hs. 13429 
Hs.250835 
Hs.179662 
Hs.115285 



fibrinogen, gamma polypeptide 
gb:RC3-PT0028-190100-01 2-b06 PT0028 Homo 
SRY (sex determining region Y)-box 9 (ca 
Homo sapiens mRNA; cDNA DKFZp547P134 (fr 



Hs.247924 
Hs.44690 
Hs.131976 
Hs.141480 
Hs.324507 



WO 03/042661 PCT/US02/36810 



425793 AA363946 
401462 

458817 AI522129 

422163 AF027208 

419875 AA853410 

423047 NM.005323 

425349 AA425234 
401368 

418531 R96760 

447290 AI476732 

441143 AI027604 

431292 AA370141 
405783 

444459 AI680624 

402112 R58624 

425745 U44060 

444827 R09764 

451195 U10492 

411417 AW845481 

418343 AA216372 

431595 AA508196 

436187 AK000998 

455699 BE068121 

459440 BE048054 

428832 AA578229 

423492 AF020761 

424235 NM.0031B1 

437913 AI140825 
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40 



Hs.93557 proenkephalin 

Hs.123064 H1 histone family, member T (teslis-spec 

Hs.79886 ribcse 5-phosphale isomerase A (ribose 5 

Hs.183758 ESTs 



Hs.263912 ESTs 
Hs.159650 ESTs 



chromogranin B (secretogranin 1) 



Hs.2186 
Hs.14427 
Hs.20415 
Hs.438 



Hs.121623 ESTs 



gb:MR1-CT0056-201199-008-b04 CT0056 Homo 
ESTs 

gb:nh60f07.s1 NCI_CGAP_Pr8 Homo sapiens 
Homo sapiens cDNA FLJ 10136 lis. done HE 
gb:CM1-BT0368-061 299-060-302 BT0368 Homo 
gb:tz46c03.y1 NCI_CGAP_Brn52 Homo sapien 
ESTs, Moderately similar to ZN91. HUMAN Z 
ubiquitm-conjugatir.g enzyme E2D 1 (homo 
T brachyury (mouse] homolog 





NM 006134 














hypothetical protein FLJ21924 
















AI928184 














ESTs, Weakly similar to S43603 RNA b'mdi 






























Homo sapiens cDNA FU14177 lis, clone NT 






AF210317 














gb:HSC3ID041 normalized infanUxain cDN 






AL137279 


Hs. 130187 


Homo sapiens mRNA; cDNA DKFZp4340'214 (f 


2,9 


443246 


T75157 




EST Vv Jyoiiiii!ar'toT08680hypotheti 


2.S 


450B77 


AI799608 


Hs.29178 


ESTs 


29 




AF085922 


S.1 13968 


ESTs 




401526 








2.9 


408751 


N91663 


Hs.258343 


ESTs 


2.9 


417320 


AA195667 


Hs.86022 


ESTs 


29 


442927 


AI024347 


Hs.131519 


ESTs 


29 


444125 


AI124882 


Hs.118121 


ESTs 


2.9 


452148 


AF007143 


Hs.28205 


Homosapiens clone 23733 mRNA sequence 


2.9 


453901 


BE065902 




gb:RC2-BT031 8-150200-01 1-b09 BT031 B Homo 


2.9 


452589 


BE159915 


Hs.61406 


ESTs, Weakly similar to 2004399Achromos 


2.8 


403011 






28 


436154 


AA764950 


Hs.119898 


ESTs 


28 


408221 


AA912183 






28 


430345 


AK000282 


Hs!239681 


hypothetical protein FU20275 


2.8 


415399 


T26994 


Hs.177198 


ESTs 


2.0 


441817 


AW969706 


Hs.293332 


ESTs 


2.8 


443556 


AA256769 


Hs.94949 


methylmalonyl-CoA epimerase 


2.8 


455092 


BE152428 




gb:CMO-HT0323-151299-126-b04 HT0323 Homo 


2.3 


439703 




Hs. 196245 


ESTs 


2.8 


411024 


BE062590 




gb:QV1-BT0260-281099-023-f05 BT0260 Homo 


28 


414546 


BE379492 




gb:601236215F1 NIH MGC 44 Homo sapiens c 


2.8 


434715 


BE005346 


Hs.1 16410 


ESTs 


2.8 


407594 


AW057584 


Hs.160681 


ESTs 


2.8 


439235 


N45513 


Hs.46608 


ESTs 


28 


453736 


AL1 18674 






2.8 


404967 








2.8 


437783 


Al 683150 


Hs.201550 


ESTs,WeaklysimilartoALU1 HUMAN ALU S 


2.8 


412887 


BE007420 




gb:PM3-BN0142-200300-001-c04 BN0142 Homo 


2.8 


426942 


AA393551 


Hs.97450 


ESTs 


2.8 


403513 








2.8 


419077 


AA233885 


Hs.164526 


ESTs 


2.8 


421823 


N40850 


Hs.28625 


ESTs 


2.8 


425664 


AJ006276 




transient receptor potential channel 5 


2.8 




H38108 




ESTs 


2.8 


407803 


AW081681 


Hs!269064 


ESTs, Weakly similar to T4268S hypotheli 


2.8 


409642 


AW450809 


Hs.257347 


ESTs 


2.8 


439492 


AF086310 


Hs.103159 


ESTs 


2.8 


420814 


AA721156 


Hs.190440 


ESTs 


28 


449508 


AK001566 


Hs.23618 


hypothetical protein FLJ10704 


28 


428350 


AW873520 


Hs.112017 




2.8 
2.8 


405456 
442459 


AI264628 


Hs.125428 


ESTs 


2.8 


415763 


Z42285 


Hs.5181 


proliferalion-associated 2G4, 38kD 


2.8 
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428532 AF157326 

436720 AW975902 

449539 W80363 

415533 T74009 

408749 H65489 
404652 

423130 AW897586 

424960 BE245380 
402131 

419530 X98330 

456118 AA380267 

444217 AV648751 

449579 AW207260 

412323 AW937143 

418912 NM.000685 

433513 AI566356 

448912 D83781 

451496 AW503407 

420273 AI652864 

451949 U03884 

420756 M411800 

423532 BE090503 

425012 T77666 

441609 AA946764 

448870 BE181783 

451206 H86228 

457314 M479597 

416192 NM.005036 

418888 AU076801 

455310 AW893961 

459450 AA426429 

424188 AW954552 

423575 C18863 

427443 M402713 



445611 AW418497 

412494 AL133900 

408243 Y00787 

407308 H67394 

423728 AW891294 

404587 M99587 

410483 BE163567 

415431 AW384459 

416805 F13271 

417177 NM_004458 

427134 AA398409 

428137 AA421792 

430B44 T94960 

441218 BE327561 

440911 AA909536 

411131 AW819212 

438602 AI167149 

441191 AI593930 
403776 

420159 AI572490 



60 
65 



Hs.58446 
Hs.268738 
Hs.250659 



gb:EST388011 MAGE reseauences, MAGN Homo 
ESTs 

ESTs, Weakly similar to ALU7_HUMAN ALU S 



Hs.153952 5 nucleotidase (CD73) 

-ls.90821 
Hs.78277 
-ls.282395 



s.171437 ESTs 



Hs.172004 
Hs.79981 
Hs.81452 
Hs.173561 
Hs.170999 



449396 BE169100 
450777 AA255646 
458043 AW979009 



452755 AW138937 

438055 AA776655 

420908 AL049974 

417806 AI867277 

430698 AA492071 

441969 AI733386 

446092 N33522 

456869 BE467912 

413617 BE155373 

444931 AV652066 

412236 AW902583 

453264 AA034137 

438370 AA843242 
406092 

454874 AW836407 

455880 BE153208 

459275 AI808913 



potassium inwardly-rectifying channel, s 



toA47582 B-cel' gr 
to 138022 hypotheli 
hypothetical protein DKFZp5BSJ1119 



Homo sapiens cDNA FLJ20325 fis, clone HE 
ESTs 

ADP-ribosylation factor domain protein 1 



ESTs, Weakly similar to I38022 hypotheli 
solute carrier family 4, sodium bicarbon 
homeo box (H6 family) 1 
gb:QV3-HT0460-230200-101-b08 HT0460 He 



Hs.202345 ESTs 



gb:ye38d07.r1 Stratagene lung (937210) H 



HS.997B5 
Hs.98244 
Hs.225998 



Hs.213436 
Hs.270942 
Hs.100261 



Homo sapiens cDNA: FLJ21245 fc, clone C 
ESTs 

DKFZP434C153 protein 
ESTs 

ESTs, Moderately similar to S47073 tinge 



Homo sapiens mRNA; cDNA DKFZp564B222 (fr 



Hs.145894 
Hs.1 54294 
Hs.279518 



amyloid beta (A4) precursor-like protein 
general transcription factor WA 
gb:QV3-NN1024-260400-171-f10 NN1024 Homo 
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411987 AA375975 

441884 AW172630 

416211 R14625 

433128 AB021923 

452259 AA317439 

453696 AI989482 

456208 AW299698 

425876 AW005887 



410181 AI468210 

410871 D78367 

412706 R97106 

422897 AA679784 

436329 AI798750 

436679 AI127483 

455992 BE179015 

452594 AU076405 

419296 AA236115 

454747 AW818535 



426662 AA879474 
400268 

438782 AA828380 

443764 F23283 

412486 AF210650 

411514 AW850178 

457900 AW976692 

417375 AA253314 

426582 AV660038 

435608 AW183971 

413627 BE182082 

432415 T16971 

445660 AI702668 

441396 AW293677 

452046 AB018345 

454936 AW846082 

454434 AA083558 

436888 AI942357 

431613 AA018515 

408812 BE397160 

416690 H84078 

436471 AA719813 



407494 U10072 

452821 AW471181 

434222 AF119886 

429864 AA460039 

456273 AF1 54846 



Hs.23357 
Hs.28707 
Hs. 146286 
Hs.334525 
Hs.234058 



ESTs, Moderate!/ similar to ALU8_HUVAN A 
ESTs 

gb:yg45c03.r1 Soares infant brain 1MB H 
EST-YD1 protein 

signal sequence receptor, gamma (Iranslo 

kinesin family member 13A 

Homo sapiens cDNA FU 1 4890 lis, clone PL 

ESTs 

gb:zi07e05.r1 Soares_fetaiJiver_spleen_ 



Hs.4290 

Hs.163960 

Hs.120451 



Hs.122710 ESTs 
Hs.126733 ESTs 



Homo sapiens heat shock transcription fa 
ESTs, Weakly similar to unnamed protein 
gb:RC3-HT0612-080500-013-h10 HT06"2 Homo 
solute carrierfamily 26 (sulfate trsnsp 

gb:RC1-ST0278-140300-016-f05ST0278Homo 
gb:RC1-BT0720-280300-011-f08 BT0720 Homo 
gb:RC3-CN0056-170300-015-f08 CN0056 Homo 



gb:HSPD22980 HM3 Homosapiens cDNA clone 
NAG19 protein 

gb:IL3-CT0219-271099-022-H12 CT0219 Homo 



Hs.2056 

Hs.250896 

Hs.246973 

Hs.289014 

Hs.201955 

Hs.186890 

Hs.27657 



Hs.187870 
Hs.264482 
Hs.254763 
Hs.108551 
Hs.117662 
Hs.158836 
Hs.168241 
Hs.201219 



ESTs, Weakly similar to A43932 mucin 2 p 



hypothetical protein FU 10242 
ESTs, Weakly similar to S18946 ultra hig 
gb:RC3-BN0034-240400-017-d09 BN0034 Homo 
gb Human forkhead family (AFX1) mRNA, pa 

Homo sapiens PR02591 mRNA complete cds 



435113 AA665469 

4381B8 AA779975 

438295 A1394151 

450181 H05254 

433764 AW753676 

433229 AB040925 

443718 AI083580 

453930 AA419466 

400365 Y10259 

419318 AW969742 

428527 AI902398 
404414 

446444 AI743737 

411354 AW992424 

417918 AA209205 

418310 AA814100 

454481 AW794878 
,441216 BE299830 
438257 AW474419 
442264 AI278777 
419505 AA243660 
417596 R07343 
443555 N71710 
444517 AI939339 
454867 AW835924 



s.128859 ESTs 



Hs.221373 
Hs.121276 
Hs.36727 
Hs.274501 



ESTs, Weakly similar to R5HU7A ribosonai 
hypothetical protein FU10903 
H.sapiens ACTH receptor mRNA 3'UTR 



Hs,34492 Cyt19 protein 

Hs.24370 ESTs 
Hs.288141 
Hs.1 63754 

Hs.86693 ESTs 

Hs.314230 ESTs, Highly similar to clock [H.sapiens 

Hs.192908 ESTs 

Hs.224794 ESTs 

Hs.263455 ESTs, Weakly similar to ALU1.HUMAN ALU S 

Hs.143061 ESTs 

Hs.226823 ESTs, Moderately similar to I5«74 gene 

Hs.21398 ESTs, Moderately similar to A Chain A, H 

Hs.146883 ESTs 

gb:PM1-LT0018-250200-002-e09 LT0018 Homo 

Hs.313803 ESTs. Highly similar to AF1 57833 1 noncl 
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424210 T71397 

447748 AI422023 

411970 AA099142 

441233 AA972965 
400706 

435033 H75391 

440836 AW370882 

431086 AI829692 

455110 BE154505 

455678 BE066007 

413088 BE064962 

436195 AK001084 

437395 BE140396 

432374 W68815 

442690 AI014727 

441700 AA233556 

410286 AI739159 
403271 

429761 AI276780 

437085 AA743935 

450822 AW771850 

457506 AF131757 



417249 N58198 

423554 M90516 

440400 AA994364 

440460 H92571 

446302 AI285848 

424012 AW368377 

428944 M780181 

419647 M348947 

455500 AW963582 

419435 AI200540 

452450 AW854891 

434699 AA643587 

436421 AI678031 

447505 AL049266 

419758 U31973 

426698 AA394104 

446861 AI696519 

423025 AA831267 

447524 AI640326 

411736 AW859089 

416334 H53139 

446818 AI342668 

454836 AWB33711 

442278 AI733477 

420854 AW296927 

408729 AA195764 

455675 BE065984 

411660 AW855718 

455252 AW876627 

409156 N76186 

423175 W27595 

430291 AV660345 
401785 
402369 

439079 AF085937 

412566 AW962574 

411463 AW847645 

413758 BE162391 
404988 

409446 AI561173 

412613 AA653507 

417909 R35514 

454743 AW818456 
406364 
404108 

411934 AW876538 

415747 AA381209 

443526 AW792804 

415319 AA659823 

454854 AW835775 

45B771 AW295151 

414349 BE512968 

426589 AW954460 

429515 AL031228 



Hs.1 12627 ESTs 

Hs.222707 KIAA1718 protein 

Hs.161338 ESTs 

Hs.1 3804 hypothetical protein dJ462023,2 

Hs.135568 ESTs 



Hs.255748 
Hs.222080 
Hs.211561 




Hs.182898 
Hs.1674 
Hs.125594 
Hs.234478 
Hs.149757 
Hs.137569 
Hs.41182 
Hs.91816 

Hs.14877 
Hs.19472C 
Hs.149425 
Hs.1 228 13 



ESTs 

gb:PMO-HT0343-281299-003-e06 HT0343 Homo 
gb:RC3-BT0319-120200-014-dOS BT0319 Homo 
gb:RC1-BT031 3-1 30400-01 6-c02 BT0313 Homo 
Homo sapiens cDNA FU10222 fis, cione HE 
hypothetical protein DKFZp762O076 
Homo sapiens cDNA FU11346 fis, done PL 
ESTg, Weakly similar to B28096 line-1 pr 
hypothetical protein FU12994 
DKFZP586N2124 protein 



glutamine-fructos e-6- phosphate transamin 
ESTs, Weakly similar to T25472 hypotheti 
Homo sapiens cDNA: FU22648 fis, cione H 
ESTs 

tumor protein 63 kDa with strong homolog 
Homo sapiens DC47 mRNA, complete cds 
hypothetical protein 

gb:EST375655MAGE resequences, MAGH Homo 
ESTs, V I i 3 (define not av 
ATP-binding cassette, sib-family G (WHIT 
Homo sapiens cDNA FU11980 fis, clone HE 
ESTs, Weakly similar to ZN22 HUMAN ZINC 
Homo sapiens mRNA; cDNA DKFZp564F093 (fr 
phosphodiesterase 6C, cGMP-specific, con 
ESTs 



Hs.62713 ESTs 



Hs.279765 
Hs.155313 



!1-CT0350-150200-002-d02 CT0350 He 



gb:QV4-TT0008-25119< 



gb:UI-H-BWO-ajc-c-07-0-Ul.s1NCLCGAP_Su 
ESTs 

gb;RC3-BT0319-120200-014-a06 BT0319 Homo 
gb:RC1-CT0279-070100-021-a06 CT0279 Homo 
gb:RC3-PT0028-120200-013-d1 1 PT0028 Homo 
M-phase phosphoprotein homolog 
hypothetical protein FLJ14627 
CGI-49 protein 



gb:EST374647 MAGE resequences, MAGG Homo 
gb:l L3-CT021 3-2801 00-056-A04 CT0213 Homo 
gb:PM2-HT0451-09010O-002-f04 HTC451 Homo 



hypothetical protein FU13089 
gb:yg66e08,r1 Soares infant brain 1NIBH 
KIAA0211 gene product 



Hs.163612 ESTs 



Hs.164007 ESTs 



01172296F1 NIH_MGC_1 5 Homo sapiens c 
gb:EST365530 MAGE resequences, MAGC Homo 
DNA segment on chromosome 6 (unique, pse 
fibrinogen, B beta polypeptide 
gb:QV4-HT0536-040500-193-f05 HT0536 Homo 
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429322 D86984 

446252 AI283125 

453361 AA035197 

455275 AW977806 

449410 AA001356 

451403 AA8B5569 

458861 AI630223 

416944 N22B09 

423010 W25436 

412505 AA974491 

446399 AI298405 

412139 BE044975 
403691 

424025 AI701852 

420352 BE258835 

422342 AA309272 

447343 M256641 

457770 BE065030 

427731 AA411750 

426920 M393351 

427794 AA709186 

429903 AL134197 

454190 AW177821 

414550 BE379808 

436391 AJ227892 
401989 

423346 AI267677 

444905 AW135853 

424539 L02911 
400861 

458426 AI084514 

429520 AA160142 



Hs.199243 KIAA0231 protein 



gb:EST389810 MAGE resequences, MAGO Homo 
ESTs 

Homo sapiens oDNA FIJ1451 1 fis, clone NT 
gb.ad06a00.r1 Prolifsrat ng Erythroid Ce 
gb:yw41e07 1 Weiz i nn ill cd , Ep I "I 
ESTs, Moderalely similar to I38022 hypot 



gb:hn25b10.x1 NCI_CGAP_Thy7 Homo sspir 

Hs.301296 Homo sapierE cDNA: FLJ23131 fls, clone L 
gb:601117374F1 NIH_MGC_16 Homo sapier 
gb:EST180209 Liver, hepatocellular carci 

Hs.236894 ESTs, Highly similar to S02392 alpha-2-m 

Hs.124179 ESTs 

Hs,20943 ESTs 

Hs.132121 ESTs 

Hs.282963 ESTs 

Hs.93597 



432253 AW090822 

439786 AV652707 

445432 AV653771 

453052 R63050 

454137 AW500340 

459608 AL119471 

452843 AI796769 

433222 AW514472 

449595 AA164569 

431532 AI537817 

425967 NMJ07159 
400641 

4309B2 R17432 

432808 NMJ15985 

410845 AW807182 

411561 H81164 

421083 AA283628 

423513 AF035960 

434627 AI221894 



427888 AA417088 

425541 AA359119 

422840 U44059 
404708 
405008 

453772 BE281431 

411035 AA857218 

444575 AI264847 

449311 AI657014 

454277 AW295069 

454565 AWB07605 

454597 AW809648 

416208 AW291168 

407851 NM_014496 

446554 AA151730 

452850 H23230 
406468 

407300 AA102616 

408617 R61736 

409627 AWS97628 

416665 H72974 

417404 NM.007350 

418994 AA296520 



Hs.146274 ESTs 

Hs.127416 
Hs.209228 
Hs.150402 



Hs.205058 hypothetical protein FLJ20075 

gb:H.sapiens endogenous retrovirus HERV- 
HS.20447Q ESTs 

Hs.150555 protein predicted by clone 23733 
Hs.137598 ESTs 



Hs.16323 
Hs .297007 
Hs.22545 



Hs.164318 
Hs.274174 
Hs.33756 



Homo sapiens, Similar to G antigen 8, cl 
membrane-bound transcription factor prot 
r ii i 9,n I i Fl M 3351 I MIT 
gb:tt49a12.x1 NCI_CGAP_GC5 Homo sapiens 
ESI i. Weakly si. lilar to Z157_HUMAN Z'NC 
gb:MR4-ST0098-120100-CG1-b06 ST0098 Homo 
gb:MR4-ST0124-2S1099-015-d01 ST0124 Homo 
EST \ ly nnlar to MUC2_HUMAN MUCIN 
ribosomal protein S6 kinase, 90kD, polyp 
nudix (nucleoside diphosphate linked moi 
ESTs, Moderately similar to A46010 X-lin 

gb:zn43e07.s1 Stratagene HeLacells393 



gb:yu28a10.s1 Soares fetal liver spleen 
pleckstrin homology-iike domain, family 
selectin E (endothelial adhesion molecul 
hypothetical protein FLJ21940 
ESTs 

transcription elongation factor (Sill) e 
Homo sapiens mRNA full length insert cDN 
gb;AV653771 GLC Homo sapiens cDNA clone 



Hs.208320 
Hs.238415 
Hs.34550 
Hs.270311 



ESTs, Moderately similar to ALU8_HUMAN A 
ESTs 

ESTs, Weakly similar to ALU1_HUMAN ALU S 



no sapiens clone IMAGE:32106, mRNAseq 



Hs.285017 
Hs.298015 
Hs.129719 
Hs.39311 
Hs.134273 

Hs.333513 



gb:MR4-ST0062-1B0200-001-b04 ST0062 Homo 

hypothetical protein FU21 799 

lT vi mlartol3B022hypotheti 
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409041 AB033025 

423244 AL039379 

453874 AW591783 

448889 BE140902 



Hs.50081 
Hs.209602 
Hs.36131 



403053 R58624 

409298 AA070211 

411322 AW887330 

447640 AI417187 

447849 AI538147 

458763 A1693417 
404638 

413986 Z43567 

407721 Y12735 

422321 AA906427 

40B238 W959C1 

436747 AW977192 

437048 AA743240 

413143 BE067232 
404561 



ESTs, Weakly similar to ubiquitous TPR m 
collagen, type XIV, alpha 1 (unduiin) 
gb:IL1-HT0028-240699-001-C11 HT0028 Homo 
ESTs 

ADP-ribosyltransferase 1 
eukaryotic translation elongation factor 
gb:zm68c04.s1 ' 



Hs.293309 ESTs 



gb:HSC1FC021 normalized infant brain cDN 
dual-specificity tyrosine-(Y)-phosptioryl 
hypothetical protein MGC11296 
gb:ze43d1 1.r1 Soares retina N2b4HR Homo 



gb:PM3-BT0347-170200-001-b05 BT0347 Homo 



Hs.135104 ESTs 



417002 T79613 
439446 AI927629 
441227 AW295407 
445038 AI635444 
4551Q7 BE154113 
458624 AI362790 
459344 AW499533 
452605 AW968557 
457652 AF116656 
450068 AW207212 
444750 AW242684 
414591 AI888490 
407264 L34727 
443169 AI038687 
426536 AI949749 
449752 AI668626 
459592 AL037421 
429504 X99133 
429063 AW363845 
430484 D82880 
423673 BE003054 
447375 AI376660 
444230 H95537 
439911 AA854024 
421295 NMJ02666 
449385 AI650471 
430044 AA464510 
427131 AA448460 
409103 AF251237 
421354 AA766485 
423740 Y07701 
440048 AA897461 
441358 AW173212 
453857 AL080235 
414290 AI568801 
427342 AL1 10150 
459459 AA460445 
434638 H50758 
442717 R88362 
419637 W27493 
431169 AW971240 
449432 AW451361 
458734 AI554946 
449529 AI990559 
425088 AF038007 
420195 N44348 
418105 AW937488 
430957 AI937072 
418188 AW139413 
424103 NM 001918 



400462 

413697 AA131315 

421755 AW169454 

424195 U50536 

434163 AW974720 



Hs.14613 
Hs.57873 
Hs.1 28893 
Hs.143917 

Hs.278639 
Hs.257976 
Hs.90012 
Hs.273809 
Hs.280925 
Hs,243623 
Hs.65902 

Hs.133338 

Hs.44441 

Hs.61773 

Hs.208746 

Hs.204238 

Hs.122142 

Hs,241548 

Hs.1695 

Hs.257822 

Hs.146067 

Hs.189110 

Hs.1 03253 

Hs.270370 

Hs.152812 

Hs.1 12017 

Hs.1 12208 



dJ467N11.1 protein 

gb:PM1-HT0340-151299-003-a08 HT0340 Homo 
KIAA1684 protein; likely homolog of mous 



EST; 
ESTs 

Homo sapiens cDNA FIJ1 1 648 fis, clone HE 

ESTs, Moderately similar to pot. ORF I [ 

lipocalin 2 (oncogene 24p3) 

ESTs, Weakly similar to A46010X-linked 

RAS p21 protein activator 2 

matrix metalloproteinase 12 (macrophage 



Hs.328737 
Hs.129041 
Hs.35861 
Hs.71721 
Hs.176680 



Hs.196529 
Hs.158794 
Hs.232033 
Hs.166196 
Hs.26243 
Hs.178000 
Hs.55043 
Hs.151880 
Hs.139410 
Hs.52184 
Hs.121742 

Hs.25954 



ESTs, Weakly similar ti 
ESTs 

DKFZP586E1621 protein 
ESTs 

Homo sapiens mRNA; cDNA DKFZp585D0724 (f 

gb:zx66h1 1 ,r1 Soares Jota'Jetu3_Nb2HF8_ 

gb:yp86e06.r1 Soares fetal fiver spleen 

l T \ akly similar to T23975 hypo'.heti 

gb:31h10 Hunan retina cDNA randomly prim 

gb:EST383329MAGE resequences, MAGL Homo 

ESTs 

ESTs 

ESTs 

ATPase, Class I, type SB, member 1 
Homo sapiens cDNA FU11 177 fis, clone PL 
ESTs, Weakly sMIar to FV1 WOUSE FRIEND 
Homo sapiens cDNA FLJ13277fis, clone OV 



interleukin 13 receptor, alpha 2 



Hs.47144 DKFZP585N0819 protein 

Hs.207422 ESTs, Weakly similar to S71949 metallopr 

Hs.142907 Human BRCA2 region, mRNAsequence CG01 1 

Hs.25205 group XII secreted phospholipase A2 
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435985 AA703154 

449916 T60525 

458661 AI299789 

159023 AW968226 
105005 

55561 AI868634 

■52161 R43077 
AI393115 
AW968203 

445635 A1769774 

433479 AW511459 

441676 BE564206 

407965 W21483 

4506B2 Z42993 

452958 AA883929 

454032 W31790 
405347 



Hs.166999 ESTs, Moderately sin 



ESTs, Weakly similar to T32250 hypotheti 



Hs.246358 
Hs.221747 
Hs.127655 



Hs.194293 ESTs, Weakly similar to I54374gene NF2 



440577 


AA889945 


Hs.325381 


EST 




455780 
457024 


3E088828 


Hs.119151 


gb:CM2-BT0693-230300-129-g09 BT0693 Homo 
ESTs 




404249 
437511 


AI807500 


Hs.125247 


ESTs 




421338 


AA287443 




gb:zs52c10.r1 NCLCGAP.GCB1 Homo sapiens 




425146 


AW954627 




gb:EST366697 MAGE resequences, MAGC Homo 




428277 


AA425220 


Hs.179203 


ESTs 




444870 


AI200621 


Hs.148504 






402090 
458507 


AI185703 


Hs.206957 


ESTs 


2.4 




AI745185 




yes-associated protein 65 kDa 




446534 


AI307356 


Hs!l 75225 


ESTs 


2.4 


453111 


AB014598 


Hs.31720 




24 
24 


405230 
405935 








24 


413642 


BE154837 




gb:PM1-HT0345-121199-001-c08 HT0345 Homo 


24 


420724 


AA279694 


Hs.191540 


ESTs 


24 


436998 


AA745625 




ESTs, Weakly similar to Al.U8_HUMAN ALU S 


2.4 


445748 


U80766 


Hs43252 


Hunan fcST clone 22453 mariner transposon 


2.4 


434283 


AW235341 


Hs.58715 


thiamine pyrophosphokinase 


2.3 


407404 


AF040257 




gb:Homo sapiens TNF receptor homolog mRN 


2.3 


440621 


AW296024 


Hs.150434 


ESTs 




423417 


AP000365 


Hs.128342 


potassium large conductance caicium-acti 


2.3 


424131 


AA335714 


Hs.199665 


ESTs 


23 


450737 


AW007152 


Hs.203330 


ESTs 


23 


453687 


T55674 


Hs.283108 


hemoglobin, gamma G 


2.3 


442704 


AI015463 


Hs.130987 


ESTs 


23 


457756 


AA126136 


Hs.38125 


interferon-induced protein 75, 52kD 


2.3 


412732 


AW993300 




gb:RC2-BN0033-180200-01Sg06 BN0033 Homo 


2.3 


418998 


F13215 


HS.287B49 


ESTs Weakly similar to T22074 hypotheti 


2.3 


419751 


AW195581 


Hs.93121 


KIAA0761 protein 


2.3 


429485 


AW197086 




ESTs 


2.3 


433377 


AI752713 


Hs43845 


ESTs 


23 


434896 


AW022054 


Hs.136591 




2.3 


441675 


AI914329 


Hs.5461 


ESTs 


2.3 




AI188739 


Hs.148488 




2.3 


445621 


AI733818 


Hs.145549 






449182 


AW292381 


Hs.224150 




23 


404068 


Y08564 


Hs.248190 


UDP-N-acetyl-alpha-D-galactossmine polyp 


2.3 
2.3 


414366 


BE549143 




gb:601076455F1 NIH MGC 12 Homo sapiens c 


23 


438315 


R56795 


Hs.82419 


ESTs 


2.3 


423161 


AL049227 


Hs.124776 


Homo sapiens mRNA; cDNA DKFZp564N1 1 1 6 (f 


2.3 


447998 


AI768289 


Hs.304389 


ESTs 


2.3 


410150 


AW382942 


Hs.6774 


ESTs 


2,3 


432792 


AA448114 


Hs.278950 


protocadherin beta 1 , 


2,3 


443363 


AI792629 


Hs.1 33293 


ESTs 


23 


440729 






ESTs 


2.3 


411045 


AW854691 


Hs.1 15325 


RAB7, member RAS oncogene fanily-like 1 


2.3 




AW1 38410 


Hs.45051 


ESTs 


2.3 




AW301478 


Hs.184592 


protein kinase, lysine defcient 1 


2.3 


458584 


BE281115 


Hs.98855 


hypothetical protein FLJ20909 




427952 


AA946582 


Hs.8700 


deleted in liver cancer 1 


23 


401899 








23 


432116 


AA902953 


Hs.308538 


ESTs 


2.3 


404195 








2.3 


410999 


AW813004 




gb:RC3-ST0186-230300-019-h02 ST0186Homo 




413308 


W28131 




gb:42f7 Human retina cDNA randomly prime 


2.3 








gb:nc71(10.s1 NCI CGAP„Pr1 Homo sapiens 


2.3 


443482 


AW1 88093 


Hs.250385 


ESTs 


2.3 


453305 


R39224 


Hs.267997 


EHM2gene 


2.3 


451963 


A1825440 


Hs.224952 


ESTs 


2.3 


453043 


AW136440 


Hs.224277 


ESTs 


2.3 


435559 


AF209198 


Hs.42636 


zinc finger protein 277 


2.3 


440727 AI073991 


Hs.134268 


ESTs, Weakly similar to 2109260A B cell 


2.3 
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434120 AI436050 

429768 AA805719 

425292 NM_005824 

455841 BE145B36 

411093 BE067650 

430706 NM 003540 

428268 AA424957 

458833 AW236702 

452215 AK002043 

444109 A1124553 

428411 AW291464 



37 kDa leucine-rich repeat (LRR) protein 
gb:MR0-HT0208-1 01299-202-b08 HT0208 Homo 
gb:MR4-BT0358-09030(H)03-e01 BT0358Homo 
H4 hislone family, member C 
ESTs 

ESTs, Weakly similar to A46010 X-linked 
hypothetical protein FLJ11181 
Homo sapiens cDNA: FLJ21693 fis, done C 



Is. 10338 ESTs 



smenl (FUSE) binding prot 



413841 M34276 

424068 U50531 

433532 AW975367 

442710 AI015631 

444206 AW301017 

451264 AI758235 

454784 AW820626 

429080 AA446228 
404166 

416327 R99822 

400631 AF173937 

438504 AW665281 

435325 AI038388 

421253 AI188102 

427046 BE245180 

432711 AA563785 

439715 AA524504 

441398 AA932398 

448458 AW614367 

452542 AW812256 

417768 R24732 

427374 A1150033 



407639 AW205369 

455121 BE156459 

448117 H49129 

443931 H23213 

450795 AW173371 

418632 AW118745 

419441 AW023731 

455067 ' AW854538 

418291 BE300369 

455964 BE1 66924 

445944 H06336 

424827 A1057094 

449272 AW137656 



Hs.36172 ESTs 



Hs.119309 
Hs.31028 
Hs. 121385 
Hs.152465 



_ T / H, milartol 022 hvp n I 

I Li HI Lt 

ESTs, Weakly simiiar to B34087 hypothec 
ESTs 

gb:RC0-ST01 74-191 099-031 -a07 ST01 74 Homo 



Hs.82845 
Hs.310359 
Hs.34298 



Hs.22657 
Hs.60435 
Hs.9450 
Hs.274368 



gb:hm46f02.x1 NC1_CGAP_RDF1 Homo sapiens 
ESTs 

gb:QVO-HT0368-040100-082-f06 HT0368 Hot,o 

ESTs 
ESTs 



; cDNA DKFZp547C244 (fr 
MSTP032 protein 

gb:RC3-CT0255-200100-024-b02 CT0255 Homo 
hypothetical protein MGC4126 
gb:CM4-HT0501-240300-519-f01 HT0501 Homo 
Homo sapiens clone 24875 mRNA sequence 
Homo sapiens cDNA: FLJ231 55 fis, clone L 



Hs.282383 ESTs 



hypothetical protein FU 12820 



424565 AW102723 

455201 AW947884 

429180 AA806287 

418849 AW474547 



AL133887 
AI693269 
AA195601 



405747 

412105 H07971 

420457 AA482280 

407726 AA435679 

423720 AL044191 



Hs. 193691 ESTs 



Homo sapiens, Similar to DNA segment, Ch 



Hs.58893 
Hs.53565 
Hs.158244 



Hs.94319 
Hs.191556 
Hs.88594 



Hs.202273 
Hs.26771 
Hs.334840 



Homo sapiens PIG-M mF 
KIAA0479 protein 

ESTs, Moderately simiar to ZNS1_HUMAN Z 
ESTs 

ESTs, Weakly similar to I38022 hypotheti 
ras homolog gene family, member I 
gb:zg07b07.s1 Soares_?ineai_cland_N3HPS 
Human (clone CTG-A4) mRNA sequence 
hypothetical protein 



VPS10 domain receptor protein 
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AI056599 












hypothetical protein FLJ23476 




AV647969 




KIAA1451 protein 










417712 


AA205S69 


Hs!l94193 


ESTs, Moderately similar to ALU1_HUMAN A 


449015 


AL038958 


Hs.22868 


protein tyrosine phosphatase, nonnecept 




AW841462 




gb:RC6-CN0014-080300-012-B09GN0014Homo 




AF056085 








AW864609 




gbPM3-SN001 7-240300-O01-hO3 SN0017 Homo 




AW886156 




gb:RC5-OT0078-1 50300-021 -E08 OT0078 Homo 


407809 


AW082279 


Hs.244106 


ESTs 


420478 


AA521259 


Hs.193796 


ESTs 












AI208754 


Hs. 147369 










gb:601 276347F1 NIH MGC 20 Hono sapiens c 




AL034344 


Hs.284186 


forkhead box C1 




AW850267 




gb:IL3-CT0219-161199-031-A09 CT0219 Homo 


403S78 
457003 


S78234 


Hs.1 72405 


cell division cycle 27 


404531 


Z25884 


Hs.121483 


chloride channel 1 , skeletal muscle (Th 


423045 


AW967472 


Hs.1 83302 


PCTAIRE protein kinase 2 


409427 


AW389668 




gb:RC2-ST0168-071299-013-(06ST0168Homo 


434745 


AW974445 


Hs.185155 


ESTs, Weakly similar to T1 2482 hypotheli 


400696 
407259 


L02256 




gb:Human Fab fragment binding syncytial 


411893 


R82845 


H5.273789 


ESTs 


428192 


AA424051 


Hs.304742 


ESTs 


435634 


T82384 




gb:yc14f05.r1 Stratagene lung (937210) H 


438018 


AK001160 


Hs^25428 


hypothetical protein FU10298 


458303 


AI264628 




ESTs 


405692 
403572 

























417859 


T26453 




gb:AB214F6R Infant brain, LLNL array of 


456472 




Hs.95744 


hypothetical protein similar to ankyrin 




AI123922 


Hs.138215 


Homo sapiens cDNA FU 1 1400 lis, clone HE 


428231 


U17989 


Hs.183105 


nuclear autoantigen 


454086 


AW885909 


Hs.6975 


PR01 073 protein 


425071 


NM_013989 




deiodinase, iodcthyronine, type II 


416348 


H65887 


Hs.272163 


ESTs 


403780 








414262 






ESTs 


419423 


D264BB 


Hs,90315 


KIAA0007 protein 


442078 


AW268583 


Hs.262629 


ESTs 


452975 


M85521 


Hs.244482 


Homo sapiens, clone 1MAGE:361 171 9, mRNA, 


419216 


AU076718 


Hs.154021 


3, i II nj il cytokin ubl mily B ( 


416588 


H66558 




gb:yu16e04.r1 Soares fetal liver spleen 
cullin4B 


425368 
4256B6 
441638 


AB014595 
M73531 


Hs.1 55976 
Hs.1 937 


retinal degeneration, slow (retinitis pi 
ESTs 


446845 


AW293202 
AI343645 


Hs.133451 
Hs.1 561 08 


ESTs 


422563 


BE299342 


Hs.19348 


hypothetical protein FU13119 


436574 


AW293527 


Hs.126455 


ESTs 


424584 


H10692 


Hs.13310 


ESTs 


456347 


U00803 


Hs,89426 


fyn-related kinase 


446901 


AI347274 




gb:tc05d02.x1 NCLCGAP_Co16 Homo sapiens 


459364 


W69284 




gb:zd46c03.r1 Soares _fetal_hBart_NbHH19W 


430686 


NMJ01942 


Hs.2633 


desmoglein 1 


414831 


M31158 


Hs.77439 


protein kinase, cAMP-dependent, regulate 


425707 
403525 


AF1 15402 


Hs.11713 


E74-like factor 5 (ets domain transcrip! 


453343 


AA905353 


Hs.121622 


ESTs 


421574 


AJ000152 


Hs.105924 


defensin, beta 2 


449327 


AI638743 
AW819848 


Hs.224672 


ESTs 

gb:QVO-ST0294-070300-15',-b04 ST0294 Homo 


420493 


AI635113 


Hs.270366 


ESTs, Weakly similar to 178885 seiine/th 


401614 








404767 








410594 


AW770778 


Hs.281238 


ESTs 


436193 


AA706059 


Hs.255286 


ESTs 


439626 


N22415 


Hs.189080 


ESTs 


455481 


AA258033 


Hs.108110 


DKFZP547E21 10 protein 


441453 


AW176106 


Hs.285459 


ESTs 


424946 


M64572 


Hs.153932 


protein tyrosine phosphatase, non-recept 


437332 


AA814943 




gb:oc07d06.s1 NCI_CGAP_GCB1 Homo sapiens 


454419 


AA082211 


Hs.233936 


nvosin, light poiypep'.ide, regulatory, n 


415225 


AA577730 


Hs.188684 


ESTs, Weakly similar to PC4259 ferritin 


450579 


AW1 36774 


Hs.48614 


ESTs 


400654 
447613 


AL041057 


Hs.33353 


DKFZP434N093 protein 
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402689 AK001334 


Hs.15470 


putative ring zinc finger protein NY-REN 


2.2 


430884 AF053748 


Hs.248114 


glial cell derived neurotrophic factor 


2.2 


432797 AA565264 


Hs.136443 


ESTs 


2.2 
2.2 


405608 

426365 AA376667 


Hs.10283 


RNA binding motif protein 8B 


2.2 


405634 






2.2 


423646 H02364 




gb:yj35d06.r1 Soares placenta Nb2HP Homo 


2.2 


434690 AI867679 


Hs.148410 


ESTs 


2.2 


436572 AA723274 


Hs.279596 




2.2 


447044 AF030107 


Hs.17165 


regulator of G-protein signalling 13 


2.2 


448828 AI580296 


Hs. 174782 


ESTs, Weakly similar to KIAA1 437 protein 


2,2 


457802 T78013 


Hs. 167279 


FYVE-finger-containing Rab5 effector pro 


2.2 


444585 AW170015 


Hs.6594 


ESTs 


2,2 


433781 AA609379 


Hs.1 92083 


ESTs 


2.2 


450587 AI828854 




striatin, calmoduiin-binding protein 


2,2 


434077 AF1 16659 


i 32 151 


Homo sapiens PR01412 mRNA, complete cds 


2.2 


448756 AI739241 




ESTs 


2.2 


430388 AA356923 




nuclear cap binding protein subunit 2, 2 


2.2 


454471 AW902125 




gb:QVO-NN1022-12050J-220-h12 NN1022 Homo 


2.2 


419107 AW085152 


Hs.292987 


ESTs 


22 


455114 AW857121 




gb:RC1-CT0302-040400-017-a',2 CT0302 Homo 


22 


416548 H62953 




gb:yr47f06.r1 Soares fetal liver spleen 


22 


454117 BE410100 




adaptor-related protein complex 1, sigma 


22 


455056 AA463550 


1K337532 


ESTs, Weakly similar to A47582 B-cell gr 


2.2 


409998 M78345 


Hs.98265 


KIAA1877 protein 




422352 AA766296 


Hs.99200 


ESTs 


2.2 


409191 AW8 18390 


Hs.175613 


homolog of Xenopus Ciaspin 


2.2 


433919 AA746311 




gb:oaS6d12.r1 NCI CGAP.GCB1 Homo sapiens 


2.2 


455771 BE084820 


Hs.185711 


hypothetical protein FLJ20070 


2.2 


431632 AK000992 


Hs.333144 


Homosapiens cDNA FLJ10130 fis. clone HE 


2.2 


454716 AW850684 




gb:IL3-CT0219-16020a063-D12CT0219Honio 


2.2 


413752 BE161807 




gb:MR3-HT0446-300300-203-h01 HT0446Homo 


2.2 


458037 AF074982 


Hs.226031 


r H miartoKIAA053 r 


2.2 


434239 AF1 19910 


H5.283047 


hypothetical protein PR02964 


2.2 


435133 AJ010482 


Hs 31412 


Homo sapiens cDNA FU 1 1 422 fis, clone HE 


2.2 


442772 AW503680 




Homo sapiens clone 24416 mRNA sequence 


2.2 
22 


400697 

455685 BE066976 




gb:PM0-BT0340-21 1 299-003-cl 2 BT0340 Homo 


22 


447039 AV661798 


Hs,282915 


ESTs 


22 








2.2 


422728 AW937826 


Hs.103262 


ESTs, Weakly similar to ZN91.HUMAN ZINC 


2.2 


421976 AL138443 


Hs.23450 


mitochondrial ribosomal protein S25 










2.2 


425001 U55184 


Hs.154145 


hypothetical protein FU11585 


22 


447816 NM 007233 




TP53 target gene 


2.2 


416143 AI9SS6S0 


Hs.79033 


glutaminyl-peptide cyclotransforase (glu 


22 


419118 AA234223 


Hs.139204 


ESTs 


22 


426261 AW242243 


Hs.168670 


peroxisomal farnesylated protein 


2.2 


449808 AA694220 


Hs.15403 


1 Mrd r I r o It tH t 


22 


454749 AW818649 




gb'.RC1-ST0278-040400-01 8-e02 ST0278 Homo 


2.2 


456933 AA363946 


Hs.20969 


ESTs 




402942 






2.2 


437064 A1023264 




gb:ov64h08.s1 Soares testis NHT Homo sap 


2.2 


458623 AI305223 


Hs.148056 


ESTs 


22 


415257 F03016 


Hs.27513 






425269 H 15302 


Hs.158950 


Homo sapiens mRNA; cDNA OKFZp566A1 046 (f 


22 


442783 AI017586 


Hs.131181 


ESTs 


22 


444313 AI140494 


Hs.1 97955 


KIAA0704 protein 


2.2 


453444 AL036531 




gb;DKFZp564IH62_r1 564 (synonym: hfbr2) 


2.2 


422757 A1909935 


Hs.65551 


Homo sapiens, Similar to DNA segment, Ch 


22 


430013 AA463833 


Ms.151275 


ESTs Weakiy similar to TRHYJHUMAN TRICH 




437138 AI935622 


Hs.271245 


ESTs 


2.2 

2.2 


406298 

409723 AW885757 


Hs,257862 


ESTs 


22 


414481 AW451955 


Hs.8383 


bromodomain adjacent to zinc finger doma 


2.2 


433266 A1863224 


Hs.31476 


Homo sapiens cDNA FU',3872 fis, clone TH 


2.2 
2.2 


435090 BE217923 
457187 AA443927 


Hs.149595 
Hs.144360 


ESTs 
EST 


2.2 


445061 AI253094 






2.2 


442617 AW340093 


Hs!l30538 


ESTs 


2.2 


438298 H23542 


Hs.181788 


ESTs 


2.2 






gb:PM1-BT0348-151299-001-dO4 BT0348 Homo 
ESTs 


2.2 


428017 AA424983 


Hs.98312 


2.2 


451149 AL047586 


Hs.10283 


RNA binding motif protein 8B 


2.2 


418076 R61388 


Hs.6724 


ESTs 


2.2 


403306 NM_006825 


Hs.74368 


transmembrane protein (63kD), endoplasmi 


2.2 


441811 AI073548 


Hs.1 64597 


ESTs 




434763 AA648618 




gb:ns07a11.r1 NCi CGAP_Ew1 Homosapiens 


2.2 


447453 AW608645 


Hs.1 8800 


hypothetical protein FU20281 


2.2 


420931 AF044197 


Hs. 10043', 


small inducible cytokine B subfamily (Cy 


2 2 


415424 Z44766 




gb:HSC28G081 normalized infant brain cDN 


22 


408332 H91230 


Hs.234794 


Homo sap ens mRNA; cDNA DKFZp564B083 (tr 


2.2 


421216 AV649282 


Hs.102664 


vesicle-associated merhbrane protein 4 


2.2 
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429609 AF002246 

448700 BE614182 

457741 BE044740 

437927 AI039789 
401694 

424419 AK001563 

436540 AA724411 

43B290 AA843719 

445908 R13580 

455735 BE161124 

458455 AV648310 

430680 AW138724 

447147 AA910353 

424063 NMJJ02019 

441874 AA970389 

448045 AJ297436 

433629 R13140 

416266 AA164199 

440633 AI140686 

442789 AW904361 

417663 AA203701 

407788 BE514982 
401240 

408908 BE296227 

411151 AW866497 

414275 AW970254 
436992 AA741074 
439634 W79377 
411770 NMJ14278 



Hs.129750 hypothetical protein FU10546 
Hs.146589 hypothetical protein FLJ10701 
Hs.156065 ESTs 



Hs.292815 
Hs.138671 
Hs.1 28055 
Hs.20166 



Hs.270152 ESTs 



ESTs, Highly simila- to ALU7_HUMAN ALU S 
ESTs, Weakly similar to T23482 hypothe'.i 
fms-related tyrosine kinase 1 (vascular 
ESTs 

prostate stem cell antigen 



Hs 131191 ESTs. 'A kij n la to ^LU7 HUMAN ALU S 

gb:zx52a10.r1 Soaresjetaljirer.spleer.. 
Hs.38991 S100 calcium-binding protein A2 



iatshock protein (hspl 10 family) 



410648 AW792909 

423717 AA330Q36 

436683 AW991278 

445225 AI216555 

410991 AW812790 

412639 AW961284 

447777 A1424223 

451270 AW341392 

404526 AI912555 

452492 BE063096 

417154 AI674701 

428162 AA422030 

442312 AI820617 

456513 AA279143 

430712 AWC44647 

441445 AI221959 

420288 AW071225 

412329 AW937445 

447033 AI357412 

436853 BE328074 

455189 AW864176 



457040 N77624 

424480 AA341442 

403317 U02687 
406018 

410566 AA373210 

413801 M62246 

415871 R55995 

416747 AW876523 

417725 R25257 

424856 AA347746 

439474 AI824060 

446895 AA166655 

448582 AI538880 

452783 AA028167 

442430 R89164 

428908 AW303529 

427335 AA448542 

428336 AA503115 

419290 AI128114 

416951 AA190926 

439950 AW937417 

458227 Z40670 

447179 AW015633 

454950 AW847460 



420844 



Hs.152003 
Hs.57787 
Hs.202398 

Hs.296235 



gb:CMO-UM0001-0103CO-258-c05 UM0001 Homo 



Hs.235795 ESTs 
Hs.157195 

Hs.21388 

Hs.129216 
Hs.88561 
Hs.196284 
Hs.187937 



gb:RC3-ST0186-141299-014-g08 ST01B6Hono 
ESTs 

gb:te95a05.x1 NCI_CGAP„Pr28 Homo sapiens 



gb:QV3-DT0043-09020C-080-c09 DT0043 Homo 



Hs.131575 ESTs 



Hs.43047 
Hs.35406 
Hs.283309 

Hs.21503 ESTs 



Hs.211501 
Hs.282803 
Hs.94812 
Hs.61486 
Hs.48320 
Hs.144955 
Hs.251677 
Hs.183752 
Hs.112885 
Hs.190785 
Hs.293561 
Hs.181340 
Hs.157299 



phosphatide acid phosphatase type 2B 
ESTs 

fms-related tyrosine kinase 3 

Homo sapiens cDNA FU13585 lis, clone PL 
ESTs, Highly similar to unnamed protein 
ESTs, Moderately similar to ALU 1 .HUMAN A 
hypothetical protein FU12910 



double ring-finger protein, Dortin 



gb:RC3-CT0208-270999421-eO4 CT0208 Homo 
gb:nh22c09.s1 NCI_CGAP_Pr1 Homo sapiens 
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426456 AA58074B 

428B22 W28418 

430879 BE149423 

444584 Al 168422 

446296 AA985662 

453853 AL040600 

414083 AL121282 
401645 

436577 WB4774 

427469 AA403084 

409168 N94037 

410276 A1554545 

443372 AI792557 

422093 AF151852 
402333 

409374 R87083 

412011 NM_000406 

412798 AW998657 

416085 H18072 

418378 AW962081 

437846 AA773866 

452374 AL037405 

450061 AI797034 

450180 AW449644 
405120 

407378 AA299264 

458890 AW865523 

435600 AL047034 

440964 AI733106 

417455 AW007066 

436461 AW511956 

436777 AA731199 

427521 AW973352 

413646 BE155042 
413231 D87461 



411518 AW850246 
443777 AV646510 
416148 H22453 
402528 

431215 AA496078 
436820 AI684535 
446209 AI375025 
453362 H14988 
417430 AA984546 
401069 

454078 AA601518 
410966 AW812088 
447124 AW976438 
449939 T86420 
411593 AW857271 
438005 BE1 51746 
443486 NMJ03428 
407884 BE075316 
404694 

406668 T62745 
441092 T99289 
454643 BE006345 
426646 AA382787 
431605 AW972407 
414452 AA454038 
401991 

457176 AA435837 
436464 AI016176 
428208 AA442327 
445049 AV652718 
419116 AF292402 
427894 AL135709 
424296 AI631874 
424323 AA338791 
404582 

418631 AA225921 
424872 AA347923 
452539 AW105321 
45465B AWB12330 
440310 AA878939 
433297 AV658581 
410900 AW810169 
419386 AA236867 



Hs.1 30658 ESTs 

Hs.3071 5 polassium voltage-gated channel. Isk-rel 
Hs.10554 hypothetical protein FU12761 

gb:ok30e11.x1 Soares_NSF_F8_SW_OT_PA_P_S 



no sapiens cDNA FLJ13155fls. clone NT 

MS.I00U3J tslS 

Hs.257786 ESTs 

Hs.17643 ESTs 

Hs.269347 ESTs, Weakly similar to 2109260A B cell 

Hs.312938 ESTs, Weak'y similar to ALU1.HUMAN ALU S 

Hs.68301 ESTs 

Hs.133107 ESTs 

Hs.111449 CGI-94 protein 



Hs.19081 
Hs.73064 
Hs.1 19120 
Hs.92576 

Hs.244569 
Hs.339639 
Hs.201115 
Hs.257182 



ESTs 



Hs.130218 
Hs.1 8949 
Hs.293261 
s.293130 



E3ubiquitin ligaseSMURFI 

gb:EST374154 MAGE resequences. MAGG Homo 



iar lo CA2B„HUMAN COLLA 



ESTs 



HS.2905B5 ESTs 



Hs.169187 ESTs 



gb:PMO-HT0349-101299-002-E04 HT0349 Homo 
BCL2-like2 

hypothetical protein DKFZp565N034 
gb:IL3-CT021 9-2S 1 099-02 1 - E07 CT0219 Homo 
Homo sapiens mRNA; cDNA DKFZp564Q12B2 (f 



Hs.121554 
Hs.200811 
Hs.153368 
Hs.107375 



Human DNA sequence from clc 



gb:am88e08.s1 Stratagene schizo brain S1 

Hs.22209 secreted modular calcium-binding protein 

gb:RC4-ST0173-191099-032-a07 ST0173 Homo 

Hs.17428 RBP1-like protein 

Hs.272139 ESTs 

gb:CMO-CT0307-210100-158 g09 CT0307 Homo 
PM1-HT 05-061293-1 HT0305 Homo 

Hs.9450 zinc finger protein 84 (HPF2) 

Hs.9501 1 syntrophin, beta 1 (dystrophin-associate 

Hs.184411 albumin 
Hs.126555 EST 

gb:RC2-BN0127-240300-011-d05 BN0127 Homo 
Hs.122713 ESTs 

gb:EST38449B MAGE 
Hs.29032 ESTs 



Hs.283093 
Hs.28921 
Hs.155140 
Hs.1 77788 



gb:AV652718 GLCHomo sapiens cDNA clone 
neuromedin U receptor 2 
zinc finger protein 

casein kinase 2, alpha 1 polypeptide 



gb:EST54302 Fe'.al heart II Homo sa 



Hs.11123 
Hs.125406 
Hs.282633 



447842 AW160B04 Hs.247302 
453880 AI803166 Hs.28462 
425189 H16622 
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457225 AW820C35 



414494 AA768491 

427027 A1924294 

444498 AI151413 

435191 R15912 

425324 M89470 

430719 AA488988 

432577 BE208545 

407593 AW044083 
401098 

440299 AI871778 

414145 BE549372 

428627 BE002993 

451806 NMJ03729 



Hs.6783 

Hs.173259 

Hs.26330 

Hs.4817 

Hs.155644 

Hs.293796 

Hs.317590 

Hs.237008 

Hs.250112 
Hs.31 7596 
Hs.187660 
Hs.27076 
Hs.154903 
Hs.25545 
Hs.247377 



gb:QV0-NN1071-283400-207-g07 NN1071 Homo 
E Weakly i larlo AI.U4 HUVAN ALU 5 
gb:RC5-OT0078-280300-022-F01 OT0078 Homo 
hypothetical protein FLJ22724 



421498 AA292084 

456083 U46922 

440527 AV657117 
406413 

439483 T69980 

446242 N66336 

449625 NMJ14253 

457938 AI373638 

413101 BE065215 

408350 AW183350 

419812 NM_000562 

430881 NMJ1C0809 

429682 NM.006306 

409955 U60655 

435579 AI3323/3 

436088 AA704687 

430223 NMJ02514 

416100 H18700 

403218 AL134878 



paired box gene 2 
ESTs 

hypothetical protein FLJ14640 



Homo sapiens cDNA FLJ12927 fis, clone NT 
putative Rab5 GDP/GTP exchange factor ho 
RNA 3'-terminal phosphate cyclase 
ESTs, Weakly similar to A56154 Abl subst 



Hs.191575 ESTs, Moderately similar to ALU2„HUMAN A 

Hs.77252 fragile histidine triad gene 

Hs.184164 ESTs, Moderately similar to S65657 alpha 

Homo sapiens cDNA FLJ 11613 fis, clone HE 



Hs.7360 
Hs.23796 
Hs.133900 

Hs.250127 
Hs.93210 
Hs.248112 
Hs.211602 
Hs.57692 
Hs.1 56924 



gb:RC1 -BT0314-310300-01 S-fOI BT0314 Home 
ESTs 

complement component 8, alpha polypap'.id 
gamma-aminobutyric acid (GABA) A recep'.o 
SMC1 (structural maintenance of chromoso 
i 6 open reading frame 10 



Hs.331250 ESTs 



nephroblastoma overexpressed gene 
ESTs 

ribosomal protein, large P2 



411940 AW876686 

412446 AI768015 

414012 AW452334 

421966 AA904519 

430566 AA481282 

456606 AA292862 

451604 T65365 

440926 AW196772 

420687 AA279392 

459082 BE551721 

413241 BE073771 

426917 AA913814 

447552 AI394125 

420905 AA521307 

428052 AA420477 

424308 AW975531 

432527 AW975028 

430202 T85775 

445610 AV659433 

427961 AW293165 

455290 U75810 

445554 AB028957 

412811 H06382 

413783 AA314337 

423867 AA331886 

42941 B AI381028 

431511 NMJ12386 

445829 AI452457 

452366 AK000464 



Hs.92127 
Hs.128148 
Hs.130710 
Hs.190149 
Hs.275369 
Hs.1 72851 
Hs.131323 



Hs.302414 
Hs.1 72854 
Hs.160413 
Hs.186651 
Hs.26993 
Hs.1 54443 
Hs.102754 



Hs.1 2896 
Hs.21400 
Hs.301547 



ESTs 
ESTs 
ESTs 

arginase, type II 
ESTs 

Homo sapiens cDNA FLJ13427 fis, clone Pt 
ESTs 

) sapiens clone FLB8945 PR0241 1 ml 



ESTs 
ESTs 



Hs.258581 Homosapiensp95oM[lin»i,n-li,i> r 
Hs.145526 ESTs 
Hs.29276 
Hs.221849 



406666 V00495 

445688 AI248205 

446131 NM_000929 

440388 AI693520 



Hs.223000 ESTs 



384 



WO 03/042661 



457128 


AI932995 


Hs.183475 


Homo sapiens clone 25061 mRNA sequence 


404416 










AW138466 






431552 


AI815863 


tfe259873 


axonal transport of synaptic vesicles 


455814 


BE141689 




gb:CM1-HT0092-220999-01 B-bOS HT0092 Hono 


454759 


AW819455 




i i j ^ r 


426497 


AA379913 




gb.EST92807 Skin tumor 1 Homo sapiens cD 


404420 










AW451982 


Hs.248613 


ESTs 


432702 


AW973953 


Hs.293744 


ESTs 


448587 


Al 539652 


Hs.28338 




446854 


BE26B103 


Hs.208914 


hypothetical protein MGC1 0999 


410569 


AA766825 


Hs.205676 


ESTs 


432596 


AJ224741 


Hs.278461 


matrilin3 








gb:EST374240 MAGE resequences, MAGG Homo 


433532 


AA649921 


Hs.1 12553 




435079 


AA664192 




gb:ac05b03.s1 Stratagene lung (937210) H 


451927 


AL355687 


Hs.27261 


Hon no i 1 1 I j h rt ull 


432839 






hypothetical protein KIAA1335 




N66727 


Hs.10957 


ESTs 


408459 


H09701 


Hs.278366 


ESTs, Weakly similar to I38022 hypotheti 


400842 








400859 


BE091833 




gb:IL2-BT0731-260400-076-F04 BT0731 Homo 


405829 
411863 


BE075244 


Hs.12420 


ESTs 


415258 
416093 


AW752247 
R60685 


Hs.293853 
Hs.268698 


ESTs 

ESTs, Moderately similar to ALUC HUMAN ! 


420314 


H81671 


Hs.320921 


ESTs, Weakly similar to T22688 hypotheti 


428002 


AA418703 




gb:zv98c03.s1 Soares NhHMPu_S1 Homosapi 


437733 


AI792574 


Hs.1 22876 


ESTs 


453118 


AW195849 


Hs.252757 


ESTs 


457039 


H29990 


Hs.101937 


sine oculis homeobdx (Drosophila) homolo 


454678 


AW809178 




gb:MR4-ST01 18-261 099-0 1 2-c07 ST0 1 1 8 Homo 




AW851728 




222-1 n7d(-hrT0222Homo 


419986 


AI345455 


H5.78915 


GA-bind'ng protein tranocriotion lector, 


415173 


AW501735 


Hs.253015 


ESTs 


449011 


AI655376 


Hs.192693 


ESTs 


410365 


AI287518 


Hs.62669 


Homo sapiens mRNA; cDNA DKFZp586D0923 (f 




AI927382 


Hs.29857 


ESTs 


455688 


BE067238 




gb:PM1-BT0348-151299-001-a12 BT0348 Homo 




AW207553 


Hs.253639 


434663 


AA641972 


Hs.1 30058 


ESTs 




AA421081 


Hs.12388 




425006 


R38685 


Hs.332622 


ESTs 




H77395 


Hs.39749 


ESTs 


400049 








428333 


AW972668 


Hs.293044 




429458 


BE161832 


Hs.292689 


ESTs 


425087 


R62424 


Hs.1 26059 


ESTs 


457122 


AI026157 


Hs.33728 


ESTs, Weakly similar to ALU1.HUMAN ALU S 


400310 


X63956 


Hs.1 35631 


H.sapiens synthetic gene for platelet-de 




AI968300 


Hs.208220 


ESTs 


401986 
415318 


T06544 




gb:EST04433 Fetal brain, Stratagene (cat 


417756 


Z43056 




gb:HSC1 2B021 norm.slizad infant brain cDN 


418301 


AW976201 


Hs.53913 


hypothetical protein FLJ 10252 


424698 


AA164366 


Hs.151973 


hypothetical protein FU23511 


429110 


129301 


Hs.2353 


opioid receptor, mu 1 




AW085934 


Hs.1 20868 


ESTs 


434118 


AF116715 


Hs.256256 


Homo sapiens PR02829 mRNA, complete cds 


435413 


AI257476 


Hs.46669 


ESTs 


443748 


AW206447 




gb-.UI-H-BI1-afg-g-02-0-UI.s1 NCI_CGAP_Su 


445205 


D83776 


Hs.12413 




458175 




Hs.150434 


ESTs 


446419 


AW576760 


Hs.160726 


Homo sapiens cDNA FU11680 fs, clone HE 


441627 


AA947552 


Hs.58086 


ESTs 


457653 


AI820719 


Hs.154662 


DnaJ (Hsp40) honolog, subfami:y A, membe 




AI693369 


Hs.202274 


ESTs 


449899 


AI610700 


Hs.103280 


ESTs 


420111 


AA255652 




gb:zs21h11.r1 NCI CGAP GCB1 Homo sapiens 


437354 


AA749215 


Hs.291886 


ESTs 


412228 


AW503785 


Hs.73792 


complement component (3d/Epstein Barrvi 






Hs.1 93521 


ESTs 


439724 


AF086565 


Hs.60351 


EST 




BE0B8812 




gb:CM2-BT0693-230300-129-dC6 BT0593 Homo 


453652 


AW009640 


Hs.28368 


ESTs, Moderately similar to S65657 alpha 


434361 


AF129755 


Hs.1 17772 


ESTs 


442479 


AF059484 




gb:AF069484 Homo sapiens astrocytoma lb 


413554 


AA319146 


Hs.75426 


secretogranin II (chromogranin C) 


459323 


AW062490 




gb:MRO-CT0065-100899-0C1-d01 CT0C65 Homo 


449438 


AA927317 


Hs.554 


Sjogren syndrome antigen A2 (60kD. ribon 



WO 03/042661 



PCT/US02/36810 



400285 








2.0 


407407 


AF050198 




gb:Homo sapiens putative mitochondrial s 


2.0 


411459 


BE142707 




gb:MR0-HT0'.57-19',',99-002-g12HT0157Homo 


2.0 


417383 


W02642 


Hs.136102 


KIAA0853 protein 


2.0 


447153 


AA805202 


Hs.315562 


ESTs 


2.0 


447313 


U92981 


Hs.18081 


Homo sapiens clone DT1P1B5 mRNA, GAG rep 
gb:RC0-BT0362-021299-O31-bO6 BT0362 Homo 


2.0 
2.0 


455696 
456510 


BE067870 
AK001652 


Hs.99423 


ATP-dependentRNA helioase 


2.0 


449815 


AI671000 


Hs. 199739 


ESTs 


2.0 


425398 


AL049689 


Hs.1 56369 


hypothetical protein similar to tenascin 


2.0 


400238 








2.0 


451678 


AA374181 


Hs.26799 


DKFZP564D0764 protein 


2.0 




AW291389 


Hs.13056 


hypothetical protein FLJ 13920 




455221 


AW867751 




gb:MR0-SN0038-29O3OO-O01-a03 SN0038 Homo 


2 0 


413174 


AA723564 




ESTs 


2.0 


435810 


BE349853 


Hs.2785 


keratin 17 


20 


418687 


R61550 


Hs.22581 


ESTs 


zo 


43B563 


AA810665 


Hs,134746 


ESTs, Weakly similar to A46010 X-linked 




431750 


AA514986 


Hs.283705 


ESTs 


20 


453242 


T98327 


Hs.18343 


ESTs 


20 




AI286235 


Hs.1 28905 


hypothetical protein FLJ13204 


2.0 








gb:yv34h09.s1 Soares fetal liver spleen 


2.0 


AK001432 


Hs.1 53408 


Homo sapiens cDNA FLJ10E70 (is, clone NT 


2.0 


409929 


R38772 


Hs.172619 


myelin transcription (actor 1-like 


2.0 
ZO 


406378 
459208 


BE261314 


Hs.149039 


ESTs, Weakly simitar to I38022 hypotheti 


2.0 


445260 


AI218133 


Hs.147617 


ESTs 


2.0 


416240 


H99169 


Hs.23450 


mitochondrial ribosomal protein S25 


2.0 




AI205655 


Hs.147221 


ESTs 


2.0 
2.0 


402048 
41269G 


AW984439 




gb:PM3-HN0011-220300-002-c05HNQ011Homo 


ZO 


416408 


R94725 


Hs.35354 


ESTs .. 


2.0 


423347 


AI660412 


Hs.234557 






427836 


AA416642 


Hs.1 16176 


ESTs 


ZO 


433347 


AF023130 




gb:Homo sapiens Ras-GRF2 mRNA, partial c 


ZO 


436902 


AW247145 


Hs.1 92729 




2.0 


440122 






ESTs 


ZO 


442901 


AI023654 


Hs.1 14191 




444097 


AW517412 


Hs.150757 


ESTs 


ZO 


447278 


AI934935 


Hs.158669 


ESTs 


2.0 


451361 


AA053854 




gb:zf52f02.r1 Soares re'jna N2MHR Homo 


ZO 


451813 


NM 016117 


Hs.27182 




2,0 


454423 


AW603985 




gb:RC4-CNO048-1401OO-011-a04 CN0048 Homo 


2.0 


458801 


N98648 


Hs.276860 


ESTs, Weakly similar to C Chain C, Human 


Z0 



TABLE 30B 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster 
Accession: Genbank aci 



407594 
407676 



1017498J 
1017982J 
10180J 



AW044083 AW044094 AW370634 
AW057584 AW057585 AW0441 53 R34370 
AW205369 AW058599 AW207508 

YV2735 1 NM^00358lA/ rt " 01 38 141 1 R41419T16717 AA002193 H62028 AI359545 AW105201 AW087158 AA699728 A1095264 
AA002065 H62029 AI289101 AA884804 AA904950 M609672 AI139874 H77896 

AA435679 AA470655 H22526 AA044031 M876426 W63767 A1421 140 AI418990 H42329 H88910 AL041066 H88909 W94610 AW352277 W94648 
W94167 AW952568 AI419553 AA335501 AA3936-1 AA ) 3 I 11 2 F 2525 R58582 AW297645 C75230 AW368034 AW468904 AI272755 
AW235638 AA346882 AW856803 AA361281 AW963163 AA044373 AA136755 
BE008347 BE008320 BE083307 BE083311 AW075968 

BE514982BE614814AW393078AW2? I OA 05 637 N 644 AA64115B MB7068 AA1 51019 AA161003 AA587315 AA716746 AA593632 
AI354870 AW1 83492 Y07755 NM.005973 AW872948 AI608987 H64556 AF086003 AA6431 49 AI819402 H64555 AA858398 AA594885 AI436747 
BE122811 AA587026AW857106A1953579AA5S6012M654004 AW238238AI971395AA459074AA458884AI608591 BE181995 BE181970 
AW1 95797 C00271 BE1 82043 
AW081681 N36967N36959 
AW082279AW082688 

AW190902 AI802788 AI041650 AI037867 AF110137 NM 01 3372 H99469 N35377 AW151676 AI678451 AW078795 AW087935 AI884505 
AW044602AB032372AF 5B0OAI621 I7505 JJ AI3104^7 4A373348 AA373673 AI752124 AL359060 W43619 AA373298 AA373975 

AW604409 AW604404 AW368603 AA545734 AW36141E AA373557 AW9561 64 AW853798 AI750595 AL359059 AA344024 N31127 AW580737 
N27040AA071133AI751 5 1801 W48E J AA112124 AI940705W52686 AA084001 C01826 AI940729 AA373544 AW957491 

AW383224 AW3831 54 AW383' 92 AW3831 25 AW3831 94 AW842507 AI940795 H80042 AW531 1 1 9 W47324 W428B4 AI750594 AI754387 AI753734 
AA372833AW069006 A^ r - \. M ( r - i 0082 AI041803 W51909 W25447 AI521673 AI087351 AA670070 W47325 AA071381 
W42791 AI090390 AI0 51 i 12 -TP 8723 AW069430 AW0B9485 AI754608 AA373014 BE140150 BE140166 BE140102 BE140143 
BE140157 BE140117 Et 0( 05 3 BE i 1 I E. L140 -6 BE140171 BE140144 BE140175 BE140160 BE140152 BE140099 BE140177 
BE140167 BE140145 BE".40109 BE140".63 BE140172 EE140161 BE140179 BE140147 BE140107 BE140146 BE140155 BE140173 BE140148 
BE140174 BE140158 BE140149 BE140", "6 BE140156 BE14C105 BE140103 BE140164 AW138508 BE140153 AW806557 BE140121 BE140162 
C01953AW806500BE140124AA329219AW9b 2 W39165 AI9661C N21 ^I754594AW069264AI754660BE551937AA543066AA703927 
AW1304B6 AI814434 AI924946 AA789056 AW173667 AW06964 1 AI022235 A1753523 AI753558 AI753482 AW063940 AI753002 AA669866 

386 



WO 03/042661 PCT/US02/36810 



407834 
407851 
407884 
407938 



408112 
408172 
408221 



1020170.1 
10212J 
1023626.1 
102759.1 



AI753593 AI753469 AI753506 AW008360 AI753255 A19491 1 1 AI752123 W19275 AI679005 AI888455 AI677772 , 

AI754023 N20040 A17M354 AI752878 AA836970 AW631233 A1440410 AWO',6646 AI801326 AI610424 AI521669 A1446171 AI453455 AI753087 
AA670062 AA59S863 / 1 > J A!80 71 ' r 25AA66 8 A A 'I AIB11571 AI309543 H80043 AI623845 AI623852 AI075634 AI919521 
AA729459 C01795 AA577421 AA668620 AA600003 M653400 AI445339 AW468974 AW130343 AA600104 AW970482 AW806615 BE140122 
AW084991 R54179N64486 
NM.014496 AF184965 H82896 H82897 
BE075316 AW090150 AW589417 H47391 D61911 BE173252 

AA905097 AI734242 BE044545 N86431 AA34404", C03245 AW953097 AI567324 AA045934 AW079233 R58274 AI744425 AI167427 R58176 
R58593/I9o99 ( 5^i TC13864AI2D455? ^ C C C 648 N84115 AA897468 AI668637 BE/'' '53AI355307 M139542 N80934 

10298 1 W2148 U1 r 9) 1 0063081 W126673M4265C7Y17782W24740W05062 W21042AA649552AA093507AA092088AA476830N88299 

AA429090 AA09564 a 2491 A443775 \ 204315 AI333690 AAS86584 AI27533? AI193915 AA659647 T82641 AA1 36048 T82643 T82647 
F36041 N74099 AI572217 T82653 Tb f 82 0 R473 0 R47361 R47357 N74672 R47356 R47355 N93086 R47354 R47353 R47342 R47362 
R47358 N93441 N56384 N93566 T82622 T82618 
103177 1 AI094748 AW241797 AI759976 T10509 AI814441 AI421977 A1423762 N73686 T1 1 392 AA04640B 

10311" L11690NM_001723 M69225H999L C A r j» BE000:9 ' 618 N 237 J2i r 395 AA088909 N21507 X58677 U04850 AWB64903 

A1830854 AW361101 AW379356 AA301 170 T29232 AW96599B H26216 H44230 H44784 AA808916 AA514765 H44575 AA705179 AA586735 
AW589403 H25843 AA654993 AI783826 BE087370 

1039850.1 AW451982 AW1571 17 AW161544 

1044105 1 W02488AW167479AW361027AW833243 

104826 T AA912183AA541622AI673341 AI2661 09 AI335896 AI421 010 A^053327 AI982962 AI372854 AI674348 AA531087 AIB0101 6 AI372855 A1817816 
N52151 D62928 AI401633 AI378549 AI378800 AI337972 D62536 AW972977 AA135826 BE222491 D52195 BE326453 AA225293 AA236461 
R85586 AA625141 AA058641 AA135825 M234644 H50784 R71863 

104868.-1 M053381 

104872 1 W95901 AA053387 AA121501 

1048 f Y00787 M28130 D14283 BE439795 BE439924 AA362187 BE439515 M17017 C05929 AA381897 AA346136 AA381472 C06304 AA381670 

M381679 AA381965 T1 1274 AA381318 AA381601 AA381700 AA381952 AW950718 AA331992 D82198 M381914 AV653399 AA362140 
AA381592 M381967 AA381454 M26383 M362188 AA382057 AA381781 AA362246 AA381669 AA3B1395 AA381845 AA381848 AA381 603 
W40425 AW663238 AA381702 AA381355 AA381356 AA381653 AA362127 AA374516 AA381426 AA381767 AA362270 AA381459 AA362358 
M362160 AA304441 T27482 D83848 AA381B77 AA381623 D83851 AA362133 M381999 AA372837 AA362192 AA362196 AA331806 D82132 
AA362107 AA363589 AA296403T11316 D82218 AA381456W56111 AA381435 W40163T11379W40283 D82615 AA3B2121 W40401 AA381699 
T63485 D62C v T 7 C C 3813Z116B6T10915 AW367405 AW604306AA363550AA363514BE004230AA381218W40279 

AA363545 AW377023 AA363453 AA363559 AA331833 AW850620 AA381B13 AA381710 AA381584 AA3B1577 AA381715 AW376990 T11334 
AA305271 AA363554 W52491 AW631242 AA363599 D82254 AA363540 AA363596 AW9511B0 BE122704 AA3B0965 AW850702 AW630501 
AA381229 AA381228 AA3B1S76 AA3B1795 AA3B1816 AI274317 AA360958 AA3B1517 AA38161BAA381B91 AI310352T11037 BE122702 
M834388W38334C01358AVV059B70AA371394AW075973AI922031 AA381189AW468452AI375262AI870149 C06055AA381309AA381625 
AA381252AI620395AW467642AJ2279-3 AW780131 AI74227BA1564251 C75559AW272612A1634748AW339521 AI07924BC75557AI569162 
C75493 AW591509 AW769268 C75524 AI459809 A1579895 C75486 AI697109 AI475413 AI355474 AI282685 AI087B73 AI469815 C06100 
AA568576 AW014817C05914AW272671 C75615 AW337841 AI3805BOAW467270AA847352W46290AW780211 AW440851 W45324 C06017 
C75643 AI346866 W45040 C05698 C05704 AA381931 AA381 1 77 AA382015 AI476632 M381 274 C05758 C75413 W45362 AW951601 C05910 
C75382 W60423 AA095727 C7541 0 AW1 88453 AW341742 W60450 AW613454 AI459266 AI473944 BE042737 AW2731 12 AW467973 C05693 
AW272835 W45409 C05663 AI431 254 T291 57 AI953422 AW510601 W45271 C75538 BE138893 AW662358 AA102526 AI457250 AW615017 
AW662707 AW051250 AW470168 AI952442 AA362337 AW015490 AA362269 BE122713 C06363 C06353 AW779405 T68419 AA3B1217 M362139 
T1 1 14C T1 1 198 AA363591 AA3621 33 AA362357 AA352344 D56519 AA3621 32 AA361981 M362159 AA3621 10 D83856 DB2102 D83842 

1050255 1 W/1 \Ui ,\Wdfb2AW173156AW178172 AW845893AW178159 AW178222AW17B213 AW178215 AW1 78090 AW178091 

AW178161AW17B207AW178210AW178214 AW178212BE140918 BE140917AW178135AW178205AW178209AW178223AW178220 
AW178206 AW1782D3 AW178165 AW178168 AW178160 AW178136 AW845B78 AW178131 AW178138 AW178105 AW845894 AW178129 
AW845810AW84582BAW178216AW1 78112 AW178211 AW178224 BE140915 AW178221 AW178130 AW17B134 AW178096 AW178108 
AW178133AW178164AW178218AW178171AAn7E157Aft178158AW178103BE141189AW178170AW845B16BE141586AW17B155 
AW178104AW178163AW178093AW178208AW178137AW178140AW178219BE141592AW845901 BE141580AW178155BE141598 BE140957 

1052056 1 H91230AW182672 

1052079.1 AW51 4652 AW612492 AW182595 

1052676 1 AW183350 C03487 C051 15 

10566 f AF070571 H70949 AW149569 AI804276 AA936447 M987912 N35082 N35065 N43822 N43804 H69776 T80170 
1059830.1 H09701 R35799 R1 9028 AW1 96589 
1064578.1 AW207553 BE221925 BE551043 
1070070.1 R61736 AW242757 R53961 T33418 Z45363 

107722J 1 m%mM^mLmSlWa M166917 AA055984 AI206035 AW023639 AA195520 M329675 AA678589 AI784331 



BE296227AF008551 Z43879AA313688AW4O9753 M2985O1AW957623BE299015 BE093498AA431134N64029AI038260AW732512AI917458 
AA628301 AA291 514 BE092451 H63889 AI126792 BE092286 H721 66 H57643 AI884735 R96793 AW51 6673 W78726 H784B3 BE01 1847 AI63B439 
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AA463550 AW958131 AA329302 AW264037 AI979134 BE163190 AA252153AA148799 AA148530 

U46922 NM.002012 AA256122 T95733 R11128 R05417 T85004 R08282 U76272 AI086356 AI741386 AI338651 AI024929 AI400094 AI401445 
AI803972 A1493307 AI245155 AA91 8722 R50713 M35686 A1278165 AW025706 AI885436 R8B314 Al 582390 AA2561 23 AI468521 R53094T89183 
AA621493 AW372683 R05964 AW819303 AI417763 AW393203 BE247001 AA864207 AW819302 AW393209 AW393195 AI870522 R05963 
AA380267 AL117573 R5147B AA131283 AA305730 AW960438 T27291 AW504410 W77831 AA243239 AA426041 AA92763S AW792877 AA055133 
AW291315AI589104AI949122AW02" 6 A'27S M r r E AA131190 AA723731 N86864 AA091367 AI738620 AA345291 AA774160 R51372 
AI373904 R37681 W72195 AA476597 AI025596 AI242078 AA384072 BE349121 AI203442 AA885019 AI424214 AI9S4081 AI219465 AW103904 
AA017216 AW591839 AI890043 BE247650 AW793376 AA224322 AI784198 AW190649 BE247646 AA055073 D19892 AA428076 AA165625 
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456472 
1 5 456481 



R34233 AA469355 AI30214B AIS90055 AA946756 AI697829 BE218045 AA027965 AI734957 AA436215 AW362506 AA469277 AA436214 

AW362508 R79964 AW362504 AW362505 AA027984 AI417795 BE0S6043 AW967592 AA165661 AA251567 C05877 R19957 R53161 R52318 

Z44555 BEO6603O AW769414 R35560 H08481 M017309 

H91010N39092N4B447 

AW299698 BE270312 AI333779 R68946 

AF154846AA400924AF075093 AA626790 AI202783 H53869 H 53499 AI0011B3 AA714166 L32164 AA768900AW182419 AW977136 AI915355 

AA994468 AI377886 AI654131 AI674302 AW074683 

AA224872 AI791642 AA22830B AI821503 AA230172 AA224838 

U00803 NM_002031 AW195632 C05922 

W28481 

AW957500AW966876AW976565AI417020AI057022AA25131SAA744820AI806037AW956871 AA251713AF080576T49825 AA281037T49826 
AK001714 AW960785 AK001 831 BE07531 1 AW273355 AA928445 AI698734 AW977909 AW268863 AI392846 AA81 1 609 AI565035 AW007245 
AI754385 AI092244 AA906509 AW438768 AW751668 AA459681 AAB61 761 AW1 1 8314 D63077 AI537049 AA78271 8 AW050550 AA430988 
AA1 30556 AA383461 AA337522 M130593 
AA258033AA459485 

AK001652 AKQ01245 AJ010840 AA443492 AA361206 AA334559 AA20521 6 A1990275 BE075308 AA927343 AA372625 AI572799 BE466582 
N30525 M813191 AA456509 AA854999 N30537 AA463668 BE1 49580 AA301443 AI366930 AA010769 AA248189 AW043957AA011050 AI193868 
AA508116 N20812 AI190726 AW663652 AI161405 AA909896 H52712 Al 081806 AI862934 AM 67935 AA460738AI345960 AA460140 AI346920 
BE565501 AA456419 AA953554 AA806622 A1242924 AI619442 AI082684 AA947786 AA601594 BE0441 19 BE349995 AA091744 AW959301 
H70392 

AA279143AA419462 

AW135986 BE467356 BE221523 BE468135 AA262464 T70755 

AW5921 87 AA284280 W47266 AW972765 AW059452 W47267 AI186593 AA526308 AA525432 

AI858634 BE465909 AI057197 AI968927 AI969377 N21608 W25410 AI792468 AI734237 N28843 N58992 AI820B02 N93619 AI668704 AI367790 
AI204569 

AI827786 AW303478 AI216657 AW592056 AA928804 AA890107 
AA292862AA398885 

BE247203 BE244171 AW205198 AI245208 Al 284388 BE048356 AW293524 NM.016014 AF151825 AI026799 BE048322 AI558420 AI041696 
AI79631 3 AW003505 AI492609 AI564777 AW5903BB AW236586 AI1 91469 AI571 792 AA490835 AA843378 AA8431 50 AA932149 AA6871 36 
AI829606AA490836BE005634 AW965621 F12793R56751 R52429 T75124 R58675 AI205826 AA258787 

BE467912 A1744S57 AA5S4436 AI64074S A1077333 BE221 602 Al 538508 AA96B61 8 AI671 472 AI290360 A1863656 AI91 0555 AW1 041 95 AI648463 
H24708 AA280869 AA279428 AA987634 AA480878 AA836072 AW87791 1 AA521 155 AA305609 AI1 1 0686 AF063561 AA480937 AA281 170 H96495 
BE005402 AW367567 AW367372 AW367351 A W367585 AW601 230 AA8471 83 AA442049 AW89601 3 BE568756 T09279 T35094 AA287495 
Z42413 AA328761 AW139131 AA252554 AA323777 AW950315 AA999676 AI815004 AI203B69 AI016834 AW1367B2 AA999661 AAB25909 
AW071 102 AA8261 27 AA838505 AA885039 AA287392 AI470496 C01794 AA249575 AL1 1 8B43 BE53861 1 R56578 R5601 4 R1 7173 AW135076 
R45909AW370767 R7-~" 



S78234 T87689 NM.001256 U00001 M78440 BE280314 BE56856B AI075160 W31 156 AA256407 AA255817 R16014 AI203880 AW1 17283 
AA504316 AW105046 AW37047B A1378553 AA7321 00 AAB15109 AW297995 BE5D1979 AW2934B0 AA972D91 BE0B2548 N98929 AA255B18 
AA256301 AI559887 R161 15 AA3B49B2 AW949B7B AA0B5260 AL1 35314 AA21 1035 AI6B9616 BE004600 BE004737 BE004597 AW800545 
AW197262AA996213AW452890 AI458977 AA 1 88860 AA491 202 BE3271 43 AA489098 AW172396 AI399903 AA830687 AI916706 AW860246 
AW006356 AI288363 AW193267 AA670452 AA445924 AI276764 AA428407 AAB36037 AI886571 AA922445 AA557273 AA599272 AA937432 
AW594254 AV650497 AI889557 BE221 1 49 AA934793 AA373059 AABB3B9B AA491 005 TB769D AA1 67622 AA137635 AA0B4655 AA207057 
AA633879 AA706479 AA706213 AA706197 AA679805 AA683208 AA679343 AA613172 AA679833 AA085131 AA679793 AA601666 AA210946 
AA583228 AA601229 AA084718 AA584501 AA224217 AA706452 AA4B9324 AA587310 AA082786 AA586B92 N2307D AA21 1405 AA0B2906 
AA489789 AA209324 AA206G31 AA167B35 AA188956 AA566070 AA1 9971 B AA214195 AA1 1 3B62 AA206791 AA633796 AA223986 AA0B3048 
AA085009 AA085307 AA207042 AA21 4235 AA 1 66892 AA1 88689 AA21 1 1 01 AA706139 AA773961 AA773972 AA1 89090 AA66859 1 AA21 3387 
AA206041 AA199719 AA223995AA085386AA199670AA199904AA199551 AA084336AA167637AA666175AA223978AA707341 AA214110 
AA633741 AA205951 AA214421 AA666402 AA188669 AA161379 AA0B4747 AA21 1 176 AA664225 AA169244 AA714142 AA579457 AA66B573 
AA206000 AA780452 AA454542 AA706364AA706507 AA705570AA722308AA707242AA704053 AA707423AA722261 AA703686 AA703777 
AA707238 AA706366 AA707433 AA706380 AA703761 AA705572 AA7O4109 AA706275 AA7041 17 AA707250AA774043 AA707349 AA6794B5 
M214199M666241 M779M5M668507 M668597 T03211 M664313M214112M66B51BAA214159M214297AA0B5030AA2110BB 
AA209254AA214326AA213476AA205166AA223713AA223626AA213473AA199706AA224108AA223973AA167729AA213641AA211125 
AAigg730AA223972AA161387AA668335AA434073AA679431 T92447AA085177AA633986T49882AA214251 AA085153 AA214425 AA223838 
AA160951 AA224107 AA214341 AA3725B1 AA3091 69 AA1 99705 AA346242 AA21 1015 AA2241 56 AA0B3161 AA1767B7 AA224038 AW374738 
AA224043 AA209284 AA224039 AA213681 H55496 AA213640 AI267801 AA206853 T82077 BE150566 AA81 1950 
AA397546 AA699372 AA699363 AA399064 

AI301740 AL119574 AW504484 BE560571 BE2691 12 AW897591 T31233 U17279 AW504792 T32181 Z4733B T0571 1 AL121247 AW504876 
AW1 51725 AI204992 D38743 D38651 AW8951 30 AA985067 AL120318 AA326654 AW955387 AA775607 D38742 AI26781 2 AL03591 0 BE008426 
BE008367 AA423862 BE348785AA249436 AA326809 AA325529 AW361428 N24212 AW302831 AI583363 AW470460 AL038861 AA234531 
AI376141 AA194854W35263 N3924BD52746 D55494 AA083749 D52828 T30063 T31624 D54617 AA1 12066 BE01 8814 AWB82900 M78702 
T31558 T31691 AW859799 AA31061 6 AW389747 AA317897 T31 1 58 AI077640 AA386250 T1 0323 D53660 F05613 AL1 20458 H1 5866 AW895727 
AW895802AW899191 AA128798 N31267 H14732BE208125 N64235 H97218AA236449BE087062F11649F11648Z42310T32845 U47717 
H43263 Z44143 T05325 T1B335 AL13B10B H84437 AA1 97265 N46693 AW0202S9 AA070B47 AW022743 AWBB5792 M78703 AA486340 T07147 
N36340 T35393 AW841 129 AI288455 BE005370 AI497892 C14978 AA487674 T30747 AW022976 AA860826 AA730653 AA857741 T08861 N67128 
AI127B18 AA88691B AI424496 AI004685 D550B4 AA747954 AI128832 W23585 Z41668 H8443B A161 1575 AI350797 AA502260 AI364537 
AA638970 U97105 NMJ01386 BE009933 BE546254 BE396357 BE513B31 W26313 H10568 T97273 T33131 T07524 AW276666 R15142 T30810 
F07479 T3381 2 H2421 9 AV647683 AV646892 
H29990 N44636 N44265 AW292774 AA398365 R92869 AA403200 

N77624 AW607635 H03232 N46965 H94749 AW382351 R02203 BE159946 N56B08 AA045401 AA046564 AA658111 N59303 H47303 R09001 
AW023738 AW507549 H68307 R73086 W6732 1 AA1 26540 R27493 AA40021 2 AA044323 AA32941 6 AL037969 N54710 AA1 01 647 D60371 N58245 
D61743 H46463 AA126414 BE501495 AI765781 AW237498 BE219266 AA716110 AA031291 AA400123 C16598 AA045283 C16444 D62466 
AW189929 AI143224 AA530868 AW194998 AW593426 AA813315 C16384 AI332909 AI335037 D57149 D45498 C16198 C16113 AA705203 
AA777104AI360091 C16168 N89583 W57248 AI469064 A1753912 N92420 AI862009 AI093246 AA101648 AI138331 AA296430 AI302151 R25756 
N34045 AW338083 AW139636 AI559099 AI184816 AI200476 AI127576 T82460 AI358362 AI435204 AI478453 AI093082 AI469352 AW338517 
AI200250 AI031 694 AI887286 AI03375B AW080370 AW0291 40 N32140 N52949 N50055 H47004 N76054 R51 887 R73087 AI160363 H6401 8 
W8874BAI0B1703AA149098 R40542 N77209 H04380AA234491 R08896 D57030AI767209 AA600348 N70842 AA044242 AI888195 AA777676 
AI53875B C16174 R38565 AI799219 H7431 2 F10055 W37697 F10343 AA610106 R52523 R22142 H88895 R77674 R44673 AA045199 R31 568 
AA505916 AI360773 F10337 AA733130 AI016479 AW1B3400 A'A779199 D58102 H64348 D57909 R26653 AW051234 Z41044 AA876777 F03579 
AI680446 AA745688 T24075 T35 1 1 6 T351 1 5 R02091 Z30229 A1872774 AW772549 AI54041 1 AI784282 R25646 AA541 81 7 T77442 R26596 
BE1 66424 T77229 R23696 

AA399018 AW242359 AI868208 AW020951 AA412180 

At026157 AA838752 A1146272 AA9 1 0825 AI266100 N24580 H97503 BE327388 A1697814 W56675 AA423842 AA423880 H40134 
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457314 
457460 
457506 
457620 
457630 
457652 
457653 
457741 
457756 



37972J 
37979.2 
395757J 



AI932995 BE064464 AW371 902 AW371 841 AI885885 BE064457 M5241 1 3 AA721 037 AA504343 AA778099 AI800598 AI6931 1 2 AI864633 
AI690228 AI400990 AW9690B9 AW371 927 AW371 912 AW383562 BE151089 AW383568 BE218503 AW383570 AW371899 BE151097 AW371900 
AW293095 AW292008 M4341 79 M714780 R45868 W011 B2 AW957767 AW1 1 9223 AI207864 W01 578 AA354403 M8051 77 AI613299 AW269636 
AA461528 AW079101 AF131777 R60489 T31289 AA481594 BE161020 AA465433 AW808125 T84992 AA749191 
AA436837AA442594 
AA443927 AA4441 06 

AWB20035 T20260 T20259 AL049415 AA737756 W46965 W00799 AW340968 AW027417 AW263261 AI420674 AA814921 AA736509 N69991 
AW368643 W47065 AI090172 AI924139 BE468071 AA375842 AA375767 AW628849 AI422731 AA494558 AA969233 AI142954 AI161089 AI167233 
W94484 AI681 576 M249694 AI695943 AA332347 AA475621 AA937792 AI702870 AA455748 AW195100 AI638530 BE502479 AI383418 AA039630 
C20777 

AA479597 Z45151 H28821 

AI143312 AI394343 AW205239 AA523980 AI831223 AI3471 80 BE2631 97 AA765781 AI654500 

AF131757T79901 

M602711 BE078290 

AI680803 AI703329 AA609004 AI305245 Al 457796 AW295787 

n IP M33134AW 2 r AW885447 H83251 AW838349AW838378AW838175 H83252 

AI82071 9 AI27351 5 AW592687 AI263784 AI351926 R46866 

BE044740 AW827360 AW827623 BE161439 BE044718 BED46207 BE046551 AA653908 BE1 86581 

AA126136 H62964 BE245159 AF280094 AA431918 BE386201 M707576 AI074267 AA969194 AA693596 H62844 AW1 34991 AA126014 AI077443 
AA699881 AI037956 M961277AW204185AI540791 AI273273 AW627957 AA926890 AI014851 AW081056 AA443705 AW137571 BE139390 
AI583851 AI583822 AI583814 AW268341 AI590502 AW302642 AI053B71 AI254692 AI591255 AI590260 AI583359 AI583341 AI583752 AI224227 
AW302089 AW466960 AA810124 AW302684 AI272921 Al 1 41 003 AW589738 M074714 AA621 482 AI796501 AW024557 AA621 074 T62627 
AA639206AI913538AA075135 R54613AA352975AW205892AI868280AW449243AA907317AW134573AI590492AI610050AI834309AI375556 



457900 
457938 
458023 
45B037 
458043 



458684 
458734 
458762 



498744.1 
517913J 
526654 1 
5431 75 J 
555234J 
579269J 



633259.1 
662297.1 
662652_1 



73076 1 
731371J 
737397.1 
75398.2 



764459 1 
777505 1 
798085.1 
812733.2 



459037 
459082 
459124 
459207 
459208 
459275 
459278 



AI373638 AI073389 AI087143 AA764776 AA913318 

AW978161 AW978165 AI016938 AI539270 AW294958 AW51 1089 AA814849 

AF074982 R27905 R31333 R31591 R27812 

AW979009AA828038AA828148 

AA393603 Z19481 AA252342 AI807614 AI913804 AA040176 AA971879H53388 AF085972 AI291424 H53349 AW015078 AA768307 AA127921 

AA723700 AA040841 AA993954 AA21 3655 AA127972 AA9 13063 BE327712 AI017585 AA9881 86 AAB2B1 83 AI205930 AAB33558 AA9741 07 

AI004390 H48931 AA724004 

AW296024AA897109AI015000 

Z40670 BE504110 BE219908 AW468668AI002334 

U48351 AA9GS182 BE327312 AW138276 BE457567 AI680815 AI422668 

AI264628 AF017648 AI872732 

AI024865AI024877 

AI084514 N46645AW183984 

AV64831 0 AV661871 AI928475 AV64781 9 T55845 

AI185703Ai805813AW292764 

AW136139 AI216724 

AI305223 AI458577AI275569 

AI362790 AI275996 N48887 

AI299789 BE551384 

BE281 1 1 5 T53860 AV559439 AV659421 BE539929 

AI554946 AW362008 AW362553 AI683342 AI376781 

AW802754BE619228 

AI69341 7 AI416256 AW627792 F32979 

AW2951 51 A1425004 AW470228 A1693738 H50554 R99198 H50553 R991 97 

AW001835AW612725AW136670AI798956 BE46736B AA280216 AI216754 BE622057 AL121193 AWB53470 AWB53450AW369075 AW369108 
AW578479 AW369106 AW361242 AW361190 N79183 

N98648 AI458157 BE041652 BE218014 BE622355AA369340 AA369515 AW962780 AW962704 
AI522129 T56009 R53849 
AW236702 AI566105 Z40396 



R15891 R61471 R61469 N69765 AI014624 M007214 AW592075 H09780 AA709038 AI335898 H11055 AI559229 F09750 T72573 AA935558 
AA988654 AA826438 AI002431 AI299721 

AW968226 Al 139249 AI701692 AA01 7303 AW469622 AA2591 48 AA81 1 690 AA807995 AA744260 AAB24494 AA731 71 0 T25332 AA258101 
AA970687 

AW439497 AI826059 AA018402 AA837392 
BE551721 H51878 AI823338 C01 488 AW8 13562 
AW301478AW301560AI889207 
AW138410AI912712N40185 
BE261 31 4 AI243406 AA027322 

AI80891 3 AW028342 W81 290 AI571379 AI382808 AA037071 W79688 R48751 N31808 AI870233 R48752 AW024895 AI333754 
AW294659 AI204928 AA351653 H51220 R85843 AA993182 U79298 R15294 F05089 Z429B3 R17818 T77498 AA332319 W56049 AA331 586 
AW881873 AW881865 AW881876 R52345 AI652070 AA400044 AA401512 T08151 W05486 N68378 T33846 AI190920 R43021 AI949980 Z39084 
T63413 W37269 F01343 R86669 AI621055 AW117593 AI19321 1 AW297932 AI500709 AA400056 



Table 30C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column Ere Genbank Identifier (Gl) numbers. "Dunham I. el al." refers to the publication entitled The DNA sequence of 
human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 

Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 
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400696 
400697 
400706 

400842 
400859 
400861 
400889 



401200 
401240 
401324 



9887666 Minus 

9929646 Minus 

8247025 Minus 

8117693 Plus 

8118496 Plus 

8118812 Minus 

8118812 Minus 

7249204 Minus 

8569993 Plus 

1927148 Plus 



9965518 
8705350 
9690246 
9743387 
3355450 
9863791 
9796180 
8670914 
9212270 
6682291 
7381770 
7-?Q5Ci 



197610-197785 



401645 
401673 
401694 
401785 
401878 
401887 
401899 
401986 



402112 
402131 
402145 
402318 



7229981 
7230209 
4406829 
4309964 
4156128 
8072512 
8117410 
7249154 
8139750 
7704961 
8018280 
7582559 



151513-151662 

56203-56313,56424-56432,57073.57185,57513-57593,57747-57941 
4786-4992 

13558-13721,13942-14090,14554-14679 

77737-77899 

79073-79198 

78299-78686 

161221-162078 

90462-90673 

91 888-9201 8,981 31-98294.99474-99570 
163855-164016 
169782-170036 
45682-45831 
85632-86174 
85679-85795 
90815-90929 
111586-111806,11 
77433-77636 

119572-119672 



11-114915,115419-115583,116351-116446,116847-116907,122853-123067,124982-125407 



182001-183323 
112763-112909 
92607-92813 
91570-93177 



155620-155815 

31137-31293 

118611-118821 

2398-2513 

43936-44078 

128316-128627 

104806-104939,106898-107052 
10607-10713 
33114-33209,33 
113086-114800 



402528 
402603 
402615 



169609-169742 
141663-141852 
131390-132157 



65 
70 



403310 
403317 
403329 
403341 



8139936 
8318526 
8516120 
8569175 



7556757 
7656757 
7960440 



146568-146659,147539-147811 
171964-172239 
157618-157755 
58039-58149 



92839-93036 

155310-155436,158402-15 
173358-179553 
152431-153243 
46652-47332 



403625 
403637 
403678 



142647-142771,145531-14 

119573-120430 

88280-88463 
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30 
35 



70 
75 



403776 
403780 
403786 
403891 
403937 

404043 

404108 



404367 
404404 
404414 
404416 
404420 
404443 
404453 
404476 
404518 



404708 
404731 
404767 
404793 



405001 
405008 
405090 
405120 



405455 
405456 
405494 



405738 

405780 
405783 
405784 



405970 
406005 
406018 



406378 
406413 
406468 



1414-1513,1624-1756 



8152087 

9795980 
7249169 



9796969 Minus 



7882827 
7232206 
3810614 
6911603 
7407927 
7523744 



9454543 
9454643 
5836195 
5815499 



6758795 Minus 
8247789 Minus 



675B904 Minus 



73028-73217 
191508-193220 
12609-12773 
5140-5208,8633-8763 



18123-18766 

63603-64942 

86147-86509 

94185-94322 

67928-68109 

64270-64633 

114391-114628 

82112-82244 

143127-143398 

143042-143216,144704-1 44853,1 45800-1 46048 

129817-130586 

87198-87441 

27768-29179 



1 04257-1 04348, 1 04822-1 04970 



69781-70096 
40059-40210 
74922-75788 
16764-16900 

99433-99528,100035-100161 
108172-108296 
128092-128227 
77522-77658 

168609-168781,182951-183081 

23244-23759 

61087-61590 

7541-8132 

37948-38226 

147612-148011 

89944-90729 

72406-72600,72779-72856 



121688-121840 
977-1116 

90716-90887,101420-101577 
134112-134671 
150052-150208 
70284-70518 

65096-65247,77508-77637,81 242-81 354,84245-84395 

114550-114688,117265-117407,119490-119599,123237-123395,131140-131217 

117070-117270 

66822-66925 

1 7856-1 7957,1 8302-1 841 2,1 8837-1 8927,22790-22989 



61379-62562 



153933-154060 



27238-27885 
77798-78000 
15628-16127 
120621-120971 
163112-163652 



37795-38168 
89972-90319 

251370-251797,252168-252882 



111718-112008 

30084-30770 

64689-64798 
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ABOUT 1884 GENES UP-REGULATED IN IPF COMPARED TO NSIP 



TABLE 31 

Table 31A lists about 1BB4 geneswhose expression levels ai 
samples. These were selected from about 59680 probe; 
expression level to "average" non-specific interstitial pneumc 
to the 90th percentile amongst idiopathic pulmonary fibrosis 
interstitial pneumonia samples. 



s (IPF) samples as comp 
s on an Affymetrix'Eos Hu03 GeneChip array such that the ratio of 
as greater than or equal to about 2.0. The "i 
ion-specific interstitial pneumonia level was : 



set to the 90th percentile amongst non-specific 




ExAccn 
AW451709 



UnigenelD UnigeneTille 

Hs.271200 ESTs 

Hs.274419 hypothetical protein FU10244 

Hs.9711 novel protein 



416653 AA768553 Hs.74170 melallolhionein 1 E (functional] 

441233 AA972965 Hs.135568 ESTs 

415817 U88967 Hs.78867 protein tyrosine phosphatase, receptor-t 

409632 W74001 Hs.55279 seme (or cysteine) proteinase inhihilo 

432437 W07088 Hs.293685 ESTs 

407266 AJ236664 gb:Homo sapiens mRNA fori 

423017 AW178761 Hs.227948 serine (or cysteine) proteinase inhibi 
403329 

429629 BE501732 Hs.30622 Homo sapiens cDNA FU13010 fis, clone NT 

441519 AA972740 Hs.127092 ESTs 

453823 AL137967 gb:DKFZp761D2315_r1 761 (synonym: hamy2) 

406690 M29540 Hs.220529 carcinoembryonic antigen-related cell ad 

416379 N38857 Hs.203933 ESTs 

428862 NM.000346 Hs.2315 SRY (sex determining region Y)-box 9 (ca 

407305 AA715284 gb™35f03.r1 NCI_CGAP_Br5 Homo sapiens 

434683 AW298724 Hs.202639 ESTs 

441802 AA968636 Hs.127877 ESTs 

431242 AA987742 Hs.251278 KIAA1 201 protein 

442377 AA993807 Hs.167367 ESTs 

420407 AA814732 Hs.145010 lipopolysaccaride-specific response 5-li 

428908 AW303529 Hs.144955 ESTs 

445898 AF070623 Hs.13423 Homo sapiens clone 24468 mF 

457673 AA551569 Hs.272034 hypothetical protein PR02822 

458771 AW295151 Hs.163612 ESTs 

AA385085 gb:EST98959 Thyroid Homo sapiens cDNA 5' 

A1948966 Hs 130017 ESTs, Weakly similar to JN0908 H^-transp 

AW207091 Hs.72307 ESTs 

AW364104 Hs.143509 hypothetical protein FU21924 

AA173632 Hs.22116 CDC14 (cell division cycle 14, S. cerevi 

AA45701 1 gb:aa90c1 1 ,r1 Stratagene fetal retina 93 

AW021173 Hs.18612 Homo sapiens cDNA: FLJ21909 fis, clone H 



440504 
415025 
438557 
416128 
457242 
423629 
404793 
435563 
415672 



431622 
405523 
424693 



AF210317 Hs.95497 solute carrier family 2 1 
N53097 Hs.193579 ESTs 
AA102322 gb:zl90f03.r1 Stratagene colon (937204) 

AA367019 Hs.241395 protease, serine, 1 (trypsin 1) 

gb:QV1-HT0413-010200-059-o.05HT0413Homo 



AW979271 Hs.29. 



ESTs 



451519 
412474 
457081 
444827 
404822 



AA715013 
AA256753 
AA649051 
AA631498 
R10099 



AW976692 
AA250737 
AW811114 



Hs.47557 ESTs 
Hs.1 69835 ESTs 

gbrzs22b12.r1 NCI.CGAP.GCB1 Homo sapiens 
Hs.164007 ESTs 

gb:np83h04.s1 NCI_CGAP_Thy1 Homo sapiens 
Hs.269805 ESTs 
Hs.209573 ESTs 

gb:ni50c09.y5 NCLCGAP Ov2Ho 
Hs.180610 splicing factor profne/glu'amine rich ( 
Hs.20416 ESTs 



Hs.291665 ESTs 
Hs.72472 ESTs 

gb:MR2-ST01 31-1 1 1 1 99-0 16-a04 ST0131 H( 
Hs.126154 ESTs 
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431374 BE258532 
409816 AW500954 
447613 AL041057 



15 

20 



ls.251871 CTP synthase 

gb:UI-HF-BP0p-air-h-12-0-Ut.r1 NIH.MGC.5 
ls.33363 DKFZP434N093 protein 
ls.86379 ESTs 
.1 55397 Homo sapiens mRNA; oDNA DKFZp564K1 43 (fr 



439063 AF085922 Hs.11 
406053 

431211 M86849 Hs.323733 gap junction protein, beta 2, 26kD (conn 

451830 H18433 Hs.21542 KIAA1035 protein 

416035 H42314 gb:yo09e02.s1 Soares adult brain N2b5HB5 

413849 BE173561 Hs.15384 AP1 gamma subunit binding protein 1 

459458 AW270957 Hs.254577 ESTs, Weakly similar to B34087 hypothec 

416154 Z46122 



AA437124 



AW511090 

X14767 

BE512572 



Hs.187247 ESTs 

Hs .89764 fragile X mental retardation 1 
Hs.816 SRY (sex determining region Y)-box 2 
Hs.130419 ESTs 

Hs.89768 gamma-aminobutyric acid (GABA) A recepto 

gb:601452090F1 NIH_MGC_66 Homo sapiens c 



AW079064 Hs.245540 ESTs 



404957 
436445 
442617 
416045 
425178 
441918 
455464 
420929 
448844 
430686 
405229 



AV652066 
AL1 37758 
AW812256 
BE297855 



H15990 

H16097 

AI733373 

AW983901 

AI694143 

AI581519 

NM_001942 

AA205015 



455747 
417420 
409545 
426750 
440615 
408959 
454482 
436508 
452046 



T85150 

BE296182 

AA383950 

AI733055 

AW890878 

BE147919 

AW604381 



446466 
405455 
411387 



415257 
441107 
419519 
410901 
426217 
424188 
456987 
405303 
414955 



AA917075 
AI198719 
AW810001 
AW131888 



on factor IMA 
. : mRNA; cDNA DKFZp434H245 (fr 
gb:RCO-ST0174-191099-031-a07 ST0174 Homo 
NRAS-related gene 



Hs.31403 ESTs 
Hs.161027 ESTs 
Hs.128119 ESTs 

gb:RC1 -HN0003-22Q300-01 1 -f 1 0 H N0003 Homo 
Hs.296251 programmed cell death 4 
Hs.177164 ESTs 



Hs.253800 ESTs 

gb:48f1 Human retina cDKA randomly prime 
gb:RC5-BT0580-170300-021-F12 BT0580 Homo 

Hs.266814 ESTs 

Hs.19002 hypothetical protein MGC4675 

gb:EST97403 Thymus II Homo sapiens cDNA 
Hs.130806 ESTs 

Hs.211610 CUG triplet repeat, RNA-binding protein 

gb:RC3-HT0230-16020C-016-aOS HT0230 Homo 

Hs.121121 ESTs, Weakly similar to S00755 pleckstri 

Hs.27657 KIAA0802 protein 

gb:Homo sapiens tetracaine Uansporter- 

Hs.202259 ESTs 

Hs.281680 peroxisomal trans 2-enoyl CoA reductase; 

Hs.308 arrestin 3, retinal (X-arrestin) 

Hs.130815 hypothetical protein FLJ21870 
Hs.1 12238 ESTs 
Hs.174880 ESTs 

gb:MR3-ST0192-010200-210-c05 ST0192 Homo 



Hs.27513 
Hs.190520 
Hs.176376 

Hs.172792 
Hs.142634 
I- " 



gb:MR4-ST0124-270300-005-b11 ST0124 Ho 
ESTs, Weakly similar 
zinc finger protein 



Hs.257224 ESTs 



417181 L10123 



431508 
415236 
413101 



AA461032 
NM_012481 
R41400 
BE065215 
AW052174 Hs.196030 



surfactant protein A binding protein 

hypothetical protein DKFZp586F1122simil 
ribosomal protein L12 
gb;yf94b12.s1 Soares infant brain 1 NIB H 
gb:RC1-BT0314-310300-015fl)1 BT0314Homo 
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408172 
426985 
404638 
447617 



Hs.148661 ESTs 

Hs.9475 glucose transporter protein 1 0 

Hs.46039 phosphoglycerate mutase 2 (muscle) 

Hs.131905 ESTs, Moderately similar toZ195_HUMAN Z 



Hs.176675 ESTs 

422182 AL043892 Hs.180582 Homo sapiens cDNA: FLJ21836 fis, clone H 
Hs.29055 ESTs 

gb:MR1-CT0353-1 503CC-1 02-a1 2 CT0353 Homo 
Hs.153351 ESTs 



444157 AI125785 



459448 
452090 
414899 
443764 
444898 
41742B 



429927 
453080 
412147 



AL037421 Hs.208746 ESTs, Moderately similar to pot. ORF I [ 

AA707472 Hs.190760 ESTs 

AA416773 Hs.275012 EST 

AA022684 Hs.124673 Homo sapiens cDNA FLJ1 1477 fis, clone HE 

AW975433 Hs.36288 ESTs 

F23283 gb:HSPD22980 HM3 Homo sapiens cDNA clone 

AI201548 Hs.308338 ESTs 

N87579 gb:LL2030F Human fetal heart, Lambda ZAP 

AI004034 Hs.98638 ESTs 

AW856093 Hs.183617 ESTs 

AA411750 Hs.20943 ESTs 

AA829433 Hs.275343 ESTs 

NM.001 1 1 5 Hs.2522 adenylate cyclase 8 (brain) 

AI423056 Hs.23921 hypothetical protein DKFZp547A023 

AW895984 gb:QV4-NN0039-040500-197-e08 NN0039 Homo 

AI079519 Hs.134398 ESTs 

AL138012 Hs.183840 ESTs, Moderately similar to ALU7_HUMAN A 

T05095 Hs.19597 KIAA1 694 protein 

AW439497 Hs,290656 EST 



453043 
456293 
447879 
426646 
454864 
404898 
435434 
443314 
408000 
441700 
455000 
404767 
445189 



AW970017 
AJ251509 
AW1 36440 



Hs.158313 



Hs.272345 phosphodiesterase 1 1 A 

Hs.224277 ESTs 

Hs.31 1561 ESTs, Weakly similar to CYA7_HUMAN ADENY 

Hs.170437 ESTs, Weakly similar to PRP4_HUMAN SALW 

Hs.122713 ESTs 

gb:QV4-LT0016-240200-110-dC4 LT0016 Homo 

Hs. 187850 ESTs 

Hs.54646 ESTs 

Hs.620 bullous pemphigoid antigen 1 (230/240KD) 

Hs. 126908 hypothetical protein FU12994 

Hs.324429 Homo sapiens cDNA FU14C15 fs, clone HE 



H87398 

AA433838 

AA385751 

AW772721 

AU076643 



gb:zw53e12.r1 Scares Jotaljetus Nb2HFE 
Hs.196379 ESTs. Weakly similar to pulative p150 (H 

gb:hl95c01.x1 NCI_CGAP_Thy8 Homo sapiens 
Hs.31 3 secreted phosphoprotein 1 (osteopontn, 



436033 H75391 Hs.255748 ESTs 

410733 D84284 Hs.66052 CD38 antigen (p45) 

60 455587 BE007829 gb:QV0-BN0147-280400-213-d03 BN0147 Homo 

459084 H01699 Hs.27289 CGI-1 25 protein 
401189 

435451 AF1 95420 Hs.303006 ESTs, Weakly similar to gamma-heregulin 

456407 AW968614 gb:EST380690 MAGE reseouences, MAGJ Homo 

6 5 425733 F13287 Hs.159388 Homo sapiens clone 23576 mRNA sequence 

447863 AL047611 Hs.288885 Homo sapiens cDNA FU14246 fis, clone OV 

436659 AI217900 Hs.1 44464 ESTs 

435463 AA6B2507 gb:zj18f08.s1 SoaresJetalJiver_spleen_ 

455675 BE065984 gb:RC3-BT0319-1 20200-01 4-a06 BT031 9 Homo 

70 439481 AF086294 Hs.1 25844 ESTs 



405784 
436461 
437636 
409629 
412999 



AW511956 
AA764781 
AW449589 
BE046255 

AA405097 
AI821324 
AA362858 



Hs.291844 ESTs 
Hs.279724 ESTs 

gb:hn38g10.x2 NCI_CGAP_RDF2 Homo sapiens 

Hs.97957 ESTs 
Hs.1 00445 ESTs 

gb:EST72900 Ovary II Homo sapiens cDNA 5 



414539 BE379046 
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5 

10 



421106 AA877124 Hs.172844 ESTs 

409076 N57559 Hs.82273 hypothec r .., 

419563 AA526235 Hs.193162 HomosapienscDNAFLJ11983fs,doneHE 

411688 AW953440 gb:EST365510 MAGE resequences, MAGB Homo 

416614 T83391 Hs.111849 ESTs 

454434 AA083558 Hs.261286 ESTs 
404526 
445393 
405302 



423841 AW753967 
427099 AB032953 
430484 D82880 



420457 AA482280 
438993 AA828995 
458421 AI279978 



4113B2 
422373 
430749 



Hs.267115 ESTs 

gb:bb28c01.x1 NIH_MGC_5 Homo sapiens cDN 
gb: RC2-CT0304 080 1 00-0 1 1 -h 1 2 CT03C4 Homo 

Hs.1 73560 odd Oz/ten-m homolog 2 (Drasophila, mous 

Hs.241548 RAS p21 protein activator 2 



BE067246 
AK001843 
AJ242956 



Hs.282832 ESTs, Weakly similar to T24961 hypotheti 

gb:PM1 -BT034B-1 51 299-001 -d04 BT0348 Homo 
Hs.115700 Homo sapiens cDNA: FLJ23515fis, clone L 
Hs.25960 v-myc avian myelocytomatosis viral rdat 




430 
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20 
25 



431333 
451073 
417663 
432363 
436975 



436020 
424989 
426447 



AI758905 
R07483 
AA534489 
AA740723 



AA009647 

H62953 

M778177 



Hs.120127 Homo sapiens cDNA: FLJ22769 fis, clone K 
Hs.206063 ESTs 
Hs.1 80461 ESTs 

gb:nf76g11.s1 NCI_CGAP_Co3 Homo sapiens 
Hs.212644 ESTs 

Hs.163846 putative N6-DNA-methyltransferase 
Hs.145990 ESTs, Weakly similar to 138022 hypotheti 
Hs.8850 a disintegrin and metalloproteinase dorna 

gb:yr47f06.:1 SDa-es felal liver spleen 
Hs.121724 ESTs 



AW967500 
R58624 

W16522 



AI990114 

AW021633 

M21305 

AK001103 

C0C719 



400462 
449804 
443305 



Hs.2186 eukaryolic Iranslalion elongation factor 

Hs.237689 Homo sapiens cDNA FU13539 fis, clone PL 

Hs. 197653 ESTs 

Hs.142390 ESTs 

Hs.24Q091 ESTs 



gb:Human alpha satellite and satellite 3 
Hs.274274 hypothetical protein FLJ10246 
Hs,120440 EST 
Hs.193521 ESTs 

Hs.72325 Human DNA sequence from clone RP1-187J11 
Hs.221849 ESTs 



AW821257 
AW1 02723 
AA527941 



Hs.133318 ESTs 



gb:PM3-ST0307-231 299-001 -b 11 ST0307 Homo 
guanylate cyclase 1, soluble, alpha 3 
gb:nh30c04.s1 NCI_CGAP_Pr3 Homo sapiens 

hypothetical protein FU12606 



X99225 
AI36513C 
AW1 04432 



414146 
417401 
401200 
411560 
42630C 
437918 
447917 



AW851186 
AA447310 
AI761449 
AL048037 
BE466506 
AI476732 
AA975096 
AL023753 



418019 
417490 
423035 
416575 
414400 
418405 
450350 
451704 

407404 
423121 
430533 



;.284153 

.149761 
Hs.192908 
Hs.152104 

Hs.36172 
Hs.317596 
Hs.187615 

Hs.179909 
Hs.1 64059 
Hs.121629 
HS.16458B 



U50525 

R09442 

W07140 

R14454 

AW836491 

R68911 

AA203335 



AI755209 

M62397 

AF040257 

AW864848 

AA480895 

BE170301 
AI375167 
AI915676 



Homo sapiens cDNA FLU 2927 fis, clone NT 



ESTs, Moderately similar to neuronal Ihr 



Hs.93701 
Hs.1 85708 
Hs.98201 



Hs.54721 ESTs 



Hs.176275 ESTs 



Homo sapiens mRNA; cDNA DKFZp434E232 (fr 
early B-cell factor 



Hs.38383 ESTs 



Hs.174880 ESTs 



ls.205615 ESTs, Weakly similar to ALU1JHUMAN ALU S 
mutated in colorectal cancers 
gb:Homo sapiens TNF receptor homolog mRN 
gb:PM2-SN0018-290300-G03-cCg SM0013 Homo 
ESTs, Weakly similar to T1728B hypotheti 



Hs.1 93574 ESTs 
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413543 AA130228 Hs.324611 ESTs, Moderately similar to ALU8_HUMAN A 

454447 BE163567 gb:QV3-HT0460-230200-101-bC8 HT0460 Hcmo 

458067 AA393603 Hs.35752 protein kinase anchoring protein GKAP42 

437608 AA761605 H 232308 EST v I I ! LU 1 HUMAN ALUS 

415549 F1 1942 gb:HSC33F061 normalized infant brain cDN 

420910 AL049437 Hs.1 00292 Homo sapiens mRNA; cDNA DKFZp586E1 120 (f 

435793 AB037734 Hs.4993 KIAA1313 protein 

453211 W84829 gb:zh53f04.r1 Soares_fetal_liver_spleen_ 



412281 
443204 
416616 



Human glucose transporter pseudogene 



431632 
403306 
422093 



AI937026 
AW1 37748 
AK000992 
NM.006825 
AF151852 



Hs.125956 
Hs.333144 
Hs.74368 



442137 AA977235 
405970 

409434 AF278761 

416100 H18700 

431418 X68242 

431954 AK001974 

440388 AI693520 

421072 AI215069 

424578 AK001973 

435331 AI239495 

444063 AH 2261 4 

444453 AW379394 
404196 

421262 AA286746 

409555 AW410788 

417669 T99898 

416057 AI927382 

425206 NM.002153 

447733 AI871000 

430664 AW969834 



Hs.131681 Homo sapiens testis transcript Y 7 (TTY7 
"~~ ESTs 
Hin-1 

hypothetical protein FU11112 



Hs.252722 
Hs.272242 
Hs.223000 
Hs.89113 
Hs.150890 
Hs.120189 



Hs.145126 ESTs 



hypothetical protein 

gb:qa96b05.x1SoaresJelalJiearl_NbHH19W 



s.2985 7 
s.1551 09 
s.161330 



431474 AL133990 



Hs.5912 
Hs.278054 
Hs.1 90642 



Homo sapiens cDNA FLJ14265 (is, clone PL 
ESTs 

gb:ye68g01.r1 Soares fetal liver spleen 
ESTs 

hydroxysteroid (1 7-beta] dehydrogenase 2 

ESTs 

ESTs 

gb:RC6-CN0014-080300-012-B09 CN0014 Homo 



ESTs, Wsakl/s milarta I3BQ22 hy| 



436577 
412209 
417153 



440727 
428766 
439567 
456231 
454318 



W84774 

AW901456 

X57010 



Hs.1 31 241 ESTs 

Hs.1 5721 2 
Hs.1 62434 
Hs.17643 



Hs.81343 

Hs.102971 

Hs.79069 

Hs.121571 
Hs.50852 
Hs.282284 
Hs.1 34268 



hypothetical protein FLJ14129 



H73183 
AW367764 
AA099113 
AI080491 
T63804 

AW063476 Hs.279080 ESTs 



425567 AA381579 
400432 AX015809 



404518 AI815601 



437B75 AA770151 
444870 AI200621 



Hs.1 12704 
Hs.126424 
Hs.1 48504 



CD83 antigen (activated B lymphocytes, i 
gb:RC3-HT0272-1 1 01 00-01 3-c06 HT0272 Homo 
gb:qa58b06.s1 Soares_NhHMPu S1 Homosapi 
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453242 
415131 
412040 



40 
45 



AK001751 

AJ 237589 

AF231338 

AK000883 

AA913381 

T98327 

D61119 



Hs.132221 
Hs.48295 
Hs.171835 

Hs.222024 
Hs.301545 
Hs.190513 
Hs.18343 



444443 AI149286 

434001 AW950905 

454145 AA046872 
405264 

411849 AW964970 

416816 T71168 

435325 AI038388 

440184 AB002297 



Hs.55099 
Hs.3697 
Hs.62798 

Hs.18861 
Hs.119567 
Hs.1 19309 
Hs.7022 



hypothetical protein FLJ10889 

gb:Homo sapiens mRNAforT-boxtranscrip 

transcription factor BMAL2 

HomosapienscDNAFU10021 fis, dona HE 

ESTs 

ESTs 

gb:HUM1 58C1 1 B Clontech human fetal brain 
neuropeptide Y receptor Y6 (pseudogene) 
Homo sapiens cDNA FLJ14364 fis, clcne HE 
rab6 GTPase activating protein (GAP an ' 
serine (or cysteine) pn 
ESTs 



429216 
429106 
405720 
400889 
416294 

425374 

427374 



AI369472 Hs.65407 ESTs 



412321 
422219 
454962 
441705 



AF129535 

AI904013 

R42778 

At 150033 

AW071349 

AI336596 

AI004397 

AF015592 

R34107 

AA411303 

U49250 

AL390159 

M533574 

AA579465 

T89677 

AW936913 



Hs.143686 
Hs.215937 
Hs.1 56294 
Hs.334552 
Hs.28853 
Hs.321450 
Hs.30022 
Hs.210862 
Hs.269812 
Hs.1 52274 
Hs.45207 



g b:zw85g07.s1 Soares_lolal_fetus_Nb2HF8_ 



F-box only protein 5 
gb:MR-BT041-220199-104BT041Ho 
Homo sapiens clone IMAGE32106, mRNAseq 



Homo sapiens cDNA FU14930 fis, clone PL 
CDC7 (cell division cycle 7, S, cerevisi 
| g ic> pecfic I 3-1-glycop in: 
ESTs, Weakly similar to NAH6_HUMAN SODIU 
T-box, brain, 1 

Homosapiens mRNA; cDNA DKFZp761M0415 (f 
ESTs 

hypothetical protein KIM1 335 

ESTs 

gb:RC1-DT0029-030200-012-f08 DT0029 Homo 
requlator of mitotic spindle assembly 1 
gb:IL3-CT021 3-2801 00-05&-A04 CT021 3 Homo 
ESTs 



426701 
401132 
407764 



neNT 

gb:CM0-BN0154-080400-325-h04 BN0154 Homo 
U40462 Hs.54452 zinc finger protein, subfamily 1A, 1 (Ik 
AA449563 Hs.151393 glutamate-cysteine ligase, catalytic sub 
BE5O2601 Hs.1 34289 ESTs, Weakly similar to KIAA1 063 protein 
AI792788 gb:ol91d05.y5 NCI_CGAP_Kid5 Homo sapiens 

BE003714 gb:QV3-BN0096-200400-161-a01 BN0096 Homo 



454466 
401449 
431196 



AA687322 
AA448542 
AW875972 
AA351364 



438315 
425523 
419340 
425636 
430553 
457030 
447375 
408334 
410085 



NM_002719 
R56795 
AB007948 
AA236590 



AI301740 
AI376660 
AW514652 



Hs.1 54929 ESTs 

Hs.112794 Human DNA sequence from clone 1068E13 on 
gb:zx66ti 1 1 .r1 Soaresiotalfetus _Nb2HF8_ 

Hs.192843 leucine zipper protein FKSG14 

Hs.251677 G antigen 7B 

gb:CM3-PT0014-071299-051-b05 PT0014 Homo 
gb:EST59099 Infant brain Homo sapiens cD 

gb:RC1-BT031 3-301 299-01 2-c09 BT0313 Homo 
gb:qs76g04.x1 NCI_CGAP_Pr28 He 

Hs.171734 protein phosphatase 2, regulatorysut 

Hs.82419 ESTs 

Hs.158244 KIAA0479 protein 

Hs.87530 ESTs 

Hs.158370 hypothetical protein FLJ10381 

gb:CM4-ST0275-021299-053-h09 ST 

Hs.1 73381 dihydropyrimidinase-like 2 

Hs.257822 ESTs 

Hs.321637 ESTs 

Hs.58589 glycogenin 2 

gb:yv27d04.s1 Soares fetal liver sple 

Hs.208515 ESTs 



AW501465 Hs.249230 ribonuclease L (2.5-oligoisoadenylate 



WO 03/042661 



PCT/US02/36810 



421581 
440633 
453264 



AW851980 
AA431127 
C0144B 



Hs.271955 

Hs.262346 
Hs.98685 ESTs 



1 n 455170 
1U 416208 
423657 



AF022913 
W27643 



Hs.62187 
Hs.6877 
Hs.73965 



tumor necrosis factor, al| 
gb:QV0-CT0387-1 80300- 1 67-h07 CT0387 Homo 
ESTs, Weakly similar to MUC2JHUMAN MUCIN 
glucan (1,4-alpha-), branching enzyme 1 

phosphatidylinasilolglycan, class K 
hypothetical protein FU10483 
splicing factor, argininefeerine-rich 2 
mRNA 




448835 
415979 
434479 
426724 
418105 



H16427 
AI138213 
AA383623 
AW937488 



Hs.22229 ESTs 

Hs.224849 Homo sapiens cDNA FU12583 fis, clone NT 

Hs.163725 ESTs 

gb:601 283601 F1 NIHJ/IGC.44 Homo sapiens c 

Hs.99879 B-cell growth factor 1 (12kD) 

Hs.134746 ESTs, Weakly similar to A46010 X-linked 

Hs.45177 ESTs 

Hs.11081 UBX domain-containing 2 

Hs.271501 ESTs, Weakly similar to I54374 gene NF2 

Hs.162035 olfactory receptor, family 52, subfamily 

Hs.293616 ESTs 

Hs.1 78000 ESTs, Weakly similar to FV1 MOUSE FRIEND 



400579 
402953 
404285 



AWB07573 
BE045604 
BE169746 



423454 
452762 
401344 



AA393167 
BE065030 
BE218708 



AW752389 
AB019494 
AL110456 
AW501435 



433132 
423600 
427667 
407257 
457041 
421482 



438394 
443741 
405090 
432267 
445409 



AK001279 
AB006834 
AA399018 



gb:MR1-ST008B-021299-004-g01 ST00BB Homo 
ESTs 

likely ortholog of mouse Arkadia 



Hs.41294 

Hs.124179 

Hs.1 17270 

Hs.146274 

Hs.77252 

Hs.79306 

Hs.233363 

Hs.331561 

Hs.90012 

Hs.87296 

Hs.225767 

Hs.469 

Hs.278582 



hypothetical protein FU14345 



eukaryotic translation initiation factor 
guanylate cyclase activator 1C 
muscle-specific RING-finger protein 3 
hypothetical protein FLJ23441 
Homo sapiens cDNA FLJ20269 lis, clon 
IDN3 protein 



gb:CM0-HT0180-041099-O6:>bO4 HT0180 Homo 
gb:RC5-OT0078-150300-021-E08 OT0078 Homo 
hypothetical protein IMPACT 



Hs.1 801 71 

Hs.250835 
Hs.104715 
Hs.1 10293 
Hs.86149 
Hs.47274 
Hs.22573 
Hs.1 95870 



phosphoinositol 3-phosphate-binding prot 
Homo sapiens mRNA; cDNA DKFZp554B176 (fr 
ESTs, Weakly similar to ALU1JHUMAN ALU S 
hypothetical protein FU14991 
gb:PM3-HT0344-071299-003-c0BHT0344 Homo 
peptidylprolyl isomerase (cyclophllin)-l 



:ns cDNA FU10010 Js, clone HE 



WO 03/042661 PCT/US02/36810 



449347 
428301 
406364 
418409 
427050 
431778 
448405 
429846 
43B165 
418888 
418432 
426424 
419505 
403743 



431292 
403513 
425745 
440122 
448446 



35 
40 



AA219332 

AA397789 

AL080276 

AW207634 

AB023021 

AA779344 

AU076801 

M14156 

BE081745 

F09134 
NMJ05756 
AA326187 
BE146879 

AK001695 
AF075084 

AA370141 



Hs.120869 ESTs,WeaklysimilartoR107_HUMAN H-REV 

Hs.1 61803 ESTs 

Hs.268562 regulator of G-protein signalling 1 7 

Hs.1 70849 ESTs 

Hs.225945 fucosyltransferase 9 (alpha (1,3) fucosy 

Hs.138136 ESTs, Weakly similar to 1510254AL1 repe 
I 3 herin 17. LI iheri /(= n' I. 

Hs.85112 insulin-like growth facto: 1 (sornatomedi 

Hs.272188 Homo sapiens cDNA FU12090 fie, clone HE 

Hs.1 43061 ESTs 

Hs.1 2839 ESTs 

Hs.1 84942 G protein-coupled receptor 64 

Hs.17170 G protein-coupled receptor 4 

gb:QV4-HT0222-261Q99-014-c11 HT0222Homo 

Hs.146589 hypothetical protein FLJ10701 

gb:Homo sapiens full length insert cDNA 

Hs.1 25897 ESTs 

Hs.2281 chromogranin B (secretogranin 1) 



U44060 
AI733011 
AI521251 



Hs.1 4427 Homo sapiens cDNA: FLJ21 800 fis, clone H 

Hs.1 27678 ESTs 

Hs.1 71030 ESTs 

Hs.19348 hypothetical protein FU13119 

448130 AW271635 Hs.170717 ESTs 



Weakly similar to IPP1 .HUMAN PROTE 




458090 AI282149 
400706 

432779 AW97924' 
444600 R41398 
403786 




FORKH 

gb:EST391351 MAGE resequences, MAGP Homo 



gb:yg61g02.r1 Soares infant brain 1NIB H 
gb:ne09a06.s1 NCI.CGAP_Co3 Homo sapiens 
gb:MR0-HT0067-201099-002-h1 1 HT0067 Homo 



Homo sapiens mRNA; cDNA DKFZp434 N079 (fr 
Home sapiens dons IMAGE:3210B, mRNA sfiu 
Homo sapiens cDNA FLJ10145 fis, clone HE 
gb:34b8 Human retina cDNA randomly prime 



gb:tz46c03.y1 NCI_CGAP_Brn52 Homosap:en 

ESTs, Weakly similar to ALU1 HUMAN ALUS 
ESTs 

gb:CM&HT0249-29irmO84-cO4 HT0249 Homo 
FU21924 

dated p-otein BAP29 
solute carrier family 22 (o-ganic cation 
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420879 N31165 

446447 AI300402 

439953 AA91B129 
400643 



438402 
451353 
419791 
415628 
423637 
454747 
452778 
457178 
401526 
408751 



40 
45 



N21043 

AI579909 

F13080 

AL1 37279 

AW818535 

R71338 

AL039101 



N91553 
AA281279 
417320 AA195667 
442927 AI024347 



NMJ14717 

AI811202 

BE245652 



Hs.238837 ESTs, Weakly similar Id S43603 RNA bindi 

Hs.202250 ESTs 

Hs.124638 ESTs 

Hs.156189 ESTs 

Hs.42915 ARP2 (actin-related prolein 2, yeast) ho 

Hs.42932 ESTs 

Hs.105104 ESTs 

gb:HSC3ID041 normalized infant brain cDN 

Hs.130187 Homosapiens mRNA; cDNA DKFZp43401214 (f 

gb:RC1-ST0278-140300-016-f05 ST0278 Homo 

Hs.5921 Homo sapiens cDNA: FU21 592 fis, clone C 

Hs.194625 dynein, cytoplasmic, light intermediate 

Hs.258343 ESTs 

Hs.23317 hypothetical protein FLJ14681 

Hs.86022 ESTs 

Hs.131519 ESTs 

Hs.118121 ESTs 

gb:RC2-BTO318-15020O-01 1-b09 BT0318 Homo 

Hs.1 08884 KIAA0390 gene product 

Hs.325335 Homo sapiens cDNA: FLJ23523 fis, clone L 

Hs.1 18281 zinc finger protein 266 



405170 
435878 
436194 
400394 
411244 
441817 
456118 
417801 
442717 



454778 
451203 
450180 
409432 
442264 



436720 
447410 
414652 
430454 
412417 
423130 
430660 
401098 
454036 



433513 
446577 
457756 



R08330 

AK001074 

AF040257 

AW833768 

AW969706 

AA380267 

AA417383 



Hs.20152 ESTs 

Hs.333435 Homo sapiens cDNA FLJ10212 fis, clone HE 

Hs.283818 Homo sapiens TNF receptor homolog mRNA, 

gb:QV4-TT0008-1 301 00-077-e06 TT0CC8 Homo 

Hs.293332 ESTs 

Hs.78277 DKFZP434F2021 protein 

Hs.82582 integrin, belaTike 1 (with EGF-like rep 

Hs.1 80591 ESTs, Weakly similar to T23976 hypotnefi 



AF100779 

AA779332 

AW057584 

AI025829 

N45513 

AA016255 

AA744693 

BE174240 

D31897 



AW380579 

AWB20199 

AW070604 

AW449644 

D49372 

AI278777 

AA777809 



Hs.8632D ESTs 
Hs.46608 ESTs 

I ls.31856 ESTs, Weakly similar to KIAA1453 protein 

gb:ny26c10.s1 NCI_CGAP_GCB1 Homosapiens 

Hs.79G24 heterogeneous nuclear ribonucleoprotein 

Hs,57714 double C2-like domains, alpha 

gb:RC6-IJMO014-17030O-022-C05 UV0014 Homo 
gb:QV3-NN1024-260400-171-f10 NN1024 Homo 

Hs.1082E8 actin binding protein; macrophin (microf 

Hs.95958 solute carrier family 2 (facilitated glu 

Hs.209657 ESTs 

gb'.QV2-ST029S-190100-029-a07 ST0296 Homo 

Hs.46517 ESTs 



gb:HSC1FC021 normalized infant brain cDN 
Hs.297221 Homo sapiens cDNA FU101 36 fis, clone HE 
Hs.272073 ESTs 

Hs.10667 ESTs 
Hs.1 90440 ESTs 

gb:zl71h04.r1 SoaresJestis_NHT Homo sap 
Hs.89739 cholinergic receptor, nicotinic, beta po 
Hs.98265 KIAA1 877 protein 
Hs.57549 hypolhetical protein dJ473B4 
Hs.301917 YDD19 protein 

gb:EST38801 1 MAGE resequences, MAGN Hoi 
Hs.172698 EST 



AI620599 Hs.72068 ESTs 



AW469011 
AA102268 
AW897586 



;,105635 ESTs 

Hs.158622 ESTs 

Hs.21213 ESTs 

Hs.100826 ESTs 



AA374756 Hs.93560 Homo sapiens mRNA 



AI904738 

AW937143 

AI566356 

AI800311 

AA126136 

N66727 

AF129505 



Hs.156291 ESTs 

Hs.38125 interferon-induced protein 75, 52<D 

Hs.10957 ESTs 

Hs.86492 small muscle protein, X-linked 



436 
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422816 
420756 
423532 



30 
35 



gb:RC6-BT0717-1 10400-01 1-F1 1 BT071 7 He 
Hs.175358 ESTs, Weakly similar to A47582 B-cell gr 
Hs.271 780 ESTs, Weakly similar to 138022 hypotheti 
iinDKF2p586J1119 




427134 
435689 
429282 



AW858612 

AW391351 

F13271 

AA383092 

AA398409 

AA694284 

N27596 



gb:QVO-CT0387-180300-167-h07 CT0337 Homo 
Hs.998 peroxisome proliferative activated recep 

gb:RC4-NN0027-060400-011-<l11 NN0027 Homo 
gb:beta -pol=DNA polymerase beta {exon a 
Hs.95667 F-box protein 30 
Hs.209927 Homo sapiens cDNA FU20325 fis, clone HE 
Hs.145583 ESTs 

Hs.145328 ESTs, Weakly similar to ALU1_HUMAN ALU S 
Hs.176860 ESTs 

gb:CM3-CT0341 -1 90400-1 52-h 1 2 CT034 1 Homo 
Hs.50820 Homo sapiens unknown mRNA 
Hs.79981 Human clone 23560 mRNA sequence 
Hs.1608 replication protein A3 (14kD) 
Hs.173561 EST 

gb:zi35o02.s1 Soares_fetalJiver_spleen_ 
Hs.21342 ESTs 
Hs.186811 ESTs 



421274 
403776 
409526 
410201 
427839 
447884 



BE298751 
AA126129 
AA608823 



435448 
445706 
422171 



410181 
422897 
427038 



Hs.98244 ESTs 

gb:ym60d10.r1 Soares infant brain 1MB H 
Hs.1 95029 ESTs 
Hs.97321 ESTs 

Hs.5957 Homo sapiens clone 24416 mRNA sequence 
Homo sapiens HSPC082mRNA, partial cds 
,.RC0-MT001 2-290300-031-1 10 MT001 Homo 



larto2N91_HUMANZINC 



ESTs Hqhly similar to FXD3 HUMAN FORKH 
gb:RC2-ST0301-1 20200-01 1 -f1 2 ST0301 Homo 
hypothetical protein FLJ20281 
gb:QV3-BT0379-0103C0-105-g03 BT0379 Homo 
gb:RC5-BT0580-170300-021-F12 BT0580 Homo 



AB021923 Hs.23367 EST-YD1 protein 

H17132 Hs.27085 ESTs 

AA305520 Hs.108812 hypothetical protein FU22004 

U50529 Hs.1 12434 Novel human gene map-' 

AW968226 Hs.60798 ESTs 

T75157 Hs.337603 ESTs, Weakly similar to TO8680 hypotheti 



AI468210 

AA679784 

NM_014633 

AI673006 

BE179015 




446554 
452512 
440728 



Hs.261 285 pleiotropic regulator 1 (PRL1 , Arabic'ops 

Hs.4290 ESTs 

Hs.1 73288 KIAA01 55 gene product 

Hs.231948 ESTs 

gb:RC3-HT061 2-0B050C-O1 3-h 1 G HT0612 Homo 

K53935 Hs.287827 ESTs, Highly similar lo MDR3JHUMAN WULTI 

AA151730 Hs.301789 nudix (nucleoside diphosphate unked moi 

AW363486 Hs.337635 ESTs 

AW086077 Hs.153272 Homo sapiens cDNA; FU22715 lis, clone H 



WO 03/042661 
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10 
15 



419481 


AI879195 


Hs.90606 




2.7 


454352 


AW389668 




gb:RC2-ST0168-071299-013-ID6ST0168 Homo 


2.7 


422831 


R02504 


Hs.332943 


ESTs 




413646 


BE155042 




gb:PM0-HT0349-101299-002-E04 HT03^9 Homo 


2.7 


426872 


AA410446 


Hs.112011 


ESTs, Weakly similar lo unknown [Rsapie 




459160 


AI904723 




gb:CM-BT066-1 20299-092 BT066 Homo sapien 


27 


421338 


AA287443 




gb:zs52c10.r1 NCLCGAP_GCB1 Homosapiens 


2.7 


446002 


AI346468 


Hs. 145789 


ESTs 




454716 


AWB50684 




gb:IL3-CT0219-160200-063-D12 CT0219 Homo 




406664 


L34041 


Hs.9739 


glycerol-3-phosphate dehydrogenase 1 (so 




453128 


AW026516 


Hs.31791 


acylphosphatase 2, muscle type 


2,7 


408691 


AW250525 




gb:2821526.5prime NIH MGC 7 Homo sapiens 




454754 


AW819191 




gb:CM1-ST0283-071299-051-d08 ST0283 Homo 




439451 


AF086270 


Hs.278554 


heterochromatin-like protein 1 




445225 


AI215555 


Hs.202398 


ESTs 




427175 


H06924 


Hs.23782 


hypothetical protein FLJ12847 




411816 


AW864609 




gb:PM3-SN0017-240300-001-h03 SN0017 Homo 




438135 


AI253025 


Hs.190426 


ESTs 




405981 










406005 










430762 


AI343652 


Hs.1 05667 


ESTs 




438361 


AA805666 


Hs.146217 


Homo sapiens oDNA: FU23077 fis, clone L 




412105 


H07971 


Hs.94319 






434684 


AA737282 


Hs.1 90911 


ESTs 




44E660 


AI702668 


Hs.201955 


ESTs 




400844 










415725 


BE219771 


Hs.237146 


hypothetical protein FU 12752 




420159 


AI572490 


Hs.99785 


Homo sapiens cDNA: FLJ21245 fis, clone C 




408812 


BE397160 


Hs.254763 


ESTs. Weakly similar to A42442 integrin 




430052 


AF1 02850 


Hs.227933 


Alg5, S. cerevisiae, homolog of 




440310 


M878939 


Hs.125406 


ESTs 




425659 


AK000590 


Hs.1 58836 


hypothetical protein FLJ20583 




417252 


AA195014 


Hs.85971 


ESTs 


2.5 


427167 


AI239607 


Hs.99196 


hypothetical protein MGC11324 




431613 


AA018515 


Hs.264482 


Homosapiens mRNA; cDNADKFZp761A0411 (f 


2.6 


414546 


BE379492 




gb:601236215F1 NIH_MGC_44 Homo sapiens c 


2.6 


407494 


U10072 




gb:Human forkhead family (AFX1) mRNA, pa 


2.6 


429643 


AA455889 


Hs.167279 


FYVE-finger-containing Rab5 effector pro 




442240 


AI791883 


Hs.292719 


ESTs 




452821 


AW471181 


Hs.1 60874 


ESTs 


2.6 


410238 


N94320 


Hs.1 44225 


ESTs 




419236 


AA330447 


Hs.1 351 59 


Homo sapiens cDNA FLJ1 1481 fis, clone HE 


2,6 


440801 


AA906366 


Hs.1 90535 


ESTs 








Hs.7122 


scrapie responsive protein 1 


2,6 


411597 


AW852925 




gb:PM0-CT0248-131099-001-f10CT0248Homo 


2,6 


417956 


AA210704 




ESTs 


26 


420621 


AA278808 




gbzs79c09.r1 NCLCGAP_GCB1 Homosapiens 


Z6 


425176 




Hs.1 55005 


TE.' mi, in family nembe 1 (SV40 I 


2,6 


425492 


AL021918 


Hs.158174 


zinc finger protein 184 (Kruppel-like) 


2.6 


425698 


NM 016112 


Hs.1 59241 


polycystic kidney disease 2-like 1 


2,6 


426098 


NM 014906 


Hs.1 66351 


KIM1 072 protein 


26 


435113 


M665469 


Hs.117136 


ESTs 


2.6 


438188 


AA779975 


Hs.1 28859 


ESTs 


2.6 


445550 


A1242754 


Hs.1 37306 


ESTs 


Z6 


458804 


AL1 57625 




gb:DKFZp761L2016_r1 761 (synonym: hamy2) 




448299 


AA497044 


Hs.20887 


hypothetical protein FLJ 10392 




436407 


T88803 


Hs.271507 


ESTs, Weakly similar to TIMJ1UMAN PROBAB 


2.6 




M352026 


Hs.94319 


VPS10 domain receptor protein 




418282 


AA215535 


Hs.98133 


ESTs 


2.6 












413470 


N20934 




gb:yx54c11.s1 Soares melanocyte 2NbHM Ho 


it 


428527 




Hs .34492 


Cyt1 9 protein 




441209 


AA922939 


Hs.1 35742 




2.6 


458679 


AW975460 


Hs.143563 


ESTs 


2.5 


442279 


AW867006 


Hs.159970 


ESTs 


2.5 


407244 


M10014 


Hs.75431 


fibrinogen, gamma polypeptide 


2.5 








gb:hm30f03.x1 NCI_CGAP_Thy4 Homo sapiens 


2.5 


404845 










411693 


AW857271 




gb:CM0-CT0307-210100-1 58-g09 CT0307 Homo 


2^6 


438298 


H23542 


Hs.18178! 


ESTs 


2.6 




AI939339 




ESTs 





455870 
457630 
424015 
417563 
413174 
438875 
440700 



431086 AI829692 



AA203701 
AA723564 
AA827640 



IS.313CC3 ESTs, Highly similar to AF157833 1 r 
Hs.1 12627 ESTs 
ls.166361 Homo sapiens mRNA; cDNA DKFZp564F1 12 (fr 

gb:zx52a" 
ls.191343 ESTs 



Hs.296184 guanine i 
Hs.1 25878 synapsin 
Hs.211561 ESTs 



hypothetical protein FLJ10879 
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451531 AA018311 

428085 AA421081 

429761 AI276780 

437958 BE139550 



30 
35 
40 
45 



gb:CMO-HT0395-2801 00-169-b09 HT0395 Ho 

Hs.158196 transcriptional adaptor 3 (ADA3, yeast h 
Hs.173519 ESTs 
Hs.1 14762 ESTs 
Hs.12388 ESTs 
Hs.135173 
Hs.121668 
Hs.303720 



435766 R11673 
452879 AW905328 
440400 AA994364 
440460 H92571 
424146 AA705092 
439950 AW937417 
410366 AI267589 
417485 AA203304 
AW962574 
U33632 
AA884448 



412566 
416498 
440397 

411819 
430357 
432869 



Hs.292987 
Hs.186498 
Hs-180842 
Hs. 125594 
Hs.234478 
Hs.202368 
Hs.293561 
Hs.302689 
Hs.32826 



ESTs, Moderately similar to PC4259 ferri 
ESTs 

gb:RC1-BT0313-130400-016-c02 BT0313 Homo 



hypothetical protein 
CGI-130 protein 

gb:EST374647 MAGE resequences, MAGG Homo 
potassium channel, subfamily K, member 1 



454836 
458589 
459716 
436340 
428020 



60 
65 
70 



AW947884 
AW976789 
AW974094 
AK000134 



AI467915 



Hs.121483 

Hs.97489 ESTs 

Hs.208161 ESTs 

Hs,36053 ESTs 



AI018435 Hs.270970 ESTs 

AK001084 Hs.333498 Homo sapiens cDNA FU10222 fis, clone HE 

BE540937 Hs.20104 hypothetical protein FLJ00052 

AA649162 Hs.236456 ESTs 

AW83371 1 gb:QV4-TT0008-251 1 99-043-e1 1 TT0008 Homo 

AV654623 Hs,288141 hypothetical protein MGC3156 

R42246 Hs.21606 ESTs 

L19058 Hs.181581 glularnate receptor, ionotropic, kainate 

AA19C926 Hs.190785 ESTs, Moderately similar to S65657 alpha 



450402 
411156 
431673 



452768 
411355 
458890 
400074 
405241 



AW902125 
AW855718 
AA352111 
AW876627 
AA830185 
AI027990 
AF220188 

BE218027 

AW819939 

AW971302 

BE182587 

BE006919 

R35063 

BE075035 

AA541957 

U46265 

R35614 

AW979009 

AW673606 

AW975183 

BE079307 

W31790 

AI916973 

AW792804 

AA344685 

AA578229 

AW297497 



gb:QVO-NN1022-120500-220-h12 NN1022 Homo 
gb:RC1-CT0279-070100-021-a06 CT0279 Homo 
gb:EST60061 Activated T-cells XX Homo sa 
gb:RC3-PT0023-120200-013-d1 1 PT002B Homo 



s.273629 
s.293233 
s.57485 
ls.134106 
ls.181 536 



gb:PM3-BT0584-260300-002-g05 BT0584 Homo 
gb:ns18d08,r1 NCLCGAPJ3CB1 Homo sapiens 
gb:QV2-PT0010-160<00-133-g01 PT0010 Homo 



Hs.326108 
Hs.80758 
Hs.292663 

Hs.194293 
Hs.213603 
Hs.134002 



gb:yg66e08,r1 Soares infant brain 1N!B H 
ESTs 

aspartyl-tRNA synthetase 
- T: » ,1/ similar to S72482 hypotheti 
gb:RC1-BT0623-120200-O1 1-g09 BT0623 Homo 
ESTs, Weakly similar to 154374 gene NF2 



gb:PM4-SN0020-010400-009-b05 SN0020 Homo 



BE512968 

AW860975 Hs.13256 

AL031228 Hs.204370 DNAsegmentonchromosome6(unique,pse 

AK000890 gb:Homo sapiens cDNA FLJ10028 fis, clone 
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442653 BE269247 



gb:601 1 85486F1 NIH_MGC_8 Homo sapier 
ESTs, Weakly similar tc S18946 ultra hig 



436359 
429322 
431699 
437107 
441953 
442777 
453361 
455275 
457824 



AF086310 

BE385725 

Z83B06 

D86984 

NM_001173 

M745598 

H11695 

AW341541 

AA035197 

AW977806 

R84938 

AW297880 

AF070526 

AI967991 

BE549143 

AA742457 



AI217097 
AA682573 
AA412446 
AF231919 



Hs. 199243 
Hs.267831 
HS.29184C 
Hs.322901 
Hs.271153 
Hs.107375 



Hs.98661 
Hs.13429 
Hs.93574 

Hs.291479 
Hs.253204 
Hs.191146 
Hs.1466 
Hs.173108 



455556 AW995423 



Hs.98138 
Hs.18759 
Hs.1 27655 



gb:601276347F1 NW.MGC.20 Homo sapiens c 
gb:H.sapiens mRNA for axonemal dynein he 
KIAA0231 protein 
Rho GTPase activating protein 5 

, Weakly similar to 178885 serine/th 



a N2b4HR Homo 
ESTs 

Homo sapiens clone 24787 mRNA sequence 
homeo box D3 

gb:601076456F1 NIH_MGC_12 Homo sapiens c 

ESTs 

ESTs 

glycerol kinase 

Homo sapiens cDNA: FU21897 lis. clone H 
gb:qd43h07.x1 Soares_fetaiJieart_NbHH19W 
ESTs, Weakly similar to organic anion tr 



183-g08BN0042llomo 



448015 
454190 
436154 



R17315 gb:yg12g11.M Soares infant brain 1NIB H 

AI458065 Hs.23196 ESTs 

AW177821 gb:IL3-HT0059-1 80899-007-C05 HT0059 Homo 

AA754950 Hs.1 19898 ESTs 



437030 M742577 Hs.303781 EST 

420815 AA280684 Hs.270584 ESTs 

418421 R58620 Hs.85050 

423638 AI003521 Hs.130310 Homo sapiens mRNA for cyclin B3 isofonn 

415425 F08365 gb:HSC2SA121 no - 
404577 



60 
65 
70 
75 



449899 
451078 
453343 
428728 
409642 
426235 
452043 

419910 
411036 
444575 



454437 
419187 
444493 
405547 
454086 
417508 
416277 



Sarcolemmal-as 



AA905353 Hs.121622 ESTs 

NM_016625 Hs.191381 hypothetical protein 

AW450809 Hs.257347 ESTs 

AI631964 Hs.34447 ESTs 

H86231 gb;yt03f02.r1 Soares retina N2b5HR Homo 



AA857218 
AI264847 
AI657014 
AW807605 
AW809648 
BE1 76776 
AA293333 
AK001104 
AI248173 



BE163512 
W78765 
AI924940 

R61736 
AA296520 
AV653771 
AW500340 



403501 
446845 
401775 



Hs.190173 ESTs, Weakly similar to A46010 X-tinked 
Hs.297007 membrane-bound trai i n factor prot 
Hs.22545 Homo sapiens cDNA FU12935 fis, clone NT 
gb:tt49a12.x1 NCLCGAP_GC6 Homo sapiens 
gb:MR4-ST0098-120100-001-b06 ST0098 Homo 
gb:MR4-ST0124-261099-015-d01 ST0124 Homo 
gb:RC3-HT0586-1 10300-01 1-g09 HT0586 Homo 
gbzt53c09.r1 Soares ovary tumor NbHOT H 
Hs.168241 hypothetical protein FLJ10242 
Hs.1 9 1 460 hypothetical protein MGC1 2936 
Hs.44693 ESTs 

Hs.282094 ESTs, Moderately similar to I38022 hypot 

Hs.6975 PR01 073 protein 

Hs.1 80877 H3 histone, family 3B (H3.3B) 

Hs.180145 HSPC030 protein 

Hs.1 08082 ESTs, Weakly similar to T31636 hypotheti 

Hs.1 241 28 ESTs 

Hs.89546 selectin E (endothelial adhesion molecul 

gb:AV653771 GLCHomo sapiens cDNA clone 

Hs.31 3876 ESTs, Weakly similar to I38022 hypotheti 

gb:PM1-NN0047-040400-001-d09 NN0047 Homo 

Hs.54576 sulfolransferase, estrogen-preferring 

gb:zu86h01.s1 Soares_testis_NHT Homo sap 

Hs.751 1 3 general transcription factor IIIA 

Hs.278436 KIAA1 474 protein 

Hs.6838 ras homolog gene family, member E 



AI343645 Hs.156108 ESTs 
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410845 AW807182 gb:MR4-ST0062-180200-001-b04ST0062Homo 

411836 AW901879 Hs.314453 ESTs 

412879 BE092219 gb:IL2-BT0734-240400-071-B04BT0734Homo 

421083 AA2B3628 Hs.298016 ESTs, Weakly similar lo 138022 hypotheti 

5 423513 AF035960 Hs.129719 transglutaminases 

428882 AA436915 Hs.131748 ESTs, Moderately similar to ALU7.HUMAN A 

428945 AW192803 Hs.98974 ESTs, Weakly similar lo S65824 reverse t 

434627 AI221894 Hs.39311 ESTs 

435256 AF193766 Hs.13872 cytokine-like protein C1 7 

10 435079 AA664192 gb:ac05b03.s1 Stratagene lung (93721 0) H 

" Hs.283389 ESTs 

HsJ76906 ESTs 



414093 
441262 
402076 
427962 
400587 
403053 
411203 
447849 



417976 
447179 
412977 



AA946582 Hs.8700 deleted in 



AW872430 
AI538147 
AB023191 

NM.007231 
AA884283 
BE565892 



Hs.2186 eukaryolic translation elongation factor 

Hs.273743 ESTs 

Hs.164277 ESTs 

Hs.44131 KIAA0974 protein 



Hs.192136 ESTs 

Hs.83077 interleukin 18 

AW015633 Hs.157299 ESTs 
AA125910 



Hs.293539 Homo sapiens mRNA for KIAA1758 protein, 



40 
45 
50 
55 



408419 AW250092 
417002 T79613 
439446 AI927629 



450947 
423073 
438142 
409239 



429063 
433868 
401645 
432149 



NM_003181 
AW363845 
AA612960 



425433 AA357471 



Hs.305953 zinc finger protein 83 (HPF1) 

Hs.14613 ESTs 

Hs.57873 ESTs 

Hs.12627 TJ6 protein 

Hs.278639 KIAA1 684 protein; likely homolog of mous 

Hs.257976 ESTs 

gb:MR0-HT0165-191199-004d09 HT0165 Homo 

Hs.240833 ESTs, Weakly similar to I33Q22 hypolheti 

Hs.3327 Homo sapiens cDNA: FU22219 fis. clone H 

Hs.94030 Homo sapiens mRNA; cDNA DKFZp586E1624 (f 

Hs.294132 ESTs 

Hs.204662 ESTs 

Hs.123119 MAD (mothers against decapentaplegic, Dr 

Hs.269651 ESTs 

Hs.44307 ESTs, Moderately similar lo I38022 hypol 

Hs.143507 T brachyury (mouse) homolog 

Hs,122142 ESTs, Weakly similar to A46010 X-linked 

Hs.337300 ESTs 

Hs.157022 ESTs, Weakly similar to T34549 probable 



gb:EST66274 LNCAP cells I Homo sapiens c 
Hs.194193 ESTs, Moderately similar to ALU1JHUMAN A 
Hs.25328 ESTs, Moderately similar to ALU7.HUMAN A 



458734 AI554946 

442179 AA983842 

444313 AH 40494 

440448 AA885428 

441498 AI379248 

438205 AA780365 



421755 
449007 
449916 
418857 
422486 



421515 Y11339 



gb;PM0.OT0019-150300-002-d01 OT0019 Homo 

Hs.1 58794 ESTs 

Hs.333555 chromosome 2 open reading frame 2 

Hs.197955 KIAA0704 protein 

Hs.1 25646 ESTs 

Hs.58742 ESTs 

Hs.122161 ESTs 



* transcript 



Hs.11713 E74-like factor 5 (ets dorr 

Hs.227280 U6 snRNA-associated Sn 

Hs.47144 DKFZP586N081 9 protein 

Hs.207422 ESTs, Weakly similar lo S71949 metallopr 

Hs.193201 EST, Weakly similar to NIP2_HUMAN BCL2/A 

Hs.299221 pyruvate dehydrogenase kinase, isoenzyme 

Hs.89394 POU domain, class 1 , transcription facto 

Hs.1 17487 gene near HD on 4p1 6.3 with homology to 

Hs.280922 ESTs 

Hs.163944 ESTs 

gb:zp53f03.r1 Stratagene NT2 neuronal pr 

Hs.283732 ESTs, Moderately similar to ALU 1_HUMAN A 

Hs.105352 GalNAc alpha-2, 6-sialyltransferase I, I 



WO 03/042661 PCT/US02/36810 



403003 
405347 
406091 
428402 
438762 
4E5780 
457024 
404249 
443921 
407055 
417154 
419720 
405230 



Hs.326876 Homo sapiens SOX6 mRNA, complete cds 

AW844412 Hs.65450 reticulon 4 

BE088828 gb:CM2-BT0693-230300-1 29-g09 BT0593 Ho 

AA397546 Hs.119151 ESTs 



Hs.1 34848 ESTs 

gb:H.sapiens DNA for endogenous retrov'r 
Hs.21388 ESTs 

Hs.337778 hypothetical protein FU11068 



X89211 
AI674701 
AA249131 



ESTs. Weakly similar to ALU8_HUWAN AL 



gb;601186291F1 NIH_MGC_8! 
hypothetical protein FU11210 
zinc linger homeoboxlB 
gb:RC1-CT0302-120200-01 3-c'04 CT0302 Homo 
hypothetical protein PR02964 
gb:af69g08.r1 Soares_NhHMPu_S1 Homosapi 
cholinergic receptor, nicotinic, alpha p 




441622 
411004 
411093 
428548 
404059 
446861 
413640 
423554 
435338 
442710 
444206 
451250 
454784 
458455 
458521 



439546 AF088056 



Hs.224864 ESTs 



AA430058 Hs.98649 EST 



AI696519 

BE158118 

M90516 

AA678071 

AI015631 

AW301017 

AA491275 



AW452065 
AF157326 
AW842115 



Hs,1674 
Hs.194300 
Hs.23210 
Hs. 146492 



ESTs, Weakly similar to I38022 hypotheti 



AV648310 



gb:Homo sapiens full length insert cDNA 
ESTs 

gb:QV3-DT0044-221299-045-c03 DT0044 Homo 



Hs.184786 
Hs.283108 
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AL1 34435 Hs.22269 
NM_014707 Hs.1 16753 
AW316932 Hs.181982 



443718 
4455BB 
400582 
411252 
401145 
407440 



443751 AI285B39 



Hs.31028 
Hs.292036 ESTs, Weakly sir 
Hs.1 71 054 ESTs 

disintegrin and 



Hs.278679 
Hs.221373 
Hs.258744 KIAA1796 



gb:MR2-n0014-151199-01 1-b07TT0014 Homo 

gb:Homo sapiens candidate lasle receptor 
gb:QV0-HT0368-040100-082-f06 HT0368 Homo 

Hs.235883 ESTs 

Hs,172982 ESTs 

Hs.8895 ESTs 



Hs.275675 katanin p80 (WD40-containing) subunit B 
Hs.197645 ESTs 



AA853410 Hs.93557 



45 
50 



60 
65 



430444 AW296421 

456208 AW299698 

430388 AA356923 

416509 N57713 

419337 AW291112 

419699 AA248998 



AW811380 

AW449838 

R15891 

AA503115 

BE144152 

AA056060 

R78581 

AI125334 

AA632577 

AF007143 
AW044647 
AW780192 
AA282119 
AV647969 



421227 
430789 



421813 
411994 
443476 
452463 
404936 
442833 
440836 
405120 
400238 



AW936336 
AA521259 
AI733297 
AI208754 
AW971180 
AI571514 
BE146866 
AI640326 



447732 
404257 



Hs.138263 
Hs.121035 
Hs.334625 
Hs.240770 



Hs.97562 
Hs.281587 
Hs.1 83752 



Homo sapiens clone 24528 rnRNA sequence 

Homo sapiens cDNA FU14890 (is, clone PL 
nuclear cap binding protein subunit 2, 2 
ESTs, Moderately similar to ZN91 .HUMAN Z 
ESTs 

ESTs, Weakly similar to I38022 hypolheB 
gb:IL3-CT022O.150200-068-H08 CT0220 Homo 
gb:IL3 ST0143 290S99 01 9-D05 ST0143 Homo 
ESTs 

Human (clone CTG-A4) rnRNA sequence 



R67298 

AW068594 

R36452 



Hs.97408 
Hs.310235 
Hs.1 58992 
Hs.28205 
Hs.1 96284 
Hs.267596 
Hs.88975 
Hs.109694 
Hs.1 49442 
Hs.252713 
Hs.1 17895 
Hs.1 54662 
Hs.87517 
Hs.37415 



ESTs, Weakly sin 
ESTs 

Homo sapiens clone 23738 rnRNA sequence 



' similar to ALU 1 _HUMAN ALU S 

gbte49b05.y1 NCLCGAP_Bm52 Homo sapien 
Homo sapiens cDNA: FU22845 (is, clone K 
ESTs, Weakly similar to YCD1.HUMAN HYPOT 



AW3708B2 Hs.222080 ESTs 



Hs.244106 ESTs 

gb:OV4-DT0Q21 -281 299-070-g 1 1 DT0021 Homo 
Hs.193796 ESTs 



ESTs 

gb:EST383268 MAGE resequences, MAGL Homo 
ESTs 

gb:QV4-HT0222-211099-0l4-f06 HT0222 Homo 
ESTs 

gb:Homo sapiens rnRNA full length insert 



Hs.133022 
Hs.62713 



A175B398 Hs.1 61318 ESTs 

BE539853 Hs.22452 Homo sapiens rnRNA for KIAA1 737 protein, 
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420678 
430692 
446453 



A1631655 

AW592167 

AK000184 

AA424051 

T82384 

AK001160 

AI276454 

AI800470 

BE300815 

AK000618 

AI522109 



Hs.122872 ESTs, Weakly similar to JU0033 hypotheli 
Hs.197919 ESTs 
Hs.293299 ESTs 



Hs.304742 
Hs.5999 



gb:yc14f05.r1 Slratagene lung (937210) H 
hypothetical protein FLJ10298 
gb:ql71a12.x1 Soares_NhHMPu_S1 Hornosa| 



Hs.201326 
Hs.123784 
Hs.171066 



AW138295 
AI537817 
H12402 



Hs.1 88646 
Hs.129908 
Hs.1 35024 
Hs.270311 
Hs.1 191 22 



TLS-associatedserine-arginine protein 2 
gb:H.sapiens endogenous retrovirus HERV- 
ESTs, Weakly similar to ALU 1 _H U MAN ALU S 
KIAA0591 protein 



419423 D26488 Hs.90315 KIAA0007 protein 

424719 H90452 gb:yv01o03.r1 Scares fetal liver spleen 

431453 AW753917 gb:RC0-CT0299-291 199-031 -F02 CT0299 1 lorno 

442078 AW268583 Hs.262629 ESTs 

452975 M85521 Hs.244482 Homo sapiens, clone IMAGE:361 171 9, rriRNA 

426197 AA004410 Hs.100009 acyl-CoenzymeA oxidase 1, palmitoyl 

427119 AW880562 Hs.114574 ESTs 
400486 

448482 AW294078 Hs.171Q92 ESTs 
402621 

408363 NMJ03389 Hs.44396 coronin, actin-binding protein, 2A 

424584 H10692 Hs,13310 ESTs 

445061 AI253094 Hs.145227 ESTs 

431065 AA491286 Hs.128792 ESTs 

411908 L27943 Hs.72924 cytidine deaminase 

441826 AW503603 Hs.129915 phosphotriesterase related 

446901 AI347274 gb:tc05d02.x1 NCI_CGAP_Co16 Homo sapiens 

422677 AL046388 Hs.208206 hypothetical protein FU21 162 

455534 AW991925 gb:PM3-BN0011-130100-002-b07BN0011 Homo 

400163 

418882 NMJ04996 Hs.89433 ATP-binding cassette, sub-family C(CFTR 

gb:QV3-DT0044-221299-045-c03 DT0044 Homo 

gb:yh67a07.s1 Soares placenta Nb2HP Homo 

Hs.47099 hypothetical protein FU21 21 2 



410556 
432584 
439482 
447877 



444540 
450024 
456481 
435138 
412887 
454204 
408253 
432887 
448063 
416171 
433098 
409781 

423646 
436572 
447044 

444585 
437334 
431917 
400843 



AA928829 
W70045 
AI435184 
R91254 

AW770778 

BE242814 

NMJ14099 

AA761594 

AW976002 

N22415 

AI693927 

AA005129 



gb:yp94e12.s1 Soares fetal liv- 



Hs.281238 ESTs 



Hs.122440 
Hs.258402 
Hs.189080 I 



AA258033 Hs.108110 



H23896 

AW190593 

AW812266 



Hs.159818 
Hs.1 25790 
Hs.151143 
Hs.1 5220 
Hs.278359 



ESTs 

gb:zh90h0ar1 Soare3_fetalJiver_spleen_ 

DKFZP547E21 10 protein 

gb:601 1 52976F1 NIH_MGC_19 Homo sapiens c 

gb:PM3-BN0142-200300-001-c04 BN0142 Homo 

gb:QVO-ST0236-171299-075-b02 ST0235 Homo 

Homo sapiens mRNA for FLJ0001 2 protein, 

ESTs 



AA723274 
AF030107 
AI580296 
AW170015 
AL353947 
D16181 



Hs.279596 ESTs 
Hs.17165 i 
Hs.1 74782 
Hs.6594 
Hs.283780 



zinc finger protein 106 
absent in melanoma 1 like 
gb:yj35d06.r1 Soares placenta Nb2HP Homo 



gb:PM1-BT0348-151299-001-a12 BT0348 Homo 
Hs.17924 ESTs, Moderately similar to ALU1_HUMAN A 
Hs.662 cerebellin 1 precursor 

gb:PM1-MT0010-200300-0O1-g08MT0010Homo 
Hs.27951 8 amyloid beta (A4) precursor-like protein 

gb:601076707F1 NIH_MGC_12 Homo sapiens c 
Hs.1 73540 ATPase, Class V, type 10D 
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15 



444547 AV650207 

417156 N49476 

416761 H85422 

408867 AA437199 

406748 AW339106 



427473 
433919 
431058 
428679 
415260 
440253 

418849 
432463 
400861 
407287 
414817 
416143 



425187 
408221 
411480 
459681 
414784 



Hs.656 cell division cycle 25C 
Hs.217493 annexinA2 
Hs.97872 ESTs 



AW274439 Hs.252709 ESTs 

M74631 1 gb:oa56d12.r1 NCI_CGAP_GCB1 Homo sapiens 

AW968865 gb:EST380941 MAGE resequences, MAGJ Homo 

AA431765 gb:zw80c03.s1 Soares_testis_NHT Homo sap 

F02614 Hs.27319 ESTs 

AI651329 Hs.160289 ESTs 

AA634818 Hs.298138 ESTs 

AW474547 Hs.53565 Homo sapiens PIG-M mRNA for rnannosyllran 

AA548518 Hs.186733 ESTs 

AI678812 gb:tu59d08.x1 NCLCGAP_Gas4 Homo sapiens 

AW902892 Hs.23782 hypothetical protein FLJ12847 

AI955650 Hs.79033 glutaminyl-peptide cyclotransferase (glu 

AA694220 Hs.15403 ESTs, Moderately similar to ALU7_HUf N A 

■ n r " Hs.250899 heat shock factor binding protein 1 



AI743261 
AW014486 
AA912183 
AW848022 

NMJ00344 
AW136066 
AW444538 
AL079309 



Hs.131860 ESTs 



457187 
446534 
403764 
442735 



441679 
432781 
448470 
419637 
443180 
422213 
423119 
450192 



gb:Homo sapiens mRNA full length insert 
Hs.257862 ESTs 

Hs.31476 Homo sapiens cDNA FU13872fe. doneTH 
Hs.149595 ESTs 
Hs.144360 EST 
Hs.175225 ESTs 



BE502267 Hs.65996 ESTs 
NM..014133 Hs.278940 PRQ061 8 protein 



gb:31h10 Human relina cDNA randomly prim 



AA322201 Hs.131976 ESTs 

AA263143 Hs.24596 RAD51 -interacting prote 

AA419529 Hs.76391 myxovirus (influenza) re! 



AW752782 

AK001563 

AW971382 

AA724411 

N34486 

AI094805 

R13580 

AW135313 

AL035414 



412700 
453125 
422757 
452864 
452441 



Hs.129750 hypothetical protein FU10546 

Hs.146589 hypothetical protein FU10701 

Hs.29401 6 ESTs, Moderately similar to B34087 hypot 

Hs.156065 ESTs 

Hs.170504 ESTs 

Hs.1 35522 ESTs, Weakly similar to S38038 hypolheti 

Hs.13436 Homo sapiens clone 24425 mRNA sequence 

Hs.150098 ESTs 

Hs.21068 hypothetical protein 

Hs.272139 ESTs 

Hs.201262 ESTs, Weakly similarto I38022 hypothetj 

Hs.1 15497 hypothetical protein FU22655 

Hs.65551 Homo sapiens, Similar to DNA sei 

Hs.287629 hypothetical protein FU14260 

Hs.1 13069 ESTs 



428186 
438432 
409446 



AW504300 
AW444990 
AI561173 



ESTs 
Hs.302415 ESTs 

Hs.250822 serine/threonine kinase 15 
Hs.889 Charot-Leyden crystal protein 
Hs.1 20750 ESTs 

Hs.1 67 microtubute-associated protein 2 
Hs.260681 ESTs, Moderately similarto ALUF.HUMAN I 
Hs.231926 ESTs 

gb:MR0-HT01 64-1 51 299-01 2-b08 HT01 54 Homo 
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414428 
445712 



404443 
433645 
414456 



AI000792 Hs.1 08209 ESTs 

AW807281 gb:MR4-STOO62-24030O-003-g01 ST0062 Homo 

AF086161 Hs.114611 hypothetical protein FLJ1 1808 

AA292084 Hs.191575 ESTs, Moderately similar to ALU2_HUMAN A 

BE296906 Hs.1 82625 VAMP (vesicle-associated membrane proiei 

X58140 (NONE) 

AI458246 Hs.167451 ESTs 

AW341392 Hs.235795 ESTs 

AA885569 Hs.40919 Homo sapiens cDNA FLJ1451 1 fis, clone NT 

AI885508 Hs.94122 ESTs 

AW292515 Hs.194317 ESTs, Weakly similar to T08680 hypolheti 

AF053748 Hs.248114 glial cell derived neurotrophic factor 

H06336 Hs.13480 Homo sapiens clone 24875 mRNA sequence 

H30351 Hs.207982 ESTs 



425457 
403317 
413801 



AW948353 
AB006624 
AI827815 



gb:EST376285 MAGE resequences, MAGH Homo 
fms-retated tyrosine kinase 3 
ESTs, Highly similar to unnamed protein 
ESTs, Moderately similar to ALU 1_H U MAN A 



456313 
450271 



417448 
402103 
450579 



60 
65 
70 



AI433201 
S78234 
AA203135 

AW1 36774 
NMJ03816 
AA291834 
H24415 



ls.27986C 
ls.172405 
fe.130186 



Hs.2442 

Hs.78950 

Hs.1 3273 

Hs.248144 

Hs.291228 

Hs.281901 



jin keto acid dehydrogenase E 
otein ^ 



420844 
427434 
432188 



Hs.301732 

Hs.2928 

Hs.53131 



gb:nh22c09.s1 NCLCGAP_Pr1 Homo sapiens 



Homo sapiens cDNA FU13155 fis, clone NT 
ESTs 

ESTs, Weakly similar to I38022 hypolheti 



AI453184 Hs.66357 ESTs 



H02217 
AI732271 
AL1 09712 
AF092047 
AL079800 



429986 
432919 
434791 
445273 



412798 AW998657 



429430 AI381837 



gb:yj38d11.r1 Soares placenta Nb2HP Homo 
Hs.25557 ESTs 

Hs.296506 Homo sapiens mRNA Ml length insert cDN 
Hs.227277 sine ocutis homeobox (Drosophila) ho 



Hs.119120 
Hs.92576 
Hs.244569 



E3 ubiquitin ligase SMURF1 
ESTs 

esophagus cancer-related gene-2 
ESTs, Weakly similar to I38022 hypolheti 
DKFZP564C1 52 protein 



430909 
453116 
415312 
423019 
414007 
459535 
448776 



AI276680 
W02640 
AI640185 
AI733895 
AV654907 
BE302464 



s.16247 ESTs, Weakly sirr 



Hs.103813 
Hs.30057 



gb:AV654907 GLC Homo sapiens cDNA c ! one 
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421279 AW564878 Hs.106645 ESTs 2.1 

443167 AI202009 Hs.132087 ESTs 2.1 

459124 AW301478 Hs.184592 protein kinase, lysine deficient 1 2.1 

448078 AI460117 Hs.170464 ESTs, Highly similar to A53933 myosin I 2.1 

5 436858 BE545498 gb:601070344F1 NIH_MGC_12Homosapiensc 2.1 

430521 NM.016383 Hs.242183 HOM-TES-85 tumor antigen 2.1 

431089 BE041395 Hs.283676 ESTs, Weakly similar to unknown protein 2.1 

407401 AF029325 gb:Homo sapiens laminin beta-4 chain pre 2.1 

_ 426336 AA375802 gb:EST881 35 HSC172 cells II Homo sapiens 2.1 

10 451124 AI186203 Hs.31432 cardiac ankyrin repeat protein 2.1 

425541 AA359119 gb:EST68172 Fetal lung II Homosapiensc 2.1 

406504 2.1 

410626 BE407727 gb:601 299771 F1 NIH.MGC.21 Homosapiensc 2.1 

415186 AA160945 Hs.14479 HomosapienscDNAFLJ14199fis.cloneNT 2.1 

15 416175 H24230 Hs.271498 ESTs, Moderately similar to ALU1_HUMAN A 2.1 

436820 AI684535 Hs.200811 ESTs 2.1 

442095 AI733162 Hs.128470 ESTs 2.1 

451878 AI821027 Hs.8429 ESTs 2.1 

449178 AI633748 Hs.197597 ESTs 2.1 

20 427307 AF117947 Hs.174795 PDZ domain-containing guanine nucleotide 2.1 

415857 AA855115 Hs.127797 Homo sapiens CDNAFLJ11381 fis, clone HE 2.1 

425154 NM001851 Hs.154850 collagen, type IX, alpha 1 2.1 



449746 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOC 

441543 AI733014 Hs.269715 ESTs 
25 403065 

428811 AA436052 Hs.99487 ESTs 

451803 BE541174 Hs.252058 ESTs, Moderately similar to PC4259 ferri 

442906 AW296888 Hs.170939 ESTs 

409171 R17125 jb:y Mi r nfart brain 1 NIB H 

30 414175 AI308876 Hs.103849 hypothetical protein DKFZp761D112 

450785 AA852713 Hs.25459 Homo sapiens, alpha-1 (VI) collagen 

412039 AW887384 gb:RC0-OT0089-130300-021-d07 OT0089 Homo 

453055 AW291436 Hs.31917 Homo sapiens, clone MGC9658, mRNA, comp 

443268 AI800271 Hs.129445 hypothetical protein FU 12496 

35 455022 AW850845 gb:IL3-CT0220-111199-028-D11 CT0220Homo 

447972 AL137275 Hs.20137 hypothetical protein DKFZp434P0116 

422942 AF054839 lls.122540 tetraspan2 
400451 

406658 T62745 Hs.184411 albumin 

40 450159 AI702416 Hs.200771 ESTs, Moderately similar to A Chain A, T 

448732 BE614063 Hs.334689 KIAA1 838 protein 

423453 AW450737 Hs.1 28791 CGI-09 protein 

421447 AB005216 Hs.104481 Nek, Ash and phospholipase C binding pro 

45 408774 AW270899 Hs.254569 ESTs, Weakly similar to B34087 hypotheti 

419986 AI345455 Hs.78915 GA-binding protein transcription factor, 
405732 

417848 AA206581 Hs.39457 ESTs, Weakly similar to JC5314 CDC28/cdc 

442875 BE623003 Hs.23625 Homo sapiens clone TCCCTAO0 1 42 mRNA sequ 

50 420344 BE463721 Hs.97101 putative G protein-coupled receptor 

455778 BE088746 gb:CM2-BT0693-210300-123-d09BT0693Homo 

426953 AI769281 Hs.97439 ESTs 

440454 AI733037 Hs.129990 ESTs 

433917 AI809325 Hs.122814 Human DNA sequence from clone RP5-1028D1 

55 424872 AA347923 gb:EST54302 Fetal heart II Homo sapiens 

454658 AW812330 Hs.11123 DKFZP564G092 prolein 

441963 AI733307 Hs.128002 ESTs 

439498 AA908731 Hs.58297 CLLL8 protein 

456224 AW292905 Hs.128770 ESTs 

60 413525 BE145899 gb:MR0-HT0208-221299-204-b10 HT0208 Homo 

444702 AI220122 Hs.326560 hypothetical protein MGC2780 

417787 R14948 Hs.23883 ESTs 

400612 

, 410878 AW809201 Hs.314248 ESTs, Weakly similar to ALU4_HUMAN ALU S 

65 414494 AA768491 Hs.6783 hypothetical protein FU22724 

427027 AI924294 Hs.173259 uncharacterized bone marrow protein BM03 

451067 BE172186 gb:MR0-HT0559-1 10300-005-h 1 1 HT0559 Homo 

455032 AI830890 Hs.192422 ESTs 

417945 R29072 gb:F1-101D22week old human felsl liver 

70 438268 AA782163 Hs.293502 ESTs 

424754 R09692 gb;yf23b12.r1 Soares fetal liver spleen 

404599 



AW294659 Hs.34054 Homo sapiens cDNA: FU22488 Ss, clone H 

AI133161 Hs.286131 CGI-1 01 protein 

X57131 Hs.248209 H2A histone family, member F, pseudogene 

AA780385 Hs.187885 ESTs 

AW297181 Hs.195922 ESTs 

AW953935 Hs.30837 ESTs 

AI82B595 Hs.250691 ESTs 

W00973 Hs.334728 ESTs 



402455 
/5 459278 
421987 



447534 
417687 
412717 
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437083 
409466 
433523 
446868 
445882 
438005 
406817 
410486 



20 
25 



AA401863 

AI373638 

AA279392 

W28418 

F13168 

BE272110 

M769123 

AW082597 

AA436207 

H29882 

AV660737 

AI948717 

BE151746 



AI768015 

AW573204 

Y13187 

AW296243 

AW001835 

AA639902 

AI703192 

AW664658 



Hs.22380 ESTs 
Hs.133900 ESTs 

Hs.88605 Homosapiens cDNA FLJ13427 Us, clone PL 
Hs.30715 potassium voltage-gated ch Isk-rel 
gb:HSC3JF101 no ' 



ESTs, Moderately similar to I54374 gene 



ESTs, Weakly similar to A46302 PTB-assoc 
gb:PM1-HT0305-061299-003-a06 HT0305 Homo 
gb:wo47a09.x1 NCI_CGAP_Gas4 Homo sapiens 
zinc finger protein 

gb:CM4-PT0031-180200-507-e05 PT0031 Homo 




Hs.92127 
Hs.137078 
Hs.248067 
Hs.1 18375 
Hs.13323 
Hs.104215 

Hs.149332 



ESTs 



hypothetical protein FU22059 

ESTs, Moderately similar to SPCN.HUMAN S 

gb:wd92h04.x1 NCI_CGAP_Lu24 Homo sapiens 



45 
50 



441996 
412194 
444229 
441635 
421387 



443520 
409248 
444518 

409316 
402725 
413783 
423867 



429540 
459456 
409840 



445627 
440299 
401236 
429996 
455135 

433449 
454197 
445297 



Hs.38383 ESTs 
Hs .11 541 2 hypothet 



BE349537 

AW900282 

AV648613 

AI908538 

AF059566 

AW1 62907 

AA424197 

W90022 

AB033035 

AI160278 

U28251 

AA314337 

AA331886 

AW675764 

AW1 95922 

AV647953 

AI452457 

AK000464 

AF116656 

M85776 

AA486036 

AW502122 

AA913880 

T78013 

AW818475 

AI871778 

H24185 

N90822 

AW857989 

BE073250 

AW772282 



BE544163 Hs.87128 



iin FLJ13881 



Hs.133000 ESTs, Weakly similar lo S26689 hypotheti 

Hs, 103983 solute carrier family 5 (sodium iodide s 

Hs.75969 proline-rich protein with nuclear target 

Hs.98947 ESTs, Weakly similar to 033496 trypsin | 

Hs.186809 ESTs, Highly similar to LCT2_HUMAN LEUKO 

Hs.51965 KIAA1209 protein 

Hs.146884 ESTs 

Hs.1498 histidine-rich glycoprotein 

Hs.53237 ESTs, Highly similar to Z169_H'JMAN ZINC 



ls.301547 ribosomal pre 



lnS7 



Hs.174248 
Hs.188758 
Hs.83077 
Hs.145526 
Hs.29276 
Hs.273809 



gb:E ST35757 Embryo, 8 week ! Homo sapien 



Hs.190124 ESTs 



hypothetical protein FLJ20457 
Homo sapiens PR01 167 mRNA, complete cds 
gb.EST02297 Fetal brain, Stratagene (cat 



Hs.176379 
Hs.1 67279 
Hs.7363 
Hs.250112 
Hs.92918 



gb:lJl-HF-BR0p-ajr-c-08-0-Ul.r1 NIH_MGC_5 

FYVE-finger-containing Rab5 effector pro 

ESTs 

ESTs 

hypothetical protein 
ESTs 

gb:PM2-CT0328-281299-003-e04 CT0328 Homo 
gb:MRO-BT0551-060300-102-e05 BT0551 Homo 
gb:hn71b05.x1 NCI_CGAP_Kid1 1 Homo sapien 
gb:MRO-HT0055-081199-002-b06 HT0065 Homo 
hypothetical protein FU23309 



458948 AI695359 Hs.280943 ESTs 



411479 AW848047 gb:IL3-CT0214-291299-052-A12 CT0214 Ho 

426536 AI949749 Hs.44441 ESTs 
442765 BE567353 Hs.99480 ESTs 



411863 BE075244 Hs.12420 ESTs 

415258 AW752247 Hs.293853 ESTs 

416093 R60685 Hs.268698 ESTs, Moderately similar to ALUCJ IUMAN ! 

416184 R48481 Hs.269177 ESTs, Weakly similar to ALU6_HUMAN ALU S 

437733 AI792574 Hs.122876 ESTs 

453118 AW195849 Hs.252757 ESTs 

457039 H29990 Hs.101937 sineoculishomeobox(Drosophtla)hoinolo 

444292 AI139794 Hs.146569 ESTs 

431360 NMJ00427 Hs.251680 loricrin 

407644 D16815 Hs.37288 nuclear receptorsubfamily 1, group D,m 

412029 AW886238 gb:RC5-OT0078-28030O-022-F01 OT0078 Homo 

438522 AA809431 Hs.258886 ESTs 

422634 NMJ16010 Hs.118821 CGI-62 protein 
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418790 
442950 
457040 
436464 
402674 
408733 
408767 
432801 
418205 
404604 

402341 
438090 
421303 

401986 
415318 
417756 
418301 
433755 
435413 
435648 
447555 
458175 



gb:yl95d11.s1 Soares pinea!_glEnd_N3HPG 
Hs.46764 ESTs 

Hs.173717 phosphatidic acid phosphatase type 2B 
Hs.269783 EST- fto kly similar LU1.HUMANALUS 



NM_016260 Hs.278963 zinc finger DNA binding protein Helios 



BE182082 Hs.246973 ESTs 



T06544 
Z43056 
AW976201 



T57448 

BE385257 

AI472167 

AL036532 

AI148157 

AK002032 

BE162391 

U17989 



gb:EST04433 Felal brain, Stratagene (cat 
gb:HSC12B021 normalized infant brain cDN 

Hs.53913 hypolhelical protein FU10252 

Hs.1 20868 ESTs 

Hs.46669 ESTs 

Hs.27524 ESTs 

Hs.160963 Homosaplens,claieMGC:12318, mRNA.com 
Hs.1 50434 ESTs 

— s, Weakly similar to I38022 hypotheli 



447678 
448150 
453445 
444420 
431956 
413758 
428231 
455873 
430970 
412277 
413025 
424083 
427654 
410483 
423942 
430340 
425686 
425075 
400285 



407407 AF050198 
411459 BE142707 
415105 D60166 



Homo sapiens dopamine receptor interacti 



Homo sapiens cDNA FU11170 fis, clone PL 
gb'.PM2-HT0451-090100-002-f04 HT0451 Homo 
nuclear autoantigen 

gb:QV4-HT0316-091199-028-(12 HT0316 Homo 
ESTs 

guanine nucleotide binding protein (G pr 
ESTs 

Homo sapiens clone 24442 mRNA sequence 



55 
60 



400617 AF151064 

418647 AA226198 
401785 

433417 AA587773 

420777 AA280223 

439509 AF086332 

430203 L36140 

450382 AA397558 

455540 BE080231 

437620 AW976930 

407528 X64990 



Hs.1 44083 
Hs.73799 
Hs.291646 
Hs.1 39137 
Hs.1 37475 .... 

gb:QV3-l IT0460-230200-1 01-b08 HT0460 Homo 
Hs. 1 35723 glycolipid transfer protein 

1 |11.r1 SoaresJ k I if 



gb:Homo snpiers puinlive mitochondrial s 
gb:MR0-HT01 57-191 1 99-002-g 1 2 HT01 57 Homo 
gb:HUM089G11BCIontech human fetal brain 

Hs.1 16369 ESTs 

Hs.315562 ESTs 

Hs.99601 hypothetical protein FU12553 

gb'.RC0-BT0362-021299-031-b06 BT0362 Homo 
Hs.99423 ATP-dependentRNAhelicase 
Hs.36069 hypothetical protein 

gb:nc26a07.s1 NCI_CGAP_Pr1 Homo sapiens 

Hs.8859 Homo sapiens, Similar to RIKEN cDNA 5830 
Hs.130865 ESTs 
Hs.58314 ESTs 

Hs.235069 RecQ protein-like (DNA helicase Q1-!ike) 
Hs.60257 Homo sapiens cDNA FLJ13598 fis, done PL 

gb:RC4-BT0629-120200-012-f11 BT0529 Homo 
Hs.128760 ESTs 

gb:H.sapiens mRNA HTPCRX16 for olfactory 



403623 
411518 
417531 
422600 
423347 



gb:IL3-CT0219-291099-021-E07 CT 



AWI I246 

NMJ03157 Hs.1087 

BE143586 Hs.87 retinoblastoma-like 1 (p107| 

AI560412 Hs.234557 ESTs 

AA1 58727 Hs.1 50555 protein predicted by clone 23733 

■ AA578512 gb:nh22e11.s1 NCLCGAP_?r1 Homo sapiens 

AF023130 gb:Homo sapiens Ras-GRF2 mRNA, partial c 

AW665538 Hs.1 17689 ESTs 

AI026130 Hs.131683 ESTs 

AL049266 Hs.18724 Homo sapiens mRNA; cDNA DKFZp564F093 (fr 

AW603985 gb:RC4-CN0048-140100-O11-a(M CN0048 Homo 
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409816 
409840 
410201 
410483 
410500 
410536 
410556 
410615 
410626 



101 4849 J 
104999J 
1074933J 
1104879J 
1108161J 

1149033J 
1155396 1 
1156071 1 
118365J 



1207322.1 
1208157J 
1212203 1 
1212621.-1 
1213795_1 
1214882J 



410934 
410999 
411004 
411018 



1227240J 
1228809J 
1 2 5 J 



1242702.1 
1245075 J 
1246724 1 
1247077J 

1247089J 
1248692.1 
1248899J 
1250843.1 
1252973J 
1253078J 

1254206J 



BE008347 BE008320 BE083307 BE08331 1 AW075968 

R17315Z43964AA053547 

AW250525BE265117 

R17126 R38456H02771 

AW364844 AW364847 AW937534 AW937593 AW937659 

AW449185AW449665 BE220971 

BE154650 BE1 54785 AW468343 BE154816 BE154567 

AW500954AW501111 AW501394 

AW5021 22 AW5021 25 AW501663 AW501720 

AA1 261 29 AA1 26033 AA082561 

BE163567 BE073689 BE073747 BE073780 BE073739 BE073748 BE163495 AW750178 BE163491 BE073763 BE073671 

R09442 AW84611 5 AW8461 08 AW751 967 AW846083 AW846087 AW846090 

N39533AW753094AW753093 

R321 58 AW754055 AW754054 AW754053 AW754045 AW857320 

AW772721 AW873372 H89212 

BE407727 

AW9021 25 AW792853 AW837703 AW83771 0 
AW794600AW794730 

AW8071 82 AW807328 AW807063 AW807183 AW8071 92 AW807033 AW807061 AW807286 AW807097 AWB07270 AW807372 AW807280 



AW810001 AW810092 AW810170 AW809884 AW8096S4 AW310353 AW810428 AW810209 AW810429 AWB10154 AW810168 AW809786 
AW810006 AW809672 AW809694 AW810552 AW81 0345 AW81 0432 AW309960 
AW81 1 1 1 4 AW81 1 095 AW81 1087 AW81 1 1 24 AW81 1 054 AW31 1 094 AW81 1 1 57 
AW81 3004 AW81 2962 AW81 2961 

AW813242BE146089AW813195AW813173AW813206 BE145953BE146212AW81319BAW854582AW813241 BE061582 

AW813428 AW81 3444 AW81 3367 AW81 3368 AW81 3429 AW81 3424 

BE062590AW813565BE147101 



AW821 257 AW821 267 AW821 283 

AW833768 AW833631 AW833435 AW833533 AW833554 AW833543 AW833557 

AW834480 AW834531 AW834637 AW83461B AWB34653 AW834487 

AW836491 AW836461 AW836673 

AW841462 BE156657 BE1 56668 BE092475 

BE067246 BE067241 BE067254 BE067249 AW841960 

AW842115AW842111AW842103 

AW845481 AW845474 AW850849 

BE142707BE142B52BE142723 BE142905AW847712AW647792AW847858AWB47852AW847654AW8476O8BE142961 

AW848047 AW848202AW848631 AW848142 AW848702 AW848121 AW848532 AWB48140 AW848571 AW848009 AW848067 AW848069 

AW848905AW848214 

AW848022 AW848704 AW8481 68 AW848959 AW848476 AW848699 AW84B700 AWB48761 

AW850246 AW850251 AW850302 

BE073250 BE073378 BE073379 AW850533 AW850529 



AW855576 AW855650 AW855578 AW855577 AW855542 AW85561 9 AW855624 AW855521 

AW855718AW855740AW855748 

AW953440 T08189AW857085 

AW857271 AW857308 AW857296 AW857258 

AW85861 2 AW861964 AW858600 AW861945 AW858445 AW861901 AW85854B AW8B1 921 AW858395 AW86196B AWB5B394 AW858401 



411745 
411762 
411772 
411816 
411819 



1256160.1 
1256906.1 
1257386 1 
1259632.1 
1259748 1 
1263110 1 
1266262.1 
1272597.1 
1273497J 
1279411 1 
1283B10.1 
1284501J 
1287B79J 
1288130.1 
1288608J 



BE170301 AW861539 AW904851 BE154336 BE 
AW864609AW864592AW864594 
AW947884 AW947918 AW947888 AW947883 AW947897 AW94791 0 AW947905 AW864751 AW947878 
AW872477BE088101 T05990 

AW876686 AW876717 AW87721 5 AW87B591 AW376722 AW877218 AW876694 AW876725 
AW886238AW886229 AW886244AW887B91 AW386289 AW886146 
AW8B7384 AW887372 AW887461 AW88731 5 

AW895984 AW895982 AW89S983 AW89591 8 AW395772 AW895843 AW895B47 AW895653 AW895652 AW895648 AW896054 M85583 
AW901456AW901450AW901441 

AW902583 AW902585 AW902695 AW902568 AW902580 AW902692 AW902684 AW902595 AW902573 AW902681 

BE160188 AW9357B5 BE160401 BE160319 BE1S0313 BE160395 

AW936336AW936339 

AW93B913 AW936959 AW936918AW936909 

AW9371 43 AW9371 50 AW9371 41 AW9371 51 AW9371 32 AW9371 60 AW9371 91 AW937174 AW9371 95 AW937173 AW937159 AW9371 39 
AW9371 71 AW9371 42 AW9371 45 AWS371 65 AW9371 63 AW9371 64 AW937137 AW9371 79 AW9371 56 AW9371 40 AW937135 AW937170 
AW937445 AW937457 AW937364 AW937374 AW937395 

AW948066 AW9481 06 AW948020 AW948047 AW94801 1 AW948033 AW948034 AW948041 AW948028 AW948048 AW948046 AW948056 
AW948025 AW948053 AW948043 AW948032 AW948037 AW948050 AW948060 AW948059 AW948051 AW948027 AW948038 AW948052 
AW948036 AW948031 AW948030 AW948035 AW94S039 AW948026 AW948044 AW948049 AW948042 AW948054 AW948055 AW94801 3 
AW948058 AW948045 AW948067 AW948029 AW948003 AW94801 6 AW9481 23 AW948008 AW948001 AW948022 AW948014 AW948023 
AW948004 AW948000 AW948018 
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413470 
413488 
413499 
413525 
413581 
413640 
413646 
413758 
413875 
413986 
414277 
414349 
414366 
414455 
414539 
414546 
414605 
414955 
415105 
415131 
415236 
415318 



416154 
416458 
416548 
416616 
416972 
417428 
417490 
417563 
417669 
417756 
417909 
417945 
418297 
418647 
418790 
418856 
418948 
419637 
419936 



1371600.1 
1373234J 
1373910J 
1374635.1 
1378335.1 
1381286J 
1381488J 
1386900.-1 
1396766.1 
140720.1 
1432122.1 
1437515.-1 
1438636.1 
1447655.1 



1522674.1 
1523680.1 
1531080 1 
1533859.1 
1536408.1 



1540853.1 
15S72G4 1 
1573965.1 
159583.1 



1697154.1 
1707657.1 
1711125 1 
1736343.1 
177521.1 
179028.1 
179649.1 
180808.1 



190755.1 
193538.1 
195113.1 

197072J 
198936 1 
201039.1 
201378.1 
204220.1 
207654.1 
223410 1 



229362.1 
230597.1 
231479.1 



BE007420 BE007419 BE007421 BE007422 

BE046255 BE046611 BE046716BE046732BE046273 

BE064032 BE064039 BE064027 BE064040 BE064045 BE063868 BE063890 

BE064655 BE1 53953 

BE064962 BE064979 BE064853 BE064857 BE064856 BE064977 BE064960 BE064860 BE064815 BE064957 BE064804 BE064816 BE064850 

BE064806 BE064796 BE064818 BE064975 BE064819 BE06481D BE064668 BE065059 

BE065209 BE065354 BE065110 BE0651 1 1 

BE065215 BE155544BE155541 BE155540 BE155542 EE155543 

BE074910 BE074913BE074911 BE074903 BE074892 BE074935 

BE075035 BE074999 BE075006 BE075005 BE075032 EE075008 BE075037 

BE141022 BE141513BE141532 BE141488 BE141431 EE141526 BE141516 BE141471 BE141478 BE141476 BE141475 BE141021 BE141533 
BE141464 BE14149Q BE141472 BE141480 
N20934 BE141875 BE141877 
BE144017 BE185527 BE144023 
BE144884 H97942 

BE145899 BE145848 BE145849 BE145853 BE145927 BEK5S25 

BE15Q618 BE1S0616 BE150626 

BE1581 18 BE158034 BE154709 BE154825 

BE155042 BE155040 BE154987 BE155012 



C15506D80541 D80506 

D60166 D60893 D60101 

D61119D81508 D81734 

R41400 H06659 Z38637 F02023 T1651 6 

TQ6544 Z44549 F05465 R51 1 28 

F08365Z43395 R54298 

F11942T78047 R13266 

F13080 R34751 T75411 

F1316BR21289 T77628 

H42314 H43080 H45217 H15384 

Z46122H 23303 R56554 H23C55T80310 R13090 Z45524 

M1 80511 M649022AW974566 

II62953 N76608 N72413 

H68270 H68271 R07781 

BE019670 M191284AA191255 

N87579 T93706 

AA203335 R11393 

AA203701 R86895 

T99898 T92080 T91447 R07805 R07779 
Z43056 R13398T80474 
R35614 R24837 R52440 

R29072 R29717 R29699 R29709 R29751 R29609 R29060 R2971B R29057 R29591 

R91254 T97156R58711 

AA226198AA226513AA383773 

H95693 W02322 AA2281 40 H95692 

AA362858 AW863761 AA229428 

AI217097 AW886090 W38035 W38792 AA232835 AW936043 
W27493 AA248063 AA249685 AA429679 
AI7927B8 BE142230 AA252019 



AA278808 BE082076 BE081812 BE081581 
AA595522 M280981 BE464358 
AW296927 AI68451 4 AI2631 68 AA281 079 
AA329711 AA287436 M283148 
T06464 C17931 

AA287443 AA419385 BE084078 AI478347 
AA293333AI820910AA293403 
BE048255 M3 1 3083 AA29841 9 

M631498 AI017191 /V49121 1 AA761 e23 AA714555 AA73S099 AA808286 AI934069 AA570223 AA574389 AA582438 AI745346 AW964510 
AA319642 AW853758 H56414 
AW864848 AA32221 3 AA322209 AW961 624 
BE090503 BE090490 BE090483 BE090501 BEC 
H02364 M329065 AW958111 AW961436 
H02217AA330235AW955673 
AW753967 AA370795 AA331630 AW962550 



AA331906M332484 
H90452 AA345767 AW964302 H90399 
R09692 R09414AA346353 
AA347923 M347928 AW961769 
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40 
45 



429B75 
430175 
430340 



430850 
431071 



432189 
432363 
432779 



433919 
434098 
434138 



65 
70 



250946 1 
251775.1 
252038J 
252945J 
256586 1 
260504J 
265003J 
268121J 
271252 1 
272088 1 
273843J 
294049J 
294655J 
299886J 
305828J 



32200J 
3221 00_1 
322947_1 



3361 89_1 
338082_1 
33905J 



355475_1 
356290J 
359936J 
36388J 
366532J 
377243_1 
380006J 
380572_1 
39041 5_1 



406582.1 
407286J 
409239J 

425676J 
428095_1 
431828_1 
433234_1 
43997J 
44635_-1 
447553_1 
467651J 
46803J 
47360J 

54B12J 



443764 
444063 
444910 



447884 
448477 
449311 



661959J 
697809J 
740749J 



921410J 
95527J 
97021J 



AA351364 H1S138 R12960 
AA352111 AW962247AA429695 
AI90401 3 AI90401 5 AA356332 
AA357471 AA360564 AW962961 
AW96421 2 AA357832 AA357B31 
M359119AW963014 D798B4 
AA364002AI522307 



M3B5085AW966617TB7798 
AA393713AA400726 
AA431765AA432015 
AA433838 AA450385 AA683244 
AA446612 AA456438 AA476655 
M85776 AA454535 AA456208 H90189 
AI091815AA460162AA460761 
AA46B724 AA468772 AA468643 AW969231 
AA476777T86049 

AW968485 AW968670 AA480922 BE350425 

AW392821 AW392809AW843258 AW843049 AW6031 56 BE165656 AW821728 
XB0240 AI064750 AI133594 
M492071 AA484229AW969850 
AI458623 M639708 AA485409 R22065 M485570 
BE144152 AA937952 AA487799 
AW968865AA491199C1714B 
AA491379 H86020 AW969148 
AW971180 AA551515 AA493610 AW089533 
AW971240 AA493843 AA493723 
AW753917 BE1 52926 AA505333 BE1 55673 
AW971119AA574265 M51326B 
AA516049AW004922 
AK000890 BE1 82413 AW8 
AA527941 AI810608 AI620190 AA635266 
AA534489 AW970240 AW970323 
AW979241 AA565006 AA847102 
AW974094 AA569074 AA602574 
AL079800AA570294L25459 
AA578512AA595535BE177533 
AF023130 AF181250 AA984703 M694303 AA351792 



AA746311 AA927492AA617995 

AA625499 AA625269 AA6251 84 

AA625B04 AW41 B787 AW074833 AIG75G42 AI393368 

AA641 957 AW749897 AW749866 AW749887 AW749890 

R34758AA642317 

AA664192H60250T71388 

BE314734AA666393 

AA682507AW851124 

BE143837 AW749652 AA683327 

T82384 R05307AA693714 

AA694284 H68267 H68264 

Z83806 AJ132091 AJ132090 

AW975902AA729344AI557342 

BE545498 AA830720 AI87301 5 AA732679 

T63B04T63768AA742849 

AA744693AW750059 

AL079309AA281819 

BE396279 

BE151746 BE336853 D63271 T94955 AA774994 

AA828995AA834879AI926361 

AF0750B4 H53157 H53054 

AF088056W76297W72448 

AL109688 R23665 R26578 



F232B3AI084941 F35774 
AI122614AW869134 
AI201849 BE069007AW946544 
AV653771 BE089370 
AI276454 AI633717 A12751 16 
AI347274AW844024 

H29505 R1 8575 Z43580 T48738 AI435454 BB 

BE612572 AL040190 F08S14 

AI65701 4 AW594035 AI657036 AI638390 

AA0051 29 AA679084 AA694399 

BE1721 86 AA059279 M02081 5 AA01 3437 

H86231 AA021632 H38271 

AW81 2256 AW81 2257 AI906423 A1906422 

W84829AA033900AW573557 

AW021633M036730AI866854 

AL137967 BE064160 BE064186 
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454423 
454447 
454456 
454482 
454560 



1108161J 
1129667.1 
1183079J 



45 
50 



BE141030 BE141474BE141467BE141753 BE141024 BE141761 AW177583 AW177579 AW177582AW177585 AW177587 AW807582 AW177581 
BE141477 BE141S20 BE141456 BE14U92 BE141028 BE141775 BE141489 BE141751 AW177599 BE141750 AW177597 BE141512 BE141460 
BE141749AW177598 
AW177821 AW1 77896 AW177867 

BE140966 BE140961 BE140967 BE141006 BE140985 BE140970 BE141669 BE141653 BE141664 BE141655 BE141661 BE141660 BE140969 

BE141673 BE141650 BE141674 BE141550 BE141688 AW178241 BE140994 BE141BBS BE140998 BE141008 BE140988 BE141011 BE140975 

BE141667 BE141675BE141657BE141681 BE141656 BE141572 BE141680 AW178237 BE141012 BE140990 BE141658 BE141648 BE141013 

BE141668 BE140973 BE1410Q4 BE140963 BE140984 BE141D09 AW1 78232 BE141007 BE141649 AW178293 BE1 40993 AW1 78233 BE141646 

BE141005 BE141691 BE141000 BE141652 BE140965 BE1415S2 BE140960 BE140962 BE141001 BE140978 AW1 78229 AW1 78239 BE 141671 

AW178230 BE141547 AW178235 BE141663 BE141549 BE140996 BE141003 AW178236 BE141002 BE141556 

AW81 649B AW80B791 AW80851 5 AW808379 AW808532 AW808605 AW808977 AW808816 AW178676 AW17B486 AW808514 AW1 78483 

AW178485 AW809007 AWB08524 

AW364844 AW364847 AW937534 AW937593 AW937659 



455778 
455780 
455849 
455851 



AW603985AW854350 

BE1 63567 BE073689 BE073747 BE073780 BE073739 BE073748 BE1 63495 AW750178 BE1 63491 BE073763 BE073671 
AW850984AW752836 M86124 
BE147919AW794884 BE147847 

AW807281 AW807092 AW807425 AW307330 AW807174 AW807171 AW807274 AW807278 AW807367 
AW807573 AW807566 AW807572 

AW807605 AW807690 AW807839 AW807752 AW807673 AW807667 AW807955 AW807760 AW807615 AW807898 AWB07849 AW807821 
AW807832 AW807842 AW807827 AW307822 AW807829 AW807830 AW807825 AW807603 AW807612 AW807908 AWB07595 AW80761 7 
AW807678 AW807687 AW80791 8 AW307921 AW807596 AW807602 AW807688 AW807609 AW807684 AW807770 AW807593 AW807754 
AW807679 AW807957 AW807683 AW807763 AW807902 AW807840 AW80781 9 AW807836 AW807769 AW807685 AW807847 AW807674 
AW807686 AW807670 AW807917 AW807677 AW807680 AW80790D AW8D7669 AWB07952 AW807907 AW807846 AW807756 AW807835 
AW807608 AW807753 AW807601 AW807956 

AW809648 AW809704 AW809643 AW809653 AW809709 AW809949 AW809939 AWB1001 0 AW809705 AW809950 AW809B22 AWB09667 

AW810093AW810076AW809673AW810349AW809895 

AW811380AW811385 

AW850684AW850150 

AW818535AW818588AW818651 

AW819191 AW819252AW819183AW819175AW819177AW819186AW819180BE158470AW819242AW819269AW819244AW819190 
AW819265AW819268AW819246 BE152602AW819249AW819251 AW819263 AW819194 

BE069199 AW936012 AW877466 AW81 9782 AW93579B AW835546 AW936042 BE0691 21 AWB35625 AW877536 AW935885 BE069202 
AW820019 AW935937 BE160180 AW935946 BE069101 BE0691 25 AW877527 BE16031BBE160398AW935794AW835701 AW935784 
AWB20199AW820434BE174743 
AW820626 AW820621 AW820608 



AW833711 AW833620AW833699 

AW835775 AW845768 AW845764 AW845773 AW845757 AW845758 AW845780 

AW8461 34 AW846467 AW846468 AW846386 AW84646 1 AW84621 1 AW846179 AW846205 AW84B320 AWB46379 AW846367 AW846561 
AW846556 

AW847645 AWB47791 AW854083 AW853945 

AW850B45 BE144010 AW855164 

BE156459 BE156469 BE 1 56468 AW857447 

AW857989 AW85801 6 AW861 677 AW86 1 689 AW861 691 AW858056 

AW860972 AW862598 AW862599 AW860988 AW860983 AW860898 AW860925 AW860922 AW860986 AWB60984 AW860989 
AW9478B4 AW94791 8 AW947888 AW947883 AW947B97 AW94791 0 AW947905 AW864751 AW947878 
AW879403AW867707 
AW867751 AW867770AWB67763 



AW876627AW876630AW876631 AW876625 

AW877139 AW8771 35 AW87701 8 AW991 835 AW8771 28 AW8771 08 AW87701 7 AW877107 
AW977806 AW887923 AW8B6321 



AW893961 AW89399B AW894034 AW894019 



AW983901 AW984485 AW947715 

AW948353 AW948351 AW948331 AW948303 AW948336 AW948305 AW948299 AW948346 AW948352 
AA102322 

BE144762 AW979091 
AW991925 AW991919 



AW995423 AW995373 
BE003714BE003721 BE003720 BE003716 

BE007829 BE007815 BE007822 BE007996BE007835 BE007837 BE007824 BE007836 BE007827 
BE011437 BE01 1402 BE01 1395 BE01 1423 BE01 1407 BE01 1421 BE01 1406 



BE067238 BE067235 BE067240 BE067255 BE067263 BE067236 BE067260 BE067253 BE067248 BE067252 
BE067870 BE067866 BE165133 BE165334 BE165329 BE165332 
BE074910 BE074913 BE074911 BE0749O3BE074892 BE074935 
BE079307 BE079309 

BE088746 BE088802 BE088755 BE088876 BE088947 BE088881 BE08B952 
BE088828 

BE146866 BE146865 BE146867 
BE146879 BE146914 BE146918 

BE149024 BE149056 BE152826 BE 149025 BE 149057 BE152819 BE149030 BE149062 BE149023 BE149055 
BE152239 BE152242 BE152230 
BE153208 BE153146 BE152981 
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1 843275_-2 W28506 



BE179085 BE179084 BE179086 BE179264 
W26642 H88394 

41 AI734056 AI820965 AI732153 AA259201 AA225731 



812733_2 
920051J 
925883J 



AW968614AA243209AA281411 
AA256753AW628680 
M457011 AI978850 
' R84938 AL047151 AA310309 AW063200 AI569528 AI307823 N49975 
AL1 57625 N72696 BE622492 
AW865523 AW865128 AW865467 AW8651 27 AW865«c 
AI904723 A1904725 AI904729 AI904722 AI904758 AI904736 
AW391177W45021 



Unique number corresponding to an Eos probeset 
Sequence source. The 7 digil numbers in " ' 

nc r I t I Nature (1999) 402:439-495. 

Indicates DNA strand from which exons were predicted. 



400510 
400514 
400579 
400582 
400587 



400734 
400BT6 
400B43 
400844 



401132 
401145 
401189 
401200 
401344 
401361 
401365 
401449 
401497 
401521 
401526 
401602 
401614 
401645 



401882 
401887 
401986 
401992 
402038 

402076 
402103 
402131 
402176 
402230 



402395 
402429 
402430 



8569993 Plus 

9188605 Plus 

9188605 Plus 

9757499 Minus 

9757506 Plus 

9958234 Minus 

3687273 Plus 



9926411 Minus 

9958052 Plus 

9796180 Minus 

8574316 Minus 

7381770 Plus 

7705251 Plus 

7770561 Plus 



7249203 
7704961 
7543687 
9966312 
8844110 
7656696 
9929693 
9796372 
9796372 
9796753 



197610-197785 
181108-181605 

139633-139910,140469-140979 
78844-79025,80850-80991 ,89754-89941 ,93750-93891 
21323-21526 

88642-88726,89716-89866 

25435-25588,25668-25747 

96756-97558 

151513-151662 

92278-92472 

4786-4992 

12818-13016 

78299-78686 

122853-123971 

161221-162078 

5863-5970,7653-7784,8892-9023,9673-9807, 1 0634-1 0789,1 5254- 1 5403,23827-23958 

24746-24872,25035-25204 

91 888-9201 8,981 31 -98294,99474-99570 

163855-164016 

169782-170036 

B5632-86174 
85679-85795 
17599-17776 



144928-145030 



9127-9234 

91570-93177 

101096-101253 

17350-17735 

34986-35133 

64056-64168 

110228-110340 

165776-165996,166189-166314,166408-166569,167112-167268,167387-167469,168634-168942 

86466-87077 

93973-94120 

31137-31293 

31452-31649 



128316-128627 
14453-15414 

331 1 4-33209.3349S-3367B 

10-750 

29782-29932 

1 65693-1 65B56 

22583-23699 

131016-131998 
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131390-132157 
130806-131036 
39290-39602 
107231-107383 



7331517 
7387384 
3135242 Minus 



52806-53106,53500-53818 
122603-122743 
79403-79560,79712-80021 
3468-3623 

71615-71773,73930-74144 
1 576 1B-1 57755 
134283-134485 



26009-26178 

112733-113001,114599-114735 

952-1160 

108903-110438 

155310-155436,158402-158535 



137150-137362 

85509-85658 

5542-6176 

62501-62653 

3519-5426 

142647-142771,145531-14576 

1344-1442,1545-1697 

55008-55083,62860-63051 

88280-88463 

143467-143634 

136463-136646 

45910-46260,47563-47824 

118692-118853 

1414-1513,1624-1756 

93160-93409 

73028-73217 



4020145 
6456726 
8705107 
9212537 
9796751 
7882827 
7232206 
3810614 



3512-3691 

114391-114628 

87198-B7441 

27768-29179 

101841-102043 

112837-113339 

69039-70100 

104257-104348,104822-104970 

17991-18420 

40059-40210 

110443-110733 

72019-72509 

99433-99528,100035-100161 

23244-23759 

61087-61590 

7541-8132 

37948-38226 

47174-47326,52928-531 46,5331 2-53602 

177015-177328 

191519-191664 

147512-148011 



349-822 



140176-140340 
37047-37198 
51081-51701 
97493-97682 
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7249178 Minus 



2078453 
6094635 
2979602 
6552772 
7408143 
7656675 
8050952 

9454643 
1054740 
5836195 
5815499 
4508116 
5306288 
4895155 
4314424 
4165003 
9797144 
7534017 
3288022 
7248203 
7417368 
7109593 
675B731 
675B795 



8247786 



6758997 
9119150 
9123919 



9256114 
9256135 
9256407 
9795553 
9795562 
7711360 
7711374 
4567182 



121688-121840 

130607-130802 

33267-33563 

977-1116 

48332-48454 

90716-90887,101420-101577 

134112-134671 

70284-70518 

65096-65247,77508-77637,81242-81364,84246-84395 

114550 114688 117 107,119490-11 v 1^3395,131140-131217 

124361-124520,124914-125050 

117070-117270 



61379-62562 

44307-44431,49619-49802 

13409-13861 

146981-147316 



48204-48371 
77798-78000 
15628-16127 



4771-5338 
39912-40220 
30921-31532 
60495-60610 
197370-197935 
251370-251797, 
30084-30770 
168241-168492 
64689-64798 
50715-50833 

126826-126979,129755-129942 
43858-44003,46993-47136 
43734616,8870-9046,1 1366-1 1 509,1 1 625-1 1880 



Table 32A lists about 969 genes upregulated in lung fibrosis relative to normal body tissues. Types of pulmonary fibrosis samples included in this analysis were idiopathic pulmonary 
fibrosis (IPF), hypersensitivity pneumonitis (HP), and nor p tic inlerstitiol pn i <( IP) These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 
Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a normalized value reflecting the relative levels of 



Table 33A lists about 800 genes upregulated in lung fibrosis relative to normal lung. Types of pulmonary fibrosis samples included in this analysis were idiopathic pulmonary fibrosis 
(IPF), hypersensitivity pneumonitis (HP), and non-specific interstitial pneumonitis (NSIP) These genes were selected from 59680 probesets on the EoslAffymetrix Hu03 Genechip 
array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a normalized value reflecting the relative levels of mRNA 
expression. 

Table 34A lists about 703 genes upregulated in idiopathic pulmonary fibrosis (IPF) relative to hypersensitivity pneumonitis (HP) or non-specific interstitial pneumonitis (NSIP). These 
genes were selected from 59680 probesels on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as 
average intensity (Al), a normalized value reflecting the relative levels of mRNA expression. 

Table 35A lists about 323 genes upregulated in hypersensitivity pneumonitis (HP) relative to idiopathic pulmonary fibrosis (IPF) or non-specific interstitial pneumonitis (NSIP). These 
genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as 
average intensity (Al), a normalized value reflecting the relative levels of mRNA expression. 

Table 36A lists about 52 genes upregulated in non pet e titial pi umoniti (NSIP) relative to hypersensitivity pneumonitis (HP) or idiopathic pulmonary fibrosis (IPF). These 
genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as 
average intensity (Al), a normalized value reflecting the relative levels of mRNA expression. 

Table 37A lists about 206 genes downregulated in lung fibrosis relativ e n m 1 ing T,p»s of pjlmonary fibrosis samples included in this analysis were idiopathic pulmonary 
fibrosis (IPF), hypersen r fi\ , pn rnoni'.is (HP) nd non pecili ntei itiol m monitis (NSIP) These genes were selected from 596B0 probesets on the Eos/Affymetrix Hu03 
Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a normalized value reflecting the relative levels of 
mRNA expression. 
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Table 38A lists about 207 genes upregulaled in lung fibrosis relafve to normal tissues. Types of pulmonary fibrosis samples included in Ihis analysis were idiopathic pulmonary 
fibrosis (IPF), hypersen n I j 1- pneui loniti (HP), md non-specifc inte ilitial pnew oniti: (NSIF) These genes were selected from 59680 probesels on the Eos/Affymetrix Hu03 
Genechip array. Gene expression dala for each probeset obtained from this analysis was expressed as averace intensity (Al), a normalized value reflecting the relative levels of 



TABLE 32 



Pkey: 



: About969 genes upregulated in lung 



Unique Eos probeset identifier number 
cxHucn: Exemplar Accession number, Genbank accessr 
UnigenelD: Unigene number 
Unigene Title:Unigene gene title 
R1 : 90th percentile of lung fibrosis Als divided by 9C 

R2: 90th percentile of lung fibrosis Als divided by 9C 



UnigenelD Unigene Title 




Homo sapiens mRNA; cDNA DKFZp564B2062 (f 



cathepsin E 

th> oidti inscription factor 1 
progastricsin (pepsinogen C) 



Homo sapiens cDNA: FLJ23494 lis, clone L 
EGF-like-domain, multiple 6 
ESTs, Weakly similar to BCHUIA S-100 pro 
N-acylsphingosine amidohydrolase (acid c 



implement component 4-binding protein, 



Homo sapiens cDNA: FU23075 lis, clone L 
(axon guidance receptor, Dros 
chemokine (C-C motif] receplor 5 
'. ' light chain 2B(ONLC2B) 
surfactant, pulmonary-associated protein 
cytochrome P450, subfamily IVB, polypepl 
advanced glycosylalion end producl-speci 
ESTs, Weakly similar to ALU4_HUMAf ' ' ' 
neutrophil cytosolic facto- 2 (55kD, chr 
ESTs, Weakly similar to T1 2545 hypotheli 
monokine induced by gamma interferon 
matrix metalloproteinase 1 (interstitial 



ALUS 



lily 34 (soc 



intron of hepatocyte nuclear factor-3 al 



low density lipoprotein (lectin 
small inducible cytokine subfamily A (Cy 
lymphocyte cytosolic protein 2 (SH2 doma 



ESTs, Weakly similar to 2109260A B cell 
hypothetical protein MGC4309 

NYD-SP22 



gb'.an03c03.x1 Stratagene schizo brain S 



sapiens cDNA: FLJ231 23 fis, clone L 



Hs.553 solute carrier family 6 (neurotransmitte 

Hs.41 644 AKAP-associated sperm protein 

Hs.1 63443 intron of periostin(OSF-2os) 

Hs,346868 nucleolar protein p40; homolog of yeast 
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446932 
408369 
409435 

452561 



25 

30 



425259 
427019 
420556 
428467 
432731 
439398 
409153 
412584 



413385 
452416 
428434 



BE21B886 

AA961459 

R38438 

AI810721 

NM_006144 

AI692181 



N75217 

AA321549 

AI204995 

AI734009 

M36125B 



Hs.125644 ESTs 

Hs.182575 SLC15A2 Solute carrier family 15 (H+/pep 

Hs.95424 ESTs 

Hs.90708 granzyme A (granzyme 1 , cytotoxic T-lymp 



Hs/9 



ESTs 



R31178 

AA284267 

W03754 

X54870 

AI248193 

M650274 



Hs.253495 

Hs.257846 ESTs 

Hs.2248 small inducible cytokine subfamily B (Cy 

gb:an03c03.x1 Stratagene schizo brain S1 

Hs.1 27699 KIAA1 603 protein 

Hs.237868 interleukin 7 receptor 

Hs.1 55397 Homo sapiens mRNA; cDNA DKFZp5B4K1 43 (fr 

Hs.173233 hypothetical protein FU10970 

Hs.1 24292 Homo sapiens cDNA: FU231 23 fis, clone L 

Hs.184465 hypothetical protein FU11259 

Hs.287820 fibronectin 1 

Hs.221504 ESTs 

Hs.50813 hypothetical protein FU20022 

Hs.74085 DNA segment on chromosome 12 (unique) 24 

Hs.1 19860 ESTs 
Hs.41296 



HSPC065 protein 



AW13855B 
AA02611! 



422173 
425638 
406672 



Hs.1 04624 
Hs.1 1614 
Hs.1 06511 
Hs.144442 
Hs.7369 

Hs.1 891 2 ESTs 

Hs.30 membrane-spanning 4-domains , subfamily A 

Hs.334873 ESTs, Weakly similar to I54374 gene NF2 

Hs.840 indoleamine-pyrrole 2,3 dioxygenase 

Hs.1 14777 ESTs 

Hs.65551 Homo sapiens, Similar to DNA segment, Ch 

Hs.28792 HomosapienscDNAFLJ11041 fis,clonePL 

Hs.191990 ESTs 

Hs.16026 hypothetical protein FU23191 

Hs. 25061 9 phorboiin-like protein MDS019(CEM15) 

Hs.158450 nasopharyngeal epithelium specific prote 

Hs.198253 major histocompatibility complex, class 
• . 1RX2 



432606 NMJ02104 Hs.3066 granzyme K (ser 

436260 BE172762 Hs.292710 ESTs, Weakly similar lo ALU5_HUMAN ALU E 

414821 M63835 Hs.77424 Fc fragment of IgG, high affinity la, re 

428820 AA436187 Hs.172631 tntegrin, alpha M (complement component 

458079 AI796870 Hs.54277 DNA segment on chromosome X (unique) 992 

419556 U29615 Hs.91093 chltinase 1 (chilotriosidase) 

417412 X16896 Hs.82112 interleukin 1 receptor, type t 

426174 AA547959 Hs.1 15838 ESTs 

408727 AL137259 Hs.47115 hypothetical protein DKFZp434D051 3 

435990 AI015862 Hs.131793 ESTs 

427621 BE621182 Hs.179882 hypothetical protein FU12443 

425555 AA359291 Hs.1 30767 Homo sapiens cDNA: FLJ23553 fis, clone L 

419086 NMJ00216 Hs.89591 Kallmann syndrome 1 sequence 



424054 AA334511 



L22524 
AI073913 
AW953937 



440273 AI805392 



ESTs 



ESTs 



is, subfamily A 



416847 
448019 
447183 



major histocompatibility co 
Hs.334485 hemicentin(fibulin 6) 
Hs.2256 matrix metalloproteinase 7 (malrilysin, 
Hs.100686 ESTs, Weakly similar to JE0350 Anterior 
Hs.240845 ESTs 

gb:UI-H-BI3-ala-a-1 2-O-Ul.s 1 NCLCGAP.Su 
Hs.145807 hypothetical protein FU13593 

gb:yh85d01 ,s1 Soares placenta Nb2HP Homo 
Hs .325335 Homo sapiens cDNA: FU23523 Ss, clone L 
Hs.152175 calcitonin receptor-like 
Hs.88974 cytochrome b-245, beta polypeptide (chro 
Hs.80261 enhancer of fomentation 1 (cas-likedo 
Hs.195641 ESTs, Moderately similar to I38022 hypol 
Hs.1731B2 ESTs 



407756 
410606 
450726 
430573 



AI560944 Hs.71428 ESTs 

AA744522 Hs.292645 ESTs, Weakly similar to ALU5_HUMAN ALUS 

AW963838 Hs.168830 Homo sapiens cDNA FLJ12136 fis, clone MA 

AW500775 Hs.24817 hypothetical protein FU20136 

AA116021 Hs.38260 ubiquitin 

AW418779 Hs.114889 ESTs 



AA744550 Hs.1 36345 ESTs 



458 



WO 03/042661 



PCT/US02/36810 



454076 
452039 
454024 



L03678 

AW204712 

AI922988 



424238 

448869 
426083 
451099 
440452 
422109 
430378 
413802 
408761 



451820 
440028 
448030 
437866 
428513 
438607 
445034 



407930 
407192 
452960 



Hs.302043 chemokine (C-C motif) receptor-l ke 2( 
Hs.156110 immunoglobulin kappa constant 
Hs.61957 ESTs 



M7673B2 

AA320014 

AL1 17428 

AI940675 

AA337401 

AL133011 

AI792798 

AW962712 

R52795 

AI925136 

S73265 

Z29572 

AW964490 

AA057264 



ls.1 26712 
ls.25954 
is.55150 



AA974603 

M18728 

AK001514 

AU077260 

M731533 



AF1 46761 

AW964897 

U24577 

T61572 

AW058357 

AW473675 

N30714 

AA156781 

BE220806 

AW080237 

AW293376 

AI000341 

AI632683 

M045847 

AA609200 

AK001335 

AF079363 



410434 
436396 
434194 
435800 
420000 
449057 

436198 

444020 



Hs.236844 
Hs.153924 
Hs.270751 I 
Hs.74126 
Hs.76845 [ 
Hs.20450 
Hs.290825 
Hs.93304 
Hs.79385 



Homo sapiens mRNA; cDNA DKFZp434P201 (:r 

ESTs, Weakly similar to ALU4 HUMAN ALU S 

r I e milaftoAri91023 1 E2I 

interleukin 13 receptor, alpha 2 

ESTs, Weakly similarto CAYP.HUMAN CALCY 

gastrin-releasing peplide 

tumor necrosis factor receptor superfami 

ESTs, Weakly similar to S65657 alpha-1C- 

ESTs, Weakly similar to (defline not ava 

major histi mpalihiliiy irpl d 

Wnt inhibitory factor-1 

pleckstrin homology domain interacting p 

KIAA1685 protein 

gb:op34f05.s1 Soares_NFL_T_GBC_S1 Homo s 
-^Human nonspecific crossreacting antig 
" I protein FLJ10652 



ty acid binding protein 6, ileal (gas 



Hs.252884 ESTs 



ESTs, Weakly similar to T1 7227 hypotheti 
membrane-spanning 4-domains, subfamily A 
metailothionein1E(funclional) 
plexln C1 



H12723 

AL039743 

AI962493 

AA641201 

AL039185 

NM_016929 

AI241019 



Hs.31137 
Hs.158213 
Hs.290791 



Homo sapiens cDNA FU12933 fis, clone NT 
Homo sapiens cDNA FLJ12807 Is, clone NT 
gb:at12e02.s1 Soares_testis_NHT Homo sap 
protein tyrosine phosphatase, receptor t 



Hs.222051 ESTs 
Hs.77558 
Hs.283021 
Hs.145544 



AB018549 
AI348408 
AA830168 
AI624707 
AF051152 

AF119B47 



Hs.38489 
Hs.69328 
Hs.1 24675 

Hs.5921 * 
Hs.63668 
3.152213 



gb:nj21d02.s1 NCI_CGAP_AA1 Homo sapiens 
Homo sapiens mRNA; cDNA DKFZp586L0120 (f 
ESTs, Weakly similar to I38022 hypotheti 
MD-2 protein 

ESTs, Weakly simitar to T14742 hypotheti 
ESTs 

Homo sapiens cDNA: FU21592 fis, clone C 



AB037784 
AA127382 
AK001125 
T4684B 



p53-inducible ribonucleotide reductase s 
KIAA1363 protein 

protease, serine, 12 (neurotrypsin, molo 
Homo sapiens cDNA FLJ 10263 fis, clone HE 
immunoglobulin superfamily, member 4 



AA353776 
AW081608 
AB023161 



CD48 antigen (B-cell membrane protein) 



Hs.97403 
Hs.73931 
Hs.1 90325 



ESTs, Moderately similar to ALU8_HUMAN A 

hypothetical protein FU21B77 

NMJ0061 1*:Homo sapiens CD59 antigen p18 
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429597 
419981 
400419 

413283 
444339 
429272 
435047 



421554 
422770 
434658 
440248 
442006 
430515 
446063 



AA477908 

NMJW2110 

M742697 

NM_003816 

AA897581 

AF084545 

AA744875 

R78669 

T96555 

W25140 

AA454985 

AI831760 

AW137676 
AL117544 
AI624436 
M876138 
AW975183 
AA746503 
AI720140 



Hs.282267 ESTs, Moderately similar to I38022 hypot 
Hs.89555 hemopoietic cell kinase 
Hs.62492 NM_052863:Homo sapiens secretoglobin, fa 
Hs.2442 a disinta 
Hs.128773 ESTs 
Target 
Hs. 18941 3 ESTs 

Hs.23756 hypothetical protein similar to swine ac 
Hs.31562 ESTs 
Hs.110667 ESTs 

Hs,54973 cadherin-like protein VR20 
Hs.155111 hypothetical protein FU 14428 

NM_004079:Homo sapiens cathepsin S (CTS 
Hs.97775 ESTs 
Hs.120021 DKFZP434I092 protein 
Hs.310286 ESTs 

ESTs 

ESTs, Weakly similar to S72432 hypotheti 



60 
65 



Hs.227817 BCL2-related protein A1 



AW136134 
AA813745 
AF213457 



41578B AW628686 

428065 AI634046 

434340 AI193043 

451558 NM_001089 



Hs.220277 
Hs.1 23446 
Hs.44234 
Hs.193161 
Hs.78851 
Hs.157313 
Hs.1 28685 
Hs.26630 
Hs.1 301 77 
Hs.171096 
Hs.1 4372 



Hs.13500 ESTs 



406387 






Target Exon 


2.77 




421168 


AF1B2277 


Hs.330780 


cytochrome P450, subfamily ilB (phenobar 


2.76 


3.24 




NM 004469 


Hs.1 1392 


fries induced growth factor (vascular en 


2,76 


2.11 


427484 


N32859 


Hs.3/288 


nuclear receptor subfamily 1, group D, m 


2.76 


1.94 


417728 


AW138437 


Hs.24790 




2.76 


1.78 


435154 






ESTs 


2.76 


2.10 




AI971131 


Hs.23889 


ESTs, Weakly similar lo ALU7_HUMAN ALU S 


276 


2.21 


423387 


AJ012074 




vasoactive intestinal peptide receptor 1 


2.76 


2.36 




AW971364 


Hs.324775 


ESTs 


2.75 


2.02 


434164 


AW207019 


Hs.148135 


serine/threonine kinase 33 


2,74 


2.48 


423706 




Hs.131924 


G protein-coupled receptor 65 


2,74 




442703 


AL044949 


Hs.1 16298 


ESTs 


2,74 


189 


450247 


AF123303 


Hs.24713 


hypothetical protein 


2,74 


1.73 


430998 


AF128847 


Hs.204038 


indolethylamine N-methyltransferase 


2,74 


2.85 


426535 




Hs.288582 


ESTs, Weakly similar to ubiquitous TPR m 


2.74 


1.88 


409196 


NM 001874 


Hs.334873 


carboxypeptidase M 


2,73 


1.86 


422389 


AF240635 




protocadherin 12 


2,72 


2.26 


444324 


AI301330 


Hs!l43838 


ESTs 


2.72 


1.74 


417831 


H16423 


Hs.82685 


CD47 antigen (Rh-relatad antigen, integr 


272 


2.40 


42B769 


AW207175 


Hs.106771 


ESTs 


2.72 


2.19 


404277 






NM_019111*:Homo sapiens major hislocompa 


2.72 


3.12 


409653 


AW451693 


Hs.220826 


ESTs 


2.72 


2.62 


437211 


AA382207 


Hs.5509 


ecotropic viral integration site2B 


2.72 


2.25 


430299 


W28673 


Hs.106747 


serine carboxypeptidase 1 precursor prot 


2.72 


2.09 


444381 


BE387335 


Hs.283713 


hypothetical protein BC014245 


2.71 


2.26 


443547 


AW271273 




hypothetical protein FLJ12666 


2.71 


1.74 




M73720 


Hs.646 


carboxypeptidase A3 (mast cell) 


2.70 


2.39 


402674 






Target Exon 


2.70 


1.95 


438068 


AI927209 


Hs.306210 


Homo sapiens cDNA: FLJ23133 fis, clone L 


2.70 


2.23 


415075 


L27479 


Hs.77889 


Friedreich ataxia region gene X123 


2.69 


2.11 


444314 


AI140497 




gb:ow76b09.s1 Soares_fetalJiver_spleen_ 


2.69 


2.28 


428656 


AB037798 


Hs.1 88790 


KIAA1377 protein 


2.68 


1.S1 


418883 


BE387036 


Hs.1 211 


acid phosphatase 5, tartrate resistant 


2.63 


3.95 


443951 


F13272 




ferritin, light polypeptide 


2.68 


2.66 


427581 


NM_0147BB 


Hs.179703 


KIAA01 29 gene product 


2.68 


1.74 




AW973785 




gb:EST385886 MAGE resequences, MAGM Homo 


2.68 


1.78 


446423 


AW139655 


Hs.150120 


ESTs 


2.6B 


2.2S 


407939 


W05608 


Hs.312679 


ESTs, Weakly similar to A49019 dynein he 


2.67 


207 




AW971178 


Hs.268571 


apolipoprotein C-l 


2.67 


3.00 


458124 


AW005548 


Hs.124590 


ESTs 


2.67 


3.78 


432882 


NMJ13257 


Hs.279696 


serum/glucocorticoid regulated kinase-li 


2.66 




445745 






KIAA0455 gene product 


2.66 




425188 


AK002052 


Ks 166071 


hypothetical protein FU1 1190 


2.65 


1.92 


432231 


AA339977 


Hs.274127 


CLST 11240 protein 


2.64 


4.23 


442200 


AW590572 


Hs.235768 


ESTs 


2,64 


2.46 


426828 


NMJ00020 


Hs.172670 


activin A receptor type IHikel 


2,64 


2.00 


448569 


BE382657 


Hs.21486 


signal transducer and activator of trans 


2,63 


3.23 
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Hs.24B549 ESTs, Moderately similar to S65657 alpha 

Hs.692B5 neuropilin 1 

Hs.41 271 Homo sapiens mRNA full length insert cDN 

Hs.127160 ESTs 

Hs.67726 macrophage receptor with collagenous sir 

Hs.132458 ESTs 

Hs.33846 ESTs 
Hs.1 55981 

Hs.161304 ESTs 

Hs.202111 ESTs 
Hs.74122 

Hs.280380 aminopeplidase 

" ' ESTs 

membrane-spanning 4-domains, subfamily A 




triggering receptor expressed on myeloid 

similar to cytokine leceplo 
Human DNA sequence from done RP11-1 96N1 
hypothetical protein DKFZp564D0462 
ESTs, Weakly similar to 138022 hypotheli 
Homo sapiens mRNA for FU001 1 1 protein, 
KIAA0624 protein 



440594 
450295 
431316 
438564 



BE301708 Hs.233955 hypothetical protein FLJ20401 
N94126 Hs.12969 hypothetical protein 
T79759 Hs.250651 ESTs, Weakly similar to I38022 hypotheli 
AA610649 Hs.333239 ESTs 

H00656 Hs.29792 ESTs, Weakly similar to 138022 hypotheli 
lin DKFZp434l0428 



AA502663 
AA381553 
BE073597 



ESTs 



major histocompatibility complex, class 



448782 
404240 
450843 
434137 
438315 



AA632649 
AA917808 
AL050295 



Hs.205383 ESTs 
Hs.124895 — 
Hs.82419 
Hs.1334 
Hs,1 03002 
Hs.132781 
Hs.194397 



428970 
433138 
415757 



AF272899 
U58913 
N90470 
AA1 66963 



AW516704 

N93466 

NM_001295 



Hs.208726 
Hs.121764 
Hs.301921 
Hs.9286 
Hs.198253 



ESTs, Weakly similar to testicular tekti 
chemokine (C-C molif) receptor 1 



Hs.245497 
Hs.201591 
Hs. 132821 
Hs.135118 



small inducible cytokine sub ; amily A (Cy 
CD38 antigen (p45) 

gb:zo86d01.s1 Stratagene ovarian cancer 
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451406 
437479 
445784 
418300 
413753 
418945 
416140 
418262 
420943 
442762 
429747 
420460 
420137 
439018 
427250 
452194 
411027 
407242 



408048 
415189 
437442 
41C577 



35 
40 



A1253155 
AI433074 
U17760 
BE246762 



ESTs, Weakly similar to S26689 hypotheti 
gb:ne26b03.s1 NCLCGAP_Co3 Homosapiens 
hypothetical protein FU14281 
SRY (sex determining region Y)-box 4 
Homo sapiens cDNA: FLJ22743 fis. clone H 
ESTs, Weakly similar to T1 7248 hypotheti 
ESTs 
; ESTs 



ls.75517 
ls.89499 
ls.301198 



major histocompatibility complex, class 
deleted in liver cancer 1 
caspase 1, apoptosis re ate 4 - t r 1 
Homo sapiens clone HB-2 mRNA sequence 
CD3D antigen, delta polypeptide (TiT3 co 
membrane-spanning 4-domains, subfamily A 



Hs.48376 
Hs.95327 
Hs.26638 
Hs.25418 

Hs.67845 



Hs.233459 ESTs 

Hs.154210 EDG-1 (en< 

Hs.170935 ESTs 

Hs.42322 A kinase (PRKA) anchor protein 2 

Hs.78146 platelet/endothelial i 

Hs.222779 ESTs, Moderately similar to similar to 
Hs.64639 



Hs, 111518 hypothetical protein 

Hs.176663 Fc fragment of IgG, low affinity lllb, r 

Hs.82921 solute carrier family 35 (CMP-sialic aci 

Hs.100194 arachidonate 5-lipoxygenase-act'wating p 

Hs.271605 ESTs, Highly similar to AF175283 1 zinc 



423961 
410798 
467250 



AB011130 
D13666 
BE1 78622 

BE397753 
M74782 
NM 004484 



ESTs 



S.127436 
s.136348 perio; 



im channel, voltage-dependent, alph 



Hs.16291 gb:PM3-HT060b-270200-001-a02 HT0605 Homo 

Hs.125779 ESTs 

Hs. 14623 interferon, gamma-inducible protein 30 

Hs.172689 interleukin 3 receptor, alpha (low affin 

Hs.1 19651 glypioan3 



414942 
419092 
424878 
406687 
411605 
416965 



435106 AA100847 



H57111 
M31126 
AW006831 



407137 
410361 
447150 
421114 



409190 
414221 
435272 
414991 
424623 
424665 



427535 
447197 
446142 
410503 
435523 
437629 
429688 



BE391804 
AA330310 
AW975051 
AL1 10326 



N77630 

U62027 

AU076536 

AW450979 

AA906415 

C17898 

AW368576 
W79117 
NM 001872 
R29543 



Hs.192986 
ls.89603 
Hs.221132 



Hs.62661 
Hs.24181 
Hs.293156 
Hs.304679 
Hs.8236 
Hs.82002 
Hs.293815 
Hs.8700 
Hs.13895 
113.155935 



ESTs 



Homo sapiens cDNA FLJ1 1 554 fis, clone H E 



gb:U l-H-BI 3-ala-a-l 2-0-U1 ,s 1 NCI CGAP Su 
ESTs 

gb:C17898 Human placenta cDNA (TFujiwara 



AW842182 
Al 357412 
AA453800 



Hs.145968 
Hs.188662 
Hs.11090 
Hs.121692 
Hs.211610 
Hs.241392 
Hs.157601 
Hs.1 92793 
Hs.29567 



CUG triplet repeat, RNA-binding protein 
I inducible cytokine A5 (RANTES) 
icted gene: Eos cloned; secreted w/V 
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422404 AL133571 Hs.3361B9 Homo sapiens mRNA; cDNA DKFZp<!34F1 135 (f 
423526 AB011086 Hs.129739 
436485 X59135 Hs.156110 

s1(CD54) 



443441 
416458 
408705 
419150 



432620 
424321 
406646 
424450 
426410 
416975 
421077 
424563 
405102 
452436 
416206 



AW291196 Hs.92195 ESTs 

AA332941 Hs.85226 lipase A, lysosomal acid, choiestero' es 

AA312135 Hs.46967 HSPC034 protein 

T29618 Hs.89640 TEK tyrosine kinase, endothelial (venous 

AA488953 gb:aa55e05.r1 NCLCGAP_GCB1 Homo sapiens 

AA935633 Hs.194628 ESTs 

M777749 Hs.5978 LIM domain only 7 

W74048 Hs.1765 lymphocyte-specific protein tyrosine kin 

M33600 Hs 308026 major hist c npatibility mplex.cl 



BE298446 
NM 004131 
AK000061 
AA446932 



AW972301 
AW970937 
AI694722 
M376836 
R12375 



Hs.305890 BCL2-like 1 

Hs.1051 granzyme B (granzyme 2, q 

Hs.101590 hypothetical protein 

Hs,151428 retfinger protein 2 

C15001220*:gi|4469553|gb|AAD2131 1.1| (AF 

Hs.31 447 ESTs, Moderately similar to A46010 X-lin 

Hs.1 1 1092 hypothetical protein FLJ22332 

Hs.83393 cystaiin E/M 

Hs.31 0286 ESTs 

Hs,293843 ESTs 

Hs.279744 ESTs 



ESTs 



g of yeast ubiquitin-protein ligas 



418918 
424006 
437581 



419216 
459680 
449677 
443071 
443021 
437838 



408410 
436293 
410730 



AF054815 
N59284 
R36207 



Hs.83968 

Hs.344096 

Hs.80887 

Hs.8881 

Hs.29191 

Hs.89476 

Hs.137548 



407202 N58172 



AW963624 
AA447438 
AI601188 



AA002071 
AL080021 
AA368546 
AI307229 



Hs.283349 
Hs.84561 
Hs.71962 
Hs,164021 
Hs.42321 



444929 
408873 
437634 
400277 
443601 
432212 



408788 
421057 
413936 - 
431924 
449444 



v-yes-1 Yamaguchi sa 
ESTs, Weakly similar to S41044 eh, 
epithelial membrane protein 2 
CD2 antigen (p50), sheep red blooi 
CD84 antigen (leukocyte antigen) 
ESTs 

hypothetical protein MGC10744 
collagen, type XIII, alpha 1 
adenosine A2a receptor 



gb:zh85d01.s1 Soares_felaUiver_spleen_ 
complement component 1 , 
Igsuperfamiiy protein 



Hs. 109370 
Hs.134587 
Hs.31707 



AI494291 
AK002016 
AW975114 



Hs. 109653 
Hs.9414 
Hs.1 55029 
Hs.61311 



□nal (Hsp40) homolog, subfamily B, membe 
ESTs 

Homo sapiens, clone MGC: 16327, mRNA, com 
ESTs 

KIAA1488 protein 
ESTs 

ESTs, Weakly similar to S10590 cysteine 



430478 NM_014349 



Hs.295944 
Hs.268107 
Hs.161354 



apolipopratein L, 3 
ESTs 

tissue factor pathway inhibitor 2 

multimenn 

ESTs 

calmodulin 2 (phosphorylase kinase, delt 
Eos Control 



Hs.8966 hypothetical protein FLJ21776 
Target Exon 

Hs.157367 EST,'" klysi '78335 serine/th 

Hs.213956 Homo sapiens BAC clone RP11- 1 02O5 from V 
Homo sapiens cDNA: FU22063 fis, clone H 



AK000850 Hs.272203 
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421464 
424831 
434542 
418323 
418836 
431315 
400750 



411653 
417916 
421757 
441942 
431843 
432006 
414154 
449987 
418452 
406645 
414516 
417032 
414875 
414522 
410511 



426722 
421229 
410491 



40 
45 



AA291553 
H61453 
AA769310 
NM_002118 



AW768399 
AA452244 
AF070578 
NMJ006416 



Hs.190086 ESTs 
ESTs 

hypothetical protein FLJ13164 
Hs.1162 major histocompatibility complex, class 
Hs.161712 ESTs 

Hs.163986 Homo sapiens cDNA: FLJ22765 fs, clone 
Target Exon 



gb:C147 
Ts.257789 ESTs 



Hs.16727 
Hs.71168 
Hs.82921 
Hs.296259 



M192469 

H42679 

AW518944 

AA743475 

NM-014339 



Hs.323060 
Hs.184719 
Hs.85201 
Hs.814 

Hs.271838 
Hs.77522 
Hs.76325 



IK cytokine, down-regulalorofHLA II 
ESTs, Weakly similar to I38022 hypotheti 
Homo sapiens mRNA; cDNA DKFZp434L1226 (f 



ESTs, Weakly similar to T43458 hypotheti 
ESTs 

major histocompatibility complex, class 



-in 17 let 



453108 
432967 
441390 



414629 
418460 
415443 



431193 
432485 
450293 



420838 
436252 
430702 
456804 
439195 



447131 
430887 
414700 
417874 



AA465131 
AW499834 
AW293413 
AI623752 



AI311457 
AA572949 
AI692560 



N90866 

N36754 

BE243915 

M446167 

AA723157 

AI539519 



Hs.296770 
Hs.276770 
Hs.171118 



ESTs 

ESTs, Weakly similar to I38022 hypotheti 

hypothetical proleinMGCI 2435 

Homo sapiens clone 25218 mRNA sequence 

inlerleukin 10 receptor, alpha 



adenylate cyclase 9 

cadherin 5, type 2, VE-cadherin (vascula 
carboxylesterase 1 (monocyteA-nacrophage 
CD8 antigen, alpha polypeptide (p32) 
ESTs 

myeloid cell nuclear differentiation ant 
ENSP00000241075:TRRAP PROTEIN. 
gb:yv68g06.s1 Soares fetal liver spleen 
ESTs, Weakly similar to 155211 salivary 
ESTs, Moderately similar to ALU8_HUMAN A 
hypothetical protein FU 10936 
KIAA1719 protein 

CDW52 antigen (CAMPATH-1 antigen) 
hypothetical protein FU00026 
leukotrieneA4 hydrolase 



Homo sapiens cDNA FLJ11E62 fis, clone HE 



Hs.82030 
Hs .209276 
Hs.17466 



ESTs 

Williams-Beuren syndrorr 
Homo sapiens clone HB-2 mf 
hypothetical prolein FLJ10147 
ESTs, Moderately similar to unnamed prol 

gb!no97h03.s1 NCI_CGAP_Pr2 Homo sai 
tryptophanyl-IRNA synthetase 



ESTs 



BE616160 
AU076484 
425252 AW391162 
428758 AA433988 
425810 AI923627 
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409745 
426780 

438670 
414359 
446566 
424528 
444745 

426721 



437751 
449618 
430634 



M602539 

AI539443 

Z37976 

AI064821 

AA077391 

BE242284 

R12499 



AF1 17754 
AA423849 
AA383588 
AW338067 
AI458213 
AA767373 
AI076459 
AI860651 



440099 AL080058 



Hs.345494 
Hs.137447 
Hs.83337 
Hs.318535 

Hs.172199 
Hs.20468 
Hs.123428 
Hs.75929 
Hs.17914 
Hs.238954 
Hs.11861 
Hs.79530 
Hs.131816 
Hs.7869 
Hs.77542 

Hs.15978 
Hs.26685 
i.247112 



Homo sapiens cDNA FU 12169 lis, cbne MA 

latent transforming growth factor beta b 

EST:, Highly similar to 1 31 8357A EWS gen 

gb:7B14E12 Chromosome 7 Fetai Brain cDNA 

adenylate cyclase 7 

ESTs 

ESTs 

cadherin 1 1 , type 2, OB-cadherin (osleob 



444051 
414464 
427792 
415801 
430027 
425771 
412443 
408771 



444083 
449679 
418183 
414776 
414803 



Hs.10247 
Hs.13957 
Hs.180841 
Hs.278443 
Hs.227743 
Hs.1 59494 
Hs.1 30767 
Hs.47584 
Hs.194718 
Hs.75596 
Hs.1 701 21 



FU 10902 
DKFZPS64G202 protein 
major histocompatibility complex, class 
activated leucocyte cell ad' 



Fc fragment of IgG, low affinity lib, re 
KIAA0980 protein 

Brulon agammaglobulinemia tyrosine '- 1 as 
Homo sapiens cDNA: FLJ23553 fis, clone L 



N48373 

AI870175 

M63928 

R24219 

AB023197 

BE561776 

AW951103 

AW732573 

N92054 

NM_000878 

Y00062 

AA995787 

AA1 69685 

AI123195 



NM_001772 Hs.83731 CD33 antigen (gp67) 

AA155598 Hs.212839 hypothetical protein FU14195; KIAA1714 

X03100 Hs.914 Human mRNA for SB classll histocompatibi 

AI493591 Hs.78146 plateleVendothelial cell adhesion molec 

AW97616S gb:EST388274 MAGE resequences, MAGN Homo 

AA402307 Hs.322844 hypothetical protein DKFZp564A1 76 



protein tyrosine phosphatase, receptor t 



N46243 
AF020202 



Hs.1 55001 
Hs.226573 
Hs.243081 



AF026944 Hs.293797 ESTs 



ESTs, Highly similar to T42626 secreted 

ESTs, Weakly similar to S55024 nebulin, 

UNC13(C.elegans|-like 

inhibitor of kappa light polypeptide gen 

ESTs 

ESTs 



449161 
453107 
418371 
432946 



AF104266 
BE076537 
NMJW6433 



Hs.1 69895 
Hs.105806 
Hs.47647 
Hs.279746 
Hs.84298 
Hs.279854 



lalrophilin 
ubiquitin-conjugating enzyme E2L6 
granulysin 

vanilloid receptor-like protein 1 ^ 
CD74 antigen (invariant polypeptide of m 
mannosidase, alpha, class 2B, member 1 
Homo sapiens mRNA; cDNA OKFZp434A1 1 9 (fr 
troponin I, cardiac 




ESTs, Moderately similar to ALU2.HUMAN A 
lartoCGHUIEcoJaoen 
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421461 
430850 
424377 
443884 
423057 
448262 



Hs.49500 KIAA0746 protein 

Hs.77348 hydroxyprostaglandin dehydrogenase 15-(N 

C5001 S32-:gi| 1 064E3DB|gb|A«,G21 430 1 1AC00 
ESTs, Weakly similar to A4601 0 X-linked 
gb:MRO-HT0165-OS0200-006-e02 HT0155 Homo 

Hs.1 46322 killer cell lectin-like receptor subfarai 

Hs.194397 leptin receptor 

Hs.130816 ESTs, Moderately similar to 138022 hypo! 

Hs.1B6273 ESTs 

Hs.271986 integrin, alpha2 (CD49B, alpha 2 subuni 
Hs.265829 integrin, alpha 3 (antigen CD49C, alpha 
Hs.308026 major histocompatibility complex, class 
Hs.17518 gb:Homo sapiens cig33mRNA, partial sequ 
Hs.41 502 hypothetical protein FLJ21 276 
Hs.128899 ESTs; hypothetical protein for 1MAGE:447 
Hs.6336 KIAA0672 gene product 
Hs,4147 translocating chi 



solute carrier family 21 (prostaglandin 

Homo sapiens clone IMAGE:71 31 77, mRNA se 

Homo sapiens mRNA; cDNA DKFZp564O0862 (f 



407775 
437119 
426836 
453498 



AF072873 

J05070 

AF161442 

AK000136 

NM.004914 

AI379921 

N41720 

BE181412 

M26301 

AI904740 

W76467 

AI928445 

U73727 

AI922189 

U2S029 

BE563152 

AWC07280 

N64458 

AI807671 

AW972642 



Homo sapiens, clone IMAGE:4054868, mRNA 
ESTs 

frizzled (Drosophila) homolog 6 



AL049227 

H58721 

AF186114 

X57522 

M62505 



Homo sapiens HSPC324 mRNA, partial cds 

asporin(LRR class 1) 

RAB36, member RAs oncogene family 



hypothetical protein FLJ1 1767 



Target Exon 
receptor (calcitonin) activity modifying 
proline oxidase homolog 



Hs.19718 , 

Hs.288390 hypothetical protein FU22795 
Hs.102761 Human glucocorticoid receptor alpha mRNA 
Hs.10362 Homo sapiens cDNA: FLJ2C944fis,cloneA 
Hs.1 15537 putative dipeptidase 
Hs.1 43345 ESTs 

Hs.24040 potassium channel, subfamily K, member 3 

Hs.293055 ESTs 

transporter 2, ATP-binding casselle, sub 
gb:yb9Sa12.s1 Stratagene lung (937210) H 

Hs.191358 ESTs 

Hs.31 8893 ESTs, Weakly similar to A47582 B-cel! gr 
Hs.87497 butyrophilin, subfamily 3, member A2 
Hs.112278 arrestin, beta 1 

eukaryotlc translation initiation factor 
Hs.2441 KIAA0022 gene product 
Hs.21 103 Homo sapiens mRNA; cDNA OKFZp564B076 (f- 
Hs.301746 RAP2A, member of RAS oncogene family 
Hs.48353 ESTs 

Hs.124776 downstream of cadherin 6 (by 3.3kb) 
Hs.271628 ESTs 

Hs.270737 tumor necrosis factor (ligand) superfami 
transporter 1 , ATP-binding cassette, sub 

Hs.2161 complement component 5 recepto' 1 (C5a I 
nuclear receptor subfamily 1, group I, m 

Hs.50002 small inducible cytokine subfamily A (Cv 

Hs.8859 Homo sapiens, Similar to RIKEN cDNA 5830 



4s.64016 protein S (alpha) 

Hs.68877 cytochrome b-245, alpha polypeptide 

-ls.334873 ESTs, Weakly similar to ALU8.HUMAN ALU S 



D50402 Hs.18261 1 solute carrier family 1 1 (proton-coupled 

N51517 Hs.47282 ESTs 

AF106941 Hs.18142 arrestin, beta 2 

AW373564 Hs.251928 BANP homolog, SMAR1 homolog 

AJ271684 Hs.1253 5 5 C-type (calcium dependent, cartohydrate- 

AW138797 Hs.132906 19A24 protein 

AA290712 Hs.82407 CXC chemckine ligand 1 6 

H95990 Hs.181244 major histocompatibility complex, class 

AA804698 Hs.82547 relinoic acid receptor resporder (lazaro 
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431921 
437400 
426274 
415078 
417929 
401854 
406850 
433815 
431130 
453870 
414763 
428281 



457941 
419542 
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35 
40 



AA505445 
AF054910 
S78187 



CD74 antigen (invariant polypeptide of in 
immunoglobulin heavy constant gamma 3 (G 
tektin 2 (testicular) 



Hs.58879 
Hs.5599 
Hs.2007 
Hs.283091 
Hs.74647 



ESTs 



NMJ06103 

AW385001 

U97275 

AA194554 

N22788 



Hs.183434 
Hs.82407 
Hs.37129 
Hs.279772 
Hs.14587 
Hs.302110 
Hs.90911 
Hs.13775 
Hs.198267 
Hs.211092 
Hs.82128 



429109 AL008637 Hs.196352 
409361 NMJ05982 Hs.54416 



55 

60 



427585 
445672 
432210 



409524 
406787 
419452 
406422 
421341 
421195 



415765 
427732 
449746 
40756B 
422573 
427138 



AF006823 

BE1 84455 

X12451 

BE465754 

D31152 

AI907438 

AI567421 

AI380016 

AA2B0722 

L36531 

AA838727 

AW403448 

NM_002406 

AW971070 
AW402151 
AW090702 
U33635 



Hs.24040 
Hg.251754 
Hs.78056 
Hs.17778 
Hs.179729 
Hs.282862 
Hs.273330 



Hs.91296 
Hs.124405 
Hs.16725 
Hs.151513 
Hs.184634 
Hs.291160 
Hs.54673 
Hs.240615 
Hs.90572 



EGF-like-domain, multiple 5 
tumor necrosis factor (ligand) supertami 
found in inflammatory zone 3 
Human T-cell receptor active alpha-chain 
Target Exon 
collagen, type I, alpha 1 

HE4; epididymis-specifi: wh a idic p 
Homo sapiens cDNA; FU231 73 fis, clone L 
quiesclnQ6 

ATPase, H transporting, lysosomal (vacuo 
CXCchemokine ligand 16 
sodium channel, nonvoltage-gated 1 , beta 
brain specific protein 

ESTs, Weakly similar to AF151859 1 CGI-1 

ESTs Weakly similar to MUC2JHUMAN MUCIN 

solute carrier family 16 (monocarboxylic 

hypothetical protein SMAP31 

mucin 4, tracheobronchial 

LUNX protein; PLUNC (palale lung and nas 

5T4 oncofetal trophoblast glycoprotein 

neutrophil cytosolic factor 4 (40kD) 

sine oculis homeobox (Drosophils) homolo 

Human DNA sequence from clone RP1-261 G23 

potassium channel, subfamily K, member 3 

secretory leukocyte protease inhibitor ( 

cathepsin L 

neuropilin 2 

collagen, type X alpha 1 (Schmid melaph 
ESTs 

Homo sapiens, clone IMAGE:3544BS2, mRNA, 
b I v ! r 1 I T c HUM II ft 'it 
ESTs, Weakty similar to 138022 hypolheti 
integrin, alpha 8 

ESTs, Weakly similar to A46010 X-linked 
inlerferon-slimutated transcript'on fact 
mannosyl (alpha-1,3-)-gtycoproteir bela- 
hypothetical protein FLJ20005 
ESTs, Weakty similar to ALU 1 . HUMAN ALU S 
tumor necrosis factor (ligand) superfami 
tubulin alpha 1 

PTK7 protein tyrosine kinase 7 
Target Exon 

deleted in malignant brain tumors 1 
ESTs 



FAT tumor suppressor (Drosophila) homolo 
novel SH2-conla'ning protein 3 
Homo sapiens mRNA: cDNA DKFZp564G112 (fr 
gb:hm30f03.x1 NCI_CGAP_Thy4 Homo sapiens 
KIAA1262 protein 



BE091833 gb:IL2-BT0731-260400-076-F04BT0731 Homo 

NM_005424 Hs.78824 tyrosine kinase with immunoglobulin and 

NMJ02980 Hs.2199 secretin receptor 

AI668594 Hs.176588 ESTs, Weakly similarto CP4YJHUMAN CYTOC 



AB033088 Hs.272567 



N77624 
AL359590 
Z92546 



406654 M90686 

407603 AW955705 

445417 AK001058 

436982 AB018305 

427507 AF240467 



Hs. 295726 integrin, alpha Vitronectin receptor 

Hs.173717 phosphatide acid phosphatase type 2B 

Hs.162604 hypothetical protein DKFZp762M186 

Hs.131819 Sushi domain (SCR repeal) containing 

Hs.343475 cathepsin D (lysosomal aspartyl protease 



456637 
422129 
417785 
414849 
436986 
410598 



AW407157 

AU076635 

X59812 

AW372721 

AA740983 

AI817130 

X14008 



iscDNAFLJ12951fis,c!onsNT 
HLA-G histocompatibility antigen, class 

Hs.62604 Homosapiens, clone IVAGE.4299322, mRNA, 

Hs.12680 a dlsintegrin-like and meta'loprotease w 

Hs.5378 spondin 1 , (f-spondin) extracellular mat 

Hs.179152 toll-like receptor 7 

Hs.156781 ESTs 

Hs.8997 immunoglobulin lambda locus 

Hs.1 09201 CGI-8B protein 

Hs. 1 478 ser.r.e (o- c/steire) proteinase inhibito 

Hs.82568 cytochrome P450, subfamily XXVIIA (stero 

Hs 291623 ESTb "I imil inamed protein 

Hs.210792 ESTs, Weakly similar to ALU8_HUMAN ALU S 

Hs.9195 Homo sapiens cDNA FLJ13698 fis, clone PL 

Hs.234734 lysozyme (renal amyloidosis) 
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438374 
417801 
425883 
428458 



433283 
426759 
435446 
421467 



451385 
400496 
409432 



30 

35 



AA013353 
AA720997 
AA321866 
AA417383 



BE041135 
AI590401 
AW016809 



Hs.128295 
Hs.6193 
Hs.82582 

Hs.251399 
Hs.258576 
Hs.148135 
Hs.279898 
Hs.175622 
Hs.21213 
Hs.119021 
Hs.97252 



hexabrachion (lenascin C, cytola:lin) 
Homo sapiens cDNA FLJ14438 fis, clone HE 
major histocompatibility complex, class 
gb:ze28h10.s1 Soares retina N2b4HR Homo 
ESTs 

hypothetical protein FU14590 

inlegrin, beta-like 1 (with EGF-like rep 

Homo sapiens mRNA; cDNA DKFZp434K0322 (f 

neurogranin (protein kinase C substrate, 

ctaudin 1 2 

serine/threonine kinase 33 

Homo sapiens cDNA: FLJ231 65 fis, clone L 



AA402107 
AA045175 
AA761605 
AI392640 
AA017656 



426486 BE178285 



NMJ04403 

AA305746 

AA565309 

NM.002776 

N40027 

AF025692 

H25899 

D86640 



Hs.257146 ESTs, Moderately similar to I38022 hypot 

Hs.17914 ESTs 

Hs.292308 ESTs, Weakly similar to ALU1.HUMAN ALU S 

Hs.1 8272 amino acid transporter system A1 

gb:ze39h01.r1 Soares -etna N2b4HR Homo 
ENSP0000022471 S":GTP-binding protein SAR 

Hs.54460 small inducible cytokine subfamily A (Cy 

Hs.67846 leukocyte immunoglobulin-like receptor, 

Hs.170056 Homo sapiens mRNA; cDNA DKFZp586B0220 (f 

Hs.72901 ' 

Hs.6727 

Hs.6853 

Hs.13530 

Hs.49 

Hs.194015 ESTs 



Hs.132937 ESTs 



Ras-GTPase activating protein SH3 domain 
hypothetical protein FLJ22167 
deafness, autosomal dominant 5 



L12350 

AJ006276 

AW296778 

AW277221 

AW273539 

AI631310 

AI732629 

AK001679 

U73531 

AK000998 

AF085839 



431169 
425916 
419721 



415094 
434088 
420727 



Target Exon 
Hs.108623 thrombospondin 2 
Hs.1 59003 transient receptor potential channel 6 
Hs.144734 Human DNA sequence from clone RP3-416F21 

nel(chicken)-like2 

ESTs 

hypothetical protein FU23577 
Hs.1 96955 ESTs 

ESTs, Weakly similar to TA2R HUMAN BETA 
Hs.202289 hypothetical protein DKFZp434P1735 
Hs.34526 G protein-coupled receptor 
Hs.297221 Homo sapiens cDNA FU10136 fis, clone HE 

gb:Homo sapiens full length insert cDNA 
Hs.218289 ESTs 

Hs.89791 wingless-type MMTV integration site fami 
Hs.190389 ESTs 
Hs.182080 ESTs 

Target Exon 

gb:EST383329 MAGE resequences, MAGL Home 
Hs.1 62200 urotensin2 



AW971240 
NM_006786 
NM 001650 
AL1 20297 
AF001622 



AF116677 Hs.249270 hypothetical protein PR01966 

H75701 Hs.99886 complement component 4-binding protein, 

AW277085 Hs.99619 ESTs 

AV660737 ESTs 

AI796317 Hs.203594 Homo sapiens uncharacterized gastric pro 



AJ227892 Hs.146274 ESTs 




BG9401 89 AW063489 AA71 5980 BF001091 BF880066 AA6661 02 AA621946 AA491826 
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421798 3042.4 
400269 2726_1 



418259 133853_1 



429228 215430J 



BC017829 AW27S646 A1984209 AA663933 AA634104 AA551528 AA634041 AA298038 BG483990 T89297 BF853958 H64685 T90329 T60644 
T57747 BF852694 T92529 BG482852 BF883064 BF883066 N74880 AA829796 N90716 

X65018 BC022318 NM.003019 BE465060 AI732255 BF446634 AI820S77 AI002217 AI924488 BI821373 BI770406 BI823937 BI820265 BI489632 
BG48291 1 AA617783 AI807697 AW205576 T94427 AA487101 T94513 BI819407 BI822450 BI82061 8 BI824619 BG542824 BG537862 

BG207209 BE1 66299 AI204995 BG1 99355 AW9S9908 AA52875S AW440776 BI044354 

BG292389 C06094 AI668930 AW104534 AA310513 AA830127 AW134897 AA046953 AW965490 AI810530 BF092924 AA334151 AA334725 
D31302 R20723 AA263003 BI824635 AI276287 AI684423 A!52423^ AI33E035 AW014704 AI911443 AA972102 AI367512 AI126670 AW016017 
AI286003 Al 1 47163 AA626033 A153S156 AA565542 Al09^253 AW51261 2 BE889628 AA744752 BE646306 AW471324 AA999975 AA863400 
H17550 AI991439 R46187 BE929954 AA33397S D63102 BF744491 

BM31 0925 AA4261 1 0 BM31 0629 BF434286 AW01 509 1 BF475996 AW1 1 8867 BE6751 86 AI688568 AI453594 AW590589 AI652425 AI827969 
BF056946 AI802866 AI393380 AI476224 AW590639 AW136271 AI458252 AI524726 AA843768 AA7821 58 AI336058 AI097532 AW451563 
AA459408 AA459633 AA418444 W23607 BG9«1S0 AI493445 AW054729 AI221929 AI868744 AA215405 AA766713 AA621546 BF928317 
BE464132 AI990909AW271459AI262061 AA215404 N74332 BG940151 BG952261 AA972115W96315AA689586 R59057 BF766885 BE769254 
W05240 

BG6761 55 BM009591 AI479075 AI025794 A101 7967 AA448270 BE46681 2 AA853422 AI392649 BG952034 AA51 3384 BF8401 24 BE714620 
AW969505AI553633 

BG207209 BE166299 AI204995 3G199355 AW969908 AA528756 AW440776 BI044354 

AI758223 AW469334 BF940841 AW080348 Al 270363 AI055892 BE464168 BF431797 BE350144 BF448739 AI693409 BF432999 D62848 AA398070 
AI383375AW611490 

M1 36653 AA136656 AW450979 AA984358 AA809054 AW238038 AA492073 BE1 68945 

AW969880 M484613 AA501874 R34356 

AI732297AW204600T95017 

AW340495 AI98431 9 AA974603 

M18728 

AW473675 AI190744 W69997 AW104913 AI221098 W69996 AA885487 AA861491 

U52054 AL581 000 AA1 56850 AW293839 B1335865 AA024963 BF149420 BE073977 AW602574 BE164012 BE163992 BE163974 AW402161 
BM1941 34 AW966609 W84374 BF91 6380 AA385173 W84366 AA383743 BF903598 AA043776 W84421 AA778446 AW444904 BF446960 
AA837481 AV755539AW468444AW468002 AA811830AA581806A1866686AI572124AA687333D20160AA812489 AU185248AU166004 
AA155781 AI536733 BM144850 AI471883 AA04092S BF507639 AA043777 AW874142 BE832523 BE1 63972 BI022546 BI021204 
AI000341 AI766341 AW873274 
AA602964AA609200 

AI884781 AI652306 AI651694 AI638744 AI962493 



50 
55 



440248 2616908J 

442006 1239046.1 

438177 9337.12 

435154 126605.1 



443647 137089.2 
444314 1027984.1 



413638 1525406.1 
436772 1239464.1 
448782 34980.1 



AW975183 AA973583 AI365103 A1699495 AI301787 

AK024965 AU158033 M978370 R79120 BE327015 AA779740 R79121 

AW972063 AA668764 AA804491 AW665583 AA765069 

L1 3288 AA928785 AI608912 AW872978 AA565655 AI02291 5 AI304920 AI564356 AI668793 AI094557 T60038 R72302 H45409 AA508805 R46356 
AA418798 BM1 29553 BM129126 BM129292 BM128865 AI808418 AI689932 AI806573 BF431808 AW872985 AW 1 66269 H73241 T16182 AI264547 
R73391 R72085 R72840 T83751 X75299 BF754348 R94105 AW449839 R73300 NM.004624 AI797007 BE045543 BF110021 BF754250 T83923 
AW884084AA903896M418962 L20295 R72351 K45098 AA961010 R73210 R46451 AWBB4085 BI022902 BI763932 BI91 0138 
AV646808 AA701 657 AW271 273 AI796734 AI47231 6 AI01 7 531 AI061 1 78 BF109096 AA548964 N33805 AA131648 AA1 56589 BE703349 
AW952494 D30877 AV684717 Z24837 F00167 BF576150 T63841 R78995 N87474 
AW749625 AW749626 AW749644 AI140497 

AI334106 R63583 AI028079 AI140098 AI91 1625 AI890637 F34815 T65959 N40935 W52768 AA854747 AA861945 AA878472 AA77B270 W32249 
AA026061 V\526t2,\1 2 V7967I 'AC6384 T94283 AA002155 R82052 BE825493 BE825520 
AW973785H60163 M557608 

AW975851 AI91 1033 AI540093 AA932284 AW297855 AA829228 AA814442 AW105017 
H71252 BE154S42 BE154668 

AA156151 Z25109 C05177 AW975688 AA731063 N67084 
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418262 5824.2 
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446354 
406687 
411605 



831448.1 
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10026.3 
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AA432067AW572442 

AA136653 AA1 36656 AW450979 AA984358 AA809054 AW238038 AA492073 BE168945 
D78831 C17898 D78863 
R36075 R361 67 Al 366546 

BG425760 BF997600 N48516 W73454 BF816344 BF997601 AA488953 
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BF036043 AW190446 BG194731 AW662036 AI445021 BE937550 AW818972 AW393132 AA834685 BF1 12058 AV721682 H1 6423 AI270167 
AIB57345 AA937302 AW818444 BE929780 BG498678 3F155010 BI598271 BI599811 BE1 61728 AW578737 AW75371 1 AW379707 AW381918 
BG506608AW028637AW994240BF887392 BF790073 AW381624 AV727105 BF439618 AA443174 AI018009 N42850 AW573242 AI417258 
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418304 
449677 
437838 
429421 
410730 
427876 
434987 
442369 
408873 



79505J 

2512601J 

117617 1 

114639J 

1373914J 

121985_1 

2691713J 



B3 AI676131 AI167170 AA836627 AA443828 AW592922 AA235129 AA730278 AW439062 AW474332 BI043239 AW474342 BG708553 

256 R16550 R39478 R39479 R94368 BG540916BM314745 AA251087 D54231 D55274 BF085805 D31589 AW966405 

AW994425 D81 879 BE093545 AW901 1 07 AA383E29 BI021 552 RE6420 N39976 AA573281 H82595 AA234955 BE093539 AW357006 BF358697 
BF366318AA663856BE70209S3F0- I 9AI 1AI2 E 4 J AA521 574 AA861212 BF083343 BF083341 AV745131 D53074 
AW954476 AW954472 AA376836 AV724531 D53C63 C14928 AA093287 AA062638 BG483558 BE940050 AA765954 T70171 BE938775 BE940057 
D53502 AW373300 AL118798 BM1 28728 AA193411 AW444709 AW952455 AI887612 BF431 948 BI496876 AI264159 BM128481 AI624657 
AI689301 AI969467 M861685 AA251595 AA6257S1 AA872090 AI826790 AA328366 BE827416 R75951 D56918 R68122 BE827384 AL1 18797 
AI184164 AA16441 1 BI495332 BE8581 13 AI863860 H00660 T69849 AW780389 C14667 BE934995 BI018652 R92801 AA16441 0 H00752 
AW373305 AW373299 AW373302 
BE883520 BI057842 AA215702 AA215703 AA368006BE 
AA002232 T99209AA002071 
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40 
45 



60 
65 
70 
75 



431843 
414516 



445334J 
60847J 



409745 
437751 



AW341668 AI201382 AW104364 M648367 AA897604 AW977914 AA81 1957 AI352198 AL040620 

BG427950 AA82601 6 AI903441 BG939868 AW9791 54 AA640872 BG5471 34 AA457091 AW368860 W88852 R94779 AA088823 

AI494291 AI582807 AA417018 AA608841 

AW9751 14 AI096634 AI767001 AA658364 AW517542 

AA99356B AI52 1 958 AI565071 AI864217 

AL046017 AW504804 AA532433 AI634701 AM 26421 AW1 51 275 AA620782 AA933047 AI828223 AI624555 AI624489 AV749032 BI495751 
AI672286 AA342944 AV748076 AV747586 AA058597 H56073 W87367 R71630 H84499 AI866297 D25918 AV746788 N78995 H63752 H82985 
R46600 BE786692 R71 1 28 AA608909 R29728 BG534018 

Y00281 NMJ02950 BC010839 BC007995 BG675232 BM453552 AL555484 BG831516 BF035300 BG677277 BF852972 BE314901 BF850656 
AI371 81 6 AA292474 AA375747 AA30841 4 BM454E44 BI333370 BM049921 BI461428 BI465007 BI223401 BE856245 AW821 1 64 BF914775 
BF914761 AU125835 B1222678 BI091137 BF340536 BM462798 BI224452 BG707915AL569160 AA443815 AW572867AW363410BF739268 
BG010283 BI013120 BF818845 BF763468 AA30E165 AI530370 AA039826 R24906 H02046 T96891 BF981330 AW936510 AA478169 H04587 
BG166574 BI869342 BE562482 BE539637 AA165C89 AL579118 AL553699 BED44054 AW1 17440 AI520674 BF435417 AW245648 AI952404 
T29534AU153459 AU152168 AW591591 AU146918AI393187 AA478013 AU148143 AI224471 AI640728 AI871537 AW264752 N93787 AI189357 
AV756134 AI471659 AU147466 AA779205 AU149419 AU149104 AU159135 AA312221 AW445119 AW021912 AI799771 F04407 AI285530 
AI91 4643 AW068751 AA51 3325 AA1 64627 AA639285 AA569B44 T96892 AI923594 BF4391 80 BI770936 BF032438 AU154884 AAB82793 
AW072992 AU1S8815 AI884444 AL048031 AU158922 AU152545 AIS95187 AL048033 AI245650 AU148507 AW467451 BE536868 BF913001 
BF062707 AL573082 AW067993 AA523354 BE886727 AI890705 AU159092 AI982693 AI81 7553 AA236729 AI687858 BG1 63767 AI524675 
AI6781 55 AA1 27100 AI762661 AU 1 5971 8 AI469720 AA-183627 AW1 31696 R26868 Al 199885 AW87561 4 AW938694 AW578974 BI763988 
BG819168 BE874767 BG978292 BE162948 AL555483 AW189719 T56783 AI018819 AI476552 BI492837 AI824440 BG996262 AA932887 AI380726 



AW972351 AW182936AI478370AA528309BG997292AW137742AI632006AA775020AA961625 W86628 BF112014 AI275423 AI680786 



U59185 NMJ04696 AV734324 AI2453« AA3B9517 1 188753 D79128 AA970406 H01059 H88761 H03446 BG620383 AU135008 AU136895 
AU1 58158 AU155762 R73608 R65751 R23756 N74630 AW078687 BE439761 BE786351 R68994BE7B5B67AW297502AW297553 BG431545 
AW814843 BF382644 BG429539 BE929862 BF811258 
H61453 AA838765 AI693104 AA721107 

AF392454AK023074A1884890AI814455 AW966220BF736545AA026021 AA286843AA251918BG197710AA026294AA337356 BC010422 
AK023226 NMJ22776 BM459496 AA76931 0 AI826463 AU153650 AU160375 AW16621 1 AW292992 BF433538 AI823888 AI684798 AI655985 
AW770982 AI400454 AI276257 AA63951 0 AI6B981 8 AW772604 AA807639 AU13029B AU132028 BF9008B9 BF904822 AY007102 AU143256 
BG621 450 AA829630 A'864665 AI0B4922 AA025234 AI350060 AA766554 AA026295 AA825817 AA251762 BE180751 BM464530 AV715833 
AA779447 AI45251 9 AW41 8525 AA435643 D25894 AA435651 AA286844 N643B9 AI702262 BG288063 BE1 70545 BM466232 BE2991 60 AA1 69573 
AL5B7428BF217285BE967276 
AA609784 R97304 
C14774C17911D79033 

BC017171 BC012195 NM_007126 AF100752 AL137377 Z70768 BM474865 BG754806 AU124376 BG757203 BG764420 BG775028 BG824418 
BM045810 AU120387 BG770238 BG685740 BG913323 BI759980 BG395998 BM048B75 BE881070 BE313589 BE879144 BM309834 AW245847 
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AW978796AA7B7373AW1 73343 AA8361 63 N 27563 AA905328 R97032 

470 



WO 03/042661 



PCT/US02/36810 



458194 
421461 
430850 



455797 
407244 
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BM475665 BE644917 AW770789 AW952971 N64863 BM263259 AI224545 AI184866 N691 14 AW518902 AI440169 AA809472 AV654440 
AA281642 AU185230 AW337382 AI872923 AI537113 N73882T83378 H63731 BF671764 AW897824 AI811204AA344646 BE0091 12 BG899664 
H91240 R60548 N41701 

BI826340 BI868674 R1261 5 AW887767 BF439409 Al«4995 BG059893 AA417003 AI220270 AA418740 AI190974 Z39070 AA742556 AA835058 
AA694436 R20520 AA418795 BG460307 BI560147 
AI990640 AI380016 BM273298 BM273060 

NM.007329 AF159456 AJ243212 AJ297935 AA295769 NM_017579 AJ243224 AI492875 AI796676 AI749838 AA918144 AI814590 AI923531 
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BE088101 T05990AW872477 
AI928355 AI709178 AA436447 AI431274 BF946000 



1241 1 26_1 AW977507 M503803 AA7671 37 M828032 M828033 
85022J M019761 AA017656 M017374 

450370J AF150283AW182000AW277221 AV735848 
41648.2 AK026817 AI559708 AW273539 AW892986 

2523501.1 AI732529AI732831 AA776249 
4045.1 AF085839 R69254 R69137AW188788 

1235760.1 AW971240AA493723AA493843 
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1488711J BE063078 BE151503 BE151498 



Unique number corresponding lo an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "[ 
human chromosome 22" Dunham, el al. (1999) Nature 402:489-495. 
Indicates DMA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



404277 
402674 
404240 
405102 
406122 
400750 
404394 



Ref Strand 
9931121 Plus 



7143420 Plus 



NLposition 



53526-53628,55755-55920,57530-57757 

116229-116371,117512-117651 

91665-91946 



116132-116407,116653-116922 

120922-121296 

30940-31386 

198991-199168,199316-199548 
37121-37205,37491-37752,41053-41140,4132 
126609-126773,139986-140205 
101165-102597 
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401854 
406422 
400496 



7770538 Plus 



7711431 Plus 



1 51483-1 51 637,151 902-1 52008, 1 521 46-1 5231 
41515-41695 



TABLE 33A: About 800 genes upregulated in lung fibrosis relative to normal lung 
Pkey: Unique Eos pri 




and denorain ator was set to 50 



AJ250717 
L22524 
X63629 



453874 
443884 
444040 



UnigenelD Unigene Title 

FGENES predicted no 
ESTs, Weakly similar I 
Hs.1355 cathepsinE 
Hs.2256 matrix metalloproteinase 7 (malrilysin, 
Hs.2877 cadherin 3, type 1, P-cadherin (placenta 
Hs.28371 3 hypothetical protein BC014245 
Hs.172928 collagen, type I, alpha 1 
'" ""9114 hexabrachion (tenasdn C, cytotactin) 




Hs.7358 
Hs.94262 
Hs.282070 



ESTs, Weakly similar to KCCI.HUMAN CALCI 
serine carboxypeptidase 1 precursor prat 
nuclear receptor interacting p-otein 1 
gb:ow76b09.s1 Soares Jet 1 _ r 1 i _ 
small inducible cytokine subfamily A (Cy 
protein tyrosine phosphatase, receptor t 
alpha 1 (undulin) 

goigin-67 " 

ATPase, H transporting, lysosomal (vacuo 
hypothetical protein FU 131 10 
p53-inducible ribonucleotide reductase s 
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422573 
430665 
445006 



AW015318 
AA001615 
AW206248 
AL133396 



Hs.295726 
Hs.157367 
Hs.13530 
Hs.145807 

Hs.23165 
Hs.84561 
Hs.111092 
Hs.74621 
Hs.9414 
Hs.11861 



inlegrin, alpha V (vitronectin receptor 

ESTs, Weakly similar to I78885 serine/th 

deafness, autosomal dominant 5 

hypothetical protein FLJ13593 

K1M0980 protein 

ESTs 

ESTs 

hypothetical protein FLJ22332 
prion protein (p27-30) (Creutzfeld-Jakob 
KIAA1488 protein 



418005 AI186220 



424917 
424408 
418113 



418259 
426716 
432810 
407112 
436100 



AI636208 

AI754813 

AI272141 

F11875 

AW016343 

AF027208 

AI005163 

AA215404 
NMJ06379 
AA863400 
AA070801 
AA704806 
A1801777 



R19015 



410273 
442048 
435120 
423575 



434340 
400076 
431049 



AA9746C3 
AI248193 
C18863 



426458 
408369 
432476 
434963 
436446 



409977 
441297 
421229 



Hs.1 45428 
Hs.83484 
Hs.5534 
Hs.233301 



collagen, type XV, alpha 1 
gb:yd78d06.s1 Soares fetal liver spleen 
hypothetical protein FLJ23049 
collagen, type V, alpha 1 
SRY (sex determining region Y)-box 4 
Homo sapiens cDNA FLJ12961 fis, clone 



AA852773 
AW118210 
AF217518 
AI193043 

AA846576 

H73456 

AI199510 

AF019226 

AW803564 

AI878927 

AW276914 

AU077058 



Hs.1 35624 
Hs.79067 
Hs.120969 
Hs.164225 
Hs.266957 
Hs.1 34269 
Hs.24758 



Hs.119860 

Hs.1 63443 

Hs.24605 

Hs.6727 

Hs.211610 

Hs.301064 

Hs.334838 

Hs.42321 



hypothetical protein FLJ14281 
ESTs, Highly similar to cytokine recepto 
ESTs, Weakly similar to I38022 hypotheti 
ESTs, Weakly similar to I38022 hypotheti 
hypothetical protein FU10120 
KIM1634 protein 

Homo sapiens cDNA: FU23523 fis, done L 
Qb:o P 34f05.3l Soares_NrL 7 GDC 31 Homos 
ESTs 

intronofperiostin (OSF-2os) 



Hs.1 03257 

Hs.13299 

Hs.267912 

Hs.8036 

Hs.288850 

Hs.79284 

Hs.326714 

Hs.54089 



hypothetical protein FLJ22548 similar to 
Homo sapiens mRNA; cDNA DKFZp761M0111 (f 
ESTs, Weakly similar to ALU7_HUMAN ALU S 
glioblastoma overexpressed 
Homo sapiens cDNA: FU22528 fis, clone H 
mesoderm specific transcript (mouse) hom 
Homo sapiens clone IMAGE713177, mRNA se 
BRCA1 associated RING domain 1 



R38438 
T94344 
AW974957 
AW016809 



AW960454 

N75217 

AA337401 

AI064821 

NM_000055 



Hs.288719 Homo sapiens cDNA FU121 42 fis, clone MA 

Hs.1 19021 ESTs 

Hs.163603 ESTs 

Hs.279696 serum/glucocorticoid regulated kinase-li 

Hs.10338 ESTs 

Hs.1 83817 ESTs 

Hs.75182 mannose receptor, C type 1 

Hs.3066 granzyme K (serine protease, granzyme 3; 

Hs.8364 Homo sapiens pyruvate dehydrogenase kina 

Hs.97056 hypothetical protein FU21634 

Hs.7766 ubiquitin-conjuaating enzyme E2E 1 (homo 

Hs.7086 hypothetical protein MGC1 2435 

Hs.152981 CDP-diacylglycerol synthase (phosphatide 

Hs.222830 ESTs 

Hs.257846 ESTs 

Hs.137635 ESTs 

Hs.31 8535 ESTs, Highly simitar to 1818357A EWS gen 
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jin SI 



us mRNA; cDNA DKFZp434G227 (fr 
gb:od40d07.s1 NCI_CGAP_GCB1 Homo sapiens 
hypothetical protein FU20022 
DKFZP434I1735 protein 
transcription factor 7-like 2 (T-call sp 
ESTs 



AU076643 
AW749505 
AA032197 



Homo sapiens mRNA; cDNA DKFZp564K1 43 (fr 
Homo sapiens cDNA: FU22902 fis, clone K 
lid quit r -activating enzyme E1C (homolog 
ESTs 

hypothetical protein FU22833 
KIAA0202 protein 

ESTs, Weakly similarto KIAA0822 protein 
collagen, type III, alpha 1 (Ehlers-Danl 



30 
35 



Hs.348024 
Hs.35086 
Hs.115175 



an leukemia viral oncogene hom 



NM_006235 Hs.24C7 POU do 



collagen, type VIII, alpha 2 



Hs.116 

Hs.78521 

Hs.72325 

Hs.43213 

Hs.75825 

Hs.301885 

Hs.292996 

Hs.6968 



melallolhionein 1E (functional) 
KIAA1 71 7 protein 

ESTs, Weakly similar lo I38022 hypolheli 
ESTs, Weakly similar to IEFS_HUMAN TRANS 
pleiomorphic adenoma gene-like 1 
Homo sapiens cDNA FU 11346 fis. clone PL 
DKFZp434A0131 protein 
KIAA146C protein 



ESTs 
ESTs 

myotubularin related protein 1 



AA045464 

Z36842 

L03678 



immunoglobulin kappa constant 



454076 
431183 
437158 



AW968078 

R10799 

AW204712 

NM.006855 

AW090198 



Hs.173524 ESTs 

Eos Control 
Hs.87773 protein kinase, 
Hs.191990 ESTs 
Hs.61957 ESTs 
Hs.250596 KDEL (Lys-Asr 



.eu) en 



Hs.166571 ESTs 

441633 AW958544 Hs.112242 normal mucosa of esophaaus specif c 1 

452281 T93500 Hs.28792 Homo sapiens cDNAFLJ1 1041 fis, clone PL 

444057 AA316896 Hs.257267 FYVE and coiled-coil domain containing 1 

411495 AP000693 Hs.70359 KIAA01 36 protein 

Hs.288959 hypothetical protein FU20920 



AA148710 
AW972594 
AA748675 
AI253112 



Hs.335499 ESTs 
ESTs 

Hs.133540 ESTs 
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60 
65 





M1 26419 


Hs.32944 


inositol polyphosphate-4-phosphatase, iy 


3.86 




NM_014071 


Hs.159613 


nuclear receptor coactivator RAP25G", per 


3.86 




BE385828 


Hs.250619 


phorbolin-like protein MDS019 (CEM15) 


3.86 




AW629965 


Hs.234983 


ESTs, Weakly similar to 2109260A B cell 


3.85 




AW800444 


Hs.76507 


LPS-inducedTNF-alpha factor 


3.85 


432705 


NM_013230 


Hs.286124 


CD24 antigen (small cell lung carcinoma 


3.85 


412636 


NMJ04415 




desmoplakin (DPI, DPII) 


3.84 


436169 


AA888311 


Hs.17602 


Homo sapiens cDNA FLJ12381 fis, clone MA 


3.84 




M740616 




gb:ny97f11.s1 NCI_CGAP_GCB1 Homo sapiens 


3.84 


436110 


AA704899 


Hs.291651 


ESTs, Weakly similar to 138022 hypotheti 


3.84 






Hs.239189 




3.84 




AW294522 






3.84 


414320 


U13616 


Hs.75893 


ankyrin 3, node of Ranvier (ankyrin G) 


8.82 


430512 


AF1 82294 




U6snRNA-associated Sm-like protein LSm8 


3.82 


427051 


BE178110 




Homo sapiens cDNA FLJ10500 fis, clone NT 


3.32 




AA744550 




ESTs 


3.82 


453394 


AW960474 


Hs.40289 


ESTs 


3.81 


431266 




Hs.1 05411 


ESTs 


3.80 




AW975114 






3.30 




AI634651 


Hs.30250 


v-maf musculoaponeuralic fibrosarcoma (a 


3.79 


435176 


AA744875 


Hs.189413 


ESTs 


3.78 




M349944 


Hs.42915 


ARP2 (actin-related protein 2, yeast) ho 


3.77 


430709 


R34356 




gb;yh85d01.s1 Soares placenta Nb2HP Homo 


3.77 






Hs.173824 


thymine-DNA glycosylase 






M306207 




protein kinase, cAMP-dependenl, regulalo 


3.76 




AL1 34324 


Hs.7312 


ESTs 


3.76 




AA224827 




gb:nc32g04.s1 NCI_CGAP_Pr2 Homo sapiens 








Hs. 156044 


ESTs 


3.75 




D51103 




ATP synthase, H transporting, mitochondr 


3.75 






Hs.9927 


Homo sapiens mRNA; cDNA DKFZp564D1 56 (fr 


3.75 






Hs.77367 


monokine induced by gamma interferon 


3.75 


429494 


M769365 


Hs.126058 


ESTs 


3.75 


447118 


AB014599 


Hs.330988 


Homo sapiens. Similar to Bicaudat D (Dro 


3.75 




AW500715 


Hs.57079 


Homo sapiens cDNA FLJ13267 fis, clone OV 


3.74 


419591 


AF090900 


Hs.91393 


Homo sapiens cDNA: FLJ21887 fis, clone H 


3.74 






Hs.128653 


hypothetical prolein DKFZp564F013 


3.74 






Hs.292710 


E 1 Hv mi'ar to ALU5 HUMAN ALUS 


3.74 




AA243499 


Hs.104800 


hypothetical protein FU10134 


3.73 


414883 


AA926960 




CDC28 protein kinase 1 


3.72 


416178 


AI808527 




serologically defined breast cancer anti 


3.72 




BE618654 


Hs.28607 


hypothetical protein A-211C6.1 


3.72 


444099 


D87432 


Hs.10315 


solute carrier family 7 (cationic amino 


3.72 




AW080237 


Hs.252884 


ESTs 


3.72 




AA912183 


Hs.47447 


ESTs 


3.72 


418699 


BE539639 


Hs.1 73030 


ESTs, Weakly similar to ALU8_HUMAN ALU S 


3.70 


419900 


AI469960 


Hs.1 70698 


3.70 


446342 


BE298665 


Hs.14846 


Homo sapiens mRNA; cDNA DKFZp564C01 6 (fr 


3.70 






Hs.1 3804 


hypolhetical prolein dJ462023.2 


3.70 




AW947164 


Hs.195641 


ESTs, Moderately similar to I38022 hypot 


3.70 




R78669 


Hs.23756 


hypothetical protein similar to swine ac 


3.70 


447846 


AA324057 


Hs.77955 


Homo sapiens cDNA: Fl .123527 lis, done L 


3.70 




AK001 1 25 




Homo sapiens cDNA FLJ10263 fis, clone HE 


3.70 


418300 




Hs.86682 


Homo sapiens cDNA: FLJ21578 fis, clone C 


3.69 


408495 


W68796 


Hs237731 


ESTs 


3.69 




N41367 


Hs.1 73002 


ESTs, Weakly similar to I38022 hypotheti 


3.68 


448479 


H96115 


Hs.21293 


UDP-N-acteylglucosaminepyrophosphon/las 


3.68 


431974 


AW972689 


Hs.200934 


ESTs 


3.68 




NM_000633 


Hs.79241 


B-cell CLUIymphoma 2 (BCL2) 


3.68 




X16896 


Hs.82112 


interleukin 1 receptor, type I 


3.68 




AA1 30992 




gb:zo15e02.s1 Slralagene colon (937204) 


3.67 




BE513171 




mitochondrial ribosomal prolein L3 


3.67 


419111 


AA234172 


Hs.137418 


ESTs 


3.67 


423979 


AF229181 


Hs.136644 


CS box-containing WD protein 


3.66 






Hs.233459 


ESTs 


3.66 


451690 


AW451469 


Hs.209990 


ESTs 


3.66 




AI684746 


Hs.1 19274 


RAS p21 protein activator (GTPase acliva 


3.66 


414888 


AL039185 


Hs.77558 


3.66 


428347 


AI264161 


Hs.183773 


golgi autoantigen, go'gin subfamily a, 4 


3.66 




AA384577 


Hs.93714 


ESTs, Weakly similar to T00365 hypotheti 


3,66 


435335 


AI693150 


Hs.137928 


ESTs 


3.66 






Hs.64639 


glioma pathogenesis-related protein 


3.66 


452933 


AW391423 








429105 


D87077 


Hs!l96275 


KIAA0240 prolein 


3.64 




AL1 20247 


Hs.40109 


KIAA0872 protein 


3.64 


425863 




Hs.159901 


Human unidentified mRNA, partial sequenc 


3.64 


451678 


AA3741B1 


Hs.26799 


DKFZP554D0764 protein 


3.64 


452420 


BE564871 


Hs.29463 


centrin, EF-hand protein, 3 (CDC31 yeast 


3.64 


452408 


AA306477 


Hs.29379 


hypothetical protein FU 10687 


3.64 


441466 


AW673081 


Hs.54828 


ESTs 


3.63 


414013 


AA766605 


Hs.47099 


hypothetical protein FU21212 


3,62 


420056 




Hs.99804 


ESTs 


3,02 


424886 


H88584 


Hs.96900 


hypothetical protein; KIAA1830 protein 


3,62 


431774 


BE348813 


Hs.268561 


hypothetical protein FU 10726 


3,62 
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AW499770 
AA868729 
AV660976 



protein tyrosine kinase 9 
amylo-1,6-glucosidasa, 4-alpha-glucanolr 
hypothetical protein FLJ 11 264 
Homo sapiens cDNA FLJ20848 fis, clone AD 



ESTs, Moderately similar to 38022 hypot 
ESTs 

Homo sapiens cDNA FU14752 fis, clone NT 
ESTs 

hypothetical protein FLJ 20220 



KIAA0127 gene product 



Protein kinase C-bindlng protein NELL2 
granzyme A (granzyme 1, cytotoxic T-lymp 
Homo sapiens cDNA FLJ 13266 lis. clone OV 



ESTs 
KIAA0379 protein 
homiccnfn (fibuiin 6| 
FEM-1 (Celegansj homolog b 
ESTs, Weakly similar to I38022 hypotheti 
AKAP-associaled sperm protein 
ESTs 

Homo sapiens cDNA FU11997 fis, clone HE 
ESTs 

KIAA0944 protein 

Down syndrome critical region gene 1-tik 
hypothetical protein FU12666 



418836 
421462 
414555 
432401 



Hs.1 85675 ESTs 



Hs.29052 
Hs.87773 
Hs.284296 



aquaporin 9 

phospholipase A2, group IIA (platelets, 
NME7 

KIAA1541 protein 
ESTs 

hypothetical protein FU201 89 



NtvL014897 

D81448 

AW074266 

AW293134 

H05072 

AI620677 

U20158 



Hs.32597 
Hs.1 24984 
Hs.73105 



ESTs 
Target Exon 

leiomodin 1, smooth muscle (LMOD1 ) (Thy 
gb:n)21d02.s1 NCI_CGAP_AA1 Homo sapiens 
hypothetical protein FLJ13159 
ATP-dependant interferon response prote: 
ESTs 
ESTs 

tumor protein p53 (Li-Fraumeni syndrome) 
caspase 6, apoptosis-related cysteine pr 
supervillin 

Homo sapiens clone 25194 mRNA sequence 
ESTs, Weakly similar to 2004399A chromos 
seven in absentia (Drosophila) homolog 1 
nuclear localization signal deleted in v 
HHGP protein 

gb:nl17c05.s1 NCLCGAP HSC1 Homo sapiens 
KIAA0924 protein 

Homo sapiens brother of CDO (BOC) mRNA, 
ESTs 



Hs.1 86937 ESTs 



lymphocyte cytosolic protein 2 (SH2 doma 
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454070 
432572 
442425 
428412 
448772 
417057 
413714 
415663 
407904 
421114 
440214 



AF080158 

T81819 

AA002008 

X89984 

AI792014 

AA504595 

AA847510 

BE622187 

AL1 57466 

AI745458 

AB020638 

AW903988 

AW473675 



W44735 
AW975051 
AA247118 
AL042005 
AI916058 
AW500775 



AA353776 
AA676939 
AI873871 



441623 
420729 
440010 



410344 
417259 
421379 
434210 
431923 



412262 
425071 
446094 



409758 
417958 
410763 
419543 



449378 
455657 
420126 
444291 
424084 
437330 
443774 



Hs.226573 
Hs.302251 
Hs.1 88633 
Hs.264610 
Hs.211563 
Hs.13809 



inhibitor of kappa light polypeptide ge; 



Hs.343026 
Hs.103000 
Hs.62119 



Hs.126083 
Hs.301528 
Hs.81086 
Hs.71428 
Hs.313332 



ESTs, Moderately similar to Ibd1 jH.sapi 
B-cell CLUIymphoma7A 
hypothetical protein FU 10648 
ESTs 
ESTs 

ESTs, Weakly similar to I38022 hypotheti 

Homo sapiens mRNA; oDNA DKFZp751 C2423 (f 

ESTs.Weakl milerlol 3 hyp thel 

KIAA0831 protein 

hypothetical protein FU14800 

ESTs, Weakly similar to T1 7227 hypotheti 

collagen, type IV, alpha 3 (Goodpasture 

i i l r hum 

hypothetical protein MGC5370 



L-kym 



Hs.144583 ESTs 



Homo sapiens cDNA: FLJ21278 fis, clone C 
ESTs, Weakly similar to I78885 serine/th 
hypothetical protein FU11305 
tripeptidyl peptidase II 



Hs.69285 
Hs.7041 
Hs.82251 



hypothetical protein FLJ20136 
J i lerc r r C 

Homo sapiens cDNA FLJ12136 fis, clone MA 
CD48 antigen (B-cell membrane protein) 



AA315805 
AW964897 
AA534930 

N46161 Hs.35274 

AA903424 Hs.6786 

AI767756 Hs.82302 

X04011 Hs.88974 

AA325633 Hs.136102 

AI381687 Hs.39526 

AB020681 Hs.27973 

AW083789 Hs.124620 
BE539101 

AW978436 Hs.62515 



alpha-actinin-2-associaled LIM protein 

desmoglein 2 

ESTs 

hypothetical protein FLJ12879 



Homo sapiens cDNA FU14814 lis, clone NT 
cytochrome b-245, beta polypeptide (chro 
KIAA0853 protein 



AA665612 

AI741770 

AI336266 

AA443691 

BE267795 

AI241331 

AB007883 

W26406 

NM_013989 

AK001760 

AK001389 



Hs.32353 
Hs.90858 
Hs.22595 
Hs.131765 



AA767382 
AF279145 
AA244170 



NMJJ16255 

AI598022 

AI940675 



Hs.15144 
Hs.1 86647 

Hs!274368 
Hs.239489 
Hs.1 36245 

Hs.182258 
Hs.1 934 17 
Hs.8966 

Hs.293907 
Hs.172510 
Hs.1 88522 
Hs.59892 

Hs.95260 
Hs.193989 
Hs.20914 
Hs.50115 
Hs.9740 



ESTs, Weakly similar to I38022 hypotheti 
mitogen-activated protein kinase kinase 
Homo sapiens clone 25023 mRNA sequence 
hypothetical protein FLJ10637 
ESTs, Moderately similar to I38937 DNA/R 
KIAA0423 protein 

seven in absentia (Drosophila) homolog 1 
deiodinase, iodothyronine, type II 
KIAA1685 protein 

hypothetical protein DKFZp564O043 
ESTs 

Homo sapiens cDNA: FU23020 lis, clone L 



ESTs, Weakly similar to I78885 serine/th 

hypothetical protein FU21776 
gl ic! h02.s1 NCI_CGAP_Pr1 Homo sapiens 
' 1 -Ttein FU23403 



gb:RC1-BT0314-310330-015-b12 BT0314 Homo 
Autosomal Highly Conserved Protein 
TARDNA binding protein 
hypothetical protein FU23056 
Homo sapiens mRNA; cDNA DKFZp761 J1 1 1 2 (f 
DKFZP434A236 protein 
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425657 
406672 
419905 
425332 



AU077196 
AA602539 
AA748009 
AA306033 



426083 
420059 
448412 
436758 
438011 



414405 
445893 
434423 
408951 
408949 
410337 
409010 
400419 



AL1 34708 

AA332145 

AA286887 

AW962712 

AF161486 

AI2190B3 

AW977167 

BE466173 

NM 004010 

U04953 

AB020658 

W20295 

AA463437 

AW851121 

W0560B 

S79895 

AI362533 

AI610702 

NM.006769 

AW407227 

AF1 89011 

M83822 

AI648575 

AF084545 

M601518 

NML003417 

AI683049 

R77776 



Hs.1 19471 ESTs 

Hs.1 93253 major histocompatibility complex, cess 

Hs.93659 prolan disulfide isomerase related prot 

Hs.1 27279 ESTs 

Hs.250897 TRK-fused gene 

Hs.82985 collagen, type V, alpha 2 

Hs.345494 ESTs 



GA-binding protein transcription factor, 
excision repair cross-complementing rode 
acetyl-Coenzyme A acetyltransferase 2 (a 



Hs.77929 

Hs!l 45998 

Hs.1 3392 

Hs.24724 

Hs.125712 

Hs.94769 

Hs.42532 

Hs.155272 

Hs.145696 

Hs.169470 

Hs.1 72801 

Hs.5867 

Hs.288178 

Hs.1 1556 

Hs.75497 

Hs.312679 

Hs.83942 

Hs.202613 

Hs.3844 

Hs.227591 

Hs.49163 

Hs.62354 



ESTs, Weakly similar to AF191 020 1 E2IG5 
RAB23, member RAS oncogene family 
ESTs, Moderately similar to ALU8_H UMAN A 
ESTs 

splicing factor (CC1.3) 
dystrophin (muscular dystrophy, Duchenne 
isoleucine-tRNA synthetase 
KIAA0851 protein; suppressor of actin 1 
Homo sapiens cDNA FU11968 fis, clone HE 
Homo sapiens cDNA FU12566 fis, clone NT 
Homo sapiens cDNA: FLJ22139 fis, clone H 
ESTs, Weakly similar to A49019 dynein he 
cathepsinK(pycnodysostosis) 
KIAA0306 protein 

ESTs, Weakly similar to TRHY_HUMAN TRICH 
LIM domain only 4 
hypothetical protein FU 11 088 

M I ii t I 
:ell d : vis on cycle 4-like 
Homo sapiens, Similar to RIKEN cDNA 1700 



Hs.1 17077 zinc finger protein 26 
Hs.201282 ESTs 



427196 
439776 
417727 
410853 
411962 



R52795 

AI925136 

BE148769 

AI183829 

AF054815 

AA868688 

AW206942 



Hs.118258 
Hs.191593 
Hs.1 76005 
Hs.82501 
Hs.30469 

Hs.25954 
Hs.55150 
Hs.334477 
Hs.202111 
Hs.137548 
Hs.93102 
Hs.253594 



gb:zk85d12.r1 Soares_pregnant_uterus_N 



NM_002430 Hs.268515 

Z29572 Hs.2556 

S73265 Hs.1473 

AA805417 Hs.64753 

AA215295 Hs.77578 
AI187366 

AA436187 Hs.172631 

AA172106 Hs.1 10950 



AI670057 

R61448 

AI362520 

AW021726 

AA631977 

AI807320 

AW964490 



454042 
407755 
447183 
437446 



AA1 16021 
AI554733 
AA788946 



Hs.238936 
Hs.199882 
Hs.1 06728 



Hs.227630 
Hs.32241 
Hs.31720 

Hs.38260 
Hs.173182 
Hs.101302 
Hs.302743 
H S .99824 



meningioma (disrupted in balanced transl 
tumor necrosis factor re( 
gastrin-releasing peptide 



Rag C protein 
ESTs 

ESTs, Weakly s 



ESTs 



nilar to (defline not a»a 



ESTs, Weakly similar to KIAA1 353 protein 

histone deacetylase 3 

gb:df27e02.y1 Morton Fetal Cochlea Homo 

KIAA0643 protein 

RE1-silencing transcription factor 

ESTs, Weakly similar to S65557 alpha-1 C- 

hephaestin 

hypothetical protein FLJ20093 
ubiqu'tin specific protease 18 

ESTs, Moderately similar to CA1 C RAT COL 
ESTs 

BCE-1 protein 

ESTs, Weakly similar to putative p150 [H 
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AI075345 
AI912410 
AB007972 
AW085690 



451475 T19093 



430253 
414183 
433194 
453915 
407725 
437412 
440937 
449057 
446126 
407204 
419145 
418757 
430000 
437296 
441381 
457250 
422900 
442787 
430589 



417308 
409799 
429160 
452679 

444034 
451518 
435702 



408089 
453686 
426167 
452195 



409245 
414290 
400294 
429819 
448873 
428471 
436288 
433376 
416051 
453468 



ESTs 

Homo sapiens cDNA FU12749 lis, clone NT 
myosin phosphatase, target subunit 2 
:::T: ■.Vc,--:I, s ni'.irloZIDO human 7!T. 
hypolhelical protein FLJ20274 
:!:iU Ir.-ccriunal rg enzyme Z2f> tRADS h 
I lomo sapiens, Similar to zinc linger pro 
H 1 ; 1 ■ rRIIA CON.. LHI-Zp5 .y.,1 

KIAA0725 protein 

ESTs, WeaWy simitarlo A43932 mucin 2 p 



AI188803 
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Hs.250723 
Hs.246315 
Hs.90061 
Hs.322710 



Homo sapiens mRNA; cDNA DKF2p547C1 36 (fr 
KIAA0810 protein ' 
KIAA1363 protein 

pleckstrin homology domain interacting p 
ESTs, Weakly similar to ALU1_HUMAN ALU S 
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hypothetical protein MGC5338 

ESTs 



AW663083 
Z42387 
AA235108 
AL161957 



Hs.10177 
Hs.174918 
Hs.121001 



hypothetical protein MGC2747 
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SEC24 (S. cerevisiae) related gene famil 
hypothetical protein DKFZp434H247 
Homo sapiens cDNA: FU22547 fis, clone H 
pyruvate dehydrogenase kinase, isoenzyme 
hypothetical p olein FLJ20375; KIAA1797 
TRAF and TNF receptor-associated protein 
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N-deacetylaseM-sulfotransferase (hepara 
NS1-binding protein 
GCN1 (general control of 



Hs.24831 ESTs 



gb:te54e10.x1 Soares_NFL_T_GBC_S1 Homo s 



Hs.24095 ESTs 



480 



WO 03/042661 PCT/US02/36810 



30 

35 



407910 
436005 
449458 

411118 

416311 
433068 
429272 



415788 
448481 
410491 
443441 



AI805078 
AW293413 
N27944 
AW237014 



Hs.292645 
Hs.41296 
Hs.158126 
Hs.208261 
Hs.132906 
Hs .221476 
Hs.315369 



ESTs, Moderately similar to I38022 hvpot 
ESTs, Moderately similar to ALU8.HUMAN A 
ESTs, Weakly similar to ALU5JHUMAN ALU S 
tibronectin teuoins rich transmembrane p 
Homo sapiens cDNAFLJ13350fc. clone OV 
ESTs 

19A24 protein 

ESTs, Weakly similar to AF108460 1 ubinu 
Homo sapiens cDNA: FLJ23075 fis, clone L 
gb:HUM081H05B Human fetal brain (TFujiwa 




AW628686 

W15284 

AA465131 

AA315703 
AW972724 
AA741024 



AW972771 
AL044949 
N71463 



Hs.215658 
Hs.78851 
Hs.74832 

Hs.92195 



Hs .88378 
Hs.122834 
Hs.144924 
Hs.171096 
Hs.12496 
Hs.26163 

Hs.292471 
lls.116298 
Hs.1 18888 



cDNA FLJ1 1654 fis, clone HE 



ESTs, Weakly similar to 154374 gene NF2 
ESTs, Moderately similar to ZN91.HUMAN Z 
KIAA021 7 protein 
ESTs 

Homo sapiens clone 2521 8 mRN A sequence 



ESTs, Weakly similar to B Chain B, Solut 
Homo sapiens EST from clone DKFZp434A041 
ESTs, Weakly similar to ALU4 HUMAN ALU S 
KIAA0649 gene product 
ESTs 

ESTs, Weakly similar to ALU1 .HUMAN ALU S 
ESTs 

ESTs, Weakly similar to ALU 1 _H UMAN ALU S 
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AA772418 AI033892 AA100926 AU154749 AI459432 AI423513 AI094597 AA740817 AI991988 AI090262 AI312104 BI256707 AA459522 AA416871 
AI075239 AI339996 AA701623 AI139549 AI336880 AA633648 AI989380 A1362835 AA399239 AI146955 BF514270 N92892 AI348243 AI278887 
AA459292 AI494230 BF507531 AI492600 AA962596 AW613002 AA293140 M235549 BF108854 M954344 N49682 AI457100 AW5894C7 
AW300758 BE220715 BE22Q698 BE569091 BM009647 BF900351 AI537692 AI203723 AI857575 AA584410 AW371667 BM172363 
AK001125 AU120581 AU146612 AW301393 
AA130992 AW969537 AA503835 
H61453 M8 38765 AI693104 AA721 107 
AA961586 AI86373S AAS88325 

AV645808 AA701657 AW271273 A(796734 AI472316 AI017531 AI051 178 BF109096 AA548964 N83805 AA131648 AA156589 BE708349 
AW952494 D30877 AV684717 Z24837 F00167 BF576150 T63841 R78995 N87474 
AW292984 AA724304 AA074346 AA074368 

BF090249 AW954947 AW890487 AI305236 D60845 D50537 AA825429 W36294 AW890410 AW088235 BF740240 M448709 AI350279 AA8791 19 
AA319510 BE702077 BE699015 BE702046 AW901293 T99319 D81708 BF475488 D60383 D81751 BE699260 
AA534222AA632632 T81234 

BF963346 BI460763 BI599382 AI188089 AY055003 AW959185 AA187681 AI692282 AU153608 BI493898 BI493899 AI797349 BE550679 AI701137 
AA744561 AA082682 BE218816AW511501 BF447881 AI859101 BF059295 AI914038 AI346564 AI685003 A1676076 A1125336 AW469637 T10225 
AI831296 R53451 AI453440 AA983739 AW470873 AI348290BE857670 D55901 243908 T34429 F07305 AK074340 AL538118 BG201484AI334192 
T35535 AW176751 BI496132 BI496133 AA469961 H72176 BF326265 AI770016 AI693177 BF223634 BF963661 BF962265 BI034894 AI475851 
N69071 AI910707 BE219038 AI918036 BE670589 BM193515 AJ33B497 AA805525 AA487625 AA830336 AA749368 AW104323 AI628055 AI932332 
A1272788 AI536849 AW162414 AW151923 T23854 AA610763 AA912188 AW339028 239946 T10224 F03171 AI205478 AI638791 F02580 F01551 
AW207551 AA421030 BE246012 BI034937 BI035373 BF939581 H19984 BE867247 H92677 N55988 BF957332 H18615 R16442 BF956229 
BF961886 R12698 AA101186 AA365932 R53452 BF062714 BF959364 H41634 AW086187 H19985 BF054881 F02581 H18616 H41527 AW003446 



M677593M618150M557952 

AK024780 NMJ24549 AW440385 AW965502 BF477855 Al 149742 BE669424 BE349941 AI376717 AA977493 AW592564 M8841 11 AA865463 
N51840BF593777M044645AW512183 BF059034 AI984320 AA733139 AI884563 AI559573 AI421262 AI052068 AI085891 BI963100 AA479883 
A1271 696 AI53S839 AI370924 AI147781 AA535765 BI966896 AA877448 AW152259 AA938992 AA996022 AI800410 AA613303 M58B323 R53623 
AW571973 A1203770 BI494224 BI494225 D45504 AA705824 AI625183 AA437118 AA640325 AA235528 AA731411 AA626263 AI560519 AW071 1 14 
AW005691 AI055933 AA971813 AA334937 N58441 AA536020 AW874174 AL565960 AW607532 H44647 
BC017965 AW969075 AA279982 AA50451 1 AI219979 AA504595 AI245579 AA278181 BG485019 BI049312 

AK07415G AI401548 BE002083 BE002085 BF.002034BG113650 AW580909 AA632959 BG610861 BG436183 BF438017 AI568608 AI095503 
AI204208 AW244158 BE622614 N52349 BE042940 AIG95B58 AA9761 19 M835838 AI620854 AA456105 AA971569 AI246289 AI335636 M994082 
AI250060 AI095710AA605125 AA291572 BE463947 N52870 AA287553 Al 149093 AA581802 W31684 AA610159 BF247061 BG258954 BE17686G 
BE177041 AA332106 BE967020 

AW473675 AI190744 W69997 AW10491 3 AI221098 W69996 AA885487 AA861491 

BE972667 AW069569 AI392889 H85161 AI074962 M423935 BE671254 AA912627 H39B45 H69083 AA280259 N67746M1 55722 
BM423504 BE075913 AI492369 AI831729 AA807134 A1057292 BF059751 AA918209 BE858647 AW016707 AI675289 AI452916 BF593014 
AI480099 AI522283 A1628681 AI470402BF056350AW236830AA514206BF055812AI796201 BF055617 BF054827 AI288301 AA705763 
BC022413 BE395396 BF754175 AA506621 BE706665 BE706678 M723159 BE153169 BE706729 BE706558 BE153312 BE706706 AW371853 
AW371849 BE153241 BC017410AI337912AI090244AW090300BE219837AI623661 BE501576 BE501734 AI742232 AI023964 AI458424 
AA975373 AI288904 AI984583 AA890325 N32562 AI358102 AW241694 AI038448 AI672071 AI018389 AA576391 AA977874 AW189392 W37448 
AA612894 AI277548 H89551 AI699774 H89365 A\3 15805 AW579186 BC014584 BC014581 AW780125 AI672414 BE328145 AW600919 BF031306 
AW172758 BE708322 M345675 BE875779 H28241 H25318 BF540913 BG179688 BF110202 AA528775 W37573 BE041644 AW366504 BM129522 
BM129322 AI122760 BE718200 AW887496 AA149420 BE706307 BE539395 BE748765 AI373653 R75904 BF979185 BF691393 BG495595 
BI094458 BE706702 BG496559 BF248373 BG494800 

BC015150 BM471660 BM478941 BI759769 BI094332 BE005819 BE005834 BM023432 BE1 79013 BE675522 AI680275 AA515794 AA256503 
F01634 AF002990 BG288697 AI091615 AW002917 AI631085 AW 182864 BE501232 AI922188 BF838783 AI631078 BE464544 AI376986 AW503774 
AA424131 AW51 1849 AA872189 BF0B4885 AI432024 AI491908 BF197906 M580599 AI274418 AW269225 AW058288 AI61 1346 AA236200 
C00417 BE669699 AI468149 BM460137 AI6B5282 AI902714 BG990066 BF894391 T83393T84298 BF809721 BE925494 AV755948 BF928512 
BF894494 BF308642 AA256710 BF894347 BE928708 H05072 M199607 BI859300 AI983233 AI521122 BE168896 BE302846 X76770 BC000927 
NM.032632 BG707103 BG913857 BG613824 BG613257 BM313077 BI561132 BI461426 BE389811 BE391509 BE164646 AA746371 AV750611 
BE175813 AA332040 BG200997 BG485576 AL043706 AV716994 BF903156 AW820369 AU132824 BF941 136 BE54321 1 AA521432 AW837568 
BF809623 AW614670 AU129338 AW880782 AA371070 BG615427 BI667976 B1598706 BI599341 BE869166 BG502432 NM 020144 AF218840 
BI461106 BI562235 BI462594 BI458753 BF195561 BF832738 BE890146 AI678629 AL135238 AU129571 AW500045 AA382478 AW502738 
BI561031 BE280153 BF087226 BM480024 AW804395 BE707050 R72666 BF038881 BF895070 BG036594 BE551809 BF326901 BG286526 



AK057015AI026834 BE857936 M149091 AI742972 AW439172 AI253168 AA255613 BF513175 AI00500S T0340S AW338149AA836442 
M420530 R88566 AI61 1672 AA433916 AA442855 BF063008 AA812568 AI889706 AA715313 AA768539 AA767620 AA665471 AA404380 
AA553612 EFD55442 A.A70! 388 AIS53 >76 A ' •" . 7^43 AI141 '69 H7822) AW901352 H78221 BE701982 BF689273 A4397454 N33072 R60218 
AW968247H1483 ■ A\ft>m>".WKt3 k»M7C 13=739332 ^51827 A!4?4963 DC40457'. AA'-'-OCCC 0 "233*.S4 A1802210 BE000129 BF734513 
R419-4 1-21055 P3 r ^ I 1DF790215I vr EFC03337AA703138AA377348 W24822 

AK055051 A1971258 AI681134 AU146 ' :,<■> 3303 A •? :732! ■ VA923662 1! F89 306 8 AI304442 AI167464 AI234188 AA054272 AA829262 AI351910 

I 1 I 1 1 r r 1 ! 'I , ' 

BE349103AI9122S4C- i ! I J V4 AA418512 

BE771304 EF894539 m S5f?3 ,\B7?.V.I ■' I mw "M-Mfri F 1 729= Mil »,4C E -808607 3E674633 AV/00132 AA227789 AA253099 
AW751SSAA9354'£ T?431SF12556A/4.i2i23 -'o8MC W, A .7»706 AA101321 R41382 H14479 AA2S3044 R54810 

R42734R44804R41278 
AI525877AA489525AW088177 



483 



WO 03/042661 



PCT/US02/36810 



AI018087AA244170M244355 



409010 10331J 

411962 2307710 1 

434982 121871 1 

432676 3503.22 

446019 658727.1 

454042 30254.1 



419145 248375.1 



AL047596 AA393792 AI670731 AI037957 AW874364 AI038137 N62286 AI241379 BE501096 AW090696 AI927369 AI669226 AI369437 AI371075 
AW612409 AI686711 AI183289 AA477717 AI075122 AA635190 M700984 AA781508 D81020 BF575223 AI356183 D79312 AI375558 H61111 
BG283489 BE090666 BE090664 BE090662 H26545 

AL575207 AL551714 BM014781 BG542863 BG771232 AA429722 AI37751 1 AI770155 M716665 BG003427 M810811 AA442760 AA128610 
AA059411 AI796263AI494075AI572127 AA420SS2 BF436083 AI648675 AA878813 BI488614 BG700886AA128609AV702879M731146 
AI580336AI373224AA919169 AI758175 AA976350 BG701414 BF057794 AW135598 AA062583 B1549631 AI185077 AA933879 AW024454 
AA193289 M045194 BG928396 BE856883 BF435859 AA195423 AW237471 R99289 D61992 BE856637 BF368270 AA194235 N51319 AA383499 



425481 
440638 
442495 



435154 

437908 1 
TABLE 33C: 



928718.1 
1280744.1 
1237041.1 



AA099050AA099526T47733 
AW975084 T90204 AA658177 
AI187366 AA618478 AA558869 
AI308202 AI307229 AA769348 

AI362520 D25917 AI670784 AI742347 AW269789 AI270700 AW610541 AW793036 AW793035 AW610540 AW362220 AW362166 AW362214 
AW362225 AW362228 AL1 19827 

AJ420458 AI018523 AA708686 BF949633 AL1 19553 BF945960 AI081305 AA041432 AI921013 AI684910 AI654847 AW874199 AI206120 
AW241428 R43035 T66767 AW103715 W28478 BF953052 H45926 BF807568 AW903943 BE170143 BI040435 BF931989 BI600000 AV722350 
W27787 H45331 BI549761 R53955 BI549855 BG991583 BI491075AW020049 AW129293 H45263 AA410309 AA340613 R42410AA707199 
A1431587 BE858679 AW2922S7 A1421678 A^041 195 BE466753 AI243913 AI358894 AW137298 AI366468 N64350 AA779107 AW025969 R49056 
AA347011 R55722AW771106F04969 Z38381 F01659 H17396 BI493714 AI880103 AW771447 AI202561 M788851 AI494436 BF856114 H22570 
AI076345 AI887648 A<\572691 

BF946219 BF946218 BF8S1494 AL536879 AA457150 AI53D194 AI582629 AA464515 AA916242 AA337109 AA336509 N46906 AA336322 
M336407 AA337222 AA319240 BI026817 BI027058 AL536880 AI693827 AA651730 AI701013 BM068789 AW339506 AA293021 BF891108 
AI458885 AW361203 AW974652 AI761251 AI655763 AA628063 BE047125 AW085916 AI129587 N52070 AW172361 AA052951 AWC85909 
AI000008AA962570AI371 4 Al * 120 A4 4 51 4 AI962506 AI824603AW376300AA058439AW361192AV65666ON50282BF820514BF891008 
H40784 BF891112 BE708029 AW043567 AA056762 

BM456602 AV706711 BF379357 H90994 AA234435 AA558020 BF351723 AA328271 R94815 N99638 BG223375 AW973750 N59599 
AF030234 BC017465 BG008526 AW505550 BM4S0141 N47324 AA361037 AA321632 N45606 AV752798 AV657116 AA296632AU 137857 
AW467027 AI742080 AI624350 H58206 AA478518 AW439997 AW393555 AW393523 AI559753 AIB08732 R66856 H01374 BI257369 BI259830 
AW960845 BM466252 AW956813 BE768647 AV658853 BM055248 BF372070 BF372055 BF372061 M347852 AA905863 BG505078 AV654024 
BF093291 AW021929 H22650 AA459715 BG496341 BE697763 BI254209 BG499543 H42946 BI059780 BI086741 H87896 H87599 BF691752 
BE76851 1 BG940948 W37195 BF372041 BE883796 BF372082 BF3G7329 QFS09744 AW966003 AV714014 BI492868 BI495144 AA921845 
AI693426 AI652147 AI435449 N47325 AI434429 AA573137 AI183429 A1829952 AI332526 BF513937 AM89561 AI221962 AI378034 AW1 18897 
AW665247 AW340077 N41605 AA478519 AA463875 AI858260 AA463379 AI292305 BE045947 AA971089 AI125820 BG940947 AI080245 
AA884954 AI125702 AI382934 AA931835 AI3G8631 AW439905 AI027833 AI399648 AI014533 AA347851 M738261 NS7374 N69081 AI763667 
AA948472 AI819214 AA293133 AI186725 AAB89214 AI222635 BI495143 N29605 N48812 AA769041 AI492769 D56771 AA095911 BF222062 
D56772AW372265BM054985 D12465 BG534562 AW00351 1 H87486H42880 AW190293 BF594697 BF377611 H22043 BI255749 BI492848 
H16217 H21980 H22651 H88179 H87354 H44052 H25165 H44128 

U52054 AL581000 AA156850 AW293839 BI335365 AA024963 BF149420 BF073977 AW602574 BF164012 BF163992 BF163974 AW402161 
BM194134 AW966609 W84374 BF916380 AA385173 W84365 AA383743 BF903598 AA043776 W84421 AA778446 AW444904 BF446960 
AA837481 AV755539AW468444AW458002M811830AA581806A:866686 AI572124AA687333D20160AA812489 AU185248AU186004 
AA156781 AI536733 BM144850 AI471333 AA040926 BF507639 AA043777 AW874142 BF832523 BF163972 BI022546 BI021204 
AI000341 AI766341 AW873274 
AV704062 BE1C2284 AI03294G BF360636 
AI884781 AI652306 AI651694AI638744 AI962493 
M18728 

BG116781 BI914326BI030196 N58885 N63406AV683374 N58892 BG110501 AA333708 AA359583 AW963123N95562 N95696 N95587 

AK074291 AW293424 BE676135 AI832125 BE019146BE465019 AI761124 M617778 AI279232 AW575897 AI672039 F28618 BF924261 

AA722184 BF934174 BE004328 AV749301 BE880282 BI01 9798 BI019389 BF928776 AW813409 AV726604 AA077560 BE272975 BF949119 

AW81419S BE87912G AIG97926 BF594155 BE205787 BF063513 N35826 AI948557 AI433839 AI379679 BG056182 AI589094 N23123AA588805 

AW316581 AI080272 AI421980 AI493318 BF194830 N87590 AA495993 N32996 AA699844 H96845 H9B592 N28741 BI035539 BF747723 

BF171066 W01350 H05495 AI243785 Z39622 AA887432 A1350659 R46102 

AL520496 AW978162 AIS10475 AI688990 AW470054 AA609426 AI167391 AA815231 AA358241 

BG009500 AI376551 AA897445 T87714 

AI18471 7 AW518883 AF121 173 

M179446 AA357794 D81719 D80529 C14833 

AW972724 AA877998 AA522631 AU185388 

AW972063 AA668764 AA804491 AW665688 AA765069 

AI740586 M771806 BE500996 AW204531 AI082424 AI033879 BF093176 AA771764 D38676 



Pkey: Unique number corresponding to an Eos probesel 

Ref: Sequence source. The 7 digit numbers in this column are Genbank identifier (Gl) numbers. "[ 

human chromosome 22" Dunham, et al. (1 999) Mature 402:489-495. 
Strand: Indicates DNAstrand from which exons were predicted. 

Imposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand Nt.position 

401403 7710966 Plus 146180-146294 

406387 9256180 Plus 116229-116371,117512-117651 

405268 4156151 Minus 24404-24521 

406122 9144087 Minus 30940-31386 

402550 7652009 Minus 80413-80673 



I "The DNA sequence of 



es upregulated in idiopathic pulmonary fibrosis rel 
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UnigenelD: Unigene number 
Unigene Title: Unigene gene title 
R1: 90th percentile of 



Pkey ExAccn 
405443 

418007 M13509 

446619 AU076643 

422511 AU076442 

406964 M21305 

425259 AL049280 

407244 M10014 

421823 N40850 

419875 AA853410 

409542 AA503020 

418310 AA814100 



408000 
447033 
420185 
420195 
426682 
408221 
417079 
419216 
422163 
422404 
445745 



430712 
453111 
451099 
414290 
417801 



411880 
401645 
401673 



NM.013989 

W74001 

AF119903 

L11690 

AI357412 

AL044056 

N44348 



UnigenelD UnigeneTitle 



Hs.83169 
Hs.313 
Hs.117938 

Hs.1 55397 

Hs.28625 
Hs.93557 
Hs.36563 
Hs.86693 

Hs.82419 



secreted phosphoprotein 1 (osteoponlin. 



Homo sapiens mRNA; cDnX DK^p564K1 43 (tr 
fibrinogen, gamma polypeptide 



AU076718 
AF027208 
AL1 33571 



AL047586 

U95031 

R57171 



418663 
412622 
424012 



402333 
429609 
454078 



AK001100 
AW664708 
AW368377 
AI017208 



Hs.261385 

Hs.2056 

Hs.81134 

Hs!l 12360 
Hs.336189 
Hs.13245 
Hs.85050 
Hs.163443 



hypothetical protein FU22418 
ESTs 

ESTs, Weakly similar to S72482 hypotheti 
ESTs 

deiodinase, iodothyronine, type II 
serine (or cysteine) proteii - L j 
hypothetical protein PR02834 
bullous pemphigoid antigen 1 (230/240kD) 
Predicted gene: Eos cloned; secreted w/V 
ESTs 



X54162 Hs.79386 

AF115402 Hs.11713 
AW044647 

AB014598 Hs.31720 

R52795 Hs.25954 

AI568801 Hs.71721 

AA417383 Hs.82582 

AW961284 Hs.203838 

AL044191 Hs.23388 

AW452355 Hs.256037 

X99133 Hs.204238 

AW503785 Hs.73792 

NMJXJ2514 Hs.235935 
AW872477 



small inducible cytokine subfamily B (Cy 
prominin (mouse)-like 1 

Homo sapiens mRNA; cDNA DKFZp434F1 1 35 (f 

KIAA0455gene product 

phospholamban 

intronofperioslin(OSF-2os) 

ESTs 

Down syndrome critical region gene 1-lik 
ESTs 

ESTs, Weakly similar to ALU8_HUMAN ALU S 
leiomodin 1, smooth muscle (LMOD1) (Thy 
E74-like factor 5 (et- J — ■ -- 



inlegrin, beta-like 1 (with EGF-like rep 
ESTs 

hypothetical protein DKFZp434F0318 
ESTs 

lipocalin 2 (oncogene 24p3) (NGAL) 
complement component (3d/Epstein Barr vi 
nephroblastoma overexpressed gene 
gb:hm30f03.x1 NCI_CGAP_Thy4 Homo sapiens 
C1 6001440":gi| 1 2330704|gb|AAG52890. 1 |AF3 
C16001416*:gi|12743112|ref|XP 010131 2j 
similar to S68401 (cattle) glucose indue 
ESTs 

collagen, type XIV, alpha 1 (undulin) 
RNA binding motif protein 8B 
mucin 5, subtype B, tracheobronchial 
calsequeslrin 2 (cardiac muscle) 
ectonucleoside triphosphate dlphosphohyd 
C4001445:gi|12697999|dbi|BAB21818.1|(AB 



Hs.208358 
Hs.41690 
Hs.171959 
Hs.137569 
Hs.131149 

Hs.1 03042 
Hs.2164 
Hs.135104 



Hs.87409 thrombospondin " 



tumor protein 63 kDawith strong homolog 
ESTs 

NM_002275*:Homo sapiens keratin 15 (KRT1 
microtubule-associated protein 1B 
pro-platelet basic protein (includes pla 



secreted modular calcium-binding protein 



gb:an03c03.x1 Stratagene schizo brain S1 
Homo sapiens cDNA FLJ14814 Bs, clone NT 
phosphatldic acid phosphatase type 2B 



485 
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30 
35 



65 
70 



422363 


T55979 




replication factor C (activator 1) 3 (33 
ESTs 


2.30 


2.30 


456536 


AW135986 


Hs.257859 


2.28 


22S 




M423849 


Hs.79530 


M5-14 protein 


2.27 


1.86 


456936 


M81349 


Hs.1955 


serum amyloid A4, constitutive 


2.25 


2.16 


417728 


AW138437 


Hs.24790 


KIAA1573 protein 


2.25 


1,37 


453070 


AK001465 


Hs.31575 


SEC63, endoplasmic reticulum translocon 


2.24 


2.« 


409159 


AW673312 


Hs.50848 


hypothetical protein FLJ20331 


2.24 


2.24 


404942 






splicing factor, argirine/serine-rich 9 


2.24 


2.64 


410286 


AI739159 


Hs.61898 


DKFZP586N21 24 protein 


2.24 


2.46 


440516 


S42303 


Hs.161 


cadherin 2, type 1 , N-cadlierin (neuronal 


2.24 


1.94 


421574 


AJ000152 


Hs.105924 


defensin,beta2 


2.23 


1.36 


418005 


AI186220 


Hs.83164 


collagen, type XV, alpha 1 


2.22 


1.37 


421948 


L42583 


Hs.334309 


keratin 6A 


2.20 


2.20 


407788 


BE514982 


Hs.38991 


S100 calcium-binding protein A2 


2.19 


1.61 


417563 






gb:zx52a10.r1Soares_feialjiverj;ptesn 


2.18 


240 


431089 


BE041395 




ESTs, Weakly similar to unknown protein 


2.16 


246 


447333 


BEO9058O 


HsJ0704 


hypothetical protein dJ616B8.3 


2.16 


200 


455797 


BE091833 




gb:IL2-BT0731-260400 076-F04 BT0731 Homo 






414987 


AA524394 


Hs.294022 


hypothetical protein FU 14950 


2.16 


1.99 


40 62 






NM_0016l5':Homo sapiens actln, gamma 2, 


2.16 






W68815 


Hs.301885 


Homo sapiens cDNA FU1 1346 (Is, clone PL 


2.15 




407137 


T97307 




gb:ye53h05.s1 Scores fetal liver spleen 


2.15 


2.24 


402641 






C1002296:gi|6677817|ref|NP_0331 26.1 1 rep 


2.14 


214 


418236 


AW994005 


Hs.337534 


ESTs 


2.14 


2.14 


413059 


BE151498 




gb:RC0-HT0295-291199-031-E11 HT0295 Homo 


2.14 


2.14 


432437 


W07088 


Hs.293685 




2.14 


2.14 


428398 




Hs.98558 


ESTs 




2.14 


428336 


AA503115 


Hs.183752 


microseminoprotein, beta- 


2.12 


1.43 


421853 


AL1 17472 


Hs.108924 


SH3-domain protein 5 (ponsin) 


2.12 


1.93 


436391 


AJ 227892 




ESTs 


2.12 


2.12 


417430 


AA984546 




gb:am88e0as1 Stratagene schizo brain S1 


2.11 


2.17 


407443 


AF227138 




gb:Homo sapiens candidate taste receptor 


2.11 


236 


428434 


AW363590 


Hs.65551 


Homo sapiens, Similar to DNA segment, Ch 


2.10 


1.29 




D49372 


Hs.54460 


m llinducibl cy kino ibfoi il VL, 


2.10 


2.10 


456614 


AV653110 


Hs.1 06650 


hypothetical protein FU20533 


2.13 


2.00 


■140273 


AI805392 


Hs.325335 


Homo sapiens cDNA: FU23523 fis, clone L 


2.10 




450271 


AI693900 


Hs.200920 


ESTs 


2.09 


2.34 


432222 


AI204995 




gb:an03c03.x1 Stratagene schizo bra'n S1 


2.09 




458208 


AI380016 




ESTs, Weakly similar to T4S4 HUMAN TRANS 


2.08 




405600 






C1 2001673:gi|9631 264|ref]N PJ48045. 1 1 or 


2.07 




434654 


AI825942 


Hs.1 39366 


I lomo sapiens clone L5 polyadenylated HER 


2.07 


2.52 


439261 


AM26020 


Hs.145674 


basic transcription factor 3 


2.05 


1.45 


421515 


Y11339 


Hs.1 05352 


GalNAc alpha-2, 6-sialyItransferase 1, 1 


2.04 


2.04 


457741 


BE044740 




gb:hm55g10.x1 NCI CGAP_RDF1 Homo sapiens 


2.04 


204 


423161 


AL049227 


Hs.1 24776 


downstream of cadherin 6 (by 3.3kb) 
ESTs 




1.33 


412505 


AA974491 


Hs.21734 


2.02 


2.02 


443180 


R15875 


Hs.258576 


claudin 12 


2.02 


2.02 


431605 


AW972407 




gb:EST384498 MAGE resequences, MAGL Homo 


2.02 


2.02 


415938 


BE383507 


Hs.78921 


A kinase (PRKA) anchor protein 1 


2,02 


2.17 


452571 


W31518 


Hs.34665 


ESTs 


2.02 


2.03 








Target Exon 


2.01 


2.52' 


439343 


AF086161 


Hs.114611 


hypothetical protein FU11808 


2,01 




402327 






Target Exon 


2.G0 


2.44 


418736 


AI796317 


Hs.203594 


Homo sapiens uncharacterized gastric pro 


2.00 


2.00 




AA256641 


Hs.236894 


ESTs, Highly similar to S02392 alpha-2-m 


200 


2.00 


438634 


AW340400 


Hs.1 26728 


ESTs 


1.S9 


2.43 


416127 


N49843 


Hs .79022 


GTP-binding protein overexpressed in ske 


1.97 


1.41 


423961 


D13666 


Hs.136348 


pc lii C 2c 


1.S6 


1.48 


430397 


AI924533 


Hs.105607 




1.96 


1.31 




AW813339 




gb:MR3-ST0192-101299-013-c05 ST0192 Homo 


1.96 


2.73 


439628 


W81007 


Hs.58628 


ESTs 


1.96 


1.28 


444301 


AK000136 


Hs.10760 


asporin (LRR class 1) 


1,96 


1.58 


431726 


NM_015361 


Hs.268053 


KIAA0029 protein 


1.95 


1.72 


410418 


D31382 


Hs.63325 


transmembrane protease, senne 4 


1,95 


1.87 


452814 


AI092790 


Hs.334703 


hypothetical protein FU14529 


1.95 


1.06 


417562 


AW888754 


Hs.134126 


crystalfin, gamma S 


1.95 


2.14 


424480 


AA341442 


Hs.205299 


ESTs 


1,94 


1.S4 


404342 






C7002192':gi|7299207|gb|AAF54404.1|(AE0 


1.92 


1,32 


443320 


AI051607 


Hs.16335 


ESTs 


1.91 


2.18 


449780 


AA443241 




ribosomal protein L44 


1.90 


1.76 


423337 


NM_004655 


Hs.127337 


axin 2 (conductin, axil) 


1,89 


2,16 






Hs .59498 


cell division cyc ! e 2-like 5 (c'tolinesle 






457505 


AL044659 




ESTs 


1.89 


2.34 


425912 




Hs.162189 


serine/threonine kinase with Dbl- and pi 


1.86 


1.26 


413585 


AI133452 


Hs.75431 


fibrinogen, gamma polypeptide 


1.88 


1.88 


428231 


U 17989 


Hs.183105 


nuclear autoantigen 


1.88 


1.88 




AA578229 


Hs.324239 


ESTs, Moderately sirrilar to ZN91_HUMAN Z 


1.86 


1,68 


404429 






Target Exon 




2,18 


447644 


AW861622 


Hs.108646 


Homo sapiens cDNA FU14934 fis, clone PL 


1.88 


3.04 


406641 


AJ235667 




gb:Homo sapiens mRNA for immunoglobulin 


1.86 


2,57 


417059 


AL037672 


Hs.81071 


e Ire-cellular natrixp tein 1 


1.86 


1.48 


454565 


BE141231 




gb;MRO-HT0075-081199-003-a09 HT0075 Homo 


1.86 


1.21 


415115 


M214228 




hypothetical protein 


1.85 


1.23 



WO 03/042661 



PCT/US02/36810 



432306 Y18207 Hs.303090 protein phosphatase 1 , regutatosy (inhib 



414085 AA114016 Hs.75746 
403344 

447245 AK001713 



406461 
455657 
400609 



422867 L32137 

417412 X16896 

426521 AF161445 

423915 AF039018 



Hs.282804 
Hs.1584 
Hs.82112 

Hs.21 1092 
Hs.135281 
Hs.58561 



412429 AV650262 



452272 AW292249 

418205 L21715 

400425 AY004252 

400419 AF084545 



Hs.169191 
Hs.227750 
Hs.13503 
Hs.252739 



Hs.1 57289 
Hs.29206 
Hs.272227 



aldehyde dehydrogenase 1 family, member 
NM_000341:Homo sapiens solule carrier fa 
hypothetical protein FLJ10851 
deafness, autosomal dominant 5 
Target Exon 

Homo sapiens cDNA: FLJ23435 fis. clone H 
hypothetical protein, clone 24751 
gb:RC1-BT0314-31 0300-01 5-b1 2 BT031 4 Homo 
C1 0001 1 47.gi| 1 26 98 9 26 |g b | AA K0 1 7 39. 1 1 AF33 

al protein FLJ22704 

ie oligomeric matrix protein (pse 
inieneukin 1 receptor, type I 
hypothetical protein 

LUNX protein; PLUNC (palate lung and nas 
alpha-actinin-2-associated LIM protein 
G protein-coupled receptor 87 
gb:IL-BT198-010499-007BT1S8 Homo sapien 
GR02 oncogene 
Target Exon 

HE4; epididymis-specific, whey-acidic pr 

gb:QV4-TT0008-1 61 1 99-033-d09 TT0008 Homo 

BAH-associated protein 3 

small inducible cytokine subfamily A (Cy 

Target CAT 

inaclivation escape 2 

hypothetical protein DKFZp434P0316 

troponin I, skeletal, fast 

PR domain containing 12 



451027 AW519204 Hs.40803 

413524 BE145837 

409099 AK000725 Hs.50579 



450569 AW192334 

450912 AW939251 

445261 T79759 

454231 AW450669 



456596 AA291834 Hs.78950 
418387 R18086 Hs.22279 



AA296874 
U38945 
AB021179 
AA203637 



Homo sapiens, Similar to RiKEN cDNA 2810 
gb:MRO-HT0208-101299-202-c07 HT0208 Homo 
hypothetical protein FU2C71 8 
C22000151:gi|6806921 |ref]NP_004 1 65. 1 1 so 
CX00 1 0 1 5:gi| 1 1 322384|emb|CAC1 6687.1 (AJ 
ENSP00000243337":CDNA FLJ13984fis,clon 



v-fos FBJ murine os 
ESTs, Weakly similar to I38022 hypothetj 
hypothetical protein DKFZp434l143 
hexabrachion (tenascin C, cytotactin] 
gb:EST178498 Colon carcinoma (HCC) cell 
gb:UI-H-BI3-ala-a-12-0-Ul.s1 NCLCGAP_Su 
ESTs 

Target Exon 

branched chain keto acid dehydrogenase E 
gb:EST95683 Testis I Homo sapiens cDNA 5 
gb:yg16b12.r1 Soares infant brain 1NIB H 
Homo sapiens. Similar to RIKEN cDNA 5830 
C1 001 634:gi|1 2621 1 36|ref|MP_075245. 1 1 Ba 
deoxyguanosine kinase 
cyclin-dependent kinase inhibitor 2A (me 



AA446539 Hs.339024 




Hs.296506 Homo sapiens mRNA full length insert cDN 
Hs.231 967 ALL1 fused gene from 5q31 

ENSP00000224716*:GTP-binding protein SAF 



ESTs 

collagen, type VII, alpha 1 (epidemtolys 
myopodin 

S100 calcium-binding protein P 

ESTs, Weakly similar to T09052 hy pothec 

nldogen (enactln) 



Hs.79385 
Hs.60435 
014717 Hs.108884 



Human clone 23574 mRNA sequence 
ESTs 

KIAA0390 gene product 



487 



WO 03/042661 



PCT/US02/36810 



444381 BE387335 Hs.283713 
442802 AL133035 Hs.8728 
442424 AI342715 Hs.129569 



hypothetical protein BC014245 
hypothetical protein DKFZp434G171 
ESTs, Moderately similar to B34087 hy| 
Target Exon 



M773187 Hs.294027 

AA033627 Hs.21858 

AW630023 Hs.76893 

AA827569 Hs.153 

AA939347 Hs.127223 

NM_001972 Hs.99863 

NM 002206 Hs.74369 iniegnn, arpna / 

W37862 Hs.274368 MSTP032 protein 



no sapiens Rab geranylgeran 



AW009886 Hs.1 18258 

BE018682 Hs.166196 

AA088500 Hs.170298 

BE257672 Hs.42949 

D17408 Hs.21223 

AW894667 Hs.22660 

AA410183 Hs.137475 

D60541 Hs.285519 

AA884208 Hs.30484 

M29540 Hs.220529 

J06070 Hs.151738 

AI928445 Hs.92254 

AB029000 Hs.70823 

AF017986 Hs.31386 

X57348 Hs.1 84510 

X81479 Hs.2375 

A<\295052 Hs.38516 



AI470451 Hs.99075 
AW903838 Hs.81800 
NMJ03816 Hs.2442 



AA019290 Hs.1 10489 

AF001548 Hs.78344 

AI702223 Hs.1 07253 
T51823 

M73720 Hs.646 

T92363 Hs.178703 

W72424 Hs.1 12405 

AA420450 Hs.292911 

AI400746 Hs.62187 



ATPase, Class I, type 8B, member 1 
ESTs 

hypothetical protein HES6 
calponin 1, basic, smooth muscle 
chimerin (chimaerin) 1 
ESTs 

Homo sapiens cDNA FU1 1904 (is, clone HE 
ESTs 

carcinoembryonic antigen-related ceB ad 

matrix metalloproteinase 9 (gelatinase B 

synaptotagmin-like 2 

KIAA1077 protein 

secreted frizzled-related protein 2 

slralifm 

egf-like module containing, mucin-like. 
Homo sapiens, clone MGC15887, mRNA, com 
fatty acid binding protein 4, adipocyte 
Homo sapiens cDNA: FLJ2271 2 fis, clone H 
Homo sapiens cDNA FLJ14459 fis, clone HE 
nucleoside phosphorylase 




i mRNA from cd34 stem ce 



Target Exon 

myosin, heavy polypeptide 1 1 , smooth i 
hypothetical protein DKFZp761F241 



S100 calcium-binding protein A9 (calgran 
Plakophilin 

phosphatidylinositol glycan, class K 



FAT tumor suppressor (Drosophila) homob 
ESTs 

myosin, light polypeptide kinase 
ESTs 

KIAA0182 protein 

Homo sapiens cDNA FLJ11 489 fis, clone HE 
ESTs, Weakly similar to ALU1_HUMAN ALU S 
• -J12572 



U48959 Hs.211582 
AW450544 Hs.220751 
D80004 Hs.75909 



AB001325 Hs.234642 
M73700 Hs.105938 

opioid receptor, kappa 1 
AA412442 Hs.98132 ESTs 

AK001092 Hs.302480 Homo sapiens cDNA FU10230 fis , clone HE 
NMJ05982 Hs.54416 sine ociilis homeobox (Drosophila) homob 
vaccinia related kinase 2 



AF086120 
AA007629 
AA774445 



BE313241 
AL048678 



457734 BE394365 



glycerol-3-phosphate dehydrogenase 1 (so 
ESTs, Weakly similar to KIAA1397 protein 
hypothetical protein FU10134 
insulin-like growth factor binding prote 
hypothetical protein 

ENSP00000252105*:CDNAFLJ12240 fis, dor. 
gb:601151545F1 NIH_MGC_19 Homo sapiens c 
C19001982:gi|3O43638|dbj|BAA2548ai| (AB 
H.sapiens mRNA (or 3'UTR of unknown prot 
hypothetical protein FLJ11526 



429295 AA682377 Hs.99216 ESTs, Moderately similar to ALU8_HUMAN A 



WO 03/042661 PCT/US02/36810 



430920 U96402 

409368 AA071059 

431958 X63629 

427719 AI393122 



AI123195 
AW604447 
AW468397 



25 

30 



Hs.248132 . 

gb:zm66a10.:1 Stralager.e neuroepuneiium 
Hs.2877 cadherin 3, type 1 , P-cadherin (placenla 
Hs.134726 ESTs 

ESTs 

gb:CM2-HT0323-171199-033-a03 HT0323 Homo 
gb:oo1 7a10.x1 Soares_NSF_F8_9W_OT_PA.P_S 
Hs.339403 ESTs, Weakly similar to S26689 hypotheti 
Hs.100000 S100 calcium-binding protein A8 (calgran 
metallothionein 1G 



452316 
413048 
457462 
452679 



426125 
418742 



Hs.187884 ESTs 
Hs.296098 ESTs 



L22524 

AW451197 

R63267 

AW192518 

BE296174 



Ts, Moderately simitar to G736.HUMAN P 
mannose receptor, C type 1 
Homo sapiens mRNA; cDNA DKFZp434J2235 (f 
transmembrane, prostate androgen induced 
Target Exon 

gb:nh26e06.s1 NCI_CGAP_Pr3 Homo sapiens 
Homo sapiens cDNA FLJ11041 (is, clone PL 
FAT tumor suppressor (Drosophila) hornolo 
matrix metatloproteinase 7 (matrilysin, 



ls.28399 ESTs 



AF207664 
AW834536 
AF076612 



Hs.24192 
Hs.8230 
Hs.258549 
Hs.166186 



):xt45h08.x1 NCI_CGAP_Pan1 Homo sapiens 
hypothetical protein FLJ13102 
uroplakin 1B 

gb:yv68g06.s1 Soares fetal liver spleen 
Homo sapiens cDNA FU20767 (is, clone CO 
c disintcqrin-iiko and metalloprotease ( 
gb:MR2 110014 241133 C12 tOG TT0014 Homo 



Homo sapiens clone 23928 mRNA sequence 
NM_002463':Homo sapiens myxovirus (imlu 



M152106 
AW578420 
AA448787 



AU077196 Hs.8291 
Z75331 Hs.821: 
AW450572 Hs.257316 ESTs 



collagen, type V, alpha 2 



455885 BE153524 
405650 

451385 AA017656 

424925 NM 002432 

431022 AA490815 

439781 AA845538 

429379 NM 014840 

435310 AA705075 

430702 U56979 

451331 AK002039 

459198 AI0BB347 



gb:FM0-HT0339-241 199-002-C03 I IT0339 Homo 
C7001981':gi|565157|gb|AAB31881.1|T-cel 

gb:ze39h01.r1 So 

myeloid cell nude 



glial cells missing (Drosophila) homolog 
KIAA0537 gene product 
Rhesus blood group-associated glycoprote 
H factor 1 (complement) 
Homo sapiens cDNA FU11 177 fis, clone PL 
ESTs 

epithelial membrane protein 1 



418211 BE244746 Hs.247474 

437158 AW090198 

427373 AB007972 Hs.1 30760 

433911 A1923092 Hs.8899 

402504 

409465 AW393810 Hs.78054 

449426 T92251 Hs.198882 
405491 

442410 
407701 
400818 
406475 

426935 NM_000088 Hs.172928 



al protein FLJ21032 
KIAA1 150 protein 

myosin phosphatase, target subunit 2 



M18728 

AW996503 Hs.197680 
AW375009 Hs.164407 



447918 
421314 
412992 
401025 



Hs.10587 
Hs.1 15175 
Hs.1 80324 



428415 
407242 
410741 



M97639 
AA337211 



ESTs 

Target Exon 

C15000508':gi|2558825|gb|AAC53387.1|(AF 
collagen, type I, alpha 1 
RAP1A, member of RAS oncogene family 
KIAA0353 protein 

ESTs, Highly similar to JC5818 garnma-xt 
Homo sapiens, clone IMAGE:4133312. mRNA, 
protease, serine, 11 (IGF binding) 
NM_004055*:Homo sapiens calpain 5 (CAPN5 
ADAMTS2 (a disintegrin-like and metallo 
receptor tyrosine kinase-Hke orphan tec 
Target Exon 

Down syndrome critical region gene 1 
gb:Human nonspecific crossreading antig 
mitogen-activated protein kinase 1 
NM 052863:Hoi io ;a| iens sec etoglobii , fa 



489 



WO 03/042661 



PCT/US02/36810 



447990 
421535 
453914 

430385 
447731 
400740 
410481 
440274 



BE048821 Hs.20144 

AB002359 Hs.105478 

NM_000507 Hs.574 

C03577 Hs.9615 
AA1 13437 

M373527 Hs.19385 

R34107 Hs.321450 

R24595 Hs.7122 

AA157857 Hs.182265 

AF035528 Hs.153863 
BE246335 
AA863400 

AJ245671 Hs.12844 

AI807320 Hs.227630 



417640 
436027 
407409 
400221 
437751 
450008 
441591 
405973 
424604 



AA852504 Hs.125359 

D30857 Hs.82353 

AI864053 Hs.39972 
AF060168 

AA767373 

H52970 Hs.36688 

AF055992 Hs.183 



439791 
431385 
407266 



407711 
417043 
420136 



424137 
434868 
424408 
433750 
447299 



421982 
407207 
416956 
413624 

452304 
414359 
452934 



AL1 35743 Hs.25566 

AI085846 Hs.25522 

NMJ04369 Hs.80988 

AW801090 Hs.1 95851 

X54942 Hs.83758 

H68441 Hs.13523 

AL034417 Hs.11169 



M262105 Hs.4094 

R57988 Hs.1 0706 

T77886 Hs.83428 

AW855861 Hs.8025 

AA852596 Hs.250S41 

R56378 Hs.181223 

AA976565 Hs.297753 

AI815733 Hs.114360 

AA335769 Hs.16262 

FB0032 Hs.159263 

AI754813 Hs.146428 

H15448 Hs.31330 

AF043897 Hs.18075 

AI042101 Hs.294107 
BE349470 

NMJ15310 Hs.6763 

AF208019 Hs.1 10347 

T03651 Hs.336780 

AA810664 Hs.1 01 650 

BE177019 Hs.75445 

AU076728 Hs.8867 

AA025336 Hs.61311 

M62194 Hs.75929 

AA581322 Hs.4213 
AJ243212 

AI624300 Hs.172923 

N75238 Hs.13075 

M59371 Hs.171596 



small inducible cytokine subfamily A (Cy 
phosphoribosylfonriylglycinamidine syntha 
fructose-1,6-bisphosphatase 1 
myosin regulatory light chain 2, smooth 
N-myc downstream-regulated gene 3 
CGI-58 protein 

hypothetical protein FLJ14280 
pregnancy specific beta- 1-glycop-otein 2 
scrapie responsive protein 1 



ESTs 

EGF-like-domain, multiple 6 
RE1-silencing transcription factor 
DKFZP564G202 protein 
WNT1 inducible signaling pathway protein 
gb:yj41a11.r1 Soares placenta Nb2HP Homo 
Target Exon 

collagen, type III, alpha 1 (Ehlers-Danl 
Thy-1 cell surface antigen 
protein C receptor, endothelial (EPCR) 
ESTs, Weakly similar to 138588 reverse t 
gb:Homo sapiens AS10 protein mRNA, parti 
NM_002082':Homo sapiens G protein-couple 
ESTs, Moderately similar to ALU1_HUMAN A 
WAP four-disulfide core domain 1 
Duffy blood group 
Target Exon 

gb:MR4-ST0124-241199-026-h09STQ124Homo 
CX001073:gi|4176497|emb|CAA201 16.1] (AL0 
Target Exon 

Down syndrome critical region gene 2 
inlegrin, alpha 2 (CD49B, alpha 2 subuni 



membrane-spanning domains, subfamily A 

gb:Homo sapiens mRNA for immunoglobulin 

Homo sapiens cDNA FLJ 13357 lis, clone PL 

gb:RC2-BN0127-240303-011-b05BN0127 Homo 

ESTs, Weakly similar to 20043S9A chromos 

KIAA1808 protein 

collagen, type VI, alpha 3 

actin, alpha 2, smooth muscle, aorta 

CDC28 protein kinase 2 

hypothetical protein FU14054 

Gene 33/Mig-6 

lipoprotein lipase 

hypothetical protein FU14566 

ESTs 

Target Exon 
ESTs 

Homo sapiens SIB 297 intestinal mucin (M 
Homo sapiens cDNA FU14208 fs, clone NT 
Eos Control 

epithelial profein lost in neoplasm beta 
nuclear factor of kappa light polypeptid 
Homo sapiens clone 23767 and 23782 mRNA 

hypothetical protein PRO2801 



collagen, type VI, alpha 2 
collagen, type V, alpha 1 
Homo sapiens cloneHQ0319 
chromosome 9 open reading frame 3 
ESTs 

mucin 6, gastric 

KIAA0942 protein 

REV1 (yeast homolog)- like 

tubulin, beta polypeptide 

hypothetical protein MGC5391 

SPARC-like 1 (mast9, hevin) 

cysteine-rich, angiogenic inducer, 51 

ESTs, Weakly similar to S 1 0590 cysteine 

cadherin 1 1, type 2, OB-cadherin (osteob 

hypothetical protein MGC16207 

deleted in malignant brain tumors 1 

collagen, type I, alpha 1 

Homo sapiens cDNA: FLJ23013 Rs, clone L 

EphA2 



WO 03/042661 PCT/US02/36810 



401899 






Target Exon 




1.22 


403579 






Target Exon 


1,22 


2.34 


415954 


AA171850 




ESTs 


1.22 


224 


429171 


AI743173 


Hs.169095 


ESTs, Weakly similar to ARL2_HUMAN ADP-R 


1.21 


444071 


AI627808 


Hs.110524 


ESTs 


1.21 


2.23 




AF036973 


Hs.145477 


HCGIV-5 protein 


1.21 


2.15 


434051 


AF116622 


Hs.1 14948 


gfcHomo sapiens clone FLB4217 mRNA seque 
cytokine receptor4ike factor 1 


1.21 


1.25 




AF073515 


1.21 


1.39 








Target Exon 


1.2S 


429751 


M55210 


Hs.214982 


laminin, gamma 1 (formerly LAMB 2) 






430392 


NMJM0627 


Hs.241257 


latent transforming growth factor beta b 






422687 


AW068823 
AW204725 


Hs.119206 


insulin-like growth factor binding prote 






424855 


Hs.25560 


ESTs 


1.20 


1.98 




AA232134 


Hs.1 90028 


ESTs 






414154 


BE073258 




hypothetical protein FKSG28 


1.20 


2.18 


AW205314 


Hs.323060 


ESTs 




1.34 
1.27 




AA334592 










AF018081 


Hs.78409 


gb:C15974 Clontech human aorta potyAmRN 
collagen, type XVIII, alpha 1 


1.19 
1.19 


2.05 


406731 


AI559131 
BE536115 




gb:tq31g07.x1 NCI_CGAP_Ui1 Homo sapiens 


1.19 


US 




Hs.1 60983 


1,19 


1.14 


405531 






Target Exon 


1.19 


2.02 


400363 


NM 001403 




eukaryotic translation elongation factor 


1.19 




426611 


BE178050 


Hs.171271 


catenin (cadherin-associated protein), b 


1.19 




427676 


AA394062 


Hs.300772 


tropomyosin 2 (beta) 




413929 


BE501689 


Hs.75617 


collagen, type IV, alpha 2 


1.18 


1.20 


413855 


D13639 


Hs.75586 


cyclin D2 


1.18 


1.20 


427111 


AA351026 




serine (or cysteine) proteinase inhibito 


1.18 


1.12 




F16365 


Hs.114346 


cytochrome c oxidase subunit Vila polype 




1.05 




Y07818 


Hs.74566 


dihydropyrimidinase-like 3(ULIP) 
ESTs 


1.18 


1.06 


446868 


AV660737 




1.18 


1.18 




AV654351 


Hs.82306 


deslrin (actin depolymerizing factor) 




1.17 


405542 






Target Exon 
ESTs 


1.18 


1.98 


419908 




Hs.293315 


2.02 




AA011117 


Hs.3745 


milk fat globule-EGF factor 8 protein 




1.19 
1.36 


407230 


AA1 57857 


Hs.1 82265 


keratin 19 




448413 


AI745379 


Hs.42911 


ESTs 




2.87 


426653 
424572 


AA530892 


Hs.171695 


dual specificity phosphatase 1 




M19650 


Hs.179600 


2',3'-cyclic nucleotide 3' phosphodieste 




2.08 


440109 


AK001138 


Hs.333149 


hypothetical protein FLJ10276 




1.06 








C1002509:gi|9938010|ref|NP. 064684.1 |odo 




2.22 
1.19 




U20982 


Hs.1516 


i (■ i- ir Vlli l 




Hs.1 2901 3 


Homo sapiens cDNA FLJ14309 fis, clone PL 






400080 






Eos Control 


116 


2,53 




AK000850 


Hs.272203 


Homo sapiens cDNA FLJ20B43 fis, clone AD 


1.16 


2.00 


412802 




Hs.74602 


aquaporin 1 (channel-forming integral pr 


1.16 


1.30 




AA447941 


Hs.1 23423 


ESTs 




1.27 


415149 


X12451 


Hs.78056. 


calhepsin L 


Ub 


1.12 


400231 






Eos Control 


1.16 




416653 


AA768553 


Hs.193145 


metallothioneinlE (functional) 
transmembrane 4 superfamily member 6 




422813 


AV656571 


Hs.121068 




1.10 


439318 


AW837046 


Hs.6527 


G protein-coupled receptor 56 


1.16 


1.15 


422424 


AH 85431 


Hs.296638 


prostate differentiation factor 




1.21 


432745 


AI821926 




gb:nt78f05.x5 NCI_CGAP_Pr3 Homo sapiens 


1.15 


2.00 


412477 


AA1 50864 




microsomal glutathione S-transferase 1 


1.15 


1.23 






Hs.239926 


sterol-C4-methyl oxidase-like 


1.15 


231 


424512 


X53002 


Hs.1 49846 


integrin,beta5 


1.15 


1.15 


449924 


W30681 


Hs.146233 


Homo sapiens cDNA: FU22130 fis, clone H 


1.15 


1.24 


414682 




Hs.76884 


inhibitor of DNA binding 3, dominant neg 


1.15 


1.07 


456076 


BE243877 




ATPase, Na7 transporting, beta 3 potypep 
Target Exon 


1.15 
1.15 


2.00 


422545 


X02761 


Hs.287820 










AW016610 


Hs.816 


ESTs 




105 


421848 


X15880 


Hs.108885 


collagen, type VI, alpha 1 


1.'l5 








Hs.1 11301 




1.15 




413936 


AF1 13676 


Hs.297681 


serine (or cysteine) proteinase inhibito 


113 


449845 


AW971183 


Hs.6019 


DnaJ (Hsp40) homolog, subfamily C, membe 


1.14 


2.07 


430202 
418806 






gb:yd60g02.r1 Soares fetal liverspleen 




AA485970 


Hs.191718 


ESTs 






424017 


AA333789 
AA361760 


Hs.296326 


gb:EST37925 Embryo, 9 week Homo sapiens 
ESTs 


114 


2,16 


437272 


AW975957 




gb:EST388066 MAGE resequences, VAGN Homo 






4383S7 


N79688 


Hs .204354 


ras homolog gene family, member B 




123 


453152 


AK001933 


Hs.31945 


hypothetical protein FLJ11071 


1.13 
1.13 


2.36 


406849 


AA454809 


Hs.172928 


collagen, type I, alpha 1 


1,33 


422110 


AI376736 




secreted protein, acidic, cysteine-nch 
follistatin-like 1 


1.13 


1,06 


425335 


BE394327 


Hs.296267 


1.13 


1,08 


434795 


BE620794 


Hs.4147 


translocating chain-associating membrane 


1.13 


1.08 


417426 


NM_002291 


Hs.82124 


1.13 




416379 


AW580939 


Hs.97199 


complement component C1q receptor 


1.13 


101 


N38857 


Hs.34145 


ESTs 


1.12 


112 


421464 


AA291553 


Hs.190086 


ESTs 


1.12 


2,01 
1.15 


442420 


AI024834 


Hs.131729 


ESTs 


1.12 



491 



WO 03/042661 



PCT/US02/36810 



421730 
453542 



407085 
447191 
406713 



414191 
452219 
430108 
402174 
416952 
410199 



40 
45 
50 



AW449808 Hs.334534 

AW836724 
AW976857 

AA417813 Hs.44208 

AW875786 Hs.25734 
Z70759 

NM 014521 Hs.17657 

U02629 Hs.77385 

AI791855 Hs.105334 

R31178 Hs.287820 
AA435468 

AW946276 Hs.6441 



AW160644 Hs,1 18695 

M69241 Hs.162 

X91911 Hs.64639 

AW250089 Hs.75807 



442275 
406786 
439403 
428043 
406722 
432242 

424125 
432077 
427687 



455128 
401404 



440400 
444963 
410934 
442849 
420407 
454600 
418454 



NM_005S69':Homo sapiens LIM dctnain kinas 
glu ji .amine (N acetyl)-6-3ulfata3e(Sanf 
' I500030E i I8i 122|gb| TO9193.1|(A 0 
Homo sapiens mRNA expressed only in plac 
ESTs 

hypothetical protein FLJ23153 
ESTs, Weakly similar to BING1 [Ksaptens 
gb.Ksapiens mitochondrial 16S rRNAgene 
SH3-domain binding protein 4 

■ light polypeptide 6, alkali, smo 



425371 
429925 
406711 
409407 
406109 
418597 
403162 



446921 
406712 
406773 
452082 
419150 
417204 
408339 



AI767736 Hs.290070 

AW377424 Hs.205125 

BE410050 Hs.11859 

AF033199 Hs.8193 

AF230877 Hs.1 52701 

AW178761 Hs.227943 

D49441 Hs.1 55981 

NM_000786 Hs.226213 

N2SS14 Hs.77335 

AW967370 Hs.342655 

AK001678 Hs.86337 

NM_006770 Hs.67726 
J04617 

AW889150 Hs.80595 

AF169797 Hs.27413 
AI554545 

AB012113 Hs.1 6530 

M31212 Hs.77385 

AA812424 Hs.76067 

M51905 Hs.125133 

T29618 HS.B9640 

N81037 Hs.1 074 

R97502 Hs.30443 

BE300094 Hs.227751 

AW449467 Hs.54795 

AW161678 Hs.1 11334 
BE265745 

T92248 Hs.2240 

H27498 Hs.293441 

AW022715 Hs.152160 
R55428 



s.1735 
AL134685 

AW003867 Hs.1570 

AF1 93766 Hs.1 3872 

A1831190 Hs.1 65676 

AW861555 Hs.314372 

AA497117 Hs.58893 

NMJJ14253 

R05385 Hs.1 43509 

AI267652 Hs.246107 

U2563 Hs.1 165 

N33033 Hs.270215 

AA330447 Hs.135159 
AW852530 

AA994364 Hs.1 25594 

AI916973 Hs.213603 
AW811114 

R10099 Hs.269805 

AA814732 Hs.145010 
AW810001 

AA315308 Hs.1 95870 

N69101 Hs.40730 



DNA fragmentation factor, 45 kD, a\pha p 
Homo sapiens mRNA; cDNA DKFZpS86J021 (ft 
NM_003213-:Homo sapiens TEA domain fami! 



ESTs 
ESTs 

Target Exon 

getsolin (amyloidosis, Finnish type] 
' Homo sapiens oDNA: FU22667 fis, clone H 
hypothetical protein FLJ13188 

n nger | tein 204 
NM_005805:Homo sapiens 26S proteasome-as 
microtubule-interacting protein that ass 
serine (or cysteine) proteinase inhibito 
mesothelin 

cytochrome P450, 51 (lanosterol 14-a!pha 
myosin, light polypeptide 6, alkali, smo 
Homo sapiens cDNA FLJ 13289 fis, cicne OV 
Target Exon 

similar to DNA-directed RNA polymerase I 
C2000231-:gi|9802031|gb|AAF99597.1|AF239 
macrophage receptor with collagenous str 
eukaryofic translation elongation factor 
NM_004552':Hr ......... - 



ng pH domain, PT 



small inducible cytokine subfamily A (Cy 
myosin, lightpolypept.de C alkali, ;mo 
heat shock 27kD protein 1 
hypothetical protein FLJ22501 
TEK tyrosine kinase, endothelial (venous 
surfactant, pulmonary-associated protein 
sentrin/SUMO-specific protease 
Eos Control 

lectin, galactoside-binding, soluble, 1 
ESTs 



Homo sapiens SNC73 protein (3MC73) mRNA, 
ESTs, Weakly similar to ALU4.HUMAN ALU S 
gb:yi79b05.r1 Scares breast 2NbHBst Hcmo 
inhibin, beta B (arivin AD bela rwlypep 
gb:DKFZp547M126_r1 547 (synonym: hfbrl) 
' -~ ; lor H1 



cy tokine-like protein C1 7 



ESTs, Moderately similar to ALULHUMAN A 
Target Exon 

odz (odd Oz/ten-m, Drosophila) hornolog 1 
hypothetical protein FU21 924 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr 
ATPase, H7 transporting, no 



Homo sapiens cDNA FU11481 lis, done HE 
gb;PM1-CT0243-071099-001-g06 CTC'243 Homo 
ESTs, Weakly similar to T25472 hypotheti 
ESTs 

gb;MR2-ST0131-111199-016-a04 ST0131 Homo 



411745 AW867825 



WO 03/042661 



PCT/US02/36810 



AK001106 
U88967 
BE258532 
T49951 
AA127221 
AW005866 
AA806794 
AA701357 



Hs.21586 
Hs.274419 
Hs.78867 
Hs.251871 
Hs.9029 
Hs.296502 
Hs.1 93969 
Hs.131511 
Hs.192759 
Hs.1867 
Hs.137476 



ESTs 



progastricsin (pepsinogen C) 
paternally expressed 10 
gb:xq01d05.x1 Soares_NHCeC cervicaLttmo 
AW749895 Hs.332520 Homo sapiens mRNA; cDNA DKFZp434A1 01 4 (f 
AA332941 Hs.85226 lipase A, lysosomal acid, cholesterol es 
NM 006979 Hs.278721 HLA class II region expressed gene KE4 
H28383 gb:yl52c03.r1 Soares breast 3NbHBs; Homo 

AW379378 protein tyrosine phosphatase, receptor t 

Target Exon 
Target Exon 

esophageal cancer related gene 4 protein 
XIAP associated factor-1 



Hs.43125 
Hs.1 39262 
Hs.221575 
Hs.296234 



AI478427 
X99699 
N38913 
AW968616 
AA609784 

AW972594 Hs.335499 

AK000517 Hs.6844 

NM_005411 Hs.177582 

Z43405 Hs.75668 



AW970623 



AW663632 

AA761546 

NMJ00399 

M33600 

NMJH1430 

AW081710 

AA037675 



ESTs 



NALP2 protein; PYRIN-Ccntaining APAF1-I1 
surfactant, pulmonary-associated protein 
Hon piens. Si.ni R'KEN cDNA 1 )0 
NM_024085':Homo sapiens hypothetical pro 
citrate synthase 

uncharacterized hypothalamus p ote n I TO 
major histocompatibility complex, class 
gb:EST382705 MAGE resequences, MAGK Homo 

Homo'sapiens mRNA; cDNA DKFZp434A119 (fr 
ESTs, Weakly similar to ALU1_HUMAN ALU S 
early growth response 2 (Krox-20 (Drosop 



Hs.75615 
Hs.1 52675 
Hs.122910 
Hs.1 14309 
Hs.62661 
Hs.96901 



apolipoprotein L 

guanylate binding protein 1 , interferon- 
hypothetical protein FU23049 



Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 



430712 
411880 
451149 



417563 
431089 
455797 



432222 
458208 
457741 



301999 1 
1139083 1 
4941 2 



2243443 1 
125941_2 
1511159 1 
1483711 1 
40161J 
539529 1 
45807J 
120741 1 



AW975183 AA973583 AI365103 AI699495 AI301787 

AK002039 AL117524 AV714494 AW954901 AL045243 BF9551 85 AU137860 AW880615 AW8B0496 AA256290 BE767078 N44348 AI886676 
AA455877 N66571 AA999864AU157344 AI817146 R54821 BE223107 AA4558B0 AI355752 BF589210 N63487AI924033 AI923020 AI306145 
AI919421AI5M169AI250173AI440227 M669696AW244040A1358104AI570333AI413315N947B7R72348 N94780 BF944396BR54598 
AW005707 N98831 BF001047 BF588691 AA318076 AW501474 AW883910 H10056 N63481 BE838574 BF909132 BI084973 BG257295 BG818471 
BE348449 AI420623 AW271213 BE048764 W44682 AI887849 AW903942 AA975919 AA312915 BF948057 R55120 H101 10 BI045196 AW880645 
AW976930 AW292808 AW451796 BF514112 A1806378 AI658903 AI769457 AW593455 AI625525 AI538551 AI660509 AA761825 AA973287 
AA861483W73065 AI735361 W60499W76653 BG959557 
AW044647 AI670953 AI656180 AA48471 5 A1659205 BF923472 
BE088101 T05990AW872477 

AF231512AW300273BG779015AW510935 AI989B16 AA137069 AI74887B AW1 50861 AI362628 AI805872 AIS75382 BE855437 AW044703 
AI677769AA886718AI753144AA626885 A1018092AI263010AW026173BE221138AA256268AW571932AW276137AI634216AW296259 
AA977716 AI302589 AA348340 AI720838 A!31 1733 AA01 5867 N73713 AL047586 AW840354 AA256196 AW840357 AW840504 T35664Z36755 
AW954421 AA247424 AIO56930 T31 380 BI910428 H88489 BG675223 AA443427 BE879501 AA478530 R72977 AA298568 BF792417 AA356982 
BF922499 BE764808 BE555636 BF903986 BF331381 N42207 BG523760 BG61 1090 BF735387 BE697757 BE697755 BE718853 N78560 AI984095 
AA137140 AA05371 1 N59865 AI078134 AA643796 T57803 AA018B42 N66799 AIOO4600 BG896323 BF895104 N73684 N73806 N73811 AW900287 
AA018641 

BG207209 BE166299 AI204995 BG199355 AW9699C8 AA528756 AW440776 BI044354 
AA203701 R86895 

BG940189 AW063489AA715980 BF001091 BF880066 AA666102AA621946 AA491826 

BE091833 BE091874 BE091871 

BE063078 BE151503 BE151498 

AW872732 AW827432 AA199662 AA610519 R54983 

BG207209 BE166299 AI204995 BG199355 AW959908 AA528756 AW440776 BI044354 
AI990640 AI38001 6 BM273298 BM273060 

BI017968BE044740BI017768AW JF380597 B 17970 BF746974 BF380582 BF3B0592 BF90B552 BF907924 BF380784 BF380651 
BF380634 BE165581 BE161439 BF908606 BI017961 BE044718 AW827623 BF907758 BI017967 AW827621 AA653908 BI017765 BI017955 
BI017960BI017798 

493 



WO 03/042661 PCT/US02/36810 



411010 1066474.1 
449780 31099.2 



444986 704733.1 



30 
35 
40 



60 
65 



456054 1263570.1 
409368 110612 1 
433430 2181751.1 



408404 658475.1 



437158 59575.1 



443144 16112.3 



432810 101919.1 



AW313381 AW816094AW813357AW814469AW813293AW816099AW813295AW813425AW813331 AW81 3325 AW813351 AW81 3427 
AW813339 

BG721806 BG623574 AA367501 BG436403 BG619828 BG570704 BF0B61 15 BF086118 R78932 BG620360 BG571920 BF997723 AA353244 
BG620631 BG621967 BG435818 BG620442 BG621518 H12650 BG573175 H61600 R67494 H01715 D78811 BG435953 BF107255 D79043 R67255 
H01310 BG570941 BG570693 R21776 AA3271 33 R32578 R30775 BG570963 T86946 H61601 W86279 BF991104 R21732 BF990905 BG622861 
BE929694 AIO90290 BE929277 BE929284 AA367783 AA082531 D78839 H7831 8 N91085 BE929344 D63217 BE929334 H53536 R80360 H54070 
C17064 AW962470 R00900 BG613698 BG623946 H9491 8 BE929345 AA004267 BF957177 BG620685 BF086421 187029 C17044 H60972 
BG573514 AA131924 D78838 BG003560 C18615 W86323 R09737 R02529 AA367502 
AJ 235667 AJ235668 AJ235669AJ 235670 

BE141160BE141231 BE141793 BE141791 BE141167 BE141807 BE141806BE141805AW807591 AW807590 AW807586 AW807583 BE141803 

AW845918BE141207BE141158 

BE065209 BE065364 BE06S1 10 BEQ651 11 

AV683451 AK057494 BG718853 BM152866 BG390826 BE709644 AI864727 BI045181 BI459637 AI909102 A1909Q90 BG722507 BI023834 
AW833783 AW833646 AW833525 AW833351 AW333526 AW833825 
AW268472AI204197AW592537 
BE145894 BE145837 BM263472 

BF821471 AW795791 BF844843 BF821 371 AA307584 AV;795790 BF833724 BE154067 BE064709 
AA136653 AA136656 AW450979 AA984358 AA809054 AW238038 AA492073 BE168945 
AA994657 AA382291 
AA203637AA832266 H67452 
AW873613AA620338 

AL527514 AI732432 AA1 33309 AI225224 AV70O997 BF589361 AW291763 AL1 21 500 AA129708 
AW821260 BE162466 BE161 168 

BG402352 BG545066 AA150252 AL036760 AA452480 AI033256 W68776 W93372 N31248 AI052219 AI367635 W69374 N88610 R53194 BI524854 
BI497111 BF940043 AI129268 AI359798 AIC56480 AA121421 AI042150 AW449003 AI418180 AI419420 AI356058 BF832243 AI349330 AI359448 
W76647BF477170AA099163 BF994549 AW608256 AA045418 H03770 AL574791 AW069455BE302148 AW022281 AW960273 AA121263 
AI336371 AI989381 AI131425 AI147433 AI31 1537 AW338638 AI141649 AA709414 AI187177 AA780884 AI333805 AA045312 AI623918 AI349421 
W63753 W70299 AA557276 AA299007 N98212 W74064 N24823 T54392 AA054724 W73059 AI869152 N93462 N71839 AI537432 R7162B 
AA303089 AI498550 T60941 AV70641 7 AW067848 A! 150677 AW3381 18 AI336313 AA826256 AI139518 AA662948 AA902723 AI970175 W68682 
AI089380 AI148372 H99951 AW133001 AI270317 AA532767 AA044727 AA931652 R82469 AA150261 W67788 H57495 R80715AW149B12 N78914 
AI862034 W61122 AW023118 W69375 T88917 T47984 N21531 R35646 AA055544 H15534 AA688295 AA090586 AA044764 BF994641 R79547 
N21313 BF674610 H02874 AW975323 R16904 AA328030 AA054S71 R79546 BF832310 AI249109 

Y03200 NM.004581 BC003093 BE733834 BI753321 BG773890 BF091906 BI917541 AI023762 AA587230 BF435086 AI264262 AI687392 AI810536 
AW589886 AI244419 AA749261 AA535435 AW205689 AI755770 AI765431 C02465 AW305347 AI818456 AA322111 AW381845 AW381B29 
AV749407 AA811636 AU159893 AA603065 AA652542 AI468678 R4961 5 AW381 863 BE389867 BE182387 BF087771 AA527551 AA134051 
AA831504 AA134052 AI871759 AW089048 BI913532 AA367709 BG82B1 55 BF093014 

BM471219 BE093160 BG171781 BI254009 A!905474 AA453162 AA829759 AI086559 AA776022 AI377446 BF5B9018 AA452822 AW614566 

AA443680 AA476733 AW970674 AI393291 AA988283 AI905528 AW384956 D78656 

AF150286AV739062AA835857 

AA136653 AA136656 AW450979 AA984358 AA809054 AW238038 AA492073 BE163945 

AA774785 AA584875 AA577705 AA683178 AA083204 AW3B2057 T92332 T51823 T02858 AA083375 T92381 
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R48772 H42892 H42537 R47898 N28263 H25721 F323B6 H43971 R48205 F21390 H45809 AA007629 R47897 R83734 H45844 AW983653 H43970 
H42536H24495 R48875 H42961 H22079 R36018 
BF314481 BE313241 BI195333BE383148 
AA071O59 AA085201 AA085020 
AA961586 AI863735 AA588325 
BE152393 AA330984 BE073904 BF176271 
BI836699 AM 23195 

BG168298 AA247945 AA528295 AW971284 

BF308898 BE298629 BE298765 AW192518 BE299614 BE300025 BF307463 



AA019761 AA017656AA017374 
AA845538AA890229 

AK002039 AL117524 AV714494 AW954901 AL045243 BF955185 AU137860 AW880B15 AW880496 AA25B290 BE767073 N44348 AI886676 
AA455877 N66571 AA999864 AU157344 AI817146 R54821 BE223107 AA455330 A1355752 BF589210 N63437 AI924033 AI923020 AI306145 
AI919421 AI584169 AI250173 AI440227 AA669596 AW244040 AI358104 AI570333 AI418315 N94787 R72348N94780 BF944396 BF754698 
AW005707 N98831 BF001047 BF588691 AA313076 AW601474 AW883910 H10056 N63481 BE838574 BF909132 BI084973 BG257295 BG818471 
BE348449 AI420623 AW271213 BE048764 W44682 AI887849 AW903942 AA975919 AA312915 BF943057 R55120 H101 10 BI045196 AW880645 
AL050068 AA160485 AW173544 AW296506 AW439860 AI521563 AI702529 AI393606 AW138323 AA570109 H19504 BM021968 BF063327 
BF593552 AA630766 AI597717 A1807123 AA523012 AI35B250 AW451857 AA974203 AI762577 BF512552 AW007307 BE575286 AW450602 
M962057 AW516069 AI582546 BF221924 BF222543 AIB91808 AW468599 AW000736 AI866625 AW235356 BM021837 AA911956 AI680606 
W86515 T03370 AW61 1634 H41653 A1458349 H19588 AWO90198 AW043993 R39847 
M18728 

NM_032013AB044943 AF308609 B',769254 A1S19262 A.A845639 AI289073 BE670187AA207127 T33098AA633492BE328324A1083620AI204945 
AA903471 AI372602 AI049836 AI049737 BF000101 AI686167 BF436896 AI659189 T32971 AA31 1302 AW733149 AI435981 T03438 AA879206 
AI972177 T33083 AA613910 A1971019 AI027140 AA853739 AI520528 A1637519 AI972307 BF054861 AK027665 BF082751 AI598127 T33663 
AA204743 BE086660 F12132T26372 BF765338 AA323106 AA322907 BI037062 BE315235 AV722868 BE254381 T66212 BE566142 T09034 
AA406202 BF843940 AI743855 AA11 3356 BG990683 BE086729 BE535539 

AB053726 A1651414 BE245990 BE245765 BF439734 AA648422 AA040639 AI340155 AA255928 AA278365 BE766296 AA280771 AL555562 
A1474638 AI863068 BI260946 AK027039 BG615852 AI69B039 AA252016 AA258886 BE905205 AW501167 BF514117 BI857400 AW297001 
AI624923AA125900AW272165AA190967AA280729AA035532AW129692AA125899BG528645 BE614599BE454693AI560128 AA551511 



BG292389 C06094 AI668930 AW1 04534 AA310513 AA830127 AW134897 AA046953 AW965490 AI810530 BF092924 AA334151 AA334725 
D31302 R20723 AA263003 BI824635 A1275287 A1684428 AI524234 AI335035 AW014704 AI911443 AA972102 AI367512 AI126670 AW016017 
A1286003 AI147163 AA626033 A1539156 AA565542 AI094253 AW512612 BE889628 AA744752 BE646306 AW471324 AA999975 AA863400 
H17550 AI991439 R461 87 BE929954 AA333976 D631 02 BF744491 
AF075017 R66779 R22463 H02780 

NM.002082 L16862 BG828886 BE795217 BE904064 BE294526 BE297283 BE394617 BE935127 BE935106 F12351 BG823182 H16710 
AW978796AA767373AW173343AA336163 N27563AA905328 R97032 
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15 
20 



400161 2656_2 



421341 1407J 

434051 73505 1 

410933 1064624J 

406731 0_0 

446863 15525,1 

400231 MH494_5 



BF374577AW809840AW809996AW809798AW309695AW809646AW809738 BF374582 AW809716 AW809826AW809302AW809747 
AW810152 

BG571303AA410586AA035018 BG572117 BG620022 AA147247 BG005785BG014448 R31981 H02668 H12498R36203 BF992089 R73999 
T49904 R75732 BI057974 T53681 AA147933 N50695 RB3588 R25671 R31935 R251 10 R36105 AK055628 BE157467 AW663674 AA190993 
H01642 BF510304AA626915 AA746952 AI161014 AA099554 BG572534 AI803329 AI309932 AI808765 AA41 1449 AI378760 AA976929 AI378620 
AA909684 R75632 AI360919 AI350463 AW069127 AA41 1621 AA742532 H12451 BE208298 H03612 H12839 N58781 R75957 BF996484 AI240665 
BF989591 BI056086 BG001590 BF107035 

AK023419 BE464935BE221668 AI148B35 BI359909 AI374780 AA766392 AI347967 AI582917 AA477117 AA229236 AA652637 AA636109 
AU134580 F21298 BF802607 BE7691 24 AV658891 H89616 AV659853 
BB006307 BE006341 BE006311 

AF007190 AW820705 BE168488 AF007192 BF753303 BG973971 BG978568 

1)44839 NM_004651 BC000350 BM58316 AU1 17940 BG759024 BG749694 BE799505 BG831537 AI816335 AA325352 AL547005 AW157038 
AI859331 AI81 61 86 AM 50786 AL043E49 AW162880 AU 159233 A! 143169 T03478 BE727648 AA764725 BE206603 AI369814 AI984369 AW1 57545 
BE221486 H99016 AU159025 AI074496 AI49451 6 BE245950 AA704385 AA280B52 AI479595 AI369776 BE671398T05538 AA682249 BI677303 
BE645335 AI359434 H92868 D52599 D53609 D54715 T06015 BE222174 AI954706 D53218 053787 R69889 W86896 Ai4 97670 R70771 BF309414 
BE620147BG910597AW96496BBE836120AL579715 H56512 D55956 BI044097 AL555239 BF220278 AA0B1 991 AI819544 AW001573AW131600 
AI858764 052367 W22034 BG818979 BG024561 BE702779 BI458863 BI910399 BG707755 BF348284 H10055 BI086315 BE620574 H41088 
BG119517 W23267W21941 AA328817 
AW513804 BE179199 BE179195 BE179198 BE179204 BE162686 

NM_007329AF159456AJ243212AJ297935 AA295769 NM_017579 AJ243224 AI492875 AI79S676 AI749838 AA918144 AI81459Q A1923531 
BF51 3992 AI720725AI1 50879 A1279072 AW612904 AI492104 AI284510 AI141231 AA613554AW662148AW769047AA565985AW612888 
AU10051 3 BG955585 BG955588 AA295763 BE829414 BF760645 BG954398 AA295332 AA295795 BE932867 AW769569 T89953 BE93431 1 
AF116622 AI114507 AA640834 BF111602 AA377999 
C16024 C15974 AW811066 AW81 1052 AW81 1020 
AI559131 

AK074473 BC017997 BI831060 BF971101 AI886394 AI082824 AV708785 W86073 W07772 AV660737 AI816793 R52250 BG183529 AA633473 
A1191256 R44763 R19947 BF571346 W86257 

BC013310 AF261085 BC004109 AY007133 BC009081 BC001601 NM.002046 M33197 BC020308 J02642 M36164 BE794233 AV721080 BE255459 
BG926429 BG389312 BG477333 AI031799 BI763443 BI260432 AA989106 AV728576 BI091380 AA402499 AI200513 AI284734 AI223995 AI289749 
BG283291 BM013814AW438544 BM450203 F35435 F33262 BE890952 AA401181 BG939668 F35525 BI088182 F34674 F33506 BM471326 
F34677 AW276712 AA187508 F34866 AA1 14245 AA522581 N23935 AI076923 AI018505 BE879774 BM465637 AI753078 BG222159 AA595947 
BF970917BI094125 AA719841 BE893087 BG775178 BE793983 BE797071 BF339134 BE409272 BE266456 BE796770 BE745957 BG755835 
BE266758 BE259342 BM450181 BG748174 BE299322 BM423567 BM467637 BM452667 BM479516 BM452420 BE273297 BM466364 BM450640 
BM478743 BM459394 BM455306 BM472001 BM473247 BM478771 BM480379 BM459071 BM450106 BM467584 BM464548 BM465044 HM450176 
BF569359 BM462924 BM455329 BM471815 B1862301 BG331735 HO',S03 AA374894 BE902964 
AA658826 A1821926AI791191 AA535129 AA5S4492 

AI2201 17 AI857837 A1218371 BM091400AI304954AI198508AI400738AW571549AW950042AI089943AA437280AU150878BF197070AI2679B4 
BF594181 BF196688 A14331 52 AI338921 A1620364 AI280197 AA652531 AI674938 AI342447 AI620350 AI281295AI148621 N54787 AI338121 
AI281 153 N51899 AI087072 AA954788 AW069054 AI346309 BG529329 AI3401 35 BF083036 AI1 673G5 AW819G57 AA935468 AI467B68 AW148701 
AI383720 BE047685 AW015498 AA9371 49 AA708346 AW77147B AW802508 H53334 AW389204 AW798230 AI553922 AI560688 AW950043 
AI961682 AV70B506 R01853 AA126514 N52757 AI536893 AI926052 AI41B720 N99964 AI5BB933 AI915737 AI080691 AI185358 N48996N68575 
H82824 H60037 AI247247 T95664 BF593863 A1749637 AW088541 AA991294 AAB87452 AI073726 AA633132 AA629674 AA629649 AA629656 
AA578595 AI16B758 AA804572 AI085786 AA994396 AA991 209 AA94B663 AA929054 AA927952 T87001 AA92B210 AA629296 AW802267 
AW384129 BF744400AA194110AI382839AA194837AA406284AI25O75OR37O35AI525586 W01244 

BG210376 BG217800 BE925778 W39114 BG682395 N70544 BE709097 AW275615 BC001469 AL564888 AI088126 AW003852 BF792438 
AA161295AW970131 AI127310 AW029307 AW192534 AA843144AW606235 BE221541 AW008111 AA224203 AA604507 AW794761 AA134005 
AA126850N99165AW769391 AV\bW B71 BE 027AW401527AA486231 AA486417 AA191542AA028128 AA159991 AI490090 

AI241024 BM145449AA774661 AI626021 C18251 BE18531 1 AA291517 N38896 N59222AI245611 AA159207AI298572 AA1695B5 AI131139 
AA157960 AI439983 AI208276 AA93B061 W67305 AW337587 AI357055 W04739 AI214517 AA617789 AW241277 AI880213 AI5B2789 AI143996 
BE814848 N49964 AI936222 AI817819 F09976 AA039349 AW805002 T35117 N94388 AI689530 AW384573 AW384555 AW384539 AW384473 
AA129709 AW384466 BG194342 BG204579 BG027536 AL573075 AA399553 AW794949 T88865 AW51 1 21 1 R26588 R361 11 BG170598 BE937009 
BG678833 AAB62899 H96612 H02273 AA758487 BF211173 N32570 AA088287 R68451 AA297563 H16847 R80655 AA421891 W68402 R28379 
R64119 R70109 R77661 R67963 B6701 844 H68670 AA1 69664 AA1 14111 BE715243 R69317 BE715252 BE713804 BG336586 NM_001679 
BC011835 U51478 BM463117AU119746 BI462090 Bi227086 BG706303 BF059073 BG706532 B154471SAI568735 BE85B747 AU122881 
AU1 26210 A1 1 86547 AU 1 34705 BE231 323 AI147220BE253820AW973937 BG281863BE858367 BE278941 AI262814 AI001194 AI391616 
AI200862 N32564 AI161064 AI089818 BF514359 M370915 AI341797 BE263168 AI218415 AM31098 AI2S5310 AI928440 AI066642 A1375442 
AW028327AI217792A1161020AI342854AI221544 AI3Q4700 AI343005 BM148839 AAB31536 AW074258 BF940569 AI149876 AI914574AA298442 



430202 1233214 1 
424017 888651J 
437272 176_2 



65 
70 



430108 1233254J 
410276 641443 1 
400247 2764 1 



AW9661 07 AA334317 AA333789 

AB075828 BG107783 AW021 313 BI492345 AW385707 AW580860 AW2961 17 BE072066 AW580775 BF679498 AW580828 BE003655 BE003672 
AW580851 AW580908 BI037146 AW580894 AW580864 H17858 N50343 N54223 F05579 F07386 F05578 AA773248 AA354359 AA330257 C20685 
BE548300 AW968728 AW968554 AA777644 AA706896 AA682517 AA832267 BG165087 N51097 N51567 R95837 BI256603 W89042 R95836 
R97364 AW975957 AA747943 AA81 1 289 
BF568186 BF899745 AW836724 BE243668 
AW976857 A1809001 AA769369 AW1 02732 AA761235 
AL578301 AI337389 AI671302 AA97B 1 35 BF591738 AI7B59 1 2 

BF036043AW1 90446 BG1 94731 AW662036AI445021 BE937550AW818972AW393132AA834685BF112058AV721682H16423AI270167 
AI857345AA937302AW818444EE929780 BG493678 BF15501 0 BI598271 BI599811 BE161728 AW578737 AW753711 AW379707 AW381918 
BG506608 AW028637 AW994240 BF887392 BF790073 AW381 624 AV727105 BF43961B AA443174 AI018009 N42850 AW573242 AI417258 
AA463483 AI676131 AI167170 AA836627 AA443828 AW592922 AA235129 AA730278 AW439062 AW474332 BI043239 AW474342 BG708553 
AW362423 BF090028 BE827256 R1 6550 R39478 R39479 R94368 BG540916 BM314745 AA251087 D54231 D55274 BF085805 D31589 AW966405 
AW994425 D81879 BE093545 AW901 1 07 AA333529 BI021 552 R56420 N39976 AA573281 H32595 AA234955 BE093539 AW36700S BF358697 
BF366318 AA663856 BE702099 BF035969 AI267384 AI257232 BE348320 AA621574 AA861212 BF083343 BF083341 AV745131 D53074 
AW954476 AW954472 AA376838 AV724531 D53053 C14928 AA093287 AA062638 BG483558 BE940050 AA765954 T70171 BE938775 BE940057 
D53502 AW373300 AL1 18798 BM128723 AA19341 1 AW444709 AW952455 AI837612 BF431948 BI496876 A1264159 BM128481 AI624657 
AI689301 A1969467 AA861685 AA251595 AA625761 AA872090 A1826790 AA328366 BE827416 R75951 D56918 R68122 BE827384 AL118797 
AI184154 AA164411 BI495332 BE8581 13 AI863860 H00660 T69849 AW780389 C14667 BE934995 BI018652 R92801 AA164410 H00752 
AW373305 AW373299 AW373302 
AA928810 AW968393 AA465294 AA81 1301 

AA083514AI554545AW169852AI3638.' \] 6A f f !2b AI7B5624 AA147545 AA147552 

BC022339BC009610BC010537 X73 Mf-_i 1671 1.979 BM467814 BM450743AU1 32951 AU137129BG493425 AV758819 BG708412 
BG705885 BG702217 AV716638 BG777009 BI545589 BI5521 53 BM47671 2 BG770858 BG527656 BG528277 BG391 3B8 AV71 6861 BI602926 
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BG290073BI667399BM451469 B!667173 BI602139 BG532171 B1669216 BI544727 BG721B52 AV716503 AV701327 BM090738BI492000 
A1308656 B!544904 AL C ^ C 1 - 1 ' 1 ^ v T 1 5505 AV714587 AV717902 BF668072 AV71S3B5 B1461927 BM090954 AV71 7826 BG503676 
AV647719 BG501392 BG428433 BE895629 BM3131 17 AW021050 BG435032 BM1 5291 0 AA31 3503 AA872377 BG574714 AV712054 AV732696 
AA252476AV712759AL599643 BE79'^..rCl/."<-' A/7333; 575 ' 7 AV76Q376 AV725139 AV716379 AA887155 BE830003 AW023795 
AL599291 AI902948 BG944042 F00781 M352483 BG217897 N33888 AW581924 BG554730 D31410 AA353088 D31288 M295029 H95170 
BE935104 AU139980 BG772963 BG776470 BG532512 BG1 05449 BI545421 AV715456 AW386083BG599714 AL535832 AL514940 BG190861 
BG210593 AW999254 H95138 AA353363 EE764809 N50375 BE091363 EG701255 BI860846 BI832485 BG168150 BG028647 BE545301 
BG900321 BI909737 BG70 I 11 1113 BG7001 21 BF031492 N85802 AV715940 N51590 BG993478 BE172016 AW893622 

439403 4937 10 AK056692AF086220AI375066AA284293W32566AW797951 AA960897AA504145 

450724 4016? 1 BF060834 BI01 9048 AW862002 AK056737 AA429538 AW06331 1 AW440655 AW231 970 AA423581 50 AW I 33AA984131 AA552874 

AA564758 AA528743 BG927275 BE0451 17 AW975234 AA664937 BF749390 BG928948 AA984546 BE009153 AV732487 BE078167 AW872716 
AW063245 AA578441 AI820704 Al '3228; P c 5<2i EF999154 BF880485 

432077 452948 1 AL1 34685 AW972760 M525319 AA526644 AA525342 AA526632 AA525296 AV739423 AW972755 

449625 249224 J BI918168 AW779760 N48674 AI375997 AA235370 BG699146 AI913631 AI498402 AI016320 AA323193 R49021 D59344 BG985750 N45526 

BG986917 T61382 R49391 R45432 AI203107 R35004 F07491 R25094 R35360 
455047 1092329J AW852530 AW852526 AW852527 AL597969 

410934 1064652.1 AW811114 AW811095 AW811094 AW81 1124 AW81 1157 AW81 1054 AW811087 

454600 1063489 1 AW810001 BF374633 AW809694 AW809572 DF374748 BF374562 AW810209AW810092 AW809786 AVU1M, 1 W 10 9A\ R 39884 BF374650 
BF374641 BF374764 BF374744 AW81 0552 BF374B78 BF374653 BF374579 AW810168 AW81 0170 BF37461 1 BF374676 BF374667 AW810353 
BF374682 AW810154 BF374688 BF374565 BF374757 BF374637 BF374743 BF374719 AW809664 BF374643 BF374680 BF374714 BF374708 
BF374716 AW810432 BF374691 BF358066 AW810006 AW810345 AW809960 

455500 1 228737 1 AW963582 BE0641 92 BE0641 69 BE 152580 AW963587 

411745 43036_8 AV704306 BF368780 AW867826 AW859896 

418869 12789.14 AA229762 AA230035 

432093 1237011.1 AW972670 M525808 H28359 H28383 

452239 10116 4 BG034853 AW17331 5 AW303375 BG190225 BG939 1 53 BF057308 AA600736 AI751 258 AI090486 BE939504 AW631 492 AI768270 AI3521 33 

AA417652 BE378218 AA599207 AW794702 AA024968 AA445024 AI148235 AI191710 BI493797 AI272646 BI493796 M634323 AI754332 
AA25841 4 C05155 AI218226 AI039656 AI350380 AI084698 AI754989 AI673545 AI43201 0 AI751 035 AA375571 AA446297 BG21 6743 

430353 10288 1 BC010106 ALS60552 AU133296 AU133086 BE268667 BE258523 BI544879 BE398161 BG473088 BI544445 BE258021 BE296339 BE255040 

BE263020 BG706790 AL598627 AW952337 BG758113 AW512753 BE267666 BE253415 BI225718 BE268350 BE258245 BI224965 AW772605 
BG723903 BE397282 BI196341 BG702880 AI878949 AL500437 AA416759 BE259917 AI031582 BF512142 AI088248 BE560328 AI802623 
AI288613 AL597585 AW768553 AI816352 BF732831 BI225687 AA833686 AA722593 AA807750 AW068064 AA405187 AI923236 N51593 AL527710 
BG282576 AL525927 AL525971 BI869547 AI064725 R91B56 H46814 H20112 W01682 AW848870 AW848585 AW376662 AW848985 AW848937 
AW848862 AW848581 AW848512 AW8481 75 AW752B23 AW75261 8 AW376822 AW376821 AW376684 AW376623 AW376622 BE706047 
AW752602AW752691 AW752E74A" \3 916 AA/t2610 AW752584AW752613AW752660AW848709AW84B576AW849155 

AW848931 AW348980 AW848979 AW848978 AW840973 AW34891 6 AW848713 AW8487D8 AW84B642 AWB48641 AW848639 AW848573 
AW84B493 AW848492 AW848489 AW848488 AW8484B7 AWB48353 AW848352 AW848220 AW752698 AW752697 AW752682 AW752681 
AW752680AW752b79 I r . 64 / 51 AW75263BAW752637AW752636AW75262BAW752B26AW752624AW752619AW752596 
AW752608 AL582019 BE875587 AL5291 75 AW965868 BGB85208 AA259073 EE696S73 AA459543 AA358314 W40564 BF926427 AW849000 
AW84871 8 AW84851 5 AW848507 AW848444 AW848440 AW848232 AW348222 AW752657 AW376786 AW376781 AW37661 5 AW376614 
BI752581 M534520 AI748906 AA047799 AI014753 AL514450 AL5819B2 BG743146 W24171 H20102 H1 1227 AW752607 AW006596 AW13037B 
BE71 651 9 AW752661 AW84B2B9 BF349557 AW75261 2 AW752532 AWB4891 0 NM.004077 AF047042 AL560606 BI765896 BI1 96831 BI855656 
BE906674 BG749937 BE535486 BE019810 AA313713AA992542 AA332541 AA6B29B5 AA3561 25 BE140478 BG750945 BI457548 BG025661 
BF326302 AA325019 BG980676 AA337465 AA321974 BG949285 BG427585 R23979 BG61 1485 BE5B0B78 W 16977 N50379 BG824101 BG471 750 
BI463171 W04691 AU099360 BG471 590 DM011999 BE262945 BE559801 BF75643B BE861957 BE314546 BG911831 BG150811 BG112017 
AA157518T92368AW752620 
431323 1235269 1 AW970623 AA502839AA502819 

442195 15007 1 U81984NM 001430 BE9D7085 B1333232 A1021986 AU138476 C18601 U51626 AU100517 BI054387 AU076970 BE786454BG010080AW377189 

BF998789 AA368139 R1 1396 T8361 3 BG006324 BI01 2404 BG001643 BF757957 AL549361 AL544018 BE0D2870 BE929314 BE090199 AL046650 
BI053717 BE929315 BI054967 BF960055 BF925432 R05421 BF922073T70331 BI004403 



405443 
401645 
401673 



Unique number corresponding to an Eos probeset 
Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) 
human chromosome 22" Dunham, et al. (1999) Nature 402:489495. 
Indicates DNA strand from which exons were predicted, 
i: Indicates nucleotide positions of predicted exons. 

Ref Strand reposition 



numbers. 'Dunham, et al." refers to the publication entitled "The DNA sequence of 



8844110 
7382153 
8571772 
9958129 



405061 
402327 
404342 
404429 
403344 
401593 
406461 



9887671 
8077108 
9965537 
6456174 



122596-125136 

26662-27225 

132492-132932 

108675-108770,10 

115854-116033 



70823-70990 

10368-10572,11293-12356 
158842-159136 
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403010 
4063B7 
402885 
404501 



405491 
400818 
406475 
401025 
402308 
405213 
400740 
402825 
405973 
405818 
402621 
401311 
401899 
403579 
404600 



9966252 
7684483 
9801137 
7534014 
9743564 
3132346 
9256180 
9926751 
7229859 
9797538 
7577926 
5754724 
7630361 



1552494 
7405502 
9797871 
5801545 



79659-79804 
97621-98084 
96314-96539 



8117518 Minus 



9212516 Minus 



9857564 
8516051 
7670575 



Plus 4151541695 



26052-26803 

174540-174634,175449-175558 

123579-124447 

17175-17373 

91720-92115 

1034-1177,3143-3266 

12366-12614 

81857-82045 

1 72644-1 72755, 1 73085-1 73200 

125417-125563,128052-128180 

179287-179483,181044-181166,181844-18203 

92080-93638 

50267-51151 

79920-80510,80576-80746 

78572-78807 

103859-104254 

29055-29196 

130806-131036 

180124-180754 



34183-34357,35635-35751 
123525-123713 

146733-146860,147899-147961,153127-15325 
253499-253674 

164091-164162,164397-164516,166720-16679 



82552-83613 

136474-136646 

162599-162935 

53315-53472 

63140-63319 



150411-151484 



TABLE 35A: About 323 genes upregulated in hypersensitivity pneumonitis relative to idiopathic pulmonary fibrosis or non-specific interstitial pneumonitis 

Pkey. Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: 90th percentile of HP Als divided by 90th percentile of IFF Als, where 1 5th percentile of normal tissue Als was subtracted from both the nui 

R2: 90th percentile of HP Als divided by the median of IPF Als, where the minimum value for the numerator and denominator was set to 50. 



421553 
424325 
417957 



412610 
452194 
447709 



UnigenelD Unigene Title 



NM_006433 
NMJJ14479 
BE244373 
AA456454 
AF1 88625 



AI694413 
U97145 
AW810204 



Hs.336908 
Hs.94814 
Hs.334489 



Target Excn 
granulysin 

ADAM-like disintegrin protease, decysin 
nuclear receptor subfamiK' 4, group A, m 
cell division cycle 2-like 1 (PITSLRE pr 
phospholipase A2, group IID 
HLA-G histocompatibility antigen, class 
ESTs, Weakly similarto androgen recepto 
fatty acid binding protein 6, ileal (gas 
olfactory receptor, fami, 2 subfai lily 
GDNF family receptor alpha 2 
gb:MR4-ST01 25-021 199-01 7-d08 ST0125 Ho 



ESTs 



=in (Fl 



R1 



F2 
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415452 R52692 

447028 AI973128 

412394 AW984150 

4501B5 AA007235 

431093 AB031038 

444090 S69115 

413682 BE1 56991 

441320 AI768724 

456766 R87310 

420340 NM.000734 

459721 AI299050 



Hs.7740 
Hs.97087 
Hs.143835 



gb:PM2-HN0008-170300-001-h09 HN0008 Homo 




oxysterol binding pratein-like 1 
CD3Z antigen, zela polypeptide (TiT3 com 
gb:qn14d12.x1 NCI_CGAP_Lu5 Homosapien 
Tangel Exon 



408544 AW293825 

413454 BE141162 

444404 M31525 

418460 M26315 

436063 AK000028 

429212 NM 001504 
400712 

417929 R27219 
403478 

418747 AJ249977 

429712 AW245825 

451668 Z43948 

414020 NM.002984 

456057 AA947457 

444346 AI142274 

418918 X07871 

451318 AA029888 

458935 Y16521 

417105 X60992 

408219 BE061111 

420137 AA306478 

443711 N67861 

423234 AA323534 

416975 NMJ04131 

425280 U31519 
405827 

406909 L20777 

437295 AW779318 

424281 AA766243 

430413 AW842182 

423901 AA333006 
405075 

457423 AK000642 



ENSP00000175238":A 
ESTs 



55 

60 
65 



gb:MRO-HT0076-02 1 299-00 1 -<J03 HT0C76 Homo 
major histocompatibility complex, class 
Hs.85258 CD8 antigen, alpha polypeptide (p32) 

ribosomal protein S24 
Hs.1 98252 G protein-cojpled receptor 9 
Target Exon 

Hs.74647 Human T-cell receptor act've alpha-chain 

NM_022342:Homo sapiens kinesin protein 9 



Hs.88049 
Hs.211914 
Hs.326444 
Hs,75703 
Hs.135560 

Hs.89476 

Hs.95071 

Hs.24812 

Hs.81226 

Hg.254211 

Hs.95327 

Hs.49390 

Hs,296162 

Hs.1051 

Hs.1872 



451404 AA460775 
424463 AW195353 
400107 



404088 

414991 C17898 

429073 AA446157 

426274 D38122 

431094 AW972276 

424899 AL1 19387 

419711 C02621 

459019 AA017156 
405453 
402516 

457365 AA577297 

407928 NMJ02262 

436553 AW407157 
406266 

419409 AW297831 

435028 AW193035 
404696 
403533 

411573 BE064863 

424148 BE242274 

419833 AA251131 



ENSP00000233627':NADH-ubiquinone oxidore 

cartilage acidic protein 1 

small inducible cytokine A4 (homologous 

ESTs, Weakly similar to T43458 hypotheti 

ESTs 

CD2 antigen (p50), sheep red blood cell 
CDP-diacylglycerol synthase (phosphatide 



Hs.257168 
Hs.201227 
Hs.121036 
Hs.345792 



AD037 protein 

granzyme B (granzyme 2, cytotoxic T-lymp 
phosphoenolpyruvatecarboxykinase 1 (sol 
Target Exon 

gb:HumanMHC class I HLA-Ggene (HLA-A33 

gb:oa13M1.s1 NCLCGAP GCB1 Homo sapiens 
S! nail inducible cytokine A5 (RANTES) 
gb:EST37064 Embryo, 8 week I Homo sapien 
Targel Exon 

hypothetical protein FLJ20635 
Target Exon 



Hs.1 19903 ESTs 



Eos Control 

NM_021096:Homo sapiens calcium channel, 
Target Exon 

gb:C17898 Human placentacDNA (TFujiwara 



ls.116195 
15.119062 
ls.159282 
ls.40719 



hypothetical protein KIAA1164 
NM_005748*:Homo sapiens YY1-associated f 

EST 

killer cell lectin-like receptor subfami 
immunoglobulin lambda locus 
Target Exon 

hypothetical protein MGC2556 
ESTs 

NM.013443:Homo sapiens CMP-NeuAC:(beta)- 
Target Exon 

gb:RC1-BT0313-110300-015-f06 BT0313Homo 
integrin,beta7 



WO 03/042661 



PCT/US02/36810 



423196 AK001866 

426416 AW512744 

449317 AW293413 

424321 W74048 

422109 S73265 

424218 AF031824 
406303 

438676 AA813745 

404240 

404056 

425508 AA991551 

429819 AL133011 

416941 BE000150 

446998 N99013 

409153 W03754 



20 

25 
30 



Hs.125139 
Hs.1 69824 
Hs.132906 
Hs.1765 
Hs.1473 
Hs.143212 



Hs.123446 ESTs 



hypothetical protein FLJ11004 
killer cell lectin-like receptor subfami 
19A24 protein 

lymphocyte-specific protein tyrosine kin 

gastrin-reieasing peptide 

cystatin F (leukocystatin) 

C1 6000922:gi|74991 03|pir||T20903 hypothe 



422994 
424517 
433671 
412116 



427527 
431574 
436485 



N75217 
BE271188 
X72755 
AW891802 



421379 Y15221 



426451 
447131 
414512 
426752 
444793 



Hs.50813 
Hs.90708 
Hs.257846 
Hs.1 55975 
Hs.77367 
Hs.296276 
Hs.1 37447 
AW138797 Hs.132905 
AW402166 Hs.784 
NMJ03726 Hs.19126 
AW402155 Hs.3003 
AL045294 Hs.136245 
AI809057 Hs.153261 
AW572659 Hs.261373 
X59135 Hs.156110 
NMJ02104 Hs.3066 

Hs.1 03982 
BE382657 Hs.21486 
L01087 Hs.211593 
X54870 Hs.74085 
NM_000878 Hs.75596 
M26041 Hs.1 98253 
X57522 

AI908165 Hs.169946 
NM.004585 Hs.1 7466 
AL044336 Hs.5831 



ens major hislocompat 
Target Exon 

Homo sapiens, Similar to RIKEN cDNA 2310 
Homo sapiens mRNA; cDNA DKFZp434P201 (fr 
nlban protein 

Homo sapiens mRNA; cDNA DKFZp564B2062 (f 
hypothetical protein FLJ20022 
granzyme A (granzyme 1, cytotoxic T-lymp 



19A24 pn 
Epstein-E 



nscDNAFU12169fis,c!o 



us induced gene 2 (lymph 
dated phosphoprotein of 
CD3E antigen, epsilon polypeptide (TiT3 
ESTs, Weakly similar to T1 7227 hypotheti 
immunoglobulin heavy constant mu 
hypothetical protein dJ43401 4.3 



granzyme K (serine protease, granzyme 3; 
small inducible cytokine subfamily B (Cy 
signal Iransducer and activator of trans 
protein kinase C, theta 

DNA segment on chromosome 12 (unique) 24 
inlerleukin 2 receptor, beta 
major histocompatibility complex, class 
transporter 1, ATP-binding cassette, sub 
GATA-binding protein 3 (T-ce!l receptor 




421924 
427307 
444929 
439237 
418196 



U44385 

BE513934 Hs,1583 

BE266484 Hs.82916 

M353776 Hs.901 

NMJ02407 Hs.97644 

BE616731 Hs.80645 

R81864 Hs.205103 

BE514514 Hs.109606 

AF117947 Hs.174795 

AI685841 Hs.1 61354 

AW408158 Hs.318893 

AI745649 Hs.26549 

AA321649 Hs.2248 

R98865 Hs.1 1135 



neutrophil cytosolic factor 1 (47kD, chr 
chaperonin containing TCP1 , subunit 6A ( 
CD48 antigen (B-cell membrane protein) 




430308 BE540865 



413472 
425762 
412472 
451406 



AW273261 Hs.216292 

AI076765 Hs.269899 

AI952797 Hs.1 0888 

AI765697 Hs. 13231 

M16714 Hs.89643 

M12529 Hs.1 59401 
N80139 
AF088033 

AF151103 Hs.112259 

BE241624 Hs.82401 

M90391 Hs.82127 

AB029488 Hs.272100 

BE242870 Hs.75379 

BE244076 Hs.159578 

AW975398 Hs.293B36 

AI694320 Hs.6295 



fs, Weakly similar toT17248 hypotheti 



449835 AW979300 Hs.293813 ESTs 



499 



WO 03/042661 



PCT/US02/36810 



424243 
457500 
424541 
439039 
449523 
457718 
415198 
431594 
432656 
422426 
414372 
427247 



40 
45 



AI656707 Hs.48713 

NMJ00579 Hs.54443 

F18572 Hs.22978 

AW009480 Hs.943 
AI823999 

NMJM0245 Hs.3075 

W79117 Hs.58559 
M1 43554 

AW504221 Hs.174103 

W57554 Hs.125019 

X57809 Hs.8997 

AA234538 Hs.2935B4 

NM.002118 Hs.1162 

H10709 Hs.269524 

AI638627 Hs.105685 

R65954 Hs.334873 

W25140 Hs.1 10667 



Targel Exon 

ESTs, Weakly similar lo T20593 hypotheti 
ESTs, Moderale!/ similar io ALUS i IUMAN A 
ESTs, Highly similar to els Golgl-locali 
protein kinase, interferon-inducible dou 
ESTs, Weakly similar to A561 94 thromboxa 



433231 
408847 
405441 



431433 
422934 
409799 



427648 
418870 
437479 
425345 
416030 
419886 
443951 
414875 
412471 
428782 



424168 
431723 
426437 
446566 



430998 
419092 
451864 
421140 



404277 
413497 
441835 



ESTs 



ne [C-C 



MHC class II transactivator 
ESTs 

gb:zo65a02.r1 Slralagene pancreas (S3720 
integrin. alpha L (antigen CD1 1A (p130), 
lymphoid nuclear protein (LAM) mRNA 



AB040926 Hs.143552 



AW392550 

X65018 Hs.253495 

BE2441B9 Hs.122492 

D11928 Hs.76845 

X03068 Hs.73931 

T82331 Hs.1 82278 

AW410235 Hs.75348 

AA743415 Hs.291368 

L20971 Hs.188 
AW768399 

AI376722 Hs.180062 

AF147204 Hs.89414 

R61866 Hs.101277 

AU077297 Hs.155894 

H15261 Hs.21948 

AA251562 Hs.146168 
F13272 

H42679 Hs.77522 

M63193 Hs.73946 

X12830 Hs.193400 

M26301 Hs.2253 

AA932689 Hs.233304 

M67466 Hs.814 

AW1 75787 Hs.334841 

N32536 Hs.42645 

U97602 Hs.87497 

AB011086 Hs.1 29739 

L29277 Hs.321677 

AW058350 Hs.1 6762 

BE076537 Hs.169895 

H95741 Hs.1 791 4 

C18825 Hs.29191 

AW772298 Hs.21103 

AA100847 Hs.5978 

AF111713 Hs.286218 

AF128847 Hs.204038 

J05581 Hs.89603 

N20370 Hs.69547 

AA298741 Hs.1 021 35 

NMJ14767 Hs.74583 

BE268912 Hs.14601 

AW972300 Hs.1 181 10 

T95594 Hs.187435 

AA907305 Hs.36475 

BE1 77661 

AB036432 Hs.1 84 

M13560 Hs.84298 
AA837085 

U29344 Hs.83190 

AI582743 Hs.94953 

H98071 Hs.4055 



Targel Exon 

proteasome (prosome, macropain) subunit, 
gb:xrti38b01.x1 NCI_CGAP_GC6 Homo sapiens 
surfactant, pulmonary-associated protein 
hypothetical protain 



major histocompatibility complex, class 
ESTs, Weakly similar to CGHU5C collagen 
proteasome (prosome, macropain) activate 
Eos Control 
ESTs 

phosphodiesterase 4B, cAMP-specific (dun 



chemokine (C-X-C motif), receptor 4 (fus 
ESTs 

protein tyrosine phosphatase, non-recept 



endothelial cell growth factor 1 (platel 

NM.014207:Homo sapiens CDS antigen (p5B- 
complement component 2 
ESTs, We n lo I 8022 hypot i 
major histocompatibility complex, class 
;el um binding olein 1 
solute carrier family 15-(monocarboxylic 
butyrophilin, subfamily 3, member A2 
KIAA051 4 gene product 



Homo sapiens mRNA; cDNA DKFZp564B2062 (f 
ubiquitin-conjugatj'ng enzyme E2L 6 
membrane-spanning 4-doraains, subfamily A 
epithelial membrane protein 2 
Homo sapiens mRNA; cDNA DKFZp564B076 (fr 
ESTs. Highly similar to AF1 74600 1 F-box 
junctional adhesion molecule 1 



KIAA0275 gene pre 
hematopoietic cell-specific Lyn substral 
bone marrow stromal cell antigen 2 



NMJ)19111*;Homo sapiens major hisbsompa 
gb:RC1-HT0598-020300-01 1-h02 HT0593 Homo 
advanced glycosylation end product-speci 
CD74 antigen (invariant polypeptide of m 



Homo sapiens, Similar to complement comp 
chromosome 21 open reading frame 50 



WO 03/042661 



PCT/US02/36810 



421071 
431779 
416047 
406826 
426836 
415661 
406824 
420679 
443071 
418090 
430250 
406825 
436906 
422241 
408279 
411372 
444342 
406906 
451558 
432805 



447023 
421481 
406868 
412819 
418253 
431243 
448133 
407112 



AW971178 

BE439894 

AW516005 

N41720 

AF057307 

AW515961 

X57152 

AL080021 

U57059 

NM_016929 

AI982529 



Hs.104476 
Hs.268571 
Hs.78991 
Hs.84298 
Hs.172684 
Hs.78575 
Hs.84298 
Hs.99853 
Hs.8986 
Hs.83429 
Hs.283021 
Hs.84298 
Hs.181244 



AI147861 
NM 014398 
Z25424 
NMJ01089 



5.170121 
ls.44095 



Hs.3107 

Hs.177582 

Hs.83623 

Hs.9641 

Hs.288472 

Hs.6101 

Hs.278613 



414552 AL036058 



X03100 Hs.914 
M361258 Hs.237868 
M447849 Hs.288660 
X54486 Hs.151242 
AW966405 

NM_006762 Hs.79356 
M33680 
AA356764 
AW391972 
M505445 
T25829 
AA215539 
U46455 
AA723157 
AA070801 
W17066 



Hs.54457 
is.17109 



406722 
438091 
407018 
412896 



AW804157 

T62849 

U24578 



ESTs, Weakly similar to CGHU1E collagen 
apolipoprotem C-l 

DNA segment, numerous copies, expressed 
CD74 antigen (invariant polypeptide of m 
vesicle-associated membrane protein 8 (e 
prosaposin (variant Gaucher disease and 
CD74 antigen (invariant polypeptide ot m 



l,q: 

r (ligand) superfami 
chloride intracellular channel 5 
CD74 antigen (invariant polypeptide of m 
major histocompatibility conplex, class 
protein tyrosine phosphatase, receptor ; 
Homo sapiens, clone MGC',12617, mRNA, co 
low density lipoprotein receptor (famii 
similar to lysosome-associated membrane 
gb:H.sapiens protein-serine/lhreonine ki 
ATP-binding cassette, sub-family A (ABC1 
CD97 antigen 



nuclear receptor subfamily 1, group I, m 



ESTs, Weakly similar to UBPF_HUMAN UBIQU 
hypothetical protein MGC31 78 
interferon, alpha-induclbie protein 27 
gb:zv/20f1 1.s1 Soares ovary tumor NbHO'I H 
major histocompatibility complex, class 
adipose specific 2 

Human mRNA for SB classll histocompstibi 
interleukin 7 receptor 

serine (or cysteine) proteinase inhibito 
EST 

Lysosomal-associated mullispanriiig niembr 
CD81 antigen (target of antiproliferativ 
integral membrane protein 2A 
KIAA1324pralein 

immunoglobulin heavy constant gamma 3 (G 
FK506 binding protein precursor 
Homosapiens cDNA FU11606 fis, clone HE 
syndecan 4 (amphiglycan, ryudocan) 
folate receptor 1 (adult) 

ESTs, Weakly similar to ALU7_HUMAN ALU S 
nuclear receptor subfamily 1, group I, m 
nuclear receptor subfamily 1 , group I, m 
ATP-binding cassette, sub-family A (ABC1 
Homo sapiens SNC73 protein (SNC73) mRNA, 
nuclear receptor subfamily 1 , group I, m 
NM_018955:Homo sapiens ubiquitin B (UBB) 
major histocompatibility complex, class 
membrane-spanning 4-domains, subfamily A 



M 001928 Hs.155597 
M34516 

D49441 Hs.155981 

BE184455 Hs.251754 



secretory leukocyte protease inhibito; ( 



Unique Eos probeset identifier number 



408544 
413454 
444404 



683260J 
1515217J 
16136 1 



BI009308 BI009893 BF922023 BF922909 BF922913 BF922096 BF957733 BE701791 AA456454 AA579876 BF933710 AA091294 BI007291 
AW905577 AW975593 AA71 3730 AW8367B1 AA666384 M551 1 06 BF594606 AI082382 AI955808 AI679895 A1679386 BF435555 AA585369 
AA551351 AA595822 AA565188 BF808855 AA584921 N86077 AA601031 AA633188 AA514764 M454562 M551297 AA936109 BI009389 
AW897806 BE815442 BF739374 BI00931 0 BF925422 BF933709 BF922034 BF925465 BI009680 

A1694413 AW9S4700AI912946 N73511 M082035 i f £ 2 W241 B9 W241 82 AI71 971 8 AA024658 AW8 1 0 1 20 AW01 5394 T79755 AA988043 
AI709339 

AW810196 AW810555 AW810507 AW81 0204 AW81 0619 AW810534 

AW947794 AW947793 AW947802 AW947798 AW9477S2 AW984150 AW9341 66 AW9841 67 AW984168 AW984179 AW9841 34 AW9841 60 
AW984180 AW984194 AW984202 AW984190 
BE156943 BE157375 BE156965 BE156949 BE156956 

AI346734 AI377971 BG193341 BG548375 AA928353 A1768724 BG215700 AA449370 BI4621 57 BI060283 BG677508 AA318802 BG719160 



501 



WO 03/042661 PCT/US02/36810 



436063 5483J 

444346 47906J4 

424281 892055 1 

423901 887384J 

414991 1785136 1 

411673 1096744.1 

452203 2630J 



458332 1139685 1 

431594 1236764J 

414372 484211.1 

459644 662385J 

400269 2726.1 

430015 713.2 



35 
40 



413497 1518002.1 

434747 117643.1 

406782 0 0 

419200 9531 1 



AK000028 AA494483 AI298674 AA720773 AV761 529 A '68-1670 A1936202 AW294235 D61552 BF881184 AV71 1384 N27154 AI926970 AV734970 

N40094 N2B596 AA884747 AA512890 BG436593 

AI147991 AI142274 AI198553 

AA338252AA338213 

AW962691 AA333006 AA332289 

D78831 C17898D78863 

BF330730 BF350539 BE153665 BE065062 BE064650 BE064863 BF330763 BE153820 BE064737 BE155079 BE054651 AW856751 AW856622 
BE054691 BE153674 BE153698 BE064730 BE153536 

BC014081 NM 000593 X57522 L21208 L21 207 L21206 L21205 L21204 AL561404 AL546423 AL560492 AL556B82 AL541575 AL550654 BI823519 
BI770023 AL564969 BI489906 AI304693 AW295947 BM146642 X57521 BG820143 BE898390 F06770 F12630 BM423610 AL561518 BM009470 
BG742981 AA279685 AA847441 M313737 BF172639 BF097216 BF914190BF903647S70277 AI569694 AW073296 AI361433 AA554644 
M487429 BE858232 AA838610 AS5391 1 4 A171 9375 AI8291 29 BG057675 AI423422 AU158860 BE300655 AW170777 AA586956 AL571 889 • 
AL556850 AL576404 AL582800 BI256544 BF342301 BG875994 AA054458 AA353161 AI940434 BE816522 AL577636 AI479650 AW1 50377 ' 
AU1 54395 AW951 271 AI032220 AI81 9778 AI345733 AW771 1 50 AW51 2525 AI249904 AA279809 AI352549 AW51 251 7 BG056280 AA521 222 
BE271141 AL581932AL541575 BI819184 AV660190 AL556475 A1620020 AW089883 AW079179 Z21518 AA687601 F04651 AI783961 T57198 
AI433367 T78652 AL554968 AA355648 AL58261 9 BE874601 BF804669 AL574458 BM145502 AI266514 AI538823 AI475626 AA948210 AA884054 
AA487637 AA031844 AA535221 AW794256 AW361447 BE788505 AI682892 M830989 AA862356 AA653084 BM0091 54 AA135727 H05927 
H23433 R42244 N79997 AW366665 AW366601 AA678742 AL556474 AA135770 BE774050 BF914200 H88457 AA627746 BI560216 BI753586 
AI000341 AI766341 AW873274 

AI823999 AA970060 AA5081 76 AW972585 AI873427 AW972389 

BI093452 AW970B65 BG1 18285 AA559075 AA492132 AW75314D AA213770 AA143554 

W03900AW197203AW753300 

X65018 BC022318 NM.003019 BE465060 AI732255 BF446634 AI820677 AI002217 AI924488 BI821373 BI770406 BI823937 BI820255 BI489532 
BG482911 AA617783 AI807697 AW205576 T94427 AA487101 T94513 BI819407 BI822450 BI820613 BI824619 BG542824 BG537862 
BC017171BC012195NM_00712b/Fi 2 0.1 )7b8 BM474865 BG754806AU124376 BG757203 BG764420 BG775028 BG824418 

BM045810 AU120387 BG770238 BG586740 BG91 3323 BI759980 BG395998 BM048875 BE881 070 BE313B39 BE879144 BM309834 AW245847 
AI770171 BF196861 BE856897 AA463876 AI375927 AA648810 AA9481 93 AA49091B AI459893 AI458188 AI2404D3 AI1 91843 AI131029 AW768399 
AI365196AW337984AW026150BE466591 BE674599 AI818438 AA772197 A1651927 AW151 143 BI198825 BG819083 BM458764 BE903567 
BE732715 BM043200 BE900263 BE900706 BE731097 BE390023 BG875384 BF996406 BF988930 BM475542 AW246215 BE501897 BE903610 
BE561530 BE560537 BE903782 BE732947 BI227204 BG761305 BE252542 BE391848 BE382475 BG008253 BI547991 BI459099 BE391391 
BE259420 BE298109 AW245422 AI423847 AI91 461 8 H 80534 BE301004 AL531791 AI435581 BF7931 12 AL577303 AA373265 BE746965 BF743630 
BE879296AI359493 BM018598 AI6892B0 AW072450 F20201 AW 151 405 AW51 7572 AA773468 BG259694 BE391153 BG621529 AI421728 
BG767231 BM462953 BG340524 W52648 AA1 1 3434 BE785431 BI041 981 BG832385 BG2531 68 BG759470 BF369329 BF981332 BE25941 8 
BE785738 BI091658 N72512 W58732 W85690 BG958989 AI20520B H19721 W17051 W77958 BI262010 AA844319 W74143 W72214 N85194 
BE734033 BG1 64099 AA931 069 F1 3645 R4 1 394 AK025750 BG1 80977 BE349455 AA81201 8 AA740241 AI027722 AM 50356 AA886395 AW977B27 
BE220225AA884082AW518114AI243844AA809493AA481029 AA825718 AI347866 AI431670 AA81 4436 AI251 109 R07704 AA765B0B AA724593 
AI91 8399 A1537550 AA49 1103 AW0081 88 R07703 AA9891 20 AA746235 AW028983 AA789 1 02 AU 1 85751 AW971465AA489681 AW971893 
AW612086 BE077936 BI860809 BE002750 BG746251 BE962912 BM454584 AL134894 BF104082 H80591 

AI334106 R63583 AI028079 AI14C098 AI911625 AI890537 F34815 T65959 N40935 W5276B AA854747 AA861 945 AA878472 AA778270 W32249 
AA026061 W52662W15352 W79670 W95384T94283 AA002155 R82052 BE825493 BE825520 
BE177661 H06215 BE144709 BE144829 



AA430373AA968771 

BF036043AW190446BG194731 AW662036AI445021 BE937550 AW818972 AW393132AA834685 BF112058 AV721582 H16423 AI270157 
AI857345 AA937302 AW818444 BE929780 BG498678 BF155010 BI593271 BI599811 BE161728 AW578737 AW753711 AW379707 AW381918 
BG506608 AW028637 AW994240 BF887392 BF790073 AW381 624 AV727105 BF439618 AA443174 AI018009 N42850 AW573242 AI41 7258 
AA463483 AI6761 31 Al 1671 70 AA836627 AA443828 AW592922 AA235129 AA730278 AW439052 AW474332 BI043239 AW474342 BG708553 
AW362423 BF090028 BE827256 R16550 R39478 R39479 R94358 BG540916 BM314745AA251087 D54231 D55274 BF085805 D31589 AW966405 
AW994425 D81879 BE093545 AW901 107 AA383529 BI021 552 R5B420 N39976 AA573281 H82595 AA234955 BE093539 AW367006 BF358697 
BF366318 AA563856 BE702099 BF035969 AI267384 AI257232 BE34832D AA621574 AA85121 2 BF083343 BF083341 AV745131 D53074 
AW954476 AW954472 AA376835 AV724531 D53053 C14928 AA093287 AA062B38 BG483558 BE940050 AA765954 T70171 BE938775 BE940057 
D53502 AW373300 AL1 18798 BW12872B AA19341 1 AW444709 AW952455 AI887B12 BF431948 BI496876 AI264159 BM12B481 AI624657 
AI689301 AI969467 AA851685 AA251595 AA625761 AA872090 AI326790 AA328366 BE827416 R75951 D56918 R68122 BE827384 AL1 18797 
AI184164 AA164411 BI495332 BE8581 13 AI863860 H00660 T69849 AW780389 C14667 BE934995 BI018652 R92801 AA164410 H00752 
AW373305 AW373299 AW373302 

BM475665 BE64491 7 AW770789 AW952971 N64863 BM2B3259 AI224545 AI1 84866 N691 1 4 AW51 8902 AI4401B9 AA809472 AV654440 
AA281642 AU185230 AW337382 AI872923 Ai5371 1 3 N73882 T83378 H63731 BF671764AW897624AI811204AA344646BE009112 BG899664 
H91240 R60548N41701 

AK054860 AV6521 98 AV652192 AV652 1 38 AV6521 27 AV652194 BE935919 AV65201 7 AV651 995 AV551 548 AV6460B3 AV651985 AV5461 84 
AV646179AW880409AA345002 BF155189 BE068931 X56197 AL603014 AW953629 BM263546 BE550772 AA701084 AI681352 AA358589 
AW938841 BF438147 W05391 H7531 3 BF3261 85 AV646335 AV651589 AV64B340 AV651992 AV646384 AV64B364 AV687497 BF155183 
AV546370 AW797876 AI906821 X56196 BE833835 AA628440 BE833808 BF224205 AA709126 BE673807 AI923885 AA947932 AI276125 AI185720 
AW510698 AA987230 BE457708 AW898628 AW898544 AI146984 AWC43642 AI288245 A1186932 AI635262 AI139455 AI298739 AI813854 
AI024768 BE699445 BE699444 AI707807 D52B54 AI214518 A 004723 AI698085 AW087420 A1565133 AA845571 AW898622 BF110144 AW513280 
AI061126 BF362770 AI268939 AI435818 BF475313 AI024767 BE174213 AA757598 AA513019 AA902959 A1860794 AI334784 BF10841 1 
BM310532AW513771 AI951391 AI337671 BF095606 BF095601 BF095468 AW890091 BF095753AW243400AW898607 AW898616 BF362762 



AI299190 W40186 BE174210 BF939091 BF434180 AW579001 T55S62 H01811 T52522 BF945037 BF955938 D54679 D53933 R57100 BG925552 
BF999056 R83430 Z29922 T85791 W03942 H63289 A1091537 BF086583 AA345570 H48870 H80720 T83523 BI039626 BI037700 R00353 
BF155184 N98343 N79072 H01812 T55581 



r corresponding lo an Eos probeset 

ret. Tre 7 j'g't r.unben; • Ih'r, column Vtrbsrk :der.:;-:or (Gli -.j 
)55CC 22" Dunh-T, ct z . (1000) Mclure 402:46t-4D5. 
s:rand from which exors were predicted, 
lolide positions of predicted exons. 

Strand NLposfflon 
Minus 80413-80673 
Plus 175792-176144 
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401447 
405097 
400712 
403478 
405827 
405075 



403533 
406303 
404240 
404056 
402736 



171191-171360 



7109593 
7770506 
7528342 



8101229 
9958257 
8569218 
7656675 
9798099 
7528342 
9800109 
8076874 
8575868 
5002624 
3548785 
9212044 
1054740 
7408124 
8118752 
1834458 
7143420 



116458-116564 
10279-10972 
124680-125321 
2570-2731 

104382-104527,106136-106372 

26424-26596,28854-28987 

5-330 



195342-195511 
2365-2518 

60037-60144,62675-63081 

162922-163658 

173522-173786 

116132-116407,116653-116922 

75843-76980,77146-78263 

66876-67010 

1 1 8677-1 1 8807, 1 1 9091-1 1 929 6, 1 2 1 626-1 2 1 82 
100952-101283 

118343-118684,120720-121013 

91665-91946 

14260-14537 



TABLE 36A: About 52 genes upregulated in no 
Pkey: 

ExAccn: Exemplar Accession nu 

UnigenelD: Unigene number 
Unigene Title: Unigene gene title 

R1: 90th percentile of NSIP Als divided by 90th percentile of HP Ate, where the 

R2 90th percentile of NSIP Ats divided by 90th percentile of IPF Als, where the mi 



Pkey 



UnigenelD Unigene Title 



406785 
449048 
444179 
430223 
451099 

418512 
457311 
402274 
453222 
447261 

406714 
418333 
404090 



Hs.204238 lipocalin 2 (oncogene 24p3) (NGAL) 
ESTs 

Hs.99949 prolactin-induced protein 

Hs.154424 deiodinase, iodothyron re, type II 

Hs.103253 perilipin 

Hs.11 2885 spinal cord-derived growth factor-B 

Hs.31 1 1 0 ESTs, Weakly similar to MAGE-B4 [H.sapie 

Hs.120721 ESTs 



AW503785 Hs.73792 complement component (3d/Epstein Barr vi 

AI809163 Hs.9908 nitrogen fixation cluster-like 

R67516 ESTs 

AU076643 Hs.313 secreted phosphoprotein 1 (osteopontin, 

AW969847 Hs.292718 ESTs.V afcl, mila I LTJ.l r. N RUTIN 

BE247354 Hs.16400 ESTs, Weakly similar to KIAA1435 protein 

AW204429 Hs.1 55033 ESTs 

AF002246 Hs.21 0863 cell adhesion molecule with homology to 

' Hs.71721 ESTs 

Hs.26799 DKFZP564D0764 protein 

gb:nk10d03.s1 NCI_CGAP_Co2 Homo sapiens 



AI568801 

AA588061 
Z45051 
W35132 



se indue 



Hs.267442 ESTs 
NMJ02514 Hs.235935 nephroblastoma overexpressed gene 
13 receptor, alpha 2 



R52795 Hs.25954 
AA830599 
AW498974 



Homo sapiens, clone MGC17333, mRNA, com 



diacylglycerol kinase, zela(104kD) 
niis/oi i ns. i/<£/oj Homo sapiens chromosome 19, BAC 41 195 (C 
C19000498*:gi|4567 1 79|gb|AAD23607.1 [AC00 
AA033929 Hs.19156 ESTs 
NM_006691 Hs.17917 
AW292593 Hs.334907 
AI219304 Hs.266959 hemoglobir 
W92113 gb:zh48e01.ri 
Target Exon 

AA342329 Hs.115920 Homo sapiens cDNA: FLJ2231 6 fis, clone K 
X00442 Hs.75990 haptoglobin 
AF088033 Hs.1 59225 ESTs 

NM_002104 Hs.3066 granzymeK (serine protease, granzyme 3; 

AW975398 Hs.293836 ESTs 

AW167668 Hs.279772 brain specific protein 

R20893 Hs.325823 ESTs, Moderately similar to ALU5JHUMAN A 

R34657 Hs.80658 uncoupling protein 2 (mitochondrial, pro 
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424310 AA338648 Hs.50334 testes development-related NYD-SP22 

422109 S73265 Hs.1473 gastrin-releasing peptide 

420440 NM_002407 Hs.97644 mammagbbin 2 

418196 AI745649 Hs.26649 KIM1708 protein 

418707 U97502 Hs.874S7 butyrophilin, subfamily 3, member A2 

419231 AL046294 Hs.136245 ESTs, Weakiy similar to T1 7227 hypotheti 

446608 N75217 Hs.257846 ESTs 

418918 X07871 Hs.89476 CD2 antigen (p50) , sheep red blood cell 

412610 X90908 Hs.74126 fatty acid binding protein 6, ileal (gas 



Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

435375 1 30020.1 AI73361 0 AI049989 AA678769 
414009 438978J BE221268R67515AV730582 R67516 
406785 0 0 AA588061 
439134 2581476J AA830599 AA970659 AA883802 

418512 12225.6 BM046773 AA224297 T33786 T08951 T09274 T08592 T33936 AA350905 

418333 73080 1 AF264624AW668618 AV731446 R93353 AA584550 AV732728 BF802614BF434359AA077092BI027317AA199812AW629027 AA831618 AI124782 
AA765804 AA055698 AA677404 AA055366 AA889402 AA765533 BE503126 BE4S7367 AW139964 W81697 AI887846 W81696 AA447817 AA447667 
F13631 BF055573 AW268271 AW088477 BF677839 AL601859 AW502118 AW502624 AA574189 BI020104 



TABLE 36C: 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this colum are Genbank Identifier (Gl) numbers. "Dunham, el al." refers to the publication entitled "The DNA sequence of 
human chromosome 22" Dunham, et al. (1999) Nature 402:489-495. 

Strand: Indicates DNA strand Tom which > , /rere predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Slrand Nt_position 

402274 2935596 Plus 5604-6527 

404090 9967450 Minus 100815-100966 



TABLE 37A: About 205 genes 



ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 

R1: 90th percentile of normal lung Als divided by the median of IFF Als, where the 



UnigenelD Unigene Title 



454229 
432128 
421218 



406714 
406673 
416539 
418021 
409385 
450847 
404518 
413951 
407570 
456525 
429509 
445769 
414002 
425571 
423168 
401234 
402181 
403479 
435424 
402911 
442195 



AW957744 
AA127221 
NMJB0499 



Hs.278469 

Hs.295502 

Hs.72912 

Hs.553 

Hs.66 



AI219304 Hs.266959 



lacrimal proline rich protein 



solute carrier family6(neu'ot ar.smitte 



Y07909 
M15881 
AA071267 
NM.003155 

AW051200 

Z19002 

AW468397 

AI741471 

NM.006732 

AJ007292 



AW083883 
NM.001430 



;.198253 
Hs.79368 
Hs.1137 

Hs.25590 



Hs.204354 
Hs.23666 
Hs.75678 
Hs.1 58306 
Hs.124940 



Hs.37896 Homo: 



ipithelial membrane protein 1 
uromodulin (uromucoid, Tamm-HorsfaS gly 
gb:zm61g01.r1 Stratagene fibroblast (937 
stanniocalcin 1 

CD33 antigen (activated B lymphocytes, i 
natriuretic peptide precursor A 
zinc linger protein 145 (Kmppel-like, e 
S100 calcium-binding protein A8 (calgrsn 
ras homolog gene family, member B 



viral oncogene h 
ephrin-A2 

GTP-binding protein 
mitogen-activaled protein kinase 8 inter 
Target Exon 

NM_007064:Homo sapiens serine/trreor.ine 
sapiens cDNA FLJ 1 3510 lis, clone PL 
NM_021158":Hcmo sapiens protein kinase d 
endothelial PAS domain protein 1 
Eos Control 



tor and denominator was set to 50 
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438564 
413956 
431319 



C05145 

M3B1553 

AI821351 

M873350 

AF124368 

AA379454 



401590 
403645 
403376 
447966 



Hs.75636 
Hs.198253 
Hs.193133 
Hs.302232 
Hs.306551 



myosin light chain 2a 
' histocompatibilit] 
Weakly similar t* 



NM_002675:Homo sapiens promyelocyte leu 



Hs.154073 gb:EST923B6 Skin tur 



Targel Exon 
Target Exon 
NrvL024513*:Homo sapiens FYVE and coilec 
Target Exon 

ESTs, Weakly similar to Homolog of rat Z 
CD69 antigen (p60, early T-cell activati 
small inducible cytokine subfamily A (Cy 
NM_006856*:Homo sapiens activating trans 



Target Exon 

NM_012448-:Homo sapiens signal transduce 
ENSP00000236574':Hypothetical 21 .3 kDa p 




420575 
457275 
432559 
402483 
416069 
445445 
436232 
418773 



404026 
400881 
430314 



449293 
416157 
446122 



BE263301 
AA463422 
AW452948 

R37101 

AF238870 

AA707006 



gb:MR1-BT0371-050500-009-a1 2 BT0371 Homo 
gb:C16931 Ck ' ' ' 
Hs.168357 T-box2 
Hs.98005 ESTs 
Hs.323817 DKFZP547E1010 protein 
Target Exon 

Hs.99029 CCAAT/enhancer binding protein (C/EBP), 
Hs.257631 ESTs 



Hs.20982 
Hs.275706 
Hs.187863 
Hs.325474 



NM_020389:Homor 



Homo sapiens clone GLSH-3 similar to gli 



Hs.1 17898 
Hs.239138 



BE304709 
M001088 
NM.003243 



Hs.146550 
Hs.29739 
Hs.342874 
Hs.278639 

Hs.196945 
Hs.343586 
Hs.247888 



ESTs 

Target Exon 
NM_025080:Homos, . 
pre-B-cell colony-enhancing factor 
Target Exon 
C1 002296:gi|667781 7|ref|NP_0331 26. 1 1 rep 
myosin, heavy polypeptide 9, non-muscle 
ESTs, Weakly similar to C34323 GTP-bindi 
transforming growth factor, betarecepto 
KIAA1684 protein; likely homolog of mous 
gb:Homo sapiens SIB 276 intestinal mucin 
ESTs 

zinc finger protein homologous to Zfr-36 
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402217 
427700 
455674 



440034 
418168 
417295 
406305 
427886 
435409 
413861 



403591 
406193 
420813 
442941 
400703 
415026 
400334 
445878 
404975 
436370 
400513 
424008 
415405 
407612 



449125 
418922 
402404 
421993 



428193 
422166 
439453 



AJ012074 
AI809182 
R43409 



AA706937 
X75042 
AW444551 
BE251104 

AI557280 

AJ243425 

A1826502 

X04430 

AA460150 

AW449385 

AI908639 



AA417083 
AJ238982 
BE175424 



R01220 

R02740 

R59141 

U26403 

AW501504 

AV657446 

AI671439 



NM_004235 
W72424 
BE264974 
AI925688 



AI864581 
AF253468 
AA530892 



N80129 

AW500896 

NM_000691 



vasoactive intestinal peptide receptor 1 
Hs.1 30907 ESTs 

Hs.6829 Homo sapiens mRNA for KIM1 644 piotein. 

C19001662*:gi|6753B72|reqNP 034345.1 |i 
Hs.180383 dual specificity phosphatase 6 

gb:RC3-BT031 9-1 001 00-01 2-d 1 2 BT031 9 Homo 
Hs.1 20802 ESTs, Moderately similar to A26641 Na?ex 

Hs.35380 x 00 



capping protein (actin filament) muscle 

Target Exon 

early growth response 1 

ESTs 

inlerleukin 6 (Interferon, beta 2) 



Hs.73217 
Hs.157294 
Hs.246781 I 
Hs.101368 I 
Hs.43148 



VNN3 protein 

gb:RC4-HT0578-170300-012-d01 HT057B Homo 
C3000142*:gi|450301 5|ref|NP_003900. 1 1 co 
C1002603':gi|9387091|gb[AAG01738.1[AF248 
NM_001397:Homo sapiens endothelin conve- 



rs, Weakly similar lo T28770 hypothetl 



Hs.8867 
Hs.72308 
Hs.145587 
Hs.1 85679 
Hs.1 37555 
Hs.37142 



gb:zm66a10.r1 Stratagene neuroepithelium 
NM_003213*:Homo sapiens TEA domain famil 
hypothetical protein FLJ22169 
Target Exon 
Target Exon 

prolactin-induced protein 
cysteine-rich, angiogenic inducer, 61 

11001794 |i n i I |t f ( 1| 
ESTs 



Homo sapiens dmd gene, intron 1 1 



HT0 
Target Exon 

putative chemokine receptor; GTP-binding 
gb:yg96d11.r1 Soares infant brain 1NIB H 
ephrin-A5 

gb:UI-HF-BPOp-ajd-h-04-0-UI,r1 NIH_MGC 5 
gb:AV657446 GLC Homo sapiens cDMA clone 
Homo sapiens mRNA for KIAA1 557 protein, 
ESTs 



Hs.1 10571 
Hs.75511 
Hs.153863 



MAD (mothers against decapentaptegic, Dr 
Knjppel-like factor 4 (gut) 
S100 calcium-binding protein A9 (caigran 



Hs.222312 ESTs 



or 13 



msh (Drosophila) homeo box homolog 2 
Target Exon 
Target Exon 
Target Exon 

Homo sapiens cDNA FLJ14242 fis, clone OV 
ESTs 

delta-like 4 homolog (Drosophila) 



Target Exon 
gb:UI-HF-BR0p-ajp-f-08-0-U!.r1 NIH_MGC_5 
nuclear receptor subfamily 4, group A, m 
Target Exon 

C5000561":gi|7513700|pir|rT14151 Inv pro 



Hs.575 



gb:UI-HF-BP0p-air-a-03-0-Ul.r1 NIH_MGC_5 
aldehyde dehydrogenase 3 family, member 
Target Exon 
Hs.192090 ESTs 
Hs.198951 junBproto-oncogene 
Eos Control 
Target Exon 
Hs.246862 ESTs 
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414580 BE385918 gb:601 275386F1 NIH_MGC_20 Homo sapiens c 

428800 M57627 Hs.193717 interleukln 10 

451676 R84770 Hs.33538 ESTs, Weak'y similar to oxygen-regulated 

402394 Target Exon 

404818 Target Exon 

436364 X06096 gb:Human macrophage alphal-antitrypsin c 

420369 U96759 Hs.97220 chondroadherin 

405590 CX001 497':gi|4557543|ref|NP_001384.1| ex 

402448 Target Exon 

433495 AW373784 Hs.71 alpha-2-glycoprctein1,zinc 

409020 AA062549 Hs.21162 retbindin 

405443 Target Exon 



455674 1490762.1 

413861 1561647J 

409368 110612.1 

415405 1872126J 

409837 915621_1 

458637 395206_1 

428193 430.1 



448793 3006936J 

409859 916430.1 

40S811 58948.2 

400116 5269.1 



T65940T64515 AA071267 AA071334 

U81984 NW_00143Q BE9Q7085 BI333232 AIQ21936 AU138476 C13601 U51626AU100517BI054387AU076970BE7B6454BG010080AW377189 

BF998789 AA368139 R1 1396 T83613 BG006324 BI012404 BG001643 BF757957 AL549361 AL544018 BE002870BE929314 BE090199 AL046650 

BI053717BE929315 BI054967 BF960055 BF925432 R05421 BF922073 T70331 BI004403 

AI023587AA889354AA846791 

AV711317 AI809938 AI8Q8768 A1240593 AI915771 

AW971347 AA524886 AA211537 BF903005 BF357120 

BE068758 BE068745 BE068689 BE068778 BE068529 BE068683 BED68445 BE068392 BE068719 BE068473 BE068521 BE068628 BE068422 
BE068618 BE068354 BE068390 BE05B414 BE068433 BE068369 BE068384 BE068661 BE068324 BE068301 BE068436 BE068754 BE068329 
BE068672 BE068494 BE068596 BE068332 BE068347 BE068588 BE058328 BE068493 BE068740 BE068685 BE068759 BE068307 BE068429 
BE068303 BE068693 BE068374 BE068295 BE068625 BE068302 BE068663 BE068675 BE068579 BE06831 1 BE068674 BE068547 BE068602 
BE068605 BE068352 BE068306 BE068401 BE068537 BE068552 BE068450 BE068723 BE068393 BE068671 BE068748 BE068317 BE068447 
BE068568 BE068632 BE068357 BE0B8330 BE06B49B BE06B631 BE068540 BE068410 BE068626 BE0B8591 BE068522 BE068676 BE068499 
BE068361 BE068598 BE068350 BE068299 BE068580 BE068567 BE06B692 BE068321 BE068327 BE068739 BE068526 BE068538 BE068765 
BE06B340 BE068733 BE068293 BE068535 BE068480 BE068476 BE06B761 BE068712 BE068706 BE068549 BE068419 BE068383 BE068434 
BE068418 BE068525 BE068543 BE068752 BE06B550 BE068623 BE06B470 

BG563152 BF846777 BFB49354 BF849359 BF846636 BF849201 BF349356 C15931 M056717 AW864542 AWBB2724 AA056567 
AI910738AW139227 AA932891 AA622104 
AF007191 AWB20706 BG978594 BF872238 

L1 3288 AA928785 A160891 2 AW872978 AA565655 AI02291 5 AI304920 AI564366 AI668793 AI094557 T60038 R72302 H45409 M508805 R46356 
AA418793BM129553BM129126 BM129292 BM128865 AI808418 AI6B9932 A1806573 BF431808AW872985 AW166269 H73241 T16132 AI264547 
R73391 R72085 R72B40 T83751 X752SS BF75434B R941 05 AW449839 R73300 NWL004624 AI797007 BE045543 BF1 10021 BF75425D T83923 
AW884084 AA903896 AA4 18962 L20295 R72351 H4509B M961010 R73210 R46451 AW884035 BI022902 BI763932 BI910138 



BF352282 BE175424 BE175418 BE175383 
AA071059 AA085201 M085020 

W18191 R59141 R54142 R12130 F11352Z42794 F08242 F07925 H21084 R54090 R59142 

AW501504 AW501656 AW50304B AW502449 AW502098 

AV657752AV657446 

AF105036 U70663 NM.004235 AF0221 84 AU141767 AU141110 AL040569 D44830 BI01 1 351 AL575805 AI290876 AI01 4784 AI393429 AI26621 1 

AW074303 AA620711 BF197792 AW008766 D25944 A!687397 AA621680 AV714408 BF446905 BM314505 BF514079 BM314197 AAB45201 

AW874084 BE720622 AI127241 AA236239 AI679709 AI679135 A1572470 AA573434 AI568487 BE049325 M637950 BG925989 AI338031 

A1365073 AI024576 AA298805 H04001 H45668 BG682146 AL552388 BI462361 BG547513 BG896863 BI256661 

AI936948 A1864581 AI570641 

AW501926AW502566AW501927 

AK057581 AW500962 AW500896 AW501 1 05 

D42041 NM.014610 AJ000332 BI75B702 BG720650 AU141 1 29 AU130711 AU141 380 AU132402 BM048556 AU127520 BE259984 AU128952 
BE614151 AL501516 BM146777 AU128103 BM194094 BE937951 BE695396 AK026997 AK026567 BF969293 BE798100 BI086881 BG166248 
BE877845 BG38541 4 AW886747 BF093789 AW3901 59 BF82031 1 AA421676 AW880845 AW404827 BF726465 BE1 61 190 BE2541 02 AW406002 
BE161223 A1912055 BF930228 AW374357 AW794531 BE720524 BE933982 BE933658 BE933694 BE933978 BE933654 BE933583 BE166557 
BE933874 BE933641 BE933859 BE933626 BE933B56 BE933633 BE933864 BE933631 BE933867 BE933634 BE933857 BE933624 BE933883 
BE933650 BE720491 AA420426 BE7204 1 0 BE720453 BE720444 BE72041 1 AW368748 BF87461 6 BE933498 BE835979 BF926667 AW849921 
AW850025 AW850022 AWB49977 AW849900 BG250251 W876B9 AI192825 A1692824 M426263 A109031 5 AI309537 AA877437 AA478438 
AI538868 AW2761 62 AI279916 AA50031 8 Al 1 88836 AW662284 AI26251 9 AA293457 BF347442 AA421 677 AA658063 AA565510 AA937060 
A1142684 AA788940 AA827426 AU152614 AI342784 AU148738 AA219664 M047835 H99450 M018563 AI073634 BM475120 BG875251 
BG248778 U46372 AA383858 AU140355 BG821891 BF935049 BI760656 BI054103 BF982309 BE872215 BI257291 AU158469 AU160599 
AU152469 AU152375AU152059AU148575 N32267 AU149554 AI627459 AI71984OAW779017 AI291493 AW304181 AW470055 AI086491 
AI311387 A1634232 AH 51241 AI288848 AW050588 AW589580 AI241353 AI880219 AA039309 AA026517 M016238 AA013444 H86822 R87530 
AA058462 N27082Z39679 BE544309 W52619 AA018076 Al 81 3668 AW1B99D7 AI41 31 04 AU1 59878 AU 1 50087 R21 754 AA015932 H67274 
AU153097AI961344AA018208W32429 R45344 R77453BM470129AU130415BI227374BE298179AW844963AW844983AI904066AA379006 
BF850571 AA355641 BG7471 56 AL547262 AW367941 BE550004 BI116061 BG899031 BE56031 8 BF1 74 1 77 BI051 455 BE001967 BE386446 
BF969326 BF808765 BF684480 BG421 61 7 A1940607 AW875483 BE789632 BF80871 1 BI192691 AW904249 BI91 1430 BE265407 BE730343 
BE397808 BI226516 

BG333973 BE385437 BE408833 BE387650 

X05826 X06096 BG468890 AW951 851 W23562 T28392 H56742 H58030 T69205 



"The DNA sequence of 
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NLposition: Indicates nucleotide positions of predicted exons. 
Pkey Ref Strand NLposition 



404518 
401234 
402181 
403479 
402911 
401540 



401126 
401904 
401919 
406443 
406016 



402483 
405448 
404439 
404026 
400881 
405429 
402642 
402217 



400703 
404975 
400513 
402404 



405063 
405163 
402386 
400489 
404795 
402394 
404818 



8671966 
9502466 
9280765 
8272661 
6758795 
8218035 
7229900 
7768597 
9103984 
7574980 
7582529 



449746-450040 

148369-148533,150678-150809 
142689-142979 
105838-107310 
13758-13922,14658-14762 



1308-1416 

13856-14004 

65578-66119 

136347-136532 

55316-55585 

79674-79968 

91446-91603,92123-92265 

51577-51723 

125599-125756 

21521-21757 

26564-26819 

108239-1 08386,1 1 2216-1 1 2373, 1 1 5388-1 1 557 

91614-91718 

103082-103414 



146733-146860,147399-147951,153127-15325 



6794-7396 
86899-87122 
130204-130806 
19346-19480,20041-20119 



33671-33839 
90492-90818 

112942-113069,114303-114521 
90716-90887,101420-101577 



TABLE 38A: About 207 genes upregulated in lung fibrosis relative to noi 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbankaccp an number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: 90th percentile of lung fibrosis Als divided by 70th 



percentile of normal tissue Als, where the minimum value for the numerator and denominator was set to f 



ExAccn UnigenelD UnigeneTitle 

AW976035 Hs.292396 ESTs, Weakly similar to A47582 B-cel! gr 

M486620 Hs.41135 endomucin-2 

NM_002984 Hs.75703 small inducible cytokine A4 (homologous 

T98226 Hs.1 71952 occludin 

AF151103 Hs.112259 T cell receptor gamma locus 

X58288 Hs.154151 protein tyrosine phosphatase, receptor t 
Target Exon 

F09247 Hs.247735 protocadherin alpha 10 

NMJ13390 Hs.160417 transmembrane protein 2 
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43B797 C16161 Hs.283040 h/poihetical protein PR02543 

410315 AI638871 Hs.152519 Homo sapiens cDNA: FLJ22524 fis, done H 

446714 W73818 Hs.110028 ESTs • 

412326 R07566 Hs.73817 small inducible cytokine A3 (homologous 

430259 BE550182 Hs.127826 RalGEF-like protein 3, mouse homolog 

412790 NMJ14767 Hs.74583 K1AA0275 gene product 



451035 
436473 
406714 
414586 
427274 
427527 
452813 
442831 
427774 
445330 
436001 
431681 



407151 
422607 
421205 
428582 
423582 
424880 
421233 
429350 
428727 



416114 

435869 
444212 
422442 
442870 
424456 
429673 
445107 



428403 
431830 
417512 
423067 
437457 
415000 
437145 
418838 
419660 
409956 
414493 



W78816 
AI093480 
X00442 
AU076785 
AI193122 
AI219304 
AA306160 
NM_0052V 
AI809057 
U54727 
AI798959 



442085 
427704 
427247 
441965 



, Weakly similar lo S65657 alpha- 1 C- 
l FLJ11896 
Hs.75990 haptoglobin 
Hs.430 plastin 1 (lisoform) 
Hs.1 32275 ESTs 
Hs.266959 hemoglobin, gamma G 

lymphocyte cytosolic protein 1 (L-plasti 



ls.16488 



Hs.174142 colony stimulating factor 1 receptor, fo 

Hs.153261 immunoglobulin heavy constant mu 

Hs.191445 ESTs 

Hs.131686 ESTs 

AA278583 Hs.180737 Homo sapiens clone 23664 and 23905 mRNA 

R52656 Hs.21691 ESTs 

AW903B49 Hs.173840 HUEL (C4orf1)-interacting protein 

AK000378 Hs.267566 hypothetical protein FU20371 

AA533447 Hs.312989 ESTs 

AI381659 Hs.267086 ESTs 

H25836 Hs.301 527 ESTs, Moderately similar to unknown [H.s 

Z45471 ' 



AL137540 Hs.102541 



BE000831 
NM_000328 
AA209534 
AI754634 
AF078847 
Z43161 
AW630088 
AI123118 
AI695549 
AF255910 
AW503976 
AA324998 
N45018 



in 4 
BENE protein 

Homo sapiens cDNA FU1 1 81 2 fis, clone 
retinitis pigmentosa GTPase regulator 
tetraspan NET-6 protein 



Hs.191356 general transcription factor IIH, polype 
Hs.283714 30 kDa protein 

Hs.76550 Homo sapiens mRNA; cDN A DKFZp564B1 264 (f 



3.10649 



edgene 



.147066 signal transducer and activator of trars 

.8769 hypothetical protein DKFZp761J17121 

AA341017 Hs.25549 hypothetical protein FLJ20898 

AA884407 Hs.21 1595 protein tyrosine phosphatase, non-recept 

AI208121 Hs.147313 ESTs, Weakly similar to I33022 hypotheti 

AL134275 Hs.6434 hypothetical protein DKFZp761F2014 

BE620016 Hs.182470 PTD010 protein 

A1393048 Hs.326159 leucine rich repeat (in FLU) interactin 

Y16645 Hs.271387 small inducible cytokine subfamily A (Cy 

X76534 Hs.82226 glycoprotein (transmembrane) nmb 

AA321355 Hs.285401 colony stimulating factor 2 receptor, be 

AA757900 Hs.270823 ESTs, Weakly similar to S65657 alpha- 1 C- 

... Hs.239812 Homo sapiens serologically defined tres 



AF007216 Hs.5462 
AW385224 Hs.35198 



He earner family 4, sc 



Hs.62 protein tyrosine phosphatase, non-recept 

Hs.2463 anglopoietin 1 

Hs.24030 solute carrier family 31 (copper transpo 

Target Exon 
Hs. 263395 HT01 8 protein 



H03109 
AA605012 

AB028981 Hs.8021 NAA1058 protein 

BE264974 Hs.6566 thyroid hormone receptor interactor 13 

R80137 Hs.302738 Homo sapiens cDNA: FLJ21425 fie, clone C 

AI289619 Hs.13040 G protein-coupled receptor 86 

BE241 624 Hs.82401 CD69 antigen (p60, early T-cell activati 

W95006 Hs.269559 ESTs, Weakly simitar to S65657 alpha-1 C- 

AA302741 HS.257B6 ESTs, Moderately similar to JC5233 galac 

AA748418 Hs.164577 ESTs 

AA9756B8 Hs.159955 ESTs 

AW971063 Hs.292882 ESTs 

AW504221 Hs.174103 integrin, alpha L (antigen CD1 1 A (p180), 

AA972712 Hs.269737 ESTs 

AK000737 Hs.237480 hypothetical protein FU20730 

BE047394 Hs.8208 ESTs, Weakly similar to S71512 hypotheti 

X90568 Hs.172004 titin 

AA452970 Hs.239527 E1 B-55kDa-associafed protein 5 

AI312783 Hs.1 55772 Homo sapiens thymic stromal co-transport 

AJ243669 Hs.8127 KIAA0144 gene product 
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449088 


A1654048 


Hs.1 96556 






416511 


NM_006762 




Lysosomal-associated multispannlng membr 


2.30 




AA563730 








412116 


AW402166 






2.78 


433793 


AW975959 




ESTs, Moderately similar to KIAA1058 pro 


2.78 




AI932285 


Hs.160569 






410057 


R66634 


Hs.268107 


multimerin 


2.77 


417497 


AW402482 




CD53 antigen 


2.77 




AA863360 


Hs.26040 


ESTs, Weakly similar to fatty acid omega 




431884 


AA521246 


Hs210792 


E T v, y imila t L!< I It II LUE 


2.75 


409969 


AW514668 


Hs.194258 


ESTs, Moderal ~ art LU5 HI! N f 


2.75 




BE621807 










AA761378 


Hs.192013 










Hs.44829 


ESTs, Weakly similar to 138022 hypotheti 




447818 


W79940 


Hs21906 


Homo sapiens clone 24670 mRNA sequence 


2.74 


418818 


AA228899 


Hs.101307 








AA345051 




ESTs, Weakly similar to 138022 hypotheti 






At954968 






2.71 








ESTs, Weakly similar to 138022 hypotheti 






AW502761 




KIAA0430 gene product 






Al 82471 9 


Hs. 143251 








AL080208 


Hs.306325 


Homo sapiens mRNA; cDNA DKFZp5B6C1 523 (f 








Hs.274260 


















NMJ04736 


Hs.227656 








AI735482 








448749 


AW859679 


Hs21902 


Homo sapiens clone 25237 mRNA sequence 


2.69 




AA015879 






2.69 




AW392551 




ESTs, Weakly similar to A561 94 thromboxa 






NMJ05397 




podocalyxin-like 




418031 








2.68 




BE502341 








405121 






mitogen-activated protein kinase 8 inter 


2.65 


410163 


AF151977 


Hs.59260 


NTT5 protein 


2.G7 






Hs. 148910 


















AW850303 




gb:lL3-CT0219-1 91 199-030-F09 CT0219 Homo 






NM_016245 




















AF071076 










AW370213 




ESTs, Moderately similar to A4601 0 X-lin 






AW270037 








408988 


AL1 19844 


Hs.49476 


Homo sapiens clone TUA8 Cri-du-chat regi 


2M 




AJ 269529 


Hs.301871 








AJ 224864 










BE179030 




Human DNA sequence from clone RP5-1 1 74N9 
















Al 568170 


Hs.96886 


















AW068287 


Hs.301175 












ESTs, Moderately similar to MAS2 HUMAN M 




419777 




Hs270975 




2'S3 








hypothetical protein FU20992 similar to 




427596 


AA449506 


Hs270143 


extracellular glycoprotein EMILIN-2 prec 


2.62 














AB033059 




KIAA1233 protein ^ 






AA234171 


Hs.1 87626 








A1244661 


Hs.103296 


ESTs, Weakly similar to S65657 alpha-1 C- 
















BE272922 


Hs.1 73936 












RAB5A, member RAS oncogene family 


2'61 




AA281959 




glia maturation factor, gamma 


2.61 




AW629386 










AF1 67706 






2^61 






Hs.334514 




2.60 




NM_00228( 








430539 


AK001489 




ADP-ribosylation fa:tor-like 1 


2^60 
















Hs.74076 








AW954011 


Hs.180711 






410883 


D43767 


Hs.66742 


CCL1 7 chemokine (TARC) (SCYA17) 


2^58 


441028 


AI333660 


Hs.17558 


Homo sapiens cDNA FU14446 (is, clone HE 


2.58 


413949 


AA316077 


Hs.75639 


Human TB1 gene mRNA, 3' end 


2.58 


434943 


AI929819 


Hs.92909 


chromosome 21 open reading frame 50 


2.58 


443605 


H06865 


Hs.134131 


ESTs 


2 57 


425017 


AL1 19305 


Hs.26409 


ESTs 


2.57 


440334 


BE2761 12 


Hs.7165 


zinc finger protein 259 


2.56 


426075 


AW513691 


Hs.270149 


ESTs, Weakly similar to 21 09260A B cell 


2.56 


425345 


AU077297 


Hs.1 55894 


protein tyrosine phosphatase, ncn-recept 


2.5C 


407174 


T79938 


Hs.77062 


leukocyte immunoglobulin-iike receptor, 


2.56 


443834 


AI741510 


Hs.173548 


ESTs 


2.55 


427557 


NMJ02659 Hs.179657 


plasminogen activator, urokinase recepto 


2 55 


420539 


AA282735 


Hs.44004 


AD031 protein 


2.55 
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AW070211 Hs.102415 Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
D63209 Hs.5944 solute carrier family 11 (pratoivcoupled 
AWB91802 Hs.296276 ESTs 




BG743181 AI830050 BE695688 AA126591 A1903503 R26045 N62B94 NB3950 AA131619 AISB1480 N7962BAA461603 R7B979 AW608865 NSB622 
BF448838AA779000AA460314AI092721 A18701B2 AI436234 AI4941 51 AI127704 AI1277D2 BE349350 AI0934B0AA115264AA131567 R26840 
R78885 

AI223854 AI1 29852 AA60501 2 

X75684 AL573167 AI445461 AI453743 AI983655 A1564644 AA977180 AI694111 AI59135BAW071625 AI678712 AI720939 AI927769 BE43979B 
AI963432 AA292956 AW192593 AIB55838 AI696905 AI424384 AI15131 2 A191 1921 AI5978D1 BI494959 AI240988 AI492554 AW252737 BE044033 
AW008570 AW629505 BI49495B AA0B8439 AA7C6057 BF222B20 BF59360B BE501957AA524526 BE044134 AW572531 AW015724 BE349186 
AA043217 BE219784 AI799814 AA1 29575 AI671 727 AI470033 BE546195 AW779725 AA903050 AA14722B AA404570 AI075B78 W38161 AI972739 
AW673152 AA723200 C061 23 BF0571 47 AA627686 AA1 57944 AI990245 AA55251 7 T324B7 AI800106 AI33317D AIB591 60 W454 1 0 AI990827 
AW275048 AA1 82640 AA478328 AI2909 r P , fl)6 1 58 AW 47 1421 AW103470 AW300456 AW191997 AIB23465 AA962397 AA1 36658 AI251 817 
AW339104AA724739AA411100AA191349AA757735AA037596A!769516AW772283AA010631 AI692846AI051065H8rj983R79933AI950693 
AI245632 AI349390 AA148284 AI798502 AA487893 AI621320 AW194272 C06365 AA953883 BE858936 AI918523 AIB72628 AI927217 AI453453 
AI189366AW338678AI26135SAl r CC r ^ I032569 AI97 I9055I 7 1 M » 93030 AA603586 BE773488 AW339301 

BE773489 BE773462 BE773495 A1650338 BE773499 AI745717 BE81 1475 BEB1 1470 BE81 14S4 BE81 1418 BEB1 1415 BE811400 BE81 1398 
BE811388 BE811352BE773501 BE773494BE773486BE773474 BE773473BE773470BE773461 BE811350 BE811337 BF593847 BG055071 
AW675302 BF003068 AA71 91 73 BE81 1348 AI582462 AI686240 BE773500 AI244845 AI565439 AI91 3453 AI472527 AI446740 AA035576 AA191 41 4 
AW674145 C05782 AI589264 D57558 AI468237 AI432033 M989662 R21752 BF002457 AA98B297 AL574095 AL576200 AL571074 AL574525 
AL578810 BG498381 AI928364 BE879732 AA 179834 AA47971 2 C17732 BM091258 BF843901 AWB20230 C17476 BE327120 AA129574 
AA136645 BF843900 AW8061 93 AA502832 AA549494 AL558520 AL547950 BE706937 BE81 1 360 BE773498 BE81 1401 BE773484 BE81 1437 
BE81 1380 BE81 13S9 BF997171 BF757734 BE926037 AI377596 C061 11 AW0BB9SB BEB114D4 BE81 1472 AI865912 AI925607 AI871950 AI093510 
BE905927 BE811435AA191387 AW7720CC BE81 1453 BE614379 BF844522 BI044896 AI744233 AW984527 C17504 BF843883 AI248307 
BE773483 AI567995 W60075 BF941 1 83 AI73BB44 BE81 1 458 BE7734B1 AI262930 AA948565 BE706942 BE1 56360 T65026 AW242958 AW1 97954 
BE905184 AA722206 AI344943 AI348877 AI334860 BE621 857 BE1562B0 AA454099 AA037722 BFB43897 AWB061 83 M04321 6 BG482896 
AA182734 AA877242 AW372926 H27252 R381 14 BF851858 BE156214 AA190427 T91752 AA035067 AA837326 T10930 BF9065B7 BI755027 
BG506731 BC003442 BC0101 ES AL5 0134 I r C 6 L r 48700AL550751 AL547978 AL545286 AL540643 AU1 1 8627 AL601379 BI259821 
BG741786 BI868522 AU135866 BI552770 BI259210 BI256520 BI2555S9 BG485098 BI258228 BG498501 BM044512 AU133984 AL556586 
BE745111 BI222533 AU133917 BG2B8151 BI260715 BI550550 BG500773 BI551761 BG707601 BI818593 BF691383 BG721129 BG541578 
BE906666 BG751098 BI224135 BG400746 BG478065 BE790436 AW0B023B AU137549 BG429896 BE392486 AW961686 BG721056 BE908365 
BE546656BG541235AW583735 BG528290BI260895AW651691 BM048974 BMD43805 BG1421B5 AA315188 AI446615 C06300 BG497644 
AA088544 AI815987 BG528631 BE51 91 82 AW2391 85 AW062910 AW062902 AA347236 F1 1933 AA4BB005 AA301631 AA376800 D56120 
AA343532 AA308636 F00242 AA3760B6 M31696B AA343799 BIB70221 BE910282 BG53B748 AW960564 AV732879 D16854 AA192519 BF922148 
AA216013 BG624091 BE544387 BG507008 AW1 76446 BF790033 BE088925BE088854AA921353 R21 BOO AA01 1222 T97525 
BI754027 BF696071 AI351939 BG15129BAI919334 AI401620 BI770165 W72057T96158T29478 AA181252 BG927793 AA714431 AA600749 
AA181247 AAS14756 AA081092 H52207 BG925934 BF222579 BGB99001 N64245 AA953040AIB32406 AA102441 BG928081 AA993445 AA916041 
AA987847 AA933329 M737219 AA916443 AW128994 AI492560 AI761847 BC005272 NMJJ00900 X53331 M58549 BI758966 AL598829 BI754530 
BGS99770 BE439699 BE440148 AV704365 AV733S52 BG21 2015 BG1B4149 BG200180 B6212690 BI761222 BG182079 AW338822 AI925631 
AI423041 AW071 181 AI889836 AW1231 1 2 BG92533S A1017633 AA568964 BF725590 AI004210 AI809799 BE0B3097 BGB96220 AW997681 
BF668788 BE0831 34 AW631281 BG193052 BG183095 BE440088 BG1 85728 AI499579 AA188162 AA864282 B1493352 AA1 55854 AA836749 
AA836844 AA9B5478 AW082299 AI816747 AA450221 AA971 294 BE327509 AI719662 BG576669 AI4793B2 BF824747 AI741 800 BG982962 
AI0BB473 AA916151 AW473324 BG901 177 BE439998 AW023269 BE813B71 AW999947BE839108AV707983AA369722 AW796627 AW890608 
AI341 771 AA302459 BI493353 AA366332 AA37 1 i 04 AA357277 AL547972 BG92801 1 AI678903 AI699886 AI956165 AA484893 AA643953 
AW591063 BG203275 BG21 1093 AI334791 AA916589 AW05B266 A1362370 AI143352 AA508721 AI928079 D57214 BE045265 AA541785 
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?? S AW01 " 04 A ™ 89242 M953322 AI68669B F27562AA614749 D56B45 F20774 F30660 F25646 AW023542 

1611592.1 A142241 9 AA51 4370 A1741678 A!735482 AI73503 1 A1371436 
10891 14_1 AWB50587 AW850589 AW85Q318 AW850303 

35068.1 AB01B322 BC012480 BI524873 AW665554 AI934469 A1479916 BF096179 BF096162 BF096132 AA744972 AI9519BB AI858339 BE076331 

AA8E699BAI 70 M91 , IE7B811 AI6931 39 AI308135 AA669045AA961064AI018062H80618BE221942 R52609 AI915164 AA365625 

F882486 BG2861E4 ,1 9 , R , , h 

5S?5S? 76773H8SS2 7M765 5"AA081^ B J ( 3AL037576AA534314 BE8149B4BE973713N49493 BE006634 
SfS 22S AW 2™3;AA234765AI33t,ll F0571 18 7450AI341191 AI434143AI91 7449 AW51 7207 AA255424AW008334AA847572 
ra090AM^299AI6oSo 3 51 

41421.1 BC0.0 1 r" AW29420'BF849776AA459054AI917452AW403072W27419BF914568 BF798468AW370558 

124009.1 AI33263B AA663215 AW629386 

31268.1 AK001489 AU129447 BF959274 BG565452 A1245327 AU1 16B48 BF358559 BF358554 BF358570 BG6781 19 AL51 5852 AU1546D7 AI3S7SB7 

AW874359 A1122554 M4064 7E AICS10, : ; IB55 ; 79 Al6861 63 AA662158 AA91 1580 D31095 AI302576 BF5M761 AWS^WflSlMO 
AW169600 D31 161 AA905362 C21 179 BE327258 D31474 AW439053 D31309 BF756901 BI83EE2b C )7 ,1 9 EM142 16 *1« 5 
AA452023 AW473972 AW468490 AM1D271 A1475D44 BF821359 AA0-E1BBAI3 33 F < h ™ 

BF375775 AW321784 AW975085 W16475 D31 031 BG696392 AW860B76 AW752864 BI013705 BF96571 5 BF326604 AW821 786 

Unique number corresponding lo an Eos probesel 

md from which exons we " 
le positions of predicted ( 

Ptey Ref Strand NLpositJon 

406617 8439858 Plus 36430-36552 

401958 3258613 Plus 108411-108629 

405121 8102330 Minus 35816-36004,36587-36684 

401113 9966541 Minus 19419-19959 



TABLE 40A: 656 genes upregulated in fibrosis relative to normal body tissues 

phosphatase, onon.transporter). Certain predicted protein domains are noted. ' 

Pkey: Unique Eos probeset identifier number 

ExAccn; Exemplar accession number, GenBank accession number 

UniGeneiD: UniGene number 

Pred.Prot.Domains: Certain predicted protein domains. Abbreviations lis* 



likely to contain; other protein domain abbreviation", are from F FAM (Nucleic Acids Research 2( 
UniGene Title: UniGene gene title "mvunuchc ram Kesearcn, 

W numeratoand teT™^ ** 50th ° f ^ A ' 5 ' ***** the 1 °' h °' a " n0ma] " ssue Als was sub,racted ftom bolh ^ 

Pkey; ExAccn; UnigenelD; Unigene Title; Pred.ProlDomains; R1 

442275; AW449467; Hs.54795; Homo sapiens secretoglobin, family 3A, m; Uteroglobin;TM=M;SS=Y; 39.47 
428434; AW363590; Hs.65551; Homo sapiens, Similar to DMA segment, Ch; LBP.BPi CETP C;TM4*SS-Y; 32 35 
439335; AA742697; Hs.62492; NM_052863:Homo sapiens secretoglobin, fa; r,one;;28~49 
406964; M21 305; ; FGENES predicted novel secreled prolein; none.none; 27 90 
425211; M18667; Hs.1867; progastricsin (pepsinogen C); asp;TM=M;SS=M; 27.90 

1 bt1 11 1 » i rill v ILf 24 97 

428330; L22524; Hs.2256; matrix metalloproteinase 7 (maMlysin, ; PepfJdase.MI 0;; 24.38 
431723; AW058350; Hs.278966; Homosapiens mRNA; cDNA DKFZd564B2062 (f; PMP22 Claudin none- 23 35 
4091^ W03-54 Hs.50313;hypot I n FLJ I C 232Q ' 

431089; BE041395; H [ r , i , r r ne . 1 

42S371 C4Q441 H 1 n fM=M;SS=M; 18.17 

448133; AA723157; Hs.73769; folate receptor 1 (cuukl r I i 1 1=M;SS=M; 17.64 

« ™ A ,7^In 5 u H ,«^ 3 M w 1M " U ' ph f F 0 1 ^Na.P,_colransTM-Y,SS=N; 17.33 

SS^ N-acylsphingo , , ( a PA Surfactant_B,none; 16.81 

419556; U29615; Hs.91093; chit 1 (chitoti lyco.hyr CBM_14;TM=M;SS=Y; 16.24 

419092. J05581 H 303, mucin 1. trar nembrane;S \;TM < M 1606 

426174; AA547959; Hs.1 15838; Homo sapiens similar to Echinoid n (LOC1 ■ none none- 15 84 

X Sfi^JWH? m T hist ° com P at ; b "jt/ complex, ctes ; ig MHCJI_alpha;TM=M;SS=M; 15.42 

I'll 13 ahepsnE a=p 15.08 
444342 NM.014398; Hs.10887; similar lo lysosome-associaled membrane ; Lamp;TM=Y;SS=M; 14.94 
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406621 ; X57309; Hs . 1 81 1 25; immunoglobulin lambda locus; ig,HSP70,Ppx-GppA;TM=M;SS=N; 14 36 
443709; AI082692, Hs i 2; EST Mr fn 3, none; 14.05 

426970; BE276891; Hs.194691; retinoic acid induced 3 (RA1G1); metabo; 7tm_3;TM=Y;3S=M; 13.88 
' , I 19, H5.1S7764; thyroid Iran plion factor 1; homeo 1M M;£ 13.6 
432519; AI221311; Hs.130704; ESTs, Weakly similar lo BCHUIA S-100 pro; none.none; 13.82 
422355; AW403724; Hs.300697; coagulation factor VII (serum prothrombi; none.ig; 1162 
430280; AA361258 I rt n 7 receptor fn3 none 13 47 

415457; AW081710; Hs.7369; Homo sapiens testes specific A2 homolog ; MORN,sugar_tr;TM=Y;SS=M; 13.35 
431164; AA493650; Hs.94367; thyroid transcription factor 1; none.homeobox; 13.32 
41 4998; NMJ02543; Hs.77729; oxidised low density lipoprotein (lectin; leotin_c;TM=Y;SS=M; 1 2.83 
400269; ; Hs.253495; Eos Control; lectin_c,Collagen,Xlink;; 12.30 

424310; AA338648; Hs.50334; testes development-related NYD-SP22; none;TM=M;SS=N; 11.81 

451558; NML001089: Hs.26630; ATP-binding cassette, sub-family A (ABC1; ABC_tran,SRP54;TM=Y;SS=M; 1 1.79 

452304; AA025386; Hs.61311; ESTs, Weakly similar to S10590 cysteine ; none.none; 11.68 

445537; AJ245671; Hs.12844; EGF-like-domain, multiple 6; EGF.MAM;; 11.56 

423778; Y09267; Hs.132821; flavin containing monooxygenase 2; FMO-like,pyr_redox;TM=Y;SS=M; 1 1.41 

414812; X72755; Hs.77367; monokine induced by gamma interferon; IL8;TM=M;SS=Y; 11.31 

430832; AI07391 3; Hs.1 00686; ESTs, Weakly similar to JE0350 Anterior ; none.none; 1 1.25 

407910; AA650274; Hs.41 296; fibronectin leucine rich transmembrane o; fn3, LRR,LRRCT,LRRNT;TM=Y;SS=M; 11.15 

451497; H83294; Hs.284122; Wntinhibitory factor-1; EGF.WIF;; 11.07 

430250; NM_01 6929; Hs.283021 ; chloride intracellular channel 5; none;TM=M;SS=N; 1 1.07 

411020; NMJ06770; Hs.67726; macrophage receptor with collagenous str; SRCR,Collagen;TM=Y;SS=M; 11.05 

446619; AU076643; Hs.313; secreted phosphoprotein 1 (osteopontin, ; Osteoponlin;; 11.01 

438091; AW373062; Hs.351546; nuclear receptor subfamily 1, group I, m; hormone_rec,zf-C4,none; 10.97 

413048; M93221; Hs.75182;mannose receptor, C type 1;fn2,lectn cRicin B lectin,Xlink;TM=Y;SS=M; 10.93 

432231; AA339977; Hs.274127; CLST 11240protein noi T" M S=M;10 i 

415402; NMJ00715; Hs.1012; complement component 4-bindir,g protein, ; sushi;TM-M;SS-M; 10.77 

418156; W17056; Hs.83623; nuclear receptor subfamily 1, group I, m; hormone_reczf-C4,none; 10.63 

436553; AW407157; Hs.181125; immunoglobulin lambda locus; ig,HSP70,Ppx-GppA;TM=M;SS=N; 10.58 

421071; AI311238; Hs.104476; ESTs, Weakly similar to CGHU1E collagen ; r,or,e;TM^Y;SS=M; 10.57 

41B007; M13509; Hs.83169; matrix metalloproleinase 1 (interstitial; hemopexin,Peptidase_M10,Astacin,PG_bindingJ;; 10.33 

419086; NM_000216; Hs.89591 ; Kallmann syndrome 1 sequence; fn3,wap;; 10.30 

407786; AA687538; Hs.38972; tetraspan 1; transmembrane4;TM=Y;SS=M; 10.28 

441384; AA447849; Hs.238660; retinoic acid induced 3; 7tm_3,none; 10.26 

453914; NM_000507; Hs.574; fructose-1 ,6-bisphosphatase 1; FBPase;TM=M;SS=N; 10 22 

435523; T62849; Hs.11090; membrane-spanning 4-domains, subfamily A; none;TM-Y;SS=M; 10 09 

1 ! J1 1 f 1 i L I r F rd r av 3 Cacre;TM=M;SS=N; 10.0 

415323; BE269352; Hs.949; neutrophil cytosolic factor 2 (65kD, chr; SH3 TPR;TM=M;SS=N; 10.02 

408562; AI436323; Hs.31141; round about(axon guidance receptor, Dros; ig,fn3;TfVhM;SS=N; 10.02 

448782; AL050295; Hs.362006; KIAA0758 protein; 7tm 2,:g,GPS,SEA;TM=Y;SS=N; 9.86 

419235; AW470411; Hs.288433; neurotrimin; none.none; 9.79 

415992; C05B37; Hs.145807; hypothetical protein FU 13593; none;TM-Y;SS^M;9.74 

418883; BE387036; Hs.1211; acid phosphatase 5 talat r it Metallopho TM=M;SS=M; 9.70 

439018; AW300887; Hs.26638; membrane-spanning 4-domains, subfamily A; none;TM=Y;SS=M; 9.69 

442652; AI005163; Hs.201378; Homo sapiens cDNA FLJ40427 fis; none;TM=M;SS-N; 9.68 

446291; BE3I 1! n r a le protein 30; GILT;TM=M;SS=Y; 9.64 

408380 AF123"Hl Hs.44532; diubiquitin J] n T» MSS N 9M 

438089; W05391; Hs.351546; nuclear receptor subfamily 1, group I, m; hormonejec,zf-C4,none; 9.52 
449494; AW237014; Hs.315369; aquaporin 4; MlP.none; 9.51 
456062; AI866286.H- I962 ESTs, Wea ly similar to B36298 proline-r; none.none; 9.42 
446428; AW082270; Hs.12496; ESTs, Weakly similar to ALU4 HUMAN ALU S; none.none; 9.41 
421952; AA300900; Hs.98849, dynein light chain 2B (DNLC2B) none.none; 9.1S 

407949; W21874; Hs.247057; ESTs, Weakly similar to 2109260A B cell ; RibosomaLS14,ank,pk;r.ase,death,none; 9.16 

456034; AW450979; ; gb:UI-H-BI3-ala-a-12-0-Ul.o1 NCI CGAP_Su; none.none; 9.15 

407788; BE514982; Hs.38991; S100 calcium-binding protein A2; efhand ,S_1 D0,S_1 OO.efh and; 9.15 

416965; N26223; Hs.160436; MDAC1; none;NA;NA; 9.03 

443324; R44013; Hs.164225; ESTs; none.none; 9.03 

435575; AF213457; Hs.44234; triggering receptor expressed on myeloid; ig;TM=Y;SS=M; 9.00 
440273; AI805392; Hs.325335; Homo sapiens cDNA; FU23523 Hs, clone L; none.none; 8.99 
424527; AW1 38558; Hs.334873; ESTs, Weakly similar to I54374 gene NF2 ; Zn csrbOpept,none; 8.80 
409203; AA780473; Hs.687; cytochrome P450, subfamily IVB, polypept; p450;TM=M;SS=Y; 8.76 
423387; AJ012074; Hs.348500; vasoactive intestinal peptide receptor 1; 7tm_2,HRM,CSC;TM=Y;SS=M; 8.74 
443907; AU076484; Hs.9963; TYRO protein tyrosine kinase binding oro; r,one;TM=M;SS=Y; 8.73 
425367; BE271 188; Hs.155975; protein tyrosine phosphatase, receptor t none;TM=M;SS=Y; 3.60 
418918; X07871; Hs.89476; CD2 antigen (p50), sheep red blocd cell ; ig;TM=Y;SS=M; 8 56 
421563; NM 006433; Hs.105806; granulysin; none;; 8.55 

450726; AW204600; Hs.355452; HUMP3PBA Human pulmonary surfactant-asso; SAPA,SurfactanLB,none; 8.51 

419693; AA133749; Hs.301350; FXYD domain-containing ion transport reg; ATP1G1_PLM_WAT8;TM=Y;SS=M; 8.51 

424450; AL137526; Hs.374425; dynein "mlermediate chain 2; WD40;; 8.42 

402474; ; ; NM_004079:Homo sapiens cathepsin S (CTSS; Pep6dase_C1;; 8.41 

458079; A1796870; Hs.54277; Homo sapiens similar to RIKEN cDNA 28100; none;TM=M;SS=N; 8 40 

424779; AL046851; Hs.153053; CD37 antigen; transmembrane4;TM=Y;SS=M; 8.35 

453310; X70697; Hs.553; solute earner family 6 (neurotransmitte; SNF,5HTJransporter;TM=Y;SS=N; 8.34 

448140; AF1 46761 ; Hs.20450; BCM-like membrane protein precursor; ig;TM=Y;SS=N; 8.33 

404240; ; ; NMJ)1B950:Homo sapiens major histocompat; ig,MHCJ;TM=Y;SS=M; 8.28 

459702; AI204995; ; gb:an03c03.x1 Stratagene schizo brain SI ; none.none; 8.1 7 

449523; NM_000579; Hs.54443; chemokine (C-C moUf) receptor 5; 7tmJ;TM=Y;SS=M; 3.17 

442994; AI02671B; Hs.16954; ESTs; ank,pkinase,death,Ribosomal S14; 8.12 

445998; N99013 H )66; Home apiens mRN DNA D Z " 23j2 (f; PMP22_Caudin,none; 8.07 
420137; AA306478; Hs.95327; CD3D antigen, delta polypeptide (T1T3 co; ITAM;TM=Y;SS=M; 3.01 
435472; AW972330; Hs.283022; triggering receptor expressed on mvc-'ml ig;TM=M ; SS M, i 39 
432441; AW292425; Hs.163484; intron of hepatocyte nuclear faclor-3 al; Fork_head,nor,e; 7.99 

409208; Y00093; Hs.172631; integrin, alpha X(antigen CD11C (p150),; vwa,FG-GAP,HegrinJ\,wa,integrin_A,FG-GAP; 7.94 
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432606; NMJ102104; Hs.3066; granzyme K (serine protease, granzyme 3;; trypsin;TM=Y;SS=M; 7.92 
442832; AW206550; Hs.253569; ESTs; nDne,none; 7.90 

412104; AW205197; Hs.240951; Homo sapiens, Similar to RIKEN cDNA 2210; none;TM=M;SS=N; 7.89 

427337; 246223; Hs.176663; Fc fragment of IgG, tow affinity lllb, r; iq;TM=Y;SS=M; 7.86 
5 443951; F13272; Hs.356835; ferritin, light polypeptide; PMP22_Claudin.none; 7.84 

418299; AA279530; Hs.83968; integrin, beta 2 (antigen CD18 (p95|, ly; integrin B,EGF,PSI;TM=Y;SS=M; 7.79 

447131; NMJ04S85; Hs.17466; retinoic acid receptor responds- (tazaro; none;TM=Y;SS=N; 7.78 

423961; D13666; Hs.136348; perioshn ;C«F ); F n T 1 r, SS=M;7.73 

424917; AI636208; Hs.96901; hypothetical protein FLJ23049; none;TM=M;SS=N; 7.72 
1 0 438564; AA381553; Hs.198253; major histocompatibility complex, class ; ig.MH CJI_alpha,none; 7.65 

456672; AK002016; Hs.1 14727; Homo sapiens, clone MGG16327, mRNA, com; none.PK.P KC.myosinJiead.RhoGAP; 7.64 

427792; M63928; Hs.180841; tumor necrosis factor receptor superfami; SRP14,TNFR_c6;; 7.63 

436954; AA740151; Hs.130425; ESTs; none.none; 7.58 

429732; U201 58; Hs.2488; lymphocyte cytosolic protein 2 (SH2 doma; SH2;; 7.56 
1 5 407601; AC002300; Hs.371 29; sodium channel, nonvoltage-gated 1 , bets; ASC:TM~Y:SS=M; 7.55 

417105; X60992; Hs.81226; CD6 antigen; SRCR;TM=Y;SS=M; 7.51 

414821; M63835; Hs.77424; Fc fragment of IgG, high affinity If, re: ig;TM=Y;SS=M; 7.46 

444381; BE387335; Hs.283713; hypothetical protein BC014245; Co!lagen;TM=M;SS=M; 7.40 

432222; AI204995; ; gb:an03c03.x1 Stratagene schizo brain S1 ; none.none; 7.38 
20 422667; H25642; Hs.132821; ESTs; FMO-like.FMO-like; 7.37 

444527; NM_0054O8; Hs.1 1383; small inducible cytokine subfamily A (Cy; IL8;; 7.36 

457411; AW085961;Hs.130093;iroquois-dasshomeobox protein IRX2; none.none; 7.32 

439237; AW408158; Hs.31 8893; ESTs, Weakly similar to A47582 B-cetl gr; Furin-like,pkinase,Recep_L_domain,YLP,none; 7.32 
„ , 419231; AL045294; Hs.136245; ESTs, Weakly similar to T17227 hypotheti; none,none; 7.30 
ZD 438873; AI302471; Hs.124292; Homo sapiens cDNA: FLJ23123 lis, clone L; none.none; 7.27 

424027; AW337575; Hs.201591; ESTs; 7tm_2,HRM,none; 7.26 

428927; AA441837; Hs.90250; Homo sapiens hypothetical protein FU231; none.none; 7.24 

432435; BE218886; Hs.282070; ESTs; none.none; 7.22 

428467; AK002121; Hs. 184465; hypothetical protein FLJ1 1259; none;TM=Y;SS=M; 7.21 
30 416030; H15261; Hs.21948; ESTs; none.none; 7.20 

433293; AF007835; Hs.32417; hypothetical protein MGC2742; none;TM=M;SS=N: 7.18 

418741; H83265; Hs.8881; ESTs, Weakly similar to S41 044 chromosom; pkinase,Activin_recp,pkinase,Aclivin_recp; 7.16 

420656; AA279098; Hs.187636; ESTs; none.none; 7.14 

427698; AW972594; Hs.335499; ESTs; none.none; 7.1 1 
i 5 432268; BE31 1856; Hs.274230; 3'-phosphoadenosine 5'-phosphosu!fale sy; APS_kinase,ATP-sulfurylase;TM=M;SS=N; 7.06 

413859; AW992356; Hs.8354; Homo sapiens pyruvate dehydrogenase kins; SAM_PNT,nor,e; 7.04 

430413; AWB42182; Hs.241 392; small inducible cytokine A5 (RANTES); IL8;TM=M;SS=Y; 7.04 

452363; AI582743; Hs.94953; Homo sapiens, Similar lo complement comp; C1 q.Collagen;; 7.03 

421481; AW391972; Hs.104696; KIAA1324 protein; none;TlvHvl;SS=M; 7.01 
40 418945; BE246762; Hs.89499; arachidonate 5-lipoxygenase; lipoxygenase.PLAT;TM=M;SS=N; 6.97 

452281; T93500; Hs.28792; Homo sapiens cDNA FU11041 fis, clone PL; TGFb_propeptide,TGF-bela,none; 6.96 

458124; AW005548; Hs.124590; ESTs; none.none: 6.94 

422846; DC513H 4 r n factor 1 (47kD H r TM=M;SS N 693 

41 1027; AF072099; Hs.67846; leukocyte immunoglobulin-like receptor, ; tnositol_P,ig;TM=M;SS=N; 6.92 
45 428820; AA436187; Hs.172631; integrin, alpha M (complement component ; vwa;ntegrin_A,FG-GAP;TM=Y;SS=M; 6.90 

423575; C18863; Hs.163443; intron of periostin (CSF-2os); Fasc ; clin,none; 5.89 

419490; NM_005144; Hs.90708; granzyme A (granzyme 1, cytotoxic T-lvmp; tr/psln;TM=M;SS=M; 6.89 

450954; AI904740; Hs.25691; receptor (calcitonin) activity modifying; none:TM-Y;SS-M; 6.87 

425976; C75094; I ls.334514; NG22 protein; vollage CLC;TM=Y;SS=M; 6.34 
50 425555; AA359291; Hs.130767; Homo sapiens cDNA: FU23553 fis, clone L; LRR;TM=M;SS=N; 6.81 

414991; C17898; ; Homo sapiens up-regulated by BCG-CWS(LO; Zip.none; 6.80 

4 1 \ 1 r ls.743; Fc fragment of Igi j !*i i ITAfv ff f < 

422163; AF027208; Hs.1 12360; prominin (mouse)-like 1; none;TM=Y;SS=M; 6.79 

445885; AI734009 H 1276 KIAA1 r r none.none; 5.77 
55 436576, AI45C2' > I 77542 E Ts 7tm_1 ,DnaJ; 6.77 

417079; U55590; Hs.81134; intedeukln 1 receptor antagonist; 111;; 6.75 

424711; NM_005795; Hs.152175; calcitonin receptoMike; 7tm_2,HRM;TM=Y;SS=M; 5.75 

416847; L43821; Hs.80261; enhancer of filamentation 1 (cas-iike do; SI I3;TM=M;SS=N; 6.73 

426251; M24283; Hs.168383; intercellular adhesion molecule 1 (CD54); ig,ICAM_N;TM=M;SS=M; 5 71 
60 417929; R27219; Hs.74647; Human T-cell receptor active alpha-chain; ig,abhydrolase; 6.70 

412584; X54870; Hs.74085; DNA segment on chromosome 12 (unique) 24; none,lectin_c; 5.70 

428227; AA321649; Hs.2248; small inducible cytokine subfamily B (Cy; 1L8;TM=M;SS=Y; 6.68 

421445; AA913059; Hs.104433; Homo sapiens, clone IMAGE-.4054868, mRNA; ionJrans.K_tetrs.asp; 6.65 

439750; AL359053; Hs.57664; Homo sapiens mRNA full length insert cDN; IMPDH_C,IMPDH_N,CBS,integrinJ3,Rcin_BJectin; 6.52 
65 428582; BE336699; Hs.185055; BENE protein; none;TM=Y;SS=M; 6.60 

453142; M033548; Hs.7473, Hcmo z iens qsp j ir tic n r ote n al ( ha conne - fM=Y SS=M 6.60 

432374; W68815; Hs.301885; Homo sapiens cDNA FU 11346 fs, cione PL; none.none; 6.56 

448569; BE382657; Hs.21486; signal transducer and activator of trans; SH2,STAT,STAT_bind,STAT_prol;TM=M;SS=N; 6.54 

424321; W74048; Hs.1765; lymphocyte-specific protein tyrosine kin; SH2,SH3,pkinase;TM=M;SS=N; 6.51 
70 446932; AA961459; Hs.125644; ESTs; none.LRRiRRNT; 6.50 

427247; AW504221 ; Hs 1 741 03; integrin, alpha L (antigen CD1 1A (p1B0),; wra,inlegrin„A,FG-GAP;TM=Y;SS=M; 6.48 

425998; AU076629; Hs.165950; fibroblast growth factor receptor 4; ig,pkinase;TM=M;SS=M; 6.47 

447232; AW499834; Hs.327; interieukin 10 receptor, alpha; none;TM=M;SS=M; 6.46 

431745; AW972448; Hs.153425; Novel FGENESH predicted cadherin repeat ; none.none; 6.43 
75 417370; T28651; Hs.374466; tryptophanyl-tRNA synthetase; WHEP-TRS.tRNA-syntJb;; 6.41 

422241; Y00062; Hs.170121; protein tyrosine phosphatase, receptor t: kinesin,fn3,Y_phosphatase;TM=M;SS=N; 6.40 

42961 0; AB024937; Hs.21 1 092; LUNX protein; PLUNC (palate lung and nas; none;; 6.39 

409340; BE174629; Hs.321 1 30; hypothetical protein MGC2771; 

aajermeases,pyridoxaLdeC,bromodomain,PHD,MBD,AT_hock l DDT,PI3_Pl4_kinase,FAT,FATC,BolA,RUN;TM=M;SS=N;6.37 
80 413385; M34455; Hs.840; indoleamine-pyrrole 2,3 dioxygenase; IDO;TM=M;SS=N; 636 

451820; AW058357; Hs.199248; ESTs; 7tm_1;TM=Y;SS=M; 6.34 

408369; R38438; Hs.118747; SLC15A2 Solute carrier family 15 (H*/peD; PTR2;TM=Y;SS=N; 6.32 
424247; X14008; Hs.234734; lysozyme (renal amyloidosis); lysjg.FAD_Synth.ldh.ldh. C.pkinase;; 6,32 
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444090; S691 15; Hs.10306; natural killer cell group 7 sequence; PMP22_Claudin TM=Y;SS=M; 5.31 

416819; U77735; Hs.80205; pim-2 oncogene; pkinase;; 6.30 

421659; NMJ14459; Hs.106511; protocadherin 17; cadherin;TM=M;SS=M; 6.27 

415198;AW009480, Hs- 3 net He -i| transcript 4; none;TM=M;SS=N; 6.26 

424273;W40460;Hs.144442;phospholipaseA2,groupX;phoslipTM M I 6.24 

429083; Y09397; Hs.227817; BCL2-related protein A1; Bcl-2;TM=M;SS=N; 6.23 

452194; AI694413; Hs.373599; olfactory receptor, fanily 2, subfamily ; nor,e,none; 6.22 

4241 44; AA454033; Hs.41 644; AKAP-associated sperm protein; RHa;; 6.21 

414142; AW368397; Hs.334485; hemicentin (fibulin 6); EGF,ig,tsp 1,hormone4,squash,TIL,Adeno E3_CR1;TM=M;SS=M; 6.21 

442005; AW975183; Hs.372210; ESTs, Weakly similar lo S72482 hvpotheti; none.none; 6.20 
420256;U84722;Hs.7620S;cadherin5,type2,VE-cadh-r„ | herin dhe lerm;TM=Y;SS=M; 6.19 
421379; Y15221; Hs.103982; small inducible cytokine subfamily B(Cy; IL8;TM=M;SS=Y; 6.17 
440452; AI9251 36; Hs.55150; ESTs, Weakly similar to CAYP HUMAN CALCY; none;NA;NA; 6.17 
421462; AF016495; Hs.104624; aquaporin 9; MIP;TM=Y;SS=M; 6.16 

452960; AK001 335; Hs.31137 proleii tyrcsine phosphatase, receptor I; Y_phosphatase,none; 6.15 
410361; BE391804; Hs.62661; guanylate binding protein 1 interfere! BP,GBH_C;T Y ob-M e 13 

415765; NMJ05424; Hs.78824; tyrosine kinase mth inn lobulin ini EGI (n3 g.pkin sejaminin EGF,TM=M;SS=Y; 6.12 

430478; NM.014349; Hs.241535; apolipoprolein L, 3; MotA_ExbB;TM=Y;SS=M; 6.12 

413869; NMJ00878; Hs.75596; interleukin 2 receptor, beta; none;TM=Y;SS=M; 6.09 

446608; N75217; Hs.175622; ESTs; Armadillo_seg,HEAT_PBS;TM=V;S3=M; 6.0C 

430378; Z29572; Hs.2556; tumor necrosis factor receptor superfami; I L2;; 6.08 

426116; AA85B729; Hs.144694; ESTs; none,none; 6.06 

445033; AV652402 H 72901 lin-d ) nd othnase inhibitor 2B (p1; ank;; 6.05 

426721; AA383588; Hs.288545; ESTs, Weakly similar to T29012 hypolheti; zf-C2H2;TM=M;SS=N; 6.05 

42922B; AI553633; Hs.356828; ESTs; none,none; 6.05 

421757 Z20897 H 29 3 Aryle t-ra e 6 34 

437669; AI368105; Hs.123154; ESTs, Weakly similar to match lo ESTs AA; none.pkinase.pkinase C; 6.03 

419508; AW997938; Hs.90786; ATP-binding cassette, sub-family C (CFTR; ABCJran,ABC_membrane;TM=Y;SS=M; 6.02 

428667; AI375550; Hs.346868; nucleolar protein p40; homolog of yeast ; none, none; 6.01 

432731; R31178; Hs.287820; fibronectin 1; fn1 ,fn2,fn3,none; 5.95 

446566; H95741; Hs 17914; membrane-spanning 4-domains, subfamily A; ncne;TM=Y;SS=M; 5.95 

450656; AA01 0539; Hs.18912; unnamed protein proL ct zl ' 2H2 ! 94 

418460; M26315; Hs.85258; CD8 antigen, alpha polypeptide (p3?) io rM , ~ M c 14 

424054; AA334511; Hs,26638; membrane-spanning 4-domains, subfamily A; none;TM~Y;SS=M; 5.94 

408048; NM_0072C2 H !' 22 \ kino (PRKA) ai hor protein 2 Parelemmin;TM M;SS=N;5.94 

438670; AI275803; Hs.123428; ESTs; none;NA;NA; 5.91 

424238; AA337401; Hs.137635; ESTs; none;TM=M;SS=M; 5.90 

444143; AW747996; Hs.160999; ESTs, Moderately similar to A561 94 throrn; Bd-2,none; 5.B9 
423Q90; AA329648; Hs.23804; ESTs. Weakly similar to PN0099 son3 prot :on_trans,IQ,none; 5.88 
409799; D11928; Hs.76845; phosphosenne phocphol - like; Hydi lose;TM=M E I; 5.81 
407239; AA076350; Hs.67846; leukocyte immunoglobulin-like receptor, ; ig;TrV.-Y;SS-M; 5.81 
411125; AA151647; Hs.68877; cytochrome b-245, alpha polypeptide; none;TM=Y;SS=M; 5.80 
1 IM.000 Hs.970 i Zar q - tap ypeptidefj ' rr T f 1 ' ' 3=M 
' 16 1 00037 Hs.2675! b Iheti al protein FLJ2 1 i V1 i j 

111 ' H I J I H I- TV MBS N 5 79 

443257; AI334040; Hs.11614; HSPC065 protein; trypsin;TM=M;SS=N; 5.76 
415801; R24219; Hs.278443; Fc fragment of IgG, low affinity lib, re; ig;TM=Y 884). 5.70 
435299; AI745458; Hs.343026; ESTs, Weakly similar to T20593 hypothec; nor.e;NA;NA; 5.69 
41 5995; NM.004573; Hs.355888; phospholipase C, beta 2; C2,PI-PLC-Y,P:-PLC-X;TM=M;SS=N; 5.67 
436772; AW975688; Hs.348918; metallothionein 1E (functional); 7lm_2,HRM,nor.e; 5.67 
431385; BE178536; Hs.11090; membrane-spanning 4-domains, subfamily A; none.none; 5.66 
419833; AA251131; Hs.220697; Homo sapiens tryptophanyl-tRNAsynthetas; WHEP-TRS,tRNA-synt_1b,none; 5.66 
421859; AA356620; Hs.108947; KIAA0050 gene product; ank,PH,ArfGap;; 5.64 
40775& AA116021; Hs.38260; ubiquitin specific protease 18; UCH-1.UCH-2;; 5.63 
425354; U62027; Hs.155935; complement component 3a receptor 1 ; 7tm_1;TM=Y;SS=M; 5.63 
423533; NM_014339; Hs.129751; interleukin 17 receptor; none;TM=Y;SS=M; 5.63 
419577; L36531; Hs,91296: integrin, alpha B; integrin_A,FG-GA.P;TM=Y;SS=N; 5.61 
452561; AI692181; Hs.49169; KIAA1634 protein; TPR,PDZ,WW,GuanylateJ<in;TM=M;SS=N; 5,61 
428677; AI6571 19; Hs.351502; troponin I, cardiac; none;TI**tSS=N; 5.60 

425509; AF079363; Hs.158213; sperm associated antigen 6; Armadillo_sec,HEAT_PBS;TM=M;SS=N; 5.58 

453852;AW961818;Hs.211592;MUM2protein;pkinase,DAG_PE-bind pkina :,i ne;5.i 

121924;BE51 l4;Hs.109( 1 1 act Wing protein. 1A »,D4I h Tf/ f N 

448030; N30714; Hs.325960; membrane-spanning 4-domains, subfamily A; nor.e;TM=Y;SS=M; 5.55 

431630; NM_002204; Hs.265829; integrin, alpha 3 (antigen CD49C, alpha ; FG-GAP,Rhabd_glycop,integrin_A;TM=Y;SS=M; 5.53 

410257; BE244044; Hs.61469; hypothetica' protein; none,none; 5.53 

441965; AA972712; H 2b I r i e.A i i rT PN Collagen; 5.52 

413934; U03056; Hs.75619; hyaluronoglucosaminidase 1 ; intecrin_B,Glyco_hydro_56;; 5.52 

424517; AI539443; Hs.137447; Homo sapiens cDNA FU12169 fc, clone MA; SH 2, ST AT, STAT bind,STAT_prot,nDne; 5.50 
447357; AI375922; Hs.132821; ESTs; FMO-like,FMO-like; 5.46 

422109; S73265; Hs.1473; gastnn-releasing peptide; Bombesin,Defensin_propep;TM=M;SS=M: 5,46 
447033; AI35741 2; Hs.1 57601; Predicted gene: Eos cloned; secreted w/V; none.none; 5.45 
417412; X16896; Hs.821 12; interleukin 1 receptor, type I; ig,TIR;TM=M;SS=M; 5.45 
436057; AJ004832; Hs.5038; neuropathy target esterase; cNMP_bindx.g,ion trans,Patatin;TM=Y;SS=M; 5.41 
417497; AW402482; Hs.82212; CD53 antigen; transmembrane4;TM=Y;SS=M; 5.41 
439285; AL133916; Hs.47860; hypothetical protein FU20093; ig, pkinase, LRR,LRRNT,LRRCT, none; 5.40 
452698; NM_001295; Hs.301921; chemokine (C-Cmotif) receptor 1; 7tT,_1;TM=Y;SS=M; 5.40 
443623; AA34551 9; Hs.9641 ; complement component 1, q subcomponent, ; Clq.Collaaen;; 5.40 
146272; BE26C91 Hs.1460 hematopoietic cel^ fic Lyi ibstrat; SH3.H rep;TM=f N; 5 
437275; AW975035; Hs.292396; ESTs, Weakly similar to A47582 B-cell or none Frizzled Fz 5 37 
419660; BE280337; Hs.194693; solute carrier family 7 (caUonic amino ; aa_permeases;TM=Y;SS=M; 5.37 
449853; AF006823; Hs.24040; potassium channel, subfamilv K, member 3; ion trans;TM=Y;SS=M; 5,36 
442434 AA995,E, Hs. 129583; E I I r ic 53b 
' "6; Hs.157313; ESTs; ICE j)20,DED,ICE_p10,ICE_p20,DED; 5.36 
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445333; BE537641; Hs.44278; hypothetical protein FLJ12538 similar to; rasMTK; 5.33 
425638; NMJ12337; Hs. 158450; nasopharyngeal epithelium spec fic pr none;TM=l 3 5 
419034; NMJ02110; Hs.89555; hemopoieticcell kinan H H3,pkina5e;TM»MS N;5 
452416; AA0261 15; Hs.114777; ESTs; none,Porphobil_deam; 5.29 

1 354; Hs.1551 OB. receplor (calcitonin) acti itym ^ n 3 ; now rM > ) 
440475; AI807571; Hs.24040; potassium channel, subfamily K, member 3; ion Jrans.r.one; 5.28 

417355; 013168; Hs.B2002; endothelin receptor type B; 7tmJ,zf-C3HC4,fn3,SPRY,KRAB,zf-C2H2,rve,zf-B_box;TM=Y;SS=M; 5 28 
436120; AI24B193; Hs.119860; ESTs; hemej.none; 5.27 

418307; U70867, Hsu -4 ol le -mily 21 (prostaglandin; OATP N.OATP C;TM=Y;SS=M; 5.27 

409745;AA077391;;gb:7B14E12Chromosome7FetalBraincDNA;7r 1 C3HC4.fr 3PF RAE I ^?H2,rve,zf-B box;TM=Y;SS=M' 5.26 
421^4 AV.137676 Hs.97775; ESTs; none rone; 5.2 

408308; AL033377; Hs.44197; hypothetical protein DKFZp564D0462; none.none; 5.22 

41 0434; AF051 1 52; Hs.63668; toll-like receptor 2; LRR,LRRCT,TIR;TM=M;SS=M; 5.21 

421585; U95626; Hs.302043; chemokine (C-C motif) receptor-like 2 (; 7tm 1;TM=Y;SS=M; 5.19 

400261; ; Hs.1802; Eos Control; ig,MHC_ll_beta;TM=Y;SS=M; 5.19 

436856 AI463355 I 1 1 i i If 1SS N r 18 

408761: AA057264; Hs.238936; EST- \ My simila (deflir.e n _,u 7tmJ,r.one;5.17 

425023;AW956889;Hs.154210;EDG-1 (endothelial differentia r , 7tm_1;TM=Y;S M c 16 

transporter 1, ATP-binding cassette, sub; ABC.tran.ABC membrane,SRP54,Thymidylale_kin;TM=Y;SS=M- 5 16 
451220; AF124251; Hs.26054; novel SH2-contalning protein 3; SH2;TM=M;SS=N; 5.15 
417771;AA804698; Hs.82547; retinoic acid receptor responder (tazaro; none,none; 5.14 
424925; NM_002432; Hs.153837; myeloid cell nuclear differentiation ant; PAAD_DAPIN,HIN:; 5.14 
451099; R52795; Hs.25954; interleukin 13 receptor, alpha 2; fn3;TM=Y;SS=M; 5.13 
427509; M62505; Hs.2161; complement component 5 receptor 1 (C5a !; 7lm_1;TM=Y;SS=M; 5.12 
423196; AK001866; Hs.125139; hypothetical protein FLJ11CC4 n TI M;£ N; 5.12 
433671; AW138797; Hs.132906; 19A24 protein; ig;TM=tvtSS=M- 5.11 

426457;AW894667;Hs.380138;chinphi ; himaerin) I DAG PE-bind,RhoG J» ,H2Tft=M,SS=N; 5.06 
431890; X17033; Hs.271986; integrin, alpha 2 (CD49B, alpha2 subur i jrir FG P;TM=-r =M; 5.0 
418185 AW958272 7 1 ihr adhesion molecule 2 (ICAM none,TW=Y,SS=M, 5 05 
437352; AL353957; Hs.284181; hypothetical protein DKFZp434P0531; DUF221 ;TM=Y;SS=M; 6.03 
457918; AL359590; Hs.162604; hypothetical protein DKF2p762M186; PLDc;TM=M;SS=N; 5.02 

452924 AW580939 ls.971 pier nt component Clq receptor; EGF 1 lectin_c,Tissue_f3c,Xlink,TIL;TM=Y;SS=M; 5.02 
426535; AU077012; Hs.288582; ESTs, Weakly similar to ubiquitous TPR m; Kur.itz.BPTI.Kunitz BPT',7tm_2,HRM; 4.99 
432805; X94630; Hs.3107; CD97 antigen; 7tm_2,EGF,GPS Fe CC 1 1, , M;4.9! 
434883; AW381538; Hs. 19807; fiypolheii protein M 129E H , FH > ,A< RhnbAP 4.95 
14291 89619; Hs 1301 otein-coupled ptor tm If 
4 31; BE3130 H 1 ESTs. Weakl mil r j mI L HUM til rrni; 4.9 

451154 AA015879 H L T T MP rrne; 4.92 

435730; AB020635; Hs.4984; KIAA0828 protein; MoHcyase,TrkA-N,2-Hacid DH_C;TM=M;SS=N; 4.90 
413011; AW068115; Hs.821; biglycan; LRR.LRRNT;; 4.90 

' » 1" H I I, in, i i r i,i i.ir-j .2 rlpl r j 1 ' 

417015; M83772; Hs.80876; flavin containing monooxyaenase 3; FM04ike,pyr_redox;TM=Y;SS=M; 4.88 
412773 H15785 H- 74 r nilarto vaccin n Hindlll 1L OR r Do TM tj SS-N 4.88 
4 3 r > , " 781 F 594 eucine-rich repe kor.l nine G protein; 7tm_1,Lf ^TM SS 1:4.6/ 
405102; ; ; C15001220':gi|4469558|gb|AAD21311.1| (AF; DAG PE-bind,PH,RhoGEF,DC1;; 4.86 
333109; I 1 t f M II Tf I 1 

432581 ; AU076455; Hs.278441 ; KIAA001 5 gene product; PP2C;TM=M;SS=N; 4.83 
414935; C14774; ; gb:C14774 Clontech human aorta polyA mRN; ank,pkinase,death,none; 4.82 
430152; AB001325; Hs.234642; aquaporin 3; MIP;TM=Y;SS=M; 4.82 
444B3B;AV651680;Hs 208553 ESI int jrin_ F J\F none; 4.81 

410423; AW402432; Hs. 63489; protein tyrosine phosphatase, non-recept; SH2,Y_phosphatase,DSPc;TM=M;SS=N; 4,81 

453107; NM_016113; Hs.279746; vanilloid receptor-like protein 1; ank,icnJrans:TM=Y:SS=N; 4.80 

433376; AI249361;Hs.74122;caspase 4. apoptn n t^J teir.e pr; ( aI r i , | E_p2D ,4 80 

422010; AA302049;Hs,31181; Homo apien-cDN FLJ23 fe one C; nor.e.SDF uoar_tr;4.7B 

419542; AA366037; Hs.9091 1 ; solute carrier family 16 (monocarboxylic; none,nor.e;4.76 

438899; AF085833; Hs. 135624; ESTs; none PI3 Fil r Ti F I _ PI3K_rbd,PI p65B;4.75 

427418; AA402587; Hs.356667; IAT1-3TM protein; none,none; 4.75 

431924; AK000850; Hs.272203; Homo sapiens cDNA FLJ20843 lis, clone AD; SH3,none; 4.73 

424218; AF031824; Hs.143212, cystatin F (leukocystatin); cystatin;; 4.72 

414888; AL039185; Hs.77558; thyroid hormone receptor interactor 7; HMG14 17,none; 4.72 

416178; AI808527; Hs.192822; serologically defined breast cancer ar.ti; none:TM-M;SS=N; 4 71 

430037; BE409649; Hs.227789; mitogen-activated protein klnase-aclivat; pkinase;TM=M;SS=N; 4.71 

451527; AF022813; Hs.26518; transmembrane 4superfamily member 7; none.none; 4.71 

453870; AW385001; Hs.8042; Homo sapiens cDN A: FLJ 23 1\' ti clone L: FG M inlegi n_A N F 471 

408113; T82427; Hs.194101; Homo sapiens cDNA: FU20859 Us, clone A; 7tm_3,r.one; 4 70 

438543; AA810141; Hs.192182; ESTs; SH2,pkinase,none; 4.70 

i™???' ^ 77260; ^• 153924 ; death-associated protein kinase 1; ank,pkir,ase,death,SPRY,SAP,RibosomaLL24e,SRP54,dDENN,DENN,uDENN;TM=M;SS=N; 4.70 
4381 13; AI467908; Hs.8882; ESTs; 7tm 1,none; 4.70 

422164; NM.014312; H- 112 77; cort c al In/moc/te receptor (X, laevis ; ig,GeminLmov;TM=Y;SS=M; 4.69 
414482; S57498; Hs.76252; endothelin receptor type A; 7tm 1;TI/fY;SS=M; 4.69 
425069; AA687465; Hs.2981 84; potassium voltage-gated channel st 1 
432314; AA533447 Hs.2851 I ESTs <li I none 4 66 

453518; AW503205, Hs.27268; gb:UI-HI BNO al t-g-D3.0-UI.r1 NIH_MGC.50; SH3,FH,RhoGEF;TM=M;SS=N; 4.66 
418613; AA744529; Hs.86575; mitogen-activated protein kinase kinase , pkinase.CNH; fM=M;SS=N- 4.66 
446063; AI720140, Hs.1 51 079; EST 3 ^Channel.none; 4.65 
454034; NM.000691; Hs.575; aldehyde dehydrogenase 3 family, member ; aldedh;;4.65 
431441; U81951; Hs.2794; sodium channel, nonvoltage-gated 1 alpha; ASC;TM=Y;SS=N; 1.65 
443402; U77845; Hs.9295; elastin (supravalvular aortic stenosis, ; none,PDZ,LIM.pkir.ase; 4 65 
414809; AI434699; Hs.77356; transferrin receptor (p90, CD71); PA;TM=Y;SS=N; 4.64 
427535; R29543; Hs.2164; pro-platelet basic protein (includes pla; IL8;TM=M:SS=M; 4.64 
437119; A1379921; Hs.177043; XP.171387 similar to rhotekin; none.none; 4.63 
41 1779; AA29281 1; Hs.72050; non-melastatic cells 5, protein expresse; NDK;; 4.63 
429784; M89796; Hs.30; membrane-spanning 4-domalns, subfami y j n, r r TI ~ / 
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415934; NM_000928; Hs.992; phospholipase A2, group IB (pancreas); phosltp;; 4.61 

408873; AL046017; Hs.356216; calmodulin 2 (phosphorylase kinase, delt; none.none; 4.61 

426432; AFQ01601; Hs.169857; paraoxonase 2; Arylesterase;TM=M;SS=N; 4.59 

444805; AB007899; Hs.12017; homolog of yeast ubiquitin-protein ligas; WW,HECT,RNA.pol_A,nor.e; 4.59 

408000; L11690; Hs.198689; bullous pemphigoid antigen 1 (230/240kD); efhand,speclrin,GAS2,SH3 Pieclin,RA,XyteeJsom,FriD,bZIPJropomyosin,Myc- 

LZ,M,ldh C,CH,AIP3;TM=M;SS=N; 4.59 
431087; H12723; Hs.290791; ESTs; ion trans,none;4.58 

425466; L18964; Hs.1904: protein kinase C, iota; pkiiase,DAG_PE-bind l pkinase _C,OPR;Tt/=M;SS=N; 4.53 
422427; AA310514 n if t H H i , in ra_channel_B,none;4.57 

1 W19504; Hs.78 i arrierfam 1 (organic anio iATP_N,OATP_C;Tf u * 56 

416464; NM_000132; Hs.79345; coagulation factor VIII, procoagulantco; Cu-oxid ase, FS_F8_type_C; ; 4.55 
421233; AA209534; Hs.284243; tetraspan NET-6 protein; transmernbrane4;TM=Y;SS=r/; 4.56 
42231 1; AF073515; Hs.1 14948; cytokine receptor-like factor 1; fn3;TM=M:SS=N; 4.55 
444895; AI674383; Hs.22891; solute earner family 7 (cationic amino : ASCdeath.TNFR cS; 4.55 
428141; D50402; Hs.182611; solute carrier family 1 1 (proton-coupled; Nramp;TM=Y;SS=N; 4.55 
410290; AA402307; Hs.322844; hypothetical protein DKFZp564Ar6 Sem PSI.TI le nn_B TM"Y,SS=M; 4.54 
426437; BE076537; Hs.169895; ubiquitin-conjugating enzyme E2L6; Armadillc_seg,UQ_con,none; 4.54 
450086; AW015343; Hs.233301; ESTs; ank,death,ZU5,NMU,none; 4.54 

438209; AL120659; Hs.61 11; aryl-hydrocarbon receptor nuclear transf; HLH,FAS,IL8;Tiy-M;SS=N; 4.54 
414788; X78342; I ls.77313; cyclin-dependent kinase (CDC2-tke) 10; pkir.ase;TM=r/;SS=N; 4.53 
429109; AL008637; Hs.196352; neutrophil cytosolic factor 4 (40kD); SH3,OPR,PX;TM=M;SS=N; 4.53 
427557; NM 002659; Hs.179657; plasminogen activator, urokinase recepto; UPAR_LY6,ET,PLA2Jnh;; 4.53 
41 1213; AA676939; Hs.69285; neuropilin 1; MAM.F5 F8Jype_C,CUB,CUB,MAM,F5_F8Jype_C; 4.53 
434158; T86534; Hs.14372; ESTs; adenylatekinase.none; 4.52 

431941; AK000106; Hs.272227; Homo sapiens cDNA FU20099 (is, clone CO; pkinase,Furin-like,Recep_L_domain,none; 4.52 

447341; AF106941; Hs.18142; arrestin, bela 2; arreslin,arrestin_C,PX,PH,PLDc; 4.52 

447656; NM_003726; Hs.19126; sre kinase-associated phosphoprotein of ; SH3,PH;TM=M;SS=M; 4.51 

417018;M16038;Hs.80887;v-yes-1Yamaguchisarcomaviralrelat J H Hj r i r rM-M N; 4.51 

422803 XQ8411 lis 3300^ m/o mir I v ih 1 I'M' J 4 

407202; N58172; Hs. 109370; ESTs; F5 F8 type_C,pkinase,Ets,none; 4.51 

447079; AA280057; Hs.105280; ESTs, Weakly similar to dJ963K23.2 (H.sa; zf-C2H2,zf-C3HC4,UIM;TM=M;SS=N; 4.51 
450747; AI064821; Hs.129953; ESTs, Highly similar to 1818357AEWSgen; rrm,zf-RanBP,GAS2; 4.50 
419452; U33635; Hs.90572; PTK7 protein tyrosine kinase 7; ig,pklnase;TM=Y;SS=M; 4.50 
453856; AA804789; Hs.379109; PDZ-LIM protein m, r LIM,PDZ;Tt ' N;4.49 
432744; AA988835; lls.38654; ESTs; none.none; 4,49 

419032; W81330; Hs.99877; ESTs, Highly similar to JAK3B [H.sapiens; pkirsse,SH2 Insulin, pkinase,SH2; 4.48 
444009; AI380792; Hs.135104; ESTs; TNFR_c6,TIL,none; 4.48 

42641 / 1 14; I 6 killer cell led e j r !c t f II 4 

412802; U41518; Hs.74602; aquaporln 1 (channel-forming Integral pr; MIP;TM=Y;SS=M; 4.48 

447217; BE46575-1; Hs. 17773; neuropilin 2; CUE,MAM,F5_F3_type C;TM=M;SS=M; 4.47 

408771; AW732573; Hs.47584; potassium voltage-gated channel, delayed; eitiand.ion Irans.KJetra.none; 4.47 

435049; AL1 22067; Hs.4746; hypothetical protein FLJ21324; none;TM=M;SS=N; 4.46 

413278; BE563085; Hs.833; interferon-stimulated protein, 15 kDa; ubiquitin;; 4,45 

423804; AW403448; Hs.1706; interferon-stimulated transciip Fact IRF,zf-C3l l(X'BR,zf-RanBP;TM=M;SS=N; 4.45 

434308; N51517; Hs.47282; ESTs; pkinase,pkinase_C,none; 4.45 

434448; W26667; Hs.184581; Homo sapiens cDNA FU14821 fa, clone OV; pkinase,pkinase_C;; 4.45 

417426; NM_002291; Hs.02124; laminin, beta 1; laminin_EGF,laminin_Nterm,integrin..B;; 4,44 

417389; BE260964; Hs.82045; rnidkine (neurits growth-promoting factor; PIN MK;TM=M;SS=Y; 4.44 

430259; BE550182; Hs.375142; RalGEF-like protein 3, mouse homolog; fn3,RA,R3sGEF;TM=M;SS=M;4.44 

436001; AW903849; Hs.173840; HUEL (C4orf1)-interj..unn protein; ig;TM *;SS=M; 4 J4 

452355; N54926; Hs.29202; G protein-coupled receptor 34; 7tm_1,OATP_C;TM=Y;SS=N; 4.43 

418751; BE389014; Hs.372548; phosphoinositide-3-kinase, regulatory su; SH2,none: 4.43 

410060; AI633888; Hs.53435; FYN-binding protein (FYB-120/130); SH3;TM=M;SS=N;4.43 

449961; AW255634; Hs.133100; ESTs; pkinase,Furin-like,Recep L domain,none; 4.42 

451734; NM.006176; Hs.26944; neurograr.in (protein kinase C substrate,; IQ,7;mJ;TM=M:SS=N; 4.42 

410598; AI817130; Hs.9195; Homo sapiens cDNA FLJ13698 fis, clone PL; RasGEF.PRK; 4.42 

43941 1; AA044876; Hs.58043; ESTs, Weakly similar to CYA2 HUMAN AOENY; guanylate cyc;TM=Y;SS=M; 4.42 

433179; AW362945; Hs.162459; ESTs; Armadillo seg.none; 4~42 

414849;AW372721,lls?9i:2 J ESTs A'eaM) >-..ir, to in named protein ; pkinase,none; 4.42 

409512; AW979187; Hs.293591; melanoma differentiation associated pro!; DEAD,hellcase_C,CARD;TM=M;SS=N; 4.41 

445903; AI347487; Hs.132781; class I cytokine receptor; fn3;TM=Y;SS=N; 4.41 

438507; AA809052; Hs.356627; ESTs; none.none; 4.41 

409524; AW4021 51; Hs.54673; tumor necrosis factor (ligand) superfami;TNF;TM=Y.SS=M; 4 40 
453037; AA0451 75; Hs.17914; ESTs; none;TM=Y;SS=M; 4.40 

412228; AW503785; Hs.73792; complement component (3d/Epstein Ban vi; sushi;TM=Y;SS=M; 4.40 
451035; AU076785; Hs.430; plastin 1 (I isoform); efhand,CH,Adaptin_N;; 4,40 
415149; X12451; Hs.78056; cathepsin L; Peptidase.CI;; 4.39 

400105; AW152237 H 70977 Ts, Weakl r I 1 .0022 hypolheti; Y^phosphatase,carb_anhydrase,DSPc,none; 4,39 

423099; NM_002837; Hs.123641; protein tyrosine phosphatase, receptor t; fn3,Y_phosphatase,DSPc,COX5C;TM=M;SS=M; 4.39 

438330; AW450572; Hs.257316; ESTs; pkinase,zf-C4,ERM,CNH,none; 4.39 

433437; U20536; Hs.3280; caspase 6, apoptosis-related cysteine pr, ICE _p1 0,!CE_p20;; 4,39 

429747; M87507; Hs.2490; caspase 1, apoptosis-related cysteine pr; CARD,ICE_p10.ICE_p20;; 4.39 

426410; BE298446; Hs.305890; BCL2-like 1; BcI-2,BH4,none; 4.38 

43451 1; R28982; Hs.18106; ESTs; pkinase,Glyco_hydro_39; 4.38 

448888; AW196663; Hs.200242; caspase recruitment domain protein 6; CARD;TM=M;SS=N; 4.37 

447827; U73727; Hs.19718; protein tyrosine phosphatase, receptor t; fn3,ig,Y_phosphatase,MAM;TM=Y;SS=lvl; 4.36 

432583; AW023624; Hs.162282; potassium channel TASK-4; potassium char; ion trans,X;TM=Y;SS=M; 4.36 

172; BE Hs.7 I mly 1 (glial higl F ! \ 

426828; NM_fJ00020 Hs 172670, activm A receptc r type I Mike 1 f L r e ivin_recp;TM^M 
449444; AW818436; Hs.351306; solute earner family 16 (monocarboxylic; none;TM=Y;SS=M; 4.36 
437145; AF00721 6; Hs.5452; solute earner family 4, sodium bicarbon; HC03.cotransp;TM=Y;SS--N; 4.36 
429670; L01 087; Hs.21 1593; protein kinase C, theta; DAG_?E bind.pkina ?,p(ina.e_C,DNA_pol_viral_N,PHD,DC1;TM=M;SS=N; 4.35 
421195; BE464560; Hs.133017; ESTs; none.none; 4.35 
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415758; BE270465; Hs.78793; protein kinase C, zeta; pkinase,DAG_PE-bind,pkinase_C,OPR;; 4.35 

157001; JO ! n D(1 hyd itamin D3) re; horrnon f-C4,Metallot TM-M 1 . 

419150; T29618; Hs.89640; TEK tyrosine kinase, endothelia! (venous; EGF,fn3,pklnase,ig,laminin_EGF,DSL;TM=Y;SS=M; 4.34 

440675; AW005054; Hs.2797BB; ESTs, Weakly similar to kCC1_HlW.AN CALCl; pkm.asex.one; 4.34 

429657; D13626; Hs.2465; KIAA0001 gene product; putative G-protei; 7tm_1;TM=Y;SS=M; 4.34 

414509; AW161311; Hs.76294; CD63 antigen (melanoma 1 antigen); transmembrane4;TM=Y:SS=M; 4 34 

125771; BE561776; Hs. 159494; Bruton agammaglobulinemia tyrosine kinas; SH2 I SH3,pkinase,PH,BTK;Tfvl=M;SS=N; 4.34 

452124; AA45422C H 11 [ ' la e none; 4.33 

407775; NMJJ04914; Hs.38772; F!AB36, member RAS oncogene family; ras,arf;TM=M;SS=N: 4.33 
452688; AA721 140; Hs.49930; ESTs, Weakly similar to putative p150 [H; SH3,none; 4.33 
434164; AW207019; Hs.148135; serineAhreonine kinase 33; pkinase;TM=M;SS=N; 4.32 
445330; R52656; Hs.21 691; ESTs; 7tm 1,none; 432 
437527; AI241019; Hs.145644; ESTs; PIP5K,none; 4.32 

437763; AA469369; Hs.5831; tissue inhibitor of metalloproleinase 1 ; TIMP,pkinase,DAG„PE-bind,RBD; 4.31 
416714; AF283770; Hs.79630; CD79A antigen (immunoglobulin-associated; ig,ITAIV,Zn clus;TM=Y;SS=M; 4.31 
416269; AA177138; Hs.161671; ESTs; pkinase,DAG_PE-bind,RBD,none; 4.30 
425458; H89317; Hs.1B2889; ESTs; ion_trans,none; 4.30 

424206; NM.003734; Hs.198241; amine oxidase, copper containing 3 (vase; Cu_amine_oxid,Cu_amine_oxidN2,Cu_amine_oxidN3;TM=M;SS=M;4.29 

451876; T63141; ; gb:yb99a12.s1 Stratagene lung (937210) H; SH3,none; 4.29 

417801; AA417383; Hs.82582 mteyiir L il» 1 (™ i E( 'IifereD EGF;;4.29 

435240; AI025435; Hs.1 17532; ESTs; GHMP_kinases,none; 4.27 

444051; N48373; Hs.10247; activated leucocyte cell adhesion molecu; none,none; 4.26 

423523; AW299828; Hs.193580; ESTs; none.ncne; 4.26 

426274; D38122; Hs.2007; tumor necrosis factor (ligand) superfami; TNF;TM=Y;SS=N; 4.26 

425356 BE244879 H I 3; inositol polyphosphal phosphot 14 E> o >ndo_ F ho- SH2;TM=M;SS=N; 4.26 

4483B6; AB037750; Hs.21 061 klA*r2 i in; PKD.Bt Ttv i "S M 4 2" 

427274; NM_00S21 1; Hs. 174142; colony sfimulaling factor 1 rcccr f rl n Tf M;4.26 

n H j ^ i n i r i rf 

436729; BE621807; Hs.351316; transmembrane 4 superfamiiy member 1; none;TM=Y;SS=M; 4.25 
436494; AA720997; Hg.128295; ESTs; none,CAP_GLY,HC03_cotran5p,Glyco hydro.63,PH; 4.24 
439668; AI091277; Hs.302634; frizzled (Drosophila) homolog 8; Frizzled,Fz,7tm_2,toxinJ;TM==Y;SS=M; 4.24 
418255; AW135405; Hs. 37251; ESTs; pkinase.none; 4.24 
400328; X87344 transporter 2 ATP birding a e, sub; none;Tf Ml 4 
405121; ; ; mitogen-aclivated protein kinase 8 inter; Cys.knot.TGF-beta.vwa.vwc.vwd.TIL.DUFISg;; 4.24 
425795; AJ000479; Hs.159543; EDG-6 (endothelial differentiation, G-p; 7tm_1;TM=Y;SS=M; 4.23 
406786; AW161678; Hs.1 11334; ferritin, light polypeptide; ferritin;TM=M;SS=N; 4.23 
449843; R85337; Hs.24030; so ! ute carrier family 31 (copper transpo; r.one;TM=Y;SS~-M; 4.23 
445657; AW612141; Hs.279575; Homo sapiens G-protein coupled receptor ; 7tm_1;TM=Y;SS=M; 4 23 
413795, AL040178 H- 142003 EST none.pkinase.LRR.LRR I I 

409142; AL136877; Hs.50758; SMC4 (structural maintenance of chromoso; ABC_tran,M,SMC_N SMC_C,DUF164,none; 4.22 

447887; AA114C50; Hs.21 1610; caspase 8, apoptosis-related cysteine pr; ICE_p10.ICE_p20 DED;TM=IV;SS=N; 4.22 

417318; AW953937; Hs.240845; ESTs; SH3,PH,RhoGEF;;4.21 

424291; AL1 20051; Hs.144700; ephrin-B1; Ephrin;TM-Y;SS-M; 4.21 

408279; AF216965; Hs.44095; Homo sapiens, clone MGC:12617, mRNA, com; none,none; 4.20 

432636; AA340864; Hs.278562; claudin 7; PMP22_Claudin;TM=Y;SS=M; 4.20 

424618; 129472; Hs.1802; major histocompatibility complex, class ; ig,MHC_ll_beta;TM=Y;SS=M; 4.19 

445633; AI453386; Hs.17287; ESTs, Weakly similar to S26689 hypDtheti; IRK.none; 4,19 

432882; NM_013257; Hs.279696; serum/glucocorticoid regulated kinase-li; pkinase.PX.pkinase.C;; 4.19 

425481; AW978162; Hs.372811; ESTs; none,OxysteroLBP;4.19 

429061; Y14039; Hs.195175; CASP8 and FADD-like apoptosis regulator; ICE.p20,DED;TM=M;SS=N; 4.18 
401083; ; ; NM_016582':Homo sapiens peptide transpor; PTR2;TM=Y;SS=M; 4.18 
420676; AI434780; Hs.4248; vav 2 oncogene; RhoGEr r H ^ H L P H rAG_PE Lind.none; 4.18 
424377; AF081675; Hs.146322; killercell !ecUn-like receptorsubfan le tir hi i SS M 1 17 

111 IT I 4 r 1 1 i r l rr EE r n II I Tf M;4.1 

421391; AW304350; Hs.191958; immunoglobulin superfamiiy receptor tran; ig.non.e; 4.17 
452100; AI668658; Hs.379032; inositol polyphosphate-5-phosphatase, 75; Exo_endo_phos,RhoGAP,none; 4.17 
413969; X14034; Hs.75648; phospholipase C, gamma 2 (phosphatidylin; SH2,SH3,C2,PH,PI-PLC-Y,PI-PLC-X,PDGF;; 4.17 
422310; AA316622; Hs.98370; cytochrome P450, subfamily IIS, polypept; none,pkinase,fn3,ig; 4.17 
444034; AL161957; Hs.10177; pleckstrin homology domain interacting p; E1- 

E2_ATPase,Cation_ATPase_C,Cauon_ATPxe_N,Hydrolase l Rltx)somaLS15,bromodomain,WD40;TM=M;SS=N;4.16 
450056; BE047394; Hs.502; ESTs, Weakly similar to S71512 hypotheti; ABC_tran,ABC_membrane,ig,MHCJLbeta,SRP54,profeasome,ABC_membrane,ABC tran; 4.16 
407245; X90568; Hs.1 72004; Win; fn3,ig,SGXXSG,pkinase;TM=M;SS=N; 4.16 
41 8962; AA714835; Hs.271 863; ESTs; RhoGAP,SH2,pkinase,POLO_box,none; 4.15 
410590; BE515216; Hs.64746; chloride inUacellular channel 3; none;Tf/=M;SS=N;4.15 
425743; BE396495; Hs.159428; BCL2-associated X protein; Bcl-2;TM=Y;SS=N; 4.15 
446957; AI699629; Hs.166781; ESTs; none.none; 4,14 
432176; AW090386; Hs.112278; arrestin, beta 1; arrestin.arrestin C.none; 4.14 
452571; W31518; Hs.34665; ESTs; none;TM=M;SS=N; 4.14 

425421; L11669;Hs.157145; tetracycline transporter-like protein; sugar tr;TM=Y;SS=M; 4.14 
4106B7; U24389; Hs.65436; lysyl oxidase-like 1; LysyLoxidase;; 4.14 
417871; AA521 368; Hs.24252; ESTs; tBB,Armadillo_seg,none; 4.13 

42981 9; AL1 3301 1 ; Hs.253920; Homo sapiens mRNA; cDNA DKFZp434P201 (fr; none,none; 4.12 

424522; AL134847; Hs.149957; ribosomal protein S6 kinase, 90kD, polyp; pklnase.pkinase.C;; 4.12 

429623; NM 005308; Hs.21 1569; G protein-coupled receptor tin e 5: okinase,RGS:TM=M;SS=N; 4.1 

413019; BE281604; Hs.75140; low density lipoprotein-related protein-; none;TM=M;SS=Y; 4.12 

434071; AF116653; Hs.34192; Homo sapiens PRO0823 mRNA, complete cds; none;TM=M;SS=N; 4.11 

434779; AF15381 5; Hs.501 51 ; potassium inwardly-rectifying channel, s; IRK;TM=Y;SS=N; 4.1 1 

449656; AA00200B; Hs.188633; ESTs; PIP5K,none; 4.11 

406403; ;; NM_002162*;Homo apier ile.cellular ad TM-Y SS-M 410 

427732; NM.002980; Hs.2199; secretin receptor; 7tm_2,HRM;TM=M;SS=M; 4.10 

437608; AA761605 Hs 292308 EST LUIJMANAU ra RI01,none; 4.10 

432885; AA595607; Hs.368129; ESTs, Weakly similar to ALU1„HUMAN ALU S; pkinase,pkinase_C,none; 4.10 
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41 1 190; AA306342; Hs.69171: protein kinase C-Iike 2; pkinase,pkinase_C,HR1 ;TM=M;SS=N; 4.1 0 

418342 BE002723 Fr i r; ICE DED,ICE_p1 K E_p20 DED;4.10 

424909; S78187 H- U lid i cycle 25B; Rhodanese;; 4.10 

435905; AW997484; Hs.5003; KIAA0456 protein; SH3,RhoGAP,FCH;TM=M;SS=N; 4.10 

422278; AF072873; Hs.1 14218; frizzled (Drosophila) homolog 6; Fz,Frizzled,7lm ? :TM-Y;SS-M; 4.10 

437952, D63209, Hs 591 olut rier family 11 (proton jrle nor I M=Y SS=M; 4.10 

432827; Z68128, Hs 31C- Rho GTI - tualmq protein 4; FCH,RhoGAP,SH3:TM=M;SS=N; 4.09 

435140; AA6681J" Hs. 13417 ESI n-ne none; 4.09 

422627; BE336857; He. 1 18787; transforming growth factor, beta-induced; Fasciclin,ABCJran,ABC_membrane,GTP_EFTU;TM=M;SS=M; 4.08 
428483; AI908539; Hs.184592; KIAA0344 gene product; none.none; 4.08 
446232; A12B1 848; Hs.194691; retlnoic acid induced 3; 7tm_3,none; 4.07 
431674; AA098901; Hs.301642; G-protein coupled receptor; none,GCV_H; 4.07 

409686; AK000002; Hs.55879; Homo sapiens mRNA; cDNA DKFZp434L0627 (f: ABCJran,ABC_membrane;TM=M;SS=M; 4.07 

441518; AW161697; Hs.294150; ESTs; Y_phosphatase,DSPc,none; 4.07 

442599; AF078037; Hs.324051; RelA-associated inhibitor; SH3,ank; rM~M;SS=N;4.06 

436982; AB018305; Hs.5378; spondin 1, (f-spondin) extracellular mat; tspj ,ReeIer;; 4.05 

420361; N92054 Hs.194718 zini nge t mtein 265; zf-RanBP,7tmJ; 4.05 

439549; AW937885; Hs.137314; ESTs; SH2,none;4.04 

419981; AA897581;Hs.128773; ESTs; pkinase,DAG_PE-bind,pkinase_C,OPR,r.or,e;4.04 
418836; AI655499; Hs.161712; ESTs; pkinase,Activin_recp,PDZ,ZU5,death; 4.04 

408806; AW847814; Hs.75608; Homo sapiens cDNA FU; 1532 1 o, clone (' SH3,PDZ,Guanyla'.6_kin,none; 4.04 
432106; N58323; Hs.269098; ESTs, Weakly similar to RETROVIRUS-RELAT; SH3,PDZ,Guanylale_kin.none; 4.03 
426085; T94907; Hs.188572; ESTs; PH,Ets,CH,spectrin,Ca.channeLB,none; 4.03 
418203; X54942; Hs.83758; CDC28 protein kinase 2; CKS;; 4.03 

412270; AC005262; Hs.73797; guanine nucleotide binding protein. (G pr; G-alpha,arf;TM=M;SS=N; 4.03 
416350; AF188625; Hs.189507; ptiospholipase A2, group IID; phoslip;TM=M;SS=Y; 4.02 
434457; AF141332; Hs.200333; apolipoprotein B48 receptor; none;TM=M;SS=N; 4.02 

414271; AK000275; Hs.75871; protein kinase C binding protein 1; bromodomaiaPHD,PVVWP,zf-MYND;TM=M;SS=N; 4.02 

425694; U51333; Hs.159237; hexokinase 3 (while If inase.hexoki If MSSN402 

449943 AF1042bb ri 4212;! op „i n 71m J,GPS,GaLLectin,OLF,Latrophiliri,HRM;TM=Y;SS=M; 4.01 

408938; AA05901 3; Hs.22607; ESTs; fn3,Y_phosphatase,carb_anhydrase,none; 4.01 

426839; M74782; Hs.172689; interleukin 3 receptor, alpha (low affin; none;TM=M;SS=M; 4.00 

422282; AF019225; Hs.114309; apolipoprotein L; MotA_ExbB;TM=Y;SS=M; 4.00 

410726; AI623E59, H 1 5936; EST a r :se pra isomerase.none; 4.00 

428318; BE300110; Hs. 183842; ublquitin B; lipocalin,atdedh,ubiquitin,IRK;; 4.00 

440188; AK001812; Hs.703B; N-Acetylgluccsamine kinase; ROK;TM=M;SS=N; 3.99 

429952; AF080158; lls.226573; inhibitor of kappa light polypeptide gen; pkinase 1 ubiquitin,Enterotoxin_A,PH04,pkinase,ubiquitin; 3.99 
414700; H63202; Hs.38163; ESTs; 7tmJ;TM=Y;SS=M; 3.99 

432269; NM 002447; Hs.2942; macrophage stimulating 1 receptor (c-met; pkinase,Sema,PSl,TiG,A4_EXTRA;TM-M;SS-M; 3.99 
456362; AW973003; lls.179909; hypothetical protein FU22995; none;TM=M;SS=N; 3.98 
427541; AI798983; Hs.375835; solute carrier family 35 (CMP-sialic aci; none.none: 3.98 
440248; M876138; Hs.369458; ESTs; SH2,none; 3.98 

43740Q; AB011542; I ls.5599; EGF-like-domain. multiple 5 TNFR_c6,laminin EGF;TM=Y;SS=N; 3.98 
425262; D87119; Hs.155418; GS3955 protein; pkinase;; 3.98 

420166; AW732276; Hs.95583; transmembrane 4superfamily member (tetr; ',ransmembrane4;TM=Y;SS=M; 3.9B 

437151; AA745610; Hs.380121; BANP homolog, SMAR1 homo eg; none.none; 3.98 

443574; U83993; Hs.321709; purinergic receptor P2X, Igand-gated io; P2X_receptor;TM=Y;SS=M; 3.97 

I 11 1c i f o II ^ " E I g ! ' f 

41 1 574; BE242842; Hs.6780; protein tyrosine kinase 9-like (A6-relat; LRR,LRRCT,TIR,cofilin_ADF;TM=M;SS=N; 3.97 
432639; AW973785; ; gb:EST385886 MAGE resequences, MAGM H no;none,IRi 9; 
457675; AF1 19917; Hs.306574; Homo sapiens PRO3098 mRNA, complete cds; none;; 3.97 
445701; AF055501; Hs.13131; lymphocyte adap'.or protein; SH2,PH;TM=M;SS=N; 3.96 
437157; BE048860; Hs.17287; ESTs; IRK,none; 3.96 

453641; AA444140; Hs.90960; ESTs; CbLN,CbLN2,Cbl.N3,UBA,zf-C3HC4,none; 3.96 
446714; W73818; Hs.1 10028; ESTs; 7tm_1 ,7tm_1 ; 3.96 

427648; AI376722; Hs.180062; proteasome (prosome, macropain) subunit,; proteasome;; 3.96 

453686; AL1 10326; Hs.304679; ESTs, Moderately similar toZ195_HUMAN Z; none,lectin_c,lig_chan;3.96 

457718; F18572; Hs.22978; ESTs, Weakty similar to ALU4_HUMAN ALU S; pkinase.pkinase; 3.95 

428727; AF078847; Hs.78452; qeneral trarscriptlon factor II H, polype; PH04,LIM;TM=M S=t 

43541 1 ; AW44461 9; Hs.13821 1 ; ESTs; none.pkinase; 3.94 

440209; H05049; Hs.247837; r.eurexin 3; laminin G.EGF.none; 3.94 

415536; N32536; Hs.42645; solute carrier family 16 (monocarboxylic; none.none; 3.94 

435272; AA906415; Hs.110041; ESTs; none.pkinase; 3.93 

402550; ; ; Target Exon; none,none; 3.93 

425233; Z17861; Hs.155218; E1B-55kDa-assoclated protein 5; SPRY,SAP,pkinase,fn3,ig; 3.93 
410073; AW408163; Hs.58488; catenin (cadherin-associated pot in latrirnin,V;r 
453548 AL079983 Hs 116774 ml n, alpha 1; none, vwa 1 nteqnr_A 392 
417225; AW505054; Hs.4283; ESTs; pkinase,RGS,PH,myosin_head,Myosin_tall;3.92 

1675 1 I 61 54 i hreoi kinase 10; pkinase, TYA;TM~M M; 3.9 

452344; AI264357; Hs.55405; hypothetical protein MGC16212; Sulfate Iransp.STAS;; 3.92 

418516; MM_006218; Hs.85701; phosphoinositide-3-kinase, catalytic, al: PI3 PI4Jinase,PI3Ka,PI3K_C2,PI3K_rbd,PI3K_p85B,none; 3.91 

423069; W15613; Hs.1613; adenosine A2a receptor; 7tm_1;TM=Y;SS=M; 3.S1 

414443; AU07726B; Hs.76144; platelet-derived growth factor receptor,; ig.pkinase;TM=Y;SS=N; 3.91 

434392; AW983709; Hs.250824; Homo sapiens cDNA FN ) lore H phnase,none; 3.91 

429515; AF258527; Hs.211562; ATP-binding cassette, sub family A (ABC1; ABC_tran;TM-Y;SS-M; 3.91 

414774; X02419; Hs.77274; plasminogen activator, urokinase; kringle.lrvpsin.plant thionins;; 3.91 

442831; AI798959; Hs.131686; ESTs; ABC tran,PRK,ABC Iran; 3.91 

441657; BE314696; Hs.7936; BAH associaTed protein H3;TM=M n 

435698; AW297855; lis. 361 171 E T VV ty similar io B0 iheti lipoxygenase,PLAT,none;3.90 

447560; AF065214; Hs.18858; phospholipaseA2, group IVC(cytosolic, ; PL A2 E;Tr7=M;SS=N; 3.90 

437897; AA770561; Hs.146170; hypothetical protein FU22969; zf-DHHC,none; 3.89 

429379; NM_014840; Hs.200598; KIAA0537 gene product pkina3e,RI01;TM=M;SS=N; 3.89 

410179; W27723; Hs.59498; cell division cycle 2-like 5 (cholineste; pkinase;; 3.89 
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428713; AA432067; Hs.268551; ESTs, Moderately similar to CYA4 RAT ADE; pkinase;; 3.89 
456629; AW891965; Hs.367942; histone deacetylase 3; HSP90,HATPase_c,zf-C2H2,PHD,none; 3.89 
425190; AW028302; Hs.155079; protein phosphatase 2, regulatory sub;,™; E56;TM=M;S3=N; 3.83 

j>;.7;; : h= i-;;oi nr t-. - n r;; ; •; :-Tr.'=M;ss=N.3.89 

417767 BE242241 I 1 io rj trcphil); I DSL;TM=M;SS=M; 3.88 

414029; BE297731; Hs.75709; mannose-6-phosphate receptor (cation dep; Man-6-P_recep;TM=M;SS=M; 3.88 
416140; AI918035; Hs.301198; roundabout(axon guidance receptor, Dros; none,none; 3.88 
434224; AA380731 ; Hs.84; interleukin 2 receptor, gamma (severe co; fn3;TM=Y;SS=M; 3.88 
410011; AB020641; Hs.57856; PFTAIRE protein kinase 1 ; pkinase;TM=M;SS=N; 3.87 
406908; Z25437; ; gb:H.sapiens protein-tyrosine kinase gen; none.none; 3.87 
425289; AW139342; Hs.155530; interferon, gamma-inducible protein 16; PAAD_DAPIN,HIN;; 3.87 
4 r W19-13S4; H i AiM inducer ne-1 pi n (I mino <id r AD t i rE TM=M SS=N 7 

439975; AW328081; Hs.6817; inosine triphosphatase (nucleoside triph; Hamlp like;TM=M;SS=N; 3.87 
415392; Z44067; Hs.10957; ESTs; PIP5K,none; 3.86 

i • C M4_012201; Hs.78979; Golgi apparatus protein 1; cysjich FGFR;TM-Y;S3-rV.; 3 86 
41 4649; AI672727; Hs.76753; endoglin (CD105 antigen) (ENG); none;TM=Y;SS=M; 3.85 
425729; L22647; Hs.189360; prostaglandin E receptor 1 (subtype EP1 ); 7tm_1;TM=Y;SS=M; 3.35 
414496; W73853; Hs.355424; ESTs; pkinase,F5 FSJype C,adh_short,none; 3.84 
412204 At 1 25507 H 19 ESI , m rcre, 3.84 

434375; BE277910; He 3 33; phosphoa lenosine 5'-phosphosulfate sy; APS_kinase,ATP-sulfijrylase,PRK,Thymidylate„kin;; 3.84 

444981; AW855398; Hs.12210; hypothetical protein FLJ13732 similai lo; SH2;TM=M;SS=N; 3 64 

412309; M23892; Hs.73809; arachidonate 15-iipoxygenase; lipoxygenase.PLAT: 3.84 

405545; ; ; Target Excn; ABCJran,SRP54,ABC_membrane;TM=Y;SS=M; 3.84 

407143; C14076; Hs.332329; FST; none;TM=Y;SS=M; 3.84 

420593; AA280356; Hs.187634; ESTs; B56,none; 3.B4 

413420; AW410235; Hs.75348; proteasome (prosome, macropain) activate; PA28_alpha,PA28 Jeta.biopterin.H;; 3.83 
448253; H25899; Hs.201591; ESTs; 7lm_2,HRM,none; 3.83 

444042; NMJ04915; Hs.10237; ATP-binding cassette, sub-family G (WHIT; ABC_tran,PRK,GBP;TM=Y;SS=N; 3.83 

430397; AI924533; Hs.105607; bicarbonate transporter related protein ; HC03_cotransp;TM=Y;SS=N; 3.83 

123067; A/ I 401; colony stimulating facto; 2,eceplo., be; fn.3;Tt M; 3.8 

458188; AW297226; Hs.137840; ESTs, Moderately similar to SIX4.HUMAN H; pkinase,WD40; 3.82 

426486; BE178285; Hs.170056; Homosapiens mRNA ,DT Dl r „ s 6B0 20 (f inase.non 3 82 

428791; AA435661; Hs.264750; ESTs; zf-C3HC4,nane; 3.82 

438068; AI927209; Hs.30621 0; Homo sapiens cDNA; FLJ231 33 fis, c'one L; NusG;; 3.82 

453370; AI470523; Hs.139336; ATP-binding cassette, sub-family C (CFTR; ABCJran,ABC_membrane;TM-Y;SS=N; 3.82 

419250; AW770185; Hs.356066; U5snRNP-specific protein, 116 kD; 7lm_1 l BAH,zf-CXXC,DNA_metrrylase; 3.82 

410017; AW952426; Hs.109438; Homo sapiens done 24775 mRNA sequence; none.none; 5.82 

1 c I 1 fibrillin 1 larin.WD40;TM=r I 

4 t r 1 i 1 fan flF DUF6JM M 

425923; NM_005026; Hs.162808; phosphoinositide-3-kinase, catalytic, de; none,none; 3.81 

417365; D50683; Hs.82028; transforming growth factor, bet eceplc mi a-' A<D4u,Ty=Y,SS=N; 3.64 

414521;D28124;Hs.76307;neuroblasloma,suppiessnrHrb,mrT;. l| DAN;TM=M - M 3,52 

422398; AI476149; Hs.334489; hypothetical protein FLJ21992; SH2.SH3;; 3.51 

418432 M141 5 H 51 in qrc h natome lnulin.,350 

459735; BE032764; Hs, 270252; ESTs, Weakly similai to androgen receplo; nonB,C2,WW,HECT; 3.48 

425009;X58288;Hs.154151;proteintyrosineptc-pu e,rece| fn3,ig,Y_ph phata r M R< / SS"M, 3 38 

415817; U88967; Hs.78867; protein tyrosine phosphatase, receptor-t; fn3,Y j>hospha'.ase,carb_anhydrase;TM=Y;SS=M; 3 37 

433336; AF017986; Hs.31386; secreted frizzled-reiated protein 2 (stn Fz.NTR;; 3.24 

426125; X87241; Hs.166994; FAT tumor suppressor (Drosophila) homolo; EGF,c3dherin,laminin_G;TM=Y;SS=M; 3.1 1 

419721; NM.001650; I Is 315369; aqjaporin 4; MlP.ncne; 2.99 

433147; AF091434; Hs.43080; platelet derived growth factor C; PDGF,CUB;; 2.91 

417976; BE565892; Hs.83077; interleukin 13 (interferon gamma-inducin; no 1" I 1 SS N 2.89 

439180; AI393742; Hs.199067; v-erb-b2 avian erythroblastic leutei ji i t , r r c u- i 1 i Fu i In phnase Recep_L_domain,Peptidase_M24; 2.59 
426158; NM_001982; Hs.199067; v-erb-b2 avian erythroblastic leukemia v; Furin4ike,pkinase,Recep_L domain,Furin-like,pkinase,Recep_Ldomain,Peptidase M24; 2.23 
411089; AA456454; Hs.355702; cei! division cycle 2-like 1 (PITSLRE pn none.none; 2.07 
428800; M57627; Hs.193717; interleukin 10; IL10;; 1.10 

TABLE 40B 




459702 539529 1 BG207209BE166299AI204995BG199355AW969908AA528756AW440776BI044354 
432222 539529J BG207209 BE166299 AI204995 BG199355 AW969908 AA528756 AW440776 BI044354 
414991 1785136J D78831 C17898 D78863 

409745 MH1944.5 B1030997 AA921S74 AW18E822 B1027862 AI347618 AI361453 AI08B754 AW207491 AA077391 BG01 2775 BG997382 AA286833 AA1 50722 BlOOi 
BI027864 BI009100 BI006275 BI006270 BKB1000 BI029864 BI006277 BI007627 BI006266 BI006991 BI006990 BI007763 BI007762 BG997377 
AA150780 BI033518 BI027818BG015789BI033807AA341445 

414936 1782849 1 C14774C17911 D79033 

451876 232B579J T63141 AI821021 BF370092 BF370127BF370060 T62998 
432639 1237887J AW973785H60163AA557608 

TABLE 40C 

Pkey: Unique number corresponding to an Eos probeset 

Ref; Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et at." refers to the publication entitled "The DNA 

sequence of human chromosome 22." Dunham I. el al.. Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 
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402474 
404240 
405102 
405121 
401083 
406403 
402550 
405545 



9256305 Minus 



53526-53628,55755-55920,67530-57757 

116132-116407,116653-116922 

120922-121296 

35816-36004,36587-36684 

33192-33360 

151426-151680 

80413-80673 

1 1 8677-1 18807,1 1 9091 -1 1 9296, 121 625-1 21 82 



TABLE ■ 

Hu03 Genechip array. 



UP-REGULATED IN PANCREATIC TUMORS OR PANCREATITIS RELATIVE TO NORMAL TISSUES 
556 genes up-regulated in pancreatic tumors or pancreatitis relative to normal tissues. These genes were selected from 59I 



0 probesets on the Eos/Affymetrix 



ExAccn: Exemplar Accession number, G 
UnlgenelD: Unigene number 
Unigene Title: Unigene gene title 
R1: 90th percentile of pancreatic cai 

Pkiy ExAccn UigiHeTB Unigene Title 



40 
45 
50 
55 
60 
65 
70 



412228 AW503785 

431462 AW583672 

444995 AJ272265 

453863 X02544 

441031 AI110684 

421344 AW631030 

416018 AW138239 

438091 AW373062 

418888 AU076801 

418969 W33191 • 

443162 T49951 

423096 AA732684 

413719 BE439580 

448243 AW369771 

421044 AF061871 
BE514982 



Hs.73792 

Hs.256311 

Hs.12230 

Hs.572 

Hs.7645 

Hs.103665 

Hs.78977 

Hs.83623 

Hs.89436 

Hs.28907 



422857 
432467 
457059 
451945 
453354 



Hs.211420 
Hs.234853 
Hs.333893 



410541 
427722 
429612 
407604 
431193 
442080 
427670 
445921 
419551 
441633 
407792 
419216 
416913 



411498 
445517 
446619 



BE298215 

X02419 

BE269047 



AW444761 
BE612888 
AB012113 
AW582256 

AI077715 

AW934714 
AW149265 
N54926 
AI879195 
AA157857 
BE019020 
D31152 
NMJD14210 
AF208855 

AF1 12213 



complement component (3d/Epstein Barrvi 
granin-like neuroendocrine peptide precu 
secreted phosphoprotein 2, 24kD 
orosomucoid 1 

fibrinogen, B beta polypeptide 
villin-like 

proprotein convertase subtilisin/kexin t 
nuclear receptorsubfamily 1, group I, m 
cadherin 17, LI cadherin (liver-intestin 
hypothetical protein FLJ20258 
DKFZP434G032 protein 
progestin induced protein 
small inducible cytokine subfamily A (Cy 
integrin, beta 8 

Human UNA sequence from clone RP1-236D1 5 
S100 calcium-binding protein A2 
cartilage oligomerlc matrix prctein (COM 
Human DNA sequence from done RP1-304B14 
exosome component Rrp40 



ESTs 



Hs.249163 
Hs.288968 
Hs.77274 
Hs.65234 
Hs.64179 
Hs.180479 
Hs.252587 
Hs.249239 
Hs.296770 
Hs.44565 
Hs.180224 
Hs.16530 
Hs.91011 
Hs.112242 
Hs.39384 
Hs.164021 

Hs.25130 
Hs.29202 
Hs.90606 
Hs.1 82265 
Hs.85B38 
Hs.1 79729 



LJ 1 2082 tr I 



e HE 



Homo sapiens cDNA FL 
c-Myc target JP01 
Microfibril-associatedglycoproteir,-2 
diaphorase (NADH/NADPH) (cytochrome b-5 
fatty acid hydroxylase 
RAB22A, member RAS oncogene family 



hypothetical protein FU20596 
il protein FLJ20116 



myosin regulatory light chain 
small inducible cytokine subfamily A (Cy 
anterior gradient 2 (Xenepus laevis) horn 
normal mucosa of esophagus specific 1 
putxive secreted ligand homologous to f 
small inducible cytokine subfamily B {Cy 
gb:RC1-DT0001-03 1 299-01 1-a1 1 DT0001 Homo 
Homo sapiens cDNA FU 14923 fis, clone PL 
Gproteinxoupled receptor 34 
15kDa selenoprotein 
keratin 19 

solute carrier family 15 (monocarboxylic 
collagen, type X, alpha 1 (Schmid metaph 
ecotropic viral integration site 2A 
hypothetical protein 

' id phosphoprotein 1 (osteopontin. 



647.30 
738.90 
CG.43 



592.10 
120.40 
117.40 
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20 

25 
30 




Hs.1340 

AW971155 Hs.293902 

NMJ06849 Hs.66581 

AA774575 Hs.121776 

NMJ00928 Hs.992 

D00306 Hs.181289 
#(N0CAT] 

AA367019 Hs.241395 

NMJM0699 Hs.300280 

AW873803 Hs.1 02876 



435975 
431330 
414463 
415003 
422281 



419768 
413841 
400560 
419502 
425746 
426205 
414590 
443614 



L36196 
#(NOCAT) 
L32140 
T72104 



Hs.1504 

Hs.289057 

Hs.81884 

Hs.531 

Hs.93194 

Hs.75576 

Hs.90765 
Hs.1 59440 
Hs.167877 



NM_000312 Hs.2351 



413829 
421126 
407731 
413585 
452624 
416402 
425573 
421905 
406672 



NMJ)05651 
U7726 
AI478463 
AU076607 
NM_001872 
M74587 
NMJ00066 
AI133452 
AU076606 
NMJ00715 



.183671 
Hs.1 870 
Hs.1 8443 
Hs.75285 
Hs.75572 
Hs.1 021 22 
Hs.38069 
Hs.75431 
Hs.30054 



AB006423 Hs.158308 
AI560247 Hs.32699 



AK000140 
W25005 
R37725 



Hs.261108 
Hs.43047 
Hs.1 84604 
Hs.1 271 79 



secreted frizzled-related protein 4 
Target Exon 

carboxyl ester lipase (bile salt-stimula 
homeodomain-interacting protein kinase 3 
chymolrypsinC(caldecrin) 
homeodomain-interacling protein kinase 3 
C2000960:gi|131432|sp|P23132|UTH_BOVlN 
elastasel, pancreatic 

ESTs, Moderately similar to SYNL RAT SYN 
Target Exon 

estrogen regulated gene 1 



rat regenerating islet-derived-like, hum 
ESTs, Weakly similar to [36022 hypothet; 
regenerating islet-derived 1 alpha (pane 
pancreatic lipase-related protein 1 
carboxypeptidaseAl (pancreatic) 
carboxypeptidase A2 (pancreatk:) 
chymotrypsinogen B1 



ESTs, Weakly similar to ISHUSS protein d 
protein disulfide Isomerase 
testis expressed sequence 1 1 
phospholipase A2, group IB (pancreas) 
elaslase 3, pancreatic (protease E) 
NM_G03122':Homo sapiens serine protease 



amylase, alpha 2A; pancreatic 



NM_030373-:Homo sapiens cytochrome P45 
fibrinogen, A alpha polypeptide 
bile acid Coenzyme A: amino acid N-acylt 
leukocyte cell-derived chemotaxin 2 
coagulation factor II (thrombin) 
fibrinogen, B beta polypeptide 



insulin-like growth factor binding prole 
complement component 8, beta polypeptide 
fibrinogen, gamma polypeptide 



complement component 4-binding protein, 
serine (or cysteine) proteinase inhibito 
ESTs, Weakly similar lo LIV-1 prolein [H 
major histocompatibility complex, class 
secretory leukocyte protease inhibitor ( 
hypothetical protein 

small inducible cytokine subfamily B (Cy 
ESTs 

Homo sapiens cDNA FLJ13585 lis, clone PI 
pancreatic polypeptide 
cryptic gene 

ENSP00000251112*:Sodium/potassium-trai 



374.00 
144.50 
266.50 



429547 
441085 
422397 
446868 
404287 



Hs.99376 
Hs.181245 
Hs.116051 
Hs.135100 



C9001188':gi|12738842MNPJ)73725 ,1| p 
ESTs 

Homo sapiens cDNA FU12532 lis, clone NT 
i lorno sapiens cDNA: FLJ22495 (is. done H 
ESTs 

C6001909:gi|704441 |dbj|BAA18909.1| (D298 
ESTs 

cryptic gene 
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30 
35 



417801 


AA417383 


Hs.82532 


integrin, beta-tike 1 (with EGF-like rep 




414142 


AW368397 


Hs.1 50042 


Homo sapiens cDNA FU14438 fls, clone HE 


128.70 


425921 


NMJ07231 


Hs.1 62211 


solute carrier family 6 (neurotransmitte 




410309 


BE043077 


Hs.278153 


ESTs 


108.80 


425842 


AI587490 


Hs.159623 


NK-2(Drosophila)homologB 


170.10 


431938 


AA933471 


Hs.54431 


specific granule protein (28 kDa); cyste 


75.70 


449592 


AI655494 


Hs.195718 


ESTs 


4.58 


414269 


W44633 


Hs.301296 


Homo sapiens cDNA: FLJ23131 fis, clone L 


188.50 


406685 


M18728 




gb'.Human nonspecific crossreactmc ante 


1123.60 


411573 




Hs.70823 


KIAA1077 protein 


995.60 


429201 


X03173 


Hs.1 98246 


group-specific component (vitamin D bind 


11.32 


418318 


U47732 


Hs.84072 


transmembrane 4 superfamily member 3 


8.38 


428698 


M852773 


Hs.334838 


KIAA1866 protein 


662.00 


444754 


T83911 


Hs.1 1881 


transmembrane 4 superfamily member 4 


4.00 




AJ224741 


Hs.278461 


matrilin 3 


283.50 


428824 


W23624 




ESTs 


4.55 


444006 


BE395085 


Hs.1 0086 


type I transmembrane protein Fn14 






AA479005 


Hs.154036 


tumorsuppressing subtransferable candid 


4.21 


418394 




Hs.84728 


Kruppel-like factor 5 (intestinal) 


4.80 


443344 


AI581519 




ESTs 


365.80 


420908 




Hs.100261 


Homo sapiens mRNA; cDNA DKFZp564B222 (fr 


112.90 


423635 


BE350494 


Hs.49753 


uveal autoantigen with coiled coil domai 


128.20 


428392 


H 10233 


Hs.2265 


secretory granule, neuroendocrine prolei 




429597 


NM 003316 


Hs.2442 


a disintegrin and metalloproteinase doma 


316.00 


452571 


W31518 


Hs.34565 


ESTs 


245.50 


443646 
436032 


AI085193 
AA1 50797 


Hs.1 64226 
Hs.1 09276 


ESTs 

latexin protein 




448030 


N30714 


Hs.325960 


membrane-spanning 4-domains, subfamly A 


252.20 


422109 


S73265 


Hs.1473 




278.20 


430407 
419235 


H23551 
AW470411 


Hs.30974 
Hs.288433 




6.20 
423.50 


449048 


Z45051 


Hs.22920 


simflar to S68401 (cattle) glucose indue 


4.01 


444301 


AK000136 


Hs.10760 


asporin(LRR class 1) 


499.90 
1.05 


427333 


AF067797 


Hs.176658 




417931 


W95642 


Hs.82961 


trefoil factor 3 (intestinal) 


4 33 


407777 


AA161071 


Hs.71465 


squaleneepoxidase 




435652 


N32388 


Hs.334370 


uncharacterized hypothalamus protein HBE 


1.47 


421341 


AJ243212 


Hs.279611 


deleted in malignant brain tumors 1 




453935 


A1633770 






208 


431629 




HsJ65827 


Interferon, alpha-indticible protein (do 


184 


439737 


AI751438 


Hs.41271 


Homo sapiens mRNA full length insert cDN 


14.21 


426227 


U67058 


Hs.168102 


Human proteinase activated receptor-2 mR 


315.70 


413554 




Hs.75426 


srxretogranin II (chromogranin C) 


ass 


412104 


AW205197 


Hs.240951 


Home jpiens Simil rt RIKEN cDN/ 2210 


3.13 


410310 


J02931 


Hs.62192 


coagulation factor III (thromboplastin, 


9.33 




BE328156 


Hs.1 50356 


ESTs 


1.03 


447395 




Hs.1 84793 


Homosapiens cDNA: FU21880 fis, clone H 


1.09 


440099 


AL080058 


Hs.6909 


DKFZP564G202 protein 




434665 


AA642125 




gb:nr60c01.s1 NCI_CGAP_fym3 Homo sapiens 


0.98 


452194 


AI694413 


Hs.332549 


olfactory receptor, family 2, subfamily 


2.23 



426322 
429010 
414420 



M74826 

AK001058 

J05068 

Y18198 

AA043424 



414004 AA737033 



406173 
403776 
403574 
428832 
458449 
409958 



Hs.1 70808 
Hs.1 2680 
Hs.2012 
Hs.1 94725 
Hs.76095 
Hs.1 18400 
Hs.7155 
Hs.7782 



AA578229 Hs.324239 
H04482 Hs.29019 
NM.001523 Hs.57697 




glutamate decarboxylase 2 (pancreatic is 
Homosapiens cDNA FU10196 fis, clone HE 
' ' I (vitamin B1 2 binding pr 



immediate early response 3 

singed (Drosophila)-like (sea urchin fas 

ESTs, Moderately similar to 21 1 5357A TYK 



ENSP00000226542':Smatl inducible cytokin 
Target Exon 

ESTs, Moderately similar to ZN91.HUMAN Z 
ESTs 

hyaluronan synthase 1 

Homosapiens cDNA: FLJ22314 fis, done H 

ESTs 

ESTs 

ESTs 

gb:zd37g06.s1 Soa:es_fetal_heart_NbHH19W 
apolipopratein C-l 

Homo sapiens mRNA; cDNA DKFZp564B2062 (f 
matrix metalloproteinase 7 (MMP7; uterin 
osteoblast specific factor 2 (fasciclin 
cadherin 1 1 , type 2, OB-cadherin (osteob 
ESTs 

Homo sapiens mRNA; cDNA DKFZp564B1264 (f 
coagulation factor II (thrombin) recepto 
protein tyrosine phosphatase, receptor t 
ESTs, Moderately similar to I54374gene 
glycoprotein 2 (zymogen granule membrane 
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455630 
428786 
420726 
451253 
420923 
413881 
431930 
421379 
419354 
422237 
437145 
414386 



BE256452 
AA359252 
AV655701 
Y16577 



Hs.952 
Hs.129778 
Hs.2257 
Hs.126485 
Hs.75183 
Hs.2314 
Hs.1330 



AF097021 

L00190 

AB035301 

Y15221 

M62839 

M13149 

AF007216 

X00442 



405849 #(N0CAT) 

405281 #(N0CAT) 

419078 M93119 

422095 AI868872 

425834 NMJ01639 

452304 AA025336 

407244 M10014 

450400 AI694722 



Hs.5241 
Hs.50966 
Hs.1 94366 



serine (or cysteine) proteinase inhibito 
NM_001 176*:Homo sapiens Rho GDP dissocia 
Target Exon 
islet amyloid polypeptide 
inlerleukin 22 receptor 
gb:Human mRNA clone with similarity to L 
delta-like homolog (Drosophila) 
secreted frizzled-related protein 5 
' ' alpha 5 (fibroneclin receptor, 



(parathyroid secretory p: 

apolipoprotein C-lll 
MSTP043 protein 

'■ irrierfamily 10 (sodium/bile ac 
i i nte tmal peptide 
vitronectin (serum spreading factor, som 
hypothetical protein FLJ 12604; KIAA16S2 
cytochrome P450, subfamily HE (ethanol- 
mannose-binding lectin (protein C) 2, so 
coagulation factor IX (plasma thrombopla 



small inducible cytokine subfamily B (Cy 
apolipoprotein H (bela-2-glycoprolein I) 
histidine- rich glycoprotein 



11 (MMP11; stro 

fatty acid binding protein 1, liver 
carbamoyl-phosphate synthetase 1, mitoch 
transthyretin (prealbumin, amyloidosis t 
Target Exon 

NM_002864:Homo sapiens pregnancy-zone pr 
insulinoma-associated 1 
hypothetical protein Ft J22704 



Hs.61311 
Hs.75431 
Hs.279744 
Hs.75615 



417366 
436961 
446319 
427899 
419092 

452340 
406319 
404235 
419183 
406293 
431912 
409327 
425200 
418738 



NMJ05518 Hs.59889 

BE185289 Hs.1076 

AW375974 Hs.156704 

AW207590 Hs.1 60711 

AA829286 Hs.332053 



AI660552 Hs.1 54903 
L41162 Hs.53563 
BE255203 Hs.1 551 01 



AA634543 
M643687 
AA526911 



Hs.79440 
Hs.149425 
Hs,82772 



428758 AA433938 Hs.98 



418478 
420001 
449038 
423184 
428505 
444783 



J05064 
AL1 33034 
NM_004428 
AL035461 
AK001468 



Hs.1 624 
Hs.2281 
Hs.62180 
Hs.1 50896 
Hs.288467 



450701 H39960 

424420 BE614743 

408660 AA525775 Hs.292523 

417940 R28205 Hs.24230 

434206 AW136973 Hs.288516 

439920 H05430 Hs.288433 
432542 



\ I nilartoF TH H IMAN - Dill 
Homo sapiens cDNA: FU22256 Ss, clone H 
3-hydroxy-3-methylgiutaryl-Coenzyme A sy 
small proline-rich protein 1B (comifin) 
ESTs 
ESTs 

serum amyloid A1 



GalNAc alpha-2, 6-sialyltransferase I, I 
ISLI transcription factor, UM/homeodoma 
CX000780:gi|66791 97|ref|N P_032800.1 1 po! 
C6001 909:gi|704441 |dbj|BAA1 8909. 1 1 (D298 
cytochrome P450, subfamily XXIV (vitamin 
Target Exon 

ESTs, Weakly similar to A561 54 Abl subsi 
collagen, type IX, alpha 3 
ATP synthase, H transporting, mitochondr 
solute carrier family 7, (cationic amino 
IGF-II mRNA-binding protein 3 
Homo sapiens cDNA FU11980 fis, clone HE 
collagen, type XI, alpha 1 
hypothetical protein Ft 



cyclin-dependent kinase inhibitor 2A (me 
complement component 6 
Homo sapiens mRNA; cDNA DKFZp434J1027 (f 



Homo sapiens cDNA FLJ12280fis, clone Ml 

prostaglandin E synthase 

ESTs, Moderately similar to PC4259 ferri 

ESTs 

ESTs, Weakly similar to S69890 mitogen i 



68.30 
102.43 
206.30 



WO 03/042661 



PCT/US02/36810 



410418 
415989 
414987 
400024 
418067 



Hs.63325 
Hs.317584 
Hs.294022 



transmembrane protease, serine 4 



418245 
423733 
450154 
422487 
424902 
422330 
436749 
423634 
430691 
401682 
422440 



AI590371 

M464510 

M29540 

M088767 

M330281 

R15891 

AJ010901 



Hs.281587 
Hs.1 98267 
Hs.1 53637 
Hs.1 15263 
Hs.5302 
Hs.1690 
Hs.103538 



AA365042 
X72755 
M18667 
AW207555 
AA1 33749 
Z15008 
AW470125 
AA1 29547 



411558 AA102670 



Hs.285754 
Hs.70725 
NMJ03401 Hs.1 50930 
M321649 Hs.2248 



456032 AW957446 

431808 M30703 

423472 AF041260 

439759 AL359055 

442295 AI827248 



Hs.1 84727 
Hs.301711 
Hs.270833 
Hs.1 29057 
Hs.57709 
Hs.224398 
Hs.1 94657 



422026 
428664 
437935 
423575 
422956 
406722 
413278 
439750 
434377 
425428 
421298 
422424 
421582 
401430 



428450 
409041 
453331 
400288 
453160 
444015 
421308 

422426 
450737 
429504 
456553 



AA770688 

U80736 

AK001666 



H27498 

BE563085 

AL359053 

AW137148 

AL1 10261 

AW172431 

AI186431 

AI910275 



M576953 Hs.22972 
NM_001898 Hs.123114 
BE280074 Hs.23960 
AI591147 Hs.61232 
AW887604 Hs.78065 
BE003054 Hs.1 695 
NM_014791 Hs.184339 
"~ Hs.50081 



Hs.594C 

Hs.163443 

Hs.122579 

Hs.293441 

Hs.833 

Hs.57664 

Hs.306593 

Hs.157211 

Hs.1 301 2 

Hs.296638 



Hs.222024 
Hs.82772 
Hs.265829 
Hs.61635 



ESTs 
ESTs 

nic antigen-related cell ad 
le, prostate androgen induced 
gb:EST33S85 Embryo, 12 week II Homosapi 
Human (clone CTG-A4) mRNA sequence 
mucin 4, tracheobronchial 



M721325 
M861271 
AW067903 
NMJW2204 
M032279 
X76029 

AF196478 Hs.1 88401 

NM_002497 Hs.153704 

AB026264 Hs.234245 




ESTs, Weakly sin 
monokine induced by gai 
progastricsin (pepsinogen C) 
Homo sapiens cDNA: FLJ23004 fis, clone L 
FXYD domain-containing ion transport reg 

mini Jim 2 (i (100kD) ' I m 
gb:xw60c04.x1 NCI_CGAP_Pan1 Homosapien 
met proto-oncogene (hepatocyte growth fa 
gamma-aminobutyric acid (GABA) A recepto 
X-ray repair complementing detective rep 
small inducible cytokine subfamily B (CX 



ESTs 

C6001909:gi|704441 |db'|BAA18909.1| (D298 
H2A histone family, member L 
trinucleotide rep 
similar to SALL1 (sal (Di . 
mucin 13, epithelial transmembrane 
Homo sapiens cDNA FU11575 fis, clone HE 
hypothetical protein FU10461 
Homo sapiens SNC73 protein (SNC73) mRNA, 
interferon-simulated prote'n, 15 kDa 
Homo sapiens mRNA full length insert cDN 
Homo sapiens cDNA FU 11382 fis, clone HE 
DKFZP586B0621 protein 
ESTs 

proslate differentiation factor 
trefoil factor 1(pS2) 
Target Exon 

hypothetical protein FU 13352 



AI240665 

X06256 Hs.149609 

AI263307 Hs.239884 

A1472865 Hs.135534 

AA687322 Hs.192843 

AJ297436 Hs.20166 




ESTs 
ESTs 

lipocalin 2 (oncogene 24p3) 

ESTs, Highly similar to Similar to a C.e 

I nscripl n f*tor BUAL2 

collagen, type XI, alpha 1 

integrin, alpha 3 (antigen CD49C, alpha 

six transmembrane epithelial antigen of 



104.70 
143.00 
311.80 
63.00 
141.70 
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AW043921 

AI35741 2 Hs.157601 

Hs.3416 
AU076442 Hs.1 17938 
NMJ02776 Hs.69423 
Hs.246005 



10(KLK10)(PRSSL1)(nes1) 

cell division cycle 2, G1 lo S and G2 to 
topoisoraerase(DNA) II alpha(170kD) 
AE-binding protein 1 
G protein-coupled receptor 64 
leukemia inhibitory factor (cholinergic 
ESTs, Moderately sin 

Homo sapiens cDNA: FU22667 lis, clone H 
transforming growth factor, beta 2 
spinal cord-derived growth factor-B 
hypothetical protein AF140225 
tissue factor pathway inhibitor 2 
ESTs, Weakly similar to T31613 hypotheO 
S100 calcium-binding protein P 
C22000015:gi|12741 327|ref|XP_008833.2| 2 

a disintegrin and metalloproteinase doma 
glutamine-fructose-6-phosphale transamin 
Human thrombospondin-1 gene, partial cds 
wingless-type MMTV integration site fami 
NM_016931:Homo sapiens NADPH oxidase 4 




matrix metalloproteinase 1 (MMP1; inters 
cathepsin D (lysosoma' 1 
h r th< lical protein FL 

minin, beta 3 (i -II H. Imn 
Homo sapiens cDNA: FLJ22182 fis, clone H 
paired-like homeodomaln transcription fa 
' " alpha-induced pro 

tumor necrosis factor (ligand) superfami 
KDEL (Lys-Asp-Glu-Leu) endoplasmic retic 
midkine (neurite growth-promoting factor 
hematological and neurological expressed 



interacting, kinesin-like (rabkines 



DKFZP564I1922 pr( 
hypothetical protein FU10134 

' (or cysteine) proteinase inhibito 
immunoglobulin kappa constant 
solute earner family 2 (facilitated glu 

aficcrossreacting antig 

EphA3 

matrix metalloproteinase 11 (MMP11; slro 
dual specificity phosphatase 4 
hypothetical protein DKFZp564D0462 
KIAA0551 protein 
lysyl oxidase 

GA733-2 antigen; epitneH glyc proterr 
heparan sulfate (glucosamine) 3-0-sjtfot 
cysteine knot superfamily 1, BMP antagon 



342.30 
213.50 
185.10 
106.10 



288.70 
502.60 
6.13 
178.90 



526 



WO 03/042661 



PCT/US02/36810 



453857 


AL080235 


Hs.35861 


DKFZP586E1621 protein 


504.30 


428966 


AF059214 


Hs.194687 


cholesterol 25-hydroxylase 


242.10 


413435 


X51405 




carboxypeptidase E 


7.30 


436476 




Hs.33829 


bHLH protein DEC2 


247.20 


406747 


AI925153 


Hs.217493 


annexin A2 


110.00 


455800 


R22479 


Hs.1 67073 


Homo sapiens cDNA FLJ13047 fis, clone NT 


112.13 


431211 


M86849 


Hs.323733 


gap junction protein, beta 2. 26kD (conn 


583.90 


431890 




Hs.271986 


integrin, alpha 2 (CD49B, alpha 2 subuni 


6.56 


431958 


X63629 


Hs.2877 


cadherin 3, type 1, P-cadherin (placenta 


460.90 




BE613126 


Hs.47733 


B aggressive lymphoma gene 


204.40 


437763 


AA469369 


Hs.5831 


tissue inhibitor of metailoproteinase 1 


7.75 


418870 


AF1 47204 


Hs.89414 


chemokine (C-X-C motif), receptor 4 (fus 


14.61 


424560 




Hs.150555 


protein predicted by clone 23733 


99.80 


436396 


AI683487 


Hs,152213 


wingless-type MMTV intagration site fami 


242.20 


407137 


T97307 




gb:ye53h05.s1 Soares fetal liver spleen 


17.88 


410668 


BE379794 


Hs.65403 


hypothetical protein 


4.18 


427660 




Hs.114121 


Homo sapiens cDNA: FLJ23228 fis, clone C 


116.40 


408826 


AF216077 


Hs.48376 


Homo sapiens clone HB-2 mRNA sequence 


63.3C 


442577 


AA292998 


Hs.163900 


ESTs 


4.18 


416498 


U33632 


Hs.79351 


potassium channel, subfamily K. member 1 


334.20 


447343 


M256641 


Hs.236894 


ESTs, Highly similar to S02392 alpha-2-m 




451277 


AK001123 


Hs.26176 


hypothetical protein FLJ10261 


375.30 


445133 


AW157646 


Hs.153506 


ESTs 


292.40 


414799 


AI752416 


Hs.77326 


insulin-like growth factor binding prote 


4.38 


431806 


AF186114 


Hs.270737 


tumor necrosis factor (ligand) superfaml 






AL353944 


Hs.50115 


Homo sapiens mRNA; cDNA DKFZp761 J 1 1 12 (f 


322.10 


410687 


U24389 


Hs.65436 


lysyl oxidase-like 1 


9.10 


417409 


BE272506 


Hs.82109 






426471 


M22440 


Hs.170009 


transforming growth factor, alpha 


138.60 




AW972512 


Hs.20985 


sin3-associaled polypeptide, 30kD 


250.50 


448625 


AW970786 


Hs.178470 


hypothetical protein FLJ22662 


4.89 


450506 


NMJ04460 


Hs.418 


fibroblast activation protein, alpha 


11.76 


433336 


AF017986 


Hs.31386 


secreted frizzled-related protein 2 


4.79 


408491 


A1088063 


Hs.7882 


ESTs 


8.25 


437802 


AI475995 


Hs.122910 


ESTs 


4.54 


408296 




Hs.44155 


DKFZP586G1517 protein 


175.10 


421155 


H87879 


Hs.102267 


lysyl oxidase 




451310 


AW250651 


Hs.26213 


Human DNA sequence from clone RP3-447F3 




439867 


AA847510 


Hs.161292 


ESTs 


261.60 


417771 


AA804698 


Hs.82547 


retinoic acid receptor responds: (tazaro 


723.00 


410763 


AF279145 


Hs.8966 


hypothetical protein FU21776 


251.70 


431385 


BE178536 


Hs.11090 


membrane-spanning 4-domains, subfamily A 


155.50 


457180 


R26022 


Hs.194662 




58.00 


424408 


AI754813 


Hs.146428 


collagen, type V, alpha 1 


17.19 


452679 


Z42387 


Hs.83883 


transmembrane, prostate androgen induced 


5.32 


425139 


AW630488 


Hs.325820 


protease, serine, 23 


371 SG 


432978 


AF126743 


Hs.279884 


DNAJ domain-containing 


7.27 


406850 




Hs.172928 


collagen, type 1, alpha 1 


19.30 


421991 


NM 014918 


Hs.1 10488 


KIAA0990 protein 


190.50 


421814 


L12350 


Hs.108623 


thrombospondin 2 


15.02 


409703 


NM 006187 


Hs.56009 


2'-5^%>r:der:yiaie syn:''etaiB 3 (100 k 


2B.57 




T93500 


Hs.28792 


Homo sapiens cDNA FU11041 fis, clone PL 


519.20 


413048 


M93221 


Hs.75182 


mannose receptor, Ctypel 


240.60 


404210 


#(NOCAT] 




NM 005936:Homo sapiens myeloidflymphoid 


404.60 
364.20 


452862 


AW378065 


Hs.8687 


Esfs 


447072 


D61594 


Hs.17279 


tyrosylprotein sulfotransferase 1 


226.20 


426935 


NM 000088 
AI432163 


Hs.172928 


collagen, type 1, alpha 1 


4.31 


427390 


Hs.268231 


Homo sapiens cDNA: FU231 11 fis, clone L 






AW903838 


Hs.81800 


chondroitin sulfate proteoglycan 2 (vers 


22.46 


451295 




Hs.17132 


ESTs, Moderately similar to I54374 gene 


23.74 


448569 


BE382657 


Hs.21486 


signal transducer and activator of trans 


5.68 


417148 


AA359896 


Hs.293885 


hypothetical protein FLJ14S02 


190.30 


432359 


AA076049 




Homo sapiens cDNA FLJ10229 fis, clone HE 


230.50 


422278 


AF072873 


Hs.114218 


frizzled (Drosophila) homolog 6 


361.90 


422545 


X02761 


Hs.287820 


fibronectin 1 




442379 


NM_004613 


Hs.8265 


transglutaminase 2 (C polypeptide, prote 


7.30 


417412 


X16896 


Hs.82112 


interleukin 1 receptor, type I 


267.20 


422110 


AI376736 


Hs.1 11 779 


secreted protein, acidic, cysteine-rich 


5.07 


431512 




Hs.2795 


lactate dehydrogenase A 


270.10 


417433 


BE270266 


Hs.82128 


5T4 oncofetal trophoblast glycoprotein 


504.60 


426369 


AF134157 


Hs.169487 


Kreisler (mouse) maf-reiatcd leucine zip 


10,62 




AL390147 


Hs.134742 


hypothetical protein DKFZp547D065 






AU077196 


Hs.82985 


collagen, type V, alpha 2 


14.01 


428797 


AA496205 


Hs.193700 


Homo sapiens mRNA; cDNA DKFZp586l0324 (f 


9.15 


434423 


NM 00676? 


I Hs.3844 


LIM domain only 4 


297.30 


426125 


X87241 


Hs.166994 


FAT tumor suppressor (Drosophila) homolo 


4B6.20 


422573 


AW297985 


Hs.295726 


integrin, alpha V (vitronectin receptor 


9.73 


421552 




Hs.105700 


secreted frizzled-related protein 4 


762.90 


424730 


NMJJ03358 Hs.23703 


ESTs, Moderately similar to CEGT_HUMAN C 


7.81 


400133 


NA 




Eos Control 


357.00 


444381 


BE387335 


Hs.283713 


ESTs, Weakly similar to S64054 hypotheti 


1150.30 


422048 


NMJJ12445 Hs.288126 


spondin 2, extracellular matrix protein 


4.50 


446019 


AI362520 


Hs.279789 


histone deacetylase 3 


11.26 
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aryl hydrocarbon receptor 
insulin-like growth factor binding prote 
NME7 

Homo sapiens cDNA FLJ14368 fis, clone 



Unique Eos probeset identifier number 



Pkiy CAT Number Accessions 

415913 163001.1 AW934714 BE161007 BE162500 AW749902 AW749864 BE162498 BE161005 M190449 AW513465 BE161006 BE162499 

419570 1860604J W68738W68831 

423733 231476.1 M330281 0AA330232 AW962521 

434565 390530.1 M642125 AA654516 

446437 763310.1 "~ ~" "~" ' 

Table 41 C 



AW4701 25 AI734872 AI749559 AW8565C4 AI583942 AW779036 AW843429 AW844876 AI52071 3 AW847236 



Pkey: Unique number corresponding to an Eos probeset 
Ret: Sequence source. The 7 digit numbers in this columr 

human chromosome 22,' Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates ONA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Re? Strand NLposition 



s. "Dunham I. et al." refers to the publication entitled The DNA sequence of 



401201 
401480 
401541 



403776 
404142 
404210 
404285 
404286 
404287 
404682 
404866 



406173 
406293 
406319 



8954179 
9743387 
7321503 

4755167 
1200312 
7630829 
8101156 



6139075 
7651817 
7705180 
7230224 
5686274 
9211730 



138534-138629,139234-139294,140121-140335,142033-142479 

166120-166347,166451-166557,169651-169832 

50018-50158 



19346-19525,19625-19708,19897-1997320357-20130,20215-20414 

89914-90033,90729-90855,91 1 31-91 1 98 

5542-6176 

1414-1513,1624-1756 



169926-170121 
32282-32416 
51086-51301 
53134-53281 
40977-41150 
11743-11929 

34202-34351,35194-35336,45412-45475,45731-45958,47296-47457,49549-49658,4979049904,50231-50342,53583-53667,54111-54279 

17705-18287 

86985-87233 

12925-13213 

17646-17953 



TABLE 42A: 574 genes upregulated in pancreatic cancer relative to normal body tissues 

Table 42A lists about 574 genes upregulated in pancreatic cancer relative to normal body tissues that are likely to encode proteins amenable to modulation by small molecules, _ 
peptides, or antibodies. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this 
analysis was expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA expression. The protein products of these genes often contain one or more 
j r- n j»e of h or i fun m if J rj < j llular qrel cr of being modulatable by small molecules, peptides, or antibodies (e.g. pkinase, death-domain, 
7lm, phosphatase, or ionjransporter). Certain predicted protein domains are noted. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar accession number, GenBank accession number 

Pr n ed.Prot D Domains: Certain" predicted protein domains. Abbreviations used: TM, transmembrane domain; SS, signal sequence; =Y, very likely to contain; =M, likely to contain; other 

protein domain abbreviations are from PFAM (Nucleic Acids Research, 2002, 30:276-280). 
UniGene Title; UniGene gene title 

R1 90th percentile of pancreatic cancer Als divided by the 50th percentile of normal tissue Als 

R2 90th percentile of pancreatic cance; Als divided by the 90th percentile of normal pancreas Als, where the 15th percentile of all normal lissue Als was subtracted 

from both the numerator and denominator 

Pkey; ExAccn; UnigenelD; Unigene Title; Pred.ProtDomains; R1; R2 

426230; M36701 9; Hs.241395; protease, serine, 1 (trypsr i i; lry| n,k» n_4;SS=M; 107 2S; 1.07 

415934; NM.000928; Hs.992; phospholipase A2, group IB (pancreas); phoslip;SS=M; 83.67; 1 .05 

421996; AW533807; Hs.1460; glucagon; hormone2;SS=M; 59.35; 1.61 

406399; ; ; NM_003122*:Homo sapiens serine protease ; kazal;SS=M; 55.49; 1.08 

444381; BE387335; Hs.283713; hypothetical protein BC014245; Collagen;TM=M;SS=M; 53.65; 43.51 
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406685; M18728; ; gb:Human nonspecific crossreactmg anlig ig Tf • 1;SS=f 73; 22 
428698; AA852773; Hs.334838; KIAA1866 protein; none;NA;NA; 32.44; 13.11 
437145; AF007215; Hs.5462; solute carrier family 4, sodium bicarbon; H X 3_coti ansp;TM=Y;; 29.80; 1.46 
428874; W32133;Hs.194366; transthyretin (prealbumin, amyloida Tran thyretin;SS=M 29.42; 1.94 
444754; T83911; Hs.374341; transmembrane 4 superfamily member 4; nor,e;TM=Y;SS=M; 28.78; 3.13 
418068; AW9711 rr H Ts, Weakly sin HUE leir nc r TM ' 3S=M 28.61; 0.98 

438091; AW373062;Hs.351 546; nuclear receptor subfamily 1 guupi i horn ejw tx.one; 2! i- 
413719, BE439580, Hs 75498; small ndi c I ,^-n\i> i bS-M 24.64; 7.21 
417771; AA804698; Hs.82547; retinoic acid receptor responder (tazaro; none,none; 23.77; 6.74 
414998, NMJ02543, Hs 77729 oxidised low density I il id ! i e r TM j-M 22 96; 4.57 
418318; U47732; Hs.84072; transmembrane 4 superfamily member 3; transmembrane4;TM=Y;SS=M; 22.31; 5.42 
425573; AB006423, H 15830 efne(o steine) prof nase inh bilo serpin.GCV H TM _ M,SS=M; 21.91; 1.03 
433110; D56494; Hs.3191; rat regenerating Islet-derived-like hum; ledin_c;TM=M "1 21 90, 0.60 
426490; NMJ01621; Hs.170087; aryl hydrocarbon receptor; PAC,PAS;TM=M;; 21.41; 19.89 
453863; X02544; Hs.572; orosomucoid 1; lipocalin,aldedh,ubiquitn,IRK;SS=M; 20.80; 8.12 
121126; M74587; Hs.102122; ir i I linJ,IGFBP;S 0.6C 

451035; AU076785; Hs.430; plastin 1 (lisoform);efnand,CH,Adaptin_N;SS=M; 19.25; 3.53 
413859; AW992356; Hs.8364; Homo sapiens pyruvate dehydrogenase kina; SAM_PNT,none; 18.38; 2.53 
420332; NM_001756; Hs.1305; serine (or cysteine) proteinase inhibito; serpin;TM=M;SS=M; 1 8.1 9. 2.29 
438089; W05391; Hs.351546; nuclear receptor subfamily 1, group I, m; hormone_r&c.zf-C4,none; 17.57; 4.80 
417426; NMJ02291; Hs.82124;laminin, beta 1; laminin EGFJaminin. Nterm.integr;n_E;SS=M; 17.08; 6.37 
427509; M62505; Hs.2161; complement component 5 receptor 1 (C5a I; 7tm.1;TM=Y;SS=M; 1 6.89; 7.15 
441031; AI110684; Hs.7645; fibrinogen, B beta polypeptide; fibrinogen_C,G-alpha,arf;TM=M;SS=M; 16.59; 7.74 
445033;AV652402;Hs.72901;cyci:n-dependent kinase in bito {p 1 an! ., 13 28,9 22 

431183; NMJJ06855; Hs.250696; KOEL (Lys-Asp-Glu-Leu) endoplasmic relic; ERJumen_ieceptTM=M;SS-M; 15.96; 2.38 

444784; D12435; Hs.11951; ectonucleotide pyrophosphatase/phosphodi; Somatomedin_B,Endonuclease,Phosphodiest;TM=Y;SS=M; 15.65; 1.33 

408243; Y00787; Hs.624; interleukin 8; HLH,PAS,IL8;TM=M;; 15.53; 4.34 

419355; AA428520; Hs.90061; progesterone binding protein; heme_1;TM=Y;SS=M; 15.45; 10.50 

426006; R49031; Hs.22627; ESTs; pkinase.TBC; 15.17; 0.58 

433376; AI249361; Hs.74122; caspase 4, apoptosis-related cysteine pr; CARD,ICE_p10,ICE_p20;SS=M; 14.84; 3.04 
422260; AA315993; Hs.105484; regenerating gene type IV; le;tin_c;SS=M; 14.71; 2.89 
430280; AA361258; Hs.237868; interleukin 7 receptor; fn3,none; 1 4.28; 11.47 

408983; NM_000492; Hs.663; cysllc fibrosis transmembrane conduclanc; ABC_tran,ABC_membrane,PRK,Bac_export_3;TM=Y;; 13.93; 1.18 
Ml K72755; I 1 r onof rp induced by gamma interferon; IL8;TW,=M;SS=Y; 13.81; 7.69 

129170; NM 001394; I r \f hi r L P V ho phatase Rib oma!_S3_N;TM=M;; 13.59; 2.24 

425988 BE0458O7 W 5 i I v 1 e I 8022 hypolheh; r ercne 13 54 0.95 

409512; AW979187; Hs.293591; melanoma differentiation associated prot; DEAD,heiicase_C,CARD;TM-M;; 13.48; 3.59 

429556; AW139399; Hs.98988; ESTs; none;TM=M;; 13.20; 1.16 

417079; U65590; Hs.81134; interleukin 1 receptor antagonist IL1;SS=M; 12.83; 7.72 

427286; AW732802; Hs.2132; epidermal growth factor receptor pathway; SH3,TonB_boxC;TM=M;; 12.72; 9.01 

431912; AI660552; Hs.356183; ESTs, We Hy nl rt I54 Abl subst; r J A i\ W Acyl-CoA_dh_N; 12.72; 6.72 

41 21 1 6; AW4021 66; Hs.784; Epslein-Barr virus induced gene 2 (lymph; 7tm 1 ;TM=Y;S S=M; 1 2.7 1 ; 12.56 

425397; J04C88; Hs.156346; topoisomerase (DNA) II alpha (170kD); □ NA_gyraseB,DNA_topoisalV,HATPase_c;SS=M; 12.52; 4.92 

409142; AL1 36877; Hs.50758; SMC4 (structural maintenance of chromosc; ABCJran,M,SMC_N,SMC_C,DUF154,none; 12.38; 7.59 

428157; AI738719; Hs.198427; hexokinase 2; hexokinase,hexokinase2,none; 12.34; 1 1 .53 

424273; W40460; Hs.144442; phospholipase A2, group X; phoslip;TM=M;SS=Y; 1 2.32; 3 02 

414821; M63835; Hs.77424; Fc fragment of IgG, high affinity la, re; ig;TM-Y;SS-M; 12.29; 2.21 

415323; BE259352; Hs.949; neutrophil cylosolic factor 2 (65kD, chr, SH3,TPR;TM=M;; 12.16; 4.14 

427283; AL1 19796; Hs.174185; ectonucleoUde pyrophosphatase/ph ill I natomedin_B,Phosphodiesl,Endonuclease;TM=M;SS=Y; 11.97; 5.93 

i 1 15; Hs.50151; » r r a li Hying charm l r Tf 11 t 
426227; U67058; Hs.154299; Human proteinase activated receptor-2 mR; 7tm_1;TM=Y;SS=M; 1 1.75; 3.56 
412228; AW503785; Hs.73792; complement component (3d/Epstein Ban vi; sushi;TM=Y;SS=M; 1 1 .67; 3.77 
450737; AW007152; Hs.53325; transmembrane protease, serine 4; trypoin,ldl_rocept_a : r,one; 1 1 .65; 4.52 

439750; AL359053; Hs.57664; Homo sapiens mRNA full length in l< DN IMPDII ( 'MPEII N.CBS,integrin_B,FUcin_BJectin; 11.56; 546 
431512; BE270734; Hs.2795; lactate dehydrogenase A; ldh,ldfuC,SH3,pkir,cse,llBA;TM=M;; 11.55 5.11 

429638; AI916662; Hs.21 1577; kinectin 1 (kinesin receptor); bZIP,Tropomyosin,spe:tnn,LBP_BPLCETP,B56,M;TM=Y;SS=M; 11.47; 4.65 

445133; AW157646 Hs 198689 E"T land.spedrii ,G -. I r r , r I PTr yc i My l" M ldh_C,CH,AIP3;TM=M;; 11.41; 12.62 

411352; NM_002890; Hs.758; RAS p21 protein activator (GTPase activa; SH2,SH3,C2,FH,RasGAP;TM=M;SS=M; 11.24; 9.95 

447343; AA256641; Hs.236894; ESTs, Highly similar to S02392 alpha- 2-m; none.none; 1 1 .1 9; 4.16 

422616; BE300330; Hs.118725; selenophosphale synthetase 2;,AIRS,AIRS_C;TM=M;; 11.17; 5.98 

422241; Y00062; Hs.170121; protein tyrosine phosphatase, receptor t; kinesin,fn3,Y_phosphatase;TM=M;; 1 1 .14; 9.09 

440594; AW445167; Hs.126036; ESTs; none.none; 11.05; 16.45 

425289; AW139342; Hs.155530; interferon, gamma-inducible protein 15; PAAD_DAPIN,HIN;SS=M; 11.05; 10.38 
427700; AA262294; Hs. 180383; dual specificity phosphatase 6; Rhodanese,DSPc;TM=M;; 1 1.05; 4.70 
448811; AI590371; Hs.199460; ESTs; none;TM=Y;; 10.85; 9.69 

424321 ;W74048; Hs.1765; lymphocyte-spec fic p ol i H I I r e TM M 10.72; 8.55 

429083; Y09397; Hs.227817; BCL2-related protein A1 b M=M;; 10.51; 1 

424247; X14008; Hs.234734; lysozyme (renal amyloidosis); lys,ig,FAD_Synlh,ldh,ldh_C,pkinase;SS=M; 10.37; 6.35 

436356; A1469355; Hs.127310; ESTs; pkir,ase,inn;TM=M;; 10.35; 2.74 

427337; Z46223; Hs.176663; Fc fragment of IgG, low affinity lllb, n ig;TM=Y;SS=M; 10.34; 314 

417018; M16038; Hs.80887; v-yes-1 Yamaguchi sarcoma viral related , "H. SH3 1 kina e I M-M , 10 34, 4 V 

418299; AA279530; Hs.83968; integral, beta 2 (antigen CD18 (pS5), ly, integrin B,EGr,PSI;TM=Y;SS=M; 10.21; 4.58 

451820; AW058357; Hs.199248; ESTs; 7tm_1;TM=Y;SS=M; 10.18; 2.67 

413043; M93221; Hs.75182; mannose receptor, C type 1; fn2,l n.c Ricin BJc fin li I rM=Y;SS=M; 10.17; 8.35 
429752; H52343; Hs.36636; ESTs; pkinase,pkinase; 10.13; 12.35 

452721; AJ269529; Hs.301371; solute carrier family ^ erol-3-pl M > n M ti i_ TM=Y;SS=M; 10.08; 8.74 
421462; AF016495; Hs.104624; aquaporin 9; MIP;TM=Y;SS=M; 10.05; 6,06 
452698; NM_001295;Hs.301921;chemokine(C-C motif) receptoi 1; 7tm_1;TM=Y;SS=M; 9.98; 3.16 
416389; AA180072; Hs.149846; integrin, beta 5; integrin_B,none; 9.85; 9.59 

421044;AF061871;Hs.101302;HumanDNAsequencefromcloneRP1-238[1 fn illagei SPN;1 M;SS=M; 9.78; 5.96 

446620; AA128808 H 17MT2 tr p f ler e;TM r S-M 975, 2.64 

405102; ;; C15001220 *:gi|4469558|gb|AAD21311.1 1 (AF; DAG_PE-bind,PH,RhoGEF.DC1;SS=M; 9.74; 1.88 

418693; AI750878; Hs.87409; thrombospondin 1; EGF,lspJ,vwc,TSPN,tsp_3;SS=M; 9.72; 6.94 
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426535; AU07701 2; Hs .288582; ESTs, Weakly similar to ubiquitous TPRm; Kunjtz_BPTI,Kunitz_BPTl,7tm_2,HRM; 9.68; 10.58 

448105 AW591433,Hs 298241, Transmembrane protease serin dl e| w n;TI c H9d7 4.06 

456266; L29073; Hs.193726; cold shookdomain protein A; 7tm_2,HRM CS D;TM=Y SS=M; 9 62; 2.36 

410240; AL157424; Hs.61239; synaptojanin 2; Exo_endo_phos,Syja_N,rrm Gram-vej>omrc;TU=M;; 9.62; 3.77 

457001; J03258; Hs.2062; vitamin D (1,25- dihydroxyvitamin D3) re; hormone je zf-C4,Mel allot i o i TM=M;; 9.60; 8.05 

456373; BE247706; Hs.89751; membrane-spanning 4-domare, hjdi mily/ none;Tk> Y; 957;3.77 

416847; L43821; Hs.80261; enhance, i fh lenlation 1 s-Mke do; SH TM=M;; 9.56; 10.50 

437158; AW090198; Hs.348709; KIAA1150 protein; none;NA;NA; 9.55; 8.87 

426108; AA622037; Hs. 166468; programmed cell death 5; DUF122,TM-> ).47;5.67 

403344; ; ; NM_000341:Homo sapiens solute carrierfa; alpha-amylase;TM=Y;; 9.47; 1.42 

449543; AF070632; Hs.23729; Homo sapiens clone 24405 mRNA r i ir Je )n (ran- none; 9.46; 3.1 2 

433233; AB040927; Hs.301804; KIAA1494 protein; SH3,zf-C3HC4;TM=M;; 9.42; 4.01 

444838; AV651680; Hs.208558; ESTs; integn'n_A,FG-GAP,none; 9.42; 1.87 

439803; AA001021; Hs.6685; thyroid hormone receptor interactor 8; none.none; 9.41; 5.55 

428505; AL035461; Hs.2281; chromogranin B (secretogranin 1); Granin;SS=M; 9.40; 3.46 

41 1213; AA676939; Hs.69285; neuropilin 1; MAM,F5_F8_lype_C,CUB,CUB,rAW,F5_F8. type_.C: 9.38; 6.32 

432810; AA863400; Hs.374489; ESTs; none.Skpl ,AM; 9.38; 4.36 

427581; NM014788; Hs.179703; KIAA0129 gene product; SPRY,zf-BJ>ox;TM=M;; 9.34; 8.26 

413109; AW389345; Hs.110855; FSTs, similar to leukemia virus receptor; PH04,none; 9.34; 4.67 

428450; NMJ14791; Hs.184339; KIAA0175 gene product; KA1,pkinase;TM=M;; 9.31; 4.24 

408113; T82427; Hs.194101; Homo sapiens cDNA: FU20869fis, clone A; 7tm_3,none; 9.24; 7.12 

448030; N30714; Hs.325960; membrane-spanning 4-domains, subfamily A; none TM=Y ;SS=M; 9,23; 6.03 

437672; AW748265; Hs.S741;flavohemoprotein b5?; hemeJ,NAD_bindingJipoxygenase,FAD_binding_6;TM=M;; 9.22; 10.72 

416498; U33632; Hs.73351; potassium channel ubfamily K, memb r rTI i SS M 9.20; 4.46 

409956; AW103364; Hs.727; inhibin, beta A (act™ A, aclivin AB a; TGF-beta,TGFb_propeptide,Tub;SS=M; 9.19; 16,46 

413095; AA494359; Hs.30715; potassium voltage-gated channel, Isk-rel; none. START: 9 15; 2 18 

418838; AW385224; Hs.35198; ectonucleotide pyrophosphatase/phosphodi; PhosphodiestTM=Y;SS=M; 9.14; 3.03 

452960; AK001 335; Hs.31 1 37; protein tyrosine phosphatase, receptor t; Y_phosphatase,none; 9.14; 1 1 .75 

417821; BE245149; Hs.82643; protein tyrosine t n i M; 9 11; 4.29 

427157; U51166; Hs.173824; thymine-DNA glycosylase; UDG;TM=M;; 9.06; 9.68 

431341; AA30721 1; Hs.251531; proteasome (prosome, macrapain) subunT; proteasome;TM=M;; 9.05; 5.61 

413367; NM J06617; Hs.75317; solute carrier family 16 (monocai oxylc ;ugar_tr;TI 1=Y;; 9.04; 5.79 

437296; AA350994; Hs.20281; KIM1700; Rhodanese,DSPc;TM=M;; 9.02; 5.75 

418888; AU076801; Hs.89436; cadherin 17, LI cadherin (live -inter li cadhe, in fM- 1 SS=M 8.94; 5.01 

446406; AI553681; Hs.348490; Arg/Abl interacting protein ArgBP2; Sorb.none; 8.91; 1 .77 

428820; AA436187; Hs.172631; integrin, alpha M(complementcomporent v arn < r r P TM=Y SS-M; 8.85; 4.74 
434398; AA121098; Hs.3838; serum-inducible kinase (SNK); pkinase,PCLO_box;TM-M;; 3.78; 4.54 
453902; BE502341; Hs.3402; ESTs; none,none; 8.72; 3.71 

433334; AI927208; Hs.231958; matrix metalloproteinase 28; Peptidase„M0,none; 8.71; 4.28 

446488; AB037782; Hs.15119; KIAA1361 protein; pkinase;SS=M; 8.70; 3.71 

450247; AF123303; Hs.24713; hypothetica 1 protein; elhand.m to.carr; I M=Y;SS=M; 8.68; 3,40 

432101; AI91 8950; Hs.123642; EphA3; fn3,pkinase,SAM,EPHJbd;TM=Y;SS=M; 8.62; 5.62 

410763; AF279145; Hs.8966; hypothetical protein FU21776; none.none; 8.61; 13.53 

422278;AF072873;Hs.114218;frizzlcd(Drosophila)homolocb k rizzled,7tm_2;TM 

425465; L18964; Hs.1904; protein kinase C, iota; pkinase,DAG_PE-bind,pkinase_C,CPR;TM=M;; 8.50; 3.18 

41 91 1 1 ; AA2341 72; Hs.1 3741 8; ESTs; none.IRK; 8.47; 7.51 

430024; AI808780; Hs.227733 nU. ji r Iph ntegni \P;TM=Y b=M; 8.45; 3.46 

447574; AF162666; Hs.18895 tojsled I ii 1 TM M 845 5 30 

447217; BE465754;Hs.17778; neuropilin 2; CUB,MAM,F5^F8_ryps_i TM . 1 3 I.' f "4 6.30 

' I34; NM_002110:Hs,89555; hemopoietic cell kinasi H2,SH3,pkinase;TM=M;; 8.4 1.87 

405555; ; ; homeodomain-interacling protein kinase 3; trypsin;TM=M;; 8.39; 0.68 

417412; X16896; Hs.82112; interieukin 1 recepto , t/n- 1 ig riR if " o=M,8 35;4.74 

405556; ; ; homeodomain-interacling protein kinase 3; trypsin;TM=M;; 8.31; 0,87 , 

407587; AK002011; H " 1 llielicai pro n FLJ 1 1 1 I ql r / ntb,ldh,ldh_C,pkinase;8S=M; 8.28; 3.12 
408051; AI623351; Hs.1 72148; ESTs: PH.RhoGAP.none; 8.27; 5.65 

449523; NMJ00579; Hs.54443; chemokine (C-C motif) receptor 5; 71m. 1 ;TM=Y;SS=M; 3 26; 5.49 
429732; U20158; Hs.2488; lymphocyte cytosolic protein 1 1 1 12 run 112 SS=M; 8 24, 8,91 
405204; ; ; NM_002086*:Homo sapiens growth factor re; SH2,SH3;TM=M;; 3.23; 6,43 
426806 T1 9228 HM 1 hvpoth ^1 protein Fl 009 pkin PF007 M; 8.20; 6.11 
428428; AL037544; Hs.184298; cyclin-dependent kinase 7 (homolog of Xe; pkinese;TM=M;; 8.18; 8.37 

450375; AA009647; Hs.352537; a disintegrin and metalloproteinase doma; Reprolysin,Pep_M12B_propep,disintegrin,Reprolysin,Pep_M12Bj)ropep,disintegnn; 8.17; 12.24 
U67319; H 7 , i 1 r tedcj i n ( E I L r. r T" c t 15:4.61 

13132; NM_006823;Hs.75209;p in kinase (cAMP-de icr t FN M; 8.15; 11 
428513; BE220806; Hs.184697; Homo sapiens clone 23785 mRNA sequence; PSI.none; 8.13; 13.28 
429345; R11141; Hs.199695; hypothetical protein; K_tetra,SAM; 8.13; 1.15 
425838;NM 014071; Hs.159613; nuclear receptor coactivator RAF250 dl rent rtv Ii -12 1.5 
425836; AW955696, Hs 909c D ESTs Cbl_N CbLN2,Cbl_N3.UBAzf-C3HC4,none; 8.11; 7.47 
406366; , ; secreted frizzled-related protein 4; trypsin;SS=M; 8.05; 0.69 

429663; M68874; Hs.21 1587; phospholipase A2, group IVA (cytosolic, ; C2,PLA2_B;TM=M;; S.04; 5.20 

458946; AA009716; Hs.4231 1; ESTs; none,DSPc,Yjhosphatase; 8.02; 1.93 

425280; U31519; Hs.1872; phosptioenolpyruvatecarboxykinase 1 (sol; PEPCK;TM=M;; 7.97; 19.33 

445800; AA126419; Hs.32944; inositol polyphosphate-4-phosphatase, ty; none.none; 7.90; 11.44 

449444; AW818436; Hs.351306; solute earner family 16 (morrr I I rr r bS=M 7 89; 7.00 

426167; AF039023; Hs.167496; RAN binding protein 6; Armadillo.seg.HEATPBS;; 7.83; 11.16 

400408 S7576 1 ; Hoi i r ne, partial c; 7 none 781,078 

443362;M641767;Hs.21015;hypotheticalproteinDKFZp564L38H mil u Tl'-i SS-M 7 78; 7.02 

1 ?0; AF1 19666; H 14 n j r tor tyrosine kinase sut H3;TM r,2.i 

409799; D1 1928; Hs.76845; phosphoserine phosphatase-like; Hydrolase;TM=M:; 7.72; 2.53 

447337; M114050; Hs.19949; caspase 8, apoptosis-related cysteine pr; ICE_p1 0,ICE_p20,DED;TM=M;; 7.68; 2.40 

421634; BE231591; Hs.106768; hypothetical pr ri I 11 radii e M 7 49,6 57 

434699; AA643687; Hs.149425; Homo sapiens cDNA FLJ11980 fis, clone HE; Nucleoside_ta2,none; 7.47; 2.53 

417880;BE241595;Hs.82848;selectinL(lymphocyteadhe lonnolp-ii- - r ct i i TM=M SS=M; 7.38; 5.60 

436729; BE621307; Hs.351316; transmembrane 4 superfamily member 1; none;TM=Y;SS=M; 7.29; 5.78 
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428970; BE276391; Hs.194691; retinoic acid induced 3; 7tm_3;TM=Y;SS=M; 7.26; 1 1.00 

426761; AI015709; Hs.172089; PORIMIN Pro-oncosis receptor inducir ji p i TM rSSM 7 25;7.22 

413880 AI660842, H 0915 in* lejkin 22 receptor; 1 c;TM ^S=M 7 24 0 98 

418945; BE246762; Hs.89499; arachidonate 5-lipoxygenase; lipoxygenase, PLAT;TM-M;, 7.22; 5.45 
5 413441; AI929374; Hs.75367; Sro-like adapter; SH2,SH3;TM=M;; 7.20; 5.72 

426158; NMJ01982; Hs.199067; v-erb-b2 avian erythroblastic leukemia v; Furin like,pl<inase,Recep_L_domain,Furin-like,pkinase,Recep_L_domain,Peptidase_M24; 7.13; 3.97 

428474; AB023182; Hs.184523; KIAA0965 protein; pkinase;TM=M;; 7.13; 5.43 

421582; AI910275; Hs.350470; trefoil factor 1 (breast cancer, estro-c t foil.G 1 ii S =M 7 08 21.61 

44984 r S y 707,6.18 

10 452110;T47667;Hs.28005;HomosapienscDNAFLJ113o°!i clone PL; pkin; - „e ,\ r_r = ,p nune, 6.94; 4.82 

451295; AI557212; Hs.17132; ESTs, Moderately similar to I54374 gene ; pkinase,DAG_PE-b:r.c,pkinase_C,OPR,none; 6.92; 15.34 

430680; AW138724; Hs.168974; ESTs, Highly similar to ALU7.HUMAN ALU S;Y_, losp tase \dap r_N Y.phosphatase; 6.88; 1.94 

441600; AA939347; Hs.127223; Homo sapiens cysteine knot protein (ZSIG, 7'.m_1 JdljecepLa,LRR;SS=M; 6.86; 0.97 

41 0687; U24389; Hs.65436; lysyl oxidase-like 1 ; Lysyi_oxidase;SS=M; 6.63; 7.24 
15 425009; X58288; Hs.154151; protein tyrosine phosphatase, receptor t; fr.3,ig,Y_phosphalase,MAM;TM=Y;SS=M: 6.83; 11.43 

400539; ; ; Target Exon; none;TM=M;; 5.70; 1.19 

431113; AK000673; Hs.274337; hypothetical protein FLJ20666; pkinase;TM=M;; 6.65; 2.21 

445280; AW055063; Hs.343220; v-crk avian sarcoma virus CHOoncogene ; SH2,SH3,rone; 5.61; 10.66 

425834, NMJJ01639 Hs.1957 amyloid P component, seram; pentaxin,TM-V;SS=M; 6.57, 2.20 
20 435706; W31254; Hs.7045; GL004 protein; PDEase,GAF,none; 6.55; 11.44 

41 5906; AI751 357; Hs.288741; Homo sapiens cDNA:FLJ22' r I loneH r plr in none, 6 45 5 25 

408308; AL033377; Hs.44197; hypothetical protein DKFZp564D0462; none.none; 6.42; 9.14 

432336; NMJM2759; Hs.274382; protein kinase, interferon-inducible dou; dsrm,pkinase;TM=M;; 6.42; 4.12 

417874; BE616160; Hs.82829; protein tyrosine phosphatase, non-recept; Y_phosphatase;TM=Y;; 6.42; 2.26 
25 446872; X97056; Hs. 16362; pyrimidinergic receptor P2Y, G-piolen c; 7lni_1;TI^Y,SS^V; 6.41; 4.54 

444006; BE395085; Hs.10086; type I transmembrane protein Fn14; !dl_recept_a,PKD,MHCJ;TM=M;SS=Y; 6.38; 3.55 

412970; AB026436; Hs.177534; dual specificity phosphatase 10; Rhodanese,DSPc;SS=M; 6.35; 4.95 

422583; AA410506; Hs.27973; KIAA0874 protein; ank,G-alpha;TM=M;; 6.35; 3.56 

452355; N54926; Hs.29202; G protein-coupled recep cr 7tm ATP_C;TM 6 11 
3 0 422282; AF019225; Hs.114309; apolipoprotein L; MotA_ExbB;TV=Y;SS=M; 6.32; 5.15 

407235; D20569; Hs.1 69407; SAC2 (suppressor of actin mulalior y; n Ribo I CJaUosyl_T,Zip,adh_short,zf-C3HC4; 6.30; 8.35 

428486; AW583497; Hs.184604; pancreatic polypeptide; hormane3;TM=M;SS=Y; 6.29; 3.61 

408847: AW290997; Hs.3034B; ESTs; pkinase.ig.none; 6.28; 3.63 

428179; AI127772; Hs.279696; serum/glucocortlcoid regulated kinase-li; pkinase,PX,pkinase_C;SS=M; 6.28; 3.50 
35 443614; AV655386; Hs.7645; fibrinogen, B beta polypeptide; none.none; 6.26; 7.48 

425354; U62027; Hs.155935; complement component 3a receptor 1; 7tm_1;TM=Y;SS=M; 5.26; 3.98 

448888; AW196663; Hs.200242; caspase recruitment domain protein 6; CARD:TfV=M;; 6.21; 4.10 

428180; AI129767; Hs.182874; guanine nucleotide binding protein (G pr; G-alphE,arf;TM=M;; 6.18; 4.62 

409245; AA361037; Hs.356436; tRNA isopentenylpyrophosphate transferas; Armadillc_seg;TM=M;; 6.17; 1 1 .15 
40 417952; AI192838;Hs.372643;dLd pmr t\ yrnsine-(Y)-ph phoi> i ii none 6 17; 3.05 

445701; AF055581; Hs.13131; lymphocyte adaptor protein; SH2,PH;TW.=M;; 6.16; 11.90 

425910; AA830797; Hs. 184760; CCAAT-box-binding transcription factor; r.one;TM=M;; S.10; 2 96 

426797; AW936258; Hs.342849; ADP-ribosylatior factor-like 5; arf,Ca_channel_B.SH3; 6.03; 3.17 

108331; Nf 0; I dual specificity phosph DSPc;TM=f 

45 441384; AA447849; Hs.288660; Homo sapiens cDNA: FLJ22182 fis, clone H; 7tm_3,none; 5.97; 13.12 

414217; AI309298; Hs.279898; Homo sapiens cDMA: FLJ23165 Us, clone L; none;NA;NA; 5.92; 6.47 

418506; AA084248; Hs.85339; Unknown protein for MGC29543; none.none 5.91; 1.94 

436345; AA873008; Hs.121572; ESTs; CARD,BIR,zf-C3HC4,CARD,BIR,zf-C3HC4; 5.90; 1.40 

414087;W19712;;gb:zb36d03.r1 Soares_parath/rcidJ '-r _;, -in ie ,none i.8! 0.9C 
5 0 430396; D49742; Hs.241363; hyaluronan-binding protein 2; ank,death,ZU5,EGF,kringle,trypsin,Nebulin,UM;SS=M; 5.77; 1.24 

431385; BE178536; Hs.11090; membrane-spanning 4-domains, subfamily A; none.none; 5.71; 4.00 

427557; NM.002659; Hs.179657; plasminogen activator, urokinase recepto; UPAR_LY6,ET,PLA2Jnh;SS=M; 5.71; 3.83 

414171; AA360328; Hs.865; RAP1A, member of RAS oncogene family; pkinase,DAG_PE L r I I ' DC1 GFP TM=M;; 5.69; 3.07 

418870; AF147204; Hs.89414; chemokine (C-X-C motiO, receptor 4 (fus; 7tmJ ,7tm. _2;TM=Y;SS=M; 5.68; 12.92 
5 5 425317; AW205118; Hs.210546; interieukin 21 receptor; none;TM=Y;SS=M; 5.60; 5.45 

417863; AB000450; Hs.82771; vaccinia related kinase 2; pkinase;TM=M;; 6.59, 4.19 

4001 51; ; ; Eos Control; AT_hook,DNA_mis repair,HATPase..c,UQ..con;TM=V;; 5.53; 3.13 

450139; AK001838; Hs.355608; serum/glucocortlcoid regulated kinase; none.none; 5.52; 8.61 

418203; X54942; Hs.83758; CDC28 protein kinase 2; CKS:; 5.52; 10.04 
60 433556; W56321; Hs,111460; calcium/calmodulin-dependent protein kin; pkinase.none; 5,51; 6.75 

424701; NM 005923; Hs.151988; mitogen-actlvated protein U se kinase ; pkinase,TM=l ( r 1 , 

iv:;-- i . . ..,- f .. - r , ■ ■■■i-in-ym-tra:, r^FPiC:ft.'=f:i-3:e a 

408761; AA057264; Hs.238936; ESTs, Weakly similar to (define not ava; 7tm_1,none; 5.42; 2.59 

415444 BE247 J I i r He carrier family 20 (phosphate tran; PH04,LlM;TM=M;; 5.37; 8.69 
65 444184; T87841; Hs.282990; Human DNA sequence from clone RP1-28H20 ; pkinase,RI01,APH,KOW;TM=M;; 5.36; 3.32 

410434; AF051 152; Hs.63668; toll-like receptor 2; LRR,LRRCT,TIR;TM=M;SS=M; 5.36; 3.94 

1291 M_000312; Hs.2351 n C (inact f |i I F r n i " 

421559; NM 014720; Hs.105751; Ste20-related senne/threonin I nase pkinase,UVR;TM=l 5 1 t 

429922; Z97630; Hs.2261 17; H1 hislone family member 0, linkerji stone;TM=M;; 5.27 I ' 
70 440682; AW362152; Hs.27181; nuclear receptor binding factor-2; cyclln,bZIP;TM=W;; 5.25; 4.82 

411558 AA 10267 I 1 r n f r Lt h n nemb;TM=Y;SS=M; 5.25; 11.26 

428234; U93553; Hs.183123; nuclear receptor subfamily 5, group A, m; hormone jec,zf-C4;SS=M; 5.20; 1.11 

408683; R58665; Hs.46847; TRAF and TNF receptor-associated protein; Exo_endo_phos;TM=M;; 5.19; 6.25 

40E657 * r i 1 H 1 / 3 E 1 rl none 5 18; 5.47 
75 438746; AI885815; Hs.184727; Human melanoma-associated antigen p97 (m; transfem'n,Guanylate_kin,PDZ,SH3; 5.17; 4.02 

438698; AW297855; Hs.361171; ESTs, Weakly similar to I38022 hypothec; lipoxygenase.PLAT.none; 5.16; 2.91 

442200; AW590572; Hs.235768; ESTs; none,none; 5.11; 4.22 

418738; AW388633; Hs.6882; solute carrier family 7, (cationic amino; none.none; 5.08; 2.71 

419088; AI538323; Hs.367688; integrin, beta 8: integrin_B,none; 5.07; 3.53 
80 414555; N98569; Hs.76422; phospholipase A2, group IIA [platelets, ; phoslip;TM=M;SS=Y; 5.05; 3.42 

408414; AI114688; Hs.193400 L I, 109260A Bcetl q V1 rSS=M 5 05; 3.41 

430407; H23551; Hs.30974; ESTs; pkinase.PBD.none; 5.03; 1.63 

427127; AW802282; Hs.22265; pyruvate dehydrogenase phosphatase; PP2C,none; 5.00; 5.14 
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452194; AI694413; Hs.373599; Ubiquitin-like protein FAT10??? - diubiq; none.none; 4.9B; 2.65 
410073; AW408163; Hs 5848S catenin (cadtienr associated pro em i * r r hn SS=M 4 97; 10.60 

409430; R21945; Hs.346735: splicing fa**, arginine/serine-rich 5; DSPc.Rnodanese.none; 4.95; 2.87 
432841; M93425; Hs.62; protein tyrosine phosphatase, non-recept Y_phosphatase;SS=M: 4.88; 21,69 
433470; AW960564; Hs.351316; transmembrane 4 superfamay membe 1 none;TM=Y;SS=M; 4.83; 4.60 

418529; AW005695; Hs.250897; TRK-fused gene; Band _41,ERM,pkin£se,LRR,LRRCT MAV,Nucleoplasmln,Tropomyosin,OPR,fi lament,bZIP,G-gamma,M,DUF164;TM=M;; 4.79; 
5.47 

421425; AK001564; Hs. 104222; hypothetical protein FU10702; efhand,kazal,arf,ras,7tm_1;TM=M;; 4.75; 5.41 

414135; NMJ04419; Hs.2128; dual specificity phosphatase 5; Rhodanese.DSPc.Y j i sphatase;TM >l !.74 9.76 

417640; D30857; Hs.82353; protein C receptor, endothelial (CPC r rM=M M 4 73; 4.68 

430630; AW26992C H 1 a tin A ( fin / y Hi TM V 4 66; 5.61 

452239; AW379378; Hs.356289; protein tyrosine phosphatase, receptor t; none.nore; 4.63; 6.62 

427333; AF067797; Hs.176658; aquaporin 8; MIP;TM=Y;SS=M; 4.63; 0.80 

431890; X17033; Hs.271986; integrin, alpha 2 (CD49B,al ha 2 si i i;vi egrin_A,FG-GAP,TM SS=M; 4.58; 11.38 

428065, AI634046 H 15731 ESI c p2C DED ICE_p10,lCE_p20,DED; 4.55; 4.51 

428582; BE336699; Hs.185055; BENE piolein; nor.e;TM=Y;SS=M; 4.54; 8,76 

416224; NM 002902; Hs.79088; reticulccalbin 2, EF-hand calcium b i din; efli ind;SS-M; 4 54, 19 57 

450056; BE047394; Hs.8208; ESTs, Weakly similar to 871512 hypotheti; Aecjr£n,ABC_membrare,ig,MHC.II_beta,SRP54,praleasome,ABC_membrane,ABC_tran; 4.49; 10.47 
414987; M524394; Hs.294022; hypothetical protein FU14950; SH2;TM=M;; 4.41; 7.27 
447232; AW499834; Hs.327; interleukin 10 receptor, alpha; none;TM=M;SS=M; 4.41; 10.25 
433208; AW002834; Hs.24095; ESTs; arf,Ca_channel_B,SH3; 4.39; 12.14 
403208; ; ; Target Exon; leclin_c,none; 4.37; 0.76 

440486; BE24J51-. H 1 h S c he cal protein PP1044; LRR,PMD_DAPIN,AAA,CARD,NB-ARC',NA;NA; 4.36; 10.34 

414278; AA3301 16; Hs.355877; Human glucose transporter pseudogene; none.none; 4.35; 7.95 

424833; NM 003894; Hs.153405; period (Drosophila) homolog 2; PAS;SS=M; 4.34; 6.23 

422573;AW297985; Hs.295726; integrin, alpha V (vitronectin r ep FG-( c r a in A rone, 4 32; 5.85 

418721; NMJ02731; Hs,87773; protein kinase, cAMP-dependent, catalyti; pkinase,pkinase_C;SS=M; 4.31; 3.09 

41 2330; NMJ05100; Hs.788; A kinase (PRKA) anchor protein (gravini . none;TM=M;; 4.25; 12.74 

421939; BE169531; Hs. 109727; TAK1-bindlng protein 2 rTA*1 33; r T JE T 1 1 

414774; X02419; Hs.77274; plasminogen activator, urokinase; kringle.trypsin.planL hionins;SS=M; 4.24; 6.91 

418526; BE019020; Hs.85838; solute carrier family 16 (monoc bo, lie rertT" i r M 4 22 5.27 

415801;R24219;Hs.278443;Fcfragmentoflg^lr> nity lib, re; ig.TM 16,7.22 

417866; AW067903; Hs.82772; collagen, type XI, alpha 1; Collagen,COLFI,TSPN,laminin_G,CorA;SS=M; 4.16; 9.27 

445496; AB007860; Hs.12802; development and differentiation enhancin; SH3,ank,PH,ArfGap;TM=M;; 4.15; 23.43 

436075; BE090176; Hs.179902; transporter-like protein; none;TM=Y;SS=M 4.14, 3.76 

414462; BE622743; Hs.301064; arfaptin 1; none.none; 4,08; 13.43 

435730; AB020635; Hs.4984; KIAA0828 protein; AdcHcyase,TnkA-N l 2-tlac;d_DH_C;TM=M;; 4.06; 9.1 2 

431681; AK000378; Hs.267566; hypothetical protein FL) 1371 n 1U ( 404 1C05 

429379; NNL014840; Hs.200598; KIAA0537 gene product; pkinase,R!01;TM=M;; 4.00; 6.35 

429061; Y14039; Hs.195175; CASP8 and FADD-like apoptosis regulator lCE_p20,DED;TM=M;; 3.98; 5.56 

405203; ; ; NM_002086':Homo sapiens growth factor re; SH2,SH3;TM=M;; 3.95; 17.71 

409335; NNL001502; Hs.53985; glycoprotein 2 (zymogen granule membrane; zor,a_peilucida;TM=M;SS=M; 3.94; 0.58 

446006; NM_004403; Hs.13530; deafness, autosomal dominant 5; none;TM=M;SS=M; 3.89; 7.59 

413899; AF083892; Hs.75608; tight jin^t i in2(zonac luden 'H i nyl JinTM~M;; 3.84; 8.89 

438000; AI825880; Hs.5985; non-kinase Cdc42 effector protein SPEC2; none;TM=M;; 3.83; 4.22 

418054 NMJ02318 H - 3 lysyl oxidase-like 2, SRCR.Lysy i TM MSS=M 3 61 6 45 

450285; AW383256; Hs.24752; spectrin SH3 domain binding protein 1 ; SH3;TM=M;; 3.78; 8.49 

417141; U22662; Hs.347353; nuclear receptor subfamily 1, group H, m; hormone_rec,zf-C4;SS=M; 3.77; 4.22 

456376; AA663904; Hs.89862; TNFRSFIA-associated via death domain; death;TM=M;; 3.68; 4.92 

438113; AI467908; Hs.8882; ESTs; 7tm_1,none; 3.59; 1212 

414883; AA926960; Hs.348669; CDC28 protein kinase 1; CKS;; 3.58; 10.93 

429952; AF0801 55 H i fru M ncse,ubiquitin,Er,terotoxin_A,PH04,pkinase,ubiq'jitin; 3 57; 5.10 

415088; AI077288; Hs.374374; serurrdglucocorticoid regulated kinase; r,one,nor,e; 3.56; 4.60 
418478; U38945; Hs.1174; cyclin-dependent kinase inhibitor 2A (me; ank;; 3.55; 4.52 

s h HI i I trans; SH2,STAT,STAT_bind.STAT.prot;l 9 

434608; AA805443; Hs.179909; hypothetical protein FLJ22995; none;TM=M;; 3.52; 9.70 
400288; X06256; Hs.149609; integrin, alpha 5 (fibronectin receptor,; integrir,_A,FG-GAP;TM=Y;; 3.45; 6.44 
445350; AF0521 1 2; Hs.12540; lysophospholipase I; abhydrolase_2;TM=M;; 3.41; 6.03 
418255, AW135405, Hs 3 251; ESTs; i ii none, 3 41, 13.97 

408822; AW500715; Hs.57079; Homo sapiens cDNA FU13267 Ss, clone OV; PIP5K,none; 3.40; 8.97 ' 

426432; AF001601; Hs.169857; paraoxonase 2, Arytesterase;TV=M;; 3.39; 11.24 

431629; AU077025; Hs.265827; interferon, alpha-lnduclble prot n r r Tl ' r 3 r 3 39 5 10 

414291; AI289619; Hs.13040; G protein-coupled receptor 85; 7tm_1 ;TM=Y;SS=M; 3.38; 10.25 

457329; AI634860; Hs.247043; type 1 tumor necrosis factor receptor sh; Peptidase_M1;SS=M; 3.38; 13.78 

411125; AA151647; Hs.68877; cytochrome b-245, alpha polip-ptn none;Tt» rt M 3 36 4 17 

443710; A1928136; Hs.9691; Homo sapiens cDNA FLJ ti or G-alpha.none; 3.32; 20.33 

454294; AB000734; Hs.50640; JAK binding protein: SH2;TM=M;; 3,31; 6,94 

408912; AB01 1084; Hs.48924; KIAA0512 gene product; ALEX2; Armadillo_seg;TM=V;SS=M; 3.29; 3.07 

426728; NMJ07118; Hs.367689; triple functional domain (PTPRF interact; SH3,ig,pkinase,PH,spectrin,RhoGEF;TM=M;; 3.27; 14.90 

427202; BE272922; Hs.173936; interleukin 10 receptor t T je T T YTMC 24 4 49 

413076; U10564; Hs.75188; weel (S. pombe) homolog; pkinase;TM=M;; 3.24; 12.27 

425976; C75094; Hs.334514; NG22 pro er jcllag _ L( TM i " '2. 1 it. 

417534 NMJ04998 H 82251 r , I 1 '13 21:15.21 

458097; AW341 135; Hs.58104; ESTs; none,SH3,PID; 3.21; 7.34 

437928; NMJ05476; Hs.5920; UDP-N-acetylglucosamine-2-epimerase/N ac; hexekir e FGG i ROK,Epimerase_2;SS=M; 3.20; 8.38 
425177; AF1 27577; Hs.155017; nuclear receptor interacting protein 1; none;SS=M; 3.19: 5,09 
416094; AW995512; Hs.225977; nuclear receptor coactivator 3; none.none; 118; 4.17 

453489; AA300067; Hs.102000; hypothetical protein DKFZp434N1 85; F5_F8_type_C,pkinase,E!s,F5_F8.t/pe_C,pkinase,Ets; 3.17; 7.88 

414914, U49844 H 776 1 « telangiectasia and Rad3 related; FAT.FATC.PI3. PI4_kinase;TM=M;; 3.16; 4.71 . „ „ 

412767; AA233808; Hs.286241; protein kinase, cAMP-dependent regulata; SH3,7lm_2,cadherin,GPS,laminin.G,EGF,laminin_EGF,Sulfate_transp,STAS,cNMP_binding,Rlla; 3.1 
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437175; AW968078; Hs,87773; protein kinase, cAMP-depencent ca y a l r C none; 3.14; 11.72 

1 t t oy H <1 i n r l r PDZ.l i !M '< 3 1 

419591;AF090900;Hs.91393;HomosapienscDNA:FLJ21337f H K 2 TM < 

447225; R62676; Hs.17820; Rho-associaled, colled coil contain nu r F irr r s.HR1,nor HI rt 

412692; AF044288; Hs.74515; aryl hydrocarbon receptor nuclear transl; HLH PAS PAC;TM=M;; 2.95; 12.23 

409274; NM_003930; Hs.52644; SKAP55 homologue; SH3,PH;SS=M; 2.90; 1 4.62 

417707; AL035786; Hs.82425; actin related protein 2/3 complex, subun; r.or.e;TV=M;; 2.90; 11.00 

427045; H86504; Hs.173328; protein phosphatase 2, regulatory sufcimi; B56;TM=M;; 2.89; 6.12 

431177; NM_003304;Hs.250687; transient receptor potential ch nnel 1 ; ionjran rl rM 2.89: ( 53 

443426; AF098158; Hs.9329; chromosome 20 open reading frame 1 ; r.one;TM=M;; 2.87; 9.1 8 

418546; AA224827; ; gb:nc32g04.s1 NCI_CGAP_Pr2 Homo sapiens vwa.infc n ./ c ~ ,aF none 2 86 9.94 

446668; W58353; Hs.285123; Homo sapiens mRNA full length insert cDN; NDK,PH,Oxysterol_BP;SS=M; 2.85; 14,25 

454080; AI199711; Hs.576; fucosida e alpha L 1 i Alpha.U . T" - 1^ M 281 28.84 

1 874;7V9432 H 9651 r no hibi ivi'.y; SH3;TM=M;SS=Y; 2.30: 10.53 

433000; U26710; Hs.3144; Cas-Br-M (murine) ectropic retroviral it; zf-C3HC4,UEA,CbLN,Cbl_N2,Cbl_N3;; 2,77; 10.93 

444488; AW1 92879 H ncient conserved domain r r are 2 77 12 58 

417904; A1750762; Hs.82911; protein tyrosine phosphatase type IVA, m; Y.phosphatase,DSPc;TM=M;; 2.7S; 12.78 

425204; NM_002436; Hs.1861; membrane protein, palmitoylated 1 (55kD); SH3,PD2,Guanylate_kin;SS=M; 2.74; 5.71 

419262; AA834664; Hs.29131; nuclear receptor coactivator 2; PAS,zf-C2H2,SET; 2.73; 12.50 

410793; AW581906; Hs.66392; intersectin 1 (SH3 domain protein EH nd PH.P.i E M M 

446081; AA972412; Hs.13755; f-box and WD-40 domain protein 2; WD40,F-box,Ribosomal_L14;TM=M;; 2.71; 12.29 

414443; AU077268; Hs.761 44; platelet-derived growth factor ret er i j.pkinasefl i 2 71; 10.53 

452683; AI089575; Hs.374574; progesterone membrane binding protein; homeobox.none; 2.69; 12.53 

i 1_014339;Hs.129751;interleukir r TM f 

422627; BE336857; Hs.118787; transforming growth factor, beta-induced; Fasciclin,ABCJran,ABC_membrane,GTP_EFTU;TM=M;SS=M; 2.67; 12.22 

453915 AA588721 HP 1 r mal | ~> em L44 none T box; 2.65; 6.38 

416810; AF035606; Hs.80019; programmed cell death 6; efhand;TM=M;: 2.61; 13.89 

439658; AA332057; Hs.5539; hypothetical protein MGC I 5440, r.i ne,TM-M;3S4v1; 2 53, 10.19 

449924; W30681; Hs.146233; Homo sapiens cDNA: FLJ22130 fis, Cone H; SH3,none; 2.56; 19.04 

412926; AI879076; Hs.75061; macrophage myristoylated alanine rich C ; M \RCk E 3S M, 2 55; 14.99 

439237: AW408158; Hs.318893; ESTs, Weakly similar to A47582 B-cell gr; Furin-like,pkinase,Recep_L_domain,YLP,none; 2.52; 14.71 

409098 AA1326 7 2 I 1 rn ml trailed PH,Sec7:TM=V;; 2.51 14.51 

413040; AA193338; Hs.12321; sodium calcium exchanger; Na_Ca_Ex;TM=Y;SS=M; 2.49; 9.28 

•L'Ai/'.i.AI M'j-f.t".. II-:. I IIL'h. pil'i. 'i'il'ii r. • r -.t.ii:,, 1 n ••- , 'i. TTTF .TV !.' 1.3 ". 2 K- 1245 

452289; BE568205; Hs.28827; mitogen-activated protein kinase kinase ; pkinase;TM=M;; 2.44; 6.68 

427657; AV652249;Hs.180107; polymerase (DNA directed), bela ncni.it' f 2.43 17 

446287; BE247683; Hs.14611; dual specificity phosphatase 1 1 (RNA'RNP; DSPc;SS^M; 2.41; 9.51 

410017; AW952426; Hs.109438; Homo sapiens clone 24775 mRNA sequence; none.none; 2.41; 14.01 

424756; AW504657; Hs.152931; lamin B receotor; ERG4 ERG24,FKEP;TM=Y;; 2.40; 5.98 

447580; AI95336C; Hs.88201; ESTs; none.none; 2.36; 11.63 

426276; AW881411; Hs.169078; hypothetical protein FLJ23018; hormoneje- £ * r4 Tf M 4; 13.34 
424441; X14850; Hs.147097; H2A histone family, member X; histone,CBFD_NFYB_HVF;; 2.33; 12.17 
429623; NM_005308; Hs.21 1 569; G protein-coupled receptor 1 irv e pi RGSJMM 332; 15.01 
439866; AA280717; Hs.6727; Ras-GTPase activating protein SH3 domain; rrm,MTF2;TM=M;; 2.32; 12.48 
453648; W21493; Hs.28329; hypothetical protein FLJ14005; none.none; 2.31; 13.19 
443951; F13272; Hs.356835; ferritin, light polypeptide PVP.2_n udin.ncne; 2.31 1 
453327; AW5001 80; Hs.356109; tryptophanyl tRNA synthetase; rrm,vwa,FG-GAP; 2 30; 13.02 
439256; AA322302; Hs.183302; PCTAIRE protein kinase 2; none.none; 2.26; 10.36 

424467; AI929392; Hs.350026; DnaJ (Hsp40) homolog, subfamily B, membe; DnaJ,pkinase,UBA,pkinase_C;SS=M; 2.26; 11.82 

426440; BE382756; Hs.169902; salute earner family facil ii nil 11 2 26 12.54 

456607; AI650190; Hs.105070; cyclin-dependent kinase inhibitor 1C (p5; CDI;TM=M;; 2.25; 13.11 

423960; AA164516; Hs.136309; SH3-containlng protein SH3GLB1;SH3,none; 2.20; 20.05 

424058; AL121516; Hs.1 38617; thyroid hormone receptor interactor 12; HECT,WWE;TM=M;; 2.20; 13.38 

446644; NM.003272; Hs.15791; transmembrane 7 superfamily member 1 (up; none;TM=Y:SS=M; 2.18; 15.68 

411218; H46440; Hs.180628; dynamin 1-like; dynamln_2,dynamin,GED,none; 2.18; 13.83 

414721; X90392; Hs.77091; ribosomal protein L10: E;<oj5ndc_phos 1 Rit^somai_U0e,Acyltransferase,SCP;TM=M;SS=M; 2.14; 11.24 

421759; AA027968; Hs.107979; small membrane protein 1; none;TM=Y;SS=M; 2.14; 1 4.03 

416240; NM 001981; Hs.79095; epidermal growth factor receptor pathway; efhand,DUF164;TM=M;;2.13; 12.86 

435521; W23814; Hs.6361; mitoaen -activated protein kinase kinase ; none.none; 2.12; 11.08 

409340; BE174629: Hs.321 1 30; hypothetical protein MGC2771; aa_permease5 l pyridoxal_deC,bromodomain,PHD,MBD,AT_hook,DDT,P13_PI4_kinase,FAT,FATC,BolA,RUN;TM=lv1;; 
2.12; 14.05 

453064; R40334; Hs.89463; potassium large conductance calcium-acti; none.none; 2.12; 8.96 

409223; AA312572; Hs.362852; phosphoinositide-3-kinase, regulatory su; SH2,SH3,RhoGAP,none; 2.08; 11.50 

414482; S57498; Hs.76252; endothelln receptor type A; 7tm_1;TM=Y;SS=V; 2.06; 14.23 

41443 A 853 I c It f t«>e.C.sdh none 2^ 1345 

450455; AL117424; Hs.25035; chloride intracellular channel 4; none.TNF; 2,05; 19.04 

i i i if 005E Hs.2 7; synaptcl I l;sy frevi NTF2;TM-> i ' 1 <t 

42211 1 c . 4 I m~ty Li 3 I M 203; 12.60 

434935; BE561824; Hs.273369; uncharactenzed hematopoietic tern/pr i re I M ^ 02 10 52 

433427; M816449; Hs.171889; cholinephosphctroisl, " - i H "~ r >F OH P 1 r MM-M 2 02,16 37 

410850;AW362867;Hs.302738;HomosapienscDNA:FLJ21425fib lone! tl tr r , T ri < ,_hox 2 02; 9.37 

440481; AA182907; Hs.7200; Homo sapiens, done MGC 1 371 4 m UP om; pkii se CC Tl M 202 12 31 

43^645; AF255303; Hs. 1 12227; membrane-associated nucleic acid binding. z(-CCCH,gpdh,Adeno_E1B_55K,zf-C3HC4;TM=M;; 2.00; 9.15 

410113; AW996564; Hs.250824; Homo sapiens cDNA: FLJ23435lis, clone H; pkinase.none; 1.99: 10.64 

414636; AL120259; Hs.76691; slannin. none;TV^tS8»Y; 1.95; 7,72 

408176; AK001 553; Hs.43436; adenylate kinase 3 alpha like; adenyh - i h r ir 1.9 i 

422690; AU077275; Hs, 119222; suppression of tumo'i i ity 3(i w |PPTM=M 194; 10.91 

427861; BE538296; Hs.323834; cytochrome c oxidase subunit Va; none.GKAP; 1.93; 20.57 

433387; L76528; Hs.3260; presenilin 1 (Alzheimer disease 3); Presenilin,7tm_3,oxidored_q5_N;TW=Y;; 1.92; 12.58 

453938; AF082569; Hs.36794; D-type cyclin-interacSng protein 1; B56;TM=M; 1.90; 1274 

433592; NM_004642; Hs.3436; deleted in oral cancer (mouse h mcl r i TM 1;;1.£ 2 

447791; BE241859; Hs.19575; CGI-11 protein; V-ATPase_H,Armadillo_seg;TM=M;; 1.88; 12.82 

426359; AA376409; Hs.10862; Homo sapiens cDNA: FLJ23313fls, clone H; acenylatekinase.none; 1.88; 14.95 
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432650; DO0850; Hs.56; phosphoribosyl pyrophosphate synthetase ; none.rore; 1.88; 12.70 
424250 AF073310 H 143648 in u r R TM M 1 86 19.50 

244E 68B21; Hs.149155; vottage-d dent anion channel I c t 1 II 

- 55 BE39432 I 6267 I in-like 1 -d al.arf.ra 'm 1;TM=M;:1 5; 1 
426122; NM_006925; Hs.166975; splicing factor .arginr rine-rich 5; urn M; 1 K 
451579 AW607731 H 11 lie 1 1 Y C S M 1 83 20.35 

428901; AI929568; Hs.146668; KIAA1253 protein; 7tm_2,UPF0073,TMS.TDE;TM=Y;SS-M; 1.83; 19.00 
4539t3,WWG311, Hs ' "F~T p ii ase Actlvin re | none 1 82, 15 25 
417414; AA434589; Hs.367676; dUTP pyrophosphatase; dUTPase.KRAB;; 1.81; 14.20 
414521; D26124; Hs.76307; neuroblastoma, suppression oftumoriger.i; DAN;TM=M;SE=M; 1.81; 22.29 
425356; BE244879, Hs U nositcl phosphate 5-phosphatase, 14 Exo_er.do_phos,SH2;TM=M;; 1.80; 18.30 
417733; AL048678; Hs.82503; H.sapiens mRNA for 3'UTR of unknown prat; none;NA;NA; 1.80; 6.28 
424S0S AF21W4 Hs c 21 C Cbl-inli ing protein; SI TM M 1 BO 11.99 
420747; BE294407; Hs.99910; phosphofructokinc-f e -n- T f , M 1 7 9; 2525 
416819; U77735; Hs.80205; pim-2 oncogene; pkinase;SS=M; 1.78; 15.25 
437708; AB033020; Hs.5801; KIAA1194 protein, LRR,Exoj!ndo_phos;TM=M;; 1.77; 11,11 
439877; H39685; Hs.258730; tryptasebeta 1; pkinase;SS=M; 1.77; 21.91 
440256; U23841; Hs.18851; hypothetical protein FLJ10875; none,UBA,UBX; 1 .76; 12.95 
425367; BE271188; Hs.155975; protein tyrosine phosphatase, receptor t; none;TM=M;SS=Y; 1.76; 21.01 
414703; BE243877; Hs.374366; ATPase, Na? transporting, beta 3 polypep; Na_K-ATPase;TM-Y;SS-M; 1 .75; 23.03 
443693; AI344782; Hs.349261; DnaJ (Hsp40) homolog, subfamily C, membe; rrrn,Dn3J,TPR;TM=M;; 1.75; 13.29 
437412; BE069288; Hs.34744; Homo sapiens mRNA; cDNA DKFZp547C136 (fn ABC_tran,GTP_ERU,ABC_membrane,none; 1.75; 8.75 
413796; AW408094; Hs.75545; interleukin 4 receptor; fr3,granu'ln;TM=M;SS=M; 1.74; 14.73 
438438; AA257992; Hs.50651; Janus kinase 1 (a protein tyrosine ktnas; pkinase,SH2,adenylatekinase,none; 1.73; 24.10 
429655; U48959; Hs.21 1 582; myosin, light polypeptide I n 'in e.fr qrcre1 71 J50 

421456; AW579B42; Hs.1045fc7; hypothetical protein FLJ10697; zf-C2H2,DUF13.eniand,C2,Pl-PLC-Y,PI-PLC-X;TM=M;; 1.73; 16.87 

1135 lis 54985 ESTs nonenone 171 1040 
442819; BE622721; Hs.284275; Homo sapiens PAK2 mRNA, complete cds; none.pkinase.PBD; 1.69; 14.02 
447918 AM 29 20;Hs.1151 L Hig /sin IC n pkin M i ne 1 69 17 14 

429279; AB018271 Hs 198689, KIAA0728 proteir t y< i I i pectrir A ycmjail; 1.68; 14.21 
450440; AB024334 i 2S001; tyi 1 e t-morooxygenase/tryptophan 5-mo; 14-3-3;TM=M.;; 1.67; 24,67 
413423 AU076684 Hs r in n ihn none 1 65 29.28 

420972; AW8 1461 6; Hs.31431; hypothetical proteir FLJ12171; Fructosamin_kin;SS=M; 1.65; 10.75 

416884; M60484; Hs.80350; protein phosphatase 2 (formerly 2A) c ' I M; 1 4 

436719;Y11192;Hs.5299;aldehydedehydrogenase5famlly member lipmil n 1 1 J 1 ir I t n M 161; 11,20 

11 12; 111; 1 Is. 1 2 l n r ( i> bran i M ft 1 b1 14.93 

414176; BE140638; Hs.75794; EDG-2 (endothelial differentiation, lys; 7tm J ,CRCB;TM=Y;; 1 61; £.03 

431476; BE612705; Hs.256697; histidine triad nucleolide-bindirg prote; HIT;SS=M; 1.60; 24.37 

412347; AW970026; Hs.73818; ubiquinol-cytochromec redjc c r e h r i p; UCR_hinge,G-alp rf;TM=M 1 59 18 09 

804;AW4C ts.170 in r i ion fact; lRF,zf-C3IIC4,lD f-RanC TM=M;; 1.59 

12B55 L E297660; Hs.170328; moesin; Band_41 ,ERM,pkinase,LRR,LRRCT,M ft tt Nuclt pi nin.Trop ,myosin.OPR,1ilamenl,bZIP,G-gamma,M,DLJF164;TM=M;; 1.58; 
428216; M1346B II' 183 1 7 |r mhn cAMP-d ndei eg r "F_birJrq n SS M 1 56 1358 
421251; Z 8913 Hs 102948; enigma (LIM domain protein) LlM,PDZ;3S=M; 1.56; 13.51 
448581; NM_002709; Hs.21537; protein phosphatase 1 catalytic su mi nrn-p none, 1 55; 12.33 
417098; AB017365; Hs.173859; frizzled (Drosophila) homolog 7; FrizzIed.Fz,7tm_2,tcxir_2;TM=Y;SS=M; 1.55; 13.77 
437076; AA961260; Hs.5443; BCL2-associated athanogene 5; BAG I lanta nuclerj cp TM=M , 1.54; 10.93 
426653; AA530892; Hs.171695; dual specificity phosphatase 1; Rhodanese,DSPc,Y_phosphatase;TM=M;; 1.54; 1 1.83 
421143 AB024 30 II 102 1 nrj bu'in superf n i !e ig LPR LRRNT,LRRCT;TM=M;SS=M; 1.53; 23.05 
414457; AW514320; Hs.76159; ATPase H tun p r » i n jj plinase,ATP-synt_C,none; 1.53; 32.59 
414382; AW380339; Hs.8068; hematopoietic PBX-interacting protein; M;TM=M;; 1 .52; 8 66 
450998; BE387614; Hs.25797; splicing factor 3b, subunit 4, 49kD; rrm;TM~M;; 1.52; 1 1.74 
402705; ; ; activator of S phase kinase; AhpC-TSA;TM=M;SS=M; 1.51; 26.85 
426268; AF083420; Hs.168913; serine/threonine kinase 24 (Ste20, yeas' p r s SS M 1 r 0 24 1. 
414604; AU076649; Hs.76556; growth arrest and DNA-damage-inducible 3; nor.e;TM-M;; 1.50; 14.35 
445584; AF217518; Hs.8360; PTD012 protein; none;SS=M; 1.49; 12.00 
407232; XQ4526, ; gb:Human liver mRNA for beta-subunit r, g; WD40;TM=M;; 1 .49; 13.32 

424206; NM_003734;Hs.198241; amine oxidase, copper cont nng I J e 1 N3;Tt M SS M 1 48; 13.21 

458761, AFC90922 H I52738; mil londn'al ribe i proteir. L1 1; EF lei ept Rbo-orral L11,RibosomaLL11_N;TM=Y;SS-M; 1.48; 12.50 
426340; 297989; Hs.169370; FYN oncogene related to SRC, FGR, YES; BNR,SH2,SH3,pk:nase,TM=Y SS-M; 1.48; 17.75 
414166; AW888941; Hs.75789; N-myc downstream regulated; DEAD,helicase_C l rrm l Ndr l Cys_knot,TlL,vwa,vwc,vwd,lQ,Rlla,abhydrolase,TGF- 

beta,DUF139,TPR,DSPc,tspJ,RibosomaLS21 p;TM-M 1 4E 2C 17 
452516; AA058630; Hs.29759; RNA POLYMERASE I AND TRANSCRIPT RELEASE ; none;SS=M; 1.46; 12.72 
414240;AL046742; Hs.75842; dual-specificity tyrosine (Yj phosp oi cl e;SS tl 1.45 14 38 
420532; AA248016; Hs.1941 10; hypothetical protein PRO2730; pkinase,WD40;SS=M; 1.43; 13.92 
402575; ; ; Rho GTPase xtivating protein 1; PAP2;TM=Y;SS=M; 1.43; 13.71 
414765; X07854; Hs.77269; guanine nucleotide binding protem(C , rf f" It 1 41 2462 
448423; BE390905; Hs.21198; Iranslocase of outer mitochondrial rnembr; TPR;TM=M;SS=W; 1.41; 10.70 
422587; AI879352; Hs.118625; hexokinase 1; hexokinase,hexokinase2;TM=M;; 1.41; 19.31 
415995; NM_004573; Hs.355888; phospholipase C, beta 2; C2,PI-PLC-Y,PI-PLC-X;TM=M;; 1.40; 11.21 
446108 AL036W r L r f r If It 140 13.98 

427721; A1582843; Hs.180455; RAD23 (S. cerevisiae) homolog A 1 1 i IF ntegr E 3S=M; 1.39; 15.01 
417891; W79410; Hs.82887; protein phosphatase 1, regulatory (inhib; none;TM=M;; 1.39; 15.97 
427373 AB007972 H 130760 myo n ) TM M 139,14.49 

446334; U52427; Hs.14839; polymerase (RNA) II (DNA directed) polyp; 

COX8,SHMT,M.IF,GST_C,EF1G_domar 3Ci N,S i r led.ca'reti r r F V < M 1 38; 12.58 

447042; AB035863; Hs.182217; succinate-CoA ligase, ADP-forming, beta ; , igase-CoA,ATP-grasp,Zip,CPSase_L_D2,GARS_B;TM=Y;SS=M; 1.37; 113? 
427705; A1870421; Hs.180394; signal recognition particle 14kD (homo'o; SRPl4,TNFR_c6;SS=M; 1.37; 22.05 
425969; AW576265; Hs.301763; K1AA0554 protein; SH3,FCH,HR1;TM=M;; 1,37; 13.68 
433572; AL046859; Hs.3407; protein kinase (cAMP-dependent, catalyti; PK!;SS=M; 1.35; 12.43 

410597; W16518; Hs.279518; amyloid beta (A4) precursor-like pro' ii L - r Pl U T h ogei )xid Tt f;S M; 1.35 ""4 
4184^4 Y13622 I n r i it I r T r j ^ ilin F_receptor;SS=M; 1.34; 12.09 

442603; AL035719 Y 091; pte n coiled/col; PITS T" M 1 34 11.40 

418043; AW377752; Hs. 33341 ; AXL receptor tyroshe kinase; fn3,ig,pkinase;TM=Y;SS=M; 1.31; 10.79 
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43927S; AF077046; Hs.6518; ganglioside expression factor 2; MAPI J£3,aminotran_3;TM=M;; 1.31; 15.89 
425875; AU077333; Hs.160483; erythrocyte membrane protein band 7.2 (s; PBP.Bar.d. .7;TM=M;; 1 .31; 17.93 
407744; AB020B29; Hs.38095; ATP b rd n c c family / BC1 ran.PI TM YSS=W; 1.29; 10.95 

111 I II [ I b 

427397; AI929685; Hs.177656; calmodulin 1 (phosphorylase km se nd,R D 3S M 1 23 15.68 

424661; M29551; Hs.151531; protein phosphatase 3 (formerly 2B) ( Melallop rf M 1 28 13.39 

I", i k H c / i l n n enriched ir r tes 15; DED;TM=M;; 1 27 11.15 
440820; AL031B46; Hs.356416; plakophilin 4; none,none; 1.26; 10.65 
448153; Y10805; Hs.20621; HMT1 (hnRNP methyltransferase, S. cerevi; NusG;SS=M; 1.25; 12.07 
447386; NM_006289; Hs.375001; KIAA1027 protein; Band_41,l_LWEQ,Apolipoprolein,IRS;SS=M; 1.22; 10.65 
i E301S09; H n c fe I i r r TM-M;, 1.20; 15 

440708; AF038962; Hs.7381; voltage-dependent anion channe 3; Eui porin.En =ro1oxin_A,PH04 none; 1.20: 14.06 

417069; AA442192; Hs.374980; cytochrome c oxidase subunit VIII; COXB,SHWT,MIF l GST_C.EF1GJomair,GST_N,S1,Fz,Frizzle<l,calreticulinJtra_2,rrm,PAP_assoc;TM=Y;SS=M; 
1.18; 16.91 

402559; ; ; Rho GTPase activating protein 1; PAP2:TM-Y;SS-M; 1.16; 15 49 

426636, BE2426"4 I in- ing El (/ T r Ti F UBACTTM=M 1 14; 10.99 

428773; BE266238; Hs.193163; bridging integrator 1; BAR,SH3;SS=M; 1.14; 1 1.38 
406906; Z25424; ; gb:H.sapiens protein-serine/threonire ki; none.none: 1.13; 12,97 
443932; AW888222; Hs.9973; tensin; SH2,WW,PID,none; 1.07; 15.41 
421996; AW583807; Hs.1460; glucagon; hormone2;SS=M; 59.35; 1.61 

414998; NM_002543; Hs.77729; oxidised low density lipoprot- n (i c ir ctir rM-> M; 22.9 1.: 

442 H93366;H a J h i mil transfer i s; aminotr 4 e 2 Mi' 

451035; AU076785; Hs.430; plastin 1 (I isoform); efhand,CH,Adaptln_N;SS=M; 19.25; 3.53 

408243; Y00787; Hs.524; interleukin B; HLH,PAS,iL3:TM=M;; 15.53; 4.34 

421340. F07762.. H-. I m. d«. , jutfemrq I. x'vi a ■:or.-.p 'er.-,er'. asn ;3S=M; 14.84; 19.59 

422260; AA31W9J. H-:. IU>I - I. r- ■Krv,/,' r i mt< •.*»:. •*.-«•_-•:. ■■' U'liii 

430280; AA361258; Hs.237B68; interleukin 7 receptor; fn3,none; 14.28; 11.47 

412116; AW402166; Hs.784; Epstein-Barr virus induced gene 2 (lymph; 7tm_1;TM=Y;SS=M; 1271; 12.56 

451B20; AW058357; Hs.199248; ESTs; 7tmJ;TM=Y;SS=M; 10.18; 2.67 

418B93;AI750878;Hs.87409;thrombospondin1;EGF,tor.J'.At„i'-PN.i pj 3S M ! '2 6.94 

448105; AW591433; Hs.298241; Transmembrane protease, senre 3 I c La,t ir TM Y SS=M; 9.67; 4.06 

456266; L29073; Hs.198726; cold shockdomain protein A; 7tm_2,HRM,CSD;TM=Y:SS=M; 9.62; 2.36 

413095; AA494359; Hs.30715; potassium voltage-gated channel, Isk-re'; none, START; 9.15; 2.18 

417933; X02308; Hs. 82962 hymdyJe ynlhelase; thymid I fl Hi f< T "I : II SS M, B 97; 5.01 

433334; AI927208; Hs.231958; matrix metalloproteinase 28; Peptidase_M10,none; 3.71; 4.28 

418030; BE207573; Hs.83321; neuromedin B; Bombesin;TM^M;SS=--Y; B.38; 1.55 

4 II ) r t- r EC uaspase6 apoptosis related cys einc I L l1 r/0™ M B 31; 4.23 
449523; NMJ00579; Hs.54443; chemokine (C-C motiti n - ptor 5 7 m_1 rM=Y SS=V, B 26, 5 49 
426513; BE220806; Hs.184697; Homo sapiens done 23785 mRNA sequence; PSI.none; 8.13; 13,28 
449444; AWB18436; Hs.351306; solule carrier family 16 (monocarboxylic; none;TM=Y;SS=M; 7.89; 7.00 
453459; BE047032; Hs.257789; ESTs; none.none; 7.40; 0.60 

436729;DE621807;Hs.351316;transmembrane4superfamilymember1;noreTV V ' 3; 5.78 
426761; A1015709; Hs.172089; PORMIN Pro-oncosis receptor inducing me; r.cr.e;TM=Y;SS4/; 7.25; 7.22 

426158; NMJJ01 982; Hs.1 99067; v-erb-b2 avian erythn e 1 it r r e p rrr L r I i r 1 n.Peptid e_M24; 7.13; 3.97 

4199C8.XC4430 I - 9 r < ink leu r> interferon t 'a J L ' { 693 3 13 

457133; M54968; Hs.351221; v-Ki-ras2 Kirslen rat sarcoma 2 viral on; ras,ldh;SS=M; 5.90; 2.85 

420344; BE463721; Hs.97101; putative G protein-coupled recea rr ' i r r T I r SS M 688 3 10 

417874, BE6161CC, He 32829 pi ',11, me phosphatase, non-recept; Y_phosphatase;TM-Y;; 6.42; 2.26 

427969; NMJM963; Hs.2230; epidermal growth factor (beta-urogastron; EGF,ldLrecept_b,ED;TV=M;SS=M; 6.37; 1.07 

L H It r j r 5 Irn III 

427557; NMJH2659; Hs.179657; plasminogen activator, urokinase recepto; UPAR_lY6,ET,PLA2jnh;SS=M; 5.71; 3.83 
41 8283; S79895; Hs,83942; cathepsin K (pycnodysostosis); Peptldase_C1 ;SS=M; 5.59; 38,68 
458471; AV648609; Hs.194240; ESTs; none.none; 5.23; 1.05 

433470; AW960564; Hs.35131 6; transmembrane 4 superfamily member 1 ; r.cne;TM-Y;SS-M; 4.88; 4.50 
433293; AF007835; Hs.32417; hypothetical protein MGC4309; noneJM^M;; 4.56; 4.96 
410867; X63556; Hs,750; fibrillin 1 (Marfan syndrome); EGFTB/r ' TIL M;4.3 
417512; X76534; Hs.82226; glycoprotein (transmembrane) nmb; PKD;TM=Y;SS=M; 4.26; 9.04 

414825; X06370; Hs.77432; epidermal growth factor receptor ( v Fun ke I r I L J ft M ot M ^04 1,16 

439180; AI393742; Hs.199067; v-erb-b2 avian erythroblastic leuk rr v; F'jrin-like.pkii RecepJ m irin-lil pi nase,Recep_L_domain,Pepli'dase_M24; 3.76; 2.21 
419508; AW997938; Hs.90786; ATP-binding cassette, sub-family C (CFTR; ABC_tran,ABC_membrane;TM=Y;SS=M; 3.47; 2.24 
419749; X73608; Hs.93029; sparc/osteonecb'n, cwcv and kazal-like d; kazal,thyroglobulin_1;SS=M; 3.37; 7.10 
436576; AI458213; Hs.77542; ESTs; 7tm_1,DnaJ; 3.15; 3.27 
428093; AW594506; Hs,104830; ESTs; none,none; 2.81 ; 3,40 

459683; AI674906; Hs.199460; gb:wc73f02.x1 NCI_CGAP_Pan1 Homo sapiens; r.cne;TM=Y;; 2.77; 1 .36 
414443; AU077268; Hs.76144; platelet-derived growth fao'or recept - g l;inase;TM=Y 2 7! 10 53 
430451; AA836472; Hs.297939; cathepsin B; Peptidase_C1,proJsomerase;SS=M; 2.28; 14.59 

428953; AA30661 0; Hs.348183; tumor necrosis factor receptor superfaml; 60s_ribosoma:, Ribosomal_L1 0,TNFR_c6,DEAD;; 2.21 ; 6.33 
435496; AW840171; Hs.265398; PAR-6 beta; none.none; 2.17; 2.00 

418641; BE243136; Hs.86947; a disintegrin and metalloproteinase doma n I pr I/ r Pep_M12B_propep,EGF;TM=Y;SS=M; 1.91; 13.06 
414521; D28124; Hs.76307; neuroblastoma, suppression of tumong- 1 D N T i' M M 1 B1 22.29 
419452; U33635; Hs.90572; PTK7 protein tyrosine kina i f i SS M 112; 8.40 

•.<:,:■..■■',% H-,1357; ■>,■■■-.•■*■■.<■* -ji-kJ. :ogf:V->>.tm-m;; 149; 3.29 

432199; A1693815; Hs.127179; cryptic gene; none;TV-M;SS-M; 1.23; 1.60 

453966; BE148734; Hs.63325; transmembrane protease, serine 4; trypsin,ldl_recepLa,none; 1.00; 3.92 

445418; AW1393" 111' i) ,p nr-™l00!15 

451106; BE382701; Hs.25960; N-MYC oncogene; HLH,Myc_N_tenn;TM=M;; 1.00; 1.87 

447993; AW1 39525; Hs.170362; ESTs; none.none; 1.00; 1.30 
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Pkey CAT Number Accession 

406S85 0_0 M18728 

414087 1632850J W19712 BE247277 

400151 9575_21 BC006850 U07418 NMJ00249 U07343 AL574783 BI090482 BG684481 AA385302 BG196167 BI091720 BG195132 AI680106 AI457552 AA402478 

BG249688 M3471 19 BG755996 BG822578 
418546 242836J T59708 AA224827 T59843 BE1 56903 

TABLE 42C 

Pkev: Unique number corresponding to an Eos probeset 

e. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. 'Dunham I. et al.* refers to the publication entitled "The DNA 
Dunham I. et al, Nature (1999) 402:489-495. 




1552511 Plus 



7230116 

9884830 
9864273 



120922-121296 
70823-70990 

1 53405-1 53564,154623-1 54875, 1 55272-1 5540 



147706-147903,14B667-14 



TABLE 43A: 43 genes upregulated in pancreatic cancer relative to normal body tissues 

Table 43Atisls about 43 genes upregulated in pancreatic csncer relative '.□ normal body tissues tnal are likely to encode prntPin" p tail ly i fill lb I nosli oi ping ti 
applications. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 Genochip array. Gene expression date for each probeset obtained from this analysis 
was expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA expression. The protein products of these genes often contain one or more domair 
indicative of have oncogenic function or of transducing in ra I I jnal of being modulatable by small molecules, peptides, or antibodies (eg. pkinase, death-domain, 7fm, 
phosphatase, or ion ^transporter). Certain predicted protein domains are noted. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar accession number, GenBank accession number 

UniGenelD: UniGene number 

Pred.Prot.Domains: Certain predicted protein domains. Abbreviations used: Tfvt, transmembrane domain; SS : signal sequence; =Y, very likely to contain; =M, likely to contain; oth 

protein domain abbreviations are from PFAM (Nucle'c Acid: Research, 2002, 30:276-280). 
UniGene Title: UniGene gene title 

R1 90th percentile of pancreatic cancer Als divided by the 50th percentile of normal tissue Als 

R2 90th percentile of pancreatic cancer Als divided by the 90th percentile of normal pancreas Als, where the 15th percentile of all normal tissue Als was subtract 

Pkey; ExAccn; UnigenelD; Unigene Title; Pred.ProlDomains; R1; R2 

446619; AU076643; Hs.313; secreted phosphoprotein 1 (osteopontin, ; Osteopon:in;SS=M; 44.95; 2.17 
421552; AF026692; Hs.105700; secreted frizzled-related protein 4; Fz,NTR;SS=M; 35.40; 29.13 
411274; NM 002776; Hs.69423; kallikrein 10, lrvpsin;TM-M;; 30.10; 13.59 

B012 n 1 Sj r ibl cytokine itf ily (Cyril 33; 1 

413719; D0439580; L n ne subfamily A (Cy; lL8;SS=f 1 

452281; T93500; Hs.28792; Homo sapiens cDNA FLJ11041 fs, clone PL; TGFb^propeptide.TGF-beta.none; 23.81; 10.74 
407811; AW190902; Hs.40098; cysteine knot superiamily 1, BMP antagon; TGF-beta,DAN;SS=Y; 22.33: 10.20 
404682; ; ; C9001188*:gi|12738842|reflNP_073725.1| p; none;TM=M;; 17.72; 1 ,40 
413554; AA319146; Hs.75426; secretogranin II (chromogranin C); Granin;TM=M;SS=Y; 17.36; 2.01 
428392; H10233; Hs.2265; secretory granule, neuroendocrine prate;; none;TM=M;SS=M; 16.82; 1.70 
408243; Y00787; Hs.624; interieukin 8; HLH,PAS,IL8;TM=M;; 15.53; 4.34 

ti ,i I 7671 r I 1 TI mallind cible yti ine j mily B (Cy ILE 5 M 15.40 S.70 
428242; H55709; Hs.2250; leukemia inhibitory factor (cholinergic ; LIF_OSM;SS=M; U.35; 6.58 

■j: ■■ r >".,.. I •• I . KjfrVfe-ecnf'iTtr' :j:H;£S=M, 14.84; 1959 

409757 NM.001898 H I23ll4;cy i " , ii =M; 14.51 12 75 

425071; NMJ13989; Hs.154424; deiodinase, iodothyronine, type II; T4_deiodinase; I M=M;SS=Y; 14,35; 17.22 
414812;X72755;Hs.77367;monokineinducedbygammain1erfr^n L TM= ' 3=Y 1381 769 
409420; Z15008; Hs.54451; lamintn, gamma 2 (niceir (100kC i f in aminii c i ,M„ EGF;SS=M; 13.05; 7.72 
432595; AJ221M1 Hs.273461; matnlx '1 12 30; 9.91 

422109; S73265; Hs.1473;gastrin-releasing peptide, Bombesin Dr'-r lh pep;TM M:SS !.' 12.79 t* 
421379; Y15221; Hs.1Q3982; small Inducib'e cytofi ^ j mil B (( H f TV, t SS f 11 35 222 
429547; AW009166; Hs.99376; FGENESH predicted novel secreted protein; none, none; 10.25; 5.62 
422424; AI186431; Hs.296638; prostate differentiation factor; TGF-beta;SS=M; 9.96; 1.88 
428505; AL035451; Hs.2281; chromogranin B (secretogranin 1); Granin;SS=M; 9.40; 3.46 

409956; AW1 03364; Hs.72v mhit beta A (activin A, activin AE rGF-be P o| T c SS=M; 9.19; 16.46 

418030; BE207573; Hs.83321; neuromedin B; Bombesir.;TM=M;SS=Y; 6.38; 1 .55 

452401; NM_0071 15; Hs.29352; tumor necrosis factor, alpha-induced pro; Xlink,CUB;SS=M; 7.46; 436 
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421582; AI910275; Hs.350470; trefoil factor 1 (breast cancer, estroge; trefoil,Gastrin;SS=M; 7.08; 21 .51 

423634; AW959908; Hs. 1 690; heparin-binding growth factor binding or; none;TM=M;SS=M; 6.78; 1 2. 1 9 

428486; AW583497; Hs.184604; pancreatic polypeptide; hormone3;TM=M;SS=Y; 6.29; 3.51 

443646; AI085198; Hs.164226; ESTs; EGF,tspJ,vwc,TSPN,tsp_3,none; 6.17; 4.25 

457489; AI69381 3 Hs.12 1 ) tic gen nor.e;TM 1 £ M 5 19; 2.79 

450983; AA305384; Hs.25740; ER01 (S. cerevisiae)-like; none;SS=-M; 5.01; 7.43 

422867; L32137; Hs.1584; cartilage oligomeric matrix protein (p tsp_ I bS-M, 4 87, 9,40 

426322, J05068 Hs 2. 12 m i ibal - r 1 1 vitamin 612 binding pr; Ccbalamin_bind;SS=V.; 4.71; 11.74 

414774; X02419; Hs.77274; plasminogen activator, urokina e knn le t p pfertUhtorin SG=M 4 24;6.91 

428758; M433988; Hs.98502; CA125 antigen; nucin 15 SEA; M=Y 3.52; 8.43 

422048; NMJ12445; Hs.288126; spondin 2, extracellular matrix protein; !sp 1;TM=M;GG=M; 3 45; 7.69 

424687; J05070; Hs.151738; matrix metalloprotelnase 9 (gelatlnase B; fn2,hemopexin,Pept;dase_M10;3G=M; 3.43; 10.37 

417931; W95642; Hs.B2961 ; trefoil factor 3 (intestinal); trefoil;SS--M; 2.98; 9.65 

445417 AK001058 H H piens cDNA FLJ 10196 f r He tsp_1,Reprolysir„Pep_M12B_propep,none; 2.97; 5.74 . 

432874; W94322; Hs.279651 ; melanoma inhibitor/ activity; SH3;TM=M;SS=Y; 2.80; 10.53 
431462; AW583672; Hs.25631 1 ; granin-like neuroendocrine peptide precu; none.none; 270; 1.99 

TABLE 43C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column are Genbark Identifier (Gl) numbers. 'Dunham I. et al." refers to the publication entitled "The DNA 

sequence of human chromosome 22' Dunham I. et at, Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
Imposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

404682 9797231 Minus 40977-41150 



TABLE 44A 754 GENES UP-REGULATED IN RHEUMATOID ARTHRITIS COMPARED TO NORMAL BODY 

Table 44A lists about 754 genes up-regulated in rheumatoid arthritis. These were selected from 35403 probesels on the Affymetrix/Eos Hu01 GeneChip. 
Pkey; Unique Eos probeset identifier number 



109086 
109279 
109779 



AA021157 
AA112307 
AA252191 



AA196625 

F10527 

R32647 



115416 
116548 
116565 



121509 
121722 
122265 
123206 
123490 
124198 
124294 
125067 
125153 
125330 
125335 
125361 
125439 



T55340 

W95414 

AA292700 

AA357993 

AA398355 

AA412092 

AA419482 

AA436838 

AA489681 



Hs.33619 
Hs.1 05894 
Hs.25199 

Hs.270737 

Hs.86080 

Hs.3353 

Hs.23545 

Hs.29035 

Hs.191290 

Hs.187403 

Hs.337079 



Hs.1 98271 
Hs.1 02298 
Hs.11 1725 



Hs.22215 
Hs.86022 
Hs.1 43376 



Unigene Title 

AFFXcontro(-HUMRGE/M10098_3 
v-ros avian UR2 sarcoma virus oncogene h 
RAS p21 protein activator (GTPase acliva 



hypothetical protein FLJ21919 
hypothetical protein 

gb:zm02a09.s1 Stratagene corneal stroma 
tumor necrosis factor (ligand) superfami 



gb:CHR90123 Chromosome 9 exon II Homosa 

ESTs 

EST 

gb:zs59a05.s1 NCI.CGAP GCB1 Homo sapier 

ESTs 

ESTs 

ESTs 

similar to p roline-rich protein 48 
EST 

Homo sapiens cDNA: FU22105 lis, clone H 
gb:ag1 1c07.s1 Gessler Wilms tumor Homo s 
NADH dehydrogenase (ubiquinone) 1 alpha 
EST 
ESTs 

ESTs 

hypothetical protein FLJ23619 
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125795 
125858 
125865 
126039 
126143 
126177 
126219 
125221 
126262 
126277 
126292 



T98190 
H11549 
H12876 



R15866 
W87435 
AA348581 



Hs.7756 

Hs.31066 

Hs.283078 

Hs.181102 

Hs.266331 

Hs.129750 

Hs.293483 

Hs.172965 

Hs.143764 

Hs.15441 

Hs.248121 
Hs.94Q31 
Hs.112227 
Hs.170263 



p30DBC prolan 
li polh n protein MGi ) 
hypothetical protein FLJ10546 
ESTs, Weakly similar to similar to C. e! 

ESTs, Weakly similar lo unknown [H.sapi 
Crm (Cramped Drcsophila)-like 
gb:aa65d09 r1 NC!.CGAP_GCB1 Homo sap 
G protein-coupled receptor 22 



127429 
127490 
127502 
127647 
127650 
127676 
127746 
127812 
127824 
127933 
128006 

128038 
128058 
128199 



100262 
100676 
100704 
100787 
100873 
100943 
100996 



AA742428 

AA215510 

AA740146 

AA235966 

AA305023 

F20186 

AA452788 



AI087279 

AA873776 

D31237 

AI239495 

AA749094 

AI208365 

AA811102 

AA058693 

AI347067 

AA868782 

AI126617 

AIC73548 

AIC79496 

AI142639 

AA452788 

H90914 

C00810 

M194724 

S83308 

D38500 

HG3044-HT3742 
HG3242-HT3419 
HG3872-HT4142 
HG4333-HT4603 
HG880-HT880 



Hs.191650 
Hs.251946 
Hs.291811 



Hs.193326 

Hs.312110 

Hs.293751 

Hs.7278 

Hs.183502 

Hs.148410 

Hs.261957 

Hs.279938 

Hs.120189 

Hs.291434 

Hs.127811 

Hs.303581 



Hs.200332 
Hs.293981 
Hs.164915 
Hs.224137 
Hs.87224 
Hs.278468 
Hs.287820 
Hs.166110 
Hs.302063 



SEC24 (S. cerevisiae) -elated genefamil 
gb:HSPD05873 HM3 Homo sapiens cDNA clone 
gb:zv80d10.r1 Soares_total.fetus.Nb2HF8_ 
fibroDlar;! growth factor receptor-like 1 
ESTs, Weakly similar to 133022 hypolhet 
ESTs, Moderately similar to TPTEJHUMAN P 
cytochrome 2 (photolyase-like) 
ESTs 



gbizvaOd 1 0.r1 Soares_tota!Jetus_Nb2HF8_ 
hypothetical protein FU20651 
guanine nucleotide binding protein (G pr 
small nuclear RNA activating complex, p 
ypc he ca p otein 

SRY (sex determining region Y)-box 5 
postneiotic segregation increased 2-like 
fibronectin 1 

calcium channel, voltage-dependent, alph 
immunoglobulin heavy constant mu 
zi -iir" d ) sin 79 pT7) 
gb:PMu-SN0u19-280300-001-D1 1 SN0019 Homo 
Interferon, gamma-inducible prote'n 30 

coagulation factor VII (serum prothrombi 
phosphclipase A2, group HA (platelets, 
gb:Human rhom-3 gene, excn. 



103719 
103876 
103897 



Hs.66578 
Hs.150930 
Hs.154138 
Hs.169139 

Hs.83354 
Hs.93907 
Hs.75294 



Hs.75774 
Hs.172928 
Hs.4273 
Hs.8203 



pin pha y se me specficphospholipas 
KIAA0421 protein 

corticotropin re'easir.g hormone receptor 
X-ray repair complementing defective rep 
chitinase 3-like 2 

yi rei u 2 (L-kyiuiei ne ydr i se) 
gb:U68133 Human cell line PCI-06A Homo s 
endogenous retrovirus H proteas 



lysyiox 



ke2 



hexabrachion (ler.ascin C, cytotactin] 
chitinase 3-like 1 (cartilage glycoprote 
thrombospondin4 
collagen, type I, alpha 1 
hypothetical protein FU13159 
endomembrane protein emp70 precursor isc 
EH-domain containing 4 
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103906 
103985 
104056 
104209 
104386 
104398 
104422 



104673 
104681 
104711 
104812 
104877 
104886 
104924 
105071 
105105 
105203 
105317 
105617 
105707 



107231 
107490 
107572 
107620 
107801 
107817 
107823 
107857 
107882 
108005 
108092 
108115 
108214 
108382 



M249437 

AA313880 

AA397529 

AB000221 

H41895 

H53555 



R81267 

AA004701 

AA007633 

AA009832 

M017254 

AA034111 

M047437 



M1 36532 
M151872 
AA1 95660 
AA233926 
M280687 
AA291012 
AA302657 
AA347964 
AA400292 
AA400490 
AA400766 
AA418046 
AA418909 

AA423987 

AA449199 

AA465657 
AA488889 
AA600152 
AA621169 
C20974 



109473 
109725 
109794 



AA019433 
AA020781 
AA021057 
M024687 
AA025630 
AA037769 

AA047291 
AA058551 
AA074885 
M075578 
AA078801 
AA101983 
AA102553 
AA127539 
AA133131 

AA151480 

M157811 

AA180352 

AA206126 

M210719 

M213774 

AA224594 

AA233151 

F10003 

F10684 

H00615 

H04794 

H05641 

H08200 

H15054 

H20402 

H23216 

H3B209 

H41280 

H48124 

H48467 



Hs.99872 
Hs.58297 
Hs. 16530 
Hs.144164 
Hs.36790 
Hs.1 32909 
Hs.323817 
Hs.98640 
Hs.18978 
Hs.20010 
Hs.34500 
Hs.32794 
Hs.124137 



Hs.28774 
Hs.29475 
Hs.87016 
Hs.7882 



Hs.7567 

Hs.30662 

Hs.21342 

Hs.27056 

Hs.29205 

Hs.32425 

Hs.29419 

Hs.8687 

Hs.12114 

Hs.34727 

Hs.103189 

Hs.59962 

lls.60171 



Hs.61208 
Hs.231967 
Hs.194293 
Hs.1 84029 
Hs.165216 
Hs.60764 
Hs.67726 



Hs.79658 
Hs.23687 
Hs.170044 
Hs.30489 
Hs.216701 
Hs.268770 
Hs.318773 
Hs.31746 
Hs.86905 
Hs.32728 
Hs.22586 
Hs.279454 

Hs'.301062 



hypothetical protein MGC2744 
feta' Alzheimer antigen 
CLLL8 protein 

i 1 1 k i ui If i ill ( 
ESTs, Moderately similar to ALU8.HUMAN A 
ESTs, Weakly similar to putative p150 [H 
ESTs 

DKFZP547E1 010 protein 

hypolhetoal protein FLJ21069 

Homo sapiens cDNA: FLJ22822 lis, clone K 



Homo sapiens clone IMAGE:451939, mRNA se 



ESTs, Weakly similar to I38022 hypotheti 



integrin, beta 8 

glucocorticoid modulatory element bindin 
ESTs, Weakly similar to A53933 myosin I 
ESTs 

Homo sapiens clone IMAGE:297403, mRNA se 
d : sabled (Drosophiia) homolog 2 (mitogen 
Homo sapiens, clone MGC. 17333, mRNA, com 
Homo sapiens mRNA for KIAA0556 protein, 
ESTs 

hypothetical protein DKFZp761E2110 
proteir tyrosine phosphatase, receplor t 
H m sap.enscDNA FLJ21 62 11 ( ne H 



Homo sapiens cDNA FLJ10674 fis, cl( 



hypothetical protein DKFZp761A052 



macrophage receptor with collagenous str 
gb:zm88h03.-1 Stratagene ovarian cancer 
gb:zm94a09.s1 Stratagene colon HT29 (937 
triggering receptor expressed on myeloid 
ESTs 

hypothetical protein AD034 
gb:zm25d03.s1 Stratagene pancreas (93720 
gb:zo38d01.s1 Stratagene endothelial eel 



gb:z:88e04.s1 NCI_CGAP_G 



ESTs 



no sapiens mRNA; cDNA DKFZp564l0816 (f 
ESTs, Weakly similar to 2004399Achromos 
KIAA1836 protein 

hypothetical protein DKFZp547F072 
ATPase, H+ transporting, lysosomal (vacu 
EST 



UDP-N-ace'.yl-alpha-D-galactosamine:polyp 
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110434 H49446 

110553 H58934 

110750 N20522 

110827 N30077 

110829 N30198 

110917 N46363 

111100 N52522 

111112 N53281 

111179 N67239 

111185 N67551 

111223 NBB921 

111275 N70970 

111443 R01901 

111573 R10305 

111590 R11157 

111671 R19358 

111732 R25153 

111809 R33616 

111829 R36070 

111944 R40606 

112015 R42836 

112023 R43020 

112055 R43621 

112334 R56239 

112340 R56602 

112353 R58986 



30 
35 



112478 
112533 
112588 
112595, 
112676 
112744 



112902 



114371 
114428 
114466 
114625 
114862 
114908 
114973 



R66067 



R98491 
T09262 
T23699 
T51620 



T87795 



113151 
113297 
113398 
113484 

113769 U55966 
113794 W37382 
113971 W86760 
114066 Z38152 
114178 Z39053 
114206 



Z41835 
M017130 
M026970 
AA084362 
AA235174 
AA236545 
AA250845 
AA251561 
AA253005 
AA256161 
AA280805 

AA287008 
M287596 
AA405838 
AA410645 
AA418835 
AA426026 
AA427B90 
AA435577 
AA460708 

AA521080 
H77395 
H81933 



115385 

115466 

115479 

115663 

115689 

115748 

115810 

115827 

115881 

116148 

116257 
116365 
116941 
116982 
116995 H83928 
116997 H84214 
117016 H87171 
117097 H93608 
117101 H94043 
117238 N20815 
117303 N22776 
117399 N26480 
117503 N31963 
117544 N33222 
117594 N34929 



Hs.20450 
Hs.35452 
Hs.10760 



Hs.22404 
HS.3406C 
Hs.293762 
Hs.35467 
Hs.14584 
HS.12919C 

Hs^9326 

Hs.1 3059 

Hs.87016 

Hs.187543 

Hs.22985 

Hs.1 1090 

Hs.269172 

lls.26920 

Hs.17930 

Hs.27339 

Hs.27810 

Hs.84790 

Hs.135150 



Hs.278188 
Hs.40507 
Hs.1 99014 



Hs.46765 
Hs.39749 
Hs.312582 

Hs.40594 
Hs.52170 
Hs.41919 
Hs.24341 
Hs.1 73337 
Hs.264079 
Hs.43805 
Hs.44286 
Hb.44451 
Hs.171984 



asporin (LRR class 1) 
EGF-like^omain. mult pie 6 
KIAA1866 protein 
ESTs 

gb:Homo sapiens endogenous retrovirus W 



Homo sapiens cDNA FLJ1 1666 Us, clone H 

ESTs 

EST 

gb:Homo sapiens full length insert cDNA 



ESTs, Weekly similar to ALU6_HUMAN ALU S 
ig superfemi.y protein 

Homo sapiens mRNA; cDNA DKFZp58SF1323 (I 
gb:y!16g12.s1 Soares placenta Nb2HP Homo 
ESTs 

gbyl47f03.s1 Soares placenta Nb2HP Homo 
gb:yi76h08.s1 Soares placenta Nb2HP Homo 
protease, serine, 12 (neurotrypsin, moto 



Human DNA sequence from clone RP5-1046G1 



hypothetical protein FLJ22938 



> open reading frame 11 



KIAA0225 protein 

lung type-l cell membrane-associated gly 
gb:zn05b10.r1 Stratacene hNT neuron (937 
Homo sapiens cDNA FU13410fis, clone PL 



ESTs, Moderately similar to I54374 gene 
ESTs 

ESTs, Moderately similar to ALU7_HUMAN A 



etina N2b In 

ESTs 
ESTs 
EST 

transcriplional cc-activator with PDZ-M 

ESTs 

ESTs 

lipoma HMGICftision partner-like 3 
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118114 
118151 
118270 



30 

35 



N36113 
N38970 
N40953 
N40976 
N47807 
N48701 
N48872 
N50101 
N51935 
N54188 
N56875 
N58276 
N62868 
N63076 
N64017 
N64529 
N64856 
N66248 



N71781 
N91481 
R02591 
R27619 
R42357 
R61293 
T12559 
T16367 
T40427 
T63474 
T66867 
T89122 
T99544 
W02129 
W38053 
W73370 
W74257 
W78050 
W86471 
W92548 
AA1 69801 



120420 AA235031 

120437 AA243427 

1204S1 AA251301 

120594 M282054 

120611 M284178 

120626 AA285064 

120696 AA291503 

120747 AA302976 

120749 AA303235 

120752 AA311972 



121038 
121065 
121067 
121071 
121082 
121172 
121191 
121354 

121399 
121479 
121498 
121704 
121736 
122198 



119795 
119817 
119831 



AA398662 
M398678 
AA398722 
AA400013 



AA411911 
AA412033 
M418743 
AA421131 
AA435892 
M436011 
AA436692 
AA437209 
M437259 
AA442870 
AA443311 
AA4437B9 



Hs .301405 
Hs.47374 
Hs. 130323 
Hs.143212 



Hs.49829 
Hs.49840 
Hs.50081 
Hs.54713 
Hs.284294 
Hs.231046 
Hs.91453 



Hs.90696 
Hs.187402 
Hs.249712 
Hs.173734 



Hs.58419 
Hs.151624 
Hg.94985 
Hs.98710 



Hs.123122 

Hs.112885 

Hs.104311 

Hs.293369 

Hs.5094 

Hs.1 10637 

Hs.1 04485 

Hs.97249 

Hs.96672 



Hs.1 74248 
Hs.291272 
Hs.98347 
Hs.308114 
Hs.97365 
Hs.97300 
Hs.97302 
Hs.1 39355 

Hs.97750 
Hs.1 04447 
Hs.1 93737 
Hs.262643 
Hs.332700 
Hs.98110 
Hs.1 78045 
Hs.98306 
Hs.148515 
Hs.97541 
Hs.98187 
Hs.98892 
Hs.234016 



gb:yy80b06.s1 Soares_mulliple_sclerosis_ 

EST 

EST 

gb:y/77a05.s1 Scares_n I - en 
hvp helic Iprolei c F ' n 
Homo sapiens cDNA FLJ 1 3561 fis, clone PL 
Homo sapiens, clone :MAGE:3960432, mRNA 



hypothetical protein DKFZp434F2322 



ESTs, Weakly similar to (defline nolav 

ESTs 

ESTs 

KIAA1199 protein 
ESTs 

Breakpoint cluster region protein, uteri 



gb:ya01a06.s2 Stratagene lung (937210) H 



hypocretin (orexinj receptor 2 
ESTs 

hypothetical protein 

gb:zp54e1 1.s1 Stratagene MT2 neuronal pr 
KIAA0627 protein; Drosophila multiple as 
gb:DKFZp434B1B22_r1 434 (synonym: htes3) 
FSH primary response (LRPR1, rat) homolo 
spinal cord-derived growth factor-B 
novel protein with MAM domain 



ESTs, Weakly similar to JC5308 tesUs-sp 



Homo sapiens, clone IMAGE:4214491 , mRNA, 
ESTs 

CGI-30 p-otein 
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PCT/US02/36810 



122746 AA458791 
122805 AA460702 
122841 AA46153B 



122933 
123005 
123142 
123153 
123168 
123188 
123276 
123305 



20 
25 



124196 
124197 
124229 
124230 



124655 
124706 
124848 
124882 
124898 
125086 



AA487504 
AA488349 
AA488881 



AA598913 
AA599014 
M609332 



F13673 
H52617 
H52921 
H62793 



N68168 
N69197 
N70294 
N93176 



T91161 
W38001 
W73409 



125419 
125424 
125526 
125539 



125707 
125790 
125875 
1259S9 
125970 
125975 
125985 
126018 
126032 
126059 
126107 
126154 
126199 



126373 
126378 
126383 
126403 
126525 
126527 



R14487 

R17870 

AA908225 

R48940 

C14616 



R94247 
A1400964 
AA495891 
H54857 
H548S6 

H66582 

H79155 

AI004105 

A1000492 

W77936 

N27236 

M830432 

F11606 

M347842 

M885594 

N73388 



AA460338 

AA699443 

M206993 

AA037230 

M064671 

S86382 

AA1 26047 

M761143 

R51877 

AI311457 



Hs.334808 
Hs.105218 
Hs.1 77726 
Hs.187946 



Hs.191956 
Hs.1 12911 
Hs.1 12916 



Hs.1 07924 
Hs.179152 
Hs.271674 
Hs.1 02808 



Hs.1 031 85 

Hs.1 33552 

Hs.324746 

Hs.134544 

Hs.18564 

Hs.1 71 10 

Hs.248120 

Hs.15463 

Hs.108043 

Hs.284122 

Hs.99962 

Hs.7298 

Hs.88414 

Hs.177516 

Hs.152290 

Hs.35981 

Hs.167583 

Hs.269065 

Hs.308486 

Hs.93361 

Hs.190488 

Hs.1 25829 

Hs.83583 

Hs.269034 

Hs.44701 



Hs.6298 

Hs.125976 

Hs.1 56432 

Hs.103853 

Hs.103142 

Hs.279746 

Hs.191391 

Hs.193213 

Hs.315367 

Hs.135084 

Hs.957 



gb:aa88c02.s1 Slratagene feisl retina 93 
collagen, type XI, alpha 1 
WAS protein family, member 2 
ESTs.High'ysimil A'ASF nte-adii 
chromosome 7 open reading frame 2 
integiin, bela 8 
EST 

hypothetical p-otein MGC41 89 



Homo sapiens cDNA. FLJ10577 fis, 




gb:yq7Bc09.s1 Soares t 



gb:za1 1c01.s1 Soares felal liver spleen 
ESTs, Weakly simlar to 138022 hypotheti 
ESTs, Weakly sircvlar to A56194 thromboxa 
ESTs 

ESTs, Weakly similar to ALU8.HUMAN ALU 



alpha-2-HS-gtycoprotein 



IA;cDNADKFZp434C201B(f 
G protein-coupled reoeptor21 

t I e M l 3 it 

Friend leukemia virus integration 1 



BTB and CNC homology 1, basic leucine zi 
h : gh density lipoprotein binding protein 
ESTs, Highly similar to JC2463 vasoactiv 



's, Highly similar to KIM0349 IH.sapi 



actin related prote : n 2/3 complex, subun 

ESTs 

ESTs 

B cell RAG associated protein 
gb:yy62a11.s1 So 
KIAA1151 protein 
ESTs, Weakly similar to 571949 metallopr 



cystalin C (amyloid angiopathy and cereb 
gb:zm1 3bC4.s1 Stratagene pancreas (93720 
putative opioid receptor, neuromedin K ( 
jb:zn09d1 1 tr n hill i ir i (93 
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127187 
127215 
127229 
127278 
127299 

127347 
127401 
127420 
127438 
127441 
127449 
127493 
127505 
127S20 



AA641812 
AA380418 
AA297138 
A1246377 
AA316181 
AA342715 
AA360710 
AA393073 
AA428350 
AA921944 
M699582 
AI224421 
AA835684 
AI421866 
AA808081 
AA594244 
AI025699 
AA773234 



127733 
127816 
127966 
127973 

127997 
128016 
128037 
128053 
128066 
128071 
128091 
128113 



12B369 
128371 
128421 



128731 
128843 
128988 
129016 
129021 
129095 
129171 
129188 
129410 
129467 
129518 
129534 
129632 



130049 
130171 
130411 
130479 



131189 
131318 
131506 



AA743646 
AI493406 
AI336794 



AI341423 
AI498467 
AA932961 
A1143866 



F12681 
H12876 
T77876 
X02761 



AA316862 
AF005271 
AA234141 
AA411040 
W84524 
AA426406 
L12350 



AA410311 
AA369807 
R73640 
L27213 



R44163 

L32137 

U92971 

AA020942 

N92934 

Z20481 

D57622 

AA598441 

F13694 

L16782 



Hs.127639 
Hs.82171 
Hs,77100 
Hs.287601 
Hs.75722 
Hs.291701 
Hs.292245 
Hs.1 16200 
Hs.271877 
Hs.268538 

Hs.150683 
Hs.189186 
Hs.1 89005 
Hs.120604 
Hs.292514 
Hs.129117 
Hs.132413 



Hs.134053 
Hs.1 66479 
lls.157775 
Hs.160189 
Hs.270244 
Hs.30445 
Hs.283078 



Hs.32944 

Hs.301693 

Hs.278398 

Hs.9605 

Hs.104555 

Hs.275675 

Hs.294140 

Hs'.1 73081 
Hs.108623 
Hs.7753 



Hs.1 460 

Hs.245257 

Hs.169910 

Hs.1 2457 

Hs.1584 

Hs.194351 

Hs.17200 

Hs.17409 



ESTs 

SHP2 interacting transmembrane adaptor 
ESTs, Weakly similar to S71949 metallopr 
ceroid-lipofuscinosis, neuronal B (epile 
six t-arsmemb-ar.e epithelial antigen of 
gb;EST4830S Fetal spleen Homo saper.s cD 
ESTs 
ESTs 

hypothetical protein MGC4090 
ESTs 

Homo sapiens clone 1 91 B7 placenta expres 
general transcription factor HE, polype 
Homo sapiens cDNA FLJ1 3830 fis, clone TH 



ESTs, Weakly similar to ALU 1JHUMAN ALU S 



ESTs, Weakly simlar Id ALUDJHUMAN !!!! 
ESTs 

ESTs, Weakly sirhlar to YA02_HUMAN HYPOT 



turner rejection antigen (gp96) 1 

ESTs, Weakly similar to S18968 cyritesti 

ESTs, Moderately similar lo KIAA1399 pro 



ESTs, Weakly similar to I38022 hypotheti 
I iomo sapiens cONA FLJ 14687 fis, clone NT 
hOAT4 



H . r I I IAGE 3899 mRNA 
KIAA1 117 protein 

cleavage and polyadenylalion specific fa 
neuropeptide FF-amide peptide precursor 
katanin p80 (WD40-containing) subunit B 



ESTs 

hypothetical protein FLJ11264 

solute earner family 4, anion exchanger 

collagen, type 111, alpha 1 (Ehlers-Danl 

hypothetical protein FLJ22301 

gb:yw75b05,s1 Soares_placenta_8to9weel<s_ 

glucagon 

ESTs, Weakly similar to A46010 X-linked 
KIAA01 73 gene product 
hypothetical protein FLJ10814 
cartilage oligomeric matrix prolein (pse 
coagulation factor II (ttromb : n) recepto 
STAM like prote'n coi ta'ning !l 13 and TA 
cyste'ne-rich protein 1 (intestinal) 
Hcmo sapiens, Similar to Bicaudal D (Dro 
s:n3-asscciated polypeptide, 30kD 
DKFZP434K2235 protein 
ESTs 

M-phase phosphoprotein 1 
bonegarnrna-carboxyglutamate (gla) protei 
KIAA1194 protein 
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131551 AA127867 

131563 C20547 

131830 U33054 

131879 AA017161 

132017 W67251 

132025 U58516 



132159 
132164 
132180 



D76435 

UB4573 

M405569 

R77451 

AA453446 

F09979 

N40559 

AA481404 

C14015 

AA156897 

U22172 

U65918 

T82292 

X00588 

F10045 

AA478971 

AA1 22386 



135348 
100547 
1C0572 



104113 
104316 
104453 



107379 
107412 
107652 
107754 



60 
65 



HG2149HT2219 

HG2271-HT2367 

HG3115-HT3291 

HG315-HT315 

M21305 

U35407 

X05232 

X54925 

X72755 

X78416 

AA082824 

AA427510 

D61871 

M19169 

M007312 

M056588 

AA424958 

M490107 

U93868 

W25105 

M010195 

AA017462 

AA026240 

M059473 

M083070 

AA121960 

M156576 

AA159605 

F01778 



112532 
112858 
113170 



Hs.28608 

Hs.302810 

Hs.32959 

Hs.33792 

Hs.267659 

Hs.3745 

Hs.3964 

Hs.41154 

Hs.41270 

Hs.418 

Hs.4245 

Hs.42673 

Hs.4774 

Hs.6129 

Hs.6686 

Hs.303075 

Hs.72157 

Hs.179697 

Hs.73078 

Hs.75111 

Hs.77432 

Hs.79347 

Hs.81988 

Hs.82985 

Hs.181300 

Hs.84469 

Hs.322978 



Hs.125511 
Hs.26102 

Hs.268177 

Hs.73995 

Hs.272620 



Hs.83326 
Hs.83169 
Hs.77367 
Hs.3156 

Hs.181202 
Hs.330821 
Hs.123114 

Hs.155489 
Hs.294132 
Hs.21753 
Hs.333851 
Hs.287797 
Hs.52642 
Hs.269244 

Hs.66783 



Hs.5947 
Hs.72580 
Hs.131740 



Hs.19081 
Hs.268668 
Hs.223649 

Hs.326475 



Hs.26805 

Hs.28313 

Hs.4454 

Hs.270373 

Hs.13759 



Homo sapiens cONA: FU221 15 fis, done H 
Novel human gene mapping (d choraosome : 
G protein coupled receptor kinase 2 (Dro 
ESTs 

vav 3 oncogene . 
milk fat globule-EGF factor 8 p-ote:n 
Homosapiens clone 24877 mr" 
Zic family member 1 (cdd-paired Drosophi 
proeotaqon-l/sine, 2-oxogiutarate 5-cio 
fibroblast activation protein, alpha 
chromosome 1 1 hypothetical protein ORF3 
ESTs 

Homo sapiens mRNA; cDNA DKFZp761C1712 (f 
ATP-binding cassette, sub-family B (MDR/ 
hypothetical protein DKFZp56401554 



protease, serine, 11 (IGF binding) 
epidermal growth factor receptor (avian 
KIAA0211 gene product 



ESTs 
EST 

cytochrome b-245. beta polypeptide (chro 
5-hydroxytryptamine (serotonin) receptor 
collagen, type I, alpha 2 

Homo sapiens mRNA; cDNA DKFZp434P1530 (f 
trichorhinophalangeal syndrome I 
hypothetical protein DKFZp564B1162 
phosphotpase C, gamma 1 (formerly subty 
gb:Homo sapiens mucin (mucin) mRNA, part 

gb:Human Golli-mbp gene, exon 2. 
pregnancy specifc beta-1-g!ycoprotein 9 
gb:Human alpha satellite and satellite 3 
peroxisome receptor 1 
matrix metalloproteinase 3 (stromelysin 



monokine induced by gamma interferon 



hyrulhel ilpulPinFUIOOSB 



J M5 protein 

polymerase (RNA) III (DNAdirected) (32k 
integrin, beta 1 (fibrcnectin receptor, 
ESI tveakli iin dr (o ALUF_HUMAN !!!! 
ESTs 

gb:no77a05.s1 NCI_CGAP_AA1 Homosapit 



gb:zm85a05.r1 Stratagene ovi 
gb;zm24g09,r1 Stratagene pancreas (93720 
mel transforming oncogene (derived from 

Homo sapiens cDNA: FLJ22562 fis, clone H 



ESTs, Weakly similar to 178885 serine/th 



gb:yj76d05.s1 Soares breast 2NbHBst Hon 
ESTs 

ESTs, Weakly similar to putative p1 50 [ 
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AA261805 

AA417297 

M424030 

M598455 

D20458 

H85422 

N23680 

N32626 

N64579 



118771 
119075 
119217 
119305 
119347 
120006 
120441 



121599 
122146 
122261 
122352 
122433 
122489 
122554 
122857 
122889 
123399 
123662 



R95778 

T26914 

T64349 

W90108 

AA243588 

M287286 

AA346854 

AA400156 

AA416770 

AA435584 

AA436830 

AA443725 

AA447417 



Hs.11087 
Hs.21948 
Hs.44021 
Hs.59609 
Hs.46627 
Hs.66817 
Hs.229071 



Hs.50421 
Hs.50547 
Hs.287820 
Hs.237309 
Hs.1 32785 

Hs.10848 
Hs.190035 
Hs.99657 
Hs.52788 
Hs.339808 
Hs.98255 
Hs.2501 73 



Hs.285491 
Hs.1 78551 
Hs.1 54654 
Hs.99606 
Hs.287687 
Hs.1 0551 6 
Hs.1 12703 



i 1 C i rt M HI 

:STS 

Homo sapiens mRNA for FLJ00065 protein, 



ESTs 

Homo sapiens cDNA: FLJ22534 lis, ctane H 
ESTs, Weakly similar to FV1 M USE FRIEND 
gb:yz51d11.s1 Morton Fetal Cochlea Homo 

gb:yyB9f01.s1 Sc 

KIAA0203 ge--- 



ESTs 



EAP30 subunit of ELL complex 
gb:yc10d08.s1 Stratagene lung (937210) H 
KIAA01 37 gene product 



hypothetical protein FLJ131 58 



cytochrome P450, subfamily I (dioxin-ind 
L f /simil 1 " N^nn 
Homo sapiens cDNA: FU21960 fis, clone H 



123981 
124126 
124404 
124557 
124703 



AA621223 
C20797 
H18517 
N31998 



125111 
125331 
125349 
125426 



125656 
125743 
125757 
125760 
125804 
12GS67 



126150 
126171 
126198 
126224 
126289 
126343 
126406 
126419 
125479 



126718 
126739 
126745 



T87826 
R43963 
R64472 
AI375276 
R13353 
AI038854 
AA040118 

AI274906 
W03020 
R79519 
AI341206 



AA515212 
AA322718 
AI160709 
AA057506 



Hs.95481 
Hs.164568 
Hs.164256 
Hs.141604 
Hs.300076 
Hs.168500 
Hs.1 37548 
Hs.1 78658 
Hs.1 61378 
Hs.164480 
Hs.1 84029 
Hs.16131 
Hs.1 58732 



Hs.1 56835 
Hs.40300 
"Hs.1 6899 
Hs.173770 
Hs.144413 
Hs.227835 
Hs.64616 
Hs.191942 



Hs.1 58701 

Hs.129064 
Hs.12285 
Hs.1 84376 
Hs.72071 
Hs.339670 
Hs.309435 
Hs.289047 

Hs.1 16910 

Hs.85962 



fibroblast growth (actor 7 (keratr.ccyie 
hypothetical protein FU20657 
ESTs, Moderately similar to ALU1_HUMAN A 
solute earner family 22 (organic cation 



ESTs, Weakly similar to T50609 hypotheti 
hypothetical protein DKFZp761A052 
hypothetical protein FLJ1 2876 
ESTs 

gb-.yf75c04.r1 Scares infant brain 1NIB H 
S164 protein 

neutral sphingomyelinase (N-SMase) activ 

gb:ym37a05.r1 Soares infanl brain 1 NIB H 

ESTs, High y simil 18U460Ap53* 

calpain3, (p94) 

ESTs 

ESTs 



gb:yv41 h02.r1 Soares tetal liver spleen 
Homo sapiens chromosome 19, cosmid F2216 
EH Weakly nilart 214 r, 



hypothetical protein FLJ20038 

EH lie ' r1 7/ 1 I n 

ESTs.W ,1 mil KIAA0927 proleii 

Homo sapiens cDNA FL) 14059 fis, clone HE 

gb:zf49g04,r1 Soa-es retina N2b4HR Homo 

ESTs 

gb:UI-H-BI3-ala-a-12-0-Ul.s1 NCI_CGAP_Su 
hyaluronan synthase 3 
nuclear recepto- subfamily 1, group D, m 
prp28,U5snRNP 100 kd protein 
gb:HSC1JA051 normalized infant brain cDN 
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127116 AA278492 Hs.288304 

127282 AA347547 Hs.185780 

127349 AA412108 Hs.269350 

127352 M415577 Hs.189105 

127482 AI337294 Hs.1 05352 

127543 AI364367 Hs.157392 

127553 M282433 

127556 AA679831 Hs.190228 

127859 AA806B37 Hs.291559 

127993 M847856 Hs.1 24565 

128277 A1018275 Hs.269791 

128285 AA634569 Hs.1 3351 

128317 A1Q51960 Hs.303754 

128334 A1080130 Hs.134207 

128428 AI185718 Hs.143900 

128582 U22963 Hs.1Q1840 

128592 AA47Q056 Hs.113994 

128751 AA442274 Hs.183176 

129105 AA224351 Hs.108681 

129161 N27334 Hs.181780 

129246 N99174 Hs.206063 

129361 X64229 Hs.110713 

129577 AA424952 Hs.82906 

129600 N78980 Hs.271599 

129989 AF005887 Hs.247433 

130024 U15197 Hs.113271 

130292 U70136 Hs.218791 

130589 AA234308 Hs.16441 

130736 T99385 

131238 R82327 Hs.24625 

131378 AA463886 Hs.203910 

131601 M31165 Hs.29352 

131605 AA256220 Hs.29383 

131676 C20785 Hs.30514 

131861 D11925 Hs.184245 

131873 H39997 Hs.166852 

132023 F01927 Hs.3743 

132770 AA425647 Hs!56406 

132859 D20925 Hs.69235 

133052 R40166 Hs.106826 

133373 S72437 Hs.73946 

133446 M25322 Hs.73800 

134693 N70361 Hs.8854 

134733 U03644 Hs.89421 

134965 J05480 Hs.272458 

136327 AA477989 Hs.98800 

135377 C21382 Hs.99766 

135398 AA194075 Hs.287270 



Homo sapiens cDNA FLJ 1 1 529 fis, clone HE 

ESTs 

ESTs 

r 7 Ar kl inilarbNBR13[H.sapiens 

GalNAc alpha-2, 6-sialyltransferase I, I 

Hon p n DNAFLJ20 3 fi lone C 

■ id ia6 -12 r1 NCLCGAP.GCB1 Homo sapiens 

ESTs 

ESTs 

ESTs 

ESTs 

LanC (bacterial lant'biot'c synthetase c 

ESTs 

ESTs 

ESTs 

major histocompatibility complex class 
Homo sapiens cDNA FLJ20796 fis, clone CO 
ESTs 

Homo sapiens brain tumor associated pro! 

hypothetical p-ote^n FLJ20241 

ESTs 

DEK oncogene (DNA br.ding) 

CDC20 (cell division cycle 20, S. cerevi 

hypothetical protein MGC1 0500 

i- »ating r nsc plic r facte i 6 

ABO blood group (transferase A, alpha 1- 

proteoglycan 4. (megakaryocyte stimulati 

DKFZP434H204 protein 

gb:ow8Sg07.s 1 Soares.fetaljiver.spieen. 

ESTs 

mall Qlutamin h Iricop le 
tumor necrosis factor, alpha-induced pro 
Homo sapiens mRNA; cDNA DKFZp434E2321 (f 
ESTs 

KIAA0929 protein Msx2 interacting nuclea 

matrix metalloproteinase 24 (membrane-in 

HomfsTpl L n\ 5 cDNTFLJ13549 fis, clone PL 

transportin-SR 

KIM1696 protein 

endothelial cell growth factor 1 (platel 

selectln P (granule membrane protein 140 

Human transcription unit PVT gene, exens 

CBF1 interacting corepressor 

protein phosphatase 3 (formerly 2B), cat 

ESTs 

Homo sapiens mRNA; cDNA DKFZp564J0323 (f 
ret protc-oncogene (multiple endocrine 



Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



CAT number 
110079.2 



107897 
130736 
108710 



125439 
117697 
125515 



91776J 
611414.1 
133560 1 



327512.1 
34624.4 
244444 1 
5025.5 



AA074897 AA1 13914 M064871 AA079329 AA071309 AA084710 AA129030 AA075042 M074794 AA071453 AA078803 AA 148628 

AA122204 M074159 AA126185 M0791 17 M127089 AA070912 AA0792B0M131372 M078833 AA071087 AA076131 AA071047 

AA079401 AA08307O AA102076 AA1151S3 AA07419BAA134725 AA113B89AA121103 AA075041 AA065148 AA071310 AA101144 

AA079659 AA078931 M079209 AA070923 M068994 AA069817 AA076187 AA069053 AA131489 AA071308 M063317 AA070156 

AA071430AA076C5CAA " Ct > 00 I 0,12C283 AA126078 AA075895M079208 AA074583AA071086 AA079623 AA070627 

AA078802 M076622 AA065051 AA0791 43 AA071 1 10 AA079434 AA148748 M079230 AA085188 AA074485 AA070580 AA076151 

AA083166AA085118AA079450M085044M120938AW9200AA1O0188MO81472AA122355AA129O31 M085362 AA059220 

AA070940 AA075968 M074563 AA084027 AA1 15929 

AA604872AA026240 

AM 68326 T99385 

AA121959AA121960 

AWB64944L07517AW869606 

N68168N69188N9O450 



N40976AA902795 
R13353 R13890H11359 
N64579 

AA491328 N42312 

U88896 U88898 AA916056 T032S5 AI341594 A1359534 A1634031 U88897 

N58924AA347842 

H17151 H11956 

N76683AA034096AA034082 
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127067 
119243 
111443 



127278 
103751 
126636 
127331 
127357 
126745 



127701 
128410 
114625 



102649 
118456 
102860 
120715 



113889 
108258 
101046 



1534978 J 
1774795J 
31528J8 



113388J 
1576900J 
142696J 

158963J 
46636.1 
82752.1 



150431 ' 
113869.1 
tigr_HT3291 
genbank_AA210719 
genbank_M610013 
genbank_D20433 
entrez_W38001 
entrez_W38294 
genbank_H83928 
genbankJJ68133 
genbank_N66580 
entrez_X00368 
genbank_AA292700 
genbank_AA303235 
genbankT95909 
genbank_W72720 
genbank_AA063269 
entrez_K01160 
genbank_N30316 
genbank_AA458791 
genbank_F10243 
genbank_AA078801 
genbank_H52921 
entrez_M21305 
genbankJW133131 
entrez_M64358 



F06732 Z43705 
T12603T12604 

AF072503AF208161 AA613233H1243S N76991 D78692 BE019603 AA776439R37932T93615 AF072508 R00744 R01948 R68685 

AI128496 AA865193 A1797529 H13302 AF072506 NM 014590 AF072505 R00743 T93661 T39519 R68740 H13097 N58614 N77302 

H01372 N41878 H04136 AA42651 1 AW971553 AW900030 R76136 T52094 AI5981 35 M781423 R76086 R77278 AI393478 AA837267 

A1570707 R01901 R27412 N531 77 AI37921 0 A11 28526 AA250958 R79323 R27389 H01325 N55091 T69704 AA868777 T47345 R27591 

AA860368AA729556H04137T8^M, 3174 I - ,1 V43AA419144 

AA342715AA367634 

AA131367AA082824 

AA057531 AA001527 

F20186AA622352 

AA424107AA452788 

AA047854 M057506 M053841 

AA064613M064671 

AA083531 AA126047 AA074915 AA1 43649 
R55021 H26613 

AW450979 AA136653AA136656AW419381 AA984358 AA492073 BE1 58945 AA809054 AW238038 BE011212 BE011359 BE011367 
BE0113S8BE011362 BE011215 BE0113S5 BE01 1363 
M179656AA182626AA182603 
AF074991 R36070 

AW964385 AA00731 2 AI081 71 1 AA31 3253 AW391 655 T991 92 
AA505046AW969109 M505047 

AL042725 BE063316 AW975S10 AA457591 BE052092 A1655202AA714296AI267264 AI075321 AA223286 AA071122 AA227849 

AA216700 AI696002 AA101867 AA099426 AA1 35997 AL041698 T02815 T51824 AA207189 T59230 T51868 AA663341 BE1 65757 

AW81 81 04 AW392886 AA584918 AA099408 AW856396 AW861 859 AA053045 

AA679064AA935466 

AA424107AA452788 

AA081507 M070071 AA070840 AA084362 

AA157811 AA836869 

AA075631 AA075578 

L18862 

AA210719 

M610013 

D20433 

W38001 

W38294 

H83928 

U68133 



119154 
119241 
119310 
119347 
119529 



genbank_N48872 

genbank_R61293 

genbank_T12559 

genbank_T40427 

genbank_T64349 

enlrez_W38053 

genbank_R65706 

genbank_R69886 

genbank_R77302 

genbank.AA398722 

genbank_AA496133 

genbank_AA496968 

ligr_HT2219 

genbank_AA599723 



TABLE 45A: 90 GENES DOWN-REGULATED IN RHEUMATOID ARTHRITIS COMPARED TO NORMAL BODY 

Table 45A lists about 90 genes down-regulated in rheumatoid arthritis. These were selected from 35403 probesets on the Affymetrix/Eos Hu01 GeneChip. 

Pkey: Unique Eos probesel identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 
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100240 
100289 
100658 
100763 

101091 
101155 
102223 
102282 
102378 
102386 



HG2B55-HT2995 

HG3597-HT3800 

HG3731-HT4001 

L06132 

L13972 

U24685 

U31383 

U40369 



U41371 
U50327 
102566 U59752 
102605 U64444 
102693 U73824 
102710 U77827 
102920 X12451 
102929 X13238 
103166 X67951 



103463 
103835 
104796 
105714 
105927 
105945 
106001 
106027 
106227 



112746 
113059 
113822 
113859 
113909 
114693 
115399 
116606 



121547 
123495 
123608 
123749 
124763 
125366 
125657 
125670 
125882 
126541 
126715 



132455 
132582 
132610 



AA412119 
AA429262 
AA435664 



H99370 

R93237 

T26925 

W47350 

W67225 

W78127 

AA122158 

AA2831B2 

D80217 



M397911 
AA402441 
AA412448 



R39610 
H60192 
AA481719 



T15774 
M318547 
AA443114 
AA609201 



Hs.15071 
Hs.75416 
Hs.179774 
Hs.75452 



Hs.79126 

Hs.28491 

Hs.81728 

Hs.75916 

Hs.1432 
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33649 AA479139 Hs.75393 acid phosphalase 1, soluble 

33B14 M33882 Hs.76391 myxovirus (influenza) resistance 1,homo 

i3437B AF006088 Hs.82425 acti n related p i r, ibu 
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Table 46A lists about 714 genes upregulated in esophageal cancer relative to norma! body tissues. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 
Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a normalized value reflecting the relative level of 
mRNA expression. 

Table 47A lists about 1 13 genes upregulated in esophageal tumors relative to norma! esophagus. These genes were selected from 596B0 probesets on the Eos/Affymetrix Hu03 
Genechip array. Gene expression data for each probeset obtained from this analyst was expressed as average intensity (Al), a normalized value reflecting the relative level of 
mRNA expression. 

Table 48A lists about 162 genes downregulated in esophageal tumors relative to norma; esophagus. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 
Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a normalized value reflecting the relative level of 
mRNA expression. 

TABLE 46A: . 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: 90th percentile of esophageal tumor Als divided by the 70th percentile of normal tissue Als, where the 1 5th percentile of the normal tissue Als was subtracted from both 



Pkey EcAccii UnigenelD Unigene Title 

413808 J00287 Homo sapiens mRNA for caldesraon, 3 1 UTR 

411243 AB039886 Hs.69319 CA11 

422168 AA5B6894 Hs.112408 S100calc irr i 1 c A 

401781 Target Exon 

424098 AF077374 Hs.139322 small praline-rich protein 3 

425211 M18667 Hs.1867 progastricsin (pepsinogen C) 

417366 BE185289 Hs.1076 small praline-rich protein 1E (comifin) 

401 780 NM_005557*:Homo sapiens keratin 1 6 (foca 

421948 L42583 Hs,334309 keratin 6A 

400289 X07820 Hs.2258 matrix metallop'oteinase 10 (stromelysin 

429538 BE182592 Hs.11261 small praline-rich protein 2A 

430666 NM I24 ci ens n * n loprc 

41B007 M13509 Hs.83169 matrix metalloproteinase 1 flnterstitial 
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Hs.43388 28kD interferon responsive protein 

Hs.1 63443 periostin (OSF-20S) 
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Hs.1 88401 annexinA10 
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Target Exon 
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Hs.252756 ESTs 
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BE568452 
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AI739159 
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Hs.158159 FAT tumor suppressor (D-osophila) homolo 

Hs.156346 topoisomerase (DNA) II alpha (170kD) 

Hs.44532 diubiquitin 

Hs.9914 follistatin 

Hs.72026 protease, serine, 21 (testisin) 

H 7 PAB6 ii ting, kinesin-like ( abkine 

Hs.108196 HSPC037 protein 

Hs.250822 serine/threonine kinase 15 

Hs.42824 hypothetical protein FU10718 

Hs.198267 mucin 4, tracheobronchial 

Hs.34665 ESTs 
Hs.152812 ESTs 

Hs.77152 minichromosome maintenance deficient (S. 
Hs.20315 interferon-induced protein with tetratri 
Hs.133244 melanoma-derived leucine zipper, extra-n 
Hs.36232 KIAA0186 gene product 

HSPC150 protein similar to ubiquitin^on 
CDC20 (cell division cycle 20, S. cerevi 
serine (or cysteine) proteir s r L i - lil to 
colony stimulating factor 3 (granulocyte 
RAB38, member RAS oncogene family 
Hypothetical protein, XP_051860 (KIAA119 
ras homolog gene family, member E 
guanylate binding protein 1 , interferon- 
'■" iynthetase3(100k 
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gbtH.sapiens SOD-2 gene for 
S100 calcium-binding protein P 
hypothetical protein FU22041 similar to 
collagen, type XI, alpha 1 
protein regulator of cytokinesis 1 
-ii, i libi Mar-associated protein 2 
DKFZP586N2124 




Hs.2962 
Hs.3849 
Hs.82772 
Hs.344037 
Hs.83551 
Hs.61898 
Hs.177164 
Hs.278613 




Hs.156827 ESTs, Weakly similar to ALU1 _HUMAN ALU S 

gb:ye53h05.s1 Soares fetal liver spleen 

Hs.69360 kinesin-like 6 (mitotic centromere-asscc 

Hs.6823 hypothetical protein FU 10430 

Hs.1 351 50 lung type-l cell membrane-associated gly 

Hs.176663 Fc fragment of IgG, low affinity !!!b, r 

Hs.82962 thymidylate synthetase 

Hs.274448 hypothetical protein FLJ11 029 

Hs.82396 W-oligoadenylate synthetase 1 (40-46 

Hs.26557 plakophilin 3 

Hs.93764 carboxypeptidaseA4 

Hs.83484 SRY (sex determining region Y)-box 4 

Hs.1 23664 ESTs 

Hs.2201 1 complement-clq tumor necrosis factor-re) 

Hs.236204 nuclear pore complex protein 

Hs.251 377 taxol resistance associated gene 3 

Hs.283082 defensin, beta 3 
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427557 
427099 
402408 

414907 
452888 
456525 
450983 



AW955454 Hs.30942 



M305384 
AA1 33749 
NM_002692 
AF129535 
BE076358 



423961 D13666 



X77343 

BE261750 

N99013 

BE143533 

AA281219 

AW594506 

W79283 

AW886727 



441020 
447078 
437044 
417621 
424905 
426059 
419833 
451807 
422530 
452203 
414020 
414809 
442599 



Hs.99185 
Hs.272027 
Hs.77667 

Hs270149 

Hs.334334 

Hs.4747 

Hs.16762 

Hs.109309 

Hs.121296 

Hs.104830 

Hs.35962 



odd Oz/ten-m homolog 2 (Drosophila, roots 
NM_030920*:Homo sapiens hypothetical pro 
hypothetical protein C321D2.4 
polo (Drosophia)4ike kinase 
ephrin-B2 

S100 calcium-binding protein A8 (calgran 
ER01(S.C6 "" 
FXYDdoma 

polymerase (DNA directed), epsilor, 2 



lymphocyte antigen 6 complex, locus E 
C6001909:gi|704441|dbj|BAA18909.1|(D298 
periostin (OSF-2os) 

ESTs, Weakly similar to 21 09260A B cell 
transcription factor AP-2 alpha 



Homo sapiens mRNA; cDNA DKFZp564B2062 (f 



AV654694 

NMJ02497 

BE292342 

AA251131 

W52854 

AW972300 

X57522 



s. 220697 ESTs 



Hs.75703 
Is 7356 
Hs.324051 



449539 W80363 Hs.58446 ESTs 



differentially expressed in Fanconi's an 
interferon-induced, hepatitis C-associaf 
NIMA (never in mitosis gene a)-related k 
igulatory factor 7 



hypothetical protein FLJ23293 similar to 
bone marrow stromal cell antigen 2 
transporter 1, ATP-binding cassette, sub 
small inducible cytokine A4 (homologous 
transferrin receptor (p90, CD71) 
RelA-associated inhibitor 
NM_022170':Homosaf 



419079 
410434 
408660 
409142 



AF051 152 
M525775 
AL136877 
AC004770 
444006 BE395085 
426761 AI015709 



428311 NMJ05651 
449378 AW664026 



loll Ike receptor 2 

ESTs, Moderately similar lo PC4259 ferri 
SMC4 (structural maintenance of chromoso 
flap structure specific endonuclease 1 
type I transmembrane protein Fn14 
Homo sapiens mRNA; cDNA DKFZp586l2022 (f 
B aggressive lymphoma gene 
minichromosome ma i deficient ( 



tryptophan 2,3-dioxygenase 



422283 
422675 
422699 



M18728 
AW409985 
AW411307 
BE018517 



Hs.4756 
Hs.10086 
Hs.172089 
Hs.47783 
Hs.1 79565 
Hs.195155 
Hs.72157 
Hs.1 83671 
Hs.59892 
Hs.133100 
Hs.9280 

Hs.76084 
Hs.1 14311 
Hs.119140 
Hs.1 19257 

Hs.8047 Fanconi anemia, coi 



emsl sequence (mammary tumor and squamo 
Fanconi anemia, complementation group G 
hypothetical protein DKFZp564A176 



65 
70 



428953 
428728 



440659 
414821 
413063 




AL035737 

X74794 

D49372 

AA534009 

AA926960 



Hs.77424 
Hs.75184 
Hs.154443 
Hs.54460 



Hs.98558 
Hs.334562 
Hs.45743 



i FLJ23468 
I family 7 (cationicarr 
extra spindle poles, S. cerevisiae, homo 
claudin 1 

Fc fragment of IgG, high affinity la, re 
chitinase 3-like 1 (cartilage glycoprote 
minichromosome maintenance delii :i t(S 
-i I r rcible ui Tf ofai il 
interferon stimulated gene (20kD) 
CDC28 protein kinase 1 



Hs.334562 cell division cycle 2, G1 to S and G2 to 



431941 AK000106 

414110 BE251752 

427857 AL133017 

419968 X04430 

430413 AW842182 

416209 AA236776 



C6001909:gi|704441|dbi|BAA18909.1|(D298 
Homosapiens cDNA FLJ20099 fls, clone CO 
gb:601 1 12444F1 NIH_MGC_16 Homo sapiens c 
hypothetical protein FU22865 
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AF217513 Hs.279905 clcneHQ0310PRO0310p1 

AA490346 Hs.40530 Homo sapiens, clone MGC:1 7624, mRNA, com 

X91195 Hs-1 00623 phospholipase C, beta 3, neighbor pseudo 

Hs.135917 hypothetical protein DKFZp761D1823 

AB037734 Hs.4993 KIAA1313 protein 




AW975531 
AF019225 
AA420687 
AF061871 
AW136771 



NM_014125 

AL359053 

AL137255 

AI467908 

M043424 

H13139 

BE315291 

AI827248 



421817 
428664 
422101 



apolipoproiein L 

Homosapiens cDNA FLJ14259 fs, clc 
Human DNA sequence from clone RP1-23 
epidermal differentiation complex protei 
ECT2 protein (Epithelial cell transform! 
hypothetical protein MGC2714 
hypothetical protein FLJ10540 
hypothetical protein DKFZp56401278 

r?M!o02 2 463-:Homo sapiens myxovims (ii! 
proliferating cell nuclear antigen 



ESTs 

immediate early response 3 

paired-like homeodomain transcription fa 

hypothetical protein MGC5627 

Homo sapiens cDNA: FLJ21274 lis, clone C 

Homosapiens cDNA FLJ11469 lis, clone HE 

solute carrier family 2 {fedfoted giu 

jg ke module coi tainin mu ke 

stress-induced-phosphoprotein 1 (Hsp70/H 
hypothetical protein MGC13102 
enhancer of zeste (Drosophila) homolog 2 
ras homolog gene family, member 
epidermal growth factor receptor (avian 
i BarH-like homeobox 1 

alcohol dehydrogenase 7 (class IV), mu o 



AI824113 

BE614557 

AF053306 

M33600 

AA861271 

AA205847 

X54232 

AK001379 

AV650262 

Al 128388 

AF005210 

AW974476 

AF146074 

AK001666 

AW404176 

AF1 19666 

AW248364 

U63630 

AL037672 

AA032279 

X72755 

BE159107 



Hs.103305 Homosapiens mRNA; cDNA DKFZp434B0425 (f 

Hs.1594 centromere protein A (17kD) 
Target Exon 

Hs.84113 cyclirMlependent kinase inhibitor 3 (CDK 

HS.1S2843 leucine zipper protein FKSG14 

Hs.82985 collagen, type V, alpha 2 

Hs.82932 cyclinD1(PRAM:pa " 



Hs.1 15474 
Hs.78281 
Hs.19574 
Hs.36708 



replication factor C (activator 1)3 (3! 



hypothetical protein MGC5469 
budding uninhibited by benzimidazoles 1 

HS.JU8UZ0 ma]or histocompatibility complex, class 

Hs.222024 transcription factor BMAL2 

Hs.2301 6 G protein-coupled receptor 

Hs.2699 glypican 1 

Hs.121028 hypothetical protein FU10549 

Hs.75765 GR02 oncogene 

Hs.113222 chemokine (C-Cmotif) receptor 8 

Hs.183601 regulator of G-protein signalling 15 

Hs.108660 ATP-binding cassette, sub-family C (CFTR 

Hs.189095 similar to SALL1 (sal (Drasophlla)-like 

Hs.111511 ribosomal protein L27 

Hs.23449 insulin receptor tyrosine kinase substra 

Hs.5409 RNA polymerase I subunit 

Hs.155637 protein kinase, DNA-activated, catalytic 

Hs.81071 extracellular matrix protein 1 

Hs.61635 six transmembrane epithelial antigen of 

Hs,77367 monokine induced by gamma interferon 

Hs.159263 collagen, type VI, alpha 2 
Target Exon 

Hs.183650 cellular retinoic acid-binding protein 2 



BE001596 
D31771 
AL035588 
AL035558 



421677 H64092 



Hs.89404 
Hs.1 53203 
Hs.73853 



AA307808 Hs.2979 



NM_021626:Homo sapiens serine carboxypep 
'mtegrin, beta 4 

msh (Drosophila) homeo box homolog 2 
MyoD family inhibitor 
bone morphogenetic protein 2 
NM_021048:Homo sapiens melanoma anligen, 
trefoil factor 2 (spasmolytic protein 1) 



AI077715 Hs.39384 putative secreted fkjand homologous to f 
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429415 
405386 
410274 
448275 
418245 



443247 
449717 
428336 



Z45051 
AW994561 
BE185499 
AA305729 



50 
55 



AA381807 

BE514434 

AA088767 

AF015592 

Y18845 

AK000546 



AW162943 

AI831958 

AK001239 

BE614387 

AB040935 

AA503115 

M033813 

T11832 

NM 002759 



Hs.61762 
Hs.20830 
Hs.83883 



similar to S68401 (cattle) glucose indue 
eukaryotic translation initiation factor 
KIAA0020 gene product 
amino acid Iransporter system A1 
procollagen C-endopeptidase enhancer 
Target Exon 

hypoxia-inducible protein 2 
kinesin-like 2 

transmembrane, prostate androgen induced 
CDC7 (cell division cycle 7, S. cerevisi 
small inducible cytokine subfamily A (Cy 
hypothetical protein FLJ20539 
NM_C0063S':Homo sapiens supe-oxide dismu 
gb:ym89h07.r1 Soares adult brain N2b4HB5 
UL16 biding protein 2 
hypothetical protein 
hypothetical protein FLJ10377 
-Myc target JP01 



no sapiens oDNA FLJ11 381 tis. clone HE 



421150 
406400 
455813 



450000 
407777 
419485 



423198 
415091 
441085 



412G41 
413436 
408636 
412115 
413142 
411573 



410310 
427584 
423725 



409461 
433020 
437915 



421975 
427585 
448140 
431722 
427239 
413385 
4397B0 



BE141577 
BE387014 
A1498957 
BES15453 
AW582256 
AW248439 
AI952797 
AA161071 
AA489023 
BE076537 
NM_003878 



Hs.144531 
Hs.250618 
Hs.61053 
Hs.274263 
Hs.333893 
Hs.23954 
Hs.183752 
Hs.79018 
Hs.127797 



jcleophosmin/nucli 
Hs.1 89902 ESTs 

kallikrein 8 (neuropsin/ovasin) (KLK8) 
gb:QV2-HT0083-071299-018-a11 HT0083HO 
Hs.166146 Homer, neuronal immediate early gene, 3 
I 70851 El i r DZ195JHUMANZIN 

Hs.346509 dedicator of cyto kinesis 1 
Hs.91011 anterior gradient 2 (Xenepus laevis) horn 
Hs.2340 junction plakoglobin 
Hs,10888 hypothetical protein FU21709 



AU076640 

M81933 

AL044872 

AW136551 

AL080021 

AK001330 

AJ0Q0512 



N54706 
M16660 
AF238083 
BE294925 



Hs.169895 

Hs.78619 

Hs.74619 

Hs.339665 

Hs.74011 

Hs.15243 

Hs.1634 

Hs.77910 

Hs.181245 

Hs.8986 

Hs.48855 

Hs.296323 

Hs.61796 

Hs.78867 

Hs.303025 



ESTs, Weakly simil; 
ubiquitin-conjugating enzyme E2L6 
gamma-glutamyl hydrolase (conjugase, fol 
proteasome (prosorne, macropain) 26S subu 
ESTs 

lymphocyte-activation gene 3 
nucleolar protein 1 (120kO) 
cell division cycle 25A 
3-hydraxy-3-memylglutaryl-Coenzyme A sy 
Home pien DN; LJ 1 2532 ns, clone IT 



hypothetical protein FLJ 10468 
serum/glucocorticoid regulated kinase 
transcription factor AP-2 gamma (activat 



k* 90kD protein 1,beta 



KIAA1077 protein 

leukemia inhibitory factor (cholinergic 
sine oculis homeobox (Drosophila) homolo 



Target Exon 
ESTs, Hi 1 -' 



AJ403108 
AA307703 
AA135159 
AA382169 
AI375726 



AW961017 

D31152 

AF146761 

BE270447 
M34455 
AL1 09688 
BE244068 
U57059 



Hs.179718 
Hs.132127 
Hs.279766 
Hs.203349 
Hs.54483 
Hs.279918 
Hs.202312 
Hs.173125 
Hs.133294 
Hs.172928 
Hs.73797 

Hs.179729 
Hs.20450 
HS.26B049 
Hs.174070 
Hs.840 



51 DNA exc'is 
coagulation factor III (thromboplastin, 
v-myb avian myeloblastosis viral oncogen 
hypothetical protein LOC57822 
klnesin family member 4A 
Homo sapiens cDNA FU1214S tis, clone MA 
N-myc (and STAT) interactor 
hypothetical protein 

Homo sapiens clone N11 NTera2D1 leralocs 
peptidylprolyl isomerase F (cyclophilin 
ESTs 

collagen, type I, alpha 1 

guanine nucleotide binding protein (G pr 

hypothetical protein FLJ 11 856 

collagen, typo X, alpha 1 (Schmid metaph 

BCM4ike membrane protein precursor 

hypothetical protein 

ubiquitir, carrier protein 

indoleamine-pyrrole 2,3 dioxygenase 

gb:Homo sapiens mRNA full length insert 

interleukin 12 receptor, beta 1 

tumor necrosis factor (ligand) superfami 
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404170 
417370 
410006 
446291 
421155 
441224 



T28651 
AW73230B 
BE397753 
H87879 
AU076964 
NM 014479 
429413 NMJ1405B 
436251 BE515065 



Hs.82030 
Hs.57783 
Hs.14623 



NM_000636*:Homo sapiens J 
tryptophanyl-tRNA synthetase 
eukaryotic translation initiation tactor 
interferon, gamma-inducible protein 30 



C00138 

M323948 

AV660038 

NMJ1220* 

NM_002916 



412577 
410575 
416084 
430393 

447342 AI199268 



5.15165 



lie acid ind 




419121 
431890 
417259 
418203 
441633 
423425 
412851 



mRNA for KIAA16S3 protein, 

.55407 Homo sapiens mRNA; cDNA DKFZp434K0621 (f 

.2056 UDP glycosyltransferase 1 family, polype 

,22302 qeneraf transcription factor IIIC, polyp 

.35120 replication factor C (activator 1) 4 (37 

NM_016651 Hs.48950 heptacellular carcinoma novel gene-3 pro 

" Hs.211238 interleukin-1 homolog 1 

Hs.74076 CD1 63 antigen 

Hs.6994 Homo sapiens cDNA: FLJ22044 fis, clone H 

Hs.79006 deoxythymidylate kinase (thymidylate kin 

Hs.241305 estrogen-responsive B box protein 

Hs.19322 Homo sapiens, Similar to RIKEN cDNA2010 

NMJ16323 Hs.26663 cyclin-E binding protein 1 

NM 006147 Hs.84981 interferon regulatory factor 6 

AF157482 Hs.19400 MAD2 (mitotic arrest deficient, yeast, h 

BE561610 Hs.5809 putative transmembrane protein; homolog 

Hs.3991 PRP4/STKM/D splicing factor 

Hs.200317 KIAA1404 protein 

Hs.58435 FYN-binding protein (FYB-120/130) 

Hs.2057 uridine monophosphate synthetase (oratat 

Hs.50842 interferon-induced protein 35 

Hs.8127 KIAA0144 gene product 

Hs.18457 hypothetical protein FLJ20315 

Hs.25723 Sjogren's syndromeiscleroderma autoanlig 

Hs.136414 UDP-GlcNAc:betaGalbeta-1,3-N-acetylgluc 

Hs.130815 hypothetical protein FU21870 

Hs.6351 cleavage and polyadenylation specific fa 

Hs.28505 ubquilr onjuqa'.inq enzyme E2H (homola 

Hs.125759 ESTs, Weakly similar to LEU5_HUMAN LEUKE 

Hs.322844 hypothetical proteinDKFZp564A176 

Hs.11090 membrane-spanning 4-dornains, subfamily A 

Hs.21486 signal transducer and activator of trans 

Hs.61635 six transmembrane epithelial antigen of 

NM_002788*:Homo sapiens proteasome (pros 
Target Exon 

Hs.75573 centromere protein E (312kD) 
KIAA0564 protein 

NM_031419*:Homc sapiens molecule possess 

Hs.155470 hypothetical protein MGC16040 

Hs.180479 hypothetical rjrotein FLJ20116 

Hs.77329 phosphatidylserine synthase 1 

NM_005402 Hs.288757 v-ral simian leukemia viral oncogene bom 

AW009166 Hs.99376 ESTs 
AA374372 
X17033 
AW903838 
X54942 
AW958544 
AA375756 
AI826502 



paralhyro 

Hs.271986 integrin, aipha 2 (CD49B, alpha 2 subuni 
Hs.81800 chondroitin sulfate proteoglycan 2 (vers 
Hs.83758 CDC28 protein " 



454140 AB040888 



U25849 



Hs.14449 KIAA160S protein 
Hs.106149 ESTs 

NM_002425:Homo sapiens matrix metallopro 
Hs.41793 hypothetical protein FLJ10474 
Hs.283532 uncharacterized bone marrow prolein BM03 
Hs.101850 retinal-binding protein 1, cellular 



s.75393 



id phosphatase 1 



ESTs 



430466 
431448 
424874 
453633 
447854 
427581 
412636 



Hs.303125 p53-induced protein PIGPC1 

Hs.79368 epithelial membrane protein 1 

Hs.1 94630 WNT1 inducible signaling pathway prolein 

Hs.122552 G-2 and S-phase expressed 1 

Hs.82733 nidogen 2 

Hs.195870 hypothetical protein FU14991 

Hs.241517 polymerase (DNA directed), thela 

Hs.306201 hypothetical protein DKFZp56401 278 

Hs.32641 3 Homo sapiens cDNA FLJ2081 2 fis, clone AD 

Hs.34045 hypothetical protein FU20754 

AW138454 Hs.11594 ESTs 

NMJ14788 Hs.179703 KIAA0129 gene product 

NM_004415 desmoplakin(DPI,DPII) 

AA297634 Hs.54925 KIAA1858 protein 



»F100779 



AL137517 
AA347951 
AA357001 
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442932 
425071 
410491 
428698 
451277 
447347 
429505 



AA457211 
NM_013989 
AA465131 
AA852773 
AK001123 
AA570056 
AW820035 



Hs.8858 
Hs.1 54424 
Hs.54001 



AA013051 Hs.91417 

AI745185 Hs.8939 

AA435504 Hs.119286 
AK000492 

AF044197 Hs.100431 
Hs.337251 

NMJ05879 Hs.21254 

U25128 Hs 159489 



bromcdomain adjacent to zinc finger doma 

deiodinase, iodothyronine, type II 

Homo sapiens clone 26218 mRNA sequence 

hypothetical protein FLJ10261 

ESTs, Moderately similar to KIAA1215 pro 



NM_000179*:Homo sapiens rr 
topoisomerase (DNA] II binding r. 
yes-associated protein 65 kDa 
ESTs 

hypothetical pre 

small inducible cytokine B subfamily (Cy 
hypothetical protein MGC2487 
TRAF interacting protein 
parathyroid hormone receptor 2 
hypothetical protein FLJ11301 
LIU and senescent cell antigen-like doma 
karyopherin alpha 2 (RAG cohort 1 , impor 
proslaglandin-endoperoxide synthase 2 (p 
pleckstrin 2 (mouse) homolog 

prostate androgen induced 

adhesion molecule 
integrin, alpha 6 
□MA segment on ' 

protein tyrosine phosphatase, non-reoep! 

. FLJ10858 
dickkopf (Xenopus laevis) homolog 1 
ESTs, Highly sin" ' 
DKFZP586G1517 protein 
mannose receptor, C type 1 
C5002154':gi|729S015|gb|AAF54217.1|(AE0 
hypothetical protein FU14540 

rij^ sr. ifi ny phosphatase 6 
TGinteracting factor (TALE family homeo 
methyl enetetrahydrofolate dehydrogenase 
; 23667 and 23775 zinc finger prote 
gb:Human parathyroid hormone-related pep 
ENSP00000246632':CDNA FLJ20261 tis, don 
cysteine and glycine rich protein 2 

If If ^ r f i 

down syndrome critical region protein OS 
complement component 5 receptor 1 (C5a I 




granzyme A (granzyme 1, cytotoxic T-lymp 



NM.00621 2*:Homo sapiens 6-phosphofructo- 
serine (or cysteine) proteinase intibito 



mitochondrial ribosamal protein 

" ■ ■' to LIM1_HUMAN LINE- 

higli-glucose-regulated protein 8 
lymphocyte cytosalic protein 2 (SI 
MAP/microlubule affinity-regulating kina 
G antigen 7B 
E1A binding protein p300 
CD8 antigen, alpha polypeptide (p32) 
chemokine (C-C motif) receptor 5 
granzyme B (granzyme 2, cytotoxic T-'ymp 
KIAA1488 protein 
RP42 homolog 

granzyme K (serine protease, granzyme 3; 
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ectodysplasin 1, anhidrotic receptor 
B-cell CLUlymphoma 1 1 A (zinc finger pro 
Homo sapiens, clone MGG12401 , mRNA, corn 
C15001 220*:gi|4469558|gb|AAD2131 1.1 1 (AF 
KIAA1463 protein 

keratin 10 (epidermolytic hyperkeratosis 
Homo sapiens cDNA: FU23165 fis, clone L 
myosin-binding protein H 
cholinergic receptor, nicotinic, alpha p 
anti-Mulierian hormone 
uroplakin 1B 

C3000442':gi|1 1120696|ref|NP _0S351 8.1 1 c 
NM_005367:Homosapiens melanoma anligen, 
ESTs, Weakly similar to T18651 hypotheti 

ESTs, Weakly similar to KIAA1 330 protein 
ESTs, Weakly similar to I78885 serine/th 
hypothetical protein DKFZp434L0718 



ubiquitir, specific protease 8 

C16001364;gi|1 1057373|ref[NP J67669J I C 
Homo sapiens cDNA FU13872 fis, clone TH 

NM_030932*:Homo sapiens diaphanous (Dros 



XAGE-1 protein 
N-MYC oncogene 
Hs.348148 gb:ns44f05.s1 NCI_CGAP_Alv1 Homo sapiens 
Hs.2633 desmoglein 1 
Hs.243770 ESTs 

Hs.173084 myosin, heavy polypeptide 3, skeletal mu 
Hs.47166 HT021 

C6001378':gi|1171748|sp|P4653C|NOTCBRAR 



AI570199 AI888812 AW867550 A1921557 AW469G96 A1925581 AI679986 AW473623 BE841640 BF061525 AI445703 AI925072 AW863188 
AW863076 BE841731 AW863167 BE841390 BE841365 BF374078 BE841760 BE841694 BE841769 AA335110 BE841B92 BF374073 AA335204 
BF374079BE841713AA335167 BE841584AW868103BE841645BE841765AI07633BAW8B7433BF373B31 BE841758 AW868911 AW863155 
AW868847 BE841651 AA335145 BE841670 BF374260 BF374088 BE841661 BE841728 BI335729 BE341739 BE841663 AW3B3104 AA335201 
AA335143 BF906965 AW867493BE841505 BF374250 BE841766 BF373837 AW863191 BE841705 AW863154 AW8B3673 AWBB731 1 AA33589B 
BE841753 AW863407 BE937102 BF374252 BF374247 BF374255 BE841785 AW029590 AW131278 AI801021 AW058240 AW058400 AW029230 
AW029432 AW130609 AW029128 AW1304S9 AI570155 AI620272 AW029259 AI801389 AI888662 AI926902 AIB01799 AI610344 AI452852 
AW131 174 AI581069 AI22502B AI446689 AI923321 AI439430 AI801502 AI679707 AW028944 AI933684 AI801724 AI537779 AI354S52 AI470250 
AI536872AI891151AW868019AW0t 4 02 \ 3Jbl92AA335165AA335189 AI933725 AW044393 AI888797 BE841577 BE841681 
AA335141 AW008175AA335223A!888837AW863~22AI803901 AW005718 AI538L 2 I ^ 58067 I44 J. 11 15394 AI868168 
AA335144 AI926349 AA335210 AA334919 AA335153 M335216 AI678342 BF374135 AI932922M335214 AA335109 AI570325 AI452619 AI926109 
AI453488 AI678606 AW869289 AW86921 1 BE841580 A16793S8 AI888882 AI926170 BF508305 AW869315 AA334926 BE841712 AW026584 
M335200 BE841764 AV730339 AW474979 AI286344 AI446430 AI537B12 M33516B AW868051 AI679133 AI949520 BE841 652 AI949532 
BE9371 13 BE841 789 BE841643 AW130556 BE841761 AW86871 5 AW8B8698 BE841669 BE937108 AA3351 58 AA3351 53 AA335159 AW867404 
AW868692 BE841742 AW868711 AW8B7546 BE841699 AA335198 AA335146 AW868150 BE841BS0T99129 BE841740 BE841714 AA335154 
AW868815 BF373812 BE841657 BE841780 AI440394 AA335215 M335202 M335162 AA335160 AI801656 AI678499 BF374019 AW130236 
AI826057 AI572459 AI932773 AA335197 AI61 1752 M335224 A1452592 AA335147 AA335149 AA334928 AA3351 14 AA3351 1 1 AI567048 
AW029395 AI570326 BF373838 BE841691 BE841776 AW863485 BF374093 AW130376 BE841732 AI446393 AI44S781 AW867547 AW029012 
AA335227 AW869307 AW869350 AW868709 AW869407 AW00501 7 AI679252 AI925523 AW1 51 553 AW863109 AI445917 AI799620 AI921607 
AW008153 AI520957 AI610620 A!67982B AI858151 AI537839 A:679547 T28354 AI282567 AA335207 R83655 BF90B963 AW131160 AI925626 
AW029396 AW028445 AW008410 AW152586 AW008476 AI801040 A1453669 AI621200 AA334925 BF374069 BF374075 N53208 BF374246 



X00474 MM_003225 X52003 M12075 BI765761 AW950155 AI571948 BI7B05B9 M308400 M568312 BI761955 AA507595 AA614579 AA614409 
BF747698 BM142326 AA307578 AI925552 AA578674 AA582084 AW009769 AA51477B AA588034 BG271505 AA85827B BM142503 AW050700 
AI307407 AI202532 M524242 AI909772 AI970839 BG23651 5 AW75021 6 AA58761 3 AI909749 AI909751 AI910083 AA614539 R55292 M507418 
BG571303 AA410586 AA035018 BG572117 BG620022 AA.147247 BG005785 BG014448 R31981 H02668 H12498 R36203 BF992089 R73999 
T49904 R75732 BI057974 T53681 AA1 47933 N50695 R68583 R25671 R31935 R25110 R36105 AK055628 BE1574B7 AW663674 M190993 
H01S42 BF510304 AA626915 AA743952 AM 61014 M099554 BG572534 AI803329 AI809932 AI808765 M411449 AI378760 AA976929 AI378620 
AA909684 R75632 AI360919 AI350463 AW069127 AA41 1621 M742532H12451 BE203298 H03612 H12839 N58781 R75957 BF996484 AI240665 



BG570706 BG572749 AW606284 H04021 AA151166 AW954405M131254 BG055461 W46291 H01532 H04384 H03231 AA852875 H04410 
H59B05 BE157601 AA113758 

BC013389 BC017398 AI023543 AA191424 AI267700 AI469633 AW958465 AW953397 AA172056 BE940298 BF909208 BF909980 BF095153 
BG285837 AI720344 BF541715 AA355086 AA172236 

AK001536 AK0561 35 BM474813 BE887303 AK02291 4 AW581996 AW81 2945 BE882302 AA134256 BI043873 AA01 9433 BI862088 BM468657 
AU128438 BE384458 AL353967 BI8571 17 BF536525 BI465223 BM460132 AU129877 BI222283 BG171592 BI043544 BG496295 BG750710 
BI256542 BG108520AU150719 AK" 0: 1 AI5 14256 AL353968 AA191092 BF132635 
BE184942 BE184946 AW238414 BE144666 

558 



WO 03/042661 



PCT/US02/36810 



BC014081 NM 000593 X57522 L21203 L21207 L21206 L21205 L21204 AL561404 AL546423 AL560492 AL656BB2 AL541576 AL650654 BI823519 
BI770023 AL554969 BI489906 AI304693 AW295947 BM146642 X57521 BG820143 BE898390 F06770 F12630 BM423610 AL561518 BM009470 
BG742981 AA279685AA847441 M313737 BF172639 BF897216 BF914190 BF903647S70277AI569694AW073296AI361433AA564644 
AA487429 BE858232 AA838610 AI53911 1 ' il B291 29 EG057675 AI423422 AIM 58860 BE300655 AW170777 AA586956 AL571889 
AL55B850 AL576404 AL582800 BI255544 BF342301 BG875994 AA054458 AA353161 AI940434 BE816522 AL577636 AI479650 AW150377 
AU154395 AWS51271 AI032220 AI81 9778 AI346733 AW771 150 AW512525 AI249904 AA279809 AI352549 AW512517 BG056280 AA521222 
BE271141 AL581932 AL541575 BI819184 AV660190 AL556475 AI620020 AW089888 AW079179 Z21518 AA687601 F04651 AI783961 T57198 
AI433367 T78652 AL554958 AA365648 AL582619 BEB74601 BF804669 AL574458 BM145502 AI266514 AI538823 AI475626 AA948210 AA884054 
AA487637 AA031844 AA535221 AW794256 AW361447 BE78B505 AI682892 AA830989 AA862356 AA653084 BM009154 AA1 35727 H05927 
H23433 R42244 N7999 /366601 i aL556474 AA135770 BE774050 BF914200H88457 AA627746 BI560216 BI753586 

AW975281 AA664986 AA525775 AA056342 AI538978 

X79449 BC01 7853 AL1 21 035 BF1 G6384 AW1 1 9044 A1028023 AW451 1 1 0 A197191 1 AW01 5069 A1079 170 AI376367 A12641 1 3 AA829646 AA737579 
AA449679 AA740864 NM 001111 U18121 AL567297 BG773801 BF973874 AV687104 AA527579 AA843525 BE706355 A1074589 A1523475 
BE890249 AW406263 BE074258 AV729485 BF809610 BG05861 9 AA677244 BE179838 AA622264 AI460106 AA740411 AI499168 A1078223 
AI682923 BE696559 AW375385 AA788739 BG984978 Z40874 T17054 F09669 AW844043 U10439 BI71 1870 AW245957 AU 158567 AA679305 
AA679316 W72510 AI346029 BG059762 AW251062 AA132373 AI925621 AI860230 AI340172 AW192891 A1707980 AI094937 AI0421 15 AI200901 
BE328452 AA644678 AA551209 BE351065 AA970761 N68609 AW002028 AA1 60826 AI422774 AW8731 1 4 AW073597 AY/664483 AI21871 0 
AW020550 AW1 90607 AI984545 AI871921 AI333970 AI452887 AI818335 AA398655 AI554424 A12741 87 BE465703 AW51 2940 AW241 366 
AI923954 AA576649 AW168294 AA813181 AA912168 AI049738 AW514073 AA548255 AI569630 BE710031 AA244182 AI341697 AA563904 
AI537990 AW517908 AW172943 Z39498 A1750294 AW150414 AI253293 BE825720 T31860 AW150775 D20310 AA150892 AU133933 BE781 148 
AL038957 BF910979 AA352297 BG988142 AY/372175 BF229106 AY/866705 BE093482 BG990396 AI499917 AA054452 H05484 AI828502 - 
BM467331 AU140570 AL135417 BF947202 AW391926 BE813418 BF998473 T92021 BI021048 BM048783 AY/501366 AY/501342AW501 549 
BE939021 BE7C7147 BE160974 BE305207 N49011 AA947119 AAS78801 BE536876 AW897428 BG329648 BG818540 BE542344 BI919250 



AF274943 BG494894 AI719075 AA908783 AI935150 AI422691 AA910644 AA583187 BM272167 AI828996 AA527373 AW972459 AI831360 
AA772418 AI033892 AA100926 AU154749 AI459432 AI423513 AI094597 AA740817 AI991988 AI090262 AI312104 BI256707 AA459522 AA416871 
AI07S239 AI339996 AA701623 AI133549 AI336880 AA633643 AI989380 AI362B35 AA399239 AI146955 BF514270 N92892 AI348243 AI278887 
AA459292 AI494230 BF507531 AI492600 AA962596 AW613002 AA293140 AA235549 BF108854 AA954344 N49682 AI457100 AW589407 
AY/300758 BE220715BE220698BE569091 BM009647 BF900351 AI537692 AI203723 AI857576 AA584410 AW371667 BM172363 



450506 1 529 1 U09278 NMJ04460 U76833 AF007822 AL550894 BG20391 9 AL575714 AI47B772 AW022667 AY/61 3820 AI435793 AI051 768 AI200109 AA43561 1 

BG208151 AI446661 BG215551 BM449645AW630055BG620125AL550932 AY/471133 AU136648 BE925603 BF828688 
455813 1515590J BE141577 BE141585 BE141587 
439780 49082.1 AL109688 R23665 R26578 

400200 3806_1 BC005265BG176720AW006027 BM352Q64 AW02631 B AIB35822 AI880584 A16937S9 AI09221 1 BI492387 AI400449AW1BB297BF939910 

AA232282 AW021 432 A1333893 AA494303 AA8548S9 AI436795 AW069256 AA682373 AID92748 AA993184 AM 26077 AI081 758 AI24068B 
AI261863 AI378423 AA465237 AI376096 AA035579 AI087306 AA448162 AA129977 AI090903 AI080686 AI28B939 N33004 AI801240 AW021546 
AI370773 AI086064 AA669528 AI250053 AI870113 AA853181 AA858014 BG055562 BG939559 AW080765 AAD32283 AW467587 H40506 D00762 
NMJJ02788AA641134/I 3 \ 25 c fc^975 AID93566 A1707743 AI2S0741 AW073417BE875418BM264076BG876884AI680535 
AW854219 BE774635 AW854212 BG952443 AY/854221 AW85420B BE156318 BEB43056 AW858991 BE9375B9 BG878291 BG87B450 AW819099 
AI908570 AA449871 AU135228 BM478404 BF126296 AA375499 AA248473 

412636 1438J M77830 NM 004415 AF139065 BG681115 BG740377 BI712964 BG000656 AA12B470 BI438324 H27408 BE931630 BE157165 AW370827 

AW370813 J0521 1 BG698865 BG740734 BG680618 BG73977B BI765807 BM353403 BM353248AW177784 AW205789 AY/951576 AW848592 
BE182164 BF149266 BE94C187 BI060445 BI060444 BF350983 BE720095 BE720069 BE715154 BE082584 BE082576 BE004047 AA85731 6 
BI039774 BE713818 BE71354B AW170253 BE1B0433 BI039775 AW886475 BM462504 BE931734 BF149264 AA340777 BF381183 BG621737 
AU127260 AW364859 BF993352 BG223489 BE819009 BF3B1184 BE715956 R58704 AAB5221 2 AW366566 BI090358 BF087707 BE819046 
BE819005 AA377127 BE073467 BE819069 BE81904B BI036306 BG990973 BI040954 BF919911 AU140155 AI951766 AI434518 AW804674 
BF752969 BE837009 BE925826 BF149265 AW995615 BE814264 BI039782 AU140407 BE144243 BE709863 BF985642 BE001923 BF933510 
AW265328BG436319BE182165AW365175AW347688BE818280AW177933BF873679AW178000BE082526BF476866 BF086994BF592276 
BE082507 BE082514 BE082505 BF373693 AV/068840 AW847678 BF804153 AY/365157 BE813930 BE002030 AW365153 BE184941 BF749421 
BE184920 BF839562 BE184933 BF842254 BE698470 BE931048 BF999889 BF363816 BE184924 BE159646 BE714632 BE184948 BG986845 
AA131128 AA099891 W39488C04715 BF096124 BE865341 AW799304 ALB03116 BE1497B0 BE705967 BE705966 BE705968AW848723 
AW376699 AW376817 AW376597 BG005097 BF751 1 1 5 BE696084 AW848371 AW376782 AW848789 AW849074 AW3B1 41 3 BF927725 BF09421 1 
AW997139 BE855474 BE185187 BE156621 BE715089 BE713297 BE713298 BE179915 AW799309 BF372345 BF088676 BE705939 AW752599 



AW377699 AY/607238 BE082519 AW377700 BF34 9 467 A1190590 A1554403 AI392926 AU158477 BI457252 AU159919 AI7S0316 BF082516 
AI439101 AA451923 AI340326 AI590975 BI791553 A1700963 AI1428B2 AA039975 AA946936 AA644381 BM314884 AA702424 At417612 
AW190555 AI220573 AI304772 AI270345 AI627383 AA552300 AI91 1702 AW166807 AI346078 Y/95070 AA149191 AA026864 At830049 AW780435 
AI078449 AI819984 AI858282 BI468588 AI860584 AI025932 AA026047 AA703232 AA6581 54 AA51 5500 AW1 92085 AA918281 T77861 AI927207 
AI205263 BF082491 AW021347 AI568096 BE939862 AA088856 D12062 AA056527 AA7B2109 W19287 W02156 AW150038 AA022701 T87181 
H44405 AI910434 BF082513 AI494069 AI270027 AI635878 AA128330 BG681 425 BE706078 R20904 BG680059 BG676647 BF764409 AA026654 
AV745530 BI762796 BG287391 AW7S8780 BE706045 BE92B470 AW7991 18 BF087996 BE002273 AW879451 AI571075 BE067786 AV721320 
AI022862 N29754 C03378 N84767 AA131077 H30146 BE714290 AI686869 AI568892 AI915596 AW105614 AI88725B AI53B577 BE926474 
BE067737 BG319486AA247685 AW793333 AW103521 BF989173 AW850B78 BE939707 BE185750 BE714064 BE713903 BE713868 BE713763 
BG950164 BE713B10 AW365151 BG955489 BE005272 BF915937 AW365148 AI905927 BF992780 AW653812 BG954443 BI770853 BGS79406 
BG740832 BG681087 BG698430 AA455100 T87267 BE696209 BEB96210 BI089483 BE006273 BE872225 AW391912 BE925515 BG677012 
BG741970 AA026480 BE705999 BG677157 BE009090BG681378 BE712291 BG961496 BG67B984 BI040941 AA337270 AW384371 AW847442 
BI058659 BE813665 W95048 W25458 AY/177786 AA025851 BE931733 BF154837 BG949393 BE714441 AW995245 BE711801 AI284090 
BE064323 BE719390 BE940148 BG99121 2 BF375714 BF349522 BG996267 T48793 BI013292 BE001925 AW365156 AY/365154 AW606653 
BF763109 BE931637 BE167181 BE71387S BF354008 BF67B726 H90899 AY/365145 W38382 AI498487 
BC015981 AJ301615 AA628980 AI126603 BF184719 



TABLE 46C . 

Pkey: Unique run I -i t id nq to an Eos probeset 

Ref: Sequence nr Th igit numbers in (hi rlumi i r i i ' ilil (31) numbers "Dunham, et al." refers lo Ihe publication entitled "The DNA sequence of 

human chromosome 22" Dunham, et al. ( 1999) Nature 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
Nt_position: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand Ntjrosition 

401781 7249190 Minus 



WO 03/042661 



PCT/US02/36810 



402994 
402075 
405770 
401994 
404996 
404240 
402447 
402992 
402408 



405545 
401797 
402294 
401961 



7249190 
8118496 
7249190 
9789672 
8118496 
2996643 
8117407 
2735037 
4153858 
6007890 
5002624 
9796640 
7767907 
9796239 
2326514 
5587915 
2326514 
2996643 
1054740 
6730720 
2282012 
4581193 
7528051 



17982-18115,20297-20456 
165776-165996,166189-166314,166408-16656 
1 1 8596-1 1881 5,1 191 1 9-1 19244, 1 1 9609-1 1 976 
16879-17023 
4727-4969 

121907-122035,122804-122921,1 



42904-43124,43211-43336,44607-44753,4519 

37999-38145,38662-38998,39727-39872,4055 

116132-116407,116653-116922 

47605-47729,51 696-51 821 ,52070-52257,5330 

42137-42615 

110325-110491 

51086-51301 

3769-3833,5708-5895 

53134-53281 

5962-6216 

118677-118807,119091-119296,121626-12182 

6973-7118 

2576-3000 

124054-124209 

80430-81581 



173667-173783,176876-177055 
18748-19757 

1553-1712,1878-2140,42524385,5922-6077 
20475-20734 



TABLE 47A: 


Pkey: 


Unique Eos [ 


uobesel idenlifier number 






Exemplar Ac 




, Genbank accession number 




UnigcncID: 


Unigene number 






Unigene Title: Unigene gen 








R1: 


Maximum of esophageal turr 


or Als divided by the 98th percentile of tlie normal esophagus Als 




Pkey 




UnigenelD 


Unigene Title 




400289 


X07820 


Hs.2258 


matrix mclalloprotcinasc 10 (slromelysin 


31.70 


411243 


AB039886 


Hs.69319 


CA11 


30.12 




M13509 


Hs.83169 




18.46 


444325 


AW152618 


Hs.16757 


ESTs 


18.22 


444381 


BE387335 


Hs.283713 


ESTs, Weakly similar lo S64054 hypolheO 


17.52 


421379 


Y15221 




small inducible cytokine subfamily B (Cy 


16.26 


400666 






NM_002425:Hcmo sapiens matrix metalopro 


15.59 


425211 


M18667 


Hs.1867 


progaslricsin (pepsinogen C) 


15.22 


425679 


X05997 


Hs.159177 


lipase, gastric 


14.60 




X81334 


Hs.2935 




13.14 


431723 


AW058350 


Hs.16762 


Homo sapiens mRNA; cDNA DKFZp564B2062 (f 


12.60 


446619 


AU076643 


Hs.313 




12.00 


453331 


AI240665 




ESTs 


11.20 


431620 


AA126109 


Hs.264981 


2'-5'-oligoadenylate synthetase 2 (69-71 


10.77 


408380 


AF1 23050 


Hs.44532 


diubiquitin 


10.32 


423673 


BE003054 


Hs.1695 


matrix metalloproteinase 12 (macrophage 


10.32 


412326 


R07566 


Hs.73817 


small inducible cytokine A3 (homologous 


10.22 


419216 


AU076718 


Hs.164021 


small inducible cytokine subfamily B (Cy 
interleukin 8 


10.18 


408243 


Y00787 


Hs.624 


9.80 


414359 


M62194 


Hs.75929 


cadherin 1 1, type 2, OB-cadherin (osteob 


9.75 


450375 


AA009647 




a disintegrin and metalloproteinase doma 


9.12 


407366 


AF026942 


Hs,17518 


gb:Homo sapiens cig33 mRNA, partial sequ 


8.88 


433447 


U29195 


Hs.3281 


neuronal penlraxin II 


8.54 


421508 


NM 004833 


Hs.105115 


absent in melanoma 2 


8.46 


452862 


AW378065 


Hs.8687 


ESTs 


8.34 


432828 


AB042326 


Hs.287402 




7.92 


452281 


T93500 


Hs.28792 


Homo sapiens CONAFU11041 fis, clone PL 


786 


409757 


NMJ01898 


Hs.123114 


cystatin SN 


7.62 


452838 


U65011 


Hs.30743 


preferentially expressed antigen in mela 


7.60 


413670 


AB000115 




hypothetical prolein, expressed in osteo 


7.53 


452410 


AL133519 




Homo sapiens mRNA; cDNA DKFZp434E2321 (f 


7.46 


437330 


AL353944 


Hs.50115 


Homo sapiens mRNA; cDNA DKFZp761 J 1 1 1 2 (f 
matrix metalloproteinase 1 1 (slromelysin 


7.44 


406687 


M31126 




7.24 


430280 


AA361258 


Hs.237868 


interleukin 7 receptor 


7.18 


439343 


AF086161 




hypothetical prolein FU1 1808 


7.13 


429228 


AI553633 


Hs.326447 


ESTs 


7.04 


421110 


AJ250717 




cathepsin E 


5.98 


414004 


AA737033 


Hs/7155 


ESTs, Moderately similar to 21 15357ATYK 


5.88 




W74048 




lymphocyte-specific protein tyrosine kin 


6.88 


406573 


M34996 


Hs!l98253 


major histocompatibility complex, class 


5.72 


421582 


AI910275 




trefoil factor 1 (breast cancer, estroge 


5.52 



WO 03/042661 



PCT/US02/36810 




416768 
430413 
427509 

413278 



AI754813 
AA279530 
NM_005408 
BE440042 
AA363733 
AW842182 
M62505 
AW972300 



-ls.83326 
Hs.1032 
Hs.241392 
Hs.2161 
Hs.118110 
BE563085 Hs.833 
AI469355 Hs. 127310 
426711 AA383471 Hs.343800 
421362 AK000050 Hs.103853 
452401 NM_007115 Hs.29352 
404240 

435523 T62849 
437763 AA469369 
425139 AW630488 
415989 AI267700 
408202 AA227710 
450701 H39960 
423271 W47225 
414774 X02419 
443907 AU076484 
424687 J05070 



431629 
435370 
443378 
443071 
409154 
445417 
413142 



418245 
413945 
423225 



AU077C25 Hs.265827 

AI964074 Hs.225838 

AW392550 Hs.9280 

AL080021 Hs.8986 

U72882 Hs.50842 

AK001058 Hs.12680 

M81740 Hs.75212 

M33600 Hs.308026 

AA025386 Hs.61311 

M088767 Hs.83883 

NM.000591 Hs.75627 

AA852604 Hs.125359 

AA114212 Hs.9930 

X12451 Hs.78056 
NM_005940 
AL042014 
AF1 13676 
AA236881 



CDS antigen, alpha polypeptide (p32) 
growth suppressor 1 
interferon-induced protein with tetratri 
Src-iike-adapter 

Fc fragment of IgG, low affinity lllb, r 

Hcmo sapiens mRNA forcaldesmon, 3" UTR 
NM_002425:Homo sapiens matrix metallopro 
collagen, type V, alpha 1 
integrin, beta 2 (antigen CD18 (p95), ly 
small inducible cytokine subfamily A (Cy 
matrix metalloproteinase 3 (stromelysin 
regenerating islet-derived 1 alpha (pane 
small inducible cytokine A5 (RANTES) 

bone marrow stromal cell antigen 2 

ESTs 

d gene amp 
cal protein f 
tumor necrosis factor, alpha-induced pro 
NM_018950:Homo sapiens map; hisloconpat 
membrane-spanning 4-doma'ns, subfamily A 
tissue inhibitor of metalloproteinase 1 
protease, serine. 23 
ESTs 



plasminogen activator, uroki 
TYRO protein ty ' " 
matrix metalloproteinase 9 (gelatinase B 
type I transmembrane protein Fn14 
myxovirus (influenza) resistance 1. homo 
hypothetical protein FLJ10718 
desmoplakin (DPI, DPII) 
interferon, alpha-inducible protein (clo 



410422 
413936 
409202 

422562 AI962060 

443639 BE269042 

444652 BE513613 

412471 M63193 

449717 AB040935 

417389 BE260964 



445109 
406778 
408716 
412773 
414024 
426530 
414945 



Hs.1 55324 
Hs.53348 
Hs.297581 
Hs.51043 



AF039916 Hs.12330 

H06273 Hs.101651 

AI567839 Hb.151714 

H15785 Hs.74573 

AA134712 Hs.22410 

U24578 Hs.278625 

BE076358 Hs.77667 



ESTs 



Homo sapiens cDNAFU10196fis, clone HE 



ESTs, Weakly similar to S1 



CD14an . 
Thy-1 cell surface antigen 
serine (or cysteine) proteinase inhibilo 
cathepsin L 

matnx metalloproteinase 11 (slromelysin 
Homo sapiens, clone MGC:15203, mRNA, com 
serine (or cysteine) proteinase inhibito 
hexosaminidase B (beta polypeptide] 
AE-binding protein 1 

proteasome (prosome, macropain) subunit, 
actin related prolein 2/3 complex, subun 
endothelial cell growth factor 1 (platel 
cerebral cell adhesion molecule 
midkine (neurite growlh-promoling factor 
ESTs, Weakly similar to ALU2JHUMAN ALU S 
ectonucleoside triphosphate diphosphohyd 
Homo sapiens mRNA; cDNA DKFZp434C107 (fr 
Homo sapiens mRNA for KIAA1 769 protein, 
similar to vaccinia virus Hndlll K4L OR 
gb:zm79g08.r1 Stralage 



TABLE 47B __ 

Pkey: Unique Eos probeset Identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession ~ 

453331 16559.1 BG571303 AA410586 AA035013 BG572117 BG620022 AA147247 BG005785BG014448 R31981 H02668H12498 R36203 BF992089 R73999 

T49904 R75732 BI057974 T53681 AA147933 N50595 R68588 R25671 R31 935 R251 10 R361 05 AK055628 BE1 57467 AW663674 AA190993 



561 



WO 03/042661 



PCT/US02/36810 



H01 642 BF51 0304 AA62691 5 AA746952 A!1 61 01 4 M0SS554 BG572534 AI803329 AI809932 AI808765 AA41 1449 AI37B750 AA976929 AI378620 
AA909684 R75B32 AI360919 AI3S0463 AW0691 27 AA41 1621 AA742532 H12451 BE208298 HQ361 2 H1 2839 N58781 R75957 BF996484AI240665 
BF989591 BI066086 BG001590 BF107035 

450375 16559.3 BG570706 BG572749 AW6062B4 H04021 AA151 166 AW954405 AA131254 BG056461 W46291 H01532 H04384 H03231 AA852876 H04410 

H59605 BE157601 AA1 13758 

452410 59661.1 AL133619 AI43S410 AA622747 AW272464 AI215594 AIS73758 AI476447 AI80412B A15B1345 A1026826 AI300820 AW513621 AA2561 B2 AI559724 

AI493388 AA614641 AI125754 AI21 4351 AI567080 AI200B13 AI476629 AI685732 AA602400 AA730140 AI565082 AI269603 AI807095 AA905453 
AA505909 AI204595 A'582930 Al 536077 AA757863 AA730 1 54 AA664048 BI831 663 AI734138 AI7341 30 AI732734 AW043563 AI741241 A1732741 
BF1 1 1 446 BE677727 AA437369 AA426284 AA433997 M425820 

406687 0 0 M31126 

421582 13358.1 X00474 NM.003225 X52003 M12075 BI765761 AW950155 AI571948BI760569 M308400 AA568312 BI761955M507595 AA614579 AA614409 

BF747698BM1 42326 AA30- h rI , 1 7 7 * 2 ' ' A W3 39769 AA514776AA588034 BG271505AA858276 BM142503 AW050700 
AI307407 AI202532 AA524242 AI909772 AI97085S BG23651 6 AW750216 AA587613 AI909749 AI909751 AI910083 AA614539 R55292 AA507418 

413808 2905.1 AI570199 AI888812 AW867550 AI921557 AW469096AI925581 AI679986 AW473623 BE841640 BF061 525 AI445703 AI925072AW863188 

AW863076 BE841731 AW863167 BE841390 BE841365BF374078 BE841760 BE841694 BE841769 AA335110 BE841692BF374073 AA335204 
BF374079 BE841713 AA335167 BE841 584 AW868103 BE841645 BE841765 AI076336 AW867433 BF373831 BE841758 AW86891 1 AW8631 55 
AW868847 BE841651 AA335145 BE841670 BF374250 BF374088 BE841661 BE841728 BI335729 BE841739 BE841663 AW863104 AA335201 
AA3351 43 BF906965 AW867493 BE841 505 BF374250 BE841 756 BF373837 AW863191 BE841705 AW863154 AW858673 AW86731 1 AA335B96 
BE841 753 AW863407 BE9371 02 BF374252 BF374247 BF374255 BE841785 AW029590 AW1 31 278 AI801 021 AW058240 AW058400 AW029230 
AW029432 AW130609 AWQ291 28 AW1 30469 AI5701 55 A'520272 AW029259 AIB01 389 AI888662 AI926902 AIB01799 AI610344 AI452852 
AW1 31 1 74 AI581069 AI225028 AI44S689 AI923321 AI439430 AI801502 AI679707 AW028944 AI933684 AI801724 AI537779 AI354652 AI470250 
AI536872 AI891 151 AW86801 9 AW006034 AI702599 AA3351 92 AA3351 55 AA3351 89 AI933725 AW044393 AI888797 BE841 677 BE8416B1 
AA3351 41 AW0081 76 AA335223 AI888837 AW868622 AI803901 AW00571 8 AI538062 AI282258 AI580678 AI445803 AI445394 A1868168 
AA335144 AI926349 AA335210 AA334919 AA3351 63 AA33521 6 AI578342 BF374135 AI932922 AA335214 AA335109 AI570325 AI452619 AI926109 
AI453488 AI678606 AW869289 AW86921 1 BE841580 AI679368 AI888882 AI926170 BF508305 AW86931 5 AA334926 BE84171 2 AW026584 
AA335200 BE841 764 AV730339 AW474979 AI286344 AI446430 AI53761 2 AA3351 66 AW868051 AI679133 AI949520 BE841 652 AI949532 
BE937113BE841789BE841643AW130556BE841761AW868716AW868698 BE841669BE937108AA335158AA335153AA335159AW867404 
AW868692BE841742AW868711 AW867546 BE841 599 AA33519B AA3351 46 AW868150 BE841660 T99129 BE841740 BE841714 AA335154 
AW868815 BF373812 BE841657 BE841780 AI440394 AA335215 AA335202 M335162 M335160 AI801656 AI678499 BF374019 AW130236 
A1826057 AI572459 AI932773 AA335197 AI61 1 752 AA335224 AI452592 AA335147 AA335149 AA334928 AA335114 AA3351 11 AI567048 
AW029395 A1570326 BF373838 BE841 691 BE841776 AW863485 BF374093 AW130376 BE841 732 AI446393 AI446781 AW867547 AW029012 
AA335227 AW869307 AW869350 AW858709 AW869407 AW005017 AI679252 AI925523 AW1 51553 AW863109 AI445917 AI799620 AI921607 
AW008153 AI520957 AI610620 AI679828 AI8681 51 AI537839 AI679547 T28354 AI282567 AA335207 R83655 BF906963 AW131160 A1925525 
AW029396 AW028445 AW00841 0 AW 152586 AW008476 AI801 040 AI453669 AI621200 AA334925 BF374069 BF374075 N53208 BF374246 
AW868723 BE93715Q AA955002 AW863338 BE841767 

415989 10194.1 BC013389 BC017398 AI023543 AA191424 AI267700 A'469633 AW958465 AW953397 AA172056BE940298 BF909208 BF909980 BF095153 

BG285837 AI720344 BF541715 AA355086M172236 

TABLE 47C 

Pkey: Unique number corresponding to an Eos prabesel 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gi) numbers. 'Dunham, et al.' refers to the publication entitled The DNA sequence of 

human chromosome 22" Dunham, el si. (1999) Nature 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Imposition: Indicates nucleotide positions of pre dicted exons. 



400666 8118495 Plus 

400665 8118495 

404240 5002624 

402992 7767907 



UnigenelD: Unigene number 
Unigene Title: Unigene gene title 

R1: 90th percentile of normal esophagus Als divided by the 90th percentile of esophageal tumor Als 

Pkey ExAccn UnigenelD Unigene Title R1 

426752 X69490 Hs!l 72004 titin 30.23 

425545 N98529 Hs.158295 Homosapiens,cloneMGC:12401,mRNA,com 

40701 3 U35637 gb:Human nebulin mRNA, partial cds 



428087 
417070 
406707 
405001 
418391 
418205 
422633 
400499 
418390 
412519 
417435 
413778 



AA1 00573 

Z19077 

S73840 



Hs.929 
Hs.182421 
Hs.1 72004 



NM.005181 Hs.82129 

AA090235 Hs.75535 

BE206854 Hs.46039 

AA195845 Hs.73680 

N53097 Hs.193579 

AA194412 Hs.50550 



myosin, heavy polypeptide 2, skeletal mu 
interleukin enhancer binding factor 1 
troponin I, skeletal, slow 
troponin I, skeletal, fast 
enolase 3, (beta, muscle) 
C10001858:gij6679124jref]NP_032759.1| ne 
titin immunoglobulin domain protein (myo 
troponin T1, skeletal, slow 
carbonic anhydrase III, muscle specific 
myosin, lightpolypeptirtp ? i julatory 
phosphoglycerate mulase 2 (muscle) 
ESTs, Weakly similar to S12558 cysteine- 
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431360 
416982 
426429 



454229 
424734 
428221 
431204 
443727 

413132 



418532 
419711 
422640 



AW975934 

NM_002666 

M21984 

AF129505 

BE379727 

AF015224 

AA725362 

AW9S9675 

AW957744 

AI217685 

U96781 

F28841 



Hs.251680 loricrin 

Hs.80691 creatine kinase, mitochondrial 2 (sarcom 

Hs.169849 myosin-binding protein C, slow-type 

Hs.343603 litin-cap (telethonin) 

Hs.49998 Z-band alternatively spliced PDZ-motif 

Hs.283382 ESTs, Weakly similar to I38344 titin, ca 

Hs.1 03253 perilipin 

Hs.73454 troponin T3, skeletal, fast 

Hs.86492 small muscle protein, X-linked 

Hs.8321 3 fatty acid binding protein 4. adipocyte 

Hs.46452 mammaglobin 1 

Hs.120456 ESTs 

Hs.291232 ESTs 

Hs.278469 lacrimal proline rich protein 

Hs.96844 ESTs 

Hs.183075 ATPase, Ca tra 
Hs.250760 

Hs.18459 ESTs 

Hs.47438 SH3 domain binding glutamic acid-rich pr 

Hs.75209 protein kinase (cAMP-dependent, catalyti 

Hs.272556 peptidylarginine deiminase type I 
Target Exon 



451621 
446962 
411102 
411862 
454059 
451957 



414657 
410621 
429134 
436519 



AI87S148 

AI351421 

AA401295 

AA528140 

NMJ03154 

A1796320 

AW015415 

X51501 

AA253314 

AA216287 

NM_003063 

AA773866 

U94777 

AA424C74 



AW444530 

AI908165 

AI074502 

NM.003803 

F00055 

X69089 

D63216 



AW772298 

Y18207 

AB014524 

AB019391 

Z97171 

W05433 



HS.11BB45 
Hs.146070 
Hs.173084 
Hs.26770 
Hs.279709 
Hs.23926 
Hs.107515 
Hs.37048 
Hs.10299 
Hs,127780 
Hs.99949 
Hs.154103 



Hs.244569 

Hs.76780 

Hs.172004 

lls.99004 

Hs.238756 

Hs.17109 

Hs.1 05362 

Hs.1 69946 

Hs.1 34292 

Hs.2504 

Hs.172004 

Hs.79227 

Hs.1 53684 

Hs.74455 

Hs.1 73059 

Hs.87383 

Hs.21103 



KTTweaktysimilartoALUI.HUMANALUS 
myosin, heavy polypeptide 3. skeletal mu 
fatty acid binding protein 7, brain 
muscle specific ring fi nger protein 1 

ESTs, Weakly similar to T00329 hypotheli 
statherin 

Homo sapiens cDNA FLJ13545 fis, clone PL 
ESTs 

prolactin-induced protein 

LIU protein (similar to rat protein kina 

myosin, light polypeptide 3, alkali; ven 

Target Exon 

esophagus cancer-related gene-2 
phosphorylase. glycogen; muscle (McArdle 
protein phosphatase 1, regulatory (inhib 



myozenln 

integral membrane protein 2A 

Homo sapiens, clone MGC18257, mRNA, com 

TA-binding p ri n ) II epto 
hypothetical protein MGC12921 
myomesin 1 (skeleminj (185kD) 



hypothetical protein 

Homo sapiens mRNA; cDNA DKFZp564B076 (fr 
protein phosphatase 1 , regulatory (inhib 



433122 
415447 
415655 
442376 
452308 
418072 
429413 
423725 



413391 AI223328 

430699 AW969847 

419050 NM_000036 

422313 AF045941 

417045 F01180 

426158 NM 001982 

435101 AI743156 



N39127 
AW872527 
A1580809 
AA458648 



Hs.61297 
Hs.86507 
Hs.201877 
Hs.132127 
Hs.32825 
Hs.75335 
Hs.292718 
Hs.89570 
Hs.1 15166 
Hs.332030 
Hs.1 99067 
Hs.131064 

Hs.59761 
Hs.99569 
Hs.99476 
Hs.202949 
Hs.201619 



myocilin, trabecular meshwork inducible 
ESTs 

Homo sapiens cDNA FU12228 lis, clone MA 
ESTs 

Human DNA sequence from clone RP3-353C17 
DESC1 protein 

hypothetical protein LOC57822 



ESTs, Weakly similarto 1313184B alphal 
KIAA1 102 protein 

ESTs, Weakly similarto S38383 SEB4B pro 
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Hs.225948 
Hs.98669 
Hs.32450 



m mRNA 

ESTs, Weakly similar to 138022 hypotheti 
hydroxyprostaglandin dehydrogenase 1 5-(N 
skin-speciflc protein 

cytochrome c oxidase subunit Vila polype 




ESTs, Moderately similar to T17372 plasm 
heat shock 27kD protein 2 
eukaryotic translation elongation factor 



U35637 AA1 92323 AA194508 BG01 1 583 F2571 2 AL598820 BE18537B 

AK057547 BG181248 M883756 F25670 AA778128 F27557 F18914 F25171 AA178844 F21556 F25872 F20457 F27617 F36059 F34817 F2S967 
F25922 F31278 F34666 F01176 F36333 F01226 F27406 F2713C F28742 F24126 F29891 M1 95955 AA086351 W69291 F25880 F32791 F31311 
F32380 F25216 F19679 F18556 F29730 F24954 F32741 F30404 F35470 F33989 F33141 F36382 F34118 F17714 AA176345 F24700 AA550940 
F18617 F16859 F15633 F34675 F16528 F17281 AA086388 F30859 F21852 C02644 F29425 F25286 C03553 F35259 W80691 F16457 F24094 
F16783 AA180319 F28443 F17753 F17448 F00542 AA197179 AA193D12 

AJ276240 N70563 F37502 F2920D F27 !03 F1E577 F19( R F20867 Z2385 7 F30994 F31752 F17375 F15601 F17543 F17411 

AV724258 AA2471 53 BF73621 9 BF51 3744 AW058048 AI082691 AA865520 N391 27 AV724549 F20776 AA249747 AW970392 AA535433 F36964 

AK056951 AK02S458 BI4391 20 BM021106 F30243 BM055214 BM054962 BM069667 F37401 AA563621 AI752243 AI720773 A193301 4 F18964 
F35317 F35258 F27772 H39537 AW445222 F19408 H28557 F30608 F31797 F30960 BF837737 BF837688 AL551046 BI758668 BI76503B BI837440 
BE392882 BI438601 A'093511 AI752244 AI784111 BG490221 BF338840 BF338974 BG895472AL57S843AW966769 F25388 F37436H28558 
AI025548 AA782333 F30929 F36002 F21229 AI720539 AA719449 F21231 F18924 AA626886 F30774 F27704 F31411 F31127 F33381 F36153 
F31793 F31138F31355F'^<1 W2982 I BI7 '34 - n AI692843 F29441 H51409 F21804 AW973249F18440 F17572 F32499 AA327152 
AA534140 AI188088 F18893 F23362 AA010888 F1E143 Z28500 H27651 AI720790 F22425 H13178 H28677 F21098 F37777 F21466 F16598 
F23420AL574723R75610F34C F1 r M 60! 11 I 117 F35534 F15713 AI612800 F16563 F15645F33609 F29995 BG939623 F17385 
F17384 F1B660 F17922 F15523 AI093253 F18359 F31452 F00232 A1583430 BM021353 AA284108 H27650 H29935 BE708208 AA010737 H51451 
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TABLE 48C 

Pkey: Unique number corresponding to an Eos probesel 

Ref: Sequence source. The 7 digit numbers r hi mnareGenban lontifl G ) rumbers "Dunham, et al." refers to the publication entitled "The DNA sequence of 

human chromosome 22" Dunham, et al (1999) I 

Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposilion 

405001 6015406 Minus 104646-104819 

400499 9796071 Minus 148495-148806 

403805 B140491 Minus 51483-51742,53429-53511 

402270 3108020 Plus 117656-117822 

404270 9828129 Minus 3649-3750,4161-4306,5962-6049,6849-6965 



Table 49A. 1552 genes upregulated in lung cancer relative to normal body tissues 

Table 49A shows 1562 genes upregulated in lung cancer relative to norma! body tissues that are likely to encode proteins amenable to modulation by small molecules, peptides or 
antibodies. These genes were selected from 59680 probesets on the C Affymetix Hi ) 3 Genechip array. Gene expression dala for each probesel obtained from this analysis w; 
expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA expression. The protein products of these genes often conlain one or more domains 
mrlicalive of have oncogenic function or of transducing intracellular signals, or of being modulatable by small molecules, peptides, or antibodies (e.g. pkinase, death-domain, 7tm, 
phosphatase, or ^.transporter). Certain predicted protein domains are noted. 



h | lile of lung tumor Als divided by Ihe 50th percentile of normal tissue Als, where the 15th percentile of normal tissue Ais was subtracted from the 



Pkey; ExAccn; UnigenelD; Unigcnc Title; Pred.Prot.Domains; R1 

421502; AF111856; Hs.105039; solute earner family 34(sodium phosphate), member 2; Ribosomal L20,Na Pi_cotrans;TM=Y;; 24.06 
,J ' 697: Hsi ESTs.Weakl lark) 03 r h protein 1 I [K |; none;SS=M; 21 

10 <57009; Hs.1C1 12 nui pglol ilinlambd ras; ig,HSP70,Pp» n P , TM=M I % 
421341; AJ243212; Hs.279611; deleted in malignant brain tumors 1; zonajreilucida,CUB,SRCR;SS=M; 16.99 
452304; AA025385 Hs.61 11; E f U akly imil lo 13 »i ie f eir e [H (i i ] none,none; 16.67 
''29259; AW20450: 1 -ls.292911; ESTs. Highly similar la S6071 2 band-6-prctein [H.sapiens]; none.none; 16.50 
454034; NM.000691;Hs.575; aldehyde dehydrogenase 3 family rremiH i ;a id hi, 1 

408000; L11590; Hs.620; bullousjjemphigoid antigen 1 (230f240kD); emand 1 spectrin,GAS2,SH3 1 Pleclin,RA,XybseJsDm,FliD,bZIP,Tropomyosin,Myc-LZ,M,ldh.C,CH^IP3;TM=M;; 

421798; N74880; Hs.29877; N-acylsphingosine amidchydrclase (acid ceramidase)-!ike; SAPA.Surfactant.B.none; 14.18 

439706; AW872527; Hs.59751; ESTs, Weakly similai lo DAP1 J IUMAN DEATH-ASSOCI/ TED PROTEIN 1 |H sapiens]; nane,none; 13.94 

431846; BE019924; Hs.271580; uroplakin 1B; transmembrane4;TM=Y;SS=M; 13.54 

417079; U65590; Hs.81134; interleukin 1 receptor antagonist; IL1;SS=M; 12.97 

444381; BE387335; Hs.283713; ESTs, Weakly similar to S64054 hypothetical protein YGL050w - yeast (Saccharomyces cerevisiae) IS.cerevisiael; Collagen'TM=MSS=M' 12 92 

C0707; I -ls.624; interleukin 8; HLH,PAS,IL8;TM=M;; 12.76 
448133; AA723157; Hs.73769; folate receptor 1 (adult); Folate _rec,MIH;TM=M;SS--M; 12.50 
414809; AI434699; Hs.77356; transferrin receptor (p90, CD71); PA;TM=Y;; 1Z12 
436553; AW407157;H 18112; mm jtobjli rlr ' -0 Ppx-GppA;TM=M;: 12.00 

418738; AW388633; Hs.6682; solute earner family 7 at i, 1 tr n-porter, y system) member 11; none.none; 11.99 
419693; AA133749; Hs.301350; FXYD domain-containing ion transport regulator 3; ATP1G1J>UlUWVT8,TW-\ S f 
417866; AW067903; Hs.82772; collagen, type XI. alpha 1; Cottager TIP rSF minin_G,CorA -"11 , 
414998; NMJJ02543; Hs.77729; oxidised low density lipoprotein (leotin-like) receptor 1; lcctin_c;TM=Y;SS=M; 11.21 
428970; BE276891; Hs.194691, retinoi i ind 3; 7tm 3;TM f1108 
418004; U37519; Hs.87539; aldehyde dehydrogenase 3 family, member B2; a!dedh;TM=M;SS=M; 11.01 
425397; J04088; Hs.156346; topoisomerase (DNA) II alpha (170kD); DNA_gyr3seB,DNAJopoisolV,HATPase_c;SS=M; 10.69 
418478; U38945; Hs.1174;cyclin-dependent kinase inhibitor 2A (melanoma, p16, inhibits CDK4); ank;. 10.65 
439223; AW238299; Hs.250618; UL16binding prolein 2; ldLrecept_a,PKD.MHCJ;TM=M;SS=Y; 10.52 

441835; AB036432; Hs.184; advanced glycosylate end product-specific receptor; homeobox,Acyltransferase,notch,EGF,ank,Acyltransferase; 10.47 

451558; NM.001089; Hs.25530; ATP-binding cassette, sub-family A (ABC1), memt-r AB l, e IM= SS=M iO 33 

443426; AF098158; Hs.9329; chromosome 20 open reading frame 1 ; none;TM=W; 10.21 

452747; BE153855; Hs.61460; Ig superfamily receptor LNIR ig P t i p;TW=> 3 M 10.14 

417389; BE260964; Hs.82045; midkine (neurite growth-promoting factor 2); PTN.MK;TM=M;SS=Y; 10.13 

433091;Y12642;Hs.3185;lymphocyteantigen6complex, locus D;UPAP LY6,toxin,Activin recp;TM=M;SS=Y; 10.12 

454098;W27953;Hs2^2 II C 1 Hi mi 1 r p rH k incrcre10 05 

414812; X72755; Hs.77367; monokine induced by gamma interferon; IL6;TM=M;SS=Y- 9.98 

430832; AI073913; Hs. 100686; ESTs, Weakly simi a I in / adient-2 [H iPns]; none,none; 9.79 

422310; AA316622; Hs.98370; cytochrome P450, subfamily IIS, polypeptide 1; none,pkinase,fn3,ig; 9.60 

414987; AA524394; Hs. 294022; hypothetca! protein FLJ i 5 31 T, f ?^ 

439453; BE264974; Hs.6566; thyroid hormone recept r mir , r,r \BC_tra, E;TM=M;;9 

430280; AA361258; Hs 237868, interleu' in pt, in o, e 9 43 
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423217; NM.000094; Hs.1 640; collagen, lype VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive); Kunitz_BPTI,fn3,wra,Collagen,bela-lactamase;TM=M;SS=M; 

41 8882; NM_004996; Hs.89433; ATP-binding cassette, sub-family C (CFTR/MRP), member 1; ABCjrambrane,ABCJran;TM=Y;SS=M; 9.32 
435472; AW972330; Hs.283022; trtggering receptor expressed on myeloid cells 1; ig TM=M SS=M; 9.26 

447343; AA256B41; Hs.236894; ESTs, Highly simil 02392 alp '-macrogl lin recept p ir or [H sapiens], none,none, 9 18 
419508; AW997938; Hs.90786; ATP-binding cassette, sub-family C (CFTR/MRP), member 3; ABCJran,ABC_membrane;TM=Y;SS=M; 9.06 
441384; AA447849; Hs.288660; Homo sapiens cDNA; FLJ22182 fis, clone HRC00S53; 7lm_3,none; 3.S6 
446292; AF081497; Hs.279682; Rh type C glycoprotein; Ammonium n p.FecCC r" > SS=M;8.74 
436972; AA284679; Hs.25640; claudin 3; PMP22_Claudin;TM=Y;SS=M; 8.71 

421817; AF146074;Hs.108660; ATP-binding cassette, sub-family C ( F le r n Iran ABC membrane,GTP_EFTU;TM=M;SS=M; 8.71 

423354; AB01 1130; H 127'-36; call iumchannel, voltage dependent, alpha 2/delta subunit 2; vwa,Cache;TM=M;; 8.66 

439606; W79123; Hs.5B561; G protein-coupled receptor87; 7tm_1;TM=Y;SS=M 8.63 

438091; AW373062; ; nuclear receptor subfamily 1, group I, member 3; hormor.ejec,zf-C4,none; 8.60 

421506; BE302796 H 10 thymidii ki , soiub TKTM"M 8 57 

413278; BE563085; Hs.833; interferon-stimulaled protein, 15 kDa; ubiquitjn;SS=M; 8.56 

408908; BE296227; Hs.250822; serine/threonine kinase 15; pklnase;SS=M; 8.52 

414774;X02419;Hs.77274; plasminogen activator, urokinase; kringle,trypsin,plant_thionins;SS=W; 6.49 

430630; AW269920; Hs.2621; cystatin A (stefm A); cystafin;TM=M;; 8.42 

413011; AW068115; Hs.821; biglycan; LRR,LRRNT;SS=M; 8.40 

446291; BE397753; Hs.14623; interferon, gamma-mducible protein 30; G!LT;TM=M;SS=Y; 8.39 

411089; AA456454; ; cell division cycle 2-like 1 (PITSLRE proteins); none,none; 8.37 

422765; AW409701; Hs.1578; baculoviral IAP repeat-containing 5 (sur/ivin); BIR;TM=M;; 8.34 

453922; AF053306; Hs.36708; budding uninhibited by benzimidazoles 1 (yeast homolog), beta; none;SS=M; 8.25 

449019, A1949095; Hs.67776; ESTs, Weakly similar to T22341 hypothetical protein F47B8.5 - Caenorhabditis elegans [C.elegans]; none,none; B.24 
409799; D1 1928; Hs.75B45; phosphoserine phosphatase-like; Hydrolase;TM=M;; 8.22 
416819; U77735; Hs.80205; pim-2 oncogene; pkinase;SS=M; 8.19 
451541; BE279383; Hs.26557; plakophilin 3; Armadillo_seg;TM=M;; 8.16 

409142; AL1 36877; Hs.50758; SMC4 (structural maintenance of chromosomes 4, yeasl)-like 1; ABCJran,M,SMC_N,SMC_C,DUF164.none; 8.16 
429002; AW248439; Hs.2340; junction plakoglobin; Armadillo.seg;TM=M;; 8.14 
445033; AV652402; Hs.72901; mucin 13, epithelial transmembrane; ank;; 8.14 
421757; Z20897; Hs.296259; paraoxonase 3; Arylesterase;SS=Y; 8.10 

414821; M63835; Hs.77424; Fc fragment of IgG, high affinity la, receptor for (CD64); ig;TM=Y;SS=M; 8.03 
439285; AL133916; ; hypothetical protein FU20093; ig.pkinase.LRR.LRRNT.LRRCT.nono; 7.97 

439738; BE246502; Hs.9598; sema domain, immunoglobulin domain (Ig), transmembrane domain (TM) and short cytoplasmic domain, (semaphorin) 4B; Sema.P3!,integrin_B;l W=Y;; 
7.86 

424905; NM_002497; Hs.153704; NIMA (never in mitosis gene a)-related kinase 2; pkinase;TM=M;; 7.85 
424779; AL046851; Hs.153053; CD37 antigen; tr i -rn, m 1 - f4 TM=Y SS=M, 7 85 

409340; BE174629; Hs.321130; hypothetical protein MGC2771; aa_pernieKes,pyridoxal_deC,bromodomain,PHD,MBD,AT_hook,DDT,PI3_PI4_kinase.FAT.FATC.BolA,RUN;TM=M;; 
7.84 

415323; BE269352; Hs.949; neutrophil ryl.,-r ir ,-^-tn 1 ~ c D hroni g nulomalou di: ,e, djlcsomal 2); SH3,TPR;TM=M;; 7.73 

427337; Z46223; Hs.176663; Fc fragment of IgG, low affinity lllb, receptor for (CDI 6); ig;TM=Y;SS=M; 7.72 

430378; Z29G72; Hs.2556; tumor necrosis factor receptor superfamily, member 17; IL2;SS=M; 7.71 

451253; H48299; Hs.26126; claudin 1C; PMP22_Claudin,Peptida^e_N let TUtet M 7t 

435575; AF213457; Hs.44234; triggering receptor expressed on myeloid ceils 2; ig;TM=Y;SS=M; 7.70 

427747, AW411 425, H 1f"" r t ine/tt - nine tinase 12; pkinase;TM=M;; 7.70 

425251 ;M24283; Hs.1 68383; intercellular adhesion molecule 1 (CD54), human rhinovirus receptor; ig,ICAM_N;TM=M;SS-M; 7.67 
422282; AF019225; Hs.1 14309; apolipoprotcin L; MotA_ExbB;™=Y;SS=M; 7.64 

413859; AW992356; Hs.8364; Homo sapiens pyruvate dehydrogenase kinase 4 mRNA, 3' untranslated region, partial sequence; SAM_PNT,none; 7.54 
424008; R02740; Hs.137555; putative chemokine receptor; GTP-binriing protein; 7tm_1;l M=Y,SS=M; 7.52 

418322; M284166; Hs.84113; cyclin-dependent kinase inhibitor 3 (CDK2-associated dual specificity phosphatase); Y_phosphatase,DSPc;TM=M;; 7.46 

421071; A131 1238; Hs.104476; ESTs, Weakly similar to CGHU1E collagen alpha 1 (XI) chain precursor [H.sapiensl; nonc;TM»Y;SS=M; 7.40 

421481; AW391972; Hs.104696; KIAA1324 protein; none;TM=M;SS=M; 7.39 

438089; W05391 ; ; nuclear receptor subfamily 1 , group I, member 3; hormone_rec,zf-C4,none; 7.38 

428484; AF1 04032; Hs. 184601; solute earner family 7 (cationic amino acid trar.sporter, y system), member 5; 

aajermeases,pyridoxal_deC,bromodomain,PHD,MBD,AT_hook,DDT,PI3_PI4_kina5e,FAT,FATC,BolA,RUN;TM=M;;7.36 
44898B; Y09763; Hs.22785; gamma-aminobutyric acid (GABA) A receptor, epsilon; Neur_chan_LBD,Neur_chan_memb;TM=Y;SS=M; 7.35 
416178; AI808527; Hs.192822; serologically defined breastcancer antigen NY-BR-81; none;TM=M;; 7.31 
418506; AA084248; Hs.85339; G protein-coupled receptor 39; none,none; 7.25 
441553; AA281219; Hs.121295; ESTs; none,FG-GAP,integrin_A; 7.25 
422311; AF073515; Hs.114948; cytokine receptor-like factor 1; fn3;TM=M;; 7.21 

415817; U88967; Hs.78867; protein tyrosine phosphatase, receptor-type, Z polypeptide 1; fn3,Yj>hosphatase,canVanhydrase;TM=Y;SS=M; 7.20 

438746 A1885815; Hs.184727; Human melanoma-associated antigen p97 (melanotransfem'n) mRNA, 3' Hank; trans'errin,Guanylate_kin,PDZ,SH3; 7.20 

412723;,AA64B459; Hs.335951; hypothetical protein AF301 222; none;TM=M;; 7.14 

418203; X54942; Hs.83758; CDC28 protein kinase 2; CKS;; 7.14 

428582; BE336699; Hs.185055; BENE protein; none;TM=Y;SS=M; 7.12 

418462; BE001596; Hs.85266; integrin, beta 4; fn3,integrin_B,Cal< b-t -( r T>'--' =M ) 

420344; BE463721; Hs.97101; putative G protein-coupled receptor; Melhyltransf.5;TM=Y;SS=M; 7.02 

.>••:•: i.i ■ t ;.■ ■ •.. •• -vi-': -i- k x>. K--i.pt c i: -? Tf.t*t 7.oo 

449230; BE613348; Hs.21 1579; melanoma cell adhesion molecule; ig,isodh,Ribcsomal_L6,F-box;TU=Y;SS=M; 6.98 

439237; AW408158; Hs.318893; ESTs, Weakly similar to A47582 B-cell growth factor precursor IH.sapiens]; Furin-like,pkinase,Recep_Ldomain,YLP,none; 6.97 

421508; NM 004833; Hs.105115; absentin melanoma 2; PAAD.DAPIN,HIN;TM=M;; 6.96 

11 F 1 II f i I p f polypeptide; ITAMJf f 

428479; Y00272; Hs.334562; cell division cycle 2. G1 toSandG2'.oM pkina . ICE i iO, i ',E_ P 20,TM=M,SS=M; 6.93 

1 532; AW1 07; Hs. 1461 i /pc | III 3;Armadilio_seg,HI T;TM M I 
451035; AU076785; Hs.430; plastin 1 (I isoform); efband,CH,Adapfin_N;SS=tAt 6.86 

432407; AA221036; ; gb:zr03f1 2.r1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA clone S similar to SW:POL_BAEVM P10272 POL POLYPROTEIN ;, mRNA 

sequence; DEAD,helicase_C rm M r.CysJ I L,vw ,vwc vwd,IQ,Rlla,abhydrolase,TGF- 

beta,DUF139,TPR.DSPc,tsp 1,RibosomaLS21,rvp;TM=M;; 6.84 
442599; AF078037; Hs.324051; RelA-associated inhibitor; SH3,ank;TM=M;; 677 
443243 AW 369771 H £ it in, bet teg B ncre 6 76 

427557; NMJ02659; Hs.179657; plasminogen activator, urokinase receptor; UPAR_LY6,ET,PLA2Jnh;SS=M; 6.75 
41B054; NM_002318; Hs.83354; lysyl oxidase-like 2; SRCaLysy!_oxidase;TM=M;SS-=M; 6.74 

426440; BE382756; Hs.1 59902; solute carrier family 2 (facilitated glucose transporter), member 1; sugar_lr;TM=Y;SS=M; 6.73 
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430397; AI924533; Hs.105607; bicarbonate transporter related protein 1; HC03_co!ransp:TM=Y;; 6.71 
449523; NM 000679; Hs.54443; chemokine (C-C motif) receptcr 5, 7ir, 1 fM SS M ''71 

431630; NM_0022C - . integrin Iph int'gen CD alp n ' s t nit fVLA-3 receptor); FGGAP,Rhabd_glycop,integrin_A;TM=Y;SS=M; 6.70 
410434, AF051 1 2 H 6366 [ol lit leceplor 2 LI P LRRCTTIR TM-M SS-tvt; 6.69 
5 192 MOO h j mv I! nucl diff tiation antigen; PAAD DAPI ri'i I 

431890; X17033; Hs.2719BB; integrin, alpha 2 (CD49B, alpha 2subunit of VLA-2 receptor); vwa,integrin_A,FG-GAP;TM=Y;SS=M; 6.65 

57 I <" I Hs.1 * hexokinase 2; hexokinase.hexokinase2,none; 5.64 
430770; AA765694; Hs. 123296; ESTs; none.none; 6.63 

412270; AC005262; Hs.73797; guanine nucleotide binding protein (G protein), alpha 15 (Gq class), G alpha crfiTM-Ms 6.59 
10 439750; AL359053; Hs. 57664; Homo sapiens mRNA full le ul r IA clone EUROIf E 2335735; IMPDH_C,IMPDH_N,CBS,integrin_B,Ricin_B_lectin; 6.59 
427700; AA262294; Hs.180383; dual specificity phosphatase 6; Rhodanese,DSPc;TM=M;; 6,59 
413048;M93221;Hs.75182:mannosereceptor,Ctypei fn2,lectin_c,Ricin BJectin linl TM=Y,SS=M; 6.58 
429345; R11141; Hs.199695; hypothetical prctein; KJetra,SAM; 6.58 

4161 10; Z42262; Hs.322844; hypothetical protein DKFZp564A176; Sema,PSI,TIG,integrin_B;TM=Y;SS=M; 6.58 
15 41 8883; BE387036; Hs.1211; acid phosphatase 5, tartrate resistant; Meta]!ophos;TM=M;SS=M; 6.57 

426746; J03625 H 157 iridine monophosphate synthetase (orolale phosphoribosy! transferase and o'otidine-5'-decarboxylase); Pribosyllran,OMPdecase;TM=M;; 6.57 
402260; ; ; NM_001436*:Homo sapiens fibrillarin (FBL), mRNA. transciipti (F8A) mRNA.; pkinase.Fibrillarin.none; 6.56 

456373; BE247706; Hs.89751; membrane-spanning 4-domains, subfamily A member 2 (Fc fraqmanl of IgE, high affinity I, receptor for; beta polypeptide); none;TM=Y;; 6.53 

444006; BE395085; Hs.10086; type I transmembrane protein Fn14; ld!_recept_a,PKD,MHC_i;TM=M;SS=Y; 6.53 
20 41 1027; AF072099; Hs.67846; leukocyte immunoglobulin* receptor, subfamily B (with TM and ITIM domains), member 4; inositoLP,ig;TM=M;; 6.52 

435523; T62849; Hs.11090; membrane-spanning 4-domains, subfamily A, member 7; none;TM=Y;SS=M; 5.52 

432920; U37689; Hs.3128; polymerase (RNA) II (DNA directed) polypeptide H; none:TM=M;; 6.43 
HI I- rrl in Hi ' I r rM > 1 

409208; Y00093; Hs.51077; integrin, alpha X (antigen CD11C(p150), alpha polypeptide); vwa,FG-GAP,integrin_A,vwa,integrin_A,FG-GAP; 6.43 
25 424441; X14850; Fis.147097; H2A hislone family, member X; histone,CBFD_NFYB_HMF;; 6.43 

41B918; X07871; Hs,89476; CD2 antigen (p50), sheep red blood cell receptor; ig;TM=Y;SS=M; 6.41 

41 321 9; AA878200; Hs.1 1 8727; Homo sapiens cDNA FLJ1 3692 fis, clone PLACE20001 03; HLH,death,TNFR_c6,Acyl-CoA_hydro; 6.41 

429170; NMJM1394; Hs.2359; dual specificity phosphatase 4; Rhodanese,DSPc,Y_pliosphatase,Ribosomal_S3_N;TM=M;; 6.39 

453914; NMJM0507; Hs.574; fructose-1,6-bisphosphatase 1; FBPase;TM=M;; 6.37 
30 424273; W40460; Hs.144442; phospholipase A2, group X; ph(»1c;TM=l*SG-Y; 6 37 

428385; AF112213;Hs.184062 putaive Ratine ing p-otei I H3.SS=M u 

432636; AA340864; Hs.278562; claudin 7; PMP22_Ctaudin;TM=Y;SS=M; 6.34 

409430; R21945; Hs.346735; splicing factor, arginine/serine-rich 5; DSPc Rhodancscncnc; 6.34 

451734; NM 006176; Hs. 26944; neurogranin (protein kinase Csub Irate I 1); IC,7tm_ TM-M 6 3 
35 443907; AU076484; Hs.9963; TYRO protein tyrosine kinase binding protein none;TM=M;SS=\ 6.34 

401027; ; ; Target Exon; none,none; 6.26 

418299; AA279530; Hs.83968; integrin, beta 2 (antigen CD18 (p95), lymphocyte lurction-associated antigen 1; macrophage antigen 1 (mac-1) beta subunit); 
integrin_B.EG rM=Y;SS=M; 5.2 

429732; U20158; Hs.2488; lymphocyte cytosolic protein 2 (SH2 domain-containing leukocyte protein of 76kD); SH2;SS=M; 6.21 
40 4081 1 3; T82427; Hs.1 941 01 ; Homo sapiens cONA: FU20869 fis, clone ADKA02377; 7tm_2,none; 6.20 

408771; AW732573; Hs.47584; potassium voltage-gated channel, delayed-rec'.ifier, subfamily S, member 3; efhand,ion_trans,K_tetra,none; 6.19 

456534; X91195; Hs.100623; phospholipase C, beta 3, neighbor pseudogene; LIM,PDZ,pkinase;SS=M; 6.18 

408482; NM_000676; Hs.45743; adenosine A2b receptor; 7tm 1;TM=Y;SS=M; 6.17 

426427; M86699; Hs.169840; TTK protein kinase; pkinase;; 6.17 
45 445019; AI20554C; I Is. 281 295; ESTs; none.none; 6.15 

438552; AJ245820; Hs.6314; type I transmembrane receptor (seizure-relaled protein); none.none; 6.16 

414907; X90725; Hs.77597; polo (DrosophiaJ-like kinase; RibosomaLL37ae,pkinase,POLO_box,tRNA-synt_1b,dynamin,dynamin.2,GED,bZIP,M;; 6.14 
425322; U63630; Hs.155637; protein kinase, DNA-activated, catalytic polypeptide; PI3 PI4_kinase,FAT,rATC;TM=M;; 6.13 
417421; AL138201; Hs.82120; nuclear receptor subfamily 4, group A, member 2; hormone_rec,zf-C4;SS=M; 6.13 
50 425776; U25128; Hs.159499; parathyroid hormone receptor 2; 7trn_2,HRM;TM=Y;SS=M; 6.12 

422278; AF072873; Hs.114218; frizzled (Drosophila) homolcq 6; Fz,Frizzled,7tm_2;TM-Y;SS=M; 6.12 

427490; Z95152; Hs.178695; mitogen-act'vated protein kinase 13; pkinase;TM=M; 6.12 

421445; AA91 3059; Hs.1 04433; Homo sapiens, clone IMAGE:4054868, mRNA; i<m_trans,K_tetra,asp; 6.11 

4441 43; AW747996; Hs.1 60999; ESTs, Moderately similar to A561 94 thromboxane A-2 receptor, endothelial [H.sapiens]; Bcl-2,none; 6.1 0 
55 423887; AL080207; Hs.134585; DKFZP434G232 protein; ABCJrar,;TM=Y;; 6.10 

409636; AA305729; Hs.1 8272; amino acid transporter system A1 ; Aa_trans;TM=Y;; 6.09 

41 1020; NMJ06770; Hs.67726; macrophage receptor with collagenous structure; SRCR,Co!lagen;TM=Y;SS=M; 6.09 

425354; U62027; Hs.155935; complement component 3a receptor 1; 7tm_1;TM=Y;SS=M; 6.08 

439963; AW247529; Hs.6793; platelet-activating factor acetylhydrolase, isoform lb, gamma subunit (29kD); PAF-AHJb,Lipase_GDSL;TM=M;; 6.07 
60 421753; BE314828; Hs.10791 1; ATP-binding cassette, sub-family B (MDR/TAP), member 6; ABCJran,ABC_membrane;TM=Y;SS=M; 6.07 
405908; Z25437; ; gb:H.sapiens prolein-tyrosine kinase gene, complete CDS.; none.none; 6.07 
425849; AJ000512; Hs.296323; serumfglucocorticoid regulated kinase; pkinase,pkinase_C;TM=M;SS=M; 5.06 

452363; AI582743; Hs.94953; Homo sapiens, Similar to complement component 1 , q subcomponent, c polypeptide, clone MGC:17279, mRNA, complete cds; C1q,CoIlagen;SS=M; 

65 4148B3; AA926960; ; CDC28 protein kinase 1; CKS;; 6.05 

414166; AW888941; Hs.75789; N-myc downstream regulated; DEAD,hclicasc_C,rrm,Ndr,Cys.knol,TIL,vwa,vwc,vwd,IQ,Rlla,abhydrolase,TGF- 
beta,DUF139,TPKDSPc,tspJ,R ; bosomaLS21,ivp,TV4.!„5Q3 

452888; AW955454; Hs.30942; ephrin-B2; Ephrin,fn2;TM=Y;SS=M; 6.03 

448782; AL050295; Hs.22039; KIAA0758 protein; 7tm 2,ig,GPS,SEA;TM=Y;; 6.03 
70 449101; AA205847; Hs.23016; G protein-coupled receplor; 7imJ;TM=Y;SS=M; 6.01 

445462; AA378776; Hs.288649; hypothetical protein MGC3077; none;; 6.00 

424381; AA285249; Hs.146329; protein kinase Chk2; pkinase,FHA,DnaJ;TM=M;; 6.00 

420162; BE378432; Hs.95577; cyclin-dependent kinase 4; pkinase;TM=M;; 5.99 

439310; AF086120; Hs.102793; ESTs; casein.kappa.pkinasejg.none; 5.97 
75 414972; BE2637B2; Hs.77695; KIAA0008 gene product; GKAP;TM=M;; 5.97 

425976; C75094; Hs.334514; NG22 protein; voltage_CLC;TM=Y;SS=M; 5.94 

444946; AW1 3920= H I5645 I, n l al protein FU22408; abhydrolase,abhydrolase_2;TM=Y;SS=M; 5.93 
411263; BE297802; Hs.69360; kinesin-like 6 (mitotic centromere-associated kinesin); kinesin;TM=M;; 5.93 
421462; AF016495; Hs.104624; aquaporin 9; MIP;TM-Y;SS-M; 5.92 
oU 426761; A1015709; Hs.172089; Homo sapiens mRNA; cDNA DKFZp586l2022 (from clone DKFZp586l2022); none;TM=Y;SS=M; 5.92 
t r t K l5;Hs.39384; putative seci g nd homoog istofjxt cr TM-f - )1 
428771; AB028992; Hs.193143; KIAA1059 protein; C2,PI-PLC-Y,PI-PL&X;Tlv1=M;; a91 
438564; AA381553; Hs.198253; major histocompatibility complex, class II, DQ alpha 1; ig,MHCJI_alpha,none; 5.91 
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440006; AK000517; Hs.6844; hypothetical protein FU20510; AAA,NB-ARC.PAADJDAPIN;NA;NA; 5.90 
449027; AJ271216; Hs.22830; dipeptidylpeplidase III; Peptidase_M49,EGF ig Kpur jii TM=" 
408790; AW580227; Hs.47860; neurotrophic tyrosine kinase, receplor, type 2; ig.pknaseiRRLRRNT.LRRCTJM SS=M 
413186; AU077141 H 75 i le carrier family 16 (i a u i an porters), member 1;sugar_tr;TM=Y;SS=M; 5.89 

5 430696; AA531276; Hs.59509; ESTs; pkinase,PP2C,none; 5.88 

422609; Z46023; Hs.1 18721 ; sialidase 1 (lysosomal sialidase); BNR,SH2,SH3,pkinase;TM=Y;SS=M; 5.88 

425367; BE271 188 H 15 J prolein jiosine phosphatase, receptor type, C-assooiated protein; none;TM=M;SS=Y; 5.88 

429619; AL120751 ; Hs.21 1568; eukaryotic translation initiation factor 4 gamma l;i r non 

437429; H79981; Hs.5613; Homo sapiens mRNA; cDNA DKFZp564E2222 (from clone DKFZp564E2222); SH2,SH3,BTB; 5.86 
1 0 436576; AI45821 3; Hs.77542; ESTs; 7lm_1 .DnaJ; 5.85 

429663; M68874; Hs.211587; phospholipase A2, group WA (cytosolic, calcium-dependent); C2,PLA2_B;TM=M;; 5.85 
419981; AA897581; Hs.128773;ESTs; pkinase,DAG_PE-bind,pkinase_C,OPR,r,one; 5.83 

428953; AA306610; Hs.348183; tumor necrosis factor receptor superfamily, member 6b, decoy; 60s_ribosomal,Ribosomal_L1D,TNFR_c6,DEAD;; 5.83 
414806; D14694; Hs.77329; phosphatidylserine synthase 1; PSS;TM=Y;SS=M; 5.82 
1 5 451320; AW118072; ; diacylglycerol kinase, zeta (104kD); none;TM=M;; £82 
400991; ; ; Target Exon; Armadillo_seg,lectin_c,none; 5.81 

456906; AF1 17646; Hs.156637; Cas-Br-M (murine) ectropic retroviral transforming sequence c; zf-C3HC4,Cbl_N,Cbl_N2,CbLN3;TM=M;; 5.81 
434263; N34895; Hs.44648; ESTs; ig.none; 5.81 

428293; BE250944; Hs.183556; solute carrier family 1 (neutral amino acid transporter), member 5; e!F6,SDF;TM=M;; 5.78 
20 421959; AW751497; Hs.98370; cytochrome P450, subfamily IIS. polypeptide 1; p460;TV=Y;SS=M; 5.78 
449539; W80363; Hs.58446; ESTs; pkinase,Furin-like,Recep_L_domain,none; 5.77 
109012; Al Ik.49725; D 11 i I I M i\ FM f ' -i f e 

412276; BE262621; Hs.73798; macrophage migration inhibitory factor (glyco"yL-iro.! nhibitir.i (a toi M F,sugar_tr,none; 5.75 
409533; AW969543; Hs.21291; mitogen-activated protein kinase kinase kinase 1 3; Peptidase_C4B,none; 5.73 
25 457001; J03258; Hs.2062; vitamin D (1,25- dihydroxyvilamin D3) receplor; hormone_rec,zf-C4,Metallothio_5;TM=M;; 5.73 
416084; L1 6991; Hs.79006; deoxythymidylate kinase (thymidylate kinase); r,one,nor,e; 5.72 

448569; BE382657; Hs.21486; signal transducer and activator of transcription 1, 91kD; SH2,STAT,STAT_bind,STAT_prot;TM=M;; 5.72 
452295; BE379936; Hs.28866; programmed cell death 10; serpin,none; 5.72 

448775; AB025237; Hs.388; nudix (nucleoside diphosphate linked moiety X)-type molif 1; NUDIX;TM=M;SS=M; 5.72 
30 448733; NM_005629; Hs.187958; solute carrier family 6 (neurotransmBter transporter, creatine), member 8; SNF;TM=Y;; 5.71 

417015 M83772 Hs 80876: fl.wn containing monoojcyger.^e 3. Ff.C-lKi .-. . - ■ • ,"M=Y.SS=M; 5.69 

453323; AF034102; Hs.32951 ; solute carrier family 29 (nucleoside transporters), member 2; Nucleoside_tran;TM=Y;SS=M; 5.69 

410290; AA402307; Hs.322844; hypothetical protein DKFZp564A176; Sema,PSI,TIG,integrin_B;TM=Y;SS=M; 5.69 

412182; AA205588; Hs 155 M plicing r < r re 4 r r "1 ugarjr;; 5.69 

3 5 418526; BE019020; Hs.85838; solute carrier family 16 (monocarboxylic acid transporters), member 3; none;TM=Y;SS=M; 5.66 

447250; AI878909; Hs.17883; protein phosphatase 1G (formerly 2C), magnesium-dependent, gamma Isoform; PP2C;TM=M;; 5.65 

438113; AI467908; Hs.8882; ESTs; 7tm_1,none; 5.65 

421391; AW304350; Hs.191958; immunoglobulin superfamily receptor translocation associated 2; ig.none; 5.54 

417115; AW952792; Hs.334612; small nuclear ribonucleoprotein polypeptide E; Sm.pkinase;; 5.64 
40 406137; ; ; NM_000179':Homo sapiens mutS (E.coli) hamoiag 6 (MSH5), mRNA. VERSION NM_000178.1 Gl; MutS_C.PWWP,MutS_N;TM-M;; 5.63 

421917; AB028943; Hs.109445; KIAA1020 protein; BTB,zf-C2H2,PI3_PI4_kinase,PI3Ka;TM=M;; 5.52 

445873: AA250970; Hs.251946; poly(A)-binding protein, cytoplasmic Mike; PABP,rrm,pkir,ase,14-3-3; 5.52 

447365; BE383676; Hs.334; Rho guanine nucleotide exchange factor (GEF) 5; SH3,PH,RhoGEF;TM-M; 5.61 

446872; X97058; Hs.1 6362; pyrimidinergic receptor P2Y, G-protein coupled, 6; 7'jn_1;TM=Y;SS=M; 5.59 
45 \> 662:W07162;Hs.15082 CAT> 3p rtein .a \DC I n ail TM-M,SS=M, 5.59 

449029; N28989; Hs.22891; soIiiIf rfamilj tionic amino a< i > y tern) member 8; aa_permeases;TM=Y;SS=M: 5.58 

431236; AV656840; Hs.285115;inlerieukin 13 receptor, alpha 1; fn3;TM=Y;SS=M; 5.57 

430508; AI015435; Hs.104637; ESTs; SDF;TM=Y;SS=M: 5.56 

426227; U67058; Hs.154299; Human proteinase activated receptor-2 mRNA, 3'UTR; 7tm_1;TM=Y;SS=M; 5.55 
50 421677; H64092; Hs.38282; ESTs; Alpp.Anradillo seg,IBB; 5,54 

429083; Y09397; Hs.227817; BCL2-related protein A1; Bcl-2;TM=M;; 5.54 

429563; BE619413; Hs.2437; eukaryotic translation u itiafon fa tor 2B. • rib ml 5 (e F , Ion, FikD), hexapep,W2,hormone2,DUF29;TM=M;; 5.52 

412817; AL037159; Hs.74619; proteasome (prosome, macropain) 26S subunit, non-ATPase, 2; PC_rep;TM=M;; 5.51 

452291 ; AF01 55S2; I ls.28853; CDC7 (cell division cycle 7, S. cerevis : ae, homolog)-lte 1 ; P kinase;TM=M;; 5.51 
55 437412; BE069288; Hs.34744; Homo sapiens mRNA; cDNA DKFZp547C136 (from clone DKFZp547C136); ABCJran.GTP_EFTU,ABC_membrane,none; 5.50 

423778; Y09267; Hs.1 32821; flavin containing moncoxygenase 2; FMO-like,pyr_redox;TM=Y;SS=M; 5.48 

422846; BE513934; Hs.1583; neutrophil cytosolic factor 1 (47kD, chronic granulomatous disease, autosomal 1); SH3,PX;TM=M;; 5.48 

434699; AA643687; Hs.149425; Homo sapiens cDNA FU11980 fis, clone HEMBB1001304; Nucleoside_tra2,none; 5.48 

426691; NM_006201; Hs.1 71834; PCTAIRE protein kinase 1; pkinase;TM=M;; 5,48 
60 453905; NM_002314; Hs.36566; UM domain kinase 1; pkinase,LIM,PDZ,zf-PARP;TM=M;; 5.48 

412939; AW411491; Hs.75069; eukaryotic translation elongation factor 1 gamma; none.none; 5.44 

430486; BE062109; Hs.241551; chloride channel, calcium activated, family member 2; none;TM=Y;SS=M; 5.43 

430065; AI929659; Hs.237825; signal recognition particle 72kD; TPR,AIRC,SAICAR_synt 5.40 

422241; Y00062; Hs.170121; protein tyrosine phosphatase, receptor type, C; kir,esin,fn3,Y_phosphatase;TM=M;; 5.40 
65 41 1825; AK000334; Hs.72289; hypothetical protein FU20327; SNF,Zip;TM=Y;; 5.36 

400205; ; ; NM_006265*;Homo sapiens RAD21 (S. pombe) homolog (RAD21), mRNA.(APO-1/CD95 (Fas)-associated phosphatase) (PTPN13), mRNA.; DUF173;SS=M; 5.35 

410687; U24389; Hs.65436; lysyl oxidase-like 1; Lysyl.oxidase;SS=M; 5.34 

407786; AA687538; Hs.38972; tetraspan 1; transmembrane4;TM=Y;SS=M; 5.34 

4251 1 8; AU07661 1 ; Hs.154672; methylene tetrahydrofolate dehydrogenase (NAD dependent), methenyltetrahydrofolate cyclohydrolase; myb_DNA- 
70 binding.THF DHG.CYHJHF DHG CYH C,CAP_GLY,AAA,LON,Peptidase.C9,bZIP,M,xan_ur_permease,HC03_cotransp;TM=M;; 5.32 

400210; ; ; Eos Control; Adap_comp_sub,Clat.adaptor.s;TM=M;; 5.32 

414825; X06370; Hs.77432; epidermal growth factor receptor (avian erythroblastic leukemia viral (v-erb-b) oncogene homolog); Furin-like,pkinase,Recep_L_domain;TM=M;SS=M; 
5.31 

414035; Y00630; Hs.75716; serine (or cysteine) proteinase inhibitor, c'ade B (ovalbumin), member 2; serpin;SS=M; 5.30 
75 416000; R82342; Hs.79856; ESTs, Weakly similar to S65657 alpha-1C-adrenergic reoeptor splice form 2 [H.sapiens]; none,sugar_tr, 5,30 

414368; W70171; Hs.75939; uridine monophosphate kinase; PRK,CoaE;; 5.29 

424321; W74048; Hs.1765; lymphocyte-specific protein tyrosine kinase; SH2,SH3,pkinase;TM=M;; 5.29 

450296; AL041 949; Hs.24756;hepatocyte growth facto requla dtyrc nr na sut , jte none none, 529 

456672; AK002016; Hs.1 14727; Homo sapiens, clone MGC:16327, mRNA, complete cds; nDne,PK,PK_C,myosin_head,RhoGAP; 5.28 
80 410068; AI633888;Hs.58435;FYN-bindingprotein(FYB-120/130);SH3;TM=M;;52fJ 

456629; AW891965; Hs.279739; histone deacetylase 3; HSP90,HATPase_c,zf-C2H2,PHD,none; 5.27 

417218; AA005247; Hs.285754; met proto-oncogene (hepatocyte growth facior receptor); pkinase,Sema,PSI,TIG,integrin_B;TM=Y;SS=M; 5.26 
444051; N48373; Hs.10247; activated leucocytecell adhesion molecule; none,none; 5.26 
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404083; ; ; C60021 59*:gi|628027|pir||A53593 protein-tyrosine-phosphatase (EC 3.1.3.48). nonreceptor ty; none;SS=M; 5.26 
422051; AW327546; Hs.1 1 1024; solute earner family 25 (mitochondrial carr sr. citrate transporter), member 1; mito_carr;TM=M;; 5.26 
419034; NMJD021 10; Hs.89555; hemopoietic ee l kinase; SH2,SH3,pkinase;TM=M;; 5.26 
427732; NMJJ02980; Hs.2199; secretin receptor; 7tm_2,HRM;TM=M;SS=M; 5.25 

5921; NM I ; He. 1 11 I family 6 ( ti u ans| ) t 14 c I Tt« , 

448030; N30714; Hs.325960; membrane-spanning 4-domains, subfamily A, member 4A; none;TM=Y;SS=M; 5.24 
441607; NM 005010; Hs.7912; neuronal cell adhesion molecule; WD40,fn3,ig;TM=M;; 5.23 
446620; AA128808; Hs.179902; transporter-like proter i rM=Y M; 5,23 
422616; BE300330; Hs.1 18725; selenophosphate synthetase 2; AIRS,AIRS_C;TM=M;; 6,23 
447131; NMJJ04585; Hs.17466; retinoic acid receptor responder (tazarotene induced] 3; none;TM=Y;; 5,2 1 
446272; BE268912; Hg.14601; hematopoietic cell-specific Lyn substrate 1; SH3,HS1_rep;TM=M;; 5.20 
450447; AF212223; Hs.25010; hypothetical protein P15-2; NTF2;TM=M;; 5.19 

425003; AF1 19046; Hs.154149; apurinic/apyrimidinic endonuclease(APEX nuc!ease)-;ike 2 protein; Troponin,Exo_endo_pho5,IQ;TM=M;; 5.19 
446636; AC002563; Hs.1 576:'; citron (ihc-interacting, serine/threonine kinase 21); CNH,DAGJ>E-bind,PH,lnvolucrin,M;TM=M;; 5.19 
434826; AF155661; Hs.22265; pyruvate dehydrogenase phosphatase; PP2C,none; 5.19 
447081; Y13896; Hs.17287; potassium inwardly-rectifying channel, subfanrly J, member 15; IRK;TM=Y;; 5.19 

407949; W21874; Hs,247057; ESTs, Weakly similar to 2109260A B cell growth factor [H.sapiens]; Ribosoir,sLS14,ank,pkinase,death,none; 5.18 
142200 AW590572 H 235768 ESTs none.none 5 II 

446566; H95741; Hs.17914; membrane-spanning 4-domains, subfamily A, member 6A; none;TM=Y;SS=M; 5.1B 
452690; AI536070; Hs,15085; ESTs; pou 1 homeobox,lig_chan,ANF_receotor; 5,18 

419138; 048508; Hs.89631; ryanodine receptor 1 (ske'etal); ionJrans,3PRY,RYDRJTPR,RyR,MIR;TM=Y;; 5.17 

431441; 081961; Hs.2794; sodium channel, nonvoltage-gated 1 alpha; ASC;TM=Y;; 5.16 

418945; BE246762; Hs.89499; arachidonate 5-lipoxygenase; lipoxygenase,PLAT;TM=M;; 5.1 5 

407601; AC002300; Hs.37129; sodium channel, nonvoltage-gated 1, beta (Liddle syndrome); ASC;TM=Y;SS=M; 5.15 

429500; X78565; Hs.2891 14; hexabrachion (tenascin C, cytotactln); EGF,fh3,fibrinogen_C,toxin 2 ,Ke ra li n_B2 ; T W - M ; R S - Y ; 5.1 5 

411984; NM_005419; Hs. 72988; signal transducer and activator oP r a r r n3kD i r \T„bind 1 prot;Tf M l 

433470; AW960564; ; transmembrane 4 superfamily member 1; none;TM=Y;SS=M; 5.14 

45281 7; AA322859; Hs.284275; Homo sapiens PAK2 mRNA, complete cds; pk!nase,PBD;TM=M;; 5.14 

453102; NM_007197; Hs.31664; frizzled (Drosophila) homolog 10; Fz,Frizzled,7lm_2;l M=Y;SS=M; 5.14 

427792; M63928; Hs.180841; tumor necrosis factor receptor superfamily, member?; SRP14JNFR c6;SS=M; 5.14 

430563; AA481269; ; ATP-binding cassette, sub-family C (CFTR/MRP), member 5; ABC_tran,GTP_EFTU,ABC_mernbrane,none; 5.13 

431681; AK000378; Hs.267566; hypothetical protein FU20371; sugar_tr;TM=Y;; 5.12 

431 183; NM_006855; Hs.250696; KDEL (Lys-Asp-Glu-Leu) endoplasmic reticulum protein retention receptor 3; ER_lumen_recept;TM=M;SS=M; 5.12 

417771; AA804698; Hs.82547; retinoic acid receptor responder (tazarotene induced) 1; none.none; 5.11 

418613; AA744529;Hs.86575;mitogen-activated protein kma-ekim e t r kinase 1 kina CMH;TM-M li 

409524; AW4021 51 ; Hs.54673; tumor necrosis factor (ligand) superfamily, member 13; TNF;TM=Y;SS=M; 5.11 

6856; AI469355; Hs.1 0 EST pkina rrm;TM=M 09 
41 1296; BE207307; Hs.101 14; growth suppressor 1 ; 20G-Fell_Oxy;TM=M;SS=M; 5.09 
410082; AA081594; Hs.158311; Musashi (Drosophila) homolog 1; rrm;TM=M;;5.09 

404440; ; ; NM_021048:Homo sapiens melanoma antigen, family A, 10 (MAGEA10), mRNA. VERSION NMJ21049.1 Gl; MAGE;TM=M;; 5.08 

424977; AA349289; Hs.100057; Homo sapiens cDNA: FU22902 fis, clone KAT05531; none.none; 5.08 

422100; AI096988; Hs.1 1 1554; ADP-ribosylation factor-like 7; arf,ras;TM=M;; 5.07 

452222; AW806287; Hs.21 432; SEXgene;Sema,TIG,PSI,GDI;5.07 

430300; O6080S; Hs.238648; oncostatin M receptor; fn3;TM=Y;SS=M; 5,07 

408369 R38438 H 1825 , ariiei famil r H ar purttr) member 2; PTR2;TM-Y;; 5.07 

4221 12; BE540240; Hs.1 1 1 783; Lsm1 protein; Sm,BAG;SS=M; 5.05 

449961 AV265"34 Hs. 133100; ESI i Furin-like.F cep_L domain.none; 5.06 

430024; AI808780; Hs.227730; integrin, alpha 5; MegrinJ\,FG-GAP;TM-Y;SS=M; 5.06 

412641; M16660; Hs.74335; heat shock 90kD protein 1, beta; HSP90,HATPase_c;TM=M;; 5.05 

437608, AA761 605 Hs.292308 I I Weakly i <l LU1 .HUMAN i I SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY IH.sapiens]; pkinase,RI01,none; 
5.05 

400296; AA305627; Hs.1 39336; ATP-binding cassette, sub-family C (CFTR/MRP), member 4; ABC_tran,ABC.membrane;TM=Y;; 5.04 
446232; AI281 848; Hs.194691; retinoic acid induced 3; 7tm 3,none;5.04 
425262; 087119; Hs.155418; GS3955 protein; P kinase;SS-M; 5.04 

414703; BE243877; Hs.75941; ATPase, Na? transporting, beta 3 polypeptide; Na_K-ATPase;TM=Y;SS=M; 5.03 

434808; AF155108; Hs.256150; Homo sapiens, Similar to RIKEN cDNA 281 002701 9 gene, clone MGC:14827, mRNA, complete cds; none;TM=M;; 5.03 
425852; AK001504; Hs.159651; death receptor 6, TNF superfamily member 21; death.TNFR c6;TM=Y;SS=M; 5.03 
449437; AI70203B; Hs.100057; Homo sapiens cDNA; FU22902 fis, clone KAT05581; none.none; 5.03 

448913; AA194422 H 2 1; m ' firth' C SAM PNT,ABC2Jr,embrane,myosinJlead,IQ,Myosin_N,bZIP,zf- 

C2H2,PHD,BTB,TFIIS,ATJlook,SAM■,TM=M;;5.02 
413441; AI929374; Hs.75367; Src-lte-adapter; SH2,SH3;TM=M;; 5.02 

427618; NM 000760; Hs.2175; colony stimulating factor3receptor(granulo-:,te) in TM d M )2 

417666; AI345001; Hs.82380; menage a trois 1 (CAK assembly factor); zf-C3HC4;TM=M;; 5.02 

429903; AL134197; Hs.93597; cyclin-dependent kinase 5, regulatory subunit 1 (p35); CDK5_activator,none; 5.01 

445333; BE537641; Hs.44278; hypothetical protein I I] imil Ir rel iroten RAB17; ras,arf,TK;SS=M; 5.01 

426285; U20620; Hs.343581; karyopherin alpha 1 (importin alpha 5); Armadillo seg,IBB;TM=M;; 5.01 

421233; AA209534; Hs.284243; tetraspan NET-6 orotein; !ransmemtrane4;TM=Y;SS=M; 5.01 

424517; AI539443; Hs.1 37447; Homo sapiens cDNA FU12169 f s, clone MAMMA1 000643; SH2,STAT,STAT_bind,STAT_prot,none; 5.00 

425345; AU077297; Hs.1 55894; protein tyrosine phosphatase, non-receptor type 1; Y_phosphatase,DSPc;TM=M;SS=M; 5.00 

446946; AI878932; Hs.317; topoisomerase (DNA) I; Topoisomerase_l,Topo:somerJ_N,RrnaAD,Hanta_nucleDcap;TM=M;; 4.99 

413900; AW409747; Hs.75612; stress-induced-phosphoprotein 1 (Hsp70/Hsp90-organIzIng protein); TPR,PDZ,WW,Guanylate_kin;TM=M;; 4.93 

412116; AW402166; Hs.784; Epslein-Barr virus induced gene2 (lymphocyte-speoinc G protein-coupled receptor) 7tm l;TM 1 M 

400792;AA635062;;HomosapiensmRNA;cDNADKFZp434O051 r i'-ri ir-tr r 4 051 1 z Z HC4 CARD,BIR;TM=M;; 4.98 

417018; M16038; Hs.80887; v-yes-1 Yamaguchi sarcoma viral related oncogene homolog; SH2,SH3,pkinase;TM=M;; 4.98 

427247; AW504221; Hs.174103; integrin, alpha L (antigen CD1 1 A (p1 3 j Ivmphrcyte function-associated antigen 1; alpha polypeptide); vwa,integrinJ\,FG-GAP;TM=Y;SS=M; 4.98 

A 4 61;Hs.4 65; EST rt 1.9 
454042; H22570; ; hypothetical protein FU20093; ig,pkinase,LRR,LRRNT,LRRCT,none; 4.97 
452698; NM.001295; Hs.301921; chenoklne (C-C molif) receptoi 1; 7tmJ;TM=Y;SS=M; 4.96 
416276; U41060; Hs.79136; LIV-1 protein, estrogen regulated; Peptidase.C4,Osteopontin,Zip;TM=Y;SS=M; 4.96 
408847; AW290997; Hs.30348; ESTs; pkinase,ig,none; 456 

419452; U33635; Hs.90572; PTK7 protein tyrosine kinase 7; ig,pkinase;TM=Y;SS=M; 4.95 
450737; AW007152; Hs.203330; ESTs; trypsin,ldl_recepLa,none; 4.95 

443354; AW970672; Hs.9247; protein kinase, AMP-activated, alpha 1 catalytic subunit; pkinase,RI01;TM=M;; 4.94 
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414135; NMJJ04419; Hs.2128; dual specificity phosphatase 5; Rhodanese,DSPc,Y_phosphatase;TM=M;; 4.94 
424247; X14008; Hs.234734,lysozyme (renal amy oid ); lys,ig,FAD_Syr,th,ld c ^tinase SS=M; 4.94 

1 l HI E r ml 698 j r r c n H r HIM 

418870; AF147204; Hs.89414; chemokine (C-X-C motif), receptor 4 (fjsin); 7tm 1 ,7tm_2;TM=Y;SS=M; 4.93 
5 408716;AI567839;Hs.151714;HomosapiensmRNAforKIAA1769 proteiri partial. J ■ " . i ri hel[case,RNB,Runl;TM=M;;4.93 
426437; BE0 7 G5 i ri 1 j i iitin-conjugat : ng nzyme E2L 6; Armadi!lo_seg,UQ.con,none; 4.92 

424241; AW995948; Hs.182339; Homo sapiens pyruvate dehydrogenase kinase 4 mRNA. 3' untranslated region, partial sequence; Ets,SAM_PNT;TM=M;; 4.92 
1115 7 0Y00 H I ike g in r 2 receptor; t I 6 r T " M SS=M;4.92 

407239; AA076350. Hs G 7 11 teik 1 immum lob li Ike re toi bfa ly B (wit TM ind I IMd uns) member 4; ig;TM=Y;SS=M; 4.92 
1 0 409512; AW979187; Hs.293591; melanoma differentiation associated pratein-5; DEAD.helicase_C,CARD;TM=M;; 4.91 
416714, AF283 7 /0 I- 79 » D 3 antigen (immunoglobulin-associated alpha); ig,ITAM,Zn_clus;TM=Y;SS=M; 4.91 

404289; ; ; NM_002944*:Homo sapiens v-ros avian UR2 sarcoma virus oncogene homolog 1 (ROS1), mRNA.; fn3,pkinase,DUF139;TM=Y;SS=M; 4.90 

428141; D50402; Hs.182611; solute carrier family 1 1 (proton-coupled divalent metal ion transporters), member 1; Nramp;TM=Y;; 4.90 
.■■■■./.■■.-■-.-. h- ..[I. •■■:.,:.[ J I V;. ■ i - r .- -■ 1 l if.--", =i J ■ 

15 432179; X75208; Hs.2913; EphB3; EPH_lbd,fn3.pkinase,SAM;TM=Y;SS=M; 4.89 

401083; ; ; NM_016582 , :Hcmo sapiens peptide transporter 3 (LOC51296), mRNA. VERSION NM_01 6579.1 Gl; PTR2;TM=Y;SS=M; 4.B9 
, 402211; AA811738; ; KIAA0430 gene product; ion_trans,KJetra;TM=Y;; 4.88 

421541; If tCl ' r ' ,. * mal prote'n S6k'r,ase, 90kD, polypeptide 4; pkinase,pkinase_C;TM=M;; 4,87 

431810; X67155; Hs.270845; kinesin-like 5 (mitotic kinesin-like protein 1); kinesin;TM=M;; 4.86 
20 425295; AA431366; Hs.37251; ESTs; pkinase.none; 4.86 

424439; AA57953I Hs.17 70. liga: e I, DNA, ATP-dependent; DNAJigase;; 4,86 

419168; AI336132; Hs.33718; Homo sapiens cDNA FU12641 lis, cone NT2RM4001953; none.none; 4.86 

442875; BE623003; Hs.23625; Homo sapiens clone TCCCTA00142 mRNA sequence; KJetra, DUF51 ,none; 4.86 

425465; L18964; Hs.1904; protein kinase C, iota; pkinase,DAG_PE-bind,pkinase_C,OPR;TM=M;;4.86 
25 410293; AK000047; Hs.61960; hypothetical protein; K_telra;TM=M;; 4.86 

443623; AA34551 9; Hs.9641; complement component 1,q subcomponent, alpha polypeptide; C1q,Collagen;SS=M;4.85 

445903; AI347487; Hs.132781; class I cytokine receptor; fn3;TM=Y;; 4.85 

427509; M62505; Hs.2161; complement components receptor 1 (C5a ligand); 7lm_1;TM=Y;SS=M; 4.85 

428820; AA436187; Hs.172631; integrin, alpha M (complement component receptor 3, alpha; also known as CD1 1b (p17D), macrophage antigen alpha polypeptide); 
30 vwa,integrin_A,FG-GAP;TM=Y;SS=M; 4.84 

445143; U29171; Hs.75852; casein kinase 1, delta; zf-C3HC4Filamin,zf-B_box,NHL,pkir,ase,zf-M!Z;TM=M;; 4.82 
427157; U51166; Hs.1 73824; thymine-DNA glycosylase; UDG;TM=M;; 4.81 

427857; AL133017; Hs.2210; hypothetical protein FU22865; myosin_head,IQ,zf-MYND;TM=M;SS=M; 4.81 

422293;X94453;Hs.114366;pyrroline-5-carboxyale., l ti las i ii i i n. gamma semialdehyde synthetase); aldedh,aakinase;TM=M;; 4.81 
35 414280; BE410769; Hs.75B73; zyxin; LIM.ig P kinase;TM=M;SS=M; 4.81 
424570; AA343306; I Is. 133511; ESTs; SH3.ar,k,nor,e; 4.80 

451144;AW956103;Hs.61712;pyrrivatedehydrogena-e!-i jse, is inzyne 1 HATPase_c,none; 4.80 
402705; AA21461 8; ; activator of S phase kinase; AhpC-TSA;TM=M;SS=M; 4.80 
410024; AW191024; Hs.55016; hypothetical protein FLJ21935; SH3;TM=M;; 4.80 
40 419972; AL041465; Hs.1 82982; golgin-67; none.none; 4.80 

427127; AW802282; Hs.22265; pyruvate dehydrogenase phosphatase; PP2C,none; 4.80 
413476; U25849; Hs.75393; acid phosphatase 1, soluble; LMWPc;TM=M;SS=M; 4.80 
415801;R24219;Hs.278443;FcfragmentoflgG,lowaffin'lyll receptor fo 0D3 ig;TM=> 1.7 

402233; ; ; NM_030750*;Homo sapiens endothelial ditferenfation, sphingolipid G-protein-coupled receptor, 8 (EDG8), mRNA; 7lmJ;TM=Y;SS=M; 4.79 
45 448153; Y10805; lls.20521; HMT1 (hnRNP melhyltransferase, S cerevis se) like 2; NusC SS=M 4 73 
407722; BE252241; Hs.38041; pyridoxal (pyridoxine, vitamin B6) k'nase. pikB;TW-M;; 4.79 

405370; ; ; NM_005569':Homo sapiens LIU domain kinase 2 (LIMK2), transcript variant 2a, mRNA.; pkinase,LIM,PDZ;SS=M; 4.79 

41G498 U33632 Hs 7935i pola r < m i 'j' 'an m mber 1 (TV/II 1, ionjrar TV=Y,SS=M,478 

429921; AA52591 1; Hs.82772; collagen, type Xl.alpha 1; Collagen.COl Fl.TSPNJam rjii.G CciA;SS=M; 4.73 
50 424415; NM_001975; Hs.146680; enolase 2, (gamma, neuronal); enolase;TM=M;; 4.78 

433133; AB027249; Hs. 104741 ; PDZ-binding kinase; T-cell originate I I nki kinase TM=M;; 4.78 

431629; AU077025; Hs.265827; interferon, alpha-induclble protein (clone IFI-S-16); none;TM=M;SS-Y; 4.78 

417929; R27219; Hs.74647; Human T-cell receptor active alpha-chain mRNA from JM cell line, complete cds; ig.abhydrolase; 4.78 

450334; AF035959; Hs.24879; phosphatidic acid phosphatase type 2C; PAP2;TM=Y;SS=M; 4.78 
55 447674; BE270640; Hs.19192; cyclin-dependent kinase 2; pkinase;SS=M; 4.77 

409744; AW675258; Hs.55265; Homo sapiens mRNA; cDNA DKFZp586P2321 (from clone DKFZp586P2321); none;NA;NA; 4.77 

446196; AI744888; lls.149470; ESTs; zf-C3HC4,SulfateJransp,STAS; 4.77 

429305; AF095727; Hs.287832; myelin protein zero-like 1; !g,transmerribrar,e4;TM=Y;SS=M; 4.77 

426812; AF105365; Hs.172613; solute cam'erfamily 12 (potassum/chloride transporters), member 7; nor,e;TM=Y;; 4.77 
60 425811; AL039104; Hs.159557; karyopherin alpha 2 (RAG cohort 1 , importin alpha 1); Armadi lo_seg,IBB,DEAD,helicase_C,Sec63,DDT,PHD,bromodomain;TM=M;; 4.77 
+-> N ip kinase phosphalase-like f I i'X; DSPc;TM=M;; 4.7 

452256; AK000933; Hs.28661; Homo sapiens cDNA FU10071 fs, clone HEMBA1001702; GDI,7trr,J .none; 4.76 

447207; AA442233; Hs.17731; hypothetical protein FU12892; none;TM=M;; 4.76 

400846; ; ; sortilin-related receptor, L(DLR class) A repeatscontaining (SORL1); EGF.fii3. ul jecepl_a,ldLrecepLb,granulin,BNR;TM=Y;SS=M; 4.76 
65 452355; N54926; Hs.29202; G protein-coupled receptor 34; 7tm_1,QATP_C;TM=Y;; 4.75 

406809; AF000574; Hs.22405; leukocyte immunoglobulin II -r :e pt ' subfan I / B (with TM and ITIM domains), member 2; ig,Gemin'L.mov;TM=M;SS=M; 4.75 
427378; BE515037; Hs. 177556; me'anoma antigen, family D, 1; MAGE;TM=M;; 4.75 

444042; NM.004915; Hs.10237; ATP-binding cassette, sub-family G (WHITE), member 1 ; ABCJran,PRK,GBP;TM=Y;; 4.74 
„ 41 0406; AI969703; Hs.1466; glycerol kinase; FGGY,FGGY_C;TM=M;; 4,73 
70 41 1653; AF070578; Hs.71168; Homo sapiens clone 24674 mRNA seq i N I 

437667; BE61641 2; Hs.28621 8; junctional adhesion molecule 1 ; none.HLH; 4.73 

417781; BE279380; Hs.82563; KIAA0153 protein; TTLAcylJransf; 4.73 

453966; BE148734; Hs.63325; transmembrane protease, serine 4; trypsinJdl.recepLa.none; 4.73 

412228; AW503785; Hs.73792; complement component (3d/Epstein Barr virus) receptor 2; sush;TM=Y;SS=M; 4.73 
75 418255; AW135405; Hs.37251; ESTs; pkinase.none; 4.73 

413472; BE242870; Hs.75379; solute cam'erfamily 1 (glial high affinity glutamate transporter), member 3; SDF;TM=Y;S8=M; 4.73 

40690b Z25421 qb:H 3 piei ole rii /three i t nasp gene, complete CDS.; none.none; 4.73 

432065 AA401 039 H ) i tein phosphata 4((om I ). catalyti i tunf M nllophos;TM=M;;4.72 
nn 424909; S78187; Hs.153752 cell jivi ion cyi le 25B, Rhodanese,SS=M; 4.72 

80 422599; BE387202; Hs.118638; non-metaslaic cells 1. protein (NM23A) expressed in; NDK 1 PH,Oxysterol_BP;SS=M; 4.71 
426136; AW957239; ; gb:EST369309 MAGE resequences, VAGD Homo sapiens cDNA, mRNA sequence; PP2C,none; 4.71 
446203; Z47553; Hs.14286; flavin containing monooxygenase 5, Ft.!') like [ rr_ , to TM-Y SS=M 4 71 

451295; AI557212; Hs.17132; ESTs, Moderately similar to 154374 gene NF2 protein [H.sapiens]; pkinase,DAG_PE-bind,pkinase.C,OPR,none; 4.71 
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424099; AF071202; Hs.139336; ATP-binding cassette, sub-family C (CFTFLWRP), meaiber 4; ABCJran,ABC_membrane;TM=Y;; 4.70 
424959; NM.00S781; Hs.153937; activated p21cdc42Ho kinase; ldh,ldh_C,SH3,pkinase,UBA;TM=M;; 4.70 
i 20 if PI ii 173965, nboscmal protein S6 kinase, 90kD, polypeptide 3; none.none; 4,70 

1662; NM_D1414 1i 1 - lie l ' c F t/F ril il ate_transp,STAS,7tm_3,xan_ur_permease;TM=Y;SS=M;4.70 

413431; AW24642 j H 75 ibiquit n-conjugating enzyme E horn agou- to yeast UBC1 3); UQ_con;TM=M;; 4.70 
405484; ; ; C3002124*:gi|12737280|ref|XP_006682.2| keratin 18 [Homo sapiens]||5S33; none;SS=M; 4.70 
401 345; M83738; ; protein tyrosine phosphatase, non-receptor type 9; none;TM=Mr, 4.70 
416602; NM_006159; Hs.79389; nel (chicken)-iike 2, EGF,vwc,TSPN;SS=Y; 4.69 
412507; L36645; 1 64; Epl 3, ( e.S M.EPH M T '' 4 
437897; AA770561; Hs.1461 70; hypothetical protein FLJ22969; zf-DHHC,none; 4.69 
432886; BE1 59028; Hs.279704; chromatin accessibility complex 1; nor,e;TM=M;; 4.69 

400843; ; ; NM_003105*:Homo sapiens sortilin-related receptor, L(DLR class) A repeats-containing (SORL1), mRNA.; 

EGF,fn3,ldLrecept_a,ldLrecepLb,granulin,BNR;TM=Y;SS=M; 4.68 
433409; A1278802; Hs.25661; ESTs; pkinase.pkinase; 4.68 
413869; NM_000878; Hs.75596; interleukin 2 receptor, beta; none;TM=Y;SS=M; 4.68 
430259; BE550182; Hs.127826; RalGEF-like protein 3, mouse homolog; <ji3,RA,RasGEF;TM=M;SS=M; 4.68 
425761; AW664214; Hs.196729; ESTs; SH3,Ribosomal_S3Ae; 4.68 

431941; AK000106; Hs.272227; Homo sapiens cDNA FLJ20099 lis, clone COL04544; pkinase,Furin-like,Recep_L_domain,none; 4.68 
419493; AF001212; Hs.90744; prateasome (prosome, macropain) 26S subunit, non-ATPase, 11; CDK5_aciivaior,PCI,none; 4.67 
425966 NM CC17C1 I I 1 i CTM M 4 b"? 

408056; AA312329; Hs.42331; ephrin-A4; Ephrin;TM-W;S3-W; 4.67 

453476; Al 640500; Hs.24633; SAM domain, SH3 domain and nuclear localisal on ignal I rsMSS=M4 67 

412926; AI879076; Hs.75061; macrophage myristoylated alanine-rich C kinase substrate; MARCKS;SS=M; 4.67 

424635; AA420687 H 11 455 Homo -piens cDNA FLJ14259 fis, clone PLACE1001076; pkinase.Funn-like.Recep.LJomain.none; 4.66 

446051; BE048061,Hs." 54 ephri A , Epl rn,A_deamin,dsrm,z-alpha;4.66 

436729; BE621807; ; transmembrane 4 superfamily member 1; none;TM=Y;SS=M; 4.66 

408204; AA454501; Hs.43666; protein tyrosine phosphatase type IVA, member 3; Y_phosphatase;TM=M;; 4.66 

435542; AA687376; ; ESTs; SH3,ig,pkinase,PH,spectrin,RhoGEF,none; 4.66 

429682; NM_006306; Hs.211602; SMC1 (structural maintenance of chromosomes 1,yeast)-!:ke 1; ABC_tran,SMC_N,SMC_C,KID;TM=M;;4.66 

11749 4« H E "1 3 anligen; lransmembrane4;TM M; 4.6C 

418736; T18979; Hs.87908; Snf2-related CBP activator protein; helicase_C,AT_liook,SNF2_N;TM=M;; 4.65 

41 51 1 7; AF120499; Hs.78016; polynucleotide kinase 3'-pho-phnt e i i case TM-M.4 65 

418629; BE247550; Hs.86859; growth factor receptor-bound protein 7; SH2,PH,RA;SS=M; 4,65 

426108; AA622037; Hs.166468; programmed cell death 5; DUF122;TM=M;; 4.64 

429263 AA0190C4 II I rP-bindinq casse any t 1) m rrbpr 1 BC_tran,SRP54;TM=Y;SS=M;4.64 

131886; L77S64; H C t 1 6; pkinase;TM=M;; 4.6 

435049; AL122067; Hs.4746; hypothetical protein FU21324; none;TM=M;; 4.63 

437763; AA469369; Hs.5031; tissue inhibitor of metalloprotelnase 1 (er/throid potentiating activity, collagenase inhibitor); TIMP.pkinase.DAG.PE-bind.RBD; 4.63 

413- J i 8083; H 301 pi c na 1 f I* =U - 

421846; AA017707; Hs.1432; protein kinase C substrate 80K-H; efhand,ldl_recept_a;SS=M; 4.62 

442590; AI002686; Hs.130313; ESTs; none,Y_phosphalase,Band_41,cor,nexir,; 4.62 

416224; NM_002902; Hs.79088; relirulo, bi 2, EF and calcium binding domain; efhand;SS=M; 4,62 

423740; Y07701; Hs.293007; aminopeptidase puromycin sensitive; Peplidase_M1 ,Armadillo_seg; 4,61 

429300; AB011108; Hs.198891; serine/threonine-prolein kinase PRP4 homolog; pkinase;TM=M;; 4.60 

447232; AW499834; Hs.327; interleukin 10 receploi apha lunrTf <" ,-" M 

412942; AL1 20344; Hs.75074; mitogen-Ectivated protein kinase-activated prolein kinase 2; pkinase;TM=M;; 4.60 
419596; DE379320; Hs.91448; MKP-1 like protein tyrosine phosphatase; DSPc;; 4.59 

417880; BE241595; Hs.82848; selectin L (lymphocyte adhesion molecule 1); EGFJectin csushi;TM=M;SS=M; 4.59 
411125; AA151647; Hs.68877; cytochrome b-245, alpha polypeptide; nor,e;TM=Y;SS=M; 4.59 
434883; AW3815.E I 19807; I pothet c 1 ot ir M C12%1 SH3,PH,WW,RhoGAP;SS=M; 4.58 

447312; AI434345; Hs.36908; activating transcription factor 1; rmi7f-RanBP,pkinase,GST_C,Ets,SAM_PNT,ABC2_membrane,myosin_head,IQ,Myo5in_N,bZIP,zf- 

C2H2,PHD,BTB,TFIIS,AT_hock,SAM;TM=M;;4.58 
435254; AW194689; Hs.30778; ESTs; pkinase,Bacterial_PQQ,none; 4.58 

426925- NM_001 196; Hs.315689; Homo sapiens cDNA; FU22373 fis, clone HRC06741; Esterase,enolase,Peptidase_S9;TM=M;; 4.58 

421685, AF189723 H 1 7 it- e, Ca transporting, type 2C, member 1; Cation_ATPase_C,Cation_ATPase.N,E1-E2_ATPase,Hydrolase,XPG_N;TM=Y;; 4.53 
14702 U '27 Hs 19718 protein tyrosine phosphatase, receptor type, U; fn3,ig,Y_phosphatsse,MAM;TM=Y;SS=M; 4.58 

427640; AF058293 H 18CC15 D i pact ran e lautcn r J 1 MT 1ir G3T CEF1G_domain,GST_N,S1,Fz,Frizzled,calreticulin,7tm_2,rrm,PAP_assoc;TM=Y;SS=M; 4.57 
441085; AW136551; Hs.181245; Homo sapiens cDNA FLJ12532 fis, clone NT2RM4000200; nDne.none; 4,57 
409581; U66243; Hs.55039; mitogen-activated protein kinase 12; pkinase;SS=M; 4.57 
423184; NM 004428; Hs.1624; ephrin-A1; Ephrin;TM=M;SS=M; 4.56 

443920; AL037764; Hs.35304; Homo sapiens cDNA FU13655 fis, clone PLACE101 1503; none.FMO-like; 4.56 

422627; BE336857; Hs.1 18787; transforming growth factor, beta-induced, 68kD; Fasc:clin,ABCJran,ABC.membrane,GTP_EFTU;TM=M;SS=M; 4.56 

41 8869; AW51 5565: ; gb:xq01 d05.x1 Soares_NHCeC_cervical_.tumor Homo sapiens cDNA clone 3' similar to contains Alu repetitive element contains element MER1 1 repetitive 

element;, mRNA sequence; none,RasGAP,WWJQ; 4.56 
430016; NMJ04736; Hs.227655; xenotropic and polytropic retrovirus receptor; SPX,EXS;TM=Y;; 4.56 
437157; BE048860; Hs.120655; ESTs; IRK,none; 4.55 

422769; AA938905; Hs.120017; olfactory receptor, family 7, subfamily E, member 38 pseudogene; none.none; 4.55 

457918; AL359590; Hs. 162604; hypothetical protein DKFZp762M1 3= PLDc;TM r - 55 

434467; BE552368; Hs.231853; Homo sapiens cDNA FU13445 fis, clone PLACE1C02952; 7tm_1 ,none; 4.55 

421 140; AA298741; Hs.1021 35; signal sequence receptor, delta (translocon-assoolatec protein delta); none;TM=Y;SS=M; 4.55 

406364; ; ; Target Exon; hexapep;TM=M;; 4.55 

434682; AA827165; Hs.191958; immunoglobulin superfamily receptor translocation associated 2; ig.none; 4.54 

438939; H21012; Hs.287657; Homo sapiens cDNA; FU21291 fis, clone COL01963; F5.F8.type.C,pkT,ase,Hs,none; 4.54 

i 34 E54527 ll 34091 I; Ts trar lion el ngat i faoloi mite bond, il t r _Ts Mb* 1 

411165; NM 000169; Hs.69089; galactosidase, alpha; Me!ibiase;SS=M; 4.54 

408956; AK001858; Hs.49344; hypothetical protein FU11006; ion_trans;TM=Y;; 454 

416847; L43821; Hs.80261; enhancer of filamentation 1 ( docking Crk associated substrate related); SH3;TM=M;; 4.53 

410226; AI831958; Hs.61053; hypothetical protein; SH3,TPR;TM=M;; 4.53 

422753; AI928995; Hs.1575; small nuclear ribonucleoprotein D3 polypeptide (18kD); Sm;SS=M; 4.52 

418355; L42563; Hs.1165; ATPase, H? transporting, nongastric, alpha polypeptide, E1-E2_ATPase,Cation_ATPase.C,Cation_ATPase_N,Hydrolase;TM=Y;; 4.52 
0( (1 Eo -if ri if'H !!_ el T- ,';S M;4.52 
444633; AF1 1 171 3; Hs.286218; junctional adhesion molecule 1 ; ig;TM=Y;SS=M; 4.52 

422940; BE077458; ; gb:RC1-BT0606-090500-015-b04 BT0606 Homo sapiens cDNA, mRNA sequence; Sec7,PH,ANF_receptor,lig_chan,WD40,IRK; 4.52 
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400303; AA242758; Hs.79136; LIV-1 protein, estrogen regulated; none,none; 4.51 
412604; AW978324 H- I rotein kir ( 1 r e C E-bW.pl* 
448633; AA311426; Hs.21635; tubulin, gamma 1; tubulin;TM=M;; 4.51 
457906; AW975939; Hs.1 53290; Homo sapiens cDNA FU 1431 8 fis, clone PLACE3000402; none.pkinase; 4.51 
456362; AW973003; Hs.179909; hypothetical protein FLJ22995; none;TM=M;; 4.51 
429690; AW956329; Hs.23721; ESTs; none,sugar.tr,RibosomaLS25; 4.50 

424618; L29472; Hs.1802; major hsto - ! I omple ID ta g MHC_ll_beta;TM=Y;SS=M;4.50 

444823; BE262989; Hs.12045; putative protein; Mra1,MBOAT;TM=M;SS=Y; 4.60 

405490; ; ; NM_031414:Homo sapiens serine/threonine kinase 31 (STK31), transcript variant l.mRNA.; pkinase,TUDOR;TM=M;; 4.50 
424494 U78575 H- 1^ 55 Dsphat'i nositol-4-phosphate 5-kinase, type I, alpha; PIP5K;SS=M; 4.50 
•11-1 1 , ■ v.u.r.v,*-. ..;«Uf, .m>n '" ■i- r -' ■ - ■ 'T =-'.TV=f.( . i=f.( 4 ft 

443951; F13272 H 111334 ferrilii Ightp lypepl le PMP 2 C1 1 1 Jin rone; 4.50 

410423; AW402432; Hs.63489; protein tyrosine phosphatase, non-recepior type 6; SH2,Y_phosphatase,DSPc;TM=M;; 4.50 

429556; AW1 39399; Hs.98988; ESTs; none;TM=M;; 4.50 

458791; BE615453; Hs.346509; dedicator of cyto-kinesis 1; none;TM=Y;; 4.49 

425209; AL049761 ; Hs.155140; casein kinase 2, alpha 1 polypeptide; pk!nase,ABC1 ;TM=M;; 4.49 

425695; NM.005401; Hs.159238; protein tyrosine phosphatase, non-receplor type 14; Y_phosphatase,Band_41,DSPc;TM=M;; 4.49 

42494^ AUO zCO H ated p c a IV AP Rbo omal_L24e,SRP54,dDENN,DENN,uDENN;TM=M;;4.49 

412Q7C ^B026436 H<= 1 / 1; dual S| 1 1 ly pho-pMase 10; Rhcdanese,DSPc;SS=V;4.48 

400755; AA635062; ; Homo sapiens mRNA; cDNA DKFZp43 U0 r I from clone K p434O051 f C3HC4,CARD,BIR;TM=M;; 4.48 
425566 AW1 234 1 r 2 )510; UL ^ nding protein 2; ldl_recepLa,PKD,MHCJ;TM=M;SS=Y; 4.48 

410151; X15723; Hs.59242; paired basic amino acid cleaving enzyme (furin, memhrane associated receptor protein); Peplidase_S8,P;TM=Y;SS=M; 4.48 

!53 I 666 guanne nucleotide binding protein (G protein), alpha 13; UCR_hinge,G-a!pha,arf;TM=M;; 4.48 

42471 1; NM_005795; Hs.152175; calcitonin receptor-like; 7tm 2.I IRM;TM=Y;SS=M; 4.48 
427878; C05766; Hs.181022; CGI-07 protein; none,zf-C2H2; 4.48 

443991; NM 002250; Hs.10082; potassium intermediate/small conductance calcium-activated channel, subfamily N, member 4; CaMBD,SK_channel,lon_trans;TM=Y;SS=M; 4.48 
422605; H16646; Hs.1 18666; hypothetical protein PP591; PAPS_redud,MoCF_biosynth;; 4.47 

410583; AW770280; Hs.36258; ESTs, Moderately s m lar I iJC5238 gataclosylcerai lide-tike protein, GCP |H.sapiens]; SH3,PDZ,Guanylate_kin,none; 4.47 
434419; AL040606; Hs.296938; dual specificity phosphatase 7; DSPc;TM=M;; 4.47 

410032; BE065985; ; gb:RC3-BT0319-120200-014-a09 BT0319 Homo sapiens cDNA, mRNA sequence; abhydrolase_2,none; 4.46 

423078; M35198; Hs.123125; integrin, beta 6; integrin_B,EGF,pp-bintling;TM=Y,SS=M; 4.46 

400263; ; ; Eos Control; GTP_EFTU,EFG_C,GTP_EFTU_D2,serpin;TM=M;; 4.46 

44 !■)•>.;. :4Wf.;. I b.::?!- r I ■:■ rl.. : . r ,..i ; , ,M|.j;.ml ■icr.-;c,-|ii A ,-,.;.:rl r.3oc;TM=M;:445 

434551; BE387162; Hs.280858; ESTs, Highly similar to A35661 DNA excision repair cross-complementing protein ERCC3 [Rsapiens]; none;TM=M;; 4.45 
413227; M79082; ; ESTs; none,none; 4.45 

441321; 1117182; Hs.7771; B-cell associated protein; Band_7;TM=M;; 4.45 

457194 H20 b9 H 54 T ghly s nilar to urramed protein product [Rsapiens]; none,pkir,ase,PBD; 4,45 
414745; AA16051 1; Hs.5326; amino acid system N transporter 2; porcupine; none.none; 4.46 

404276; ; ; NM_002944*:Harao sapiens v-ros avian UR2 sarcoma virus oncogene homolog 1 (ROS1), mRNA.; fn3,pkinase,DUF139;TM-Y;SS=M; 4.45 
426966 AI491134 i I i lM M SS U 4 45 

408873; AL046017; Hs.182278; calmodulin 2 (phosphorylase kinase, delta); none.none; 4.44 

426486: BE178285; Hs.170056; Homo sapiens mRNA; cDNA DKFZp586 B0220 (from clone DKFZp586B0220); pkinase.none; 4.44 
432798; AA565309; Hs.194015; ESTs; lntegrin_B,Sema.PSI,TIG,none; 4.44 

439668; AI091277; Hs.302634; frizzled (Drosophila) homolog 8; Frizzled,Fz,7tm_2,tox:n_2;TM=Y;SS=M; 4.44 
417886; AA214584; ; ESTs; SPRY.7tmJ.ANFjeceptor.none; 4.43 

452098; AI858183; ; gb;wl46a12.x1 NCI CGAP Ut1 Homo sapiens cDNA clone 3' similar to contains Alu repetitive element;, mRNA sequence; SH3,none; 4.43 

426874; N67325; Hs.347487; ESTs; SH3,TonB_boxC,none; 4.43 

422714; AB018335; Hs.119387; KIAA0792gene product; DUF221;TM=Y;SS=M; 4.42 

410741; Z11695; Hs.324473; mitogen-activated protein kinase 1 ; pkinase.none; 4.42 

432193; AA372264; Hs.273193; hypothetical protein FU107D5; pkinase;TM=M;; 44! 

409506; NMJ106153; Hs.54589; NCK adaptor protein 1; SH2,SH3;TM=M;; 4.41 

429390; AB040942; Hs.201500; KIAA1509 protein; nor,e;TM=M;; 441 

421859; AA356620; Hs.108947; KIAA0050 gene product; ank,PH,ArfGap;SS=M; 4.41 

451527; AF022813; Hs.26518; transmembrane 4 superfamily member 7; none.none; 4.41 

421748; NM_014718; Hs.107809; KIAA0726 gene product; cadherin;TM=Y;; 4.40 

410416; BE410072; Hs.63304; protein phosphatase methylesterase-1; none;TM=M;; 4.40 

450457; AA367701; Hs.6639; KIAA1624 protein: none;TM=M;SS=M; 4.40 

433029; NM_014322; Hs.279926; opsin 3 (encephalopsin); 7tm_1 ,Mcnooxygenase;TM=Y;SS=M; 4.40 

408805; H69912; Hs.48269; vaccinia related kinase 1; pkinase;TM=M;; 4.40 

421585; U95626; Hs.302043; chemokine (C-C motif) receptor-like 2; 7tmJ;TM=Y;SS=M; 4.40 

440014; AW960782; Hs.6856; ash2 (absent, small, or homeotic, Drosophila, homolog)-like; SPRY.BAG.UPFD001; 4.40 

451154; AA015879; Hs.33536; ESTs; TIMP,none; 4.40 

433895; AI287912; Hs.3628; mitogen-activated protein kinase kinase kinase kinase 4; pkinase,zf-C4,CNH,ERM;TM=M;; 4.40 
422034; AC006486; Hs.333069; Ets2 repressor factor; Ets;TM=M;; 4.39 
444009; AI380792; Hs.135104; ESTs; TNFR c6,TIL,none;4.39 
420020; BE295866; Hs.94382; adenosine kinase; pfkB;SS=M; 4.39 

416207; NMJ514745; Hs.79077; Homo sapiens, clone MGC:2908, mRNA, complete cds; none;TM=Y;SS=M; 4.39 

417655; AA780791; Hs.14014; hypothetical protein FU14813; pkinase,pkinase_C;TM=M;; 4.39 

402915; ; ; ENSP00000202587':Bicarbonate transporter-related protein BTR1 .; HC03_cotransp;TM=Y;; 4.39 

453199; AI336266; Hs.32353; mitogen-activated protein kinase kinase kinase 4; pkinase;TM=M;; 4.38 

416033; NM.012201; Hs.78979; Golgi apparatus protein 1; cys_rich_FGFR;TM=Y;SS=M; 4.38 

453672; U73531; Hs.34526; G protein-coupled receptor; 7tm_1;TM=Y;SS=M; 4.38 

437852; BE001836; Hs.256897; ESTs, Weakly similar todJ365Q 12,1 [Rsapiens]; GPS7lrn_2;TM-Y;; 4.38 

420039; NM_004605; Hs.94581; sulfotransferase family, cytosoiic, 2B, member 1; Sulfotransfer;SS=M; 4.38 

12834; R77123; 14 I pienscDN J J t c n 1 C ,14 i 

452203; X57522; ; transporter 1, ATP-binding cassette; sub-fami'y B (MDR/TAP); ABC_tran,ABC_membrane,SRP54,Thymidylate_kin;TM=Y;SS=M; 4.37 
4IE317.AW205118.H- 210.16 nterl ul r, 21 i eptoi none;TM-Y;SS=M; 4,37 

i 34 L04 I 61 73 I , Ih I pi le i FLJ1 1 > n rr T> =M;S! ? J 

424028; AF055084; Hs.153692; Homo sapiens cDNA FU14354fis, clone Y79AA1 001384, highly similar to Homo sapiens very large G-protein coupled receptor-1 (VLGR1) mRNA; 
none.none; 4.37 

jiens PRO 

440270; NM_01 5986; Hg.7120; cytokine receptor-like moiecule 9; fn3;SS=M; 4.36" 
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432987; AIBB4771; Hs.27954; CD86 antigen (CD28 antigen ligand 2, B7-2 antigen); none;TM=Y;SS=M; 4.35 

435943; AA773838; Hs.5353; caspase 10, apoptosis-related cy tpn protease; ICE_p10,ICEj DE D TM=M 4 36 

457897; AI355125; Hs.345158; ESTs, Weakly similar to HXA2.HUMAN HOMEOBOX PROTEIN HOX-A2 [H.sapiens]; homeobox;NA;NA; 4.36 

406671; AA129547; Hs.285754; met proto-oncogene (hepatocyte growth factor receplui). Sema.pl inase TIG.PS'.none; 4.36 

413969; X14034; Hs.75648; phospholipase C, gamma2 (phosphatid,! Ito spec'fic); ! H2 SH3,C2,PH,PI-PLC-Y,PI-PLC-X,PDGF;SS=M; 4.35 

408101; AW968504; Hs.123073; CDC2-related protein kinase 7; none,none; 4.35 

414029; BE297731; Hs.75709; mannose-6-phosphate receptor (calion dependent); Man-6-P_recep;TM=M;SS=M; 4.35 

425069; AA687465; Hs.298184; potassium voltage-gated channe', shaker-related subfamlv, beta member 2; aldo_keUed,none; 4.35 

438937; AW962654; Hs.244624; ESTs; EPHJbd,pkinase,fn3,SAM,none; 4.35 

41 2584; X54870; Hs.74085; DNA segment on chromosome 12 (unique) 2489 expressed sequence: none.lectin.c; 4.35 
436540; BE397032; Hs.14468; hypothetical protein MGC14226; rrm,7tm_1,SNF;TM=M;;4.34 
435267; N23797; Hg.110114; ESTs; none,Syja_N,Exo_endo_phos;4.34 
40561 5; ; ; Target Exon; none, SH3, BAR; 4.34 

432141; BE410964; Hs.272735; nuclear receptor binding protein; pkinase;TM=M;; 4.33 
417927; R73095; Hs.24122; ESTs; none.pkinase; 4.33 

429B49; U33053; Hs.2499; protein kinase C-like 1 ; pkinase,pkinase_C,HR1 ;TM=M;; 4.33 
425743; BE395495; Hs.1 59428; BCL2-associated X protein; Bcl-2;TM=Y;; 4.33 
453863; X02544; Hs.572; orosomucoid 1; lipocaiin,aldedh,ubiquitin,IRK;SS=M;4.33 

400847; ; ; NM_003105*:Homo sapiens sortilin-relaled receptor, L(DLR class) A repeats-containing (SORL1), mRNA.; 

E I n Idl r t_a M n P t_b,granul BNR M=Y;SS=I 4 J 
414914; U49344; Hs.77613; ataxia telangiecias a and Rad3 re'ated; FAT,FATC,PI3 PI4 kinase;TM=M;; 4.33 
413858: NML0C1610; Hs.75589; acid ohosphalase 2, lysosomal; ac'd_phosphat;TM=Y;SS=M; 4.33 

442539; AL1 19506; Hs.58220; Homo sapiens cDNA: FLJ23005 lis, clone LNG00396, highly similar to AF055023 Homo sapiens clone 24723 mRNA sequence; 

RasGAP,adenylatekinase;4.33 
419607; R52557; Hs.91579; Homo sapiens clone 23783 mRNA sequence; !MP4;TM=M;; 4.32 
436703; AW880614; Hs. 146381; RNA binding motif protein, X chromosome; rrm,SH3,PH,CH,RhoGEF; 4.32 
414899, AW975433,Hs 36288 ET pkin I SH3 none 4 32 

444895; AI674383; Hs.22891 ; solute carrier family 7 (cationic amino acid transporter, y system), member B; ASC,death,TNFR_c6; 4.31 
415135; AW673559; Hs.78040; KDEL (Lys-Asp-Glu-Leu) endoplasmic reticulum protein retention receptor 1; ER_lumen_recept,none; 4.31 
444070; NM.015367; Hs.10267; MIL1 protein; Bcl-2;TM=Y;; 4.31 

422611; AA158177; Hs.118722; fucosyltransferase 8 (alpha (1,6) fucusylinnsfeiase) SH3 K-box;TM=M;SS=Y; 4.31 
437162; AW005505; Hs.5464; thyroid hormone receptor coactating protein; bromodomain;TM=M;; 4.30 
440983; M20681; Hs.7594; solute comer family 2 (facilitated glucose transporter), member 3; sugar. ir;TM=Y;SS=M; 4.30 
414080; AA135257; Hs.47783; B aggressive lymphoma gene; A1pp;TM=M;; 4.30 

415072; BE253687; Hs.77876; Homo sapiens, clone IMAGE3461982, mRNA, partial cds; Metallophos,Armadillo.seg;TM=M;; 4.30 
442994; A1026^1E H 1 I- T a I e 1 mat S14;4.30 

432328; AI572739; Hs.1 95471 ; 6-phosphofruclo-2-kinase/fructose-2,6-biphosphatase 3; PGAM,6PF2K;TM=M;; 4.29 
439490; AW249197; Hs.100043; ESTs, Weakly similar to A45302 PTB-associated splicing factor, long form [H sapiens]; none;TM=M;; 4.29 
422005; BE266556; Hs.110702; Homo sapiens mRNA; cDNA DKFZp761E212 (from clone DKFZo761E212); none,Na_H_Exchanger; 4.29 
415214; AI445236; Hs.1 251 24; EphB2; fn3,pkinase.SAM.EPHJbd;TY-Y;SS=M; 4.29 

430316; NM_000875 H' 23 I I i lii ke growth fact I receptor; fn Furr lie pkinase,Receo_L_doma:n;TM=M;SS=M; 4.29 
429099; BE439952; Hs.196177; phosphorylase kinase, gamma 2 [testis); Dkinase,Bac_DNA_binding;TM=M;; 4.29 
425843; BE313280; Hs.159627; death associated pralein 3; myb_DNA-binding,PAH,BAH,bromodomain,PHD,SET;TM-M;; 4.28 
437603; AW979259; Hs.293673; ESTs; death.none; 4.28 

439975; AW328081; Hs.6817; 'nosine iriphosohatase (nucleoside triphosphate pyrophosphatase); Ham1pJike;TM=M;; 4.28 
424512; X53002; Hs.149846; inlegrin, beta 5; inlegrin B,EGF;TM=Y;SS-M; 4.28 
442980; AA857025; Hs.8878; kinesin-like 1; kinesin,Luleo_ORF3,DUF164;TM=M;; 4.28 

420166; AW732276; Hs.95583; transmembrane 4 superfamily member (tetraspan NET-7); trans membrane4;TM=Y;SS=M; 4.27 
409582; R27430; Hs.271565; ESTs: none.Neur_chan.LBD,Neur_chan_memb; 4.27 
439096; AA830185; ; ESTs; ras.none; 4.27 

414561; AI064813; Hs.195155; Homo sapiens amino acid transport system N2 (SN2) mRNA, complete cds; AaJrans;TM=Y;; 4.27 

41 1835; U29343; Hs.72550; hyaluranan-mediated motility receptor (RHAMM); bZIP;SS=M; 4.27 

428781; AF1 64799; Hs.193384; putatative 28 kDa protein; pkinase,DAG_PE-bind,pkinase_C,OPR;SS=M; 4.27 

430603; AA148164; Hs.247280; I IDV assoc'ated factor; zf-C3HC4,zf-RanBP,pkinase; 4.27 

415149; X12451; Hs.78056; cathepsin L; Peptldase_C1;SS=M;4.2B 

444338; AV651680; Hs.208558; ESTs; integrin A,FG-GAP,none; 4.26 

402328; ; ; Target Exon; pkinase;TM=M;; 4.26 

41 6094; AW99551 2; Hs.225977; nuclear receptor coactivalor 3; none.none; 4.26 
420942; H03514; Hs.15589; ESTs; none.pkinase; 4.26 
453902; BE502341; Hs.3402; ESTs; none,none; 4.26 

425505; AL036458; ; gb;DKFZp564D2062_r1 564 (synonym: hfbr2) Homo sapiens cDNA clone DKFZp564D2062 5', mRNA sequence; arf,G-alpha,none; 4.26 

427344; NM_000869; Hs,2142; 5-hydroxytjryptamine (seroton'r) receptor 3A; Neur chanJ_BD,Neur_chanjTiemb;TM=Y;SS=M; 4.26 

432269; NM.002447; Hs,2942; macrophage stimulating 1 receptor (c-met-related tyrosine kinase); pkinase,Sema,PSI,TIG,A4_EXTRA;TM=M;SS=M; 4.26 

41 7007; AF224741 ; Hs.80768; chloride channel 7; CBS,voltage_CLC;TM=Y;; 4.28 

447960; AW954377; Hs.2641 2; ring finger protein 26; zf-C3HC4;TM=Y;SS=M; 4.26 

442300; AI765908; Hs.129166; ESTs; none;SS=M; 4.25 

421856; NM.016447; Hs.1 08931; MAGIJK protein p55T; Protein Associated with Lira 2; SH3,PDZ,Guany:ateJdn,L27;TM4/l;; 4.25 
452110; T47667; Hs.28005; Homo sapiens cDNA FU11309 Tis, clone PLACE1010076; pkinase,Activin recp.none; 4.25 

422451; AA310753; Hs.42491; ESTs, Weakly similarto S65657 alpha-1C-adrenergic receptor spi ce form 2 [H.sapiens]; PDZ,SH2,STAT,STAT_bind,STAT_prot,none; 4.25 
453955; AW579207; Hs.304666; ESTs, Weakly similar to I78885 serine/threonir.e-specific protein kinase [H.sapiens]-. fn3, : g,MAM,none; 4.25 
457670; AF119666; Hs.23449; insulin receptor tyrosine k nase substrate; 8H3;TM=M;;4.25 

419133; U46116; Hs.89627; protein tyrosine phosphatase, receptor type, G; fn3,Y_phosphatase,carb_anhydrase,DSPc;TM=Y;SS=M; 4.25 
419660; BE280337; Hs.194693; solute cam'erfamily 7 (cationic amino aod transporter, y system), member 7; aa_permeases;TM=Y;SS=M; 4.25 
415198; AW009480; Hs.943; natural killer cell transcript 4; none;TM=W;; 4,24 

416440; AI823912; Hs.79335; Homo sapiens, Similar to SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily d, member 1, clone MGC15280, 

mRNA, complete cds; SWIB;TM=M;; 4.24 
419169; AW851980; Hs.262346; ESTs, Weakly similar to S72482 hypothetical protein [H.sapiens]; rone,spectrin,SH3,PH,CH; 4.24 

4 4 1 I 2 j n I boxvlic acid e t a TM ft t 

433848; AF095719; Hs.93764; carboxypeptidase A4; Zn_carbOpept,Propep_M1 4;SS=M; 4.24 
442213; N36110; Hs.305971; solutecarrierfamily 2 ( I te glu e transporter), member 10; sugarJr;TM=Y;SS=M; 4.24 
412681; AW983655; Hs.172004 In; fn3 SGpkrase "424 

-I I 1 If I l d (If I It 

421066; AU076725; Hs.1 01408; branched chain aminotransferase 2, m tochondrial; aminotran_4;; 4.23 



573 



WO 03/042661 



PCT/US02/36810 



428338; AF147765; Hs.232093; ESTs; fn2,CIMR;TM=M;SS=M; 4.23 

443329; BE262943; Hs.9234; hypothetical protein MGC1936; none;TM=Y;SS=M; 4.23 

432314;AA533447; Hs.312989; ESTs; Xiink.none; 4.23 

434608; AA805443; Hs.179909; hypothetical protein FU22995; none;TM=M;; 4,23 

454166; AW99335B; Hs.285814; sprouty (Drosophila) homology SH2,SH3;TM=M;SS=M: 4.23 

442596; AI4571 02; Hs.347970; Human glucose transporter pseudogene; none.none; 4.23 

442549; AI751601; Hs.8375; TNF receptor-associated factor 4; MATH,zf-TRAF,zf-C3HC4;SS=M; 4.22 

424154; AF026004; Hs.141 660; chloride channel 2, voltage_CLC,CBS,EPO_TPO,PC_rep; 4.22 

433419; AI830342; Hs.211272; ESTs; transmembrane4,none; 4.22 

421921; H83363; Hs.6820; translocase of inner mitochondrial membrane 1 0 (yeast) homolog; zf-Tim10_DDP,eiliand,CH,spectrin.serpin;TM=M;; 4.22 

445633; AI453386; Hs.17287; ESTs, Weakly similar to S26689 hypothetical prater, hd ■ mouse [M.musculus]; IRK,nor,e; 4.22 

424B1 2; AF059252; Hs.1 53299; DOM-3 (C. elegans) homolog Z; none;TM=M;; 4.22 

410668; BE379794; Hs.65403; hypothetical protein; death,TNFR_c6;TM=Y;SS=M; 4.22 

416636; N32536; Hs.42645; solute carrier family 16 (monocarboxylic acid transporters), member 6; none.none; 4.22 

418969; W33191; Hs.28907; hypothetical protein FLJ20258; SH3;TM=M;; 4.21 

447200; BE543146; Hs.281434; Homo sapiens cDNA FU14028 lis, clone HEMBA1 003838; none.none; 4.21 

40020B; ; ; Eos Control; FCH,RhoGAP,SH3;TM=M;; 4.21 

405369; ;;NM.005569' Homo dp LIMdomainl M I ar cript variant 2a, mRNA.; pkinase,LIM,PDZ;SS=M; 4.21 

445350; AF052112; Hs.12540; lysophosphollpase I; abhydrolase_2;TM=M;; 4.21 

441208 AICC9704 H WML >v klycimlar - LU1 HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY [Rsapiens]; 

ionJrans,RYDRJTPR.MIR,ncne;4.21 
427217; AA399272; Hs. 144341; ESTs; ANP,GHMP_kinases,none; 4.21 

400845 NM CC1 ) r n i L(DLRciass) A repeats-containing (SORL1), mRNA.; 

EGF 1 fn3.ldl_recepLa.ldLrecepLb,granulin,BNR;TM=Y;SS=M; 4.21 
422667; H25642; ; ESTs; FMO-like.FMO-tlke; 4.21 

450056; BE047394; Hs,8208; ESTs, Weakly similar to S71 51 2 hypothetical protein T2 - mouse [M.musculus]; 

ABC_lran,ABC_membrane,ig,MHCJLbeta,SRP54,proteasome,ABC.membrane,ABC_tran;4.20 
448950; AF288687; Hs.9275; CGI-152 protein; E1-E2_ATPase,Hydrolase;TM=Y;; 4.20 
408634, AW 407254, H° i 2 i \m r 2 (phasphorylase kinase, delta), none.none; 4.20 

422335; AA375957; Hs.6682; solute carrier family 7, (cationic amino acid transporter, y system) member 1 1 ; none.none; 4.20 

426754; NM_014264; Hs.172052; serine/threonine kinase 18; pkinase;TM=M;; 4.20 

435810; BE349853; Ho.2785; keratin 17; zf-Tim10_DDP l SH2,SH3,pkinase,PH,BTK,Rbosomal_L44;4.20 

446143; BE24C342 H° 3 r 07 ec61 homolog NUC c\ E1 dehyd g.l r el_py 1M Y,SS=M;4.20 

426626; Al 1 24572; Hs.323879; inhibitor of kappa light polypeptide gene enhancer in B-cells, kinase gamma; zf-C2H2;TM=M;; 4.20 

403716; ; ; Target Exon; Adap.comp sub,PDZ 1 DEP,DIX,D'shevelled,hexapep,W2,ABC_tran; 4.19 

415749; BE262529; Hs.78771; phosphoglycerate kinase 1; PGK,none; 4.19 

434599; AB002313; Hs.3989; plexln B2; PSI,SemaJIG;NA;NA, 4.19 

412600 L28824 H 11 el ek'r 2 pi n 'e 1 19 

41 6738; N2921 8; Hs.40290; ESTs; ABC>an,ABOmembrane,none; 4. 1 9 

410639; BE269047; Hs.65234; hypothetical protein FU20596; DEAD,helicase_C,PRK l AIP3;TM=M;;4.19 
431385; BE178536; Hs.11090; membrane spanning 4-domains, subfamily A, member 7; none.none; 4.19 

407305; AA715284; ; gb:nv35f03.r1 NCLCGAP Br5 I lomo sapiens cDNA c'one similar to contains Alu repetitive element;, mRNA sequence; pkinase.integrin_B,Sema,PSI,TIG,none; 
4.18 

452880; AA029332; Hs.87549; ESTs; none,integrin_B; 4.18 

428245; AF1 51 048; Hs.183180; anaphase promoting complex subunit 11 (yeast APC11 homolog); none;SS=M; 4.18 
421964; X73079; Hs.288579; polymeric immunoglobulin receplrr r ( tl t JiMMSS-M.418 
40921 3; U61 41 2; Hs.51 1 33; PTK6 protein tyrosine Kinase 8, 3H2,SH3 : pkirase; l'M-M;; 4,1 3 

421790; AW896201; Hs.22654; sodium channel, voltage-gated, type I, alpha polypeptide; ion_trans,IQ,PEP-utilizers_C;TM=Ys 4.18 
429668; AA625142; Hs.179991; ESTs, Weakly similar to S28942 protein kinase C [H.sapiens]; none.none; 4.18 
44306B; AI188710; ; ESTs; Endonuolease,pki'nase,Activin_recp,none; 4,18 
418827; BE327311; Hs.47166; HT021; none;TM=M;;4.18 

447887; AA114050; Hs.19949; caspase 8, apoptosis-related cysteine protease; ICE_p10,ICE_p20,DED;TM=M;; 4.18 

429109; AL008637; Hs.196352 neutrophil cytosolic factor 4 (40kD); SH3,OPR,PX;TM=M;; 4.18 

422083; NM 001141 H r rachidonate 1 | jlo c i ' tv h e none n ne 4 13 

438974; AF089816; Hs.6454; chromosome 19 open reading frame 3; PDZ;SS=M; 4.18 

413407; AI356293; Hs.75339; inositol polyphosphate phosphatase-like 1; SH2SAM t Exo_endo_phos;SS=M; 4.18 

424954; NM_000546; Hs.1846; tumor protein p53 (Li-Fraumeni syndrome); P53,WD40 t IRK;TM=M;; 4.17 

421836; AF109219; Hs. 108787; phosphatidylinosi ol glyc n cla N rirre,none;4.17 

431544; AK000770; Hs.299329; Homo sapiens cDNA FU20763 hs, clone COL0991 1 ; none.none; 4.17 

413781; J05272; Hs.850: IMP (irosine monophosphate) dehydrogenase 1; CDS,IMPDH_C,IMPDH_N,NPD;TM=M;; 4.17 

452012; AA307703; Hs.279766; kinesin family member 4A; kinesin,DNA.topoisotV,K-box;TM=M;; 4.17 

425606; U52112; Hs.156331; renin-binding protein; none;; 4.16 

416817; AA398045; Hs.104679; ESTs; Furin-like,pkinase,Recep_LJomain,fn3,none;4.16 

402447; ; ; C1000201;gi|204416|gb|AAA02627.1| (L05195) fructose transporter [Rattus norvegicus] gi|44; none;TM=Y;SS=M; 4.16 
452875; BE275760; Hs.30928; DNA segment on chromosome 19 (unique) 1 177 expressed sequence; Euk_porin;TM=M;SS=M; 4.15 
426395; BE151985; Hs.5722; hypothetical protein FLJ23316; pkinase.none; 4.15 

404140; ; ; NM_006510:Homo sapiens ret (inner protein (RFP), trarsciip' variant alpha, mRNA.; zf-C3HC4,SPRY,zf-B_box;SS=M; 4.15 

432268; BE31 1856; Hs.274230; 3'-phosphoadenosine 5'-phosphosulfate synthase 2; APS_kinase,ATP-sulfurylase;TM=M;; 4.15 
40551 6; ; ; ENSP00000200457* Thyroid receptor int actii r I n IP ) (OPA-interactingproteir. 1) (Zyxin related prolein 1) (ZRP-1).; LIM;TM=M;; 4.15 

448390; AL035414; Hs.21068; hypothetical protein; FGGY_C;TM=M;; 4.15 

435732; AF229178; Hs.123136; leucine rich repeat and dealh domain containing protein; none.none; 4.15 
414108; AI267592; Hs.75761 ; SFRS prolein kinase 1; ank,PH,OxyslerolJ3P.pkinase;TM=M;; 4,15 

411558;AA102670;Hs.70725;gamma-aminobutyricacid(GABA)A eptor, pi; Neu LBD.Ne njn t "M Y SS=M; 4.14 
424339 BE25^1 18 Hs.145416; end n; no IM , SS M 414 

427274; NM_005211; Hs.174142; colony stimulating factor 1 receptor, formerly McDonough feline sarcoma viral (v-fms) oncogene homolog; ig,pkinase;TM=Y;SS=M;4.14 
440524, R712b4 H 1 3 F 1 3H j.pkir KH -r tr I EFnone,414 
436115; AW512033; Hs. 102004; ESTs; pkinase.none; 4.14 

I NM_016314:H I 1 ir l r H ITAMdc i I HM M H 

418529;AW005695;Hs.250897;TRK-fused gene; Band_41,ERM,pkinase,LRR l LRRCT,MAM Nucleoplasmin,Tropomyosin,OPR,filament,bZIP,&ganima l M,DUF164;TM=M;;4.14 
420727; H75701; HS.99BB6; complement component 4-binding protein beta; sushi;SS=M; 4,14 
433075; NMJJ02959; ; sortilin 1; Exo_endojhos,Atrophin-1 ,BNR,Kelck;TM=M',;4.14 
422783; AA598956; Hs.120439; ethanolamlne kinase; Choline_kinase;TM=Y;; 4. 1 4 
410726; AI623859; Hs.15936; ESTs; pkinase,pro_isomerase,none; 4.14 
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417903; NMJJ02342; Hs.1116; lymphotoxin beta receplcr (TNFRsuperfamily, membe' 3); TNFR c6;TM=M;SS=M; 4.14 

428307; W27393 Hs.183648; f n n» phosphatase, receptor type, f polypeptide (PTPRF), tnleracting protein (liprfn), alpha 1; SAM,SH3,HS1_rep; 4.14 
442434; AA995787 i 12958 l! ^Kncne 4 13 

438361; AA805666; Hs.146217; Homo sapiens cDNA: FLJ23077 fe, clore i NG05840: pkinase,pkinase_C,none; 4.13 

445580; AF167572; Hs.12912; skbl (S. pombe) homolog; none;SS=M; 4.13 

421425; AK001564; Hs.104222; hypothetical protein FLJ10702; efhand,kazal,arf,:as,7tm_1;TM=M;; 4.13 

400252; ; ; NM_004651*:Homo sapiens ubiqultin specific protease 11 (USP11), mRNA. substrate 1 (PTPNS1), mRMA.; UCH-1 ,UCH-2;TM=M;; 4.13 

446641; AL049229; Hs.15787; Homo sapiens mRNA;cDNADKFZp564O1016 (from clore DKFZps >1 i n =,p nasePBD,413 

400209; ;;NMJ)01666:Homo sapiens RhoGTPase /atinq r n4 HGAP4), mRN JZ - m M _ 0 06083.2 Gl; FCH,RhoGAP,SH3;TM=M;; 4.13 

429012; AW629596; Hs.194726; BCU-assoc'ated aiharogene4; Sm,BAG;SS=M' 4.13 

41 1826; AW947946; Hs26706; CGI-121 protein; none.DSPc; 4.13 

423189; M59371; Hs.171596; EphA2; fn3,pkinase.SAM,EPHJbd;TM=Y;SS=M;4.12 

413934; U03056; Hs.75619; hyaluronoglucosamiridase I : intenrir_B,Glyco_hydro_56;SS=M; 4.12 

414874; D26351; Hs.77515; inositol 1,4,5-triphosphate receptor, type 3 ionjran MIF RYt MTPP;TM=Y;; 4.12 

432047; NMJ)16247;Hs.272380;interphotoreceptor matrix proteoglyi E I E,' IM i'SS=M;4.12 

451820; AW058357; Hs.199248; ESTs; 7tm_1;TM=Y;SS=M;4.12 

445515; BE388665; Hs.179999; Homo sapierc, clone IMAGE3457003, mRNA" zf-C2H2,BTB,K.tetra WD40 Syrtaxin; 4.12 
424539; L02911; Hs.150402; Aclivin A receptor, type I (ACVR1) (ALK-2); pkirase,Activin_recp;TM=M;SS=M; 4.12 

405110,. C70001Q9q|1 4 )|sp|Q9Y5T7|KC HUM DIA( I ,LrfEROL KINASE, BETA (DIGLYCERIDE KINASE :; none.none; 4.12 
441026; AW179058; Hs.99858; ribosomal protein L7a; pkinase,LRR,LRRCT,RibosomaLL7Ae,none; 4.1 1 

443142; AI69651 3; Hs.108705; protein phosphatase 2 (formerly 2A), regulatory subur.it A (PR 65), beta isoform; HEAT,Vilellogenin_N,HEAT_PBS;SS=M; 4.1 1 
450505; NM 004572 H 25051 la! 3 .Kin 2, Armadillo_seg;TM=M;; 4.1 1 

459601; AL044470; Hs.270604; ESTs, Weakly similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION WARNING ENTRY [Ksapiens]; 
none,SH3,PGAM,UBA; 4.10 

417300; AI765227; Hs.55610; sotute carrier family 30 (zinc transporter), member 1 ; Cation_efJux;TM=Y;SS=M; 4.10 

427315; AA179949; Hs.175563; Homo sapiens mRNA; cDNA DKFZp554N0763 (from clone DKFZp564N0763); none 1 spectrin,SH3,PH,CH; 4.10 

416239; AL038450; Hs.48948; ESTs; E1-E2_ATPase,Cation_ATPase_C,CationJ\TPase_N,Hydrolase,none; 4.10 

42931 1; AF080157; Hs.198998; conserved helix-loop-helix ubiquitous kinase; pkinase.none; 4.10 

412146; M92444; Hs.73722; APEX nuclease (multifunctional DNA repair enzyme); Exo_endo_phos,Alrophin-1,BNR,Kelch;TM=M;; 4.10 

418420; AW604405; Hs.324874; hypothetical protein MGC3079; Phosphodiest;TM=Y;; 4.1 0 

434396; AA632270; Hs.162851; Homo sapiens cDNA FU14317 fls. clone PLACE3000401; pkinase.none; 4.10 

454438 AA224053 H 1724 I: I J. on cycle 27; SPRY,7tm_3,ANF_receptor; 4.10 

439578; AW263124; Hs.3151 11; nuclear receptor co-repressor/HDAC3 complex subunit; WD40;TM=M;; 4.10 

451995; AI827431; Hs.224645; ESTs, Weakly similar to IF1S HUMAN GAMMA-INTERFERON-'NDUCIBLE PROTEIN tFI-16 [H.sapiens); none.PAAD.DAPIN.HIN; 4.10 

420340; NM 000734; Hs.97087; CD3Z antigen, zeta polypeptide (TIT3 complex); ITAM;TM=M;SS=M; 4.10 

442942; AW1 67087; Hs.131562; ESTs; pkinase.none; 4.09 

428187; AI687303; Hs.285529; G protein-coupled receptor 49; 7tm_1,none; 4.09 

418838; AW385224;Hs.35198;cctonucteotidepyrophosphatooe,FhosD 1 :uie.c aseE (p ative function); Phosphodiest;TM=Y;SS=M; 4.09 
416445; AL043004' Hs.79337; KIAA0135 protein, pkin se.PAS T r,M,1 ' X 

427001; NM_006482; Hs.173135; dual-specificity tyrosine-(Y)-phospliorylation regulated k'nase2; pkinase;TM=M;: 4.08 

403608; ; ; C3001199:gi|7494834|pir||T15308 hypothetical protein B0286.2 - Caenorhabditjs elegans||41; 7!mJ,7tm_2,GPS,WIF;TM=Y;SS=M; 4.08 

427177; AB006537; Hs.173880; inlerleukin 1 receptor accsssory protein; ig,TIR;TM=Y;SS=M; 4.06 

401241; AB028989; ; mitogen-activated protein kma e 8 inte adlni proteii knot.T I v c mi TIL DUF139 SS-M 4 07 

444805;AB007899;Hs.12017;homologofyeaslubqui'r p nlig e potentia pi Lelia sodum channel regulator; WW,HECT,RNA_pol_A,none; 4.07 
448888; AW1 96663; Hs.200242; caspase recruitment domain protein 5; CARD;TM=M;; 4.06 
426006; R49031; Hs.22627; ESTs; pkinase.TBC; 4.06 

434521; NMJ02267; Hs.3886; karyopheiin alpha 3 (importin a'pha 4); Armad;llo_seg l IBB;TM=M;; 4.08 

408761; AA057264; Hs.238936; ESTs, Weakly similar to (Mine not available 7496841) [C.elegans]; 7tm_1,none; 4.05 

425289; AW1 39342; Hs.155530; interferon, gamma-inducible protein 16; PAAD_DAPIN,HIN;SS=M; 4.05 

413109; AW389845; Hs.110855; ESTs; PH04,none; 4.05 

426457; AW894667; Hs.169965; cliimerlr (cliimaeiin) I ; DAG. PE-bind, RhoGAP.S I2;TM=M;; 4.05 
435730; AB020635; Hs.4984; KIAA0828 protein; AdoHcyase,TrkA-N,2-Hacld_DH_C;TM=M;; 4.04 

429747; M87507; Hs.2490 ca-pa-e 1 apcpto-i I t J toine p He ieukin 1, beta, convertase); CARD,ICE_p10,ICEj)20;SS=M; 4.04 
444378; R41339; Hs.12569; ESTs; ig,pkinase,LRR,LRRNT,LRRCT,none;4.04 

449843; R85337; Hs.24030; solute carrier family 31 (copper transporters), member 2; none;TM=Y;SS=M; 4.04 
427359; AW020782; Hs.79881; Homo sapiens cDNA: FLJ23006 fls, clone LNG00414; 7!m_1 .none; 4.04 
413095; AA494359; Hs.3071 5; potassium voltage-gated channel, Isk-related family, member 3; none.START; 4.04 

418540; AI821597; Hs.90877; ESTs, Weakly similar to ALU1.HUVAN ALU SUBFAM'LY J SEQUENCE CONTAMINATION WARNING ENTRY [H.sapiens]; ank.CAP_GLY,7tm_1; 
4.03 

442007; AA301 116; Hs.142838; nucleolar phosphoprotein Nopp34; rrm,!RK;SS=M; 4.02 

448659; AF191338; Hs.21 712 TANK-binding kinase 1; pkinase;TM=M;; 4.02 

412935; BE267045; Hs.75064; lubulin-specific chaperone c; none;; 4.02 

414844; AA296874; Hs.77494; deoxyguanosine kinase; dNK;; 4.02 

445817; NM 003642; Hs.13340;histoneacetyltransferase1;none;TM=M;; 4.02 

426728; NM_007118; Hs.171957; triple functional domain (PTPRF interacting); SH3,ig pkinase,PH,spectrn,RhoGEF;TM=M;; 4.02 
420676; AI434780; Hs.4248; vav 2 oncogene; RhoGEF,PH,CH,SH2,SH3,DAG_PE-bind,none; 4.02 

405102; ; ; C15001 220*:g l|4469558|gb | AAD21 31 1 .1 1 (AF1 26008) breast cancer nuclear receptor-binding auxi; DAG_PE-bind,PH,RhoGEF,DC1;SS=M;4.02 
439964; AI732902; Hs.124652; Homo sapiens cDNA FU12376 fis, c'one MAMMA1 002494; pkinase.none; 4.01 
429680; AL035754; Hs.2474; loll-like receptor 1; LRR,LRRCT,TIR;TM-M;SS=M;4.01 

453891; AB037751; Hs.36353; Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 1035904; none,none; 4.01 
426535; AU077012; Hs.288582; EST \ I ir la jbiquitou I i if r f 1 1 (H apiens]; Kunitz_BPTI l Kunitz_BPTI,7tm_2,HRM; 3.99 
1 'E 1 r F 1460; p n kir nu; p DAG PE-bind,PH;TM 11 1 
408308; AL033377; Hs.44197; hypothetical protein DKFZp564D0<62; none,none; 3.98 
19517; AW500106; H r MASK; pkinase;TM=M 

404185; ; ; Target Exon; sugar Jr;TM=Y;SS=M; 3.98 

441226; BE563042; Hs. 1 1 8820; Homo sapiens, Similar to RIKEN cDNA 0S10012G03 genp Hn M if iplet T" ii 

4-^9638 AI91 bb° 1 f I c PLCE1 1 1 i S M 3 97 

417386; AL037228; Hs.82043; D123 gene product; NUD! <*esY,E1 dehydrc j . ansket pyr;TM=Y;SS=M; 3.97 

45272 1 ; AJ269529; Hs.301 87 1; solute carrier f amil 7 (alyce-o r , rt trai rrter) member 1; MORN,sugar_tr;TM=Y;SS=M; 3.96 

417183; R52089; Hs.172717; ESTs; pkinase,LRRCT,ig,LRR,LRRNT,nore; 3.95 

439176 AI446444 Hs 1 ESTs. Weak L I i en ] none;TM=M;; 3.94 

424490; AJ278016; Hs.55565; ankyrin repeat domain 3; ank,pkinase;TM=M;; 3.94 

422610; AF1 53820; Hs.1547; potassium inwardly-reclifyng channel, subfamly J, member 2; IRK;TM=Y;; 3.94 
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450746; D82673; Hs,27B589; general transcription factor II, i; none.SH3,PX; 3.94 

41 851 6; NM_00621B; Hs.85701; phosphoinositide-3-kinase, catalytic, alpha pctypeptide; PI3.PI4Jlnase,PI3Ka,PI3K_C2,PI3K_rbd,PI3K_p85B,none; 3.94 

414217; AI309298; Hs 279898 Hcrm pie, DMA: FLJ231I t done Jo^Jl nel>ANA,393 

416537; T99086; Hs.144904; nuclear receptor co-repressor 1; myb_DNA-binding RNA_pol_A,rore; 3.93 

450747; AI064821; Hs.318635; ESTs, Highly similar to 1818357A EWS gene [H.sapiens]; rrm,zf-RanBP,GAS2; 3.93 

444825 AW167613, mil ;en-a< , led v. Ism Hnase kinase kinase 8; pkinase;TM=M;; 3.93 

408354 AU82PH I r ' > I i ne.none; 3.93 

453945; NM 005171; Hs.36908; activating ran.cnp nf 1; rrm,zf-RanBP,pkin e GST_C,Ets,SAM_PNT,ABC2_merabrane,myosinJead,IQ,Myosin_N,bZlP,zf- 

C2H2,PHD,BTB,TFIIS AT Jot SAM M=M !.9 
I 532; AF1 2 Hs. 184735; TBP-inte ng p-oteii m£dillo_seg.VHS,HEAT;TM--M;; 3.9: 

413967; AW204431; Hs.117853; ESTs, Weakly 'irril 3 I 8022 t-ypothet'cal p ein [H.s [en] Arm dilb_seg,IBB,PHD,DOT,none; 3.91 
415906; AI751357; Hs.288741; Homo sapiens cDNA: FLJ22256 fis, clone HRC02860; Ephrin.none; 3.91 
450139; AK001838; Hs.296323; senim/glucocorticoid regulated kinase; none.none; 3.91 
440255; AI932285; Hs.160569; ESTs; none.pkinase; 3.90 

421077; AK000061; Hs.101590; hypothetical protein; ank,pkinase,deaih,SPRY,SAP,R 1 bosomsl_L24e,SRP54,dDENN,DENN,uDENN;TM=M;; 3.90 

4 211; H11850; Hs.12808; MAR pi nasp UBA,KA1;SS=M, 3 90 

433233; AB040927; Hs.301804; KIAA1494 protein; SH3,zf-C3HC4;TM=M;; 3.90 

419609; U46415; Hs.270379; gb:HSU4641 5 Human pancreatic cancer cell line Patu 8988I Homo sapiens cDNA done xs476, mRNA sequence; PWWP.none; 3.90 
433198; AA992841; Hs.27263; KIAA1458 protein; none.none; 3.89 

407721; Y12735; Hs.38018; dual-specificity tyro (Y)-r. r I c n n TM M 3 89 

427657; AV652249; Hs.180107; polymerase (DNA directed), beta: none;TM=M;; 3.39 
453035; AW581943; Hs.334; Rho guanine nucleotide exchange factor (GEF) 5; none.none; 3.89 

446329; NMJ13272; Hs.14805; solute carrier family 21 (organic anion transporter), member 1 1; kaza!,OATP_N,OATP_C;TM=Y;SS=M; 3.89 

429922; Z97630; H&226117: H1 histone family, member 0; linker_histone;TM=M;; 3.88 

432074; AA525248; Hs.149723; ESTs; Y_phosphatase,none; 3.88 

435143; R12375; Hs.194600; ESTs; SH3,ig,pkinase,PH,spectrin,RtioGEF,none; 3.87 

423198; M81933; Hs.1634; cell division cycle 25A; Rhodanese.none; 3.87 

428474; AB023182; Hs.184523; KIAA0965 protein; pkinase;TM=M;; 3.87 

419073; AW372170; Hs.18391 8; Homo sapiens cDNA FU12797 fis, clone NT2RP2002066, highly similar to Rattus norveglcus transmembrane receptor Unc5H2 mRNA; 
death,ZU5;SS=M;3.86 

415457; AW081710; Hs.7369; ESTs, Weakly sir ilai to Al U1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY |H.sapiens); 

MORN,sugar_tr;TM=Y;SS=M; 3.86 
447061; D86964; Hs.1721 1; dedicator of cyto-kinesis 2; SH3;TM=M;; 386 
425490; NM 001621; Hs.170087; aryl hydrocarbon receptor; PAC,PAS;TM=M;; 3.86 

451961; NM_003800; Hs.27345; RNA guanylyltransferase and 5'-phosphalase; mRNA_cap_enzyme,DSPc,DNAJigase,mRNA_cap_C;TM=M;; 3.86 
417874; BE616160; Hs.82829; protein tyrosine phosphatase, non-receptor type 2; Y_phosphatase;TM=Y;; 3.86 
446874; AW968304; Hs.56156; ESTs; none.RGS; 385 

418630; AI351311; Hs.251946; poly(A)-b:nding p-otein, cytoplasmic Mike; pkirase.none; 3.85 

416140; AI918035; Hs.301198; roundabout (axon guidance receptor, Drosophila) homolog 1; none.none; 3.85 

425474; Z48054; Hs.158084; peroxisome receptor 1;TPR;TM=M;; 3.35 

n J 1 \L0 .IS" II 1 87; Iranslocase of outer mitochondrial membrane 20 (yeast) homolog; MAS20,zf-A20,VPS9;TM=M;SS=M; 3.85 
411770; NM 014278; Hs.71992; heat shock protein (hspl 10 family); HSP70;TM=M;; 3.84 
128782 X12830 II" 193400; interleukin 6 receptoi; fn3,ig;TM=Y;SS=M; 3.84 
f 1 372505; H i leukin 1 recep p j IV M 
433376; AI249361; I ls.74122; caspase 4, apoptcsis-related cysteine protease; CARD,ICE_p10,ICE_p20;SS=M; 3.83 
440332; AI218517; Hs.188051; ESTs;fn3,pkinase,SAM,EPHJbd,none; 3.83 
445803; AV655264; Hs.4283; ESTs; pknase,RGS,PH,myosinJead,Myosir.Jail; 3.83 
435905; AW997404 II 50C KIA f n dI B.RhoGAP.FCI l;TM=M;; 3.83 

414991; C17898; ; gbC17898 Human placenta cDNA (TFujiwara) Homo sapiens cDNA clone GEN-554E10 5', mRNA sequence; Zip.none; 3.83 

423067; AA321355; Hs.285401; colony stimulating factor 2 receptor, beta, low-affinity (granulocyte-mscrophage); fn3;TM=Y;SS=M; 3.82 

419088; AI538323; lls.52620; integrin, beta 8; integrin_B,none; 3.82 

41 1704; AI499220; Hs.71573; hypothetical protein FLJ10074; pkinase;TM=M;; 3.82 

459346; AW510557; Hs.258016; EST; none;TM=M;; 3.82 

445330; R52656; Hs.21691; ESTs; 7tmJ,none; 3.82 

451452; BE560065; Hs.26433; dolichyl-phosphate (UDP-N-acetylglucosamine) N-acelylglucosaniinephosphotransferase 1 (GlcNAc-1-P transferase); GlycosJransL4;TM=Y;SS=M; 
3.81 

405545; ; ; Target Exon; ABC_tran,SRP54,ABC_membrane;TM=Y;SS=M; 3.B1 

448165; NMJ05591; Hs.20555; meiotlc recombination (S. cerevisiae) 1 1 homolog B; Melallopl;os,Ribosomal_L15e;SS=M; 3.81 
416305; AU076628; Hs.79187; coxsackievirus and adenovirus receptor; ig;TM=Y:SS=M; 3.80 
415444; BE247295; Hs.78452; solute carrier family 20 (phosphate transporter), member 1; PH04,LIM;TM=M;; 3.80 
421684; BE281591; Hs.106768; hypothetical protein Ft J 1051 1; ,ArmaJillo_3eg;SS=M; 3.80 

438581; AW977766; Hs.292133; ESTs, Moderately similar to I78885 sehne/lhreonine-specific protein kinase [H.sapiens); pkinase,RI01,none; 3.79 
439199; R40373; Hs.26299; ESTs; ionjrans.none; 3.78 

450931 ; N251 56; Hs.25648; tumor necrosis factor receptor superfamily, member 5; TN FR_c6;TM=Y;SS=M; 3.78 
417691 ; AU076610; Hs.82399; low density lipoprotein receptordefect C complementing; rone;SS=M; 3.78 

430355; NMJH6219; Hs.239818; phosphoinositide-3-kinase, catalytic, beta polypeptide; PI3_PI4Jinase,PI3Ka,PI3K_C2,PI3K_rbd,PI3K_p85B;TM=M;; 3.78 
448119 H38587 Hs 346509 dedi air I i 1 inneTM=Y 3 78 

442013; AA506476; Hs.10600; Human DNA sequence from clone RP1 1-353C1 8 on chromosome 20 Conlains ESTs, STSs, GSSs and CpG islands. Contains the NIFS gene for 

cysteine desulfurase, two genes for novel proteins and the gene for the splicing factor CC1 .3 with a second isoform (CC1 .; none.none; 3.77 
425481 ; AW978162; Hs.18571; ESTs; none.OxysteroLBP; 3.77 

411411; AA345241; Hs.55950; ESTs, Weakly similar to KIAA1330 proteir [H sapiens]; >-NA_poLA,ig,MHCK_EF2_kinase;SS=M; 3.77 

426866; U02330; Hs.172816; neuregulin 1; Pep«dase_M49,EGF,ig,Neuregulin;TM=M;; 3.77 

430396; D49742; Hs.241363; hyaluronan-blnding protein 2; ank,death,ZU5,EGF,kringle,irypsin,Nebijlin,LIM;SS=M; 3.77 

434398; AA121098; Hs.3838; serum-inducible kinase; pkinase.POLO _box;TM=M;; 3.77 

4154B5; AW272990; Hs.18571; ESTs; none.OxysteroLBP; 3.76 

453226; AA641926; Hs.61712; pyruvate dehydrogenase kinase, isoenzyme 1; HATPase_c,nore; 3.76 
118753; AW959311; I 172012; hypothetical protein Dl Zp134J0S pkirase,RI01;TM=M;; 3.7t 

424842; AA034127 H I534 iq nsducinq a-Japlor n lie (SH3 d i i I ' M motif) 1; SH3,VHS,UIM;TM=M;; 3.75 
426500; NMJJ14638; Hs.170156; KIAA0450 gene product; C2,Pl-PLC-Y;TM=M;; 3.75 
419952; AK000967; Hs.93872; KIAA1682 protein; none;TM=M;; 3.75 
25424; NM.004954; I ELl i i kinase,UBA rM=M;; 3.75 

431696; AA259068; Hs.267819; protein phosphatase 1. regulatory (inhibitor) subunH2; none;SS=M; 3.75 
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444184;T87841; Hs.282990; Human DNA sequence from clone RP1-2SH20 on chromosome 20q13.1. Contains pai 
proteins, the 5' end of a novel q«r EST T SS ndthre p3 I lands; pkir 
405411; ; ; ENSP00000252213:SODIUM BICARBONATE COTRANSPORTER.; rone;TM=Y;SS=M; 3.75 
405602; ; ; Target Exon; pkinase;SS=M; 3.75 

429355; AW973253; Hs.292689; ESTs; pkinase,bZIP,Armadillo_seg,none; 3.75 
430153; AW968128; Hs.336679; ESTs; pkinase.none; 3.74 

414180 AI863304, i £ Hor os en" cDNAFLJ11448fis, clone HEMBA1001391; P!3_PI4Jinase,PI3Ka,PI3teC2,PI3K_rbd,H3K_p85B,none; 3.74 
432235' AA531 132 • jb n 17h06 1 NCLCGAP Pt9Horoo£ pier cDNA clone i iRNA sequence | , "ase rone; 3.74 

433390 M58B950 Hs.2601BO mRNA cDNA DKFZp761G18121 (from clone DKFZp761G18121); completecds; none,spectrin,SH3,PH,CH; 3.74 

426485; NM 006207; Hs.170040; platelet-derived growth factor receptor-like; ig;SS=M; 3.74 
408414; AI1T4688; Hs.17998; ESTs, Weakly similar to 2109260A B cell groVh u >i I 1 - piens] 3 iTh r t' ' 
40S793; Al£ 163; Hs.1 1799 piol kinase linked: pi t nase 2TM M;; 3.73 
412456; T32689; Hs.7859; ESTs; BAG.none; 3.73 
l | in i 1l fic phosphol pcse C beta 1: C2,PI-PLC-Y,FI-I MM 

442229; AI885776; Hs.8164; Mulibrey nanism; MATH,DENN,GRAM,zf-BJ»x,dDENN,uDENN;SS=M; 3.73 
450151; AI088196; Hs.22968; Homo sapiens clone IMAGE:451939, mRNA sequence; ig,pk:nase,r.or.e; 3.72 
408331; NM 007240; lis. 44229; dual specificity phosphatase 12: DSPc;TM=M;; 3.72 
41 7821; BE245149; Hs.82643; protein tyrosine kinase 9; cofilin_ADF;SS=M; 3,72 

403391; ; ; C3001164-;gi|1730196|sp|P50573|GAR3 RAT GAMMA-AMINOBUTYRIC-ACID RECEPTOR RHO-3 SUBUNIT PRE; none;TM=Y;; 3.72 

41752^20^4 h c 1 n I n 1NFLS_S1 Homo sap'ens cDNA clone 5', mRNA sequence; SH3;SS=M; 3.71 

428428; AL037544; Hs 164298 cyclii epend 1 1 L i 7 (homolog of Xenopus M015 cdk-activating kinase); pkinase;TM=M;; 3.71 

428180; AI129767; Hs.182874; guanine nucleotide hinding protein (G prater.) alpha 12: G-alpha,arf,TM=M;; 3.71 

42212 AW5D42 r H II 0 T binding f J NN,DENN,GRAM,PH;SS=M; 3.70 

430 570, AI417881. Hi ?9? 1 = 1 " *1 . "n.. : T cd.none; :.70 

452561; AI692181; Hs.49169; KIAA1634 protein; TPR,PDZ,WW,Guanylate_kin;TM=M;: 3.69 

432336, NMJ02759 H 274 2: prot kii nipr ,i r. J u ble d j e nrarded RNA dependent, dsrm.pkinase;TM=M;; 3.69 

419945; AW290975; Hs.118923; ESTs; SH3,PDZ,Guanylate_kin,transferrin; 3.69 

425539; AB01 1 1 55; Hs.170290; discs, large (Drosophila) homolog 5; SH3,PDZ,Guanylale_kin;TM=M;; 3.68 

436534 AA721 628 M U1 immun bbulrn'uperfamrly receptor translocation associated 2, :g;TM=Y;SS=M; 3.68 

1 N581 H 11^ J r I F5_F8_I i (r r e Et none; 3.67 

420297; AI628272; Hs,88323; ESTs, Weakly similar to AW1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY IHsapiens); pkinase,TUDOR,none; 
3.67 

417863; AB000450; Hs.82771 : vaccinia r elated kinase 2; pkinase;TM=M;; 3.67 

425304; AA463844; Hs.31339; fibroblast growth factor 11; FGF,Neur_chan_LBD,Neur_chan_memb,none; 3.67 
418318; U47732; Hs.84072; transmembrane 4 superfamily member 3; transmembrane4;TM=Y;SS=M; 3.67 
419511; AA429750;Hs.75113; general tran.cnptrr fa I!'. 3lypic <n none; 3.66 
424315; AW614850; Hs.193384; putatative 28 kDa protein; none.none; 3.66 
413076; U10564; Hs.75188; weel (S.pombe) homolog; pkinase;TM=M:; 3.66 

425838- NM_014071; Hs 159613; nuclear receptor coactivator RAP250; peroxisome proliferator-activated receptor interacting protein; thyroid hormone receptor binding protein; 
none;TM=M;; 3.65 

446983; AA157484; Hs.97199; complement component C1q receptor; EGF,lectin_c : Tissue_fac,Xlink,TIL;TM=Y;SS=M; 3.65 
434350; ALO42940; Hs.93872; KIAA1682 protein; none.none; 3.65 

457317; AA683016; Hs.12210; hypothetical protein FLJ13732 similar to ten*: SH2;TM=M;: 3.65 

434416; AA805903 Hs594"C celMi in cycle 2-I rt 11 i J i on controller); pkinase.none; 3.55 

410174; AA306007; Hs.59461; DKFZP434C245 protein; none,DSPc; 3.65 

423598; BE247600; Hs.155538; ESTs; 7tm_1;TM=Y;SS=M; 3.65 

440861; BE244115; Hs.7482; KIAA06B2 gene product; rrrn,Guar.ylate_kin-TM=M;; 3.64 

454954; AW99301 3; Hs.49169; KIAA1634 protein; TPR.PDZ.VW Gu„ ylate kin;TM=M;; 1.61 

430250, NM_01 6929; Hs.283021 ; chloride Intracellular channel 5; rone;TM=M;; 3.64 

450587; At828854; Hs.258538; striatin, calmodulin-binding protein; pkir.ase,WD40;TM=Y;; 3.54 

424893 AW29511? H 15364 Hem pisn cDNA FLJ13303 lis, clone OVARC1 001 372, highly similar to Homo sapiens Iiprin-alpha4 mRNA; SAM;SS=M; 3.64 

425645; AA361027; ; gb;EST70242 I ce!! lymphoma Homo sapiens cDNA 5' end, mRNA sequence; HMGJ)OX,DNAjTiis_repair,HATPase_c,none; 3.64 

417426 NM 002211 1 1 32124 mi n eta 1 laminir L L minir_Nterm,integrin_B;SS=M; 3,63 

451292; AB037716; Hs.25204; KIAA1295 protein; SH3;TM=M;; 3.63 

412314; AA825247; Hs.250899; heat shock factor binding protein 1; 7tmJ TM f;SS=M i F 

418303; AA215701; Hs.136541 ESTs, Weakly similar to 8022 h ro'hetcal protein [H.sapiens]; elF5_elF2B,W2,pkinasB,UBA l KA1; 3.63 
452716 AI914926 H 2 j E 1 - i SH2 , STAT,STAT_bind,STAT_prot,none; 3.63 

403B69; ; ; NM 004520':Homo sapiens kinesin heavy chain member 2 (WF2), mRNA member 3 (KCNQ3). mRNA.; kinesin;TM=M;; 363 

450377; AB 033091; Hs.74313' KIAA1265 protein; Zip;TM=M;SS=M; 3.63 

417793; AW405434; Hs.82575; small nuclear ribonucleoprotein polypeptide B"; rrm;TM=M;; 3,63 

404942; U30825; ; splicing factor a qimn erii I TM=Y;SS=M; 3.63 

429554; NM_01227 20722 mi eukinl.dell II 1 TM M, 3 63 

417871; AA521368; Hs.24252; ESTs; IBB,Armadillo_seg,none; 3.62 

437672; AW748265; Hs.5741;flavohemoprotein b5?; hemaJ,NAD_binding,l'poxygenase,FADJ)indirig_6;TM=M;; 3.62 

I. II I I 1" protein [H.sapiens] IT 

447217 BE465754 H 1 pil I '1 F F i MtrSM3b1 

407961; AW672939; Hs.41694; origin recognition complex, subunit 2 (yeast homo. >g)-likf nore.pkinase projsomerase; 3.61 
428840; M15990;Hs.194148;v-yes-1 Yamaguchi j rcg ii Ir-g H_ H t < la e SS=M; 3.61 

455608; BE01 1437; ; gb:CM4-BN0220-080500-170-f03 BN0220 Homo sapiens cDNA, mRNA sequence; none,CDK5_activator, 3.61 
407748; AL079409; Hs.38176; KIAA0606 protein; SCN Ci-cadian Oscillatory Protein (SCOP); PP2C,LRR,PH;TM=M;; 3.60 
421474; U76362; Hs.104b 17; " nluteca i e. fa nily 1 (gluta n te transpo ler) n ember 7; Dl ,TM=Y;SS=M; 3.60 

449987; AW079749; Hs.184719; ESTs, Weakly sin la I ALU1 I IMAf \ U SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY [H.sapiens]; 

ABCJran,ABCjnembrane,ionJrans;3.60 
403142; ; ; NM_002706*:Homo sapiens protein phosphatase 1 B (formerly 2C), magnesium-dependent, beta isoform (PPM1 B), mRNA.; PP2C;TM=M;; 3.60 
400844 NM 00310 H r II M CF )A repeats-containing (SORL1), mRNA; 

EGF,m3,ldLrecepLa,ldljecept_b,granulin,BNR;TM=Y;SS=M; 3.59 
450152; AI13B635; Hs.22968; Homo sapiens clone IMAGE451S39, mRNA sequence; ig, pkinase.none; 3.59 
429782; NM_005754; HS.2206B9; Ras-GTPase adival r pioleli H rain-binding pre in rrm I NTF2;TM=M;; 3.59 
436469; AK001455; Hs.5198; Down syndrome critt:J cj.n )e ne2 none;SS=M; 3.59 
1 B01 I 3 L I rj r I r » I r r 1" 

426797; AW936258; Hs.342849; ADP-ribosylalion factoi-'.ike 5; arf,Ca_channel_B,SH3; 3.58 
431170; AW971246; Hs.291022; ESTs; LRR,CARD,none; 3.58 

434542; AA769310; Hs.61260; hypothetical protein FLJ13164; PH,Oxysterol_BP,TM=M;SS=M; 3.58 
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420181; AI380089; Hs.158951: ESTs; none,tg.pkinase,LRR,lRRCr; 3.57 

450572; AI700863; Hs.202494; Homo sapiens cDNA FLJ13245fis, clone OVARC1000631; Na_sulph_symp,ncne; 3.57 
433618; AA602539; Hs.345494; ESTs; G-alpha,A_deaminase; 3.57 

452695; AW780199; Hs.30327; milogen-activated p nt r i e-ac led prolein kinase 5; pi e.noi i 

418512; AW498974; ; diacylglycerol kinase, zeta (104kD); ras.none; 3.57 

451752; AB032997; Hs.26966; KIAA1171 protein; ATP-synl_C,TBC,TM=Y;SS=M; 3.57 

417129; AI381800; Hs.300684; calcitonin gene-related peptide-receptor componert protet.; none,none; 3.57 

449474; AA019344 He 055 jd qu I r sctivEting enzyme E1 (A1S9T and BN75 temperature sensitivity complementing); ThiF,UBACT,pkinase,UCH-2,UCH-1,rrm,zf-C2H2,zf- 
RanBP.G-patch; 3.57 

412124; H43378; Hs.288550; Homo sapiens cDNA; FU23156 Us, clone LNG09609; none,none; 3.56 
435021; AA922192; Hs.54709; ESTs; EPH_lbd,pkinase,fn3,SAM,none; 3.56 

431341; M307211; Hs.251531; proteasome (prosome, rnacropair.) subun't, alpha type, 4; proleasome;TM=M;; 3.56 

437387; AI198874; Hs.28847; AD026 protein; none,7tm_1,WD40; 156 

422583; AA410506; Hs.27973; KIM0874 protein; ank,G-alpha;TM=M;; 3.56 

452102; U04343; Hs.27954; CD86 antigen (CD28 antigen I ganJ 2 7 r one;TM h 

420112; NM 005109; Hs.95220; oxidative-stress responsive 1; pkr.ase;TM=M;; 3.55 

437639; AA827712; Hs.291880; ESTs; SH3,none; 3.55 

457500; NM_002759; Hs.274382; protein k'rase, rlerfeion-inducible double stranded RNA dependent; dsrm,pkinase;TM=M;; 3.56 
415660; AI909007;Hs 7 i.i J ubiquitin c onjug at ng enzyme E2G 1 (homologous to C. elegans UBC7J' UQ_con;TM=M;; 3.55 
'is Hs.2142C;p21-acfvated protein kin FBI rM M r 

428727; AF078847; Hs.191356; general transcription factor tlH. polypeptide 2 (44kD subunit); PH04,LIM;TM=M;; 3.55 
411190 AA306342 H< J i 1 p eir in t i p HR1 TM=M 355 

408683; R58665; Hs.46847; TRAP and TNF receptor-associated protein; Exo_endo_phos;TM=M;; 3.55 

412350; AI659306; Hs.73826; protein tyrosine phosphatase, non-receptor type 4 (megakaryocyte): Y_phosphatase.Band_41 ,PDZ;TM=M;; 3.55 
446742; AA2321 1 9; Hs.16085; putative G-protein coupled receptcr; none;TM=Y;SS=M; 355 

427283; AL1 19796; Hs.17418S;ectonucleotide pyrophosphatase/phosphodiesterase 2 (autotaxin); Sulfatase t Somatomedin_B,Phosphodiest,Endonuclease;TM=M;SS=Y; 3.55 
414888; AL03918 C Hs.7755f d lurmone i ceptoi ntera;lor 7; HMG14_17,none; 3.55 
424848; AI263231; Hs.327090; EST; SH3,PDZ,Guanylate_kin,none; 3.54 

402704 ,C1001099'gi|6005896|ref|NP_009101 1|te p p in nas 2 [Homo sapiens) gi|4; none.none; 3.54 

444099 D87432 Hill I c t n) member 6; aa_permeases;TM=Y;SS=M; 3.54 

429687; AI675749; Hs.211608; nucleoporin 153kD; zf-RanBP,integrin_B;TM=M;; 3.53 

413879; AA132961; Hs.212S33; Homo sapiens cDNA: FLJ22572fis. clone HSI02313; none.none; 3.53 

431045; AW968560; Hs.301957; nudix (nucleoside diphosphate linked moiety X)-type motif 5; NUDIX,secY,E1 Jehydrog,transket.pyr;TM=Y;SS=M; 3.53 

423855; AA331761; Hs.254859; ESTs; none,pkinase,UQ_con,vwa 1 FG-GAP 1 integrin_A' 3.53 

440682; AW362152; Hs.27181; nuclear recepto' bnd rq factor-2; yclin tZIPTM-M;; 3.52 

410686; AI733735; Hs.1 14905; IRE1, S. cerevisiae, homolog of; pkinase, Bacterial PQQ;TM M S M ' 2 

449810; AB008681 ; Hs.23994; activin A receptor pellB;pl 1 t 1 M SS-M 352 

418755; Y14443; I ls.88219; zinc finger protein 200; zf-C2H2,zf-BED;TM=M;; 3 52 

448804; AW512213; Hs.342849; ADP-ribosylr-linn f- 1 ke 5 C I crnel_B,SH3; 3.52 

438507; M809052; Hs.182018; ESTs; none.none; 3.52 

r L 3 I I 1C glucocorticoid receptor DNA binding factor 1 ; none, PAS; 3.51 

410054; AL120050; Hs.58220; Homo sapiens cDNA: FLJ23005fis, clone LNG00396, highly similar lo AF055023 I lomo sapiens clone 24723 mRNA sequence; 

RasGAP,adenylatekinase; 3.51 
422321; M906427; Hs.181035; hypothetical protein MGC11236; none;TM=M;; 3.51 
445701; AF055581; Hs.13131; lymphocyte adaptor prolein; SH2,PH;TM=M;; 3.50 

407393; AB038237; ; gbiHomo sapiens mRNA for G protein-coupled receptor C5L2, complete cds.; 7tm J ;TM=Y:SS=M; 3.50 

443303; U67319; I ls.9216; caspase 7, apoptosis-related cysteine protease; pkinase,ICE_p10,ICE_p20;Tltf=M;SS=M; 3.50 

420673; AB0081 12; Hs.99847; peroxisome biogenesis factor 1; AAA,APS_kinase;TM=M;SS-M, 3.49 

424663; NM_002351; Hs.151544; SH2 domain protein 1A, Duncan's disease (lymphoproliferatlve syndrome); SH2;TM=M;; 3.49 

12932 -.111 p i andinE receptor 4 (subtype EP4); 7tmJ;TM=Y;SS=M: 3.49 

400178; ; ; Eos Control; none,Somatomedin_B; 3.49 

439549; AW937885; Hs.137314; ESTs; SH2,none; 3.49 

436345; M873008; Hs.121572; ESTs; CARD,BIR,zf-C3HC4,CARD,BIR,zf-C3HC4; 3.49 
427658; H61387; Hs.30868; nogo receptor; LRR,LRRNT,LRRCT;SS=M; 3.48 

402833; ; ; C1002508;gi|6691937|emb|CAB65797.1 1 (AL096770) bA150A6.2 (novel 7 transmembrane receptor; none,none; 3.48 

442363; AI337304; Hs.23120; PIST;fn3,pkinase,PDZ,DUF139;TM=Y;SS=M; 3.48 

409132 A12245 8 H 50732; prol /r I 1 catalytic sut tnneTM=M347 

417971; Y08991; Hs.83050; phosphoinositide-3-kinase, regulatory subunit 4, p150; pkinase,WD40,HEAT;SS=M; 3.47 

432159; Y00971; Hs.2910; phosphoribosyl pyrophosphate synthetase 2; Pribosyltran;; 3.47 

447425; AI963747; Hs.18573; acylphosphatase 1, erythrccyte (common) type; Acylphosphatase;SS=M; 3.47 

427231; AW851989; Hs.285814; sprouty (Drosophila) homolog 4; SH2,SH3;TM=M;SS=M; 3.46 

401851; ; ; NM_002401*:Homo sapiens mitogen-activated protein kinase kinase kinase 3 (MAP3K3), mRNA.; pkinase;SS=M; 3.46 

407877; AW016811; Hs.234478; Homo sapiens cDNA: FLJ22648 f$, clone HSI07329; pk'nase.pkinase. Cnone; 3.45 
432279; N95104; Hs.274260; ATP-binding cassette, sub-family C (CFTR/MRP), member 5; ABCJran,ABC_membrane,none; 3.45 
437103; AW1 39408; Hs.152940; ESTs; Choline_kinase,none; 3.45 

420338; AA825595; Hs.88269; Homo sapiens, clone MGC17339, mRNA complete cds; 7tm_1;TM=Y;SS=M; 3.44 
422209; AF005210: Hs.113222; chemokine (C-C motif) receptor 8;7tm_1,7tm_2;TM=Y;SS=M; 3.44 
410781; AI375672; I ls.165C26; ESTs; pkirase.laminin^Nlerm.laminin^EGF.cyclir.F-box.cycin^C.oerpin/iTP-synLCo.M 
437296; AA350994; Hs.20281; KIAA1700; Rhodanese,DSPc;TM=M;; 3.43 

19S55; AI935 Hs.144139; E zf-C3Ht UB bl it ' 12 3l I3,zf-C M UBA,CbLN,CblJI2,Cbt_N3; 3.43 
433336; AF017986; Hs.31386; secreted frizzled-related protein 2; Fz,NTFiSS=M; 3.43 
428483; AI908539; Hs.1 84592; KIAA0344 gene product; none,none; 3.43 

4451 19; AF035121; Hs.12337; kinase insert domain receptor (a type Ik <-pl tyri ne nase j r> i a eTM=Y;SS=M; 3.42 
1 I 1 H 1 1 r 1 or initiat'on f li l J I L I 

410386; W26187; Hs.3327; Homo sapiens cDNA: FU22219fis, clone HRC01537; pkinase,Guanylate_kin,PDZ,SH3,L27,none; 3.42 
422907; AI879263; Hs.77273; Human glucose transporter pseudogene; none,none; 3.42 
449816; AI701457; Hs.38694; ESTs; SET,BAH,PK,PK_C; 3.42 

440074; M863045; Hs.10669; ESTs, Weakly similar toT00050 hypothetical protein KIAA0400 [H.sapiens]; SH3,ank,tubulin-binding,ArfGap,PH;TM=M;SS=M; 3.42 
125175, W5b339 H 1 17057 I pi re.none 3 42 

401242; AB028989;; mitogen-activated protein kinase 8 interactir protein 3; Cys_knot TGF-beta,v awe ™d,TIL,DUF139;SS=M; 3.41 
429276; AF056085; Hs.198612; G protein-coupled receptor 51 ; 7tn 3.ANF eceplor.bZIPJM Y„341 
445800; M126419; Hs.32944; inositol polyphosphate-4-phosphatase, type 1, 107kD; none.none; 3.41 
410908; M121686; Hs.10592; ESTs; GTP_EFTU,GTP_EFTU_D3,GTP_EFTU_D2,none; 3.41 
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452960; AK001 335; Hs.31 1 37; protein tyrosine phosphatase, receptor type, E; Y_phosphatase,none; 3.40 
447898; AW969638, Hs 112313 6.2 kd p otein; none,none; 3.40 
450402; BE218027; Hs.89969; ESTs; SH3,none; 3.40 
441466; AW673081; Hs.54828; ESTs; pklnase,zf-C2H2,KRAB,none; 3.40 
5 408546; W4951 2; Hs.46348; bradykinin receptor B1; 7tm_1;TM=Y;SS=M; 3.40 

410927; T77635, ; gb.yc91h06.r1 Soaresinfi t r1 'II I omo sap'ens cDNA clone 5', mRNA sequence; none.none; 3.40 
409646; AVU61391 5.709; deoxycyl'dii IN SS=M; 3,39 

4171 65; R80137; Hs.302738; Homo sapiens cDNA:FLJ21 42b fi lo >L041 62; Sulfate J ; -TAS,HMG_pox: 3.39 

449343 / 1 i protein phosphatase 3 (fori c 3 u r ph i oform (calcineurin A alpha); none.none; 3.39 

1 0 45051 1 ; R37423; Hs.85392; thyroid hormone receptor interactor 1 1 ; MyosinJail.EGF; 3.39 

414271, AK0DIL ' Ho.75371; pn tein kinase C binding protein 1; bromodomain,PHD,PWWP,zf-MYND;TM=M;; 3.38 

418428; Y12490; Hs.85092; thyroid hormonereceptor rteracl r1l tZ!P> - n,GTP_cyc M l,M;TM=M;; 3.37 

422369; AF005216; Hs.115541; Janus kinase 2 (a protein tyrosine kinase); SH2,pkinase;TM=M;; 3.37 

456451; AI761180; Hs.94211; rcdl (required for cell differentiation, S.pombe) homolog 1; none;TM=M;; 3.37 
1 5 438543; AA810141; Hs.192182; ESTs; SH2,pkinase,none; 3.37 

401943; NM_012434; I solute carrier family 17 (antanfeugar transporter), member 5; r.one;TM=M;; 3.36 

415276; U88666; Hs.78353; SFRS protein kinase 2; pkinase;TM=M;; 3.36 

447881; BE620886; Hs.75354; GCN1 (general control of amino-acid synthesis 1. yeasl)-like i; pkirase.pkinase; 3.35 
434533; AA639257; Hs.292549; ESTs; SH3,PDZ,Guanylate_kin,none; 3.35 
20 432639; AW973785; ; gb;EST385886 MAGE resequences, MAGM Homo sapiens cDNA, mRNA sequence; none.IRK; 3.35 
410678; BE540516; Hs.293732; hypothetical protein MGC3195; Armadillo._seg;TM=M;SS=Y; 3.35 
402807; ; ; ENSP00000235229;SEMB.; integrin_B,Sema,PSI;TM=Y;; 3.35 
420189; AW296380;Hs.95821 o te ci il t 1bH3ank3 34 

437389; AL359587; Hs.271586; hypothetical protein DKFZp762M115; secY,E1_dehydrog,transket_pyr,none; 3.34 

2 5 453423; NM_002647; Hs.32971; phosphoinosftlde-3-kinase, class 3; PI3_PI4Jtii£se,PI3Ka,PI3hlC2;TM=M;; 3.34 

414270; L2Q352; Hs.247527; solute earner family 20 (phosphate transporter) membe 2 Cm i to r \PH04;TM > , h 
417479; AI057052; ; ESTs, Weakly similar to Z195.HUMAN ZINC FINGER PROTEIN 195 [H.sapiens]; LRR,CARD,none; 3.33 
424946, M64. - . Hs.1 393; pr 'em tyrosine phosphatase, non-receptor type 3; Band_41,PDZ,Y_phosphatase,none; 3.33 
452681; AF153330; Hs.30246; solute carrier family 19 (thiamine transporter), member 2; Fo!ate_carrier;TM=Y;SS=M; 3.33 

3 0 426477; AA379464; ; gb:EST92386 Skin tumor I Homo sapiens cDNA 5' end, mRNA sequence; DUF6,MATH,BTB; 3.33 

438283; AI458931; Hs.37282; ESTs; none,transmembrane4; 3.33 
421327; AA837295; Hs.1 88802; ESTs; none,IMP4,Y_phosphatase; 3.33 

432481; AW451645; Hs.151504; Homo sapiens cDNA FLJ11973 fis, clone HEMBB1001221; laminin_G,Cofegen,COLFI,CorA,TSPN,none; 3.33 

452682; AA456193; Hs.9071; progesterone membrane binding protein; homeobox.none; 3.32 
3 5 428997; AF065391; Hs.194718; zincfinger protein 25b zf-Ra b IM f ) 2 

43221 1; BE274530; Hs.273333; hypothetical protein FLJ10986; FGGY_C;TM=M;; 3.31 

443601; AI07-. 1 Hs 1563 E Ts, ank,pkinase,death,RlbosomaLS14: 3.31 

430597; AF062006; Hs.285529; G protein-coupled receptor 13 tm_1,L LRRM1;TM-Y;S3-M 1 

419912; AF249745; Hs.6066; Rho guanine nucleotide exchange factor (GEF) 4; SH3,PH,RhoGEF;TM=M;; 3.31 
40 400380; NM_018485; Hs.283079; G protein-coupled receptor C5L2; 7tm_1 ;TM=Y;SS=M; 3.31 

415983; AI436798; Hs.1 17078; Homo sapiens cDNA: FU23028 fis, clone LNG01 B52, highly similar to HSU08023 Human cellular proto-oncogene (c-mer) mRNA; 
fn3,ig,pkinase;TM=Y;SS=M; 3.31 

441054; AA913591; Hs.126480; ESTs; none,7tmJ; 3.31 

41 8342; BE002723; ; leptin receptor; ICE_p20,DED,ICE_p1 0,ICE_p20, DED: 3.31 
45 4461 2B; AW836779; Hs.1 1 3029; ribosomal protein S25; none,7tm_1 ; 3.31 

1 Hs H piers cC M r T ° e r J L =f r 1 yalej 14 1 

425725; NMJ12243; Hs.159322; solute earner family 35 (UDP-N acetytglucosamine (UDP-GlcNAc) transporter), member 3; DUF6;TM=Y;SS=M; 3.30 

422608; AW160644; Hs.118695; potassium voltage-gated channel, subfamily G, member 1; ion _trans,K_tetra;TM=Y;; 3.30 

429061; Y14039; Hs.195175; CASP8and FADD-Iike apoptosis regulator; ICE_p20,DED;TM=M;: 3.30 
50 433656; AW974941; Hs.292385; ESTs, Weakly similar to I78885 serine/threonine-specific protein kinase [Rsapiens]; pkinase,ABC1,none; 3.30 

413132; NM_006823; Hs.75209; protein kinase (cAMP-dependent, cata'ytic) inhibitor alpha; PKI;SS=M; 3.30 

402603; ; ; ENSP00000251206':KIAA0778 PROTEIN (FRAGMENT).; none;TM=Yr; 3.30 

418801; AA228366; Hs.1 15122; ESTs; integrin_A,FG-GAP,none; 3.30 

400275; ; ; NM.006513*:Homo sapiens seryl-IRNA synthetase (SARS), mRNA. (SAM6B), mRNA.; tRNA-synt_2b,Seryl_tRNA_N;TM=M;; 3.29 
55 440286 U* -■ l; Hs.7138; cholinergic i ptor, mi artr 7tm_1 ;TM=Y;; 3.29 

409101; NM 004297; Hs.50612; guanine nucleotide birdinq i (Gf tein) alpha 11 S-alpha,none; 3.29 
432736; AA788898; Hs.179902; transporter-like protein; none;TM=Y;EE=M; 3.29 
408738; NM.014785; Hs.4731 3; KIAA0258 gene product; none;TM=M;; 3.29 

443195; BE148235; Hs.193063; Homo sapiens cDNA FU14201 fis, clone NT2RP3002955; Aajrans.none; 3.29 
60 405328; ; ; NM_005391:Homo sapiens pyruvate dehydrogenase kinase, isoenzyme 3 (PDK3), mRNA.; HATPase_c;SS=M; 3.28 
418764; N30531; Hs.42215; protein phosphatase 1, regulatory subunit 6; none.none; 3.28 

408756; AA524743; Hs.44883; ESTs; Armadillo_seg,IBB,DEAD,helicase_C,Sec63,DDT,PHD,bramodomain;TM=M;; 3.28 

41 5474; NM_014252; Hs.78457; solute earner family 25 (mitochondrial earner; ornithine transporter) member 1 5; mito_carr;TM=M;; 3.28 

417805; U38545; Hs.32587; phosphoipase Dl, phophatldylcholine-specific; PH,PLDc,PX;TM=M;; 3.28 
65 41 0254; BE0041 31; Hs.31 851 0; Homo sapiens cDNA FLJ13682 fis, clone PLACE200001 5, weakly similar to EPIDERMAL GROWTH FACTOR RECEPTOR SUBSTRATE 
SUBSTRATE 15;efhand,none; 3.28 

443968; AA287702; Hs.1 0031 ; KIAA0955 protein; CARD;TM=M;SS=M; 3.28 

438899; AF085833; Hs.135624; ESTs; none,PI3 PI4 kinase,PI3Ka,PI3K_C2,PI3Krbd,PI3K_p85B; 3.27 

41 5663; AW296841; Hs.313332; ESTs; UQ_con,Neur_chan_LBD,Neur_chan_memb; 127 
70 414087; W19712; ; gb:zb36d03.r1 Soares.parathyroid tumor.NbHPA Homo sapiens cDNA c'one 5'. mRNA sequence; pkinase.none; 3.27 

442833; AA328153; Hs.88201; ESTs, Weakly similar to A Chain A, Crystal Structure Of The Human Acyl Protein Thioesterase 1 At 1.5 A Resolution [Rsapiens]; 
abhydrolase_2;TM=M;;3.27 

444754; T83911; Hs.1 1881; transmembrane 4 superfamily member 4; none;TM=Y;SS=M; 3.25 

432579, AFOi 244; Hs.278439; nu la lein 3 (apoptosis repressor with CARD domain); CARD;TM=M;; 3.26 
75 458943; AW249181; Hs.19954; ESTs, Weakly similar to T19873 hypothetical protein C41 C4.2 - Caenorhabditis elegans.(C.elegans]; none.pkinase.RGS; 3.25 

411974; AW880414; Hs.84264; acicic protein rich in. leucine;; E1-E2_ATPase,Catlon.ATPase_C,Cation_ATPase_N,Hydrolase,asp; 3.26 

437145 AF007">1 rierfamil u b 1 I ansporter, member -IC03_cotransp;TM=Y;; 3.26 

423387; AJ012074; ; vasoactive intestinal peptide receptor 1; 7tm_2,HRM,CSD;TM=Y;SS=M; 3.25 

442643; U82756; Hs.3991; PRP4/STWWD splicing factor, WD40;SS=M; 3.25 
80 417525; R93355; Hs.192991; ESTs, Weakly similar to ALUB HUMAN !!!! ALU CLASS B WARNING ENTRY III [H.sapiens]; SH3,ig.pkinase,PH,spectrin,RhoGEF;SS=M; 3.25 

412283; BE069084; ; gb:QV3-BT0379-140100-058^12 BT0379 Homo sapiens cDNA, mRNA sequence; ion_trans,RYDRJTPR,MIR,none; 3.25 

411213; AA676939; Hs.69285; neuropilin 1; MAM,F5_F8_type_C,CUB,CUB,MAM 1 F5_F3Jype.C; 3.25 

400352; AF227133; ; taste receptor, type 2, member 7; none;TM=Y;SS=M; 325 
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402974; ; ; Target Exon; Yjhosphatase,GnRH,hormone5,hormone4; 3.25 

407644; D1681 5; Hs.37288; nuclear receptor subfamily 1, group D, member 2; hormcne_rec,zf-C4;TM=M;SS=M; 3.25 
421654 AW163267 k 106 u r 1 ( L i» li icase bS=V, 3 25 

438022; AW517E24 35201 I )D2 ; i item, LRR,CARD,GTP_CDC,ViraLhelicase1;TM=M;; 3.24 
449964; AW001741; Hs.24243; hypothetical protein FU10706; pklnase;TM=M;; 3.24 

428816; AA004986; Hs.193852; ATP-binding cassette, sub-family C (CFTRiMRP), member 2; EGF,sushi,trypsin,CUB,ABCJran,ABC_membrane;SS--M; 3.24 
427319; AW631495; Hs.27135; B-cell receptor-associated protein BAP2S; filament;TM=Y;SS=M: 3.24 
421970; AF227156; Hs.1 10103; RNA polymerase I transcription factor RRN3; 

aajemieases.pyrtdoxalJeCMm^^ 
411887; AW182924; Hs.128790; ESTs; pkinase;TM=M;; 3.24 
430180; AA331406; Hs.75456; A kinase (PRKA) anchor protein 1 0; RGS;SS=M; 3.24 
410267; AW978005; Hs.12600; N-ethylmateimide-sensitive factor attachment protein, beta; none,NTF2; 3.23 
410240; AL157424; Hs.61289; synaptojanin 2; Exo_endojhos,Sy|a_N.rrm,Gram-ve..poiins;TM=M;; 3.23 
434510; AF143885; Hs.18190; EST; SH3,FCH,none; 3.22 

422592; BE081857; Hs.94211; rcdl (required for cell differentiation, S.pombe) homolog 1; none,PI-PLC-X,PH,F1-PLC-Y,C2; 3.22 

439803; AA001021 ; Hs.6685; thyroid hormone receptor inleraclor 8; none.none; 3.22 

448520; AB002367; Hs.21355; doublecortin and CaM kinase-like 1; pkinase,DCX;TM=M;; 3.22 

409245; AA361037; Hs.288036; tRNA isopentenylpyrophosphate transferase; Armadillo_seg;TM=M;; 3.22 

458946; AA009716; Hs.42311; ESTs; none,DSPc,Y_phosphalase; 3.22 

409048; H59990; Hs.37699; ESTs; Armadillo_seg,IBB,none; 3.22 

420357; U94333; Hs.97199; complement component C1q receptor; EGF,lectin_c,TissueJao,Xlink,TIL;TM -Y;SS--M; 3 22 
426230; M367019; Hs.241 395; protease, serine, 1 (liypsir, 1); liyps:n,loxin_4;SS=M; 3.21 

41 1352; NM.002890; Hs.758; RAS p21 protein activator (GTPase activating protein) 1 ; SH2,SH3,C2,PH,RasGAP;TM=M;SS=M; 3.21 

438333; R39382; Hs.25283; cyclin-dependent kinase 8; pkinase.none; 3.20 

414202; BE275653; Hs.270379; transmembrane 6 superfamily member 1; 7lm_5,none; 3.20 

429651 ; D79248; Hs.279870; ESTs, Weakly similar to A4601 0 X-linked retinopathy protein [H.sapiens]; MgtE.none; 3.20 

400987; ; ; C11000939:gi|11464993|ref|NP_065260.1| gene for odoranl receptor MOR83 [Mus muoculuo] gi|6; nor.e;TI*-Y;SS--M; 3.20 

413760; Z25101; Hs.25127; Homo sapiens mRNA for KIAA1725 protein, partial cds; none.ank.ArfGap; 3.20 

408468; AI90971 2; Hs.93837; phosphatidylinositol transfer protein, membrane-associated; PX,PH,PLDc,PH,PLDc,PX; 3.20 

409463; AI458165; Hs.17296; hypothetical protein MGC2376; K_tetra;TM=M;; 3.20 

425910; AA830797; Hs 184760, CCAAT-box-bu ding li , i riplioi .it ine;TM=M;; 3.19 

423798; AF047033; Hs.132904; solute carrier family 4, sodium bicarbonate cotransporter, member 7; HC03_cotransp;TM=Y;SS=M; 3.19 
407753; AL045916; lls.293419; ESTs; Ephrin.none; 3.19 

419355; AA428520; Hs.90061; progesterone binding protein; hemeJ;TM=Y;SS=M; 3.19 
454128; AL031259; Hs.41639; programmed cell death 2; zf-MYND;TM=M;; 3.19 

421202; AF193339; Hs.102506; eukaryotic translation initiation (actor 2-alpha kinase 3; pkinase;TM=Y;SS=M; 3.19 

446360; N42553; Hs.267914; homolog of mouse transient receptor potential-phosphdipase C-interacting kinase CtlaK; hypothetical protein FLJ20117; 

ion_trans.MHCK_EF2_kinase;TM=M;;3.18 
458882; R34993; Hs.226666; ESTs, Moderately similar to I54374 gene NF2 protein [H.sapiens]; CRAL_TRlO,PKI; 3.1 8 
424124; AA335609; Hs.7589; ESTs, Weakly similar to A4601 0 X-linked retinopathy protein [H.sapiens]; pkinase.TBC; 3.13 
444745; AF117754; Hs.11861; thyroid hormone receptor-associated protein, 240 kDa subunil; none;TM=M;;3.18 
426399; AA652588; Hs.301348; Homo sapiens cDNA FLJ13271 fis, clone OVARC1001000; SH3,HS1_rep,none; 3.18 
425836; AW955696; Hs.90960; ESTs: Cbl_N,CbLN2,Cb;_N3,UBA,zf-C3HC4,none; 3.18 

403335; ; ; NM_021815:Homo sapiens solute carrier family 5 (choline transporter), member 7 (SLC5A7), mRNA.; SSF;TM=Y;SS=M; 3.17 
428788; AF0822B3; Hs.193516; B-cell CLUIymphoma 10; CARD;TM=M;; 3.17 

429558; AI391454; Hs.207251; nucleolar autoantigen (55kD) similar lo lal synaplonemal complex protein; none;SS=M; 3.17 
440248; AA876138; Hs.153136; ESTs; SH2,none; 3.17 

423706; U95218; Hs.131924; G protein-coupled receptor 65; 7tm l;TM=Y;SS=M; 3.17 
429752; H52348; Hs.36636; ESTs; pkinase.pkinase; 3.17 

446163; AA026880; Hs.25252; Homo sapiens cDNA FLJ13603 fis, clone PLACE1010270; none;NA;NA; 3.17 

456773; AI038192; Hs.129764; EGF-like repeats and i scoidii l-li s don ain 3 ht l myosin Jead,IQ,MyTH4,EGF,F5_F8Jype_C,Band_41;TM=NI;; 3.17 
434392; AW983709; Hs.250824; Homo sapiens cDNA: FU23435 fis, clone HRC12531; pkinase.none; 3.16 
435972; W95088; Hs.114198; ESTs; pkinase,OPR,none; 3.16 
441401;AIB24 3 38 H 12 891 E i ue I Tf 

410497; AL157648; Hs.157078; Homo sapiens cDNA FU12793 fis, clone NT2RP2002033; none,none; 3.16 
401 1 1 3; H25530; ; solute carrier family 22 (organic cation transporter), member 1-!ike; none;SS-=M; 3.1 6 
424833; NMJ03894; Hs.1 53405; period (Drosophila) homolog 2; PASrSS=M; 3.15 

453880; AI803166; Hs.28452; ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens]; HSP70,none; 3.15 

435391; AA704588; Hs.58934; ESTs; PIP5K,none; 3.15 

428065; AI634046; Hs.157313; ESTs; ICE j20,DED,ICE_p10,ICE_p20,DED; 3.15 

452688; AA721140; Hs.49930; ESTs, Weakly similar to putative p150 [H.sapiens], SH3,none; 3.15 

426839; M74782; Hs.172689; interleukin 3 receptor, alpha (low affinity); none;TM=WSS=M; 3.15 

421247; BE391727; Hs.102910; general transcription factor IIH, polypeptide 4 (52kD subunit); nor,c;TM=M;; 3.14 

440249; AI246590; Hs.249175; ESTs; TatD DNase,pkinase,death,none; 3.14 

409519; AK001015; Hs.55220; BCU-associated athanogene 2; BAG;TM=M;; 3.13 

446135; AW130288; Hs.170318; hypothetical protein FU10147; hormone_rec,zf-C4;SS=M; 3.13 

400440; X83957; Hs.83870; nebulin; SH3,Nebulin;; 3.12 

409099; AK000725; Hs.50579; hypothetical protein FU2071 8; ArmadiIo_seg;TM=M;; 3.1 2 
434237; AF1 19908; Hs.235516; hypothetical protein PR02955; none;SS=M; 3.12 

428179 Al 127772 H 79696; ESI Weakly similai lo I33I > hyp. ie i! i n [r apers] pkinase,PX,pkinase_C;SS--M; 3.12 
422824; NMJ12108; Hs.121128; BCR downstream signaling 1; SH2,PH;TM=M;;3.11 

409745; AA077391; ; gb;7B14E12 Chromosome 7 Fetal Brain cDNA Library Homo sapiens cDNA clone 7B14E12, mRNA sequence; 7tm_1,zf-C3HC4,fn3,SPRY,KRAB,zf- 

C2H2,rve,zf-B.box;TM=Y;SS=M; 3.1 1 
435411; AW444619; Hs.13821 1; ESTs; none.pkinase; 3.1 1 
424852; AI222779; Hs.1 44848; ESTs; adenylatekinase,SH2,pkinase,none; 3.11 
441970; AW959918; Hs.155160; ESTs; rrm,zf-C2H2; 3.11 

453370; AI470523; Hs.139336; ATP-binding cassette, sub-family C (CFTPJMRP), member 4; ABCJran,ABC_membrane;TM=Y;; 3.1 1 
413285; BE078405; ; gb;QV2-BT061 7-080300-071 -g03 BT0617 Homo sapiens cDNA, mRNA sequence; GCV_T;SS=M; 3.10 
429458; BE161832; Hs.292689; ESTs; pkinase,bZIP,Armadillo_seg,none; 3.10 

401 185; ; ; NM_021625:Homo sapiens vanilloid receptor-related osmotically activated channel; OTRPC4 protein (OTRPC4), mRNA.; ank,lon_trans; I M=Y ;; 3.1 

404537; Z25884; ; chloride channel 1 , skeletal muscle (Thomsen disease, autosomal dominant); none;TM=Y;; 3.10 
417089; H52280; Hs.18612; Homo sapiens cDNA: FLJ21909 fis, clone HEP03834; vollage.CLCCBS.none; 3.09 
450792; AA400323; Hs.183041; ESTs; none,ABCJran; 3.09 
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420361; N92054; Hs.194718; zinc finger protein 265; zf-RanBP,7tm_1; 3.09 
444040; AF204231; Hs.182982; golgin-67; SH3,C2,PH,RhoGEF,efhanc;TM=M;; 3.09 
416 1 r F b i I receptor; zM I r TM 

442215; AI703172; Hs.129005; ESTs, Weakly similar to 2109260A B cell growth factor [H.sapiens]; none.none; 3.09 
5 424187; AA336561 Hs.1 t E T eok!> nmila tc S26689 hypothetical protein hd - mouse [M.musculus]; IRK,none; 3.09 
426623; AA382826; Hs.1 32793; ESTs; none;TM=M;; 3.08 
419577; L36531; Hs.91296; integrin, alpha 8; integrin_A,FG-GAP;TM=Y;; 3.08 
426618; AL036456; Hs.171374; smg GDS-AESOQATED PROTEIN; Armadillo_seg;TM=M;; 3.08 

445133- AW157645- Hs 153505; ESTs; eniar,d,spectrin,GAS2,SH3Plectin,RA,XyloseJsom 1 RiD,bZIPJropomyosin,Myc-LZ,M,ldh_C 1 CH,AIP3;TM-M;; 3.03 
10 423b81 AB023 r! 1)i Hon 1 1 n mPtIA; cDNA DKFZp434E199 (from clone DKFZp434E199); partial cds; TTL;TM=M;; 3.08 
428730, AA62504- ri, . EST HECT,none;3.08 

427976; AW977808; Hs.80545; mitogen-activated protein kinase 3 interacting protein 2; RibosomalL37e,pkinase; 3.08 
412448; L12964; Hs.73895; tumor necrosis factor receptor superfamily, member 9; TNFR_c6;TM=Y;SS=M; 3.03 
416814 AM 1 r s i jl J P h hi linn fer e Alg6_Alg8,7tm_1;TM=Y;SS=M; 3.08 

1 5 427395; AW298741; Hs.97861 ; ESTs, Moderately similar to 138022 hypothetical protein [Rsapiens]; none.aldedh.aakinase; 3.08 
436267; AW450938; Hs.180115; ESTs; none.PFK; 3.07 

422309; U79745; Hs.1 1 4924; solute carrier family 16 (monocarboxylic acid transporters), member 6; sugar_tr;TM=Y;SS=M; 3.07 
439238; N47305; Hs.46668; ESTs; 7tm_1 ;TM=Y;SS=M; 3.07 

458760; AI498631; Hs.111334; ferritin, light polypeptide; cystatin,ferritln,histone,HC03_col-ar.sp,SH3,RhoGAP 1 xan_ur_permease 1 FCH;SS=M; 3.07 
20 424236; AW0581 14; Hs.7837; phosphoprotein regulated by mitogenic pathways; pkinase;TM--M;; 3.06 

427286; AW732802; Hs.2132; epidermal growth factor receptor pathway substrate 8; SH3,TonB_boxC;TM=M;; 3.06 

423878; AI907090; Hs.52891; hypothetical protein PR01853; cystatin,femlin,hlstone,HC03_cotransp 1 SH3,RhoGAP,xan_ur_permease,FCH;SS=M; 3.06 
1 2 0 i JM 00 I u r Epl 1 ii v l ' "3 MFPH Id I" It M 3.05 

450407; NM_000810; Hs.24969; gamma-aminobutyric acid (GABA) A receptor, alpha 5; Neur_chan_LBD,Neur.chan_memb;TM=Y;; 3.06 
25 456249; AI2U61 14; Hs.82508 HRIHFB2206 protein; none;SS=M; 3.06 

441560; F13386; Hs.7888; Homo sapiens clone 23736 mRNA sequence; pkinase,Recep_L_domain,Furin-like,YLP,none; 3.05 
446488; ABC37782 H 15119 1361 p I pkii eSSM305 

447495; AW401864; Hs.18720; programmed cell death 8 (apoptosis-inducir.g factor); pyrjedox;TM=M;; 3.05 
425390; AI092634; Hs.156114; protein tyrosine phosphatase, non-recepto type substi ate 1; ig;TM=Y;SS=M; 3.04 
3 O 409705; M37762; Hs.56023; brain-derived neurotrophic factor; NGF;SS=M; 3.04 
413962; AA331563; Hs.24678; sphingosine-1 -phosphatase; PAP2;TM=Y;; 3.04 

426578; R23027; ; gb;yh27e07.r1 Soares placenta Nb2HP Homo sapiens cDNA clone 5', mRNA sequence; pkinase.none; 3.04 
438005; BE151746; ; gb:PM1-HT0305-061299-003-a06 HT0305 Homo sapiens cDNA, mRNA sequence; pkinase,UBA,KA1,none; 3.04 
438316; AA789249; Hs.80042; gb:aj27g08.s1 Soares_lestis_NHT Homo sapiens cDNA clone 1391582 3', mRNA sequence; none.none; 3.04 
35 4 r 2B c 0 J _J 1 EST M derately-imlar to A46010 X-linked retinopathy protein [H.sapiens]; CBS,voltage_CLC,none; 3.03 

405266; ;; Target Exon; arf.G rtphaiSS-M; 3.03 

402615; ; ; C1003844*:gi|6912550|ref|NP_036483.1| olfactory receptor, family 10, subfamily J, member 1; none;TM=Y;SS=M; 3.03 
422803; W28669; tls.139041; ESTs; transmembrane4,none; 3.02 

439325; AF086139; Hs.150423; cyclin-dependent kinase 9 (CDC2-related kinase); pkinase,MurJigase,MurJigase_C; 3.02 
40 416389; AA180072; Hs.149846; integrin, beta 5; integrin_B,none; 3.02 

418836; AI655499; Hs. 161712 ES t j kina Activin p,F Zl deal 

438996; AW748336; Hs.110613; KIAA0421 protein; none;TM-M;; 3 02 

422676; D28481; Hs.1 570; histamine receptor HI; 7tm_1 ;TM=Y;SS=M; 3.02 

450267; AW505538; Hs.243620; ESTs; pkinase.none; 3.01 
45 400555; ; ; Target Exon; nor.e;TM=Y;; 3.01 

«reiftAW50U.n"CiV ar ^i^e-promoting complex 1; meiotlccheckpoip.l regulaic; pi-PLC-X,C2,SH2,PH,SH3,PI.PLC-Y,PAN,none;3.01 

429673; AA884407; Hs.211595; protein tyrosine phosphatase, non-rcceotor type 13 (APO-1/CD95 (Fas)-associated phosphatase); Y_phosphatase,Band_41,PDZ;SS=M; 3.01 

417067 AHOUI H 81 " m Mi i euronal monoamine transporter), member 3; sugar Jr;TM=Y;SS=M; 3.00 

403212; ; ; NM_019595:Homo sapiens inlerseclin 2 (ITSN2), mRNA. (CHRNA9), mRNA.; SH3,elhar,d,C2,PH,RhoGEF;TM=M;; 3.00 
50 410141 R>"""j H 17557 Homo sapienscDNA: FU21291 fs, clone COL01963; F5_F8_type_C,pkinase,EU,none; 3.00 

421059; AI654133; Hs.302 12; thyroid receptor interacting protein 15; none.none; 3.00 

452335; AW1 88944; Hs.61272; ESTs; none.IRK; 2.99 

437644; AA748575; Hs.136748; lectin-like NK cell receptor; lectin_c;TM=Y;SS=M; 2.99 
435876; AW61 2586; Hs.160271 ; G protein-coupled receptor 48; 7lm_1,LRR,lRRNT;TM=Y;SS=M; 2.99 
5 5 429177; AA447527; Hs.207429; ESTs; 7tm_1,none; 2.99 

449289; BE466067; Hs.225660; ESTs; 3Beta_HSD,pkinase; 2.99 

454701; AW854930; ; gb:PMO-CT0253-201099-003-f06 CT0263 Homo sapiens cDNA, mRNA sequence; SH2,STAT,STAT_bind,STAT_prot,none; 2.99 
409995; AW960597; Hs.129206; ESTs; pkinase.none; 2.98 
446860; AV660685; Hs.282953; ESTs; none,PP2C; 2.98 
60 438684; AA830105; Hs.194976; SH2 domain-containing phosphatase anchor protein 1 ; ig;TM=Y;SS=M; 2.9B 
434164; AW20701 9; Hs.148135; serine/threonine kinase 33; pkinase;TM=M;; 2.98 

403290; ; ; C10001011*;gi|475821 2|ref[NP 00441 1.1 1 dual specificity phosphatase 8 [Homo sapiens] gi|601; none;TM=M;; 2.97 
433556 \A5 3^1 H 11 } i a i i n i e d i tina^e CaM kinase) II delta; pkinase.none; 2.97 
421990; T31811; Hs.110480; DC12 protein; GKAP,DUF159;TM=M;; 297 
65 428315; AA688152; Hs.98505; ESTs; pkinase.none; 2.97 

41 1 140; AW819453; ; gb:RC5-ST0293-061299-031-C07 ST0293 Homo sapiens cDNA, mRNA sequence; Choline_kinase,Cam_acyltransf,Sulfatase,Cam_acyltransf; 2.97 
453998; H47802, Hs.755 FK505-t nd ng protein 5; none,none; 2.97 
401342; ; ; Target Exon; none.none; 2.97 

453020; AL162039; Hs.31422; Homo sapiens mRNA; cDNA DKFZp434M229 (from clone DKFZp434M229); dNK,none; 2.96 
70 410976; R36207; Hs.25092; hypothetical protein MGC10744; none;TM=M;SS=M; 2.96 

431074 BE372?" V I532 i E r T^ Mod ill , iihr to A16010 X-linked retinopathy protein [H.sapiens]; none.serpin; 2.96 

443829; AI087954; Hs.23348; S-phase kinase-associated protein 2 (p45); F-box,none; 2.96 

400356; AF227 1 37; ; taste receptor, type 2, member 13; none;TM=Y;SS=M; 2.95 

422559; AW247696; Hs,155839; hypothetical protein MGC12934; adh_zinc,PGK,Semialdhyde_dh;SS=M; 2.95 
75 423482; BE280172; Hs.129228; galactokinase 2; GHMP_kinases;TM=M;; 2.95 

438330; AW450572 Hs.25731 6 ESI ;kmase,zf-C4,ERM,CNH,none; 2.95 

414581; AA2S6213; Hs.72010; ESTs; none,Carn_acyltransf,Choline_kinase.SC01 SenC,Glycos_transf_3,Glycos_trans_3N; 2.95 
453058; AW612293; Hs.288684; Homo sapiens cDNA FU11750 fis, clone HEMBA1005558; SH2,SH3,C2,PH,RasGAP,none; 2.95 
430556; AW9b bCr F- E ^ETnore2 94 
80 400471 ; ; ; Target Exon; none;TM=M;; 2.94 

419459; AW291 128; Hs.278422; DKFZP586G1122 protein; Metallophos,7tm 1; 2.94 
107013;U35637 gb:Human nebulin mRNA, partial cds SH3.Nebulin 2.94 
421476; AW953805; Hs.21887; ESTs; Piwi,PAZ,Piwi; 2,94 
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426806; T19228; Hs.172572; hypothetical protein FLJ20093; ank.pkinase,UPF0073;SS=M; 2.94 

405588; ; ; NM_000299":Horao sapiens plakophilin 1 (ectodermal dysplasiaMin fragility syndrome) (PKP1 ), mRNA.; Armadillo_seg;TM=M;; 2.94 

443614; AV655386; Hs.7645; fibrinogen, B beta polypeptide; none.none; 2.94 

416737; AF154335; Hs.79591; UM domain protein; LIM,PDZ;TM=M;; 2.93 ' 
428522; R10184; Hs.191 987 EST Veal. milart LI 1 1 HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY [H.sapiens]; 

none,ArfGap,PH,TNFR_c6;2.93 
447818; W79940; Hs.21906; Homo sapiens clone 24670 mRNA sequence; none.pkir.ase; 2.93 
432925; AA878324; ; ESTs; none.none; 2.93 

443670; AW178935; Hs.238707; ESTs; RmaAD,DENN,dDENN,uDENN;TM=M;; 2.93 

447555; AI391662; Hs.1 60963; Homo sapiens, clone MGC:12318, mRNA, complete cds; none;TM=M;; 2.93 

435092;AL137310;Hs.4749;HomosapiensmRNA;cDNADKFZp76IE' riirriHl m F MM n ilcd r,one;TM=f 93 

417670, R07785, , gb yt1 c -nr ,i , aresl lliversp! n INI L L mo sop is 1' l c s E similar to contains Atu repetitive element;contains MSR1 repetitive element ;, mRNA 

sequence; XYPPX.ABC.membrane.ABCJran; 2,93 
424148; BE242274; Hs.1741; integrin, beta 7; integrin_B,EGF,melalthio,PSI;TM=Y;SS=M; 2.92 

439090; H65724; Hs.347158; gb;yr75a11.r1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 5\ mRNA sequence; pkinase.none; 2.92 
408048; NM 007203; Hs.42322; A kinase (PRKA) anchor protein 2; Pa:alernmin:TM=M;; 2.92 

428796; AU076734; Hs.193665; solute earner family 28 (sodium-coupled nucleoside transporter), member 2; Nucleoside_lra2,BPD_lransp_2;TM--Y; ; 2.92 

415272; AA164215; Hs.203186; ESTs; none,Exo_endo.phos,BNRAtrophin-1,B56,pkinase,ig,TPR; 2.92 

424775;AB014540;Hs.153026;SWAP-70prolein;efhand,PMleu gulinJM I. 2.92 

439569; AW60216B; Hs.222399; CEGP1 protein; EGF,TNFR_c6,granulin,CUB,Keratin.B2,TIL;TM=M;SS=M; 2.92 

441680; AW444598; Hs.7940; RAP1, GTP-GDP dissociation stimulator 1; Armadi«o_seg;TM=M;; 2.91 

444784; D12485; Hs.11951; ectonucleolide pyrophosphatase/phosphodiesterase 1; Somatomedin_B,Endonuclease,Phosphodiest;TM=Y;SS=M; 2.91 
400398, AF1 37396 h 338 ubiquili m „iiHYSS-M291 
435592. AI83D490 II 14- V, rol kin e; FGG FG Ji _C TM M ; 2.90 
400539; ; ; Target Exon; none;TM=M;; 2.90 

403743; ; ; C1002604 gi|8393668|ref|NP_05893d 1 1 r il * r * th leukemic aiio lated gene (st; none;TM=M;; 2.90 

418913; BE046745; Hs.91579; Homo sapiens clone 23783 mRNA sequence; Y_phosphatase,IMP4,none; 2.90 

428169; AI928984; Hs.1 82793; golgi phosphoprotein 2, phoioRC,UPF0118;TM=Y„Z90 

403912; ; ;C5000394*:gi|12737280|re(|XP_006682.2| keratin 18 (Homo saplens][|6633; none;TM=M:; 2.89 

431868; BE246400; Hs.285176; acetyl-Coenzyme A transporter; none;TM=Y;; 2.89 

421558; AB011125; Hs.105749; KIAA0553 protein; none;TM=M;; 2.89 

444100; AA3B3343; Hs.22116; CDC14 (cell division cycle 14, S. cerevisiae) homolog B; Y_phosphatase,DSPc;TM=M;; 2.89 

447437; U07225; Hs.339; purinergic receptor P2Y, G-protein coupled, 2; 7tm_1 ,SH2;TM=Y;SS=M; 2.89 

431512; BE270734; Hs.2795; lactate dehydrogenase A; ldh,ldti_C,SH3,?kinase,UBA;TM=M;; 2.89 

446601; AI312783; Hs.1 55772; Homo sapiens thymic slromal co-transporter mRNA, complete cds; sugar_tr;TM=Y,;; 2.89 

420747; BE294407; Hs.99910; phosphofructokinase, platelet; PFK;TM=M;; 2.88 

449459; BE546846; Hs.195048; ESTs 3 ras.PH.AifGap HC03 (.utransp; 2.88 

405099; ; ; Target Exon; C2,PI-PLC-Y,PI-PLC-X;TM=M;; 2.88 

445890; AF055019; Hs.21906; Homo sapiens clone 24670 mRNA sequence; pkinase.pkinase; 2.88 

401445: ; ; NM_021161*:Homo sapiens potassium channel, subfamily K, member 10 (KCNK10), mRNA.; ion_lrans;TM=Y;SS=M; 2.87 

105 i rarqet Exon; no none; 2.8 

400189; ; ; Eos Control; LRR.PPTA;TM=M;; 2.87 

450125;AA005418; Hs. 158186; ESTs; CIDE-N,7tm_1,none; 2.87 

432056; AB040973; Hs.272385; G protein-coupled receptor 72; 7tmJ;TM=Y;SS=M; 2.86 

423619;T40691; Hs.249159; adrenergic alpha-2A-, receptor; 7tm_1,7tm_2;TM=Y;SS=M; 2.86 

417381; AF164142; Hs.82042; solute caniei family 23 (nucleobase transporters), member 1; xan_ur_permease,RA; 2.03 

) 187908; E 1111 h ^Inrpr^ur or [H.sapiens]; HATPase.c,MOZ.SAS,zf.C2H2; 2.86 

425480; AB023198; Hs.158135; KIAA0981 protein; PIP5K;SS=M; 2.86 

446700; AW205257; Hs.156325; Human DNA sequence from done RP1 1-145L22 on chromosome 6p21.32-22.2. Contains the gene for myelin/oligodendrocyte glycoprotein MOG, 

(part of) the gene for a novel KRAB box containing C2H2 type zinc finger protein, ESTs, STSs, GSSs and a putative CpG; none;TM=M;; 2.86 
444595; AL1 21094; Hs.83572; hypothetical protein MGC14433; Y_phosphatase,SH2,Y_phosphatase,SH2; 2.85 
41 1 331 ; AW8371 78; ; gb:QV1-LT0037-070300-1 00-d1 1 LT0037 Homo sapiens cDNA, mRNA sequence; SH2,none; 2.85 
410763; AF279145; Hs.8966; hypothetical protein FU21776; none.none; 2.85 
440517; AA894880; Hs.181181; ESTs; none.none; 2.85 
454071; AI041793; Hs.42502; ESTs; 7tm_1,none; 2.85 

41 1 040; AF007393; Hs.1 77574; protein-kinase, interferon-inducible double stranded RNA dependent inhibitor, repressor of (P58 repressor); HLH;TM=M;; 2.85 
402183; ; ; NM„004491*:Homo sapiens glucocorticoid receptor DNA binding factor 1 (GRLF1), mRNA.; none;SS=M; 2.85 
428753; AW939252; Hs.192927; hypothetical protein FLJ20251; none;TM=M;; Z84 
417070 Z1907/ H- I 'Of ill n SGXXSt P kinase;TM=M 284 
458456; AI122709; Hs.153609; ESTs; bZIP/\rmadillo_seg,rrm,NTF2,none; 2.84 

421226; AL096748; Hs.1 02708; DKFZP434A043 protein Amiadi lu r - r MG TM=M;SS=M; 2.84 

436733; BE327477; Hs.166941; ESTs; 7tm_3,oxidared_q5_N , Preser.ili n , PWi ; 2 84 

427161; AI024595; Hs.97508; a disintegrin and metalloproteinase domain 5: ig;TM=Y;SS=M; 2.84 

419462; AF071076; Hs.112255; nucleoporin 98kD; DEAD,helicase_C,Nucleor>orin..FG,homeobox;SS=M; 2.83 

413658; AA055369; Hs.75456; A kinase (PRKA) anchor protein 10; none.none; 283 

400749; ; ; NM_003105*:Homo sapiens sortilin-related receptor, L(DLR class) A repeats containing (SORL1), mRNA.; 

EGF,fn3,ldl_recepLa,ldLrecept_b,granulin,BNR;TM=Y;SS-M; 2.83 
447388; AW630534; Hs.76277; Homo sapiens, clone MGC;9381 , mRNA. complete cds; TB2_DP1_HVA22;TM=Y;SS=M; 2.83 
413243; AA769265: l ls.193657; ESTs; r>.inase l zf-C4,ERM,CNH,none; 2.83 

423690; AA329648; Hs.23804; ESTs, Weakly similar lo PN0099 son3 protein [H.sapiens]; ion.Jrans,IQ,none; 2.82 

447993, AW1 395^5 u 170 E r one none, 2 82 

423061; AI290473; Hs.44807; ESTs; integrin_B,Sema,PSI,TIG,none; 2.82 

440619; AW408586; Hs.91052; ESTs, Moderately simila' to ALU5_HUMAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION WARNING ENTRY [H.sapiens]; 

abhydrolase_2,none; 2.82 
423497; U92642; Hs.129701; G protein-coupled receptor 45; 7tm_1;TM=Y;SS=M; 2.81 
446126;AW085909;Hs.10177;pleckstrinhomologydomainintera<li i i leii none.nor "I 
452488; N74921; Hs.184385, ESTs; r.or.e;TM=M;; 2.80 
449515; AI653378; Hs.302012; ESTs; ion_trans;TM=Y;SS=M; 2.79 
443881; R64512; Hs.237146; hypothetical protein FU12752; none,none; 2.79 

449636; AI656608; Hs.281328; ESTs, Weakly similar to T00378 KIAA0641 protein [H.sapiens]; pkinase,hormone3;TM=Y;SS=M; 2.78 

424348; AB020523; Hs.266258; endonuclease G-like 1; Endonuc!eas=;TM=M;SS-M; 2.78 

418844; M62982; Hs.1200; arachidonate 12-lipoxygenase; lipoxygenase,PLAT;TM=Mi; 2.78 

442233; AW967149; Hs.28439; ESTs, Weakly similar to 138022 hypothetical protein [Rsapiens]; MIF.sugar_tr.none; 2.78 



WO 03/042661 



PCT/US02/36810 



450010; AW293801; Hs.255052; ESTs; ARID,7tm_1;2.78 

4528 1 3; U54727; Hs. 191445; ESTs; pkinase,Activin„recp,none; 2.78 

418177; N44967; ; ESTs; pkinase.none; 2.78 

403014; AA723782. '1 3; protein ki c 5MF f ^ <* t/F I M k I ™ a.pkina eSS=M 2" 
35 17 H in hypol i! in DKF r II la HIAT1 r.tr.TM V, 

423994; X01057; Hs.1724; inteileukin 2 receptor, alpha; sushi;TM=Y;SS=M; 2.77 

427342; AL110150; Hs.176680; Homo sapiens mRNA; cDNA DKFZp586D0724 (from clone DKFZp586D0724); none;NA;NA; 2.76 
447574 AF1 2666 H I tousled-like ki 1 ^ r ^TM M 2 76 

442681; AI809182, Hs I3i t ESTs; I bketclase,E1.dehydrog,transket_pyr,lransketolase_C,pkinase; 2.75 
433637, AW024214 H i . if L r Ha_sulph_symp,aa_ i s M-Y SS=M; 2.75 
458997; AW937420; Hs.69662; ESTs; SH3,RhoGAP,FCH;TM=M;; 2.75 
432284; AA532807; Hs.105822; ESTs; pkinase.none; 2.74 

406139;;; Target Exon; ig.Tub;TM=Y;SS=M; 2.74 ' 

439518; W76326; ; gb:zd60d04.r1 SoaresJetal_heart_NbHH1 9W Homo sapiens cDNA clone 5' similar to contains Alu repetitive element;, mRNA sequence; Armadillo_seg,none; 



402129; ; ; Target Exon; SH2,Peptidase_C9;TM=M;; 2.73 
424238 AA3374C1 H T i~ e TM M SS-M 2 73 

433834; AA620742; Hs.130786; ESTs; SPX,EXS;TM=Y;; 2.73 

409339; AB020686; Hs.54037; ectonucleotide pyrophosphatase/phosphodleslerase 4 (putative function); Sulfatase.Phosphodiest;TM=M;SS=M; 2.73 

408163.AW779B42 H 258217 ESTs 7ti l.zf-E )ox.zf-C3HC4,7tm 1,zf-B b ■ f-C3HC4 273 

422358; AL1 33030; Hs.1 15429; Homo sapiens mRNA for KIAA1666 protein, partial cds; SH3;TM=M;; 2.73 

426409; AA594207; ; gb:nn29e01.s1 NCi_CGAP_Gas1 Homo sapiens cDNA Cone 3' mRNA sequence; pkinase.Fibrillarin,none; 2.72 

400645; ; ; Target Exon; lig_chan,SBP_bac_3,ANF_receptor,none; 2.72 

443661; AA336609; Hs.10862; Homo sapiens cDNA: FLJ23313f6, clone HEP 11919; adenylatekinase.none; 2 71 
442572; AI001922; Hs.135121; hypothetical protein FU22415; none,HSP70; 2.71 

j • J £ i H 3250 bone morphogenetic protein receptor, type It (BMPR2); pkinase,Activm_recp;TM=M;SS=M; 2.71 

459357; AWail^lT?' gb" L^CTO 21 4-1 50200-075-B1 1 CT0214 Homo sapiens cDNA, mRNA sequence; ABC_tran,ABC_membrane,ion_trans; 2.70 

439935; S75105; Hs.8358; glutamate receptor, ionotropic, kainote 2; ANF_recepto-,lig_chan,none; 2.70 

414924; C06267; Hs.44247; ESTs; none.none; 2.69 

421008; BE259378; Hs.103147; hypothetical protein FU21347; OUF255;; 2.69 

449951; AA0O4982; Hs.120904; ESTs; DED.Calsequestrin; 2.69 

411226; AW833022; ; gb:RC3-TT0005-191099-012-d04TT0335 Hcmo sapiens cDNA, mRNA sequence; pkinase.none, 2.o3 

417625 U59305 Hs 71 jer-Thrpr in kinase related to the myotonic dystrophy protein kinase; pkinase,bZIP,G-gamma,K-box,pkinase_C;SS=M; 2.68 
408051; AI623351; Hs.172148; ESTs; PH,RhoGAP,none; 2.68 

1 Hs.29402 et ein FLJ1495C r Hz TM M 2 68 

41 3922; A1535895; Hs.221 024; ESTs; ion_trans,RYDR_ITPR,MlR,UDPGT; 2.68 

2188;/ Hs.2 t t rr f r (cationican i t po ' " ' 1 ' 1 ^ 

, Hi, ]b :H i I r 1 i cDNA Homo sapiens cDNA clone c-2wf0 r FN rence; ion_trans,none; 2.67 

419749;X73608;Hs.93029;sparc/osteonectin,cwcvandkazal-lil<edomalnsproteoGlycan(testiLi kaz Ik ruiiiL in 1 SS-M 2 66 

, 6I V014 Hs.221 1 Ts, Weakly similar to I38022 hypothet protein [H.sapiens]; i <H(^ Cbl_N,CbLN2 1 Cbl_N3,none; 2.66 
403609; ;;C3001199:gi|7494834|pir||T! 31 iy, II, temB02G62 Cccnorhabditis -egans||41; 7tni_1.7tm_2,GPS,WIF;TM=Y;SS=M; 2.66 

158213 1047521; Hs.12210;hy| thetical protein FU13732 similar to tensin; pkina none; 2.6E 

4261 58' NM_001982; Hs.199067; v-erb-b2 avian erythroblastic leukemia viral oncogene homolog 3; Funn-like,pkinase,Recep_L_domam,Furin- 

like,pkinase,Recep_L_d jiiidin P d M2 
43541 0;AL1 35067; Hs. 117182; ES is; nor,e,pkinase,RBD,DAG_PE-bind; 2.66 

437838; AI307229; Hs. 184304; ESTs; CARD,ICE_p20 l ICE_p10 l HIT,voltage_CLC,CBS,HCCAJsomerase; 2.66 
430293; AI416988; Hs.238272; Inositol 1 ,4,5-triphosphate receptor, type 2; ion_trans,RYDRJTPR,MIR,none; 2.65 
433090; AI720050; ; immortalization-upregulated protein; none;SS=M; 2.65 

432103; T15803; Hs.272458; protein phosphatase 3 (formerly 2B), catalytic subunit, alpha isoform (calcmeunn A alpha); Metallophos;TM=M;; 2.65 
435852 H72333 H.lilll [ T r p li i.r.oi '4 
433327; AI674779; Hs.126744; ESTs; ncne,7tm_1; 2.64 

438459; T49300; Hs.35304; HomosapienscDNAFU13655fis, clone PLACE101 i r i FMi Ik' 2^ 
432251; AW972983; Hs.232165; polycythemia rubra vera 1; cell surface receptor; none;TM=M;SS=M; 2.63 
446963; AI862668; Hs.176333; ESTs; OMPdecase,Pribosyltran,pklnase,RhoGEF,PH; 2.63 
444821; AA.053564, Hs. 12040; STE20-'ike kinase; pkinase;TM=M;; 2,63 
436206; AK001451; ; CD2-assoclated protein; none.none; 2.63 

434370; AF130988; Hs.58346; ectodysplasin 1 , anhidrotic receptor; death,Kunitz_BPTI;TM=Y;SS=M; 2.63 

439039; AI656707; Hs.48713; ESTs; pkinase,none; 2.63 

449656; AA002008; Hs.188633; ESTs; PIP5K,none; 2.63 

429341; X73874; Hs.2393; phosphorylase kinase, alpha 1 (muscle); none;TM=M;; 2.62 

445174; AV652850; Hs.172004; Win; fn3,ig,SGXXSG,none; 2.62 

424950; AA60291 7 k 1 56974; ESI nue,CDP-OH_PJransf;2.62 

438141- AW946871; ; gb:RC2-ET0022-080500-012-d02 ET0022 Homo sapiens cDNA, mRNA sequence; SH2,STAT,STAT_bind,STAT_prot,none; 2.61 
434938'; AW500718; Hs.8115; Homo sapiens, clone MGG16169, mRNA, complete cds; pkinase l TBC,Rhodanese;TM=M;; 2.61 
409264;NM_014937;Hs.52463;KIAA0966proteii,,H;-i N;TM-M;; 2.6C 

458438- AI141520; Hs.151454; ESTs Weakly similar to ALUC HUMAN till AkU CLASS C WARNING ENTRY III [H.sapiens]; pkinase.none; 2.60 
400719; ; ; NM_004055*:Homo sapiens calpam 5 (C Hi mRN r UN NM.004335.2 Gl; C2,Peptidase.C2,CalpainJII;TM=M;; 2.60 
427318 AF136081 rl 17 Ir 1 Zip;TM SS=M; 2.59 

426086; T94907; Hs.188572; ESTs; PH,Ets,CH,spectrin,Ca.channeLB,none; 259 

43013 \70>9 lls.2540;c t polypeotid Neur_chan.LBD,Neur.chan.memb,pkinase;TM=Y;SS=M; 2.58 

411495; AP000693; Hs.70359; KIAA01 36 orolein; HATPase.c,bZIP;TM=M;; 2.58 

438167; R28363; Hs.24285; ESTs; none;TM=Y;SS=M; 2.58 

418749; N75147; Hs.22488; ESTs; none,zf-C2H2,KRAB,pkinase; 2.58 

454289, AL1375SJ H )9 prol in kinase NYD-SP15; dCMP_cyLdeam;TM=M;; 2.53 

443605; H06865; Hs.134131; ESTs; efhand,ion_trans,none; 2.57 

429429; AA829725; Hs.334437; hypothetical protein MGC4248; none,transmembrane4; 2.57 

403088; ; ; NM_003319*:Homo sapiens titin (TTN), mRNA mRNA fn3,ig,SGXXSG;TM=M 2.5; 

409190; AU076536; Hs.50984; sarcoma amplified sequence; transmembrane4;TM=Y;SS=M; 2.57 

4266S6; AW3S3332; Hs.171044 Hn ipien Ut FLJ22296fis, clone HRC04468; ig;TM=Y;SS=M; 2.56 

403328; ; ; Target Exon; Glyco_hydro.35;TM=M;; 2.56 
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426167; AF039023; Hs.167496; RAN binding protein 6; Armadillo _seg,HEAT_PBS;; 2.56 
428695; AI355647; Hs.139999; purinergic receptor (family A g-oup 5); 7tm_1;TM=Y;SS=M; 2.54 
419285; D31887; Hs.89868; KIAA0062 protein; Zip;TM=Y;SS=M; 2.54 

415740; N80486; Hs.39911; Homo sapiens mRNA for FLJ00039 protein, partial cds: CBM_21;TM=M;; 2.53 

403305; NM.006825- ; transmembrane protein (63kD), endoplasmic retici'lumfGolgi intermediate comDartment; pkinase;TM=Y;SS=M; 2.53 

443804; AL135352; Hs.255883; ESTs, Wealiyim n d I38022 Ihelicatpr i |H ap ens]; Peptidase.M18,Peptidase_M18,Y.phosphatase; 2.53 

450425; H06607; Hs.6099; ESTs; E1-E2_ATPase,Calionj\TPa e_CC I ATPase_N,Hydrolase,none; 2.51 

401702; ;; NMJ01 171*:Homo sapien Alt- te.sub-f TRJI Ler (ABCC6), mRNA.; ABC_tran,ABC_membrane;TM=Y;SS=M; 2.50 

439463; W69304 qr _ 4 I, I n fetal_hearLNbHH19W Homo sapiens cDNA clone 5', mRNA sequence; fn3.Y_phosphatase,none; 2.50 
i; ' ' »•; r- I"-"' ■■' i ■ ■■ .. x/> i JM=Y::2.50 

443259 AW090601 H t T1 a pt I t R1,none; 2.50 

400777; ; ; NM.007325*:Homo sapiens glutamate recepir i m trophic AMPA3 (GRIA3), transcript variant flip, mRNA.; lig_chan,SBP_bac_3,ANF.receptor;TM=M;SS=Y; 
2.49 

426044; AA502490; Hs.170290; ESTs; none.none; 2.48 

454564; AW807573; ; gb:MR1-ST0088-021299-Q04-g01 ST0088 Homo saaiens cDNA, mRNA sequence; pkinase.none; 2.48 
415938; BE383507; Hs.78921; A kinase (PRKA) anchor protein 1; KH-domain,TUDOR;TM=M;SS=M; 2.47 

426481; AW963941; ; gb;EST376014 MAGE resequences, MAGH Homo sapiens cDNA. mRNA sequence; Y_phosphalase,Band_41,DSPc,none; 2.46 
426005; AA377499; ; gb:EST90341 Synovial sarcoma Homo sapiens cDNA 5 end, mRNA sequence; tubulin.FKBP.COX6B.7tmJ ,tubulin_C;SS=M; 2.46 
424879 AA348013, H- 2 33 c f irf.i ,4plia,none; 2.46 
415156; X84908; Hs.78060; phosphorylase kinase, beta; none;TM=M;; 2.46 

416508; R39769; ; ESTs, Moderately similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION WARNING ENTRY [H.sapiens]; 

SH3,PDZ,Guanylate_kin,ZU5,none;2.46 
408087; AW150645; ; gb:xg54f07.x1 NCI_CGAP_Ut4 Homo sapiens cDNA clone 3', mRNA sequence; XYPPX,ABC_membrane,ABCJran; 2.46 
433434 AA'88429 gb:no2 1N I C jAF Fr22 1 omo apien cCNA clone 3', mRNA sequence; pkinase,DNA_mis_repair,HATPase_c; 2.45 
446768; AV66O305, Hs 110286; ESTs; ICE_o20,DED,ICE_p10,ICE_p2Q,DED; 2.45 
437158; AW090198; ; KIAA1150 protein; none;NA;NA; 2.45 
430177; AW969233; Hs.302746; MSTP028 protein; K_tetra,none; 2.45 

AF1144 1 1 r r | t i i I r i ol r ember a; none;TM 

430680; AW138724; Hs.168974, ESTs Higl I ; similar tc M ! '7JHUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION WARNING ENTRY [H.sapiens]; 

Y_phosphatase,Adaplin_N,Y_phosphatase; 2.44 
446569; AW248031 ; Hs.155839 hypnt J ,r ( ; rl nMGC1 4 dh 7 i c PGK,Semialdhyde_dh;SS=M; 2.44 
41 1902; AW875344; ; gb:RC1-PT0009-22030Q-01 3-f06 PT0009 Homo sapiens cDNA, mRNA sequence; none.pkinase.ank; 2.43 
430057; AW450303; Hs.2534; bone morphogenetic protein receptor, type IA (BMPR1 A) (ALK-3); Activin_recp,pkinase;TM=Y;SS=M; 2.43 
446338 AI239M 2l ' E T n ne «H3 2 42 
426221; AB007881; Hs.110613; KIAA0421 protein; none,Ribosomal_S8; 2.42 
446796; A1652497; Hs.1 10103; RNA polymerase I transcription factor RRN3; none,none; 2.41 
428360; H10291; Hs.30974; EST i * FBDmneM] 

428379; X06026 H" 2 ,r D3 an n,gamma| r phi ( 1 °" F ITV1 i'< 1 S= M 2.40 
432488; AA551010; Hs. 216640; ESTs; Na_sulph„symp,none; 2.40 

TO72 D20569- 1 1691 SAC2 (su| essor of aclin inutaHoi 2. ye I homolag ike rore Ribcsomal_S13,Galactosyl_T,Zip,adh_short,zf-C3HC4;2.40 
448595; AB01 4544; Hs.21 572; Kl AA0644 gene product; LRR,LRRCT;TM=Y;SS=M; 2.4C 

428283, AI439096; Hs.323C79; Homo sapiens mRNA; cDNA DKFZp564P1 16 (from clone DKFZp554P1 1 6); Y_phosphalase,fn3,ig,none; 2.39 

432460; H12912; Hs.274691 adenvlai I e 3; adenyl na,e 1 01 e 238 

429549; AI333013; Hs.250505; retinoic acid receptor, alpha; none,zf-C3HC4,BRCT,lig_chan; 2.38 

429303; AW137635; Hs.44238; ESTs, Weak'y simila- to S65657 aipha-1 C-adrenergic receptor splice form 2 [H.sapiens]; Phosphodiest,Somatomedln_B,Endonuclease,none; 2.36 

r C1 a a & e ABC1 TM-M 235 

453186;AK001708;Hs.32271;hyootheticalproteinFLiin 16 II DUF300;TV=Y;SS=M; 2.33 
447276; AL049795; Hs. 17987; hypothetical protein MGC1 203 n r r T V I.' 1 i 
445310; AI242490; Hs.153290; Homo sapiens cDNA FU14318 fls, clone PLACE3000402; none.pkinase; 2.31 
432942; AF083955; Hs.279852; G protein-coupled receptor; 7tm_1.globin;TM=Y;SS=M; 2.30 
434693; AW976001; Hs.337603; ESTs; none.none; 2.26 

452034; F12234;Hs.75893;ankyrin 3, node of Fanvrm Lynn G "IMbiih none; 2.25 

423732; AT0S80S6; Hs.132183; solute comer family 16 (monocarboxylic acid fransporters), member 7 sugarjr TM V,S 5=N 2 2 1 

404956; ; ; C1003210':gi|5912582|ref[NPJ)36524.1 1 peflin [Homo sapiens] gi|5009487|dbj|BAAB4922.1| (AB; none,PI3_PI4_kinase,PI3rCC2,PI3K_rbd,PX,PI3Ka,C2; 2.24 
452183; NM.006594; Hs.28298; adaptor-related protein complex 4, beta 1 subunit; Adaptin_N,Y_phosphatase; 2.23 

420520. D25259; Hs.319844; ESTs, Moderately similar to 154374 gene NF2 protein [H.sapiens]; pkinase,DAG_PE-bind,RBD,ras,DC1,GFP;TM=M;; 2.21 

408808; BE074219; Hs.17230; hypothetical protein FU22087; Armadillo_seg;TM=M SS=M 2 21 

451932; AA360954; Hs.27268; Homo sapiens cDNA: FU21933 fis, clone HEP04337; SH3,PH,RhoGEF;TM=M;; 2.21 

432008; AW296791; Hs.193170; hypothetical protein FU21687; LIM.Synaptophysln.ionJrar.s.KOW; 2.20 

455840; BE145897; ; gb:MR0-HT020B-221299-204-b07 HT0208 Homo sapiens cDNA, mRNA sequence; PI3_PI4_kinase,PI3Ka,PI3_PI4_kinase,PI3Ka; 2.19 
429238; NM 002849; Hs.198288; protein tyrosine phosphatase, receptor type, R; Y_phosphatase,TM=Y;SS=M; 2.19 

430975;AA490055;;gb:ab05b09.s1Stratagenefetalretina937202HomosapienscDNAclone3',rnRNA eouei.L id 1 te ma Tr mdylate_kin;TM=M;; 2.17 
407174' T79938;Hs.77062;leukOLjt'ir nui bulin f , pic ubfan /B| ith TV and ITIt main") member 5; ig,none; 2.16 
450921; AA098790 Hs 14624a E ! Moderaiel iniilai c T17242 hypothetical protein DKFZp586B1417.1 [Hsapiens]; none;NA;NA; 2.15 
427209; H06509; Hs.92423; KIAA1566 protein; pkinase;TM=M;; 2.14 

401917; AL050149; ; RAN binding protein 3; Orexin 1 SH2,STAT,STAT„bind,STAT_prot,ion_trans,PAC,PAS,none; 2.12 
426359; AA375409; Hs.10862; Homo sapiens cDNA: FU23313 fis, clone HEP1 1919; ader.ylatekinase.r.one; 2.07 

439520; W76548; Hs.336621; ESTs, Moderately similar to ALU5_HU VAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION WARNING ENTRY [H.sapiens]; lonirans.none; 
2.06 

410439; R35943; Hs.63758; transferrin receptor 2; PA;TM=Y;; 2.05 

44EE96 AI564 6 C L 1 73070; EST, Wea i hn I ZINC F NGER PROTEIN 42 (MYELOID ZINC FINGER 1) (MZF-1) [H.sapiens]; none,zf-C2H2; 2.04 

449543; AF070632; Hs.23729; Homo sapiens clone 24405 mRNA sequence; K_tetra,ion_trans,none; 2.04 

453496; AA442103 Hs 3 4 If facilitated g! fn se t arsporter), member 5; sugar_tr,TM=Y;SS=M; 2.02 

443952; AM49106; Hs.143530; ESTs; pkinase.none; 2.02 

437589; AA761322, Hs.269662; ESTs; SH2,SH3,C2,PH,RasGAP,none; 2.02 

422637; AA399024; Hs.1 18836; myoglobin; globln;TM=M;; 2.01 

450253; AL133047; Hs.24715; Homo sapiens mRNA; cDNA DKFZp434D0215 (from clone DKFZp434D021 5); partial cds: SH3;TM=M;; ! .97 
401984; ; ; C17000146':gi|2143629|pir||A57156 Ca2+/calmodulin-dependent protein kinase (EC 2.7.1.123) I; pkinase;; 1.96 
453464; AI884911; Hs.32989; receptor (calcitonin) activity modifying protein 1; none;TM=Y;; 1.95 
41 7733; AL048678; Hs.82503; H.sapiens mRNA for 3'UTR of unknown protein; none; NA;NA; 1 Si 
411450; H4961S; Hs. 127301; ESTs; pkinase.none; 1.32 

40 0 C160CCO2>q 03|plr||T20903 hyp F i a ttl I an gi; ABC_tran,GTP.EFTU,PRK,ABC.membrane;TM=Y;; 1.80 

425009; X58288; Hs.1 541 51; protein tyrosine phosphatase, receptor type, M; fh3,lg,Y.phosphatase,MAM;TM=Y;SS=M; 1.74 
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4252B0; U31519;Hs.1872;ptiosphoenolwnivalecarboxykinase1 (soluble); PEPCKTW i M „„_.„ ., 

42595B; AW163271; Hs 3 h 39; intrac llul * ntigend led by mono il antibody Ki inl i ellul h> luron n binding protein; Y_phosphatase,DSPc;TM=M„ l.t 
432563; NM_013261; Hs.198468; peroxisome proliferative activated receptor, gamma, coactivator 1; rrm;TM=M;; 1.51 



AK054860 AV652198 AV652192 AV65213B AV6521 27 AVS521 94 BE935919 AV652017 AV651995 AV651 548 AV646063 AV651985 AV6461B4 
AV646179 AW880409 AA345002 BF1551 B9 BE068S31 X561 97 AL603Q14 AW953629 BM263546 BE550772 AA701084 AI681352 AA358689 
AW938841 BF438147 W05391 H7531 3 BF326185 AV646335 AV651 589 AV646340 AV651992 AV646384 AV646364 AV687497 BF155183 
AV646370 AW797876 AI906821 X56196 BE833835 M628440 BE833808 BF224205 AA709126 BE673807 AI923886 AA947932 AI276125 AI185720 
AW510698 AA987230 BE467708 AW898628 AW898544 Al 1 46984 AW043642 AI238245 AI1 86932 AI635262 AI139455 AI298739 AI813854 
A1024768 BE699445 BE699444 AI707807 D52654 AI214518 AI004723 AI6980B5 AW087420 AI565133 AA845571 AW898622 BF1 10144 AW513280 
AI061126 BF362770 AI268939 AI435818 BF475318 AI024767 BE174213 AA757598 AA513019 AA902959 AI860794 AI334784 BF10841 1 
BM310532 AW513771 AI951391 AI337671 BF095606 BF095601 BF09546B AW890091 BF095753 AW243400 AW898607 AW898616 BF352762 
AI9222C4 AW898625 BE699468 BE1 74196 AW1 02923 D5271 5 BE699456 D52477 D55017BF955933 BG623563 AV646254 AA463522 BI003244 
AI299190 W40186 BE174210 BF939091 BF434180 AW579001 T55662 H01811 T52522 BF945037 BF955938 D54679 D53933 R67100 BG925552 
BF999056 R83430 Z29922 T35791 W03942 H63289 AI09 1 537 BF085583 AA345570 H48870 H80720 T83523 BI039626 BI037700 R00353 
BF155184 N98343 N79072 H01812T55581 

BI0C9308 BI009893 BF922023 BF922909 BF922913 BF922096 BF957733BE701791 AA456454 AA579876 BF933710 AA091294 BI007291 
AW905577 AW975593 AA71 3730 AW836781 AA666384 AA551 106 BF594606 A1082382 AI955308 AI679895 A1679335 BF435555 AA586369 
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T32623 BG01 5679 AL518518 AL5171 18 AL538396 A1049851 AL581976 AV752041 W26586 BE181609 AI963016 BG057603 AI720256 AA844560 
AA055570BE619606 C17428AI3421" f 394 5 6 97 BR 95208 AW057940BI091399AW975179AA909936H28712W65445AL515439 
W37117 H66514T85737W37369 

W19712BE247277 

L13288 AA928785 AI608912 AW872978 AA555655 AI022915 AI304920 AI564366 AI668793 AI094557 T60038 R72302 H45409 AA508805 R45356 
AA418798 BM129553BM129126 BM129292 BM128865 AI808418 AI689932 AI806573BF431808AW872985 AW166269 H73241 T16182 AI264547 
R73391 R72085 R72840 T83751 X75299 3F754348 R94105 AW449839 R73300 NM.004624 AI797007 BE045543BF1 10021 BF754250 T83923 
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30 
35 



413285 12794.9 

426578 358276.1 

438005 694209.2 

454701 352355.1 

411140 1071177.1 

407013 2073_7 

432925 225876J 

417670 2139687J 



55 

60 



1875286.1 

7504_2 

31207.1 



1974151.1 
633688_1 
194362.1 



1141058.1 
1518844.1 
56593.2 



BI030997 AA921874 AW188822 BI027862 AI347518 A1361453 AI0887E4 AW207491 AA077391 BG012775 BG997382 AA286833 AA150722 
BI007625 BI027864 BI009100 BI006275 BI006270 BI031000 BI029864 BI006277 BI007627 BI006265 BI005991 BI006990 BI007763 BI007762 
BG997377 AA150780 BI033518 BI027818 BG015789 BI033807 AA341445 



AV742719 R23027 R63874 AA381749 



BF697879 BG984482 AW854930 AW854941 AW814115 AW814431 AW814190 BF325887 BF325890 BF985536 

AW819463 AW819514 AW819617 AW819618 AW819609 

U35637 AA192323 AA194508 BG01 1583 F25712 AL595320 BE185376 

AA878324 AI619686 AI014377 

T85948 R07785T86972 

Y08200 NM.004581 BC003093 BE733834 BI753321 BG773890 BF091906 BI917541 AI023762 AA587230 BF435086 AI254262 AI687392 AI810536 
AW589886 AI244419 AA749261 AA535435 AW205689 AI765770 AI765431 C02465 AW305347 AI818456 AA322111 AW381845 AW381829 
AV749407 AA811636 AU159893 AA603065 AA652542 AI468678 R49616 AW381863 BE389867 BE182387 BF087771 AA527551 AA134051 
AA831504 AA134052 AI871759 AW089048 BI913532 AA367709 BG828155 BF093014 
AW837178 T77002F13038 

AK056654 AJ420421 AI127111 AA705921 AA749298AA776967AI343758AW070583AA766587 M804876AA460658 AA394137 W72279 
AW071467 AI343843 AA393817 AW769379 AA851873 AA715043 AW512448 AI452856 AI819873T173S4 AW779778 BF477620 AI783605 
AI624523 AA261906 AA514931 BI954124 AW576481 AI864544 AA490863 AA860972 BI963076 AI632879 AA291985 AA255873 BI906876 BI963833 
Z38970 BI495302 BI495301 AI784395 AU185472 AA652150 AA652026 D20449 BI088167 BI260S36 BE869946 AI935271 BI792882 AI762915 
AI809275 AI813351 BF447139 A1052069 AI057 1 27 AA398950 AA291 984 AA292934 AA262543 BF760287 R54455 R72980 H90786 BE698016 
AW959314 BI031449 AL574617 AA776284 AA393770 BM455517 BI602104 BI793150 N36710 H59529 BI005937 BIG00748 BF085914 BF085907 
BF835429 BF835210 BF085926 AA226136 BF836829 BF836606 BM007373 AI369807 BF085930 W251 19 BI252884 BI001270 BE549079 



AF086341 W76326W72300 

NM.003899 D63476 BM456434 AA778936 AA452871 AI052466 AW014138 AA448725 BE673088 AW028198 BI856378 BM150466 BM150674 
BM148451 AW500880 AA180228 BE243507 DM144903 AA333656 AW503767 AA305470 AW504819 AA978194 AW500776 BE872488 AI032663 
AA704686 AA652189 AA179463 AL53592S BE275744 BE277708 BE275715 AW504259 AA354483 BE244197 BE24G232 D17055 AW013876 
AW014877 T09464 T08407 AA830246 AW897881 BE501 192 BE501195 AL044534 AA258853 BI037915 AA448037 BM461769 BI825965 BE763352 
AW167531 Z45588 AV721881 AA527273 AI573219 AA457036 AW439651 AW264418 AA577618 AI802954 AA902292 AA468752 AI380374 
AA722690 AI867708 AA916982 AI29157B AW190427 AI338089 A1653744 AI306B65 AW513541 AW440077 AI370014 AA904269 AW188378 
AI671644 AW193386 AI261832 AA775336 BF43581 1 AI582703 AI278635 BE440186 AA617898 AA648948 BI491837 BF590311 AA448633 F27048 
F37022AW770819AA258808AI369C04^' 0. - 7 i r 4 BC531048 BF222087AA042973 AI868087 AA911460Z41274 AI919082 T16746 
AA447634 AI282427 F22456 T15901 AA825298 AW007436 BE934303 BG981939 AW373814 BM151B38 AW956921 BM150080 BM153173 
BM147451 BF953992 AA916696 AW444935 M78398 AW581 147 AW608258 AA651910 AA132152 AW806295 T30326 D20054 AA310837 T06543 
BM194508 BM193225 BM469348 AW964920 AA325930 BI833627 AW952193 AA738189 AA321051 BG987199 BF953967 T08890 BE869543 
BG742857 BG988685 AA456880 BG001842 BF809452 AW892083 BF944342 T49551 W69981 BF764519 T15869 AA132030 
AW964027AA377709 
AW848421 T71427 



H20760 R15237 Z43915 BF372479 F11411 

AB038318 BC008888 BE905346 BE301941 AA705936 AW014954 BE378742 AI720050 BE395327 BG951204 
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AA883540 AA169387 AW771571 AI1 30803 BF438773 AA088710 AI972591 AI972638 AI762358 AI473907 AI925905 AA502277 BG943806 
BG218468 AA194853 AU128875 AA306025 BG986896 
AA778849 AW946871 AW946782 AW946955 



AW807573AW807572 

AW963941 AA379825 AW963944 AA379564 
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AI146747 AW296894 H85337 AA017692 AA354519 AA018512 D20081 R02704 AA825671 AA017651 AL135600 R02585 AA018849 BG749616 
BF689840 R85326 AA677955 AA702354 AI076645 A1057359 H531 78 W86484 H53074 BG988909 AW962455 AA367326 AA377499 
R39769 T53143 H60012 
AW150645AW811024AW811148AW811068 



AL050068 AA160485 AW173544 AW296506 AW439860 AI521553 AI702529 AI393606 AW138323 AA570109 H19504 BM021968 BF063327 
BF593552 AA630766 AI597717 AI807128 AA523012 AI356250 AW451857 AA974203 AI762577 BF512552 AW007307 BE675285 AW450602 
AA962057 AW516069 AI582546 BF221924 BF222543 AI801808 AW468599 AW000736 AI866625 AW235356BM021837 AA911956 A1680606 
W86516 T03370 AW61 1 634 H41653 AI468349 H19588 AW090198 AW043993 R39847 

AW875344 AW875287 AW875285 AW875286 BF361 295 AW875402 AW875400 

BE145816 BE145897 BF349721 BE145885 

AK057266 BI767614 BI828586 AW069362 B1829572 AI826091 BI819382 AL040402 
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Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numb 

sequence of human chromosome 22." Dunham I. el al., Nature (1999) 402 189495 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



im I. et al." refers to the publication entitled The DNA 



Pkey 



NLposition 



401027 
400991 
406137 
404083 



400843 
406364 
405490 
400755 
404276 
402915 
405616 

402328 
405369 
400845 
403716 



405516 
405110 
403608 
401241 
405102 
404185 



404942 
403142 
400844 
402704 
402833 
401851 
401242 
401943 



402974 
400987 
403335 



9166422 
9944029 
7528051 



9188605 Plus 



5922025 
9926424 
9188605 
9256114 



9796640 Plus 
9843520 Plus 
9454624 Plus 



7280046 Minus 

7382153 Plus 

9444521 Plus 

9188605 Plus 



403912 
405099 
401445 
405480 
402183 
400749 



16650-17082 
80430-81581 
134339-134593 
15049-15286,30267-30457 



67414-68229 

89961-90114,90773-90895,91131-91261 

90281-91477 

38980-39111 

39310-39474 

199214-199579,199672-199920,200252-20049 
148042-148392 

5863-5970,7653-7784,8892-9023,9673-9807, 

50715-50833 

20683-20850 



44643-44835 

13758-13922,14558-14752 
34183-34357,35686-35751 
34428-34612 
86899-87122 

47605-47729,51696-51821,52070-52257,5330 
37761-38147 

112707-112876,113676-113854 

118940-119100 

121321-121476 

30503-30844,31056-31248 

120922-121296 

129171-129327 

118677-118807,119091-119296,121626-12182 

17503-17778,18021-18290 

44647-44778 

42410-42544,83317-83540,86840-86922,8797 



24746-24872,25035-25204 

37368-37493 

26987-27778 

'"64,147794-147971,14 




-101660,103476-103656 
141663-141852 
21399-21583 
124035-124321 
22052-22185 

112307-112524,114074-114703 
19419-19959 
177393-177691 
188775-189573 
',3337-63552 
131390-132157 
64486-64714 
156037-156210 
19288-20076 
3096-3242 
105629-105760 
46180-46366 
8559-8721 
136463-136646 

72000-72290,72431-72700,72929-73199 
114365-114514,128635-128831 
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402129 
400645 
403201 
403609 



166156-166365 

58471-58716 

109782-109934 



7387343 
9502466 
4454511 



114632-114805 

68182-68325 

70745-71121 



Table 50A lists aboul 414 genes up-regulated in non-seminomatous mixed germ cell testicular cancer compared to normal adult tissues. These were selected from 59680 probesets 
on the Affymetrix/Eos Hu03 GeneChip array such that the ratio of "average" non-seminomatous mixed germ cell testicular cancer to "average" normal adult tissues was greater than 
or equal to 2. The "average" non-seminomataus mixed germ cell testicular cancer level was set to the 85th percentile amongst non-seminomatous mixed germ cell testicular cancers. 
The "average" normal adult tissue level was set to the 95th percentile amongst non-malignant tissues. In order to remove gene-specific background levels of non-specific 
hybridization, the 10th percentile value amongst the non-malignant normal body tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

Table 51A lists about 518 genes up-regulated in seminomatous testicular cancer compared to normal adult tissues. These were selected from 59680 probesets on the Affymetrix/Eos 
Hu03 GeneChip array such that the ratio of "average" seminomatous testicular cancer to "average" normal adult tissues was greater than or equal to 2. The "average" seminomatous 
testicular cancer level was set to the 85th percentile amongst seminoma j ul it normal adult tissue level was set to the 95th percentile amongst non- 

malignant Issues. In order to remove gene-specilic background levels j non ifi I i iL-ition, the 10th percentile value amongst non-malignant normal body tissues was 
subtracted from both the numerator and the denominator before the ratio was evaluated. 

Table 52A lists about 673 genes up-regulated in testicular cancer (non-seminomatous and seminomatous) compared to normal adult testicular tissues. These were selected from 
59680 probesets on the Affymetrix/Eos Hu03 GeneChip array such that the ratio of "average" testicular cancer to "average" normal testicular adult tissues was greater than or equal to 
6. The "average" testicular cancer level was set to the 75th percentile amongst testicular cancers. The "average" normal adult testicular tissue level was set to the 95th percentile 
amongst non-maligr-iii i ti ula li »n In older to remove gene-specilic background levels of non-specific hybridization, the 10th percentile value amongst non-malignant normal 
body tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

Table 53A lists about 735 genes up-regulated in testicular cancer (non-seminomatous and seminomatous) compared to normal adult tissues. These were selected from 59680 
probesets on the Affymetrix/Eos Hu03 GeneChip array such that the ratio of "average" testicular cancer to "average" normal adult tissues was greater than or equal to 3. The 
"average" testicular cancer level was set to the 95th percentile amongst testicular cancers. The "average" normal adult tissue level was set to the 95th percentile amongst non- 
malignant tissues. In order to remove gene-specific background levels of non-specific hybridization, the 10th percentile value amongst non-malignant normal body tissues was 
subtracted from both the numerator and the denominator before the ratio was evaluated. 

:er (non-seminomatous and seminomatous). These were selected from 59680 probesets on the 
al adult tissues was greater than or equal to 3 R1 was calculated as the mean number of 
ing normai testicular samples. The ratio (R2) of "average" normal tesli to "average" testicular 
ial testi level was set to the 50th percentile amongst normal testis. The 'average" normal testicular 
In order to remove gene-specific background levels of non-specific hybridization, the 10th percentile 
value amongst non-malignant normal body tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

Table 55A lists about 586 genes up-regulated in non-seminomatous mixed germ cell testicular cancer compared to normal adult testicular tissues. These were selected from 59680 
probesets on the Affymetrix/Eos Hu03 GeneChip array such that the ratio of "average" non-seminomatous mixed germ cell testicular cancer to "average" normal adult testicular 
Issues was greater than or equal to 4. The "average" non-seminomatous mixed germ cell testicular cancer level was set to the 95th percentile amongst non-seminomatous mixed 
germ cell testicular cancers. The "average" normal ad j I test ul r tissue le el v as set to the 95th percentile amongst non-malignant testicular tissues. In order to remove gene- 
specific background levels of non-specific hybridization, the 10th percentile value amongst non-malignant normal testicular tissues was subtracted from both the numerator and the 




ial adult testicular tissues. These were selected from 59680 probesets on the 
:er to "average" normal adult testicular tissues was greater than or equal to 4. The 
s. The "average" normal adult tissue lev ' 



ve gene-specific background levels of non-specific hybridization, the 10th percentile value amongst no 
al testicular tissues was subtracted from both the num ' 1 ' ' 

TABLE 50A: 

Pkey: Unique Eos probeset identifier number 
ExAccn: Exemplar Accession number, Genbank accession nui 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 

R1 : Ratio of non-seminomatous mixed germ cell testicula 

Pkey ExAccn UniGene Unigene Title 

432666 AW204069 ESTs, Weakly similar to ' 

432730 A1066520 Hs.131358 ESTs 

450581 AF081513 Hs.25195 TGF-beta4 

418696 AW959433 Hs.326290 hypothetical pr 

423458 AI204212 ESTs 

428564 AK001666 Hs.189095 similarto SALL1 (sal (Drosophila)-like 

44B981 AI968719 Hs.195387 ESTs 

407710 AW022727 Hs.23616 ESTs 

429486 AF155827 Hs.203963 hypothetical protein FU 10339 

451106 BE382701 Hs.25960 N-MYC oncogene 

417407 AA923278 Hs.290905 ESTs, Weakly similar to protease [Rsapi 

420759 T11832 Hs.127797 Homo sapiens cDNA FU11381 fis, clone HE 



rUI2561 
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s.1 50890 hypothetical protein 



AI360455 

AI654443 

AA280174 

U65011 

AI638616 

M13509 

BE302454 

AW207084 

AW248508 

H65423 



Hs.37776 ESTs 



ESTs 



matrix metalloproteinase 1 (Interstitial 
MRS2 (S. cerevisiae)-like, magnesium horn 
hypothetical protein MGC14801 
ESTs; homologue of PEM-3 [Ciona sa/ign/ 
hypothetical protein DKFZp434E2l35 
Target Exon 

ESTs, Highly similar to A40350 transcrip 
small inducible cytokine subfamily B (Cy 



428479 Y00272 



425266 J00077 I 

408465 AW196940 I 

444971 AI651116 I 

413318 AU076607 I 

425769 U72513 I 

409731 AA125985 I 

424905 NMJ02497 I 

412265 AA101325 I 

407340 M810168 I 

453B84 AA355925 1 

422956 BE545072 

432239 X81334 

440119 AA865455 

431840 AA534908 

435918 AF263538 

412537 AL031778 

416658 U03272 

428916 AF003001 

424085 NM_002914 

453392 U23752 

437052 AA861697 



M31659 

AW004683 

AI970276 

AA628980 

AA866115 



CDC7 (cell division cycle 7, S. cerevi 
fibroblast growth factor 4 (heparin se 
alpha-fetopratein 



inter-alpha (globulin) inhibitor, H2 pol 
Human RPL13-2 pseudogene mRNA, comple' 
thymosin, beta, identified in neuroblast 

hypothetical protein FU12457 
vitiligo-associated protein VIT-1 
KIAA01 86 gene product 
ECT2 protein (Epithelial cell transform! 
matrix metalloproteinase 13 (collagenase 
ESTs, Moderately similar to unknown [H.s 
POU domain, class 5, transcription facto 
growth differentiation factor 3 
nuclear transcription factor Y, alpha 
fibrillin 2 (congenital contractural ara 
telomeric repeat binding factor (NIMA-in 
replication factor C (activator 1) 2 (40 
SRY (sex determining region Y)-box 1 1 
ESTs 

i branched chain aminotransferase 1 , cytos 

hypothetical protein FU10713 
I DNA replication factor 
I ESTs 



AI188710 



mutS (E. coli) homolog 2 (colon cancer, 
KIAA1676 

down syndrome critical region protein DS 
Homo sapiens cDNA FU11381 fis, clone HE 
hepatocyte growth factor (hepapoielin A; 



ESTs 



441287 
425572 
416747 
436902 
441627 
440304 
432407 



ie, homolog 



Hs.58086 
Hs.1 25395 
Hs.13273 



gb:zr03f12.r1 Stratagene NT2 neuronal pr 
gb:UI-H-BWO-ajp-g-09-0-lll.s1 NCI_CGAP_Su 
matrix metalloproteinase 12 (macror, 
DNA (cytosine-5-)-methyltransferas( 



Homo sapiens cDNA FU1041 7 fis, clone NT 
chromatin assembly factor 1, subunit A ( 
origin recognition complex, subunit 1 (y 
ESTs, Highly similar to A35661 DNA exci; 
apolipoproteinA-ll 
ESTs 

putatatlve 28 kDa protein 
ESTs, Weakly similar to ALUB JHUMAN !!!! 
gb:df46h12.y1 Morton Fetal Cochlea Homo 
gb:EST3741 54 MAGE resequences, MAGG He 



WO 03/042661 PCT/US02/36810 



Hs.323822 Homo sapiens mRNA for K1AA1 551 protein, 



431494 
416661 
423642 



45 

50 
55 



445537 AJ245671 Hs.12844 EGF-lito 



Hs.298312 hypothetical protein DKFZp434A1315 

Hs.79440 IGF-tl mRNA-binding protein 3 

Hs.1 57148 hypothetical protein MGC13204 

Hs.195718 ESTs 

Hs.120769 gb:zn43e07.s1 Stratagene HeLa cell s3 93 

Hs.97084 lymphocyte antigen 94 (mouse) homolog (a 

Hs.1 56326 Human DNA sequence from clone RP11-145L2 



phosphoinositol 3-| 









ESTs, Weakly similar to ALU' HUMAN ALU S 


4.50 




AK000517 




NALP2p t in i N-Con ini 3 Al F1 1 
















BE294069 




hypothetical protein FLJ10512 






AI188161 






4.45 




AI638774 




testes development-related NYD-SP20 


4.40 




AW856665 




gb:RC3-CT0297-290100-013-d03 CT0297 Homo 








Hs.1 69840 


TTK protein kinase 


4^30 




AI478658 




1 1 Idin ' inhibi pd guanine nn leotide 


4.20 




AW182459 


Hs.125759 


1 STs Wea milai U hlf t'lLU 1 - 






NM 005033 






4^15 




AA236776 




MAD2 (mitotic arrest deficient, yeasl, h 


4.10 






Hs.1 28859 




4.10 










4.10 










4.05 




BE182082 


Hs.246973 






445140 




Hs.197913 


ESTs, Weakly similar to SCP3 MOUSE SYNAP 


4.00 


448038 




Hs.232026 


~ ST Wea y simil R 52 HUI f -V r 


4.00 




AI498957 


Hs.170861 


1 F ly similar tc HUMAN ZIN 


3.95 








KIAA0007 protein 


3.95 




AV6571 17 




ESTs, Moderately similar to S65657 alpha 


3.95 




AA281219 






3.95 






Hs.289014 


ESTs, Weakly similar to A43932 mucin 2 p 


3.91 




NMJ01898 




cystatin SN 






AK001239 


Hs.274263 


hypothetical protein FLU 0377 






BE5472C7 




hypothetical protein MGC13183 


3.85 


403780 






C4001759:gi|133250|sp|P19474|RO52_HUMAN 


3.84 




AB028943 


Hs.1 09445 


KIAA1 020 protein 


3.84 






Hs.81343 




3.84 




AK001673 




poll 'l prolpin FLJ10811 


3.82 














AF053306 


Hs.36708 








AW450198 










AI683036 


Hs.124135 


Homo sapiens cDNA FU13051 fis, clone NT 






AL1 09688 










NM_001B09 




centromere protein A (17kD) 


3.68 














NMJ12247 




SELENOPHOSPHATE SYNTHETASE; Human selen 


3^65 










3.65 






Hs.170623 


hypothetical protein FLJ11'183 


3.65 






Hs.170114 


KIAA0061 protein 


3.60 






Hs.108106 




3.60 






Hs.137516 








H93366 




branched chain aminotransferase 1, cytos 


3^48 






Hs.98440 


FT . j I, imilai I 1 hypo'.Leti 






AA490866 






3.44 




AW006303 




ESTs, Weakly similar to (defline not ava 


3.43 




AI420213 






3.41 


422094 


AF129535 


Hs.272027 


F-box only protein 5 


3.40 


418661 


NM 001949 




E2F transcription factor 3 


3.40 


423198 


M81933 


Hs.1 634 


cell division cycle 25A 


3.39 


424153 






MAGE-like 2 


3.33 


417705 


AW134952 


Hs.175220 


hypothetical protein FU14541 


3.37 


443715 


AI583187 


Hs.9700 




3.34 


420281 


AI623693 


Hs.323494 


Predicted cation efflux pump 


3.34 


449571 


AW016812 


Hs.200266 


ESTs 


3.34 


424687 


J05070 


Hs.1 51 738 


matrix metalloproteinsse 9 (gelatinase B 




452807 


AA028933 


Hs.162434 


ESTs 


3.31 


422756 


AA441787 




glycoprotein hormones, alpha polypeptide 


3.30 


421650 


AA781795 


Hsj 22587 


ESTs 


3.30 


418355 


L42563 




ATPase, H? transporting, nongastric, alp 


3.28 


438494 


AA908678 


Hsll30183 


ESTs 


3.23 


424568 


AF005418 


Hs.1 50595 


cytochrome P450, subfamily XXVIA, polype 


3.22 


433764 


AW753676 


Hs.39982 


zinc finger protein R'NZF(NM 023929) 
ESTs 


3.20 


427642 


R40761 


Hs.9834 


3.20 


414747 


U30872 


Hs.77204 


centromere protein F (350/400kD, nitosin 


3.18 


403432 






NM_001622:Homo sapiens alpha-2-HS-glycop 


3.16 


442618 


R56222 


Hs26514 


ESTs 


3.17 


415799 


AA653718 


Hs.225841 


DKFZP434D193 protein 




416000 


R82342 


Hs.79856 


ESTs, Weakly similar to S65657 alpha-1C- 


115 


450431 


AW136797 


Hs.266041 


ESTs 


113 


433800 


AI034361 


Hs.135150 


lung type-l cell membrane-associated gly 


3.'2 



592 



WO 03/042661 



PCT/US02/36810 



15 

20 



45 
50 





AI240006 




ESTs 


3.12 




NM_007019 




ubiquitin carrier protein E2-C 


3.10 




AW965339 




ESTs 


3.10 




R76593 




gbyi60c11.r1 Soares placenta Nb2HP Homo 


3.05 


430253 


AK001514 


Hs.236844 


hypothetical protein FLJ 10652 


3.04 


411975 


AI916058 




ESTs 


3.01 






Hs.241559 


Homo sapiens raRNAfull length insert cDN 


3.00 


413943 


AW294416 




Homo sapiens cDNA FLJ12981 lis, clone NT 


2.99 


440207 


AI371978 


Hs.1 28326 


ESTs 


2.98 










2.97 


432840 


AK001403 


Hs.279521 


hypothetical protein FLJ20530 


2.97 


450149 


AW969781 


Hs.1 32863 


Zic family member 2 (odd-paired Drosophi 


2.95 




AW665538 


Hs.1 17689 


ESTs 


2.93 










' 2.93 




AW813110 




gb:CM4-ST0189-051099-021-f05 ST0';89 Homo 


2.91 




BE263782 




KIM0008 gene product 


2.90 




AA452378 


Hs.1 46668 


Homo sapiens mRNA; cDNA DKFZp547J125 (fr 


2.90 




AA814043 


Hs.88045 




2.89 




AW970529 


Hs.86434 


hypothetical protein FLJ21 81 6 


Z89 




AI798376 




3b:tr34b0 1 NCI_CGAP_0/23 Homo sapiens 


2.87 




NM_004484 






Z87 






Hs.263478 




2.86 




BE540274 




forkhead box M1 


2.86 








troponin I, cardiac 


2.85 






Hs.305985 


early development regulator 1 (homolog o 


2.84 




A1038687 


Hs.1 33338 


ESTs 


2.84 






Hs.339665 


ESTs 


2.84 




AI368236 




ESTs, Moderately similar to ALU 1. HUMAN A 


2J34 


406687 






matrix metalloproteinase 1 1 (stromeiysin 


2 33 


416201 


AA467752 


Hs.195161 


ESTs 


2.83 




AA219691 


Hs.73625 


RAB6 interacting, kinesin-like (rabkines 


2.83 








Friend leukemia virus integration 1 


2.82 


410704 


BE076754 




gb:CM1 -BT0601 -1 80200-1 21 -b1 0 BT0601 Homo 


2.81 




BE155042 




gb:PMO-HT0349-101299-002-E04 HT0349 Homo 


2.30 




BE539976 




Homo sapiens mRNA; cDNA DKFZp434B0425 (f 


2.75 




AI393122 


Hs.1 34726 


ESTs 


2.75 




AF2 16751 


Hs.26813 


CDA14 


2.75 




MM 000506 




coagulation factor II (thrombin) 


2.74 


442032 


AW016786 




ESTs 


2.73 


437123 


AL049285 


Hs.302053 


Homo sapiens mRNA: cDNA DKFZp554M193 (fr 


2.72 


446528 


AU076640 




nucleolar protein 1(1 20kD) 




442007 


AA301116 




nucleolar phosphoprotein Nopp34 


Z71 


438180 




Hs.272151 


ESTs 


2.70 


453900 


AW003582 


Hs.226414 


ESTs, Weakly similar to ALU8 HUMAN ALU S 


2.70 


423765 


R23858 


Hs.143375 


Homo sapiens, clone IMAGE-.3840937, mRNA, 


2.69 


420949 


AA934063 


Hs.1 3836 


ESI , similar to I3802 II 


2.69 


413813 


M96956 


Hs.75561 


teratocarcinoma-derived growth factor 1 


2.68 


433914 


AF108138 


Hs.112160 


Homo sapiens DNA helicase homolog (PIF1) 


2.67 


445413 


AA151342 


Hs.12677 


CGI-147 protein 


2.66 


448769 


N66037 


Hs.38173 


ESTs 


2.66 


411022 


AW936378 




gb:QV4-DT0021-301299-071-f05 DT0021 Homo 


2.65 


423600 


AI633559 


Hs.310359 


ESTs 


2.65 



447175 
414151 
448877 



453775 
429228 
410929 

446142 



420218 
407275 
414312 



Hs.257245 
Hs.253313 
Hs.179718 
Hs.132799 
Hs.143375 
Hs.35120 



426075 
435096 
422468 



AI754693 
AI207197 
AI469213 
AF157482 
AW958037 
AI364186 
AA155694 
AB002359 
AW600291 



Hs.145968 ESTs 



riboscmal protein L4 
gb:qw34h07.x1 NCI_CGAP_UI 
Hs.191060 ESTs 
Hs.1 05478 phosphoribosylfoi 



hypothetical protein FU 10430 
Hs.270149 ESTs, Weakly similar to 2'09250ABcell 

gb:nu73b07.s1 NCI_CGAP_Alv1 Homo sapiens 
gb:EST63589 Jurkat T-cells V Homo sapien 



452226 AA024898 Hs.1 57103 ESTs 

421451 AA291377 Hs.50831 ESTs 

424308 AW975531 Hs.1 54443 i 

418203 X54942 

453941 U39817 Hs.36820 

413762 AW411479 Hs.848 FK506-binding protein 4 

449655 AI021987 



593 



WO 03/042661 



PCT/US02/36810 



AW138190 
AF1 16846 
AK001581 
H81213 



Hs.242183 HOM-TES-85 tumor antigen 

Hs.18616 hypothetical protein FU20311 

Hs.96978 hypothetical protein MGC10764 

Hs.12570 tubulin-specifio ohaperone d 

Hs.212709 ESTs 

Hs.12M5 putative protein 

Hs.180248 zinc finger protein 124 (HZF-16) 

Hs.10431 dead ringer (Drosophila)-like 2 (bright 

Hs.334828 hypothetical protein FLJ10719; K1AA1794 

Hs.14825 ESTs, Weakly similar to KIAA1503 protein 

Hs.222024 transcription factor BMAL2 



407325 
410276 
444670 
419029 

414812 
425202 
425212 
423787 



AA291180 
AI554545 
H58373 
AA233397 



AW629485 
AW380579 
NM_004477 



417833 
451999 
407910 
418866 
410060 
449138 
425159 



410275 
407818 
417777 
401704 
449670 
424081 



441031 
419594 
407289 



414890 
409014 
418140 



AW176401 
M650274 
T65754 
NM001448 

NM_004341 
AF155108 
M379597 



Hs.77367 
Hs.1 52049 
Hs.171618 
Hs.236204 
Hs.140720 
Hs.209657 
Hs.203772 
Hs.243991 
Hs.86264 



AA428211 
AU076799 
BE251328 
T49300 



41296 



FU14549 

DEAD7H (Asp-Glu-Ala-Asp/His) box polypep 
fibronectin leucine rich transmembrane p 
gb:yo1 1 c07.s1 Stratagene lung (937210) H 
glypican 4 
ESTs 



NMJ06413 
AF039850 



AI110684 
AA013051 
M135159 
AA830797 



Hs.139120 
Hs.121028 
Hs.198515 
Hs.152484 



Hs.251865 
Hs.77573 
Hs.49760 
Hs.83551 



Homo sapiens, Similar to RIKEN cDNA 2810 

HSPCI 50 protein similar to ubiqultin-con 

transcription factor AP-2 gamma (activat 

jumonji (mouse) homolog 

ESTs, Weakly similar to l7B385serine/th 

NM_021195':Homo sapiens claudin 6 (CLDN6 

Homo sapiens mRNA; cDNA DKFZp434K2172 (f 

ribonuclease P (30kD) 

hypothetical prctein FU 10549 

dead ringer (Drosophila)-like 1 

ESTs, Weakly similar to I38022 hypotheti 

small inducible cytokine subfamily B (Cy 

hypothetical protein FU10514 

NMJB n ! _h gl F 

fibrinogens beta polypeptide 

Dme (DN ) II bindii j pr in 
Homo sapiens cDNA FU12149 fis, clone MA 
CCAAT-box-binding transcripticn factor 
' ;e1,cyKK 



Hs.35304 
Hs.81831 
Hs.23960 
Hs.131973 



ESTs 

uridine phosphorylase 

origin recognition complex, subunlt 6 (y 

microfibrillar-associated protein 2 

hypothetical protein FU14033 similar to 

apollpoproteln A-IV 

hypothetical protein FU 10881 

Homo sapiens cDNA FU13655 fis, clone P 

Homo sapiens cDNA FLJ11347 fis, clone P 

ESTs 



410855 
44B757 
450254 
418973 



400195 
431805 



AA926960 CDC28 protein kinase 1 

AI2B8274 Hs.345792 ESTs 

X97795 Hs.66718 RAD54 (S.cerevisiae)-like 

AI365784 Hs.48820 TATA box binding prote'n (TBP)-assoclate 

NM_004885 Hs.99231 neuropeptide G protein-coupled receptor; 

AA233056 Hs.191518 ESTs 

AW295647 Hs.71331 hypothetical protein MGC5350 

AA912476 Hs.116750 Homo sapiens cDNA FU13221 fis, clone NT 

AW206447 gb:UI-H-BI1-afg-g-02-0-Ul.s1 NCI_CGAP_Su 

AI267700 ESTs 

NM_007057":Homo sapiens ZW10 interactor 

AA436884 Hs.4B926 ESTs 

NM.C14053 Hs.270594 RVCR protein 

BE091926 Hs.16244 mitotic spindle coiled-coil related prot ' 

AA285249 Hs.1 46329 protein kinase Chk2(CHEK2) 

BE250964 Hs.82045 midklne (neurite growth-promoting factor 



594 



WO 03/042661 



PCT/US02/36810 



423905 
413992 
412722 



KIAA0203 gene product 
ESTs, Moderately similar to 138022 hypot 
C1900072B*:gi|12586552|splQ9Y2Q1|Z257_HU 
iis laclor receptor superfami 
is, Weakly similar to 138022 hypolheti 




429166 
432446 
417D66 



Hs.1 58084 
Hs.31442 
Hs.164115 

Hsll97668 
Hs.294088 
Hs.82772 
Hs.301704 
AA251393 Hs.289052 
AI656634 Hs.195389 
AI761902 Hs.99597 
AK000492 Hs.98806 hypothetical protein 



37 kDa leucine-rich repeat (LRR) protein 
Homo sapiens mRNA; cDNA DKFZp667N064 (fr 
IGF-II mRNA-binding protein 1 
KIAA0974 prolein 
peroxisome receptor 1 

ESTs 

thyroid hormone receptor coactivating pr 



collagen, type XI, alpha 1 
eomesoderrnin (Xenopus laevis) homolog 
Homo sapiens, Similar to RIKEN cDNA 5430 



Z15005 

AA009647 

AA062980 



ls.347310 hypothetical protein FU14627 

NM.0001 79*:Homo sapiens mutS (E. coS) h 
Hs.75573 centromere protein E (31 2kD) 



448275 
432731 
405157 

423739 
421310 
457107 
437257 
407259 



BE243834 

NMJJ02318 

AW450669 

AW952792 

AA459699 

AW452871 

AI272141 

BE514434 

R31178 



Hs.159251 forkheadboxHI 

Hs.274415 Homo sapiens cDNA FLJ10229 fis, clone HE 

Hs.50441 CGI-04 protein 

Hs.83354 (ysyt oxidsse-like 2 

Hs.45068 hypothetical protein DKFZp434l143 



AA41824S 
L02255 



Hs.99496 ESTs 
Hs.56043 CGI-115 protein 
Hs.83484 SRY (sex determining region Y)-box 4 
Hs.20830 kinesin-like 2 
Hs.287820 fibronectin 1 

NM_003213*:Homo sapiens TEA domain famil 
Hs.155462 minichromosome maintenance deficient (mi 
Hs.97600 ESTs 

Hs.103315 trinucleotide repeat containing 1 
Hs.185796 ESTs, Weakly similarto Z184.HUMAN ZINC 
Hs.290931 ESTs, Weakly similar to YFJ7_YEAST HYPOT 
gb:Human Fabfragmenl binding syncytial 



TABLE 50B: 

Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 

Pkey CAT Number Accession 

432665 144 7 AA558585 AA565499 AI360576 AW204069 AA991648AA864939 

423458 30480 J BC01 8070 BG702493 AI20421 2 AA460929 AA993606 BF926635 AA226938 BG1 90705 BG1 86496 AW291 865 BG183340 BG195301 BG214539 

BG21 5094 BG198867 BG1 96332 BG208220 BG212418 



595 



WO 03/042661 



PCT/US02/36810 



430676 
427521 



60836.2 
513212J 



41837B 1227421J 
422689 874209.1 
436514 132288.1 



413646 
442032 
411022 



420218 191547.1 

435096 125215.1 

422468 216674.1 

415684 18695.18 

447350 2267324 1 

410275 641443.1 



13268.1- 
245947.1 



400195 16B94.2 



450375 16559.3 



BG433950 BE061583 T05808 BE1 4431 3 AW812038 BE144812 AWB12040 AW812041 AU124350 BE061602BE061604 BF922595 BE061603 
AI352469 BE061601 BI062752 AW818205 BF887722 

AW973352 BF222929 AW016853 BF059130 AI651829 BE551767 AA55B414 AI339359 BF059501 AI961162 AI341422 AI206248 AI206165 

AA548736AA768578AI539031 AW (I r c A 1357AA480892 

BF51 071 5 BE673055 BE4641 1 1 AW590620 AI637939 AA404324 AW236441 AI650952 BF056796 AA974433 

AK025201 AA425472 AI694282 BG057305 AA907787 AI286170 A;684577 AJ42C494 AI809865 BF058095 AI478773 Al 1 60445 AL0441 14 
AW665529 Al 129239 AW2971 52 AI268215 AI469807 AI969353 BE552356 N66509 AA736741 AA382555 AW075B11 AV759188 BI259364 
BF4451 42 BG232066 Al 141 758 AI631 202 AI1 67566 AI208445 AA889823 BF982682 N90322 BI090882 BF20B005 AW95391 8 AL0441 1 3 AI01 6793 
AA382555 AW235763 AA927051 AI862075 BE886691 BE619282 
AV752763 AI032142 N30308 N221 81 H95390 AW575632 
AW978773 AW298067 AA810101 AW194180 AA731645 AI690673 

BC022538 AI990847 BF478249BG217996 BG212702 BG1 82057 AW589883 BF000085 AA993969 BG479023 BG220014 BG679466 BE907092 

AI623855 AA223956 AA223917 AW022983 AW090580 AW573219 BF514491 BF445397 AA884705 AI910424 

AA218925AW962081 M354237 

AW954733 AA315006 AW856665 

AA683356 AW592804 AI1 50287 

AL109688 R23665 R26578 

AF147390 R76593 R76594 

AWB1 31 1 0 BF771 370 BF771371 AW8131 1 3 AW003381 

AF134164BF809407 AA21 B567 BF8428S3 AI267168 BF875178 BG999253 AW861851 AW858362 AI817548 BF771300 AA1 13928 AA223422 
M055556 BF773400 BF99B869 BE081333 BE073424 BE142245 H59571 H59570 BF871558 BF871064 BE001132BF826831 AW754298 
AA223267 BG997895 BG997897 AW991957 AA534354 BG31 9501 BF736309 AI694265 AA045564 BG950256 AI829309 BG987850 BE093175 
BF854337 

NM.000363X54163M64247AI265781 AI760600 AI367238 BE140258AW207185 AI657074 C03333 AI193911 C05024 C03193 AI950215C05070 
C05613 W17389 C05351 AA311399 C04180 C04896 C05502 C05482 C04456 C04543 C04558 C04551 C03114 C03103 AI369979 AI652255 
T12391 T12073 W19390 C02994 C02730 C04434 W07135 R57607 C03339 
M31126 

AI21 6469 AI354789 AA446136 H24336 AA446443 AI37622B R48940 

AW87745BAW877524 BE076922BE166912AW840534BE076754AW797829 BE166905BE166926AW877462BE165927BE166932AW877523 
BE166917 AW877529 BE166928 BF351394 AWB77522 AW877528 BE1B6861 BE16686B BE166913 BE166919 AW877456 AW877537 BE076866 
AW840571 



BF223060 BF222818 AI9SQ472 AW01 6786 A1207136 AI969730 BF222890 AI633857 AI968711 AA974235 A1352637 
A W936378 AW936544 AW81 351 3 

BG676155BM009S91 AI479075 AI025794AI017967 M443270 BE466812 AA853422 AI392649 BG952034 AA513384 BF840124 BE714620 



AI207197BF773544AW196462 

AW958037R42557AI337047AA948360AI638005AA459950 AI624915AI63B047AI467856AI521B26M860305AI932315AW003092AW271756 

AW779380 AA609879 AI634791 AI493770 AI56521 1 Z41 1 45 AI627952 AA303734 BE349457 AW196765 AA256527 BE089727 

H30075AA664977AW97527B 

AW962701 AA310998 AW962699 

BF666746 D59356 BG678312 N56640AA1 66861 

AI375572 AI480404 BF430912 T06882 

AA003514 AI554545 AW169852 A1363822 AI533826 AI656026 A1765624 M147545 AA147552 
AI740586 M771806 BE50C996 AW2D4531 AI082424 AI033B79 BF093176 AA771764 D3B575 
T65754AA229658AA229857 

AK021881 AU145974 AU145787 C16964 AA42821 1 AU119598 AA993264 BF999192 AW903017 AA346559 AU119446 AW581679 AA991577 
AW8981 65 AW386878 AW890957 Z1 8340 

AF274943 BG494894 AI719075 M908783 A1935150 AI422691 AA910644 AA583187 BM272167 AI828996 AA527373 AW972459 AI831360 
AA772418A1033892 AA100326 AU154749 A'459432 AI42351 3 A1094597 AA740817 AI991988 AI090262 A1312104 BI256707 AA459522 AA416871 
AI075239 AI339996 AA701623 AM 39549 AI335880 AA633648 AI9B9380 AI362835 AA399239 AI146955 BF514270 N92892 AI348243 AI2788B7 
AA459292 AI494230 BF507531 AI492500 AA962595 AW51 3002 AA293140 AA235549 BF108854 AA954344 N49682 AI457100 AW589407 
AW300758 BE220715 BE220698 BE559091 BM0C9647 BF900351 AI537692 AI203723 AI857576 AA584410 AW371667 BM172353 
BM467830 AI084433 AW206447 AI400976 AI248530 R1 6553 

BC013389 BC017398AI023543AA191424A1267700A1469633AW958455AW953397AA172055BE940298BF909208 BF909980 BF095153 
BG285837 AI720344 BF541715 M355086 AA172236 

BM477554 BM423967 BC020979 AF067656 NM.007057 BI869291 BG468263 BG760599 BI251788 AA8550B0 BE257094 BF212452 BE888249 
BI259219AW409765 BE089556AL564377BI258884 AW440401 AL578460 AL578434AL555136BG036804AL531381 AW371767 BG610641 
BF1 02552 BE294929 BF792282 BG1 21 657 BG502285 BG777493 AL564510 AW770358 AA573448 AA5B4001 AA969560 AW078946 AW750065 
AL573860AA1 43778 H99221 AA969210AW103401 AW750073 

BG570706 BG572749 AW606284 H04021 AA1 51 1 66 AW954405 AA131254 BG056451 W46291 H01532 H04384 H03231 AA852876 H04410 
H59605 BE1 57501 AA1 13758 



TABLE 50C 

Pkey: Unique number corresponding to an Eos probeset 
Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (G 1 ) 

human chromosome 22" Dunham, et a!. (1999) Nature 402:489495. 
Strand: Indicates DNA strand from which exons were predicted, 
imposition: Indicates nucleotide positions of predicted exons. 



numbers. "Dunham, et al." refers to the publication er 



ed "The DNA sequence of 



Pkey 



403432 
401704 
403433 
401220 



Ref Strand 
6007890 Plus 
7711513 Minus 



7711499 
8117697 
9166422 



NLpositon 

37999-38145,38652-38998,39727-39872,4055 

172780-174358 

113086-114800 

93160-93409 



72225-72437 
48079-48279 
117335-11B473 
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WO 03/042661 
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405157 9966228 Plus 156363-156502,157573-157746 



ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unigene Tide: Unigene gene litle 

R1: Ratio of seminomatous testicular cancer compared to normal adult tissues 



418696 AW959433 



428664 AK001666 

420367 AA259090 

420347 AL033539 

437052, AAB61697 

407710 AW022727 

420528 AF1 30728 

424578 AK001973 

420759 T11832 

417407 AA923278 

429486 AF1 55827 

434649 AA738254 

430252 AI638774 

423458 AI204212 

438915 AA280174 

427711 M31659 

427667 AKQ01279 

426427 M86699 

420401 AK001907 

406937 U 14622 

430521 NM_01 6383 

425769 U72513 

418477 AW022983 

434551 BE387162 

436812 AW298067 

437789 AI581344 

433800 AI034361 

421241 X91817 

410102 AW24B508 

418134 AA397769 

433159 AB035898 

433975 AA971953 

422956 BE545072 

410561 BE540255 

431494 AA991355 

436899 AA764852 

426083 AW962712 

424905 NM 002497 

408908 BE296227 

413627 BE1 82082 

425572 AB011076 

415857 AA856115 

408728 AL1 37379 
406647 

424153 AA451737 

434699 AA643587 

437421 AA917062 

409731 AA1 25985 

419423 D26488 

428227 AA321549 

431840 AA534908 

430676 AF0B4866 

436608 AA628980 

435206 AI432364 

414972 BE2637B2 

407340 AA810168 

426518 Z43039 

436513 AJ278110 

427521 AW973352 

423673 BE003054 

422232 D43945 

420047 AI478658 

431041 AA490967 

427335 AA448542 

422797 AB033064 

418379 AA218940 



UnigenelD Unigene Title 

Hs.326290 hypothetical protein FLJ12581 

ESTs, Weakly similar to unnamed 
Hs.131358 ESTs 

Hs.2051 testis specific protein, Y-linked 



Hs.23616 ESTs 

Hs.98586 doublesex and mab-3 related transcriptio 

Hs.150890 hypothetical protein 

Hs.127797 HomosapienscDNAFLJ11381 fis, clone HE 

3090 ESTs, Weakly srnilai to protease [H 

Hs.203963 hypothetical protein FLJ10339 

hs.165390 ESTs, Highly similar to A40350 transcrip 

Hs.105328 testes development-related NYD-SP20 
ESTs 

Hs.285681 Williams-Beuren syndrome chromosome regi 

Hs.180408 solute carrier family 25 (mitochondria! 

Hs.180171 Homo sapiens cDNA FLJ1 041 7 fis, clone NT 

Hs.97464 hypothetical protein 

gb:Human transketolase-like protein gene 
Hs.242183 HOM-TES-85 tumor antigen 
Hs.159486 Human RPL13-2 pseudogene mRNA, complete 

gb:df46h12.y1 Morton Fetal Cochlea Homo 
Hs.280858 ESTs, Highly similar to A35661 DNA excis 

gb:UI-H-BW0-ajp-g-09-0-Ul.s1 NCl_CGAP_Su 
Hs.127812 ESTs, Weakly similar to T17330 hypothefj 
Hs.135150 lung type-t cell membrane-associated gly 
Hs.102866 transketolase-like 1 
Hs.279727 ESTs; homologue of PEM-3 [Ciona savignyi 
Hs.86617 ESTs 
Hs.1505B7 kinesin-like protein 2 
Hs.122055 ESTs 

Hs,122579 ECT2 protein (Epithelial ceil transformi 
Hs.6994 Homo sapiens cDNA: FU22044 fis, clone H 
Hs.298312 hypothetical protein DKFZp434A1315 
ESTs 

Hs.126712 ESTs, Weakly similar to AF1 91 020 1 E2IG5 
Hs.153704 NIMA (never in mitosis gene a)-rela!ed k 
Hs.250822 serine/threonine kinase 15 
Hs.246973 Intron of Bicaudal D homolog 1 
Hs.158307 undifferentiated embryonic cell transcri 
Hs.127797 Homo sapiens cDNA FU11381 fis, clone HE 
Hs.47125 hypothetical protein FLJ13912 

Target Exon 
Hs.141496 MAGE-like 2 

Hs.149425 Homo sapiens cDNA FU1 1980 lis, done HE 

Hs.56145 thymosin, beta, identified in neuroblast 
Hs.90315 KIAA0007 protein 

small inducible cytokine subfamiry B (Cy 
POU domain, class 5, transcription facto 
gbtHomo sapiens envelope protein RIC-3 ( 
Hs.192371 down syndrome critical region protein DS 
Hs.1 60594 ESTs 
Hs.77695 KIAA0008 gene product 
Hs.284289 vitiligo-associaled protein VIT-1 
Hs.170198 KIAA0009 gene product 
Hs.125507 DEAD-box protein 



Hs.197955 KIAA0704pr< 

Hs.251677 Ganligen7B 

Hs.235463 KIAA1238 protein 

Hs.137516 fidgetin-like 1 



Hs.2248 
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423905 AW579960 

433764 AW753675 

422665 AJ011812 

433701 AW445023 

436909 AA907120 

423728 AW891294 

429228 AI553633 

419384 AA490866 

435514 AW592804 

434334 AA912476 

430835 AI240006 

438188 AA779975 

429120 AK001673 

408758 NM_0036B5 

424081 NM 006413 

432359 AA076049 

428153 AW513143 



427510 Z47542 

412265 AA101325 

413623 AA825721 
402145 

414136 AA812434 

42B479 Y00272 

42B949 AA442153 

40B460 AA054726 

415947 U04045 

420900 AL045633 

426496 D31765 



422938 NM_001809 
402199 

409103 AF251237 Hs.112208 

416659 H43437 Hs.80305 

410165 AK001376 

410929 H47233 

4178B5 AA214684 

426223 AW977812 

409421 AA199883 

428249 AA1 30914 

429999 AI761 902 

431721 AB032996 



Hs.135150 lung type-Ice 
Hs.39982 zinc finger protein RINZF (NM_023929~) 
Hs.119018 transcription factor NRF 
Hs.15155 ESTs 
ESTs 

Hs.132136 solule carrier family 4, sodium bicarbon 
ESTs 

Hs.39429 ESTs 



hypothetical protein FLJ 10811 
seP(30kD) 

ienscDNA FLJ10229 fis, clone HE 
SRY (sex determining region Y)-box 17 (S 
gb:RC3-CT0297-290100-01M03 CT0297 Homo 
ESTs, Weakly similar to 138022 hypotheS 
cftSnase 1 (criitotriosidase) 
ESTs 

gb:EST14192 Testis tumor Homo sapiens cD 
small nuclear RNA activating complex, po 
hypothetical protein FLJ 12457 
intron of Bicaudal D homolog 1 
Target Exon 

I if oi yd 2 G1 o S and G2 lo 
hypothetical protein DKF2p434J061 7 
ESTs 

mutS (E. coli) homolog 2 (colon cancer, 
ESTs 

KIAA0061 protein 

centromere protein A (17kD) 
Target Exon 
XAGE-1 protein 

hypothetical protein MGC1 4258 
hypothetical protein FU1 051 4 



Hs.179312 
Hs.86154 
Hs.246973 



Hs.334562 

Hs.285574 
Hs.78934 
Hs.44269 
Hs.170114 
Hs.31742 
Hs.1594 



Hs.30643 ESTs 
ESTs 



413197 N48921 

428329 AA426091 

418235 BE072634 

427119 AW880562 

414812 X72755 

414034 U89277 

409066 AA062980 

416201 AA467752 

433330 AW207084 

429629 BE501732 

437099 N77793 

415799 AA653718 

412530 AA766268 

418221 Z45514 

418971 AA360392 

416111 AA033813 

423175 W27595 

415717 AA167270 

423198 M81933 

433849 BE455884 

436211 AK001581 

432840 AK001403 

421307 BE539976 

414725 AA769791 

408291 AB023191 

408332 H91230 

416773 AKD00340 

427584 BE410293 

421917 AB02B943 

430647 AC003682 

430287 AW182459 

436360 AI962796 

438624 AA889055 

434609 R76593 

411945 AL033527 

408065 AW954272 



Hs.25B044 
Hs.44205 
Hs.27441 
HS.9B453 

Hs.272525 
Hs.77367 
HS.3059B5 
Hs.66960 ESTs 
Hs.195161 
Hs.132816 
Hs.30622 
Hs.48659 
Hs.225841 
Hs.266273 
Hs.83775 
Hs.87113 
Hs.79018 
Hs.347310 
Hs.130435 
Hs.1634 
Hs.280728 
Hs.334828 
Hs.279521 



zinc finger protein 268 
ESTs 

KIAA11 70 protein 
cortistaBn 
KIAA1 61 5 protein 

Ef f'r er milartoF 1 " [H 
gb:PM4-BT0548-171299-GG1-h08BT0548Horno 



ESTs 

hypothetical protein MGC14801 
Homo sapiens cDNAFU13010fis, done NT 
ESTs, Highly similar to S14458laminin a 
DKFZP434D1 93 protein 

protehFU13346 



ESTs 



cell division cycle 25A 
ESTs 



Hs.44131 
Hs.234794 
Hs.79828 
Hs.179718 
Hs. 109445 
Hs.1 27988 
Hs.125759 
Hs.156100 
Hs.1 23468 



protein FU10719; KIAA1794 
hypothetical protein FLJ20530 
Homo sapiens mRNA; cDNA DKFZp434B0425 (f 
ring finger protein 21, interferon-respo 
KIAA0974 protein 

Homo sapiens mRNA; cDNA DKFZp564B083 (fr 
hypothetical protein FU20333 
v-myb avian myeloblastosis viral oncogen 
KIAA1020 protein 

ESTs, Weakly similar to Z21 1 HUMAN ZINC 
ESTs, Weakly similar to LEU5 HUMAN LEUKE 
ESTs 

gb:yi60c1 1 ,r1 Soares placenta Nb2HP Homo 
L-myc-2protein(MYCL2) 

gb:EST366342 MAGE resequences, MAGC Homo 



WO 03/042661 



PCT/US02/36810 



413833 Z15005 

421010 AW974553 

438456 AA9133B1 

424687 J05070 

412537 AL031778 

418661 NM 001949 

40B750 BE294069 

422094 AF129535 

419839 U24577 

416350 AF188625 

426054 U12431 
401435 



426067 AW554691 

415684 D59356 

429469 M64590 

424590 AW965399 

427761 AA412205 

433641 AF080229 

418216 AA662240 

438180 AA808189 

424281 AA766243 

421379 Y15221 

428878 AA436884 

438885 AI886558 

416445 AL043004 

424381 AA285249 

432415 T16971 

427298 AA400495 

420218 AW958037 

407300 M102616 

410420 AA224053 

432809 AA555509 

424085 NM 002914 

421373 AAB08229 

423354 AB011130 

418830 BE513731 

431077 AI669133 

418049 AA211467 

420949 AA934063 

432407 AA221036 

434288 AW1 89075 

418295 AW970043 

429714 BE661801 

421350 AW301608 

420161 AI683069 

414518 AI204600 

430253 AK001514 

414747 U30872 

423419 R55336 

410275 U85658 

408092 NM 007057 

4236B5 BE350494 

43837B AW970529 

415912 H08859 

412140 AA219691 

420552 AK000492 



415829 AW450198 



429830 AI53727B 

420524 AB010575 

433023 AW864793 

421633 AF121860 

420507 AF093408 

432938 T27013 

414598 AI094221 

419635 NM_005033 

425312 AA354940 

425474 Z48054 

411027 AF072099 

432446 AA542845 

424513 BE385864 

436902 AW247145 

422789 AK001113 

430056 X9754B 



Hs.272027 
Hs.93304 
Hs.189507 



Hs.97053 ESTs 



matrix melalloproteinase 9 (gelalinase B 
nuclear transcription factor Y, alpha 
E2F transcription factor 3 
hypothetical protein FU10512 
F-box only protein 5 

phospholipase A2, group VII (platelet-ac 
phospholipase A2, group IID 
ELAV (embryonic lethal, abnormal vision, 
C14000397-:gi|7499696|pir||T33295 hypcth 
ESTs, Moderately similar to neuronal thr 
hypothetical protein FLJ10688 
"""3 gene product 



sorbitol dehydrogenase 



hypothetical protein FLJ2O086 




jbroal H-ll ZG I B1Hom( pier 



KIAA0135pr< 

protein kinase Chk2(CHEK2) 
ESTs, Weakly similar to A43932 mucin 2 p 



Hs.139226 
Hs.46677 
Hs.1 27436 
Hs.88959 



Ho.13 



replication factor C (activator 1) 2 (40 
ESTs 

calcium channel, voltage-dependent alph 
r f i il -rolPin MGC4816 
hypothetical protein FLJ12810 
Homo sapiens, Similar to nuclear iocaliz 
ESTs, Weakly similar to I38022 hypolhefi 
gb:zr03f12.r1 Stratagene NT2 neuronal pr 



Hs.238039 hypothetical protein FU 1 1 090 

Hs.2484 T-cell leukemia/lymphoma 1A 

Hs.2781 88 ESTs, Moderately similar to 154374 gene 

Hs.120817 ESTs 

Hs.96978 hypothetical protein MGC1 0764 

Hs.236844 hypothetical protein FLJ10652 

Hs.77204 centromere protein F (350/400kD, mitosin 

Hs.23539 ESTs 

Hs.61796 transcription factor AP-2 gamma (activat 

Hs.42650 



Hs.86434 
Hs.206459 
Hs.73625 
Hs.98806 

Hs.274263 
Hs.163742 
Hs.97600 
Hs.268814 
Hs.1454 
Hs.290931 
Hs.225841 
Hs.98547 



hypothetical protein FU21816 
ESTs, Weakly similar to AUI6_HUMAN ALU S 
RAB6 interacting, kinesin-tike (rabkines 
hypothetical protein 

NM_030920':Homo sapiens hypothetical pro 
hypothetical protein FU 10377 
ESTs 
ESTs 
ESTs 

gastric inhibitory polypeptide 
ESTs, Weakly similar to YFJ7_YEAST HYPOT 
DKFZP434D193 protein 
amiloride-sensitive cation channel 3, te 
thrombospondin 1 
Hs.1 06260 sorting nexin 10 
Hs.98397 A kinase (PRKA) anchor protein 3 
Hs.3132 steroidogenic acute regulatory protein 
Hs.135150 lung type-l cell membrane-associated giy 
Hs.91728 polymyositis/scleroderma autoantigen 1 ( 
Hs.145958 ESTs 
Hs.158084 peroxisome receptor 1 
Hs.67846 leukocyte immunoglobulin-like receptor, 
Hs.294088 GAJ protein 
Hs.149894 mitochondrial translationai initiation f 
Hs.192729 ESTs 

Hs.120842 hypothetical pro em r^J m M 
Hs.228059 KRAB-associated protein 1 
Hs.199179 RAN binding protein 2 
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30 
35 
40 
45 



406367 

418866 T65754 

435918 AF263538 

436511 AA721252 
402680 

414161 AA136106 

427239 BE270447 

433683 AI817723 

417576 AA339449 

420697 AA827705 
427719 AI393122 
419131 AA406293 

427314 AB033024 
424315 AW614850 
430335 D80007 
410361 BE391804 
413686 AI469213 
429183 AB014604 
430292 AK000634 
422726 U11690 
437834 M769294 
435159 AA668879 
428361 NM 015905 
43C388 AA356923 
434070 AF116652 
429323 NM.001649 
433247 AB040948 
415884 H22966 
427668 M298760 
437162 AW005505 
401091 

425601 AW629485 
428597 AK000147 
417705 AW134952 
438243 AI581311 
418203 X54942 
410704 BE075754 
429063 AW363845 
427147 AA398587 
430552 AA1 76374 
437660 W31708 
425237 U07695 
419335 AW960146 
426386 AA748850 
423123 NM.012247 
430968 AW972830 
420596 NM.002692 
419741 NM 007019 
401464 

411855 H67899 
411355 M76477 
419029 AA233397 
421654 AW1 53267 
421535 AB002359 
423453 AW450737 
412673 AL042957 
410006 AW732308 
434159 AW135214 
427260 AA663848 
439053 BE244588 
414706 AW340125 



Hs.291502 ESTs 

Target Exon 
Hs.184852 KIAA1553 protein 



Target Exon 
Hs.26605 ESTs 
Hs.134726 ESTs 



proline oxidase homolog 
KIAA1 198 protein 
pulatative28 kDa protein 
KIAA0185 protein 

guanylale binding protein 1, interferon- 



Hs.71404 ESTs 



1.197955 
5.238270 
Hs.1572 



Hs.1 63858 

Hs.240770 

Hs.270087 

Hs.2391 

Hs.14285 

Hs.1 3471 

Hs.180191 

Hs.5464 

Hs.140720 
Hs.295909 
Hs.175220 

Hs.83758 

Hs.322903 
Hs.97414 
Hs.243886 
Hs.55304 
Hs.1 55227 
Hs.284137 
Hs.1 25830 
Hs.124027 



KIAA0704 protein 
hypothetical protein FLJ20627 
faciogerilal dysplasia (Aarskog-Scott sy 
gb:rt236g03.s1 NCI CGAP_GCB1 Homo sapiens 
ESTs 

transcriptional intermediary (actor 1 
nuclear cap binding protein subunit 2, 2 
protein PRO0813 



GSK-3 binding protein FRAT2 
hypothetical protein FLJ1070Q 
hypothetical protein FLJ 14541 
ESTs 

CDC28 protein kinase2 
gb:CM1-BT0601-180200-121-b10 DT0601 He 
ESTs, Weakly similar to A46010 X-lir.ked 
ESTs 

nuclear autoanligenic sperm protein (his 



SELENOPHOSPHATE SYNTHETASE ; Human seler 
gb:EST384925MAGE resequences, MAGLHomo 
polymerase (DNA directed), epslon 2 
ubiquitin carrier protein E2-C 
tiistone deacetylase 5 

Homo sapiens cDNA: FLJ23259 fis, clona C 
GM2 ganglioside activator protein 
hypothetical protein FLJ12581 



Hs.105478 
Hs.1 28791 CGI-09 protein 
Hs.31845 — 
Hs.57783 
Hs.191828 



Hs.6455 chaperonin containing TCP 1 , sul 
HS.769B9 KIAA0097 gene product 

gb:ag03a08.s1 Soares_lestis_NHT Homo sap 



420582 BE047878 

418355 L42563 

411127 AA668995 

437205 AL1 10232 

412123 BE251328 

436481 AA379597 

408446 AW450669 

437033 AW248364 

418592 X99226 

415585 R59946 

424800 AL035588 

426470 AA528794 

426919 AL041228 

421209 AJ010230 

437496 AA452378 
401837 

428743 AL0B0060 

422809 AK001379 



Homo sapiens chromosome 19, cosmid R2837 
ATPase, H7 transporting, nongastric, alp 
hypothetical protein 

Homo sapiens mRNA; cDNA DKFZp564D2071 (f 
hypothetical protein FLJ10881 
HSPC150 protein similar to ubiquitin-con 
hypothetical protein DKFZp434H43 
RNA polymerase I subunit 
Fanconi anemia, complementation group A 
KIAA1553 protein 
MyoD family Inhibitor 



Hs.128644 ESTs 



ELAV (embryonic lethal, abnormal vision, 
ret finger protein-like 1 antisense 
Homo sapiens mRNA; cDNA DKFZp547J1 25 (fr 
NMC25109:Homo sapiens hypothetical prol 
Homo sapiens mRNA; cDNA DKFZp564H172 (fr 
hypothetical protein FU10549 



WO 03/042661 
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418648 AW979223 

423020 AA383092 

430345 AK000282 

424075 AI807320 

428728 NM_016625 

423755 AB037735 

424051 AL1 10203 

416734 HB1213 

422406 AF025441 

433228 F28212 

411943 BE502436 

426181 AA371422 

423642 AW452650 

411571 M122393 

419750 AL079741 

408209 NMJ04454 

435726 BE5357B7 
404068 
403137 

434276 AF123659 

422283 AW411307 

429652 AA766810 

416204 AW972270 

414713 BE465243 

425910 AA830797 

408875 NMJ15434 

435244 N77221 
402679 

413943 AW294416 

433914 AF108138 

437812 AI582291 

410855 X97795 

423232 BE244625 

427578 AI591305 



434981 AW182577 

417911 M333387 

409210 AA251812 

424425 AB031480 

411885 M452636 

421567 AJ272137 

425159 NM_004341 

418678 NMJ301327 

431197 AL038596 

411630 U42349 

410968 AA199907 

421305 BE397354 

417153 X57010 

412389 AW947655 

419359 AL043202 
403780 

437681 AI207958 
400205 

433160 AW207002 

432606 NM_002104 

425331 AW962128 

430506 BE266026 

424308 AW975531 

418821 AA436002 

437437 AA226869 

413437 BE313164 

42584B BE242709 

435532 AW291488 

430183 BE010038 

409342 AU077058 

430504 H52761 

427726 AI359144 

417115 AW952792 

412721 AW183155 
404071 

413762 AW411479 

425811 AL039104 



414732 AW410976 



Hs.292478 ESTs 

Hs.1 608 replication protein A3 (1 4kD) 

Hs.239681 hypothetical protein FLJ20275 

Hs.227630 RE1-siIencing transcription factor 

Hs.191381 hypothetical protein 

Hs.132560 hypothetical protein FU10312 

Hs.138411 Homo sapiens mRN r M DKFZp586J1922 (f 

Hs.14825 ESTs, Weakly similar to KIAA1503 protein 

Hs.116206 Opa-interacting protein 5 

Hs.14953 KIAA1491 protein 

Hs.7962 ESTs, Weakly similar to S44608 C02F5.6 p 

Hs.334371 hypothetical protein MGC13096 

Hs.157148 hypothetical protein MGC13204 

Hs.7081 1 hypothetical protein FU20516 

Hs.183114 Homo sapiens cDNA FLJ14236 fis, clone NT 

Hs.43697 els variant gene 5 (ets-relaled molecule 

Hs.1 13170 ESTs 



Hs.114311 CDC45 (cell division cycle 45, S.cerevis 
Hs.259290 ESTs 
Hs.195161 ESTs 
Hs.1 2664 ESTs 

Hs.184760 CCAAT-box-binding transcription factor 
HS.4B604 DKFZP434B168 protein 
HS.1B7824 ESTs 

NM_000478:Homo sapiens alkaline phosphat 
Hs.144687 Homo sapiens cDNA FLJ12981 fis, clone NT 
Hs.112160 Homo sapiens DMA hetlcase homolog (PIF1) 
Hs.16846 ESTs, Weakly similar to 04HUD1 debrisocu 
Hs.66718 RAD54(S.cerevisiae)-like 
Hs.125742 teucine-rich neuronal protein 
Hs.169084 ESTs. High y similar to TUL3.HUMAN TUBBY 
Hs.190466 hypothetical protein FU22584 
Hs.236204 nuclear pore complex protein 
Hs.1 72589 nuclear phosphoprotein similar to S. cer 

gbtHomo sapiens full length insert cDNA 
Hs.293077 ESTs 

Hs.82916 chaperonin containing TCP1, subunit 6A ( 
Hs.51 1 20 cathe'.icidin antimicrobial peptide 
Hs.146824 SPR1 protein 

Hs.131057 ESTs, Moderately similar to CRGD_HUMAN G 

Hs.198265 matrix metalloproteinase 25 

Hs.154868 carbamoyl-phosphale synthetase 2, aspait 

Hs.157379 cancer/teslis antigen (NY-ESO-1) 

Hs.250745 polymerase (RNA) III (DNA directed] (62k 

Hs.71 1 1 9 Putative prostate cancer tumor suppresso 

Hs.67397 homeoboxAl 

Hs.324830 diptheria toxin resistance protein requi 

Hs.81343 collagen, type II, alpha 1 (primary oste 

gb:RCO-MT0003-140300-031-b07 MT0003 Homo 
Hs.90073 chromosome segregation 1 (yeast homolog) 

C4001759gl|133250|sp|P19474|RO52_HUMAN 
Hs.166556 Homo sapiens, Similar to TEA domain fami 

NtvL006265':Homo sapiens RAD21 (S. pombe) 
Hs.134342 TASP for lestls-speclfic adriaraycin sens 
Hs.3066 granzyme K (serine protease, granzyme 3; 

gb:EST374201 MAGE resequences, MAGG Homo 
Hs.31476 Homo sapiens cDNA FU13872 fis, clone TH 
Hs.1 54443 minichromosome maintenance deficient (S. 
Hs.183161 ESTs 

hypothetical protein DKFZp762L0311 
Hs.75361 gene from NF2/meningioma region of 22q12 
Hs.159637 valyl-tRNA synthetase 2 
Hs.1 17305 Homo sapiens, clone IMAGE:3682908, mRNA 

gb:PM3-BN0176-100400-O01-g04 BN0176 Homo 
Hs.54039 BRCA1 associated RING domain 1 

Homo sapiens, clone MGC.12617, mRNA, com 
Hs.143688 Homo sapiens cDNA: FU23031 fs, clone L 
Hs.334612 small nuclear ribonucteoprotein polypepl 
Hs.95600 ESTs 

C12000514*:g:|7302471|gb|AAF57553.1|(AE 
Hs.848 FK506-binding protein 4 (59kD) 
Hs.159557 karyopherin alpha 2 (RAG cohort 1, impor 
Hs.120363 hypothetical protein MGC15634 
Hs.78853 uracil-DNA glycosylase 
Hs.246787 ESTs 
Hs.153797 ESTs 

Hs.14468 hypothetical protein MGC14226 
Hs.83484 SRY (sex determining region Y)-box 4 
Target Exon 

Hs.77152 minichromosome maintenance deficient (S. 



601 



WO 03/042661 



PCT/US02/36810 



421002 AF116030 

438833 BE612940 

420333 AJ001383 

433844 AA610175 

427528 AU077143 

430289 AK001952 

421016 AA504583 

436251 BE515065 

418826 AK000375 

428612 AA770001 

433220 AI076192 

422225 BE245652 

437549 AA759149 

409299 AA045650 

408665 T88845 

408116 AA251393 

420062 AW411096 

432820 AI554057 

430255 AK000703 

420337 AW295840 

407275 AI364186 

416209 AA236775 

423675 AI990509 



Hs.97084 
Hs. 179647 
Hs.179565 
Hs.238039 
Hs.101047 
Hs.296585 
Hs.88820 



Hs.1 28757 
Hs.53125 
Hs.1 12200 
Hs.289052 
Hs.94785 
Hs.152477 
Hs.323822 
Hs.14555 

Hs.79078 
Hs.131342 
Hs.247423 



Hs.221318 ESTs 



lymphocyte antigen 94 (mouse) homolog (a 
Homo sapiens cDNA FLJ12195 lis, dor e MA 
minichromosome maintenance deficient (S. 
hypothetical protein FLJ 11090 
transcription factor 3 (E2A immunoglobul 
nucleolar protein (KKE/D repeal) 
HDCMC28P protein 
ESTs 
ESTs 

zinc ringer protein 266 
gb:ah70e03.s1 SoaresJestis.NHT Homo 
small nuclear ribonucleoprolein D2 polyp 
C3Ts. Weekly similar to ALU? HUMAN ALU 
Homo sapiens, Similar to RIKEN cDNA 5430 
TGF(beta)-induced transcription factor 2 



Homo sapiens mRNA for KIAA1 551 protein, 
Homo sapiens cDNA: FLJ21513 fis, cone C 
gb:qw34h07.x1 NCI CGAP Ut4 Homo sapiens 
MAI m h r ficient, yeast, h 
small inducible cytokine subfamily A (Cy 
adducin2(beta) 



ein(Gpr 



400587 

407832 AW976516 

427159 1180735 
405770 

412722 AI343300 

414334 AA824298 

438192 AI859065 

417420 T85150 

421308 AA687322 

412851 AI826502 

414702 L22005 

409670 AI368109 

419926 AW900992 

417863 AB000450 

434750 BE019254 

410252 AW821182 

418574 N28754 

409019 AW385412 

416608 R11499 

436027 AI864053 

408161 AW952912 

422805 AA436989 

410284 U50939 

434274 AA628539 

430935 AW072916 

433252 AB040957 

415819 U77735 

437218 AL1 17497 

407239 AA076350 

433947 AA927996 

424727 AW590378 

435703 AW630133 

420297 AI628272 

422192 AA305159 

407961 AW672939 

410193 AJ132592 

414151 AW976468 

434789 AW292515 
424196 AL1 33660 
408831 AF090114 
414733 BE514535 
434523 AA703709 
409637 AA323948 

432141 BE410964 
409014 H83115 
410575 BE207480 
415071 AK002197 
418755 Y14443 
406137 

409893 AW247090 
421413 AI82612B 



NM„001436*:Homo sapiens fibrillarin (FBL 
ESTs 

C10000649-:gi|7296574|gb|AAF51 857.1 1 (AE 
Homo sapiens cDNA: FLJ21 354 fis, clone C 
PAX transcription activation domain Inte 



Hs.1 5091 ESTs 

Hs.21331 hypothetical protein FLJ10036 

Hs.293B07 Homo sapiens AFG3L1 isofonn 1 mRNA, part 

Hs.268814 ESTs 

HS.192B43 leucine zipper protein FKSG14 

Hs.97269 ESTs 

Hs.76932 cell division cycle 34 

KIAA1856 protein 

Hs.93796 DKFZP5B6D2223 protein 

" ""™ vaccinia related kinase 2 



Hs.82771 
Hs.4112 
Hs.61418 



Hs.189716 
Hs.39972 
Hs.300383 
Hs.121017 
Hs.61828 



myosin regulatory llghlchain 2, smooth 



amyloid beta f 



Hs.58185 
Hs.67846 
Hs.112876 
Hs.1 5251 9 



zinc finger protein 131 (clone pHZ-10) 
KIAA1524 protein 
pim-2 oncogene 

ESTs, Weakly similar to T42727 prolifera 
leukocyte immunoglobulin-like receptor, 
ESTs, Weakly similarto AF129535 1 F-box 
hypothetical protein FU20674 
GK003 protein 

ESTs, Weakly similar to ALU 1 _H UMAN ALU S 
fls485 

origin recognition complex, subunit 2 (y 
zinc finger protein 281 
ESTs 

ESTs, Weakly similar to T08680 hypotheti 
Homo sapiens beta cysteine siring pro'.el 
endocrine regulator 

minichromosome maintenance deficient (3. 



Hs.59757 

Hs!l94317 
Hs.142926 
Hs.48433 
Hs.77171 
Hs.23410 
Hs.55407 

Hs.272735 
Hs.49760 
Hs.6994 
Hs.284270 
Hs.88219 

Hs.57101 

Hs.55209 ESTs, Weakly similar to A49364 59 protei 



r in re gni r mplex ubuni j( 
Homi pi i Ui LJ22044I cl i H 
Homo sapiens cDNA FU11335 fis, clone PL 
zinc finger protein 200 
NM_000179*:Homo sapiens mulS (E. coli) h 
■ icientlS. 
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425955 NMJM1761 

431393 AW971493 

407162 N63855 

4223B2 D79988 
402677 

433017 Y15067 

424677 U09414 

418883 BE387036 

424959 NM_005781 
402678 

408146 R45621 

420027 AF009746 

427447 T65414 

433219 AB040916 

431126 AF085243 

407136 T64896 

419669 AJ007041 

'419594 AA013051 



437108 AA434054 

408636 BE294925 

420005 AW271105 

412783 BE276738 

415319 AA659823 

425358 AL079658 

409617 BE003760 



Hs.24106 
Hs.283619 
Hs.287420 
Hs.92236 
Hs.91417 
Hs.1 68249 
Hs.272838 
Hs.80624 
Hs.46680 
Hs. 133294 
Hs.74578 



Hs.265827 
Hs.1 53880 
Hs.344037 
Hs.97689 



unc119(C.elegans)horaolog 

ESTs, Highly similar to cytokine recepto 
zinc finger protein 
KIAA0166 gene product 
NM_000478:Homo sapiens alkaline phosphal 
zinc finger protein 232 
zinc finger protein 1 37 (clone pH2-30) 
acid phosphatase 5, tartrate resistant 
activated p21cdc42Hs kinase 
Target Exon 

hypothetical protein MGC271 8 
ATP-binding cassette, sub-family D (ALD) 
Homo sapiens cDNA FU130BB fis, clone NT 
KIAA1483 protein 
zinc finger protein 236 
Homo sapiens cDNA FU11533 fis, clone HE 
KIAA0304 gene product 



Homo sapiens mRNA; cDNA DKFZp434B104 (fr 

hypothetical protein FU 10494 

hypothetical protein MGC2560 

CGI-12 protein 

ESTs 

DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 
Homo sapiens cDNA FU13485 fis, done PL 
FK506 binding protein 12-rapamycin assoc 
Homo sapiens mRNA; cDNA DKFZp434K0514 (f 
homolog 



polymerase (RNA) mitochondrial (DNA dire 
protein regulator of cytokinesis 1 
VASA protein 



sr: Gene cluster number 



CAT Number Accession 



430676 
427521 



43651 '1 
422689 
421974 



4172J 

1000797J 
978554J 
60836.2 

513212J 

596835_1 
215430_1 

132288J 
874209J 
864120.1 



417886 
418235 
414725 



14739 J 
101881 1 
14066J 



BC022538 AI990847 BF478249 BG21 7996 BG212702 BG182057 AW589883 BF000085 AA993969 BG479023 BG220014 BG679466 BE907092 

AI623855 AA223956 AA223917 AW022983 AW090580 AW57321 9 BF51 4491 BF445397 AA884705 AI91 0424 

AW978773AW298067 AA810101 AW194180 AA731645 AI69Q673 

AA764852AA736937 

AA917062AA757369AW592218 

BG433950 BE051583T0580B BE144813 AW812038 BE144B12 AW812040 AWB12041 AU124350 BE061602 BE061604 BF922595 BE061603 



AW973352 BF222929 AW01 6B53 BF059130 AI651829 BE551757 AA558414 AI339359 BF0596O1 AI961 1 52 AI341 422 AI20624B AI2051 65 
AA548736 AA788578 AI539081 AW025957 AA736837 N79575 AW594357 AA4B0892 
AW102570 AA90715O AA907120 M7371 B8 AI248890 AW977353 

BG576155 BM009591 AI479075AI025794 AI017967 AA448270 BE455812 AA853422 AI392649 BG952034AA513384 BF840124 BE714620 
AW969505A1553633 



AW954733 AA315006 AW856665 
AA301270 AA301379 AA301366 

AJ420453 AL526740 AW968449 M459140 AA843893 AI566516 AW971760M430089 AI753216 AA854268 AA743075 AI864957 M458920 
AI566634 AA211796 BG615512 BE169275 BF9B3253 BF969462 AA755261 AI769894 AA135833 AI831542 NB3376 AA214392 AU154486 
AW605017 AW450072AA446459 BEB81B75 AI061423 AA598549 AW439151 AA426273 Z40087 AA812434 AA135965 H04812 
AA210987 D57294 AA214584 M207006 D56572 
BE072634 BE072653 AA830615 AA214736 AA33171B 

NM_0581 66 AF220030 AL043894 AW974257 AA625445 AU153502 AI650537 AW6121 16 AI672377 AW772451 BE892241 BE50174O AA71 8936 
AI650276 AI654206 BE503226 AI651327 AW373562 AW271269 AW2715B5 AI873518 AI207150 A1338825 AI550258 AI628362 AA2271 17 AI207149 
AW052076AI470775M588100AW235852M'697D1AI701B53AK027664AI984770AU153469 BE222316AA609539BE220093AA609112 
BI05431 B 

AF1 47390 R76593 R76594 

BI603077 AW954272 BI598724 AI003154 AA059300 AA046911 BI669907 BI600966 BI669987 

AK025201 AA425472 AI694282 BG057305 AA937787 AI2B6170 AI6B4577 AJ420494 AI809865 BFO58095 AI478773 Al 160445 AL0441 14 
AW665529 AI129239 AW297152 AI26821 5 A1469807 AI969353 BE552356 N66509 AA736741 AA382555 AW07581 1 AV75918B BI259364 
BF445142 BG232065 AI141758 AI531202 Al 1 67566 Al 208445 AA8B9823 BF982682 N90322 BI090882 BF208005 AW953918 AL0441 13 AI016793 
AA382556AW235763AA927051 AIB62075 BE686691 BE619282 
BF665746 D59356 BG67831 2 N56640AA1 66861 

AF0B0229AF080232 U87593UB7592 U875S1 U37590 AI636743 AI633818 AW206802 AI583718 AF080231 AF080234 AF080233 AL535594 
AI818326 AF080230 S46404 AI970376 AA463992 AW665466 BF512210 U87595 U87589 BE550533 AI672574 BE467547 AI680833 AW614951 
N299B6N25695H69001 U87596 BE673974 A1797496 AI701526 AA703396AW1 39734 H92278 N66048 BE219539 BE671665AI524817 BE466611 
AI206344 AA574397 BF593413 BG231 27 r rii 1 7 U87594 BF062180 BE466420 AI887798 BF674385 AA204735 AW496808 AA204833 
AA207155BII 
AA338252AA338213 
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410704 1054673J 

430968 1237115J 

427260 11272_50 

433979 2076469J 

426919 347372J 



437437 6087.1 



430183 17316.1 
430504 5477_6 
428612 1383189J 



424677 251 8 _37 



AA93371 7 BF061 B97 AW628327 AA641 78B AA400495 

AW958037 R42557 AI337047 AA948360 AI638005 AA459S50 AI62491 5 AI638047 AI467856 AI521826 AA860305 A1932315 AW003092 AW271756 
AW779380AA609879AI634791 AI493770 AI565211 241145 AI627952AA303734 BE349457AW19B765AA256527BE089727 
AJ31 4647 NM_052888 B1494693 AA E >34477AI .C678 AI807696 BF477887 AI701147Z39187 R38979 F02234 AA984711 BI222234 

AV731417 R42406 H04996 T98498 R12489 R12577 R42405 

BE999967 BF438599 AW864793 AI802899 BE815132 AW4688E8 AI672189 At052004 BF112024 AA772335 AW275054 AA573845 AI144148 

AI968683 AA846676 AA927355 H80424 AW973295 R88209 F29868 BE928871 

T65754AA22965BAA229857 

AL532360 BE794750 AA582906 AI01 5067 AW271 034 BG27 1636 AW0751 77 AW071 374 AI345565 AI307208 BE1 38953 BE049086 AI334881 
AW075006 AW075181 AA46401 9 AW302733 AW0751 00 AW073433 AI802854 AI334909 AI802853 AI345036 AI348921 AI340734 AI307478 
AI251289AW302327AW0725 It 12145 A M .r2513 AW071289 AI307559 AA8761 86 T29587 AI307493 AI255068 AI25286B AI252839 

AW074809 AI252926 AI252160 AI251 662 AS251 252 AI51 09 1 3 AI270787 AI2701 56 AI252075 AW073469 AW072901 AW072496 AW071 420 
AI305762AI254764AI80283^AI2 r £ 7 ! » 171311 AI34064C BE138965 BE138502 AW073456 AI334733AI054335 BE139260 
AI054302 AI054060 AI054057 AI053722 A128971 1 BE1 39228 AW470478 AW271039 AW302085 BE041 872 AI254494 AI271496 AI252427 
BF71 8773 BF71 8645 AW074866 BE857822 

BG1 10129 AW749287 BE535498 AW749299 AW749293 AW749302 AW749296 AW749291 AW749294 AW749289 AW749288 AW749296 

AA769294 AW749297 AW749295 AW749292 BE002573 

AI581311AA781682AA781678 

AW877458 AW877524 BE076922 BE1 6691 2 AW840534 BE076754 AW797829 BE1 66905 BE1 66926 AW877462 BE1 66927 BE1 66932 AW877523 
BE166917 AW877529 BE166928 BF35 1 394 WV8 '7! 22 AW877528 BE166861 BE166866 BE166913 BE166919 AW877456 AW877537 BE076866 
AW840571 

AW972830 AA489820 AA527647 AA570362 
AA401424 M400100AA663848 
N50454AA620999T16375 

BI91 7595 AI203314 AL041228 AV727959 D61361 D82004 B1753157 AA961066 AI990307 BF439651 A1453076 AI376075 AI014836 AI018308 
AW1 83530 AA393346 AA935601 AA628633 AI1 50282 A1028574 AI2171 82 AA431 478 AW087473 AW900295 H50055 AL041 229 BI917726 
AF076009R63109 R63068 
AW947655AW984020 

NM_006265 D38551 X98294 BM477931 BM461 566 AU1 23557 AU133303 AU134649 AW500421 BM172439 AW500587 AW503665 AW504355 
AW503640 BM1 52454 AW505260 AI81 5984 AW504075 AW50071 6 AL59731 0 BC001 229 BM474371 AA984202 AU135205 BE090841 AW1 63750 
BF747730 BF898637 AI206506 AV660B70 AV5921 1 0 AW3B6B30 AV656831 N64710 AW993470 BF086802 BF758454 BG960772 BF757769 
BI870853 BE018627 C75436 AW148744 BF757753 BG622067 BE909924 AA708208 BG530266 BF968015 AW992930 BF888862 BG536628 
M143164 AW748953 BG498922 BFB8519DE FB89005 BF7547B1 BF800003 BM476529 AI627668 AW0281 26 AL04601 1 BF59066B AI01 7447 
AA579936 AI367597 AA699622 BE2B0597 AI124620 AI08254B AW274985 AA677870 AI056767 BE551689 AA287642 H94499 AI752427 AI652365 
AW002374 AW062651 AA360834 N68822 AU135442 AU125960 Z78334 BE545813 AI092115 BF312771 BF242B59 BG533616 BG533761 
BG164745 BG492433 BM473183 AA172Q43 AA172069 AU157092 AU151353 AU155318 BE302211 AI375022 AA085641 AU157923 H88858 
M132730AA115113AA909781AI475255M424205AW5723B3AW084296AI184820AI46917BAA7B2432H921B4AA340562 BF195818 
AA852821 AW576342 AA827107 AA173317 AW190014 AI918514 AA729372 AA72971B Ai055958 AA331424 BE328601 AA515690 BI0188S6 
AW628277 M748368 AA625222 BG492636 AW3B0520 BFB00058 AW370955 M290909 R25857 BG952995 BF801437 AA1 72077 AU155890 
AU1 49783 AI720904 AA902936 AA865727 AI470830 AV740577 AA142982 AA4B24B5 AU145485 AW576399 AU156042 R63448 BF246427 
BE928472 D25910 BF758439 BF968785 BE565238 AA3559B1 AI905607 BG291148 BG533096 BG532888 BF030886 BG613756 BE928471 
BG574501 AA187596 M361196 T95557 BG531445 BG527242 BG527513 BG611106 AA085995 BF847252 BG024608 BE540261 BG531236 
AL579993 BG108733 BG483503 BG571032 BG492505 
M427363 AW9621 28 AA355353 

BC009352BC014630AU131857AL527140 AU13176B B17693B2 BI753220 AU129886 AU128771 AA314135AU126B19 AI333799 AA479336 
AA258503 AL597351 AL359619 BG59721B B12542B3 AI743B46 AA236444 AA397533 AA247450 AI051464 AI224533AU 153442 AU151001 
AU1 52621 AU1 51 829 AU1 53069 AW269958 AU1 541 95 AI862754 AI589780 AW273839 AI3381 55 Al 1 26632 BE046048 AA976930 AI289304 
AI625961AI222288AI280054 AA973329 AI524262 A1242371 AA296517 AI5678B5 AI590681 AJ34B516AW247913 AI422051 AI475352 AI689531 
AW469308 AW198034 AA936939 AU 151 059 AU 1481 34 AA486419 AU151953 AI830968 BI493265 BI493264 AU1 49861 BE268763 AV763495 
AW962827 BM480300 AA226869 AL529368 BM451 957 AU 132714 BI871 31 9 AA380739 BI91 1351 BF795906 BE548853 AW579751 
AK055745 AA039909 BE183282 W60721 AA464B67 AA39B986 T672B0 BF995651 AI675055 BG001051 BF7B4727 BF766707 BF764717 BF764852 
BF173139BE010038 

BE219720 BF475241 AI571723 BE219848 BI789268 A1224899 AA724864 AW771467 AA4B0255 AWB45616 AI440295 H52800 BE218790 AI681575 
AW300064 AW262133 H21568 AI363015 AI884914 H86948 
AA770001 AA431112AA432126 

AI625045 AW504152 AI469086 M905873 AW504662 AW1 361 14 AI927270 BE041 754 AI648386 AA652655 AA400052 AI143501 AI744934 
AI400147 AI381657 AA676551 AA974367 AW117437 AI570383 AI242456 AI2745B1 AA678138 R49939 AI393926AA345B54 AW605850 AI869780 
AW391171 R77044 

AW955043 AI990326 AA776406 AIO16250 AW451882 AA843678 BF916900 AW945895 AI979339 N23129 W70051 AA322672 N23137 
BM480413 N28908 H39792 BE240826 BE882093 BE240B27 AWB58637 BF739795 AA700834 AA769597 AA489668 AW968806 AW085196 
AI093280 AI218457 AA063138 AI632958 AW51 5005 AI570530 Z41 724 AA748789 AI596584 AA062544 AA773643 AA490285 
BC01 7923 AA789302 AW466994 BF513878 AI819642 AI184913 AW469044 AI220572 AW072916 AI280239 AI47361 1 AW841 126 D60937 
M489195 N59350 AA693435 BG531204 AA484243 AW514092 

U09414 NM_003438 AA503545 AI022449 AA04345B M766074 AA765442 AAB05052 AI02B21 1 AW609708 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digil numbers in this column are Genbank Identifier (Gl) numbers, "I 

human chromosome 22" Dunham, et al. (1 9995 Nature 402488495. 

Strand: Indicates DN>\ strsnd from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

406547 7711513 Minus 172780-174358 

402145 6018280 Plus 113086-114800 

402199 8576116 Minus 84187-84744 

401435 8217934 Minus 

402408 9796239 Minus 

406367 9256126 Minus 

402680 8113438 Plus 137634-137768,139702-139393,140475-14059 

402299 6693370 Plus 23367-25175 

401091 9958240 Plus 94760-94898 



he publication entitled "The DNA sequence of 
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404071 
403242 
402260 
400587 
405770 
403532 
406137 
402677 
402678 



2735037 
8076842 
9166422 
8113438 
8113438 



170688-170834 
31237-31375,32. 
120993-121095,121660-121729 
18123-18766 
92349-92 
132079-132216 

167354-167859,168810-168920,169000-16910 
11297-12511 

113765-113910,115653-115765,116B08-11694 



61057-62075 
81750-81901 
30487-31058 
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TABLE 52A: 








Pkey: 


Unique Eos p 










Exemplar Accession number, 






UnigenelD: 


Unigene number 






Unigene Title: Unigene genr 


i title 






R1: 


Ratio of testicular cancer (no 










UnigenelD 






414438 


AI879277 


Hs.76135 






424247 




Hs.234734 






416680 


AW245540 








412948 


BE243313 


Hs.334851 






438091 


AW373062 








406658 


AI920965 


Hs.77961 






418174 




Hs.83656 


Rho GDP dissociation inh itor(GDI e 




409038 




Hs.50002 






413063 


AL035737 


Hs.75184 






430542 


AI557486 


Hs.1 19122 


ribosomal protein L1 3a 




428928 


BE409838 








432730 


AI066520 


Hg.131358 






444562 


AA186715 


Hs.336429 






446525 


AW967069 


Hs.211556 






417088 


M54915 


Hs.81170 






418870 


AF147204 


Hs.89414 








A1034361 


Hs.135150 






426295 


AW367283 








406856 


AW515336 


Hs.29797 






417139 


M69043 


Hs.81328 






440207 


AI371978 


Hs.128326 






422578 


AF239666 


Hs.1545 


caudal type homeo box transcription fact 


26.95 


432359 


AA076049 


Hs.274415 


Homo sapiens cDNA FLJ10229 fis, clone HE 


26.90 


420367 


AA259090 


Hs.257028 


ESTs 


26.50 


429978 


AA249027 




ribosomal protein S6 


26.43 




Z28925 


HsJ188 


sema domain, immunoglobulin domain (Ig), 


26.36 


423673 


BE003054 


HS.1S95 


matrix metalloproleinase 12 (macrophage 


26.23 


412636 


NM 004415 




desnoplakin (DPI, DPII) 


26.15 


435538 


AB011540 


Hs.4930 


low density lipoprotein receptor-related 


25.25 


446899 


NM 005397 


Hs.16426 


podocalyxin-like 


25.25 


442562 


BE379584 




dolichyl-diphosphooligosaccharide-protei 


25.15 


406656 




Hs.89643 


ma|or histocompatibility complex, class 


25.13 


446619 




Hs.313 


secreted phosphcprotein 1 (osteoponSn, 


24.78 


423961 


D13666 43 


Hs.136348 


periostin(OSF-2os) 


24.48 


425543 


R23313 


Hs.334895 


ribosomal protein 110a 


24.38 


420676 


AI434780 


Hs.4248 


vav 2 oncogene 


24.18 


406820 


AI223958 


Hs.108124 


ribosomal protein S4, X-linked 


23.96 


440869 


NM_014297 


Hs.7486 


protein expressed In thyroid 


23.80 




AL048753 


Hs.303649 


small inducible cytokine A2 (monocyte ch 


23.56 


414587 


NM 004862 


Hs.76507 


LPS-inducedTNF-alpha factor 


23.22 


446627 


AI973016 


Hs.15725 


hypothetical protein SBBI48 




449571 


AW016812 


Hs.200266 


ESTs 


22.83 


413787 


AI352558 




tyrosine 3-monooxygenase/tryptophan 5-mo 


22.81 


410315 


AI638871 


Hs.17625 


Homo sapiens cDNA: FU 22524 fis, done H 


22.68 


414092 


Z14244 


Hs.75752 


cytochrome c oxidase subu nit VI I b 


22.45 


422714 


AB018335 


Hs.1 19387 


KIAA0792 gene product 


22.45 


439180 


AI393742 


Hs.1 99067 


v-erb-b2 avian erythroblastic leukemiav 


22.30 


444784 


D12485 


Hs.11951 


ectonucleotide pyrophosphatase/phosphodi 


21.59 


406648 


AA563730 


Hs.277477 


major histocompatibility complex, class 


21.53 


448588 


AI970276 


Hs.156905 


KIAA1676 


21.23 


433423 


BE407127 


Hs.8997 


heat shock 70kD protein 1A 


21.19 


429490 


AI971131 


Hs.23889 


ESTs Wi klj 1 3 ic \LU7 HUMAN ALU 


20.70 


432606 


NM 002104 
BE548267 


Hs.3066 


granzyme K (serine protease, granzyrae 3; 


20.60 


407862 


Hs.337986 


Homo sapiens cDNA FU10934 lis, clone OV 


20.57 
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420754 
425769 
424800 
412915 
452322 
410143 
420759 



452874 
408669 
421379 
426083 
429183 



U72513 

AL035588 

AW087727 

BE566343 

AA188159 

T11832 

AK001514 

AB007937 

AB029000 

AK001061 

AI493591 

Y15221 

AW962712 



450000 AI952797 



ribosoraal protein L22 
Human RP L1 3-2 pseudr 
MyoD family inhibitor 
NM004541 :Homo sapiens NADH de 
giutaredoxin (thioltransferase) 
KIAA1191 protein 

Homo sapiens cDNA FLJ11381 fis, clone HE 
hypothetical protein FLJ10652 
KIAA0468 gene product 
WAA1 077 protein 
hypothetical protein HJ10199 
plateiet/endothelial cell adhesion molec 



415221 
429614 
418707 
412025 



AB014680 
AL021154 
X12830 
W07413 
AI371172 
U97502 
AI827451 
AA923278 
NM_014479 



Hs.126712 
Hs.1 97955 
Hs.10888 

Hs.323822 
Hs.7252 
Hs.283713 
Hs.8786 
Hs.75884 
Hs.193400 
Hs.78225 
Hs.211539 
Hs.87497 
Hs.24143 
Hs.290905 
HS.14529E 
Hs.16530 



B(Cy 



ESTs, Weakly simnarto AF191020 1 E2IG5 
KIAA0704 protein 
hypothetical protein FU21709 
KIAA1 265 protein 

Homo sapiens mRNA for KIAA1551 protein, 
KIAA1 224 protein 

ESTs, Weakly similar to S64054 hypotheii 
carbohydrate (N-acety!glucosamine-6-0) s 
Inhibitor of DNA binding 3, dominant neg 
intedeukin 6 receptor 

hypothetical protein MGC4248 
butyrophilin, subfamily 3, member A2 
Wiskott-Aldrich syndrome protein interac 
ESTs, Weakly similar to protease [H.sapi 
ADAM-like disintegrin protease, decysin 
small inducible cytokine subfamily A (Cy 
hypothetical protein AL1 101 15 
NM_004079:Homo sapiens cathepsin S (C 
ATP-dependantintj 




small Inducible cytokine A4 (homologous 
Homo sapiens cDNA: FIJ23523 fis, clone L 
protein C receptor, endothelial (EPCR) 
A kinase (PRKA) anchor protein 2 



55 
60 
65 



Hs.111680 endosulfme alpha 



428227 
427820 
426552 
422241 



butyrate response factor 2 (EGF-response 
small inducible cytokine subfamily B (Cy 
inhibitor of DNA binding 2, dominant neg 



AW361666 
AW1 52225 
AA382104 
AB023154 



447211 
417426 
414420 
444051 
454413 



AW968934 Hs.173108 

AL161961 Hs.17767 

NM 002291 Hs.82124 

AA043424 Hs.76095 

N48373 Hs.10247 

AI653672 Hs.40092 

AI218918 Hs.30209 

AF081513 Hs.25195 

NM_005904 Hs,1 00602 

AA070801 Hs.51615 

AI817130 Hs.9195 

AK001666 Hs.1 89095 



ESTs, Weakly similar to I38022 hypothe 
lectin, galaotoside-bindirg, soluble, 9 
KIAA0937 protein 
A kinase (PRKA) anchor protein 1 
CGI-44 protein; sulfide dehydrogenase I 
actin related protein 2/3 complex, subun 
Homo sapiens cDNA: FU21897 fis, clot 
KIAA1 554 protein 



TGF-beta4 

MAD (mothers against decapentapleg'c, Dr 
ESTs, Weakly similar to ALU7_HUMAN ALU S 
Homo sapiens cDNA FLJ13698f s, clone PL 
similar o SALL1 (sal (DrosophilaHike 
ESTs, Moderately similar to ALU8_HL)MAN A 



WO 03/042661 



PCT/US02/36810 



445055 
447534 
408822 



AW953935 
AW500715 
AI634046 
AW1 39342 



Hs.288655 ESTs 



435398 H87136 



AA804789 

BE077546 

NM 003642 

AW957744 

N64214 

AA866115 

AA911568 

W77762 



418299 AA279530 



445817 
408437 
435522 



Hs.57079 
Hs.157313 
Hs.155530 



Hs.31447 

Hs.13340 

Hs.278469 

Hs.9774 

Hs.127797 

Hs.279860 

Hs.79015 

Hs.155111 

Hs.83968 

Hs.2621 



412247 
413497 
436876 
432409 
453020 
419384 
410275 
432805 
416975 
450719 
423753 
418460 
402145 
407179 



AA1 33590 

AF022375 

BE177S61 

AI1 24756 

AA806538 

AL1 62039 

AA490866 

U85658 

X94630 

NMJ04131 



Hs.107513 
Hs.250857 
Hs.73793 

Hs.5337 
Hs.130732 
Hs.31422 
Hs.39429 
Hs.61796 
Hs.3107 



Homo sapiens cDNA FLJ 13267 fis, clone 
ESTs 

interferon, gamma-inducible protein 16 

ribosomal protein S17 

PDZ-LIM protein mystique 

ESTs, Moderately similar to A46Q10 X-lin 



lacrimal proline rich protein 
synovial sarcoma translocation gene on c 
Homo sapiens cDNA FLJ11381 fis, clone HE 
tumor protein, translationally-controlle 
antigen identified by monoclonal antibod 
hypothetical protein FLJ14428 
integnn, beta 2 (antigen CD18 (p95), ly 
cystatinA(stefinA) 

integnn, alpha X (antigen CD11C(p1 50), 
collagen, type XVII, alpha 1 
Williams-Beuren syndrome chromosome regi 
ninein(GSK3B interacting protein) 
ESTs, Moderately similar to KIAA1058 pro 
calcium/calmodulin-dependenl protein Ian 



gb:RC1-HT0598-02Q300-01 1-h02 HT0598 Homo 
isocitrate dehydrogenase 2 (NADP), raitoc 
KIAA1575 protein 

Homo sapiens mRNA; cDNA DKFZp434M229 (fr 



AA206465 
AW002834 
AA775268 
AB011084 



Hs.24095 



413686 
406701 
424687 
407252 
445929 
451864 
429307 
434280 
447519 
417365 
418945 
406776 
437103 
449961 
441244 



AI469213 
AA780613 
J05070 



BE246762 

T16206 

AW1 39408 

AW265534 

BE612935 

AK001838 

BE272922 

AF290512 



AA502917 
AA737415 
W05391 
AW452948 



.6127 
Hs.48924 
Hs.1 04520 
Hs.252748 
Hs.1 40996 
Hs.23363 
Hs.1 79902 
Hs.153934 
Hs.246885 
Hs.1 244 
Hs.238954 
Hs.145199 
Hs.279930 
Hs.238730 
Hs.71404 

Hs/151738 
Hs.163780 
Hs,323401 



CD97 antigen 
granzyme B (granzyme 2, cytotoxic T-lymp 
ESTs, Weakly similar to RB8B_HUiVAN RAS-R 
phosphoinositide-3-kinase, class 2, beta 
CD8 antigen, alpha polypeptide (p32) 
Target Exon 

ESTs 

Homo sapiens cDNA: FLJ23020 lis, clone L 
KIAA0512gene product; ALEX2 
Homo sapiens cDNA FLJ13594 fis, clone PL 
Homo sapiens cDNA FU20394 fis, clone KA 



hypothetical protein FU10983 



hypothetical protein FU20783 
CD9 antigen (p24) 

ESTs, Weakly similar to KIAA1204 protein 



Hs.1 52940 
Hs.133100 
Hs.1 84052 



Hs.58215 
Hs.211198 
Hs.103832 



transforming growth factor, beta recepto 
arachidonate 5-Iipoxygenase 
- similar to LDHH_HUMAN L-LAC 



Homo sapiens, Similar to rholekir, clone 



Hs.1 56974 ESTs 



Hs.257531 
AW250089 Hs.75807 
AA738254 Hs.1 65390 
W74048 Hs.1 755 
AA805534 Hs.300870 
NM 006144 Hs.90708 



ESTs, Highly similar to A40350 transcrip 
lymphocyte-specific protein tyrosine kin 
Homo sapiens mRNA; cDNA DKFZp547M072 (fr 
granzyme A (granzyme 1 , cytotoxic T-lymp 



WO 03/042661 
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LCHN protein 
KIAA1361 protein 
KIAA1 628 protein 

erythrocyte membrane protein band 7.2 (s 
HT01 8 protein 

; mRNA for KIAA1 650 protein, 
lytoplasmic polyadenylation element bind 

scDNAFU14934fls,donePL 



Prader-Willi/Angelman syndrome-5 
nebulin 
pleckstrin 

solute carrier family 25 (mitochondrial 



hypothetical protein FU22995 



420059 AF161486 

410730 AW368860 

431958 X63629 

417315 AI080042 



433156 
425246 
440268 
414821 
407254 



Hs.1 17077 
Hs.181301 
Hs.94769 



transforming growth factor, beta-induced 



leutrophil cytosolic factor 2 (65kD, chr 
piatelet/endothelial cell adhesion molec 
hypothetical protein FLJ20534 
ESTs, Moderately similar to ALU5_HUWAN A 

hypothetical protein MGC3260 
protein tyrosine phosphatase, receptor I 
DKFZp564J157 protein 
hematopoieticatly expressed homeobox 



m Inwardly-rectifying channel, s 
hypothetical protein MGC3032 
' " in regulated transporter-like 
nucleolar protein family A, member 3 (H/ 
ESTs, Weakly similar to I38022 hypothe'.i 
cathepsinS 

RAB23, member RAS oncogene family 
DnaJ (Hsp40) homolog, subfamily D, membe 
cadherin3,type1, P-cadherin (placenta 



Hs.192425 ESTs 



450147 
442806 
431187 
449971 



Hs.77424 
Hs.181165 
Hs.171825 
Hs.95140 
Hs.1 52983 
Hs. 18 5980 
Hs.195641 
Hs.31314 
Hs.13797 
Hs.146324 



gb:RC3-CT0297-290100-013-d03 CT0297 Homo 
Homo sapiens cDNA: RJ22539 fis, clone H 
serum response factor (c-fos serum respo 
Homo sapiens, clone IMAGE:3677185, mRNA 
Fc fragment of IgG, high affinity la, re 
eukaryotic translation elongation factor 
basic helix-loop-helix domain containing 
K(AA0189gene product 
HUS1 (S. pombe) checkpoint homolog 
ESTs 



BE264111 
AW967807 
AW373713 

AW294522 Hs.1 49991 ESTs 



AA533447 
AI888490 
Y18024 



Hs.80887 
Hs.42491 
Hs.93304 
Hs.1 93482 
Hs.312989 
Hs.55902 
Hs.78877 
Hs.289114 



ESTs 

Homo sapiens cDNA aJ14296fe, clone FL 
v-yes-1 Yamaguchi sarcoma viral related 
ESTs, Weakly similar to S65657 alpha-1C- 
phospholipase A2, group VII (platelet-ac 
Homo sapiens cDNA FU11903fis, clone HE 
ESTs 



417228 
425593 
422616 



AW295840 Hs.14555 
Z42387 Hs.83883 
AA434054 Hs.80624 
Hs.7312 
Hs.1908 
Hs.1 18725 



. __ r _ns cDNA:FU21 513 fis, clone C 
transmembrane, prostate androgen induced 
hypothetical protein MGC2560 



444933 NM_016245 Hs.12150 



gb:UI-HF-BR0p-akaf-12-0-Ul.r1NIH_MGC_5 
neutrophil cytosolic factor 4 (40kD) 
retinal short-chain dehydrogenase/reduct 

608 



WO 03/042661 
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430592 
445612 
427254 
428970 
425190 
430152 
421684 



AJ224864 
N94126 
AL121523 
BE276891 



Hs.1 55079 
Hs.346348 
Hs.1 06768 



hypothetical protein FU 10511 



442832 AW206560 Hs.253569 ESTs 



AI184717 
428467 AK002121 
449924 W30681 
447674 BE270640 



hypothetical protein FU 11 259 

Homo sapiens cDNA: FU22130 (is, clone H 

cyclln-dependent kinase 2 



412093 
407833 
411979 
437134 
430333 
408996 
425284 
441623 
442622 
441021 
446630 
417621 
433655 



40 
45 

50 



BE242691 Hs.14947 

AW955632 Hs.66666 

X85134 Hs.72984 

AA349944 Hs.42915 

S70114 Hs.239489 

AI979168 Hs.344096 



AW150717 
BE246446 
AA731491 
AL036058 
AV660804 
106505 
N20S17 



440327 R12581 



427523 
407151 
448094 
421395 
431574 



D90084 

AW572559 

AW444433 



426215 AW963419 

435748 AA699756 

443351 AW016783 

447500 AI381900 

407949 W21874 

428728 NM_016625 

434511 R28982 

411852 AA528140 



Hs.334477 
Hs.76807 
Hs.301417 
Hs.182979 
Hs.194397 
Hs.8752 
HSJ149 
Hg.111577 
Hs.191146 
141162 Hs.53563 
NM 001290 Hs.4980 
W28673 Hs.106747 
Hs.1 79526 
Hg.301527 
Hs.32061 
Hs.1 023 
Hs.261373 
Hs.136061 
Hs.1 93580 
Hs.21213 
Hs.172199 
Hs.155223 
Hs.1 17335 
Hs.30799 
Hs.159212 



ESTs, Weakly similar to S1 9560 proline-r 
retinoblastoma-binding protein 5 
ARP2 (actin-related protein 2, yeast) ho 
TIA1 cytotoxic granule-associated RNA-bi 
glycoprotein (transmembrane) nmb 



I histone family, member 2 



ribosomal prolein S23 

STAT induced STAT inhibitor 3 

v. j r ct ng enzyme EMikc 

hypothetical protein MGC1 4879 

major histocompatibility complex, class 

AHNAK nucleoprotein (desmoyokin) 

ribosomal protein L1 2 

leptin receptor 

transmembrane protein 4 

Homo sapiens cDNA: FLJ21950 f s, clone H 

integral membrane protein 3 



424875 
419378 
449523 



443303 
407013 
429999 



432666 AW204069 



AA355799 

BE302464 Hs.30057 

AI583207 Hs.99029 

AW997484 Hs.5003 
U24683 

AW298741 Hs.97861 



;n,type [X, alpha 3 



id by 125-dihydrox/vitamin D-3 
ESTs, Moderately similar to unknown [H.s 
ESTs, Weakly similar to I38022 hypotheti 
' ' de) alpha 



Hs.191381 
Hs.18106 
Hs.107515 
Hs.199310 



AI187945 

R24922 Hs.90078 

NM 000579 Hs.54443 

NMJM1241 Hs.155478 

NMJH1674 Hs.460 

U67319 Hs.9216 

U35637 Hs.83870 

AI761902 Hs.99597 
AI915771 

AW410235 Hs.75348 

NM 005908 Hs.115945 

BE620712 Hs.33026 



Homo sapiens cDNA FLJ13471 lis, do 



ESTs, Weakly similar to T00329 hypotheti 
ESTs 

nucleotide-sugar transporter similar to 
chemokine (C-Cmo'.if) receptor 5 
cyclinT2 

activating transcription factor 3 
caspase 7, apoptosis-related cysteine pr 
gb:Human nebulin mRNA, partial cds 
ESTs 

metallothioneinlE (functional) 



434159 AW135214 Hs.191828 ESTs 



hypolhetical protein PP2447 



ESTs, Weakly similar lo unnamed proteii 
gb:aa55e05.r1 NCLCGAP_GCB1 Ho - 
SEC22 vesicle trafficking protein (S. 



ESTs, Moderately similar to 138022 hypot 



WO 03/042661 
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BE268912 






7.75 




438962 


BE046594 




gb:hn41d 1.x1 NCI_CGAP_RDF2 Homo sapiens 


7.75 




434963 


AW974957 


Hs.288719 


Homo sapiens cDNA FU12142 lis, clone MA 








AA641201 


Hs.222051 




7,73 


5 




Al 341227 






7.72 






AI638616 


Hs.196566 




7.71 








Hs.80712 


KIAA0202 protein 


7.67 




410300 


BE280421 


Hs.94499 




7.67 


10 








sal (Drosophila)-tike 2 


7.67 








ESTs, Moderately similar to similai io N 


7.65 










gb:ye53h05.s1 Soares fetal liver spleen 


7.65 


























15 




AW015759 


Hs. 2357 09 


Homo sapiens mRNA; cDNA DKFZp667B0711 (f 


7.63 






AA81 1987 












AA908472 




gb:og82a10.s1 NCI CGAP_Ov8 Homo sapiens 


7.62 








Hs.1 13660 


Homo sapiens cDNA FLJ11631 fis, done HE 




20 










7^61 






Hs.1 19769 














gb:yh88b01.s1 Soares placenta Nb2HP Homo 










Hs.44021 


Homo sapiens mRNA for FLJ 00065 protein, 










Hs.81988 






25 














X81889 






755 






AW970026 






7.54 




429083 


Y09397 


Hs. 22781 7 




7.E4 










ESTs, Moderately similar to 21 1 5357A TYK 




30 




AW579960 


Hs. 135150 




















BE244076 


Hs.1 59578 


AT-hook transcription factor AKNA 










Hs.85201 










T49300 


Hs.35304 


Homo sapiens cDNA FU13655 fis 'clone PL 




35 




Al 360167 






7.48 




AA830856 




Homo sapiens cDNA: FLJ21 122 fis, clone C 












ESTs, Weakly similar to ALU1 J-IUMAN ALU S 


7.45 










Homo sapiens cDNA: FLJ22539 fis, clone H 








BE409442 






7^44 


40 














AW972542 


Hs. 289008 


H i apiensi r illi LJ21814 fi I i n- H 


7,43 






AF1 38863 




hypothetical protein FLB6421 


7.43 












7.43 










ESTs, Weakly similar to I38022 hypoth'eB 


7.43 


45 














AA093668 




muscleblind (Drosophila)-like 


7.39 








Hs.263478 




7.30 




















Hs.292566 


hypothetical protein FLJ14697 


7^36 


50 






Hs 221318 








AW006303 




ESTs, Weakly similar Id (deffine not ava 


7.35 








Hs 342307 


hypothetical protein FU 10330 




















BE179882 






7^35 


55 














BE148235 




Homo sapiens cDNA FU14201 fis, clone MT 


7.35 






AV651680 


Hs.208558 




7.33 






















zinc finger protein 137 (done pHZ-30) 


730 


ou 
























408418 


AW Q 638 Q 7 


Hs.44743 


K1AA1 435 protein 


7.28 












7.26 






AA528489 






7.25 


65 




AW392551 




ESTs, Weakly similar to A56194 Ihromboxa 


7.25 




AK001972 


Hs.30822 


hypothetical protein FU1 1 1 1 0 


7.25 






AW043782 


Hs.293616 




7.25 












7.25 




444670 




Hs.332938 




7.25 


70 




AI922013 












Hs 288660 


Homo sapiens cDNA: FLJ221 82 fis, clone H 
goliath protein 






434817 


AA082118 


Hs',1 02737 


7^23 




419970 


AW612022 




ESTs 


7.23 




432290 


AK001099 


Hs.274273 


Homo sapiens cDNA FLJ10237 fis, clone HE 


7.23 


75 
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pre-B-cell leukemia transcription factor 
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platelet-derived growth factor alpha pol 
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c-Myc target JP01 
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hypothetical protein FLJ12581 

Homo sapiens cDNA FLJ1 1980 fis. clone HE 

sorting nexin 10 

hypothetical protein 

ESTs 

EGF-conlaining fibulin-like extracellula 

dual specificity phosphatase 12 

Homo sapiens cDNA: FLJ21425 fis, clone C 
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phospholipaseC,beta2 
monokine induced by gamma interferon 
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Homo sapiens cDNA FU',2807 fis, clone NT 
Homo sapiens mRNA; cDNA DKFZp564H1916 (f 
PGU domain, class 5, transcription facto 
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D-dopachrome tautomerase 
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fibrinogen, gamma polypeptide 
cytochrome b-245, beta polypeptide (chro 
H-ccll associated protein 
ESTs 
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histone deacetylase 3 
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v-erb-b2 avian erythroblastic leukemia v 
ESTs, Weakly similar to C55663 oligodend 
MFH-amplfed sequences with uci rlc 



WO 03/042661 PCT/US02/36810 




612 



WO 03/042661 



PCT/US02/36810 

















Homo sapiens mRNA; cDNA DKFZp586F1822 (f 
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h pot is nl protein FLJ10983 
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ribosoma! protein L10 
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collagen, type V, alpha 2 
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hypothetical protein DKFZp762P21 1 1 
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Homo sapiens cDNA FLJ14364 lis, clone HE 
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activin A receptor, type MB 
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ribosomal protein L6 
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mitochondrial ribosomal protein L42 
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protein (peptidyl-prolyl cis/trans isome 


409945 


AW015935 


Hs.1 22642 


ESTs 


419641 


BE1 70548 


Hs.1 18190 
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nuclear cap binding protein subunit 2, 2 
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TABLE 52C 

Pkey: Unique number corresponding lo an Eos probesel 

Ref: Sequence -ourec Th 7 dig urn n tl imn e Genbank Idei tir r (GI) numbers "Dunham, et al." refers lo the publication entitled "The DNA sequence of 

human chromosome 22" Dunham, et at. (1999) Nature 402:489-495. 

Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

402474 7547175 Minus 

402145 8018280 Plus 113086-114800 

401091 9958240 Plus 94760-94898 

401466 6682292 Plus 28748-29023 

406542 7711499 Plus 117335-118473 

401113 9966541 Minus 19419-19959 

405086 8072509 Plus 73664-73841,74081-74217,74610-74779,7492 



TABLE 53A: 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 



Unigene Title: Unigene gene title 
R1: Ratio of testicular cancer (non-semincmatous and seminomatous) compared to normal adult tissues 

Pkey ExAccn UnigenelD Unigene Title R1 

4326BB AW204069 ESTs, Weakly similar to unnamed protein 78.20 

452838 U65011 Hs.30743 preferentially expressed antigen in mela 75.45 

418696 AW959433 Hs.326290 hypothetical protein FLJ 12581 53.80 

432730 AI066520 Hs.131358 ESTs 51.80 

450581 AF081513 Hs.25195 TGF-beta4 48.40 
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420367 
451106 
437052 
417407 
420759 
420347 
407710 



AI204212 
AK001666 
AA259090 
BE382701 



AL033539 
AW022727 
AI968719 



Hs.257028 
Hs.25960 
Hs.120591 
Hs.290905 

Hs!97124 
Hs.23616 
Hs.195387 



AF130728 Hs.98586 
Hs.159486 
Hs.1 05328 
Hs.37062 
Hs.2248 



testis specific p-ote'n, Y-linked 
ESTs 

to SALL1 (sal (Drosophila)-like 



430252 AI638774 

454077 AC005952 

428227 AA321649 

434649 AA738254 

424578 AK001973 

427335 AA448542 

438915 AA280174 

432938 T27013 

449322 AI638616 

430691 C14187 

430676 AF084866 

440119 AA865455 

418756 AA252254 

410102 AW248508 

447534 AW953935 

407122 H20276 

446979 AI654443 



456847 
448776 
452291 
408908 
418007 
422828 
433330 
410561 
427667 
418134 
454438 
449032 
426427 
437789 
419384 
418477 



Hs.150890 

Hs.251677 

Hs.285681 

Hs.3132 

Hs.196566 

Hs.157208 

Hs.125331 
Hs.226949 
Hs.279727 
Hs.288655 
Hs.31742 
Hs.197633 

Hs.180408 



ESTs, Weakly similar to protease [H.sapi 
Homo sapiens cDNA FU 1 1 381 fis, clone HE 
Human DNA sequence from clone RP1-309H15 



hypothetical protein FU10339 

doublesex and mab-3 related transcriptio 

Human RPL13-2 pseudogene mRNA, complete 

testes development-related NYD-SP20 

insulin-like 3 (Leyd g cell) 

small inducible cytokine subfamily B (Cy 

ESTs, Highly simlar to A40350 transcrip 

G antigen 7B 

Williams-Beuren syndrome chromosome regi 
steroidogenic acute regulatory protein 



ESTs; homologue of PEM-3 |Ciona savignyi 



AF015592 Hs.28853 
BE296227 Hs.250822 
M13509 Hs.83169 
AL133396 
AW207084 



MRS 2 (S. cerevisiae)-like, magnesium horn 
CDC7 (cell division cycle 7, S. cerevfei 
serine/threonine kinase 15 



AK001279 
AA397769 
AA224C53 
AA045573 
M86699 
AI581344 
AA490866 
AW022983 



Hs.132816 hypothetical protein MGC14801 
Hs.6994 Homo sapiens cONA: FLJ22044 (is, cbne H 
Hs.180171 Homo sapiens cDNA FU10417 lis, clone NT 
Hs.86617 ESTs 
Hs.1 72405 
Hs.22900 
Hs.169840 
Hs.127812 
Hs.39429 



AF053306 Hs.36708 

H65423 Hs.17631 

NMJ16383 Hs.242183 
AI188710 



ESTs, Weakly similar to T17330 hypothec 
ESTs 

gb:df46h12.y1 Morton Fetal Cochlea Homo 



AA991355 

U14622 

AF086270 



Hs.97464 
Hs.62661 
Hs.298312 



ESTs, Highly similar to S14458 lamlnin a 
hypothetical protein 
guanylate binding protein t.interferon- 
hypothet'cal protein DKFZp434A1 31 5 
gb:Hu 



424905 
444619 
434551 
421241 



BE387162 

X91317 

BE263732 



426866 U02330 



433800 
440207 
407276 



AI034361 
AI371978 
AI951118 



Hs.77695 
Hs.172816 
Hs.150587 
Hs.135150 
Hs.128326 
Hs,326736 
Hs.24485 



Target Exon 

NIMA (never in mitosis gene a)-related k 

ESTs, Moderately simila- to A46010 X-tin 

ESTs, Highly similar to A35661 DNAexcis 

Iransketolase-like 1 

KIAA0008 gene product 

neuregulin 1 

kinesin-like protein 2 

lung type-l cell membrane-asscc'ated gly 

ESTs 



AW014631 Hs.225068 ESTs 



436812 AW298067 



VASA protein 
ECT2 protein (Epithelial cell transform: 
gb:UI-H-BW0-ajp-g-09-0-Ul.s1NCI_CGAP_Si 



AL137379 Hs.47125 hypothetical protein FLJ13912 

AW206560 Hs.253569 ESTs 

AA764852 ESTs 

AA442153 Hs.104744 hypothetical protein DKFZp434J061 7 

AA125985 Hs.56145 thymosin, beta, identified in neuroblast 

AI432364 Hs.160594 ESTs 

AA971953 Hs.122055 ESTs 
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427469 
442618 
419423 



444971 
136613 
427486 
415857 



30 
35 
40 



AA912476 
BE003054 
AA779975 
AA233056 



N36327 
AI291804 
AA403084 



AW513143 
AW600291 
AI651116 
AJ27B110 
AA974433 
AA866115 
AI954833 
AW196940 
AK001575 



!: anscrlplion factor EC 
brefeldin A-lnhibited guanine nucleotide 
Homo sapiens cDNAFU 13221 fis, clone NT 
matrix metalloproteinase 12 (macrophage 



439570 T79925 



413318 AU076607 



407340 AA810168 



425572 
410420 
453878 



down syndrome critical region protein DS 
Homo sapiens cDNA FLJ11S80 fis, clone HE 
KIAA0009 gene product 
gb:yx82b06.r1 Soares melanocyte 2NbHM Ho 



ls.1 18101 ESTs 



iar to 2109260A B cell 
ESTs 

KIAA0007 protein 

SRY (sex determining region Y)-box 17 (S 

hypothetical protein FLJ 10430 

ESTs 

DEAD-box protein 

fibroblast growth factor 4 (heparin seer 
Homo sapiens cDNA FLJ 1 1 381 fis. clone HI 
ESTs 
ESTs 

hypothetical protein FLJ10713 



AI581519 
AA534908 
Y00272 
AW962712 
AD011076 
AA224053 
AW964440 
AW1 82459 
AW004683 
AA301270 
AK001403 
AW292317 
AA101325 Hs.86154 



ESTs 

Homo sapiens nRNA far KIAA1551 protein, 
FGENESH predicted novel cell surface pr 
POU domain, class 5, transcription facto 
cell division cycle 2, G1 to S and G2 to 
ESTs, Weakly similar to AF191020 1 E2IG5 
undifferentiated embryonic cell transcr 
ceil divis'on cycle 27 



DC32 

ESTs, Weakly similar to LEU5_HUMAN LEUKE 
ir i 3(E homolog 7 ( n ai i 
gb:EST14192 Testis tumor Homo sapiens cD 
hypothetical protein FLJ20530 



55 
60 



449433 
458570 
441287 
434609 
432239 
441425 
446293 
414136 



X81334 



422665 

428916 
408045 
448588 
433764 
439780 



Hs.2936 



AI500417 



AI420213 Hs.149722 
AA812434 

NM 014781 Hs.50421 

NM_001809 Hs.1594 

AA356792 Hs.334824 

AA024898 Hs.157103 

AF263538 Hs.86232 

NMJH1949 Hs.1189 



LIM domain transcription factor LIM-1 (h 
SMC2 (structural maintenance of chromoso 
KIAA0203 gene product 
centromere protein A (17kD) 
hypothetical protein FU14825 
ESTs 



E2F transcription factor 3 



Hs.46764 ESTs 



R53848 Hs.44976 

AW086185 Hs.223856 

NMJ04849 Hs.11171 

AJ011812 Hs.119018 
AA917062 

AF003001 Hs.194562 

AW1 38959 Hs.245123 

AI970276 Hs.156905 

AW753676 Hs.39982 
AL109688 



telomeric repeat binding factor (NIMA-in 

ESTs 

KIAA1676 

zinc finger protein RINZF (NM.023929) 
gb:Homo sapiens mRNA full length insert 



WO 03/042661 



PCT/US02/36810 



AI262106 Hs.12653 

AW965339 Hs.111471 

NMJJ02914 Hs.1 39226 
AI263307 
U23752 



replication factor C(actta;ci 1)2(40 
H2B histone family, member L 
SRY (sex determining region Y)-box 1 1 
branched chain am' .......... 




442717 
433247 
430647 



412537 
426614 
457465 
430253 
440801 
453116 



424081 
441878 
429120 
418221 



40 
45 
50 



Hs.81343 
Hs.301549 
Hs.180591 



Hs.301960 
Hs.122908 
Hs.236844 



AB040948 

AC003682 

AA214584 

Y00971 

AL031778 

M411925 

AW301344 

AK001514 

AA906366 

AI276680 

AA907120 



U29615 Hs.91093 

NM_0001C2 Hs.1363 

AA908678 Hs.130183 

X99226 Hs.284153 

NM_003686 Hs.47504 

AI005668 Hs.1 30673 

AK0C1239 Hs.274263 

Z15005 Hs.75573 

NM.006413 Hs.1 391 20 

AI801869 Hs.127982 

AK001673 Hs.196530 



245514 
AK001376 
AA781795 
AW006303 
AB023191 



ts.83775 
ls.59346 
ls.122587 
ts.329296 



AA033813 Hs.79018 



407300 AA102616 



AI476318 
AB037783 
BE045641 
AA076049 
AW8S1294 



412642 BE244598 
438450 AI050866 
428301 AW628666 
408750 BE294069 
415947 U04045 
408460 AA054726 
AW062564 
AW876523 



428249 
419635 
402145 
447178 
458814 
442980 

450254 
441627 
440304 



NMJ05033 

AW594641 
AI498957 
AA857025 
AA406293 



gb:Homo sapiens cig33 mRNA, partial seau 
hypothetical protein MGC10764 
collagen, type II, alpha 1 (primary osle 
Homo sapiens mRNA; cDNA DKFZp554H172 (fr 
ESTs, Weakly similar to T23976 hypotheti 
KIAA1515 protein 

ESTs, Weakly similar to Z211 .HUMAN ZINC 



Fanconi anemia, complementation group A 



hypothetical protein FU 10377 
centromere protein E (312kD) 
ribonuciease P (30kD) 
ESTs 

hypothetical protein FU10811 
DiGeorge syndrome gene D 
hypothetical protein FU10514 



ESTs 

testis-specifc ankyrin motif containing 
small nuclear RNA activating complex, po 
hypothetical protein MGC13204 
chromatin assembly factor 1, subunit A ( 
gb:zn43e07.s1 Stratagene HeLa cell s3 93 



i.130391 ESTs 



Hs.192729 ESTs 



on factor C (activator 1)4(37 



Hs.192480 

Hs.170623 

Hs.1 37573 

Hs.274415 

Hs.1 321 33 

Hs.137078 

Hs.280728 

Hs.809 

Hs.65853 

Hs.98440 



hypothetical protein FU11183 



413523 AA825721 



Hs.192417 

Hs.170861 

Hs.8878 

Hs.1 09526 

Hs.99231 

Hs.58086 

Hs.1 25395 

Hs.344043 

Hs.129302 

Hs.36820 

Hs.225841 

Hs.246973 



ESTs, Weakly similar to I38022 hypotheti 

I Ih I ij-rotcn FU10512 
mutS (E. coli) homolog 2 (colon cancer, 
ESTs 

Homo sapiens cDNA: FU22135 fis, cbne H 
hypothetical protein FLJ12910 
zinc finger protein 268 
polymyositis/scleroderma autoantigen 1 ( 
Target Exon 
ESTs 

ESTs, Weakly similartoZ195_HUMAN ZINC 



neuropeptide G protein-coupled receptor; 



Homo sapiens cDNA FU14459 fis, clone HE 



DKFZP434D193 protein 



WO 03/042661 



PCT/US02/36810 



Hs.97414 ESTs 



450113 
418678 
437812 
431354 



440842 
432407 
401837 



U03272 

AA464510 

AI571514 

U29700 

AF251237 

AL045633 

AI283085 

AI004650 

AA648459 

H56777 

AW976463 

AI023707 

H42169 

AW445023 

NM_003428 

AA907288 

AA221036 

AA398155 
AW614850 
AW003582 



418235 
427961 
441553 
429999 
426496 
410929 
448757 



440138 
407568 
409798 
431215 
416350 



Hs.200866 
Hs. 167379 
Hs.16846 
Hs.251673 
Hs.1 95718 
Hs.12830 
Hs.79432 
Hs.152812 
Hs.1 33022 
Hs.123014 
Hs.1 12208 
Hs.44269 
Hs.290931 
Hs.225674 
Hs.335951 
Hs.121084 
Hs.257245 
Hs.1 34273 



AL036195 
AW273055 
AA301116 
AB033064 



445413 
449670 
436211 
429629 
424235 
448038 
430272 



AA151342 

F07693 

AK001581 

BE501732 

NMJ03181 



AF129535 
AB033036 
AI817226 
BE218919 
AL036710 
AW970529 
BE072634 
AW293165 
AA281219 



AA418246 

H91230 

AB033023 

AA740964 

AA248587 

AA496078 

AF188625 



403730 
418378 
418894 



W73921 

AA382826 

D13305 

X72755 

M037675 



U39277 

AA052980 

AA452378 

AA634543 

M009547 

AA467752 

AL1 37385 



ESTs, Moderately similar to ALU7_HUMAN A 
cancer/testis antigen (NY-ESO-1) 
ESTs, Weakly similar to 04HUD1 debrisoqu 
DNA (cytosine-5-)-methyltransferase 3 be 



fibrillin 2 (congen'tal contracture' ar; 



ESTs, Weakly similar to YFJ7_YEAST HYPOT 
hypothetical protein AF301222 



hypothetical protein FU 14627 



zinc finger protein 84 (HPF2) 



ls.15155 ESTs 



Hs.97600 
Hs.193384 
Hs.226414 
Hs.130435 ESTs 



Hs.93453 
Hs.272525 
Hs.2909 
Hs.1 56593 
Hs.1 42838 
Hs.236463 
Hs.263478 ESTs 



ESTs, Moderately similar lo R27328 2 [H. 



Hs.334828 
Hs.30622 
Hs.143507 
Hs.232026 
Hs.237658 
Hs.272027 
Hs.97594 
Hs.313413 
Hs.1 18793 
Hs.209527 
Hs.86434 

Hs.143134 
Hs.121296 
Hs.99597 
Hs.170114 
Hs.30643 
Hs.48820 
Hs.1 85796 
Hs.234794 
Hs.318127 
Hs.62699 
Hs.30237 
Hs.121554 



cytoplasmic polyadenylation element bind 
CGI-147 protein 

Homo sapiens mRNA; cDNA DKFZp434K2172 (f 
hypothetical protein FU10719; KIAA1794 
Homo sapiens cDNA FLJ13010 fis, clone NT 
Tbrachyury (mouse] homoiog 
ESTs, Weakly similar to R052„HUMAN 52 KD 



F-box only protein 5 
KIAA1 210 protein 

" i, Weakly similar to T231 1 0 hypotheti 



Hs.232048 ESTs 



TATA box binding protein (TBP)-assoaate 
ESTs, Weakly sriilar to Z184 HUMAN ZINC 
Homo sapiens mRNA; cDNA DKFZp564B083 (fr 
hypothetical protein FLJ10201 
ESTs 

ESTs, Weakly similar to ALUB.HUMAN I.!. 
Human DNA sequence from clone RP11-21BC1 
phosphonpaseA2, group IID 



lymphocyte antigen 94 (mouse) homoiog (a 
C4001759;gi|133250|sp|P19474|RO52_HUMAN 
gb:EST374154 MAGE resequences, MAGG Homo 



Hs.132793 
Hs.203 
Hs.77367 
Hs.152675 



Target Exon 

L-myc-2 protein (MYCL2) 

early development regulator 1 (homoiog o 



ns mRNA; cDNA DKFZp434M1126 (f 

620 



WO 03/042661 



PCT/US02/36810 



423198 
424687 
418971 
411571 
409517 



442562 
412530 
443715 
423123 
451105 
444431 
440591 
424281 
447175 
408101 
430183 
416445 
429652 
426054 
418618 
445537 
427298 
412863 



30 

35 



437915 
421830 
440006 
450719 
431721 



414725 
424153 
414466 



55 

60 



AA360392 

AA122393 

X90780 

AL157491 

AI038687 



Hs.1634 
Hs.151738 
Hs.87113 
Hs.70811 

Hs.145211 
Hs.133338 
Hs.1 23468 



AW513324 
AA431599 
AA766243 
AI365208 
AW968504 
BE010038 
AL043004 
AA766810 
U12431 

AJ245671 
AA400495 
AA1 21673 
AW205257 
U24577 
AA766998 



Hs.59757 
Hs.1 56326 
Hs.93304 
Hs.79126 
MJ00246 Hs.3076 



433183 
439314 
425312 
427584 
430444 
416773 
421010 
418216 
450351 
454073 
417006 
417576 
448877 
421379 



442010 
452807 
401435 
447519 
421307 



AW235341 
AI637993 
AA789269 
AK000517 



W27595 

M036729 

AA236776 

BE065183 

AL037824 

AI653154 

AA769791 

AA451737 

M349211 

AW372914 

AV647010 

A1188161 

AF080229 

N72847 



H81213 

A1791883 

AB028943 

AA934063 

AW380579 

AF231338 

AA382413 

AA354940 



Hs.53715 
Hs.202312 
Hs.1 22509 
Hs.6344 
Hs.21349 
Hs.268044 
Hs.347310 
Hs.335639 
Hs.79078 



Hs.141496 
Hs.76205 
Hs.86149 



Y15221 

U42349 

AK000634 

AA412323 

T39310 

AA625304 

AI032680 



Hs.71119 
Hs.238270 
Hs.1 05323 



hypothetical protein FLJ20516 
troponin I, cardiac 

Homo sapiens raRNA; cDNA DKFZp434K1 1 1 1 (f 



hypothetical protein FU 13346 
cyclin E1 

SELENOPHOSPHATE SYNTHETASE; Human se!en 
gb:wi60b1 1.x1 NCI_CGAP_Co1 5 Homo sapiens 
Homo sapiens, clone MGC:9010, mRNA, comp 
hypothetical protein FU23451 
■ :oa13b11.s1 NCI_CGAP_GCB1 Homosapiens 



ls.293606 ESTs 



CDC2-related protein kinase 7 
gb:PM3-BN0176-100400-001-g04 BN0175 Homo 
KIAA01 35 protein 
ESTs 

ELAV (embryonic lethal, abnormal vsion, 
GTP cyclohydrolase 1 (dopa-responsive dy 
EGF4ike-domain, multiple 6 
ESTs 

zinc finger protein 281 

Human DNA sequence from clone RP1 1-145L2 
phospholipase A2, group VII (ptaletel-ac 
Human DNA sequence from clone RP1 1-1 6L21 
MHC class II transactivator 



Homo sapiens clone N11 NTers2D1 leralrxa 
ESTs, Weakly sim lar to dJ 1018D12.3 [H.3 
NALP2 protein; PYRIN-Conlaining APAF1-H 
ESTs, Weakly similar to RB8B_HUMAN RAS-R 
KIAA1170 protein 
hypothetical protein FU14627 



ring finger protein 21 , inlerferan-respo 
MAGEJike 2 

cytochrome P450, subfamily XIA (choleste 
phosphoinositol 3-phosphale-binding prot 
glycine dehydrogenase (decarboxylating; 



ESTs 



Hs.292719 
Hs.109445 
Hs.1 3836 
Hs,209657 
Hs.222024 
Hs.178144 
Hs.145958 
Hs.179718 
Hs.121035 
Hs.79828 
Hs.267124 
Hs.283099 
Hs.59791 
Hs.116727 



nc finger protein 75 (D8C6) 
jclear cap binding protein subu 

ESTs, Weakly similar to KIAA1503 protein 

ESTs 

ESTs, Weakly similar to I38022 hypotheti 
ESTs 

transcription factor BMAL2 



hypothetical protein FU20333 

ESTs, Weakly similar to ALU6JHUMAN ALU S 

AF15q14 protein 

hypothetical protein MGC13183 



small inducible cytokine subfamily B (Cy 
Putative prostate cancer tumor suppresso 
hypothetical protein FLJ20627 
ESTs 

gb:ya04a09.r2Stratagene lung (937210) H 



' 14000397 j| 499898|| ||! 295 hypotli 



AW004045 Hs.203365 



WO 03/042661 



PCT/US02/36810 



431126 


AF085243 


Hs.283619 


zinc finger protein 236 


4.20 


429628 


H09604 


Hs.13268 


ESTs 


4.20 


415989 


AI267700 




ESTs 


4.20 


421373 


AA808229 


Hs.46677 


ESTs 


4.20 


433979 


AA620999 




I r o08.g1 "jit g g NH1 Home [ 


4.20 


408321 


AW405882 


Hs.44205 




4.19 


410193 


AJ132592 


Hs.59757 


zinc finger protein 281 


4.17 


430335 


D80007 


Hs.239499 


KIAA01 85 protein 


4.17 




AA081395 


Hs.42173 


Hone sap ens cDNA FU10366 (is. clone NT 


4.16 


438885 


A1886558 


Hs.1 84987 


ESTs 


4.15 


451578 


NM 016323 


Hs.26663 


cyclin-E binding protein 1 


4.15 


432446 


M542845 


Hs.294088 


GAJ protein 


4.13 


445076 


AI206888 


Hs.154131 


ESTs 


4.11 


420218 


AW958037 




ribosomal protein L4 


4.10 


453628 


AW243307 


Hs.83937 


hypothetical protein 




418459 


R85436 


Hs.268814 






418866 


T65754 




gb:yd 1c07.s1 Stratagene lung (937210) H 


4.08 


440404 


AI015881 


Hs.324527 


mitochondrial ribosomal protein S5 


4.06 


426300 


U15979 


Hs.1 69228 


delta-like homolog (Drosophila) 


4.06 


446223 


BE300091 


Hs.1 19699 


hypothetical protein FU12969 


4.05 




AL050102 


Hs.227209 


hypothetical protein FLJ21617 


4.05 


449687 


W68520 




intermediate filament protein syncoiiin 


4.05 


452109 


AI525873 


Hs.61164 


hypothetical protein FU14909 
histone deacetylase 5 


4.05 
4,05 


444670 


H58373 


Hs.332938 


hypothetical protein WGC5370 


4.05 


415884 


H22966 


H3.13471 


ESTs 


4.05 


442066 
402098 
404287 


BE502147 


Hs.128418 


ESTs 

ENSPOO0OO217725':Lam:nin a!pha-1 chain p 
FGENESH predicted novel CUB-domain conta 


4.04 
4.01 


422756 


AA441787 


Hs.119689 


glycoprotein hormones, alpha polypeptide 


4.01 


449704 


AK000733 


Hs.23900 


GTPase activating protein 


4.00 


445685 


AW779829 




gb:hn88a05.x1 NCI„CGAP_Kid11 Homosapien 


4.00 


444379 


N99035 


Hs.30352 




4.00 


435373 


AW665538 


Hs.1 17689 


ESTs 


4.00 


424557 
413646 


AA343057 
BE1 55042 


Hs.164588 


i 31 M V mil c n ironal 1 
gb:PM0-HT0349-101299-002-E04 HT0349 Homo 


4.00 
4.00 


413648 


AW979223 


Hs.292478 


ESTs 


4.00 


445074 


AA079799 


Hs.343103 


hypolhetical protein FLJ11896 


4.00 


447353 


AI375701 


Hs.25884 


ESTs 


4.00 


410100 


M081636 


Hs.271916 


ESTs, Weakly similar to S41044 chromosom 


4.00 


428856 


AA436735 




hypothetical protein FU22002 


4.00 


445140 
406367 
437834 


AI650599 
M769294 


Hs'l97913 


i i r p j ti 

NM_022357:Ho™ sa P iens P utstive ™ tallo P 
gb:nz36g03.s1 NCI CGAP GCB1 Homosapiens 


4.00 
3.99 
3.99 


453985 


N44545 


Hs.251865 


ESTs 


3.96 


408446 


AW450669 


Hs.45068 


hypothelical protein DKFZp434H43 


3.97 


408562 


AI436323 


Hs.31141 


roundabout (axon guidance receptor, Dros 


3.97 


414713 


BE465243 


Hs.1 2664 


ESTs 


3.96 


426067 


AW664691 


Hs.97053 


ESTs 


3.96 


456497 


AW967956 


Hs.1 23648 


ESTs.Weakly similar loAF108460 1 ubinu 


3.96 


454679 


AW813110 




gb:CM4-ST0189-051099-021-f05 ST0189 Homo 


3.95 


451865 
403137 


H43737 


Hs.33186 


NM_005381*:Homo sapiens nucleolin (NCL|, 


3.95 
3.S6 


445730 


AI624342 


Hs.179082 


ESTs 


3.95 


451993 


AA765776 


Hs.122983 


ESTs 


3.95 


428819 


AL135623 


Hs.193914 


KIAA0575 gene product 


3.92 


433683 


AI817723 


Hs.22678 


hypothelical protein FU21832 


3.91 


420812 
423806 


AA715303 
AA331247 


Hs.107369 
Hs.86617 


ESTs 
ESTs 


3.90 
3.90 


437205 


AL110232 


Hs.279243 


Homo sapiens mRNA; cDNA DKFZp554D2071 (f 


3.90 


449211 


AI922972 


Hs.196073 


ESTs 


3.90 


409757 


NMJ01898 


Hs.1 231 14 


cystatin SN 


3.90 




AI864053 


Hs.39972 


ESTs, Weakly similarto 138588 reverse 1 


3.89 


432512 


NM 003284 


Hs.3017 




3.89 


440840 


AW629666 




ESTs, Weakly similarto SS4D54 hypotheti 


3.88 


449099 






ESTs 


3.88 


408092 


NMJJ07057 


Hs]42650 




3.85 


423909 


AJ223183 


Hs.135194 


immunoglobulin superfamily, member 6 


3.85 


437162 


AW005505 


Hs.5464 


thyroid hormone receptor coactivating pr 


3.84 


424381 


AA285249 


Hs.146329 


protein kinase Chk2(CHEK2) 


3.33 


433023 


AW864793 




thrombospondin 1 


3.U2 




W31518 


Hs.34665 


ESTs 


3.B1 


421413 


AI826128 


Hs.55209 


ESTs, Weakly simlar to A49364 59 p-otei 


3.80 


440953 
420697 


AI683036 
AA827705 


Hs.1 241 35 
Hs.26605 


Homo sapiens cDNA FLJ13051 fis, clone NT 
ESTs 


3.80 
3.80 


407275 


AI364186 




gb:qw34h07.x1 NCI CGAP Ut4 Homo sapiens 


3.80 


4227B9 


AK001113 


Hs.1 20842 


hypothetical protein FLJ 10251 


3.80 


411856 


H67899 


Hs.4190 


Homo sapiens cDNA FU23269 fis. clone C 


3.80 


449529 


AI990559 


Hs.232033 


ESTs 


3.80 




AK000318 


Hs.1 861 6 


hypothetical protein FU2031 1 


3.78 


444656 


AI277924 


Hs.145199 


ESTs 


3.77 


448674 


W31178 


Hs.154140 


ovary-specific acidic protein 


3.77 


415829 


AW450198 


Hs.163742 


ESTs 


3.76 



622 



WO 03/042661 



PCT/US02/36810 



AK001049 
X51630 



437440 
427366 
438456 

417918 
415912 
423020 



452631 
427719 
431869 
429830 
420297 
421972 
403433 



AA885108 Hs.223806 



Ts, Weakly similar to A4601 0 X-iinked 
Ts 

fAbox binding protein (TBP)-associate 



AA383092 

AL137163 

AW276887 

AA412205 

NMJ16625 

AI188658 

AI393122 

AA521136 

AI537278 

AI628272 

M18185 



hypothetical prolein FU12606 
ESTs, Weakly similar to ALU6_HUMAN ALU S 
replication protein A3 (14kD] 
hypothetical protein dJ473B4 



Hs.1608 
Hs.57549 
Hs.46609 
Hs.140996 ESTs 
s.1 91381 hypothetical pre 



Hs.87496 
Hs.134726 
Hs.190176 
Hs.225841 
Hs.88323 



419926 AW900992 

420161 AI683069 

414747 U30872 

449571 AW016812 

424727 AW590378 

441820 AA969119 



Hs.152519 

AA969119 Hs.143502 

BE350494 Hs.49753 

AA766060 Hs]301209 

AW138970 Hs.122113 

AW207538 Hs.61603 

AB002350 Hs.1 7262 

X97548 Hs.228059 
AA211467 

AW189075 Hs.116265 

AI446444 Hs.1 90394 

AW301608 Hs.278188 



429782 
411027 
412140 
429183 



451065 
419741 
406542 
422406 



NMJ05754 Hs.220689 

AF072099 Hs.67846 

AA219691 Hs.73625 

AB014604 Hs.197955 

AA436884 Hs.48926 

X54942 Hs.83758 

H16262 Hs.31415 

H93366 Hs.7567 

AW295132 Hs.222231 

NMJ07019 Hs.93002 

AF025441 Hs.116206 

AK000375 Hs.88820 



400268 
443596 
442875 



ESTs 
ESTs 

DKFZP434D1 93 protein 
E I We ,l n to ALU UMAN U 3 
gastric Inhibitory polypeptide 
NM_001622:Homo sapiens a'pha-2-HS-glycop 
KIAA1 553 protein 
NM_030920*:B 



Hs.189716 ESTs 



408065 AW954272 

431077 AI669133 Hs.115660 

452461 N78223 Hs.108106 



Hs.98806 
Hs.93796 
Hs.120817 
Hs.77204 
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iin F (350/400kD, mi 



ESTs 

hypothetical protein FU20674 
ESTs, Weakly sirrila- lo envelope protein 
uveal autoantigen with coiled coil domai 
hypothetical protein DKFZp434JC617 
myeloid/lymphoid or mixed-lineage leukem 



Hs.197042 ESTs 



ESTs 
KIAA1677 

KIAA0352 gene product 

KRAB-asscciated protein 1 

Homo sapiens, Similar to nuclear localiz 

fibriUin3 

ESTs, Weakly similar to B28096 line-1 pr 
EST Mod ite'y imil I54374 je . 
Homo sapiens cDNA FU12981 f s, clone NT 
hypothetical protein FU 10881 
gb:EST384925 MAGE resequences, MAGL Homo 



Target Exon 

Ras-GTPase-activat'ng protein SH3-domain 
leukocyte immunoglobulin-tke receptor, 
RAB6 interacting, kinesin-like (rabk'nes 
KIAA0704 protein 
ESTs 

CDC28 protein kinase 2 



AW026048 Hs.134124 

BE623003 Hs.23625 

T30290 Hs.107515 

N77221 Hs.1 87824 

AB011130 Hs.127436 

AI368236 Hs.283732 

AI950339 Hs.40782 

NM.004477 Hs.203772 

N66037 Hs.38173 

AW450737 Hs.128791 



\H binding protein I 
transcriptional intermediary factor 1 
NM_003292:Homo sapiens I 



FSHD region gene 1 
ESTs 

CGI-09 protein 
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Hs.1 75220 


hypothetical protein FU 14541 
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410252 


AW821182 


Hs.61418 




3.41 


404068 






Target Exon 


3.40 


401644 








3.40 


422364 


AF067800 


Hs.115515 


C-type (calcium dependent, carbohydrate- 


340 


452907 


BE256966 


Hs.31652 


ESTs, Moderately similar lo I54374gene 


3.40 


420281 


AI623693 


Hs.323494 


Predicted cation efflux pump 


3.39 


452404 


AW450675 


Hs.212709 


ESTs 


3.39 


452256 


AK000933 


Hs.28661 


Homo sapiens cDNA FLJ10071 fis, clone HE 


3.39 




AW975076 


Hs.1 72589 


nuclear phosphoprotein similar to S. cer 


3 3S 


440606 


AI828751 




ESTs, Weakly similar to I38022 hypotheti 


3.38 


425474 


Z48054 


HS.1580B4 


peroxisome receptor 1 


3.37 


429714 


BE561801 


Hs.2484 


T-cell leukemia/lymphoma 1A 


3.37 


446214 


AK001322 


Hs.14347 


hypothetical protein FLJ10460 


3.36 


434808 


AF155108 


Hs.256150 


Home sapiens, Similar to RIKEN cDNA 2810 ■ 


3.36 


448789 


BE539108 


Hs.22051 


hypothetical protein MGC1 5548 


3.36 


421633 


AF121860 


Hs.106260 


sorting nexin 10 


3,36 


438192 


AI859065 


Hs.293807 


Homo sapiens AFG3L1 isoform 1 mRNA, part 


3.36 


436511 


AA721252 


Hs.291502 


ESTs 


3.35 








Target Exon 


3.35 


414598 


AI094221 


Hs.135150 


lung type-l cell membrane-assoc:ated gly 




449477 


AI652602 


Hs.197043 


ESTs 


3.36 


413686 




Hs.71404 


ESTs 


3.35 


401091 






decay accelerating factor for complement 


3.35 


418295 


AW970043 


Hs.238039 


hypothetical protein FLJ11090 


3.35 


433220 


AI076192 


Hs.131933 


ESTs 


3.34 


453200 


AA033832 


Hs.212433 


ESTs 


3.33 


427239 


BE270447 




ubiquitin carrier protein 


3.33 


418355 


L42563 


Hs.1165 


ATPase, H? transporting, nongaslric, alp 


3.33 


421535 


AB002359 


Hs.105478 




3.31 


441243 


AI767056 


Hs.193002 


ESTs 


3.30 


440716 


AW105245 


Hs.146509 


ESTs 


3.30 


400587 






C10000649':gi|7296574|gb|AAF51857.1|(AE 


3.30 


401148 






Target Exan 


3.30 


411752 




Hs.126497 


ESTs 


3.30 


433252 


AB040957 


Hs.151343 


KIAA1 524 protein 
ESTs 


3.30 


434008 


AA740878 


Hs.1 12982 


3.30 


4-14665 


BE613126 


Hs.47783 




3.30 


458067 


AA393603 


Hs.36752 


protein kinase anchoiirg protein GKAP42 


3.30 


410340 


AW182833 


Hs.1 12188 


hypothetical protein FU 131 49 


3.29 




BE244742 


Hs.30532 


CG1-77 protein 


3.29 


451418 


BE387790 


Hs.26369 


hypothetical protein FU20287 


3.29 




NMJM649 




apical protein, Xenopus laevis-iike 






AA565509 


Hs.131703 


ESTs 


3.27 


449426 


T92251 


Hs.198882 


ESTs 


3.27 


425174 




Hs.1 54978 




3.25 


435159 


AA668879 


Hs.1 16649 


ESTs 


3.25 


446597 




Hs.1 5470 


putative ring zinc finger protein NY-REN 


3.25 


411554 


W22895 


Hs.1 12360 


prominin (mouse)-like 1 


3.25 




AI391662 


Hs.160963 


Homo sapiens, clone MGC:12318, mRNA, com 


3.25 


426931 


NM1003416 


Hs.2076 


zinc finger protein 7 (KOX 4, clone HF.1 


3.25 




AI207197 




ESTs 




410275 


U85658 


Hs.61796 


transcription factor AP-2 gamma (activat 


3.24 


424568 


AF005418 




cytochrome P450, subfamily XXVIA, polype 


3.24 


453293 


AA382267 


Hs.10653 


ESTs 


3.24 


421654 


AW163267 


Hs.106469 


suppressor of varl (S.cerevisiae) 3-iike 




430552 


AA176374 


Hs.243886 


nuclear autoartigenic sperm protein (his 


3^22 


411975 


AI916058 


Hs.1 44583 


ESTs 


3.22 


448140 


AF1 46761 


Hs.20450 


BCM-like membrane protein precursor 


3.22 


403432 


AJ278111 




MM 001622:Homo sapiens alpha-2-HS-ylycop 


3.21 


436515 


Hs.195292 


putative tumor antigen 


3.21 


456505 


AA504595 




ESTs 


3.21 


427668 


AA298760 


Hs.180191 


hypothetical protein FLJ14904 


3.21 


452794 


AM 92444 


Hs.25892 


ESTs, Weakly sim'lar to I37356 epitheta 


3.20 


427314 


AB033024 


Hs.175475 


KIAA1198 protein 


320 


424051 


AL1 10203 


Hs.1 38411 


Homo sapiens mRNA; cDNA DKFZp586J1922 (f 


3.20 


452028 


AK001859 


Hs.27595 


hypothetical protein FU21142 


3.20 




AF1 16030 


Hs.1 00932 


transcription factor 17 


3.20 


422225 


BE245652 


Hs.11B281 


zinc finger protein 266 


3.20 


437549 


AA759149 


Hs.1 28757 


gb:ah70e03.s1 Soaresjestis NHT Hcmosap 


3.20 


418524 


AA300576 


HS.B5769 


acidic 82 kDa protein mRNA 


3.20 








ESTs 


3.20 


442765 


BE567353 


HSJ9480 


ESTs 


3.20 


410048 


W76467 


Hs.343874 


proline oxidase homolog 
cannabinoid receptor 2 (macrophage) 




412008 


NM_001841 


Hs.73037 


320 


423675 


AI990509 


Hs.131342 


small inducible cytokine subfamily A (Cy 


3,20 


453895 


AA039843 


Hs.61948 


H 1.1 piens, clone M 1:164 ml M cm 


3.20 


424144 


AA454033 


Hs.41644 


AKAP-associated sperm protein 


3.19 


428612 


AA770001 




ESTs 


3.19 


422805 


AA436989 


Hs.121017 


H2Ahistonefam1y, member A 


3.19 


444371 


BE540274 


Hs.239 


forkhead box M1 


3.18 


427528 


AU077143 


Hs.179565 




3.17 


451684 


AF216751 


Hs.26813 


CDA14 


3.17 
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Hs.79856 
Hs.12045 
Hs.15243 
Hs.240770 



RAP1 A, member of RAS oncogene family 

hypothetical protein FLJ20272 

ESTs, Weakly similar to S55657 alpha-1 C- 

putative protein 

nucleolar protein 1 (120kD) 



Hs.279637 ESTs 



AB037735 
M648884 
NM_O00579 
BE076754 



R55336 

AW242474 

AA622037 

AA284267 

AW294215 

N58115 



Hs.291955 ESTs 



Hs.266041 
Hs.1 32560 
Hs.1 34278 
Hs.54443 



Hs.31845 
Hs.98397 
Hs.23539 
Hs.98960 



403610 
-121281 
429274 
438243 
-124800 
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416443 
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414706 
4-11703 
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AI379772 
AI581311 
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AW958439 
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AW340125 
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AW960146 
AW821113 
NMJ03616 
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NM.004131 
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421305 
448048 
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BE281291 

AA648988 

AI801235 

AL109791 

AA594915 

N45202 

NM_003037 



hypothetical protein FU10312 

Homo sapiens cDNA FU12676fs, clone NT 

chemokine (C-C motif) receptor 5 

gb:CM1-BT0601-180200-121-b10 BT0601 Homo 

ESTs 

A kinase (PRKA) anchor protein 3 



Hs.166468 programmed cell death 5 



Hs.221504 
Hs.195631 
Hs.21137 
Hs.76252 



Hs.194225 
Hs.105633 
Hs.166520 
Hs.76989 
Hs.192843 
Hs.301059 
Hs.1 16832 
Hs.14953 
Hs.284137 
Hs.72402 



AD024 protein 

endolhelin receptor type A 

C3001 199:gi|7494834|pir||T15308 hypothet 



KIAA0097 gene product 
leucine zipper protein FKSG14 
hypothetical protein FU 12488 



FU12E88 



.112160 
Hs.289082 
Hs.113170 
Hs.125830 
Hs.99185 
Hs.44331 
Hs.61188 



ESTs 

survival of motor neuron protein interac 
gb:Homo sapiens full length insert cDNA 
faciogenital dysplasia (Aarskog-Scottsy 
eukaryotic translation initiation factor 
granzyme B (granzyme 2, cytotoxic T-!ymp 
Homo sapiens DNA helicase homolog (PIF1 ) 
GM2 ganglioside activator protein 
ESTs 



Homo sapiens Ku70-binding protein (KUB3) 
diptheria toxin resistance protein requi 
ESTs, Moderately similar to A47582 B-cel 
gb:ns41(11.s1 NCI_CGAP_GCB1 Homo sapiens 
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Hs.32970 
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AL532360 BE794750 AA582906 AI01 5067 AW271 034 BG271 536 AW0751 77 AW071 374 AI345565 AI307208 BE138953 BE049086 AI334881 
AW075006 AW0751 81 AA464019 AW302733 AW075 1 00 AW073433 AI802854 AI334909 AI802853 AI345036 A1348921 AI340734 AI307478 
AI251 289 AW302327 AW072520 AI31 2 145 AW073555 AW07251 3 AW071 289 AI307559 AA8761 86 T29587 AI307493 AI255068 AI252868 AI252839 
AW074809 AI252926 AI25216C AI251 662 A1251262 AI61 0913 AI270787 AI270156 AI252075 AW073469 AW072901 AW072496AW071420 
AI305762 AI254764 AI802837 AI251 254 AW073049 AW071 31 1 AI340643 BE1 38965 BE1 38502 AW073456 AI334733 AI054335 BE139260 
AIQ54302 AIQ54060 AI054057 AS053722 AI23971 1 BE 1 39228 AW470478AW27 1039 AW302085 BE041872 AI254494 AI271496 AI252427 
BF718773 BF71 8645 AW074866 BE857822 
AI2C7197 BF773544AW196462 

BC017965 AW969075 AA279982 AA50451 1 AI219979 AA504595 AI245579 AA278181 BG485019 BI049312 

AA770001 AA431112AA432126 

AL079309AA281819 

Al 732997 AA977633 AA86581 8 

AW877458 AW877524 BE076922 BE166912 AW840534 BE076754 AW797829 BE166905 BE166926 AW877462 BE166927 BE166932 AW877523 
BE166917AW877529 BE166928BF351394AW877522AW877528 BE166861 BE166866 BE166913 BE166919 AW877456 AW877537 BE075866 
AW840571 

AI581311 AA781682AA781678 
AF075009 R63109R63068 
AW974599AA648988 R98760 



el al. (1999) Nature 402:489-495. 



The 7 digit number in tt i colu bcnk ldonlificr( I) numbers. "Dunham, et al." refers to the publication entitled "The DNA sequence of 



402145 
401837 
403780 
402299 
401435 
401464 



406367 
403137 
402178 
403433 



8576116 
8018280 
7630990 



8575912 
9719511 
9796239 
6758795 
7711499 
8117697 



84187-84744 
113086-114800 

120993-121095,121660-121729 



170588-170834 
44186-44330 
53134-53281 
58313-58489 

92349-92572,92958-93084,93579-93712,9394 

391133-391711 

72225-72437 

110325-110491 



117335-118473 
121553-12174 
18123-18766 



Unique Eos pre 

n: Exemplar Accpssion number, Genbank access 

nelD: Unigene number 

ne Title: Unigene gene title 

Ratio of normal testi to normal adult tissues 
Ratio of "average" normal testi to "average" tes 

ExAccn UnigenelD 



627 



WO 03/042661 PCT/US02/36810 



10 
15 
20 



AW378993 
AI6134B0 
AA431576 
AA398866 
AL1 37538 
M338648 
AA412605 
AW294544 
AA431698 
AA705799 



C1 7000767:gi| 1 1 990770|emb|CAC1955 1 .1 1 (A 
Hs.183747 ESTs, Moderately similarto CALB_HUMAN C 
Hs.46039 phosphoglycerate mutase 2 (muscle) 
Hs.169797 Homo sapiens BOULE(BOULE) nRNA. complel 
Hs.28472 hypothetical protein FU11 181 
Hs.155986 DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 
Hs.87224 SRY (sex determining region Y)-box 5 
Hs.98328 hypothetical protein MGC13040 
Hs.97633 A kinase (PRKA) anchor protein 4 
Hs.304020 ESTs, Weakly sinvlar to CRTC.HUMAN CALRE 
Hs.120869 ESTs feaklysinl lo P 107_HUMAN H-REV 
Target Exon 



ESTs 

I lomo sapiens testis-development related 
Homo sapiens mRNA; cDNA DKFZp434N2019 (f 
testes development-related NYD-SP22 
SPANX family, member C 
ESTs, Weakly similar to CORB MOUSE CORNI 



an DMA 



ESTs 



ESTs, Weakly similar to AF1 32972 1 CGI-3 



426604 
442373 
427455 



Hs.250182 ESTs 



35 
40 



429877 
413114 
412026 
411844 



AL1 33030 

AW1 18604 

BE044562 

AW949856 

AI126471 

NM_006687 

NM_003296 

AA3CC900 

W37337 

AI825838 

AA383618 

AI807681 

AA974253 

AW183574 

AI073430 

AA393442 

AA416642 

Y12735 

AJC05371 

BE047007 

AK001113 

H64205 

AI827749 

AA860341 

AF053356 

T19204 

AA382787 

NM_016611 



hypothetical protein PRTD-NY3 

Homo sapiens, clone MGC:1 7333, mRNA, coi 

Homo sapiens mRNA for KIAA1666 protein, 



dynein light chain 2B (DNLC2B) 
ESTs 

protein phosphatase 3 (formerly 2B), cat 
testis-specific ankyrin motif containing 
ESTs, Weakly similarto T17257 hypolheti 



s.146775 ESTs, Weakly sir 



hypothetical protein FU10251 
mitochondrial capsule selenoprotein 
serine/threonine protein kinase SSTK 



hypothetical protein DKFZp434H0311 



628 



WO 03/042661 PCT/US02/36810 



426736 AA431615 Hs.130722 

427843 AC005622 Hs.1B0943 

426639 AI799059 Hs.112807 

438637 BE500941 Hs.126730 

433795 AI216683 Hs.122599 

441232 AI656050 Hs.7086 

433943 AA992805 Hs.44865 



hypothetical protein R30953_1 
ESTs 

ESTs, Weakly similar to KIAA1214 protein 
EST Weakly ;ir;i l rl ALLI7 HUMAN ALI 
hypothetical protein MGC12435 
lymphoid enhancer-bind'ng fac.or 1 



Hs.98257 - ESTs 



431153 
419350 
427107 
429461 
432512 
434451 
420348 
427214 
458658 
457034 
423120 
438983 
426619 
440822 
416205 
426712 
427840 
439314 
426943 
409209 
441710 



424426 
437387 
420718 



AI188219 
NM.003284 
AW445179 
AL1 37385 

AI301117 
AA398061 
AW160551 
AF085884 
AI357194 
AI554897 
AA176396 
AW173177 
AI216654 



441856 
427789 
418967 
430232 
421850 
449436 
426699 
426627 
427285 
423693 
457019 



436661 
427749 
441830 



424144 
426710 
428710 
438641 
420614 
422705 
421805 
448963 
426738 
440403 



s.121438 
Hs.97140 
Hs.178213 
Hs. 122055 
Hs.296587 
Hs.1 24021 
Hs.20029 
Hs.1 19284 

Hs. 169624 
Hs.197755 
Hs.98251 
Hs.178144 



Hs.73217 ESTs 



Homo sapiens mRNA; cDNA DKFZp434M1126 (f 
ESTs 

Homo sapiens chromosome 21 segment HS21 C 
soggy-1 gene 

prcacrosin binding protein sp32 precurso 
ESTs 

Homo sapiens clone 191B7 placenta expres 
ESTs 

hypothetical protein UGC5356 



AI187883 Hs.1 27510 



428563 AA431616 



AI674774 

AA412428 

NMJ01725 

AA469940 

AW274576 

AA86C329 

AA383337 

AF012359 

AA401664 

AL133633 

AA421844 



ls.98952 
Hs.98660 
Hs.335499 
Hs.1 28014 
Hs.48642 
Hs.89535 
Hs.105324 
Hs.1 21021 
Hs.279307 
Hs.121269 
Hs.1 95685 
Hs.97784 

" .131779 



hypothelical protein FLJ23093 

ESTs, Moderately similar' i-TF Hi W I, mi 
ESTs 

hypothetical protein DKFZp434l2117 



Hs.1 2 



Homo sapiens mRNA; cDNA DKFZp434E2l 1 8 (f 
hypothelical protein 
NM_030813':H 



450606 AI668605 Hs.60380 ESTs, Moderately similar to ALU6_HUMAN A 



AC005764 
AL035195 
AI476416 
AI198874 
NM_002301 



Human DNA sequence from clone RP1 1-218C1 
exonuclease NEF-sp 
hypothetical protein DKFZp434L1717 
germ cell specific Y-box binding protein 
Homosar ' 



420768 AI468780 Hs.292503 ESTs tteai larto 2 hypoth 



AI125270 
BE045979 
AA383104 



417592 AA204664 



AA454033 
BE041517 
AI890919 
AW1 38484 



Hs.42954 
Hs.178067 
Hs.132793 



Hs.41644 
Hs.143893 
Hs.1 26780 



AW665135 
AI184560 
AI245026 
AI190714 



Hs.1 30947 
Hs.331247 
Hs.291902 



ESTs, Weakly similarlo T19142 hypotheti 
Homo sapiens cDNA: FLJ23052 lis, clone L 
hypothetical protein DKFZp564D0372 



Homo sapiens cDNA FLJ1 1 789 (is, clone HE 
interleukin 13 receptor, alpha 2 
ESTs, Weakly similar to I54333 chromosom 
gb:zu66c01.r1 Soares_testis_NHT Homo sap 



putative transmembrane protein 
acb'n-like 7B 

hypolhetical protein DKFZp434N1415 
Homo sapiens, clone IMAGE:361 071 2, mRNA, 



Hs.1 30531 ESTs 



Hs.98945 ESTs 



629 



WO 03/042661 



PCT/US02/36810 



426735 
438653 
443038 
428677 



427757 
426721 
401692 
403783 



420419 
435897 
428516 



25 

30 



421610 
411982 
427288 
414439 
401798 
433395 
438064 
426658 
431986 

437896 
420431 
409467 
422770 



AW027571 
AF049615 
AL0354S0 
M397796 
AF269223 
R38137 



AI240184 
AA757626 
AI024442 
AW083791 
AA1 49285 
A1026844 
N93466 



AW1 72570 
AA759233 
AA393168 



ESTs, Weakly similar to S38782 actirt be 




Hs.97309 ESTs 



Hs.7973 
Hs.122959 
Hs.1 24009 
Hs.11614 
Hs. 128322 
Hs.1 56469 
Hs.97543 
Hs.99344 
Hs.23830 



PDZ-binding kinase; T-cell originated pr 
ESTs 

ESTs, Weakly similar to T29012 hypotheli 
C16000122":gi|5S89527|db'|BM83047.1|(A 
NM_031956:Hcmo sapiens NYD-SP14 p-otein 



ENSP00000244792*:Phosphrjglycerate kinase 
hypothetical protein DKFZp434G156; KIAA1 
Huntinqt n interacting protein M 
Human DNA sequence from clone RP5-860F19 
HSPC065 protein 

t-complex 11 (a murine top homolog) 
ESTs, Moderately similar to KIAA0940 pro 



ESTs 



PRTD-NY3 



Hs.88051 ESTs 

Hs.1 30767 Homo sapiens cDNA: FLJ23553 lis, clone I 
Hs.99330 hypothetical protein FLJ23588 
Hs.91 885 ESTs, Weakly similar to 155214 salivary 
Hs.1 88781 ESTs 
Hs.187018 
Hs.343487 
Hs.10941 
Hs.346385 
Hs.21263 
Hs.1 15659 
Hs.98843 
Hs.121764 



ESTs, Weakly similar to IPP1.HUMAN PROTE 
ESTs 

suppressor of potassium transport defect 



hypothetical protein DKFZp434t099 



Hs.99052 

Hs.62595 chromosome 9 open reading tram 

Hs.135763 ESTs 

Hs.14600 ESTs 
ESTs 

Hs.90034 hypothetical protein FLJ21 91 6 

Hs.48802 ESTs 
Hs.97792 

Hs.100007 : 



ESTs, Weakly similar to KIAA1205 protein 
Homo sapiens cDNA FLJ12825 Ss, clone HI 
cylicin, basic protein of sperm head cyl 
DKFZP4341092 protein 




AC005261 
AI198850 
X57655 
AI208684 
AW628609 
AA448011 
R28215 



serine/threonine kinase 13 (aurora/IPLI- 
DMRT-like family B with proline-rich C-l 
serine protease inhibitor, Kazal type, 2 
Homo sapiens heat shock transcription fa 



Hs.131918 
Hs.143878 
Hs.1 20369 



427572 
427541 
429404 
415014 



AJ010230 Hs.102576 ret finger protein-like 1 antisense 

U06632 Hs.966 coilin 

AF215924 Hs.97899 putative allantoicase 

AA41 7291 Hs.97978 hypolhetical protein MGC4766 similar to 

AI798983 Hs.82921 solute carrier family 35 (CMP-sialicaci 

NM_005738 Hs.201672 ADP-ribosylalion factor-like 4 

AW954064 Hs.24951 ESTs 

AF1 55140 Hs.98738 gonadotropin-regulated testicular RNA he 



630 



WO 03/042661 



PCT/US02/36810 



453017 
424466 
442084 
423196 
434183 



R84301 

AL040420 

H81173 

AK001866 

AW104257 



433982 AA724720 



Hs.12: 
Hs.127194 
Hs.99430 
Hs.1 55976 
Hs.158450 
Hs.226377 
Hs.31387 
Hs.148250 
Hs.34596 
Hs.125139 
Hs.123426 
Hs.104830 

Hs.225433 

Hs.38041 



ESTs. Weakly similar to 138022 hypolheli 
ESTs 

testis zinc finger prolein 

nasopharyngeal epithelium specific prote 

phosphate cytidylyllransferase 2, ethano 

DKFZP564J0123 protein 

Homo sapiens mRNA; cDNA DKFZp434N1 535 (f 

ESTs 



ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434G 1 53 (fr 
ESTs, Weakly similar to S43569 R01H10.6 
pyridoxal (pyridoxine, vitamin B6] kinas 
Target Exon 

hypothetical protein FLJ20519 



ESTs 

Homo sapiens organic cation transporter 
ESTs 

testis-specific protein TSP-NY 
Homo sapiens mRNA; cDNA DKFZp434F091 9 (f 

sapiens cDNA: FU23553fis, done L 




427255 
440713 
418499 
423218 



NM_015896 
AW070634 
AI638774 
AI188225 



hypolhetical prolein MGC10771 



Homo sapiens cDNA FLJ13794 {is, clone TH 
ESTs, Weakly similar to Lysozyme [H.sapi 
hypothetical prolein DKFZp434J0617 
' intermediate chain 2 

Homo sapiens mRNA; cDNA DKFZp434P0810 (f 
ESTs 

Hs.99841 ESTs, Weakly similar to dJ1106D1 1.1 [H.s 

Hs.1 26838 ; 
Hs.140646 

Hs.22142 ■ 
Hs.189160 
Hs.1 22042 
Hs.99214 

Hs.343593 ESTs, Weakly similar to TD54JHUMAN TUMOR 

Hs.126368 ESTs 

Hs.302023 hypothetical protein FKSG25 

Hs.167380 BLu prolein 

Hs.144794 ESTs 

Hs.105328 testes development-related NYD-SP20 



b5 reductase b5R.2 



Human DNA sequence from do 



hypothetical protein MGC3048 



427306 
418725 
456748 
443162 
433836 
439680 
437426 
405528 
442977 



AL133057 
AA609661 
AI476743 
AL1 17637 
AW1 37749 



Hs.229275 
Hs.306094 
Hs.125902 



DKFZP434I225 protein 
ubiquitin specific protease 2 
DKFZP434G032 protein 



Hs.144090 
Hs.132816 
Hs.144505 
Hs.121192 
Hs.60177 
Hs.98330 
Hs.97757 



C2002647;gi|4507721)ref |NP_00331 0. 1 1 tit 
ESTs 

hypothetical protein MGC14801 



631 



WO 03/042661 



PCT/US02/36810 



AW140126 
AF078164 
AA993337 
AI457958 
AI028794 
AI817226 
AI221893 



429861 
426622 
427256 
408407 



435274 
426612 
435110 
433792 
423278 



AI989571 
AL044400 
AL042436 
AF214680 



Hs.80464 

Hs.313413 
Hs.121549 
Hs.181451 
Hs.1 80471 
Hs.187134 
Hs.30325 
Hs.60288 ESTs 
Hs.190413 

Hs.99510 



409364 
419224 
428915 
427181 
442122 
424812 



Z94277 

Z46967 

T18864 

AI480252 

NM.012189 

AI041278 

AI183653 



410380 
452537 
401712 



433760 
431219 
411773 
453468 
445158 
440860 
452420 
433281 



NMJ14433 
AA398161 
AA429709 
AA364109 



Homo sapiens Ku70-bindinq protein (KUB3) 
ESTs 

hepalitis B virus x-inleracling protein 
ESTs 

ESTs, Weakly similar to T23110 hypothel 



Hs.25371 ESTs, Weakly similar to A37232 mucin, tr 



AW296167 
AA887547 
AA922067 



Hs.120014 
Hs.104682 
Hs.98104 



ESTs, Weakly similar to mitochondrial ci 



Hs.1 27689 
Hs.115460 
Hs.1 44924 
Hs.1 37368 
Hs.314452 
Hs.87908 
Hs.27888 



Homo sapiens mRNA; cDNA DKFZp434B1 15 (fr 
ESTs 

type 1 protein phosphatase inhibitor 



serine/threonine protein kinase SSTK 



Hs.2706 
Hs.104675 
Hs.1 23004 
Hs.1 20397 



DKFZP434B204 protein 



Hs.23459 ESTs 



AI190773 

NM_006799 

W00712 

AI992108 

R10482 

BE564871 

N48673 

AI269514 

AI674766 

AA298760 

AA393379 

AW274294 

AA290712 

AA393673 

AA447209 

U92992 

T19219 



Hs.112095 
Hs.1 22045 
Hs.1 22511 
Hs.61809 
Hs.1 8566 
Hs.1 33230 

Hs.1 27204 
Hs.72025 
Hs.32990 
Hs.1 27206 
Hs.132876 
Hs.29453 
Hs.1 46037 
Hs.1 29802 



novel protein similar to mouse MOV10 
hypothetical protein FLJ1 161 8 
Target Exon 

hypothetical protein DKFZp434F1819 
ESTs 

centrin, EF-hand protein, 1 
hypothetical protein FLJ141 17 



ESTs 

ribosomal protein S15 
ESTs 

ESTs, Weakly similar to similar Ic CR1 6, 
protease, serine, 21 (testisin) 
DKFZP566F084 protein 
ESTs 
ESTs 
centrin, I 
hypotl ei 
ESTs 
ESTs 



Hs.180191 hypothetical protein FU14904 
Hs.9741 5 ESTs, Weakly similar to T33068 hypotheti 
Hs.14409 ESTs, Woakly simlarl ii lacch 
Hs.82407 CXCchemokineligand16 

ESTs, Weakly sim lar to ALU5_HUMAN ALU S 
Homo sapiens NYD-SP28 mRNA, complete cds 



Hs.98834 ESTs 



hypothetical protein MGC4766 similar to 
phosphorylase kinase, gamma 2 (testis) 
G protein-coupled receptor kinase 2 (Dro 
tubulin, alpha 2 
hypothetical protein 

rhabdoid tumor deletion reg'on protein 1 
ESTs 



Hs.71791 
Hs.35984 
Hs.97602 

Hs.99336 ESTs, Weakly similar to T1 5446 hy: 



427054 AA421240 Hs.97570 



Hs.177990 ESTs 

Hs!l26413 ESTs 0 ' 6 '" 

Hs.70202 WD repeat domain 10 

1 oxysterol binding protein-like 1 



WO 03/042661 



PCT/US02/36810 



438633 AI653327 

427199 AW015836 

440182 AA868919 

435517 AA928626 

446309 BE044261 

420338 AA825595 



434298 
440720 
427554 
414251 



437418 
437522 
422808 



Hs.123501 ESTs 

Hs.292919 ESTs 

Hs.250110 ESTs 

Hs.130177 ESTs 

Hs.149774 ESTs 

Hs.88269 Homo sapiens, clone MGC:17339, mRNA, com 

Hs.102021 ESTs 

Hs.99338 ESTs 

Hs.111591 ESTs 

Hs.112772 ESTs 

Hs.81 79 hypothetical protein, clone 27 

Hs.116290 ESTs 

Hs.336943 Human DNA sequence from clone R?1 1-60H7 

Hs.179615 hypothetical protein FLJ10058 

Hs.97689 VASA protein 



gene from PAC106H8, similar I 




13086J 

532606_1 
863689_1 



BC017998 BI826643 BG71 5794 BG722697 BI460787 BG773459 H52859 AI652B53 AI990773 AW665193 AW340601 AA913806 AI337099 
BE045942 AW572790 AW515652 H15004 AA9C9115 BI455310 BI462024 BI56157B BI4B3075 BG722527 R86003 BG6232BS H15QQ3 BI562131 
BG435272 

BE736800 BM471423 AL557221 BG763302 BF7421S6 BF991016 BG200112 BF920027 BG576409 BG332214 BI830957 AI827504 AW183574 
AI805171 AI126491 AA448257 AI090641 AW183329 AA994B73 AI203663 BE04151 3 AA382260 AA382261 AL554887 BE273483 
A1809889 AA393442 AI150574 AI200886 AI221692 AA608977 AA813213 

AI209094 AI377740 AW117382 AV/1 32289 A1674095 AW18S01 9 AA897352 AA931314 AA923336 AW665317 AA629314 AA776691 AA906846 
AA974625AA884357AI808590 

AL042392 AI147451 M758821 AW450252 AA399310 AI656343 AI636668 AW515660 AI190733 AI025812 AA723645 AA709253 AA 725709 
AA398244 AA382463 AI139837 

AI218809 BC014509 BG724383 AI024359 AA904573 AI138595 M868685 AI768931 BI828436 BG717350 BG719800 AW182303 AA448181 
BI826670 BI827131 BI830254 BI824155 BI831745 T19190 BI830415 

BG207562 BG1921 13 AA977616 AW274024 AI554897 AJ221379 AA9691 58 AA906B67 AI873494 AI015039 
AA412383 AA300675 BG773248 AA412243 AA405951 
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WO 03/042661 
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427748 
440115 
428524 
440933 
427829 
438735 
449333 



1372622J 
34 3 

1382184J 
980517.1 
1373537.1 
10316.1 
36378.1 



2684549.1 
584982.1 
657337 1 
32896.1 



NM.004157 X14968 BG480488 BE409869 BG723898 BG476313 AU1 21626 BE386516 AI969297 AW172340 AA889668 R23436 AI01 5037 
BE250558 BF590945 AW385993 BF933000 AA070235 AL556082 M78388 AW504473 AW370139 BG91 3597 BE899096 BG827945 BE741233 
AI015465 AW370169 BE297350 AA093249 
AI1 22828 AA909991 AA759233 

AL591713 BF197609 AI9850S4 AW448916 AI243277 AL419630 AL449S29 

AW241405AW205071 AI6715%^lf 3 5 I A )90359 AW662771 AW594067 BE502532 A1218894BE466416 BF056295 AI247366 
AI990484 AI917746 AW665925 AI216456 AW1 82169 AA969884 AA723888 AI018419 

BI602176 BI503138 BI459895 B1755030 AW418788 AA883999 AA724858 AI480311 AW195355 AI004813 AI651117 BE814363 AW589856 
AA448124 AA447982 BI461166 AA405629 
AI208966 AA421041 AA815377 AA411954 



AI208080AA442862AA429772 

AI125404AW593312AI247364AI2Q8217AA910021 A!S15307 
AI190292AI188225AA416673AA416596AA952888AA972172AA906874 

M76676 NM.022571 BG772522 BF51644S AI5374S5 AW517245 BF762535 AA634446 AW196331 AI203035 BG722281 

AK056320 AL522040 BI793043 AW071691 AI433682 AA865414 AA702684 BI792794 H96879 R52351 AA211126 AA442875 N25725 AA482563 

N33446 N25222 Z41 1 10 N26507 N73447 N24077 N20492 AW275550 H9961 9 AL51 8306 AL522041 AW959849 AL51 8307 AA725907 A16551 1 3 

AI309906 F10184 BM451081 BE257595 BG721625 BI82SS09 BG700470 F12568 Z45396 BI829288 AA364618 AA364851 AA421448 T74231 

R52350 AA482415 AI203021 T88948 AI565842 

AI024286 AA769898 AA778661 AA868972 AA609524 

AI932330 AI190707 AI376782 AA976847 

AA6091 79 AW592321 AA758282 AI2 14437 AW072537 AA78 1 937 

BI831486 AW190479 AI472793 AA460217 AA459937 BF082576 AA393673 AA398702 

NM.006001 L1164S A12Q5604 AI207994AI187362AA709190 BI462421 BG772170BG722772AA436991 BG771S55 BI553260 BF126025 
BF125857 BI462670 BG724164 BI562424 BG721652 BI559662 BG722455 BI596415 BG717561 BG72213B BG773507 BG720572 C03867 AI016802 
AL042663 AA770436 AA435720 T19365 AA626S98 M759057 AI20B021 BF507844 AI208058 AA412719 AA426374 AI208775 AA977217 AA758055 
F34585 AA180062 

BG70Q885 AA86801 7 AW341719 AA971332 AI688794 Z20462 AI808145 AW665263 AA884952 AA906136 
AW663704 AI1 50368 A1216464 

AI125340 AI125684 AI377949 AI126470 W21 8351 AW665355 AI243952 AW663454 AI240B03 AI187742 AA884214 AA723933 
AA857437 AI968733 A1968938 AA992784 AA397520 AW235244 

BM479185 AL552795 AL577722 BF038B38 BM1 27617 BF510346 AW450652 AA865478AW449519 BM127314AI806539 AW449522 AA993634 
AI827626AA904788 



ler corresponding to an Eos | 
urce. The 7 digit numbers in this column are Genbank identifier (Gl) 
■ el aL (1999) "' 



numbers. "Dunham, et al." refers to the publication entitled "The DNA sequence of 



: Indicates nucleotide positions of predicted exons. 



406378 
405264 
401692 
403783 



9256142 
7329374 
3540172 
8081824 
9828129 
6730720 
9801539 
9581957 
8469086 



26365-26510 

128412-128635 

56392-57645 



TABLE 55A: 

Pkey: Unique Eos pre 
ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 
R1: Ratio of non-seminomatous mi 



Pkey 



UnigenelD 
Hs.79516 
Hs.136348 
Hs.334851 
Hs.194657 



411573 
432730 
418870 
444381 



BE409838 

X14008 Hs.234734 

AI879277 Hs.76136 

AI920965 Hs.77961 

NM 005397 Hs.16426 



Hs.131358 ESTs 



AU076643 
AL048753 
L20688 



UM and SH3 protein 1 

cadherin 1, type 1 , E-cadherin (epitheli 

lysozyme (renal amyloidosis) 

major histocompatibility complex, class 
podocalyxin-like 
KIAA1077 protein 



634 



WO 03/042661 PCT/US02/36810 



412636 
414092 
414587 
412915 
406648 
412247 
446108 
424800 
413063 
415314 
406656 
426295 
406820 
444562 



425543 
452679 
420754 



Hs.29797 
Hs.76884 
Hs.7188 
Hs.81328 



AI493591 Hs.78146 
NM.004415 

Z14244 Hs.75752 

NM.004862 Hs.76507 

AW087727 Hs.74823 

M563730 Hs.277477 

AF022375 Hs.73793 

AL036596 Hs.42322 

AL035588 Hs.153203 

AL035737 Hs.75184 

N88802 Hs.5422 
Hs.89643 



AW367283 

AI223958 

M186715 

M54915 

H73444 

D12485 

M043424 

R23313 



Hs.108124 
Hs.336429 
Hs.81170 



Hs.83883 
Hs.346911 
Hs.63042 



415899 X78992 

406786 AW161678 

422105 AI929700 

422714 AB018335 

444051 N48373 



T97490 
AW954134 
AI970276 
AF179897 



454413 AI653672 
415221 W07418 
425535 AB007937 
450000 AI952797 
418151 AA864238 
414799 AI752416 
412025 AI827451 
445055 BE512856 
412490 AW803564 
408437 
430542 
424670 
432409 
425945 

449961 
435522 
407179 



Hs.1 11680 

Hs.119387 

Hs.10247 

Hs.1 73108 

Hs.13396 

Hs.211539 

Hs.737 

Hs.50002 

Hs.20924 

Hs.1 56905 

Hs.104105 



Hs.40092 
Hs.78225 
Hs.1 58287 
Hs.1 0888 
Hs.83583 
Hs.77326 
Hs.24143 

Hs.288850 
Hs.278469 
Hs.119122 



nuclear factor of kappa light porypepid 
laminin, beta 1 

Homo sapiens cDNA FLJ10934 fis, clone OV 
v-erb-b2 avian erythroblastic leukemia v 



desmoplakin (DPI, DPII) 
cytochrome c oxidase subunit VII b 
LPS-induced TNF-alpha factor 
NMJj04541:Homo sapiens NADH dehydrogenas 



| anchor protein 2 
MyoD family inhibitor 
chitinase 3-like 1 (cartilage glycoprote 
glycoprotein M6B 



zinc finger protein 6 (CMPX1) 
ribosomal protein S4,X-linked 
RIKEN cDNA 9130422N19 gene 



immediate early response 3 

ribosomal protein L10a 

transmembrane, prostate androgen induced 

DKFZp564J157 protein 

protein C receptor, endothelial (EPCRj 

butyrate response factor 2 (EGF-response 

ferritin, light polypeptide 

endosulflne alpha 

KIAA0792 gene product 

activated leucocyte cell adhesion molecu 

Homo sapiens cDNA: FU21897 fis, done H 

Homo sapiens clone 25028 mRNA sequence 

hypothetical protein MGC4248 



iilyA( C/ 



Meis (mouse) homolog 2 

MAD (mothers against decapentapiegic, Dr 

ESTs, Weakly similar to I38022 hypotheti 

glutaredoxin (thiollransferase) 

PNAS-123 

KIAA0468 gene product 
hypothetical protein FU21 709 
actin related protein 2/3 complex, subun 
insulin-like growth factor binding prote 
Wiskott-Aldrich syndrome protein Interac 
SH3 domain binding glutamic acid-rich pr 
Homo sapiens cDNA: FU22528 fis, clone H 
lacrimal proline rich protein 
ribosomal protein L13a 



AA806538 Hs.130732 

AW410669 Hs.164280 
W67330 

AW265634 Hs.133100 

N64214 Hs.9774 
AA206465 



KIAA1575 protein 
solute carrier famit 
hypothetical protein AL1 101 15 



418509 
440869 
417144 
451106 



BE297660 Hs.1 70328 moesin 



407951 
449944 
423673 
435056 



420676 AI434780 



Hs.5422 
Hs.279860 
Hs.1 18787 
Hs.4248 



heat shock 70kD protein 1A 
Notch (Drosophila) homolog 3 
antigen identified by monoclonal ant'bod 
Homo sapiens, Similar to rhotekin, clone 
matrix metalloproteinase 1 2 (macrophage 



transforming growth factor, beta-induced 



635 



WO 03/042661 



PCT/US02/36810 



30 
35 



436075 
450139 
427691 



423184 
450847 
417407 



413813 
434280 
417944 
409963 
428664 



421379 
422241 



447471 
426215 
446627 



448776 
453856 
410143 
414591 



AA340277 

AB014680 Hs.8786 

BE090176 Hs.1 79902 
AK001838 

AW194426 Hs.20726 

L33075 Hs.1742 

AI219083 Hs.42532 

BE313555 Hs.7252 

NM_004428 Hs.1 624 

NM_003155 Hs.25590 

AA923278 Hs.290905 

AI124756 Hs.5337 

D90084 Hs.1 023 

AF263538 Hs.86232 

R19774 Hs.22835 

AI220684 Hs.347939 

AW195237 Hs.7734 

L11144 Hs.1907 
Hs.282076 
Hs.75561 



serum/glucocorticoid regulated k'nass 



ephrin-A1 
stanniocalcin 1 

ESTs, Weakly similar to protease [H.sapi 
isocitrale dehydrogenase 2 (NADP), milcc 
pyruvate dehydrogenase (iipoamide) alpha 
growth differentiation factor 3 
HHGP protein 

■■ ,alpha2 

al protein FU22174 



AI634046 
AJ271671 
AB023154 



Hs.189095 
Hs.263395 
Hs.82280 



Hs.37040 
Hs.300870 
Hs.1 7767 
Hs.104520 



Y00062 

AW958264 
AF039843 
AW963419 
AI973016 
AK000680 
AV653729 
AB017169 
AB032973 
BE302464 
AA804789 
AA188169 



Hs.57929 
Hs.1 2461 
Hs.30057 
Hs.19447 



Homo sapiens mRNA for K1AA1 650 protein , 
teratocarcinoma-derived growth fxtor 1 
gb:CM1-BN01 16-150400-189-h02 BN01 1 6 Ho 
collagen, type V, alpha 2 
calcium/calmodulin-dependeni protein kin 
similar to SALL1 (sal (Drosophilal-like 



zinc/iron regulaled transporter-like 
KIAA0937 protein 

piateletfendothelial cell adhesion molec 
platelet-derived growth factor alpha pol 
Homo sapiens mRNA; cDNA DKFZp547M072 (fr 
KIAA1554 protein 

Homo sapiens cDNA FLJ13694 fis, clone PL 
integrin, beta 2 (antigen CD18 (p95), ly 
small inducible cytokine subfamily B (Cy 
ESTs, Weakly similar lo AI.U7.HUMAN ALU S 
protar tyrosine phosphatase, receptort 
STAT induced STAT inhibitor 3 
similar to yeast Upf3, variant B 
sprouty(Drosophila) homolog 2 
slar.riocelcin 2 
hypothetical protein SBBI48 
phosphoprotein associated with GEMs 
CGI-44 protein; sulfide dehydrogenase li 
slit (Drosophila) homolog 3 
LCHN protein 

MRS2(S. cerevisiae)-like, magnesium hom 
PDZ-LIM prolein mystique 



AI888490 Hs.55902 ESTs, Weakly similar to ALU8_HUMAN ALU S 



AF081513 
AW961576 
Al 193356 
X06256 
AK001514 
AF161486 
AW512022 



AW967069 
NM 001674 
AW1 39408 



Hs.257631 ESTs 
Hs.25195 
Hs.60178 
Hs.160316 



phosphoinositide-3-kinase, class 2, beta 



Hs.1 52940 
Hs.278562 
Hs.151738 




integrin, alpha 5 (fibronectin receptor, 
hypothetical protein FU10552 
RAB23, member RAS oncogene family 



matrix metailoproteinase 9 (gelatinase B 
paired-like homeodomain t-anscription fa 
gb:ye53h05.s1 Soares fetal liver spleen 
CGI-145 protein 

hypothetical protein DKFZp434E2135 



Homo sapiens cDNA: FLJ22524 fis, clone H 
hypothetical protein FLJ10199 
glypican 3 

small inducible cytokine A4 (homologous 



WO 03/042661 PCT/US02/36810 



447674 BE270640 Hs.19192 
426689 BE245550 Hs.171B25 
444933 NM_016245 Hs.12150 



cyclin-dependent kinase 2 

basic helix-loop-helix domain containing 

retinal short-chain dehydrogenass/reduct 

serum response factor (c-fos serum respo 
CD9 antigen (p24) 

ESTs, Highly simila- lo LDHHJHUMAN L-LAC 



AI469213 Hs.71404 

AF025374 Hs.46465 

AW015759 Hs.235709 

AV651680 Hs.203558 
Y00093 

BE222494 Hs.130919 

AW580939 Hs.97199 

H87136 Hs.5174 

BE148235 Hs.193063 

T85104 Hs.222779 

AA775268 Hs.6127 

NM 002205 Hs.149609 



M528489 
D50683 
BE077546 
NM_006855 



AF245505 
AW953575 
BE270030 



hypothetical protein FU14428 
ESTs 

T-cell, immune regulator 1 

Homo sapiens mRNA; cDNA DKFZp667B071 1 (f 

ESTs 

integrin, alpha X (antigen CD1 1 C (p1 50), 
inhibitor of DNA binding 2, dominant neg 
complement component C1q receptor 
ribosomal protein S1 7 

Homo sapiens cDNA FU 14201 fis, clone NT 
ESTs, Moderately similar lo similar lo N 
Homo sapiens cDNA: FLJ23020 :1s, clone L 
Targel Exon 

inlegin, alpha 5 (flbronectin receptor, 
NM_004079:Homo sapiens cathepsin S (CTSS 
ESTs 

ESTs, Moderately similar lo 138022 hypo! 
fibroblast activation protein, alpha 
gb:zx80f10.s1 Soares ovary tumor NbHOT H 
Homo sapiens cDNA FLJ20394 fis, clone KA 

trite c i erfamil 2(1 ililated glu 
ribosomal protein L23 
transforming growth factor, beta recepto 
ESTs, Moderately similar to A46010 X-lin 
KDEL (Lys-Asp-Glu-Leu) endoplasmic relic 
LIM domain binding 2 
KIAA1191 protein 

inhibitor of DNA binding 1, dominant neg 
Adlican 

p53-induced protein PIGPC1 
Homo sapiens, clone IMAGE:36771B5, mRNA 
athepsir D (iysoso r I protease 
KIAA0512 gene product; ALEX2 
hcxabrachion (tenascin C, cytotactin) 
glycine dehydrogenase (decarboxylating; 
Williams-Beuren syndrome chromosome -eai 
Human clone 23933 mRNA sequence 
gb:RC1-HT0598-O20300-O11-h02 HT0598 Homo 
nucleolar protein family A, member 3 {HI 
Homo sapiens cDNA FLJ13471 fis, clone PL 



ESTs, Weakly sir 
enolase 2, (gam 



Hs.1 19769 ESTs 



cDNA FU12807 fis, done NT 
transglutaminase 2 (C polypeptide, prote 
ESTs, Weakly similar to KIAA1204 protein 
hypothetical protein MGC10823 
disabled (Drosophila) homolog 2 (mitogen 
butyrophilin, subfamily 3, member A2 

major histocompatibility complex, dass 
PR01914 protein 

gb:hn41c11.x1 NCI_CGAP_RDF2 Homosapiens 



AU077333 Hs.1 60483 



io sapiens, Similar to hypothetical pr 
copine III 

erythrocyte membrane protein band 7.2 (s 



443523 AK001575 Hs.9536 

428311 NM_005651 Hs.183671 

447519 U46258 Hs.339665 

445817 NMJJ03642 Hs.13340 

434423 NM_006769 Hs.3844 



441970 
433675 
437374 

407284 
426158 
447029 
417315 



Hs.42915 
AI539227 Hs.214039 
NM.001982 Hs.199067 



ribonucleotide reductase W2 polypeptide 

ninein (GSK3B interactinc protein) 

ARP2 (actin-related protein 2, yeast) ho 

hypothetical protein FLJ23556 

v-erb-b2 avian erythroblastic leukemia v 

Homo sapiens mRNA; cDNA DKFZ F 434C201 6 (f 



637 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 







Hs.185831 


ESTs 


6.20 




BE379794 


Hs. 159651 


hypothetical protein 


6.16 






Hs. 159578 


AT-hook transcription factor AKNA 






AI301 558 




RNA binding motif protein, Xchromosonie 


6.13 






Hs.193482 


Homo sapiens cDNA FLJ 11903 fis, clone HE 


6.12 




AA554989 




sudD (suppressor of bimD6, Aspergillus n 


6.10 








inositol 1,4,5-trisphosphate 3-kinase B 










low density lipoprotein receptor-relatsd 


6.10 








neutrophil cytosolic factor 4 (40kD) 










CDS antigen, alpha polypeptide (p32) 






AI979168 






6.10 




AW204069 




ESTs, Weakly similar to unnamed protein 
ESTs 


6.09 




AA700439 


Hs.188490 


6.06 








CD97 antigen 


6.06 




AA927670 


Hs.131704 


6.06 








glycoprotein M6B 


6.00 




AW997484 


Hs.5003 


KIAA0456 protein 


6.00 




AA278921 




proteoglycan 1, secretory granule 


5.99 






Hs.227817 


BCL2-related protein A1 


5.35 








Homo sapiens cDNA FLJ13698 fis, clone PL 


5.95 




AA354572 




gb:EST62857 Jurkat T-cells V Homo sapien 


5.95 




AA807346 


Hs.288581 


Homo sapiens cDNA FU14296 fis, clone PL 


5.03 




AA281504 




Homo sapiens cDNA: FLJ22165 f s, clone H 
neuropilin 2 


5.90 




BE465754 




5.90 




AL1 34324 






5.86 








Homo sapiens cDNA FLJ13618 fis, clone PL 


5.86 




N38902 


Hs.21153S 


hypothetical protein MGC4248 


5.84 








serine/threonine kinase 12 


5.84 










5.82 


















matrix Gla protein 


5.80 




















5.80 




AI739168 




Homo sapiens cDNA FU13458 fis. clone PL 


5.30 








ESTs, Moderately similar to Ibd1 (H.sapi 


5.30 








v-yes-1 Yamaguchi sarcoma viral related 




442821 












R28982 


Hg.18106 


ESTs 


5.75 




AW511970 




turn ir in lein innshtionally-controlle 


5.70 








ESTs, Highly similar to S94541 1 clone 4 






AW772041 




ESTs, Moderately similar to JC5238 galac 


5.70 




AA613726 




ribosomal protein L10 


5 53 




AA605038 




Homo sapiens cDNA: FLJ21950 fis, clone H 


5.53 














AA315805 




desmoglein 2 


5.63 




AI433074 


Hs.86682 


Homo sapiens cDNA: FU21678 fis, clone C 


5.63 




AL080058 




DKFZP564G202 protein 


5.50 




AW662958 




pleomorphic adenoma gene-like 1 


5.50 


AV654694 




interferon-induced, hepattis C-associal 


5.60 










5.60 




BE409248 




hypothetical protein FU22357 similar to 


5.53 








cathepsin B 


5.57 

5.53 






Hs.182979 


ribosomal protein L12 




AA010319 






5.50 








leukocyte membrane antigen(IRCI) 










ribosomal protein S23 


5.50 






Hs. 184510 


stratifin 


5.42 








Homo sapiens cDNA FLJ 13267 fis, clone OV 


5.40 




AW291587 






5.40 




AW361666 


Hs.49500 




5.40 
5.40 




AK0021 21 


Hs. 184465 


hypothetical protein FLJ 1 1 259 






Hs. 301 746 


RAP2A, member of RAS oncogene family 


5.40 






Hs. 146233 


Homo sapiens cDNA: FLJ221 30 fis, clone H 










gb:og82a10.s1 NCI_CGAP_Ov8 Homo sapiens 


5.39 




AW968613 


Hs.79428 


BCL2/adenovirus E1B 19kD-interacting pro 


5.36 








meningioma expressed antigen 5 (hyaluron 


5.34 








collagen, type IX, alpha 3 


















jun B proto-oncogene 








Hs.156974 




5.30 




BE512730 








424408 


AI754813 


Hs!l 46428 


collagen, type V, alpha 1 


527 


425430 


BE1 85921 


Hs.98073 


ESTs, Mod I ilor to Z1 HUMAN Z 


5l27 


411165 


NM 000169 


Hs.69089 


galactosidase, alpha 


5.26 


406701 


AA7B0613 


Hs.62954 


ferritin, heavy polypeptide 1 




435631 


BE254086 


Hs.29647 


uncharacterized hematopoietic stemfcroge 
actinin, alpha 4 


5.24 


418905 


BE5 39674 




5.23 


449303 


AK001495 


Hs.23467 


hypothetical protein FLJ10633 


5.22 






Hs.7378 


Homo sapiens mRNA; cDNA DKFZp434G227 (fr 


5.20 


430314 


AA369601 


Hs.239138 


pre-B-cell colony-enhancing factor 


5.20 


444930 


BE1 85536 


Hs.3011B3 


molecule possessing ankyrin repeals indu 


5.20 


430598 


AK001764 


Hs.247112 


hypothetical protein FLJ 10902 


5.20 


407254 


AW1 29401 


Hs.181165 


eukaryotic translation elongation factor 


5.20 



WO 03/042661 



PCT/US02/36810 



Hs.49124 
Hs.278441 
Hs.13797 



425190 AW028302 

454227 AW963B97 

429357 AB007867 

452191 AU076408 

419073 AW372170 



421878 
424875 
427641 
442806 
442495 
439941 
444981 
452472 
418117 
431824 
417558 
410570 
431805 
430333 
428494 
419641 



414821 
429558 
432588 

406797 

422392 
447197 
447832 
420932 
413593 
451658 
441224 
408067 

417308 

418113 



AI187945 
AI270591 
AW294522 
AI184717 



AW957300 

AI922013 

AW972842 

AF045229 

AI1 33096 

NM_014053 

S70114 

AA233439 

BE170548 

H73972 

AK001972 

AA298065 



Hs.82280 
Hs.64593 
Hs.270594 



KIAA0015 gene product 
ESTs 

Homo sapiens cDNA; 
protein phosphatase 2 
KIAA1435 protein 



UDP-glucose dehydrogenase 

Homo sapiens cDNA FLJ12797 fie, clone NT 

Homos h iens cDNA FLJ11469 fis, clone HE 

hypothetical protein FU23516 

Homo sapiens cDNA FLJ11643 (is, clone HE 



ESTs 
ESTs 

amino acid transporter system A1 
hypothetical protein FLJ13732 similar to 
ESTs, Weakly similar to C55663 oligodend 
linker for activation of T cells 
gb:EST384937 MAGE resequences, MAGL Hor 
regulator of G-prolein signalling 10 
ATP synthase, H transporting, milochondr 
FLVCR protein 

TIA1 cytotoxic granule-associated RNA-bi 

hypothetical protein FLJ20005 

Homo sapiens cDNA: FLJ21081 fis, clone C 



AI391454 

X92715 

AI025842 

AI432224 

AW970C26 

NM_0C59O8 

R36075 

AI433357 



Hs.30822 
Hs. 11465 
Hs.347969 
Hs.78948 
Hs.157211 
Hs.32366 
Hs.77424 
Hs.207251 
Hs.3057 



DKFZP586B0621 protein 

ESTs, Weakly similar to TWST HUMAN TWIST 

Fc fragment of IgG, high affinity la, re 

nucleolar auloantigen (55kD) similar to 

zinc finger protein 74 (Cos52) 

ESTs 

ubiquinot-cylochrome c reductase hinge p 
mannosidase, beta A, lysosomal 
gb:yh88b01.s1 Soares placenta Nb2HP Homo 
ESTs 

ESTs, Weakly similar to T21 597 hypotheti 
gb:zq78c12.r1 Stratagene hNT neuron (937 
ESTs, Moderately similar to I38022 hypot 



BE244580 Hs.342307 hypothetical protein FU 10330 



AA479005 

H60720 

AI583207 



AA974433 
M77640 
AA1 57857 



418945 BE246762 

448966 AW372914 

426711 AA383471 

433681 AI004377 

441321 H17182 

448896 AL1 57484 

447232 AW499834 

449317 AW293413 

436372 AW972301 

422082 AA01618B 

432465 D56165 

417924 AU077231 

446659 AI335361 

414829 AA321568 

418036 Z37976 



Hs.1757 

Hs.1 82265 

Hs.59892 

Hs.1 73936 

Hs.89499 

Hs.86149 

Hs.343800 

Hs,200360 

Hs.7771 

Hs.22483 

Hs.327 

Hs.132906 

Hs.310286 

Hs.111244 

Hs.275163 



Hs.83337 



KIAA0725 gene product 

fibroblast growth factor 4 (heparin seer 

L1 cell adhesion molecule (hydrocephalus 



Homo sapiens cDNA FU13027 fis, done NT 

B-cell associated protein 

Homo sapiens mRNA; cDNA DKFZp752M127 (fr 

interleukin 10 receptor, alpha 

19A24 protein 

ESTs 

hypothetical protein 

non-melastatic cells 2, proieir (NM23B) 
cyclin D1 (PRAD1 : parathyroid ad- 



016055 Hs.82389 

AI076765 Hs.269899 

AA303799 Hs.300141 

AA221036 Hs.13273 

BE620712 Hs.33026 

H88409 Hs.40527 

AA159248 Hs.180909 
AI890243 
AI34B027 



.108557 hypothetical protein PP1057 



WO 03/042661 



PCT/US02/36810 



425410 
407874 
417535 
421818 
408491 



U51166 
M310974 
A1766311 
M203559 



thymine-DNAglycosyiase 
Homo sapkms cDNA FLJ10522 lis, clone NT 

1 1 api it FLJ140GSI L 

ESTs 

Hs.50098 NM_0C2489:Homo sapiens NADH dehydrogenas 



20 
25 



417353 
426141 
412922 
434449 

418883 
416581 
440983 
450988 
413663 
410342 
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AI423041 AW071181 A'8S983B AW1291 12 BG925339 AI017633 AA568964 BF72S590 AI004210 AI809799 BE083097 BG89S220 AW997681 
BF668788BE083134AW6312B1 BG193052 6G183095 BE440088BG185728 AI499579M188162AA864282 BI493352 M155854 AA836749 
AA836844 AA985478 AW082299 A1816747 AA450221 AA971294 BE327509 AI719562 BG576669 AI479382 BF824747 AI741800 BG982962 
AI088473 AA91 61 51 AW473324 BG901 1 77 BE439998AW023269 BE813871 AW999947 BE83910BAV707983AA369722AW796627 AW890608 
AI341 771 AA302459 BI493353 AA366332 AA371 104 AA357277 AL547972 EG928011 AI678903 AI699886 AI956165 AA4B4893 AA643953 
AW591063 BG203275 BG21 1 093 A1334791 AA91 6589 AW058266 AI362370 AI143352 AA508721 AI928079 D57214 BE045265 AA5417B5 
BG21951 0 BG201686 BG195572 AW01 9904 AW089242 AA953322 A1686698 F27562 AA614749 D56645 F20774 F30660 F25646 AW023542 
M827300 AA582214 AI7012BS AA228293 A1906950 AA2301 56 AA384572 AW43B988 AA742516 BI490938 M731082 BF665869 BG190518 
AV704158 BE439643AA910665AA155913 AA923097 AA975721 AA985555 BG927032 AA94B389 AA451625AA916141 AL572719 AV707258 
AWD33733 AA128053 AI9537B9 AI91 1993 AA421793 BG429150 AIS15305 Z3013Q AA126929 BG926630AA081013 M553696AA916094 
BG924321 AI039722 A1954958 AI372839 AI401406 A538215 

AL047596 AA393792 AI670731 A1037957 AW874364 A10381 37 N52286 AI241 379 BE501 096 AW090696 AI927369 AI669226 AI369437 AI371075 
AW612409AI686711 AI183289AA477717 AI076122 AA535190 AA700984 AA781508 D81020 BF575223 AI356183 D79312AI375558 H61111 



2176805J 
1036795J 
2949482.1 



AK056315 AI015524 AA724079 BI713619 AI377728 AW293682 AI928140 AI092404 AI085630 AA731340 BM469629 AW968804 AA425658 
AA769094 BF446026 AW1 18719 AI332765 AW5QQ888 AW576556 AI859571 AW499664 AW614573 AW629495 AW505314 W74704 AI356361 
AI923640AW070509AI521500AL042095AA609309 M761319AI381489 H45700 AA761333 AW265424 AA909524 AA635311 AA649040 
AI392620 Z40708 AI985564 AW26351 3 AA913692 A1693486 AW263502 AI8061 64 AW291 1 37 BI061872 BI05949B AA1 34476 AW084883 AA036967 
AW370B23 T55263 BI002756 AA489664 BF827261 W74741 BF963166 

BC022413 BE395396 BF754175 AA506621 BE706555 BE706678 AA723159 BE153169 BE706729 BE706558 BE153312 BE706706 AW371853 
AW371849 BE153241 BC01 7410 AI337912 AI090244 AW090300 BE219837 AI623661 BE501576 BE501734 AI742232 AI023964 AI458424 
AA976373 AI288S04 AI9B4583 AA890325 N32562 AI3S81Q2 AW241694 AI038448 AI672071 AI018389 AA576391 M977874AW1 89392 W37448 
AA612894 AI277548 H89551 AI699774 H89365 AA315805 AW579186 BC014584 BC014581 AW780125 A1672414 BE328145 AW600919 BF031306 
AW172758 BE708322 AA345675 BES75779 H28241 H25318 BF540913 BG179688 BF110202 AA528775 W37573 BE041644 AW366504 BM129522 
BM129822 AI122760 BE718200 AW887496 AA149420 BE706307 BE539395 BE7487B5 AI373653 R75904 BF979185 BF691393 BG495595 
BI094458 BE706702 BG496559 BF248373 BG494800 
M908472 

BM469076 M533027 AI127512 AI368802 AA533141 AA700560 AW57602B AI510851 AI435361 BM129172 AW474544 BM128899 AI814292 
AW502039 M531 243 BF941 858 AW502037 AA702337 AI419S54 AA652755 M934364 AI30051 0 A1291 1 36 AA505263 AI144527 AI07691 9 
AI633534 AI242473 AA938561 BG055372 AA512894 A1671356 AA962403 BF808010 AA66391 1 AA847056 AA513301 AA369069 AA377265 
BG2912O6M4O2298AA885766 AW801002AA302290AI305842AW80O873AA302492 M47B427AI817291AW8011O4AW8O1028AA865744 
BF155979 AI374743 AA478431 Al 1 59646 AI369757 AI300672 BF435788 AA255451 A1937707 AW0061 98 AI280363 BF062434 AW801 1 15 
AI919181 F28413 W04214AW152380 AW901567 AW901570 AA336371 AA384251 AI302846 BE701902 AA931606 H42673 R33703 AW901556 
AA009816 AW901568 AW352200 AA256558 H15928 BI037170 AW800530 AA369068 N98562 H28652 N34644 H97650 H00956 W70039 AI142831 
AA009817 F37136 N70289 AA531347 R72374 H27488 R66605 
AI184717 AW518883 AF121 173 
AW972842 AA516061 M630205 
AI74231 1 AI025842 AA57S843 

AI432224 AW276890 AI499346 AA93701 4 AA653573 A!31 8525 AI246219 AA961 591 AI270640 
R3S075 R36167 AI366546 
AI628543 AI433357 AW772732 
AA205248 AA204737 AA1 30658 

BF51 071 5 BE673055 BE4641 1 1 AW590620 AI637939 AA404324 AW236441 A1650952 BF056795 AA974433 
AI890243 

BG402852 BG545066 AA150252AL036760 AA452480 AI033256W68776 W93372 N31248 A1052219 AI367635W69374 N88610 R58194 BI524854 
BI4971 1 1 BF940043 A! 1 29263 AI35979S AI056480 AA121 421 AI042150 AW449003 AI4181B0 AI41 9420 AI356058 BF832243 AI349330 AI359448 
W76647 BF477170 AA099163 BF994549 AW608255 AA045418 H03770 AL574791 AW069455 BE302148 AW022281 AW960273 M121268 
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A1336371 AI989381 AI131425 A1147483 A:31 1537 AW338538 AI141 649 AA709414 AI187177 AA780B84 AI333805 AA045312 A1623918 AI349421 
W63753 W70299 M557276 AA299007 N982 1 2 W74064 N24823 T54892 AA054724 W73059 AI869152 N93462 N71889 AI537432 R71 628 
AA303089 AI498550 T60941 AV706417 AWQ67848 AI150677 AW3381 18 A1336313 AA826256 AI139518 AA662948 AA902723 AI970175 W68682 
AI089380AI148372H99951 AW183001 AI270317AA532767AA044727AA931652R82469AA150261 W67788 H67495 R8071 5 AW149812 N78914 
AI862034 W611 22 AW0231 18 W69375 T88917 I 47984 N21 531 R35646 AA055544 HI 5534 AA68B295 AA090586 AA044764 BF994641 R79547 
N21313BF674610 H02874AW975323 R16904M328030AA054671 R79546 BF832310 AI249109 

413703 376077J BG015794 BE158357 BE158353 BE158358 BE158360 BE153352 EE158351 BE158355 

406870 0_0 AA075144 

456629 207.22 AW891 965 AW604749 BE080872 R1 5559 BE1 77623 AW883520 AW945343 AI2461 67 T07082 AW805679 W96278 AA1 35796 W3261 5 AW99541 B 

AW801 688 BE003837 AW801621 AW385721 AW385742 AW38571 4 AW604757 W87409 AW604738 AW385757 AW580796 AW801 247 BE003239 
BE003133 AA847112 AW5S0975 AW6O4760 AW385727 BE164590 BE003090 AW362791 AW604759 AW866589AW604758 N44337 AI378548 
AW890438AA077172AI288683AA223539AA091945AW945454AA063629AA702504AW861938AW894816AW580841 AA094372 T06399 
AW885686 BE244086 BE005035 AW861913 AA551 773 AW858460 AW370926 AW754352 AW889695 AW384408 AI907428 BE067491 AW861939 
AA248197 AW381373AW1 77325 AWo 1 tC67493 BE083742 BE067470 AW894935 BE082529AI248811 BE179917 BE002200 

AW607506 AW392889 AW894560 AW381 360 AI904206 AW863533 C00609 AW381372 BE082530 AW8981 20 BE075323 AW392799 AW601 420 
AI695314 BE083790 AW85856S AW945550 BE ! 771 53 AW970506 BE350419 AI906919 AW360794 AI90691 7 AW885979 AW794240 AW945566 
AI688683 AI688694AW009660AW601421 AW360793 BE065524 BE083901 AW369847 AW381871 AW935435AW664582AW877775 AW838449 
BE180456AW858501 BE13Q464 AI371 1 S3 AA778231 AI174991 BE011720AW877776AW877800AW877795T19900AW866365AW898099 
BE011715BE167842BE011718BE011724AW363639AW878658AW878662AW894887BE082356AW389211AW804286AW610312AI904717 
AW610318AW996909AW610296AW901923AW880003AI762171 AW062582AW368713AW062593AW176663AW842064AW842089 
AW842095 AI243049 AW902074 AW052592 AW1 76664 AW751 692 BE037703 AI907439 BE009686 BE1721 1 5 BE077030 AW608556 AW835577 
AI909628 BE077029 AW1 76241 BE077552 BE160370 BE1 60288 AW835656 AW606765 AW606770 AW835678 AW606758 AW606778 AI907484 
BE172821 AW606768 AW999517 AW844165 BE171738AW751683 AW610493 BE177484 BE177487 AA090510 AW844117BE173367 AW999878 
AI124870BE163472AW841823AW379762AW8S3297AI290296BE089132AA610287AW176676AW607622BE172639AW893232AA329629 
BEQ89Q08BE178350BE178214BE063291 AW820235 AW999653 BE089436 BE173126 BE171775 BE185737 AA558280 AI174840 AW9991 12 
BE218391 BE1 72734 BE1 78021 BE172738 BE173324 AW603494 AL036722 R38192 R60905 H53721 H41052AL037917 R37795AW998972 
AA767139 AW044272 H50689 AA768399 AA767764 AI087888 H44202 BE222792 N90597 W81 396 N9061 5 AI935353 BE501 1 68 F1 0945 
AW118215 AI970480 AI627541 AW236081 AA574090AI527652 AI681913 A1759983 N69591 N69276 BE467722 AW392780 BE172467 H92861 
AI524921 F02989 Z39328 F02705 FC1414 T88678 AI215165 H87220 AW374781 

437386 5541.2 BM476605 BI545004 BI834636 BG1 12453 BI199049 BG1 12759 M149846 H97925 AA306121 AA313204 W52451 AI734997 M931168 AA429766 

N4791 3 AA584321 BF940241 AI083648 A'08941 0 AI347705 AI343661 Al 186232 AI839031 N98464 AI820039 AI459034 AV652512 M622990 
BE857200 M932998 AA740573 AI826264 AABB5583 AI344550 A'C'27349 AI056087 AA442777 AA603724 AA873347 AI056717 AI092185 AI032895 
AA535639 BF806025 BF806061 BF305985 EF746099 BF746097 AI309259 AI597603 BF806066 A1090653 AI129205 AI248410 H72993 AW615341 
BF805990 BF805982 M993819 T34373 T35604 H56242 AA648145 T35907 BF808691 N94015 AV703438 BG774276 H82341 R76371 

439403 4937.10 AK056692 AF086220 AI375065 AA284293 W32566 AW797961 AA960897AA504145 

409245 3199.2 AF030234BC017465 BGOOB526AW505550 BM460141 N47324AA361037AA321632 N45606 AV752793 AV6571 16 AA296632 AU1 37357 

AW467027 A1742080 AI624350 H58206 AA47851 8 AW439SS7 AW393555 AW393523 AI559753 AI808732 R66856 H01374 B1257369 BI259030 
AW960845 BM466252 AW956813 BE768647 AVB53853 BM055248 BF3 

DFC93291 AW021929 H22650AA459715 BG496341 BE697763 BI254239 BG499543 H4294o Elm" 8i H08f 1 H87 t H37599 BF691752 
BE768511 BG940948 W37195 BF372041 BE883795 BF372082 BF367329 BF909744 AW966003 AV714014 BI492868 BI495144 AA921845 
A1693426 A1652117 AI435449 NM7325 AI434429 AA573137 AI133429 AI829962 AI332526 BF513937 A11B9G61 AI221962 AI37BQ34AW1 18897 
AW665247 AW340077 N41605 AA478519 AA463875 AI658260 AA46337S A!292305 BE045947 AA971089 AI125820 BG940947 AI080245 
AA884954 A11 25702 AI382934 AA931 835 AI358631 AW439905A!027833AI 399648 AI014533 AA347851 AA73B261 N67374N69081 AI768667 
M948472AIB19214M293133AI13S725M88S214AI222635BI495U3N23505N48812AA769041 AI492769 D56771 AA095911 BE222062 
D56772AW372255EM054985 D12455 BG534562 AW003511 H87486 H42B80AW190293BF594697BF377611 H22043 BI255749 BI492848 
H16217H21980 H22651 H88179 H87354 H 44052 H25165 H44128 

448782 34980.1 AB018301 AL050295 BF513128 AW385060 AL551708 AI352542 AI829703 A1819388 AW629019 AW073189 AW273857 AW1 18768 A1453845 

AI452494 AA885341 AI057144 AA904347 AI423547 AW263913 AI094774 AI434419 AI039546 AI002491 AI240412 Z25099 AA9951 78 AW050649 
AW026140 AI796309 AI58401 2 BE1 66666 A1767991 AI309041 AA724059 AI695284 AI245095 T63971 Z40627 BE166681 BG570071 BF921915 
BI562702 BG506502 AV658066 R48378 AA121543 AI096938 AA618131 H40993 R48277 AI352281 BG540253 BG538901 N95226 AI356752 
AI221152Z28777 R16574 AW965449 AA044116 AW797518 BI010405 AA044288 AI093508 BE140159T64039BG433106 AW130367 AW130361 
N73937 AA1 27680 AW044037 AI096437 AA384077 BF941 499 T93764 BG003285 

422960 11862.2 BF090249AW954947AW890487AI305236 D50845 D60537AA825429W36294AW890410AW088235 BF740240AA448709AI350279 AA879119 

M319510 BE702077 BE699015 BE702046 AW901293T99319 D81708 BF475488 D60383 D81751 BE599260 



Ref: Sequence source. The 7 digit numbers in this column are Gonbank Identifier (Gl) numbers. "Dunham, et al." refers to the publication entitled The DNA sequence of 

human chromosome IT Dunham, et al. (1999) Nature 402:489495. 

Strand: Indicates DNA strand from which exons were predicted. 

Nt_position: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand Nt_ position 

401466 6682292 Plus 

401192 9719502 Minus 

402474 7547175 Minus 53525-53628,55755-55920,57530-57757 

402145 8018280 Plus 113086-114800 

406230 4760409 Plus 71716-72515 



ExAccn: Exemplar 
UnlgenelD: Unigene number 
Unigene Title:Unigene gene title 
R1 : Ratio of seminomatous testicular . 
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424247 


X14008 


Hs.234734 


lysozyrae (renal amyloidosis) 


44.80 


438091 


AW373062 




nxlear receptor subfamily 1, group 1, m 




412948 


BE243313 


Hs.334851 


LIM and SH3 protein 1 


34,90 


417088 


M54915 


Hs.81170 




31.10 


430542 


AI557486 


Hs.119122 


ribosomal protein L1 3a 


29.60 


412915 


AW087727 


Hs.74823 


NM_004541:Homo sapiens NADH dehydrogenas 


29.10 








Rho GDP dissociation Inhibitor (GDI) bet 




406820 


AI223958 


Hs.108124 


ribosomal protein S4, X4inked 


2ai3 


433800 


AI034361 


Hs.135150 


lung type-l cell membrane-associated gly 


28.10 




AI920965 


Hs.77961 


major histocompatibility complex, class 


27.85 


416680 


AW245540 


Hs.79516 


brain abundant, membrane attached signal 


27.70 


446525 


AW967069 


Hs.211556 


hypothetical protein MGC5487 


27.20 


432359 


AA076049 


Hs.274415 


Homo sapiens cDNA FU10229 fis, clone HE 


26.30 


422578 


AF239666 


Hs.1545 


caudal type homeo box transcription fact 


25.80 


429978 


AA249027 




ribosomal protein S6 


26.40 


418870 


AF147204 




chemokine (C-X-C motif), receptor 4 (fus 


24.60 


440207 


AI371978 


Hs.128326 


ESTs 


24.50 


425543 
442562 


R23313 
BE379584 


Hs.334895 


ribosomal protein L10a 


24.30 
24.10 


444562 


AA186715 


Hs.336429 


dolichyl-diphosphooligosaccharide-proiei 
RIKEN cDNA 9130422N19 gene 


24.05 


413063 


AL035737 




chitinase 3-like 1 (cartilage glycoprote 


24.00 


409038 


T97490 


Hs.50002 


small inducible cytokine subfamily A (Cy 


22.90 


420367 


AA259090 


Hs.257028 


ESTs 


22.90 


40685S 


AW515336 


Hs.29797 


ribosomal protein L10 


22.77 




M69043 


Hs.81328 


nuclear factor of kappa light polypeptld 


22.75 


412636 


NM.004415 




desmoplakin(DPl, DPII) 


22.40 


420676 


AI434780 


Hs.4248 


vav 2 oncogene 


22.10 


440869 


NM_014297 


Hs.7486 




21.40 


446627 


AI973016 


Hs.15725 


hypothetical protein SBBI48 


21.20 


410315 


AI638871 


Hs.17625 


Homo sapiens cDNA: FU22524 fis, clone H 


21.10 


420754 


W79431 


Hs.346911 


ribosomal protein L22 


20.98 


435538 


AB011540 


Hs.4930 


low density lipoprotein receptor-related 


20.90 


440440 


Z28925 


Hs.7188 


ssma domain, immunoglobulin domain (Ig), 


20.80 


429490 


AI971131 


Hs.23889 


ESTs, Weakly similar to ALU7_HUMAN ALU S 


20.40 


425769 


U72513 


Hs.159486 


Human RPL13-2 pseudogene mRNA, complete 


19.50 


422714 


AB018335 


Hs.1 19387 


KIAA0792gene producl 


19.15 


430253 


AK001514 


Hs.236844 


hypothetical protein FLJ10652 


13.00 


413787 


AI352558 




tyrosine 3-monooxygenase/tryptophan 5-mo 


18.50 


452874 


AK001061 


Hs.30925 


hypothetical protein FU10199 


18.50 


430255 


AK000703 


Hs.323822 


Homo sapiens mRNA for KIAA1551 protein, 


18.00 


432606 


NM 002104 


Hs.3066 


granzyme K (serine protease, granzyme 3; 


17.90 


448588 


AI970276 


Hs.156905 


KIAA1676 


17.70 


444784 


012485 


Hs.11951 




17.50 


428782 


X12830 


Hs.193400 


intcrloukin 6 receptor 




414092 


Z14244 


Hs.75752 


cytochrome c oxidase subunit Vllb 


17^20 


425945 


AW410669 


Hs.164280 


solute carrier family 25 (mitochondrial 


17.15 


420759 


T11832 


Hs.127797 


Homo sapiens cDNA FLJ11381 fs, clone HE 




440528 


BE313555 


Hs.7252 


KIAA1224 protein 


17^06 


410143 


AA188169 




KIAA1191 protein 


17.05 


421181 


NM.005574 


Hs.184585 


LIM domain only 2 (rhombotin-like 1) 


16.90 


426033 


AW952712 


Hs.126712 


ESTs, Weakly similar to AF191020 1 E2IG5 


16.70 


429183 


AB014604 


Hs.197955 


KIAA0704 protein 


16.70 


450937 


R49131 




ATP-dependant interferon response prolei 


16.60 


449571 


AW016312 


Hs'.200266 


ESTs 


16.50 


432730 


AI056520 


Hs.131358 


ESTs 


16.20 




zinc finger protein 6 (CMPX1) 
\r-erb-b2 avian erythroblastic leukemia y 
carbohydrate (N-acetylglucosamine-8-O) s 
small inducible cytokine subfamily B (Cy 
regulator of G-protein signalling 10 
glycine dehydrogenase (decarboxylating; 
cadherin 1, type 1, E-cadherin (epithet! 
glutaredoxin (thioltransferase) 
major histocompatibility complex, class 
small inducible cytokine A4 (homologous 
KIAA1265 protein 
hypothetical protein AL1 101 15 
Human clone 23933 mRNA sequence 
ESTs 

hypothetical protein MGC4248 
N-MYC oncogene 

protein tyrosine phosphatase, receptor t 



Hs.303649 small inducible cytokine A2 (monocyte ch 
ESTs 

Hs.9195 Homo sapiens cDNA FLJ13698 fis, clone PL 

Hs.189095 similarto SALL1 (sal (Drosophila)-like 

Hs.57079 Homo sapiens cDNA FLJ 1 3267 fis, clone OV 

Hs.155530 interferon, gamma-inducitle protein 1 6 

Hs.170328 moesin 
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418151 
453020 
410275 



447735 
410325 
445817 
418299 
445863 
454413 
418460 



30 
35 
40 



AA866115 

H87136 

AA864238 

U85658 

NM_004862 

AU077333 



Hs.127797 Homo sapiens cDNA FU11381 fis, clone HE 



Hs.83583 
Hs.31422 
Hs.61796 
Hs.76507 



AA490855 Hs.39429 ESTs 



S17 

actin related protein 213 complex, subun 
Homo sapiens mRNA; cDNA DKFZp434M229 ((r 
transcription factor AP-2 gamma (activat 
LPS-induced TNF-alpha factor 
erythrocyte membrane protein band 7.2 (s 
' " i(PRKA)am - 



AW975959 
AA911568 
AW361666 
AK000737 
AA775268 



M26315 
Al 634046 
X94630 

AA340277 
BE077546 
AA133590 



Hs.737 
Hs.337986 
Hs.107513 
Hs.279860 
Hs.49500 
Hs.237480 
Hs.6127 
Hs.62264 
Hs.13340 
Hs.83968 
Hs.13396 
Hs.40092 
Hs.85258 
Hs.157313 
Hs.3107 



431639 
444381 
418509 
437374 



AL359571 

X78992 

A1096837 

ALC35588 

AW205632 

AL161961 

AW1 52225 

A1929700 

M382104 

AI807519 

AI218918 

U97502 

AW958934 

M412205 

AK001575 



449246 
424321 
407179 
413497 
430068 
446795 
435522 
451864 
419490 
419904 



KIAA0746 protein 
hypothetical protein FU20730 
Homo sapiens cDNA: FLJ23020 fis, cli 
KIAA0937 protein 



2 (antigen CD18(p95],ly 
Homo sapiens clone 25028 mRNA sequence 
PNAS-123 

CD8 antigen, alpha polypeptide (p32) 
ESTs 

CD97 antigen 
collagen, type XVII, alpha 1 
Homo sapiens cDNA FU20167 fis, clone CO 
Hs.31 447 ESTs, Moderately similar to A4601 0 X-lin 
Hs.250857 calcium/calmodulin-dependent protein kin 

NM_004079:Homo sapiens cathepsin S (CTSS 
Hs.51615 ESTs, Weakly similar to ALU7_HUMAN ALU S 
Hs.1 1 1334 ferritin, light polypeptide 
Hs.2248^ small inducible cytokine subfamily B {Cy 

PDZ-LIM protein mystique 
ESTs 

platelet/endothelial cell adhesion molec 
phosphoprotein associated with GEMs 
ESTs, Weakly similar lo S64054 hypotheti 
ATP synthase, H transporting, mitochondr 
ninein(GSK3B interacting protein) 
butyratc response factor 2 (EGF-response 
ESTs, Weakly similar to RB8B_HUMAN RAS-R 
MyoO family inhibitor 
ESTs 

KIAA1554 protein 

ESTs, Weakly similar to I38022 hypotheti 
endosulfine alpha 

lectin, galactoside-binding, soluble, 9 
Homo sapiens cDNA FU13694 fis, clone PL 
KIAA0854 protein 

butyrophilin, subfamily 3, member A2 
Homo sapiens cDNA: FU21897 fis, clone H 
ESTs 



W74048 

AA206465 

BE1 77661 

AA464964 

AI797713 

N64214 

N20370 

NM_006144 



Hs. 153934 
Hs.78146 
Hs.266175 
Hs.283713 
Hs.85539 
Hs.44054 
Hs.78909 
Hs.21349 
Hs.1 53203 
Hs.211198 
Hs.1 7767 
Hs.1 65909 
Hs.111680 
Hs.81337 
Hs.104520 
Hs.30209 
Hs.87497 
Hs.173108 
Hs.140996 
Hs.9536 

Hs.71404 
Hs.15119 
Hs.23363 



ESTs 
KIAA1361 protein 
hypothetical protein FLJ 10983 
lymphocyte-specific protein tyrosine kin 
thymosin, beta 4, X chromosome 
gb:RC1-HT0598-02030O-011-h02 HT0598 Homo 
gb:zx80f10.s1 Soares ovary tumor NbHOTH 
ESTs 

synovial sarcoma translocation gene on c 
ESTs 

granzyme A (granzyme 1 , cytotoxic T-lymp 



447534 
419223 
423573 
422960 
412025 
408784 
417407 
432409 
440273 
407110 
408161 

434280 BE00539E 
434524 AA635931 



X60111 Hs.1244 CD9 antigen (p24) 

BE003054 Hs.1695 matrix metalloproteinase 12 (macrophage 

AW890487 cadherin 13, H-cadherin (heart) 

AI827451 Hs.24143 Wiskott-Aldrich syndrome protein irterac 

AW971350 Hs.63386 ESTs 

AA923278 Hs.290905 ESTs, Weakly similar to protease [H.sapi 

AA806538 Hs.130732 KIAA1 575 protein 

AI805392 Hs.325335 Homo sapiens cDNA: FLJ23523 As, clone L 

AA018042 Hs.252085 Prader-Willi/Angelmansyndrome-5 

AW952912 Hs.300383 hypothetical protein MGQ032 

BE005398 gb:CM1-BN0116-150400-189-h02 BN0115Homo 

Hs.249716 ESTs 
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450294 H42587 



434442 
440268 
447519 
433156 
410730 
436823 
442806 
433271 
445577 
408437 



414191 
411979 
414829 



AW129401 
AA737415 
BE270030 



AW368860 
AW749865 
AW294522 



AW957744 

NMJ05904 

AW971146 

Al 697901 

AW073971 

AL036596 

AV653729 
W05391 
X63629 
AA805634 



Home sapiens, done MAGE:3677185, mRNA 
ESTs 

Homo sapiens cDNA: FLJ22539 fis, clone H 
DnaJ (Hsp40) homolog, subfamily B, membe 
ESTs, Weakly simi'ar to I38022 hypotheli 
ESTs 

nucleolar protein family A, member 3 (HI 
cytoplasmic polyadenylation element bind 
lacrimal proline rich protein 
: MAD (mothers against decapentaplegic, Dr 



ESTs, Weakly similar to KIAA1204 pre 
A kinase (PRKA) anchor protein 2 
decay accelerating factor for co 
CGI-44 1 



ie dehydrog 



AI219083 

Y11312 

AW955632 

NM_004131 

AW002834 



ir subfamily 1, group I, m 
cadherin 3, type 1 , P-cadharin (placenta 
Homo sapiens mRNA; cDNA DKFZp547M072 (fr 
PDZandLIM domain 1 (elfin] 
retinoblasloma-binding protein 5 
pleckstrin 
ESTs 

ESTs, Moderately similar to ALU8JIUMAN A 
i phosphoinositide-3-kinase, class 2, beta 
E T \ea I, -I 51 60 p-olire r 
granzyme B (granzyme 2, cytotoxic T-tymp 
ESTs 

retinoic acid induced 3 



408360 
426827 
453716 
418840 
434649 



Hs.80624 hypothetical protein MGC2560 

Hs.44344 hypothetical protein FLJ20534 

Hs.1 72665 methylenelelrahydrofolale dehydrogenase 

Hs. 152675 ESTs 



AA002008 
AB007937 
AA386192 



Hs. 165390 ESTs, Highly sir 



436797 
420099 
417640 



409327 
411960 
434159 
447500 



Hs.257631 

Hs.334477 

Hs.95140 

Hs.82353 

Hs.14555 

Hs.3844 

Hs.31314 

Hs.7149 

Hs.3463 

Hs.111577 

Hs.18106 

Hs.193580 



Hs.18103 



ESTs 



nscDNAFU11903fis,c!< 



e HE 



hypothetical protein MGC14879 
KIAA0189geneproducl 
protein Creceptor, endothelial (EPCR) 
Homo sapiens cDNA: FU21513 fis, clone C 
LIM domain only 4 
retinoblastoma-binding protein 7 
Homo sapiens cDNA; FU21950 fis, clone H 
ribosomal protein S ~ 



collagen, type IX, alpha 3 




447674 
413420 
422451 
437134 
408912 
419839 
431427 
437469 



AI362533 
AW298741 
AI761902 
AW137772 
AW861622 
AI718702 
BE270640 
AW410235 
AA310753 
AA349944 
AB011084 
U24577 
AK000401 
AW753112 
Al 341227 



immunoglobulin heavy constant mu 
transmembrane protein 4 
ubiquinol-cytochrome c reductase hing 



Hs.1 85980 ESTs 

Hs.108646 Homo sapiens cDNA FU14934fis, clone PL 

Hs.279930 major histocompatibility complex, class 

Hs.19192 cyclin-dependent kinase 2 

Hs.75348 proteasome (prosome, macropain) activato 

Hs.42491 ESTs, Weakly similar lo S65657 alpha-1C- 

Hs 45915 ARP2 (aclir-relaled prolein 2, /east; ho 

Hs.48924 KIAA051 2 gene product; ALEX2 

Hs.93304 phospholipase A2, group VII (platelet-ac 

Hs.252748 Homo sapiens cDNA FU20394fis, clone KA 

Hs.15514 hypothetical protein MGC3260 

Hs.157106 ESTs 
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AA464839 


Hs.292566 


hypothetical protein FU14697 


441612 


AI802629 


Hs.1 13660 


Homo sapiens cDNAFLJ1 1631 (is, clone HE 


408067 


BE244580 


Hs.342307 


hypothetical protein FLJ10330 


434963 


AW974957 


Hs.288719 


Homo sapiens cDNA FLJ12142 lis. clone MA 


437103 


AW139408 


Hs.1 52940 


ESTs 




AI184717 




ESTs 




AI089660 


Hs.323401 


dpy-30-like protein 




Al 3601 67 


H I 2774 


ESTs 


436075 


BE090176 


Hs.1 79902 


transporter-like protein 


450139 


AK001838 




semm/glucccorticoid regulated kinase 


423905 


AW579960 


Hs.1 35150 


lung type-l cell membrane-associated gly 


406819 


AA908472 




gb:og82a10.s1 NCI_CGAP_Ov8 Homo sapiens 


407719 


AW963866 


Hs.44021 


Homo sapiens mRNA for FLJ00065 protein. 




AA278921 


Hs.1 908 


proteoglycan 1, secretory granule 




AW958264 


Hs.103832 


similar to yeasl Upf3. variant B 


422616 


BE300330 


Hs.1 18725 


selenophosphate synthetase 2 








zinc finger protein 137 (clone pHZ-30) 


427254 


AL121523 


Hs.97774 


ESTs 




AF1 17947 


Hs.1 74795 


PDZ domain-containing guanine nucleotide 




AW502384 




gb:UI-HF-BR0p-aka-M2-0-Ul.r1 NIH_MGC_5 




BE072881 




gb:RC2-BT0548-200300-012-e09 BT054B Homo 




AI801869 


Hs,127982 


ESTs 




BE614387 


Hs.333893 


c-Myc target JP01 


412645 


AW444433 


Hs.136061 


Homo sapiens, Similar lo hypothetical pr 




AI080042 


Hs.1 80460 


ribosomal protein S24 






Hs.35254 


hypothetical protein FLB6421 




AA830856 


Hs. 29808 


Homo sapiens cDNA: FLJ21 1 22 Hs. clone C 




AB032973 


Hs.12461 


LCHN protein 




X72755 


Hs.77367 


monokine induced by gamma interferon 


445055 


BE512856 


Hs.109051 


SH3 domain binding glutamic acid-rich pr 






Hs.63042 


DKFZp564J 157 protein 


418596 


AW959433 


Hs.326290 


hypothetical protein FLJ12581 






Hs.1 46233 


Homo sapiens cDNA: FU22130 fa. clone H 


418134 


AA397769 


Hs.86517 


ESTs 


424768 


AA353895 


Hs.1 52983 


HUS1 (S. pombe) checkpoint homolog 


443303 




Hs.9216 


caspase7, apoplosis te'.A j r pr 


411852 


M528140 


Hs.107515 


ESTs, Weakly similar to T00329 hypothec 


451838 


AW005866 


Hs.193969 


ESTs 


425367 


BE271188 


Hs.1 55975 


protein tyrosine phosphatase, receptor t 




BE620712 


Hs.33026 


hypothetical protein PP2447 


401466 






vesicle-associated membrane protein 4 


457073 


AA233210 


Hs.179943 


ribosomal protein L11 


412093 


BE242691 


Hs.14947 


ESTs 


442492 


AA528489 


Hs.234518 


ribosomal protein L23 


449971 


AA807346 


Hs.288581 


Homo sapiens cDNA FU14296fis, clone PL 


431773 


BE409442 


Hs.268557 


pleckstrin homology-like domain, family 


416401 


N80139 


Hs.268916 


ESTs 


426501 


AW043782 




ESTs 



417535 
418117 
417558 
424541 
447341 
407949 



Hs.155111 hypothetical protein FU14428 

Hs.5337 isocitrate dehydrogenase 2 (NADP), mitoc 

Hs.54443 chemokine (C-C motif) receptor 5 

ESTs.Weakl lilarto unnamed proteii 

Hs.1742 IQ motif containing GTPasc activating pr 

NMJ01241 Hs.155478 cyclin T2 

AI085561 Hs.155321 serum response factor (c-fos serum respo 

NM_015625 Hs.1 91381 hypothetical protein 

W28673 Hs.1 05747 serine carboxypeplidase 1 precursor prot 

AA608955 Hs.109653 ESTs 

Hs.13797 ESTs 

Hs.191482 ESTs 

Hs.83496 linker for activation of T cells 

Hs.82280 regulator of G-protein signalling 10 

Hs.180559 ESTs, Weakly similar to A56194 thromboxa 

Hs. 18142 arrestin, beta 2 

.247057 ESTs, Weakly similar to 2109260A B cell 



NMJM0579 
AW204069 
L33075 



AA203569 
AI922013 
AF045229 
AW392551 



W21874 

442460 NMJ14135 Hs.8345 
428818 AI131291 
453932 AW006303 
415221 W07418 



te.78225 



441384 

421684 BE281591 
441224 AU076964 



Hs.24724 MFH-amplified sequences with lem n ric 
Hs.288660 Homo sapiens cDNA: FLJ22182 Hs, clone H 
•ls.106768 hypothetical protein FLJ10511 



422693 BE300073 Hs.279860 



hypothetical protein 

ESTs, Moderately similar to HERC2 [H.sap 
Homo sapiens cDNA FLJ11 533 fts, clone HE 
tumor protein, translationally-controlle 
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M082118 
M301867 
AA310974 



Hs.1 02737 
Hs.76224 
Hs. 156828 



45 

50 



60 
65 



414662 
418113 
406870 
416003 
445493 



EGF-containing fibulin-like ex 
Homo sapiens cDNAFL""" 
POU domain, d " 

family 22 (organic cation 

in helicase DNA binding protei 
adenylate cyclase 7 
interleukin 10 receptor, beta 
Homo sapiens cDNA FLJ11980 fis, clone HE 
caspase 8, apoptosis-relaied cysteine pr 




AW291196 
AW503116 
AI432224 



Hs 151738 
Hs.178391 
Hs.19631 
Hs.92195 
Hs.301819 



Hs.47094 ESTs 



malrix metalloproteinase 9 (gelatinase B 
ribosomal protein L44 
ESTs, Weakly similar to I38022 hypotheti 
ESTs 

zinc finger protein 146 
ribosomal protein L6 
ribosomal protein L10 



BE244076 
AA226392 
AW1 39585 



451658 
441623 
416926 
425190 
441244 
421305 
430504 
422310 
408605 
433891 
406542 
406858 
408331 
432729 
439451 
455263 
441321 
429083 
406806 
416987 



457752 
435511 
451589 
439979 
412528 
424875 
426981 

449961 
416759 
415082 



AF161486 
A1418466 
AW195351 
AA315805 



Hs.159578 
Hs.179943 
Hs.12708 
Hs.79172 
Hs.14601 
Hs.71913 
Hs.94769 



Hs.250520 ESTs, 



KIAA1702 protein 

Homo sapiens cDNA: FU21814 fis, clone H 
AT-hook transcription fxtor AKNA 
ribosomal protein 11 1 



Hs.1 55079 
Hs.184052 
Hs.324830 



RAB23, member RAS oncogene family 



. lilar to I38022 hypol 

desmoglein 2 
HT01 8 protein 

protein phosphatase 2, regulatory subuni 



AF086270 

H17182 

Y09397 

AW088535 

D86957 

BE618571 



AA424791 
AW600291 
AI123478 
Al 187945 
AL044675 
AI459554 
AW265634 
AK000978 
AA160000 
AB028962 
AJ271671 



Hs.29797 
Hs.44229 
Hs.130732 
Hs.278554 



eria toxin resistance protein requi 
o sapiens, clone MGG12617, mRNA, com 
cytochrome P450, subfamily IIS, polypepl 
T-cell, immune regulator 1 
gb:no97h03.s1 NCI_CGAP_Pr2 Homo sapiens 
C19000728':gi|12585552|sp|Q9Y2Q1|Z257_HU 
ribosomal protein L10 
ual itj phosphatase 12 
hypothetical protein FLJ20285 
heterochromalin-like protein 1 
Homo sapiens cDNA FU14028 fis, clone HE 
Hs.7771 B-cell associated protein 
Hs.227817 BCL2-related protein A1 

ribosomal protein, large, P0 
Hs.80712 KIAA0202 protein 
Hs.429 ATP synthase, H transporting, mltochondr 
Hs.193163 bridging integrator 1 
ribosomal protein L6 
Hs.285643 Homo sapiens cDNA FLJ14364 fis, clone HE 
Hs.189046 ESTs 

Hs.5734 meningioma expressed antigen 5 (hyaiuron 
Hs.6823 hypothetical protein FU10430 
Hs.32112 ESTs 
Hs.199310 ESTs 



in FU10116 

s.137396 ESTs, Weakly similar to JC5238 galactosy 
Hs.301552 KIAA1039 protein 
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Hs.77274 

Hs.132906 19A24 protein 

Hs.1 55029 ESTs 

Hs.77573 uridine phosphorylase 

Hs.1 69470 dystrophin (muse ' ' 

Hs.283722 GTT1 protein 

Hs.90078 nuoleotide-sugar 

gb:EST383301 MAGE resequences, MAGL Homo 

Hs.270823 ESTs, Weakly similar to S65657 alpha-1C- 




AI796870 Hs.54277 DNA segment 



Homo sapiens cDNA FLJ 1 3485 f s, clone PL 
DNA segment on chromosome X (unlc ue) 992 
NM_006662*:Homo sapiens Snf2-related CB? 



650 



WO 03/042661 



PCT/US02/36810 



430968 AW972830 



Hs.1 18659 ESTs 
Hs.268814 ESTs 

Hs.1 121 10 mitochondrial ribosomal prolan L42 
Hs.170114 KIAA0061 protein 

Hs.306292 Homo sapiens mRNA; cDNA DKFZp564F133 (fr 
gb:np86c01.s1 NCI_CGAP_Thy1 Homo sapiens 
gb:UI-H-BW0-ajp-g-09-0-Ul.s1 NCI_CGAP_Su 

Hs.1 31 175 ESTs 

Hs.119172 ESTs 

Hs.294110 ESTs 

Hs.89751 membrane-spanning 4-doma:r.s, subfamily A 
Hs.260523 synovial sarcoma, translocated lo X chro 
Hs.137003 ESTs 

Hs.63489 protein tyrosine phosphatase, non-recept 

gb:og83g12.s1 NCI_CGAP_Ov8 Homo sapiens 
Hs.75643 nuclear factor (erythroid derived 2), 45 
1 14366 pyrroline-5-carboxylate synthetase (glut 
• 1, protein (NM23B) 
Hs.259855 elongalion factor-2 kinase 
Hs.327 inlerleukin 1 0 receptor, alpha 
Hs.241535 apolipoprotein L, 3 
Hs.186704 ESTs 

Hs.116252 ESTs, Moderately similar to ALU 1 _H U MAN A 
Hs.172318 ESTs 

Hs.287727 hypothetical protein FU23132 
Hs.3261 59 leucine rich repeat (in FLU) inters* 

gb:EST384925 MAGE resequences, MAGL Homo 
Hs.267923 uncharacterized hypothalamus protein HTO 
Hs.133512 ESTs 
Hs.128619 ESTs 

Hs.21893 hypothetical protein DKFZp761N0624 
Hs.79428 BCL2/adenovirus E1B 19kD-interacting pro 
Hs.7312 ESTs 

actinin, alpha 4 
Hs.143688 Homo sapiens cDNA: FU23031 lis, clone L 



1s.13137 



JV radiation nasistance associated gene 
sin L13a 



Hs.41639 programmed cell death 2 

Hs.130350 ESTs 

Hs.310286 ESTs 

Hs.14158 copine III 

Hs.342389 peptidylprolyl isomerase A (cyclophilir. 

gb:ny97f11.s1 NCI_CGAP_GCB1 Homo sapiens 

Hs.44672 hypothetical protein FU10470 

Hs.54593 ATP synthase, H transporting, mitochondr 

Hs.94499 ESTs 

Hs.192013 ESTs 

Hs.173734 ESTs,WeaklysimilartoALU1_HUMANALU S 

Hs.6947 HSPC069 protein 

Hs.114104 ESTs 

Hs.126705 ESTs 

Hs.35304 Homo sapiens cDNA FLJ13655 f,s, clone PL 

Hs,337460 ESTs, Weakly similar to A47582 B-cell gr 

Hs.8594 KIAA1 191 protein 

Hs.11183 sorting nexin 2 

Hs.332938 hypothetical protein MGC5370 

gb:yh88b01.s1 Scares placenta Nb2HF Homo 

Hs.200866 ESTs, Moderately similar to ALU7_HUMAN A 

Hs.85092 thyroid hormone receptor interactor 1 1 

Hs.271778 ESTs, Weakly similar to I38022 hypothec 

Hs.101150 Homo sapiens, clone IMAGE'4054156, mRNA, 



Hs.5662 
Hs!267912 



rls.22941 
Hs.178470 
Hs.1 18078 



guanin i icleotide binding protein (G pr 
ESTs, Weakly similar lo A47582 B-cell gr 
ESTs, Weakly similar to ALU7_HUMAN ALU S 
gb;te64e10.x1 Soares_NFL_T_GBC_S1 Homo s 
Eos Control 

immediate early response 3 
DKFZP434D193 protein 
T cell receptor gamma locus 
KIAA1363 protein 
hypothetical protein FU22662 
ESTs 



Hs.85226 
Hs.1 18787 
Hs.5472 
Hs.78877 



myeloid cell leukemia sequence 1 (BCL2-r 
lipase A, lysosomal acid, cholesterol es 
transforming growth factor, bela-ir.cuced 
hypothetical protein FLJ20173 
inositol 1,4,5-trisphosphate 3-kinase B 
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AA639388 




gb:nq88b06.s1 NCI_CGAP_Co9 Homo sapiens 






AB008681 




activin A receptor, type MB 




410323 






Homo sapiens cDNA: FLJ22844 fis, clone K 










actin related protein 2/3 complex, subun 


'..31 




AW29621 9 




RAB7, member RAS oncogens family-like 1 


4.31 








zinc finger protein 304 










KIAA1077 protein 






BE144608 




',30 






Hs. 112497 


Homo sapiens cDNA: FLJ22743 fis clone H 








Hs.240770 


nuclear cap binding protein subunit 2, 2 






AA811465 


Hs.1 23375 




4.30 




BE048433 










AI990790 


Hs.188614 




4.B0 




AA082650 




Homo sapiens cDNA FLJ12521 fis, clone NT 


4.30 


AA865818 




ESTs, Weakly similar to S71886 Ste20-lik 


4.78 






Hs.122910 




4.77 




AA382169 




N-myc (and STAT) interactor 


4.77 








kangal 1 (suppression of tumorigenicity 


4.74 




AB011110 




GTPase activating protein-like 


4.74 




AI929374 




Src-like-adapter 


4.74 




AK001567 




Homo sapiens cDNA FLJ1 0705 lis, clone NT 


4.73 




AA687361 






4.71 




AA085803 




ESTs, Moderately similar to I78885 serin 


4.70 




BE544475 






4.70 








ESTs, Weakly similar to ALU1_KUMAN ALU S 


4.70 




AA233056 






4.70 








ESTs 






AA878324 






4.70 




AF075009 




gb:Homo sapiens full length insert cDNA 


4.70 








ESTs, Weakly similar to I38022 hypotheti 












4.70 








hypothetical protein FLJ00052 




444601 






chromosome 6 open reading frame 9 






AI094221 


Hs.135150 


lung type I cell membrane-associated gly 


4.65 








Homo sapiens cDNA FLJ14387 fis, clone HE 






AA724811 


Ha. 334791 


Homo sapiens cDNA FLJ14609 fis, done NT 


4.55 






Hs.1 72550 


polypyrimidine tract binding protein (he 


4.64 




AF026849 


Hs.1 50922 


BCS1 (yeast homolog)-like 


4.64 




NM_000169 


Hs.69089 


4.63 








KIAA0456 protein 










cAMP responsive element binding protein- 






NM_003358 




ESTs, Moderately similar to CEGTHUVAN C 










centromere protein F (350/400kD, mitosin 






BE379794 


Hs.1 59651 


hypothetical protein 




AW51 3333 




ribosomal protein, large, P1 


4.60 








major histocompatibility complex, class 


4.60 








antigen identified by monoclonal antibod 






A1347128 






4.60 








i |j on induced, hopati C-ess I 






AW612022 






4.60 




AW957965 




Homo sapiens, clone ! MAGE:3532168, mRNA 


4.60 




AW572659 




hypothetical protein dJ434014.3 


4.60 




AA568548 










AL359620 




hypothetical protein DKFZp762P21 1 1 


4.60 
















Hs. 146907 


ESTs, Weakly similar to 2004399A chromos 






AA010319 




4.60 








major histocompatibility complex, class 


4.60 




NM_002462 




myxoviras (influenza) resistance 1, homo 










Fc fragment of IgG, high affinity la. re 


4.59 








ribosomal protein L39 






AW058261 




ESTs, Weakly similar to ALU1_HUMAN ALU S 


4.56 




AA885699 




CGI-26 protein 














433339 






glioblastoma overexpressed 










hypothetical protein FLJ23316 








Hs.86682 


Homo sapiens cDNA: FU21578 fis, cione C 


4.53 


423799 


AW026300 


Hs.132906 


19A24 protein 


4.53 


445093 


AI207197 




ESTs 


4.52 


428044 




Hs.301404 


RNA binding motif protein 3 


4.52 








High mobility group (nonhistone clromoso 




414194 


BE175494 


Hs.75811 


N-acylsphingosine amidohydrolase (acid c 


450 


427747 


AW411425 


Hs.1 80655 




150 


406745 


AW511970 


Hs.279B60 


tumor protein, translationally-controlle 


4.50 


407013 


U35637 


Hs.83870 


gb:HumannebulinmRNA, partial cds 


4,50 


407198 
414646 






gb:yv04a07.s1 Soares fetal 'Ivor spleen 


4.50 


AA353776 


Hs.901 


CD48 antigen (B-cell membrane protein) 


4.50 


429687 


AI675749 


Hs.211608 


nucleoporin 153kD 


4.50 


436566 


BE545586 
AW293046 


Hs.278712 
Hs.255158 


Homo sapiens CDNAFU1 1074 fis. clone PL 
ESTs 


4 60 


439971 
442485 


W32474 


Hs.301745 


RAP2A, member of RAS oncogene family 


4.50 


BE092285 


Hs.29724 


hypothetical protein FLJ13187 


4.50 


445873 


AA250970 


Hs.251946 


poly(A)-binding protein, cytoplasmic 1-1 


4.50 
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450497 
417497 
447667 
413856 



406742 
425095 
410342 
442333 



409604 
418866 



H64159 Hs. 15328 ESTs 
AW402482 Hs.82212 CD53 antigen 

AL117611 Hs.19150 Homo sapiens mRN A; cDNA DKFZp564A21 64 (f 



U29615 
X89984 
U51166 



401846 
422303 
442358 



416754 
436671 
410079 
420150 
424723 
428931 
429109 
430280 
438330 
438962 
444794 
445100 



436138 
438915 
449217 



H07145 

AW137159 

U94362 

AA648712 

BE409813 

AA994979 

AL008637 

AA361258 

AW450572 



400440 
415049 
418304 
423180 
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TABLE 56C 

Pkey: Unique number corresponding to an Eos probeset 

Ref. Sequence source. The 7 digit numbers in this column are Genoa* Identifier (G 

human chromosome 22" Dunham, et al. (1999) Nature 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



s. 'Dunham, et al." refers to the publication entitled "The DNA sequence of 



Pkey 



Strand 



NLposition 

53526-5362 

113086-114800 

94760-94898 

28748-29023 



7712190 
7143420 
4156171 



TABLE 57 A: 703 genes upregulated in testicular cancer relative to normal I 

i normal body tissues that are likely to encode proteins amenable to module 
o, Ec Wfymeti - ' .irech p array. Gene expression data for ea 

?es'Fe"gTpkinase, , death-dornain, 



3 genes upregulated in teslicul; 

^^"Tnten^ niRt-J It .r I prodj ^^i^f^^^^^ 
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Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar accession 

UniGenelD: UniGene number 

Pred.Prol.Domains: Certain predicted protein domains. Abbreviations used: TM, transmembrane domain; SS, signal sequence; =Y, very likely to contain; =M 

likely to contain; other protein domain abbreviations are tram PFAM (Nucleic Acids Research, 2002, 30:276-280). 

UniGene Title: UniGene gene title 

R1 95th percentile of testicular cancer Als divided by the 50th percentile of normal tissues Als, where the 1 0th percentile of all normal tissue Als was subtracted 



Pkey; ExAccn; UnigenelD; Unigene Title; Pred.Prot.Domains; R1 

424687; J05070; Hs.151738; matrix metalloproteinase 9 (gelatinase 3, 92kD gelatinase. 92kD type IV collagenase)' matrix metalloproteinase 9 (gelatinase B- 31 23 
440119;AA865455;Hs.125331;ESTs,Moderatelysimilartounkno™[l-^ im ESTs, Moderately ,3a, to unknown [H 2-37 
421241; X91817; Hs.102866; transketolase-like 1; Iransketolase-like 1; 26.89 

431840; AA534908; Hs.2860; POU domain, class 5, transcription factor 1; POU domain, class 5, transcription facto; 25.03 
435918; AF263538; Hs.86232; growth differentiation factor 3; growth differentiation factor 3; 19.88 

432666; AW204069; Hs.351 11 8; ESTs. Weakly similar to unnamei rr rrr ct[H - phs] ESTs, Weakly similar to unnamed protein; 1774 
419556; U29615; Hs.91093; chitinase 1 (chitotriosidase}; chitinase I (chitotriosidase); 17.64 

452838; U65011; Hs.30743; preferentially expressed antiger in m no prefer i Jl/exp essed antigen in mela; 17.35 
117886; AA214584; ; ESTs; ESTs; 15 

412265; AA101325; Hs.86154; hypothetical protein FLJ12457; hypothetical protein FLJ12457; 15.93 

425572;AB011076;Hs.158307;undifferentiatedembryoniccelltrarscrir'jnfe-; jr l undifferentiated embryonic cell Iranscri; 1582 
423905; AW579960; Hs.135150; lung type-l cell membrane-assccialed glycoprotein; lung lype-l cell membrane-associated gly; 15.1 1 
419741; NM_007019; Hs.93002; ubfquftin carrier protein E2 ^ q n t i in E2 C 15.08 

427584; BE410293; Hs.179718; v-myb avian myeloblastosis viral oncogei horn ilog-like 2, v-myb avian myeloblastosis viral oncogen- 14 17 
418696; AW959433; Hs.326290; hypothetical protein FU12581; hypothetical protein FU125B1 ; 13 58 
416819; U77735; Hs.80205; pim-2 oncogene; pim-2 oncogene; 13.20 

41 4034; U89277; Hs.305985; eady development regulator 1 (homolog of polyhomeotic 1); early development regulator 1 (homolog o- 12 93 
1 ii 001 52 H 37062 - ulin k J(Le> iq cell); insuli.n-like 3 (Leydiq cell); 12.90 
432730; AI056520 II 1 1 3 58 ESTs ESTs 12.B4 

4462.93; AI4202 1 3; Hs. 1 49722; L IM doman inscription factor LIM-1 (hUM-1 ) mRNA; LIM domain transcription factor LIM-1 (h; 1 2 74 
423354; AB01 11 30; Hs. 127436; calcium channel, voltage-dependent, alpha 2/de:tasub mil 2 calcium chain el, voltage-dependent, alplv 12 46 
450581;AF0815l3;Hs.251S5;TGr-beta 4;TGF-beta4; 12.42 

f,S ' ,9; ESTs ' Weakly ' mlar 10 RBaB - HUMAN RAS-FtELATED PROTEIN RAB-8B [H.sapiens]; ESTs, Weakly similar to RB8B_HUMAN RAS-R; 12.26 
431462; AW583D72 H 5 ji1;gramr l;e neuroendocrine peptide precursor; granin-likeneuroercocine peptide precu' 11 96 
431354; BE046956;Hs.251673;DNA(cytosine-5-) i letl Ir i Lela: DMA ( >)-nelhyltransfer e 3 be, 11.91 

402199; ;; Taigel Exor; Target Exon; 11.85 

424578; AK001973; Hs.150890; hypothetical protein; hypothetical protein; 11.81 
416350; AF188625;Hs.189507;phospholipaseA2, group IID; phospholipaseA2, group IID; 11.67 
43997.9; AW600291; Hs.6823: hypothetical protein FLJ10430; hypathotical protein FU10430; 1 1.57 
410048; W76467; Hs.343374; proline oxidase homolog; proline oxidase homolog; 11.42 

442573; H93366; lls.7567; branched chain aminotransferase 1, cytosolic; branched chain aminotransferase 1, cylos- 1 1 ,42 
•11',: 2756; H 77 367; monokine induced by gamma interferon; monokine induced by gamma interferon; 1 1.38 
421917; A8028943; Hs.109445; KIAA1020 protein, KIAA1020 | mar,, 11 15 

440006; AK000517; lls.6844; NALP2 protein; PYRIN-Containing APAF1-like; NALP2 protein; PYRIN-Containing APAFI-li; 10.92 
414683; S78296; Hs.76888; hypothetical protein MGC12702; hypothetical protein MGC12702; 10.91 

423673; BE003054; Hs.1 695, matnx metalloprotemn e 12 , haqe elapse); n a I proteinase 12 (macrophage ; 10 74 

433800; AI034361; Hs.135150; lung type-l cell membrane-associated glyrnprrtpin hinnt pe ! t In i r q sociated gly; 10 68 
429120; AK001673; Hs.196530; hypothetical protein FLJ10B11 tij ; hetic protein FU10811; 10.48 
444371; BE540274; Hs.239; forkhead box M1; forkhead box M1; 10.46 
441553; AA281219; lls.121 296; ESTs; ESTs; 10.37 

426534; U58096; Hs.2051; testis specific protein, Y-linked; testis specific protein, Y-linked; 10.28 
441878; AI801869; Hs.127982; ESTs; ESTs; 10.05 
432117; AL036195 H 2909 prota e1 ! 111 

425427; AI652662; Hs.317432; branched chain aminotransferase 1 , cvtosolic; branched chain aminotransferase 1, cytos; 9 97 
416201; AA467752; Hs. 1951 61; ESTs; ESTs; 9.97 
41 0929; H47233; Hs.30643; ESTs; ESTs; 9.91 

427486 AA9-44 H fr r . h Kto 4 (heparin secre:ory transicrrning protein 1, Kaposi sarcoma oncogene); fibroblast growth factor 4 (heparin seer- 9 81 

427239; BE270447; Hs. 35651 2; ubiquilin carrier protein; ubiquilin carrier protein; S.68 
402680; ; ; Target Exon; Target Exon; 9.68 

409208; Y00093; Hs.1 72631; integrin, alpha X (antigen CD1 1 C (p1 50), alpha polypeptide); inlegrin, alpha X (antigen CD11 C (p1 50),; 9.46 
443426; AF098158; Hs.9329; chromosome 20 open reading frame 1; chromosome 20 open reading frame 1; 9.42 
440207; AI371978; Hs.128326; ESTs; ESTs: 9.41 

433001; AF217513; Hs.279905; clone HQ0310 PROO310p1; clone HQ0310 PRO0310p1; 9.41 
447534; AW953935; Hs.288655; ESTs; ESTs; 9.33 
442333; AI650877; Hs.129302; ESTs; ESTs; 9.28 

421307; BE539976; Hs.103305; Homo sapiens mRNA; cDNA DKFZp434B0425 (from clone DKFZp434B0425); Homo sapiens mRNA; cDNA DKFZd434B0425 [f 9 24 
423458;AI204212;Hs.3511l3;ESTs;ESTs;9.23 ' 
431958; X63629; Hs.2877; cadlterin 3, type 1, P-cadherin (placental); cadherin 3, lype 1, P-cadherin (placenia; 9.23 
422938; NMJB1309; Hs.1 594; ceriromere protein A (l7kD); centrome-e protein A (17kO); 9,21 

411027; AF072099; Hs.67846; leukocyte immunogict i in i rec. | lor sun ar iil> " (» Ih TM and ITIM domains), member 4; leukocyte immunoglobulin-like receptor ; 9.21 
425397;J04088;Hs.156346;topoisomerase(DNA), 17 I I c C i lit 1-0)918 
428664; AK001666; lls.189095; similar lo SALL1 (sal (Drosophila)-'ike; similar to SALL1 (sal (D-csophila)-lke; 9.17 

428970; BE276891; Hs.194691; retinoic acid induced 3 (RAIG1); metabolropic qlut n famil I r i mcic acid induced 3(RAIG1)- meiabo- 9 11 
'1 i r Hs.1 9400; MAD2 (mitotic arrest del /east, homoloq)-lik 1AD2 (mild nl,y 911 

422310;AA316622;Hs.98370;cytochromeP450,suLl 1 i iK h » i , chrome 150 hrmy IIS, polypepf9 10 
449722; BE280074; Hs.23960; cyclin B1; cyclin B1 ; 8.86 

441560; F13386; Hs.7888; v-erb-a avian erythroblastic leukemia viral oncogene ftonolog-like 4 (HER4); v-erb-a avian erythroblastic leukemia vr 8 86 
440983; M20681; Hs.7594; solute carrier family 2 (facilitated glucose transporter), member 3: solute earner family 2 (facilitated qkr 8 86 
409342; AU077058; Hs.54089; BRCA1 associated RING domain 1 ; BRCA1 associated RING domain 1 ; 8 83 
420367; AA259090; Hs.257028; ESTs; ESTs; 8.82 

41 5947; U04045; Hs.78934; mutS (E. coli) homolog 2 (colon cancer, nonpolyposis type 1); mutS (E. coll) homolog 2 (colon cancer, ; 8 73 
418613; AA744529; Hs.86575; mitogen-activated protein kinase kinase kinase kinase 1 ; mitogen-activated protein kinase kinase ; 8.71 
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417407; AA923278; Hs.290905; ESTs, Weakly similar to protease [H sapiens]; ESTs, Weakly similar to protease [Ksapi; 8.64 
407239; AA076350; Hs 67846 leuk/ , e imr inogl jn like ec pt i mi'y I (with TM id !M torn ii ) n mber 4; leukocyte imi 
' 481;M378597; Ks.5169; HSPC150 [ oin similar to ubiquitii njugalir en yme H ' ISO pto'efn mila i il , i(i n ' i 
41 2140; AA218691; Hs.73625; RAB6 inleiacling, kinesin-like (rabkinesin 6); RAR6 interacting, kines^-like (rabkines; 8.52 
438091; AW373062; Hs.351546; nuclear receptor subfamily 1 , group I, member 3; nuclear receptor subfamily 1, group I, m; 8.51 

.1 c 1532 MyoDI i bit 1yoD family inhibit J 
447188; H65423; Hs.17631; hypothetical protein DKFZp434E21 ^5 f v heti ! i i DhFZp434E2135; 8.45 
430056; X97548; Hs.228059; KRAB-associated protein 1; KRAfrassociatec protein 1; 8.42 

41 7389; BE260964; Hs.82045; midkine (neurile growth-promoting factor 2); midkine (ne rite growth-promoting factor; 8.40 
420759 TSrH 9b;Homosapiensenvelo P e prole,n Rlr 1 " ] ' e 6 " 1 capers envelope protein RIC-3(; 8.38 

406621; X57809; Hs 



453914; NM_000507; Hs.574; fructose-1,6-! 

423198; M8193 HO 6 II divi , _a i ion c,cie 25<\ i 19 
418299^M279530; Hs.83968; integrin, beta 2 (antigen CD18 (p95), lymphocyte function-associated antigen 1; macrophage antigen 1 (mac-1) bela subunit); integrin, beta 2 (antigen 

453968; AA847843; Hs.62711; High mobility group (nonhistone chromosomal) protein 4; High mobility group (nonhistone chromoso; 8.16 

453985; N44545; Hs.251865; ESTs; ESTs; 8.14 

451 106; BE382701; Hs.25960; N-MYC oncogene; N-MYC oncogene; 8.10 

420347; AL033539; Hs.97124; Human DNA sequence from clone RP1-309H15 on chromosome 6p22.1-223 Contains a gene similar to HDGF (hepatoma-derived growth factor (high- 
mobility group protein 1-lite)), ESTs, STSs, GSSs and a CpGlsl; Human DNA sequence from clone RP1-309H15; 8.03 
415857; AA866115; Hs.127797; Homo sapiens cDNA FLJ11381 fis, clone HEMBA1000501; Homo sapiens cDNA FLJ11381 fis, clone HE; 8.02 
425601; AW629485; Hs.140720; GSK-3 binding protein FRAT2; GSK-3 binding protein FRAT2; 7.90 

421016;M504583;Hs.101047;transcriptionfactor3(E2Aimmunoglobu r i a rt m factors E12/E47); transcription factor 3 (E2Aimmunoglobul; 7 89 

432407; M221036; ;gb:zr03f12.r1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA clone 5' similar to SW:POL_BAEVM P10272 POL POLYPROTEIN ;, mRNA 

sequence; gb:zr03f12.r1 Slratagene NT2 neuronal pq 7.83 
422846; BE513934, Hs.15 neutrophil cylosolic factor 1 (47kD, chronic granulomatous disease, autosomal 1); neutrophil cytosolic factor 1 (47kD, chr; 7.80 
I33228;F28212;I I495 , 1491 protein; KIAA1491 protein; 7.73 
446528; AU076640; Hs.15243; nucleolar protein 1 (120kD); nucleolar protein 1 (120kD); 7.71 

447350; AI375572; Hs.172634; v-erb-a avian erythroblastic leukemia viral oncogene homoiog-like 4 (HER4); v-erb-a avian erythroblastic leukemia vi; 7.71 
448775; AB025237; Hs.388; nudix (nucleoside diphosphate linked moiety X)-type motif 1 ; nudix (nucleoside diphosphate linked moi; 7.71 
430253; AK001514; I ls.23G844; hypothetical protein FU1 0652; hypothetical protein FLJ 10652; 7.70 

444784; D1 2485; Hs.1 1951; ectonucleotide pyrophosphatase/phosphodiesterase 1 (Plasma-cell membrane glycoprotein PC-1); ectonucleoiide pyrophosphatase/phosphodr 7 62 
443 Li 6;Hs.20 chotecystokinm B receptor, cholecystokinin B receptor, 7.57 

446291; BC397753; i(s.14623; interferoi , ganima-.n.lu ible prot< n iO nteiferon gamma nducble p-otein 30; 7.55 
410006;AW732308;Hs.57783;eukaryoclr n o I n 11 t in t on factor; 7.53 

430255 AKOOOiO Homo sap n Phi! teir, partial cd rr pie'ns mRNA for Kl i I 

41 1975; AI916058; lls.144583; 3'UTR of; dead ringer (DrosophilaJ-like 1; 3'UTR of: dead ringer (Drosophila)-like ; 7.50 

439864; AI720078; Hs.291997; ESTs, Weakly similar to A47582 3-ce!l grow* factor precursor [H.sapiers]; ESTs. Weak'y similar to A47582 D-cell gr; 7 47 

* 1 Homo sapiens, Similar to RIKEM cDMA 270 re MGC4669, mRNA. complete cds; H r I L' t 

417705; AW134952; lls.175220; hypothetical protein FLJ14E41; hypothetical protein FLJ14541; 7.47 

425367; BE271188; Hs.155975; protein tyrosine phosphatase, receptor type, ^associated protein; protein tyrosine phosphatase, receptor t; 7.47 
407710; AW022727; Hs.23615: E 15 
445093; AI207197; I ls.374149; ESTs; ESTs; 7.41 

418113; AI272141;Hs.83484;SRY (sex determining region Y)-box4;SRY (sex determining region Y)-box 4; 7.39 

117900; BE250127; Hs.82906; CDC20 (cell division cycle 20, S. cerevisiae, homolog); CDC20 (cef division cycle 20, S. cerevi; 7.37 

429469; M54590;Hs,1 11 801; glycine dehydrogena e(decarboxjla u f n I Me n P) glycine dehydrogena e (decarboxylase; ; 7.33 

422726 U11690 Hs 15 j Id kog-Scott synd aciogenita 1 q ; ott-y 733 

430504;H52761;Hs.44095;Homosapiens,cloneMGC:12617,mRNA,completecds;Hcnio d P ie o IrmRNn Dm; 7 32 

448981 AI96B710 lls.1953 F 1 T ^ 28 

413762; AW411479; Hs.848; FK506-binding protein 4 (59kD); FK606-binding protein 4 (59kD); 7.26 

435092; AL137310; Hs.4749; Homo sapiens mRNA; cDNA DKFZp761 E13121 (from clone Dl<rZp761E13121); partial cds; Homo sapiens mRNA- cDNA DKFZp761E13121 (■ 7 25 
434414;AI798376;;gb;lr34b07.x1NCLCGAP_Ov23HomosapienscDUr(^" nil loTI C 175 01 175 UNNAMED HERV-H PROTEI -il >' , -squence, qb tr34b0^ x1 
NCI_CGAP_Ov23 Homo sapiens; 7.24 

11)11 I i 

434274; AA62853aHs.577M^ ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY [Rsapiens]; ESTs, Moderately 

446700; AW206257; H~s.156326; Human DNA sequence from clone RP1 1-145122 on chromosome 6p21.32-22.2. Contains the gene for myelin/oligodendrocyte glycoprotein MOG, 

(part of) the gene for a novel KRAB box containing C2H2 type zinc f; Human DNA sequence from ctone RP1 1-145L2; 7.15 
420 24 AB0105 5 H "I (enchant I e al on channel 3, te- 7.15 

439053;BE244588;Hs.6456;chapaonincontainingTCP1,stibumt t i 1 ting T suumt2(b714 

I450 AI206B I 1541 I E 1 71) 
448588; AI970276; Hs.156905; KIAA1676; KIAA1576; 7.13 

429486; AF1 55827; Hs.203963; hypothetical protein FU10339; hypothetical protein FLJ10339; 7.10 

441362; BE614410; Hs.23044; RAD51 (S. cerevisiae) homolog (Ecoli RecA homolog); RAD51 (S. cerevisiae) homolog (E coll Re; 7.04 

433914; AF108138; Hs.1 12160; Homosapiens UNA helicase homolog (PIF1) mRNA partial cds; Homo sapiens DNA helicase homolog (PIF1); 7.02 

413278; BE563085; Hs.833; interferon-stimulated protein, 15 kDa; interic-ron-otimulated crolein, 15 kDa; 7.02 

423765; R23858; Hs.143375; Homo sapiens, clone IMAGE:3840937, mRNA, partial cds; Homo sapiens, cfone IMAGE:3840937, mRNA,- 6.96 
5 1 Hs.79432; ficril tractur i- I v in r i lr i I 

438450; AI050866; Hs.65853; nodal, mouse, homolog; nodal, mouse, homolog; 6.90 

11 BE2 ' ^ 11 '9 neutrophil cytosolic factor 2 (65kD, chronic granulomatous disease, autosomal 2); neutrophil cytosolic factor 2 (65kDchr6 90 
444381; BE387335; Hs.283713; hypothetical protein BC014245; hypothetical protein BC014245; 6.89 
447582;BE293520;Hs.18910;prostatecanceroverexpressedgene1 postatec cer r s qero 1; 6.89 
424779; AL046851; Hs.153053; CD37 antigen; CD37 antigen; 6.89 

443907; AU076484; Hs.9963; TYRO protein tyrosine kinase binding protein; TYRO protein tyrosine kinase binding pro; 6.84 
427298; AA400495; ; ESTs; ESTs; 6.32 

414732; AW410973 u i linict u no-one maintenance deficient (S. cerevisiae) 7; minichromosome maintenance deficient (S.; 6.81 

424959; NM.005781; Hs.153937; activated p21cd '2- , ivaled p21cd /-Mir„se 6.81 

426866; U02330; Hs.172816; neuregulir 1; neuregulin 1 ; 5.80 

418203; X54942; Hs.83758; CDC28 protein kinase 2; CDC28 protein kinase 2 5.80 

427521; AW973352; ; ESTs; ESTs; 5.75 

430397; AI924533; Hs.105607; bicarbonate transporter related protein 1 ; bicarbonate transporter related protein ; 6.75 
427719; AI393122; Hs.134726; ESTs; ESTs; 5.74 



660 



WO 03/042661 



PCT/US02/36810 



439453; BE264974; Hs.6566; thyroid hormone receptor inleraclor 1 3; thyroid hormone receptor interactor 1 3; 6.72 
453348; BE272318; Hs.8595; hypothetical protein FLJ 12, lyp f ic ! prolein FLJ12438; 6.71 

446113; AW967553; Hs.323518; Homo sapiens mRNA for FU03J 3 p c padal cd Homo sapiens mRNA for FU00083 protein, ; 6.70 

446363; NM_005903 1 E u „parone d, 6 70 

433701; AW445023; Hs.15155; ESTs; ESTs; 6.69 

418054; NM_002318; Hs.83354; lysyl oxidase-like 2; lysyl oxidase-like 2; 6.63 

441031; Al 110684; Hs.7645; fibrinogen, B beta polypeptide; fibrinogen, B beta polypeptide; 6.62 

439237; AW408158 Hs 31 88S3 E c l I I 682 B-cell h faster prec II a en J ESTs, Weakly similar to A47582 B-cell or; 6.59 

424381; M2B5249; Hs.146329; protein kinase Chk2 (CHEK2); protein kinase Chk2 (CHEK2);6.58 

414821; M63835;Hs.77424;Fc fragment of IgG hgh ,lf ilyl receplo ( )64 F ragmento ,t high affinity ta, re; 6.58 

418883; BE387036; Hs.1211; acid phosphatase 5, tartrate resistant; acid phosphatase 5, tartrate resistant; 6.57 

414161; AA136106; Hs.184852; KIM1653 protein; KIM1553 protein; 6.57 

437437; AA226869; Hs.351623; hypothetical protein DKFZp762L0311; hypctheticai protein DKFZp762L031 1; 6.55 

425769; U72513; Hs.159486; Human RPL13-2 pseudogene mRNA, complete cds; Human RPL13-2 pseudogene mRNA, complete ; 6.55 

452363; AI582743; Hs.94953; Homo sapiens, Similar to complement cornpo™ j i r ' 1 mRNA complete cds; Homo sapiens, Similar 

to complement comp; 6.55 

422765; AW409701; Hs.1578; baculoviral IAP repeat-containing 5 (survivin); baculoviral IAP repeat-containing 5 (sun 5,51 

409893; AW247090; Hs.57101;minichromosome maintenance delice e 2 tir): mi orno=ome maintenance deficient (S.; 6.50 

418913; X07871; Hs.89476; CD2 antigen (p50), sheep red blood e!l I Cjr n -d tlood cell 6.49 

41791 1; AA333337; Hs.8291 5: chaperonin containing TCP1, subunit 6A (zeta 1); chaperonin containing TCP1, subunit 5A (; 6.44 

427747; AW411425; Hs.180656; serine/threonine kinase 12; serine/threonine kinase 12; 6.43 

409142; AL1 36877; Hs.507S8; SMC4 (structural maintenance of chromosomes 4, yeast)-like 1; SMC4 (siructural maintenance of chromoso; 6.42 
430280; AA361258; Hs.237868; inlerleukin 7 receptor; interleukin 7 receptor; 6.42 
432938; T27013; Hs.3132; steroidogenic acute regulatory protein; steroidogenic acule regulatory protein; 6.42 
i J 1 H 11 j potheticel pro! I I Iproie FLJ 10 19 

427578; AI591305; Hs.169084; EST I lighly s ai l( I IJL3.HUVAN TUBBY RELATED PROTEIN 3 [H.sapiens]; ESTs, Highly similar to TUL3_HUMAN TUBBY' 6 40 

417115; AW952792; Hs.334612; small nuclear ribonucleopratein polypeptide E; small nuciear ribonucleoprotein polypept; 6.39 

428227; AA321 649; Hs.2248 -m hrd ibl 1 j m f X-Cys), member 10; small induct ul family B (Cy; 6.39 

441384; AA447849;H 5 .288G"D ret acid induced 3, te rid induced 3; 5.38 

438915; AA280174; Hs.285681; Williams-Beuren syndrome chromosome region 14; Wiliiams-Beuren syndrome chromosome regi; 6.34 
416773; AK000340; Hs.79828; hypothetical protein FLJ20333; hypothetica' protein FLJ20333; 6.34 

451999; AW1 76401; Hs.27424; DEAD/H (Asp-Glu-Ala-Asp/Hio) box polypeptide 11 (S.cerevi e HL1 Ike helicase); DEAD/H (Asp-Glu-Ala-Asp/His) box polypep' 6 31 

444159; AF116846; Hs.10431; dead ringer (Drosophila)-like 2 (brght and dead ringer); dead ringer (DrosophilaJ-like 2 (bright ; 6.31 

425274; BE281191; Hs.155462; minichromosome maintenance defi len mi i minbhrcmosome ma o defi lenl'm 6.31 

K ' ); ESTs, I I 10350 [ f f f | u ( | I T r / n / 1 

420507; AF093408; Hs.98397; A kinase (PRKA) anchor protein 3; A kinase (PRKA) anchor protein 3; 6.30 
415829; AW450198; Hs.163742; ESTs; ESTs; 6.28 

410342; R31350; Hs,743; Fc fragment of IgE, high affinity I, receptor for: gamma polypeptide; Fc fragment of IgE, high affinity I, rec;6.26 
446839 BE091926 H I6244, mil pindle il related ein; mitotic n 1 il d ml related prot; 5.25 

457465; AW301344; Hs.122908; DNA replication factor; DNA replication fxton. 6.25 
428918; AL036967; Hs.2324; prolamine 2; prolamine 2; 6.24 

443523; AK001 57 r H 36 pot feat prolan FU1 13 hypothetical protein FLJ1D713; 6.19 
424415; NMJ01975; Hs.146580; enolase 2, (gamma, neuronal); enolase 2, (gamma, neuronal): 6.19 
407245; X90558; Hs.172004; titin; titin; 6.18 

458627; AW088642; Hs.97984; SRY (sex determining region Y)-box 17 (SOX17); SRY (sex determining region Y)-box 17 (S; 6.18 

421379; Y15221, Hs.103982; small inducible cytokine subfamily B (Cys-X-Cys), member 1 1 ; small inducible cytokine subfamily B (Cy; 6.16 

435099; AC004770; Hs.4756; flap structure-specific endonuc'ease 1 ; flap structure-specific endonuciease 1 ; 6.1 3 

424308 AW975531 H 11 a if I; minichrom mp ma ntenance deficient (S.; 6.12 

430521; NM_016383; Hs.242183; HOM-TES-85 tumor antigen; HOM-TES-85 tumor antigen; 6.10 

444823; BE262989; Hs.12045; pulative protein; putative protein; 6.10 

428484; AF104032; Hs.184501; solute earner family 7 (cationic amino acid transporter, y system), members; solute cam'erfamily 7 (cationic amino ; 5.09 
402260; ; ; NM_001436':Homo sapiens fbrillarin (FBL), mRNA. transcript) (F8A), mRNA.; NM_DD1436':Homo sapiens fibrillarin (FBL; 6.09 
441321; H17182; Hs.7771; B-cell associated prolein; B-cell associated protein; 6.05 

422283; AW41 1307; Hs. 1 14311: CDC45 (cell d v ic r , ike TDC45 (cell division cycle 45, S.cerevis; 6.04 

431629; AU077025; Hs.265827 ntr- f» id t I cp 16); in on, alph bio, (cio 6 04 

402678; ; ; Target Exon; Target Exon; 6.03 

453884; AA355925; Hs.36232; KIAA0186 gene product; KIAA0186 gene product; 6.01 

439753; BE262233; Hs.7423; hypothetical prolein from EUROIMAGE 2168212; hypothetical protein from EUROIMAGE 2168; 6.01 
420596; NM_002692; Hs.99185; polymerase (DNA directed), epsilon 2; polymerase (DNA directed), epsilon 2; 5.01 
420676; AI434780; Hs.4248; vav 2 oncogene; vav 2 oncogene; 6.00 
418756; AA252254; Hs.226949; ESTs; ESTs; 5.99 

454438; AA224053; Hs.1 72405; cell division cycle 27; coll division cycle 27; 5.98 
407818; AL021938; Hs.40154; jumonji (mouse) homolog: jumcr.ji (mouse) homotoo; 5,98 
41 331 3; N M_002047; Hs.293885; glycyl-tRNA synthetase; glycyl-tRNA synthetase; 5.96 
424247; X14008; Hs.234734; iysozyme (renal amyloidosis); lysozyme (renal amyloidosis); 5.95 

417777; AI823763 H ESTs, Weakly sirn ilic | te n kina e |H apien ] ESTs, Weakly similar to I78885 serine/th; 5.94 

449569; AI656634; Hs.195389; ESTs; ESTs; 5.92 
436576; AI458213; Hs.77542; ESTs; ESTs; 5.90 

438746; AI885815; Hs.184727; Human melanoma-associated antigen p97 (melarotransferrin) mRNA, 3' flank; Human melanoma-associated antioen d97 (nr 5 89 

417208; S67773; Hs.81665; v-kit Hardy-Zuckerman 4 feline sarcoma vira! oncccene homolog ; v-kit Hardy-Zuckerman 4 feline sarcoma v; 5.88 

;;C2001472*:gi|5809678|gb|AAB41848.2|(U64675) sperm membrane protein BS-63 [Homo sap n ] II 001472 MS ) icb nAB41848.2| (U64- 5.87 
4nn ««*u„«« 0 ,< ui 1i r p if r 1 I -a 1 157 protein; 5,87 

r iensmutS(E.coli) homolog 6 ' 

423787; AJ295745; Hs.236204; nuclear pore complex protein; nuc.._ r . 

425126; N32759; Hs.172944; chorionic gonadotropin, beta polypeptide; chorionic gonadotropin, beta polypeptide; 5.84 
452796; AB01 1 100; Hs.30656; KIAA0528 gene product; KIAA0528 gene product; 5.34 
418322; AA284166; Hs.84113; cyclin-dependent kinase ibil (CI ixiat d dual specificity phosphatase); cyclin-dependent kinase inhibitor 3 (CDK; 5.80 
447359; NM_01 2093; Hs.18268; adenylate kinase 5; adenylate kinase 5; 5.79 

420344; BE463721; Hs U 101 puta e^pio'ei coupled racepto f ' I _ G protein-coupled receptor; 5.78 

420297jAI628272; Hs.128757;ESTs, Weakly similar to ALU1HUMANALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY [H.sapiens]; ESTs, Weakly similar to 
lie (Drosophila) homolog 2; enhancer of zesle (Drosophila) homolog 2; 5.75 
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430287; AW182459; Hs.125759; ESTs, Weakly similar loLEU5_HUV1AN LEUKEMIA ASSOCIATED PROTEIN 5 [H.sapiens]; ESTs, Weakly similar lo LEU5 HUMAN LEUKE; 5.74 
436251; BE515065; Hs.296585; nucleolar protein (KKE/D repeat); nucleolar protein (KKE/D repeat); 5.73 

421535; AB002359; Hs.105478; phosphoribosylformylglycinamidine synthase (FGAR amidotransferase); phosphoriDosvi'crmylglvcinarnidine synlha; 5.71 
414883; AA9269B0; Hs.348669; CDC28 protein kinase 1; CDC28 protein kinase 1; 5.69 

425159; NMJ04341; Hs.154868; carbamoyl-phosphate synthetase 2, aspartate transcarbamylase, and dihydroorotase; carbamoyl-phosphate synthetase 2, aspart; 5.69 
401704; ; ; NM_021195*:Homo sapiens claudin 6 (CLDN6), mRNA. VEFC "t IMJ2C 32 31 N M_021 1 95*:Homo sapiens claudin 6 (CLDN6; 5.66 
425358; AL079658; Hs.338207; FK506 binding protein 12-rapamycin a oci protein 1 FK506 binding protein 12-rapamycin assoc; 5.65 

402677; ; ; NM_000478:Homo sapiens alkaline phosphatase, liver/bone/kidney (ALPL), mRNA. alph- h . ro> , as p-i, peptide 1 (CYP7S1), mRNA.; NM_000478:Homo sapiens 

alkaline phosphat; 5.64 
409264; NM_014937; Hs.52463; KIAA0965 protein; KIAA0956 protein; 5.63 
432185; AA221032; Hs.272838; hypothetical protein FU1 ' r 1 ,i,t I a protein FLJ1 0494; 5.63 
499012; AL11 7435; Hs.49725; DKFZP434I216 protein; DKFZP434I216 protein; 5.63 

430252; AI638774; Hs.105328; testes development-related MYD-SP2C estes develc jnent-related NYC-SP20; 5.61 

" m 59; AL043202; Hs.90073; chromosome segregation 1 (yeast homolog)-like; chromosome segregation 1 (yeast homolog); 5.61 



alpha-hydroxylase), polypeptide 1 (CYP7B1), mRNA.; NM_000478:Homo sapiens 

41 4291; AI289619; Hs. 1 3040; G protein-coupled receptor 86; G prolein-coupled receptor 86; 5.58 
453028; AB006532; Hs.31442; RecQ protein-like 4; RecQ protein-like 4; 5.58 
453905; NM_002314;Hs.36566;LIMdom i kinasi 1 LIM imam kinase 1; 5.56 

411263; BE297802 Hs 69360 hre e 5 (mitotic cenlromere-a ociale kinesii tin- m-!. e 6 miotic centromere-assoc; 5.55 
419660;BE280337;Hs.194693;solutecarrierfamily7(catiomcaminoacidtran'f i ), membe lu ir f ri a ionic amino; 5.55 

446979; AI654443; Hs. 1 97683; ESTs; ESTs; 5.54 

424321; W74048; Hs.1765; lymphocyte-specific protein tyrosine kinase; lymphocyte-specific protein tyrosine kin; 5.53 
418962 AA71483'" H SI C T 5 53 

447388; AW630534; Hs.76277; Homo sapiens, clone MGC:9381, mRNA, complete cds; Homo sapiens, clone MGC:9381, mRNA, comp; 5.52 

427247; AW504221; Hs.174103; inlegrin, alpha L (antigen CD11A (p180), lymphocyte function-associated antigen 1; alpha polypeptide); integrin, alpha L (antigen CD11A (p180) ,; 
5.52 

449322; AI638616; Hs.196566; ESTs; ESTs; 5.51 

428450; NM.014791; Hs.184339; KIAA0175gene product; KIAA01 75 gene product; 5.51 

415141; AA189099; Hs.268171; ESTs, Weakly similar b ALU7_HUMAN ALU SUBFAMILY SO SEQUENCE CONTAMINATION WARNING ENTRY [H.sapiens]; ESTs, Weakly similar 

to ALU7_HUMAN ALUS; 5.48 
454048; H05626; Hs.6921; ESTs; ESTs; 5.46 

417079; U65590; Hs.81134; interleukin 1 receptor antagonist inlerleukin 1 receptor anlaconisl; 5,45 

434699; AAG43687; Hs.149425; Homo sapiens cDNA FLJ1I980 hs, clone HEM3B1001 304; Homo sapiens cDNA FU11980 (is, clone HE; 5.44 
414334 AA82<'z Q P F 1 i r FLJ100 hyfc 'p t i rL 10C36; 5.44 

F016 C7 (cell d I erevi lae, homolog)-like 1: CDC7 (cell division cycle 7, S. cerevisi; 5.44 

1 a I 1 | M , I r i a r n I 

427668; AA298760; Hs.180191; hypothetical protein FLO I r < > p ' ' ' P <- 21 Fl I ' '04 5.43 

1 19437; AI702038;Hs.100057; Homo sapiens cDNA:aj22902!i « I u 1 r, n M A; FLJ22902 fis, clone K; 5.41 
453633; AA357001; Hs.34045; hypothetical protein Ft J20764; hypothetical protein FLJ20764; 5.40 
450746; D82673; Hs.278589; general transcription factor II, i; general transcription factor II, i; 5.40 
425966; NMJXJ1761; Hs.1973: cyclin F; cyclin F; 5.39 
413134; AA397769; F!s.86617; ESTs; ESTs; 5,33 

432141; BE410964; Hs.272736; nuclear receptor binding protein; nuclear receptor binding protein; 5.37 

417141; U22662; Hs.347353; nuclear receptor subfamily 1, group H, member 3; nuclear receptor subfamily 1 , group H, m; 5.36 

428329; AA426091; Hs. 98453; ESTs, Moderately similar to R27328 2 [H appnsi M ierately mi rtoF , E 2 [H 535 

406811;U82979;Hs.67846;leukocyteimmunoglobulin-likereceptni ul mil) (with TM and ITIf-, omoir -net ibe 4 i i « le mrrunoglobulin-like receptor ; 5.34 
415819 AU07 7 33C I 3791; t ptione t ( Fa ongati n factor A (Sll),; 5.33 

448133; AA723157; Hs. 73769; folate receptor 1 (adult); folate receptor 1 (adult) 5 33 
424762; AL1 19442; Hs.183684; eukaryotic translation initiation factor 4 gamma, 2; eu 

421959; AW751497; Hs.98370; cytochrome P450, subfamily IIS, polypeptide 1; cytochrome P450, su™ , , , - . 

429271; AF039850; Hs.198515; dead ringer (DrosophilaJ-like 1; dead ringer (Drosophila)-like 1; 5.32 

455373; BE247706; Hs.86693; membrane-spanning 4-domains, subfamily A, member 2 (CD20 antigen); membrane-spanning 4-domains, subfamily A; 5.30 
414907; X90725; Hs.77597; polo (Drosophia)-like kinase; polo (Drosophia)-like kinase; 5.30 
422997; BE018212; Hs.122908; DNA replication factor, DNA replication factor; 5.29 

440014; AW960782; Hs.6856; ash2 (absent, small, orhomeotic, Drosophila, homolog) -tike; ash2 (absent, small, or homeotic, Drosop; 5.28 
118399; AF131781; H 34 53 t ,P the I protein Fl 1124 ^r lie I pr I r 11 12 r 9, 

416178; AI808527; Hs.192822; serologically defined breast cancer antigen NY-BR-81 ; serologically defined breast cancer anti; 5.21 

450377; AB033091; Hs.355925; KIAA1265 protein; KIAA1265 protein; 5.20 

409670; AI368109; Hs.375604; KIAA1856 protein; KIAA1856 protein; 5.20 

429083; Y09397; Hs.227817; BCL2-relaled protein A1; BCL2-related protein A1; 520 

449523; NM_000579; Hs.54443; chemokine (C-C motif) receptor 5; chemokine (C-C motif) receptor 5; 5.20 

408908; BE296227;Hs.250822,senne/threoninekim 1 i th o , ne tinase 15 5 19 

429732; U20158; Hs.2488; lymphocyte cytosolic protein 2 (SH2 d lp r rotein of 75kD); lymphocyte cytosolic protein 2 (SH2doma; 5.19 

438089; W05391; Hs.351546; nuclear receptor subfamily 1, group I, member 3; nuclear receptor subfamily 1 , group I, m; 5.16 

437623; D63880; Hs.5719; chromosome condensation-related SMC-associated protein 1; ch-omosome condensation-related SMC-asso; 5.15 

448181; AF272833; Hs.279763; hypothetical protein FLJ10504: hypothetical protein FLJ10504; 5.15 

436540; BE397032; Hs.14468; hypothetical protein MGC14226; hypothetical protein MGC14226; 5.14 

422241; Y00062; Hs.170121; protein tyrosine phosphatase, receptor tyoe, C; protein tyrosine phosphatase, receptor t; 5.14 

126752: X59490;Hs.1 72004; titin; t 1 

415007; BE244332; Hs.77770; adaptor-related protein complex 3, mu 2 subur.it; adaptor-related protein complex 3, mu 2 ; 5.13 
400263; ; Hs.75309; Eos Control; Eos Control; 5.13 

437099; N77793; Hs.48659; ESTs, Highly similar to S14458 laminin alpha-1 chain precursor [H.sapiers]; ESTs, Highly similar lo S14458 laminin a; 5.12 
427209; H06509; Hs.92423; KIAA1566 protein; KIAA1566 protein; 5.10 

407347; AA829847; ; g b;od40d07.s1 NCI_CGAP_GCB1 Homo sapiens cDNA clone IMAG&1370413 3' similar lo contains Alu repetitive element;, mRNA sequence.; gb:od40d07.s1 

NCI_CGAP_GCB1 Homo sapiens; 5.10 
458933; AI638429; Hs.24763; RAN binding protein 1; RAN binding protein 1; 5.10 
450431 AW13670 F 1 E ESTs 5.09 

434608; AA805443; Hs.179909; hypothetical protein FU22995; hypothetical protein FLJ22995; 5.03 

410423; AW402432; Hs.63489; protein tyrosine phosphatase not i eptor ly 6; pn n t> i le phosphatase, non-recept; 5.08 

417929; R27219; Hs.74647; Human T-cell receptor active alpha-chair, mRNA from J V cell line, complete cds; Human T-cell receptor active alpha-chain; 5.05 

412723; AA648459; Hs.335951; hypothetical protein AF301 222; hypothetical protein AF301222; 5.05 
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447321; AW271217; Hs.281434; Homo sapiens cDNA FLJ14028 fis, clone HEMBA1 003838; Homo sapiens cDNAFLJ14028fis, clone HE; 5.03 

417866; AW067903; Hs.82772; collagen, type XI, alpha 1; collagen, type XI, alpha 1; 5.03 

410082; AA081594; Hs.158311; Musashi(Drosophila) homoloc 1 M i( osophil inmolog 1: 5.02 

433592; NM_004542; Hs.3436; deleted in oral cancer (mouse, honxlo ) i a ' I in oral ncer(moj hcmolog) ; 5.02 

425811; AL039104; Hs.159557; karyopherin alpha 2 (RAG cohort 1, importir, alpha 1); karyopherin alpha 2 (RAG cohort 1, impor; 5.00 

425237; U07695; Hs.155227; EphB4; EphB4; 5.00 

414809; AI434B99; Hs.77355; transfenin receptor (p90, CD71); transferrin -ecepior (p90. CD71); 4.99 
402145; ; ; Target Exon; Target Exon; 4.99 
432126; AA865239; Hs.37196; ESTs; ESTs; 4.99 

408279; AF216965; Hs.44095; Homosapiens, clone MGC:12617, mRNA, complete cds; Homo sapiens, clone MGC:12617, mRNA, com; 4.98 
41 9525; T79257; Hs.1 259; asialoglycoprotein receptor 2; asialoglycoprotein receptor 2; 4.97 
424439; AA579635; Hs.1770; ligase I, DNA, ATP-dep r na^ ligo DMA ' rh dependent: 4.97 

427667; AK001279; Hs.180171; Homo sapiens cDNA FLJ10417 fis, clone NT2RP1000112; Homo sapiens cDNA FLJ10417 fis, clone NT; 4.96 
' II 102; Hs.241 520; transcriptional cc 3 > 'I 

448569; BE382657; Hs.21486; signal transducer and activator of transcription 1 , 91kD; signal transducer and activator of trans; 4.94 
426427; M86699; Hs.1 69840; TTK prolein kinase; TTK protein kinase; 4.91 

(1 Veakly sim 86 Ste20-like [ [H.sapier.s]; ESTs, Weakl II I 

453922 AF053306 H W buddin u i nbited o/ benzimidazoles 1 (yea-1 homolcg), beta; budding uninhibited by benzimidazoles 1 ; 4.90 
435602; AF217515; Hs.283532; uncharacterized bone marrow protein BM039; uncha-acterized bone marrow protein BM03; 4.89 
452698; NMJJ01295; Hs.301921; chemokine (C-C motiO receptor 1; chemokir.e (C-C motif) receptor 1 ; 4.88 

445333; BE537641; Hs.44278; hypothetical protein FU 12538 similar to ras-related protein RAB17; hypothetical protein FLJ12538 similar to; 4.88 
437162; AW005505; Hs.5464; thyroid hormone receptor coactivating protein; thyroid hormone receptor coaclivaling pr; 4.87 
441600, AA939347 Hs.127 2 Homosapi cysteine knot p Imp "K q) mRNA, complete cds; Homo sapiens cysteine knot prolein (2SIG; 4.87 
421350; AW301608; Hs.278188; ESTs, Moderately similar to I54374 gene NF2 protein [H.sapiens]; ESTs, Moderately similar to I54374 gene ; 4.87 
409093; BE243834; Hs.50441; CGI-04 p otein; CGI-04 protein; 4.86 

424304; NM_001395; Hs.144879; dual specificity phosphatase 9; dual specificity phosphatase 9; 4.86 
437696; 283844; Hs.5790; hypothetical protein dJ37E15.5; hypothetical protein dJ37E15.5; 4.86 

417381; AF164142; Hs.82042; solute earner family 23 (nucleobase transporters), member 1; solute carrier family 23 (nucleobase tra; 4.84 

416445; AL043004; Hs.79337; KIAA0135 protein; KIAA01 35 protein; 4.83 

448939; BE267795; Hs.22595; hypothetical prolein FLJ10637; hypothetical prolan FU 1 0637; 4.82 

413566; AW604451; Hs.285814; sprouty (Drosophila) homolog 4;sprouty (D-osophila) homolcg 4; 4.32 

424081; NM_006413; Hs.139120; ribonuclease P (30kD), ribon rlr - ;F kD) . J l 

425474; 248054; Hs.158084; peroxisome receptor 1; peroxisome receptor 1; 4.81 

412276; BE262621; Hs.73798; macrophage mig-aticn inhibitory factor (glycosylation-inhibiting facto-); macrophage migration inhibitory facto; (; 4.81 

422689; AW856665; Hs.299797; gb:RC3-CT0297-2901 00-01 3-d03 CT0297 Homo sapiens cDNA, mRNA sequence; gb:RC3-CT0297-2901 00-01 3-d03 CT0297 Homo; 4.B0 

409101; NMJ04237 H 0 12 ju nin ruccotide binding protein (G protein), alpha 14; guanine nucleotide binding protein (G pr; 4.79 

412760; AW379030; Hs.41324; ESTs; ESTs; 4.79 

447250; AI878909; Hs.17883; protein phosphatase 1G (formerly 2C), magnesium-dependent, gamma isoform; protein phosphatase 1G (formerly 2C), ma; 4.79 
429345; R1 11 41 ; Hs.199695; hy pethet alp loin; hypothetical proteii 1.78 
448950; AF288687; Hs.9275; CGI-152 prolein; CGI-152 protein; 4.78 

412926; AI879076; Hs.75061; macrophage myristoylaled alanine-rich C kinase substrate; macrophage myrisloylated alanine-rich C ; 4.78 
412641; M16660; Hs.74335; healshock90kD protein 1, beta; heal shook SOkD protein 1, beta; 4.76 

420261; AW206093; Hs.748; fibroblast growth factor receptor 1 (fms-related tyrosine kinase 2 fei ter syndi ome) fit i r.\e-' growth factor receptor 1 (fms; 4.76 
421905; AI660247; Hs.32699; ESTs, Weakly similar to LIV-1 prctem [H piens] Ts, Weakly similar to LIV-1 protein [H; 4.75 
413880; AI660842; Hs.1 10915; interleukin 22 receptor; inlerieukin 22 receptor; 4.75 

424905; NMJ02497; Hs.153704; NIMA (neve: in mitosis gene a)-related kinase 2 (NEK2); NIMA (neve-in mitosis gene a)-related k; 4.74 
418355; L42563; Hs.1165; ATPase, H7 transporting, nongastric, alpha polypeptide; ATPase, H? transporting, nongastric, alp; 4.74 
435905; AW997484; Hs.5003; KIAA0456 prolein; KIAA0456 protein; 4.74 

428024; Z29067; Hs.2236; NIMA (never in mitosis gene a)-related kinase 3; NIMA (never in mitosis gene a)-relaled k; 4.74 
421846; AA017707; Hs.1432; protein kinase C substrate 80K-H; protein kinase C substrate 80K-H; 4.72 
419138; U48508; Hs.89631; ryanodine receptor 1 (skeletal); ryanodine receptor 1 (skeletal); 4.72 
437296; AA350994; Hs.20281; KIAA1700; KIAA1700; 4.70 

450142; AW207469; Hs.24485; chondroitin sulfate proteoglycan 5 (bamacan) chondroitir sulfate proteoglycan 6 (bama; 4.70 

121 L 1 1 9671 I I 0; fit bla rowth facto : eplor 3 (achondroplasia, thanatophoric dwarfism); fibroblast growth factor receptor 3 (ach; 4.69 

449475; AI348027; Hs.129826; hypothetical protein PP1057; hypothetical protein PP1057; 4.69 

420062; AW411096; Hs.94785; TGF(beta)-induced transcription factor 2; TGF(beta)-induced transcription factor 2; 4.69 

429336; AB005038; Hs.199270; cytochrome P450, subfamily XXVIIB (25-hydroxyvitamin D-1-alpha-hydroxylase), polypeptide 1; cytochrome P450, subfamily XXVIIB (25-hy; 4.69 
436856; AI469355; Hs.127310; ESTs; ESTs; 4.6B 

406937; U14622; ; gb:Human transkelolase-like protein gene, partial cds.; gb;Human t-ansketoiase-like protein gene; 4.67 
411296; BE207307; Hs.1 01 1'.; growth suppressor 1; growth suppressor 1; 4.67 
426726; AA488915; Hs.171955; traphinin associated protein (tastin); trophinin associated protein (tastin); 4.67 
409132; AJ224538; Hs.50732; protein kinase, AMP-activated, beta 2 non-catalytic subunit; protein kinase. AMP-activaled. beta 2 no; 4.67 
449230; BE613348; Hs.356392; melanoma cell adhesion molecule; melanoma cell adhesion molecule: 4.66 
431681; AK000376 ri J r u , hetk al protein FLJ20371; hypothetical prolein FLJ20371; 4.65 
443623;AA345519;Hs.9641;complemenlcomponenl1,. ' 
441595; AW206035; Hs.356457; ESTs; ESTs; 4.64 
423419; R55336; Hs.23539; ESTs; ESTs; 4.64 

415724; NM_003580; Hs.78687; neutral sphingomyelinase (N-SMase) activation associated factor neutral sphingomyelinase (N-SMase) acliv; 4.63 
435045; BE297155; Hs.143698; ESTs; ESTs; 4.62 

424441; X14850; Hs.147097; H2A histone family, member X; H2A histone family, member X; 4.62 
414972; BE263782; Hs.77595; KIAA0008 gene product; KIAA0008 gene product; 4.62 

436685; W28661; Hs.5288; Homo sapiens mRNA; cDNA DKFZp434M245 (from clone DKF2p434M245); Homo sapiens mRNA; cDNA DKFZp434M245 (fr 4.62 
449515; A1653378; Hs.302012; ESTs; ESTs; 4.61 

425998; AU076629; Hs.165950; fibroblast growth factor recepto- 4; fibroblast growth (aster receptor 4; 4.61 
420027; AF009746; Hs.94395; ATP-binding cassette, sub-family D (ALU) member 4; ATP-bir.ding cassette, sub-family D (ALD); 4.61 
436469; AK001455; Hs.519B; Down syndrome critical region gene 2; Down syndrome critical region gene 2: 4.61 
413441; AI929374; Hs.75367; Src-like-adapten Src-like-adapter; 4.60 
456847; AI360456; Hs.86088; ESTs; ESTs; 4.58 

421506; BE302796; Hs.105097; thymidine kinase 1, soluble; thymidine kinase 1, soluble; 4,57 
426935; NMJ00088; Hs.172928; collagen, type I, alpha 1; collagen, type I, alpha 1; 4.57 
428782; X12830; Hs.193400; interleukin 6 receptor; interleukin 6 receptor; 4.56 
409430; R21945; Hs.346735; splicing f t ginii in ' r * o rq i -erne nch 5, 4.56 
412773; H15785; Hs.74573; similar to vaccinia virus Hindlll K4L ORF; similar to vaccinia virus Hindlll K4LOR; 4.55 
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443068; AI1 88710; Hs.374480; ESTs; ESTs; 4.55 

441607; NMJ05010; Hs.7912; neuronal cell adhesion molecule; neuronal cell adhesion molecule; 4.54 
453227: AW1 35862: Hs.243991; ESTs; ESTs; 4.52 

438459; T49300; Hs.35304; Homo sapiens cDNA FLJ13655 fis cl i PLA E101 1503; Homo apiei cDNA FLJ13655 fis, clone PL; 4.51 
422565; BE259035; Hs.118400; singed (Drosophiln)-iike (sea urchin fascin homolog like); singed (Drosophi!a)-like (sea urchin fas; 4.51 
453613; F06838; Hs.374476; ESTs; ESTs; 4.50 

413900;AW409747; Hs.75S1 2; stress-induced-phosphoprotein 1 (Hsp70/Hsp90-organizing protein); slress-induced-phosphoprotein 1 (Hsp70/H; 4.50 
412507; L36645; Hs.73964; EphM; EphA4; 4.50 

419034,NM_002110 h 3 lien i >■ nop li kinase; 4.49 

413431; AW246428; Hs.75355 ubiquitin-conjugatlng enzyme E2N (homologous to yeast UBC13); ubiquitin-conjugating enzyme E2N (homolo: 4.49 
406547; ; ; Target Exon; Target Exon; 4.49 

443216; W80487; Hs.324521; hypothetical protein DC50; hypothetical protein DC50; 4.48 

41 7497; AW402482; Hs.8221 2; CD53 antigen; CD53 antigen: 4.47 

448595; AB014544; Hs.21572; KIAA0644 gene product; KIM0644 gene product; 4.47 

445350; AF052112; Hs.12540; lysophospholipase I; lysophospholipase I; 4.46 

446236; NM 006293; Hs.301; TYR03 protein tyrosine kinase; T /R03 protein, tyrosine kinase; 4.46 

429150; AF120103; Hs.197366; smoothened (Drosophila] homolog; smoothened (Drosophila) homo'cg; 4.46 

420340; NM.000734; Hs.97087; CD3Z antigen, zeta polypeptide (TiT3 complex); CD3Z antigen, zeta polypeptide (TiT3 com; 4.46 

413426; U88837; Hs.75354; GCN1 (general control of amino-acid synthesis 1 , yeast)-li'<e 1; GCN1 (general control of amino-acid synt; 4.44 

121319; NMJ1 H iP 3: zi dii ir.edin: 4.44 

409512; AW979187; Hs.293591; melanoma differentiation associated prolein-S; melanoma differentiation associated prot;4.44 

428995; AW004975; Hs.194716; MAD (mothers against decapentaplegic, Drosophila! homolog interacting protein, receptor activation anchor; MAD (mothers against decapentaplegic, 
Dr; 4.43 

434551; BE387152; Hs.280858; ESTs, Highly similar to A35661 DNA excision repair cross-complementing protein ERCC3 [H.sapiens]; ESTs, Highly similar to A35661 DNA excis; 

418295; AW970043; Hs.238039; hypothetical protein FLJ11090; hypothetical protein FLJ11090; 4.42 
409243; AB037761; Hs.51743; KIAA1340 protein; KIAA1340 protein; 4.42 
437103; AW139408; Hs.152940; ESTs; ESTs; 4.42 

413186; AU077141; Hs.374548; solute carrier family 16 (monocarboxylic acid transporters], member 1; solute carrier family 16 (monocarboxylic; 4.42 
456362 AW9"30C" - 799 hyp ica 1 protein Fl [ H i ( t or ii LP2115 4.40 
424078; AB006625; Hs.139033; paternally expressed 3; paternally expressed 3; 4.39 

448153; Y10805; Hs.20521; HMT1 (hnRNP methyltransferase, S. cerevisiaej like 2; HMT1 (hnRNP methyltranslerase, S.cerevi; 4.37 
410134; U68140; Hs.58927; nuclear VCP-like; nuclear VCP-like; 4.36 

435523; T62849; Hs.11090; membrane-spanning ^domains, subfamily A, member 7; membrane-spanning 4^)omains, subfamily A; 4.35 
400440; X83957; Hs.83870: nebulin; nebulin; 4.35 

437218; AL117497; Hs.581B5: ESTs, Weakly similar to T42727 proliferation potential-related protein - mouse [V.musculus]; ESTs, Weakly similarta T42727 prolifera; 4.34 
430478; NM_014349; Hs.241535; apolipoprotein L, 3; apolipoprotein L, 3; 4.34 
432744; AA988835; Hs.38664; ESTs; ESTs; 4.33 

423173; AA442655; Hs.124942; protein phosphatase 2A 48 kDa regulatory submit; protein phosphatase 2A 48kDa regulatory; 4.33 
415995; NMJ04573; Hs.355888; phospholipase C, beta 2; phospholipase C, beta 2; 4.33 
408728; AL1 37379: Hs 47125: hypothetical protein FU13912; hypothetical protein FU13S12; 4.30 
131222 567 Hs.273 9 ' zj .a pellucida glycoprotein 3A (sperm receptor); zona peilucida glycoprotein 3A (sperm re; 4.30 
427792; M63928; Hs.180841; tumor necrosis factor receptor superfamily, member 7; tumor necrosis factor receptor superfami; 4.29 
426227; U67058; Hs.154299; Human proteinase activated receptor-2 mRNA, 3'UTR; Human proteinase activated receptor-2 mR; 4.29 
427337; Z46223; Hs.176663; Fc fragment of IgG, low affinity lllb, receptor for (CD16); Fc fragment of lgG, low affinity 1Mb, r; 4.29 
434826;AF155661;Hs.22265;pyruvatedehydrogenasephosphatas° pyruvate hydi ase ph phatase; 4.29 

412314; AA825247; Hs.356084; downstream of; G protein-coupled recepto- 27 (GPR27) (SREB1); downstream ot G protein-coupled recepto; 4.28 
447827; U73727; Hs.19718; protein tyrosine phosphatase, receptor type, U; protein tyrosine phosphatase, receptort; 4.28 
426108; AA622037; Hs.1 66468; programmed cell death 5; programmed cell death 5; 4.28 

428820; AA436187; Hs.1 72631; inlegrin, alpha M (complement component receptor 3, alpha; also known as CD1 1b (p170), macrophage antigen alpha polypeptide); integrin, alpha M 

(complement component ; 4.27 
437908; AI082424; Hs.351043; ESTs; ESTs; 4.27 

444664; N26362; Hs.11615; map kinase phosphatase-like protein MK-STYX; map kinase phosphatase-like protein MK-S; 4.27 
429002; AW248439; Hs.2340; junction plakoglobin; junction plakoglobin; 4.26 
439334; AI148976; Hs.112062; ESTs; ESTs; 4.26 

425308; M97639; Hs.155585; receptor tyrosine kinase-like orphan receptor 2; receptor tyrosine kinase-like orphan rec; 4.25 
413869; MM_000878; Hs.75596; interieukin 2 receptor, beta; interieukin 2 receptor, beta: 4.25 
453648; W21493; Hs.380100; hypothetical protein FU14005; hypothetical protein FLJ14005; 4.24 

447200; BE543146; Hs.281434; Homo sapiens cDNA FU 14028 (is, clone HEMBA1003838; Homo sapiens cDNA FU14028 lis, clone HE; 4.24 
447528; AI612027; Hs.76277; Homo sapiens, clone MGG9381, mRNA, complete cds; Homo sapiens, clone MGC-.9381, mRNA, comp; 4.23 
452721; AJ269529; Hs.301871; solute carrier family 37 (glycero!-3-ptiosphate transporter), member 1; solute carrier family 37 (glycerol-3-pho; 4.22 
449810; AB008681; Hs.23994; activin A receptor, type IIB; activin A receptor, type I1B; 4.22 
447198; D61523; Hs.283435; ESTs; ESTs; 4.22 

416714; AF283770; Hs.79630; CD79A antigen (immunoglobulin-asscciated alpha): CD79A antigen (immunoglobulin-asscciated; 4.22 
425356; BE244879 H 1 tol polyphocpr t l n 1c p ohosphale-5-phosphalase, 14; 4.22 

422605; H16646; Hs.118666; Hypothetical protein PP591; hypothetical protein FF591; 4.21 

444535; AF011466; Hs.122575; EDG-4 (endothelial differentiation, lysophosphatidic acid G-protein-coupted receptor. 4); EDG-4(endothelial differentiation, lys; 4.21 
417088; M54915; Hs.81170; pim-1 oncogene; pim-1 oncogene; 4.20 
421707; NM_014921; Hs.107054; leclomedin-2; lectomedin-2; 4.20 

408717;AF045458;Hs.47061;unc-51(C.elegans)lii kina I i l( Mnase1;4.20 
438485; W57578; Hs.378718; RAB7, member RAS oncogene family; RAB7, member RAS oncogene family; 4.19 
419452; U33635; Hs,90572; PTK7 protein tyrosine kinase 7, PTt proteii Vn rMmaie 7, 4.18 
418755; Y14443; Hs.88219; zinc finger protein 200; zinc ringer protein 200; 4.1 8 
417212; AW952823; Hs.351547; NS1-binding protein; NSI-bindhg protein; 4.17 
413686; AI469213; Hs.71404; ESTs; ESTs; 4.17 

419344; U94905; Hs.277445; diacylglycerol kinase, zeta (104kD); diacylglycerol kinase, ;eta (104kD); 4.16 

418870; AF147204; Hs.89414;chemokine (C-X-C n I) i :ec i if nn) chemokine (C-X-C motil), receptor 4 (fus; 4.16 

408482; NM.000676; Hs.45743; adenosine A2b receptor; adenosine A2b receptor; 4.16 

422391; D63479; Hs.1 15907; diacylglycerol kinas d Mr ia -jlyrerol kinase, delta (130kD); 4.15 

409421; AA199883; Hs.67624; ESTs; ESTs; 4.15 

415938; BE383507; Hs.78921; A kinase (PRKA) anchor protein 1; A kinase (PRKA) anchor protein 1; 4.14 

41 51 98; AW009480; Hs.943; natural killer cell transcript 4; natural killer cell transcript 4; 4.14 

424685; W21223; Hs.151734; nuclear transport fxlor 2 (placental protein 15); nuclear transport factor 2 (placental pr; 4.13 



664 



WO 03/042661 



PCT/US02/36810 



428479; Y00272; Hs.334562; cell division cycle 2, G1 to S and G2 to M; ceil division cycle 2. G1 to S and G2 to; 4.13 

423728; AW891294; Hs.132136; solute carrier family 4, sodium bicarbonate cotransporfer, member 8; solute carrier family 4, sodium bicarbon; 4.13 

i E54 i u t * T I i tion elongation I mi hondri Mi lali i 1 n , u i ' niitoch; 4.13 
420253; AI656055; Hs.96200; neighbor of A-kinase anchoring protein 95; neighbor of A-kinase anchoring protein 9; 4.12 

412584; X54870; Hs.74085; DNA segment on chromosome 12 (unique) 2489 expressed sequence; DNA segment on chromosome 12 (unique] 24; 4.12 

400205; ; Hs.81848; NM_006265":Homo sapiens RAD21 (S. pombe) homolog (RAD21), mRNA. (APO-1/CD95 (Fas)-associated phosphatase) (PTPN13), mRNA.; 

NM_006265*:Homo sapiens RAD21 (S. pombe); 4.12 
425322; U63630; Hs.155637; protein kinase, DNA-activated, catalytic polypeptide; protein kinase, DNA-ac'.ivated, catalytic; 4.11 < 
419607; R52557; Hs.91579; Homo sapiens clone 23783 mRNA sequence; Homo sapiens clone 23783 mRNA sequence; 4.10 

459035; AW291 109; Hs.332563; ESTs, Weakly similar to T31 61 1 hypothetical protein YSOEBA.g - Caenorhabditis elegans [C.elegans]; ESTs, Weakly similar to T3161 1 hypotheti; 
4.10 

432512; NMJM3284; Hs.3017; transition protein 1 (during histone to protamine replacement); transition protein 1 (during histone to ; 4.10 

452875; BE275760; Hs.30928; DNA segment on chromosome 19 (unique) 1177 expressed sequence; DNA segment on chromosome 19 (unique) 1.1; 4.09 

423804; AW403448; Hs.1706; interferon-stimuiated transcription factor 3, gamma (48kD); interferon-stimulaled transcription fact; 4.09 

413745; AW247252;Hs 75514 nucleoside phosphoryl i side p 'rrrrylase 4 09 

442091; AW770493; Hs.182874; guanine nucleotide binding protein (G protein) alpha 1 2: guanine nucleotide binding protein (G pr; 4.09 

425003; AF119046; Hs,154149; apurinic/apyrimidinic endonuclease(APEX nuclease)-'ike 2 protein: epunnic/apynmidinic endonuclease(APEX ; 4.08 

421859; AA356620; Hs. 108947; KIM0050 gene product; KIAA0050gene product 4.08 

432841; M93425; Hs.62; protein tyrosine phosphatase, non-receptor type 12; protein tyrosine phosphatase, non-recept; 4.08 
452069; AB028949; Hs.1 83994; KIAA1 025 protein; KIAA1026 protein; 4.08 

425069; AA687465; Hs.298184; potassium voltage-gated chann»l shaker-'elal ibfa til) tela member 2; potassium voltage-gated channel, shaker-; 4.07 
418526; BE019020; Hs.85838; solute carrier family 16 (monocarboxylic acid transporters), member 3; solute carrier family 16 (monocarboxylic; 4,07 
42451 7; A1539443; Hs.13i 147; Homo sapiens :DNA TLJ12169 fis, clone MAMMA10C0643: Homo sapiens cDNAFU 12169 lis, clone MA; 4.07 
434224; AA380731; Hs.84; interleukin 2 receptor, gamma (severe combined immunodeficiency); interieukin 2 receptor, gamma (severe co; 4.06 

I 1 95922; ESTs; ESTs: -1.06 

432065; AA401 039; Hs.2903; protein phosphatase 4 (formerly X), catalytic subunil; protein phosphatase 4 (formerly X), rata; 4.06 

431194; D43704 H " 7 i. l Ii iui i c hannel, voltage-dependent, bela 3 suLuini i u el ultaye dependent, beta, 4 06 

418751, BE389014 1- r 7 | h<- r n< j nr= r u i ! ci ibjnit polypeptide 3 (p55, gamma); phosphoinositide-3-kinase, regulatory su; 4.06 
425923; NMJ05026; Hs.162808; phosphoinosilidc-3-kinoso, catalytic, delta polypeptide; phosphoinositido-3 -kinase, catalytic, de; 4.05 
408692; AL040127; Hs.34074; dipeplidylpeptidase VI; dipeptidylpeptidase VI; 4.04 

446272; BE268912; Hs.14601; hematopoietic cell-specific Lyn substrate 1; hematopoietic cell-specific Lyn substrat; 4.04 

439176; AI446444; Hs.190394; ESTs Weakly similar to B28C96 line-1 protein OEF2 [H.sapiens]; ESTs, Weakly similar to B28096 line-1 pr, 4.04 

417880; BE241595; Hs.82848; seledin L (lymphocyte adhesion molecule 1): seleclin L (lymphocyte adhesion molecule; 4.04 

410068; AI633888; Hs.58435; FYN-binding protein (FYB-120/130); FYN-binding protein (FYB-120/130); 4.03 

410639; BE269047; Hs.65234; hypothetical protein FLJ20596; hypothetical protein FLJ20595; 4.03 

427716; L38951; Hs.180446; karyopherin (importin) bela 1; karyopherin (importin) beta 1; 4.03 

451050; AW937420; Hs.351869; ESTs; ESTs; 4.02 

449667; AB023227; Hs.23860; KIAA1 01 0 protein; KIAA1 01 0 protein; 4.02 

448499; BE613280; Hs.77550; p53-regulated DDA3; p53-regulaled DDA3; 4.01 

437527; AI241019; Hs.145644; ESTs; ESTs; 4.01 

425113 76611 H: 1 54 67; methylene tetrahydn olale dehydrogenase (NAD deoondenl), methenyitelrahydrofolate cyclohydrolase; methylene tetrahydrofolate dehydrogenase; 

4.00 

451931; AK000208; Hs.27267; Homo sapiens cDNA FU20201 fis, clone COLF1210; Homo sapiens cDNA FU20201 fis, clone CO; 4.00 
412939; AW41 1491; Hs.75069; eukaryotic translation elongation factor 1 gamma; eukaryotic translation elongation factor; 4.00 
409581; U66243; Hs.55039; milogen-aclivaled pidun na miloger li led protein kinase 12; 3.99 
433577; AW007080; Hs.284192; ESTs; ESTs; 3.99 

439963; AW247529; Hs.6793; platelet-activating factor acetyihydrolase, isoform lb, gamma subunit (29kD); plalelei-activating factor acetylhydrola; 3.99 
418629; BE247550; Hs.86859; growth factor receptor-bound protein 7; g rowth factor receptor-bound protein 7; 3.99 
448633; AA311426; Hs.21635; tubulin, gamma 1; tubulin, gamma 1; 3.98 

402398; ; ; C19000263:gi|3108023|gb|AAC15755.1| (AC004659) BC62940 2 [Homo sapien5 ; |]66335; C19000263:gi|310B023|gb|AAC15765.1| (AC0; 3.97 
408414; AI1 14688; Hs.193400; ESTs, Weakly similar to 2109260A B cell growth factor [H.sapiens]; ESTs, Weakly similarlo 2109250A B cell ; 3.97 
415012; NM.004383; Hs. 77793; c-src tyrosine kinase; c-src tyrosine kinase; 3.97 

416084; L1 6991 ; Hs.79006; deoxythymidylate kinase (thymidyiate kinase); deoxythymidylate kinase (Ihymidylate kin; 3.95 

422051; AW327546, Hs 111024 solute rriei (ami 5 (mil hondnlal t ei m il importer), member 1, solute carrier family 25 (mitochondrial ; 3.95 
447887; AA1 14050; Hs.21 1610; caspase 8, apoptosis-related cysteine protease; caspase B, apoptosis-related cysteine pr; 3.95 

65694; H i <-T 14 

442994; AI026718; Hs.16954; ESTs; ESTs; 3.94 

420333; AJ001383; Hs.97084; lymphocyte antigen 94 (mouse) homolog (activating NK-receptor; NK-p46); lymphocyte antigen 94 (mouse) homolog (a; 3.94 
438456; AA913381; Hs.279763; ESTs; ESTs; 3.94 

422599; BE387202; Hs.1 18638; non-melastalic cells 1, protein (NM23A) expressed in; non-metaslatic cells 1, protein (NM23A) ; 3.93 
420162; BE378432; Hs.95577; cyclin-dependent tine e4 c clin lepei ml kinase 4; 3.93 

424829; NMJ02507; Hs.1827; nerve growth factor receptor (TNFR superfanily, member 15); nerve growth factor receptor (TNFR super; 3.93 
4475/4 AF1fc«rfabb H I 1 1 kt, k na e 1 393 

425797; AF002986; Hs.1 59545; platelet activating receptor homolog; platelet activating receptor homolog; 3.93 

421910; NM 014586; Hs.109437; hormonally upregulated neu tumor-associated kinase; hormonally upregulated neu tumor-associa; 3.92 

413781; J05272; Hs,850; IMP (inosine monophosphate) dehydrogenase 1; IMP (inosine monophosphate) dehydrogenas; 3.92 

434334; AA912476; Hs.1 16750; Homo sapiens cDNA FLJ 13221 fis, clone NT2RP4002075; Homo sapiens cDNA FU13221 fis, clone NT; 3.92 

400262; ; Hs.75309; Eos Control; Eos Control; 3.90 

424977; AA349289; Hs.100057; Homo sapiens cDNA: FU22902 lis, clone KAT05581 ; Homo sapiens cDNA: FU22902 fis, clone K; 3.90 
409799; D1 1928; Hs.76845; phosphoserine phosphatase-like; phosphoserine phosphatase-like; 3.90 
435206; AI432364; Hs.160594; ESTs; ESTs; 3.90 

439863; BE547830; Hs.375208; paired immunoglobulin-like receptor bela; paired immunoglobulin-like receptor beta; 3.90 
413627; BE182082; Hs.246973; intron of Bicaudal D homolog 1; intron of Bicaudal D homolog 1; 3.90 
426265; AA421069; Hs.97896; ESTs; ESTs; 3.89 

451063; AW163702; Hs.25911; HLA-B associated transcript-2; HLA-B associated t-anscript-2; 3.89 
407013; U35637; Hs.83870; gb:Human nebulin mRNA, partial cds; gb:Human nebulin mRNA, partial cds; 3.89 
437239; AW503395; Hs,5541; ATPase, Ca transporting, ubiquitous; ATPase, Ca transporting, ubiquitous; 3.88 
400261 ; ; Hs.1 802; Eos Control; Eos Control; 3.88 

450447; AF212223; Hs.25010; hypothetical protein P15-2; hypothetical protein P15-2; 3.88 

422293; X94453; Hs.1 14366; pyrroline-5-carboxylate synthetase ql i m jar mial i y fetase); pyrroline-5-carboxylate synthetase (glut; 3.87 
414251; AL042306; Hs.97689; VASA protein; VASA protein; 3.87 

417767; BE242241; Hs.82542; acyloxyacyl hydrolase (neutrophil); acyloxyacyl hydrolase .neutrophil); 3.87 

414443; AU077268; Hs.76144; platelet-derived growth factor receptor, beta polypeptide; platelet-derived growth factor receptor,; 3.87 
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444099; D87432; Hs 1031 > olute can r family 7 (calionic amino acid transporter, y sysiem), member 6; solute carrier family 7 (cationic amino ; 3.86 
419596; BE379320; Hs.91448; MKP-1 like protein tyrosine phosphatase Mf n i il intyrosir i ,nta-e386 
v-245839; He.1 i rotein polypeptide A; small nuclear ri prol pep 

443661; AA336609; Hs.10862; Homo sapiens cDNA: FLJ23313 fis, clone HEP11919; Homo sapiens cDNA: FU23313fis, clone H; 3.86 
408056; AA312329; Hs.42331; ephrin-A4; ephrin-A4; 3.86 

41 0552; X66945; Hs.748; fibroblast growth factor receptor 1 (frns-reiated tyrosine kinase 2, Pfeiffer syndrome); fibroblast growth factor receptor 1 (fms; 3.85 
450778; U01375; Hs.25450; solute carrier family 2S (nucleoside transporters), member 1 ; solute carrier family 2d (nucleoside tra 3 85 
418978; T85295; Hs.268606; ESTs; ESTs; 3.84 

429903; AL134197; Hs.93597; cyclin-dependent kinase 5, regulator/ subunit 1 (p35); cyclin-dependent kinase 5, regulatory su; 3.84 
442980; AA857025; Hs.8878; kinesin-like 1 ; kinesin-like 1 ; 3.84 

44723? AW4998 ( H' Ic-ukii 10 recept pha nterleukin 10 receptor, alpha; 3.84 

434689; AF154115; Hs.4076; CTD (carboxy-lerminal domina, RNA po'ymerase II, polypeptide A) phosphalase, subunil 1; CTD (carboxy-terminal domina, RNApolyme; 3.B3 
432539; AL13B169; Hs.278378; karyopherin bela2b, Iransportin; karyopherin beta 2b, transportir,; 3.83 
415684; D59356; Hs.374480; sorbitol dehydrogenase; sorbitol dehydrogenase; 3.63 
451598; N29102; Hs.79658; ESTs; ESTs; 3.82 

449433; AI672096; Hs.901 2; ESTs, Weakly similar to S26650 DNA-binding protein 5 [H.saplens]; ESTs, Weakly similar to S26650 DNA-bindi; 3.82 
426359; AA376409; Hs.10862; Homo sapiens cDNA: FLJ23313 fis, clone HEP1 191 9; Homo sapiens cDNA: FLJ2331 3 fis, clone H; 3.81 
435160; AB002374; Hs.4791; KIAA0376 protein; KIAA0376 protein; 3.80 

443402; IJ77846; Hs.9295; elastin (supravalvular aortic stenosis, Williams-Beuren syndrome): elastin (supravalvular aortic stenosis, ; 3.80 
422753; AI928995; Hs.1575; small nuclear ribonucleoprotein D3 polypeptide (18kD); small nuclear ribonucleoprotein D3 polyp; 3.79 
421508; NM_004833; Hs.105115; absent in melanoma 2; absent in melanoma 2; 3.79 
414E06; D14694; Hs.77329; phosphalidylserine synthase 1; phosphatidylserine synthase 1; 3.79 

428023; AL038843; Hs.374530; Homo sapiens cDNA: FLJ23bU> t - ne LN 15 i Homo sapiens cDNA: FU23602 fis, clone L; 3.79 

421654; AW163267; Hs.106469; suppressor of vorl (S.cerevisiae) 3-like 1; suppressor of varl (S.cerevisiae) 3-like' 3 79 

439668; AI091277; Hs.302634; frizzled (Drosophila) homolog 8, fn led (Drosophi | homolo R3 70 

41 1 125; AA151647; Hs.68877; cytochrome b-245, alpha polypeptide; cytochrome b-245, alpha polypeptide; 3.78 

I7C I* K 1 359cual-p ifi itv pho pHa e 4 dual city pi phi»e4 78 
425354; U62027; Hs.155935; complement component 3a receptor 1; complement component 3a receptor 1 ; 3.78 
423909; AJ223183; Hs.135194; Immunoglobulin superfamily, member 6; immunoglobulin superfamily, member 6; 3.78 
429687; AI675749; Hs.21 1 608; nucleoporin 1 53kD; nucleoporin 1 53kD; 3.77 

414177; AI351355; Hs.356303; uncharacterized hypothalamus protein HARP11; uncharacterized hypothalamus protein HAR; 3.77 
414135; NM_004419; Hs.2128; dual specificity phosphatase 5; dual specificity phosphatase 5; 3.77 
445817; NM_003642; Hs.13340; hislone acetyltransferase 1 ; histone acetvltransferase 1 ; 3.77 

410422; ALO42014; Hs.63348; Homo sapiens, clone MGC:15203, mRNA, complete cds; Homo sapiens, clone MGC15203, mRNA, com; 3.76 
41 2146; M92444; Hs.73722; APEX nuclease (multifunctional DNA repair enzyme); APEX nuclease (multifunclional DNA repai; 3.76 
413011; AW068115; Hs.821; biglycan; biglycan; 3.76 
428157; AI738719; Hs.1 98427; hexokinase 2; hexokinase 2; 3.76 

400288; X06256; Hs.149609; integ n ( or c r polypep p erect n receptor,; 3.75 

427378; BE515037; Hs.177556; melanoma antigen, family D, 1; melanoma antigen, family D, 1; 3.75 

405484;;;C3002124*:gi|12737280|reflXPJ06682.2|keralin18[Homosa r .en=]ihi 13 i )0i r'4" t H2737280|ref|XP_006682.2| k; 3.75 
450998; BE387614; Hs.25797; splicing factor 3b, subunil 4, 49kD; splicing factor 3b, subunit 4, 49kD; 3.75 
4 324 60; H12912; Hs.274691; adenylate kinase 2 adeny ute kinase 3 ,75 

* II T j I I r it/ f TFTR/MRP), member 2; ATP-binding cassette, sub-family C(CFTR' 3 74 

431M to AUJ8 2 mMAN AU?S Tn *° ALU8 - HUMAN ALU SUBFAMILY sx SEQUENCE CONTAMINATION WARNING ENTRY [H.saplens]; ESTs, Weakly similar 

453329; T97205; Hs.193400; ESTs, Weakly similar lo2109260A B cell growth factor [H.sapiens]; ESTs, Weakly similar to 2109260A B cell ; 3 74 
426440; BE382756; Hs.169902; solute earner family 2 (facilitated glucose transporter), member 1; solute earner family 2 (facilitated qlu' 3 74 

8330; AW150572; Hs.25731 6; ESTs; ESTs; 3.74 
41991 1; L15301: Hs.1276; BN51 (BHK21) temperature sensitivity complementing; BN51 (BHK21) temperature sensitivity com; 3 74 
452695; AW780199; Hs.30327; mitogen-activated protein kinase activatec protein i inase 5; m togen a iv it j pr leii kinase- activat- 3 73 
442233; AW967149; Hs.28439; ESTs, Weakly similar to I38022 hypcthei' I prot i IH.sapiens]; ESTs. '/e kl simil i to I38022 hyp'otheti; 3.73 
451295; AI557212; Hs.17132; ESTs, Moderately similar to I54374 gene NF2 protPir 'H ,i r ' E fs, Moderately imi -r to 1 54374 gene 373 
41 0772; BE275297; Hs.1 94685; Homo sapiens clone 24675 mRNA sequence; Homo sapiens clone 24675 mRNA sequence; 3.73 

426251; M24283;Hs.1 68383; intercellular adhesion molecule 1(CD54), human rhinovirus receptor ml; ellu rsdh - , ecu r- 1 (CD54); 3.72 

449843; R85337; Hs.24030; solute carrier family 31 (copper transporters), nemher 2; solute carrier family 31 (copper transpo- 3 71 
423523; AW299828; Hs.193580; ESTs; ESTs; 3.71 

41 3407; AI356293; Hs.75339; inositol polyphosphate phosphatase-like 1 ; inositol polyphosphate phosphatase-like ; 3.71 
448336; R53848; Hs.44976; ESTs; ESTs; 3.70 

422083; NMJM1 141; Hs.1 11 256; arachidonate 15-lipoxygenase, second type; arachidonate 15-lipoxygenase, second typ; 3.70 
416087; AF045184; Hs.79008; SKI-INTERACTING PROTEIN; SKI-INTERACTING PROTEIN; 3.70 
442200; AW590572; Hs.235768; ESTs; ESTs; 3.70 
414280; BE410769; Hs.75873; zyxin; zyxin; 3.69 

409354; N68188; Hs.159472; Homo sapiens cDNA: FU22224 lis, clone HRC01703; Homo sapiens cDNA: FU22224 fis. clone H 3 69 
41 5276; U88666; Hs.78353; SFRS protein kinase 2; SFRS protein kinase 2; 3.69 

439659; AW970780; Hs.59483; leucine-rich repeat-containing G protein-coupled receptor 5 (LGR6) mRNA; leucine-rich repeat-containing G protein; 3 69 

446522; NM_003876; Hs.15196; putative receptor protein; putative receptor protein; 3,69 

422785; AI8241 14; Hs.289088; heat shock 90kD protein 1, alptv Mr i k 90kDprol in 1, alpha; 3.68 

401083; ; ; NM_016582':Homo sapiens peptide transporter 3 (LOC51 296), mRNA, VERSION NM 01 6579,1 Gl; HU 016582*:Homo sapiens peptide transport 3.68 
413048; M93221; Hs.751 82; mannose receptor, C type 1; mannose receptor, C type 1; 3,66 
452690; AI536070; Hs.15085; ESTs; ESTs; 3.68 

428981; BE313077; Hs.93135; ESTs, Weakly similar to ALU2_HUMAN ALU SUBFAMILY SB SEQUENCE CONTAMINATION WARNING ENTRY [H.sapiens]; ESTs, Weakly similar 
to ALU2.HUMAN ALUS; 3.68 

415010; NMJ04203; Hs.77783; membrane-associated tyrosine- and threonine-specific cdc2-inhibitory kinase: membrane-associated tyrosine- and threon- 3 6B 

428579; NM_005756; Hs.184942; G protein-coupled receptor 64; G protein-coupled receptor 64; 3.68 

446430; AA346837; Hs.1 5075; hypothetical protein DKFZp434E hy I al t Dt t E 2216; 3.66 

44201 3; AA506476; Hs.375009; Human DNA sequence from clone RP1 1 -353C1 8 on chromosome 20 Contains ESTs, STSs, GSSs and CpG islands. Contains the NIFS gene for 

cysteine desulfurase, two genes for novel proteins and the gene for the; Human DNA sequence from clone RP11-353C1; 3.66 
416602; NM_006159; Hs.367895; Protein kinase Obinding protein NELL2; Protein kinase C-binding protein NELL2; 3.65 

44122 3 65 563042: Hs - 118820; Homosa P iens ' sil * r to RIKEN cDNA 0610012G03gene, clone MGC14132. mRNA, completecds; Homo sapiens, Similarto RIKEN cDNA 0610; 
413076; U10564; Hs,75188; weel (S. pombe) homolog; weel (S. pombe) homolog; 3.65 

' ' „ \ I " r \ >■ I If i nerg plice form 2 [H.sapie ESTs, Weakly similar to S65657 alpha-1C-; 3.65 

452060; W26980; Hs.349089; ATP-binding cassette, sub-family F (GCN20), member 2; ATP-binding cassette, sub-family F (GCN2; 3.65 
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451558; NM_001089; Hs.26630; ATP-binding cassette, sub-family A (ABC1), member 3; ATP-binding cassette, sub-family A (ABC1; 3.65 
414774; X02419; Hs.77274; plasminogen activator, urokinase; plasminogen activator, urokinase; 3.65 
417426; NM_002291; Hs.82124; laminin, beta 1; 'aminin, beta 1; 3.65 

450296; AL041949; Hs.24756; hepatocyte growth factor-regulated tyrosine kinase substrate; hepatocyte growth factor-regulated tyros; 3.64 
437669; AI358105; Hs.1 23164; ESTs, Weakly similar to match to ESTs AA667999 [F plans]; ESTs Veaklj im s o match to ESTsM;3.64 
414029; BE297731; Hs.75709; mannose-6-phosphate receptor (cation depends; m n e-6-phocphcter ptor itondep 364 
444388; AB033058; Hs 1 1 i i1 (IAA1232 protein; KIAA1 232 protein; 3.64 

425910; AAB30797; Hs.184760; CCAAT-box-binding transcription factor; CCAAT-box-binding transcription factor; 3.63 

451484; AV648896; Hs.283771; hypothetical protein; hypothetical protein; 3.63 

444513; H29627; Hs.79092; hypothetical protein FLJ14427; hypothetical protein FLJ14427; 3.63 

447495; AW401864; Hs.18720; programmed cell death 8 (apoptn» nducing f I ) Tnqramn=d ceil death 8 (apoptosis-induc; 3.62 
408101;AW968504;Hs.278346;CDC2relatedoct , C I protein kinase 7; 3.62 

424732; D80001; Hs.152629; KIAA0179 protein; KIAA0179 protein; 3.62 
411165; NM_000169; Hs.69089; galactosidase, alpha g osidase, alpha; 3i 
422112; BE540240; Hs.111783; Lsm1 protein; Lsm1 protein; 3.62 

453020; AL1 62039; Hs.31422; Homo sapiens mRNA; cDNA DKFZp434M229 (from clone DKFZp434M229); Homo sapiens mRNA; cDNA DKFZp434M229 (fr; 3.61 
438795; AAB25792; Hs.3771 19; gb:cd84b1 1 .s1 NCI_CGAP_Ov2 Homo sapiens cDNA clone, mRNA sequence; gb:od84b1 1 .s1 NCI_CGAP_Ov2 Homo sapiens ; 3 61 
445515; BE388665; Hs.179999; Homo sapiens, clone IMAGE:345''003, mRNA, Hcmo s ens cn MAGE 3457003, mRNA; 3.61 
407797; AK000524; Hs.39850; hypothetical protein FLJ20517; hypothetical protein FLJ20517; 3.60 

423217; NM_000094; Hs.1640; collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive); collagen, type VII, alpha 1 (epidermolys; 3.60 

444985; AI677737; Hs.380100; hypothetical protein FU14005; hypothetical protein FLJ14005; 3.60 

433764 AW753676 i n | FN (NM_0 j f I It i 1 n?3929); 3.59 

427857; AL133017; Hs.288679; hypothetical protein FLJ22865; hypothetical protein FU22865; 3.59 

415020; BE249915; Hs.293533; Human DNA sequence from clone RP11-127I2C on chromosome 10. Contains ESTs, STEs, GSSs and CpG islands. Contains the gene for a novel 

glulathione-S-transferase and five novel genes; Human DNA sequence from done RP1 1-127L2; 3.59 
115149 12451; Hs.7805 I n L; cathepsin L 

458715; AK000973; Hs.1706; hypothetical protein FLJ101 11; hypothetical protein FU10111; 3.57 

423576; NM_000383; Hs.129829; autoimmune regulator (autoimmune polyendocrinopathy candidiasis ectodermal dystrophy)- autoimmune regulator (automimmune polyen- 3 57 
440270; NM_015986;Hs.712O; cytokine ecentorlikp rr I tokin pto =cul-9 3 57 ' 

434883; AW381538; Hs.19807; hypcthetea! protein MGC12959 h, thetioal f , 3012959,3 57 

W J n 1 n i nc finger protein 278 (ZNF278), transcript variant 1, nRNA.; NM_014323':Homo sapiens zinc finger prot; 3.57 

449656; AA002008; Hs.188633; ESTs; ESTs; 3.56 ' H 

413795; AL040178; Hs.142003; ESTs; ESTs; 3.56 

406B59; AI581134; Hs.181357; laminin receptor 1 (67kD, ribosoma! protein SA); laminin receptor 1 (67kD, ribosomal prot; 3.56 
411030; BE3871 93; Hs.67896; 7-60 protein; 7-60 protein; 3.56 

447079; AA280057; Hs. 105280; ESTs, Weakly simi'ar to dJ953K23.l [i . mK ? ESTs Weaklysim rr to dJ963K23.2 [H.sa; 3.56 

424263; M77640; Hs.1757; L1 cell adhesion molecule(hydrocephalus, stenosisof aqueduct cfSyhiUo 1 V'SA i ;n , r. lardatcn, aphasia shuffling gait and adducled thumbs) 

syndrome p I tip ji I); L cell qlhp'nn module (hydrocephalus; 3.55 
413472; BE242870; Hs. 75379; solute carrier family 1 (g'iol high affinity glulamote transporter), member 3; solute carrier family 1 (alia' hiqh aifi' 3 55 

55; BE243123; Hs.321045; II I - iducible IkappaB kinase: II kinase n I 

426746; J03626; Hs.2057; uridine monophosphate synthetase (orotale phosphoribosyl transferase and orotidine-5'-decarboxylaso); uridine monophosphate synthetase (orotat: 3.55 
450931; N25156; Hs.25648; tumor necrosis factor receptor superfamily, member 5; tumor necrosis factor receptor superfami; 3 55 
425836; AW955696; Hs.90960; ESTs; ESTs; 3.54 
441054, AA913591; Hs.126480; ESTs; ESTs; 3.54 
440592; AL1 37268; Hs.7285; KIM0759 protein; KIM0759 protein; 3.54 
15! IE a, 1Q71S 12311 ESTs ESTs 3.K 

449027; AJ271216; Hs.22880; dipeptidylpeptidase III; dipeptidylpeptidase III; 3.53 
421662; NM.014141; Hs.106552; cell recognition molecule Caspr2; cell recognition molecule Caspr2; 3.53 
422732; AA577455; Hs.24937; transformer-2 alpha (htra-2 alpha); transfonner-2 alpha (hlra-2 alpha); 3 53 
424870 T15545 Hs. 244624, ESTs; ESTs; 3,52 

442794; AI74413Q; Hs.356753; hypothetica' protein MGC2975; hypothetical protein MGC2975; 3.52 

417640; D30857; Hs.82353; protein C receptor, endothelial (EPCR); protein C receptor, endothelial (EPCR); 3.51 

419971; AA400027; Hs.296234; ESTs, Weakly similar to T31613 hypothetical protein Y50E8A.i - Caenorhabditis elegans [Celegans]; ESTs, Weakly similar lo T31613 hypotheti; 3.51 
410257; BE244044; Hs.61469; hypothetical protein; hypothetical protein; 3.51 

424837 BE2761 13 1 J t r horn f ARD1 ; N-acetyltransferase, homolog of S. cerev 3 51 

421921; H83363;Hs.355993; trans cease of inner mitochondria membrane I0| east) iomclog;trar,sloc3seot nne n 'nchordnal mcmbr; 3.50 

454128; AL031259; Hs.367900; programmed cell death 2; programmed cell death 2; 3.50 

434049; AA501430; Hs.5771; amyotrophic lateral sclerosis 2 Guvenile) chromosome region, candidate 2; amyotrophic lateral sclerosis 2 (iuvenil- 3 50 

453641; AA444140; Hs.90960; ESTs; ESTs; 3.50 

429592; AB029041; Hs.209646; KIAA1118 protein; KIAA1 118 protein; 3.49 

430647; AC003682; Hs.127988; ESTs, Weakly similar to Z211_HUMAN ZINC FINGER PROTEIN 211 [H.sapiens); ESTs, Weakly similar to Z211 HUMAN ZINC ; 3.47 

410855; X97795; Hs.66718; RAD54 (S.cerevisiae)-like; RAD54 (S.cerevisiae)-like; 3.44 

413372; H55532; Hs.349695; tubul n, alpha 2; tubulin, alpha 2; 3.07 

437224; AL1 17628; Hs.97808; ESTs; ESTs; 2.77 

430439; AL133561; ; DKFZP434B061 protein; DKFZP434B061 protein; 2.76 

435897; AF269223; Hs.128322; t-complex 11 (a murine top homolog); t-complex 1 1 (a murine lep homolog); 2,53 

417592; AA204664; Hs.182437; ESTs, Weakly similar to I54383 chromosome segregation prote d [H i | ESTs, Weakly similar to I54383 chromosom; 2.46 

412026; AA383618; Us. 73073; teslis-specific ankyrin molif containing protein; testis-specific ankynn motif containing; 2,35 
422789; AK001113; Hs.120842; hypothetical protein FU13251 by thet col lein FU10251 3 
426627; AF012359; Hs.195685; ESTs; ESTs; 2.12 

438983; AF085884; Hs.20029; proacrosin binding protein sp32 precursor, proacrosin binding protein sp32 precurso; 2.07 
425709; AA383076; Hs.159274; outer dense fibre of sperm tails 1; outer dense fibre of sperm tails 1; 1.99 
433724; AI827749; Hs.1 44924; serine/threonine protein krnaeSSTP i hi ilein kinase SSTK; 1.68 

420710; NM_007009; Hs.99875; zona pellucida bm1 n i lotein; z I id bindir u, Dts.n 1 54 
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AA21 0987 D57294 AA214584 AA207006 D56572 

BG036675BF772005BF771866 B3961^t BG 0 -1 M'.<_ 005712 AF110315 BE074534 BE182776 BE158000 BE157999 BE714315 AWB18104 
AW847519AA099426AW817981 AV\r t 96 B( ,1 - -224493 AA308542 AW821833 BF902155 AI732411 BG778834 BG283641 BE748279 
BE748870 BG319540BE748864 BF73 4 BG 155 Al 5 '283 BI861466 AA563341 AA457591 BG949294 AW392886AA071122 AA227849 
AA584918 BG959570 BF773486 AL041 698 BF959013 RS717Q C16859 BF770411 BF771298 A1075321 L13823AA216700BF771864AW861859 
BE537068 C18935 M155719 BF771172 BF769107 BF804964 AW81B172AWB18143AW392930 AW817057AW858044BF746211 M179928 
AW861687AW821826BI055726 BF242S43 AA207189 BF770412 BF771157 BG430030 AA055592 

A»l 3 il«^ 809 ,i° 7AA218567 BF8 B63AI2671 BF! i7,P " H 9 "AW8S1851AW858362AI817548BF771300AA113928AA223422 
AA055556 BF773400 BF998869 BE081333 BE073424 BE142245 H59571 H59570 BF87155B BF871064 BE001132 BF826831 AW754298 
AA223267 BG997895 BG997897 AW991957 AA534354 BG319501 BF736309 AI694265 AA045564 BG950256 AI829309 BG987850 BE093175 
BF854337 

AA933717 BF061897AW628327AA6417B8M400495 

AW973352BF222929AW016853BF^ 1 -I l c LT551767 AA558414 AI339359 BF059601 AI961162 AI341422 A1206248 AI206165 
AA548736 AA768578 AI539081 AW025957 AA735837 N79575 AW594357 AA480892 



AL133561 AL117481 AL122069 AW439292 AI96882S AL041090 



Unique number corresponding to an Eos 
Sequence source. The 7 digit numbers ir 
of human chromosome 22.' Dunham I. eta 
Indicates DNA strand from which exons v 



51) numbers. "Dunham I. et al." refers to the publication entitled The DNA 



402678 
403171 
406137 
401704 
402677 
402679 
402145 



405484 
401083 
404976 



84187-84744 

137634-1 37768, 1 39702-1 39893, 1 40475-1 4059 
1 13765-1 1 3910, 1 1 5653-1 1 5765, 1 1 6808-1 1 594 
3739537514,37866-37981 



30437-31058 
24712-25374 

22135-22309,23063-23238 
132079-132216 



172780-174358 
24019-24973 

199214-199579,199672-199920,200262-20049 



TABLE 58A: 434 genes upregulated in bladder cancer relative to normal body (issues 

Table 58A lists about 434 genes upregulated in bladder cancer relative to normal body tissues thai are likely lo encode proteins amenable to modulation by small molecules, peptides 
or .-ilrc.1ics.Thesegeneswere S electedfrcm59580probesetsontheEos/Affvrr xHi c-n r rray. Gene expression data foreachprobesetcbtan mthi 
.a .pressed as average „ tensity , Ah, a normal zed value re. ectrng the reM, ,e ^y,^, z rrczw l^^ v ^^J^^^^^^^ 
indicate of have oncogenic function or of transducing rntracellular signals, or of being modulatable b/ small rno'ecute, pep ,de= rr.rtit, dr e.g.pki =ath r I 
phosphatase, or lonjransporter). Certain predicted protein domains are noted, ' ' 

plffi yi Unique Eos probeset identifier number 

ExAccn: Exemplar accession number, GenBank accession number 

UniGenelD: UniGene number 

Pred.Prot.Domains: Certain predicted protein domains. Abbrevialions used: TM, transmembrane domain; SS, signal sequence; =Y, very likely to contain- =M likely to contain- other 

protein domain abbrevialions are from PFAM (Nucleic Acids Research, 2002, 30:275-280) 
UniGene Title: UniGene gene title 

R1 90th percentile of bladder tumor Als divided by the 50th percentile of normal tissue Als 

Pkey; ExAccn; UnigenelD; Unigene Title; Pred.Prot.Domains; R1 

430630; AW269920; Hs.2621; cystatin A (stefin A); cystatin;TM=M;; 35.25 

422282; AF019225; Hs.114309; apolipcp. nl 1o ExbB;TM S=M; 3 5 

414555; N98569; Hs.76422; phospholipase A2, group IIA (platelets, ; phoslip;TM=M;SS=Y; 31 .60 

415192; D17793; Hs.78183; aldo-keto reduclasefamil/ , m to k red;TM=t ) ! 

417771; AA804698; Hs,82547; retinoic acid receptor responder (tazaro; r.one.none; 28,50 

?™™fll^ r r | ch r pknase Recep L domain.Peptidase M24; 27.43 

417079; U65590; Hs.81134; interteukin 1 receptorantagonist IL1;SS=M; 25.98 

413859; AW992356; Hs.8364; Homo sapiens pyruvate dehydrcq n cm fH none; 25.38 

418818; AA228899; Hs.101307; Homo sapiens HUT1 1 protein rnRNA, partial; UT.none- 25 28 

425397; J04088;Hs 156346, topoisomerase (DW) II (170kD];Dt ,yr B,DN topoPoiV.HATPase c;SS=M; 23.58 

421733; AL11 9671 ;Hs.1 420; fibroblast growth factor receptor 3 (a i ig.pkinasc I" f;SS=V; 21.24 

424008; R02740; Hs.137555; putaUvechemokine receptor, GTPbindirn trr,J;TM=Y;SS=f i 

447343, AA256641 Hs ^ I; ESI hly similar to SO Ipl w none 19 78 

408243; Y00737 Hs.624; inteileukin 8; HLH,PAS,lL8;TM=J 18 90 

427490; Z951 52 H 1 rr I rotein kir rl- na e TM=M;; 18.75 

' h c < i it- r f 

444381; BE387335; Hs.283713; ESTs, Weakly simi'ar to S64054 hypotheti; Coilagen:TM=M;SS=M; 18.60 
452747; BE153855; Hs.61460; Ig superfamily receptor LNIR; ig.Rhabd q ycop;Tf*YS8-M; 18.55' 
415444; BE247295; Hs.78452; solute carrier family 20 (phosphate Iran; PH04,LIM;TM=M;; 18.25 
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413132; NM_006823; Hs.75209; protein kinase (cAMP-dependent, oaialyti; PKI:SS=M; 1 7.73 

427337; Z46223; Hs.176663; Fc fragment of IgG, lowarfinity Nib, r; iq;TM=Y;SS=M; 17.58 

450746; D82673; Hs.278589; general transcription facloi II, i, none,SH3,PX; 1 7.12 

418945; BE246762; Hs.89499; arachidonate 5-lipoxygenase; lipoxygenase,PI_AT;TM=M;; 1 5.88 

420981; L40904; Hs.100724; peroxisome proliferative activated recep; hormone- rec.zf ZMV M 16 78 

439941; AI392640; Hs 18272, amino acid transporter j I 1 1 T =Y 16 75 

431846; BE019924; Hs.271580; uroplakin 1B; transmembrane4;TM=Y;SS=M; 16.56 

J 1 ' Hs.23473 , I , l/s,ig,rAC nil dl Idl pkin 3 M 

414883; AA926960; ; CDC28 protein kinase 1 ; CKS;; 16.20 

438091; AW373062; ; nuclear receptor subfamily 1, group I, m; hormone rec,zf-C4,none; 15.80 
S63;AW24752M H I t 3 fac cety'hydrola; PAF-AH_lb,Li[ GDSL TM=M;; 15.70 

428450; NM_014791, Hs 184339, KIAA0175 gene product, Ka1 pkin TM=r 1 5.6 
422278; AF072B73; Hs.114218; frizzled (Drosophila) homolog 6, Fz , rizzled,7tn " rM=Y SS=M; 15.45 
434293; NM_004445; Hs.3796, EphB6, EPHJbd fn3 pkinas SAM i >o, ■ ^ , 

417880; BE241595; Hs.82848; selecUn L (lymphocyte adhe ion r EGF,ledin_c rl M <=s M; 15.28 

iS'K r r 1 MED K - d " H,r| 1 n TM=fSS=M; 15.10 

416305; AU076628;Hs.79187; coxsackievirus and adenovirus rec p j;T M; 14.90 

432306; Y18207; Hs.303090; protein phosphatase 1 , regulatory (Inhib; CBM_21;TM=M;; 14.80 
413076; U10564; Hs,75188; weel (S. pombe) homolog; pkinase;TM=M;; 14.73 
429345; R11141; Hs.199695; hypothetical protein; K_tetra,SAM; 14.58 

449230;BE613348;Hs.211579;melanomaceUjl oni ailc-;ig od bosomalj box <-> SS-M 14 55 
421508; NM_004833; Hs. 105115; absent in melanoma 2 P L )APIN,HIN;TM=f 1 
446006; NM_004403; Hs.13530; deafness, autosomal dominant 5; nor,e;TW-M SS=M 14 35 
429556; AW139399; Hs.98988; ESTs; ncne;TM=M;; 14.18 

416224; NM.002902; Hs.79088, reticrK is [ ,,j Icium bindin elhand SS=M; 14,13 

^, 88 „ 4 5 07: Hs ' 21 15951 pratein tyrosine Phosphatase, non-recepl; Y_phosphatase,Band_41,PDZ;SS=M; 13.90 
426657; NM_015865; Hs.171731; solute carrier famil/ 14 (urea Iransport; UT;TM=Y;; 13.83 
428157, AI738719; Hs.198427;hexokinase2; hexokinase,hexokinase2,none; 13.80 

^T^T^f^'^ 0 " 3 ^" h ' a r ' ^ » ' r r p( *W-«eP» b,granLlin,BNR;TM=Y;SS=M: 13.78 

439453; BE264974; Hs.6566; thyroid hormone receptor inte * lor 1 AAA.AB n W=M;;13.3£ 

432314; AA533447 H 1 1 I89; ESTs ini none 13 25 

413109;AW389845; Hs.110855; ESTs; PH04,none; 13,15 

424490; AJ278016; Hs.55565; ankyrin repeal domain 3; ank,pkinase;TM=M;; 13.13 

426490; NMJ01621; Hs.170087; aryl hydrocarbon receptor; PAC,PAS;TM=M;; 12.93 

r I7? AF1 5 H I! ' 7 ni ea, ptori , ting proteir no S~M ' 8 

11 I r hi r I 0 C f 11 r 

' \ 1 EGP1 EGFJNFF JB 1 r.tin B2TIL;TM=M;SS=M; 11.93 

443195; BE148235; Hs.193063; Home sapiens cDNAFU14J)1 li cloi NT;/ trai r.on, 11 £ 

WD00 UMWh : cCHAP™* 

412182; AA205588; Hs. 1551 50; Splicing factor, arginine/serine-rich, 4; rrm.hormone rec,zf-C4,sugar In; 11 85 
433470; AW960564; ; transmembrane 4 superfam \) r t t r , T , 34 1.80 
425118; AU076611;Hs.154672; methylene tetrahydrofolate dehydrogenase; myb DNA- 

418870^,4^ L 1 ' F ' r "^nur P ermease,HC03.cotransp,™=M;; 11.69 

418870, AF147204;Hs.89414;chemokme(C-X-C motif], receptor 4 (fi 7tm_1,7lm_2;TM=Y;SS=M 1.50 

426761; AI015709; Hs.172089; Homo sapiens mRNA; cDNA DKFZ P 5S6I2022 I lone; i M=Y;SS=M; 11.48 

2S £K55 I S 2 H °, m ,° "P 6nS mRNA M lenglh msert cDNi ,MPDH -CJMroHJJ,CBS,inlegrin_B,Ricin.BJeciin ; 11.38 

446742; AA2321 19; Hs.16085; putative G-prolein coupled receptor; none:TM-Y;SS=M; 11.23 

436729; BE621 807; ; transmembrane 4 superfamily member 1; none, [!.'-- ,, C F M, 11 lb 

409960; BE261944; Hs.339573; hexokir.ase 1; nor.e.none; 11.02 

426539;AB011155;Hs.170290,drc I 3 e(Dro phi homolog 5; SH3,PDZ,Guanylale kin;TM=M;; 10.78 
17321; BE245149; Hs.02643; pr , I m i 

4 1; AA410183; H 475; ESTs; ion _ti c,IGFBP,tsp_1; 1 

41 B004;U3751 9; Hs.87539; aldehyde dehydrogen se3famil Tiber i Jh TM-M SS=M' 10 53 

447365; BE383676; Hs.334; Rho guanine nucleotide exchange factor ( c =>H,RhoGEPTM=M 0 53 

449437; AI70203B; Hs.100057; Homo sapiens cDNA: FU22902 fis, clone K; none.none; 1 0.52 

436856; A1469355; Hs.127310; ESTs; pkinase,rrm;TM=M;; 10.48 

451035; AU076785; Hs.430; plastin 1 (I isoform); elhar,d,CH,Adapfn_N;SS=M- 10 38 

«?™ ^ 2 L 9 , 53 °, ; Hs 83968; integrtn ' beta 2 (anli9en 0018 * integr;n_B,EGF,PS!;TM=Y;SS=M; 10.35 

H 1 3 , D49B, alpha 2 sul it Ft M; 10.34 

440006; AK00051 7; Hs.6844; hypothetical protein FLJ20510; AAA,NB-ARC,PAAD DAPIN;NA;NA; 10.25 
420344; BE463721; Hs.97101; putative G protein-coupled receptor; Mcth u, I TM=Y;SSH 1 
437852; BE001836; Hs.256897; EST. e imilarto 1365012.1 [I I 3P3~ In 2 TM-Y '0 13 
]' 2 NM - 0031 < Pi ililmreated EGF,fn3,ldl_recept_a Ial_r6 ot_b,grai li 1 1 1 ^ TM Y SS-M; 1 0.08 
427700; AA262294; Hs.180383; dual specificity phosphatase 6; RhoJ- ;se DSP. ;TW=M;; 10 05 

fmlla m ^lt^I^ m fUnCti0na ' d0man (FTPRr lnl ' ,ci H3 K PH spectrin,RhoGEF;TM=M;; 10.05 

400496; ; ; ENSP00000224716*;GTP-binding protein SAR; none;TM=Y;; 1 0.01 

413899;AF083892 Hs 7 tight j lion protein 2 (z jc S O PDZ Guanylate km,TM=M" 10 00 

404568; ; ; NM_022071':Homo sapiens hypothetical pro; SI I2;TM=M;; 10.00 

444823 BE ^62981 H 1 15; p Mral.f TN MSS 1 1.93 

421425; AK00 1 564; Hs 104222; hypothetical protein FU10702; efhand,kaz£l,arf,ras,7tm 1;TM=M- 9 90 

413441; AI929374; Hs.75367; Src-iike-adapler; SH2,SH3;TM=M;; 9.90 

424954; NM_000546; Hs.1846; tumor protein p53 (Li-Frauneni syndrome); P53,WD40,IRK;TM=M- 9 88 
452289; BE568205; Hs.28827; mitogen-activated proteir na i pkinasej 1;; 9 
439223; AW238299; Hs.250618; UL16 binding protein 2; IdLrecepI a.PKD.MHC l;TM=M;SS=Y; 9.83 
429238; NMJ02849; Hs.1982B8; protein tyros re pi rr l phospt 11 f 8 

SmK 4 ! 9 '' 1 128 Jua ' ! - lticlt y Phosphatase 5; Rhodanese.DSP-'. phosp 3 ta e;TM= 1.73 
452239; AW37937B; Hs.170121; protein tyrosine phosphatase, receptor t; none.none- 9 73 
!S M284166;Hs - 84113;cyc,ln - dependenlkl ! Phosphatase^ M 

403912; ;; C5000394*:gi|12737280|ref]XP_006682.2| I* r,one;TM M '0 

427315; AA179949; Hs.175563; Homo sapiens mRNA; cDNA DKFZp564N0763 (f; none,spectrin,SH3,PH,CH; 9.70 
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428428; AL037544; Hs.184298; cyclin-dependent kinase 7 (homotog of Xe; pkinase;TM=M,; 9.68 

422241; Y00062; Hs.170121; protein tyrosine phosphatase, receptor ■ kinesir in -, , t rcspha ase;TM=M;; 9.63 

429083; Y09397; Hs.227817; BCL2-re!ated protein A 1 t 2 f \ 163 

425322; U6363C Hs.1556 pr leii ina 5 Dil, activated, catalytic. F Pt4_kinase,F T FATC;TM=M;; 9.55 
430259 BE550182 H E P F I 10 c EF;TM M SS-M 9.50 

428520; AA331901; Hrs.184736; hypothetical protein FLJ10097; none;TM=M;; 9.50 
418969;W33191;Hs.28907.hypothfic»i"i ein FLJ202 43'TM-M I 

448913 c^2PHD 2 B T H t r \ r -^r 5 r ss ' Gs ™ 

414911, NM_000107 H- 602; damag n>cFc DNA binding protein 2 (4: WD40,homeobox,LIM;TM=M;; 9.48 
^H? 5;AI557212;Hs - 17132; ES Ts, Moderatelysimilartol54374 gene ; pkinas e ,DAG_PE-bir,d,pkin a se C,OPR,none; 9.45 
402328; ; ; Target Ex.cn: pkir,ase;TM-M;; 9.44 

443710; AI928136; Hs.9691; Homo sapiens cDNA: FLJ23249 (is, clone C; G-alpha,none; 9.42 
414987; AA524394; Hs.294022; hypothetical protein FLJ14950; SH2;TM=M;; 9.42 

440G7 r AW0053 4 I 4 I , A'eakly similar to KCC1_HUVAN CALCI; pkinase,none; 9.35 
433376; AI249361;Hs.74122;caspase 4, apoptosis-related cysteine pr; CARD,ICE_p1 0.ICE p20;3S -M- 9 23 
410668; BE379794 Hs 654C3 hypothetical prctem i th.TNFF 6;TM= = S=M9 25 
430024; AI808780; Hs.227730; integrin, alpha 6; intc-grin A,FG-GAP;TM=Y;SS=M; 9.23 
452696; AI826645; Hs.21 1 534; ESFs; ArfGap,PH,ank.Guanylate_kin,PDZ,SH3; 9.13 
434263; N34895; Hs.44648, ESTs; ig.none; 9.13 

407949;W21874 Hs?47i, EST , iky imtar to 2109260A B cell RibosomaLS14,ank,pkinase,death,none; 9.10 
429332;AF030403; Hs.199263; Ste-20 related krr e ol r r = f 

17426; NM_002291 nn I i 4 i 

443951; F13272; Hs 111334 ferritin light potyprptijo PVF j none; 9.0/ 
414368; W70171; Hs.75939; uridine monophosphate kinase; PRK,CoaE;; 8.98 
451292; AB037716; Hs.26204; KIAA1295 protein; SH3;TM=M;; 8.93 
438000; AI825880; Hs.5985; non-kinase Cdc42 effector protein SPEC2; none; TM=M;; 8 90 
446620; AA128808; Hs.179902; transporter like pr I n c TN" S=M;8.90 
436075; BE090176; Hs.179902; transporter-like protein; none;TM=Y;SS=M; 8.88 
437056; AM 47061; ; gb:ok33a1 1 .s1 Soares.NSF_F8.9W_OT_PA_P.S; none,spec(rin,SH3,PH,CH; 8.78 
445496; AB007860;Hs.12802; development ndd tiation enhancin; SH3.ank.PI T 
418203, X54942 H - r 3 icier km CKS 8.75 
434608; AA805443; Hs.179909; hypothetical protein FU22995; none;TM=M;; 8.70 
445033; AV652402; Hs. 72901; mucin 13, epithelial transmembrane; ank;; B.65 
41 7640; D30857; Hs.82353; protein C receptor, endothelial (EPCR); none;TM=M;SS=M- 8 65 
432841; M93425; Hs.62; prolein tyrosine phosphatase, non-recept; Y_phosphalase;SS=M; 8.65 
430397;AI924533;Hs.105607;bicarbonatetransporterrelotcdprot3in HCO solransp;TM=i 64 
448888; AW1 96663; Hs.200242; caspase recruitment domain prolein 6; CARD;TM=M;; 8.60 
442994; AI026718, Hs 16954; ESTs , ( r,t,okinase,deatf,RbosDmal_S14; 8.60 
4 c L HI P ytosol IT, 4 r r T , . n 

455439;AW945484,Hs1B4252 FT kly similar to ALU8 HUMAN Al irr Ir I 5 
437763; AA469369 H 58 I ti su inhil iter of metalloproteinase 1 ; TIMP,pkinase,DAG_PE-bind,RBD; 8.43 
417035; AA192455; Hs.22968; Homo sapiens clone IMAGE.-451 939, mRNA se; none,none;8.40 
418478; U38945; Hs.1174; cyclin-dependent kina e nhibil i !A me ank 339 
448209; AW160489; Hs.20709; tetra-paa I ansa r ran l;TM=Y;SS=M 8.33 
450139; AK001838;Hs.296323;seram/glucocorticcid regulated kinase; none.none; 8.33 
445350; AF052112; Hs.12540; lysopho pholip e I; abhydrolase TM=M831 
427509; M62505; Hs.2161; complement component 5 receptor 1 (C5a I; 7tm_1 :TM=Y;SS=U; 8.30 

1-1 ESTs, A -i 689 hypotheti; IRK.none: 3.2 

446719; W39500; Hs.301872; hypothetical protein MGC4840; AAA,SKI;TM=M;; 8.23 
424762; AL119442; Hs.183684 eira.ri, trans tio initialioi actor onercne820 
?lVn} n ^- mm H 1 11 n 1 Jl 1 tnacTMYSS=M;8.18 

416094;AW995512;Hs,225977; nuclear receptor coactivator 3; none.none; 8.15 
41 9262; AA834664; Hs.29131; nuclear receptor coactivator 2; PAS,zf-C2H2,SET; 8.15 
417386; AL037228; Hs.82043; D123 gene product; NUDIX,secY,E1 dehydrog.lranstet pyr;TM=Y SS-M 8 13 
431236 AV656840 1- 11 kin 13 r T t US M 8 10 

439658; AA332057; Hs.6639; hypothetical protein MGC15440; none;TM=M;SS=M; 8 09 
425424; NM.004954; Hs.157199; ELKL motif kinase; pkinase.UBA.KA 1 ;TW=M;; 8.08 
422573; AW297985; Hs.295726; integrin, alpha V (vitronectin receptor: FG-GAP,inlegrin_A none' 6 05 
408912; AB01 1084; Hs.48924; KIAA0512gene product, ALEX. Am if 3 M 8 03 

447674; BE270640; Hs.19192; cyclin-dependent kinase pkinose;S -\' 3 33 
404891; ; ; Target Exon; none.none; 7.95 

414278; AA33011 H 7727 Hma glucose transporter pseudogene; none.none; 7.93 

452888; AW955454; Hs.30942; ephrin-B2; Ephrin.fn2;TM=Y;SS=M; 7.93 

433211 H1'E5C h 2606; N pkin UBA.KA1 S=M 7 91 

438485; W57578; Hs.237955; RAB7, member RAS oncogene family; pkinase,ABC1,none; 7.90 

1^2o? :J ^?Z! 2 ? 0:Hs - 118727; Homosapien S cDNAFLJ13692fi5, clone PL; HLH.death.TNFR c5.Acyl-CcA hydro; 7.90 

417381;AF164142;Hs.82042;solutecamerfanily2 ni eo x n_ r.permease.RA; 7.88 

427122; AW057736; Hs.323910; HER2 receptor tyrosine kinase (c-erb-b2 , Funn i ke ptir r i p_L_domain YLP-TM=Y'SS=M- 7 B6 
405036; ; ; NM_021628*:Homo sapiens arachidonate lip;iipoxygenase,compeAl_4 )Kd.PLAT;TM M 783 

lSSL 6 f^ f tr LRRiRRCT.MAV.Nucleopi n,n Tropomyosin,OPR,filament,bZIP,G i amma,M,DUF164;TM=M;; 7.83 

431912; AI660552 H 76549 EST Vv i | Act su e j r\ 1h N 780 

432981; NM_00273.<, Hs 3136; proton kinase, AMP-aotivated, gamma 1 n; CBSAe_trans:TM=M;; 7 73 

422940; BE077458; ; gb:RC1-BT0606-090500-015-b04 BT0606 H r Se 7.PH.ANF receptor lig ran,WD40,IRK- 7 78 

1 15767; citron (rho-inl ne rJH t bind.PF II 

431183; NM_006855; Hs.250696; KDEL (Lys-Asp-Glu-Leu) endoplasmic relic; ERJumen recept,TM=M;SS=M- 7.78 

400845; ; ; NM_0031C5*:Homo sapiens sortilin-related; EGF,fn3,ldl_recepl_a,ldl_recept_b,granulin,BNR;TM=Y;SS=M; 7 73 

11 ~! I e b > c* | > n n 

«719 AW9 86; Hs.7535 ubiquitin conjugating enzyme E2N (I ioI U3 con.Y pho I 3.SH 7 I 
403212;; ; NM_019595:Homo sapiens intersectin 2 ( r i H,RlioGEF;TM=M;; 7.70 

441 190; H09073; Hs.25046; ESTs; E1-E2_ATPase,Cation_ATPsse_C,Ca:ion_ATPase_N,Hydroiase,none; 7.68 
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42BBU £m£E?hV i'EEI 4612 C " me J Feta ' Brai " CDNA; 7t m J,zf-C3HC4 1 fn3,aPRY,KRAB, Z f-C2H2.rve,zf-B bcx;TM=Y;SS=M; 7 68 

425289,AW1C^V , 530; interferoi ir indiinble protein 16; PMD DAPiN HIN'SS=M' 7 63 
401927; ;; C17000914*gi|83943o7|r |NP_0 854E I none, 7 60 

421489; AI922821 H . Ts; none,Pi-PI PI-PL 

SEED'S u 24U " Ch nrd .« lc '™^ted,fani;none ; TM=Y ; SS=M;7.53 

431605;AW972407;Hs.124370;gb:EST384498 MAGEreseqi P n- - r der ci e 3RP C 4 TM-M-- 7 'in 

430570 AI417881 H 12464; EST 2 Fz Fnzzled.none; 7.50 ^ny,atek,nase,SRP54,TM-M„ 7.50 

™™ Su 27 ! 5 " 1 '™ Channel ' nom °Ne-gated 1 alpha; ASC;TM=Y;; 7.48 
^ ; n^ 4780;H5 - 4248;vav2cncoaene RhoGEr| " lU L| " PE bind none- 7 48 
444252 P^11 5;H '.Ft nonr;7.47 ~ 

AF 4 nQ« 4 ^ HS J 7 oS al f' 3 telan 9 iectasia and Rad3 rela ' e * FAT,FATC,PI3_PI4Jlnase;TM=M;; 7.47 

418546; AA224827;;gb:nc32g04.s1NCI_CGAP_Pr2 Homo sapn is i inteqrin ~G "AP none 1 7 45 
437860; AA333063; Hs.279898; Homo sapiens cDNA: FLJ23 G' e L | NA 7 4 
S : a*™? H f 49 ° 6; CST5 ' Weak " Simi,ai 10 T17210hypotheti; none.pkinase; 7.10 * 

^^^m^^tfTS^r ' ' V r " nare death SP^.SAP.BIxaoinaLiate.SRPSMDENN.DENN uDENN"TM=M" 740 

417141 : U22~662 n^°T ? ' f f™"'- 0rmln 9 ? d+!i; '-RR,PAAD_DAPIN,AAA,CARD,NB-ARC;NA;NA 7.38 ' " M ° 

«nn« S S nuclear receptor subfamily 1, group H, m; hormone_rec,zf-C4;SS=M; 7.38 
430016; NM_004736; Hs.227656; xenotropic and polylropic letrov r rec' SPX E>'S'TM-> 7 38 
422813;AV656571;Hs.121068;lransmernbrane4superfam!lyn-cmt r6 ! t r men'br UTM YSS-M-7 35 
433390; AA586950; Hs. 260180; Homo sapiens mRNA; cDNA DKFZp7> I , 7 pc n SH3 PH CH 7 25 
409213; U61412; Hs.51133; PTK6 protein tyrosine kinase 6; SH2.SH3 pkin Tf , 1 SH3 ' PH ' CH ' 7 ' 25 
444745; AF117754; Hs.11861 thyroid hormone receptcr-asso-iated prat noncTM-M" 7 "3 
K m HOm ° Sa P! e ; S v PAK2 mRNA ' ^ s <* Pkinas e ,PBD;TM=M;; 7 19 

407591; NM_000910; Hs.37125; neuropeptide Y receptor Y2- 7tm MM=Y- 7 18 
427231; AW851989; Hs.285814 prouly ), phila) ho 3 g 4 t- SH3 TM M SS=M; 7 18 
Pectm donai, indinq prot in 1 SH3 TM=f I 
'334 M Memo ens solute my c-Tf Y 15 

1 I 1 4 hloiid n o I 1 ' \ 7 14 

^ ;N3 m1 10;H5 ' 305971;50|uIc ,rr r ' "ttatecTg gar t'r'.TM-Y,SS=r 14 
^ 7 ^^- 022819:Ho ™saP'ensphosp l oli F d l ,h i'7 1' 

433618; AA602539; Hs.345494 ; ESTs; G-alpha,A_deaminase; 7.10 

X ^m KC H5 u °v^ H0m0 , Sapien5 CDNA: FU21425 fs ' olone °- Sufale_lransp,STAS,HMG box; 7.08 
426655, AL049589; Hs.1 71 723; neuronal cell death-related protein; I FIID-31 ;TM=M - 7 05 

01 1 oactive mtestir pepti cceptor 1; 7tm. 2,HRf 1 , 03 

4JU41,AA307211;Hs.251531;proteasome(prosome,macropaini-i J t.unH nrolea meTM=M 6 93 
405275;AB028989;;mitogenactivatedproterkr i T l a c AvdTIL ntlFHO-qs-iui. K <n 

415392; Z44067; Hs.10957; ESTs; PIP5K none- 6 89 ^ s - K " 01 ' reF beta,vwa,vwc,vwd,TIL,DUF139,SS=M; 6.93 

429355 AW973253 H- 689; [ r rt - fcZIP Armadillo seq none- 6 88 
429663; M68B74; Hs.211587; phospholipase A2, group IVA (cytosoli'c ■ C2 PLA' B TM=M- 6 85 

401917; AL050149, , RAN binding plot In 3; O.exin.St T T STAt' bind STAT prot ion trans PAC PAS nn™>- R 7R 
Su073 M 58 00 H 3, 21,rn, Pn i \ 

4?fi7il ;A Si 3 ^ H n^ 823 ^ " ' r r ' a f ' ' 1 J plraQsSANLPNT;TM=U ; ;6.70 

408056; AA312:.r ' J 1 I; ephrin-A4; Ephrinjf =M;SS=M M 
401057; BE563196; ; eukaryotic translation elongation faclon ior.Jrans IQTM=Y'' 6 60 
!K H8 m» 1™™ r? 0 "' Homosa P iens cDNA FLJ13357 fs^lone PL; nor.e'.none; 6.60 
400528; ; ; NM_020975*:Homo sapiens ret proto-onccge; cadherin ckinaseTM-\ ' f< 65(1 
418562; R60669; Hs.124831; CGI-67 protein; none,Skp1,AAA- 657 
453826; AL138129; ; gb:DKFZ P 547F152.r1 547 (synonym: hfbri) ; PK,PK_C.none- 6 55 
437419- RFOfiooon. u. Homo sapiens mRNA . C J NA DKF ^ p547C ^ 6 ' (fr . A b C 
serum/glucocorticoid regulated kina- r er 

lypothetical protein DKFZp434N185; F5 F8 type C,pMnase tis - t 

7?S S - 6727; R^-STPase^tivahng protein SH3 domain; TM-fe ! 6 4 

, 1 663904; Hs.89862;TNFRSF1 ociated via death domain: deatlvTM=M ■ C 4i 
441712; AW391927; Hs.7946; KIAA1288 protein; AIP3;TM=M- 6 44 
414557; AA3401 11; Hs.100009; acyl-Coenzyme A oxidase 1, palmhoyh pkinase 7trn V 6 43 

Apnl^ 3 ? HS ' 273333 ^ ^Po.he,ic a r protein FLJ 1 0986; FGGyIc-™^ ; G 43 
- - J;" I "I I k i il LJ108 r T t • 

S ;AF 5 5 ° 47H * LT nel BSH3 art none 6 40 

^on lA S 81 H ' lL " T r ' M rl H none 6 33 

438698; AW297855; Hs.125815; ESTs, Weakly simila lo I38022 IivddII etr liDoxvaBnase PI AT nnno- fi « 
427801; AW979155; Hs.298275; amino acid transporter 2; AaJrans;TM=M;SS=M; 6.32 
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407970; AW40381 4; Hs.41714; BCL2-associated athanogene; ubiquitin.BAG.Tropomyosin;, 6.30 
438464; AA669735; Hs.324743; protein phosphatase 4 regulatory subunit; none,none; 6.30 
453977, AA886006 K 7F I pkinase,P2X_receptor.E1-E2_ATPase,Hydrolase; 6.30 
412491; W31589; Hs.73957; RAB5A member RAS once I FF Tf M I 

413235; BE243445; Hs.75248; topoisomerase (DNA) II beta (180kD); DNAjyraseB,DNAJopoisolV,HATPase_c,DNA_gyraseB,DNA topoisolV.HATPase c; 6.29 
404342; ;;C7002192-:gi|7299207|gb|AAF544041 1 (AEf r r TM=M;; 6.27 
409274; NM.003930; Hs.52644; SKAP55 homologue; SH3,PH;SS=M; 6.25 

419693; AA133749; Hs.301350; FXYD domain-containing ion lianspoit [eg; ATP1G1 PLM MATB;TM=Y;SS=M; 5.23 
405429; ; ; Target Exon; Y_pliosphaIase,none; 6.23 

404975; AL042279; ; uncharactenzed hypothalamus protein HT0; ingle;TM=Y;S S=M; b 2i i 
452929; AW954938;Hs.172816, reu ulir. 1; N gulin.E ig,Neu,egulir.,EGF,ig; 6.18 

446883; AW452756;Hs.16364; hypothetical protein FLJ10955;DEAD l helicase_C,rrn,Ndr l Cys_knotJIL,vwa l vwc,vwdJQ,Rlla,abhydrolaseJGF- 

beta,DUF139 l TPR,DSPc,tsp_1,Ribosomal_S21,rvp;TM=M;; 6.18 
433233; AB040927; Hs.301804; KIM1494 protein £H3 f-C3HC4;TM=M;; fi.1 
458791; BE615453; Hs.346509; dedicator of cyto-kinesis 1 ; none;TM=Y;; 6.15 
421836; AF109219; Hs.108787; phosphatidylinositol glycan, class N; none.none; 6.14 
447727; AI421 079,, tumor necros fa , erfami; nor.e.syi ptobrevin. 6 13 
401536; ; ; NM_002530*:Homo sapiens neurotrophic tyr; ig,pkinase I LRR,LRRNT,LRRCT;TM=M;SS=M; 6.1 1 

' l40566;Hs.1 6; ESTs, Weakly simil PLHU p II W,l i p i, PI-PLC C2.SH2.PH.SH3 PI-PI PAN;6.1C 
NM_00 H r%57 plasminogen activator, urokinase recepto; UPAR_LY6,ET PLA2 inh-SS=M- 6.08 
450447; AF212223; Hs.25010; hypothetical protein P1S-2; NTF2;TM=M;; 6.08 
450107; A1873287; Hs.257K12; ESTs; ICE_p20,DED;TM=M" 6 05 

418175; AW967054;H '06312 ES , f r,H2,BTB,KJetra,Syntaxin,r,one; 6.05 

408983; NMJ00492, Hs ( r cystic ft is -ansmembraneconductanc; ABC_tran,ABC_membrane 1 PRK,Bac_export_3;TM=Y;; 6.05 
439738; BE246502; Hs,9598; sema domain, immunoglobulin domain (lq ma.PSI.in qrin B;Tf , f 3 
427625; AF008216 H 2 013: putat 1 1 1 lan HLA class II associated p- none none' 603 ~ 
40943C R21945 H 46735; s pi if nine/se e-rich5 DSPc Rhodanese, none; 6.03 
I I I 1 a ! c Tf ' r 

405102; ; ; C15001220':gi|4469558|gb|AAD21311.1|(AF; DAG.PE-bind.PH,RhoGEF,DC1;SS=U; 6.03 

I E H I I I rq n I Tf 1 M b 03 

415327; H22769; ; gb:ym54c02.r1 Soares infant b n 1MB H PC I M-CC 
404148; ; ; NM.002944 , ;Homo sapiens v-ros avian UR2;fn3,pkin se.DUFI 3;TM=Y'SS f 
405531; ; ; Target Exon; PDZ.CARD.Guanylate kin;TM=M;; 6.00 

• A 4829 Is 275163; non c I pro 114238) ; ND "0 

427270; H47921; Hs.174139; chloride channel 3; voltage_CLC,CBS,none; 5.99 
423774; L39064; Hs.1702; interleukin 9 receptor; none;TM=M;SS=M; 5.98 
424124; AA335609; Hs.7589; ESTs, Weakly similar Id A46010 X-linked ; pkir.ase.TBC; 5 90 
41 1040; AF007393; Hs.177574: protein-kinase, interferon-inducible dou; HLH;TM=M;; 5.95 
423422; AC005175; Hs.128425; NY-REN-24 ant grn Sulfotra fe 7;m 1,none;5.88 
453902; BE502341 Hs.3402; ESTs; none.none; 5.88 

409636; AA305729; Hs.18272; amino acid transporter system A1; Aa_trans;TM=Y;; 5 86 
436154; AA764950; Hs.119898; ESTs; efhand,DAG_PE-bind,DAGKa,PHD 1 DAGKc,PSI l none; 5.85 
455358;AW902641;;gb:QV3-NN1024-10050O-181-d08NN1024 Horr,o;Sulfatase.3Drr,atomed„i B.Phosphodiest.none; 583 
414108; AI267592; Hs. 75761; SFRS protein kinase 1; ank,PH,Oxystera'_BP,pkinase;TM=M" 5 82 
452547; AA335295; Hs 7412D; adipose specific 2; LEA;TM=M;;5.82 

41 5204; T27434; ; gb:hbc2294 Human pancreatic islei Homo s; Na_Ca_Ex,Calx-beta none' 5 80 
435563; AF210317; I ls.95497; solute earner family 2 (facilitated gin, sugarJr;TM=Y;; 5.73 
456097; C15702; Hs.288028; ESTs, Moderately similar to I54374 gene ; dsrm.FKBP; 5.73 

S^u 52 , 6 ^ 356 ' 03 tra " s P° rtln 9' cardiac ™ sc,e ■ C Z ' ^ TPase Ca ion ATPase_C,Ca[ion_ATPase.N,Hydrolase;TM=Y;; 5.77 
436246; AW450963; Hs.119991; ESTs; none,DNA_gyraseB,DNA_lnpni i HATF e_ ,75 
422953; AA488860; Hs. 245043; hypothetical protein FU14297 AB' FF If'-, S-M b 75 
425854; AA749190; ; ESTs; RhoGAP,SH2,pkinase,POLO_box,none; 5.74 
424160; T74062; ; gb:yc81f01.r1 Soares infant brain 1NIB H; ROK,none; 5.70 
417389; BE260964; Hs.82045; midklne(neuritegro*th p moting fact PTN.Mk TIH1 3S=Y; 5.69 
437613; R19892; Hs.10267; MIL1 protein; none.none; 5.68 

410820; BE391493; Hs.16475; Human DNA sequence from clone RP5-852V4 ; TBC-SS=M- 5 68 
423393; R37772, Hs, 21420; p21-activated protein kinase- 6; pkinase,PBD;TM=M- 5 66 
426500; NM_014638; Hs.170156; KIAAO450 gene piodu: C2.PI-PCC , TM=M , 5 63 
406930; U04691; ; gb:Human olfactory receptor (OR17-219) g; none;TM= f;SS=M 51 O 
401044; ; ; Target Exon; none,ICE_p20,ICE_p10,CARD,Peptidase_M1; 5.56 

428479; Y00272 H 4 t r , e ICE plOICE p'OTM M SS-M 5 

421970; AF227156; Hs.110103; RNA polymerase I transcription factor RR; 

aa_perraeases,pyridoxaLdeC,bromodomain,PHD,MBD,AT_hook,DDT,PI3_PI4 kinase,FAT,FATC,BolA,RUN;TM=M;; 5 53 
426248; T18988; Hs,293668; ESTs; pkinase,none; 5.50 

418426; NMJ03804; Hs.296327 rec 1 1 TNFRSF) a* lug erine-thi pkinase,death;TM=M" 5 43 

417 -, A ,194446 ErT We ,kly simitar to 550 4rebu! an! lead lUS.EGF.kringle.tryf in,Nebulin,LIM;SS=M i '-2 

447437; U07225;Hs.339;purinergic receptor P2Y, G-protein coup 71 i_1 SH2;TM=', SS-M 5 40 

"- — - - ~ k'""'. v., :.Yv,.Mi V.:.. =E'.-,F .". = '.' = 4- 

434938; AW500718; Hs.8115; Homo sapiens, clone MG&16169, mRNA, com; pkinase,TBC,Rhodanese;TM=M;; 5.38 
419355; AA428520; Hs.90061; progesterone binding protein; heme_1;TM=Y;SS=M; 5.35 
411188; BE161168; ; gb:PM0-HT0425-170100-002-a10 HT34'5 F i a r e none- 5 35 

422461; NMJ03417; Hs.1 17077; zinc finger protein 264; zf-C2H2hF TFII TM=M r > 
426348; BE466586; Hs.17433; hypothetical protein FU20957 none.none; 5 25 

429170; NMJ01394; Hs.2359; dual specificity phosphatase 4; Rhodanese, DSPc,Y_phosphatase,Ribosomal S3 N'TM=M" 5 23 
429592; AB029041; Hs.209646; KIAA1118 protein; Troponin,Ex,o_endojhos, l Q;TM=M;- 5 22 

^V^^^- 2 ^ V ' } ^^ e ^ ms]pMe ' rinm Ll ' nen_-ecept,RibosomaLC11,RibosomaLL11_N;TM=Y;SS=M;5.21 

434368; AW519020; Hs.73893; dopamine receptor D2; pktnase.SH3.none 5.15 

10 NM_000299:Homo sapiens plakophilin 1 (ec; Armadilb_seg;TM=M;; 5.13 

408176; AK001553; Hs.43436; adenylate kinase 3 alpha li<e ade tat i nc e 512 

407443; AF227138;;gb:Hcmo sapiens candidate taste re eon none;TM=T SS=M 5 

416817; AA398045: Hs.104679; ESTs; Furfn-llke,pkinase,Recep_L domain,fn3,none; 5.10 

401886;;;NM_021783:HomosapiensXEDAR(<EDAr, ml TIIFR 6;TM=M;SS=M 508 

410314; AW860708;Hs.18851; hypothetical protein FLJ10875;myb_DNA.binding,PAH,BAH,bromodomain,PHD,SET;TM=M;; 5.08 
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401579; AL031447; ; Homo sapiens, clone IMAGE:4053044, mRNA,; Neur chan_LBD.Neur_chan_memb,none; 5.05 

445873; AA250970; Hs.251946; poly(A)-binding protein, cvtop:asmic 1-1; PABP,rrm,pkinase,14-3-3: 5.05 

417529; AA203634; ; gb:zx58b09.r1 SoaresJetal_live-_spleen_; pkinase,UBA,KA1,none; 5.03 

417527; AA203524; ; gb:zx56e10.r1 Soares felaljive-_spleen_; SH3;SS=M; 4.98 

1 BE252211 H doxal ( min B6) kin B;Tf 

431321; AW1 36372; Hs.1852; acid phosphatase, prostate; acid phosphat,none; 4.93 

404298; ; ; C6001238*:gi|121715|sp|P26697|GTA3_CHICK; none,GST_C,GST_N,pkir.ase; 4.85 

407603; AW955705; Hs.62604; Homo sapiens, clone IMAGE:4299322, mRNA,; none;TM=M;; 4.82 

424099; AF071 202; Hs.139336; ATP-binding cassette sit family CFTR B( ran.ABi niembrane;TM=Y;;4.73 

422366; T83882; Hs.97927; ESTs; pkinase.none; 4.64 

424905; NMJ02497; Hs. 1 53704; NIMA (never in mitosis gene a)-related k; pkinase;TM=M;; 4.48 

426925; NMJ01196; Hs.315689; Homo sapiens cDNA: FLJ22373 lis, clone H; Esterase,enoiase,Peptidase_S9;TM=M;; 4.45 

439606; W791 23; Hs.58561 ; G protein-coupled receptor 87; 7:m_1 ;TM=Y;SS=M; 4.40 

400749; ;;NM_003105*:Homosapiers , slated E GF ,fn3,ldLrecepLa,ldLrecept b,granuiin,BNR;TM=Y;SS=M;4.38 

425721; AC002115; Hs.159309; uroplakin 1A; transmembrane4;TM=Y;SS=M; 4.33 

444006; BE395085; Hs.10086; type I transmembrane protein Fn14; ldl_recept_a,PKD,MHCJ;TM=M;SS=Y; 4.31 

1 1 N" I I r ^ i per ti'in-re r fn3,ld a Idl recepLb,granulin.BNR:TM=Y;SS=M;4.18 

408908; BE296227; Hs.250822; serine/threonine kna-e l c i , h r 3=M 1 9( 

422152; AA909249; Hs.112282; so' lis ca rer family 30 (are transport; none,none; 3.88 

45876ft AI498631; Hs.1 1 1 334; ferritin, light polyp-pti t y atin.fereif his r HO , uansp,SH3,RhoGAP,xanjr_permease,FCH;SS=M;3.85 
441218 BE3275G1 H 1 FT WC40 El E2_ATPase,Ca!ion.ATPase_C,Cation_ATPase N.Hydrolase; 3.78 
419073;AW372170;Hs.183918;HomosapienscONAFLJ12797 fis clone Nl deail ZU5;SS=M 3 76 

451385; M017656; ; gb:ze39h01.r1 Soares retina N2b4HR Homo ; Airophin-1 ,enolase,Atrophin-1 ,Y_phosphalase.SH2,fibrinogen_C,TIM; 3.60 

412604; AW978324; Hs.1904; protein kinase C, iota; pkinase,DAG_PE-bind,pkinase_C 1 OPR;TM=M;; 3.30 

409582; R27430; Hs.271565; ESTs; none.Neur chan_LBD,Neur _chan_memb; 3.28 

441155; AW161008; Hs.7719; GABA(A) receptor-associated protein; MAP1_LC3;SS=M; 3.23 

436740; AW975133; ; gb;EST387239 MAGE resequences, MAGN Homo, none.EPH_lbd,fn3,pkinase,SAM; 3.20 

418319; AW611703; Hs.190173; ESTs, Weakly similar to A46010X-linked ; none, IRK; 3.20 

409744; AW675258; Hs.56265; Homo sapiens mRNA; cDNA DKFZp586P2321 (f; r,one;NA;NA; 3.13 

418764; N30531; Hs.42215; protein phosphatase 1, regulatory subuni; none.none; 3.10 

400846; ; ; sortilin-related receptor, L(DLR class] ; EGF,fn3,ldl_recepLa,idLrecepLb,granu'in,BNR;TM=Y;SS=M; 3.09 
422005; BE266556; Hs.11 0702; Homo sapiens mRNA; cDNA DKFZp76l E2i 2 (fr; none,Na_H Exchanger 3.03 
426440; BE382756; Hs.169902. solutecarrier family 2 ifi. I - ug TM=Y;SS=M; 3.02 
424187;M336561;Hs.17287;ESTs,WeaklysirnilartoS26689hypn>p Rl non 

425852; AK001 504; Hs.159651 ; death receptor 6, TNF superfamily member; deafhJNFR _c6;TM=Y;SS=M; 2.93 
401279; ; ; C13000351':gi|2494033|sp|Q64398|KDGD_MES; none.none; 2.88 

406671; AA129547; Hs.285754; met proto-oncogene (hepatocyte growth fa; Sema,pkinase,TIG,PSI,none; 2.83 

^i I n 1 channel, s; IRK;TM=Y;; 2.30 

453019; H87648; Hs.33922; Homosapiens, clone MGC:9084, niRt.'A com; ■ na ;e 1 M=M,; 2 75 
441699; AW511126; Hs.127572; ESTs; none.AaJrans; 2.73 

458781; AI444821; Hs.63085; ESTs, Weakly similar to MPP3.HUMAN MAGUK; SH3,PDZ,Guanylate kin,L27:TM=M;; 2.73 

446913; AA430650; Hs.16529; transmembrane 4 supe; -fa. r.ily membei (ted; [ransinembrane4;TM=Y;SS=M; 2.70 

453487; R31770; Hs.56562; ESTs; 7tm„1,none; 2.68 

421279; AW664878; Hs.106645; ESTs; pkinase.ncne; 2.68 

419720; AA249131; Hs.337778; hypothetical protein FLJ11068; none.none; 2.65 

452345; AA293279; Hs.29173; hypothetical protein FU20515; DSPc;TM=M;; 2.63 

422247; U18244; Hs.1 1 3602; solute carrier family 1 (high affinity a; SDF;TM=Y;; 2.62 

12521 902253, H 17161E i e.rane; 2.6C 

427344; NM_000869; Hs.2142; 5-hydroxytryptam'ne (serotonin) receptor; Neur_chan_LBD,Neur chan_memb;TM=Y;SS=M; 2.58 

423629; AW021173; Hs.18612; Homo sapiens cDNA: FU21909 Us, clone H; voltage CIX.CBS.none; 2.55 

456737; BE247203; Hs.124831; CGI-67 protein; abhydrolaspJJM M ri ~M 2 

421585; U95626; Hs.302043, chemokme (C-C notiti t -r'o ike m_1;TM=Y;S ' 2.5 

424028; AF055084; Hs.1 53692; Homo sapiens cDNA FU14354 lis, clone Y7; none.none; 2.50 

448324; AI571355; Hs.34174; ESTs, Moderately similar to ALU8_HUMAN A; lCE_p20,CARD,lCE_p10,none; 2.50 

402256; ; ; Target Exon; pkinase,UBA,none; 2.43 

452256; AK000933; Hs.28661; Homo sapiens cDNA FU10071 ffs. clone HE; GDl,7tm 1,none; 2.40 

419169; AW851980, Hs.26234G, ES I Weakly simila c 7 . f hvpotheti; none.sp nn,SH3,PH,CH; 2.40 

420634; S42457; Hs.1323; cyclic nucleotide gated channel alpha 1; cNMP.binding.ion trans;TM=Y;; 2.35 

419630; W57756; ; gb:zd20g10.r1 Soares_fetaLheart_NbHH19W; zf-C3HC4,rane; 2.35 

426227; U67058; Hs.154299, Human proteinase activated recepto:-2 mR; 7tm 1 ;TM=> ;SS=M; 2 3S 

400704; ; ; Target Exon; lig _chan,SBP_bac_3,ANF_receptor;TM=Y;SS=M; 2,33 

V.1-V ,L-,: Ml i i ■: :.f I •■, .TV- , .: : --v.. 1 ■■ 

459327: AW14970G lls.7! 9;gb;xf41d02 1 NCI_CGAP_Bm50 Homo sapien; PHD.PWWP.SET.pkinase.ig; 2.30 
452220; BE158006; Hs.212296; ESTs; integrin_A,FG-GAP,none; 2.25 
416690; H84078; Hs.108551; ESTs; pkinase.none; 2.23 
408354: AI3G2803, 1 Is 159235: EbT;, none.nor.e; 2.23 

452203; X57522; ; transporter 1, ATP-binding cassette, sub; ABCJran.ABC membrane,SRP54,Thyrnidylale_kin;TM=Y;SS=M; 2.21 

•u r I93 i i 'iii.i U«1" • 1 i l 7E22S43|pir|rT32733AMPAg; none.none; 2.20 

412723; AA648459; Hs.335951; hypothetical protein AF301222; none;TM=M;; 2.20 

418738; AW388633; Hs.6682; solute earner family 7, (calionic amino; none.none; 2.1B 

417185; NMJ02484; Hs.81469; nucleotide binding protein 1 (E.coli Mir; ParA,fer4 Nifl l,A-sA_ATPase;TM=M;; 2.18 

433222; AW51 4472; Hs.23841 5; dickkopf (Xenopus laevis) homclog 4; none,PH04; 2.18 

413627; BE182082; Hs.246973; ESTs; Armadilbjeg,IBB;TM=M;; 2.18 

407415; AF073328; ; gb:Homo sapiens tetracaine transporter-; none.rene; 2.1 5 

450592 AI701555; Hs.202562; ESTs; pkinase.none; 2.15 

428767; A1421972; Hs.98802; ESTs, Moderately similar to T14342 NSD1 ; none,pkinase,ig; 2.15 

429012; AW629596; Hs.194726; BCL2-associated athanogene 4; Sm,BAG;SS=M; 2.15 

419122; AI401360; Hs.44410; ESTs; ABCJran,ABCjnembrane,none; 2.10 

446420; AW015693; Hs. 1 3561 4; ESTs; ion trans.none; 2.05 

420076; AA827830; Hs.293717; ESTs; DUF59,pkinase; 2.05 

409416; AW388359; Hs.10667; ESTs; transmembrane4;TM=Y;S£--M 2.1 i 

428766; AA477989; Hs.98800; ESTs; TPR,7tm 1; 2.03 

427001; NM_006482;Hs.173135; dual-specificity tyrosine-(Y)-pho^h pkin TM=f 2J 
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453709; AL119133; ; protein kinase C substrate 80K-H; nonhistone; 2.03 
423341; AW242394; Hs.108660; ESTs; none,none; 2.00 

456772; AW604297; Hs. 129711; hepatitis A virus cellular receptor 1;ig;TM=Y;SS=M; 1.88 
427786; BE407863; Hs.256871; ESTs; none,FG-GAP,7tm_1; 1.65 
123508; AW6 Hs.129711 i ellul r 1 If M; 1.00 

447993; AW139525; Hs.170362; ESTs; none.none; 1 .00 

TABLE 58B 



AF274943 BG494894 AI719075 M908783 AI935150 AI422391 AA910644 AA583187 BM272167 A1828996 AA527373 AW972459 AI831360 AA77241 8 
AI033892 AA100926 AU154749 A1459432 A142351 3 AI094597 AA740817 AI9919B8 AI090262 AI312104 BI256707 AA459522 AA41S871 AI075239 
AI339996 AA701623 AI139549 AI336880 AA63364S AI98938Q AI36283S AA399239A1146955 BF514270 N92892 AI348243 AI278887 AA459292 
AI494230 BF507531 A1492600 AA962596 AW613002 AA293140 AA235549 BF108854 AA954344 N49682 AI457100 AW589407 AW300758 BE220715 
BE220698 BE569091 BM009647 BF900351 AI537692 A1203723 AI857576 M584410 AW371667 BM172363 

AK054860 AV652198 AV652192 AV652138 AV6521 27 AV6521 94 BE935919 AV652017 AV651995 AV651548 AV646063 AV651985 AV646184 
AV646179 AW880409 AA345002 BF155189 BE068931 X56197 AL603014 AW953629 BM263546 BE550772 M701084 AI681352 AA358689 
AW938841 BF438147 W05391 H75313 BF326185 AV646335 AVS51589 AV646340 AV651992 AV646384 AV646364 AV687497 BF155183 AV646370 
AW797876 AI906821 X561 96 BE833835 AA628440 BE833808 BF224205 AA70912B BE673807 A1923886 M947932 AI2761 25 AM 85720 AW51 0698 
AA987230 BE467708 AW898628 AW898544 AM 46984 AW043642 AI288245 AI186932 AI635262 A11 39455 AI298739 AI81 3854 AI024768 BE699445 
BE699444 AI707807 D52654 AI214518 AI004723 AI698085 AW08742D AI565133 AA845571 AW898622 BF1 10144 AW51 3280 A1061 126 BF362770 
AI268939 AI43S818 BF475318 A1024767 BE174213 AA757S98 AA513019 AA902959 AI860794 AI334784 BF108411 BM310532 AW513771 AI951391 
A1337671 BF095606 BF095601 BF095468 AW890091 BF095753 AW243400 AW898607 AW898616 BF362762 AI922204 AW898625 BE699468 
BE174196 AW102923 D52715 BEG9945G D52477 D55017 BF955933 BG623563 AV646254 AA463522BI003244AI299190 W40186 BE174210 
BF939091 BF434180 AW579001 T55662 H0131 1 T52522BF945037 BF955938 D54679 D53933 R67100 BG925552 BF999056 R83430 Z29922 T85791 
W03942 H63289 A1091537 BF086583 AA345570 H48870 H80720 T83523 BI039626 BI037700 R00353 BF155184 N98343 N79072 H01812 T55581 
X75684AL573167AI445461 AI453743 A1983655 A;S64G44 AA977180 A1694111 AI591358 AW071625 AI678712 AI720939 A1927769 BE439796 
A1963432 AA292956 AW192593 AI865838 AI696905 AI424384 AI161312 A1911921 AI597801 BI494959 AI24098B A1492554 AW262737 BE044033 
AW008570AW629505B!494958M088439AA706057BF222820 BF593608 BE501957M524526 BE044134 AW572531 AW015724 BE349186 
AA043217 BE219704AI^9981 III F 1 c 2SaA903050 M147228AA404570AI075878 W38161 AI972739 

AW673152 AA723200 C061 23 BF057147 AA627686 AA1 57944 AI990245 AA662517 T32487 AI800106 AI333170 AI859160 W45410 AI990827 
AW275048 AA182640 AA478328 AI298935 AW085158 AW471421 AW103470 AW300456 AW191997 AIB234G6 AA962397 AA13GG58 AI251817 
AW3391 04 AA724739 AA41 1 1 00 AA1 91 349 AA757735 AA037696 AI76951 6 AW772283 M010631 AI692846 AI061 065 H80983 R79933 AI950693 
AI245632 AI349390 AA1 48284 A1798502 AA487893 AI621 320 AW1 94272 C06365 AA953883 BE858936 AI91 B523 AI872628 AI92721 7 AI453453 
i AI032569 AI972899 A!985583 Z28771 AI363829 AIG93030 AAG03586 BE773488 AW339301 
BE773489 BE773462 BE773495 AI650338 BE773499 AI745717 BE81 1475 BE81 1470 BE811464 BE811418 BE811415 BE81 1400 BE811398 
BE811388 BE811352 BE773501 BE773494 BE773486 BE773474 BE773473 BE773470 BE773461 BE811350 BE811337 BF593847 BG055071 
AW675302BF003068AA719173BE811348A1582462 AI686240 BE773500 AI244845AI565439 AI918453 AI472527 AI446740 AA03557G AA1 9 1 4 14 
AWG74145 C05782 AI5892G4 D5755B A1468237 AI432033 AA989662 R21752 BF002457 M9B8297 AL574D95 AL576200 AL571074 AL574525 
AL578810 BG498381 AI928364 BE879732 M479834 M479712 C17732 BM091258 BF843901 AW820230 C17476 BE3271 20 AA1 29574 M136645 
BF843900 AW806193 AA502832 AA649494 AL568520 AL5479G0 BE706937 BE81 1360 BE773498 BE81 1401 BE773484 BE811437 BE811380 
BE81 1399 BF997171 BF757734 BE926037 AI377596 C06111 AW08896B BE81 1404 BE81 1472 AI865912 AI9256D7 AI871950 AI093510 BE905927 
BE81 1435 AA191387 AW772000 BE81 1453 BE614379 BF844522 BI044896 A1744233 AW984527 C17504 BF843883 AI248307 BE773483 AI567995 
W60075 BF941183 AI738844 BE811458 BE773481 AI262930AA948565BE706942BE156360TC502GAW24295BAW197954BE905184AA72220G 
AI344943AI348877AI334860 BE621857BE156230AA454099AA037722 BF843897AW806183AA043216BG482896M182734AA877242 
AW372926 H27252 R381 14 BF851858 BE1 56214 AA190427 T91762 M035067 AA837326 T10930 BF905587 BI755027 BG506731 BC008442 
BC01 01 66 AL5501 34 AL553096 AL548700 AL550751 AL547978 AL545286 AL540643 AIM 1 8627 AL601379 BI259321 BG741786 BI868522 AU 1 35866 
BI552770 BI259210 BI256520 BI255569 BG485098 BI25822BBG498501 BM044512AU133984 AL556586 BE745111 BI222633 AU133917 BG288151 
BI260715 BI550550 BG500773 BI551761 BG707601 BI818593 BF691383 BG721 129 BG541578 BE906666 BG751098 BI224135 BG400746 BG478065 
BE790436 AW080238 AU137549 BG429896 BE392486 AW961686 BG721056 BE908365 BE546656 BG541235 AW533735 BG528290 BI260395 
AW651691 BM048974 BM043805 BG142185 AA315188 AI446615 C0630D BG497644 AA088544 A1815987 BG528631 BE619182 AW239185 
AW062910 AW062902 AA347236 F1 1 933 M488005 AA301 831 AA376800 D56120 AA343532 M308636 F00242 AA376086 AA31 6968 AA343799 
BI870221 BE910282 BG538748 AW960564 AV732879 D16854 AA192519 BF922148 AA216013 BG624091 BE544387 BG507008 AW176446 
BF790033BE088925BE088854AA921353 R21800AA011222T97525 

X75684 AL573167 A1445461 AI453743 A1983655 A1564644 AA977180 AI694111 A1591353 AW071 625 AI67871 2 AI720939 A5927769 BE439796 
AI963432 AA292956 AW 192593 AI865838 AI696905 AI424384 AH 61 31 2 AI91 1 921 AI597801 BI494959 AI240988 AI492554 AW262737 BE044033 
AW008570 AW629505 BI494958 M088439 AA706057 BF222620 BF593608 BE501957 AA524526 BE0441 34 AW572531 AW015724 BE349186 
AA043217 BE219784 AI799814 AA129575 AI671 727 AI470033 BE646195 AW779725 M903050 M147228 AA404570 AI075878 W381 61 AI972739 
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400843 9188605 



405036 7543748 Minus 121957 122129 

400845 9188606 Plus 34428-3401 2 

403212 7630897 Minus 156037-156210 

401927 3873185 Minus 112000-112137 



Table 59A lists about 1170 genes up-regulated in prostate can 
GeneChip array such that the ratio of 'average' prostate cano 
75* percentile amongst prostate cancers. The 'average" norn 
background levels of non-specific hybridization, the 10" percei 



1 to normal adult tissues. These were selected from 59680 probesels on the Affymelrix/Eos Hu03 
s normal adult tissues was greater than or equal to 8.2. The "average' prostate cancer level wx set to the 
e ,eve was se: to the 50» percentile amongst non-malignant tissues. In order to remove gene-specific 
longst the nori-malignant tissues wx subtracted from both the numerator and the denominator before the 



TABLE 59A: ABOUT 1 170 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 



Unique Eos pre 

ExAccn: Exemplar Accession number, Genbank accession number 
UmgenelD: Unigene number 
Unigene Title: Unigene gene title 

R1: Ratio of prostate tumor to normal adult body tissue 



Pkey ExAccn 



Unigene ID Unigene Title 

Hs.2256 
Hs.171995 
Hs.273321 



Purkinjecell protein 4 
acid phosphatase, prostate 
pleiolrophin (heparin binding growth fac 
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437145 AF007216 Hs.5462 solute carrier family 4, sodium bicarbon 

446057 AI420227 Hs.149358 EST 5010 X linked 

431579 AW971082 Hs.222886 ESTs, Weakly similar to TRHYJHUMAN TRICH 

400299 X07730 Hs.171995 kallikrein 3, (prostate specific antigen 

427674 NMJM3528 Hs.2178 H2B histone familv, member Q 

432374 W68815 Hs.301885 Homo sapiens cDNA FLJ11346 fs, cone PL 

422487 AJ010901 Hs.198267 mucin 4, tracheobronchial 

415314 N88802 Hs.5422 glycoprotein M6B 

431958 X63629 Hs.2877 cadherin 3, type 1, P-cadherin (p'scenta 

409262 AK000631 Hs.52256 hypothetical p-oteir. FLJ20624 

423582 BE000831 Hs.23837 HomosapienscDNAFLJ11812fs,ctoneHE 

400298 AA032279 Hs.61635 six transmembrane epithelial antigen of 

Hs.190394 ESTs, Weakly simila- to B23096 line-1 pr 

Hs.79386 leiomodin 1 (smooth muscle) 



433043 W57554 

400302 N48056 

453096 AW294631 

419239 AA468183 

427665 AF134803 

415539 AI733881 

428411 AW291464 

428819 AL135623 
401424 

452114 N22687 

426990 AL044315 

448045 AJ297436 

421552 AF026692 

432101 AI918950 

427398 AW390020 

426716 NM_006379 

434025 AF1 14264 

414591 AI888490 

453355 AW295374 

408380 AF123050 

429500 X78565 

453370 A1470523 

408369 R38438 

408001 AA046458 

440274 R24595 

417332 AW972717 



459247 N46243 

418829 AA516531 

459290 NMJ01546 

418827 BE327311 

410330 AW023630 

450377 AB033091 

424012 AW358377 



Hs.1 15838 ESTs 



416585 X54162 
447726 AL1 37638 
426174 AA547959 
425071 NMJ13989 Hs.154424 
417067 AJ001417 Hs.81086 
Hs.279898 
Hs.125019 
Hs.1915 
Hs.11325 
Hs.184598 
Hs.180141 
Hs.72472 
Hs.10338 
Hs.193914 



deiodinase. iodothyronine, type II 
solute carrier family 22 (extraneuronal 
Homo sapiens cDNA: FU231 65 fis, clone L 
lymphoid nuclear protein (LAF-4) mRNA 
folate hydrolase (prostate-specific memb 



Homo sapiens mRNA for KIAA1 750 protein, 



Hs.8236 
Hs.173094 
Hs.20166 
Hs.105700 
Hs.123642 
Hs.20415 

Hs.171921 sema domain, immunoglobulin domain (Ig), 
Hs.216381 Homcsapiens done HH409 unknown mRNA 
Hs.55902 ESTs, Weakly simi'ar to ALII8_HUMAN ALU S 
Hs.31 412 Homo sapiens cDNA FLJ1 1422 lis, clone HE 



:in Ccytoiactin) 
ATP-binding cassette, sub-family C(CFTR 
solute canier family 1 5 (H??? transport 



Hs.1 82575 

Hs.95296 

Hs.7122 

Hs.288462 

Hs.1 661 96 

Hs.102720 

Hs.1 10373 

Hs.55999 

Hs.34853 

Hs.47166 

Hs.46786 

Hs.74313 

Hs.1 37569 

Hs.1 12242 

Hs.1 98760 

Hs.505 

Hs.1 395 

Hs.54416 

Hs!l 09370 
Hs.283683 
Hs.8769 



407709 M456135 

433444 AW975324 

415989 AI267700 

430539 AK001489 

408221 AA912183 

448519 AW175665 

437179 M393508 

426457 AW894667 

444484 AK002126 

400292 M250737 

433647 AA603367 

429220 AW207205 

427871 AW992405 

448106 AI800470 

415992 C05837 

452792 AB037765 

407910 AA650274 

428862 NMJ00345 

418961 AW967646 



ESTs, Highly similar to T42626 secreted 
NKhomeobox(Drosophila],farnily3,A 
inhibitor of DNA binding 4, dominant neg 
HT021 



neurofilament, heavy polypeptide (200kD) 
ISL1 transcription factor, LIM/homecdoma 
early growth response 2 (Krox-20 (Drosop 
sine oculis homeobox (Drosophila) homolo 
hypothetical protein dJ462023.2 
ESTs 

chromosome 8 open reading frame 4 
hypothetical protein DKFZp761J17121 
NM_005656':Homo sapiens transmembrane | 



Hs.317584 ESTs 



Hs.11260 
Hs.72472 
Hs.222294 
Hs.136319 
Hs.59622 
Hs.171941 
Hs,145807 
Hs.30652 



Homo sapiens prostein mRNA, complete c 
serologically defined colon cancer aniig 
chimerin (chimaerin) 1 
hypothetical protein FLJ1 1264 



Homo sapiens, clone 1MAGE:35C7281 , mRNA, 



SRY (sex determining region Y)-box 9 (ca 



677 



WO 03/042661 



PCT/US02/36810 



451027 AW519204 

413517 N76712 

409731 AA1 25985 

418396 AI765805 

412088 AI689496 

429673 AA884407 

411644 H92064 

409619 AK001015 

433466 AA508353 

431657 AI345227 

440995 T57773 

432278 AL137506 

428695 AI355647 

424720 M89907 

440260 AI972867 

443622 AI911527 

426418 M90464 

425383 D83407 

450642 R39773 

417421 AL1 38201 

435981 H74319 

450693 AW450461 

444784 D12485 

423242 AL039402 

408621 AI970672 

408000 L11690 
404210 

417622 AW298163 

419526 AI821895 

442799 AI564739 

424846 AU077324 

420195 N4434B 

429922 Z97630 

422511 AU076442 

437330 AL353944 

431474 AL1 33990 

430887 N66801 

416655 AW968613 

408912 AB011084 

445133 AW1 57646 

421513 X00949 

413746 AA1 33243 

446795 AI797713 

440774 AI420611 

407168 R45175 

417511 AL049176 

431130 NM_006103 

451277 AK001123 

411800 N39342 

446469 BE094848 

427138 N77624 

409758 AW474960 

424897 D63216 

430280 AA361258 
401197 

452814 AI092790 

429918 AW873986 

423073 BE252922 

453469 AB014533 

444922 AI921750 

436396 AI683487 

419440 AB020689 

429170 NMJ01394 

441690 R31733 

423690 M329648 

423044 M320829 

410929 H47233 

427019 M001732 

431556 AF016028 

409632 W74001 

444190 AI878913 

407118 AA156790 

416370 N90470 

431217 NMJ13427 

430187 AI799909 

420120 AL049610 

434423 NMJ06769 

417381 AF164142 

409062 AL1 57488 

409038 T97490 

436032 M150797 

431548 AI834273 

418565 AK001529 



ESTs, Weakly similar to I38022 hypotheli 



Hs.211595 
Hs.278626 
Hs.55220 
Hs.105314 
Hs.105448 
Hs.10263 
Hs.274256 



fs. Weakly similar to B34087 hypotheli 



89999 purinergic receptor (family A group 5) 



Hs.152292 
Hs.7130 
Hs.11805 
Hs.169825 
Hs.1 56007 
Hs.7130 
Hs.82120 
s.188620 



collagen, type IV, alpha 5 (Alport syndr 



ESTs 



receptor subfamily 4, group A, m 



Hs.203965 ESTs 



lonucleotidepyophospha:ase/phosphc.j: 
DEME-6 protein 

chromosome 1 1 open reading frame 8 
bullous pemphigoid antigen 1 (230/240kD) 
NM_005936:Homo sapiens myelc' ' 
WAS protein family, menbei 3 
ESTs 
ESTs 

neuropeptide Y 

Homo sapiens cDNA FLJ1 1 177 fis, clone PL 

H1 histone family, member 0 

collagen, type XVII, alpha 1 

Homo sapiens mRIJA; cDNA DKFZp761J1112 (f 

K1AA1841 protein 

BC1.2/adenoviais E1B 19kD-interacting pro 
KIAA0512 gene product; ALEX2 



Hs.105314 

Hs.171553 

Hs.156471 

Hs.127832 

Hs.1 17183 

Hs.82223 

Hs.2719 

Hs.26176 

Hs.103042 



ESTs, Moderately similar tc CLC3_HUMAN C 20.8 

ESTs 20.7 

ESTs 2a7 

ESTs 20.6 

chordin-like 20.6 

epididymis-specific, whey-acidic protein 20.6 

hypothetical protein FU10261 20.6 

microtubule-associated protein 1B 20.5 

homogentisate 1,2-dioxygenase (homogenti 20.5 

phosphatidic acid phosphatase type 2B 20 5 
20.5 

frizzled-related protein 20.5 

[ rr>3i'- n 7 receptor 20 4 

ENSP00000229263*;HSPC213. 20.3 

hypothetical protein FLJ14529 20 3 

ESTs 20.2 

MAD (mothers against decapentaplegic, Dr 20.2 

KIAA0633 protein 20.2 

Homosapiens cDNA FU13752 fis, clone PL 20.0 

■ving'es ty MMTi itegi n site fan 20 0 

KIAA0882 protein ?nn 
dual specificity phosphatase 4 



ESTs, Weakly similar to PN0099 son3 prof 




protein FU10970 
sarcospan (Kras oncogene-associated gene 
serine (or cysteine) proteinase inhibito 
cysteine and glycine-rich protein 2 
E r >, akl mila ' Z2 HUMAN ZIN( 

C! Fs, Weakly simi o 133022 hy- bed 

RhoGTPase activating protein 6 




WO 03/042661 



PCT/LS02/36810 



407690 R47799 

421913 AI934365 

418819 AA228776 

450831 R37974 

415691 AW963979 

417124 BE122762 

445162 AB011131 

426647 AA243464 



420344 BE463721 

436420 AA443966 

427315 AA179949 

418432 M14156 

410765 AI694972 

444931 AV652066 

437446 AA788946 

439569 AW602166 

417061 AI675944 

424682 AW604804 

432435 BE218886 

443547 AW271273 

431121 AW971157 

410467 AF102546 

407110 AA018042 

440703 AL1 37663 

429925 NM_CC0786 

427958 AA418000 



419355 AA428520 

417412 X16896 

423943 AF1 63570 

434217 AW014795 

439444 AI277652 

447809 AW207605 

430177 AW969233 

432473 AI202703 

421823 N40850 

418522 AA605038 

420092 AA814043 

414602 AW630088 

417426 NMJJ02291 

400297 AI127076 



Hs.266957 hypothetical protein FLJ14281 



Hs.191721 
Hs.25255 
Hs.24723 
Hs.25338 
Hs.12376 
Hs.294101 
Hs.1 69387 
Hs.203270 

Hs!31595 
Hs.175563 
Hs.85112 
Hs.66180 
Hs.75113 
Hs.16869 
Hs.222399 
Hs.188691 
Hs.151717 
Hs.282070 
Hs.23767 

Hs.63931 
Hs.252085 
Hs.7378 
Hs.226213 
Hs.98280 
Hs.30212 
Hs.241552 KIAA0268 pr( 
Hs.90061 
Hs.82112 
Hs.135756 
Hs.23349 
Hs.54578 
Hs.164230 
Hs.302746 
Hs.152414 



piccolo (presynaptic cytomatn'x protein) 
pre-B-cell leukemia transcription factor 
KIAA0036 gene product 
ESTs, Weakly similar to ALU1.HUMAN ALU S 
putative G protein-coupled receptor 
ESTs 

Homo sapiens mRNA; cDNA DKFZp564N0753 (f 
insulin-like growth factor 1 (somatomedi 
nucleosome assembly protein 1-like 2 
general transcription factor IIIA 
ESTs, Moderately similar to CA1C RAT COL 



ESTs 

hypothetical protein FU12666 

uences, MAGL Homo 
(Drosophila) homolog 
"" igelman syndrome-5 

mRNA; cDNA DKFZp434G227 (fr 
■30, 51 (lanosterol 14-alpha 



ESTs 
ESTs 

Homo sapiens cDNA: FLJ21 950 fis, clone H 
ESTs 

Homo sapiens mRNA; cDNA DKFZp564B1264 (f 
hypothetic I prate r D 



432966 AA650114 

426657 NM 015865 

419536 AA603305 

443745 AB039570 

438899 AF085833 

418555 AI417215 

442064 AI422867 

409385 AA071267 

453070 AK001465 

443801 AW206942 

437536 X91221 

409196 NMJM874 

438337 AK002058 

416239 AL038450 

448290 AK002107 

407192 AA609200 

409060 AI815867 

442571 C06338 

419713 AW968058 

426125 X87241 

450164 AI239923 

429663 M68874 

408990 AL022395 

417094 NM_006895 

436024 AI800041 

412652 AI801777 

456516 BE172704 

452576 AB023177 

430299 W28673 



ESTs 



Hs.325198 ESTs 



:er family 14 (urea transport 
gb:np12d11.s1 NCLCGAP Pr3 Homo sapiei 
ALEX1 protein 
ESTs 

hypothetical protein FU 12577 
ESTs 

gb:zm61g01 .r1 Stratagene fibroblast (937 



Hs.1 44465 ESTs 
Hs.334873 
Hs.6166 



nudix (nucleoside diphosphate linked m< 



Hs.50130 
Hs.1 65464 
Hs.92381 



Hs.21 1 587 phospholipase A2, group WA (cytosolic, 

Hs.49526 (-box and leucine-rich repeat protein 4 

Hs.81182 histamine N-methyltransferase 

Hs.190555 ESTs 

Hs.6774 ESTs 

Hs.222746 KIAA1610 protein 

Hs.29900 KIAA0960 protein 

Hs.1 06747 serine carboxypeptidase 1 precursor prat 

Hs.132458 ESTs 

Hs.83337 latent transforming growth factor beta b 



679 
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458082 AW978811 

409264 NMJJ14937 

432729 AK000292 

422270 AF1 14494 

410339 AI916499 

431992 NM 002742 

424432 AB037821 

431933 AI187057 

440749 W22335 

416434 AW1 63045 

450382 AA397658 

407021 U52077 

413786 AW613780 

437866 AA1 56781 



Hs.117183 ESTs 
Hs.164599 ESTs 
Hs.102754 ESTs 
Hs.75454 
Hs.101307 
Hs.293616 



413597 AW302885 

439677 R82331 

432527 AW975028 

413654 AA331881 

418818 AA228899 

426501 AW043782 

435056 AW023337 

414504 AW069181 

443960 AI093577 

434988 AI418055 

435380 AA679001 

410268 AA3161B1 

437252 AI433833 

446377 AW014022 

425009 X58288 

415672 N53097 

450325 AI935962 

400294 N95796 

443674 AI081330 

415263 AA948033 

442081 AA401863 

415788 AW628686 

445467 AI239832 

436299 AK000767 

418693 AI750878 

422163 AF027208 

443180 R15875 

432437 W07088 

415786 AW419196 

446091 AW022192 

409341 A1963376 

430376 AW292053 

434096 AW662958 

453698 AA037615 

444805 AB007899 

427707 NM.005578 

453308 AW959731 

442607 AA507576 

433927 AI557019 

443162 T49951 

412490 AW803564 

424580 AA445539 

443912 R37257 

410297 AA1 48710 

431448 AL137517 

432306 Y18207 

451652 AA018968 

406038 Y14443 

408784 AW971350 

453510 AI699482 

407894 AJ278313 

433908 AW298141 

440529 AW207640 

413249 AF167160 

408430 S79876 

430665 BE350122 

447752 M73700 

426044 AA502490 



427982 NM.016156 Hs.181326 

407634 AW016569 Hs.136414 

448362 AA641767 Hs.21015 

433226 AW503733 Hs.9414 

427472 AA522539 



hypothetical protein MGC2747 
RAB18, member RAS oncogene family 
NM_004930*:Homo sapiens capping protein 

' ns clone DT1 PI B6 mRNA, CAG rep 



glycoprotein M6B 

sterile-alpha motif and leucine zipi 

hypothetical protein FLJ21986 



Hs.193579 
Hs.26289 
Hs.278695 



Hs.112360 
Hs.258576 
Hs.293685 
Hs.257924 
Hs.200197 
Hs.12532 
Hs.12532 
Hs.75825 
Hs.42746 
Hs.12017 
Hs.180398 
Hs.323099 



Hs.28 
Hs.116467 



Hs.30120 
Hs.314451 
Hs.52463 



iin tyrosine phosphatase, receptor t 



3in mRNA, complete cds 



open reading frame 21 



il iomoipl icadi nomagc-ne lil : 1 



Hs.303090 
Hs.1 33536 

Hs.88219 zincfng( 
Hs.63386 ESTs 
Hs.42151 ESTs 



Hs.157975 
Hs.16478 
Hs.75251 



LIMi 
ESTs 

Homo sapiens cDNA; FLJ22696 lis, clone H 

small nuclear protein PRAC 

□KFZP434G032 protein 

Homo sapiens cDNA; FLJ22528 fis, clone H 

ESTs, Weakly similar to A4601 0 X-linked 

ESTs 

hypothetical protein DKFZp56401278 



Homo sapiens cDNA: FLJ21718 fis, clone C 
□EAD/H (Asp-Glu-Ala-Asp/His) box binding 
dipeptidylpeptidase IV (CD26, adenosine 
ESTs, Weakly similar to I78885 serine/th 



NM_001615*:Homo sapiens actin, gamma 2, 
KIAA1073 protein 

UDP-GlcNAc:betaGal beta-1,3-N-acetylgluc 
hypothetical protein DKFZp564L0864 simil 

transposon-derived Buster3 transposase-l 
ESTs, Weakly similarto NUCL.HUMAN NUCLE 
ESTs, Weakly similar to ALU 1 _HU MAN ALU S 



Hs.298258 ESTs 



Hs.146858 protocadherinIO 



hypothetical protein MGC3199 
nuclear factor, interleukin 3 regulated 
Homo sapiens cDNA FU13598 fs clone PL 
gb:Human marinerl transposase gene, comp 
ESTs 

metallothionein 1E (functional) 



680 
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454078 AA601518 

411990 AW963624 

443280 M299688 

416836 D54745 

436860 H12751 

425174 D87450 

442242 AV647908 

443790 NMJXJ3500 

418994 AA296520 

419563 AA526235 

437718 AI927288 

412828 AL1 33396 

432712 AB016247 

433332 A1367347 

451533 NM_004657 

424701 NM_005923 

446494 AA463276 

456508 AA502764 

432251 AW972983 

426423 NM_012446 

442281 N34742 

442145 AI022650 

418848 AI820961 

447072 D61594 

439518 W76326 

443967 AW294013 

424775 AB014540 

411190 M306342 

447384 AI377221 

444880 AW1 18683 

433409 AI278802 

423201 NM_000163 

413724 AA1 31466 

454219 X75042 

429165 AW009886 

430598 AK001764 

444172 BE1 47740 

416774 AI005169 

410310 J02931 

428738 NMJ00380 

400296 AA305627 

407939 W05608 

452281 T93500 

408622 AA056060 

428054 AI948688 

444636 T96667 

416292 AA1 79233 

442315 AA173992 



449474 AA019344 

42227B AF072873 

445817 NMJ03642 

434629 AA789081 

416795 AI497778 

410001 AB041036 

452242 R50956 

413231 D87461 
404641 

428648 AF052728 

414279 AW021691 

411573 AB029000 

417632 R20855 

431467 N71831 

450313 AI038989 

445472 AB006631 

419264 AA877104 

418663 AK001100 

452277 AL049013 

447896 AI436124 

447082 T85314 

451468 AW503398 

415443 T07353 

427212 AW293849 

442113 BE622187 

433517 AW022133 

430829 AW451999 

453111 AB014598 

435839 AF249744 

429113 D28235 



Hs.31707 
Hs.24183 
Hs.80247 



cholecyslokinin 
PR01914 protein 

Homo sapiens cDNA: FLJ23285 (is, clone H 
acyl-Coenzyme A oxidase 2 branched char 
selectin E (endothelial adhesion molecul 
Homo sapiens cDNA FLJ11983 fis, clone HE 
ESTs 

prion protein (p27-30) (&eut2feld-Jakob 
sterol-C5-desaturase (fungal ERG 3, delta 
Homo sapiens clone TCCCTA001 51 mRNA seq 
serum deprivation response (phosphatidyl 
mitogen-aclivated protein kinase kinase 
WW Domain-Containing Gene 
ESTs, Weakly similar to AF208855 1 BM-01 
polycythemia rubra vera 1; cell surface 
igle-stranded-DNA-binding protein 

.' CDNAFLJ13492 fis, clone PL 
erbb2-inleracting protein ERBIN 



Hs.288906 
Hs. 123469 
Hs.232165 



Hs.170065 Homo sapiens 



gb:zd60d04.r1Soares_fetaLheart_NbHH19W 




Hs,28274 
Hs.62192 
Hs.1 92803 



Hs,28792 

Hs.266619 

Hs.17877 

Hs.42390 

Hs.7956 

Hs.28726 

Hs.194687 

Hs.226434 

Hs.93304 

Hs.1 34460 

Hs.2055 

Hs.1 14218 

Hs.1 3340 

Hs.4029 

Hs.20509 

Hs.57771 

Hs.1 59993 

Hs.75244 



r associated protein 1 
hypothetical protein FLJ10902 
ESTs, Moderately similar to I38022 hypot 
Homo sapiens cDNA: FU22049 fis, doneH 
coagulation factor III (thromboplastin, 
xeroderma pigmentosum, complementation g 
ATP-hindin s If nl m Z (C \r 
ESTs, Weakly similar to A49019 dyneir, he 
Homo sapiens cDNA FU1 1 041 fis, clone PL 
Homo sapiens cDNA FLJ12166 fis, clone W\ 
ESTs 
ESTs 



Hs.1 8 



histone acetyltransferase 1 
glioma-amplified sequence-41 
HBVpX associated protein-8 

gycosyltrans (erase 
BCL2-like 2 

NM_0 ' -HnnDsapipnsphcsphoglucomuta 



GCN5 (general control of amino-acid synt 
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30 

35 





AW937826 


Hs.103262 


ESTs, Weakly similar to ZN91_HUMAN ZINC 


13.3 


427286 


AW732802 


Hs.2132 


epidermal growth factor receptor pathway 
leptin receptor 


13.3 


437323 


AA371145 


Hs.226627 


13.2 


421878 


AA299652 


Hs.111496 


Homo sapiens cDNA FLJ11643 fls, clone HE 


13.2 


433037 


NM_014158 


Hs.279938 


HSPC067 protein 


13.2 


407938 


AA905097 


Hs.85050 


phospholamban 


13.2 


400860 






Target Exon 
KIAA1323 protein 


13.2 


411031 


W37943 


Hs.34892 


13.2 


436797 


AA731491 


Hs.336454 


hypothetical protein MGCK87S 


13.2 


409277 


T05558 


Hs.156880 


ESTs 


13.2 


434036 


AI659131 


Hs.197733 


hypothetical protein MGC2849 


13.2 


448520 


AB002367 


Hs.21355 


doublecortin and CaM kinase-like 1 


13.2 


450755 


AA010984 


Hs.159464 


ESTs 




450649 
408495 


NM_001429 
W68796 


Hs.25272 
Hs.237731 


E1 A binding protein p300 
ESTs 




429732 
428839 


U20158 


Hs.2488 
Hs.823Q2 


lymphocyte cytosolic protein 2 | SH2 doma 
Homo sapiens cDNA FLJV814 fls, done NT 




416987 


D86957 


Hs.80712 


KIAA0202 protein 


110 


453006 


AI362575 


Hs.167133 


ESTs 


13.0 




AK000768 


Hs.107872 


hypothetical protein FLJ20761 
putative transmembrane protein 


13,0 


415752 


BE314524 


Hs.78776 


13.0 

13.0 


411529 


AA430348 


Hs.317596 


Homo sapiens cDNA FLJ12927 fls, clone NT 


417318 


AW953937 


Hs.12891 


ESTs 


13.0 
12.9 


410763 


AF279145 


Hs.8966 


hypothetical protein FLJ21776 


426034 


AI276989 


Hs,56123 


Homosapiens cDNA FLJ13443 fls, clone PL 


12.9 


447966 


M340605 


Hs.105887 


ESTs, Weakly similar to Homolog of rat Z 


12.9 


428218 


AA424266 


Hs.123642 


EphA3 


12.9 


409799 


D11928 


Hs.76845 


12.9 


448779 


BE042877 


Hs.177135 


ESTs 


12.8 




AB014605 


Hs.22599 


atrophin-1 interacting protein 1; activi 


128 




AI080042 


Hs.336901 




12.6 


429697 


AW296451 


Hs.24605 


ESTs 


12,8 


432674 


AA641092 


Hs.257339 


ESTs, Weakly similar to I38022 hypotheti 


128 


445309 
451099 
424806 


AL1 57474 


Hs.12504 


likely ortfiolog of mouse Arkadia 


12,8 


R52795 
AA382523 


Hs.25954 
Hs.105689 


interleukin 13 receptor, alpha 2 
MSTP031 protein 


12.6 


442343 


AA992480 


Hs.129874 


ESTs 


12.7 


432244 
459513 


A1669973 
A1C32946 


Hs.200574 


ESTs 

gb:ox06g09.s1 Soares fa'al liver spleen 


12.7 
12.7 


452512 


AW363486 


Hs.337635 


ESTs 




415079 


R43179 


Hs.22895 


hypothetical protein FLJ23548 


127 




AF035537 


lls.115521 


REV3 (yeast homolog)-like, catalytic sub 


12.7 


411067 


AI68100S 




ESTs 


12.7 


442501 


AA315267 


Hs.23128 


ESTs 


12.7 


434466 


AB037829 


Hs.3862 


regulator of nonsense transcripts 2; DKF 


12.7 


437052 


AA861697 


Hs.120591 


ESTs 


12.7 


433234 


AB040928 


Hs.65366 




12.6 




AA534296 


Hs.20953 


ESTs 


12.6 


409995 


AW960597 


Hs.30164 


ESTs 


12.6 




A1568801 


Hs.71721 


ESTs 


12.6 


417248 


AA329449 


Hs.247302 


twisted gastrulation 


12.6 


418624 


AI734080 


Hs.104211 


ESTs 


12.6 


412654 


AI093480 


Hs.29263 


hypothetical protein FU11896 


12.6 


450253 
446619 


AL1 33047 


Hs.24715 


Homo sapiens mRNA; cDNA DKFZp434D0215 (f 
secreted phosphoprotein 1 (osteoponSn, 


12.6 


AU076643 


Hs.313 


12.6 


427078 


AI676062 


Hs.1 11902 


ESTs 


12.5 


420838 


AW118210 


Hs.5244 


ESTs 


12.5 




AW161319 


Hs.12915 


ESTs 


12.6 


431325 


AW026751 


Hs.5794 


ESTs, Weakly similar to 21 09260A B cell 


12.5 






Hs.78353 


SFRS protein kinase 2 


12.5 


426110 


NM 002913 


Hs.1 66563 


replication factor C (activator 1) 1 (14 
estrogen receptor 1 


12.5 


400301 


X03635 


Hs.1657 


12.5 


434926 


BE543269 


Hs.50252 


mitochondrial ribosomal protein L32 


12.5 


426316 


NM_002430 


Hs.268515 


meningioma (disrupted in balanced trans! 


12.5 


420345 


AW295230 


Hs.25231 


ESTs 


12.4 


432205 


AI806583 


Hs.125291 


ESTs 


451893 


AW1 92083 


Hs.290855 


ESTs 


12.4 


435266 
427003 


AK001942 


Hs.4863 


hypothetical protein DKFZp566A1524 


12.4 


U19487 


Hs.2090 


prostaglandin E receptor 2 (subtype EP2) 


12.4 


404642 






NM_021965*:Homo sapiens phosphoglucomuta 


12.4 




AA425085 


Hs.4007 


Sarcolemmal-associated protein 


12.4 


429716 






collagen, type XIII, alpha 1 




420871 


AA702972 


HSJ5300 


ESTs 


12/ 


448072 


AI459306 


Hs.24908 


ESTs 


12.4 


441269 


AW015206 


Hs.178784 


ESTs 


12.3 


427761 


AA412205 


Hs.140996 


ESTs 


12.3 


425322 


U63630 


Hs.155637 


protein kinase, DNA-activated, catalytic 


12.3 


417733 


AL048678 


Hs.82503 


Rsapiens mRNA for 3'UTR of u nkr own prot 


12.3 


436521 
408920 




Hs.213003 
Hs.48998 


ESTs 

fibronectin leucine rich transmembrane p 


12.3 
12.3 


446332 
453994 


AK001635 
BE180964 


Hs.14838 
Hs.165590 


hypothetical protein FLJ10773 
ribosomal protein S13 


12.2 
12.2 


420397 


NM.007018 


Hs.97437 




12.2 
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442562 BE379584 
400880 

425920 AL049977 

419743 AW408762 

414222 AL135173 

427051 BE178110 Hs.173374 

452994 AW962597 

425154 NM_001851 



437124 AA554458 

411450 H49619 

421559 NM 014720 

449625 NMJ14253 

419459 AW291128 

426252 BE176980 

414171 AA360328 

436169 AA888311 

435047 AA454985 

417625 U59305 

439820 AL360204 

443646 AI085198 

434874 N62448 

443837 AI984625 

451640 AA195601 

424232 AB015982 

414555 N98569 

446874 AW968304 

454119 BE549773 

436746 AA730045 

432359 M076049 

416508 R39769 

444758 AL044878 

413991 H44725 

431645 AF078849 

421406 AF179897 

408826 AF216077 

424903 T26477 

412977 AA125910 

426981 AL044675 

410853 H04588 

444670 H58373 

414381 BE387335 

434398 AA121098 

438032 BE045624 

433212 BE218049 

424330 AW073953 



sprouty (Drosophila) homolog 2 

-•nli' In mhosphooligosaccharide-prola 

NM_000611*:Homo sapiens CD59 antigen p18 

3 mRNA sequence 



423782 AI472209 

425770 NM 014363 

437108 AA434054 

430200 BE613337 

453857 AL080235 

411096 U80034 

422805 AA436989 

422538 NM.006441 

412677 AW029608 

421896 N62293 

453930 AA419466 

443123 AA094538 

415293 R49462 

443161 AI038316 

420185 AL044056 

445527 W39694 

428829 R14050 

416852 AF283776 

449919 AI674685 

414844 AA296874 

449543 AF070632 

429784 M89796 

407786 AA687538 

414407 AA147026 

441247 AW118681 

456804 AI421645 

422546 AB007969 

417350 U50928 

417683 AW566008 

424800 AL035588 

433577 AW007080 

453935 AI633770 

415060 AJ223810 

425710 AF030880 

448766 AI473827 



.31305 
Hs.154850 
Hs.197751 
Hs.127301 
Hs.1 05751 
Hs.23796 
Hs.278422 
Hs.28917 
Hs.865 
Hs. 17602 
Hs.54973 
Hs.44708 
Hs.283853 
Hs.164226 
Hs.293970 
Hs.9884 
Hs.26771 
Hs.143460 
Hs.76422 
Hs.56156 
H3.4QS10 
Hs.187866 ESTs 
Hs.274415 
Hs.206088 
Hs.11899 
Hs.42683 
Hs.266483 
Hs.104105 
Hs.48376 
Hs.22883 
Hs.191461 
Hs.173081 
Hs.30469 
Hs.332938 
Hs.283713 
Hs.3838 
Hs.1 52992 
Hs.121820 
Hs.333396 
Hs.83883 
Hs.156933 
Hs.323117 
Hs.159492 
Hs.80624 
Hs.234896 
Hs.35861 



sorbitol dehydrogenase 
Homo sapiens cDNA FU10500 fis, cl( 
KIAA1 547 protein 
collagen, type IX, alpha 1 



e NT 



Ste20-related se 
odz (odd Oz/len- 
DKFZP586G112 



spindle pole body protein 
Human DNA sequence from clone 747H23 on 
protein kinase C, nu 
phospholipase A2, group HA (platelets, 



ESTs, Weakly similar to I38022 hypotheti 
ESTs 

KIAA0530 protein 



Homo sapiens cDNA FU13596 fis, cl< 



DKFZP586E1621 protein 



Hs.121017 H2A histone family, member A 

Hs.1 1 81 31 5, 1 O-methenyltetrahydrofolale synthetase 

Hs.17384 ESTs 

Hs.45107 ESTs 

Hs.36727 hypothetical protein FU10903 

Hs.272B08 putative transcription regulation nuclea 

Hs.106541 ESTs 

gb:ox48cG8.x1 Soares totaLfetus_Nb2HFB_ 

Hs.158047 ESTs 

Hs.83286 ESTs, Weakly similar to S14747 sphingomy 

Hs.1 94051 Homo sapiens mRNA; cDNA DKFZp566B2'3 (fr 

Hs.80285 Homo sapiens mRNA; cDN A DKFZp586C1 723 (f 

Hs.200141 ESTs 



05 mRNA sequence 



Hs.23729 
Hs.30 

Hs.38972 tetraspan 1 

Hs.76704 ESTs 

Hs.128051 Homo sapiens thymicstromal lymphopoieti 

Hs.139851 caveolin2 

Hs.301478 KIAA0500 protein 

Hs.82001 polycystic kidney disease 2 (autosomal d 

Hs.239154 ankyrinrep i fa lil F FXANK-IIke), 

Hs.153203 MyoD family inhibitor 

Hs.8817 ESTs 

Hs.42572 ESTs 

Hs.43213 ESTs, Weakly similar to IEFS.HUMAN TRANS 

Hs.1 59275 solute carrier family, member 4 

Hs.31793 ESTs 



683 



WO 03/042661 



PCT/US02/36810 



444151 AW972917 

410108 AA081659 

418450 RB4397 

415457 AW081710 

452110 T47667 

414212 AA136569 
401519 

458509 AA654550 

416913 AW934714 

452784 BE463857 

436758 AW977167 

409401 AI201895 

434715 BE005346 

427283 AL1 19796 

428342 AI739168 

419465 AW500239 

430519 AF129534 

418838 AW385224 

429163 AA884766 

434158 T86534 

414341 D80004 

414650 AA150435 

44B475 BE613134 

437575 AW954355 

423566 AW976434 

425289 AW1 39342 

452598 AI831594 

417433 BE270266 

438379 N23018 

433230 AW136134 

412622 AW664708 

450353 AI244661 

434349 NMJH5678 



422890 Z43784 

437162 AW005505 

450244 AA007534 

417169 R13560 

421481 AW391972 

431576 M76665 

408988 AL119844 

439706 AW872527 

435133 AJ010482 

433658 L03678 

428364 M426565 

441540 C01367 

431154 AW971228 

416777 AF146760 

439556 AI623752 

428280 H05541 

453942 AW190920 

447982 H22953 

422779 AA317036 

447595 AW379130 

427115 AW972853 

448019 AW947164 

419326 W94915 

435163 AA668884 

417578 T91443 

407930 AA045847 

450206 AI796450 

426413 AA377823 

421977 W94197 

437967 BE277414 

408243 Y00787 

439492 AF086310 

413492 D87470 

436962 AW377314 

435572 AW975339 

422583 AA410506 

417665 AW852858 

433285 AW975944 

419693 AA133749 

424878 H57111 

449659 R60031 

415910 U20350 

433556 W56321 

452241 AL050204 

450330 AW500775 

430589 AJ002744 



Hs.318775 
Hs.193651 
Hs.7369 
Hs.28005 
Hs. 10848 



Hs.282906 ESTs 



ESTs, Weakly similar to ALU 1.H I 
Homo sapiens oDNA FU 1 1 309 fis, clone ML 
KIAA0187gene product 
C15000476*:gi|12737279|ref|XP_012163.1| 



Hs.151258 
Hs.1 55272 
Hs.181309 
Hs.1 16410 



proteasome(prasome, macrapain) subunii, 



Hs.1 4372 
Hs.75909 
Hs.72063 
Hs.247474 
Hs.36529 
Hs.3623 
Hs.155530 
Hs.68647 
Hs.82128 
Hs.1 71391 
Hd.220277 
Hs.171959 
Hs.1 03296 
Hs.3821 
Hs.237225 
Hs.98732 
Hs.139336 
Hs.7413 
Hs.75893 
Hs.5464 
Hs.125062 
Hs.246773 
Hs.104696 
Hs.275215 



Homo sapiens cDNA FLJ13458 (is, clone PL 
Homo sapiens cDNA: FLJ23068(is, clone L 
F-box only protein 4 

ectonucleotidepyrophosphatase/phosphodi 
gb:am20a10.s1 Soares_NFL_T_GBC_S1 Homo s 



ESTs 

hypothetical protein FU 21 032 
hypothetical protein MGC11242 
hypothetical protein FU 11 220 
interferon, gamma-inducible protein 16 
ESTs, Weakly similar to ALLI7_HUMAN ALU S 
5T4 oncofetal trophoblast glycoprotein 



ESTs, Weakly similar to S65657 alpha-1C- 
neurobeachin 

hypothetical protein HT023 

Homo sapiens Chromosome 15 BAC clone CIT 

ATP-binding cassette, sjb-family C (CFTR 

ESTs 



KIAA1324 protein 

hydroxysteroid (11-beta) dehydrogenase 1 
Homo sapiens clone TUA8 Cri-du-chat regi 
ESTs, Weakly similar to DAP1_HUMAN DEATH 



Hs.1561 10 immunoglobulin kappa constant 
II 31 103 EST Modei el/ sim toALU1_ 



Hs.290259 
Hs.79844 
Hs.163603 



Hs.1 18787 transforming growth factor, bets-induced 



Hs.1 12237 
Hs.195641 
Hs.42419 
Hs.19155 
Hs.193963 
Hs.188361 
Hs.201600 

Hs.110165 



Hs.103159 ESTs 

Hs.75400 KIAA0280 protein 

Hs.5364 DKFZP564I052 pre 

Hs.239828 ESTs, Weakly simi 

Hs.27973 KIAA0874 protein 

Hs.22862 ESTs 

Hs.237396 ESTs 

Hs.301350 FXYD domain-containing ion transport reg 

Hs.221132 ESTs 

Hs.198899 eukaryotic translation initiation factor 

Hs.7891 3 chemokine (C-X3-C) receptor 1 

Hs.1 1 1460 calcium/calmodulin-dependent protein kin 

— ■ cDNADKFZp586F1223(f 



ESTs, Moderately similar to I38022 hypot 

ESTs 
ESTs 

Homo sapiens cDNA FLJ12807 lis, clone NT 
ESTs 

gb:EST90805 Synovial sarcoma Homo sapien 

ribosomal protein L26 homolog 

mel transforming oncogene (derived from 



to GAG2_HUMAN RETRO 



Hs.28540 
Hs.24817 
Hs.246315 UDP-! 



lin FLJ20136 



684 



WO 03/042661 



PCT/US02/36810 



433923 AI823453 

418196 AI745649 

416155 AI807264 

430057 AW450303 

429350 AI754634 

418601 AA279490 

437267 AW511443 

432169 Y00971 

421982 AF206019 

414085 AA114016 

418662 AI801098 

449685 AW296669 

441111 AI806867 

436671 AW137159 

447974 R76886 

417916 NM_006416 

424562 A1420859 

408331 NM_007240 

443015 R33261 

439584 AA838114 

426298 AW965058 

420929 AI694143 

418318 U47732 

414565 AA502972 

418329 AW247430 

424433 H04607 

420230 AL034344 

419441 AW023731 

426011 AW996096 

451900 AB023199 

424692 AA429834 

439999 AA1 15811 

424368 AB037766 
402076 

413886 AW958264 

445041 T64183 

428927 AA441837 

414831 M31158 

407902 AL1 17474 

426429 X73114 

452055 AI377431 

438913 AI38C429 

443684 AI681307 

429281 AA830856 

421040 AA715026 

425277 NM.001241 

418564 AA631143 

440099 AL080058 

434958 T99949 

419972 AL041465 

416182 NM_004354 

418365 AW014345 

452286 AI358570 

430361 AI033S65 

446716 M436575 

419544 AI909154 

433023 AW8647S3 

409151 AA306105 



Hs.146625 ESTs 



413328 Y15723 

453020 AL1 62039 

411359 H86088 

420000 AB036063 

412262 W26406 

419875 AA853410 

414422 AA147224 

426581 AB040956 

424649 BE242035 

407317 AI204033 

440426 Al 159800 

433495 AW373784 

423250 BE061916 

438459 T49300 

451593 AF151879 

435106 AA100847 

438157 AW137011 



Hs.205442 
Hs.2534 
Hs.131987 
Hs.86368 
Hs.258110 
Hs.2910 
Hs.1 10347 
Hs.75746 
Hs.1 51500 
Hs.66095 
Hs.1 26594 
Hs.146151 

Hs.82921 
Hs.1 50557 I 
Hs.44229 



KIAA1708 protein 

ESTs, Weakly similar to T34036 hypotheti 
bone morphogenetic protein receptor, typ 



ESTs, Weakly similar to A43932 mucin 2 p 



Hs.84072 
Hs.1 83390 
Hs.84152 
Hs.9218 
Hs.284186 
Hs.274368 
Hs.58924 
Hs.27207 
Hs.151791 
Hs.6838 
Hs.1 46085 

Hs.1 03832 
Hs.282982 
Hs.90250 
Hs.77439 
Hs.41181 
Hs.169849 
Hs.141693 
Hs. 172445 
Hs.55098 
lls.29808 
Hs. 1 35280 
Hs.155478 
Hs.278695 



KIAA0092 gene product 
ras homolog gene family, mi 
KIM1345 protein 



similar to yeast Upf3, variant B 



protein kinase, cAMP-dependent, regulate 

Homo sapiens mRNA; cDNA DKFZp727C191 (fr 

myosin-binding protein C, slow-type 

hypothetical protein MGC1 0858 

ESTs 

ESTs 

I lomo sapiens cDNA: FLJ21 122 lis, clone C 

ESTs 

cyclinT2 

Homo sapiens prostein mRNA, complete cds 

DKFZP564G202 protein 

Homo sapiens cDNA FU 14832 lis. clone OV 

golgin-67 

cyclin G2 



ls.123933 ESTs, Weakly simili 
Hs. 239926 sterol-C4-methyl oxi 



M ZINC 



Hs.87409 
Hs.50785 
Hs.37528 



445715 AB012958 

443228 W24781 Hs.293798 

424051 AL110203 Hs.138411 

453496 AA442103 Hs.33084 

452741 BE392914 Hs.30503 

428728 NM.016625 Hs.191381 



Hs.75295 
Hs.31422 
Hs.22635 
Hs.94262 
Hs.295923 



Hs.1 25849 
Hs.35304 
Hs.26706 
Hs.1 93380 
Hs.49576 



gb:QV-BT200-010499-007 BT200 Homo sapien 
thrombospondin 1 

SEC22. vesicle trafficking protein (S. c 
ESTs, Weakly similar to AF090944 1 PRO06 
UV radiation resistance associated gene 
KIAA1710 protein 

Homo sapiens mRNA; cDNA DKFZp586J1922 (f 
solute earner family 2 (facilitated glu 
Homo sapiens cDNA FU 1 1344 lis, done PL 

lung type-l cell membrane-associated gly 

guanylate cyclase 1, soluble, alpha 3 

Homo sapiens mRNA; cDNA DKFZp434M229 (fr 



seven in absentia (Drosophila) homolog 1 



Homeobox A13 
KIAA1523 protein 
embryonic ectoderm development 
ESTs, Weakly similar to 138022 hypotheti 
Homo sapiens cDNA FU13663 fis, clone PL 
alpha-2-gtycoprolein 1, zinc 
chromosome 8 open reading frame 2 
Homo sapiens cDNA FLJ 13655 fis. clone PL 
CGI-121 protein 

ESTs, Highly similar to AF1 74600 1 F-box 
ESTs 



WO 03/042661 



PCT/US02/36810 



55 
60 
65 



443181 AI039201 

449125 AI671439 

411479 AW848047 

446553 AB021179 

418278 AI088489 

419791 AI579909 

418866 T65754 

413753 U17760 

445564 AB028957 

435021 AA922192 

457498 AI732230 

425628 NM 004476 

419589 AW973708 

446162 AIS31319 

414523 AU076633 

427393 AB029018 

408741 M73720 

437743 AI383497 

448172 N75276 

421863 AI952677 

444618 AV653785 

413450 Z99716 

413431 AW246428 

410150 AW382942 

444100 M383343 

448148 NM_016578 

443476 AW068594 

433345 AI681545 

423681 AB023215 

450205 AI219748 

408374 AW025430 

446035 NM_006558 

432589 AL1 35725 

435604 AA625279 

411997 AW673478 

419672 AA465113 

444564 AI167877 

426931 NMJJ03416 

451424 AI862026 

433730 AK002135 



Homo sapiens mRNA for KIAA1657 protein, 
gb:IL3.CT0214-291299-052-A12 CT0214 Homo 
HMBA-inducible 



425683 AB037813 

442760 BE075297 

418884 AA230228 

444821 AA053564 

451989 AF169797 

439731 AI953135 

453127 AI696671 

435706 W31254 

410748 BE383816 



430291 AV660345 

440370 AA884000 

412350 AI659306 

428825 AI084336 

441054 AA913591 

415B75 AA894876 

453078 AF053551 

409203 AA780473 

446720 AI439136 

418475 AI858732 

417708 N74392 

407870 AB032990 

420568 F09247 

418B58 AW961605 



434579 T55958 

434826 AF155661 

427528 AU077143 

413174 AA723564 

443250 AI041530 

451957 AI796320 



Hs.75517 

Hs.12896 

Hs.54709 

Hs.191737 

Hs.1915 

Hs.201925 

Hs.63841 

Hs.76353 

Hs.177635 

Hs.646 

Hs.131811 

Hs.77558 

Hs.135904 

Hs.1 08972 

Hs.173334 

Hs.75372 



Hs.75 
Hs.6774 
Hs.22116 
Hs.20509 
Hs.1 33878 
Hs.152982 
Hs.131525 
Hs.1 1356 
Hs.1 55591 
Hs.13565 
Hs.131708 
Hs.26892 
Hs.108323 
Hs.23853 
Hs.143716 
Hs.2076 
Hs.302810 
Hs.3542 
Hs.195839 
Hs.79881 
Hs.1 59200 
Hs.6614 
Hs.59197 
Hs.1 2040 
Hs.27413 
Hs.45140 
Hs .2941 10 
Hs.7045 
Hs.12532 
Hs.83918 
Hs.63428 
Hs.104215 
Hs.29279 



gb:yc11c07.s1Slralageneliirig(937210)H 

laminin, beta 3 (nicein (1 25kD), kalinin 

KIAA1034 protein 

ESTs 

ESTs 

folate hydrolase (prostate-specific memb 
Homo sapiens cDNA FU13446 fis, clone PL 
hypothetical protein DKFZp434E2318 
serine (or cysteine) proteinase inhibito 
KIAA1095 protein 
carboxypeptidase A3 (mast cell] 
ESTs, Weakly similar to ALU1_HUMAN ALU S 
thyroid hormone receptor interactor 7 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434P228 (fr 
ELL-RELATED RNA POLYMERASE II, ELONGATI 1 
N-acetylgalactosaminldase, alpha- 
ubiquitin-conjugating enzyme E2N (homolo 
ESTs 

CDC14 (cell division cycle 14, S. cerevi 

HBVpX associated protein-8 

ESTs, Weakly similar to YCD1_HUMAN HYPCT 

hypothetical protein FU13117 

Homo sapiens mRNA; cDNA DKFZp434E199 (fr 



ESTs 

zinc finger protein 7 (KOX 4, clone HF.1 
Novel human gene mapping lo chomosome 20 
hypothetical protein FU 11 273 
E! Fs /eakl similar to I38022 h/potheti 
Homo sapiens cDNA: FLJ230O6 fis, done L 
hypothetical protein DKFZp762K222 
ESTs, Weakly similar to A43932 mucin 2 p 
ESTs 

STE20-like kinase 

adaptor protein containing pH domain, PT 
hypothetical protein FU14084 



Hs.126480 

Hs.5687 

Hs.31584 

Hs.587 

Hs.140546 

Hs.30443 

Hs.50495 

Hs.40719 

Hs.247735 



Hs.179565 

Hs.191343 

Hs.132107 

Hs.10299 

Hs.6295 

Hs.1 14556 

Hs.179982 

Hs.79362 



adenosine monophosphate deaminase (isofo 
ESTs, Weakly simil ar to Z 1 9 5_H U MAN ZINC 
ESTs, Moderately similar to SPCN_HUMAN S 
eyes absent (Drosophila) homolog 2 
ESTs 

CGI-49 protein 

hypothetical protein FU 10803 

protein tyrosine phosphatase, non-recept 

ESTs, Weakly similar to 138022 liypollieti 



(formerly 2C), ma 

cytochrome P450, subfamily WB, polypept 
ESTs 

sentnWSUMO-specific protease 
ESTs 

hypothetical protein KIAA1 1 64 
protocadherin alpha 10 
hypothetical protein RG083M05.2 
integrin, alpha 1 
KIAA0033 protein 

gb:yb35f05.r1 Stratagene fetal spleen (9 
pyruvate dehydrogenase phosphatase 



ESTs 

Homo sapiens cDNA FLJ 13545 fis, clone PL 
ESTs, sVeakiy similar to T 1 7243 hypothec 
hypothetical protein FU20174 
tumor protein p53-binding protein 
retinoblastoma-like2(p130) 



WO 03/042661 PCT/US02/36810 



60 
65 
70 



439091 AA830144 

418575 AA225313 

432426 AW973152 
404571 

444427 H25094 

439183 AW970600 

432886 BE159028 

432689 AB018320 

417333 AL1 57545 

429302 AU076674 

412324 AW978439 

422072 AB018255 

431318 AA502700 

417171 BE613486 

421709 AA159394 

415156 X84908 

436446 AW016809 

447482 AB033059 

439726 AW449893 

429525 N92540 

409190 AU076536 

419111 AA234172 

452487 AW207659 

416039 AA376989 

408239 AA053401 

409752 AW963990 

453894 AW937825 

439671 AW162840 

407230 AA1 57857 

426083 AW962712 

452221 C21322 

434263 N34895 

412766 BE544475 

418723 AA504428 

443970 AI280341 

424534 D87682 

419407 AW410377 

447889 AW469180 

429504 X99133 

433165 AA578904 

418700 AI963808 

440282 BE262386 

411630 U42349 

408915 NMJ16651 

448552 AW973653 

432682 AI376400 

435803 Z44194 

437444 H45008 

410252 AW821182 

438497 AA808725 

449226 AB002365 

449465 NM_004380 

420969 AI636310 

449845 AW971183 

434415 BE177494 

420805 L10333 

415083 AI632683 



428250 AW809208 

453293 AA382267 

412446 AI768015 

441102 AA973905 

421689 N87820 

448073 W19789 

422244 YOB890 

452323 W44356 

441499 AW298235 

420184 AA188408 

420061 AW024937 

420337 AW295840 

417407 AA923278 

450580 N40087 

419926 AW900992 

421928 AF013758 

412639 AW961284 

429786 AL0B0232 

446131 NMJ00929 

451418 BE387790 

416658 U03272 

425139 AW630488 



Hs.31050 ESTs 
Hs.29 



Hs.303261 
Hs.279704 
Hs.278626 
Hs.42179 
Hs. 198899 
Hs.69504 
Hs.111138 
Hs.293147 
Hs.81412 
Hs.107056 
Hs.78060 
Hs.323795 
Hs. 18705 
Hs.293707 
Hs.205353 
Hs.50984 
Hs.137418 



chramalin accessibility complex 1 
Arg/Abl-interacting protein ArgBP2 
bromodomain and PHD finger containing, 3 
eukaryotic translation initiation factor 



KIAA1233 protein 

ESTs, Weakly similar to I38598 zinc fing 
ectonucleoside triphosphate diphosphohyd 
sarcoma amplified sequence 
ESTs 

Homo sapiens cDNA FLJ 1 3329 lis, clone OV 
alcohol dehydrogenase 5 (class 111), chi 
ESTs, Moderately similar to ALU7_HUMAN A 



Hs.54347 

Hs. 10487 

Hs.166571 

Hs.150275 

Hs.41502 

Hs. 170651 

Hs.204238 

Hs.292437 

Hs.86970 

Hs.7137 

Hs.71119 

Hs.48950 

Hs.20104 

Hs.159588 

Hs.4994 

Hs.31518 

Hs.61418 

Hs.291712 

Hs.23311 



Hs.27179 
Hs.240770 
Hs.306088 
Hs.119471 
Hs.183297 
Hs.10653 
Hs.92127 
Hs.331328 
Hs.106826 
Hs.336635 
Hs.1 13503 
Hs.292812 
Hs.101689 
Hs.95665 
Hs.29410 
Hs.14555 
Hs.290905 



Homo sapiens, clone IMAGE.-3954 1 32, m RMA, 



ESTs, Moderately similar to ALU5_HUMAN A 
clones 23667 and 23775 zinc finger prote 
Putative prostate cancer tumor suppresso 
heptacellular carcinoma novel gene-3 pro 
hypometical protein FLJ00052 



ESTs, Weakly similar to I38022 hypothe'j 
KIAA0367 protein 

CREB binding protein (Rubinstein-Taybi s 



Homo sapiens cDNA FLJ 12933 fis, clone NT 
nuclear cap binding protein subunit 2, 2 
CT10 oncogene 



DKFZP566F2124 protein 



ESTs, Weakly similar to T33468 hypothec' 



Hs. 1 09643 polyadenylate binding protein-int 



Hs.220696 

Hs.290 

Hs.26369 

Hs.79432 

Hs.325820 



Homo sapiens mRNA; cDNA DKFZp56GACG1 (fi 
phospholipase A2, group V 
hypothetical protein FU20287 
fibrillin 2 (congenital contractural ara 
protease, serine, 23 



687 



WO 03/042661 



PCT/US02/36810 



448807 AI571940 

412505 AA974491 

412314 AA825247 

445704 AI493742 

428465 AW970975 

422564 AI1 48006 

430027 AB023197 

450680 AF131784 

424850 AA151057 

433862 D86960 

429227 AI961456 
401558 

428634 AA811845 

437678 AA829860 

416653 AA768553 

431117 AF003522 

450506 NM_004460 

431187 AW971146 

421202 AF1 93339 

429597 NMJJ03816 

430669 AW969657 

435869 AF255910 



423800 AA331156 

432189 M527941 

434361 AF129755 

443247 BE614387 

419879 Z17805 

411928 AA888624 

448704 AW080932 

448430 AI500642 

451246 AW1 89232 

418821 AA436002 

427213 AW007211 

419103 Z40229 

450654 AJ245587 

434194 AF119847 

447033 AI357412 

420757 X78592 

428593 AW207440 

430223 NM 002514 

444984 H15474 

445674 BE410347 

430294 AI538226 

419852 AW503756 

440349 AA884196 

432161 AK000400 
405523 

416662 T25853 

422763 AA033699 

446183 AA354991 

432897 AF155099 

448988 Y09763 

449523 NMJ00579 

407204 R41933 

413554 AA319145 

414917 C04863 

434539 AW748078 

441805 AA285135 

412530 AA766268 

424683 N87519 

426158 NMJ01982 

413132 NM_006823 

426490 NMJJ01621 

446896 T15767 

441079 AW1 50697 

437231 BE303000 

431316 AA502663 
403137 

433628 AI821784 

418489 U76421 

453344 BE349075 

410227 AB009284 

417173 U61397 

408161 AW952912 

420495 A1338247 

433213 AW665130 

451573 AW130351 

408393 AW015318 

434725 AK000796 

418876 AA740616 



Hs.7549 


ESTs 


9.3 


Hs.21734 


ESTs 




Hs.250899 


heat shock factor binding protein 1 


9 3 


Hs.165210 


ESTs, Moderately similar to I38022 hypot 


9.3 




NM 001 )':Homo is calronir. 3 id 




Hs.293653 


ESTs 




Hs.222120 


ESTs 


93 


Hs.227743 


KIAA0980 protein 




Hs. 25318 


Homo sapiens clone 25194 mRNA sequence 




Hs. 153498 




9.2 


Hs.3010 


KIAA0205 gene producr d ' n9 ^ 




Hs.21275 


hypothetical protein FLJ11011 
ENSP00000220478*:SECRETOGRANINIII. 


9 2 


Hs.106290 


Kelch motif containing protein 




Hs.1 22834 


ESTs 




Hs.74170 


melallothionein 1E (functional) 


Hs.250500 
Hs.418 






fibroblast activation protein, alpha 




HS.2931B7 


ESTs 




Hs.102506 


eukaryotic translation initiation factor 




Hs.2442 






Hs.291029 


ESTs"' 69 "" mela " opro,e " lased ° ma 


Hs.54650 






Hs.289038 


hypothetical protein MGC4126 




Hs.282093 


hypolhetical protein FLJ21918 




gb E 5T35034 Enbryo, 6 week subtracted; 






gb:nh30c04.s1 NCI_CGAP_Pr3 Homo sapiens 


ai 


Hs.1 17772 


ESTs 




Hs.333893 


c-Myc target JP01 




Hs.93564 


Homer, neuronal immediate early gene, 2 




Hs.1 97289 
Hs.249247 


rab3 GTPase-activating protein, non-cata 








Hs.289057 


Homo sapiens cDNA FLJ 1 1 404 fis, clone HE 




Hs.39140 


cutaneous T-cell lymphoma tumor airigen 




Hs.1 831 61 


ESTs 




Hs. 16131 


hypothetical protein FU12876 




Hs.96423 


hypothetical protein FLJ23033 




Hs.25275 


Kruppel-type zinc finger protein 

Homo sapiens PRO1550 mRNA, partial cds 


9.'1 


Hs'l57601 






Hs.99915 


androgen receptor (dihydrotestosterone r 




Hs.185973 


degenerative spermatocyte (homolog Droso 




Hs.235935 
Hs.132398 










Hs.1 3063 


transcription factor CA1 50 




Hs.32976 


guanine nucleotide binding protein 4 




Hs.286184 


hypothetical protein dJ551D2.5 


9!o 


Hs.31476 


Homo sapiens cDNA FLJ13872 fis, clone TH 




Hs.292807 


- T \ ikly mil oenvelo| 'H 




Hs.7538 


C8001409*:gi|7441226|pirI|S31 212 collage 




Hs.83938 
Hs.1 4222 






Homo sapiens mRNA; cDNA DKFZp761P019 (fr 




Hs.279780 


NY-REN-18 antigen 




Hs.22785 


gamma-aminobutyric acid (GABA) A recepto 




Hs.54443 


chemokine (C-C motiQ receptor 5 




Hs.140237 


ESTs, Weakly similar to ALU1 HUMAN ALU S 




Hs.75426 
Hs.47191 


secretograninll(chromograninC] 


8.9 


Hs.214410 


ESTs, Weakly similar to MUC2_HUMAN MUCIN 




Hs.301914 






Hs.266273 


pothetical rntPin FLJ13346 






ESTs 




Hs!l99067 


v-erb-b2 avian erythroblastic leukemia v 




Hs.75209 






Hs.1 70087 


aryl hydrocarbon receptor 




Hs.22452 


Homo sapiens mRNA for KIAA1737 protein, 




Hs.107418 






Hs.288929 


hypothetical protein FLJ1 3258 similar to 










NM 005381*:Homo sapiens nucleolin (NCL) 




Hs.188578 


ESTs 


8.9 


Hs.85302 


adenosine deaminase, RNA-specitic, B1 (h 


8.9 


Hs.44571 


ESTs 


8.9 


Hs.61 152 


exostoses (multiple)-like 2 


8.9 


Hs.81424 


Jbiquitin-like 1 (senlrin) 


8.9 


Hs.300383 


hypothetical protein MGC3032 


8.9 


Hs.98314 


Homo sapiens mRNA; cDNA DKFZp586L0120 (f 
ESTs 


8.9 


Hs.1 371 90 


8.9 


Hs.243549 


ESTs 


8.9 


Hs.23165 


ESTs 


8.3 


Hs.4104 


hypothetical protein 


6.3 




gb:ny97f11.s1 NCI_CGAP_GCB1 Homosapiens 


8.3 



WO 03/042661 PCT/US02/36810 



444558 AW1B1975 

417355 D13168 

424084 A1940675 

431706 AI816086 

424956 AW19B103 

441866 BE464341 

444647 H14718 

444858 Al 199738 

409643 AW450866 

428647 AA830050 

412330 NM 005100 

410275 U85658 

425465 L18964 

424113 A1743880 

459324 AW080953 

451684 AF216751 

451244 AW008798 

439170 AA332365 

424238 AA337401 

4496B6 AW072813 

440658 H29142 

410870 U81599 

458229 A1929602 

429831 AA564489 

453468 W00712 



408949 AF189011 

424558 D83702 

412719 AW016610 

423872 AB020316 

408242 AA251594 

430848 AW021726 

421492 BE176990 

449893 T97999 

410442 X73424 

425913 AA365799 

425704 U79293 

427723 AI355260 

441S83 BE564214 

447335 BE617695 

428259 AA424793 

433104 AL043002 

420041 AB005142 

432432 AA641323 

452335 AW1 88944 

412673 AL042957 

430335 D80007 

419904 AA974411 

442402 NM.000954 

433759 AA680003 

434064 AL049045 
401785 

418407 AL044B18 

419647 AA348947 

425907 AA365752 

452017 AF109302 

424434 H87028 

400303 AA242758 

414680 AA743331 

413992 W26276 

415071 AK002197 

451230 BE546208 

441790 AW294909 

441124 T97717 

438160 AA779332 

433264 D85782 

434851 AA806164 

420608 BE548277 

426535 AU077012 

411667 BE160198 

454058 AI273419 

435614 R0971B 

446595 T57448 

435677 AA694142 

447894 AW204253 

452682 AA456193 

418372 AA311833 

447231 AK001293 

416206 AW206248 



Hs.15815 
Hs.21201 
Hs.11506 
Hs.208275 
Hs.257359 
Hs.124344 
Hs.788 
Hs.61796 
Hs.1904 

Hs.12876 ESTs 



Hs.26813 

Hs.267448 

Hs.165539 

Hs.137635 

Hs.270868 

Hs.143032 

Hs.66731 

Hs.177 

Hs.1 37526 

Hs.32990 

Hs.154721 

Hs.90063 

Hs.33074 



endothelin receptor type B 
hypothetical protein FLJ23055 
adenylyl cyclase-associated protein 2 



A kinase (PRKA) anchor protein (gravin) 
transcription factor AP-2 gamma (actwat 
" Ciota 



ESTs, Moderately similar to ALU4_HUMAN A 
ESTs, Weakly similar to neuronal thread 
homeo box B13 

phosphatidylinositol gjycan, class H 
ESTs 

DKFZP566F084 protein 
neurocalcin delta 

Homo sapiens, clone IMAGE:360651 9, mRNA, 

hypothetical protein PP1044 

Homo sapiens, Similar to zinc linger pro 



Hs.279789 

Hs.1 02946 

Hs.286192 

Hs.24144 

Hs.1 28246 

Hs.94592 

Hs.115831 

H S .S12^2 

Hs.31845 

Hs.239499 

Hs.1 8672 

Hs.8272 

Hs.109363 

Hs.180758 



Hs.91816 
Hs.155965 
Hs.27495 
Hs.146861 
Hs.79136 
Hs.272572 
Hs.136075 
Hs.284270 
Hs.26090 
Hs.1 32208 
Hs.1 19563 
Hs.1 22671 



(photolyase-like) 
ESTs 

uronyl 2-sulfotransferase 
PIBF1 gene product 

gb:df27e02.y1 Morton Fetal Cochlea Homo 
hypolhetical protein FU 21 940 
ESTs, Weakly similar to B34087 hypotheti 
propi l ICoen y it \c bi <yla e.belar 
SEC22, vesicle trafficking protein (S. c 
Human clone 23948 mRNA sequence 
histone deacetylase 3 



hypothetical protein FLJ20940 



KIAA01 85 protein 
ESTs 

prostaglandin D2 synthase (21 kD, btz 
Homo sapiens cCNA FLJ2 0311 
hypolhetical protein PRO0082 
NM_002275*.Homo sapiens keratin 1 
nuclear transcription factor Y, beta 
hypolhetical protein 
ESTs 

prostate cancer associated protein 7 
hypothetical protein FLJ20580 
LIV-1 protein, estrogen regulated 
hemoglobin, alpha 2 
RNA.U2 small nuclear 
Homo sapiens cDNA FLJ 1 1 335 fis, cl< 
I FU20272 



cysteine dioxygenase, type I 
Hs.116502 ESTs 
Hs.103104 ESTs 

Hs.288582 ESTs, Weakly similar to ubiquitous TPR m 

gb:QV1-HT0413-010200-059-h03HT0413Homo 
Hs.135146 hypothetical protein FU13984 
Hs.20403 
Hs.1 5467 
Hs.293726 
Hs.21912 
Hs.9071 
Hs.84318 
Hs.33020f 

Hs.11 10 p pi I 



689 



WO 03/042661 



434747 AA837085 

415938 BE383507 

450813 AI739625 

413956 AI821351 

408599 AA056614 

440840 AW629666 

407891 AA486620 

411019 AW993097 

432620 M777749 



418120 AA213437 

419436 AA991639 

432600 AI821085 

448474 AI792014 

434384 AA631910 

452234 AW084176 

411562 AL050201 

421622 AB037748 

431160 AW971174 

449567 AI990790 

411088 BE247593 

452186 M120761 

437872 AK002015 

429393 AA383024 

425692 D90041 

438440 AA807228 



ESTs, Weakly similar to ALU7.HUMAN ALU S 
ESTs, Moderately similar to ZN91.HUMAN Z 
ESTs, Weakly similar to S64054 hypotheti 



Hs.48617 Homo sapiens cDNA FLJ12540fis, cbne NT 

Hs.5978 LIM domain only 7 

Hs.193557 ESTs, Moderately similar to ALU7.HUMAN A 

Hs.31835 ESTs 
Hs.30213 
Hs.21943 



Hs.23648 
Hs.192249 
Hs,242413 



Hs.162849 ESTs 



NIF3 (Ngg1 interacting factor 3, S.pambe 
5-methyltetrahydrofolate-homocys!eine me 
Homo sapiens cDNA FLJ13872 f s, clone TH 
Homo sapiens cDNA FU 13097 tls, done NT 



hypothetical protein DKFZp434K1421 
Qb:ns95a12.y5NCI_CGAP_Pr3r 
hypothetical protein FU 10648 



ESTs, Weakly similar to I38022 hypotheti 
hypothetical protein DKFZp586E1923 
KIAA1327 protein 



Hs.145053 ESTs 



Hs.155956 
Hs.225161 
Hs.272251 



4 (formin bind 
RNA binding motif protein 7 
Homo sapiens mRNA; cDNA DKFZp434D0917 (f 
N-acetyltransferase 1 (arylamine N-acety 



446947 AF146747 
403149 

436679 AI127483 

421991 NM.014918 

436476 AA326108 

434980 AW770653 

407385 AA610150 

453121 A1968264 

420630 AL133101 

426101 AL049987 

419517 AF052107 

425707 AF1 15402 

440080 AW051597 

422049 W25760 

430154 AW583058 



Hs.144959 
Hs.269607 
Hs.5635 
Hs.232165 



KIAA0010 gene product 
ESTs 

ESTs, Weakly similar to AL 



Hs.99508 
Hs.166361 
Hs.90797 



ESTs 

r ! , f a r ra ra 1; ce'l face 
NM_001450:Homo sapiens four and a half L 
ESTs, Weakly similar to unnamed protein 
KIAA0990 protein 
bHLH protein DEC2 

sterol O-acyltransferase (acyl-Coenzyme 
ESTs, Weakly similar to I38022 hypotheti 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434O0921 (f 
Homo sapiens mRNA; cDNA DKFZp564F1 1 2 (fr 
Homo sapiens clone 23620 mRNA sequence 
E74-like factor 5 (ets domain transcript 



ts.77531 glycine cleavage system pre 



412125 Y17114 

441523 AW514263 

417958 AA767382 

452040 AW973242 

430468 NMJ04673 

407802 D84145 

420556 AA278300 

414138 AA135884 

438209 AL120659 

408267 AW380525 

429692 N48422 

408108 AI580492 

408418 AW963897 

430334 AI824719 

422083 NM_001141 



Hs.49349 
Hs.19479 
Hs.73393 
Hs.301771 



gb:zn43e07.s1 Slratagene HeLa cell s3 93 



ls.193417 ESTs 



ESTs, Weakly similar to ALUF_HUMAN !«! 



Hs.293690 ESTs, Weakly similar to I38022 hypotheti 



ls.3772 ESTs 



Hs.44743 
Hs.328700 
Hs.1 11256 



aryl-hydrocarbon receptor nuclear transl 

tubulin-specific chaperone e 

ESTs, Weakly similar lo B34087 hypotheti 

hypothetical protein 

KIAA1435 protein 

ESTs 

ndtyp 



Unique Eos probeset identifier ni 
Genbank accession numbers 



WO 03/042661 



PCT/US02/36810 



423800 
426413 
429163 
430848 
431121 



434415 
434579 
439518 



115301J AW953990AA078196 AW749482 M077468 BE151571 AA376917 

1247077 1 AW848047AW848202AW848631 AW848142 AW848702AW84B121 AW343632 AW848140 AW848571 AW848009 AW848067 AW848069 AV 
AW84B214 

1 253334J BE160198 AW935898 T1 1 520 AW935930 AW856073 AW851 034 

163001J AW934714 BE161007 BE162500 AW749902 AW749864 BE162498 BE161005M190449 AW513455 BE161006 BE162499 

1 79788 J T65754 AA229857 AA229658 

179960 1 M740616AA554854AA229923 

185688J M603305 M244095M244183 

1 85760_2 AI9091 54 AA526337 AA2441 93 AI909153 

232161J AA331 1 56 AA331 1 57 AA331155 

266650J AA377823 AW954494 AI022688 

300543J AA884766 AW974271 AA592975 AA44731 2 

324621J AW021726AA487752AA488085 

328275 1 AW971157 M492575AA492520 

342819_1 AA527941 AI810608 AI620190 AA635266 

350959J AI821085AW973464AA554802AI821831 AA657438AA640756AA550339 

385931J BE1 77494 AW276909 AA632849 

38916J T55958 T57205 AF147346 

47334J W76326AF086341 W72300 

561305J AI038316AI344631 AI261653 

745643J R76886 AI453674 R77049 



TABLE 59C 



Unique number corresponding to an Eos probeset 
Sequence source. The 7 digit numbers in this columi 
human chromosome 22." Dunham I. et al„ Nature (1999) 402:489-495. 
Indicates DNA strand (ram which exons were predicted. 



401197 
401424 
401519 



403137 
403149 
403362 

404571 
404641 
404642 



8176894 
6649315 
7139678 



9662953 
3540153 
9211494 
9799833 
8571772 
5006246 
7249169 
9796810 
9796810 
9454643 



176341-176452 
24223-24428 
157315-157950 
103510-104090 

118596-118816,119119-119244,119609-119761,120422-120990,130161-130381,130468-130593,131097-131258,131866- 
1 31 932,1 32451 -1 32575,1 33580-1 3401 1 

165776-165996,165189-166314,166403-166569,167112-167268,167387-167469,16B634-16B942 
128316-128627 



59763-59909 



92349-92572,92958-93084,93579-93712,93949-94072,94591-91748,95214-95337 

25034-25185 

64099-64250 



114550-114688,117265-117407,119490-119599,123237-123395,131140-131217 



Table 60A lists about 1 1 66 genes up-regulated in prostate cancer compared to norrral adult tissues. These were selected from 59680 pi 
GeneChip array such that the ratio of "average" prostate cancer to "average" normal adult tissues was greater than or equal to 3.0. The ' 
75* percentile amongst prostate cancers. The "average" normal adult tissue level was set to the 85* percentile amongsl non-malignant tissues. 
' of non-specific hybridization, the 10* percentile ' 



TABLE 60A: ABOUT 1166 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 
Pkey: Unique Eos probeset identifier number 



UnigenelD Unigene Title 



426747 
419526 
420154 



AW292425 
AF1 09298 
AA508552 
X07730 
AI420227 



Hs.11 

Hs.195839 

Hs.171995 

Hs.1 49358 

Hs.1852 

Hs.61635 

Hs.1 90394 



ts.163484 ESTs 



prostate cancer associated protein 1 
ESTs, Weakly similar to I38022 hypotheti 
kallikrein 3, (prostate specific antigen 
ESTs, Weakly sin 



ESTs, Weakly similar to B28096 lir.e-1 pr 
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WO 03/042661 
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25 
30 



409262 AK000631 

428819 AL1 35623 
401424 

453370 AI470523 

400292 AA250737 

407202 N5B172 



407709 
407168 
433444 
409731 



418961 
452792 
403047 



408C21 
448999 
450642 



429290 
410467 
439444 
450382 
452340 
417332 
407021 
419553 



HS.222BB5 
Hs.1 23642 
Hs.182575 
Hs.11325 
Hs.181350 
Hs.52256 
Hs.193914 

Hs.139336 
Hs.72472 
Hs.109370 



folate hydrolase (prostate-specific memb 
ESTs, Weakly similar to TRHY_HUMAN TRICH 
EphA3 

solute carrier family 15 (H??? transport 



R45175 
AW975324 
AA1 25985 



AF071202 Hs.139336 
AW967646 Hs.23023 
AB037765 Hs .30652 



433647 
432240 
430487 
440260 
429170 
429220 
416370 
418819 
421513 



NMJ01394 
AW207206 
N90470 
AA228776 



400297 
423073 
431474 



Hs.22791 
Hs.7130 
Hs.12532 
Hs.325198 
Hs.159425 
Hs. 112259 
Hs.306201 
Hs.123119 
Hs.190642 



AW014795 
M14156 
AI249368 
AI654133 



440529 
424846 
448519 



Hs.30212 
Hs.18081 
Hs.255416 
Hs.20843 
Hs.109276 
Hs.102754 
Hs.1 32458 
Hs.97594 
Hs.1 6478 
Hs.1 832 



AL137517 



KIM0575 gene product 
NMJM172:Homo sapiens arginsse, type li 
ATP-binding cassette, sub-family C (CFTR 



ATP-binding cassette, sub-family CJCFTR 



Hs.203697 
Hs.191721 
Hs.105314 
Hs.1 19383 
AA814043 Hs.88045 
AI202703 Hs.152414 
AW450461 Hs. 203965 
AI834273 Hs.9711 
AA641092 Hs.257339 
AI799909 Hs.158989 
R81733 Hs.33106 
AI970672 
AF1 79274 
R39773 
AW292053 
AA650114 
AW023630 
AF151103 



dual specificity phosphatase 4 
ESTs 

ESTs, Weakly similar to I38022 hypotheti 



chromosome 1 1 open reading frame 8 
transmembrane protein with EGF-like an 
copinelV 

1 open reading frame 21 



Hs.1 17183 
-ts.13804 
Hs.306201 
NM_005923 Hs.1 51988 
BE218886 Hs.282070 
AF203032 Hs.198760 
AF102546 Hs.63931 
AI277652 Hs.54578 
AA397658 Hs.60257 
NM_002202 Hs.505 
AW972717 Hs.288462 
U52077 



ESTs, Weakly similar to NUCLJHUMAN NUCLE 



thyroid receptor interacting protein 1 5 

Homo sapiens clone DT1P1 B5 mRNA, CAG rep 

hypothetical protein FU21 986 

Homo sapiens rDNA FLJ 1 1 245 fis, clone PL 



no sapiens prostein mRNA, complete cds 
is, Moderately similar to SPCN HUMAN S 

gb:ns95a12.y5 NCI_CGAP_Pr3 Homo sapiens 

ESTs 

hypothetical protein dJ462023.2 
hypothetical protein DKFZp56401276 
mitogen-activated protein kinase kinase 
ESTs 

neurofi lament, heavy polypeptide (200kD) 
dachshund (Drosophila) homolog 
ESTs, Weakly similar to I38022 hypotheti 
Homo sapiens cDNA FU1 3598 fis, clone PL 
ISL1 transcription factor. LIM/homeodoma 
hypothetical protein FLJ21 511 
gbtHuman marinerl trarsposase gane, comp 
Homo sapiens cDNA FLJ1 1 983 fis, clone HE 
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WO 03/042661 



PCT/US02/36810 



400301 
419536 
445472 
415539 
419264 
424432 
439518 
434036 
415263 
431217 
444805 
437252 
444190 
418293 
432244 
451640 



415079 
409799 
433234 
400296 
422728 
425154 



416795 
422805 
418848 
4Q8430 
415788 
404641 
432437 



433043 
442082 
436671 
415786 
440774 
420120 
427138 
430722 
422487 
443646 
414565 
428728 
434926 
437162 
437866 
453006 
418723 
421040 
414212 



412715 
428829 
447156 
428330 
417412 
424592 
435981 
442081 
4349B8 
407910 
420345 
412324 
420757 
408374 
435604 
453160 
424051 
416182 
452110 



AK000292 

X03635 

AA603305 

AB006631 

AI733881 

AA877104 

AB037B21 

W76326 

Al 659131 

AA948033 

NMJ13427 



Hs.12784 
Hs.72472 
Hs.293672 
Hs.146858 

Hs.1 97733 
Hs.1 30853 
Hs.250830 
Hs.12017 
Hs.1 64159 
Hs.1 0526 
Hs.1 6063 
Hs.200574 
Hs.26771 
Hs.63931 



hypothetical protein FLJ20265 
estrogen receptor 1 

gb:np12d11.s1 NCI_CGAP_Pr3 Homo sapiens 
Homo sapiens mRNAfor KIAA0293 gene, par 
BMP-R1B 

ESTs, Weakly similar to ALUB_HUMAN !!!! 
protocadherinIO 

gb:zd60d04.r1 Soares JeiaLhearLNtHH 1 9W 

hypothetical protein MGC2849 

ESTs 

Rho GTPase activating protein 6 
homolog of yeast ubiquitin-protein ligas 
ESTs, Weakly similar to ALU1_HUMAN ALU S 
cysteine and glycine-rich protein 2 
hypothetical protein FU21877 
ESTs 

Human DNA sequence from clone 747H23 on 



5.17279 
Hs.22895 
Hs.76845 
Hs.65366 
Hs.1 39336 
Hs.103262 
NM_001851 Hs.154850 



AA305627 



M412205 
M419466 
AI497778 



Hs.26289 
Hs.1 40996 
Hs.36727 
Hs.20509 
Hs.1 21017 
Hs.193465 
Hs.44926 
Hs.78851 



W57554 

R41823 

AW137159 

AW419196 

AI420611 

AL049610 

N77624 



KIM1495 protein 
ATP-binding cassette, sub-family C (CFTR 
ESTs, Weakly similar to ZN91JHUMAN ZNC 
collagen, type IX, alpha 1 
small nuclear protein PRAC 
hypothetical protein MGC5370 
■!(oddOz/ien-m,Dr. 



W070B8 Hs.293685 ESTs 
AI683487 Hs.15221 
AW377314 Hs.5364 



dipeptidylpeptidase IV (CD26, adenosine 
KIAA021 7 protein 
021965*:Homosi 



Hs.7413 

Hs.146151 

Hs.257924 

Hs.1 53934 

Hs.95243 

Hs.173717 

Hs.203270 

Hs.198267 

Hs.164226 



Hs.5464 



wingless-type MMTV integration site fami 
DKFZP564I052 protein 
nucleosome assembly protein Mike 2 
Homo sapiens cDNA FLJ13458fis, done PL 
Homo sapiens cDNA FLJ11041 fis, done PL 
lymphoid nuclear protein (LAF-4) mRNA 
ESTs 
ESTs 

hypothetical protein FU13782 
ESTs 

transcription elongation factor A (Sll)- 
phosphatidic acid phosphatase type 2B 
ESTs, Weakly similar to ALU1_HUMAN ALU S 



Hs.1 35280 ESTs 



AA156781 

AI362575 Hs.303171 ESTs 
AA504428 
AA71502B 
AA136569 

NMJJ06416 Hs.82921 
W22335 Hs.7392 
NMJ00947 Hs.74519 
R14050 Hs.1 94051 
AW274731 Hs.1 57920 
L22524 Hs.2256 
X16896 Hs.82112 
AA429834 Hs.1 51791 
H74319 Hs.1 88620 
AA401863 
AI418055 
AA650274 
AW295230 
AW97B439 
X78592 
AW025430 
AA625279 
AI263307 



FLJ1359C 
hypothetical protein 

thyroid hormone receptor coactivating pr 
metallothioneinlE (functional) 



KIAA0187 gene product 
solute carrier family 35 (CMP-sialic aci 
hypothetical protein MGC31 99 
primase, polypeptide 2A(58kD) 
Homo sapiens mRNA; cDNA DKFZp565B21 3 (fr 



Hs.161160 ESTs 
.41296 fibronect 



ls.25231 
ls.69504 
Hs.99915 
Hs.1 55591 
Hs.26892 
Hs.239884 
Hs.1 38411 
NM_004354 Hs.79069 
T47667 Hs.28005 



ne rich transmembrane p 



androgen receplor (dihydroteslosterone r 



Homo sapiens mRNA; cDNA DKFZp586J1922 (f 

cyclin G2 

Homo sapiens cDNA FU 1 1 309 fis, clone PL 
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WO 03/042661 PCT/US02/36810 



451099 
427003 
443180 



416140 
451230 
400294 



AW6021B6 

AI918035 

BE546208 

N95796 

N23874 

AW408752 

AW863261 

AW821182 

BE242035 

AA424266 

T86534 

AF167160 

AA305127 

NM_003816 

AW976434 



418G64 
437872 
448148 
449G23 



prostaglandin E receptor 2 (subtype EP2) 
claudin 12 
uncoupling protein 4 
thrombospondin 1 
KIAA1523 protein 

ankyrin repeat, family A (RFXANK-like), 
CEGP1 protein 

roundabout (axon guidance receptor, Dros 
hypothetical protein FU20272 
Homo sapiens prostein mRNA, complete cds 
RAB27A, member RAS oncogene family 
Homo sapiens clone 24416 rnRNA sequence 
hypothetical protein DKFZp434K1421 



457728 AW974811 



Hs.105448 



.14372 ESTs 



AA702972 
AI088489 

BE172704 Hs.222746 KIAA1610 

AL049176 Hs.82223 
BE160198 

AA147224 Hs.24919: 

AA156790 HS.26203E 

NM_000399 Hs.1395 

AA631143 Hs.27869; 

AKQ02015 Hs.5887 

NM 016578 Hs.20509 

NMJ00579 Hs.54443 
AI345227 



DEAD/H (Asp-Glu-Ala-Asp/His) box binding 
hypothetical protein HT023 
a disintegrin and metalbproteinase doma 
hypothetical protein FU11220 
calcium/calmodulin-dependent protein kin 



gb:QV1-HT0413-010200-059-h03 HT0413 Homo 
Homeo boxA13 

ESTs, Weakly similar to Z223_HUMAN ZINC 
early growth response 2 (Krox-20 (Drosop 
Homo sapiens prostein mRNA, complete cds 
RNA binding motif protein 7 
HBV pX associated protein-8 
chemokine (C-C motif) receptor 5 
ESTs, WeaHv similar to B34087 hypotheti 
gb:EST386916 MAGE roseouences, MAGN Homo 
■ ' (C-X3-C) ret 
irotein L26 hi 
ESTs, Weakly similar to envelope [Rsapi 



411479 
420954 
439492 
449919 



55 
60 
65 
70 



431770 
418575 
443745 



N40850 

ALQ44675 

AU077315 

AW848047 

AA282074 

AF086310 

AI574685 

C21322 

AI815470 

AI805867 

AI732230 

R49462 

AA884766 

AW298141 

T15767 



AB039670 
AA418000 
AW627932 
T57773 



Hs.28625 
Hs.1 73081 
Hs. 154970 

Hs.237323 
Hs.103159 
Hs.200141 
Hs.288057 
Hs.260024 
Hs.126594 
Hs.191737 
Hs.106541 

Hs.157975 
Hs.22452 
Hs.268555 
Hs.222886 
Hs.9728 



KIAA0530 protein 
transcription factor CP2 

gb:IL3-CT021 4-29 1 299-052-A1 2 CT021 4 Homo 
N-acetylgluccsamine-phcsphatemulase 



gb:am20a10.s1 Soares_NFL_T.GBC_S1 Homos 
ESTs 

Homo sapiens mRNA tor KIAA1737 protein, 

ESTs, Weakly similar to TRHY.HUMAN TRICH 
ALEX1 protein 

potassium intermediate/small coi 



BE385828 Hs.250619 phorbolin-like protein MDS019 

NMJB2742 Hs.2891 protein kinase C, mu 

AAB51697 Hs.120591 ESTs 

AW021254 Hs.135055 ESTs 

D54745 Hs.80247 cholecystokinin 

AI357412 Hs.1 57501 ESTs 
AA441837 
AA527941 



448779 BE042877 



gb:nh30c04.s1 NCI_CGAP_Pr3 Homo sapiens 



432682 AI376400 

441499 AW298235 

441676 BE564206 

421077 AK000051 

452784 BE463B57 

452055 AI377431 

431576 M76665 

418994 AA296520 

452277 AL049013 



Hs.247112 
Hs.71119 
HS.1595B8 
HS.1016B9 

Hs!l01590 
Hs.151258 
Hs.141693 



hypothetical protein 
hypothetical protein FU21 062 
hypothetical protein MGC10858 
hydroxysteroid (11 -beta) dehydrogenase 1 
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WO 03/042661 PCT/US02/36810 



5 

10 



60 
65 




nuclear transcription factor Y, beta 



695 



WO 03/042661 PCT/US02/36810 



432169 Y00971 



AW970976 Hs.293653 

AA502663 Hs. 145037 

AA649530 Hs.348148 

AW963624 Hs.31707 

AA768553 Hs.193145 

AW020782 Hs.79881 
Hs.2910 



R27496 Hs.1378 
AI241733 Hs.43871 
AI940675 Hs.20914 



418866 
428364 
431467 
436679 
447188 
45^21! 

430412 
42325(1 
421433 
435572 
458571 
441054 
431725 
433409 
441102 
453387 
421928 



421896 
410870 
420729 
425066 
429467 
447816 
446553 
453308 
452576 
421437 
422295 
453942 
■10012-1 



448944 
413991 
444454 



Is. 176090 



AB014605 
H44725 
BE018316 

AI797713 Hs.1 56471 

AA102616 Hs.1 20769 
AA534222 

BE094848 Hs.1 51 13 

AA382267 Hs.1 0653 

AL120071 Hs.48998 

AW450866 Hs.257359 

AI355260 Hs.279789 

AW137011 Hs.49576 

BE463721 Hs.97101 

AW972983 Hs.232165 

AW192535 Hs.1 9479 

AW900992 Hs.93796 

BE313280 Hs.1 59627 

AW445167 Hs.1 26036 

AW058558 Hs.20943 

NMJ04917 Hs.218366 

AB033070 Hs.1 94408 

NM_01 51 56 Hs.78398 

AI356967 Hs.43086 

AK002126 Hs.1 1260 

AW854793 Hs.87409 

D86960 Hs.3610 

AW008798 Hs.343877 

AI734080 Hs.104211 

AV660345 Hs.238126 

R42216 Hs.1 2342 



gb:ns44f05.s1 NCI_CGAP_Alv1 Homosapiens 
ESTs, Weakly similar to YEW4_YEAST HYPOT 
metallothioneinlE (functional) 
Homo sapiens oDNA: FLJ23006 fis, clone L 



AK001465 Hs.31575 

AW009886 Hs.1 18258 
T65754 

AA426565 Hs.311103 

N71831 Hs.256398 

AI127483 Hs.1 20^61 

H65423 Hs.17631 

AW972565 Hs.32399 

AW029608 Hs.1 7384 

AW341754 Hs.189305 
BE061916 

AI829192 Hs.22380 

AW975339 Hs.239828 

AV653731 Hs.282829 

AA913591 Hs.1 26480 

X65724 Hs.2839 

AI278802 Hs.25661 
M973905 

AI990741 Hs.252809 

AF013758 Hs.1 09643 

AI823453 Hs.146625 
AA38Q153 

AF162666 Hs.1 8895 

N62293 Hs.45107 

U81599 Hs.66731 

AW964897 Hs.290825 

M82882 Hs.1 54365 

NM.004477 Hs.203772 

NM.007233 Hs.274329 

AB021179 Hs.15299 

AW959731 Hs.323099 

AB023177 Hs.29900 

AWB21252 Hs.1 04336 

AFC51151 Hs.1 14408 

AW190920 Hs.1 9928 

AJ276315 Hs.287374 

AW372884 Hs.240770 
i.110637 



ESTs 

centrosoma! protein 1 
tryptophan rich basic protein 
SEC63, endoplasmic reticulum trar.slocon 
prostate cancer associated protein 1 
gb:yc11c07.s1 Stratagene lung (937210) H 
ESTs, Moderately similar to ALU1 HUMAN A 
Homo sapiens mRNA; cDNA DKFZp434E0528 (f 
ESTs, Weakly similar to unnamed protein 
hypothetical protein DKFZp434E2135 
ESTs, Weakly similar to S51 797 vasodilat 
ESTs 
ESTs 

25849 chromosome 8 open reading frame 2 



gb:EST93093 Skin tumor I H 



E744ike factor 1 (ets domain transcript 

FSHD region genel 

TP53 target gene 1 

HMBA-inducible 

ESTs 

KIAA0960 protein 
hypothetical protein 

hypothetical protein SP329 

zinc finger protein 304 

nuclear cap binding protein subunlt 2, 2 

homeo boxA10 

atrophin-1 interacting protein 1; activi 



gb:zn43e07.s1 Stratagene HeLa eel S3 93 
gb:nj21d02.s1 NCI_CGAP_AA1 Homosapiens 
homogentisate 1,2-dioxyger.ase (homogen'j 
ESTs 

fibronectin leucine rich 
ESTs 

histone deacetylase 3 



ESTs 

KIAA1244 protein 

KIAA0071 protein 

ESTs, Weakly similar to A46010 ) 

hypothetical protein FLJ11264 

thrombospondin 1 

KIAA0205 gene product. 

hypothetical protein FLJ20039 
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WO 03/042661 



PCT/US02/36810 



420218 AW958037 



408331 
417407 
431325 



AW381569 Hs.40334 

BE242B03 Hs.262823 

AF086534 Hs.1 87561 

AI659306 Hs.73826 

AA007534 Hs.125062 

AI953135 Hs.45140 

AF1 19908 Hs.235516 

AI963808 Hs.86970 

H20899 Hs. 13399 

AA923278 H&290905 

AW026751 Hs.5794 

AW972917 Hs.128749 



Hs.93304 
Hs.170065 
Hs.103104 
Hs.100293 
Hs.1 01616 
Hs.1 61954 



H1 histone family, member 0 
Hs.155287 KIAA0010 gene product 



hypothelical protein FLJ 10326 

ESTs, Moderately similar lo ALU1_HUMAN A 

protein tyrosine phosphatase, non-reoept 

ESTs 

hypothetical protein FLJ1 4084 
hypothelical protein PR02955 
ESTs, Moderately similar to ALU5_HUMAN A 
Homo sapiens clone 25032 mRNA sequence 
dual specificity phosphatase 12 
ESTs, Weakly similar to protease (H.sapi 
ESTs, Weakly similar to 2109260A B cell 
alpha-methylacyl-CoA racemase 
NM_004496*:Homo sapiens hepatocyle nude 
nuclear factor, interleukin 3 regulated 
phospholipase A2, group VII (platelet-ac 
Homo sapiens cDNA FLJ13492 fis, clone PL 
ESTs 

O-linked N-acetylglucosamine (GlcNAc) tr 



438321 
452744 
418818 



35 
40 



420210 
425905 
428647 



416030 
420948 
423749 



Hs.6153 

Hs.246107 

Hs.101307 

Hs.1 54978 

Hs.225706 

Hs.278695 



412846 
418836 
401558 
441647 



W80363 

Z42047 
AA610150 
AI557257 
AB032959 
AA830050 
AA844518 
AI521936 
AL022395 
H15261 
AB016898 

U09848 Hs.1 32390 

AA316181 Hs.61535 

W44356 Hs.292812 

AI681307 Hs.55098 

AI648602 Hs.55458 

AA281062 Hs.29493 

H04607 Hs.9218 

AW880614 Hs.146381 

NM.003729 Hs.27076 

AF007835 Hs.32417 

AW449285 Hs.313636 

AI925195 Hs.130891 

NM_016179 Hs.262960 
AA453208 

AW961245 Hs.55896 



Homo sapiens mRNA; cDNA DKFZp434E0B2 (fr 
Homo sapiens HUT1 1 protein mRNA, partial 
KIAA0261 protein 

ESTs, Weakly similar to I38022 hypotheti 
Homo sapiens prostein mRNA, complete cds 
ESTs, Weakly similar to MUC2_HUMAN MUCI N 
ESTs 

Kruppel-type zinc finger protein 

Homo sapiens PR02751 mRNA, complete cds 

ESTs, Weakly similar to 138022 hypotheti 



Hs.31 8584 novel C3HC4 type Zinc finger (ring finge 

Hs.1 24344 ESTs 
Hs.300876 
Hs.107149 

Hs.49526 f-boxandleucine-ricl 

Hs.21948 ESTs 

Hs.1 00469 myeloid/lymphoid or 

Hs.1 32390 zinc finger protein 36 (KOX 18] " 

Hs.61535 six transmembrane epithelial antigen of 

Hs.292812 ESTs, Weakly similar lo T33468 hypotheti 



AF01025B 
AI268407 
AI783767 
AI005169 
AW500239 
AL039402 
AI122843 



Hs.237146 
Hs.246773 
Hs.127428 
Hs.211458 
Hs.148635 
Hs.28274 



444951 
416774 
419465 
423242 
452627 
419005 



416602 NMJ06159 Hs.79389 



426011 
442138 
446015 



AW996096 Hs.58924 

AA445973 Hs.1 3303 

T30968 Hs.13531 

AW962597 Hs.31 305 

AW022192 Hs.200197 

AIB61896 Hs.304505 



RNA binding motif protein, X chromosome 
RNA3'-terminal phosphate cyclase 
hypothetical protein MGC4309 
EST 

hypothelical protein MGC4400 
transient receptor potential channel 4 
RAB9, member RAS oncogene family 
Homo sapiens PAC clone RP5-978E18 from 7 
ESTs 

ENSP00000220478*:SECRETOGRANINIII, 
Homo sapiens cDNA: FU22096 fis, clone H 
hypothetical protein FLJ 12752 



homeo box A9 
DC-specific tra 
ESTs, Ntoderately similar to ALUB.HUMAN ! 
Homo sapiens cDNA; FLJ22049 fis, cbne H 
nn | 'Ml" fis I L 
DEME-6 protein 

ESTs, Weakly similar to KIAA1006 protein 
ESTs, Weakly similar to I54374 gene NF2 
NM_005656":Homo sapiens transmembrane pr 
nel (chicken)-like 2 

gb:Homo sapiens F-box protein Fbl3b (FBL 
ESTs, Weakly similar to JC5594jerky gen 
Homo sapiens cDNA: FLJ21784 fis, clone H 



KIAA1547prc 



WO 03/042661 



PCT/US02/36810 



430226 BE245562 

431673 AW971302 

446183 AA354991 

452355 N54926 

432093 H28383 

448402 BE244226 

409019 AW3B5412 

413812 AW1BB687 

414343 AL036166 
M 006769 



407355 

412125 
437330 
446720 

415083 
423412 
433507 
428966 
423782 
456362 
415451 
419745 



AB037832 

Y17114 

AL353944 

AI439136 

AI879317 



AF059214 

AI472209 

AW973003 

H19415 

AF042001 



429784 
101519 
417576 
427484 
117061 
418858 
433517 
129559 



409190 
444922 
416547 



Homo sapiens mRNA; cDNA DKFZp761FC19 (fr 



AA339449 Hs.82285 



AI985345 

AW608920 

AU076536 

AI921/50 

H62914 

BE245290 

AW959074 



AI1 67877 
AI381900 
AL049280 



gb:yl52c03.r1 Soares breasl 3NbHBst HotiO 
RAB18, member RAS oncogene famly 
myosin regulatory light chain 2, smooth 
ESTs 

coated vesicle membrane protein 
LIM domain only 4 

ESTs, Weakly similar to ALU1_HUMAN ALU S 



Hs.50115 
Hs.1 40546 
Hs.334691 
Hs.27179 

Hs.191791 
Hs.194687 
Hs. 3231 17 
Hs.179909 
Hs.268720 
Hs.93005 
Hs.53126 
Hs.294030 



hypothetical protein FLJ22995 

ESTs, Moderately similar to ALU 1 _H UMAN A 

slug (chicken homoiog), zinc finger prot 



HS.2Q5442 ESTs, Weakly sin 
Hs.46988 ESTs 



to T34036 hy| 



eukaryolic translation initiation (actor 
polycystic kidney disease 2 (autosomal d 
Homo sapiens mRNA; cDNA DKFZp564D1 164 (f 
gbxc28c12.x1 NCLCGAP_Co1 8 Homo sapiens 
membrane-spanning 4-domains, subfamily A 
31 5000476-:gi;i2737279;ref}XP_012165 1| 
phosphoribosylglycinamide formyKransfer 
nuclear receptorsubfamily 1, group D, m 
Homo sapiens cDNA FLJ 1 2033 tls, done HE 
hypothetical protein RC 



:s. 189838 ESTs 



zinc finger protein 75 (D8C5) 



Hs.239218 
Hs.23648 
Hs.291712 
Hs.162849 ESTs 
Hs.143716 ESTs 
Hs.159212 ESTs 



IA;cDNADKFZp564K143(fr 
ESTs 

nuclear receptor subfamily 4, group A, m 



417728 
420407 
425810 



AW369351 Hs.287955 

R18374 Hs.117956 

AF035537 Hs.1 15521 

AW138437 Hs.24790 

AA814732 Hs.145010 

AI923627 Hs.31903 

NM_013257 Hs.279696 
AL161983 



417845 AL1 17461 



408527 
419852 
424534 
424850 
435023 
442914 



450330 
439764 



W84446 
AL135018 
AW503756 



Hs.61152 
Hs.82719 
Hs.61272 
Hs.211933 
Hs.226434 
Hs.33074 
Hs.286184 
Hs.150275 
Hs.153498 



Homo sapiens mFNA; cDNA DKFZp586F1622 (f 
ESTs 

collagen, type XIII, alpha 1 

hypothetical protein MGC4643 

Homo sapiens, clone IMAGE:3606519, mRNA, 

hypothetical protein dJ551D2.5 

KIAA0241 protein 

chromosome 18 open reading frame 1 
gb:wd73f12.x1 NCI_CGAP_Lu24 Home sapiens 
hypothetical protein FLJ14007 



Hs.189023 
Hs.24817 
Hs.157113 
Hs.22744 



AW444551 Hs.35380 x 001 protein 



ESTs, Weakly similar to T23762 hy 
ESTs 

hypothetical protein FU20136 
coenzyme Q, 7 (rat, yeast) homoiog 
hypothetical protein MGC13105 



WO 03/042661 



PCT/US02/36810 




407198 
447752 
4Q2855 
443161 
415827 
4Q8161 



Hs.66095 
Hs.140237 
Hs.74313 

Hs.! 



Hs. 190044 
Hs.291029 
Hs.142517 
Hs. 134460 



AA310393 
AW969657 
AL039482 
H73972 

N77630 Hs.13895 
NM_016010 Hs.1 18821 
D29954 Hs.13421 
AL049985 Hs.19180 
H91679 

M73700 Hs.105938 



H17462 

AW952912 

AW014345 

N92540 

R44477 

AI335191 

AW117261 

D80007 

AW505076 

AL1 57474 

AW271273 

AF279145 

R94088 

AA329796 



430253 
425211 
443273 
428055 
425707 

430519 

431117 



Hs.29894 
Hs.49210 
Hs.23294 



811845 
AK001514 
M 18667 
AI042063 
M420564 
AF1 15402 
AI457338 
AF1 29534 
R53790 
AF003522 
AW449650 
AW503398 
BE1 58773 



AI732083 
AW043782 
M731491 
H51299 
NM_007020 Hs .93502 



Hs.187619 
Hs.293616 
Hs.334477 



sorbitol dehydrogenase 
KIAA01 35 protein 

SEC22, vesicle trafficking prolein (S. c 
Homo sapiens cDNA: FLJ21721 fis, c one C 
hypothetical protein FU20424 
ESTs, Weakly similar to KIAA1205 proton 
ESTs 

ESTs, Weakly similar to ALU 1 _H U MAN ALU S 



Homo sapiens mRNA; cDNA DKFZp434P0810 (f 
ESTs 

Homo sapiens cDNAFLJH 654 fis, clone HE 
CGI-62 protein 
KIAA0056 protein 

Homo sapiens mRNA; cDNA DKFZp564E1 22 (fr 
gb:yv04a07.s1 Soares felal liver spleen 



is.300383 hypothetical protein MGC3032 



de triphosphate diphosphohyd 
hypothetical protein FLJ14621 
ESTs, Weakly similar to 21092BOA B cell 
ESTs 

KIAA0185 protein 

DiGeorge syndrome critical region gene B 
likely ortholog of mouse Arkadia 
hypothetical protein FU12666 
hypothetical protein FLJ21776 
ESTs 

DKFZp434J1813 protein 



Hs.293663 ESTs, Moderately similar to I38022 hypot 
Hs.213207 ESTs 



Hs.79432 



MGC14879 
gb:yp07c06.s1 Soares breast 3NbHBst Homo 
U1-snRNP binding protein homclog (70kD) 
claudin 3 



Hs.91662 
Hs.213659 
Hs.98732 



ESTs 

KIAA0888 protein 

ESTs, Weakly similar to KIAA1357 protein 
Homo sapiens Chromosome 16 BAC clone CI" 

IK M':gi|112 I % ||E 2AM 
ubiquitin-'ike 1 (sentrin) 



Hs.199429 Homosapiensml 



WO 03/042661 PCT/US02/36810 



BE410107 Hs. 

NMJXJ1429 Hs. 

NMJJ12421 Hs. 

H27332 Hs. 

AI767751 Hs. 



SRY (sex determining region Y)-box 9 (ca 
N-acetylgalactosaminidase, alpha- 
CGI-82 protein 
E1 A binding protein p300 
rearranged L-myc fusion sequence 
MGC5338 




AA377823 
AI801777 
AA769045 



408242 
443484 
422083 
428043 
407192 
421590 
453020 
437372 
432278 



427635 
430968 
452234 
453698 
408875 
431197 
434072 
438138 
447906 
415992 
435655 
430507 
439398 
448515 
445242 
41B576 
436024 
407783 
436207 
442861 



AAQ32276 
AA251594 
AI091458 
NMJ0114' 



Hs.90424 
Hs.97635 
Hs.107872 

Hs.274415 

Hs.2391 

HS.2QQ113 

HS.26Q024 

Hs.280699 
Hs.42294 
Hs.99010 
Hs.43913 



Hs.31422 
Hs.283631 
Hs.274256 



Hs.223296 
Hs.42746 
Hs.48604 
Hs.250745 



Hs.1 77502 ESTs 
Hs.19999 
Hs.145807 



protein phosphatase 3 (formerly 2B), cat 
ESTs 

KIAA0853 protein 

hypothetical protein FU23142 

Homo sapiens cDNA: FLJ23285 (is, clone H 

hypothetical protein FU20761 



Homosapiens cDNA FLJ10229 fis, clone HE 
apical protein, Xenopus laevis-like 
Homosapiens cDNA FU11379 fis, clone HE 
gb:EST90805 Synovial sarcoma Homo sapien 
ESTs 

gb:oa80h07.s1 NCLCGAP.GCB1 Homosapiens 



ESTs, Moderately similar to T14342 NSD1 

PIBF1 gene product 

ESTs 

arachidonate 15-lipoxygenase, second lyp 
uteroglobin 

gb:af12e02.s1 SoaresJestisJJHT Homo sap 

jerky (mouse) homolog-like 

Homo sapiens mRNA; cDNA DKFZp434M229 (fr 

hypothetical protein DKFZp547G163 

hypothetical protein FLJ23563 

RAD50 (S. cerevisiae) homolog 

ESTs 

tumor protein p53-binding protein 
gb:EST384925 MAGE resequences, MAGL Homo 
ESTs, Weakly similar to I38022 hypotheli 
ESTs 

DKFZP434B168 protein 
polymerase (RNA) III (DNA directed) (62k 
Homo sapiens PR01 082 mRNA, complete cds 



Hs.247324 
Hs.221504 
Hs. 13528 
Hs.21108 
Hs.289104 
Hs.190555 



Hs.57787 
Hs.1 70577 
Hs.204715 



hypothetical protein FU 13593 
HSPC069 protein 

mitochondrial ribosomal protein S14 



, 1FU14054 
ESTs, Weakly similar to ALU 1 _HU MAN ALU S 
)'2,CaT-likeA 



AI335773 
AK002055 
AI379451 
AA365649 



452129 
453927 
432336 



Hs.44198 
Hs.270123 
Hs.151046 
Hs.1 53636 
Hs.259478 
Hs.9222 



hypothetical protein FU11193 
far upstream element (FUSE) binding prot 
ESTs, Weakly similar to PC4259 ferritin 
estrogen receptor binding site associate 




700 



WO 03/042661 



PCT/US02/36810 



434470 AA634818 Hs.298138 ESTs 

452032 BE244005 Hs.27610 relinoic acid- and interferon-inducible 

420026 AI831190 Hs.166676 ESTs 

421838 AW881089 Hs.108806 Homo sapiens mRNA; cDNA DKFZp566M0947 (f 

449916 T60525 Hs.299221 pyruvate dehydrogenase kinase, isoenzyme 

452270 AW975014 Hs.26 ferrochalatase (protoporphyris) 

445941 AI267371 Hs.172636 ESTs 

416882 A1633Q44 tryplophanyl tRNA synthetase 2 (m'tochon 

412533 AA679863 Hs.69606 ESTs 

431740 N75450 Hs.183412 ESTs, Moderately similar to AF1 1 5721 67 

442710 AI015631 Hs.23210 ESTs 

448212 AI475858 

413431 AW246428 Hs.75355 

453038 AW292415 Hs.20509 

427661 AA410292 Hs.1 04761 

420923 AF097021 Hs.273321 

414344 AA296874 Hs.77494 

441856 BE464341 Hs.21201 

406315 AA833930 Hs.288036 tRNAisopentenylpyrcphosohate transferas 

423482 BE280172 Hs.1 29228 galactokinase 2 

424677 U09414 Hs.151689 zinc fnger protein 137 (clone pHZ-30) 

430160 AW968210 Hs.293957 ESTs, Weakly similar to ALUC.HUMAH !!!! 

433672 BE281165 Hs 288038 TLS-associat gimne piol I 

433887 AW204232 Hs.279522 ESTs 

452670 AF068227 Hs.30213 ceroid-tipofuscir.osis. neuronal 5 

411890 H92738 Hs.75811 N-acylsphingosine amidohydrolase (acid c 

432728 NMJW6979 Hs 278^21 hi i- II region p- pressed gene KE4 

431724 AA514535 Hs.283704 ESTs 

435703 AW630133 Hs.83313 GK003 protein 

445674 BE410347 Hs.13063 transcription factor CA150 

414178 AW957372 Hs.46791 ESTs, Weakly similar to 138022 hypotheti 

416111 AA033813 Hs.79018 chromatin assembly factor 1, subunit A ( 

432840 AK001403 Hs.279521 hypothetical protein FU20530 

439726 AW449893 Hs.293707 ESTs, Weakly similar to 138598 zinc fing 

445704 AI493742 Hs.1 67700 ESTs, Moderately simflar to I38022 hypot 

449115 AW959952 Hs.37528 ESTs,Weakl> milar I * F090 q 44 1 PRO06 

439379 AA835002 Hs.1 25611 ESTs 

414646 AA353776 Hs.901 CD48 antigen (B-cell membrane protein) 

434579 T55958 g b:yb35f05.r1 Stratagene fetal spleen (9 

451367 AA923729 Hs.26322 cpII , e related kinase 

420285 AA258124 Hs.293878 ESTs, Moderately similar to ZN91_HUMAN Z 

431263 AW129203 Hs.322915 ESTs 

431952 Z70695 Hs.272240 HomosapienscDNAFU11086fis,clonePL 

439092 AA830149 gb:oc44f08.s1 NCI_CGAP_GCB1 Homo sapiens 

411562 AL050201 Hs.70769 hypothet'cal protein DKFZp586E1 923 

417771 AA804698 Hs.82547 retinoic acid receptor responder(tazaro 

419440 AB020689 Hs.90419 KIAA0882 protein 



AL.1 57545 

419241 AA523939 

410762 AF226053 Hs.66170 HSKM-B protein 

413384 NML000401 Hs.75334 exostoses (multiple) 2 

424081 NM_006413 Hs.139120 ribonuclease P (30kD) 

428004 AA449563 Hs.161393 glutamate-cysteinelig^ , stalyti. lb 

420390 AA330047 Hs.191187 ESTs 

416662 T25853 Hs.7538 ESTs 

433280 AA581404 Hs.289037 Homo sapiens cDNA FLJ14135 fis, done M/ 

433285 AW975944 Hs.237396 ESTs 

421991 NMJ14918 Hs.1 10488 KIAA0990 protein 

413950 AA249095 Hs.32793 ESTs 

431318 AA502700 Hs.293147 

400289 X07820 Hs.2258 

411149 N68715 Hs.269128 

417601 NM_014735 Hs.82292 KIAA0215 gene product 

418334 AA319233 Hs.5521 ESTs 

422583 AA410506 Hs.27973 KIAA0874 protein 

425717 X07282 Hs.171495 retinoic acid receptor, beta 

425856 AA364908 Hs.98927 hypothetical protein FLJI3993 

434569 AI311295 Hs.344478 KIAA0196 gene product 

419436 AA991639 Hs.242413 hypothetical protein DKFZp434K1 421 

437296 AA350994 Hs.20281 KIAA1700 

432697 AW975050 Hs.293892 ESTs, Weakly similar to ALU4 HUMAN AL 

439605 AF086431 Hs.22380 ESTs 

442145 AI022650 Hs.8117 erbb2-inleracting protein ERBIN 

432589 AL135725 Hs.131708 ESTs 

425910 AA830797 Hs.184760 

412095 AI624707 Hs.5921 

421129 BE439899 Hs.89271 

429433 AA452899 Hs.213586 ESTs, Weakly similar to KIAA1353 protein 

434839 AI743069 Hs.1 34736 ESTs 

435166 AI391470 Hs.1 53618 ESTs 

448966 AW372914 Hs.86149 phosphoinositol 3-phosphate-binding prot 



Hs.7B531 Homo sapiens, Similar to RIKEN cDNA 5730 
hypothetical protein FLJ11457 
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Hs.1 23648 ESTs, Weakly similar to AF1 08460 1 ubimi 



5 

10 



445817 
417251 



M004681 
NM.003642 
M013051 
AW015242 



ESTs 



437187 
423645 
432370 
434966 
404571 
418727 
436374 
452721 
427472 
418245 



Hs.129781 
Hs.115325 
Hs.75212 
Hs.234896 
Hs.306325 

Hs^274424 



AA400709 Hs.96716 

AJ269529 Hs.301871 

AA522539 Hs.131250 

AA088767 Hs.83883 



topoisomerase (DNA) II binding protein 
ESTs, Weakly similar to YK54.YEAST HYPOT 
chromosome 21 open reading frame 5 
RAB7, member RAS oncogene family-like 1 
ornithine decarboxylase 1 
gemlnin 

Homo sapiens mRNA; cDNA DKFZp586C1 523 (f 
ESTs 



ESTs 

ESTs, Weakly similar lo T1 721 0 h/potheti 
solute carrier family 37 (glyceral-3-pha 
transposon-derived Busler3 Iransposase-I 
transmembrane, prostate androgen induced 
ENSP00000251056*:Plasma membrane calciu 
gb:ye53h05.s1 Soares fetal liver spleen 



437367 
407908 
430144 
416292 



408832 
422564 
424842 
429984 
435256 

440348 
442910 
443242 
445469 
449517 



440370 
438825 
446874 
408101 
419131 
423178 



420041 
426174 
427083 
441021 
436664 
441124 
444169 
421476 
422165 

450747 
401197 
433404 
422546 
443884 
414341 



Al 148006 
AA034127 
AL050102 
AF1 93766 
AI056769 
AW015802 
AI365130 
BE243910 
AW298370 
AW500106 
U62027 
NMJ14158 
AA320829 
W93170 
AW971183 
AW952412 



Hs.13872 



Hs.197698 ESTs 

Hs.155956 N-acetyllransferase 1 (arylamine N-acety 



M731533 Hs.270751 

M749316 Hs.271879 

BE379758 Hs.110853 

AI732722 Hs.187694 



Hs.344249 
Hs.195614 
Hs.279704 
Hs.343564 
HS.6342B 
Hs.222120 
Hs.1 53487 



AI064821 



s, Moderately similar to ALU1_HUMAN A 



splicing loci r 3b, ;u i it 3 13 kD 



signal transducing adaptor molecule (SH3 
hypothetical protein FLJ21617 
cytokine-like prolein C17 



ESTs, Weakly similar to T1 9326 hypotheti 
nucleoporin p54 

complement-clq tumor necrosis factor-rel 
serine/threonine prolein kinase MASK 



Hs.284164 
Hs.9683 
Hs.65874 

AA884000 Hs.8173 

BE327427 Hs.79953 

AW968304 Hs.56156 

AW968504 Hs.123073 

AA406293 Hs.41167 

AI033140 Hs.124983 

AW753676 Hs.39982 

H29142 Hs.143032 

AA102670 Hs.70725 
AA418841 

AA907153 Hs.190060 

AI807320 Hs.227630 

AA099802 Hs.83883 

AB005142 Hs.94592 

AA547959 Hs.115838 

NM 006363 Hs.173497 

AW578716 Hs.7644 

AW197887 Hs.253353 

T97717 Hs.1 19563 

AV648170 Hs.58756 

AW953805 Hs.21887 

AL041199 Hs.1 481 



Dnal (Hsp40) homolog, subfamily C, membe 

EST Vea lo A4034B El ex 

i FU10803 



CDC2-related protein kinase 7 
ESTs 

Homo sapiens mRNA; cDNA DKFZp564C142 (fr 
ESTs 

ESTs, Weakly similar to neuronal thread 
gamma-aminobutyric acid (GABA) A receplo 
gb:zw01e11.s1 Soares J*HMPu_S1 Homosapi 



Hs.318535 



T32982 Hs.1 02720 

AB007969 Hs.301478 

N20617 Hs.194397 

D80004 Hs.75909 

NM_005824 Hs.1 55545 



KIAA0500 protein 
leptin receptor 
KIAA01 82 protein 

37 kDa leucine-rich repeat (LRR) protein 
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M1 47026 


Hs.76704 


ESTs 


3.3 


420324 


AF1 63474 


Hs.96744 


prostate androgen-regulated transcript 1 
CGI-09 protein 


3.3 


423453 


AW450737 


Hs.1 28791 




444170 


AW613879 


Hs.1 02408 


ESTs 


3.3 


445474 


AI240014 


Hs.259558 


ESTs 


3.3 


450582 


AI339732 




G-rich RNA sequence binding factor 1 




451246 




Hs.39140 


cutaneous T-cell lymphoma tumor antigen 




434194 


AF1 19847 


Hs.283940 


Horn r ii r> T r j mRNA, parli cd 




444324 


AI301330 


Hs.1 43838 


ESTs 






NMJJ14788 


Hs.1 79703 


KIAA0129 gene product 


3^3 


429258 






gb:zx10e09.r1 Soares_lolaiJetus_Nb2HF8_ 


3.3 


400880 






NM_000611':Homo sapiens CD59 antigen p18 




445525 


BE1 49866 


Hs.1 4831 


Homo sapiens, Similar to zinc finger pro 


3^3 


451968 


H56196 


Hs.26409 


Homo sapiens mRNA; cDNA DKFZp547K204 (fr 


3.3 


411190 


AA306342 




protein kinase C-like 2 




410660 


AI061118 


Hs.65328 






420493 


AI635113 


Hs.270366 


ESTs, Weakly similar to 178885 serine/th 




423740 




Hs.293007 


aminopeptidase puromycin sensitive 




428523 


AW974540 


Hs.98626 






428743 


AL080060 


Hs.301549 


Homo sapiens mRNA; cDNA DKFZp564H1 72 (f- 


32 


432363 


AA534489 




gb:nf76g11.s1 NCI_CGAP_Co3 Homo sapiens 




435102 


AW899053 


Hs.76917 


F-box only protein 8 




438147 


AW250553 




H-2K binding factor-2 




445808 


AV655234 




ESTs, Moderately similar to PC4259 fori 




453058 


AW612293 


Hs.288684 


Homo sapiens cDNA FLJ11750 fis, cine HE 




453286 


AA034319 


Hs.29041 


Homo sapiens cDNA FU14177 lis, clone NT 




453802 


AL1 34757 




gb:DKFZp547G1090 r1 547 (synonym: hlbrl) 




422017 


NM_003877 


Hs.1 10776 


STAT induced STAT inhibitor-2 


3.3 


411373 


BE326276 


Hs.8861 


ESTs 


3.3 


435114 


AA775483 


Hs.288936 


mitochondrial ribosomal protein L9 


33 


443674 


AI081330 


Hs.1 45008 


ESTs 


3.3 


427528 


AU077143 


Hs.1 79565 


minichromosome maintenance deficient (S. 




433800 


AI034361 


Hs.1 351 50 


lung type-l cell membrane-associated gly 






AI636310 


Hs.28310 


ESTs 




421654 


AW163267 


Hs.1 06469 


suppressor of varl (S.cerevisiae) 3-like 




409542 


AA503020 


Hs.36563 


hypothetical protein FLJ22418 




410119 


F07841 


Hs.1 3926 


ESTs 


3.3 


417379 


AA1 96390 




gb:zp99b10.s1 Stratagene muscle 937209 H 




423201 


NM 000163 


Hs.1 251 80 


growth hormone receptor 




442787 


W93048 


Hs.250723 


hypothetical protein MGC2747 




407930 




Hs.1 88361 


Homo sapiens cDNA FLJ12807 fis, clone NT 




423427 










424903 


T26477 




ESTs, Weakly similar lo I38022 hypotheti 




426775 


AA384564 




ESTs 




432378 


AI493046 


Hs.146133 


ESTs 




438875 


AA827640 


Hs.1 89059 


ESTs 


3.3 




AI933416 


Hs.189674 


ESTs 




453124 




Hs.125790 


leucine-rich repeat-conlaning 2 




417560 


U73338 


Hs.82283 






433002 


AF048730 


Hs.279906 






430929 


AA489166 


Hs.156933 




3.2 




H46008 


Hs.31518 


ESTs 




446297 


AI346930 


Hs.149728 


ESTs 




407870 


AB032990 


Hs.40719 


hypothetical protein KIAA1164 




429377 


M813192 


Hs.200596 


KIAA0547 gene product 




426030 


BE243933 


Hs.1 08642 


zinc finger protein 22 (KOX 15) 




420583 




Hs.65450 




425264 


AA353953 


Hs.20369 


ESTs, Weakly similar to gonadotxpin ind 




442562 


BE379584 




dolichyl-diphosphooligosaccharide-prolei 




418871 


NM_001608 


Hs.1209 


acyl-Coenzyme A dehydrogenase, long chai 




421029 


AW057782 


Hs.293053 


ESTs 




452937 




Hs.44656 


ESTs 




435035 


BE568487 


Hs.47668 


x 006 protein 




408412 


AW1 93033 


Hs.124436 


ESTs 


421684 


BE281591 


Hs.106768 


hypothetical protein FLJ10511 




421887 




Hs.109201 


CGI-86 protein 




432409 


M806538 


Hs.130732 






434378 


M631739 


Hs.335440 


EST 




409401 


AI201895 


Hs.181309 


proteasome (prosome, macropain) subunit, 




420195 


N44348 


Hs.26243 


Homo sapiens cDNA FLJ11177 5s, clone PL 




410264 


AK001853 


Hs.61508 


Homo sapiens cDNA FU10991 fis, clone PL 


3.2 




AW630675 


Hs.271946 


ESTs 


3.2 


413838 


AV661185 


Hs.75574 


mitoohondn'al ribosome! p-otein L19 


3.2 


414342 


AA742181 


Hs.75912 


KIAA0257 protein 


3.2 




AA704703 


Hs.77031 


Sp2 transcription factor 


3.2 


425475 


W56339 


Hs.107057 


ESTs 


3.2 


437814 


AI088192 


Hs.1 35474 


ESTs, Weakly similar to DDX9.HUMAN ATP-D 




449897 


AW819642 


Hs.24135 


transmembrane protein vezalin; hypotheB 


32 




BE379936 


Hs.28866 


programmed cell death 10 


3.2 


437517 


AI927675 


Hs.244482 


Homo sapiens, clone 1MAGE:351 1 71 9, mRNA, 


3.2 


432960 


AW1 50945 


Hs.144758 


ESTs 


3.2 


452498 


AK000101 


Hs.29700 


hypothetical protein FLJ20094 


3.2 


427299 


AA830210 


Hs.214263 


ESTs, Moderately similar to ALU1 HUMAN A 


3.2 
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427715 BE245274 

408699 AA056614 

415715 F30364 

426172 AA371307 

434825 N87549 

436246 AW450963 

436401 AI087958 

408784 AW971350 

432125 AW972667 

426716 NM_006379 

434747 AA337085 

414591 AI888490 

426991 AK001536 

426171 AI12860B 
402802 

412530 AA766268 

433050 AI093930 

439221 AA737106 

440351 AF030933 



ESTs, Weakly similar to A47582 B-cell gr 
ESTs, Moderately similar toZN91_WJMAN Z 



inc finger protein ZNF140-like pr< 



o sapiens cDNA FU12300 fis, clone MA 



NM 001 397:Homo sapiens endothelin conver 

hypothetical protein FLJ13346 

Homo sapiens cDNA: FU21000 lis, clone C 

ESTs, Moderately similar to 178885 serin 

RAD1 (S. pombe) homolog 

Homo sapiens clone 24407 rnRNA sequence 

ESTs 



438666 
419875 
456660 
418701 
425071 
440300 
444100 
431416 



T64904 

AW014493 
AA853410 



AA383343 
AA53271B 
DE274778 



s.1 19007 RAB4, member Rf 



430039 
430188 
446146 



hypothetical protein FU12577 
solute carrier family 2 (facilitated glu 
KIAA1048 protein 



ESTs, Weakly similar to T24396 hypotheti 



441791 
457718 
452017 



AA143654 

AA402414 

AA452244 

AI821399 

AW007080 

AK002135 

H80696 

AA767537 

AW976218 

AW372449 

F18572 

AF109302 

T79957 



AI806533 
AL080058 
AA424793 



hypothetical protein FLJ13149 
gb:zo65a02.r1 Stratagene pancreas (93720 
coatomer protein complex, submit beta 



hypothetical protein FU11273 



hypothetical protein FU21 159 

ESTs , Weakly similar to ALU4__HUMAN AL 

prostate cancer associated protein 7 

ESTs 

KIAA1627 protein 

ESTs, Weakly similar to unnamed protein 
ATPase, Ca transporting, type 2C, membei 



434614 AI249502 



Hs.24144 ESTs 



704 



WO 03/042661 PCT/US02/36810 







Hs.28891 


hypothetical protein RJ11360; arterais p 


3.1 




AI 689496 


Hs. 108932 


ESTs 


3.1 


434361 


AF129755 


Hs.88474 


ESTs 


3.1 








NM_002425:Homo sapiens matrix metatlopro 


3.1 


AI879252 


Hs.5151 


RAN binding protein 7 


3.1 


449474 




Hs.2055 


ubiquitin-activating enzyme E1 (A1S9T an 


3.1 




AW629223 


Hs.64794 


zinc finger protein 183 (RINGfinger, C3 
ESTs 


3.1 


N59650 


Hs.27252 


3.1 


418647 


AA226198 




gb:nc26a07.s1 NCI_CGAP_Fr1 Homo sapiens 


3.1 


434808 


AF155108 


Hs.256150 


Homo sapiens, Simi'ar to RIKEN cDNA 2810 




453716 


AA037675 


Hs.152675 


ESTs 


3.1 




AA813887 


Hs.1 88173 


Homo sapiens cDNA FU12187 fis. clone MA 


3.0 




AA312135 


Hs.46967 


HSPC034 protein 
interleukin 7 receptor 


3.0 


430280 


AA361258 


Hs.237868 




445919 


T53519 


Hs.334692 


hypothetical protein MGC14141 


3.0 


441790 


AW294909 


Hs.1 32208 


ESTs 


3.0 






Hs.75746 


aldehyde dehydrogenase 1 family, member 


3.0 


AW298597 


Hs.61884 


Homo sapiens, clone IMAGB4298026, mRNA, 


3.0 


422788 


AL1 17352 


Hs.77196 


Human DNA sequence from clone RP5-876B10 


3.0 


439703 


AF086538 


Hs.1 96245 


ESTs 


3.0 


444489 


AI151010 


Hs.1 57774 


ESTs 


3.0 


453878 




Hs.1 9025 


DC32 


3.0 




AW902312 


Hs.7037 


Homo sapiens clone 24923 mRNA sequence 
ESTs 


3.0 


433680 


A1805366 


Hs.1 99945 


3.0 


426363 


M58524 


Hs.2025 


transforming growth factor, beta 3 


3.0 


447595 


AW379130 


Hs.18953 


phosphodiesterase 9A 


3.0 


426044 


AA502490 


Hs.170290 


ESTs 


3.0 


431962 


AL049385 


Hs.272251 


Homo sapiens mRNA; cDNA DKFZp586M1418 (f 


3.0 






Hs.250848 


hypothetical protein FU14761 


3.0 




AK001680 


Hs.30488 


DKFZP434F091 protein 


3.0 


458229 


AI929602 


Hs.177 


phosphatidylinositol glycan, class H 


3.0 


435712 


AA694607 


Hs.176956 


ESTs 


3.0 


414993 


AA299264 


Hs.57776 


ESTs, Moderately similar to I38022 hypo! 


3.0 


AW819403 


Hs.77724 


KIAA0586 gene product 


3.0 


421988 


AW291128 
AW450481 


Hs.278422 
Hs.1 61333 


DKFZP586G1 122 protein 
ESTs 


3.0 
3.0 




N51075 


Hs.1 10028 


ESTs 


3.0 
3.0 
3.0 


447620 


H09323 
AW290951 


Hs.27133 
Hs.224965 


ESTs 
ESTs 


449375 


R07114 


Hs.271224 


ESTs 






AI355647 


Hs.1 89999 


punnergic receptor (tamily A group 5) 




443967 


AW294013 


Hs.200942 


ESTs 


3.0 


441224 


AU076964 


Hs.7753 




3.0 
3.0 


448822 


BE149845 


Hs.289038 


hypothetical protein MGC4126 


435688 


H72286 


Hs.1 28387 


ESTs 


3.0 


441889 


AI090455 


Hs.268371 


hypothetical protein FU20274 


3.0 




AI499220 


Hs.71573 


hypothetical protein FU 10074 


3.0 


415954 


AA171850 


Hs.42251 


ESTs 


3.0 


419G44 


AI909154 




gb:QV-BT200-010499-O07 BT200 Homo sapien 


3.0 


420077 


AW512260 


Hs.87767 


ESTs 


3.0 


443475 


AI066470 


Hs.1 32809 


ESTs 


3.0 


447231 


AK001293 


Hs.330208 


crystallin, zeta (quinone reductase)* 


3.0 


438436 


AA807168 


Hs.271552 


ESTs 


3.0 




AW992405 
AA299688 


Hs.59622 


Homo sapiens, clone IMAGE:3507281, mRNA, 
ESTs 


3.0 
3.0 


448264 


AI478933 


Hs.1 88260 


ESTs 


3.0 


428673 


AW601325 


Hs.337757 


Homo sapiens mRNA; cDNA DKFZp566M063 (fr 


3.0 


453843 


D25215 


Hs.35804 


hectdomainandRLD3 


3.0 


400303 


AA242758 


Hs.79136 


LIV-1 protein, estrogen regulated 


3.0 


445943 
432286 


AW898533 
AW327432 


Hs.181574 
Hs.255843 


ESTs 
ESTs 


3.0 




R21326 


Hs.267905 


hypothetical protein FU10422 


3.0 


432675 


AI791855 


Hs.1 05884 


ESTs 






T49951 


Hs.9029 


OKFZP434G032 protein 


3.0 


448073 






Homo sapiens, clone 1MAGE;4179482, mRNA 




419713 


AW968058 


Hs]92381 


nudix (nucleoside diphosphate linked moi 


3.Q 


447966 


AA340605 


Hs.1 05887 


ESTs, Weakly similar to Honolog of rat Z 


3.0 


423583 
418304 


AL122055 




KIAA1028 protein 


3.0 


M215702 




gb:zr97g10.r1 NCLCGAP_GCB1 Homo sapiens 


3.0 


417206 


AA291183 




hypothetical protein FU 11021 similar to 


3,0 


410011 


AB020641 


rfe57856 


PFTAIRE protein kinase 1 


3.0 


411850 


AK002033 


Hs.72782 


hypothetical protein FLJ11171 


3.0 


438986 


AF085888 




ESTs 


3.0 


445921 


AW015211 


Hs.1 46181 


ESTs 


3.0 


447124 


AW976438 


Hs.1 7428 


RBP1-Iike protein 


3.0 


452953 


AI932884 




ESTs, Weakly similar to A46010 XJinked 


3.0 



Pkey: Unique Eos probeset identifier number 



705 



WO 03/042661 



PCT/US02/36810 



;r: Gene cluster number 



419536 
419544 
423412 



CAT Number 

1247077.1 

1253334 J 

147525J 

156454.1 

162718.1 

163001J 
167238.1 
173658.2 
177521.1 
179788_1 
185688_1 
185760.2 
228001_1 
232161J 
262918_1 
266650_1 

271683J 

2741 5_1 



429163 
429220 
429258 

430968 



432125 
432189 
432340 
432363 
432600 
432810 



440840 
441102 



443161 
445808 
447082 



300543J 
301384J 
301917.1 
325772.1 



341776.1 
342819.1 
345248.1 
345469.1 



38916.1 
396504.1 
398093.1 
44433.2 



45074.1 
468554.1 
47334.1 
479942.1 



561305.1 
65133.1 
707248.1 



AW848047 AW848202 AW848631 AW8481 42 AW843702 AW848121 AW848632 AW848140 AW848571 AW848009 AW848067 AW848069 
AW848905 AW848214 

BE160198 AW935898 T11520 AW935930 AW356073 AW861 034 
AA143654 AW7531 40 AA213770 AW970865 AA569075 AA4921 32 

AA743331 M837388 AW664540AA775711 AA150965 BE465475 AA968994 M886905 AI141054 AW194991 AA252147 AA319500 AI184288 
AA708749 AA644620 AA652769 AA242975 AA151074 T19890 

AI267700AI720344AA191424 A:023543AI469533AA172056AW958465M172236 AW953397AA355086 
H51299 H44619 H46391 R86024 H51892 T72744 

AI633044 AW016212 AW241143 AA769058 R43272 AW068958 AA210918 AA293774 AI748815 AI763294 AI333114 AI277384 AI088297 AI468477 

AI824624 AW189606 A'631751 Z40749 AI984573 AI671316 AA189024 AW235412 BE178426 R24677 R40635 H05100 R40597 

AW934714BE161007BE162500AW749902AW749864BE162498 BE161005 AA190449 AW513465 BE16100S BE162499 

M196390AA507837AA196468 

M215702 M368006 AA215703 BE066555 BE006876 

M226198AA226513AA383773 

T65754AA229857AA229658 

AA603305AA244095AA244183 

A1909154 AA526337 AA244193 AI909153 

AF109300 AI299378AI202654 

AA331156AA331157M331155 

AA769045 AA372590 AW963633 

AA377823 AW954494 AI022688 

M3801 53 AA380233 AW963529 

AA3845C4 W/9CC47! II'. 21 N4129 DG3213 AAB8S888 AI922414 AW044240 AW196808 AI076736 AA599294 AI954433 AW1 17617 AI640323 
H98134 

AK001536 AA191092 AW510354 AI55425B AL35396B AA134266 

AI739168 AA426249 AI199636 AW505198 AW977291 AA824583 M883419 M724079 AI015524 AI377728 AW293682 A1928140 M731438 

AI092404 AI085630 AA731340 

AA884766 AW974271 AA592975 AA447312 

AW207206 AW341473 AA448195 AI951341 

AA448765 C04967 C03045 AA658293 

AW072916AI184913AA489195AW465994AW459044 N59350AI819642AI280239 AI220572M789302 AI473611 AW841126 D60937 
AW972830 AA527647 AA489820 AA570362 

AF072813AF119297AA362885AF059524 NM.006054AA157365AW163623M056148AA227062AA418057AA227076AA078753AA233594 

D5B629 AA232373 AA233577 T35956 BE61 8035 AA354497 AA359082 T32010 AA134519 BE299901 BE268096 BE395B26 AA324268 AL120308 

AA187561 AA311680 

H28383 AW972670 H 28359 AA525808 

AW972667 AA526539 AI057032 AW167B42 

AA527941 AI810608 AI620190 M635266 

AA534222AA632632T81234 

AA534489 AW970240 AW970323 

AI821085 AW973464 AA554S02 AI821 831 AA657438 AA640756 AA650339 

AA863400AI991439AW016017AW014704AI367512 H17550AA744752 R46187AW471324AI126670AA526033AI276287AI094253AI286003 
AI147163 AI911443 AW51261 2 AA9721 02 AA999975 A'6B4428 AI335035 D63102 AI524234 AI539156 AA565542 
T55958 T57205AF147346 
AA657494 AI582663 AI581639 

AI692552 AI393343 AI800510 AI377711 F24263 AA661876 

AA156781 AW293839 U52054 AA024963 AA778445 BE073977 AW444904 AW602574 BE164040 BE164012 BE163972 BE163974 BE163992 

AA837481 AW468444 BE185091 AW468002 M687333 M81 1 830 AA581 806 AI856686 AI572124 M043777 AA040926 D20160 AI536733 

AA812489AW874142AI47188 

AW250553 UD7876 Z36843 R30693 AM90097 AW955317 



W76326AF086341W72300 



AW629666AW959831 AW205739 BE620243 AA412357AW300025AW051920 AI288591 AW236114 AI302852 AI038548 M534496 A1797207 
AA921877 

M973905 AI299888 AA917019 H63235 T90771 

BE379584R34211 BE544763 AW973709 AI653056 AI6531 73 AI2BB043 A1656750 H74180AI492830 A137609D AI472184 D59940 AW170056 
AI082443 AW021 142 AI167921 AI343677 AI278577 AW1 30886 AA761517 AI698203 AA1 15535 AI264790 R34328 D59939 AW205074 AA554902 
D62102 AI0007 
A1038316 AI344631 AI261653 



AI475858AW959013 

NM.014253 AF100772 BE088769 AL022718 BE151779 AW863569 BE161640 AL03906D BE168542 AW296554 AA323193 AA235370 AW779760 
N48674 AI3759S7 R45432 D59344 AI203107 F07491 R35360 R25094 AI913631 AI49B402 T613B2 AI016320 N45526 T61415 AA331486 
N40087 H12925AA460779AA096372 BE160847 AW816890 AW816893 AW816577AW816891 AW81 6882 AW81 6868 AW816889 AW816940 
AW816892 AW816941 AW816578 M029183 T83320 H86850 AAD10295 BE160823 Z25353 AA150883 AW887764 AW023806 AW022095 
AA164518AA730973 W00417W65303 

AI339732 M010300 AW515041 M768334 N29860 AA425874 M425118 M865829 AW936878 

AA453208 NM.004251 U44103 A 6715 17 N 163 AV -21 M479232T82809 R67109 AW796099 AI833027 AI765395 AI763029 A1758228 
BE326331 AI934576 A'922373 AW276431 AI71 8466 N35566 AA904753 BE464245 AI338752 AI659875 AW272338 AI423136 AI089270 AI160904 
M664354 BE1 

AI831 594 AW970667 AW027959 Al 1 29800 AI927949 AI650270 AI625105 AW514661 AI708393 AL1 38076 BE18051 0 AI926721 AI399955 AA7491 39 
AI862160 AW87401 1 A1242763 AA252795 AA039864 H73499 A 093249 BE245661 AI816834 N25206 AA828301 AI084565 AI30281 6 AA026905 
M77255 

AA418841 AI452657 AI768876 AA028973 BE179873 C00215 AA418930 
AL1 34757 AW079131 

BE1601 98 AW935898 T1 1 520 AW935930 AW856073 AW861 034 

706 



WO 03/042661 



PCT/US02/36810 



AW974811 AA651634AA650072 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gi) nu 
omosome 22." Dunham I. et a!., Nature (1999) 402:489-495. 

Indicates DNA strand from which exons were predicted. 
;: Indicates nucleotide positions of predicted exons. 



s. "Dunham I. et al." refers to the publication entitled "The DNA sei 



400880 
401197 
401424 
401451 
401519 
401558 
402031 
402802 



Ret 



6634068 
6649315 
7139678 
7656761 
3287156 



7249169 Minus 

9796668 Plus 

9796810 Plus 

9454643 Plus 

1621110 Plus 



NLposition 

13558-13721,13942-14090.14554-14679 

160456-160567,164757-164873 

29235-29336,36363-36580 

176341-176452 

24223-24428 

119926-121272 

157315-157950 

103510-104090 

33080-33263,33939-34094,36103-36507 



55707-55859,56369-56511 




102999-103145 

114550-114688,117265-117407,119490-119599,123237-123395,131140-131217 
4502-4644,5983-6083 



These were selected from 59680 probesets on 



Table 61 A lists about 440 genes up-regulated in prostate cancer compared to normal prostate and no 

Atfymetrix/Eos Hu03 GcneChip array such that the ratio of "average' prostate cancer to "average" normal prostate tissue was greater than or equal to 2.0 and the ratio to normal adult 
tissues was greater than or equal to 2.0. The "average" prostate cancer level was set to the 75 th percentile smongs' ' '" ... 

set to the %*• percentile amongst prostate tissues. The "average" norma: adult tissue level was set to the "'" 
specific background levels of non-specific hybridization, the Iff" percentile value amongst the non-rna ,: - 
before the ratio was evaluated. 

TABLE 61A: ABOUT 440 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL PROSTATE AND NORMAL ADULT TISSUES 



UnigenelD Unigene Title 



444151 
421566 
420729 



450096 A1682088 



443271 
434078 
431117 

416182 
421913 
434217 



425905 
425018 
452221 
421552 



AW972917 Hs.1 28749 



Hs.79375 
Hs.171353 
Hs.195704 
Hs.283683 
Hs.250500 
Hs.180450 
Hs.79069 
Hs.109439 
Hs.23349 
Hs.159525 
Hs.23128 
Hs.227209 
Hs.22920 



U66468 
AA315267 
AL050102 
Z45051 



AL049610 Hs.95243 



AB032959 
BE245277 
C21322 



cell growth regulatory with EF-hand doma 
ESTs 

hypothetical protein FU21617 
similar to S68401 (cattle) glucose indue 
ESTs, Weakly similar to protease [H.sapi 
transcription elongation factor A (Sll)- 



Hs.105700 

D11928 Hs.76845 

BE326276 Hs.8861 

NM.014324 Hs.128749 

AA410506 Hs.27973 

AL120659 Hs.6111 

AL049964 Hs.8358 



novel C3HC4 type Zinc finger (ring fmge 
E4F transcription fxtor 1 
hypothetical protein FLJ22242 
secreted frizzled-relaibd pioteh 4 
phosphoserine phosphstase-like 
ESTs 



WO 03/042661 



PCT/US02/36810 



446896 
449118 
427472 
431548 
417153 



T16971 

T15767 

R67477 

AA522539 

A1834273 

X57010 

NMJ03816 
NMJ06441 
X72755 
AW973653 



414343 
451684 
421470 
426350 
436476 
437571 
447072 
444917 
417318 
434170 



40 

45 
50 



NMJ03245 
AA326108 
M760894 



451952 
431676 
462242 
401519 
414342 



439518 
409151 
420544 
444100 



448664 
432242 
433887 



418432 
433409 
447247 
423749 
444755 
431912 



Hs.289014 

Hs.22452 

Hs.23103 

Hs.1 31250 

Hs.9711 

Hs.81343 

Hs.66744 

Hs!l 18131 
Hs.77367 
Hs.20104 
Hs.79018 

Hs!323378 
Hs.26813 
Hs.1378 



ESTs, Weakly similar to A43932 mucin 2 p 
Homo sapiens mRNAfor KIM1737 protein, 
Bed (S. cerevisiae) homolog 
transposon-derived Bus[er3 transposase-I 
novel protein 

collagen, type II, alpha 1 (primary oste 
twist (Drosophila) homolog (acrocephalos 
a disintegrin and metalloprote : nase doma 
5, 1 0-methenyltetrahydrofblate synthetase 
monokine induced by gamma interferon 
hypothetical protein FLJ00052 
chromatin assembly facto' 1 , subunit A ( 
ESTs, Weakly similar to T43458 hypotheti 
coated vesicle membrane protein 
CDA14 



R68651 Hs.144997 

AW953937 Hs.240845 

AA626509 Hs.1 22329 

AF1 55099 Hs.279780 

AI925195 Hs.1 30891 

N40850 Hs.28625 

AW008798 Hs.343877 

AI470523 Hs.1 39336 

D13635 Hs.1 55287 

R96696 Hs.35598 

D20569 Hs.1 69407 

AI952677 Hs.1 08972 

BE566249 Hs.20999 

AL120173 HSJ301663 
AI685464 

R50956 Hs.159993 

AA742181 Hs.75912 

AA004410 Hs.1 00009 

NM_004477 Hs.203772 

AF1 23050 Hs.44532 

AA365752 Hs.155965 

AI676062 Hs.111902 

AW500239 Hs.21187 

H56585 Hs.196308 

BE242803 Hs.262823 

AI041530 Hs.132107 

AP000692 Hs.129781 

AA329796 Hs.1098 

N34742 Hs.170065 

AL048842 Hs.194019 

H65423 Hs.17631 

AA814948 Hs.96343 

AA460421 Hs.30875 
W76326 

AA306105 Hs.50785 

AA677577 Hs.98732 

M383343 Hs.22116 

AI911527 Hs.11805 

AB020725 Hs.58009 

BE439899 Hs.89271 

AF059214 Hs.1 94687 

AI879317 Hs.334691 

AW022715 Hs.162160 

AW204232 Hs.279522 

AI364997 Hs.7572 

X03635 Hs.1657 

AI085198 Hs.164226 

AL1 17429 Hs.240845 

M14156 Hs.85112 

AI278802 Hs.25661 

AW369351 Hs.287955 

U09848 Hs.132390 

AA431791 Hs.113823 

AI660552 Hs.76549 

AK001745 Hs.184628 

AB033059 Hs.18705 

AF097994 Hs.301528 

NM_004655 Hs.1 27337 
AA889120 Hs.1 10637 
NM_014700 Hs.119004 

AI204033 Hs.271461 
AW023630 Hs.1 59425 



hypothetical protein FU20039 
ATP-binding cassette, sub-family C (CFTR 
KIAA0010gene product 
ESTs 

SAC2 (suppressor of actin mutations 2, y 
Homo sapiens mRNA; cDNA DKFZp434P228 (fr 
hypothetical protein FU23142 
ESTs 

gb:tt88f04.x1 NCI_CGAP_Pr28 Homo sapiens 
gycosyltransferase 

C1 5000476':gi|1 2737279|ref|XP_01 21 63. 1 1 
KIAA0257 protein 

acyl-Coenzyme A oxidase 1, palmitoyl 
FSHO region gene 1 



Homo sapiens cDNA: HJ23068 lis, clone L 
tryptophan rich basic protein 
hypothetical protein FLJ10326 
ESTs 

chromosome 21 open reading frame 5 

DKFZp434J1813 protein 

Homo sapiens cDNA FU 13492 fis, clone PL 



gb:zd50d04.r1 Soaresjetaljieart NbHH19W 
SEC22, vesicle trafficking protein (S. c 
Homo sapiens Chromosome 16 BAC clone CIT 
CDC1 4 (cell division cycle 14, S. cerevi 



cholesterol 25-hydroxylase 

splicing factor 3a, subunit 1, 120kD 

ESTs, Weakly similar to ALU4_HUMAN AW S 



OKFZP434D1 46 protein 



ESTs 



NT 



Homo sapiens cDNA FU13090 fis, 
zinc linger protein 36 (KOX 18] 
CIpX (caseinolytic protease X, E. colij 
ESTs, Weakly similar to A561 54 Abl subst 
hypothetical protein FLJ10883 
KIAA1233 ' ' 



axin 2 (conductin, axil) 
homeoboxA10 
KIAA0665 gene product 
ESTs, Weakly similar to 138022 hypolheti 
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WO 03/042661 
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414664 
437162 
421828 
418821 
422049 



448807 
439209 
420077 
451009 
452281 
452561 
447033 
427138 



425900 
431689 
440749 



451367 
445472 
408101 
421246 
426227 
402802 
419168 
436207 
453864 
448944 
444489 
453124 
453078 
439444 
413492 
434804 
445840 



Hs.122669 
Hs.38085 
Hs.66295 



AA436002 Hs.1 83161 

W25760 Hs.77631 

AI263307 Hs.239884 

AI566086 Hs.153716 

AA151730 Hs.301789 

AA317439 Hs.28707 

AI571940 Hs.7549 

AF087993 Hs.91954 

AW512260 Hs.87767 

AA013140 Hs.1 15707 

T93500 Hs.28792 

AI692181 Hs.49169 

AI357412 Hs.157601 

N77624 Hs.173717 

BE281238 Hs.8886 
AA743331 

T06809 Hs.332086 

AF030880 Hs.159275 

AW118681 Hs.128051 

NMJ03416 Hs.2076 

BE266659 Hs.293659 

AI918950 Hs.123642 

AU076674 Hs.198899 

AW404075 Hs.42785 

M78878 Hs.7314 

AI224483 Hs.16063 

AK000166 Hs.288809 



Y00971 Hs.2910 

AK002015 Hs.5887 

R37257 Hs.1 84780 

M502972 Hs.1 83390 

R61680 Hs.26904 

AI174947 Hs.295789 

AL035588 Hs.153203 

AA923729 Hs.26322 

AB006S31 Hs.12784 

AW968504 Hs.1 23073 

AW582962 Hs.102897 

U67058 Hs. 154299 

AI336132 Hs.33718 

M334774 Hs.12845 

AW021407 Hs.21068 

AB014605 Hs.22599 

AI151010 Hs.157774 
AI1 39058 

AF053551 Hs.31584 

AI277652 Hs.54578 

D87470 Hs.75400 

AA649530 Hs.348148 

AI277811 Hs.146291 
AW976434 



KIAA0264 protein 

Homo sapiens, Similerto RIKEN cDNA 0610 
multi-PDZ-domain-containing protein 
thyroid hormone receptor coa::ivatir.g pr 
histone deacetylase 3 
ESTs 

glycine cleavage system protein H (amino 
H2B histone family, member L 
Homo sapiens mRNA for Hmob33 p-otein, 3' 
nudix (nucleoside diphosphate linked moi 
signal sequence receptor, gamma (translo 



phosphatidic acid phosphatase type 2B 
hypothetical protein FU20424 
hemoglobin, alpha 2 
ESTs 

solute carrier family, m 



430226 
400296 
420154 



447726 
447816 
420407 



zinc finger protein 7 (KOX 4, clone HF.1 
Homo sapiens, Similar to RIKEN cDNA A430 
EphA3 

eukaryotic translation initiation factor 
DC11 protein 

hypothetical protein FU21877 
hypothetical protein FU20159 
UDP-Gal:betaGlcNAc beta 1 ,3-galactosyltr 



Eos Control 

phosphonbosyl pyrophosphate synthetase 

RNA binding motif protein 7 

ESTs 

hypothetical protein FU13590 
ESTs, Moderately similar to Z195JIUMAN Z 
Homo sapiens mRNA; cDNA DKFZp554D1154 (f 
MyoD family inhibitor 



CGI-47 protein 

Human proteinase activated receptor-2 mR 
NM_GG1397:Homo sapiens endothelin conver 
Homo sapiens cDNA FU12641 fis, clone NT 
hypothetical protein MGC1 3159 
hypothetical protein 



Hs.125790 leucine-rich repeat-containing 2 



ESTs, Weakly similar to I38022 hypotheti 
KIAA0280 protein 

gb:ns44f05.s1 NCLCGAP_Alv1 Homo sapiens 



Hs.300383 

D00723 Hs.77631 

AF129534 Hs.4921C 

D87742 Hs.241552 

M90516 Hs.1674 

BE245562 Hs.2551 

AA305627 Hs.139336 

AI093155 Hs.95420 

AA382267 Hs.10653 

F18572 Hs.22978 

BE293466 Hs.20880 

AB020689 Hs.90419 

AW975944 Hs.237396 

AI356967 Hs.43086 

AW015211 Hs.146181 
AI633044 

AW57B849 Hs.283552 
N40087 

AL1 37638 Hs.19368 

NMJ07233 Hs.274329 

AA814732 Hs.145010 

AK000181 Hs.1 14556 



ESTs 

hypothetical protein FLJ11220 
hypothetical protein MGC3032 
glycine cleavage system protein H (amino 
F-box only protein 4 
KIAA0268 protein 

glutamine-fructose-6-phosphatetrarsamin 
adrenergic, beta-2-, receptor, surface 
ATP-binding cassette, sub-family C (CFTR 
JM27 protein 
ESTs 

ESTs, Weakly sinilai toALU4 HUMAN ALU 3 
ESTs, Weakly similar to I33022 hypolhel! 
KIAA0882 protein 
ESTs 

ESTs, Weakly similar to A46010 X-linked 



TP53 target gene 1 
lipopolysaccande spi cific nr. 
hypothetical protein FLJ20174 
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WO 03/042661 



PCT/US02/36810 



420297 
427701 
437812 
445808 



451796 
407783 
459029 
441676 



AI628272 Hs.88323 

AA411101 Hs.243886 

AI582291 Hs.16846 
AV655234 

AW863261 Hs.242413 

AI287817 Hs.129636 

AL044818 Hs.84928 

AI093930 Hs.163440 

AL133019 Hs,27038 

AW996872 Hs.172028 

AA1 31376 Hs.343809 



Hs.49889 ESTs 



ESTs. Weakly similar to ALU1_HUMAN ALU S 
nuclear auloanligenic sperm protein (his 
ESTs, Weakly similar to 04HUD1 debrisoqu 
ESTs, Moderately similar to PC4259 ferri 
hypothetical protein DKFZp434K1421 
ESTs 

nuclear transcription factor Y, beta 
Homo sapiens cDNA: FLJ 21000 lis, clone C 
Homo sapiens mRNA; cONA DKFZp434G2127 (f 
a disintegrin and metalloproteir J - 
fibroblast growth factor 12B 



BE614347 Hs.169615 

AW243307 Hs.83937 

AW978439 Hs,69504 

BE086548 Hs.42346 

AF013758 Hs.109643 

BE391727 Hs,102910 

N29862 Hs.44104 

NM 000163 Hs.125180 

AW970602 Hs.105421 
AW934714 

AW015242 Hs.99488 

X95097 Hs.2126 

AA654650 Hs,282906 

AA844518 Hs.300876 



calcineurin-binding protein calsarcin-1 
polyadenylate binding protein-interactln 
general transcription factor IIH, polype 



gb:RC1-DT0Q01-C31299-01 1-a1 1 DT0CC1 Homo 
EST'.:. We -■■ y ■jimilv:- c 'r <54. YEAST HYPOT 
vasoactive intestinal peptide receptor 2 



433577 
439093 
442833 

409619 
432729 
452721 
447342 
409205 
422805 
446238 
447313 
411630 



417683 
414312 
417135 
437748 
422728 
453070 
406627 
407151 
425815 

431474 
436204 
439328 
449115 
437531 
445424 
403047 
416198 
417576 
430607 
436401 
450654 
426216 
431325 
410102 



BE622585 Hs.3731 

AA010200 Hs.175551 

AW975014 Hs.26 

AI000341 Hs.220491 

AA922153 Hs.132760 

L32137 Hs.1584 

AA447718 Hs.107057 

AI569936 Hs.296178 

AW007080 Hs.284192 

AA534163 Hs.5476 

AA328153 Hs.88201 

BE618582 Hs.97661 

AK001015 Hs.55220 



5.130732 

AJ269529 Hs.301871 

AI199268 Hs.19322 

AI952884 Hs.14832 

AA436989 Hs.121017 

T95143 Hs.14511 

U92981 Hs.18081 

U42349 Hs.71119 

R94088 Hs.43569 

U61397 Hs.81424 

NM_014735 Hs.82292 

AI830342 Hs.211272 

AF107454 Hs.107537 

AA446539 Hs.339024 

AA659037 Hs.163780 

AW024937 Hs.29410 

AB037821 Hs.146858 

AW566008 Hs.239154 

AA155694 Hs.191060 

AA422067 Hs.50547 

AF234882 Hs.5814 

AW937826 Hs.103262 

AK001465 Hs.31575 

T64904 Hs.163780 

H25836 Hs.301527 

R94023 Hs.94560 

AA706910 Hs.112742 

AL133990 Hs.190642 

AK002014 Hs.47546 

W07411 Hs.1 18212 

AW959952 Hs.37528 

AI400752 Hs.112259 

AB028945 Hs.12696 

H27332 Hs.99598 

AA339449 Hs.82285 

AW973521 Hs.247324 

AI087958 Hs.29088 

AJ245587 Hs.25275 

N77630 Hs.13895 

AW026751 Hs.5794 

AW248508 Hs.279727 



hypothetical protein MGC1 5729 



hypothetical protein FLJ 22637 



hypothetical protein FU20285 
30'jie carr'ei family 37 (gyce-ol-C-oho 
Homo sapiens, Similar to RIKEN cDNA 2010 
ESTs, Moderately simitar to unnamed prot 
H2A histone family, member A 
SCO (cytochrome oxidase deficient, yeast 
Homo sapiens clone DT1P1B6 mRNA, CAG rep 
Putative prostate car 



KIAA0215 gene product 



suppression of tumorigenicity 7 

ESTs, Weakly simila- to ZN91 .HUMAN ZINC 

SEC63, endoplasmic reticulum translocon 



Homo sapiens cDNA FLJ 11' 
ESTs, Ml 



toALU3_Hl 



ne HE 



ESTs, Weakly similar to AF090944 1 PRO06 
T cell receptor gamma locus 
cortactin SH3 domain-binding protein 
NM _00666fi*:Homo sapiens transmembrane pr 
hypothetical protein MGC5338 



mitochondrial ribosonial protein S14 

Kruppel-type zinc finger protein 
Homo sapiens cDNA FLJ 11 554 lis, clone HE 
EST , al mi 109260A I el 
Homo sapiens cDNA FLJ1 4035 fis, clone HE 



WO 03/042661 PCT/US02/36810 



446791 
418004 
419791 
447210 



AI632278 

U37519 Hs.87539 

AI579909 Hs.105104 

AF035269 Hs.17752 



Hs.195922 ESTs 



aldehyde dehydrogenase 3 family, member 




435102 
456161 
407021 
437517 
408267 
447713 
456848 
437617 

443180 
445929 
424701 
445255 
409731 
421709 
412707 
418882 
451071 



421059 
426413 
431147 
442677 
446091 
431563 
439478 
413049 
405685 
407917 
450378 
429303 
417248 
429277 



432453 
447499 
452994 
450325 
422390 



410733 
442995 



AW899053 Hs.76917 

BE264645 Hs.282093 
U52077 

AI927675 Hs.244482 

AW380525 Hs.343564 

AI420733 Hs.207083 

AL121087 Hs.296406 

AI026701 Hs.5716 

AI493046 Hs.146133 

R15875 Hs.258576 

AI089660 Hs.323401 

NMJ05923 Hs.151988 

NM_014841 Hs.12477 

AA125985 Hs.56145 

AA159394 Hs.107056 

AW206373 Hs.16443 

NM_004996 Hs.89433 

AW138807 Hs.204939 

BE463721 Hs.97101 

NMJ14158 Hs.279938 

AW247430 Hs.84152 
AI909154 

AK002055 Hs.151046 

AW450461 Hs.203965 

AI680772 Hs.306094 

M099802 Hs.83883 

AI129767 Hs.182874 

AA406400 Hs.12482 

AI954968 Hs.279009 

AA157291 Hs.21479 

AA102616 Hs.1 20769 

AI683487 Hs.152213 

AA737106 Hs.32250 
AL1 34757 

A1654133 Hs.30212 
AA377823 

AI767751 Hs.20300 
AI557914 

AW022192 Hs.200197 

AI027643 Hs.120912 



gb:zx1 0e09.r1 Soares_total_felus_Nb2H F8_ 
F-box only prolein 8 
hypothetical protein FU21918 
gb:Human marinerl transposase gene, conp 
Homo sapiens, clone IMAGE:361 1719, mRNA, 
tubulin-specific chaperone e 
ESTs 



ESTs 

dpy-30-like protein 

mitogen-activated protein kinase kinase 
synaptcsomal-associated protein, 91 kDa 
thymosin, beta, identified in neuroblast 
CED-6 protein 

Homo sapiens cDNA: FU21721 fis, clone C 
ATP-binding cassette, sub-family C (CFTR 
ESTs, Highly similar to A54677 homeolic 
putative G protein-coupled receptor 
HSPC067 protein 
cystathionine-beta-synthase 
gb:QV-BT200-010499-007 BT200 Homo sapien 
hypothetical protein FU11193 



ESTs 

trinucleotide repeal containing 12 



2 nucleotide binding protein (G pr 



matrix Gla pre 

gb:zn43e07.s1 Stratagene HeLacells393 
wingless-type MMTV integration site fami 
ESTs, Moderately similar to 178885 serin 
gb:DKFZp547G1090_r1 547 (synonym: hfbrl) 
thyroid receptor interacting protein 1 5 
gb:EST90805 Synovial sarcoma Homo sapien 
ESTs 



Hs.6574 

NM.002151 Hs.823 

U63139 Hs.41587 

AW249181 Hs.19954 

AW1 37635 Hs.44238 

AA329449 Hs.247302 

AW452016 Hs.127863 

AI815470 Hs.260024 

AL080058 Hs.6909 

AA768553 Hs.193145 

H63010 Hs.5740 

U09550 Hs.1154 

AF051151 Hs.1 14408 

AI885537 Hs.27172 

AW262580 Hs.147674 

AW962597 Hs.31305 

AI935962 Hs.26289 

AW450893 Hs.121830 

BE539108 Hs.22051 

AI127483 Hs. 120451 

NMJ01197 Hs.155419 

U80456 Hs.27311 

AW821182 Hs.61418 

AF086534 Hs.187561 

BE548555 Hs.1 18554 

AF007835 Hs.32417 

D84284 Hs.66052 

Y15221 Hs.1 03982 

AA532511 Hs.288455 



RAD50 (S. 
ESTs, Weakly similar to T1 9873 hypotheti 
ESTs, Weakly similar to S65657 alpha-1 C- 
twisted gastrulation 

ESTs, Weakly similar to SFR4_HUMAN SPLIC 



DKFZP564G202 pn 



ESTs, Moderately similar to PC4259 ferri 
protocadherin beta 16 
KIAA1547 protein 
ESTs 

ESTs, Weakly similar to T42682 hypotheti 
hypothetical protein MGC15548 
ESTs, Weakly similar tounnamed protein 
BCL2-interacting killer (apoptosis-induc 
single-minded (Drosophila) homolcg 2 



ESTs, Moderately sin 
CGI-83 protein 
hypothetical protein fx 
CD38 antigen (p45) 

5 cytokine 

: cDNA: FL 
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5 
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15 



30 
35 



422424 AI1 86431 Hs.296638 

433444 AW975324 Hs.129816 

446354 AW449650 Hs.346335 

438869 AF075009 

425628 NM_004476 Hs.1915 

424036 AA7706B8 

440146 AW014231 Hs.90790 

407253 AA411175 Hs.141939 



415162 
423349 
424534 
407756 
440658 
416547 



416208 
407182 
421689 
432833 
425170 
436278 
438719 



424081 
433800 
413627 
414133 
411096 
450244 
422493 
409219 
419986 
421969 
434128 
434503 
443292 
443555 
448172 
452039 
400290 
449625 
441406 
451369 



437323 
410076 
421501 
419733 



AF035718 Hs.78061 

AF010258 Hs.127428 

D87682 Hs. 150275 

AA116021 Hs.38260 

H29142 Hs.143032 

H62914 Hs,268946 

AK001239 Hs.274263 

NMJ04786 Hs.18792 

AW372914 Hs.86149 

AW819642 Hs.24135 

AI933416 Hs.1 89674 

NM_006363 Hs.173497 

AB023191 Hs.44131 

AA227609 Hs,94834 

R35252 Hs.1 30558 

M508353 Hs.105314 

D63480 Hs.278634 

NMJ16156 Hs.181326 

AA250970 Hs.251946 

A1311295 Hs.344478 

AW372449 Hs.61271 

AA706319 Hs.98416 

AA298489 Hs.303171 

AI0S1118 Hs.65328 

AA909249 Hs.1 12282 

A1523398 Hs.17617 

D29643 Hs.34789 

AK000039 HS.132B26 

AW291168 Hs.41295 

AA312551 Hs.230157 
N87820 

N51075 Hs. 

AU077315 Hs.154970 

BE396290 Hs.5097 

AA357129 Hs.239625 

NMJJ06979 Hs.278721 

J02931 Hs.62192 

R25685 Hs.211933 

NMJ106413 Hs.139120 

AI034361 Hs.136150 

BE182082 Hs.246973 

AW022188 Hs.41167 



prostate differentiation factor 



gb:Homo sapiens full length insert cDNA 



H2Ahistone family, member L 
Homo sapiens cDNA: FLJ22930 fis. clone K 
NM_021965':Homo sapiens phosphoglucomuta 
ESTs. Moderately similar to 365657 alpha 
C2002008:gi|7303957|gb|AAF590CC.1|(AEOO 
transcription factor 21 

KIAA0241 protein 

j: ., litins rcific protease 18 

ESTs, Weakly similar to neuronal thread 

ESTs, Weakly similar to PC4259 ferritin 

hypolhetical protein FLJ10377 




ir, family 51, subfamily 
Fanconi anemia, complementation group F 
'2 (zinc transport 



dolichyl-diphosphooligosaccharide-prolei 
Homo sapiens cDNA FU 1491 3 fis, clone PL 
ESTs, Weakly similar to MUC2_HUMAN MUCIN 



106826 KIAA1696 protein 



collagen, type XIII, alpha 1 
ribonucleaseP(30kD) 
lung type-l cell membrane-associated gly 



hypothetical protein FLJ13158 



AI345455 Hs.78915 

AF052217 Hs.1 10099 

W93170 Hs.284164 

T96231 Hs.17762 

AK000213 Hs.9196 

N71710 Hs.21398 

N75276 Hs.135904 

AI922988 Hs.172510 

H18836 Hs.31608 
NMJ14253 



AW953437 Hs.5486 

AW847814 Hs.289005 

AI432672 Hs.288539 

N62096 Hs.293185 

AW138343 Hs.141867 

AA371145 Hs.194397 

T05387 Hs.7991 

M29971 Hs.1384 



hypothetical protein FLJ20041 

odr (odd Oz/ten-m, Drosophila) homolog 1 

phosphoprotein regulated by mitogenic pa 

ESTs 

clone FLB5214 

Homo sapiens cDNA: FLJ21532 fis, clone C 

hypothetical protein FU22191 

ESTs. Weakly similar to JC7326 amino aci 
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Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 

Accession: Genbank accession numbers 



AA743331 AAB37388 AW664540 AA775711 M150965 BE465475 AA968994AA886905 AI141054 AW194991 AA252147 AA319500 AI184288 
AA708749 AA644620 M652769 AA242975 AA151074T19890 

AIB33044 AW01 6212 AW241 143 AA759058 R43272 AW0B8958 AA210918 AA293774 AI748815 AI763294 AI333114 AI277384 AI088297 AI468477 
AI824624 AW189606 AI631751 Z40749 A964673 AI671316 AA169024 AW235412 BE1784 

AW934714 BE161007 BE162500 AW749902 AW749864 BE162498 BE1 81 005 AA1 90449 AW513485 BE161006 BE162499 
AI909154 AA526337 AA244193 AI909153 

AW362955 H59488 AI040666 W60959 W94209 H27231 T84S25 H75715 W04957 W63S7S AA659693 AA514302 W63789 BE046412 T91396 
A1951 970 AW044233 N20018 AW663548 T9Q114 AI139947 AA809643 AA846232 AA581966 M789002 

AA770688 H15373 AW1 61070 BE304523 BE378517 AA939300 AA904029 BE254211 AA449148 AI268420 AI300495 AI21 5637 AI300494 AI268551 

AA928971 AA179427 AA947684 BE393792 H9801B AI885781 AI188567 A1290658 C15404 

AA377823AW954494AI022688 

AA448765 C04967 C03045 AA658293 

AI685464AW971336M513587AA525142 

AB018320 H56457 AA247916 N83488 N87920 M095653 T19858 AL134279 AA094167 AIB73378 AI000340 R47500 W16595 AW152297 AI625937 
AA002027 AI814851 AA902666 AI039729 AW975053 BE302243 A!240793 AA193203 N5581 
AF075009 R63109 R63068 
W76326AF086341 W72300 

AI557914 W81031 AW473764 AI814081 W81068 AW182826 AW173296 AI376594 AI220500 BE257195 BE246486 R55637 C20788 AI014407 




AV655234AW966332AA340239 

NM.014253AF100772 BE088769 ALQ2271S BE1 61779 AW8S3569 BE161640 AL039060 BE168542 AW296554AA323193 AA235370 AW779760 
N48674 AI375997 R45432 D59344 AI203107 F07491 R35360 R25094 AI913631 AI498402 T613 

N40087 H12925 AA460779 AA096372 BE160847 AW81689D AW81BB93 AWB16577 AW816891 AW816882 AW816B68 AW816889 AW816940 

AW81 6892 AW816941 AW616578 AA0291 83 T83320 H85B50 AA010295 BE160823 Z25353 AA150883 AW8 

AL134757AW079131 



Unique number corresponding to an Eos pre 
Sequence source. The 7 digit numbers in this col 

~ " "unham I. et at., Nature (1999) 402:489-495. 



402802 
403047 
404641 



9719705 Plus 

6649315 Plus 

7139678 Plus 

3287156 Minus 

3540153 Minus 

9796810 Minus 

4508129 Minus 

9114084 Plus 



176341-176452 
157315-157950 
103510-104090 
53242-53432 



Table 62A lists about 600 genes up-regulated in prostate cancer compared to normal adull tissues. These were selected from 59680 probesets on the Affymetrix/Eos Hu03 GeneChip 
array such that the ratio of -average" prostate cancer to "average' normal adult tissues was greater than or equal to 3.0. The "average" prostate cancer level was set to the 75» 
percentile amongst prostate cancers. The "average" normal prostate tissue level was set to the 95* percentile amongst normal prostate tissues. In order to remove gene-specific 
background levels of non-specific hybridization, the 10* percentile value amongst non-malignant body tissues was subtracted from both the numerator and the denominator before the 

TABLE 62A: ABOUT 600 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL PROSTATE TISSUES 



ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 

R1: Ratio of prostate tumor to normal prostate tissue 

Pkey ExAccn UnigenelD Unigene Title 

444151 AW972917 Hs.128749 alpha-methylacyl-CoA racemase 

421566 NM000399 Hs.1395 early growth response 2 (Krox-20 (Drosop 

427274 NM_005211 Hs.174142 colony stimulating factor 1 receptor, fo 

420729 AW964897 Hs.290825 ESTs 

412326 R07566 Hs.73817 small inducible cytokine A3 (homologous 

416188 BE157260 Hs.79070 v-myc avian myelocytoralc i ncog 

417696 BE241624 Hs.82401 CD69 antigen (p60, early T-ce! ac'.ivafi 

424247 X14008 Hs.234734 lysozyme (renal amyloidosis) 

401197 ENSP00000229263*:HSPC213. 

426793 X89887 Hs.172350 HIR (histone cell cycle regulation defec 

450096 AI682088 Hs.79375 holocarboxylasesyntheta:f (t o ii p 'P 

449156 AF103907 Hs.171353 prostate cancer antigen 3, nor-coding DD 

443271 BE56B568 Hs.195704 ESTs 

434078 AW880709 Hs.283683 chromosome 8 open reading frame 4 

423396 AI382555 Hs.127950 bromodomain-containing 1 
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418036 
410023 
421684 
426490 



448520 
417315 
433332 
456614 
427019 
432606 



40 

45 



Y14443 

AB001914 

N34145 

Z37976 

AB017169 



AF003522 

Y14443 

AB002367 

AI080042 

AI367347 

AV653110 

AA001732 

NM 002104 

AF164142 

D87450 

AF055581 

NM_004354 

U97018 

AI934365 

AW014795 

AW129357 

U66468 

AA315267 

AA394183 

AL050102 



Hs.83337 
Hs.57929 
Hs.106768 
Hs.170087 



zinc finger protein 200 
paired basic an ' 
ESTs, Moderately similar to ZN91 .HUMAN Z 
latent transforming growth factor beta b 
slit (Drosophila) homolog 3 
hypothetical protein FU 10511 
aryl hydrocarbon receptor 
metallothionein 1E (fin ctioi ;J) 



449048 
444880 
417407 
436962 
450313 
420120 
438613 



451900 
447769 
452436 
452359 
425018 
447737 
424897 
433867 
427308 
432359 
452221 
416051 
421181 



AW1 18683 

AA923278 

AW377314 

AI038989 

AL049610 

C05569 

M93119 



AW873704 
BE077546 
BE167229 
BE245277 
AK000643 
D63216 



NM.005574 
AJ133123 
AI590401 
AA179233 



417022 NM.014737 Hs. 
451957 AI796320 



413431 
408405 
421552 



443020 
415752 
412482 
417880 
444381 
448019 



AW246428 
AK001332 
AF026692 



Hs.1 06650 
Hs.173233 
Hs.3066 

Hs'.1 54978 

Hs.13131 

Hs.79069 

Hs.12451 

Hs.1 09439 

Hs.23349 

Hs.329700 

Hs.1 59525 

Hs.23128 

Hs.26873 

Hs.227209 

Hs.9884 

Hs.22920 

Hs.154150 

Hs.290905 

Hs.5364 

Hs.332633 

Hs.95243 

Hs.243122 

Hs.89584 

Hs.318584 

Hs^320831 

Hs.31447 

Hs.29206 

Hs.154196 

Hs.19404 

Hs.153684 

Hs.3618 

Hs.174905 

Hs.274415 

Hs.288057 

Hs.25253 

Hs.1 84585 

Hs.20196 

Hs.21213 

Hs.42390 

Hs.8941 

.80905 
is.10299 
Hs.288158 
Hs.75355 
Hs.44672 
Hs.105700 



Homo sa 



neTCCCTA00151 mRNAsequ 



hypothetical protein FU10970 
granzyme K (serine protease, granzyme 3; 
solute carrier family 23 (nucleobase tra 
KIAA0261 protein 
lymphocyte adaptor protein 
cyclin G2 



ell growth regulatory with EF-hand do 



hypothetical protein FLJ21617 

spindle pole body protein 

similar to S68401 (cattle) glucose indue 

ESTs 

ESTs, Weakly similar to protease [H.sapi 
DKFZP564I052 protein 
Bardet-Biedl syndrome 2 
transcription elongation factor A (Sll)- 
hypothetical protein FU13057 similar to 



novel C3HC4 type Zinc ringer (ring tinge 
KIAA0982 protein 

Homo sapiens cDNA FU14597 (is, clone NT 
ESTs, Moderately similar to A46010 X-lin 
hypothetical protein MGC14376 
E4F transcription factor 1 
DKFZP564L0862 protein 
frizzled-related protein 



Homo sapiens cDNA FU10229 fis, clone HE 
hypothetical protein FU22242 
mannosidase, alpha, class 1 A, member 1 



BE241595 

BE3B7335 

AW947164 

D11928 

AW975531 

BE326276 

NM.014324 

AA410506 



AL049964 

T16971 

T15767 

AA251594 

AW292577 



Ftas association (RalGDS/AF-B) domain fsm 
Homo sapiens cDNA FU13545 fis, clone PL 
hypothetical protein FU23591 
ubiquitin-conjugating enzyme E2N (homolo 
hypothetical protein FU10470 
secreted frizzled-related protein 4 
programmed cell death 5 
ESTs, Weakly similar to ALU7.HUMAN ALU S 
Hs.4431 3 v-rel avian reticuloendotheliosis viral 
Hs.1 061 29 ESTs 

Hs.78776 putative transmembrane protein 
Hs.334885 mitochondrial GTP binding protein 
Hs.82848 seleotin L (lymphocyte adhesion molecule 
Hs.28371 3 ESTs, Weakly similar to S64054 hypolhet 
Hs.195641 ESTs, Moderately similar to I38022 hypot 
Hs.76845 phosphoserine phosphatase-like 

maintenance deficient (S. 



ESTs 

alpha-methylacyl-CoA racemase 
KIAA0874 protein 
KIAA1708 protein 
aryl-hydrocarbon rec 



Hs.128749 
Hs.27973 
Hs.26549 
Hs.6111 



Hs.269014 ESTs, Weaay similar to A43932 mucin 

Hs.22452 Homo sapiens mRNA for KIAA1737 pre 

Hs.43913 PIBF1 gene product 

Hs.94445 ESTs 
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449118 
427472 
431548 



414734 
429597 
419047 



434540 
452144 
407938 
445467 
414812 
448552 
406672 



414516 
414343 
412490 



436476 
422746 
437571 
407902 
116985 
447072 
449969 
453204 
452092 
438510 
444917 
414212 
451593 

444172 
433600 
434170 



R67477 

AA522539 

AI834273 

X57010 

X91662 

AA151712 

NMJJ03816 

AW952771 

AI831760 

AB012958 

NMJ06441 

AL1 57645 

NM_016045 

AA032197 



419159 
421470 
411486 



407235 
430935 
412953 



AF226053 

N40850 

NM_006895 

J04088 

AW008798 

AI470523 

AK001776 

AA305599 

W44356 

M788946 

NMJ02890 

N77999 

D13635 

AA126419 

R96696 



Hs.81343 collagen, type II, alpha 1 (primary oste 
Hs.66744 twist (Drosophila) homolo (acrocephalo 1 - 
Hs.82572 ESTs 



Hs.90043 
Hs.155111 
Hs.13137 



In ..i„-i r J riotein FU14428 



Hs.48793 

Hs.3945 

Hs.102558 

Hs.85050 

Hs.15617 

Hs.77367 

Hs.20104 

Hs.198253 

Hs.79018 



Homo sapiens, clone MGC5352, mRNA, comp 



ESTs, Weakly similar to ALU4.HUMAN ALU S 
monokine induced by gamma interferon 
FLJ00052 



AU076903 

AI796870 
AF216751 
AW959311 
AW449065 
M004825 
AW974945 
R27496 



AI767756 

NM_002291 

AB032947 

NMJ03245 

AA326108 

NM_004484 

AA760894 

AL038704 

D61594 

AW295142 

R10799 

BE245374 

AL080220 

R68651 

AA1 36569 

AF151879 

AW953937 

BE147740 

R42833 

AA626509 

X99699 

NM_007240 

W31254 

AL1 38201 



assembly factor 1, subunit A ( 
hypothetical protein SP329 
tuberous sclerosis 1 

CDW52 antigen (CAMPATH-1 antigen) 
ESTs, Weakly similar to T43458 hypotheti 
coated vesicle membrane protein 
Homo sapiens oDNA: FLJ22523 fis, clone H 
selectin P ligand 

collagen, type XIV, alpha 1 (undulin) 

DNA segment on chromosome X (unique) 992 

CDA14 



hypothetical protein 
annexinA3 

eukaryotic translation elongation fxtor 
Homo sapiens mRNA; cDNA DKFZp564C0472 (f 
C50021 54«:gi|729S015|gb|AAF5421 7.1 1 (AEO 
Homo sapiens cDNA FLJ14814 fis, clone NT 
laminin, beta 1 

Ca2+dependent activator protein for seer 
transglutaminase 3 (E polypeptide, prote 
bHLH protein DEC2 



Homo sapiens mRNA; cDNA DKFZp727C191 (fr 
ESTs, Weakly similar to ALU 1 _HU MAN ALU S 
tyrosylprotein sulfotransferase 1 
Homo sapiens cDNA FLJ1 4337 fis, clone PL 



Hs.27842 
Hs.6285 

Hs.144997 ESTs 

Hs.10848 KIAA0187 gene pre 

Hs.26706 CGI-121 protein 

Hs.240845 ESTs 

Hs.1 04558 ESTs, Moderately; 

Hs.22232 ESTs 
Hs.1 22329 



Hs .279780 
Hs.130891 
Hs.66170 
Hs.28625 
Hs.81182 
Hs.156346 
Hs.343877 
Hs.139336 
Hs.143954 
Hs .238205 
Hs.292812 
Hs.101302 
Hs.758 



ESTs 

XIAP associated faclor-1 
dual specificity phosphatase 12 
GL004 protein 

nuclear receptor subfamily 4, group A, m 
NY-REN-1 8 antigen 
hypothetical protein MGC4400 
HSKM-B protein 
ESTs 



= (DNA) II alpha (170kD) 
hypothetical protein FU20039 
ATP-binding cassette, sub-family C (CFTR 
hypothetical protein FLJ10914 
hypothetical protein PRO2013 
ESTs, Weakly similar to T33468 hypotheti 
ESTs, Moderately similar to CA1 C RAT COL 
RAS p21 protein activator (GTPase aotiva 
Homo sapiens mRNA full length insert cDN 
KIAA0010 gene product 
inositol polyphosphate-4-phosphalase, ty 



WO 03/042661 



PCT/US02/36810 



AI952677 
BE566249 
AL120173 



441390 
447382 
431676 



419111 

425840 
414020 
427078 
419326 
419465 
408705 
429250 
431585 
449673 
438440 
427210 
428673 
456974 
416814 
443123 
436420 
443250 



417986 
438828 
408088 

414002 



416782 
418701 
419083 
429900 
437672 
439518 
450661 



N-acetylgalactosaminidase, alpha- 
Homo sapiens rnRNA; cDNA DKFZp434P228 (fr 
hypothetical protein FLJ23142 
ESTs 

ESTs, Weakly similar to T34036 hypotheti 
activating (ranscripiion factor 3 
step II splicing factor SLU7 



M742181 
AA004410 
NM.004477 



AW971493 

AA365752 

M234172 

AW978731 

NM_002984 

AI676062 

W94915 

AW500239 

M312135 

H56585 

BE242803 

AA002064 

AA807228 

BE396283 

AW601325 

M12529 

AW192307 

M094538 

M443966 

AI041530 

M099050 



Hs.75912 
Hs.1 00009 
Hs.203772 
Hs.75909 



Hs.1 34269 ESTs, Highly similar to cytokine recepto 

Hs.155965 ESTs 

Hs.137418 ESTs 

Hs.301 824 hypothetical protein PR01 331 

Hs.75703 small inducible cytokine A4 (homologous 

Hs.1 11 902 ESTs 

Hs.42419 ESTs 

Hs.211 87 Homo sapiens cDNA: FLJ23068 f s, done t 

Hs.46967 HSPC034 protein 

Hs.1 98308 tryptophan rich basic protein 

Hs.262823 hypothetical protein FU10326 



AB011792 

M426080 

N34742 

AW016809 

H85157 

AW014493 

H60720 

AL048842 

AW503976 

H65423 

AW080339 

AA279530 

AW067805 

NMJJ13257 

AA353113 

AA481003 

AL134275 

AW1 57022 

H04588 

NMJ06732 

AA502700 

M836381 

L35035 

AA814948 



Hs.337757 
Hs.169401 
Hs.80042 
Hs.272808 
Hs.31595 
Hs.1 321 07 

Hs.129781 

Hs.1098 

Hs.130489 

Hs.127826 

Hs.35094 

Hs.98489 

Hs.1 70065 

Hs.1 19021 

Hs.40696 

Hs.126727 

Hs.81892 

Hs.194019 

Hs.10649 



eukaryotic translation initiation factor 

Homo sapiens mRNA; cDNA DKFZp566M063 (fr 

apolipoprotein E 

dolichyl-P-Glc:Man9GlcNAc2-PP-dolichylgt 
putative transcription regulation nuclea 
ESTs 
ESTs 

gb:zk85d12.r1Soares_pregnant_uterus_NbH 



DKFZp434J1 81 3 protein 
ATPase, aminophosph " 
RalGEF-like protein 3. 



KIAA0101 gene product 



AW952160 
AU076484 
M306105 
AW972830 
AI421645 
AI289619 
AA677577 
AI500642 



Hs.139851 

Hs.13040 

Hs.98732 

Hs.289067 

Hs.2441 

Hs.22116 

Hs.1 1805 



integrin, beta 2 (antigen CD18 (p95|, ly 
methylenetetrahydrcfolate dehydrogenase 
serum/glucocorticoid regulated k'nase-li 
Homo sapiens cDNA: FU22743 fis, clone H 



FBJ murine osteosarcoma viral oncogene h 
ESTs, Moderately similar to A46010 X-lin 
nuclear receptor co-repressor/H DAC3 comp 
ribose 5-phosphate isomerase A (ribose 5 
ESI Ve; 1 milarl LUC HUM t 
Homo sapiens cDNA FU122S2 fis, clone WA 
ESTs 

flavohemoprotein b5? 

gb:zd60d04.r1 Soares_felELheart_NbHH19W 
ESTs 

TYRO protein tyrosine kinase binding pro 
SEC22 trafficking | oteir i 

3S, MAGL Homo 



G protein-coupled receptor 86 

Homo sapiens Chromosome 16 BAC clone CIT 

Homo sapiens cDNA FLJ114C4 Ss. done HF 

KIAA0022 gene product 

CDC14 (cell division cycle 14, S. cerevi 

ESTs 
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Hs.10029 


cathepsm C 










Homo sapiens, clone IMAGE:3636299, mRNA, 






BE439899 




















AF059214 


Hs.194687 


cholesterol 25-hydroxylase 


4.1 




Al 861854 


Hs.21 0778 


hypothetical protein FU 10989 
pulativenudesr protein 






Al 567669 












MA WD binding protein 




448664 
404390 






splicing factor 3a, submit 1, 120kD 
Target Exon 


4.1 








hypothetical protein FLJ22341 








Hs.82226 


glycoprotein (transmembrane) nmb 


4,1 




AL039185 




thyroid hormone receptor interactor 7 


4.1 




AW970043 


Hs.238039 


hypothetical protein FLJ11090 


4.1 




AW022715 


Hs.162160 


ESTs, Weakly similar to ALU4_H UMAN ALU S 


4.1 














A1364997 




ESTs 


4.1 




BE348706 






4.1 




NM_006052 




Down syndrome critical region gene 3 


4.1 




AV654978 


Hs.19904 


cystathionase (cystathionine gamrna-ryase 








estrogen receptor 1 






A1085198 










A1972094 




Homo sapiens oDNA FLJ13741 fis, clone PL 






AW084176 




ESTs, Weakly similar to I38022 hypotheti 


4.1 






Hs.240845 


DKFZP434D146 protein 






W94748 


Hs.48821 




4.0 








insulin-like growth factor 1 (somatomec'i 










Homo sapiens clone 24538 mRNA sequence 


4.0 
















WW domain binding protein 4 (formin bind 






AB011151 




hypothetical protein MGC14139 










translocase of inner mitochondria! membr 






AW369351 


Hs.287955 


Homo sapiens cDNA FLJ 13090 fis, done NT 




415710 


J04543 


Hs.78637 




4^0 






Hs.246850 








A1186203 


Hs.31432 






424401 




Hs.1 69681 












KIAA0456 protein 
















BE386750 


Hs.86978 


prolyl endopeptidase 








Hs.132390 


zinc finger protein 36 (KOX 18) 


3.9 












AW952703 




synaptic nuclei expressed gene 1b 






AW977167 


Hs.1 55272 




3.9 


451030 


H40697 


Hs.181426 


ESTs, Moderately similar to YSHUT Ihreon 
Target Exon 


19 
3.9 




AW581935 


Hs.141480 


Homo sapiens mRNA; cDNA DKFZp434N079 (fr 






M431791 


Hs.113823 


CIpX (caseinolytic protease X, E. colt] 


3.9 


AL049795 




hypothetical protein MGC1203 






AI660552 


Hs.76549 


ESTs, Weakly similar to A56154 Abl subst 






AK001745 


Hs.1 84628 


hypothetical protein FU10883 
















AF273304 


Hs. 235376 


XPMC2 protein 


3.9 






Hs.1 8705 


KIAA1233 protein 






M694070 


Hs.268835 




3.9 


450899 


AF097994 


Hs.301528 
Hs.1 77864 


ESTs 


a9 

3.9 








axin 2 (conductin, axil) 
















NM 014700 


Hs.1 19004 


KIAA0665 gene product 


3.8 


437231 


BE303000 




hypothetical protein FU13258 similar to 
fibroblast activation protein, alpha 


3.8 












AI204033 


Hs.271461 


ESTs, Weakly similar lo I38022 hypotheti 




410330 


AW023630 


Hs.159425 


ESTs 


3^8 




AW747996 


Hs.1 60999 


ESTs, Moderately similar to A56194 Ihrom 










phosphorylase kinase, beta 






AA347720 


Hs.122669 


KIAA0264 protein 


i:i 






Hs.116784 


C200 'i 'gil7296271lgblAAF51562.il (AEC 
thyroid hormone receptor interactor 4 


3.8 


424163 


NM 007071 




HERV-H LTR-associating 3 


^8 


424806 


AA382523 


Hs!l05689 


MSTP031 protein 


3.3 


425878 
432363 


AW964806 
AA534489 


Hs.38085 


Homo sapiens, Simlar to RIKEN cDNA 0610 
gb:nf76g11.s1NG CGAP Co3 Homo sapiens 


3.8 
3.8 




AW291196 


Hs.92195 


ESTs 


3.8 


432529 
432908 


AI989507 
A1861896 


Hs.304505 


ESTs 
ESTs 


53 


424723 


BE409813 


Hs.152337 


protein arginine N-methy'trcnsferose 3ili 


33 


424855 


AW204725 


Hs.25560 


ESTs 


3.8 


414664 


AA587775 


Hs.66295 


multi-PDZ-domaln-containing protein 


3.3 


437162 


AW005505 


Hs.5464 


thyroid hormone receptor coaciivating pr 


3.3 


421828 


AW891965 


Hs.279789 


histonedeacetylase3 


3.3 
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WO 03/042661 
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453160 
416072 
424906 
446554 



AK001466 
M436002 
W25760 



435401 
448807 
413403 
418323 



452260 
439506 
420077 

436473 

443067 



427138 
442991 
434974 
443399 
401807 
413979 
414680 
417380 
425710 
441247 
445183 



M317439 

AI940507 

AK001952 

R44477 

AI571940 

M129105 

NM_002118 

AI671439 



AF087993 

H46612 

AW977724 

AV647917 

AW97B811 

M676939 

NMJ15156 
X54942 
AL1 19796 

'AI361238 

AW512260 

AA013140 

AI193122 

AW157712 

AI077389 

T93500 

A1692181 

AI343543 

NMJ03642 

AI357412 

M360328 

N77624 

BE281238 

M778711 



Hs.26516 

Hs.183161 

Hs.77631 

Hs.239884 

Hs.79000 

Hs.153716 

Hs.301789 

Hs.54443 

Hs.28707 

Hs.318526 

Hs.238039 

Hs.10056 

Hs.7549 

Hs.198425 

Hs.1162 

Hs.196029 

Hs.42644 

Hs.2250 

Hs.91954 

Hs.293815 

Hs.75968 

Hs.107153 

Hs.314451 

Hs.69285 



451246 
415528 
435513 



H2B histone family, member L 

growth associated protein 43 

Homo sapiens mRNA for Hmob33 protein, 3 ; 

nudix (nucleoside diphosphate linked moi 

chemokine (C-C motif) receptor 5 



FU12661 
hypothetical protein FLJ11090 
-' ' - FU14621 



ESTs, Moderately similar to S65657 alpha 



nia inhibitory factor (cholinergic 



Homo sapiens HSPC285 mRNA, partial cds 
thymosin, beta 4, X chromosome 
inhibitor of growth family, member 1-lik 
ESTs, Weakly similar to ALU 1_HUMAN ALU S 
neuropilin 1 



Hs.174185 ectonuoleotidepy 



Hs.1 15707 

Hs.47534 
Hs.269818 
Hs.28792 
Hs.49169 
Hs.1 26890 
Hs.13340 
Hs.1 57601 
Hs.865 



AW118681 

AB007877 

NM_003416 

BE266659 

AA013051 

AV658299 

AW578003 

Z43784 

AI918950 

BE298808 

M50721 

H56499 

AU076674 

AW189232 

R17236 

AW404075 

M78878 

AW236991 

AI765082 
N59027 
AA172106 



Hs.123642 
Hs.33363 
Hs.23262 
Hs.74870 
Hs.252692 



432125 AW972667 



ESTs, Weakly similar to I38022 hypotheti 
ESTs, Weakly similar to Z195JHUMAN ZINC 
Homo sapiens cDNA FLJ 1 1041 (is, clone PL 
KIAA1 634 protein 



RAP1A, member of RAS oncogene family 
phosphatidic acid phosphatase type 2H 
hypothetical protein FU20424 



Hs.279453 ESTs 

[ 

Hs.332086 
Hs.159275 
Hs.1 28051 
Hs.12385 



Hs.39140 

Hs.42785 
Hs.7314 
Hs.102495 
Hs.38363 
Hs.48317 



zinc finger protein 7 (KOX 4, clone HF.1 
Homo sapiens. Similar to RIKEN cDNA A430 
topoisomerase(DNA) II binding protein 
ESTs 

tropomodulln 3 (ubiquitous) 

ankyrin 3, node of Ranvier (ankyrin G) 

EphA3 

DKFZP434N093 protein 
ribonuclease, RNase A family, k6 
H2.0 (Drosophila)-like homeo box 1 
ESTs, Weakly similar to I38022 hypotheti 
eukaryolic translation initiation factor 
cutaneous T-cell lymphoma tumor antigen 
gb:yg12c04j-1 Scares infant brain 1 NIB H 
DC11 protein 



Hs.3085 
Hs.23598 
Hs.204715 

Hs.1 382 11 ESTs 



ESTs 
ESTs 

gb:yv59d11.r1 Scares fetal liver spleen 
Rag C protein 
Target Exon 

KIAA0054 gene product; Helicase 

" 9 protein (Rubinst:- 'n-Taybi 3 



ESTs 



no sapiens cDNA FU 1 2300 fis, clone MA 



KIAA115( 
ESTs 
MSTP032 protein 
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A1732358 




ESTs, Moderately similar to A37413 calbi 




AW970536 


Hs.105413 


ESTs 


440474 


AI207936 




gamma-aminobutyric acid (GABA) A receplo 




AF151063 


Hs.1 84456 


hypothetical protein 


447397 


BE247676 


Hs. 18442 


E-1 enzyme 




NM.000020 


Hs. 172670 


activin A receptor type ll-like 1 


418293 


A1224463 


Hs. 16063 


hypothetical protein FU21877 




AL046851 


Hs.153053 


CD37 antigen 


415015 


NM_002109 


Hs.77798 


1 tid) I riA synthetase 




AK000166 


Hs.288809 


hypothetical protein FU20159 


408113 


T82427 


Hs.194101 


Homo sapiens cDNA: FU20869 lis, clone A 




AK001972 


Hs.30822 


hypothetical protein FU11110 




AA832479 


Hs.72060 


ESTs 




AA305688 




UDP-Gal:betaGlcNAc beta 1,3-galactosyllr 




A1150730 ■ 


Hs.265640 


ESTs, Moderateiy similar to A47582 B-cet 








23001 j* ,| ^70 o, "i tj|B =,31613.1! (AE 


440749 




Hs.7392 


hypothetical protein MGC3199 




AW025430 


Hs. 155591 


forkhead box F1 








protein S (alpha) 




BE620738 


Hs.173125 


peptidylprotyl isomerase F (cyclophitin 




AW504027 


Hs.15301 


Homo sapiens cDNA FLJ12596 fis, clone NT 
Eos Control 








KIAA1323 protein 




AW993524 








T49300 


Hs!35304 


Homo sapiens cDNA FU13655 fis, clone PL 








phosphoribosyl pyrophosphate synthetase 




AK002015 


Hs.5887 


RNA binding motif protein 7 


443912 


R37257 


Hs.184780 


ESTs 




A1762154 


Hs.54982 


Homo sapiens cDNA FU14014 fis, clone HE 


425836 


AW955696 


Hs.90960 


ESTs 


414565 


AA502972 


Hs.183390 


hypothetical protein FU13590 




AI963808 




ESTs, Moderately similar to ALU5.HUMAN A 




AW504300 


Hs.295605 


mannosidase, alpha, class 2A, member 2 




AJ276678 


Hs.20 101 


HEF tike Protein 




R61680 


Hs.26904 


ESTs, Moderately similar to Z195_HUMAN Z 


426848 


H72531 


Hs.36190 


ESTs 


439653 


AW021103 


Hs.6631 


hypothetical protein FU20373 


450094 


AI174947 


Hs.295789 


Homo sapiens mRNA; cDNA DKFZp564D1 164 (f 


402474 






NM 004079:Homo sapiens cathepsin S (CTSS 


424800 


AL035588 


Hs.153203 


MyoD family inhibitor 




M361258 


Hs.237868 


interleukin 7 receptor 


408968 


AI652236 


Hs.49376 


hypothetical protein FU20644 


453548 


AL079983 


Hs.1 16774 


integrin, alpha 1 


451367 


M923729 


Hs.26322 


cell cycle related kinase 


444445 


M342329 


Hs.1 15920 


Homo sapiens cDNA: FLJ22816 fis, clone K 


445472 


AB006631 


Hs.12784 


Homo sapiens mRNA for KIAA0293 gene, par 


400658 






ENSP00000237081*:KIAA1217 PROTEIN (FRAGM 


408101 


AW968504 


Hs.1 23073 


CDC2-relaled protein kinase 7 



55 
60 



U67058 
AW979177 
AV650179 
F09247 



Hs.302689 L, r _..._. , 
Hs.67991 hypothetical protein DKFZp434G0522 
Hs.102897 CGI-47 protein 
Hs.83938 ESTs,ModeratelysimilartoMAS2_HUMANM 
Hs.154299 Human proteinase activated receplor-2 mR 

gb:EST391287 MAGE resequences, MAGP Homo 



Hs.282431 ESTs 
Hs.247735 



alpha 10 



431304 
406247 
411653 
412639 
419168 
433561 
436207 
438461 
455473 



442355 
422473 
407816 



AA334774 
AW075485 
AW984788 
BE564214 
AL037228 
AW732802 

U94780 



Hs.71168 Homo sapiens clone 24674 mRNA sequence 

Hs.296235 ESTs 

Hs.33718 Homo sapiens cDNA FU12641 fls, done NT 

Hs.20104 hypothetical protein FU00052 

Hs.12845 hypothetical protein MGC13159 



gb:RC1-HN0015-120400-021-c07 HN0015 Hor 

Hs.102946 ESTs 

Hs.82043 D123 gene product 

Hs.2132 epidermal growth factor receptor pathway 

Hs.8262 lysosomat-associated membrane protein 2 

Hs.1 17242 meningioma expressed antigen 6 (coi'.ed-c 

Hs.40137 anaphase-promoting complex 1; meioticch 

Hs.20985 sin3-associated polypep'ide, 30kD 

Hs.21088 hypothetical protein 

Hs.22599 atroph:n-1 interacting protein 1; actvi 

Hs.267086 ESTs 

Hs.1 53678 reproductions 

II ji 1 ul ti d jin q i ieE2, RAD 1 

Hs.737 immediate eariy protein 

Hs.1 0283 RNA binding motif protein 8B 

gb:QVO-CT0224-261099-035-g02 CT0224 Hor 

Hs.269496 ESTs, Weakly similar b 138022 hypotheti 
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30 
35 
40 



AI927212 

AF053551 

AW846787 

AI277652 

T97717 

AA173632 

T75202 

W21874 

D87470 

AF061871 



Hs.3021 1 hypothetical protein FLJ2231 3 
Hs.157774 ESTs 
Hs.125790 
Hs.212715 I 
Hs.3734 
Hs.31584 



441124 
416128 
445101 
407949 
413492 
421044 
422603 
427399 



442906 AW296888 

445840 AI277811 

446084 AL137658 

435293 AI040777 



414922 D00723 



M910353 
AL050163 
D87742 
AB029015 



408051 AI623351 



414279 
440426 
423872 
420154 



M220238 
M305627 
AW021691 



Hs.12314 
Hs.247057 
Hs.75400 
Hs.101302 



CDC14 (cell division cycle 14,S.cerevi 
Homo sapiens mRNA; cDNA DKFZp586C1 01 9 (f 
ESTs, Weakly similar to 2109260A B cell 
KIAA0280 protein 

Human DNA sequence from clone RP1-238D15 



Homo sapiens mRNA; cDNA DKFZp434i1216 (f 
ESTs 

Target Exon 

Homo sapiens mRNA; cDNA OKFZp586C1 723 (f 
hypothetical protein FU 11220 
Homo sapiens mRNA for FLJ001 16 protein, 
NM_003763:Homo sapiens syntaxin 1 6 (STX1 
Target Exon 

ENSP000002521 1 7:Myelin transcription fac 
Target Exon 

hypothetical protein MGC3032 
gb:MR4-ST0121-141099-010-G06_1 ST0121 Ho 
glycine cleavage system protein H (amino 
ESTs, Weakly similar to ALU1JHUMAN ALU S 
F-box only protein 4 

ESTs, Weakly similar to T23482 hypolheti 
DNAX-ac'jvotion protein 10 
WAA0268 protein 
phospholipase C, epsilon 2 
glutamine-fructose-6-phosphatetronsamin 
myelin gene expression factor 2 



Hs.300383 

Hs.77631 
Hs.191534 
Hs.49210 
Hs.292815 
Hs.117339 
Hs.241552 
Hs.54886 
Hs.1674 
Hs.44268 
Hs.2551 
Hs.172148 
Hs.94986 



4S.95420 JM27 protein 



ribonuclease P (38kD) 
ATP-binding cassette, sub-family C (CFTR 
GCN5 (general control of amino-xid synt 
Homo sapiens cDNA FLJ13663 lis, clone PL 



phospholipase A2, group IVC (cytosolic, 



70 
75 



444001 AI095087 



441401 
457718 
429500 
424240 
447476 
419440 
421786 



445784 
446677 
426448 
445921 
406673 
430387 
427157 



AW975944 
AI906055 
BE616160 
AA351584 
AW003850 
AI356967 
AA179949 
BE049102 
AK001455 

X63692 

AI253155 

AI800311 

R06054 

AW015211 

M34996 

AW372884 



Hs.143535 
Hs.20880 
Hs.90419 
Hs.21351 
Hs.237396 
Hs.57749 
Hs.82829 
Hs.1 00543 
Hs.12532 
Hs.43086 
Hs.1 75563 
Hs.179982 
Hs.5198 
Hs.77603 
Hs.77462 

Hs!l56291 

Hs.1 461 81 
Hs.1 98253 
Hs.240770 
Hs.173824 
Hs.258397 



dual specificity phosphatase 5 
solute earner family 21 (organic anion 
Homo sapiens cDNA FLJ1 3542 fis, clone PL 
ESTs, Moderately similar to S65657 alpha 
gb:PM3-LT0032-030100-006-e08 LT0032 Home 
ESTs, Weakly similar to I38022 hypotheti 
ESTs 

ESTs, Weakly similar to ALU4_HUMAN ALU S 
hexabrachion (tenascin C, cylotactin) 
calcium/calmodulin-dependent protein kin 
ESTs, Weakly similar to I38022 hypotheti 
KIAA0882 protein 



ESTs 
ESTs 

synaptic nuclei expressed gene 2; KIAA1 0 
protein tyrosine phosphatase, non-recept 
Homo sapiens clone 24505 mRNA sequence 
chromosome 1 open reading frame 21 
ESTs, Weakly similar to A46010 Winked 
Homo sapiens mRNA; cDNA DKFZp564N0763 (f 
ESTs, Weakly similar to TRHYJHUMAN TRICH 

ESTs 



gb:ye89g07.r1 Soares fetal liver spleen 



nuclear cap binding protein subunit 2, 2 



720 



WO 03/042661 



PCT/US02/36810 



451102 AA015683 

416882 AI633044 

414052 AW57884! 

450580 N40087 



Hs.41 185 Homo sapiens mRNA; cDNA DKFZp56401 262 (f 

tryptophan^ tRNA synthetase 2 (mitochon 
Hs.283552 ESTs, Weakly similar to unnamed protein 



Unique Eos probeset identifier number 



CAT Number Accession 



410531 
410886 
411315 
411962 



1225822.1 
1238570J 
126744 1 
143227J 



1539409 1 
157163.7 
162718.1 

219674.1 
267323.1 
325772.1 



468139.1 
47334.1 
836.1 



452260 
453024 
455473 
456719 



AW752953H88044BE1 56092 
AW809324BE144977BE144956 
AW836547 AW836513 AW836587 
AA099050AA099526 T47733 

R17236 R52580 F11642 

M173632 AI174858 AA581361 AI700024 AA173988 BE165417 AI366964 

AI633044AW016212AW241143AA769056 R43272 AW068958 AA21D918 M293774 AI74BB15 AI763294 AI333114 AI277384 AI088297 AI468477 
AI824624 AW189606 AI631751 Z40749 A1934673 AI67131 6 AM 89024 AW235412 BE1784 ? 
N59027 AA314694 N53937 R08100 
R06054AA378789AW956453 

AiiunToaon «a^w? AA489 ^ ^5 AW466994 AW469044 N59350 AI819642 AI280239 AI220572 AA7B9302 AI473611 AW841 1 26 D60937 

AW972830 AA527647 AA489820 AA570362 

BE157283 BE1 57287 AA502438 

AI685464 AW971336 AA513587 AA525142 

AW972667 AA526539 AI057032 AW1 67842 

AA534489 AW970240 AW970323 

™°1 98 , T J 73644 AW439860 AW0D7307 AI762577 W8651 6 AA1 60485 AA974203 AI589521 AW451 857 AW45D602 AI702529 AA630766 
AI801808AW611634AI393b36AV, 3 3 w ?3012AW51 «iruo«AA^bb 

AA156781 AW293839 U52054AA024963AW8446BE073977AVV4449D4AW6D2574BE164040BE164012BE163972BE163974BE163992 
AA837481 AW468444 BE185091 AW468002 M687333 AA81 1830 AA581305 AI866686 AI572124 AA04 4BE163S92 
AW979177 AA846994 AA829672 
W76326AF086341 W72300 

»,1^ 6 ° 009278 U76833 AWB30055 AW47 1 1 33 C02434 W45237 AW79351 8 BED701 1 2 AI587479 AI624429 AW1 90536 AI446661 AI478772 
AW022667 AA528235 AA599775 AW613820 AI435793 AW594230 AI051758 AI200109 AI680296 

N4ra7H12925AA460779AA096372BE160847AW616890AW816B93AW81B577AW816891 AW816882AW816868AW816889AW816940 
AW816892AW816941 AW81657BAA029183T83320 H86850AA01C295BE1b082^ 35 f> I50SB3 WB 

^S 8 S° 4251 U ' l4103AlD7i NE 463 1 ,,7 ^ l ' ,l479232 T82809 R67109AWra6099AI833027AI765395AI763029AI758228 
BE326331 AI934576 AI922378 AW276431 AI71 8466 N36566 AA904753 BE4S4245 AI338752 Al 

^^l^^°^^^^^2hWmOS2W!i2%^ rj " 1 MAW903374AW839045AW903373AW903352 
AW839043AW839049AW903372AWB46755AW846-67AA9n f i 66BE14662E f- \A84"302 AW 

ZITllo ZT" 1 A ™ 34807 AW98 " 81 9 AW98479 ° AW984782 AW9M784 AW984780 AW964614 AW984795 AW984793 

AW984789 AW984823 AW948021 AW984802 AW984800 AW984799 AW984825 AW984792 AW984821 AW984820AW 
^Jni^ 3 . 382 ^ 572911 7 ^ 44 " 1 ' 1 M ( ' 4C C BE5D2530Z25032 AA805324AA449241 AI651B25 AI264863 

AW196918 AA948267 AI953735 AI263703 AA319159 AW964436 A1903440 AW594171 AI867447 AW2 



Sequence source. The 7 digit numbers in this cdi 
human chromosome 22." Dunham I. et al., Nature (1999) 402:489495. 
Indicates DNA strand from which exons were predicted. 
: Indicates nucleotide positions of predicted exons. 



n I. et al." refers lo the publication entitled The DNA sequence of 



Ref 

8439786 
8118459 
8119067 
9719705 
6649315 
3293210 
2576349 
7331536 



9438331 
7105492 
7708872 



NLposition 

115386-116348 

73525-73644 

198991-199168,199316-199548 

176341-176452 

157315-157950 

65838-66031 

14595-14751 

152325-152912 



13683-13874 

163415-163634 

69340-69615 



39624-40072 

99861-100054 

159913-160605 

37764-37877 

46234-46461 
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PCT/US02/36810 



Table 63A lists about 366 genes significantly down-regulated in prostate cancer compared to normal prostate. These were selected from 59680 probesets on the AffymetrixiEos Hu03 
GeneChip array -nr th the i tic t j/erage" normal prostate to 'average" pr toteconcert ;u - vrac greater than or equal to 2.5. The "average" normal prostate level was set to 
"'" " ' - ' to the 75* percentile amongst tumor samples. In order to remove gene-specific 



the 75* percentile of normal prostate tissues. The "average" prostate cancer level was sel 



TABLE 63A: ABOUT 366 GENES SIGNIFICANTLY DOWN-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL PROSTATE 

Pkey: Unique Eos 
ExAccn: Exemplar Ac 
UnigenelD: Unigenenur 
Unigene Title: Unigene ger 



er, Genbank accession nui 



425932 M81650 

407245 X90568 

425545 N98529 

426752 X69490 

400440 X83957 

412519 AA196241 

413778 AA090235 

420813 X51501 

415337 Z44881 

407013 U35637 

433331 AI73B815 

432117 AL036195 

453863 X02544 
431847 



Unigene ID 

Hs.172004 
Hs.1 58295 
Hs.172004 
Hs.33870 
Hs.73980 
Hs.75535 
Hs.99949 
Hs.9012 



403612 
405001 
441490 



AI791314 
T83061 
AI459021 



no sapiens, clone MGC12401, mRNA, com 



404606 
432326 
459708 
440808 
458339 
454278 
417032 
434352 



AF217525 
AA192469 
AF1 29505 



Hs .274361 

HS.168B12 

Hs.7432 

HS.172B43 

Hs.49002 



417364 N73749 
400831 

416935 AA190712 

442082 R41B23 

452625 AA724771 



Hs.271113 
Hs.157752 
Hs.222475 



ESTs 

Down syndrome cell adhesion molecule 
ESTs 

small muscle protein, X-linked 
ESTs 

Homo sapiens mRNA full length insert cDN 
ESTs 



430611 
415981 
441040 
442764 
411426 



458895 
417729 
431627 
420721 
449519 
442089 
436781 



418626 
423772 
457136 
436404 



Hs.131122 
Hs.188757 
Hs.213049 



gb:QVO-HT0101-061099-032-c04 HT0101 He 
Homo sapiens, clone MGC 
Target Exon 



Hs.126083 
■ls.137240 
1s.1 74782 



Hs.119740 ESTs 



EAP30 subunit of ELL complex 
ESTs 

Homo sapiens mRNA for partial 3'UTR, seq 
ESTs, Weakly similar to WAA1437 protein 
Target Exon 



gb:EST35805 Embryo, 8 week I Homo sapien 

722 



WO 03/042661 PCT/US02/36810 



436190 
400164 
435519 



5 

10 



20 
25 



R46799 
AA1 67670 
AI028767 
AL120600 



463412 
401411 
415925 



Hs.1 25461 
Hs.21417 
Hs.23966 



H09474 

H24185 

BE048401 

AA6B2421 

AW189097 

AI821465 

AW836646 

BE250521 



446175 
459707 
406917 



408229 
427318 
417620 
404660 
409144 
436524 
459200 
418833 
424887 
404984 
454549 
426736 
434552 
438040 
443618 
442097 



gb:Homo sapiens cDNA FLJ 10197 lis, done 
Eos Conlrol 

hypothetical protein FLJ 11 539 



BE154173 
AI796730 
M025647 



family 12, (potassium-chl 



gb:DKFZp762E152_r1 762 (synonym: hme!2) 
Target Exon 

gb:AJ003290 Selected chromosome 21 cDNA 
ENSP00000247172*:HYPOTHETICAL 126.2 kDa 

Hs.202341 Homo sapiens cDNA: FLJ23573 fis, clone L 
hypothetical protein 

Hs.1 96511 ESTs 

Hs.59125 ESTs 

Hs.1 66597 ESTs 

Hs.188810 ESTs, Weakly similar to ALU6.HUMAN ALU S 
gb:PM3-LT0032-090100-008-e05 LT0032 Homo 
ras homolog gene family, member A 
gb:HSC13A121 normalized infant brain cDN 



gb:np86b01.s1 NCI_CGAP_Thy1 H< 
gb:H.sapiens nestin gene. 
gb:PM1-HT0340-201299-00«12 HT0340 Homo 
ESTs 

gb:ze85d01 .r1 Soares Jetal_heart_NbHH 1 9W 



Hs.279714 
Hs.221037 
Hs.30148 
Hs.292776 
Hs.259461 



Hs.130722 
Hs.325116 
Hs.121846 
Hs.300700 
Hs.1 28474 



C9000841':gi|12654691 kjb|AAH01 1 85.1 |AAH0 



Hs.210628 
Hs.287631 
Hs.335440 



Homo sapiens, clone MGC:2962, mRNA, romp 
ESTs 

hypothetical prolan FU 20727 



butyrophilin-like3 
hypothetical protein FU 11 729 
hypothetical protein FU20036 
NMJ03105':Homo sapiens sortilin-related 

Homo sapiens cDNA FLJ14269 fis, clone PL 



440207 
445045 
406177 



L09078 
AI.253414 



442027 
447889 
405152 
402460 
448516 
450588 
447600 



Hs.1 9031 2 
Hs.1 28395 
Hs.346398 



Target Exon 

gb:Homo sapiens mRNA fragment 
ENSP00000252164":K!AA1578 protein (Fragm 
gb:aq14f04,x1 Stanley Frontat NS pool 2 



Target Exon 

C1001261*:gi|2695979|emb|CM7O854.1|(YO 
gb:RC1-NN0073-260400-011-g09 NN0073 Home 



446516 
415696 
410140 



Hs.291742 
Hs.1 291 5 
Hs.1 3277 
Hs.257557 
HS.14B22 
Hs.247837 
AI/33837 Hs.1 45661 
AW410380 Hs.116056 
BE090690 

AK001494 Hs.202596 



Hs.283105 ESTs 



IDSC43 

gb:RC1-BT0720-280300-01 1-g02 BT0720 Home 
Homo sapiens cDNA FLJ 10632 fis, clone NT 



723 



WO 03/042661 PCT/US02/36810 





AI830851 


Hs.200014 


ESTs, Weakly similarto ACIDIC PROLINE-R 


3.2 




AW002425 


Hs.224142 


ESTs 


3.2 






Hs.22833 


ESTs 


3.2 




AW974812 


Hs.291971 


ESTs 


3.2 




AW235786 


Hs.1 95359 


hypothetical protein MGC1 0954 


3.2 




Hs.161008 


ESTs 


3.2 


434407 


AW815333 




gb:QVO-ST0215-060100-083-g01 ST0215 Homo 


3.2 


429973 


AI423317 


Hs.164680 


ESTs 


3.2 


418092 


R45154 


Hs.338439 


ESTs 




411356 


H45377 




gb:yn99h03.r1 Soares adult brain N2ti5HB5 


11 








ESTs, Moderately similar to 2109260A B c 


3.1 


422069 


AJ010063 


Hs.343603 


titin-cap (telethoninl 




420778 


AW970512 




gb:EST382593 MAGE resequences, MAGK Homo 


3.1 


403451 








3.1 


451686 


AA059246 


Hs.1 10293 


ESTs 


3.1 


423837 


AW937063 


Hs.275150 


gb:PM3-DT0037-231299-001-g11 DT0037 Hoto 


3.1 


422731 






gb:DKFZp434A1229 r1 434 (synonym: Hes3) 


3.1 


450049 


AI681234 


Hs.258609 


EST 


3.1 


428656 


AB037798 


Hs.188790 


KIAA1377 protein 


3.1 








Target Exon 


3.1 


418087 


M961613 


Hs.127838 




421813 


BE048255 




gb:tz49b05.y1 NCI_CGAP_Bm52 Homo sapien 


11 


402490 






Target Exon 


3.1 


448001 


AW293237 


Hs.202037 


ESTs 


3.1 




AI219833 


Hs.166767 


ESTs 


3.1 


405150 






Target Exon 


3.1 




BE165819 


Hs.207684 


ESTs 


3.1 


425198 


M352090 


Hs.128003 


hypothetical protein FLJ21213 


3.1 




AB040919 


Hs.210958 


KIAA1486 protein 


3.1 


421894 


AI418464 


Hs.1 90836 


ESTs 


3.1 


419386 


M236867 




ESTs. Weakly similar to I38022 hypolhel 


3.1 




BE305242 


Hs.1 6098 


claudin 2 


3.1 


405422 






ENSP00000216658*:HYPOTHETICAL 133.5 kDa 


3.1 
3.0 




M278221 


Hs.173344 


ESTs 


3.0 


410627 


M181339 


Hs.929 


myosin, heavy polypeptide 7, cardiac mus 


3.0 




W87707 


Hs.82065 


interleukin 6 signal transducer (g P 1 30, 


3.0 








3.0 


408025 


AI692784 


Hs.41767 


PTD002 protein 


3.0 


426349 


AI308855 


Hs.301497 


arginyltransferase 1 


3.0 






Hs. 182045 


ESTs 


3.0 




AC005551 


Hs.130714 


Homo sapiens HSPC323 mRNA, partial cds 


3.0 


441335 


BE222470 


Hs.150825 


ESTs, Weakly similarto antigen NY-CO-33 


3.0 


459647 


R34107 


Hs.321450 


pregnancy specific beta-1-glycoprotein I 


3.0 


435713 


M699313 


Hs.1 14071 


ESTs 


3.0 


407275 


AI364186 




gb:qw34h07.x1 NCI_CGAP_Ut4 Homo sapiens 


3.0 


456103 


Z39430 


Hs.72350 


ESTs 


3.0 


435200 


AA670310 


Hs.145903 


ESTs 


3.0 


449245 


A1636539 


Hs.224296 


ESTs 


3.0 


428132 
443454 


AI057494 




gb:zu24g02.s1 Soares_NhHMPu_S1 Homo sapi 


3.0 


Hs.133421 


ESTs 


2.9 


423732 


AF058056 


Hs.132183 


solute carrier family 16 (monocarboxvlic 


2.9 


411944 






gb:QV2-PT001 0-1 6040O-1 33-gD1 PT0010 Homo 


2.3 


432718 


AA563943 


Hs.244371 


ESTs 


2.9 


435534 


AA830927 


Hs.1 17306 


ESTs 


2.9 


430348 


AA476915 




ESTs, Weakly similar to I38022 hypotheti 


2.9 


410289 
453126 




Hs.61935 


Homo sapiens mRNA; cDNA DKFZp761 1071 (fr 


2.9 


AA032155 


Hs.61622 


ESTs 


2.9 


459421 


AA233154 


Hs.12532 


chromosome 1 open reading frame 21 


2.9 




Al 73301 4 


Hs.269715 


ESTs 


2.9 


442252 


AI733395 


Hs.129124 


ESTs 


2.9 




AI469453 


Hs.222760 


ESTs 


2.9 


443727 




Hs.1 8459 


ESTs 


2.9 








gb:HSZ99362 DKFZphamyl Homo sapiens cDNA 


2.9 


453898 


AW003512 


Hs.232770 


arachidonate lipoxygenase 3 


2.9 


415098 


D59687 




gb:HUM056E10B Clontech human fetal brain 


2.9 


406398 


D61119 




gb:HUM153C11B Clontech human feta' brain 


2.9 






Target Exon 


2.9 


433942 


AW272166 


Hs.123465 


2.9 








Target Exon 


2.9 


442640 


AI205646 




2,3 






Hs249596 






438361 


AA805666 


Hs.146217 


Homo sapiens cDNA: FLJ23077 fis. clone L 


2.9 


422482 
407142 


AI439905 




gb:ti57g08.x1 NC:_CGAP_Lym1 2 Homo sapien 


2.3 


AA412535 




gb:zt99b10.s1 Soares testis NHT Homo sap 


2.3 


454319 


AW247736 


Hs.101617 


ESTs, Weakly similar to T32527 hypotheti 


2.8 


442907 


AI023763 


Hs.79707 


ESTs 


2,3 


456075 


N73442 




gb:yz31h09.r1 Soares multiple sclerosis 
ESTs 


438690 


AA815031 


Hs.123598 


2.8 


452570 


AW861293 


Hs.336940 


Homo sapiens cDNA: FU20861 (is, clone A 


2.3 


408493 


BE206854 
AI885187 


Hs.46039 


phosphcojycerate mutase 2 (muscle) 


2.3 


447556 


Hs.210710 


ESTs, Moderately similar to ALU6 HUMAN A 
EST 


2.8 


451632 


BE005934 


Hs.310625 


2.3 



724 



WO 03/042661 



PCT/US02/36810 



35 
40 



451004 
425223 
405770 
455227 
454445 
419494 
427639 
428079 
406337 



T84239 

AW749432 

W01060 

AW444530 

AA421020 



450044 
403630 
445514 
446362 
432492 



AI241280 

AW612481 

AW275110 

U22491 

AW411330 

BE067952 

AF015041 

NMJ03063 



Hs.148906 
Hs.104105 
Hs.271106 
Hs.248117 
Hs.1 18796 



gb:EST376014 MAGE resequences, MAGH Hoi 
Target Exon 

gb:Homo sapiens mRNA for immunoglobulin 



gb:zf53d09.r1 Scares retina N2MHR Homo 
gb:EST60880 Activated T-cells XX Homo sa 
NM_002362:Homo sapiens melanoma antigen, 

gb:RC3-BT0386-301299-01 1-a09 BT0386 Homo 
ESTs 

Homo sapiens, clone MGC:1 8257, mRNA, com 
ESTs 

C14000021 :gi|7242973|dbj|BAA92547.1 1 (AB 
gb:Human gene for cytochrome P-450c1 1, e 



C3001 708-:gi|4758028|ref|NP_004360. 1 1 co 



405678 
432789 
455791 



404979 
412318 
424361 
412542 
441975 
457021 
457861 



AW936911 
AK001551 
AW961516 
AW173240 
AW968934 
AW450205 
AF087987 
U19878 
T51454 



Hs.326729 
Hs.145944 
Hs.95097 
Hs.344285 
Hs.173108 
Hs.305890 
Hs.42696 



Hs.82845 



gb:CMO-BT0365-061299-122-g09 BT0365 Homo 

numb (Drosophila) homolog-like 

sarcolipin 

gb:601 1 1 1 034F1 NIH_MGC_1 6 Homo sapiens c 
gb:DKFZp547F152_r1 547 (synonym: lifbrl) 
CX001454:gi|8393794|ref|NP_0586o1.l | myo 
KIAA0042 gene product 

gb:RC1-BT0720-280300-01 1-108 BTD720 Homo 
ESTs, Weakly similar to ALU7_HUMAN ALU S 

gb:yg21 M 1 ,r1 Soares infant brain 1MIBH 
C10001 18*:gi|9951 913|ref|NP_052832.1 1 pr 
U6 snRNA-associated Sm-like protein 
ribosomal protein S1 4 
Target Exon 

hypothetical protein MGC1 1082 

Homo sapiens cDNA FLJ10689fis, clone NT 

ESTs 

EST 




443650 
427419 
423741 



402706 
436054 
402749 
442472 
445762 
441420 



AI215667 
AA813125 
AW511049 
R47948 
AW299606 
AI698330 
NHL000200 
AA330362 



Hs.188732 
Hs.232777 
Hs.151444 
Hs. 177888 



AI734002 Hs.264590 



KIAA0460 protein 
ABO biood group (transferase A, alpha 1- 
gb:yd22f08.s1 Sc " 



gb:EST34074 Embryo, 12 week II Homo s a P i 

ESTs 

ESTs 

Target Exon 
ESTs 

Target Exon 

gb-.MR0-ST002O-081 1 99-004-c03 ST0020 Homo 

T ' imi Ml HUM 

gb:oo57d07.s1 NCI_CGAP_Lu5 Homo sapiens 
Target Exon 



WO 03/042661 



PCT/US02/36810 



406003 
459584 
441597 
411280 



A1910884 

AW1 35032 

N50B17 

M83554 

AA830811 

BE172762 

T78001 



451255 
455737 
445326 



444130 
422210 
424241 
451606 
400427 



Hs.346429 
Hs.203625 
Hs.80506 
Hs.1314 



BE072246 
AI220072 
AW1 95262 
BE326671 
AA913819 

AL040177 

NM_004504 



Hs.90744 
Hs.344672 



tumor necrosis factor receptor superfami 
ESTs 

ESTs, Weakly similar to ALU5JHUMAN ALU S 
hypothetical protein FLJ21120 
C16000902:gi|403440|gb|AAA7316e.1j(M:i7 
gb:za62d04.r1 Soares fetal tiver spleen 
ESTs, Weakly similar to I38022 hypotheli 
ESTs 



gb:nc25c06.r1 



;i_CQAP_Pr1 Homo sapiens 



gb:QV4-BT0536-271299-059-b1 2 BT0536 Homo 
ESTs 

gb:xnS7b05.x1 NCI_CGAP_CML1 Home sapiens 



Hs.309071 ESTs 



gb:DKFZp434F0213_r1 434 (synonym: htes3) 
gb:PM1 -CT0247-0801 00-008-hC9 CT0247 Homo 
HIV-1 Rev binding protein 
ESTs 

steroid dehydrogenase-Iike 

Homo sapiens pyruvate dehydrogenase k:na 

AIE-75 binding protein protein 



419354 
447610 
450724 
414523 
414776 
447730 
434152 
412671 
456401 
404678 



gb:MRO-HT0165-140200-009-d04 HT0155 Homo 
ESTs 

H (beta-2 glycoprotein I) 



M1 55598 
AI421251 
AI792665 
AW977734 
W28I46 



gb:yJ79b05.r1 Soares I 
serine (or cysteine] proteinase inhibilo 
hypothetical protein FU14195; KIAA1714 
" no sapiens mRNA for K1AA1755 protein, 



s.291190 ESTs 



gb:EST389963 MAGE resequences, MAGO Homo 
gb:43f1 1 Human retina cDNA randomly prim 
Target Exon 



AL080137 
AF090887 
BE539951 
AW450551 



Hs.22580 

1s.1 93743 ESTs 

Hs.306562 Homo sapiens clone HQ0085 

Hs.306996 Homo sapiens, clone IMAGE:3447073, mRNA, 



415131 
415386 
415981 



1048462J 
1058667J 
114787J 
1204995.1 
1238624_1 
1239102_1 
1240273J 
1245515„1 



1351 148J 
1365950_1 
1373392_1 
151B27J 
1522174_1 
1523680J 
1535560_1 
1564242_1 
1604B48J 
163179J 
184356_1 
1888662J 



AW176091 H24234 
AW195262 R2786BAWB11262 

BE250521 AA076837 BE249870 AAS64291 AW502442 AW501 551 AI221491 AA194239 D63046AA1 93426 AA773243 AA193293 
BE163567 BE073689 BE073747 BE073780 BE073739 BE073748 BE163495 AW750178 BE163491 BE073763 BE073671 
AWB36646 AW836580 AW836610 AW836636 AW836503 AW836632 
AWB3721 2 AW837408 AW837265 AW837380 AW83721 3 AW33741 1 AW837418 



BE141714AW845993AW845989 



AW8771 39 AW8771 35 AW877018 AW99 1 835 AW8771 28 AWB771 08 AW877017 AW877107 

BE067952 BE067945 BE067942 BE067943 BE057949 BE067954 BE067944 BE067953 BE057956 BE067946 

BE090690 BE090688 BE090681 BE090693 BE090675 

BE144228 BE144291 

AI733907 AA1 59708 At 73261 4 

D59687 D59694 D59656 D59589 

D61119D81508 D81734 

Z43087 F07410 H15506 H54108 R95033 H98000 

R35694 H12035R53312 

W03955 H82332 H69247 H72486 

AA190712 AA1 90665 AA252564 

AA236867 AA237066 AA354236 AW957759 H08S61 



AW970512AA280251 AI652287BE466438AI650725AA551854AA281574AW571481 

726 



WO 03/042661 



PCT/US02/36810 



421342 
421813 
422731 
423741 



423871 
426481 
428132 
431847 
434407 



441420 
442472 
445797 
447518 
447600 
448516 



451004 
452351 
453412 



454549 
455048 
455210 
455649 
455737 
455791 



201427J 
207654.1 
220507J 
231582J 



232749.1 
257878.1 
287430.1 
338402.1 
385744.1 
405360.1 
41555.1 
45946.1 
516748.1 
543371.1 



766241.1 
81926.1 
844585.1 
85453.1 



1476756 1 
177089.1 
1844649.2 



AA504749AA287498 
BE048255AA313083AA298419 
AL138411 AL138412 AA315860 
M330362AW962525H87796 

M306637 NM.007241 AF156102 BE388339 BE614027 AW93501 6 AA315261 AW37641 8 AW304951 AI859820 AA367436 AW51 6957 BE61 451 9 
AI453152AI453149AI453139AW168378AM39491 A I 538368AW468227AI580027AW090513AA662830F30995AI351985AI424349AW009599 
C02215AI6525 
AA331906AA332484 

AW963941 AW963944 AA379825 AA379564 
AA421765 AA456076 AI290275 AA455579 
AI791314 A1791434 AA516511 
AW81 5333 AW81 5409 AA632563 
R61083 R13743AA679174 AA679193 Z42903 

65L09102 

8110 
32 

AI253414A1366014 R34822 
T80061 AI382804 
AI42Q990 A1399725 A1401757 

AW898595 AW898588 AW898590 AW898663 AW89S592 AI525093 
BE258051 R45758AA004732BE255126 



AL040177 

AL1 20800 BE378580 
AL138129 AL138179 BE064231 
AW749432AW749434R47332 

AW806910 AW866461 AW865396 AW866373 AW86661 1 AW86661 6 
AW852749 AW852755 AW852620 
AW866599 AW866294 AW866468 AW866467 

BE055051 BE155165 BE064764 BE155231 BE064648 BE064671 BE054636 
BE072246BE072229BE072225 BE072210 BE072221 BE072256 BE072211 BE072242 
BE090689 BE090685 BE090697 BE090680 BE090691 BE090696 BE090698 BE090686 
BE154173 BE154098 BE154096 
N73442 R981Q0 BE410380 



TABLE 63C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digil numbers in this column are Genbank Identifi 

human chromosome 22." Dunham I. et al„ Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
Nt.position: Indicates nucleotide positions of predicted exons. 



1 1, et al." refers to the publication entitled The DNA sequence of 



400831 
401278 
401411 
401656 
401716 
401884 
401896 
401917 
402422 



402706 
402749 
402797 
402817 
402844 
403451 
403612 
403630 
403649 
404260 
404267 
404606 



9796071 
7331445 
8576271 
9799936 



9796344 
9796884 
9797648 
8894426 
9212740 
3421043 
6822166 



108901 -1 09254, 1 1 0246- 1 1 058 1 , 1 1 36 1 3-1 1 3960 

149982-150929 

148640-148805 

68787-68882,76602-76768 

15758-15930 

48611-49012 

54958-55313 

77382-78300 



13909-14466,15251-15760,16898-17431,4174242440 

27141-27247 

51396-51513 

12209-12313,18241-18397 

22310-23269 

168215-168916 

115196-115448 

87762-88217 
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405152 
405258 
405422 
405564 



406085 
406177 
406337 



61057-62075 

42079-42516 

18665-18843 

18930-19148 

90117-90337 

118691-118959 

36296-36610 



Table 64A lists about 703 genes up-regulated in BPH compared to normal adult tissues. These were selected from 596B0 probesets on the Affymetrix/Eos 

lal adult tissues was greater than or equal lo 3.0. The 'average" BPH level was set to Ihe 75* percen 

non-malignant tissues. In order to remove gene-specific background I 

>m both the numerator and the denominator before the ratio was evali 

TABLE 64A: ABOUT 703 GENES UP- REGULATED IN BENIGN PROSTATIC HYPERPLASIA COMPARED TO NORMAL ADULT TISSUES 

Pkey: Unique Eos probeset identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 

R1: Ratio of BPH tissue to normal adult body tissue 




407275 
452340 
432473 
410330 
431579 
452792 
418848 



Hs.282070 
Hs.22791 
Hs.13804 
NMJH1851 Hs.154850 
AI364186 

NMJ02202 Hs.505 
AI202703 Hs.152414 
AW023630 Hs.159425 
AW971082 Hs.222886 
AB037765 Hs.30652 
AI820961 Hs.193465 
AA250737 Hs.72472 



AW207206 
AA421773 
AA056060 



H2B histone family, member L 
hypothetical protsir FLJ 20624 
CEGP1 protein 



Hs.161008 
Hs.202577 
Hs.12364B 
Hs, 132458 
Hs. 10531 4 
Hs.190394 
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429290 
407709 
415293 
429918 
440260 
453096 
410467 
456516 



AA456135 

R49462 

AW873986 

AI972867 

AW294631 

AF102546 

BE172704 



Hs.198760 
Hs.23023 
Hs.106541 
Hs.1 19383 



Hs.63931 
Hs.222746 
Hs.203270 
Hs.271923 



neurafilamenl, heavy polypeptide (200kD) 



dachshund (Drasophila) homolog 
KIAA1510 protein 

ESTs, Weakly similar to ALU1_HLWAN ALU S 
Homo sapiens cDNA: FU 22785 (is, clone K 



420424 
450642 
434036 



R39773 Hs.7130 

AI659131 Hs.197733 

AA441837 Hs.90250 

AA516531 Hs.55999 

AW295230 Hs.25231 



419743 
417169 



hypothetical protein MGC2849 
ESTs 

NK homeobox (Droscphila), family 3, A 



AW40B762 

NM 016625 

R13550 

AI990741 

AF1511Q3 

AA767382 



404592 
409557 
420352 
438231 
44C529 
449300 
417332 
453930 
449249 
430187 
434217 
431217 



R81733 

AI470523 

AW118681 

AI277652 

AL1 37517 

AW302885 



Hs.246773 
Hs.252809 
Hs.1 12259 
Hs.193417 
Hs.121513 
Hs.278081 
Hs.1 39336 
Hs.1 46625 
Hs.72325 
Hs.33106 
Hs.1 39336 
Hs.1 28051 
Hs.54578 
Hs.306201 
Hs.1 171 83 

Hs.211193 

Hs.1 55689 

Hs.346514 



hypothetical protein 



T cell receptorgamma locus 
ESTs 

Homo sapiens clone Z'3-1 placenta expres 
ESTs 

ATP-binding cassette, sub-family C (CFTR 
ESTs 

Human DNA sequence from clone RP1-187J1 1 
ESTs 

ATP-binding cassette, sub-family C (CFTR 
Homo sapiens thymic stromal lymphopoiet! 
ESTs, Weakly similar to 138022 hypotheti 
hypothetical protein DKFZp56401278 
ESTs 

NM_022739*:Homo sapiens C3 ubiquitin lig 
ESfs 

gb:601117374F1 NIH_MGC_16 Homo sapiens c 
ESTs 

Homo sapiens cDNA: FU21718 (is, clone C 
ESTs 

hypothetical protein FU21511 
hypothetical protein FU 10903 
Homo sapiens mRNA for KIM1764 protein, 



Rho GTPase activating protein 6 
Homo sapiens mRNA: cDNA DKFZp564D083 (fr 
hypothetical protein FU13782 
thyroid receptor interacting protein 15 




408197 
454457 
421154 



450497 
425312 
426140 
440911 
412350 
432600 
435375 
416370 
431359 



424846 

435396 
448004 



AA284333 Hs.287631 

AA452899 Hs.213586 

N71831 Hs.256398 

H04607 Hs.9218 

AI239923 Hs.63931 

AW976434 Hs.3623 

H64159 Hs.15328 

AA354940 Hs.1 45958 

AF13179B Hs.343768 

AA909536 Hs.1 43562 
Is.73826 



ESTs, Weakly similar to Z223_HUMAN ZINC 



ESTs, Weakly similar to A46010 X-linked 
gb:QV2-CT0261-261099-011-d11 CT0261 Homo 
Homo sapiens cDNA FU 1 4269 fe, clone PL 
ESTs, Weakly similar to KIAA1353 protein 
Homo sapiens mRNA; cDNA DKFZp434E0528 (f 
ESTs 



AI821085 
AI733610 
N90470 
AW993522 
AF230877 



AI683487 
AW451477 
BE221880 
AI026838 



ls.203697 ESTs, Weakly similar to 138022 hypotheti 

1s.292934 ESTs 

is.152701 microtubule-inte-acting protein that ass 

Target Exon 

ts.1832 neuropeptide Y 

ls.222399 CEGP1 protein 

Is. 152213 wingless-type MMTV integration site fami 



Hs.268555 
Hs.30120 
Hs.55468 



5 -J ex. 



ESTs, Weakly similar to NUCLJUMAN NUCLE 
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436873 
442481 
432527 
447156 
404003 
424853 
427726 
433043 
438138 
440129 
434973 
408527 
459241 
434485 
429716 



448148 
420948 
425810 
414312 
404571 
424850 



40 
45 



AI623511 

R25685 

AI093577 

AA033B32 

AA418703 

AA650114 

NM_016578 



Hs.132967 
Hs.143688 
Hs.125019 
Hs.177502 
Hs.174936 
Hs.313636 
Hs.33074 
Hs,99010 
Hs, 11 8567 
Hs.211933 
Hs.255416 
Hs.212433 



437267 
441916 
452784 

452531 
426503 
423101 
445704 
450813 



65 
70 



454968 
441054 
408385 



419536 
421040 
446495 
415263 
426748 
436338 
445238 



AA767376 
AI459306 
AA603305 
AA715026 



Target Exon 

Human EST done 1 22887 mariner transposo 
Homo sapiens cDNA: FLJ23031 fis, clone L 
lymphoid nuclear protein (LAM) mRNA 
ESTs 

ESTs, Weakly similar to S71886 Ste20-lik 



Hs.325198 ESTs 



410196 AI936442 
436578 AI091435 
424051 AL1 10203 
450325 AI935962 
421863 A1952677 
432161 AK000400 
435072 AW592176 
428647 AA830050 
441111 AI806867 
433087 AI720686 
450244 AA007534 
400294 N95796 
415539 AI733881 
AW511443 
M993571 
BE463857 
Al 337735 
AA4294S2 
AA380153 
M83941 
AI493742 
AI739625 
W87707 



gb:zv98c03.s1 Soares_NhHMPu_S1 Homosspi 



AA151057 

AA830149 

AI676062 

AA436989 

AL049176 

NM_014253 

AI557019 

BE564206 

X00949 

AI376400 

AA922192 

AW854832 

AB040956 



AA397658 
AW972565 
AI339732 

AK001764 

T57773 

AA709046 



NM_015902*:Homo sapiens progestin induce 
chromosome 18 open reading frame 1 
gboc44ID8.s1 NCI_CGAP_GCB1 Homo sapiens 



Hs.54709 ESTs 

Hs.1 35890 
Hs.38348 
Hs.134214 



Hs.59838 

Hs.1 38411 
Hs.26289 

Hs.341181 

Hs.1 24344 
Hs.126594 
Hs.152520 
Hs.125062 
Hs.278695 
Hs72472 



Homo sapiens mRNA; cDNA DKFZp586J1922 (f 



Hs.123642 
Hs.1 67700 
Hs.203376 
Hs.82065 
Hs.268712 

Hs.126480 
Hs.44553 
Hs.291631 
Hs.24908 

Hs.1 35280 
Hs.1 53460 
Hs.130853 
Hs.97177 
Hs.118394 
Hs.187506 
Hs.60257 
Hs.32399 



ESTs 
ESTs 

hypothetical protein FLJ21062 

ESTs, Moderately similar to ZN91_HUMAN Z 

ESTs, Weakly similar to I38022 hypothec 

gb:EST93093 Skin tumor I Homo sapiens cD 

EphA3 

ESTs, Moderately similar to 538022 hypot 
ESTs 

interleukin 6 signal transducer (gp130, 
ESTs 

gb:IL3-CT0214-150300-085-H0G CT0214 Homo 



gb:np12d11.s1 NCI_CGAP_Pr3 Homo sa 



ESTs, Weakly similar to ALU1.HUMAN ALU S 



ESTs 
ESTs 

Homo sapiens cDNA FLJ 13598 lis, clone PL 
ESTs, Weakly similar to S51797 vasodllat 
G-rich RNA sequence binding factor 1 
hypothetical protein FU13409; KIAA1711 
hypothetical protein FLJ10902 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586N2424 (f 
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AL049280 
M319233 
R44477 



452891 
457374 
430261 
408829 
448041 
451244 
443180 
414422 
418866 



435136 
438132 
447058 
419187 
433523 
420871 
450317 
453843 
430172 
431657 



427212 
443361 
409731 



417173 
420931 
434408 
454024 
447805 
438875 



417601 
430697 
430701 
437252 

420133 
431740 
439752 
447816 
431060 

452055 
416602 
418019 
423352 
438042 



416706 
420608 
428249 
442242 
444030 
435655 
400533 



432960 
459527 
426413 



M147224 

T65754 

M281062 



M907076 
AI939456 
AA234852 
H29882 
AA702972 



AW293849 

AI792628 

AA125985 

AI831190 

AW500775 

AW628686 

U61397 

AF044197 

AI031771 



AA130914 
AV647908 
AW021254 
AW105663 



Homo sapiens mRNA; cDNA DKFZp5E4K1 43 (fi 
ESTs 

hypothetical protein FLJ14621 

Homo sapiens prostein rnRNA, complete cds 

gb:yv04a07.s1 Soares fetal liver spleen 

C4001100 , :gi|5852342|gb|AAD54015.1|(AI-"lJ 

ESTs, Weakly similar to DYH9.HUMAN CILIA 

gb;nh05d12.s1 NC_CGAP_Thy1 Homo sapiens 

hypothetical protein HT023 

Te)3-0*uhot 



Hs.81424 
Hs.100431 
Hs.132586 



AA484207 Hs.211867 

AI760833 Hs.293971 

AI433833 Hs.164159 

R41933 Hs.140237 

AA426117 Hs.155543 

N75450 Hs.183412 

T78968 Hs.14411 

NMJ07233 Hs.274329 

AF039307 Hs.249171 

N36417 Hs.144928 

AI377431 Hs.141693 
NMJ06159 
R68911 



AW272603 Hs.266134 

AA873345 Hs.60226 

AW150945 Hs.144758 

AW977556 Hs.291735 
AA377823 
AW968485 



Hs.343877 
Hs.258576 
Hs.249195 



Hs.53126 ESTs 



FU20039 
claudin 12 
HomeoboxA13 

gb:yc11c07.s1 Stratagene lung (937210) H 
u • -J20-42 



Hs.1 61889 ESTs 
Hs.105448 ESTs, W 
Hs.208320 ESTs 



phosphatidylinositol glycan, class H 
N acetylglucosamine-phosphate mulase 
RAP1A, member of RAS oncogene family 
ESTs 

ESTs, Weakly similar to ALU7 .HUMAN ALU S 
ESTs 

thymosin, beta, identified in neuroblast 
ESTs 

hypothetical protein FLJ20136 

KIAA021 7 protein 

ubiqu.tm-like 1 (ser.trin) 

small inducible cytokine B s ubfamlly (Cy 



Hs.293907 hypothetical protein FU23403 



Hs.302421 
Hs.189059 
Ho.75113 
Hs.21108 



general transcription factor IIIA 

ESTs, Weakly similar to ALU1_HUMAN ALU S 

EphA3 

KIAA1 223 protein 



ESTs 



•toALUIJ 



toALUtHUMANALUS 

ESTs, Moderately similar to AF1 1 572 1 67 
ESTs 

TP53 target gene 1 

homeoboxAH 

ESTs 

hypothetical protein MGC10858 



Hs.176275 
Hs.193576 
Hs.180610 
Hs.61523 
Hs.136433 
Hs.82112 
Hs.27133 
Hs.288945 
Hs.103104 
Hs.183291 

Hs.90424 Homosa 
Hs. 135055 ESTs 



interleukin 1 receptor, type I 
ESTs 

1 FLJ13448 



HSPC069 protein 



5*:PRED65 protein (Fragmen 

Homo sapiens, clone IMAGE:3621638. mRNA, 
ESTs 

ESTs, Weakly similar to I78885 serine/th 
gb:EST90805 Synovial saxoma Homo sapien 
gb:EST380561 MAGE resequences, MAGJ Homo 
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25 
30 



416312 


W0264Q 


Hs.16247 


ESTs, Weakly similar to 2004399Achromos 


4.3 


454159 


AW968065 


Hs.44143 


polybromo 1 


4.3 


435176 


AA744B75 


Hs.189413 


ESTs 


4.3 


451061 


AW291487 


Hs.213659 


ESTs, Weakly similar to KIAA1 357 protein 


4.3 


407910 


AA650274 


Hs.41296 


fibronectin leucine rich transmembrane p 


4.3 


420111 


AA255652 




gb:zs21h11.r1 NCI_CGAP_GCB1 Homo sapiens 


4.3 


420757 


X78592 


Hs.99915 


androgen receptor (dihydrotestosterone r 


4.3 


459045 


N69101 


Hs.40730 


ESTs 


4.3 


405348 






C7001664:gi|1.2698061|dbj|BAB21349.1|(AB 


4.3 


422906 


U80773 


Hs.121580 


Human EST clone 42944 mariner transposon 


4.3 


449603 


AI655662 


Hs.197698 


ESTs 


4.3 


409705 


M37762 


Hs.56023 


brain-derived neurotrophic factor 


4.3 


423782 


AI472209 


Hs.323117 


ESTs 


4.3 


432887 


AI926047 


Hs.1 62859 


ESTs 


4.3 


420905 


AA521307 


Hs.186651 


ESTs 


4.2 


416814 


AW192307 




dolichyl-P-Glc:Man9GlcNAc2-PP-dotichylgl 


4.2 


427908 


AA417272 


Hs.24122 


ESTs 


4.2 


432765 


AJ003429 




gb:AJ003429 Selected chromosome 21 cDNA 


4.2 


444246 


H93281 


Hs.10710 


hypothetical protein FLJ2041 7 


4.2 


450597 


AI701635 


Hs.207077 


ESTs 


4,2 


434022 


R18374 


Hs.117956 


ESTs 


4.2 






Hs.140996 


ESTs 


4.2 


433209 


AB040907 


Hs.278436 


KIAA1474 protein 


4.2 


413525 


BE145899 




gb:MFSO-HT0208-221299-204-b10HT0208 Homo 


4.2 


436679 


AI127483 




ESTs, Weakly similar to unnamed protein 


4.2 


449655 


AI021987 


Hs'.59970 


ESTs 


4.2 


440774 


A1420611 


Hs.153934 


ESTs 


4.2 


427521 


AW973352 


Hs.290585 


ESTs 


4.2 


440594 


AW445167 


Hs.126036 


ESTs 


4.2 


458912 
410790 


AI911066 
AW803357 




ESTs 

gb:IL2-UM0079-090300-050-A08 UM0079 Homo 


4.2 


415245 


N59650 


Hs.27252 


ESTs 


4.1 


435743 


T66861 


Hs.12962 


ESTs 


4.1 


442786 


H50733 


Hs.256261 


ESTs, Moderately similar to ALU8_HUMAN A 




436714 


AA728964 


Hs.293399 


ESTs 




423789 


AK002084 


Hs.1 32851 


hypothetical protein FLJ11222 




408177 


AI241733 


Hs .43871 




4.1 


424200 


AA337221 




gb:EST41944 Endometrial tumor Homo sapie 




430523 


AW451385 


Hs.161954 




4.1 


4452C6 


AI35C199 


Hs.269990 


ESTs 




452335 


AW1 88944 


Hs.61272 


ESTs 


4.1 


438431 


AW207860 


Hs.293116 


ESTs 




44S907 


AA004825 


Hs.103281 


ESTs 




430487 


D87742 


Hs.241552 


KIAA0268 protein 


41 


448152 


AI741053 


Hs.170770 


ESTs 




448515 




Hs.13528 


hypothetical protein FU 14054 


AA 


423242 


AL039402 


Hs.125783 


DEME-6 protein 


4.1 


412715 


NM.000947 


Hs.74519 


primase, polypeptide 2A (58kD) 


4.1 


400746 






Target Exon 


4.1 


413081 


BE064415 




gb:RC4-BT0311-241 199-01 2-b03 BT031 1 Homo 


4.1 


446416 


AV658299 


Hs.163959 


ESTs 


4.1 




AA195601 


Hs.26771 


Human DNA sequence from clone 747H23 on 




427203 


AW629517 


Hs.244855 


ESTs 


4'.0 


440840 


AW629666 




ESTs, Weakly similar to S64054 hypotheti 


4.0 


442338 


AI761976 


Hs.156080 


ESTs 


4.0 


446307 


T50083 


Hs.22247 


ESTs 


4.0 


455276 


BE176479 




gb:RC3-HT0585-160300-022-b09 HT0585 Homo 


4.0 


415861 


Z43123 


Hs.144513 


ESTs 




418259 


AA215404 




ESTs 




418818 


AA228899 


Hs.101307 


Homo sapiens HUT1 1 protein mRNA, partial 


4.0 


432229 


AW290976 


Hs.143587 


ESTs 


4.0 


418310 






ESTs 


4,0 


416760 


H85182 


Hs'.191327 


ESTs, Highly similar to KIAA1 1 02 protein 


4.0 


419083 


AI479560 


Hs.98613 


Homo sapiens cDNA FLJ12292 fls, clone MA 


4.0 


421312 


AA824627 


Hs.291670 


ESTs 


4.0 


448131 


AI675054 
AI953135 


Hs.200481 
Hs.45140 


ESTs 

hypothetical protein FU 14084 


4.0 


418243 


W51873 


Hs.171857 


Homo sapiens testis protein mRNA, penia 






H70854 


Hs.283059 




4.C 


434844 


AF157116 


Hs.22350 


hypothetical protein LOC56757 ' 


4.0 


452221 


C21322 
AW975050 


Hs.288057 
Hs.293892 


hypothetical protein FLJ22242 

ESTs, Weakly similar to ALU4 HUMAN ALU S 


4.0 

4.0 


446354 


AW449650 


Hs.346335 


ESTs 


4.0 




D61594 


Hs.17279 


tyrosylprotein sulfotransferase 1 


4.0 


417333 


AL1 57545 


Hs.173179 


bromodomain and PHD finger containing 3 


4.0 


441568 


AI733322 


Hs.127176 


ESTs 


4.0 


441736 


AW292779 


Hs.8182 


ESTs 


4.0 




AI732230 


Hs.191737 


ESTs 


3.9 


433234 


AB040928 


Hs.65366 


KIAA1495 protein 


3.9 


434222 


AF119886 


Hs.283941 


Homo sapiens PR02591 mRNA, complete cds 


3.9 


448568 


AA149121 


Hs.71947 


ESTs 


3.9 


415068 


Z19448 


Hs.131887 


ESTs, Weakly simila- to T24396 hypotheti 


3.9 


424940 


AA9B5308 


Hs.283902 


ESTs 


3.9 



WO 03/042661 



PCT/US02/36810 



429066 
435878 
446862 
447530 



411084 
419629 
423453 
434833 



45 
50 
55 



410352 
407961 
410252 



70 
75 



AW450737 
AF1 56548 
BE064420 
AW848047 
AW134924 
BE439899 

R53790 
AI357412 
BE220199 

AW672939 
AW821182 
BE565647 
AI333129 
AI268407 
AW973708 
M7BB65 
BE617907 
N23018 
Z43062 
H69043 
AA320829 
NMJ05923 
AI817336 



428055 
447785 
451746 

436671 
407437 
40841 8 
420092 
446947 
441865 
419875 
431231 
418348 



431316 
414178 
450630 
411067 



Hs.22030 
Hs.1 78222 
Hs.20152 
Hs.282700 
Hs.248865 
Hs.61884 

Hs.177135 
Hs.1 18971 
Hs.125472 
Hs.91662 
Hs.128791 
Hs.1 92969 



1s.1 90325 ESTs 



Hs.1 50444 
Hs.41694 
Hs.61418 



Hs.211458 
Hs.201925 
Hs.275215 
Hs.97635 
Hs.171391 



141765 
M86178 
AA382267 
AW137159 
AF220264 
AW963897 
AA814043 
AF1 45747 
AA38472B 



AI537167 

X07876 

D16471 

AA8847B6 

AF085933 

AA910403 

U91935 

BE245290 

AI419991 

AK001536 

AA502663 

AW957372 

AA010429 

AI681006 

BE085236 

AA316181 



ESTs, Moderately similar to JC5238 galac 
Homo sapiens, clone IMAGE4296026, niRNA, 
Target Exon 



ESTs, Weakly similar to AT1A HUMAN POTEN 
gb:RC4-BT031 1-241 1 99-012-c08 BT0311 Homo 
gb:IL3-CT0214-291299-052-A12 CT0214 Homo 



Hs.1 57601 ESTs 



WD40 protein Ciaol 

KIAA0373 gene product 

origin recognition complex, subunit 2 (y 

MLuflnlld-d souated protein 1 



Homosapiens cDNA FLJ13446fe" clone PI 
hydroxysteroid (1 1-beta) dehydrogenase 1 



Hs.191791 
Hs.196779 

AW448930 Hs.5415 

AW270580 Hs.189311 

AL044675 Hs.173081 

AA505138 Hs.291341 

W03928 Hs.1 14524 ESTs 

AA999723 Hs.129607 ESTs 

AW298141 Hs.157975 ESTs' ^ 

A1624707 
AA227879 
AK000181 
BE173515 
AA732584 
AW998716 
M420564 



gb:HSC12E041 normalized infant brain cDN 
Homo sapiens cDNA FLJ14366 lis, clone HE 
protocadherin 18 

mitogen-activated protein kinase kinase 



hypothetical protein FLJ20174 
gb:RC2-HT0560-210200-012-f03 HT0550Homo 
progestin induced protein 
gb:PM4-BN0067-250300-002-f11 BN0067 Homo 



Hs.340375 
Hs.311258 
Hs.10653 
Hs.146151 

Hs.44743 
Hs.88045 
Hs.232165 
Hs.5722 



Hs.292620 ESTs 



hypothetical protein FLJ 23560 
wingless-type MMTV integration site fami 
Human mRNA, Xq terminal portion 
gb:am20a10.s1 SoaresJIFLJLGBCSI Homos 



Hs.233321 
Hs.239218 
Hs.145225 

Hs.1 45037 
Hs.46791 
Hs.191939 
Hs.71721 



Homo sapiens cDNA FLJ10674 fis, clone NT 
ESTs 

ESTs, Weakly similar to I38022 hypotheft 



"s, Weakly similar to A40348 Elavfiex- 



733 
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452498 
401132 
421105 
435177 
449343 
457653 
419563 
419994 

434763 
447497 
449988 



AL1 17474 
AF010258 
AA913814 
AI381837 
AI732637 

AK000101 

AA766501 
AI018174 
AI151418 
AI820719 
AA526235 
AA282881 
AA761526 
AA648618 
AW1S7254 



Hs.12532 

Hs.44098 

Hs.48948 

Hs.10526 

Hs.41181 

Hs.127428 

Hs. 172854 

Hs.155335 

Hs.277901 

Hs.1904 

Hs.29700 



protein kinase C, iota 

hypothetical protein FLJ20094 

C1 200051 7":gi|47587 1 2 |ref|NP_004659.1 1 £ 

ESTs 

ESTs 

protein phosphatase 3 (forrerly 2B), cat 
DnaJ (Hsp40) homolog, subfamily A, membe 
Homo sapiens cDNA FLJ11933 fis, clone HE 



AA504428 
AA906121 
AI953425 
AI042063 

AA694192 

H86374 

H74319 

AW971183 

AI800041 

AA455889 

R44013 

AI000341 

H72286 

AA454985 



Hs.205722 
Hs.201420 
Hs.1 81035 
Hs.10487 
Hs.173421 
Hs.345291 
Hs. 1321 56 



Hs.190555 
Hs.1 67279 
Hs.164225 
Hs.220491 



Hs.163853 ESTs 



gb:ns07a11.r1 NCI_CGAP_Ew1 Homo sapiens 

Moderately similar to ALU7.HUMAN A 
hetical protein MGC11296 
Homo sapiens, clone IMAGE:3954132, mRNA, 
ESTs 

ESTs, Weakly similar to I38022 hypotheti 



DnaJ (Hsp40) homolog, subfamily C, mei 
ESTs 

FYVE-f nger-containing Rab5 effector pre 



404848 
412494 
417435 
4304Q3 
436340 
441696 
442231 
447124 
449517 
450297 



55 
60 
65 
70 



AW500106 

AW901347 

T79703 

T89832 

AI301330 

NM.018432 

AA811B57 



400362 
425523 
426237 
427473 
434520 



Hs.82129 

Hs.21606 
Hs.206768 
Hs.222413 
Hs.17428 
Hs.23643 



453942 
431421 
408784 
420184 
420721 
422728 

429165 
427674 
412505 
431408 



ENSP00000240769':BG1 5303.1 (simila- tc 
ADP-ribosylai'cn factor domain protein 1 
carbonic anhydrase III, muscle specific 
tumor necrosis facto- (ligand) superfaml 



AA188408 
AA927802 
AW937826 
AWB80614 



Hs.1 37576 
Hs.192817 
Hs.19928 
Hs.108144 
Hs.63386 



AA825971 

AV653771 

AA225313 

AA873008 

N50128 

BE410107 

AI080271 

AF068294 

AB007948 

AK001104 

AW274439 

AA205273 



ribosomal protein L34 pseudogene 1 



E \ nil toZN9 HMf 'I I 

RNA binding mou'f protein, X chromosome 
prostate cancer associated protein 1 
H2B histone family, member Q 



ls.173400 ESTs 



Hs.134533 
Hs.272414 
Hs.1 58244 



Hs.252709 
Hs.1 77011 
Hs.197505 
Hs.8108 



414818 BE541217 Hs.23606 ESTs 



Homo sapiens HDCMB45P mRNA, partial cds 
KIAA0479 protein 
hypothetical protein FU 10242 
ESTs 

hypothetical protein 
ESTs 

disabled (Drosophila) homolog 1 



734 



WO 03/042661 
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418051 
418719 
437714 
455710 
421590 
445707 
411436 



AI961456 
AA888624 
AW1 92535 



BE072049 

AF004715 

AI248720 

AW846433 

NM_015865 

AF109300 

AW377314 

AA659823 

AW861930 



AA088767 

T92248 

AA610150 

R82826 

U52077 

AI123555 

AW753311 

W05355 

AA001793 

AA495800 



Hs.87128 
Hs.1 73911 
Hs.5232 
Hs.17585 
Hs.147487 
Hs.21275 
Hs.1 97289 
Hs.19479 
Hs.151707 
ls.293242 



Hs.5364 
Hs.34955 
Hs.102500 
Hs.1 371 90 



Hs.81796 
Hs.346690 
Hs.102971 



KIAA0801 gene product 



DKFZP564I052 protein 

Homo sapiens cDNA FLJ13485 Us, clone PL 

hypothetical protein dJ51 1E16.2 

ESTs 

ESTs, Weakly similar lo ALU1.HUMAN ALU S 



hypothetical protein FLJ14751 
gb:zh86c06.r1SoaresJeta!Jiver_spleen_ 
gb:zw05b07.s1 Soares_NhHMPu_S1 Homo sa 



55 
60 



451025 
401416 
421928 
454860 



445218 
447597 
437866 
416882 
410870 



418319 
419088 
422165 
448044 



432589 
420512 
417206 
410821 
423855 



421823 
41 681 2 
445784 



AW028689 Hs.301985 ESTs 



AW835767 

AI160015 

AK001331 

AL049610 

AA766296 

X81895 

A1243596 

T79971 



C14000338':gi|74595O2[pir||S74665 outer 
polyadenylate binding protein-inte-adin 
gb:QV4-LT001 6-240200-1 1 0-b08 LT001 6 Hoi 



Hs.99200 
Hs.194765 
Hg.94830 



R51034 

AI636310 

R60891 

AL080116 

T06809 

AW611703 



Hs.28310 
Hs.260274 
Hs.74420 
Hs.332086 
Hs.190173 
Hs.52620 



U04343 
AA364908 
D54745 
AL1 35725 
AA262886 



AA412290 

AW969706 

AA909249 

N40850 

H91010 



Hs.98927 
Hs.80247 
Hs.131708 
Hs.143817 
Hs.81648 
Hs.92526 
Hs.254859 



H.sapiens GENX-5624 mRNA, 3' UTR 
ESTs, Moderately similar to T03094 A-kin 
Homo sapiens clone 24407 mRNA sequence 



Hs.135624 ESTs 



origin recognition complex, subunil 3 (y 



Hs.221612 
Hs.93304 
Hs.79881 
Hs.138343 



gb:tk13e01.x1 NCLCGAP_Lu24 Homo sapiens 
ESTs 

Target Exon 



Hs.1 12282 
Hs.28625 
Hs.44940 
Hs.1 46065 
Hs.1 62849 
Hs.301198 



CD86 antigen (CD28 antigen ligand 2, B7 
hypothetical protein FLJ 13993 
cholecystokinin 



rel C3HC4 type Zinc finger (ring fmqe 

Ts 

Ts 

ute canier family 30 (zinc transport 



roundabout (axon guidance receptor, Dros 
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416423 
418037 
419197 
420179 
433610 
436295 
444800 



AF129533 

H54375 

AI990212 

N48921 

N74530 

AA806822 

N73895 

AW1 19071 



407349 
410869 
425354 
425480 
441492 
447078 
459324 
433852 



416642 

426174 
430459 
439182 



Hs.268921 ESTs 



s.1 12547 ESTs 



gb:Homo sapiens F-box protein FbI3b (FBL 



AA825449 
AW808361 
U62027 



Hs.155935 
Hs.158135 
Hs.1 46346 



gb:za62d06.s1 Soares fetal liver spleen 
ESTs 

hypothetical protein RG083M05.2 
homogentisate 1,2-dioxyger.ase (homogenti 
protein kinase C, mu 

ENSP00000233779-:Hypothetical 53.0 kDa p 

Homo sapiens cDNA: FLJ22437 lis, clone H 

gb:MR1-ST01 11-1 1 1099-003-104 ST01 1 1 Homo 

complement component 3a receptor 1 

KIAA0981 protein 

ESTs 

ESTs 

gb:xc28c12.x1 NCI_CGAP_Co18 Homo sapiens 
ESTs 

ESTs, Moderately similar to A53959 throm 



AA411176 Hs.141939 

AW084175 Hs.223296 

AI821803 Hs.136580 

AW968263 Hs.123126 



Z42387 

T96118 

AI217097 

AA547959 

BE178539 

AF086030 

AI283476 

AA1 58659 



AA234759 
AA400540 
AA743240 
AI864668 
NMJ02759 



NM_030813':Homo sapiens suppressor ot po 
ESTs, Moderately similar Id S65657 alpha 
ESTs, Weakly similar lo 138022 hypoiheli 



Target Exon 

gb:hn42h03.x1 NCI_CGAP_R0F2 Homo sapiens 
transmembrane, prostate androgen induced 
ESTs 

gb:qd43h07.x1 Soares JetalJiesrLNbHH19W 



Hs.21621 
Hs.263478 
Hs.334712 
Hs.221931 
Hs.1 32950 
Hs.135282 
HS.S1582 
Hs.48832 




hypothetic; 

gb:CM0-CT0103-120899-037-g07CT0103Homo 

gb:ye53h05.s1 Soares fetal liver spleen 
ESTs 

splicing factor 3b, subunit 3, 130kD 
ESTs 

Homo sapiens cDNA FU14913 (is, clone PL 
ESTs, Weakly similar to TSGA RAT TESTIS 
hypothetical protein FU14600 



TABLE 64B 



Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 

Accession: Genbank accession numbers 



Pkey CAT Number Accession 

410790 1221 131J AW803357AW803423AW812233R06814 

410869 1225123J AW808361 AW808404 AW808386 AW808594 AW803654 AW808813 AW808551 AW808676 AW808350 AWB08406 AW808694 AW808934 
AW808829 AW808385 AW808422 AW808401 AW808409 AW808760 AW808863 AW808521 AW808539 AW808609 AW808472 AW808739 
AW808704 AW808558 AW808714 AW808420 AW8 

411436 1245660J AW848433 AW846159 AW846377AW846528 
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412988 
413081 
413525 



419536 
420111 
420352 
423412 
424200 
426413 
426503 
426991 



442481 
415432 
448044 



452462 
453682 
454037 
454096 
454171 
454457 
454860 



50357^2 

505904 1 
543588J 
63943J 
747196 1 
80517.2 



B3933J 
84327J 
918580J 



1049240 
1207274 
1237732 
1247029 
1272541 



455710 
457374 
458912 



BE064415 BE064430 BE064448 

BE145899 BE145848 BE145349 BE145853 BE145927 BE145925 

AI267700 AI720344 AA191424 AI023543 AI469633 AA172056 AW958465 AA172236 AW953397 AA355086 
Z43062 R13213 H14422 

H51299 H44619 H46391 R86024 H51 892 T72744 

A1633044 AW016212 AW241 143 AA769G58 R43272 AW068958 AA210918 AA293774 AI748815 A1763294 AI333114 AI2773B4 A1088297 AI468477 
AI824624 AW189606 AI631751 Z40749 AI984673 AI671316 AA189024 AW235412 BE178426 R24677 R40635 H05100 R40597 
AA215404 AI990909 BE464132 AW271 459 N74332 AI262061 
T65754AA229857AA229658 

AI217097 AW886090 W38035 W33792 AA232835 AW936043 
AA603305 AA244095 AA2441 83 
AA255652 AA280911 AW967920 AA262684 

BE258835 AW968316 M258918 AW843305 R14744 AI580388 BE071923 R362B0 
AF109300 AI299378 AI202654 
AA337221 AA336756AW966196 
AA377823 AW954494 AI022688 
AA380153 AA380233 AW963529 

AK001536 AA1 91 092 AW510354 AI554256 AL353968 AA1 34266 
AA418703 AA418711 BE071915 BE071920 BE071912 

AI739168 AA426249 AI199636 AW5051 98 AW977291 AAB24583 AA883419 AA724079 AI015524 AI377728 AW293682 AI928140 AA731438 

AI092404 AI085630 AA731340 

AA884766 AW974271 AA592975 AA44731 2 

AW207206 AW341473 AA448195 AI951341 

AW968485 AW968670 AA480922 BE350425 

AI821 085 AW973464 AA554802 AI821 831 AA657438 AA640756 AA650339 
AJ003429 AJ003367AA564825 

H29882 AW665533 AW1 49901 AI57291 7 AA598500 AI686466 AI336390 AW864390 AW864320 

AA648618 AW974389 H51771 

N73895AJ001872 

BE085236 BE08531 7 X04236 AA577934 AA578392 AA502836 AA595852 AA57B25B AW270791 AA507151 AA5591 52 T57040 BE503281 
AWS9340S A182575S AI3S0499 AIGE5710 AI9722B1 AI654949 BE0739B1 BE073962 BE041399 AW750214 AA228488 BE074016 AI908706 
AW270601 AW873282 

AA156781 AW293839 U52054 AA024963 AA778446 BE073977 AW444904 AW602574 BE164040 BE164012 BE163972 BE163974 BE163992 

AA837481 AW468444 BE185091 AW468002 AAG87333 AA81 1 830 AA581806 AI8666B6 AI572124 M043777 AA040926 D201BO AI536733 

AA812489AW874142 AI47188 

AA8301 49 AW978407 M85983 AW503637 

DE220199W01813 AF0B511B N707B0 BE221405 

AW629666 AW959831 AW205739 BE620243 AA41 2367 AW300025 AW05192O AI288591 AW23B1 14 AI302B52 AI03B54B M534496 AI797207 
AA921877 

AA91 0403 AI81 5593 W58361 AW1 62520 AIB1 5550 
N99828 BE079873 AI1 1 0738 AF074645 
AV653771 BE089370 

AI458682 H24240 R14537 R18426 AW867082 

AI151418W60401 AW631238AI346936AA862B55W60310 N72501 H90060BE150445AW380821 AI540906 C04881 W03542AA641764H97053 
AW889353 AA521308 AA001203 W92328 A[207798 M74B555 R78710 W24617 AA024605 C01747 AW173095 W61229 W92585 AA742467 
H00789 R76925AW1828 
AA001793AA001871 

NM_014253AF100772BE088769AL022718BE161779AW8635S9 BE161640 AL039060 BE168542 AW295554 AA323193 AA235370 AW779760 
N48674 AI375997 R45432 D59344 A1203107 F07491 R35360 R25094 AI913631 AI49B402 T513B2 AI016320 N45526 T51415AA331486 
AI692689 R14223 R18395 

N40087H12925AA460779AA096372BE160847AW8168SOAWB16893AW815577AWB15891 AW816882AW816868AW816889AW816940 
AW81 6892 AW81 6941 AW816578 AA0291 83 T83320 H86650 AA01 0295 BE1 50823 225353 AA1506B3 AWBB7764 AW023806 AW022095 
AA164518 AA730973 W00417 W65303 

AI339732 AA01 0300 AW51 5041 AA768334 N29860 AA425874 AA4251 1 B AA865829 AW936878 

AA495800 AA495737 AA010736 AA654716 AA640726 

BE173515 BE173560 AI902860 

T79703 T96307 AL079725 

AW998716AW022148N68020 

AVTO2757 AW176890 AW062758 AW176895 AW176859 AW175900 AW176897 AW176928 AW176868 AW176B92 
AW854832AW854798AW854857AW854ei6AW854834AW854817 
AW753456 AW753036 AW854868 AW854862 
AW8357B7 AW835537 BE160187 

AW84904B AW847956 AW849039 AW847957 AW848279 AW84B69B AW849034 AW849033 

BE176479 BE176578 BE176357 BE176550 AW8E6079 BE176S76 BE176515 BE176555 BE176489 BE176610 BE176362 
BE064420 BE064435 BE064429 BE064414 BE064400 BE064517 

BE072049 BE069471 BE069489 BE069478 BE069479 BE069492 BE069485 BE072122 BE072124 BE069491 BE069486 BE069464 

AA493662 AW897396 BE154814 

AI91 1056 AI933734 AI680888 AJ003599 



TABLE 64C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled The DNA sequence of 

human chromosome 22." Dunham I. et al„ Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref Strand NLposition 



737 



WO 03/042661 



PCT/US02/36810 



401403 
401416 
401424 
403242 
403481 
403510 
403667 



7710966 Plus 



7637817 Minus 
9965004 Plus 
7652047 Plus 



8248647 
7329374 
2914717 



147703-147896 
85679-85795 
146180-146294 
121456-121626 
24223-24428 
11 237-1 2D11 



61866-62027 

1344-1442,1545-1697 

143467-143634 

198349-199096 

59039-70100 

112450-112648 

39067-39225 

23955-24034,25143-25264 
28556-28684 



Table 65A lists about 347 genes up-regulated in BPH compared to prostate cancer tissues. These were selected from 59680 probesets on the Affymetrix/Eos Hu03 GeneChip array 
such that the ratio of "average" BPH to 'average' prostate cancer Issues was greater than or equal to 3.0. The "average" BPH level was set to the 75* percentile amongst BPH 
tissues. The "average" prostate cancer tissue level was set to the 85" 1 percentile amongst malignant prostate tissues. In order to remove gene-specific background levels of non- 
specific hybridization, the 1 0» percentile value amongst non-malignant body tissues was subtracted from both the numerator and the denominator before the ratio was evaluated, 

TABLE 65A: ABOUT 347 GENES UP-REGULATED IN BENIGN PROSTATIC HYPERPLASIA COMPARED TO PROSTATE CANCER TISSUES 



ExAccn UnigenelD Unigene Title 

AA421773 Hs.161008 ESTs 

AW815036 Hs.151251 ESTs 

BE247706 Hs.89751 membrane-spanning 4-dornairs, subfamily A 

AI890347 Hs.271923 Homo sapiens cDNA: FLJ22785 fis, clone K 

ENSPOOr ' ' 

AA814100 Hs.86693 ESTs 



438231 
418387 
442481 



412988 

454171 
400086 
440911 
425312 
426140 
419015 
453789 
424940 



R18085 
N99828 
T52285 
AI820719 



Hs.321190 
Hs.155689 



AA909536 

AA354940 

AF131798 

T79262 

AA628517 



Hs.132851 
Hs.83870 

Hs.143562 
Hs.1 45958 
Hs.343768 
Hs.14463 



NM_022739':Homo sapiens E3 ubiquitin lig 
gb:QV2-CT0261-261099-011-d11 CT0261 Homo 
Eos Control 

gb:QV3-OT0063-290300-1 35-cD4 OT0063 Homo 
gb:601117374F1 N!H_MGC_1S Homo sapiens c 
ESTs 

gbygl 6b1 2.r1 Soares infant brain 1 NIB H 
qb:za32c04.r1 Soares feta! livor spleen 
Homo sapiens mRNA for K1AA1764 c-otein, 
DnaJ (Hsp40) homolog, subfamily A, rrembe 
Target Exon 

gb:EST380561 MAGE resequences, MAGJ Homo 
gb:QV4-LT0016-240200-110-b08 LT001B Homo 
hypothetical protein FU11222 
gb:hn42h03.x1 NCI_CGAP_RDF2 Homo sapiens 



Homo sapiens clone 251 19 mRNA sequence 



404003 
453200 
428002 
432319 
445444 



Hs.191754 ESTs 



AA418703 
AW510770 
AA380876 



solute carrier family 6 (neurotransmitte 



Hs.212433 ESTs 



Hs.128386 
Hs.270 
Hs.133273 

Hs.38348 
Hs.1 96011 
Hs.107410 



gb:zv98c03.s1 Soarss.NhHMPu S1 Homo sap! 
ESTs 

pleckstnn homology, Sec7 sr.d coiled/coi 
ESTs 

gb:IL3-CT0219-291099-021-E07 CT021S Homo 

ESTs 

ESTs 

ESTs, Weakly similar to A46010 X-linked 



738 



WO 03/042661 PCT/US02/36810 



436578 AI091435 



435072 
414403 
443744 



408385 
437752 
446495 



AW969551 
AI084326 
AI720686 
AA993571 
BE541217 
AA429462 
AW849046 
H08225 



Hs.134859 
Hs.133324 
Hs.116932 
Hs.76064 
Hs.271548 
Hs.152520 
Hs.129075 
Hs.23606 



Hs.268712 
Hs.44553 
Hs.291631 



AI733610 
AL048409' 
AW954525 



AA883971 
BE074910 
AI809510 



ESTs, Weakly similar to ALU1_HUMAN ALU S 
gb:EST366595 MAGE resequences, MAGC Homi 
Target Exon 

hypothetical protein FLJ13448 



gb:RC5-BT0580-1 70300-021-F12 BT0580 Homo 



Hs.118971 ESTs 



AA889386 Hs.125468 ESTs 



gb:nh05d12.s1 NCI_CGAP_Thy1 Homo sai 



AA284658 
AA907076 



Hs.223000 ESTs 



427302 
442123 
438727 



40 
45 



AI697790 
AW978756 
AF 044197 
AI337127 
AI827243 
AF 147363 



Hs.100343 

Hs.208320 
Hs.865 
Hs.135282 
s.159961 

s.100431 

is.224398 



AA484207 
AI620661 
BE1 40643 
AW119071 

AW207860 
AA300591 
T78968 
AA234759 



Hs.153287 
Hs.293115 



small inducible cytokine B subfamily (Cy 
ESTs 

Homo sapiens cDNA FLJ11469(is, clone HE 

gb:Homo sapiens full length insert cDNA 

ENSP00000251890':Monocytic leukemia zinc 

Human EST clone 42944 mariner transposon 

C7001976':gi|4758712|retiNP_004659.1|aI 

ESTs 

ESTs 

gb:RC0-HT0015-310599-016 HT0015 Homo sap 



gb:EST13437 Testis tumor Homo sapiens cD 



416312 
415467 
459045 



AA029994 

AW296971 

AA211586 

NM_014809 

AI928037 

AI051603 

AA345504 

W02640 



427908 
432765 
448108 
436345 
409500 
413525 
403305 
436338 
427521 
430124 
410790 



Hs.180610 ESTs 

Hs.26441 
Hs.158832 
Hs.133141 



Hs.16247 
Hs.260274 
Hs.40730 
Hs.140970 
Hs.7984 
Hs.26675 
Hs.24122 



Hs. 170685 ESTs 



gb:AJ003429 Selected chromosome 21 cDNA 



sulfotransferase, estracen-preferrlng 
gb:URO-HT0208-221299-204-b10HT0208 Ho 
transmembrane protein (63kD), endoplasmi 



Hs.127780 
Hs.205241 
Hs.256261 



gb:IL2-UM0079-rj90300-05OA08 Ur, 



739 



WO 03/042661 PCT/US02/36810 



AI350199 Hs.269990 



gb:EST41944 Endometrial tumor Ho 



453713 
400746 
4130B1 
442338 
455388 
455276 
421312 
431421 
448882 



453387 
434222 
454806 
429066 



421353 
401459 
403433 



BE064415 
AI761976 
AW936234 
BE176479 
M824627 

AJ001531 
AI733322 
BE064420 
M428211 
M993527 
AI990741 
AF1 19886 
AW872430 
AA868555 
R0B330 



452320 
402145 
458438 
440450 
437587 
416009 
434381 
416534 

411436 
424830 
431447 
435932 
442447 
419831 
403242 
455490 
449254 



448135 
405510 
407437 



U06117 

AA713661 

R44574 

AW754192 

AA042873 



AW846 

AW270580 

AA505138 



AW953477 

AI637649 

AI080230 

AI039624 

BE173515 

AA227879 

H64159 

AL041765 

M86178 

H94331 

AI470874 



427726 
434306 
456354 
419261 
422899 
447458 
411421 
439075 



ls.156080 ESTs 



404561 
429073 
419002 



Hs.252809 
Hs.283941 
Hs.273743 
Hs.1 78222 
Hs.20152 
Hs31917 
Hs.248865 
Hs.26100 



gb:RC4-BT031 1-241 1 99-01 2-c08 BT031 1 Homo 
hypothetical protein FLJ14033 similar to 
hypothetical protein FU23403 
ESTs 

Homo sapiens PR02591 mRNA, complete cds 



Hs.283879 ubiquilin 3 
Hs.255130 ESTs 



Hs.250 

Hs.121091 

Hs.107510 



gb:RC2-CT0321-131299-012-a04 CT0321 Homo 



Hs.224961 Homo sapiens cDNA FLJ14366 fis. do 



Hs.189311 
Hs.291341 
Hs.114524 
Hs.1 29607 
Hs.5415 



Hs.196105 
Hs.1 34214 
Hs.98388 

Hs.1 87621 
Hs.15328 
Hs.340375 
Hs.311258 
Hs.34024 
Hs.343799 



gb:RC2-HT0560-210200-012-f03HT0560Ho 



D16471 
AI741082 
BE272110 
AF0B5933 Hs.292620 ESTs 



ESTs 
ESTs 

ENSP00000233779 , :Hypoihetica! 68.0 kDa p 
gb:Homo sapiens MOST-1 mRNA, complete cd 
ESTs, Weakly similar to ALU1.HUMAN ALU S 
gb:PM2-CT0328-2B1299-003-e04 CT0328 Homo 
gb;zn27h06.r1 Stratagene neuroepilhelium 
Homo sapiens FKSG41 (FKSG41) mRNA, comp! 
Homo sapiens cDNA: FLJ23031 fis, clone L 
Homo sapiens cDNA: FLJ21447 fis, clone C 
alcohol dehydrogenase 4 (class II), pi p 
wingless-type MMTV integration site fcmi 
Human mRNA, Xq terminal portion 
ESTs 
ESTs 



AI203405 
U91935 
H71739 



Hs.47831 
Hs.233321 
Hs.200227 

Hs.47385 
Hs.268594 
Hs.191939 



Retina-derived POU-domsin factor-1 
ESTs, Moderately similar to A53959 throm 
trichorhinophalangeal syndrome I gene(T 



740 



WO 03/042661 PCT/US02/36810 



419622 
449821 
451193 



AF086294 Hs.1 25844 ESTs 

AA452054 Hs.1 19338 ESTs 

AI671141 Hs.211122 ESTs 

N29850 Hs.44098 ESTs 

AW984757 < 

AB031479 Hs.91600 

AI732637 Hs.277901 

AI1262BB Hs.192232 

BE072483 Hs.278337 



405264 
417675 
449988 
434763 
416379 
407557 
437956 
403510 
451722 
435681 
427125 



25 

30 



50 
55 



AW372068 

AA64B618 

N38857 

Z83803 

M773283 

H86374 



AL133105 

434497 AI821803 

453682 T79703 
404848 

417435 NMJ05181 

442231 W02434 

450297 AW901347 

436340 R42246 

441596 AA939300 

456995 T89832 

456304 AI820973 

427033 AI457449 

400379 NM.018432 

423357 AI285124 

4505B2 AI339732 

437662 M765387 

442388 AW663442 



Hs.203559 

Hs.40861 
Hs.148979 
Hs.97612 
Hs.144633 
Hs.136580 



NM_030813':Homo sapiens suppressor of po 
similar la murine leucine-rich repeat pr 
ESTs, Moderately similar to ALU7.HUMAN A 
gb:ns07a11.r1 NQ_CGAP_Ew1 Homo sapiens 



gb:H.sapiensmRNAfor; 
hypothetical protein FLJ12701 
Target Exon 



Hs.82129 
Hs.222413 
Hs.38592 



431408 
428923 
407344 



hypothetical protein DKFZp434F2322 
ESTs 

gb:yd71e08.r1 Soares fetal liver spleen 
ENSP00000240769':BG153O3.1 (similar to C 
carbonic anhydrase III, muscle specific 
ESTs 

hypothetical protein FLJ23342 
ESTs 
ESTs 
ESTs 

gb:nc21c02.y5 NCI_CGAP_Pr1 Homo sapiens 
ESTs 

:er related prot 

Hs.157505 ESTs 

G-rich RNA sequence binding factor 1 

Hs.145095 ESTs 

Hs.129485 ESTs 

Hs.148701 ESTs 

Hs.207077 ESTs 

Hs.1 24284 ESTs 

gb:AV653771 GLC Homo sapiens cDNA clone 

Hs.191995 ESTs 

Hs.1 05738 ESTs 

Hs.1 88785 ESTs 

Hs.271418 gb:ox29f07.x1 ScaresJotalJetus_Nb2HF8_ 

Hs.1 58244 KIAA0479 protein 



AA744703 Hs.129030 ESTs 



447282 AI989963 



418719 
402703 
455710 
452011 
453973 
420355 
418898 
418986 
449570 
447516 
408304 
402454 



Hs.197505 
Hs.346145 
Hs.87128 
Hs.59847 
Hs.164018 
Hs.123642 
Hs.132967 
Hs.65641 
Hs.161707 



hypothetical protein FU23309 



EphA3 

Human EST clone 122887 mariner transposo 
protein FU20073 



BE072049 

AW628911 

AI291895 

AW968263 

T66847 

AI123555 

AA001793 

W05355 



AW851255 

407896 D76435 

435786 H09175 

447597 AIB86036 

432625 AI243596 

459535 AV654907 

447183 AI554733 

426629 AI203933 

447892 AI435848 



Hs.211429 

Hs.61993 ESTs, Weakly similar to ALU1_HUMAN ALU S 
Hs.123126 ESTs 

Hs.194040 ESTs, Weaky similar to 138022 hVDotheti 

Hs.81796 ESTs 

gb:zh86c06.r1SoaresJeta!Jirer_sp[een_ 

Hs.102971 hypothetical protein FLJ14751 

gb:MR4-ST0125-151299-029-a09 ST0125 Homo 
C1002501':gi|129092|sp|P23270|OLF7_RATO 
gb:IL3-CT0220-1 60200-066-H02 CT0220 Homo 



Hs.26085 
Hs.213675 
Hs.94830 

Hs.173182 
Hs.97142 
Hs.172978 
Hs.1 26083 



ESTs 
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WO 03/042661 



PCT/US02/36810 



416812 
452102 
434512 
458251 



AA677981 
AA059412 
AI754634 



457021 
420301 
420721 
43B013 
441492 



Hs.326728 ESTs 

gb:zw05b07.s1 SoaresJIhHMPu S1 Homosapi 

Hs.44940 ESTs 

Hs.27954 CD86 antigen (CD28 antigen ligand 2, B7- 

Hs.1 88941 ESTs 

Hs.210422 ESTs * 

Hs.293561 ESTs 

Hs.1 38343 ESTs, Weakly similar to I78885 serineAh 

Target Exon 

Hs.11 9023 SMC2 (structural maintenance of i 

Hs.47986 hypothetical protein MGC1 0940 

Hs.131987 ESTs 

Hs.105635 ESTs 



427773 
415901 
445758 
403291 
436295 



427B13 
405733 
426743 
449913 



Hs.98124 
Hs.76118 
Hs.25804 



ESTs 

ubiquitin carboxyl-terminal esterase L1 
ESTs 

Target Exon 

gb:za62d06.s1 Soares fetal liverspleen 
gb:52g1 1 Human relina cDNA randomly prim 



Hs.86447 ESTs 
ESTs 



R15337 

AW968934 

AA767526 

M927802 

AA703419 

AI1 49998 

AWB08361 

AA236867 

W88732 

AI939357 

AA131782 

AF1295C5 

AK001991 

H60457 

T96118 

AF08603C 

AI217097 



AA383833 

M004696 

AW062757 

AA865818 

H47233 

D56919 

AI381837 

M017410 



Hs.268921 
Hs.21958 
Hs.173108 
Hs.22030 
Hs.1 59471 
Hs.287749 
Hs.146346 

Hs.36107 
Hs.270710 
Hs.1 82314 
Hs.36492 



ls.1 74936 
Hs.30643 
Hs.265848 



ESTs 

Homo sapiens mRNA; cDNA DKFZp547D086(fr 
Homo sapiens cDNA: FLJ21897 lis, clone H 
paired box gene 5 (B-celt lineage specif 
ZAP3 protein 

Homo sapiens cDNA: FU23593 fs, clone L 



Hs.1 00861 hypothetical 



AW810279 BE146684 BE146693 BE146S94 BE146679 AW810472 AWB10208 AW810356 AW810193 AW1 78838 AW1 78837 AW1 78857 AW810515 
AW81 0330 AW81 0514 AW81 0441 AW810358 AW178852 AW81 0359 AW81 0322 AW81 0327 AW81021 1 AW178835 AW81 0635 AW81 0288 
AW81 0263 AW81 0325 AW81 0443 AW8 

M125984 M1 271 89 AA065075 AA070377 AA1 00017 AA079891 AA1 1 3255 AA0751 68 AA082764 M083380 NB4829 AA084752 AA076512 
M085119M085208AA085045 

AW754192 W00554 AW857797 AW754203 AW754197 AVV7541 93 
AW803357 AW803423 AW812233 R06814 

AW808361 AW808404 AW808386 AW808594 AW808654 AW808813 AW808551 AW808676 AW808350 AW808406 AW80B694 AWB08934 

AW80B829 AW808385 AW808422 AW808401 AW808409 AW808760 AWB08863 AW808521 AW808539 AW808609 AW80B472 AW808739 

AW808704 AW808558 AW808714 AW808420 AW8 

AW846433 AW846159 AW846377 AW846528 

AW850246AW850251 AW850302 

AW851255AW851432AW850955 

AW984757 AW984797 AW984734 AW984745 

BE046680 BE046738 BE044958 

BE064415 BE064430 BE064448 



1208283 1 
1221131 1 
12251 23 J 

1245660 J 
1248692J 
1249255.1 
1322288J 
1342150.1 



413525 
416009 
416422 



Z43062 R13213 H14422 
H60457 H68709 H73528 H54335 R87154 
M21 1586 F35799 AA211641 F29720 AW937387 AW937408 
R18085 M219028 R17712 Z44345 
AI21 7097 AW886090 W38035 W38792 AA232835 AW936043 
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WO 03/042661 



PCT/US02/36810 



419386 
420352 
421926 
424200 

424994 
428002 
430535 



440947 
442481 
445432 



455067 
455135 
455276 



455747 
455772 
455801 



184356.1 
192979 1 
209246 1 



918580J 
977454J 
1007449 J 
1049240J 
1207274J 



AA236867 M237066 AA354236 AW957759 H08961 

BE258835 AW968316 AA258918 AW843305 R14744 AI580388 BE071923 R36280 

AA300591 AW963893AA300493 

AA337221 AA336756 AW966196 

AA345504 AA345251 AW963243 

AW954525 AI372685 AA349501 AI372687 H1 0564 

AA418703 AA418711 BE071915 BE071920 BE071912 

AW968485 AW968670 AA480922 BE350425 

AJ003429AJ003367AA564825 

H29882 AW665533 AW149901 AI57291 7 AA598500 AI686466 AI336390 AW864390 AW864320 

AA631834 AA633425 AA632455 AI792312 AI792311 

AF147363T47219T47218 

AA648618AW9743S9H51771 

N73895AJ001872 

AA910403 AI815593 W58361 AW1 62520 AI816550 
N9982S BE079873 AI1 10738 AF074645 
AV653771 BE089370 
AA001793 AA001871 



1370508J 
17682CJ 
328758_1 



AI339732AA0103Q0AW515041AA768334N29860AA425874AA425118AA865829AW936878 

AA495800AA495737AA010736AA654716AA640726 

BE173515 BE173560AI902860 

T79703T96307AL079725 

AW062757AW176890AW062758AW176895AW176869AW176900AW176897AW176928AW176868AW176892 
AW854832 AW854798 AW854857 AW854816 AW654834 AW85481 7 
AW753456 AW753036 AW854868 AW854862 
BE069128 BE069023AW809375 

AW812866 AWS12746 AW81 2747 AW81 2884 AW81 2763 AW81 2722 
AW835767 AW835537 BE1 60187 

AW849046 AW847956 AW849039 AW847957 AW848279 AW848698 AW849034 AW849033 
AW854538AW854418AW854412 

AW857989 AW85801 6 AW861677 AW861 689 AW861691 AW858056 

BE176479 BE176678 BE176357 BE176550 AW886079 BE1 76676 BE17661 5 BE1 76555 BE176489 BE176610 BE176362 

AW936234 AW936074 AW936181 AW936179 AW936217 AW936077 AW936227 AW936191 

AW953477 Z41970 F12435 T73989 T09387 

BE064420 BE064435 BE064429 BE0B4414 BE064400 BE064517 

BE072049 BE069471 BE069489 aE069478 BE069479 BE069492 BE059485 BE072122 BE072124 BE069491 BE069486 BE069464 
BE074910 BE074913 BE074911 BE074903 BE074892 BE074935 
W28799 BE086078 

BE140643 BE140645 BE140644 BE140557 BE140660 BE140659 BE140561 
AI820973 AI734077 A1820984 AA225795 AA225060 AA2251 01 
AA493662AW897396BE154814 



TABLE 65C 



Unique number corresponding to an Eos probesel 
Sequence source. The 7 digit numbers in this column are Genbank Idenlili 
human chromosome 22." Dunham I. et al., Nature (1999! 402:469-195. 
Indicates DNA strand from which ext 



s. 'Dunham I. el al." refers lo lire 



"The DNA sequence of 



400746 
401132 
401459 
402145 



403481 
403510 
403667 
403805 
404003 
404561 
404592 



7230870 Plus 



7329374 
3419846 
2914717 



NLposition 

277132-277595 

147703-147896 

85679-85795 

182001-183323 

113086-114800 

14826-15803 

15335-15500 

11297-12511 

95177-95435 

114632-114805 

124794-124941 



72225-72437 

93496-93633 

61866-62027 

1344-1442,1545-1697 

51483-51742,53429-53511 



39067-39225 



89944-90729 

154015-154123 

28556-28684 

44654-45210 

43310-43462 

101028-101174 

10878-11048 

124832-125051 



WO 03/042661 



PCT/US02/36810 



Table BBA lists about 370 genes up-regulated in BPH compared to prostate cancer and normal adult tissues. These were selected from 59680 probesets on the Affymetrix/Eos Hu03 
GeneChip array such that the ratio of "average" BPH to "average" prostate cancer and normal adult tissues was greater than or equal to 3.0. The "average" BPH level was set to the 
75* percentile amongst BPH tissues. The "average" prostate cancer and normal adult tissue level was set to the &5*< percentile amongst malignant prostate tissues and normal adult 
tissues. In order to remove gene-specific background levels of non-specific hybridization, the 1 0» percentile value amongst non-malignant body tissues was subtracted from both the 
numerator and the denominator before the ratio was evaluated. 

TABLE 66A: ABOUT 370 GENES UP-REGULATED IN BENIGN PROSTATIC HYPERPLASIA COMPARED TO PROSTATE CANCER AND NORMAL ADULT TISSUES 



Unique tos pr< 

ExAccn: Exemplar Accession number, Genbank accession nu 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 

R1: Ratio of BPH tissue to prostate tumor and normal body tissue 



Pkey 



453387 
454457 
441247 
431576 
400080 
431448 



H47233 

AW450461 

M14156 



AA421773 

AI127076 

AA508353 

R49462 

AI890347 

AA441837 

AW295230 

AI990741 

AW753456 



AW854832 

AA282262 

U20158 

AA284333 

N71831 

H04607 



UnigenelD Unigene Title 
Hs.30643 ESTs 
Hs.203965 ESTs 



Hs.85112 
Hs.152414 
Hs.151251 



Hs.306201 
Hs.105314 
Hs. 106541 
Hs.271923 
Hs.90250 
Hs.25231 
Hs.252809 



gb:qw34h07.x1 NCI_CGAP_Ut4 Homo sapiens 
ESTs 

hypothetical protein DKFZp56401278 



Hs.107410 

Hs.2438 

Hs.287631 



425312 AA354940 Hs.145958 

426140 AF131798 Hs.343768 

440911 AA909536 Hs.143562 
400533 

418310 AA814100 Hs.86693 



434485 AI623511 

453200 AA033832 

428002 AA418703 

424850 AA151057 

445444 AA380B76 

443361 AI792628 

421513 X00949 

439079 AF085937 

430535 AW968485 

436578 AI091435 

424051 AL110203 

421863 AI952677 

435072 AW592176 

435375 AI733610 

444609 AW571659 



no sapiens cDNA: FU22785 fis, clone K 



ESTs 
ESTs 

gb:QV2-CT0261-261099-011-d11 CT0261 Homo 
Homosa itr tl rnic tr I lymphop 3l 
hydroxysteroid (11-beta) dehydrogenase 1 
Eos Control 

hypothetical protein DKFZp56401278 

NM_022739*.Homo sapiens E3 ubiquitin lig 

gb:601117374F1 NIH_MGC_15 Homo sapiens c 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA for KIAA1764 protein, 
hypothetical protein FLJ13189 (FLJ13189) 
gb:QV2-CT0261-201099-01 1-fOS CT0261 Homo 
ESTs, Weakly similar lo A46010 X-linked 
lymphocyte cytosolic protein 2 (SH2 doma 
Homo sapiens cDNA FU 14269 fis, clone PL 
Homo sapiens mRNA; cDNA DKFZp434E0528 (f 
ESTs 

gb:za32c04.r1 Soares fetal liver spleen , 



wingless-type MMTV integration site fami 
Target Exon 

Human EST clone 122887 mariner transposo 



Hs.283902 ESTs 
5.118567 ESTs 



Hs.212433 ESTs 



Hs.153498 

Hs.270 

Hs.133273 

Hs.105314 

Hs.38348 

Hs.1 34859 
Hs.138411 
Hs. 108972 
Hs.116932 
Hs.187832 
Hs.278081 



gb:zv98c03.s1 Soares_NhHMFu_S1 Homosapi 
chromosome 1 8 open reading frame 1 
pieckstrin homology, Sec7 and coiled/coi 



gb:EST380561 MAGE 



AW511443 Hs.258110 

AA993571 Hs.1 29075 

BE463857 Hs.151258 

AA429462 Hs.293946 



nel (chicken)-like 2 
ESTs 

gb:oc44f08.s1 NCI_CGAP GCB1 Homosa 



744 



WO 03/042661 



PCT/US02/36810 



437752 
446495 
422906 
427726 



416706 
440129 
445238 
450582 



20 

25 



AW849046 

AF055634 

M767376 

D60923 

U80773 

AI359144 

AL048409 

W92147 

M314676 

AA865818 

MBB3971 

AI339732 



Hs.1 23642 
Hs.72325 
Hs.268712 



Hs.291631 
Hs.1 53460 
Hs.121580 
HS.14368B 
Hs.97177 



gb:!L3-CT0214-150300-085-H06 CT0214 Homo 

unc5 (C.elegans homolog] c 

ESTs, Moderately similar lo S65657 alpha 

Human EST clone 42944 mariner transposes 
Homo sapiens cDNA: FU23031 fis, clone L 
ESTs, Weakly similar to ALU1_HUMAN ALU S 



1s.288945 hypothetical protein FLJ13448 
Hs.174936 ESTs, Weakly similarto S71886 ! 



AA493652 

NM.006042 

AW292769 

AI820719 

H64159 

AA907076 

BE541217 

AA452899 

H29882 



443635 AI080230 



450497 
438132 
414818 



Hs.118971 
Hs.30732 
Hs.125468 
Hs.212875 

Hs.48384 
Hs.206228 
Hs.154662 



G-rich RNA sequence binding factor 1 
ESTs 

hypothetical protein FLJ13409; KIAA1711 
ESTs 

ESTs, Weakly similarto DYH9_HUMAN CILIA 
gb:nh05d12.s1 NC!_CGAP_Thy1 Homo sapiens 
heparan sulfate (glucosamine) 3-O-sulfot 



452843 
427302 
452031 
423789 
417173 
420931 
454024 



433433 
427908 
432765 
413525 
420355 
427521 



AI796769 
AA400540 
AA741314 
AK002084 
U61397 

AA993527 

AA429750 
A1939456 
AA424266 
AA484207 
AA581404 
AI671141 
AA521307 



AL049242 

AA324808 

AW296971 

AA029994 

AI093155 

AA130914 

AV647908 

AK000101 

AW977556 

AA708205 

W02640 

AA255652 

M37762 



Hs.1 5328 ESTs 
Hs.1 22060 ESTs 
Hs.23606 ESTs 

Hs.213586 ESTs, Weakly similar to KIAA1353 protein 
ESTs 

gb:wd86g05.x1 NCI_CGAP_Lu24 Homo sapiens 
Hs.134214 ESTs 
Hs.161889 ESTs 
Hs.208320 ESTs 

Hs.135282 Homo sapiens cDNA FLJ11554 fis, clone HE 
Hs.865 RAP1A, member of RAS oncogene family 
Hs.1 32851 hypothetical protein FLJ11222 
Hs.81424 ubiquilin-like 1 (senlrin) 
Hs.1 00431 small inducible cytokine B subfamily (Cy 
Hs.293907 hypothetical protein FLJ23403 

C7001664:gil1269B051|dbj|DAD21849.1|(AD 
general transcription factor IMA 



ESTs 
EphA3 



Hs.75113 
Hs.160870 
Hs.123642 
Hs.211867 
Hs.289037 
Hs.211122 
Hs.186651 ESTs 
Hs.14411 ESTs 
Hs.141693 
Hs.234794 
Hs.193576 
Hs.180610 
Hs.61523 
Hs.95420 
Hs.183291 



cDNA FU14135 fis, clone IA\ 



ESTs 
ESTs 

JM27 protein 
zinc finger protein 268 
Homo sapiens cDNA: FLJ23285 fis, clone H 
hypothetical protein FLJ20094 
ESTs, Weakly sin " 



Hs.16247 
Hs.40730 



AW968263 
AW973352 
AI911066 
AW803357 



Hs.27252 
Hs.127780 
Hs.256261 
Hs.207077 



gb:IL2-UM0079-090300-050-A08 UM0079 Homo 



ESTs, Moderately similar to ALU8.HUMAN A 



413081 
404967 
427203 



AW207860 
AI741053 
AA827640 

BE064415 

AW629517 



Hs.244855 ESTs 



Target Exon 

gb:RC4-BT031 1-241 199-012-b03 BT031 1 He 
Target Exon 



745 



WO 03/042661 PCT/US02/36810 



Hs.156080 ESTs 



4351 36 
416760 
421312 
441568 
441736 
448882 
434222 



435878 
447530 
412988 
410196 



25 
30 
35 



R27299 

H85182 

AA824627 

AI733322 

AW292779 

AF1 19886 
AA767526 
AA868555 



Hs.1 91 327 ESTs, Highly similar to KIAA1 102 pre 



Hs.127176 

Hs.8182 

Hs.22404 

Hs.283941 

Hs.22030 

Hs.1 78222 

Hs.20152 

Hs.248865 



414441 AA234759 
425810 AI923627 Hs.31903 
409111 AL043352 Hs.7984 



AW969118 Hs.108144 
Z43062 

H69043 Hs.224961 
H91679 
AW448930 



431447 AA50513S 



Hs.5415 
Hs.189311 
Hs.291341 



M999723 Hs.1 29607 ESTs 



AW298141 
AW998716 
AI918950 
AA227879 
DE173515 
M149121 



442447 
403242 
433908 
454037 
432101 
418759 
452462 
448568 
4477B5 
451746 
436345 AA873008 



M853410 

M715026 

AW835767 

M495800 

AF220264 

AL1 33990 

AW451477 

X07876 

D16471 



protease, serine, 12 (neurotrypsin, moto 
Homo sapiens PR02591 rnRNA, complete cds 
paired box gene 5 (B-cell lineage specif 



gb:hn42h03.x1 NC:_CGAP_RDF2 Homo sapiens 
hvi i J protein FU10808 
ESTs, Moderately similar to KIAA0877 pro 
KIAA0888 protein 

gb:RC4-BT031 1-241 199-01 2-c08 BT0311 Homo 



Hs.156147 ESTs 



Homo sapiens oDNA FU13458 fis, clone PL 
ESTs, Weakly similar to unnamed protein 
gb:HSC12E041 normalized infant brain cDN 
Homo sapiens cDNA FU14366 fis, clone HE 
gb:yv04a07.s1 Soares fetal liverspleen 
ESTs 

ESTs, Weakly similar to putative p150[H 



407437 
431474 
448004 



3.157975 ESTs 



Hs.187621 ESTs 



Hs.71947 
Hs.340375 
Hs.311258 
Hs.1 21572 
Hs.282700 
Hs.93557 
Hs.1 35280 



gb:RC2-HT0560-210230-012-f03 HT0560 Homo 



Hs.190642 
Hs.257456 
Hs.89791 
Hs.121571 



gb:QV4-LT0016-240200-11D-b08 LT0016Homo 
gb:zw05b07.s1 Soares_NhHMPu_S1 Homosapi 
gb:Homo sapiens MOST-1 rr,RNA, complete cd 
CEGP1 protein 



Human rnRNA, Xq terminal portion 



s.158961 ESTs 



AI203405 
U91935 
H71739 
AI021987 
AA010429 



51193 N29850 

J20026 AI831190 

26917 AA913814 

,29430 AI381837 

133563 AI732637 

W3744 AI084326 

156373 BE247706 
AW973708 



401132 
421105 
435177 
434763 



Retina-derived POU-domain factor-1 



Hs.47831 
Hs.233321 
Hs.200227 
Hs.59970 ESTs 
Hs.191939 ESTs 



Hs.3623 
Hs.44098 ESTs 
Hs.166676 ESTs 
Hs.172854 DKFZP586B0923 pre 
Hs.155335 ESTs 
Hs.277901 ESTs 
Hs.271546 
Hs.89751 



ESTs, Weakly similar to I78885 serine* 
membrane-spanning 4-domains. subfamily A 
Homo sapiens cDNA FLJ13446 fis, clone PL 
C12000517*:gi|4758712|ref]NP_004659.1 1 £ 



Hs.42936 ESTs 



gb:ns07a11.r1 NCI.CGAP.Ew1 Home sapion: 
ESTs, Moderately similar to ALU7_HUMAN A 
t ns, clone IMAGE:36 I IA 

Target Exon 



746 



WO 03/042661 



PCT/US02/36810 



428647 
412775 
429643 
443324 



404848 
408480 
412494 
417435 
430403 
435340 
441596 
442231 
447124 
450297 
453682 



AA709046 
M455889 
R44013 
AW204994 



Hs.5232 
Hs.153287 
Hs.1 24344 
Hs.27552 
Hs.167279 

Ha253450 
Hs.293971 
Hs.293399 



Hs.206768 
Hs.222413 
Hs.17428 



Hs.170278 ESTs 



ESTs 
ESTs 

ENSP00000240769*:BG153O3.1 (similar to C 
fibroblast growth factor 7 (keratinocyte 
AOP-ribosylation factor domain protein 1 
carbonic anhydrase III, muscle specific 
tumor necrosis factor figand) superfami 
ESTs 
ESTs 
ESTs 

RBP1-like protein 
hypothetical protein FLJ23342 
gb:yd71e08.r1 Soares fetal liver spleen 



459646 AW883968 



431408 
438801 
445432 
450003 
448108 



418985 
418719 
455710 



Hs.220913 
Hs.192817 
Hs.1 81035 
Hs.194024 
Hs.159471 
Hs.1 88578 
Hs.1 46381 
Hs.211933 
Hs.1 92232 
Hs.321190 



AA504757 
AA825971 
AV653771 
AA777809 
AW300021 
AB007948 
AW274439 
AA205273 



AI042330 
AW975590 
BE072049 
AA258245 



hypothetical protein MGC11296 



gb:QV3-OT0053-29030O-135-c04 OT0063 Homo 



Hs.191995 
Hs.170685 
Hs.158244 
Hs.252709 
Hs.177011 
Hs.197505 
Hs. 17585 
Hs.143587 
Hs.87128 
Hs.161707 

Hs.127573 



gb:AV653771 GLC Homo sapiens cDNA clone 
ESTs 

KIAA0479 protein 

hypothetical protein 

KIAA0801 gene product 
ESTs 

hypothetical protein FLJ23309 
ESTs 



444246 
453973 
434497 
439306 
420608 
418986 
438118 
447516 
449570 
424994 
417675 



W05355 
AA001793 
AW954525 



Hs.53126 ESTs 
Hs.102500 
Hs.10710 
Hs.61993 
Hs.136580 

Hs.103104 
Hs.81796 
Hs.346690 



rdJ511E16.2 
hypothetical protein FLJ2041 7 
ESTs, Weakly similar to ALU1 HUMAN ALU S 



hypothetical protein FLJ14751 
gb:zh86c06.r1 Soares_fetalJiver_spleen_ 
gb:EST366595 MAGE resequences, MAGC I 
Hs.3781 similar to murine leudne-rich repeat pr 

Target Exon 
Hs.200481 ESTs 
Hs.99200 ESTs 

Hs.94830 ESTs, Moderately similar to T03094 A-kin 



AI031771 
X56411 
AA628517 
AL080116 



Hs.260274 

Hs.132586 

Hs.1219 

HS.11B502 

Hs.74420 

Hs.1 901 73 

Hs.52620 



alcohol dehydrogenase 4 (class II), pi p 
ESTs 

origin recognition complex, subunit 3 (y 
ESTs, Weakly similar to A46010 Winked 
integrin, beta 8 



histidine decarboxylase 

gb:IM3e01.x1 NCl_CGAF_Lu24 Homo sapiens 
gb:RC3-ST0186-300100-017-b03ST0186Homo 



747 



WO 03/042661 



PCT/US02/36810 



434512 
452102 
433444 
410821 
415861 
422299 
432527 
427773 
441817 



AA454993 

AW139932 

U04343 

AW975324 

AI114811 



Hs.1 38343 
Hs.188941 
Hs.27954 
Hs.129816 



CDB6 antigen (CD28 
ESTs 

ESTs. Weakly similar lo T00365 hypolheli 



Hs.114556 hypothetical protein FLJ20174 
Hs.102754 ESTs 
Hs.98124 ESTs 



418037 
419197 
420179 



Hs.268921 
Hs.86447 
Hs.27441 
Hs.21168 



gb:Homo sapiens F-box protein, Fbl3b (F3L 



447078 
435021 
408832 
450580 
432319 
453713 
445784 



U62027 

AI149998 

AW885727 

AA922192 

AW085690 

N40087 

AW510770 

R20640 

AI253155 

T96118 

AI217097 

AA810867 



Homo sapiens PR01 082 mRNA, col 
NM_030813*:Homo sapiens suppressor of po 
gb:MR1-ST01 1 1-1 1 1099-003-f04 ST01 
omplement component 3a receptor 1 



s.146346 ESTs 



ESTs, Weakly similar to Z195JUMAN ZINC 



gh:qd43h07.x1SoaresJetal_heart_NbHH19W 



AI419991 
H73972 
AA426117 
AA045084 



449913 
454096 
419622 
449745 
428412 



Hs.145225 
Hs.1 34460 
Hs.1 55543 
Hs.29725 
Hs.245022 
Hs.124813 



hypothetical protein FLJ13197 
ESTs 

hypothetical protein MGC14817 
ESTs 

gb:CM0-CT01 03- 1 2089G-037-g07 CT0103 Homo 



AA428240 Hs.126083 

AI039624 Hs.98388 

AW794931 Hs.100861 

AA521136 Hs.190176 

AA017410 Hs.40568 



hypothetical protein FU 14600 



Unique Eos probeset identifier nui 
Genbank accession numbers 
CAT Number Accession 



1247077.1 
1342150J 



416009 
418948 
420111 



AW803357 AW803423 AW812233 R06814 

AW808361 AW808404 AW808386 AW808594 AW808654 AW808813 AW808551 AW80B675 AW808350 AW808406 AW808694 AW808934 
AW808829 AW808385 AW808422 AW80B401 AW808409 AW808760 AW808863 AW808521 AW808539 AW808609 AW806472 AW808739 
AW808704 AW808558 AW80S714 AW80B420 AW8 
AW846433 AW846159 AW846377AW846528 

AW848047 AW848202 AW848631 AW848142 AW848702 AW848121 AW848632 AW848140 AW848571 AW848009 AW848067 AW848069 

AW848905AW848214 

BE046680BE046738BE044958 



BE145899 BE145848 BE145849 BE145853 BE145927 BE145925 
Z43062R13213H14422 

AI217097AW8860O' A'38 35\< ! 7 12 V.232835 AW93B043 
AA255652AA280911 AW967920 M252684 

BE258835 AW968316 AA258918 AW843305 R14744 AI580388 BE071923 R36280 

M337221 M336756AW966196 

AW954525 AI372685 AA349501 A1372687 H10564 

AA418703 AA41871 1 BE071915 BE071920 BE371912 
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WO 03/042661 



PCT/US02/36810 



428342 290035.2 AI739168 M426249 AI199636 AW505198 AW977291 AA824583 AA883419 M724079 AI015524 AI377728 AW293682 AI928140 M731438 

AI092404 AI085630 AA731340 

430535 31 9643 J AW958485 AW968670 AA480922 BE350425 

432765 353907J AJ003429 AJ003367 M564825 

433523 368873.1 H29882 AW665533 AW149901 AI57291 7 AA598500 AI686466 AI336390 AW864390 AW864320 

434763 392847 1 AA648618 AW974389H51771 

436295 41 733 J N73895 AJ001872 

439092 468554J AA830149 AW978407 M85983 AW503637 

439306 470B8 1 BE220199 W01813 AF086118 N70760 BE221405 

440947 505904J AA910403 AI815593 W58361 AW1B2520 AI816550 

442481 543588 1 N99828 BE079873 AI1 10738 AF074645 

445432 63943 T AV653771 BE089370 

448044 747196J AI458682 H24240 R14537 R18426 AW8670B2 

449570 81018 1 AA001793 AA001871 

449745 814534 1 AI668593 AI820774 R86205 H39971 H221 77 H26241 

450317 831956J AI692689 R1 4223 R1 8395 

450580 83929 1 N40087 H12925AA460779 AA096372 BE160847 AW81B390 AW81S893 AW816577 AW816891 AW816882 AW81S8B8 AW81BB89 AW816940 
AW816892 AW816941 AW816578 AA029183 T83320 H86850 M010295 BE160823 Z25353 AA150B83 AW8B7764 AW023806 AW022095 
AA164518 AA730973 W00417W65303 

450582 83933.1 AI339732 AA010300 AW515041 AA7B8334 N29863 AA425874 AA425118 AA865829 AW93B878 

450687 84327.1 AA495800 AA495737 AA010736 AA654716AA640726 

452462 918580J BE173515 BE173560 AI902860 

453682 977454.1 T79703 T96307 AL079725 

454037 996287.1 AW998716AW022148N68020 

454096 1007449.1 AW062757 AW176890 AW062758 AW1 76395 AW1 76869 AW1 76900 AW176897 AW176928 AW17B868 AW176892 

454171 1049240.1 AW854832 AW854798 AW854857 AW854816 AWB54834 AW854817 

454457 1207274.1 AW753456 AW753036 AW854868 AW854862 

454665 1 228599.1 AW81 2866 AW81 2746 AW812747 AW812884 AW81 2763 AW31 2722 

454860 1237732 1 AW835767 AW835537 BE160187 

454968 1247029 1 AW849046 AW847956 AW849039 AW847957 AVV848279 AWB48698 AW849034 AW849033 
455067 1252050.1 AW854538 AW854418 AW854412 

455276 1272541 1 BE176479 BE176678 BE17B357BE176550AW886079BE176676BE176615 BE176555 BE176489 BE176610 BE1763B2 
455646 1348557.1 BE064420 BE064435 BE064429 BE064414 BE064400 BE064517 

455710 1352368.1 BE072049 BE069471 BE069489 BE069478 BE069479 DE069492 BE069485 BE072122 BE072124 BE069491 BE069486 BE069464 
457374 328758.1 AA493662 AW897396 BE1 54814 
458912 823104.1 AI911066 AI933734 AI68088BAJ003599 



TABLE 66C 



ed The DNA sequence of 



Unique number corresponding to an Eos pre 



401132 
402145 
403242 
403481 
403510 
403667 
404003 
404561 
404592 



277132-277595 

147703-147896 

85679-85795 

113086-114800 

11297-12511 

93496-93633 



1344-1442,1545-1697 
198349-199096 
69039-70100 
39067-39225 

23955-24034.25143-25264 

89944-90729 

154015-154123 

28556-28684 

44654-45210 

43310-43462 

101028-101174 

11552-11686 



TABLE B7A: B89 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 

Table 67A lists about 689 genes up-regulated in prostate cancer compared to normal adult tissues. These were selected from 9009 probesets on the Affymetrix/Eos Hu03 GeneChip 
array that showed some expression in human xenograft tumors and cell lines. The 639 genes were selected such that the ratio of "average" prostate cancer to "average" normal adult 
tissues was greater than or equal to 2.5. The "average" prostate cancer level was set to the 75th percentile amongst prostate cancers. The "average" normal adult tissue level was set 
to the 85th percentile amongst non-malignant tissues. In order to remove gene-specific background levels of non-specific hybridization, the 10th percentile value amongst non- 
malignant tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to norma! body tissue 



749 



WO 03/042661 



PCT/US02/36810 







UnigenelD 


Unigene Title 


R1 


448999 


AF179274 


Hs.22791 


transmembrane protein with EGF-like and 


73.2 


426747 


M535210 


Hs.171995 




65.0 


432441 




Hs. 163484 


ESTs 


56.0 


446057 




Hs.149358 


Trp-p8 transient receptor potential cati 


55.0 


400302 
414569 


N4B056 




folate hydrolase (prostate-specific memb 


41.4 


432240 


AI694767 


Hs. 129179 


prostate cancer associated protein 1 
Homo sapiens cDNA FU13581 fis. clore PL 
ESTs 


40.3 


419526 


AI821895 


Hs.1 93481 


35.9 


400298 


AA032279 


Hs.61635 




35.7 


400299 






kallikrein 3, (prostate specific antigen 


34.9 


453370 


AI470523 


M 1 


ATP-binding cassette, sub-family C (CFTR 


33.9 




AA125985 




thymosin, beta, identified in neuroblast 


32.8 


424099 




Hs.1 39336 


ATP-binding cassette, sub-family C (CFTR 




400292 


AA250737 


Hs.72472 


BMP-R1B 


2s!a 


400287 




Hs.181350 


kallikrein 2, prostatic 


28.8 


425075 


M506324 


Hs.1 852 


acid phosphatase, prostate 


28.6 


409361 


NM 005982 


Hs.54416 


sine oculis homeobox (Drosophila) homolo 


27.8 


415539 


AI733881 


Hs.72472 


BMP-R1B 


27.0 




AL1 35623 


Hs.193914 


KIAA0575 gene product 






AK000631 




hypothetical protein FU20624 






W68815 


Hs.301885 


Homo sapiens cDNA FLJ11346 fis, clone PL 


24'8 


407168 


R45175 


Hs.1 17183 


ESTs 


24.5 


400296 


M305627 


Hs.139335 


ATP-binding cassette, sub-family C (CFTR 


23 6 


428336 


AA503115 


Hs.1 83752 




23.5 


433043 


W57554 


Hs.125019 


lymph i n jclea orotein (LAF-4) mRNA 


23.4 


403047 






NM 005656*:Homo sapiens transmembrane pr 


23.4 




AK002126 


Hs.1 1260 


hypothetical protein FU 11 264 


22.0 


423073 


BE252922 


Hs.123119 


MAD (mothers against decapentapiegic, Dr 


21.6 


407709 


M456135 


Hs.23023 


ESTs 


21.5 


401424 






NM_001172:Homo sapiens arcinase, type ii 


20.6 


448290 


AK002107 


Hs.20843 


I - p cDNA FLJ1 1 I lei L 


19.7 


434666 


AF151103 


Hs.1 12259 


T cell receptor gamma locus 


18.5 


415989 


AI267700 




ESTs 


17.9 




AA861697 




ESTs 


17.8 


451939 


U80456 


Hs.27311 


single-minded (Drosophila) hornolog 2 


16.8 


431548 


AI834273 


Hs.9711 


novel protein 




425628 


NM 004476 




folate hydrolase (prostate-specific memb 


16.5 


428862 


NM 000346 


Hs'.2316 


SRY (sex determining region Y)-box 9 (ca 


16.5 


450096 


AI682088 


Hs.79375 


holocarboxylase synthetase (biotin-|prop 


1&5 




AW302885 






16.1 


411096 


U80034 




mitochondrial intermediate peptidase 


15.7 


435677 


AA694142 


Hs. 2937 26 


E T We klysil lar tc fSGA F T rEST! 


I5.3 


419743 


AW408762 




Homo sapiens clone 24416 mRNA sequence 


15.3 










15.1 


427958 


AA418000 






14.1 


418829 


AA516531 


Hs.55999 


NK homeobox (Drosophila), family 3, A 


13.9 


426501 


AW043782 


Hs.293616 


ESTs 


13,9 




AB033070 




KIAA1 244 protein 




418961 


AW967646 


Hs.23023 








AW175665 


Hs.278695 










Hs.1 93465 
















429220 


AW207206 




ESTs 


127 


401451 






NM_004496*:Homo sapiens hepatccyte nude 


12.4 




AW368377 


Hs.1 37569 


tumor protein 63 kDa with strong hornolog 




419078 


M93119 


Hs.89584 


12.3 




AA397658 




Homo sapiens cDNA FU13598 lis, clone PL 


11.9 




AW292053 




chromosome 1 open reading frame 21 


11.8 


408826 


AF216077 


Hs.48376 


Homo sapiens clone HB-2 mRNA sequence 




429918 


AW873986 


Hs.1 19383 


ESTs 


11 '.3 


427212 


AW293849 


Hs.58279 


ESTs, Weakly similar to ALL7 HUMAN ALU S 




412446 


AI768015 




ESTs 




433404 


T32982 










X78592 


Hs.99915 




10.9 




NM_00352f 


! Hs.2178 




10.6 




AW972917 




alpha-methylacyl-CoA racemase 


10.4 




AI199268 


Hs.19322 


Homo sapiens, Similar to RiKEN cDNA 2010 






AI088489 


Hs.83937 






430226 


BE245562 


Hs.2551 


adrenergic, beta-2-, receptor, surface 


1 0.2 


433927 


AI557019 


Hs.1 16467 


small nuclear protein PRAC 


10.2 


408000 


L11690 


Hs.198689 


bullous pemphigoid antigen 1 (230/240kD) 


10.1 


449625 


NM_014253 


odz (odd Oz/ten-m, Drosophila) hornolog 1 


10.1 




AF003522 


Hs.250500 


delta (Drosophila]-like 1 


10.0 


439444 


AI277652 


Hs.54578 


ESTs, Weakly similar to 138022 hypotheli 


9.8 


437718 




Hs.1 96779 




9.8 


440529 


AW207640 


Hs.16478 


Homo sapiens cDNA: FLJ21718Ss, clone C 


9.7 


453160 


AI263307 




H2B histone family, member L 


9.7 


432715 


AA247152 


Hs.200483 


ESTs, Weakly similar to KIAA1074 protein 


9.7 


450325 


AI935962 


Hs.91973 


ESTs 


9.6 


434170 


AA626509 


Hs.122329 


ESTs 


9.6 


428600 


AW863261 




hypothetical protein DKFZp434K1421 


9.6 
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447033 
401747 
434423 
411887 
457211 
415263 
432729 

432101 
451640 
428046 
420218 



15 

20 
25 



424800 
428728 
418564 
442049 
429597 
410889 
443180 



432586 
431325 
435047 
447476 



425465 
450377 
451418 
418250 
416111 



433293 
452340 
451027 
436063 
440749 



AI357412 Hs.157601 ESTs 



NM_006769 Hs.3844 

AW182924 Hs.128790 

AW972565 Hs.32399 

AA948033 Hs.130853 

AK000292 Hs.130732 

N62293 Hs.45107 

AI918950 Hs.123642 

M195601 Hs.26771 

AW812795 Hs.337534 



ESTs, Weakly similar to S51797 «a 
ESTs 

hypothetical protein FU20285 



BE543269 Hs.50252 

M429834 Hs.1 51791 

AL1 57545 Hs.173179 

AU076405 Hs.29981 

AW967109 Hs.13804 

N95796 Hs.278595 

NM_004354 Hs.79069 

H63010 Hs.5740 

BE247675 Hs.18442 

AA641092 Hs.257339 

AA909249 Hs.1 12282 

AI087412 Hs.143611 

AF216751 Hs.26813 
AA554458 



AL035588 Hs.153203 

NM 016625 Hs.191381 

AA6~31143 Hs.278695 

AA310393 Hs.190044 

NM 003816 Hs.2442 

X91662 Hs.66744 

R15875 Hs.258576 

AW975944 Hs.237396 

U81599 Hs.66731 

AI655499 Hs.161712 

AI921750 Hs.1 44871 

NM.002742 Hs.2891 

AB032959 Hs.318584 
AA568548 

AW026751 Hs.5794 

AA454985 Hs.54973 

BE293466 Hs.20880 



mitochondrial ribosomal protein L32 

bromodomain and PHD finger contaning, 3 
solute carrier family 26 (sulfate transp 
hypothetical protein dJ462023.2 
Homo sapiens prostein mRNA, complete ods 

ESTs" Weakly similar to I38022 hypotheti 
solute carrier family 30 (zinc transport 
ESTs, Weakly similarto 2004399A chromos 
CDA14 

KIAA0666 protein 
ESTs 

ESTs, Weakly similar to A43932 mucin 2 p 
MyoD family inhibitor 
hypothetical protein 

Homo sapiens prostein mRNA, complete cds 



Homo sapiens cDNA FLJ13752 fis, clone PL 

protein kinase C, mu 

novel C3HC4 type Zinc finger (ring tinge 

ESTs 

ESTs. Weakly similar to 21 09260A B eel! 
cadherin-like protein VR20 
ESTs, Weakly similar to I3B022 hypotheti 
ESTs, Weakly similar to ZN91_HUVAN ZINC 



AA622037 Hs.1 66468 

AAB05132 Hs.159142 

AF010258 Hs.127428 

U41514 Hs.80120 

NM 001851 Hs.154850 
AF075009 

AW247430 Hs.84152 

U71207 Hs.29279 eyes absent (Drosophila) homobg 2 

AA279490 Hs.86368 calmegin 

NM.004917 Hs.218366 

AW377314 Hs.5364 

L18964 Hs.1904 



BE387790 Hs.26369 

U29926 Hs.83918 

AA033813 Hs.79018 

NM_003245 Hs.2022 

AI420611 Hs.1 53934 

W26276 Hs.1 36075 

AF013758 Hs.109643 

AF007835 Hs.32417 

NM 002202 Hs.505 

AW519204 Hs.40808 
AK000028 

W22335 Hs.7392 

T15803 Hs.272458 

X63629 Hs.2877 

AW993097 Hs.48617 

NM 000399 Hs.1395 

AI267652 Hs.246107 

AL049280 Hs.1 55397 

AW503680 Hs.5957 

AJ297436 Hs.20166 
AI739168 

AI572438 Hs.32976 

AW022133 Hs.1 89838 
AA770688 



transglutaminase 3 (E polypeptide, prate 
ESTs 

RNA, U2 small nuclear 

polyadenylate binding protein-interactin 

hypothetical protein MGC4309 

ISL1 transcription factor, LIM/homeodoma 

ESTs 

ribosomal protein S24 
hypothetical protein MGC3199 
protein phosphatase 3 (formerly 2B), cat 
cadherin 3, type 1, P-cadherin (placenta 
Homo sapiens cDNA FLJ12540 fis, clone NT 
early growth response 2 (Krox-20 (Drosop 
Homo sapiens mRNA; cDNA DKFZ P 434E082 (fr 
Homo sapiens mRNA; cDNA DKFZp564K143(fr 
Homo sapiens clone 24416 mRNA sequence 
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20 
25 




445873 
412935 
415068 
434826 
415621 
416653 
433332 
455497 
418245 



447805 
434808 
453308 
421470 
418555 
404632 
450861 



410037 
410240 
403046 
419647 



423242 
434485 
407103 
414085 
422486 
422083 
439735 
420039 

423445 
433730 
425006 



451468 
429467 
451/52 



Z19448 

AF155661 

AI648602 



Hs.55468 ESTs 



splicing factor 3b, subunit 3, 130kD 
poIy(A)-binding protein, cytoplasmic 1-1 
tubulin-specitic chaoercne c 
ESTs, Weakly similar to T24396 hypotheii 
pyruvate dehydrogenase phosphatase 



AA1 12573 Hs.278695 

M088767 Hs.83883 

AW207440 Hs. 185973 

M18667 Hs.1867 

M502764 Hs.123469 

AW972512 Hs.20985 

AF226053 Hs.6617Q 

AW445167 Hs.126036 

Y15723 Hs.75295 

AW627932 Hs.302421 

AF155108 Hs.256150 

AW959731 Hs.323099 

R27496 Hs.1378 

AI417215 Hs.87159 



H18836 
AB020725 
AL1 57424 



416276 
441021 
453171 
416795 
427615 
419440 



metallothionein 1 E (functional) 

Homo sapiens cloneTCCCTA00151 mRNA seq- 

Homo sapiens prostein mRNA, complete cds 

transmembrane, prostate androgen induced 

degenerative spermatocyte (nomolog Droso 

progastricsin (pepsinogen C) 

ESTs, Weakly simi!ar to AF208855 1 BM-01 

sin3-associated polypeptide, 30kD 

HSKM-B protein 

ESTs 

guanylate cyclase 1, soluble, alpha 3 
Homo sapiens, Similar to RIKEN cDNA 2810 



S-adenosylmethionine decarboxylase 1 
LIV-1 protein, estrogen regulated 
hypothetical protein FLJ20041 



450164 AI239923 Hs.63931 

417318 AW953937 Hs.240845 

417153 X57010 Hs.81343 

412350 AI659306 Hs.73326 

456088 BE177320 Hs.155148 

433852 AI378329 Hs.126629 

452679 Z42387 Hs.83883 

AW372884 Hs.240770 

U41060 Hs.79135 

AW578716 Hs.7644 

R76472 Hs.65646 

AI497778 Hs.20509 

BE410107 Hs 179817 

AB020689 Hs.90419 

NM 012445 Hs.288126 

H18835 Hs.31608 

AB016898 Hs.100469 



AF189723 Hs.106778 

AA514535 Hs.283704 

AL039402 Hs.125783 

AI523511 Hs.118567 

AA424881 Hs.256301 

AA1 14016 Hs.75746 

BE514492 Hs.117487 

NM_001141 Hs.111256 

AI635386 Hs.142846 

NM_004605 Hs.94581 

AW9711B3 Hs.6019 

NM.014324 Hs.128749 

AK002135 Hs.3542 

R49031 Hs.22627 

F13036 Hs.27373 

AA150000 Hs.137396 

D11928 Hs.76845 

AL122055 Hs.129836 

AF109302 Hs.27495 

AW005505 Hs.5464 

AW503398 Hs.293663 

NM 004477 Hs.203772 



collagen, type II, alpha 1 (primary oste 
protein tyrosine phosphatase, non-recept 
hypothetical protein FLJ13231 



AI538226 Hs.32976 
AA317439 Hs.28707 
AW973003 Hs.1 79909 



ESTs 



CGI-82 protein, PSDR1 
KIAA0882 protein 

spondin 2, extracellular matrix protein 
hypothetical protein FU20041 
myeloid/lymphoid or mixed-lineage ieukem 
ESTs, Weakly similar to G02075 transcrip 
ATPase, Ca transporting, type 2C. member 
ESTs 

DEME-6 protein 
ESTs 

hypothetical protein MGC1 31 70 
aldehyde dehydrogenase 1 family, member 
gene near HD on 4p16.3 with homology to 
arachidonate 15-lipcxygenase, second typ 
hypothetical protein 
suifotiansferase family, cytosolic. 2B. 
DnaJ (Hsp40) homolog, subfamily C, membe 
alpha-methylacyl-CoA racemase 
hypothetical protein FU11273 
ESTs 



KIAA1028 protein 

prostate cancer associated protein 7 
thyroid hormone receptor coactivaling pr 
ESTs, Moderately similar to 138022 hypot 
FSHD region gene 1 
KIAA1171 protein 

guanine nucleotide binding protein 4 
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Hs.22920 



427308 
435706 
416854 



458440 
446416 
417873 
430598 



444108 
422890 
417379 



438825 
433233 
415276 
440146 
407819 
450402 
432527 
419733 
422134 
432435 
450680 
436420 
430122 
411031 
429259 
407813 
417067 
410227 
417958 
427176 



55 
60 



421829 
437252 
437967 
417061 



ESTs 



D26Q67 Hs.174905 

W31254 Hs.7045 

H40164 Hs.80296 

AI694320 Hs.6295 

T05387 Hs.7991 

U85658 Hs.61796 

AA468183 Hs.184598 

AA315267 Hs.23128 ESTs 

A1817776 Hs.236557 ESTs 

AA640891 Hs. 102406 ESTs 

AI364997 Hs.7572 ESTs 

AI095468 Hs.135254 Homo 

AV658299 Hs.163959 ESTs 



similar to S68401 (cattle) glucose indue 
Homo sapiens, Similar to RIKEN cDNA 5830 
KIAA0033 protein 
GL004 protein 
Purkinjecell protein 4 
ESTs, Weakly similar to T17248 tiypottieti 



AK001764 Hs.247112 

BE397354 Hs.324830 

AI620463 Hs.347408 

AJ278313 Hs.41143 

AA587775 Hs.66295 

AW963897 Hs.44743 

U80669 Hs.55999 
AA534489 

AF1 62666 Hs.13395 

AW379130 Hs.18953 

AL080235 Hs.35861 

R55784 Hs.1 40942 
Z43784 
AA196390 

AA228399 Hs.101307 

AW248508 Hs.279727 

AA436002 Hs.183161 

BE327427 Hs.79953 

AB040927 Hs.301804 

U88666 Hs.78353 

AW014231 Hs.90790 
R42185 

BE218027 Hs.89969 

AW975028 Hs.102754 

AW179019 Hs.112110 
BE218886 Hs.282070 
AF131784 Hs.25318 
AA443966 Hs.31595 
NM.013342 Hs.233765 
W37943 Hs.34892 
AA420450 Hs.292911 
AL120247 Hs.40109 
AJ001417 Hs.81086 
AB009284 Hs.61 152 
AA767382 Hs.193417 
AW381569 Hs.40334 
AI580252 Hs.293246 
AA340605 Hs.1 05887 
AI791855 

AW968058 Hs.92381 
BE410092 Hs.178471 
T49951 Hs.9029 
AA249096 Hs.32793 
AW247529 Hs.6793 
AI479560 Hs.98613 
NM 016578 Hs.20509 
AW500239 Hs.21187 
AK000382 Hs.274251 
AW290951 

AJ002744 Hs.246315 
U37519 Hs.87539 
AW968613 Hs.79428 
AF1556S0 Hs.300496 
AB018330 Hs.108708 



NM 004655 Hs.1 27337 
AW385224 Hs.35198 
AB014533 Hs.33010 



Homo sapiens, Similar to RIKEN cDNA A430 
liypollielica! protein FLJ10902 
diptheria toxin resistance protein requi 
hypothetical protein MGC13102 



ti-PDZ-domain-contain:ng protein 
KIAA1435 protein 

NK homecbox (Drosophila), family 3, A 
gb:nl76g11.s1 NCI.CGAP.Co3 Homo sap 
tousled-like kinase 1 



DKFZP586E1 621 protein 
ESTs 

ankyrin 3, node of Ranvier (ankyrin G) 
gb:zp99b10.s1 Slratagene muscle 937209 H 
Homo sapiens HUT11 protein mRNA, partial 
ESTs; homologue of PEM-3 [Ciona savignyi 



KIAA1494 protein 

SFRS protein kinase 2 

Homo sapiens cDNA: FU22930 fis, clc 



TCF3 (E2A) fusion partner (in childhood 
KIAA1323 protein 
Plakophilin 
KIAA0872 protein 



nudix (nucleoside diphosphate linked rnoi 
KIAAG798 gene product 
DKFZP434G032 protein 
ESTs 

platelet-activating factor acetylhydrola 

Homo sapiens cDNA FU12292 fis, clone MA 

HBVpX associated protein-8 

Homo sapiens cDNA: FU23068 fis, clone L 

hypothetical protein FLJ20375; KIAA1797 

ESTs 

UDP-N-acetyl-alpha-D-ga]actosamine:po!yp 
BCL2/adenovirus E1B 19kD-interacting pro 



ESTs, Weakly similar to ALU1.HUMAN AL 
mel transtorming oncogene (derived from 
Homo sapiens cDNA FLJ12033 fis, clone r 
axin 2 (conductin, axil) 



AI628272 Hs.88323 

AW957137 Hs.98541 

AW612232 Hs.254835 

AA243464 Hs.2941 01 pre-B-cell leukemia transcription fac 
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456177 


NM 012391 


Hs.79414 


prostate epithelium-specific Ets Iranscr 




W16480 




ESTs, Moderately similar to reduced expr 


432370 


AA308334 


Hs.274424 


N-acetylneuramlnic acid phosphate syntha 




A1186431 


Hs.296638 


r i I t rentiation factor 


446157 


BE270828 




Homo sapiens cDNA: FLJ22562 lis, clone H 


433339 


AF019226 


Hs.8036 


glioblastoma cverexpressed 


421887 


AW161450 


Hs.1 09201 


CGI-86 protein 




AA775483 


Hs.288936 


mitochondrial ribosoma! protein L9 




AA442324 




H2J i i fii il i mbe ( 


436476 


AA326108 


Hs.33829 


bHLH protein DEC2 




AU076964 












NM_003071 :Homo sapiens SWI/SNF related, 


432278 


AL1 37506 


Hs.274256 


hypothetical protein FLJ23663 




BE153855 


Hs.61460 


Ig superfamiiy receptor LNIR 




AI871613 


Hs.28538 


H ioe en DNA FLJ2I 6 done ( 




AA046458 


Hs.95296 


ESTs 


428330 






matrix metalloproteirc l 1 1 1 1 i 


418113 


AI272141 


Hs.83484 


SRY (sex determining region Y)-box 4 




AA420564 








BE327311 




HT021 






Hs.44 


pleiotrophin (heparin binding growth fac 


427701 


M411101 


Hs.243886 


nuclear autoantigenic sperm protein (his 






Hs.33718 


Homo sapiens cDNA FLJ12641 (is. clone NT 




AA306105 




Ira jpr 


439671 


AW1 62840 


Hs.6641 


in innnlymemberSC 






Hs.301871 


solute carrier family 37 (glycerol-3-pho 




AW1 57098 


Hs.324104 


Human DNA sequence from clone RP1-63M2 o 


432615 


AA557191 


Hs.55028 


ES \" , 1 mil 1 15437 j ne NF2 




N20617 


Hs.1 94397 


leptin receptor 


403752 






NM_002753':Homo sapiens mitogen-activate 


427723 


AI355260 


Hs.279789 


histone deacetyiase 3 




AA299652 




Homo sapiens cDNA FLJ1 1643 fis, clone HE 


440494 


BE618768 




acetyl-Coenzyme A carboxylase alpha 


426716 


NM_006379 


Hs.1 71 921 










ESTs 




T97307 




gb:ye53h05.s1 Soares fe'.al liver spleen 


432886 


BE1 59028 


Hs.279704 


chromatin accessibility complex 1 




BE245274 


Hs.1 80428 


KIAA1181 prolein 




AI026701 




KIAA0310 gene product 




W79940 


Hs.21906 


Homo sapiens clone 24670 mRNA sequence 


414407 


AA147026 




ESTs 




BE383488 


Hs.1 05547 


neural proliferation, differentiation an 




AW105663 


Hs.5947 


HSPC069 protein 




AW992405 


Hs.59622 


Homo sapiens, clone IMAGE:3507281 , mRNA, 




NM 014141 


Hs.1 06552 


cell recognition molecule Caspr2 




R41933 


Hs.1 40237 


'vi U1_HUMAN ALU 


407908 


BE379758 


Hs.1 10853 






AA426202 


Hs.40403 


Cbp/p300-interacting transactivator, wit 




F09247 


Hs.247735 




436873 


N23874 


Hs.50477 


RAB27A, member RAS oncogene family 




BE439553 


Hs.250528 


Homo sapiens, clone IMAGE:4098694, mRNA, 




AA336522 


Hs.12854 


angiotensin II, type I receptor-associat 




AW505076 


Hs.301855 


DiGeorge syndrome critical region gene 8 


443991 


NM 002250 Hs.10082 




409960 


BE261944 




hexokinase 1 


433891 


AA613792 




gb:no97h03.s1 NCI CGAP Pr2 Homo sapiens 




AL117429 


Hs.240845 


DKFZP434D146 protein 




AW022715 


Hs.162160 


ESTs, Weakly similarto AW4.HUMAN ALU S 






Hs.182982 


ESTs 67 




A1248720 


Hs.1 14390 






AI972402 


Hs.306051 


hypothetical protein MGC2648 


425263 


NM_001197 Hs.155419 


BCL2-in:eracling killer (apoptosis-incuc 




Z17805 


Hs.93564 




433345 


AI681545 


Hs.152982 


hypothetical protein FLJ 131 17 




L24203 


Hs.82237 


ataxia-telangieclasia group D-associated 








glycine cleavage system protein H (amino 


437898 


W81260 


Hs.43410 


ESTs 




AL1 18747 


Hs.26691 






AA523172 


Hs.103135 


ESTs, Weakly similar to SFR4.HUMAN SPLIC 




NM 001429 Hs.25272 


E1A binding protein p300 


404210 






NM_005936:Homo sapiens myeloid-lymphoid 


434954 


AF161455 


Hs.284295 


Homo sapiens HSPC337 mRNA, partial cds 


401519 






C1 5000476*:gi| 1 273727S|reflXP_01 21 63. 1 1 


419517 




Hs.90797 


Homo sapiens clone 23620 mRNA sequence 


431585 


BE242803 


Hs.262823 


hypothetical protein FLJ10325 


427461 


AA531527 


Hs.332040 


hypothetical protein MGC13010 


453288 


AW583292 


Hs.274412 


similar to yeast Upf3, variant A 


412513 


AA322599 


Hs.5163 


ESTs, Weakly similar to AF151 840 1 CGI-8 


432579 


AF043244 


Hs.278439 


nucleolar protein 3 (apoplosis repressor 


439593 


BE073597 


Hs.1 24853 


ESTs 


451945 


BE504055 


Hs.211420 


ESTs 


434614 


AI249502 


Hs.29669 


ESTs 


445525 


BE149866 


Hs.14831 


Homo sapiens, Similar to zinc finger pro 
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408101 
425810 
443123 
447439 



AJ000098 Hs.94210 eyes absent (Drosophila) homoloi 
AW9685C4 Hs.123073 CDC2-related protein kinase 7 
AI923627 Hs.31903 ESTs 



436087 
414222 
419749 
418559 
408784 
424285 
417193 



445943 
434194 
432426 
416974 
451131 
423453 
417215 



431578 
409757 
407136 



429638 
452908 
426030 



421674 
432302 
415172 



ESTs 
ESTs 

nuclear receptor subfamily 2, group F, m 

hypothetical protein FLJ22795 

xeroderma pigmentosum, complementation g 

zinc-binding protein Rbcc728 

ESTs, Moderately similar lo T2928E hvpot 

Homo sapiens cDNA FLJ10674 fis, clone NT 

ESTs 

Homo sapiens PRO1550 mRNA, partial cds 



RALBP1 associated Ep 
fatty-acid-Coenzyme A ligase, long-chain 
CGI -09 prolein 



BE300296 Hs.5054 
AL135173 

X73608 Hs.93029 

AA225048 Hs.104207 

AW971350 Hs.63386 

BE207168 Hs.144630 

AI922189 Hs.288390 

NM_000380 Hs.1 92303 

AJ272269 Hs.121429 

BE242758 Hs.190223 
AK001536 

AW898533 Hs,181574 
AF119847 

AW973152 Hs.31050 

AF010233 Hs.80667 

AI267586 Hs.268012 

AW450737 Hs,1 28791 

BE253181 Hs.81687 

BE396290 Hs.5097 
AI361896 

AA813887 Hs.188173 

AI049624 Hs.283390 

NM_016156 Hs.181326 

AB037759 Hs.261587 

NM.001898 Hs.123114 

T64896 Hs.287420 

NM_004445 Hs.3796 

AB041036 Hs.57771 kallikrein 1 1 

AW272262 Hs.225767 ESTs 

BE391046 Hs.278962 AIM-1 protein 

AA195037 Hs.169341 HTPAP protein 

AI806583 Hs.1 25291 ESTs 

AI571940 Hs.7549 ESTs 

AW600291 Hs.6823 hypothetical prolein FU 10430 

AA351 978 Hs.4943 hepatocellular carcinoma associated prot 

AA179949 Hs.175563 Homo sapiens mRNA; cDNA DKFZp564N0763 (f 

AA081 594 Hs.1 5831 1 Musashi (Drosophila) homolog 1 

AI298501 Hs.21 192 ESTs, Weakly similar to T46428 hypotheti 

AI916662 Hs.21 1577 kinectin 1 (kinesin receptor) 

AB001451 Hs.30965 neuronal She adaptor homolog 

BE243933 Hs.108642 zinc linger protein 22 (KOX 15) 

W25317 Hs.37616 Human D9 splice variant B mRNA, complete 

T99949 Hs.303428 Homo sapiens cDNAFLM 4832 fis, clone OV 
hypothetical protein MGC2742 
Homo sapiens cDNA: FU21880 fis, clone H 



Homo sapiens cDNA FU 1 21 87 fis, clone MA 
ESTs, Weakly similar to 2109260A B cell 
KIAA1073 prolein 
GCN2 e!F2alpha kinase 
cystalin SN 

Homo sapiens cDNA FLJ1 1533 fis, clone HE 



447958 
409340 
420061 

417622 
420344 
446791 



AW245993 Hs.223394 

AW974911 Hs.1 84793 

W56321 Hs.1 11460 

AA393273 Hs.75133 

BE546846 Hs.195048 

H66220 Hs.278177 

N34895 Hs.44648 

BE266822 Hs.344097 



AI801777 

BE208843 Hs.1 29544 

AW970602 Hs.1 05421 

AA863045 Hs.10669 

AA150797 Hs.1 09276 

AA013051 Hs.91417 

T10707 Hs,296355 

AA345857 Hs,274307 

AF079529 Hs.78106 

AI879076 Hs,75061 

M81740 Hs.75212 
AA393508 

AA311833 Hs,84318 

AW971945 Hs.293235 

AW390020 Hs,20415 

AW796524 Hs,68644 

BE174629 Hs.321130 

AW024937 Hs29410 

NMJJ16569 Hs.267182 

AW298163 Hs.82318 

BE463721 Hs.97101 



transcription factor 6-like 1 (mitochond 



hypothetical protein MGC15438 
ESTs 

ESTs, Weakly similar to T00050 hypotheti 



macrophage myristoyla'.ed alanine-rich 
ornithine decarboxylase 1 
serologically defined colon cancer antig 
replication protein A1 (70kD) 
ESTs 

chromosome 21 open reading frame 11 
Homo sapiens microsomal signal peptic 
hypothetical protein MGC2771 
ESTs 

TBX3-iso protein 

WAS protein family, member 3 

putative G protein-coupled receptor 



W20128 Hs.296039 ESTs 
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10 



AW970976 Hs.293653 

AI693815 Hs.127179 

BE245374 Hs.27842 

AI811807 Hs.108646 



437396 
413125 
401785 
433495 
452099 
437296 
421254 
445109 
423551 
421154 
410193 
435854 
423396 
442202 
441345 
444367 



444101 
425320 
431631 
448804 
431416 
447881 
445309 



431674 
422017 
443181 
448913 
440193 
452941 
451338 
411190 
434629 
407192 
434747 
428171 



446494 
438157 
452260 
409648 



453390 
426514 
425294 
405387 
441266 
432388 
450937 
447210 

451404 
409650 
452707 
451134 



BE612992 Hs,27931 



AF039916 Hs. 12330 

AA327598 Hs.89633 

AA284333 Hs.287631 

AJ132592 Hs.59757 

AJ278120 Hs.4996 

A1382555 Hs.127950 

BE272862 Hs.106534 

AW068579 Hs.7780 

H54892 Hs.10974 

AI023428 Hs.34649 

Y07604 Hs.9235 
AI754416 

BE614070 Hs.326416 

W03011 Hs.306881 

BE161065 Hs.167531 

R19175 Hs.169793 

U29344 Hs.83190 

AA548906 Hs.122244 

AW512213 Hs.342849 



M489323 Hs.182825 

NM 015865 Hs.171731 

AA9~10353 Hs.75432 

AW390282 Hs.31130 

AL079409 Hs.38176 

R89543 Hs.12942 

NM_003512 Hs.28777 

AA463276 Hs.288906 



AW451449 Hs.57749 



ESTs 

cryptic gene 

hypothetical protein FLJ1 1210 
Homo sapiens cDNA FLJ14934 lis, clone PL 
ESTs, Moderately similar to PC4259 ferri 
hypothetical protein DKFZp762O076 

NMJ)02275".Homc sapiens keratin 15 (KRT1 
alpha-2-glycoprolein 1, zinc 
hypothetical protein FU10607 similar to 

coat protein gamma-cop 



Homo sapiens cDNA FU 14269 fis, clone F 

zinc finger protein 281 

putative ankyrin-repeat containing prote 



hypothetical protein FLJ22625 

Homo sapiens mRNA; cDNA DKFZp564A072 (fr 

hypothetical protein FLJ22390 

ESTs, Highly similar to S94541 1 clone 4 



AK001403 Hs.279521 

AA010200 Hs.1 75551 

AA098901 Hs.301642 

NMJ303877 Hs.1 10776 

AI039201 Hs.283316 

AA194422 Hs.22564 

AW902312 Hs.7037 

AL110347 Hs.31074 

AW612322 Hs.19131 

AA306342 Hs.69171 

AA789081 Hs.4029 



Cdc42 effector protein 3 

Homo sapiens mRNA; cDNA DKFZp564M91 6 (f 
MSTP043 protein 

methylcrotonoyl-Coenzyme A carboxylase 2 



ADP-ribosylation factor-like 5 



GCN1 (general control of: 
likely ortholog of mouse ArKadir 
hypothetical protein FLJ10846 



AA862496 Hs,28482 

BE616633 Hs.170195 

AF033827 Hs,155553 

H15968 Hs.293845 

X15218 Hs.2969 

R49131 Hs.26267 

AF035269 Hs.1 7752 

AL050272 Hs.1 2305 

AA460775 Hs.6295 

T08490 Hs.288969 

AI093823 Hs.45070 

AA318315 Hs.25999 



ribosomal protein L35 
solute carrier tamily 14 (urea transport 
ESTs, Weakly similar to T23482 hypothetJ 
transmembrane 7 superfamily member 2 
KIAA0606 protein; SCN Orcadian Osciilat 



RAB9, member RAS oncogene family 
ESTs 

eslrogen receptor 1 
ESTs 

bone morphogenetic protein 7 (osteogenic 



nelMAGE:3502329, mRNA, 



ATP-dependant interferon response pro:rj 

phosphatidylserine-specilic phospholipas 

DKFZP566B183 protein 

IT i r f pothel 

HSCARG protein 

ESTs 

hypothetical protein FLJ22195 



AL080220 I 

T54904 Hs.163780 ESTs 

AA173992 Hs.7956 

AF230924 Hs.107187 

AA830797 Hs.184760 CCAAT-box-binding transcription factor 



ESTs, Moderately similar to ZN91.HUMAN Z 
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401197 
442607 
410297 
447963 
433101 
437546 



425174 

413435 
428293 
428180 

424395 

113 62 
429922 
449518 



408196 
424339 
416292 
451040 
445636 
419175 
422000 
441128 
446237 
431797 
406789 
423217 
423115 



AA507576 Hs.288361 

AI452973 Hs,165900 

AW572317 Hs.12082 

AW074836 Hs.173984 

AW245680 Hs.9701 

AA325512 Hs.71472 

AA047685 Hs.62946 

L10333 Hs.99947 

BE280478 Hs.182695 

AI015709 Hs.172089 

AI355647 Hs.189999 

X65724 Hs.2839 

D87450 Hs.154978 

AF102546 Hs.63931 

X51405 Hs.75360 

BE250944 Hs,1 83556 

AI129767 Hs.182874 

AW245741 Hs.58461 

AA165082 Hs.146388 

W79632 Hs.256301 

AW411479 Hs.848 

Z97630 Hs.226117 

BE395253 Hs.30861 

T53519 Hs.334692 

AK000785 Hs.199480 

AJ245820 Hs.6314 

AK001023 Hs.256549 

AA045650 Hs.5312S 

AW294020 Hs.117721 

R33261 Hs.6614 

AB023177 Hs.29900 

AF007555 Hs.74624 
AW371048 Hs.93758 
AL034548 Hs.43627 
BE257148 
AA179233 I 
AA324743 I 
AW105401 
AW270037 
M30599 
AA570256 
AW270515 



NM_000094 Hs.1640 

AA421973 Hs.169119 

AA262331 Hs.48376 

AI373544 Hs.331328 

AI557486 Hs.119122 



cold inducible RNA-binding prolein 

replication factor C (activator 1)1 (14 

fatty acid amide hydrolase 

ENSP00000229263*:HSPC213. 

Homo sapiens cDNA: FU 22696 fis, clone H 



ESTs 

hypothetical protein MGC3243 

Homo sapiens mRNA; cDNA DKFZp5BSI2022 (f 

purinergic receptor (family A group 5) 

Nome disease (pseudoglioma) 

KIAA0261 protein 

dachshund (Drasophila) homolog 

carboxypeptidase E 

solute carrier family 1 (neutral amino a 

guanine nucleotide binding protein (G pr 

ESTs, Weakly similar to A35659 krueppel- 

microtubule-associated protein 7 

hypothetical prolein MGC13170 



H1 histone family, member 0 



ESTs, Weakly similar to A43932 mucin 2 p 
KIAA0960 protein 

protein tyrosine phosphatase, receptor t 

H4 histone family, member H 

SRY (sex determining region Y)-box 22 



ESTs 



iin L29 



KIAA0779 pre 
homeo boxA10 

E T Weakly simila 273 hypo - 
Homo sapiens, Similar to RiKEN cDNA 2310 
hypothet'eal protein FU20280 

collagen, type VII, alpha 1 (epidermolys 
ESTs, Weakly similar to T25731 hypotheti 
Homo sapiens clone HB-2 mRNA sequence 
intermediate f lament protein syncoilin 
ri bosom al protein L1 3a 
ESTs 

N-acylsphingosine amidohydrolase (acid c 



438821 
421091 
439414 
400263 

449051 
431615 
433037 



S70114 Hs.239489 

AI720078 Hs.291997 

AA826425 Hs.192375 
W22821 

NM_001183 Hs.6551 

AW083384 Hs.11067 

AW961400 Hs.333526 

AW295859 Hs.235860 

NMJ14158 Hs.279938 

AA304961 Hs,699 

AK001231 Hs!53940 

AK000767 Hs.5111 

AW382884 Hs,204715 

X89887 Hs.1 72350 

AI267593 Hs.250535 

W79485 Hs.173980 

L32832 Hs,101842 

AW270608 Hs.170195 

AF154335 Hs,79691 

AA157291 Hs,21479 

X56777 Hs.273790 



erase B (cyclophilin 
hypothetical protein FU 10369 
hypothetical protein FU20729 
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447050 
408461 
421594 
443837 
421867 
435021 
435750 
435025 
407255 
425976 
449458 
428013 



AL135352 Hs.255883 

AA579668 Hs.41072 

AW975063 Hs.343443 

N27605 Hs.16492 

AI393666 Hs.42315 

M24470 Hs.1435 

U66515 Hs.172280 

NM.016314 Hs.17200 

AB037756 Hs.45207 

R456B9 Hs.21889 

AI984625 Hs.9884 

AA481078 Hs.109045 



AA012992 Hs.256301 
C75094 Hs.334514 
AI805078 Hs.208261 



ESTs, Moderately similar to S65657 alpha 

ESTs, Weakly similar lo I38022 hypotheti 

serine (or cysteine) proteinase inhibito 

ribosomal protein L36 

DKFZP564G2022 protein 

p10-binding protein 

guanosine monophosphate reductase 

SWI/SNF related, matrix associated, acti 

STAM-like protein containing SH3 and ITA 

hypothetical protein KIAA1335 

Homo sapiens cDNA FU1 2978 fis. clone NT 

spindle pole body protein 

hypothetical protein FU 10498 

ESTs 

KIAA1089 protein 

anchor attachment protein 1 (Gaatp, yeas 
hypothetical protein MGC13170 
NG22 protein 



438682 
427515 - 
418700 



gb:yd78d06.s1 Soares fetal liver spleen 
EBP50-PDZinteractorof64kD 
integral type I protein 

ESTs, Moderately similar to ALU 5.H U MAN A 



415989 

429220 
412446 



420218 
437124 
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AA724542 AW1 02730 AA909978 AW1 18134 AW827241 W56431 BM127381 BF436987 AI016509 AW663972 BM127686 C15552 N63435 N51744 
T98800 N56980 BG108636 N49381 R49336 D61278 BI755612 AA6Q8234 R49885 BFB50422 
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AL562079 AI700926 AI470561 BF063058 AW1 96387 AU 1 32984 BI064046 AI9701 57 R021 22 H55924 AI521721 AA808206 AA725223 AI766003 
AW339821 AA305951 AI287969 AW664827 

BC021085 AL527872 AL526296 AL557067 B1255090 AU1 43499 AL560356 BG823170 BE736988 AU141 388 AL580262 BI764173 AU120299 
AU141755 AU141988 AU142941 AU141330AU141784 BI770885BI91 1394 BE901426 BI918039 BG760842 BE883026 BI254740 BE732690 
BG430761 BE792868 AW328267 AL1 35173 AA102674 BF726986 AL564735 BE155962 BE1 55979 BF741 679 H67776 H59234 H89665 AW1 17774 
AW274435 L29008NM 003104U07361 BF002824 BI222S49 AI45B045 AW951537 BF196474 AI669049 AI042523 BE463928 AU160125 AU160580 
AW069877 AW015214 AI948718 BE219706 AI953605 BE217755 AI991382 AI245005 BF431 179 AI521843 BE048908 AI420449 AW275385 
AA336950 BE501521 BF740566 AA311404 AA3B4639 BI772535 BG473076 BE891298 BE24B928 BM012986 BE242693 BE901342 BG746358 
BF374053 AL564430 AU143835 A!635707 BF1 95492 AI2B0559 BF7416B5 AA385257 BE247655 W94974 BE163702 AI025167 AI827118 N78641 
AL581093AU158964AU158917AI282516AU146399A<\713947AI285028AA101228AI338522AA832316AI284986AA857926BF372568 
M670172AI753825AA171565AU159257 BM194320W93390AW132101 AA550898 BI259678 AA522554 N55172 AW013929 AI826274 AI871237 
C75260 AA934846 AA555036 AA526579 AA526466 N80270 AI536347 AW61E805 AU158875 AU15BBB3 AU159396 AI016B67 AU 159238 AI282517 
AA406317 AI285043 N53050 AA969446 N57718 H93323 W93374 AI873751 H5601 1 AI936174 AA937B30 AW438877 AI800550 AW32826B 
AI244886 M948424 AA7CC604 AA832317 AA01 1 555 H97671 AA98871 1 AW150529 N709B3 H44325 AV657614 AA902238 AA644018 AA03405O 
T71053 N71444AA831158 T40892 AA706106 

AK001536 AK056135 BM47481 3 BE887303 AK022914 AW581996 AW81 2S45 BE882302 AA1342S6 BI043873 AA019433 BI862088 BM468657 
AU128438 BE384458 AL353967 BI8571 17 BF586525 Bi465223 BM450132 AU129B77 BI222283 BG171592 B1043544 BG496295 BG750710 
BI256542 BG108520AU150719 AAM. A15542 Al 353933 AA191092 BF132635 
AF119847AA437261 AA436987 AU 32965 
AF150424AI861896AA570057AV738855 

AI801777 BE677762 AW00821 0 AW009441 BE350994 BE207949 AI091475 AIB02774 A1B27533 AI075353 AI659979 AA687855 AI078125 AI090285 
AA670058 AA602411 AA683472 AI436058 AA612B26 AI038932 BG057725 AM 67355 BF449023 AI289476 AW074381 BF97291 2 AI991780 
AA8891 19 A1537472 Z39730 AI868953 A!1 92337 BE812978 BE812939AA115248 H99006AI915784 F08973T16748 D2046B AA609899 BF081234 
AA115336 BE812B76 BE812972 BE812982 BE813006 BE813019 R43B83 BEB129B1 

AK055109 BC019085 AA187684 BG656226 BM023227 AI93231 1 AW264381 AA398371 BM021483 AI432433 AI375777 AI129580 AW262782 
AA134107 BM023515AA977504AI859222 AI348454 R69725 AA975268 BM021207 AL080074 AA129218 AW207842 N90581 AA771919 AI092259 
AI028416 AI0741 14 BG656536 BE501677 AW193419 AA917040 W90430 AI342984 AI37B957 AL0364B6 AW02006B BI491093 BF476021 R41226 
R69631 F041 25 C02343AA1 15589 R56480AI400988 R54266 R31422 

AK055915BE867252AI523348AA765350BF446858 Z43675R19529AL133837BG389444AW382942BE702956AA081961 BE835247 BE835308 
BE835295 AA376302 BE645790 AA375690 AA376592 AW954423 N85732 AA249770 AI754416 AA213816 BF592044 AAB1 1729 AW514842 
AI633485 At096810 AW183016 AI635738 N27524 BE645916 AI247842 AW991230 AI754277 H16814 AI76B892 BF108422 A1300361 T95129 
F11313 R97946 R93989 AA375242 BF109388 AI860939AI680050 AW953899 BF971486AI972337 AW953841 AA372437 AI216746 H11384 
R33484AA249043AA249732 

AW969151 AA532718 AA504784 AI791 194 AI821930 AI821485 F37127 M654206 F27974 

AK074291 AW293424 BE6761 35 AI8321 25 BE01 9146 BE465019 AI761 124 AA617778 AI279232 AW575897 AI672039 F28618 BF924261 
M722184 BF934174 BE004328 AV749301 BEB80282 BI01 979B B1019389 BF928776 AW813409 AV726604 AA077560 BE272975 BF9491 19 
AW814195 BE879126 AI697926 BF5941 55 BE205787 BF063513 N35828 AI948557 AI433839 AI379679 BG056182 AI589094 N23123 AA588805 
AW316581 AI080272 AI421980 A1493318 BF194830 N37590 AA495993 N32996 M699844 H96845 H96592 N28741 BI035539 BF747723 
BF171066 W01 350 H05495 AI243785 Z39622 AA8B7432 AI350659 R46102 
AA602964AA609200 

AW976537 AI033582 AA837085 AA745261 AA648395 

BC013939 BI494690 BI491211 A1928393 AA843540 BG938644 AU185628 BI495842 AW1 73255 BM052709 AI743999 AI690144 AI922209 AI740907 
AW340368 M928759 AW1 18737 BF513970 AA707807 BF435295 AI339463 Al 373842 AI433809 BE222392 AA602308 AA428261 AI460355 
AW662760 AI888087 AI342098 AA72241 B W781 51 N64382 BE221S48 AW025901 AA452120 AI150479 AI016166 AA779515 AA661791 BM474307 
BI91 1169 BG575154AW953303 T33304 D59141 AA385765 AA14B64B BM461961 BG6B1168 BI602483 BE8B9592 AW95431 1 BM052986 BI962893 
AI989299 Z42328 BF029504 T35668 BG402602 AU1 85770 AI023271 AA14771 9 AI434079 AI569000 AI276488 AA992453 M342821 AA648303 
A1349364 AI051 008 AA926941 AA350894 AW071451 N22249AI784138AA0B3B47 N22258 AW440B25 AA661570 AA376687 AA659125 AI356299 
R70463 AI383586 AA827189 BI494872 AW021094 B1494871 AA905500 AA460923 BI492041 AW028965 AI62461 1 BG271780 AI497723 H88862 
D59858 N89979 AA658425 N61154 D62341 AI274437 N66697 H95993 AI370663 AA728850 H05232 R59374 H12223 AI935759 A1362553 D60006 
N29S72 AI91 6833 N75273 AA1487 1 0 BI597117 BG740471 AA332671 AA333874 AA643052 AW020175 R70550 BG623469 AA452342 AW965441 
W19723 R32966BE883841 R61003 A191 0374 A1835262 R55325 AI46B927 R34681 H96211 Z39807 BF9543B6 

NM 015720 AF219137 AL534420 AL524055 AL537346 AL538442 BG765888 AL530054 AL525377 BG474596 BG473144 BE251553 BG706099 
AL538039 BG73Ji" L r,; BF805256 F12128 AL566773 BI828536 BF761480 AI204971 BG818818 BI199246AL534816 BF529941 AA324163 
AL523285 BG914330 H07952 AL534815 BE769SC3 AI867302 EM310135 AL533702 BE2544B4 BF528852 BE867462 BE740130 AL134164 
AL567115 AL533701 AL524054 AL515904 AL523284 AL568203 AL534419 BF981162 BE257148 AL561833 
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445636 8561 5 BF339388 AI345516 BG391657 BE708S67 BG026034 BE261703 H56716 H65572 H93801 T48830 R96953 R96989 R39707 BE867593 AA090310 

AA090672 

419175 35068J AB018322 BC012480 BI524873 AW665554 AI93446S AI479915 BF096179 BF096162 BF096132 AA744972 AI951988 AI858339 BE076331 

AA88699SAI570585 AI916F 1 QAI r8135 AA569046 AA961Q64 AI018062 H80618 BE221942 R52609 AI915164 AA36562S 

Z44671 BI052776 BFB82486 BG2861 84 AI5B9558 M931663 AA534979 AI275392 AI273455 R52553 AA829920 H80652 AA360728 F1061 8 
AW953666 AW1 76773 H85527 AA765570 AA081927 BF093262 BC-743753 AL037576 AA534314 BE814964 BE973713 N49493 BE006634 
BE006630 AW270037 M234765 AI334004 BF057179 AI857450 AI341 191 AI434143 AI917449 AW517207 AA255424 AWQ0B334 AA847572 
AA994211 AA861901 AASB1 873 At580157 AI364363 AW242357 AW235291 N56645AA319869 R36911 AA256551 AW044188 AI203159 N49403 
F02090 AI187299 AI609644 Z40516 AW952314 

441128 20932 1 BC014072 BE328850 AI356567 AI143171 A1022165 BC-149661 BF000671 AA233101 AA573721 AA447991 AW016855 AI005068 AA554071 

BF478215AA906902 AW014761 BE905651 BE512923 BM047129 AA243352 AA232991 AA127550 AA127551 AA570256 AI473237 BF033706 



A1041403Z49148 

AK0577GC BC015899 BE867103 AL526923 AL527436 BG913023 AI884867 BE85B451 AI885227 AI935218 BE645596 AI922406 AA778161 
BF34S973 AW195S53 AI3S7121 AI336147 A1091 364 AI769203 AW627B59 BE677432 BF439954 BGB20230 AA2836BS AA812396 AI358396 
AA932022 BE222881 AW016109 BM352667 BF528099 BG818616 AI935916 AA912295 BG910887 AL568301 AL567278 BI522445 BI754384 
BG81 9375 BFB35950 AA32371 S AA860955 AI089847 AL520775 AL526045 AL538507 AL567798 AL536876 AL574332 BF834531 BF3401 16 
BF835954 AL573997 AL581252 BF95?tf7 AL 58 ) 1 70 A / 352324 AL569332 AW148695 AL559234 BG034270 BI913839 AL581327 AL565842 
H43568 BF945188 AL559686 AL539326 T15431 AW895092 AL5826B4 BI519896 

Z11692 X51466 NMJM961 M19997 BI224253 BG830478 AU122147 AU123437 BG113591 BG752624 BE886804 BI868669 BG337216 AW629935 
BM016525 AI560409 AL562866 AI90917B BF849556 AA371735 BF038841 BF727115 BC006547 BG757526 AL555664 BI261304 BG770095 
BI033486BI517580 BG876486 BI01 1 828 AI313235 BGB31724 BF869862 BG998348 BI011834 BFB88337 BF89B627 BF092380 AW803215 F01241 
BF805719 BG876487 AW498536 BF98B866 BG998849 AA248724 BG829202 BG756455 BG032392 BIB59287 BM01 6990 BG332369 BE933685 
BE166758 BM452445AI937808AW026128 N23584 AW006041 AI337621 F33111 BF344301 BG105450 BG387343 BF569547 BF1 54671 
BM007368 BF569385 BE772007 BI199437 BF761700 BI251 519 BF944452 BF898506 AI038390 BM044934 AW381 142 BG743618 BE769206 
BE893973 BI015Q47 BF886479 BF761350 BE769769 BG766117 BF847365 BE397834 AW371121 BF089125 BE082996 BF1B3193 BG180964 
BF089940 BI00Q274 BG255503 BG674499 BG774174 BI015084 



TABLE 67C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 dqit rumber 3 r n e -nbank Identifier (Gl) nu 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:439-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 



401424 8176894 
401451 6634068 
401747 9789672 



404632 9796668 Plus 45096-45229 



59793-59968 
!4223-24428 
19926-121272 

1 8596-1 1 881 6, 1 191 19-1 19244, 1 1 9609-1 1 976 



403046 3540153 

404922 7341893 

403752 7678857 

404210 5006246 

401519 . 6649315 

401785 7249190 



^5707-55859,56369-56511 
13248-13428 



401866 8018106 Plus 73126-73623 

405387 6587915 Minus 

401197 9719705 Plus 

403532 8076842 Minus 



169926-170121 
157315-157950 

165776-165996,166189-166314,166408-16656 



TABLE 68A: 995 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 
Table 68A lists about 995 genes up-regulated in prostate i 



t compared to normal adult tssues. These were selected from 9009 , 
and cell lines. The 995 genes were selected such that the ratio of "average" prostate cancer to 1 
tissues was greater than or equal to 2.5. The "average" prostate cancer level was set to the 85th percentile amongst prostate cancers. The "average" i 
to the 85th percentile amongst non-malignant tissues. In order to remove gene-speciSc background levels of non-specific hybridization, " 
malignant tissues was subtracted from both Ihe numerator and the denominator before the ratio was evaluated. 




with EGF-like and 
3, (prostate specific antigen 
Trp-p8 transient receptor potential cati 
folate hydrolase (prostate-specific memb 



prostate cancer as: 

ATP-binding cassette, sub-family C (CFTR 
Homo sapiens cDNA FLJ13531 lis, clone PL 
six transmembrane epithelial antigen of 
ATP-binding cassette, sub-family C (CFTR 
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X07730 

AA305627 

AA250737 



415539 
432374 
433043 
423073 
403047 
444484 
428336 
401424 
407709 
407122 



448290 
437052 
411096 



419743 
427958 
431548 



Hs.56145 
Hs.171995 
Hs.139336 
Hs.72472 
Hs.1852 



thymosin, beta, identified in neuroblast 
kallikrein 3, (prostate specific antigen 
ATP-binding cassette, sub-family C (CFTR 
BMP-R1B 



AK000631 Hs.52256 



hypothetical protein FLJ20624 
Hs.193914 KIAA0575 gene product 



Hs.117183 
Hs.181350 
Hs.72472 
Hs.301885 
Hs.125019 
Hs.123119 



NMJ04476 
AI267700 
AK002107 
AA861697 



AF151103 



Hs.9711 
Hs.1 12259 
Hs.293616 
AW302885 Hs.117183 
NMJM0346 Hs.2316 , 
AA694142 Hs.293726 
AW207206 

AF216077 Hs.48376 
AW967646 Hs.23023 
AB033070 Hs.1 94408 
X78592 Hs.99915 



449625 
401451 
447033 
452594 



429918 
424012 
418278 
408000 
443822 
432101 
447342 
412446 
437718 
433404 
431089 
427212 
429597 
444151 
428046 
453160 



415263 
416182 
420218 
411887 
425154 
430226 



ESTs 
kallikrein 2, prostatic 
BMP-R1B 

Homo sapiens cDNA FU1 1346 fis. done PL 
lymphoid nuclear protein (LAF-4) mFiNA 
MAD (mothers against decapentaplegic, Dr 
NM_005656*:Homo sapiens transmembrane pr 
hypothetical protein FLJ11264 
microseminoprotein, beta- 
NM_001172:Homosapiensarginase,'.ypell 
ESTs 
ESTs 

folate hydrolase (proslate-specific memb 
ESTs 

Homo sapiens cDNA FU 1 1245 fis, clone PL 



ne 24416 mRNA sequence 



T cell receptor gamma locus 



Homo sapiens clone HB-2 mRNA sequence 
ESTs 

KIAA1244 protein 



AI357412 Hs.157601 

AU076405 Hs.29981 

AW1 75665 Hs.278695 

AA516531 Hs.55999 

AW292053 Hs.12532 

AI820961 Hs.193465 

AW873986 Hs.119383 

AW368377 Hs.137569 

AI088489 Hs.83937 

L11690 Hs.198689 

AI087412 Hs.143611 

AI918950 Hs.123642 

AI199268 Hs.19322 
AI768015 

AI927288 Hs.196779 
T32982 

BE041395 Hs.156110 

AW293849 Hs.58279 

NM.003816 Hs.2442 

AW972917 Hs.1 28749 

AW812795 Hs.337534 
AI263307 

AI557019 Hs.1 16467 

AF003522 Hs.250500 

AL035588 Hs.1 53203 

NMJB3528 Hs.2178 



Homo sapiens cDNA FLJ 13598 fis, clone PL 
odz (odd Oz/ten-m, Drosophila) homolog 1 
•■M_004496':Homosar ' 



rle earner family 26 (sulfate transp 
no sapiens prostein mRNA, complete cds 
homeobox (Drosophila), family 3, A 
umosome 1 open reading frame 21 



mg homolog 

(230/240kD) 
o 2UU4399A chromos 

to RIKEN cDNA 2010 



AA429834 
AA948033 
NM 004354 
AW958037 
AW182924 
NM_001B51 



5.151791 
s.1 30853 
ls.73059 



s.1 28790 ESTs 



alpha-methylacyl-CoA rac 
ESTs, Moderalely similar to I38022 hypot 
H2B histone family, member L 
small nuclear protein PRAC 
delta (Drosophila]-!ike 1 
MyoD family inhibitor 
H2B histone family, member Q 
KIAA0092 gene product 
ESTs 
cyclin G2 



AI935962 Hs.91973 

AI277652 Hs.54578 

AW207640 Hs.1 6478 

AW972565 Hs.32399 

AW863261 Hs.138860 



ESTs, Weakly similar to I33022 hypotheti 
Homo sapiens cDNA: FLJ21716 fis, clone C 
ESTs, Weakly similar to S51797 vasodilat 
hypothetical protein DKFZp434KH21 



761 



WO 03/042661 PCT/US02/36810 



422728 
417333 
434423 
434170 
432729 
446100 
432715 
421 B96 
401747 
451640 
431542 
447397 



432415 
418601 
433332 
437124 
428728 



AW937826 Hs.1 03262 



Hs.1 30732 
Hs.13804 
Hs.200483 
Hs.45107 



AA1 95601 
H63010 
BE247676 
AW975944 



AA310393 Hs.1 90044 

R15875 HS.25B576 
M21305 

NM 003245 Hs.2022 

AA909249 Hs.1 12282 



AA279490 
A! 367347 
AA554458 
NM_016625 



CSTs, Weakly similar to ZN91_HUMAN ZINC 
bromodomain and PHD finger containing., 3 
LIM domain only 4 



Dtl r ;a! pre 
Ts, Weakly: 



similar to KIM1074 protein 



ESTs, Weakly similar to 2 109260A B cell 
twist (Drosophila) homolog (acrocephalcs 
ESTs, Weakly similar to 138022 hypotheti 
mitochondrial ribosomal protein L32 



transglutaminase 3 (E polypeptide, prote 

solute carrier family 30 (zinc transport 

prostate stem cell an ligen 

COA14 

ESTs 

ESTs, Weakly similar to A43932 mucin 2 p 
calmegin 

Homo sapiens clone TCCCTA00151 mRNAsequ 
KIAA0666 protein 
hypothetical protein 

gb:Homo sapiens full lenglh insert cDNA 

protein kinase C, mu 

RNA.U2 small nuclear 

Homo sapiens cDNA FU 1 3458 fis, clone PL 

ESTs 



418250 U29926 

425905 AB032959 

418329 AW247430 

450203 AF097994 

425465 L18964 

416239 AL038450 
AW377314 



AI267652 

AI921750 
U81599 
AA454985 



Hs.83918 
Hs.318581 
Hs.84152 



450164 
452744 
440774 



435047 
418564 
451027 



421928 
432586 
426108 
417153 



Hs.5364 
Hs.63931 
Hs.246107 
Hs.153934 

Hs.66731 
Hs.54973 
Hs.278695 
Hs.40808 
Hs.251946 
Hs.127428 
Hs.109643 



447476 BE293466 Hs.20880 

450377 AB033091 

429769 NM.004917 Hs.218366 

416315 U41514 Hs.80120 

433323 M805132 Hs.159142 

451952 AL120173 Hs.301663 

452367 U71207 Hs.29279 

421470 R27496 Hs.1378 

420092 M814043 Hs.88045 

416111 M033813 Hs.79018 

440749 W22335 Hs.7392 

441866 BE464341 

411019 AW993097 



ESTs 

DKFZP564I052 protein 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434E082 (fr 
ESTs 

Homo sapiens cDNA FLJ 13752 fis, clpne PL 

homeoboxB13 

cadherin-like protein VR20 



ESTs 

poly(A)-binding protein, cytoplasmic 1-1 
polyadenylate binding protein-interactin 



collagen, type II, alpha 1 (primary oste 
metallothionein 1E (functional) 
I I M klysimil i lo I38022 hypolheli 
KIAA1265 protein 

kallikrein 4 (prostase, enamel matrix, p 

UDP-N-acetyl-alpha-D-galactosamine:polyp 

ESTs 

ESTs 

eyes absent (Drosophila) homolog 2 
ESTs 

chromatin assembly factor 1, subunit A ( 
hypothetical protein MGC3199 
nectin3; DKFZP566E0846 protein 
Homo sapiens cDNA FLJ12540 fis, clone NT 
early growth response 2 (Krox-20 (Drosop 
hypothetical protein FLJ20287 



Hs.21201 
Hs.43617 

421566 NM_000399 Hs.1395 
451418 BE387790 Hs.26369 
433517 AW022133 Hs.189838 ESTs 
415621 AI648602 Hs.55468 ESTs 
442592 BE566623 Hs.29899 ESTs, Weakly sir 
452340 NM_002202 Hs.505 
450244 AA007534 Hs.125062 
433293 AF007835 Hs.32417 
425259 AL049280 Hs.155397 Homo sapiens nRNA; cDNA DKFZp564K143 (fi 
434485 AI623511 Hs.118567 ESTs 
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442772 
425211 
431958 
408418 



20 
25 
30 



M18667 Hs.1867 
X63629 Hs.2877 
AW963897 Hs.44743 



Homo sapiens clone 24416 mRNA sequence 



412350 AI659306 Hs.73826 



AA587775 
AW970976 
AA326108 
BE242803 
AF155108 
T15303 
BE177320 
AJ001443 
BE514492 
AB009284 
AI572438 
AB020725 
AI628272 

NMJM6578 
AA316181 
AW1 79019 
415068 Z19448 

412935 BE267045 

453308 AW959731 

449048 Z45051 

410762 AF226053 

436032 AA150797 

456508 M502764 

413328 Y15723 



436476 
431585 
434808 
432103 
456088 
429084 
422486 
410227 
453439 
410037 
420297 
409648 
448143 
410268 
422134 



cadherin 3, type 1, P-cadhenn (placenta 

KIAA1435 protein 

tibosomal protein S24 

protein tyrosine phosphatase, non-recept 

multi-PDZ-domain-contaring protein 

ESTs 

bHLH protein DEC2 

hypothetical protein FLJ10326 

Homo sapiens, Similar to RIKEN cDNA 2810 

protein phosphatase 3 (formerly 2B), cat 

hypotneticai protein FLJ13231 

splicing factor 3b, subunit 3, 130kD 

gene near HD on 4p16.3 with homology to 



guanine nucleotide binding protein 4 



Hs.256150 
Hs.272458 
Hs.156148 

Hs!l 17487 
Hs.61152 
Hs.32976 



Hs.1 1 21 10 mitochondrial ribosomal protein L42 
Hs.131887 ESTs, Weakly similar to T243S6 hypotheti 
NM_022490:Homo sapiens hypothetical prot 
tubulin-specilic chaperonec 



418245 
450861 
418821 



433730 
449845 
420948 

458440 
419647 

455497 



M033767 
AI523898 
M436002 
AW972512 
AW975028 
AK002135 
AW971183 
AB016393 
U79293 



Hs.75064 
Hs.323099 
Hs.22920 
Hs.66170 
Hs.109276 
Hs.1 23469 
Hs.75295 



Hs.83883 
Hs.17617 
Hs.183161 



similar to S68401 (cattle) glucose indue 
HSKM-B protein 
lalexin protein 



419440 
435705 
431583 
442409 
453171 
437252 
422083 
429638 
403046 
447805 
443162 

437162 
419465 
440146 
427308 
409151 



W31254 
AL042613 
BE208843 



AW005505 

AW500239 

AW014231 

D26057 

M306105 

AK001403 

AL039402 



Hs.21187 
Hs.90790 
Hs.174905 

Hs!l25783 
Hs.31608 
Hs.7644 



guanylate cyclase 1, soluble, alpha 3 
deiodinase, iodothyronine, type II 
H2Ahistone family, m ' ' 



Hs.102754 ESTs 
Hs.3542 
Hs.6019 
Hs.100469 
Hs.159264 
Hs.135254 
Hs.91816 



sjn3-associated polypeptide, 30kD 

nFU11273 
Dnal (Hsp40) homolog, subfamily C, membe 
i ymplioid or mixed-li l I jr r 
Human clone 23948 mRNA sequence 
Homo sapiens c'one 1 thrombospondin mRNA 



446416 AV658299 Hs.163959 ESTs 

407819 R42185 ESTs 

430387 AW372884 Hs.240770 nuclear c 

410102 AW248508 Hs.279727 ESTs; he 

429165 AW009886 



no sapiens prostein mRNA, complete cds 
degenerative spermatocyte (homolog Droso 
'in, estrogen regulated 



Hs.293659 
Hs.61289 
Hs.324830 
Hs.6295 



AL1 57424 
BE397354 
AI694320 
AI497778 
AA411101 



AI378329 Hs.126629 
NM.013257 Hs.279696 
AB020689 Hs.90419 



diptheria toxin resistance protein requi 
ESTs, Weakly similar to T17248 hypotheti 
HBVpXas: 



lar to I38022 hypot 



ESTs, Mr. 
ESTs 

serum/glucocorticoid regulated kinase-li 
KIAA0882 protein 
GL004 protein 

S-adenosylmethionine decarboxylase 1 
hypothetical protein MGC15438 

ESTs, Weakly similar to ALU1_HUMAN ALU S 



AI916662 Hs.211577 



itor) 
to sapiens tra 

DKFZP434G032 protein 

hypothetical protein FU 12577 

thyroid hormone receptor coactivaling pr 

Homo sapiens cDNA: FLJ23058 fis. clone L 

Homo sapiens cDNA: FLJ22930 fis, clone K 

KIAA0033 protein 

SEC22, vesicle trafficking protein (S.c 
hypothetical protein FLJ20530 
DEME-6 protein 
hypothetical protein FLJ20041 
H1 histone family, member 2 
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415276 U88666 } 

400290 H18836 L 

432435 BE218886 r 

mm AW953937 I- 

429467 NM.004477 1 

416276 U41060 1 

410076 T05387 1 

420039 NM_004605 I 

423445 NMJ14324 1 

436420 AA443966 I 

414085 AA114016 I 

422048 NM_012445 1 

427176 AW381559 I 

450832 AW970602 ! 

448807 AI571940 I 

420568 F09247 I 

421685 AF189723 

427615 BE410107 

430598 AK001764 

452576 AB023177 

431724 AA514535 

409757 NMJ01898 

432242 AW022715 

407103 AA424881 

421154 AA284333 

400301 X03635 

426006 R49031 

452679 Z42337 

446880 AI811807 

444108 R55784 

452259 AA317439 

419083 M479560 

416354 H40164 

422890 Z43784 

428330 L22524 

419168 AI336132 

452017 AF109302 

456362 AW973003 

442501 M315267 

439735 A1635386 

451752 AB032997 

423583 AL122055 

446157 BE270828 

432363 AA534489 

433313 W20123 

408101 AW968504 

453857 AL080235 

451982 F13036 

447574 AF1 62666 

429299 AI620463 

420522 AW957137 

417379 AA196390 

411031 W37943 

431725 X65724 

441224 AU076964 

407813 AL120247 



hypothetical protein FU20041 



protocadhenn alpha 10 
ATPase,Catra 
CGI-82 protein, PSDR1 
hypothetical protein FLJ 10902 




ESTs, Weakly similar to ALU4.HUMAN ALU S 

hypothetical protein MGC1 3170 

Homo sapiens cONA FLJ1426S fls, clone PL 



CDNAFU14934 fls, clone PL 
ESTs 

signal sequence receptor, gamma (translo 
Homo sapiens CDNAFLJ12292 fis, clone MA 
Purkinje cell protein 4 
ankyrin 3, node of Ranvier (ankyrin G) 

i r t i ( l 17 / 

Homo sapiens cDNA FLJ 12641 fis. clone NT 
prostate cancer associated protein 7 
hypothetical protein FU22995 
ESTs 



KIAA1028 protein 

Homo sapiens cDNA: FLJ22562 fis, clone H 
gb:nf76g11.s1 NCI_CGAP_Co3 Homo sapiens 
ESTs 

CDC2-ielated protein kinase 7 

DKFZP586E1621 protein 

Homo sapiens mRNA; cDNA DKFZp56401763 (f 

tousled-like kinase 1 

hypothetical protein MGC13102 

hypothetical protein 

gb:zp99b10.s1 Stratagene muscle 937209 H 

KIAA1323 protein 

Nome disease (pseudoglioma) 

calumenin 

KIAA0872 protein 

ESTs, Weakly similar to T00050 hypotheti 
guanine nucleotide binding protein 4 
ESTs, Weakly simil I ! 
glycine cleavage system protein H (amino 
NK homeobox (Drosophila), family 3, A 
ESTs 

KIAA0261 protein 



ESTs 

eukaryotic translation inifation facta 
phosphodiesterase 9A 
ESTs, Moderately similar to 
phosphoinositide-specific pi , 
Homo sapiens HUT11 protein mRNA, partial 
calciumfcalmoddin-dependent protein kin 
putative transcription regulation r.uclea 
eyes absent (Drosophila) homo'og 1 
topoisomerase (DNA) II binding protein 
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30 
35 



AA046458 Hs.95296 

AA249096 Hs.32793 

AW968613 Hs.79423 

U37519 Hs.87539 
AW362955 

AB040927 Hs.301804 



418004 
419733 
433233 
439979 



417958 AA767382 



ESTs 
ESTs 

BCL2/adenoviais E1B 19kD-interading pro 
aldehyde dehydrogenase 3 family, member 
HornosapienscDNAFU14415f!S, clone HE 
KIAA1494 protein 
hypothetical protein FU 10430 
UDP-N-acetyl-alpha-D-galaclosarr,ine:polyp 
ESTs 



AW505076 Hs.301855 

BE140396 Hs.21621 

AI923627 Hs.31903 

NM.0M110 Hs.172028 

BE153855 Hs.61460 

AA420450 Hs.292911 

BE245374 Hs.27842 

BE277414 Hs.5947 



417067 AJ001417 



T97307 

AI580252 Hs.293246 

AW968058 Hs.92381 

AW020782 Hs.79881 



411145 
449459 
421662 



a disintegrin and melailoproteinasedoma 
Ig superfamily receptor LNIR 
Plakophilin 

hypothetical protein FU11210 

mel transforming oncogene (derived from 

mitochondrial solute carrier 

solute carrier family 22 (extraneuronal 



gb:ye53h05.s1 Sc 
ESTs, Weaky similar to putative p150 [H 
nudix (nucleoside diphosphate linked rnoi 
Homo sapiens cDNA: FU23006 fis, clone L 



Hs.101760 ESTs 



platelet-activating factor acetylhydrola 



chromatin ac 
Homo sapiens, Similar to Ri'KEN cDNA 23 
Homo sapiens cDNA FU11643 lis, clone HE 
axin2(conductin, axil) 
putative ankyrin-repeat containing prote 
hislonedeacetylase3 
TCF3 (E2A) fusion partner (in childhood 
roundabout (axon guidance receptor, Dros 
nuclear autoantigen 

ESTs, Weakly similar to Homoiog of rat Z 
hypothetical protein FU20375; K AA1797 
ESTs, Weakly simitar to T46428 hypolhefi 
calcineurin-binding protein calsarcin-1 
gb:af12e02.s1 Soares_teslis_NHT Homo sap 
semadoman, immunoglobulin domain (Ig). 
KIAA0798 gene product 
Homo sapiens PRO1550 mRNA, partial cds 
KIAA0712 gene pre ' ' 



likely ortholog of mouse Arkadia 
H2Ahistone family, member 0 
hypothetical protein MGC14797 
ESTs, Moderately similar to reduced expr 



zinc finger protein 22 (KOX 15) 



Homo sapiens mRNA full length insert cDN 
CGI-62 protein 
KIAA0633 protein 
HT021 



Hs.292453 ESTs 



C15000476*:gi|1 2737279|ref|XP_01 21 63.1 1 
Cbp/p30O-interacting transactlvator, wit 
UDP-Gal:betaGlcNAc beta1,3-galactosyltr 
ESTs 

HSPC069 protein 

ESTs, Weakly similar to I54374 gene NF2 
ADP-ribosylation factor-like 5 
DKFZP434D146 protein 
hypothetical protein FU23563 

gb:am20a10.s1 Soares_NFU_GBC_S1 Homos 
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30 



AI87B918 Hs.10526 

NMJM2250 Hs.10032 

AF019226 Hs.8036 gliobla 

M922192 Hs.73952 ESTs 

R77123 Hs.79831 Homo 

M507576 Hs.288361 

AW612232 Hs.254835 

AA313887 Hs.188173 

NM_012391 Hs.79414 



cysteine and glycine-rich proleir 2 



AW161450 Hs.109201 



434614 AI249502 



AA830797 Hs.184760 

M308334 Hs.274424 

AA393273 Hs.75133 

M775483 Hs.288936 



AK001536 

AW992405 Hs.59622 

AW162840 Hs.6641 

AI272141 Hs.83484 

AI186431 Hs.296638 

AA313565 Hs.14502Q 

AW957372 Hs.46791 

AB032953 Hs.173560 

BE379758 Hs.1 10853 

AW088363 Hs.246240 

AI871613 Hs.28538 

NWL004786 Hs.18792 

M306342 Hs.69171 

AL118747 Hs.26691 

N34895 Hs.44648 

AA147026 Hs.76704 

AW973152 Hs.31050 

NM_014363 Hs.159492 

AF043244 Hs.278439 

R19175 Hs.169793 



Hs.35598 ESTs 



Homo sapiens cDNA FLJ 12187 (is, clone MA 
prostate epithelium-specific Ets tianscr 
hypothetical protein FU13117 
CGI-86 protein 



CCAAT-bc 
N-acetyineuraminlc acid phasphate syntha 
transcription factor 6-like 1 (mitochond 
mitochondrial ribosomal protein L9 



432908 AI861896 



Homo sapiens cDNA FLJ10674 (is, clone NT 
Homo sapiens, clone IMAGE:350728 1 , mRNA, 
kinesin family member 5C 
SRY (sex determining region Y)-box 4 
prostate differentiation factor 
ESTs, Weakly similar to KIAA1 205 protein 
ESTs, Weakly similar to 138022 hypotheti 
odd Oz/ten-m homolog 2 (Drosophila, mous 



Homo sapiens cDNA: FU21086 (is, done C 

thioredoxin-like,32kD 

protein kinase C-like 2 



spastic ataxia of Chartevoix-Saguenay (s 
nucleolar protein 3 (apoptosis repressor 
ribosomal protein 132 



S.1B2982 golgin-67 



431103 
433577 
435017 



AW024937 Hs.29410 

M57399 Hs.44 

AW007080 Hs.284192 

AA336522 Hs.12854 
L10333 



plelotrophin (heparin binding growth tac 
ESTs 

angiotensin II, type I receptor-assodat 
Target Exon 

Dvl-binding protein IDAX flnhMion of 
KIAA1073 protein 



449933 AW157098 Hs,324104 



438157 
451945 
452721 



440494 
407748 
453049 



AW137011 Hs.49576 

BE504055 Hs.211420 

AJ269529 Hs.301871 

AW957744 Hs,278469 

AF010233 Hs.80667 

BE383488 Hs. 105547 

NM_000380 Hs.1 92803 

BE618768 Hs.7232 

AL079409 Hs.38176 

BE537217 Hs.30343 

AA523172 Hs.1 031 35 

BE250162 Hs.83765 

R41933 Hs.140237 

NMJW1429 Hs.25272 

AJ272269 Hs.121429 

W79940 Hs.21906 

BE207168 Hs.144630 

AA453208 Hs.330994 

BE463721 Hs.97101 



426793 
419879 
403752 



AA225048 Hs.1 04207 

H66220 Hs.278177 

AI049624 Hs.283390 

AI093577 Hs.255416 

AA322599 Hs.5163 



solute carrier family 37 (glycerol-3-pho 

lacrimal proline rich protein 

RALBP1 associated Eps domain containing 



ESTs, Weakly similar to SFR4_HUMAN SPLIC 

dihydrofolate reductase 

ESTs, Weakly similar to ALU1 HUMAN ALU S 

E1A binding protein p300 

zinc-binding protein Rtcc728 

Homo sapiens clone 24670 mRNA sequence 

nuclear receptor subfamily 2, group F, m 

RAB9, member RAS oncogene family 

putative G protein-coupled receptor 

KIAA1181 protein 

HIR (histone cell cycle regulation defec 
Homer, neuronal immed late eariy gene, 2 
NM_002753':Homo sapiens rnitogen-activate 
ESTs 
ESTs 

ESTs, Weakly similar to 210926OA B cell 
hypothetical protein FLJ21986 
" toAF1518401 CGI-8 
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428695 


AI355647 


Hs.189999 


purinergic receptor (family A group 5) 


3.5 


419517 


AF052107 


Hs.90797 


Homo sapiens clone 23620 mRNA sequence 


3.5 


446494 


M463276 


Hs.288906 


WW Domain-Containing Gene 


3.5 


417515 


L24203 


Hs.82237 




3.5 


433891 


AA613792 




jl i 1 It 3AP Pr2Homo pien 


3.5 


427315 




Hs.175563 


Homo sapiens mRNA; cDNA DKFZp564N0763 (f 


3.5 


445525 


BE149866 


Hs.14831 


Homo sapiens, Similar to zinc linger pro 




434672 


AW294020 


Hs 1 17721 




15 


453288 


AW583292 


Hs.274412 


similar to yeast Upf3, variant A 


3.5 




AW272262 


Hs .225767 


ESTs 


3.5 






Hs.288390 


hypothetical protein FLJ227S5 


3.5 


437617 


AI026701 


Hs.5716 


KIM0310 gene product 


3.5 


445943 


AW398533 


Hs.181574 


ESTs 


3.5 


412628 


AI972402 


Hs.306051 


hypothetical protein MGC2648 


3.5 








Homo sapiens cDNA: FU23013 fis, clone L 




410467 


AF102546 


Hs.63931 


dachshund (Drosophila) homolog 


3.5 




AW902312 




Homo sapiens dene 24923 mRNA sequence 


3.5 




AW974911 




Homo sapiens cONA: FLJ21880 fis, clone H 


as 




NM_001394 




dual specificity phosphatase 4 






M081594 




' c r iu jplua) homolog 1 


3.5 


436087 


BE300296 


Hs.5054 


CGI-133 protein 


3.5 






Hs.43410 


ESTs 


3.5 


451131 


A1267586 


Hs.268012 


fatty-acid-Coenzyme Aligase, long-chain 


3.5 


439609 


AW971945 


Hs.293236 


ESTs 


3.5 






Hs.21398 


ESTs, Moderately similar to A Chain A, H 


3.5 


418610 
409960 


AW245993 
BE261944 


Hs.223394 


hypothetic protein MGC2742 


3.5 
3.5 


425263 


NM_001197 


Hs.155419 


BCL2-lnterading killer (apoptosis-induc 


3.5 


404210 




NM_005936:Homo sapiens myeloid/lymphoid 


3.5 


442323 


AW016669 


Hs.29190 


ESTs 


3.5 


452707 


AI093823 


Hs.45070 


ESTs 


3.5 


421437 


AW821252 


Hs.1 04336 


hypothetical protein 


3.5 




BE616633 


Hs.1 70195 


bone morphogenelic protein 7 (osteogenic 


3.4 


450628 


AW382884 


Hs.204715 


ESTs 




446825 


BE266822 


Hs.344097 


filamin A, alpha (actin-birding protein- 


3A 


421867 


AA481078 


Hs.109045 


hypothetical protein FU10498 




434954 


AF161455 


Hs.284295 


Homo sapiens HSPC337 mRNA, partial cds 


3.4 




AD001451 


Hs.30965 


neuronal She adaptor homolog 








Hs.226117 


H1 histone family, member 0 






BE614070 


Hs.326416 


Homo sapiens mRNA; cDNADKFZp564H1916 (f 




434293 


NMJW4445 


Hs.3796 


EphB6 






AL135173 




sorbitol dehydrogenase 








Hs.296355 


hyp th j i.al protein FU23138 






M805166 


Hs.154762 


HIV-1 rev binding protein 2 


3.4 




AI222020 




Cocoa Crisp 


3.4 








sparc/osteonectin, ewev and kazat-like d 


3.4 


432302 


AA345857 


H&274307 


KIAA1442 protein 


3.4 


407944 




Hs.239727 


desmocollin 2 


3.4 






Hs.301642 


G-protein coupled receptor 


3.4 


408291 


AB023191 


Hs.44131 


KIAA0974 protein 


3.4 




AB037759 


Hs 261587 


GCN2 elF2alpha kinase 


3.4 


440789 


AB007857 


Hs.7416 


KIAA0397 gene product 


3.4 


434958 


T99949 


Hs.303428 


Homo sapiens cDNA FU14832 fis, clone OV 


3.4 




BE379936 


Hs.28866 




3.4 


417229 


AA975096 




hypothetical protein PR02849 


3.4 


439593 


BE073597 


Hs.1 24863 


ESTs 


3.4 


407136 


T64896 


Hs.287420 


Homo sapiens cDNA FL) 1 1 533 fis, clone HE 


3.4 


421044 


AF061871 


Hs.101302 


Human DNA sequence from clone RP1-238D1 5 


3.4 




AA531527 




hypothetical protein MGC13010 


3.3 




AI080042 


Hs.1 80450 


ribosomal protein S24 


3.3 




M1 95037 


Hs.1 69341 


HTPAP protein 




412652 


AI801777 




ESTs 


3.3 




AF079529 


Hs.73106 


phosphodiesterase 8B 


3.3 


418372 


M311833 


Hs.84318 


rep'ication protein A1 (70kD) 


3.3 




AW796524 


Hs.68644 


Homo sapiens microsomal signal peptidase 


3.3 


425010 


T16837 


Hs.4241 


ESTs 


3.3 




AA262331 




Homo sapiens clone HB-2 mRNA sequence 


3.3 




AI039201 


Hs.283316 




3.3 




AI422867 


Hs.88594 


ESTs 


3.3 




NM_016569 


Hs.267182 


TBX3-iso protein 


3.3 


412926 


AI879076 


Hs.75061 


macrophage myristoylated alanine-rich C • 


3.3 


420818 


AW969635 


Hs.33032 


ESTs, Weakly similar to 1 207289A reverse 


3.3 




NM 006042 


Hs.48384 


heparan sulfate (glucosamine) 3-O-sulfot 


3.3 


408681 


AW953853 


Hs .292833 


ESTs, Weakly similar to I38022 hypotheti 


3.3 


436278 


BE396290 


Hs.5097 


synaptogyrin 2 


3.3 


424560 




Hs.1 50555 


protein predicted by clone 23733 


3.3 


417215 


BE253181 


Hs.81687 


non-metastatic cells 3, protein expresse 


3.3 


418166 


A1754416 




Cdc42 effector protein 3 


3.3 


434629 


AA789081 


Hs.4029 


glioma-ampled sequence-41 


3.2 


427398 


AW390020 


Hs.20415 


chromosome 21 open reading frame 1 1 


3.2 


457489 


A1693815 


Hs.127179 


cryptic gene 


3.2 


434747 


M837085 






3.2 


413125 


BE244589 


Hs.75207 


ESTs ^ ^ 


3.2 



767 



WO 03/042661 



PCT/US02/36810 



435561 
447210 
422522 
423396 
401785 
441345 
451404 
422173 



M351978 Hs.4943 

AF035269 Hs.17752 

AI023428 Hs.34549 

AI382555 Hs.127950 

AW068579 Hs.7780 

AA460775 Hs.6295 

BE385828 Hs.250619 

BE612992 Hs.27931 



AA179233 Hs.42390 

M814859 Hs.1 60074 

AW665130 Hs.137190 

D16532 Hs.73729 

BE174629 Hs.321130 

M81740 Hs.75212 

AW612322 Hs.19131 
M393508 

AI963808 Hs.86970 

AA805666 Hs.146217 



Hs.192375 ESTs 



hepatocellular carcinoma associated prat 
phosphalidylserine-specific phospho'ipas 
ESTs, Highly similar to S94541 1clone4 

NM_002275*:Homo sapiens keratin 15(KRT1 
Homo sapiens mRNA; cDNA DKFZp564A072 (fr 
ESTs, Weakly similar to 1 17243 hypothcti 
phorbolin-like protein MDS019 
hypothetical protein FLJ1 I607 slml arte 
peroxisomal biogenesis factor 6 
ESTs, Moderately similar to PC4259 fern 
Human D9 splice variant B mRNA, complete 
Mf' ) 37 Horn i ; I roxycterc I 
nasopharyngeal carcinoma susceptibility 
ESTs 
ESTs 

very low density lipoprotein receptor 
hypothetical protein MGC2771 
ornithine decarboxylase 1 
transcription factor Dp-2 (E2F dimerizat 
serologically defined colon career anlig 
ESTs, Moderately similar to ALU5_HUMAN A 
Homo sapiens cDNA: FU23077 fis, clone L 
ESTs 
ESTs 

ESTs, Moderately similar to 138022 hypot 



NM_024638:Homo sapiens hypothetical prot 
N-sulfoglucosamine sulfohydrolase (sulfa 
KIAA01 82 protein 
protocadherin 18 
collagen, type XIII, alpha 1 
hypothetical protein FLJ22390 
ESTs 



448044 
422017 
421254 



ESTs 

putative transmembrane protein 
gb:tk13e01.x1 NCI_CGAP_Lu24 Homo sapiens 
STAT induced STAT inhibitor-2 
coat protein gamma-cop 
endoglycan 

ectonucleoside triphosphate diphosphohyd 
fatty acid synthase 

melhylcrotonoyl-Coenzyme A carboxylase 2 

bone morphogenetic protein 7 (osteogenic 

hypothetical protein FU22625 

WAS protein family, member 3 

ESTs, Weakly simitar to A47582 B-cell gr 



J02931 

AW074836 

AA318315 



fibranectin leucine rich tra 



hypothetical protein FU20280 
KIAA1700 

ESTs, Weakly similar to KIAA1 006 protein 
myosin VI 

hypothetical protein FLJ23548 



Hs.9235 
Hs.62192 
Hs.173984 



Hs.1 89703 ESTs 



hypothetical protein FU22195 



BE250944 Hs.1 83556 

NM 015865 Hs.171731 

T64904 Hs.1 63780 

NMJXE913 Hs.166563 

AA532718 



ESTs 



R10723 Hs.20573 ESTs 

AA77871 1 eukaryotic translation initiation factor 

AA910353 Hs.75432 ESTs, Weak ! y similar to T23482 hypotheti 

R49131 Hs.26267 ATP-dependant interferon response protei 

NMJ06979 Hs.278721 HLA class II region expressed gene KE4 

NM_014158 Hs.279938 HSPC067 protein 
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5 
10 
15 

20 
25 



407355 AA846203 



BE336654 Hs.70937 
AK000737 Hs.237480 
AW245680 Hs.9701 
AW971213 

Hs.193974 
Hs.6111 



H3 histone family, member A 
hypolhetical protein FL120730 
growth arrest and DNA<lamagr>inducible, 
gb:EST383301 MAGE resequences, MAGL Homo 
" to ALU1_HUMAN ALU S 



AW983709 Hs.250824 Homo sapiens cDNA FU23435 fis, clone H 

TIA1 cytoloxicgranule-associated RNA-bi 
vesicle Irsiticking protein 
hypothetical protein M3C131 70 



424396 
423472 
425920 
409927 
447881 



438510 
423217 
427581 



Hs.23054 

S70114 Hs.239489 

R89543 Hs.12942 

W79632 Hs.256301 

AA418775 Hs.47234 

AW245741 Hs.58461 

AI745325 Hs.271923 

AA165082 Hs.146388 

AF041260 Hs.129057 

AL049977 Hs. 162209 



AA838114 Hs.221612 

NM 014735 Hs.82292 

NMJ03512 Hs.28777 
BE548555 



ESTs, Weakly similar to A35659 krueppel- 
Homo sapiens cDNA: FU22785 fis, clone K 
microlubule-associated protein 7 
breast carcinoma amplified sequence 1 
claudin 8 

gb:yo19d03.r1 Stratagene lung (937210) H 
GCN1 (general control of amino-acid synt 



NM_000094 
NM 014788 
AI452973 



Hs.1 640 

Hs.179703 

Hs.165900 



DKFZP586PC123 protein 

collagen, type VII, alpha 1 (epidermolys 

KIAA0129gene product 

ESTs, Weakly similar to ALU1 .HUMAN ALU S 

ESI A to A43932 mucin 2 | 

liypothelical protein FU10628 




transmembrane 7 superfamily member 2 



432140 
434517 
448718 
408196 
419849 



412719 
438552 
439621 
437050 
414256 

416941 
421594 
436213 



AW277121 
AI584156 
AK000492 
AI129767 
AI675881 
AA761668 



Hs.254881 ESTs 



AL034548 
BE041436 
AK000933 



AA971014 
AW016610 
AJ245820 



Hs337251 
Hs.1 53959 
Hs.43627 
Hs.93379 
Hs.28661 
Hs.93758 
Hs.288969 
Hs.75432 



Homo sapiens, clone iMAGE:4l 39775, mRNA, 
hypolhetical protein 

guanine nucleotide binding protein (G pr 

gb:nz24cOB.s1 NC1_CGAP_GCB1 Homo sapiens 
hypothetical protein FLJ20397 
hypothetical protein MGC2487 
hypothetical protein MGC15436 
SRY (sex determining region Y)-box 22 
eukaryotic translation initiation factor 
Homo sapiens cDNA FU10071 (is.cloneHE 
H4 histone family, member H 
HSCARG protein 

IMP (inosme monophosphate) ;Ni ji i 
ESTs 



AW410035 HS.75B52 

AK0D0785 Hs.199480 

BE000150 Hs,48778 

R45689 Hs,21889 

AA325512 Hs.71472 



441266 H15968 
426761 AI015709 
413746 AA133243 



ESTs, Highly similar to B34087 hypotheti 
ESTs 

MAD (mothers against decapentapleglc, Dr 
Homo sapiens, Similar to epsin 3, clone 
niban protein 

Homo sapiens cDNA FU12978 fis, clone NT 
hypothetical protein FU10774; KIAA1709 
ESTs, Weakly similar to I3B022 hypotheti 



421717 
453912 
435703 
431108 
451040 
443837 

450447 
409330 
413762 
433586 
424720 
422027 
422717 



AL121031 
AW630133 
AA99150B 



Hs.131765 
Hs.264190 
Hs.107187 

Hs.83313 
Hs.105317 
Hs.40808 

Hs.26549 
Hs.25010 
Hs.53940 
Hs.848 



Hs.306319 
Hs.1 19475 
Hs.74624 
Hs.235516 



ESTs 
Target Exon 

intermediate filament protein syncoilin 
ESTs, Moderately similar to I38937 DNA/R 
vacuolar protein sorting 35 (yeast homo! 
divalent cation tolerant protein CUTA 
SWI/SNF related, matrix associated, acti 
GK003 protein 
ESTs 
ESTs 

spindle pole body protein 
KIAA1708 protein 
hypothetical protein P15-2 
hypothetical protein FU10369 
FK506-binding protein 4 (59kD) 
gb:yd78d06.s1 Soares fetal liver spieen 
SWI/SNF related, matrix associated, acti 
fatty acid amide hydrolase 
cold inducible RNA-binding protein 



769 



WO 03/042661 



PCT/US02/36810 



407255 AA012992 
431472 AK001023 
423318 AW467064 




Hs.266619 ESTs 



431609 
447896 
431525 



418663 
442013 
449051 



Hs.264330 
Hs.294069 
Hs.6185 



M506476 
AW961400 
AW292415 



Hs.20509 
Hs.172280 



peplidylprolyl isomerase B (cyclophiSn 
ESTs 

Homo sapiens cDNA FU13384 fis ; clone PL 
KIAA1557 prolein 

Homo sapiens cDNA: FU23270 (is, done C 
Homo sapiens mRNA; cDNA DKFZp566L203 (fr 
ESTs 

desmocallin 3 

Human DNA sequence from clone RP" "-353C1 
HER2 receptor tyrosine kinase (c-erb-b2, 
HBVpX associated prolein-B 
KIM0779 protein 

SWI/SNF related, matrix associated, acti 
ribosoma! protein L26 



419452 
425491 
414556 

435602 
441128 
440340 



AW973744 
AI859390 
AW732837 



Hs.293100 
Hs.2a8940 
Hs.42390 
Hs.90572 
Hs.255221 



AI557486 
AI041403 
AL050136 



T07353 

AI821926 

AA045650 

R87622 

H16423 

BE409813 

AW083384 



Hs,140945 
Hs.30861 

Hs.1 10637 

HsJ948 

Hs.53125 

Hs.82585 
Hs.15?337 
Hs.11067 
400263 

435750 AB029012 Hs.4990 



421709 AA159394 

421077 AK000061 

416359 ALO42210 

442104 L20971 

411605 AW006831 



p10-binding protein 
uncharacterized bore 
ESTs, Weakly similar to T23273 hypolheti 



ribosomal protein L13a 
ribosoma! protein L29 

Homo sapiens mRNA; cDNA DKFZp585L141 [fr 
ESTs 

KIAA0036 gene product 
homeoboxA10 

Prader-Willi/Angelman syndrome-5 
ESTs 

gb:nt78f05.x5 NCI_CGAP_Pr3 Homo sapiens 
srnal nude iriibonw eoprotein D2 polyp 
KIAA1831 protein 

CD47 antigen (Rh-related antigen, integr 
protein arginine N-methyltransferase 3(h 
E I H jh'y in r T46395 h pi 



Hs.255883 ESTs.W 
Hs.107056 
Hs.101590 
Hs.16493 
Hs.188 



sin DKFZp762N2316; KIAA 



Hs.5460 
Hs.1 73980 
Hs.183006 
Hs.1 80107 



matrix protein NMP200 related to 
apiens cDNA FLJ123C0 fis, clone MA 
rase (DNAdlrected), beta 
lal protein L29 



R77776 

BE218049 

BE613416 

AA318224 

X55079 

AL050201 



ts.336425 Homo sapiens, clone MGC:17296, mRNA, cc 



AA547959 

T53519 

AA421973 



770 



WO 03/042661 PCT/US02/36810 



427087 
412524 
435176 



AA354489 Hs.2221(K 

NM 000546 Hs.1846 

AW978811 Hs.31445' 

BE073913 Hs.17351! 



441094 U33319 Hs.7647 

451356 M748418 Hs.164577 

408461 AB037756 Hs.45207 

418516 NM 006218 Hs.85701 

423476 AL035633 

444034 AL161957 Hs.10177 

407378 AA299264 Hs.57776 



ATP-binding cassette, sub-family A (ABC1 
ESTs 

ESTs, Moderately similar lo S65657 alpha 

Homo sapiens mRNA; oDNA DKFZp434N2412 (f 

EBP50-PDZinteractorof64kD 

tumor protein p53 (Ll-Fraumeni syndrome) 

ESI V i U nvlai oALU1 HUMAN ALU 

uncharacterized hypothalamus protein HT0 

hypothetical protein FU231 53 

ESTs 

hypothetical protein MGC11138 
hypothetical protein 

MYC-assooiated zinc finger protein (puri 

hypothetical protein KIAA1335 
phospho roGitido 3 kinase, catalytic, al 
Human DNA sequence from c'one RP5-1046G1 



430512 
433301 
444381 



AA913591 
AI815395 
AF182294 



410935 
407887 
430341 



AI458179 Hs.41528 
BE515037 Hs.177556 
NM_001183 Hs.6551 
M24470 Hs.1435 
BE440108 Hs.106127 
Hs.342849 
Hs.66881 
Hs.41072 



Hs,1 84641 fatty acid desaturase 2 
Hs.241578 " 
Hs.152016 
Hs.283713 



440191 
453187 
412133 
426841 



NM_006348 

AW198103 Hs.18724 

AB011128 Hs.30512 

AA297567 Hs.43728 

BE408359 Hs.43621 

C75094 Hs.334514 

AW009892 Hs.31924 

AW339037 Hs.24908 

AW952619 Hs.17235 

AA120865 Hs.23136 

AI337316 Hs,147998 
NM.000786 

AD001528 Hs.89718 

NMJ04922 Hs.81964 

AW295840 Hs.14555 

AL1 10245 Hs.47367 

AW953437 Hs.5486 
AI963821 

NM_000202 Hs.1 72458 

BE539108 Hs.22051 

W88920 Hs.29341 

AL1 37445 Hs.28846 

AL1 19305 Hs.26409 

AI37B424 Hs.288761 

BE387162 Hs.280858 

NMJW6823 Hs.75209 

AK001015 Hs.55220 

BE250716 Hs.87614 

AI283476 Hs.263478 
AI990417 

AH 61 383 Hs.34549 

U83460 Hs.73614 

AI052358 Hs.193725 
AA830893 Hs,119759 
AW410646 Hs.164649 



435025 T08990 

416751 T48130 

453946 AA009716 

424762 AL119442 

449458 AI805078 

432409 AA806538 

425184 BE278288 



Hs.42311 
Hs.183684 
Hs.208261 
Hs.130732 
Hs.155048 



, :in LSmS 

Homo sapiens cDNA: FLJ22140 fis, dona H 
ESTs, Weakly similar to S64054 hypotheti 
ESTs, Weakly similar to neuronal thread 
melanoma antigen, family D, 1 
ATPase, H transporting, lysosomal (vacuo 

FtNA polymerase 1 16 kDa subunit 
ADP-ribosylation factor-like 5 
dynein, cytoplasmic, intermediate polype 
serine (or cysteine) proteinase inhibito 
golgi transport complex 1 (90 kDa subuni 
ESTs, Weakly similar to granule cell mar 
Homo sapiens mRNA for KIAA0555 protein, 
hypothetical protein 

Homo sapiens, Similar to hypothetical pr 

NG22 protein 

ESTs 

ESTs 

Homo sapiens clone TCCC1A00176 mRNA scqu 

ESTs 

ESTs 

cytochrome P450, 51 (lanosterol 14-alpha 
spermine synthase 

SEC24 (S. cerevisiae] related gene famil 
Homo sapiens cDNA: FU21513 fis, clone C 
KIAA1 785 protein 
clone FLB5214 

iduronate 2-sulfatase (Hunter syndrome) 

hypothetical protein MGC1 5548 

hypothetical protein FU22376 

Homo sapiens mRNA; cDNA DKFZp5660134 (fr 

ESTs 

hypothetical protein FU21 749 

ESTs, Highly similar toA35661 DNAexcis 

protein kinase (cAMP-dependent, catalytj 

BCL2-associatedathanogene2 



ESTs 

hypothetical protein DKFZp434H247 
hypothetical protein FU21918 
regulator of G-protein signalling 10 
anchor attachment protein 1 (Gaalp, yeas 
Homo sapiens mRNA; cDNA DKFZp586P1622 (f 
ESTs 

eukaryotic translation initiation factor 
ESTs 

K1AA1575 pre 



od group (Auberger b antigen 



AF106697 
AW294316 
AI174783 



771 



WO 03/042661 



PCT/US02/36810 



436972 AA284679 



Hs.25540 
Hs.29705 
NM_006855 Hs.250696 
AI792670 Hs.144405 



435522 N64214 

406764 AA429825 

422032 AA476966 
409125 



446091 
419848 
415015 



Hs.8752 
Hs.9774 
Hs.343443 
Hs.1 10857 
Hs.343567 
X98834 Hs.79971 
AW022192 

AA251242 Hs.103238 
NM.002109 Hs.77798 
Hs.27141' 



426178 
414528 
425494 
407656 
437330 
449139 
448153 
4173B6 



AA524394 

R79445 
M371409 
M148950 
AL1 19528 
AW747986 
AL353944 
BE268315 
Y10805 
AL037228 
Z15008 



Hs.294022 
Hs.301872 
Hs.76230 



Hs. 170098 
Hs.37443 
Hs.50115 
Hs.23111 
Hs.20521 
Hs.82043 
Hs.54451 
Hs.170345 
407627 AI419020 Hs.62620 



KDEL (Lys-Asp-Giu-Leu) endoplasr 



polymerase (RNA) III (DNA directed) poly 

axonal transport of synaptic vesicles 

sal (Drosophila)-like 2 

ESTs 

ESTs 

histidyl-tRNA synthetase 
be la-site APP-cleaving enzyme 2 
hypothetical protein FLJ149S0 
hypothetical protein MGC4840 
ribosomal protein S10 
hypothetical protein FLJ22294 
ESTs 

KIM0372 gene product 

Homo sapiens mRNA; cDNA DKFZp434B2119 (f 
Homo sapiens mRNA; cDNA DKFZp761J11 12 (f 
phenylalanine-tRNA synthetase-like 



410006 



AW956345 Hs.12926 

AL080243 Hs.1 80920 

NM_016831 Hs.12592 

AW732308 HS.S7783 

452906 BE207039 Hs.306123 

450341 N90956 Hs.17230 

419430 AI878S42 Hs.80336 

440099 AL080058 Hs.6909 

452518 AA280722 Hs.24758 
431843 M516420 

424795 AW102850 Hs.153177 

448019 AW947164 Hs.196641 



429332 AF030403 

438572 BE267017 

425692 D90041 

431188 W05656 

408803 AF248953 



Hs.4863 
Hs.1 931 72 
Hs.204041 

Hs!850 
Hs.26954 
Hs.1 37507 
Hs.199263 
Hs.6315 
Hs.1 559 56 
Hs.159755 
Hs.24049 
Hs.76252 
HS.154S79 
Hs.97031 
Hs.200350 



421026 
407687 
433683 



AI659076 
AI004377 
AK001171 

NM_005781 Hs.153937 

H25835 Hs.301527 

AB014537 Hs.13604 
H78517 



laminin, gamma 2 (nicein (100kO), kalini 
ESTs ^ ^ ^ 



collagen, type IV, alpha 5 (Alport syndr 
ESTs, Weakly similar to A47582 B-cell gr 
ESTs, Weakly similar to 138022 hypotheti 
ESTs 

ribosomal protein S9 
period (Drosophila) homolog 3 
eukar/olic Iranslalion initiation factor 
MAGEF1 protein 
hypothetical protein FU22087 
ATPase, H transporting, lysosomal [vacuo 
DKFZP564G202 protein 
ESTs, Weakly similar to I38022 hypotheti 
E I \ I rnlar to I38022 hypothec 
ribosomal protein S28 
ESTs, Moderately similar 
ESTs 
Eos Control 

hypothetical protein DKFZp566A1524 
ESTs, Weakly similar to 138022 hypotheti 
chromosome 14 open reading frame 3 
CDP-diacylglycerol synthase (phospha'.ida 
IMP (inosine monophosphate) dehydrogenas 
Homo sapiens mRNA; cDNA DKFZp762G123 (fr 
vang (van gogh, Drosophila)-like 2 
Ste-20 related kinase 
acetylserotonin O-methyltransferase-like 



1 1(at, 

golgi autoantigen, golgin subfamily a, 2 



.33905 ESTs 



hypothetical protein MGC13047 
Homo sapiens cDNA FU13027 fis, clone NT 
hypothetical protein MGC4549 
activated p21cdc42Hs kinase 
ESTs, Moderately similar to unknown [H.s 
KIAA0637 gene product 



*I5 (general control of 
hypothetical protein FU11149 
hypothetical protein FU21832 



Unique Eos probeset identifier nui 



CAT Number Accession 



BC013389 BC017398 AI023543 AA191424 AI267700 A1469633 AW958465 AW953397 AA172056 BE940298 BF909208 BF909980 BF095153 
BG285837 AI720344 BF541715 AA355086 AA172236 
AW341473 AA448195 AW207206 AI951341 AA969259 



772 



WO 03/042661 



PCT/US02/36810 



433404 
10 453160 



420218 
15 437124 



191547J 
59408J 



25 
30 
35 



407819 
409151 



422890 61425.1 



5 5 432363 
417379 
414922 



70 
75 

447520 
407192 
434194 



BI918158 AW779760 N48674 AI375997 AA235370 BG69S146 AI913531 AI498402 AI016320 AA323193 R49021 D59344 BG986750 N45526 
BG985917 T61382 R49391 R45432 AI203107 R35004 F07491 R25094 R35360 

BC021735 AI669212 AL120184 AI769949 BE701002 BE184363 BE819031 BG702238 BF090049 BF953318 BF961912 BF943013 M934514 
AA151245 BF960659 AA987907 Z41449 BF908059 BF908053 BF908049 BE699424 BF908060 BF962832 BF952020 BF953134 BI035538 
BF908052 BF908057 BF090026 BF943158 AI532924 BF512340 BF952021 BF960776 BF943437 BF942847 AI768015 F09778 F04816 F02721 
M102645 AI63383S M817929 BF947001 BI035448 BE935876 AW890837 AW898604 BF957405 BF963433 BG704815 
BC022980 BF224081 BG149908 AW672842 BE670687 AI702161 AW341832 BE222503 N71836 AI026061 AW953116 AW083132 AI9792S1 
AV725377 AI423298 AI640707 AW675518 AI032611 AI818044 AI299508 AI91 1386 AI270418 BE219257 BM141826 AA825491 Z25159 AA587421 
N59447 Z39436 T32982 R541 10 BF1 15783 F09044 BF808433 

BC009612 NM_003526 BI597615 AV761592 AV760377 AL501008 BI604131 BE645918 BG187760 BG181525 BG210634 BG192999 AI263307 
AA344186 AW952966 AA033609 AA037552 AA722183 R79452 H70775 BF674991 BE769437 BG007356 AA037483 AW572535 AI143991 
M034581 AA033610AV742510AV735788 R08336 

AW958037 R42557 AI337047 AA948360 AI638005 AA459950 AI624915 AI638047 AI467856 AI521826 AA860305 AI932315 AW003092 AW271756 
AW779380 AA609879 AI634791 AI493770 AJ565211 Z41 1 45 AI627952 AA303734 BE349457 AW196765 AA256527 BE089727 
AL050013 BG939500 AW969191 AA769925 AI377973 AI625545 M81 1365 AA521 1 14 N24705 AI379579 AA424899 AI684671 M829715 AI453010 
N35401 AA677452 M504340 AI209149 M883574 AI379062 AI084455 AI280147 M644327 BF432508 N27873 N47364 N34880 Al 147024 T86860 
AI219716 AA960926 H25544 BI857123 AW960489 M599099 AA765246 AA328537 BG434703 M890373 AA424765 AI292318 AA829886 N95742 
AI218758 H25588 N35282 AA024987 N36687 BI919187 N4S471 M889970 AW166152 AA468546 AI262504 AI452782 AA554458 M807080 
AA724542 AW1 02730 M909978 AW1 18134 AW827241 W56431 BM1 27381 BF436987 AI016509 AW663972 BM1 27686 CI 5552 N63435 N51744 
T98800 N56980 BG108636 N49381 R49886 D61278 BI756612M508234 R49885 BF850422 
AF075009 R63109 R63068 

AK056315AI015524AA724079 BI713619AI377728AW293682AI928140AI092404AI085630AA731340BM469629AW968804 M425658 
M769094 BF446026 AW118719 AI332765 AW500888 AW576556 AI859571 AW499664 AW614573 AW629495 AW505314 W74704 AI356361 
AI923640AW070509AI521500AL042095AA609309M761319AI381489 H45700 M761333AW265424 M909524AA635311 AA649040 
AI392620 Z4070S AI985564 AW26351 3 AA913892 AI693486 AW263502 AI8061 64 AW291 1 37 BI061 872 BI059498 AA134476 AW084888 AA036967 
AW370823 T55263 BI002756 M489664 BF827261 W74741 BF963166 

AK055952 BG182168 BG220105 BG191569 BG188964BG187388 BG220104 BG183714 BE645998 AI819354AW974068 AI393635 AI580846 
AI024796 AW020098 BI491127 AI393644 N74993 AW472959 BM478854 BI597437 H12165 BI458612 BE543192 
BC022881 AU150944 BG750783 AW754175 AW857737 AI911559 AI050036 AA554053 AI826259 AA568548 

AB033091 AL520743BE811813N53332 N99716AL561910AA280655BE710392AW05100AW293978AW444556AA281459AI679751 AI873695 
BG700891 BI553S17 R8QS18 BG779771 BG534451 AA479402AW961580 BF061430 AIB57643 AI768486 AW512118 M479302 AW770384 
AW072470 AI041596 AI049699 AW592865 M975261 BE879747 BG1 14119 AW183811 M909938 BF571621 BF350794 BF351375 BE925699 
AL050294 BC010371 BF982270 AL042656 BF095732 AW812618 BF095731 BG212397 BF678765 BI038602 BG388664 AW675337 BG289398 
BE939598 AW805570 AA527097 AW1 50540 AI693720 M743364 AI915793 N481 85 AI573107 AA043474 AI35161 5 AI969490 AI910763 R50866 
AI699181 N73808 H08164 

AK000028 AA494483 AI298674 AA720773 AV761529 AI884670 AI936202 AW294235 D61 652 BF881 1 84 AV71 1384 N 271 54 AI926970 AV734970 
N40094 N28596 AA884747 M512890 BG436593 

NM_033445BC001193AI885781 BF794032 M475620AA810906M810905AI291244AI885097AI359708AI335629 H97395AI344589AA300377 
AA457566AW771833BE465621 AI364 0 68 AI364452 AI648505 AI918342 AI928670 AA886580 AL531029 AA885344 AI186419 BG329096 
BM045465 AL531028 BG437151 BE868021 M179427 

AK056626 AI800896 BF939022 BE644718 AI954754 BE218177 BE348567 AI962405 AW293122 AI968798 AI457321 BE327228 BG913531 
AW939055 T30280 RM1BS Z43366 BG819153 BF003119 BE646274 BF940881 R18246 R42185 

NMJHM892 AF047442 BE275338 BF724863 BI917206 BE276993 AL602308 AA306105 BM152505 BC001364 AW993471 AW993481 AW993283 
AW992919 AW992921 AW992980 AW992861 AW993220 BG573124 AA456385 AA234796 AI902726 AA354813 BI092644 BG778400 BI260001 
BG007325 AI267455 AA426574 AI160782 AI4721 86 AA255500 AA43400S BG435520 AI3561 1 1 H00525 AV749060 BG944497 BG292031 
AA902153 C04925 AA902160 M383100 AW073533 AA256706 AA150809 R55766 AW958448 BE090972 BF693195 AV738979 R65855 R80136 
AA484677 

AK057805 AW162343AI190479 AI093318 BE048820 Ai 198397 AA654667 BE219303Z39851 F02555Z28734T16575 F10145Z45266 AW572911 
AW964436 AW004030 AI632565 BE502530 B1792383 BF056928 M449241 AI651825 M805324 AI264853 AW1 9691 8 M948257 AI953735 
AI263703 BF055387AW594171 AI867447 AA3191S9 AI903440 AW956110 AI366013 A1867923 BG91 1906 D81142 C1561B AL538697 Z25032 
Z43784R13382AV746924AA44S369M318815BF36«65H17038H10054F04161T87230 R40898AW204071 BI819428 M683393AA683376 
Z43192T74078 T05103 F12527T77951 H10118H17037 BF855407R19603 

AB032997 AI141578 AW978722 BE4671 19 AI761408 BF727385 AW237035 AI934521 BF436248 AI479658 Z4D532 AA832081 AW295901 
BF057835 BE465977 AI621269 BE465983 BF756369 N74056 AI817896 AA71 5567 AA934774 H62600 H09497 BF943762 BE395335 BE883333 



AA196390 AA507837 AA196458 

BG1 07484 AA632009 AI432670 AI656660 AI650884 AI521919 AI264553 AW1 50793 AW61 1894 A1917098 BI091 245 AI651454 BF434889 AI580285 
AI88073S BE301995 AI392959 AW61396S BM023S28 AW51S374 A1460102 BM023318 BE3281B8 AI952820 A1581363 AA557165 A1695677 
AL562079AI700926 AI470561 BF083058AW196387AU132984BI064046AI970157 R02122 H55924 AI521721 AA808206 AA725223AI766003 



AK027321 W63676 W63789 BE046412 BF1 14614 BE646183 BU126230 AW044233 AI951970 AW663548 AI139947 AA514302 AA845232 
BM126251 AA789002AA581966AA809643AW188870BE706664BE70S539BE153177BF084925AL133779AW961788AA659693M347970 
AA295134 AA525037 AA449282 BG190454 H61476 T91396 N20018 T901 14 H75644 BE710736 BF687723 H28581 AA249370 BF726698 
BE341554 B1045099 T84625 AW129678 BG770826 
AW973834 AI791 932 AI791855 AI732640 AA558833 AA559897 AI821 6 1 0 

BC013939 BI494690 BI49121 1 AI928393 AA843540BG938644AU1 85628 BI495842AW173255 BM052709 AI743999AI690144 AI922209 AI740907 
AW340368 AA928759 AW1 18737 BF513970 M707807 BF435295 AI339463 AI373842 AI433809 BE222392 AA602308 AA428261 AI450355 
AW662760AI888087AI342098 M722418 , A78151N64382BE221848AW025901AA452120AI150479AI016166M779515AA661791 BU474307 
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Table 69A lists aboul 200 genes that are upregulated in human umbilical vein endothelial cells (HUVEC) compared to normal body tissues. These genes were selected from 59680 
probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a normalized 
value reflecting the relative level of mRNA expression. 

Table 70A lists about 148 genes that are downregulaled using the mean of vessels (veins and arteries) compared to the mean of HUVEC. These genes were selected from 59680 
probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a normalized 
value reflecting relative level of mRNA expression. 

TABLE 69A: 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of the mean of HUVEC Afs lo the mean of the normal body tissue Al's 



418994 
413795 
441689 
412567 
424432 



UnigenelD 
Hs.105689 
Hs.83337 
Hs.283713 



Unigene Title 
MSTP031 pn ' ' 

latent transforming growth factor bela b 
ESTs, Weakly similar to S64054 hypolheli 
hypothetical protein DKFZp434G171 
collagen, type V, alpha 2 
"erin 13, H-cadherin (heart) 
POLYMERASE I AND TRANSCR PT RELEASE 
protease, serine, 23 ' 
NM_014624:Homo sapiens S100 calcium-bind 

caveolin 1 , caveolae protein, 22kD 
NM_016931:Homo sapiens NADPH oxidase4 ( 
small inducible cytokine A2 (monocyte ch 
sushi-repeat-containing protein, X chrom 



Hs.44155 

Hs.89546 selectin E (endothelial adhesion molecul 

Hs.142003 ESTs 

Hs.74034 Homo sapiens clone 24651 mRNA sequence 




BE566411 
AW956889 
AA301867 
AW391423 



AA155598 

AI570189 

AI076459 

AW072215 

AA861271 

W31178 

X97748 



Hs.154210 

Hs.76224 EGF-containinq tbulin ike extracellula 
Hs.288555 Homo sapiens cDNA: FU22425 fis, clone H 



Hs.25338 ESTs 

Hs.200360 Homo sapiens cDNA FLJ1 3027 fs, done NT 

Hs.94030 Homo sapiens mRNA; cDNA DKFZp53SE1 624 (f 

Hs.212839 hypothetical protein FU14195; KIM1714 

Hs.25132 KIAA0470 gene product 

Hs.15978 KIAA1272 protein 

Hs.208470 ESTs 

Hs.222024 transcription factor BMAL2 

Hs.154140 ovary-specific acidic protein 

gbiH.sapiens PTX3 gene promoter region. 



WO 03/042661 



PCT/US02/36810 



434846 AW295389 



441020 
401234 
420174 
445684 



25 

30 
35 



Hs.119768 ESTs 
Hs.103483 KIM1798 protein 

metallothionein 1E (functional) 
Hs.1 18502 ESTs 

Hs.143954 hypothetical protein FU 10914 
Hs.35962 ESTs 



mitogen-actlvated protein kinase 8 inter 
Hs. 199749 ESTs 
Hs.13109 Ran binding protein 11 
Hs.87409 thrombospondln 1 

ESTs, Weakly similar to ubiquitous TPR m 
A kinase (PRKA) anchor protein (gravin) 
transmembrane protein (63kD), endoplasmi 
434523 AA703709 Hs.23410 translocase of inner mitochondrial membr 
1s.74122 caspase 4, apoptosis-related cysteine pr 



AI824144 
AK001696 
AI750878 
AU077012 
NM_005100 
412646 NMJJ06825 



445119 
438142 
437473 



417640 
407252 
418825 
417805 



Hs.18 



AA156022 
AA004368 
AF035121 



AL390172 Hs.317432 



BE047032 

AF151073 

AL037544 

X06256 

D30857 



415714 NM_002290 



412351 AL135960 



T59538 
AL1 35960 
AI821409 
D28235 
NM_002837 
M782114 
M319798 



Hs.257789 

Hs.8645 

Hs.184298 

Hs.149609 

Hs.82353 

Hs.163780 

Hs.22394 

Hs.82587 

Hs. 172670 

Hs.78672 

Hs.16714 

Hs.283072 

Hs.72912 

Hs.278585 

Hs.73828 
Hs.304471 
Hs.196384 



ESTs 

hypothetical protein 

cyclin-dependent kinase 7 (homolog of Xe 
integrin, alpha 5 (fibronectin receptor, 
protein C receptor, endothelial (EPCR) 
(AB075832) KIAA1952 protein [Homo sapien 
FU10893 



activin A receptor type 11-Hke 1 
larrinin, alpha 4 

Rho guanine exchange factor (GEF) 15 
actin related protein 2/3 complex, subun 
cytochrome P450, subfamily I (aromatic c 
NMJ24756; Homo sapiens hypothetical pro 
gb:yb65g12.s1 Stratagene ovary (937217) 
T-celt acute lymphocytic leukemia 1 



se2(p 



M452970 

M334384 

AL036923 
AL1 57424 
NM_005795 
AW236021 



protein tyrosine pho: 
ESTs 

eukaryotic translation elongation factor 
HS.16450 ESTs 

Target Exon 

Hs.149609 integrin, alpha 5 (fibronectin receptor, 
Hs.97199 complement component C1q receptor 
Hs.1 71844 NM 006505 Homo sapiens poliovirus recept 
Hs.239527 E1B-55kDa-associated protein 5 
Hs.22968 intronofVEGFR 
Hs.149420 ESTs 
Hs.22245 ESTs 
Hs.322710 ESTs 



410057 
446727 
412564 
414786 



430129 
429276 
407704 
448694 
434398 



AB011095 

X83703 

AI246482 

AI554545 

T54904 



Hs.1 521 75 

Hs.78531 Homo sapiens, Slmllarto RIKEN cDMA 5730 

Hs.337986 Homo sapiens cDNA FU10934 fls, clone OV 

Hs.268107 multimerin 

Hs.16032 KIM0523 protein 



Bone morphogenetic protein 6 
Hs.118893 Melanoma associated gene 
Hs.30822 hypothetical protein FU11110 
Hs.181097 tumornecrosis factor (ligand) superfami 



BE301708 

AF056085 Hs.198612 

BE315072 Hs.78768 

AA478756 Hs.194477 

AA121098 Hs.3838 



ESTs 

bone morphogenetic protein 2 
MAD (mothers against decapentapfegic, Dr 
Homo sapiens cDNA FU 14445 fis, clone HE 
ngenital contractual ara 
ns cDNA: FU22743 fis, clone H 



R19015 
AI369275 
U03272 



428206 AB020643 

412755 BE144306 

456977 AK000252 

456977 AK000252 

427897 NM_017413 Hs.303084 apelin; peptide ligand for APJ receptor 



WO 03/042661 PCT/US02/36810 



444409 
421340 
445701 
444009 
409220 
416940 
418922 
431548 



AW152225 
AI792140 
F07783 



417933 X02308 



414430 
412140 
442104 



AA256444 
X02308 
M443966 
AI346201 
AA219691 



ESTs, Weakly similar to I38022 hypotheti 



Hs.135104 
ir 131 S7 



AI834273 

434927 H46612 

434558 AW264102 

438183 BE263252 

428438 NMJM1955 

453365 M035211 

41805B AW161552 

439410 M632012 



AA084248 Hs.85339 
AW247252 Hs.75514 



Hs.78824 
Hs.156110 
Hs.1 09706 
Hs.78146 



lumor necrosis factor rec 



ESTs 
ESTs 



Homo sapiens HSPC285 mRNA, paTial c:.ls 
ESTs 

hypothetical protein MGC31 78 
endothelin 1 

ng region Y)-box 



guanine nucleotide binding pn 



Hs.126485 
Hs.82962 
Hs.31595 
Hs.76118 
Hs.73625 
Hs.188 
Hs.156346 



hypothetical protein FU12604; KIAA1692 

thymidylate synthetase 

ESTs 

ubiquitin carboxyl-terminal esterase L1 
RAB6 interacting, kinesin-like (rabkines 
phosphodiesterase 4B, cAMP-specific (dun 
topoisomerase (DNA) II alpha (170kD| 
G protein-coupled receptor 39 
nucleoside phosphorylase 
mitogen-actiVated protein kinase 8 inter 
karyopherin alpha 2 (RAG cohort 1, impor 
tyrosine kinase with immunoglobulin and 
ESTs, Weakly similar to unknown protein 



418054 NMJ02318 Hs.83354 lysyl oxidase-like 2 



55 
60 



419172 
447560 



AA486620 
M1 50797 
BE247684 
AW338625 
AW975183 
AF065214 
AA127221 
NMJ02291 
AW014795 
AL134430 
AW248322 
AF217513 



449722 BE280074 



AB018259 
BE280074 
AF234532 
BE242587 
M2130G 



418738 
418185 
417308 
407975 



M68874 



U84722 

NMJ307351 

NM_005402 

NM_006823 

F06972 

C18356 



Hs.27372 
Hs.295944 
Hs.72116 
Hs.49265 



Hs.109276 
Hs.103070 
Hs.22120 



Hs.6906 

Hs.95835 

Hs.279905 

Hs.8881 

HSJ567 

Hs.1 18140 

Hs.23960 

Hs.61638 

Hs.1 18651 

Hs.202509 

Hs.49500 

Hs.211587 

Hs.6682 

Hs.347326 



ES is similar lo TRANSMEMBRANE <! SUPERF 
ESTs. Weakly simiarto S72482 hypotheti 
IVC (cytosolic, 



Homo sapiens cDNA: FU231 97 fis, clone R 
ESTs. Weekly similar to unnamed protein 
clone HQ0310PRO0310p1 
ESTs, Weakly similar to S41014 chromosom 

KIAA0716 geTe prodSd 8 " 3 ^ 336 ' ^ 

hematopoietically expressed homeobox 
gb:Human alpha satellite and satellite 3 
ESTs 

KIAA0746 protein 

phospholipase A2, group IVA (cytosolic, 
"'■■'n earner family 7, (cationic amino 



KIAA0101 ge 
endothelial cell-specific molecule 1 
cadherin 5, type 2, VE-cadherin (vascula 
NM_002425:Homo sapiens matrix metalloprr 

v-ral simian leukemia viral oncogene horn 
protein kinase (cAMP-dependent, catalyfj 
— '-"--"il tyrosine kinase (Etk) (BMX), 



FU20992 similar to 



Hs.83169 matrix metalloproteinase 1 (interstitial 



CAT number: Gene cluster number 



CAT Number Accession 



778 



WO 03/042661 



PCT/US02/36810 



453085 10017 1 BC017336 BG716430BG501286 BI458528 AI582223 N98532 AI338138 AI273442 AW102617 AA831177 AA745642 AA412583 AA355375 

BG547492 AW954243 BI766546 BG057641 AI192435 AI338935 AI31 2651 A1708679 AI191125 AI206832 AA676899 AI078010 A1888718 AA452830 
BF445542 N69930 AA71 5017 BF446713 BE046352 AW771 909 AA907729 AM 43749 AI761290 AA890233 BF925759 AI78371 3 AI767267 AA81 4538 
W56778 AA91 8481 BG743526 BE645242 AI025328 Al 298436 AI290445 H2771 0 AI475034 BG740023 AI090348 AI340003 BI602481 W38495 
AI183314 AI927418 BG397181 AA878310 W19369W56507 C05751 AW380760 AW380770 AW380790 BF930729 H28425 AA037326 AA375805 
436729 6624.1 X75684 AL5731 67 AI445461 AI453743 AI983655 AI584644 AA9771 80 AI6941 1 1 AI591358 AW071 625 AI678712 AI720939 AI927769 BE439796 

AI963432 AA292956 AW192593 AI865838 AI696905 AI424384 AI161 31 2 AI91 1 921 AI597801 BI494959 AI240988 AI492554 AW262737 BE044033 
AW008570 AW629505 BI494958 AA088439 AA706057 BF222820 BF593608 BE501957 AA524526 BE044134 AW572531 AW01 5724 BE349186 
AA043217 BE219784AI799814AA129575AI671727AI470033BE546195AW779725AA903050AA147228AA404570AI075878W38161 AI972739 
AW673152M723200 C06123BF057147AA627686AA157944AI990245AA662517T32487AI800106AI333170AI859160W45410AI990827 
AW275048 AA1 82640 AA478328 AI298935 AW085168 AW471421 AW103470AW300456AW191997 AI823466 AA962397 AA136658 AI251817 
AW339104AA724739AA411100M191349AA757735AA037696AI769516AW772283AA010631 AI692846 AI061065 H80983 R79933 A1950693 
AI245632 AI349390 AA148284 AI798502 AA487893 AI621 320 AW194272 C06365 AA953883 BE858936 AI918523 AI872628 AI927217 AI453453 
AM 89366 AW338678 AI261 359 AI500576 BF477735 AI032569 AI972899 AI985583 Z28771 AI363829 AI693030 AA603586 BE773488 AW339301 
BE773489 BE773462BE773495 AI6S0338 BE773499 AI745717 BE811475 BE811 470 BE81 1464 BE811418 BE81 1415 BE811400 BE811398 
BE81 1 388 BE81 1 352 BE773501 BE773494 BE773486 BE773474 BE773473 BE773470 BE773461 BE811350 BE811337 BF593847 BG055071 
AW675302 BF003068 AA71 91 73 BE811348 AIE824S2 AI586240 BE773500 AI244845 AI565439 AI918453 AI472527 AI446740 AA035576 AA191414 
AW674145 C05782 AI589264 D57558 AI468237 AI432033 AA989662 R21752 BF002457 AA988297 AL574095 AL576200 AL571074 AL574525 
AL578810 BG498381 AI928364 BE879732 M479834 AA479712 C17732 BM091258 BF843901 AW820230 C17476 BE327120 AA129574 
AA136645 BF843900 AW8061 93 AA502832 AAB49494 AL568520 AL547950 BE706937 BE81 1 360 BE773498 BE81 1 401 BE773484 BE81 1 437 
BE811380 BE81 1399 BF997171 BF757734 BE926037 AI377596 C061 1 1 AW0889BB BE81 1404 BE81 1472 AI865912 AI925607 AI871950 AI093510 
BE905927 BE81 1435 M191 387 AWT72000 BE81 1453 BE614379 BF844522 BI044896 AI744233 AW984527 C17504 BF843883 AI248307 
BE773483 AI567995 W60076 BF941 1 83 AI738844 BE81 1 458 BE773481 AI252930 AA948565 BE706942 BE1 56360 T65026 AW242958 AW1 97954 
BE905184 M722206 AI344943 AI348877 AI33486C BES21 857 BE156280 AA454099 AA037722 BF843897 AWB06183 AA043216 BG482896 
AA182734 M877242 AW372926 H27252 R38114 BF851858 BE156214 AA190427T91762 AA035067 AAB37326 T10930 BF906587 BI755027 
BG506731 BC008442 BC010166 AL550134 AL553096 AL548700 AL550751 AL547978 AL5452BS AL540643 AU1 1 8627 AL601379 BI259821 
BG741786 BI868522 AU1 35866 BI552770 B1259210 BI256520 BI255569 BG485098 BI258228 BG498501 BM044512 AU133984 AL556586 
BE745111 BI222633AU133917BG288151 BI260715 BI550550 BG500773 BI551761 BG707601 BI81 8593 BF691 383 BG721 129 BG541 578 
BE906666 BG751098 BI224136 BG400746 BG478065 BE790436 AW080238 AU137549 BG429896 BE392486 AW961686 BG721056 BE908365 
BE546656 BG541235 AW583735 BG528290 BI2G0895 AWS51691 BM048974 BM043805 BG142185 AA315188 AI446615 C06300 BG497644 
AA088544AI815987BG528631 BE619182AW239185AW062910AW062902AA347236F11933AA488005AA301631 AA376800D56120 
AA343632AA308636 F00242AA376086AA316968 M343799BI870221 BE910282BG538748 AW960564AV732879 D1S854AA192519 BF922148 
AA216013 BG624091 BE544387 BG507008 AW17G446 BF790033 BE088925 BE088854 M921353 R21800 M01 1222T97525 
442506 29197.1 NM.002640 L40377 BG227962 AI925897 W50100 BG980023 AA853496 AA36O401 AW9561 68 W61361 A1074846 AA373798 BG193973 BG499418 

BF81 6267 AI675932BG61 1734 BG777123 AW4494B5 BF912016 BE566411 BM068728AW976385AL121194AI767324BM054718AW366882 
436772 1239464.1 AA156151 Z25109 C05177 AW975G88 AA731063 NG7084 
434596 14701.1 AF147374T59538T59589T59598T59542 

412564 18571.1 X83703 NM.014391 BC018667 AU139209 BE924924 BE924899 BE924977 BE924985 BE924947 BF229055 BE924917 BE924938 BE812621 

BE924937 AA9G9134 BE924898 BF2290GG BF229054 BF081177 BE924934 BF229040 BE925037 BE925011 BG943776 AA488072AA486364 
Z36249 AW455806 AW582753 BF081 1 31 BE92491 0 BF229058 BF081 1 72 BE924956 AW378093 BF081 132 BF081 1 68 BE81 2735 BE81 2738 
BF081165 C04160C04483BF229048 

442006 1239046.1 AW975183 AA973583 A1365103AIG99495 AI301787 

TABLE 69C 

Pkey: Unique number corresponding to an Eos probeset 

Fief: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. -Dunham, et al. - refers to the publication entitled The DNA sequence of 

human chromosome 22" Dunham, el al. (1999) Nature 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
Ntposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

402463 9796896 Minus 8818-8952 

400494 9714719 Plus 169845-170272 

401234 9929642 Plus 120173-120337 

406506 7711374 Minus 6843-8077 

405025 7107727 Plus 105267-105343,106184-106294,106387-10653 

405121 8102330 Minus 35816-36004,36587-36684 

400666 8118496 Plus 17982-18115,20297-20456 



TABLE 70A: 

Pkey: Unique Eos probeset identifier number 
ExAccn: Exemplar/ ' ... 
UnigenelD: Unigene number 
Unigene Title:Unigene gene title 
R1: Ratio ofthe mean of 



anoftheHUVECAi's 



Pkey ExAccn UnigenelD Unigene Title 

428928 BE409838 Hs.194657 cadherin 1,type 1, E-cadherin (epitheli 

439180 AI393742 Hs.199067 v-erb-b2 avian erythroblastic leukemia v 

412636 NM 004415 desmoplakin (DPI, DPII) 

426158 NM.001982 Hs.199067 v-erb-b2 avian erythroblastic leukemia v 

414320 U13616 Hs.75893 ankyrin 3, node of Ranvier (ankyrin G) 

417878 U90916 Hs.82845 Homo sapiens cDNA: FLJ21930 f s, clone H 

414572 AU077174 Hs.288181 cathepsinH 

415314 N88802 Hs.5422 glycoprotein M6B 

431103 M57399 Hs.44 pieiotraphin (heparin binding growth fac 

406973 M34996 Hs.1 98253 

456974 M12529 Hs.169401 

430560 Z28942 Hs.243960 
Hs.84298 



N-myc downstream-regulated gene 2 



NM.012445 Hs.288126 spondin2, 



WO 03/042661 



PCT/US02/36810 



406824 AW515961 
418299 AA279530 
437536 X91221 



442402 NM.000954 Hs.8272 



Hs.144465 ESTs 



454042 
413283 
424247 
422103 
415994 
427393 
414217 
414063 



A984330 Hs.1 11676 
M.002923 Hs.78944 
B029018 Hs.177635 



hypothetical protein FU20093 
cathepsin K (pycnodysostosis) 
lysozyme (renal amyloidosis) 
protein kinase H11; small stress protein 
regulator of G-protein signalling 2, 24k 
KIM1095 protein 

Homo sapiens cDNA: FU23165 fis, cioni 



453464 
410677 
452685 
425869 
406643 
453767 
424614 
444195 
417233 



BE1 84455 
D63209 
AW960427 
AW960427 



T19239 

BE175494 

AU076484 



AI634651 

M524547 

N77976 

AB011792 

X54486 

AB002351 

W25005 

U54617 

M21574 

AL031846 



423915 
410036 
412047 



Hs.5944 

Hs.342874 

Hs.342874 

Hs.342874 

Hs.303205 

Hs.3603 

Hs.1940 

Hs.75811 

Hs.9963 

Hs.32989 

Hs.65424 

Hs.30250 

Hs.160318 

Hs.347939 

Hs.35094 

Hs.151242 

Hs.10587 

Hs.24395 

Hs.8364 

Hs.74615 

Hs.196647 
Hs.317596 
Hs.323511 



ecamerfamily 11 (proton-coupted 
transforming growth factor, beta recepto 
transforming growth factor, beta recepto 
transforming growth factor, beta recepto 



N44348 Hs.251385 

U53445 Hs.15432 

AI827976 Hs.24391 

AFC39018 Hs.1 35281 

R57171 Hs.57975 

AA934589 Hs.49696 



FXYD domain-containing ion transport reg 



s.16578 ESTs 



extracellular matrix protein 2, female o 
serine (or cysteine) proteinase inhibito 
KIAA0353 protein 

small inducible cytokine subfamily B (Cy 
Homo sapiens pyruvate dehydrogenase kina 
platelet-derived growth factor receptor, 
plakophilin 4 
KIAA0527 protein 

Homo sapiens cDNA FU12927 fis, clone NT 
Homo sapiens cDNA: FU231 76 fis, clone L 



) sapiens cDNA FU11177 fis, clone 
downregulated in ovarian cancer 1 
hypothetical protein FU13612 
alpha-actinin-2-asscciated UM protein 



hairy/enhancer-of-split related with YF!P 



421834 
423961 
447384 
425383 
447023 



AW468397 Hs.100000 S100 calcium-binding protein A8 (caigran 
AF114264 Hs.216381 " 
Hs,288771 
Hs.1 36348 
Hs.40528 



Hs.1 95851 



443604 C03577 

421913 AI934365 

451331 AK00203S 

416908 AA33399C 

407938 AA905097 



D13666 
AI377221 
D83407 
M356764 
AW801090 
M32053 

NM_003480 Hs.300946 

AF1 28847 Hs.204038 

NM_001928 Hs.155597 

AA345519 Hs.9641 

AI091195 Hs.65029 

AI917901 Hs.208641 



Hs.153684 

Hs.173594 

Hs.9615 

Hs.1 09439 

Hs.251385 

Hs.80424 

Hs.85050 

Hs.83164 

Hs.32478 

Hs.91296 



periostin(OSF-2os) 
ESTs 

Down syndrome critical region gene 1-lik 

actin, alpha 2, smooth muscle, aorta 
gb'.Human H19 RNA gene, complete cds. 
Microfibril-associated glycoprotein-2 



D component of complement (adipsin) 
complement componei 
growth arrest-specific 1 
ESTs 

four and a half UM domains 1 

frizzled-related protein 

serine (or cysteine) proteinase inhibito 

myosin regulatory light chain 2, smooth 

osteoglycin (osteoinductive factor, mime 

Homo sapiens cDNA FU1 1 177 fis, cione PL 

coagulation fxtor XIII, A1 polypeptide 

phospholamban 

collagen, type XV, alpha 1 

ESTs 

integrin, alpha 8 

NM_019111*:Homo sapiens major rfclocompa 

780 



WO 03/042661 



PCT/US02/36810 



407815 


AW373860 


Hs.183860 


hypothetical protein HJ20277 


421B53 


AL1 17472 


Hs.108924 


SH3-domain protein 5 (ponsin) 




AI017574 


Hs. 17409 


cysteine-rich protein 1 (intestinal) 




NMJ00088 


Hs.1 72928 


collagen, type 1, alpha 1 






Hs.84298 


CD74 antigen (invariant polypeptide of m 








scrapie responsive protein 1 


42781 B 


AW511222 


Hs.193765 


ESTs 


433465 


AV657778 


Hs.3314 


selenoprotein P, plasma, 1 




AI769067 


Hs.127824 


ESTs, Weakly similar to T28770 hypotheti 


459290 


NM.001546 


Hs.34853 


inhibitor of DNA binding 4, dominant neg 




NM.003734 




amine oxidase, copper containing 3 (vase 


453874 




Hs.36131 


collagen, type XIV, alpha 1 (undulin) 




NM.003881 


Hs.194679 


WNT1 inducible signaling pathway protein 






Hs.159263 


collagen, type VI, alpha 2 


417043 


NM.004369 


Hs.80988 


collagen, type VI, alpha3 




X76534 


Hs.82226 


glycoprotein (transmembrane) nmb 


452669 


AA216363 


Hs.262958 


hypothetical protein DKFZp434B044 




BE440002 


Hs.180324 


Homo sapiens, clone IMAGE:418331 2, mRNA, 




AI342493 


Hg.24192 


Homo sapiens cDNA FU20767 fs, clone CO 




AI478427 




esophageal cancer related gene 4 protein 






Hs.245188 


tissue inhibitor of metalloproteinase 3 




N40850 


Hs.28625 


ESTs 




BE297626 


Hs.296049 
Hs.8944 


microfibrillar-associated protein 4 
procollagen C-endopeptidase enhancer 2 




AW939251 




v-fos FBJ murine osteosarcoma viral onco 




Y18207 


Hs!303090 


protein phosphatase 1 , regulatory (inhib 




AU076772 


Hs.1279 


complement component 1, r subcomponent 




AW887604 


Hs.78065 








Hs.75794 


endothelial differentiation, lysophospha 


452114 


N22687 


Hs.8236 


ESTs 




AI624300 


Hs.1 72928 


collagen, type I, alpha 1 




AW291464 


Hs.10338 


ESTs 






Hs .74602 


aquaporin 1 (channel-forming integral pr 


406849 


M454809 


Hs.1 72928 


collagen, type I, alpha 1 






Hs.89463 


potassium large conductance calcium-act' 


429500 


X78565 


Hs.289114 


hexabrachion (tenascin C, cylotactin) 




AF017986 


Hs.31386 


secreted frizzied-related protein 2 




L12350 


Hs.108623 


thrombospondin 2 


427373 


AB007972 


Hs.130760 


myosin phosphatase, target subunit 2 


414290 


AI568801 


Hs.71721 


ESTs 




AF001548 


Hs.78344 


myosin, heavy polypeptide 1 1 , smooth mus 


417130 




Hs.81256 


S100 calcium-binding protein A4 (calcium 




AA334592 


Hs.79914 




453355 


AW295374 


Hs.31412 


Homo sapiens cDNA FU1 1422 fis, clone HE 




AA912183 


Hs.47447 


ESTs 


430223 


NM_002514 




nephroblastoma overexpressed gene 


416585 


X54162 


Hs/79386 


leiomodin 1 (smooth muscle) 


422737 


M26939 


Hs.1 19571 


collagen, type III, alpha 1 (Etilers-Danl 


426406 


AI742501 


Hs.1 69756 


complement component 1, s subcomponent 


448429 


D17408 


Hs.21223 


calponin 1, basic, smooth muscle 


413G24 


BE177019 


Hs.75445 


SPARC-like1(mast9,hevin) 



Accession: Genbank accession nu 



M77830NM 0044 1 5 AF 139065 BG68 1 1 1 5 BG740377 BI712964 BG0Q0656 AA128470 BI438324 H27408 BE931 630 B E1 671 65 AW370827 
AW370813 J05211 BG698865 BG740734 BG6B0518 BG739778 BI765807 BM353403 BM353248 AW177784 AW205789 AW951576AW848592 
BE182164 BF149266 BE940187 BI060445 BI050444 BF350983 BE720095 BE720069 BE715154 BE082584 BE082576 BE004047 AA857316 
BI039774 BE71381 8 BE713548 AW1 70253 BE160433 BI039775 AW886475 BM452504 BE931734 BF149264AA340777 BF381183 BG621737 
AU127260 AW364859 BF993352 BG223489 BE81 9009 BF381 1 84 EE715956 R58704 AA852212 AW366566 BI090358 BF087707 BE819046 
BE819005M377127 BE073467 BE819059 BE819048 BI03S306 BG990973 B I040954 BF919911 AU 1 401 55 AI951 7S6 AI43451 8 AW804674 
BF752969 BE837009 BE925826 BF149265 AW99561 5 BEB1 4264 B1039782 AU140407 BE144243 BE709863 BF985642 BE001923 BF933510 
AW265328 BG436319 BE182166 AW365175 AW847688 EE8182B0 AW177933 BF873679 AW1 78000 BE082526 BF476866 BF086994 BF592276 
BE082507 BE082514 BE082505 BF873693 AW053840 AW847678 BF804153 AW365157 BE813930 BE002030 AW365153 BE184941 BF749421 
BE184920 BF839562 BE184933 BF842254 BE698470 BE931048 BF999889 BF368816 BE184924 BE159646 BE714632 BE184948 BG986845 
AA131128 AA099891 W39488 C04715 BF095124 EE865341 AW799304 AL603116 BE149760 BE705967 BE705966 BE705968 AW848723 
AW376699 AW376817 AW376697 BG005097 BF751 1 1 5 BE696084 AWB48371 AW376782 AW848789 AW849074 AW351413 BF927725 BF09421 1 
AW997139 BE865474 BE185187 BE156621 BE715089 BE713297 BE713298 BE179915 AW799309 BF872345 BF088676 BE705939 AW752599 
BG005197 BF350086 BE715196 BE715155 BF752396 BF093817 BFB31 190 BF752409 BE006561 BG959922 BF094833 BF094748 BF094583 
AW377699 AW607238 BE082519 AW377700 BF349467 AI190590 AI554403 A1392926 AU158477 BI4S7252 AU159919 AI760816 BF082516 
AI439101 AA451923 AI34032SAI590975 BI791553 AI700963 A1142882 M039975 AA946936 AA644381 BM314884 AA702424 A1417612 
AW1 90555 AI220573 AI304772 A1270345 AI627383 AA552300 AI91 1702 AW166807 A1346078 W95070 AA1491 91 AA026864 AI830049 AW780435 
AI078449 AI819984 AI858282 BI468588 AI860584 AI025932 AA026047 AA703232 AA658154 AA515500 AW192085 AA918281 T77861 AI927207 
AI205263BF082491 AW021347AI568096BE939M2M088ro6D12062AA056527AA782109W19287W02156AVV150038AA022701T87181 
H44405AI910434 BF082513 AI4940S9/ 70027 Al 1 i M128330BG681425BE706078 R20904BG680059BG676647BF764409AA026654 
AV745530 BI762796 BG287391 AW798780 BE706045 BE926470 AW7991 18 BF087996 BE002273 AW879451 AI571075 BE067786 AV721320 
AI022862 N29754 C03378 N84767 M131077 H30146 BE714290 AI686869 AI568892 AI915596 AW105614 AI887258 AI538577 BE926474 
BE067737 BG319486 AA247685 AW798883 AW103521 BF989173 AW860878 BE939707 BE185750 BE714064 BE713903 BE713868 BE713763 
BG950164 BE71 3810 AW365151 BG955489 BE005272 BF915937 AW365148 AI905927 BF992780 AW853812 BG954443 BI770853 BG679406 

781 



WO 03/042661 



PCT/US02/36810 



BG740832 BG681087 BG698430 AA455100 T87267 BE696209 BE696210 BI089483 BE006273 BE872225 AW391912 BE925515 BG677012 
BG741970 AA026480 BE705999 BG677157 BE009090 BG681378 BE712291 BG961498 BG678984 BI040941 AA337270 AW384371 AW847442 
BI058659 BE813665 W95048 W25458 AW177786 M025851 BE931733 BF154837 BG949393 BE714441 AW996245 BE711801 AI284090 
BE064323 BE719390 BE940148 BG991 21 2 BF37571 4 BF349522 BG996267 T48793 BI013292 BE001925 AW365156 AW365154 AW606653 
BF7S3109 BE931637 BE167181 BE713879 BF354008 BF678726 H90899 AW365145W38382AI498487 

AJ420458 AI018523 M708686 BF949633 AL1 1 9553 BF945960 AI081 305 AA041 432 AI921 01 3 AI68491 0 AI654847 AW8741 99 AI206120 
AW241428 R43035 T66767 AW1 0371 5 W28478 BF953052 H45926 BF8075S8 AW903943 BE170143 BI040435 BF931989 BI600000 AV722350 
W27787 H45331 BI549761 RS39SS BI549855 BG991 583 BI491075 AW020049 AW1 29293 H45263 M41 0309 AA340613 R4241 0 M707199 
AI431587 BE858679 AW292267 AI421 57S AA041 195 BE4667E3 AI24391 3 AI358894 AW1 37298 AI366468 NS4350 AA779107 AW025969 R49056 
AA34701 1 R55722 AW771106 F04969 Z38381 F01659 H17396 BI493714 AI880103 AW771447 AI202561 AA788851 AI494436 BF8561 14 H22570 
AW964381 BG007409 BM314056 AA465642 T30661 T3311 1 Z42834 C04542 BF948152 BF944325 BF858895 M935284 AI267360 N64249 
W67500 F07962 AA322394 BI489987 BE644965 BM313782 AA910364 AI809246 AA836750 BF115228 AA829730 N27413 BM141766 AI742325 
AA455261 AA938708AI420241 Al 130039 BF222341 BF941301 AA771807Z41034 BF447457BF447426AW515347 T03874F02360 F02302 
N34898AI678586AA807824AA948556AW204638 BM1 42045 BF446775AV648364AI801368AA971739AI017351 AA760722AI460007 AI458383 
AI694152 AA226536 BE467282 N47808 BE348825 BE830581 EE830583 N53009 N59351 N41056 W67501 AA016246 H43293 F21282 R59121 



TABLE 70C 

Pkey: Unique number corresponding to an Eos probesei 
Ref: Sequence source.The 7 digit numbers in this column an 

human chromosome 22" Dunham, et al. (1999) Nature 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLpositicn: Indicates nucleotide positions of predicted exons. 



Genbank Identifier (Gl) numbers/Dunham, et al." red 



Pkey 



Strand 



TABLE 71A: 774 GENES UP-REGULATED IN EWING SARCOMA COMPARED TO NORMAL ADULTTISSUES 

Table 1A lists 774 genes up-regulated in Ewing Sarcoma compared to normal adult tissues. These were selected from 35371 probesets on the Affymetrix/Eos Hu03 GeneChip array 
such that the ratio of "average" Ewing Sarcoma to "average" norma! adult tissues was greater than or equal to 2.1 . The "average" Ewing Sarcoma level was set to the 75th percentile 
amongst numerous Ewing sarcomas. The "average" normal adult tissue level was set to the 85th percentile amongst various non-malignant tissues. In order to remove gene-specific 
background level' if non 'it, Lndizaton the 10th percentile value amongst the non-malignant tissues was subtracted from both the numerator and the denominator before the 
ratio was evaluated. 



Pkey: 



Uniqui 



ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unlgene number 
Unigene Title: Unigene gene litle 

R1: 75th percentile of ewing sarcoma to 85th percentile of body map 



101447 
105782 
115881 



131291 
101063 
121619 



ExAccn UnigeneiD 

AW9G9769 Hs.1 05201 
M21305 

H09748 Hs.57987 

NM 005756 Hs.184942 

AW969726 Hs.98381 

AW862258 Hs.169266 

AF061573 Hs.19492 

AA316181 Hs.61635 

AL008583 Hs.91622 

AA918317 Hs.57987 

U29690 Hs.37744 

AA737106 Hs.32250 

AF050147 Hs.97932 

NMJM4350 Hs.170019 

D54745 Hs.80247 

AA528339 Hs.178062 
AW975398 Hs.293836 
D49493 Hs.2171 
NMJ00399 Hs.1395 



124006 
116301 
121231 
106533 



116790 
105316 
102123 



AL13470B 
AA219691 

R96290 Hs.75874 
AW161357 

AI671245 Hs.24835 

NM_001809 Hs.1594 

AL049601 Hs.13649 

AA554907 HS.5B291 

AW292315 Hs.7215 



AA314490 
AA80B001 
C14187 



ESTs, 

neuropeptide Y receptor Y1 
protocadhen'n 8 
six transmembrane epithelial antigen of 
neuronal pentraxin receptor 
B-cetl CLUIymphoma 1 1 B (zinc finger pro 



ESTs" 1 3 



tode-a ysii 



to I7B885 serin 



rur.t-relaled Inscription factor 3 



ESTs, Weakly similar to phosphatidylseri 



eady growth response 2 (Krox-20 (Drosop 
gb:no97c02.s1 NCLCGAP_P:2 Homo sapiens 
ESTs; homologue of PEM-3 [Ciona savignyi 
ESTs 

ESTs, Weakly similar to ALUC.HUMAN !!! 



145998 ESTs 



Hs.74376 
Hs.26369 
Hs.27669 
Hs.184411 
Hs.103538 ESTs 



16 interacting, kinesin-like (rabkines 
noosomal protein L44 
mlcrotubule-assodalec' p < tein 1 m 
hypothetical protein FU 14594 
centromere protein A (17kD) 
Novel human ! 
ESTs 
ESTs 

neuropeptide Y receptor Y5 



WO 03/042661 



PCT/US02/36810 



5 

10 



126077 M78772 Hs.210836 

100698 AA884407 Hs,211595 

106821 NM 006594 Hs.28298 

110288 H40665 Hs.31564 

106498 AI221919 

112134 R41823 Hs.7413 

115266 AA334274 Hs.18368 

128639 AW582962 Hs.102897 

125698 AF078847 Hs,191356 

105200 AA328102 Hs.24641 

119456 AA248897 Hs.48784 



AA648459 Hs.335951 

BE252983 Hs,35086 

AF234532 Hs,61638 

NM 006276 Hs.184167 

W20090 Hs.6616 

AA495904 Hs.103316 

AI249502 Hs.29669 



hypothetical protein FU10582 



Homo sapiens, Similar to TEA don 
hypothetical protein AF301222 
ubiquitin specific protease 1 



123289 

123532 

125556 
127240 
123049 



AA084273 Hs.76561 
AB033064 Hs.236463 
AJ005683 Hs.86998 
BE047680 Hs.211869 
AF091086 Hs.44563 
BE538082 Hs.8172 
NM.003616 Hs.102456 
Hs.35120 
Hs.26411 
Hs.101265 
Hs.59761 



131945 NM_002916 
110730 N67655 
119186 AI979147 



ESTs 
ESTs 

gb:aeS6f06.s1 St 
ESTs, Weakly similar to S47072 linger pr 
KIAA1 238 protein 
nuclear factor of activated T< 
dickkopf (Xenopus laevis) hoi 



. iimilartoA4601OX-!in 
af motor neuron protein interac 
in factor C (activator 1)4 (37 



ESTs, 



121916 AW1 17207 Hs.98523 ESTs 



ESTs, Weakly similar to DAP1_HUMAN DEATH 1 0.3 



131402 
101879 
133181 
131307 



127003 
115414 
117829 
126098 
121910 



120217 

131601 
12G232 



Hs.1 13503 
Hs.243886 
Hs.66744 



karyopherin (importin) be 



AA457211 

X98834 

AA148859 

W07288 

AW816515 



Hs.173540 
Hs.283099 
Hs.247117 



55 
60 
65 
70 



M79088 

AI204600 Hs.96978 

AF176012 Hs.260720 

R42216 Hs.7759 

AW247529 Hs.6793 

AI940675 Hs.20914 

AA862257 Hs.66035 

T03927 Hs.293147 

NM_007115 Hs.29352 

AI0B5153 Hs.143764 

AW997484 Hs.5003 

AU077228 Hs.77256 

M187554 Hs.300496 

D82343 Hs.74376 

AF013758 Hs.109643 

AB026187 Hs.159156 

AI347487 Hs.132781 

AB023191 Hs.44131 
AA443241 

AW970602 Hs.105421 

AW952412 Hs.65874 



NM_015156 Hs.78398 KIM0071 pp 



twist (Drosophils) homolog (acr 
adrenergic, beta-3-, receptor 
bromodomain adjacent to zinc finger doma 
sal (Drosophila)-like 2 
hypothetical protein FU22995 
ESTs, Weakly similar to LNHUER IgE Fc re 
ATPaso, Class V, type 10D 
AF15q14 protein 

ESTs, Moderately similar to ALUF_HUMAN ! 

gb:EST01236 Subtracted Hippocampus, Stra 

hypothetical protein MGC10764 

J domain containing protein 1 

Homo sapiens clone 24538 mRNA sequence 

platelet-activating factor acetylhydrola 

hypothetical protein FU23056 

ESTs 

ESTs, Moderately similar to A46010 X-lin 
tumor necrosis factor, alpha-induced pro 
ESTs, Weakly similar to unknown [H.sapio 
ESTs, Weakly similar to A46010 X-linked 
KIAA0456 protein 

enhancer of zeste (Drosophila) homotog 2 



polyadenylate binding protein-interactin 



I cytokine receptor 
KIAA0974 protein 
ribosomal protein L44 



Is, Weakly similar to A40348 Elav/Sex- 



AA745781 
AI654133 



Hs.27688 
Hs.287733 
Hs.14600 



107254 Z45998 



Hs.5364 



Hs.22543 

AA339449 Hs.82285 

119082 AF252297 Hs.91546 

125400 AL1 10151 Hs.1 28797 

129413 AW377610 Hs.11123 

116766 AI608657 Hs.95097 

129075 BE250162 Hs.83765 



hypothetical protein FU23189 
ESTs 

hypotheUcal protein MGC2454 

SWI/SNF related, matrix associated, acti 

thyroid receptor interacting protein 15 

DKFZP564I052 protein 

gb:zn27h06.r1 Stratagene neuroeplthelium 

Homo sapiens mRNA; cDNA DKFZp761l1912 (f 

phosphoribosylglycinamide fonrnyltransfer 



dihydrofolate reductase 



WO 03/042661 PCT/US02/36810 



ESTs, Weakly similar to 138022 hypoiheli 




55 
60 



AW771958 

W37943 

AW877787 

N45018 

AI674126 



120325 AA195651 



133276 
127742 
127664 



AA806164 
AF133123 
M412488 
BE614410 
M319146 



Hs.69504 
Hs.180138 
Hs.116502 
Hs.90847 
Hs.48820 
Hs.23044 
Hs.75426 

Hs.22003 



g b:nc26a07.s1 NCLCGAP_Pr1 He 
gb:nc08a07.r1 NCI_CGAP_Pr1 Homosai 
ESTs, Moderately similar to PC4259 fern 



hemoglobin, beta 

Home sapiens cDNA: FLJ22559 fis, clone H 
hypothetical protein FU20624 
AP-2betatra 



general transcription (actor IIIC, polyp 
TATA box binding protein (TBP)-associate 
RAD51 (S.cerevisiae) homolog (Ecoli Re 



103346 


X87613 


Hs.5464 


thyroid hormone receptor coactivat'ng pr 


6.4 




AA205976 


Hs.50141 


gb:zq48a10.r1 Stratagene hNT neuron (937 
EST 


6.4 
6.4 


105301 


AW352357 


Hs.7457 


MAGE1 protein 






AI869372 


Hs.17207 


Homo sapiens cDNA FU11 922 fis, clone HE 


&3 


127968 


AA830201 


Hs.1 24347 


ESTs 


&3 


134719 


AA852985 


HS.B9232 


chromobox homolog 5 (Drosophila HP1 alph 


6.3 


110748 


AW581992 


Hs.301434 


KIAA1387 protein 


6.3 


115844 


AI373062 


Hs.332938 


hypothetical protein MGC5370 




121885 
102398 


AA934883 


Hs.98467 


ESTs, Highly similar to AF257737 1 cilia 
gb:Human N33 protein form 1 (N33) gene, 


6.2 
6.2 


130882 


AA497044 


HS.208B7 


hypothetical protein FU10392 


6.2 


132256 


AI078645 


Hs.431 


murine leukemia viral (bmi-1) oncogene h 


6.2 


106383 




Hs.27860 


Homo sapiens mRNA; cDNA DKFZp586M0723 (f 


6.2 


135186 


U73328 




distal-less homeobox 4 


a2 


105510 
109261 


Z42047 


Hs!283978 


Homo sapiens PR02751 mRNA, complete cds 
Homo sapiens cDNA FU13591 fis.clone PL 


6.2 
6.1 




R41677 


Hs!6107 


Homo sapiens cDNA FU14839 fis, clone OV 


6.1 


128501 


AL133572 


Hs.199009 


protein containing CXXC domain 2 




109747 


AI223001 


Hs.22969 


ESTs, Weakly similar to Z141.HUMAN ZINC 


6.1 


128361 


AW172570 


Hs.14600 




6.1 


115773 


AW445044 


Hs.38207 


Human DNA sequence from clone RP4-530115 


6.0 


104689 


AA420450 


Hs.292911 


Plakophilin 


6.0 


113575 


AW138168 


Hs.1 5671 


ESTs, Weakly similar to KBF3_HUMAN NUCLE 


6.0 


121830 


AW394055 


Hs.98427 


ESTs, Weakly similar to I38022 hypotheti 


5.9 


119298 


NMj)01241 
H01052 


Hs.155478 


cyclinT2 

gb:yj32h01.s1 Soares placenta Nb2HP Homo 


5.9 
5.9 


115622 


AI088691 


Hs.208414 


Homo sapiens mRNA; cDNA DKFZp564D0472 (f 


5.8 


122969 


AW821252 




hypothetical protein 


5.8 


109872 




Hs.28653 


ESTs 


5.8 


114208 


AL049466 


Hs.7859 


ESTs 


5.8 


113494 




Hs.86538 


ESTs 


5.8 




AA668631 


Hs.159971 


KIAA0379 protein 


5.8 


129300 


W94197 


Hs.110165 


ribosomal protein 126 homolog 


5.8 


127489 


AA650250 


Hs.272076 




5.8 


133479 




Hs.238797 


ESTs, Moderately similar to 138022 hypol 


5.8 


105909 


AA195191 




hypothetical protein FU20729 


5.8 


101255 


BE3B5864 


Hs.149894 


mitochondrial translational initiation f 


5.8 


134676 


W28051 


Hs.87819 


Homo sapiens, clone MGC:2492, mRNA, comp 


5.7 


130895 


AA641767 
AI468004 


Hs.21015 


hypothetical protein DKFZp564L0864 simil 
hypothetical protein FU12929 


5.7 
5.7 


130723 


BE247676 


Hs!l8442 


E-1 enzyme 


5.7 


107230 


AI034467 


Hs.34650 


ESTs 


5.7 


102745 


AW753865 


Hs.74376 


olfactomedin related ER localized protei 


5.6 


108699 


AA121514 


Hs.70832 


ESTs 


5.5 


128080 


F12310 




gb-JHSC38D041 normalized infant brain cDN 


5.5 



784 



WO 03/042661 



PCT/US02/36810 



5 
10 



40 
45 
50 
55 
60 
65 
70 



117357 


N24829 




gb:yx98h12.s1 Soares melanocyte 2NbHM Ho 


120827 


AA382525 


Hs.132967 


Human EST clone 122887 mariner transpcso 


123494 


AW179019 


Hs.112110 


mitochondrial ribosomal protein L42 


100438 


AA013051 


Hs.91417 


topoisomerase (DNA) II binding protein 


126165 


AI741816 


Hs.1 25897 


ESTs 


125934 


AA1 93325 


Hs.32646 


hypolhetical protein FU21 901 


124370 


AI243499 


Hs.170652 


ESTs 


106135 


AL1 17474 


Hs.41181 


Homo sapiens mRNA; cDNA DKFZp727C1 91 (fr 




R49390 


Hs.254129 


KIAA1678 






Hs.61635 


six transmembrane epithelial antigen of 


109606 


Z43371 




ESTs 




BE042915 


Hs.22572 


Homo sapiens cDNA FLJ13675 (is, clone PL 




AA610620 


Hs.181165 


major histocompatibility complex, class 


131185 


BE280074 


Hs,23960 


cyclin B1 


111227 


T06701 


Hs. 12268 




105516 


AK001269 




hypothetical protein FU 10407 




NM 004454 


Hs.43697 


ets variant gene 5 (ets-related molecule 


116008 


T79153 


Hs.48589 


zinc linger protein 228 


106394 


Z42993 


Hs.25320 


Homo sapiens clone 25142 mRNA sequence 


115121 


AI634549 


Hs.325422 


ESTs 


125464 


N71807 




gb:yz29d09.r1 Soares mulSple_sclerosis_ 


115596 


AW298597 


Hs.61884 


Homo sapiens, clone IMAGE:4298026, mRNA, 


119040 


R02394 


Hs.269436 


L T } i ly sin to 1 f n 


128040 


AW500486 




splicing factor proline/glutamine rich ( 


110151 




Hsl31608 


hypothetical protein FLJ 20041 


105539 


AB040884 


Hs.109694 


KIAA1451 protein 


130557 


M383092 


Hs.1608 


replicali ,n urotem A3 (14kD) 


111077 


N41367 


Hs.173002 


ESTs, Weakly similar to I38022 hypotheti 


120830 


AI568170 


Hs.96886 


ESTs 


106516 


AL137311 


Hs.234074 


Homo sapiens mRNA; cDNA DKFZp761G02121 


103316 


X83301 


Hs.324728 


SMA5 


116129 


AF1 89011 


Hs.49163 


putative ribonuclease III 


131253 


R71802 


Hs.24853 


ESTs 


116680 


AW902848 


Hs.273829 


ESTs 


123949 


M621665 


Hs.208957 


EST 


105511 


AB037829 


Hs.3862 


regulator of nonsense transcripts 2; DKF 


125847 


AW161885 


Hs.269745 


ESTs 


108301 


AA069728 


Hs.184582 


ribosomal proteinic 


110799 


AI089660 


Hs.323401 


dpy-30-like protein 




AA054726 


Hs.285574 


ESTs 


125972 


AI927475 


Hs.35406 


ESTs, Highly similar to unnamed protein 


107869 


AW975998 


Hs.58595 


ESTs, Weakly similar to I38022 hypotheti 




AA293834 


Hs.97312 


ESTs 


125321 


T86652 


Hs.178294 


ESTs 


102627 


AL021918 


Hs.1 58174 


zinc finger protein 184 (KruppeHike) 


101446 


AF084555 




cyclic AMP phosphoprotein, 19 kD 




H79863 


Hs.1 14243 


ESTs 



116814 
130622 
110818 

108768 
120484 
114518 



H50834 
AI582291 
AL157503 



Hs.27552 
Hs.129074 
Hs.61345 
Hs.96473 
Hs.106469 

Hs.102243 
Hs.125034 
Hs.130853 
Hs.28274 
Hs.17109 



j ,uf jiii I ,c fel ivi rspleen 
ESTs, Weakly similar to 04HUD1 debrisoqu 
Homo sapiens mRNA; cDNA DKFZp586N2424 (f 
ESTs, Moderately similar to MEG1 MOUSE M 



suppressor of varl (S.cerevisiae) 3-like 



Homo sapiens cDNA; FU22049 Js, clone H 
integral membrane protein 2A 
gb:zn09g08.r1 Slratagene hNT neuron (937 
gb:yr77g01.r1 Soares fetal liver spleen 
D1 23 gene product 



127987 
129706 
129598 



100448 
127542 
132452 



AW805510 

AW954064 

AI022103 

AA443241 

N30436 

AL096739 

AA034922 

AF234887 

AA703684 

AW352149 

AI475110 

AW136703 

X02404 

AA377165 



Hs.97056 
Hs.24951 
Hs.1 24511 



KIAA0863 protein 

illicula Wmphoma variant translocatio 
Homo sapiens mRNA; cDNA DKFZp547D086 (fr 
ESTs 

hypothetical protein FU21634 



cadherin, EGF LAG seven-pass G-lype rece 
ESTs.Moderal i > L HUMANE 
mitochondrial ribosomal protein S14 



Hs.203933 ESTs 

Hs.17268 ESTs 

Hs.274534 calcitonir 

Hs.44833 ESTs 



785 



WO 03/042661 



113974 


AW969756 


Hs.34145 


ESTs, Weakly similar to B49647 GTP-bM 


133098 


AW629223 


Hs.64794 


zinc finger protein 183 (RING finger, C3 


109920 


H05840 


Hs.111323 


ESTs 


103487 


AA743603 


Hs.172108 


nucleoporin 88kD 


125353 


AB033043 


Hs.149377 


hypothetical protein DKFZp761L0424 


100893 


BE245294 


Hs.180789 


S164 protein 




AL134197 


Hs.93597 


cyclin-dependent kinase 5, regulatory su 


111148 


AB020690 


Hs.7782 


paraneoplastic antigen MA2 




BE379594 


Hs.49136 


ESTs, Moderately similar to ALU7_HUMAN A 




AL133092 


Hs.68055 


hypothetical protein DKFZp434l0428 


132742 


AA025480 


Hs.292812 


ESTs, Weakly similar to T33468 hypothefi 


104912 


AA813192 


Hs.200596 


KIAA0547 gene product 


110223 


H19836 


Hs.31697 


ESTs 


113047 


AI571940 


Hs.7549 


ESTs 


101031 


J05070 


Hs.1 51738 


matrix metalloproteinase 9 (gelatinase 9 


113002 


BE243513 


Hs.7212 


hypothetical protein PP1044 


105304 


AW134924 


Hs.190325 


ESTs 




AA831785 


Hs.171914 


Homo sapiens cDNA FU14209 Us, clone NT 


106489 


AA452054 


Hs.1 19338 


ESTs 


122732 


AW188551 


Hs.99519 


hypothetical protein FU14007 


112651 


AA595802 




ESTs, Weakly similar to T17279 hypotheti 




AW970976 


Hs.293653 


ESTs 


123596 


M421130 


Hs.1 12640 


EST 




AI582222 


Hs.128686 


ESTs 


106178 


AL049935 


Hs.301763 


KIAA0554 protein 


102250 


AI249361 


Hs.74122 


caspase 4, apoptosis-related cysteine pr 


105868 


M378780 


Hs.334842 


tubulin alpha 1 


127496 


AI031650 




ESTs 




AF1 16622 




gb:Homo sapiens clone FLB4217 mRNA seque 


110193 


AI004874 


Hs.310764 


Homo sapiens mRNA; cDNA DKFZp434M082 (fi 


105012 


AF098158 


Hs.9329 


chromosome 20 open reading frame 1 


129445 


W52452 


Hs.29797 


ribosomal protein L10 


114721 


D61939 


Hs.103822 


ESTs 


120922 


AA481003 


Hs.97128 


ESTs 


115167 


AA749209 


Hs.43728 


hypothetical protein 


102407 


AW602154 


Hs.82143 


E74-like factor 2 (els domain transcript 




AW875398 


Hs.6451 


PRO0659 protein 


123829 


AF251237 


Hs.1 12208 


XAGE-1 protein 


103616 


NM 002647 


Hs.32971 


phosphoinosiBde-3-kinase, class 3 


100269 


NM 001949 


Hs.1 189 


E2F transcription factor 3 



112728 R31313 Hs.272104 EST" "II i I ALU1.HUMAN A 

135098 AW27452G Hs.277721 membrane component, chromosome 17, surfa 

106034 AW952005 Hs.14928 hypothetical protein FLJ12903 

133571 BE515Q37 Hs.177556 melanoma antigen, family D, 1 

106246 AL036917 Hs.288821 KIAA1638 protein 

125724 AL360190 Hs.318501 Homo sapiens mRNA full length insert cDN 

132206 AA425204 Hs.334721 hypothetical protein FU13391 

130227 BE397151 Hs.153003 serine/threonine kinase 16 

129650 AF109298 Hs.1 18258 prostate cancer associated protein 1 

130382 NM_003450 Hs.155204 zinc finger protein 174 

106073 AL157441 Hs.17834 downstream neighbor of SON 

105403 AI473827 Hs.31793 ESTs 

118138 AA374756 Hs.93560 Homo sapiens mRNA for KIAA1771 protein, 

134126 NM 003747 Hs.131814 tankyrase, TRF1-interacting ankyrin-rela 

115041 AA252457 Hs.86543 ESTs, Moderately similar to T00256 hypot 

106012 AI240665 ESTs 

116732 AW152225 Hs.165909 ESTs, Weakly similar to I38022 hypotheti 

130832 AW838006 Hs.20082 zinc finger protein 3 (A8-51) 

125960 AI754693 Hs.145968 ESTs 

133916 AL039185 Hs.77558 thyroid hormone receptor interaotor7 

106232 AB037828 Hs.15370 KIAA1407 protein 

129228 U40714 Hs.239307 tyrosyl-tRNA synthetase 

109929 AA773187 Hs.294027 ESTs 

123729 AL039779 Hs.278672 membrane component, chromosome 1 1 , surfa 

101266 L35545 Hs.73964 EphA4 

132389 AA310393 Hs.190044 ESTs 

124320 H95749 Hs.102342 EST 

103023 AW500470 Hs.11795C multifunctional polypeptide similar to S 

116133 AW449597 Hs.313652 EST, Weakly similar to 138022 hypothetic 

129001 AA443323 Hs. 107812 BPOZ protein 

119271 AI061118 Hs.65328 Fanconi anemia, complementation group F 

114767 AI859865 Hs.1 54443 minichromosome maintenance deficient (S. 

126107 H75477 Hs.93361 ESTs 

115333 BE241812 Hs.87860 protein tyrosine phosphatase, non--ecept 

100571 L14561 Hs.78546 ATPase, Ca« transporting, plasma membra 

134104 L35253 Hs.79107 mitogen-activated protein kinase 14 

120150 BE005771 Hs.153746 hypotheUcal protein FU22490 

134470' X54942 Hs.83758 CDC28 protein kinase 2 

129948 AI537162 Hs.263988 ESTs 

106882 AA149537 Hs.25994 hypothetical protein FU20477 

126199 AW088276 Hs.1 25829 ESTs 

112727 T91029 Hs.1 5069 ESTs 



WO 03/042661 



PCT/US02/36810 



30 
35 



Hs.24950 
Hs.82921 
Hs.16875 
Hs.186518 
Hs.79133 



AW630087 

AI470523 

NMJ05574 

AL038511 

N74210 

NM.002975 

A1244489 



115399 
133615 
118129 
126522 
131965 



112996 
129428 
118967 
127335 
10GG3G 

125905 

131971 

128895 
110924 
112921 
l?08?0 



W31912 

W79283 

U52154 

M704253 

W57578 

AI133161 

U51166 

AI434760 

BE2761 12 

AA256906 

AA830210 
AW958037 
W45623 
U29725 



130580 
130174 
115577 



AA984531 
N32388 
M29551 
AA393167 



126127 N95428 

103175 X69089 

100154 H60720 

106140 AB006624 



Hs.125316 

Ha1 05927 
Hs.285724 
Hs.46506 
Hs.182426 
Hs.104413 
Hs.38178 



regulator of G-protein signalling 5 
solute carrier family 35 (CMP-sialic aci 
ESTs, Weakly similar to S23650 retroviru 
hypothetical protein MGC2599 similar to 
ESTs 

AFFX control -DapX-M 
RAN binding protein 6 
trinucleotide repeat containing 1 
ubiquitjn-conjugafing enzyme E2E 1 (homo 
ATP-binding cassette, sub-family C (CFTR 
LIM domain only 2(rhombotln-like 1) 
ESTs, Weakly similar to S33990 finger pr 
ESTs 

stem cell growth factor; lymphocyte seer 



hypothetical protein FU23468 
HS.805U6 small nuclear tibonudeoprotein po 
lls.92023 core histone macroH2A2.2 
Hs.75236 ELAV (embryonic lethal, ab 

gb:yy54c08.s1 Soares_mulltpie_scierosis_ 
Hs.211 68 gb:zc76d03.s1 Pancreatic Islet Homo sapi 
Hs.35962 ESTs 

Hs.1 93044 potassium inwardly-rectifying channel, s 



Hs.1 69359 ESTs 



Hs.237955 

Hs.286131 

Hs.1 73824 

Hs.279949 

Hs.7165 

Hs.111364 

Hs.216756 

Hs.214263 

Hs.286 

Hs.74571 

Hs.3080 

Hs.6456 

Hs.82932 

Hs.154938 

lls.179999 



Hs.77196 
Hs.1 59293 
Hs.334370 



Hs.81892 
Hs.14912 
Hs.158210 



AA296B74 Hs.77494 



AW450979 
AA316241 
T47910 



116203 
123377 
122798 
107780 
132733 



117417 AI241733 

106272 AW959731 

110834 AW273860 

123663 AL039482 

124250 AA350256 

127038 AA233014 

107711 W96141 



Hs.90691 

Hs.20976 
Hs.87306 
Hs.271297 
Hs.145696 
Hs.269213 
Hs.123654 
Hs.192843 
Hs.23754 
Hs.1 35260 
ls.43871 



RAB7, member RAS oncogene family 
CGI-101 protein 
thymine-DNA glycosylase 
KIAA1007 protein 
zinc finger protein 259 
ESTs, Weakly similar to ubiquitous TPR m 
ESTs 

ESTs, Moderately similar to ALU1_HUMAN A 
ribosomat protein L4 
ADP-ribosylation factor 1 
mitogen-activated protein kinase 7 
chapcronin containing TCP1, subunit 2 (b 
cyclin D1 (PRAD1: parathyroid adenomatos 
hypothetical protein MDS025 
Homo sapiens, clone IMAGE:3457003, mRNA 



glioma-amplified sequence-41 

Homo sapiens, Similar to zinc finger pro 

Human DNA sequence from clone RP5-876B10 



ESTs 

gb:zb80d09.s1 Soares.senescentjbroblas 

heparan sulfate (glucosamine) 3-O-sulfot 

myomesin (M-protein) 2 (165kD) 

KIAA0101 gene product 

KIAA0286 protein 

hypothetical protein MGC2556 

tumor necrosis factor receptor superfami 

hypothetical protein MGC3199 

deoxyguanosine kinase 

gb.od71a09.s1 NCI_CGAP_Ov2 Homo sapiens 

gb:UI-H-BI3-ala-a-12-0-Ul.s1 NCI.CGAP.Su 

g b:yb18b11.s1 Stratagene fetal spleen (9 



Hs.1 05965 ESTs 



787 



WO 03/042661 



130850 


AB040922 


Hs.20237 


DKFZP566G134 protein 


119126 






ESTs 


125466 


R08234 


Hs.180461 


ESTs 


105150 


AA631977 


Hs.155995 


KIAA0643 protein 


103163 
104495 


AU077018 


Hs.3235 
Hs.292979 


keratin 4 
ESTs 




AW975687 


Hs 60136 




113577 


AI300699 


Hs.111334 


PRO0470 protein 


102681 


Y08890 


Hs.113503 


karyopherin (importin) beta 3 




R98881 


Hs.109655 


sex comb on midleg (Drosophila]-like 1 


127563 


AB035898 


Hs.1 50587 


kinestn-like protein 2 


131112 


H15302 


Hs.168950 


Homo sapiens mRNA; cDNA DKFZp566A1046 (f 


114118 


F01598 


Hs.175930 


ESTs 


103937 


AA934063 


Hs.13836 


ESTs, Weakly similar to 138022 hypotheti 


125174 


W51835 


Hs.231082 


EST 


104799 


AA029703 




gb:ze95h08.s1 Soares_fetal_heart_NbHH19W 


128952 


AL043463 


Hs.6755 


RaP2 interacting protein 8 


135191 


X16866 
AI683183 


Hs.333497 
Hs.99348 


cytochrome P450, subfamily IID (debrisoq 
distal-less homeo box 5 


113560 


T91015 




ESTs 


119232 


AI655226 


Hs.1 17659 


ESTs, Weakly similar to T46481 hypotheti 


113988 


W87536 


Hs.36473 


ESTs, Weakly similar to JC5238 galactosy 


115173 




Hs.88252 


ESTs 


12GG00 




Hs.191385 


ESTs 




AI738610 


Hs.267967 


ESTs, Moderately similar to ALU8_HUMAN A 


123419 


T66087 


Hs.1 12482 


Homo sapiens unknown mRNA sequence 


102025 


U04045 


Hs.78934 


mutS (E. coli) homolog 2 (colon cancer, 


100661 


BE623001 


Hs.132748 


Homo sapiens ribosomal protein L39 mRNA, 


132339 


D80030 


Hs.45127 


chondroitin sulfate proteoglycan 5 (neur 


115691 


AW190215 


Hs.62348 


hypothetical protein FU1 1753 


134853 


BE268326 


Hs.90280 




134075 


NM_012201 


Hs.78979 


Golgi apparatus protein 1 


128468 


Hs.1 50580 


putative translation initiation factor 


127229 


AA316181 


Hs.61635 




108451 


AA079195 




gb:zm92h12.s1 Stratagene ovarian cancer 


133421 


AF134160 


Hs.7327 


claudin 1 


135332 


AW393883 


Hs.98968 


hypothetical protein FU23058 


132520 


AA257992 


Hs.50651 


Janus kinase 1 (a protein tyrosine kinas 


'25002 


AI494372 


Hs.98968 


hypothetical protein FLJ23058 




BE161065 


Hs.167531 


methylcrotonoyi-Coenzyme A carboxylase 2 


125558 


R59305 




gb-./hl ISoa ia tb n 1NIB F 


117205 






RAB9-like protein 


130215 


BE301883 


Hs.152707 


glioblastoma amplified sequence 


109482 


AI002238 


Hs.1 1482 


splicing factor, arginine/serine-ricti 11 






Hs.252716 


oxysterol-binding protcin-relatod protei 


132317 


BE262438 


Hs.44592 






AW029507 


Hs.1 61 102 


ESTs 


100944 


L07518 




mucin 6, gastric 




R58958 




hypothetical prolein MGC15880 


125032 


T74884 




gb:yc5Bd02.s1 Stratagene liver (937224) 


123720 


M609734 


Hs.1 12755 


EST 


128846 


M730767 




SCGIO-like-protein 


116443 




Hs.339808 


LBP protein 32 


128770 


AB015982 


Hs.143460 


protein kinase C, nu 




AA076049 


: Hs.274415 


Homo sapiens cDNA FU10229 fis. clone HE 


131244 


AI638429 


Hs.24763 


RAN binding protein 1 


128765 


AF073310 


Hs.1 43648 


insulin receptor substrate 2 


111223 




Hs.334838 


KIAA1866 protein 


104857 


AI920902 


Hs.19058 


ESTs, Moderately similar to S65657 alpha 


100390 


AI580880 


Hs.268149 


putative methyltransferase 



133582 BE391579 Hs.75087 Fas-activated serine/threonine kinase 

128527 AA504583 Hs.101047 transcription factor 3 (EzAimmunoglobul 

106367 AA504747 Hs.136102 KIAA0853 protein 

103392 X94563 gb:Rsapiens dbi/acbp gene exon U 2. 

101086 AA382524 Hs.250959 histaUn 1 

133423 T84084 Hs.196008 Homo sapiens cDNA FU1 1723 fis, clone HE 

117714 N45226 Hs.46495 EST 

110986 AL133117 Hs.81376 Homo sapiens mRNA; cDNA DKFZp535L1 121 (f 

114096 AF060219 Hs.27007 chromosome condensation 1-like 

117147 AW901347 Hs.38592 hypothetical protein FU23342 

133347 BE257758 Hs.71475 acid cluster protein 33 

132456 AB011084 Hs.48924 KIAA0512 gene product; ALEX2 

133765 M62194 Hs.75929 cadherin 11, type 2, OB-cadherin (osteob 

109734 AI927212 Hs.3734 ESTs 

132786 BE083422 Hs.56851 hypothetical prolein MGC2668 

106685 AU076617 Hs.16251 cleavage and polyadenylation specific fa 

105593 AA279341 Hs.174151 aldehyde oxidase 1 

131331 M92642 Hs.26208 collagen, type XVI, alpha 1 

108687 BE544475 Hs.54347 ESTs 

113115 AI141426 Hs.8705 ESTs 

107234 AA972447 Hs.288833 Homo sapiens mRNA; cDNADKFZp434KG87(fr 

123110 AA486256 Hs.193510 EST 



WO 03/042661 



129079 


AK000157 


Hs.103502 


hypothetical protein FLJ20150 


111122 


N63753 


Hs.16492 


DKFZP564G2022 protein 




BE278111 


Hs.134200 


DKFZP564C186 protein 


128671 


AI885045 


Hs.211536 


phosphoinositide-3-kinase, regulatory su 


123258 


AA490929 


Hs.105274 


ESTs, Weakly similar to RMS1 HUMAN REGUL 


128826 


Z40313 


Hs.106330 


Homo sapiens clone I MAGE:23371 , mRNA seq 


120149 


AA227609 


Hs.94834 


ESTs 


128817 


BE395776 


Hs.168640 


ankylosis, progressive (mouse) homolog 


110679 


AA004798 


Hs.108311 


ESTs, Weakly similar to T00351 hypotheti 


122365 


AA813546 


Hs.99034 


GTP-binding protein Rho7 


134455 


BE378152 


Hs.83530 


hypothetical protein 


115506 


AB037756 


Hs.45207 


hypothetical protein KIAA1335 


134267 


AI174596 


Hs.196209 


RAE1 (RNA export 1, S.pombs) homolog 




AA443164 


Hs.23259 


hypothetical protein FU13433 




BE245294 


Hs.180789 


S1 64 protein 


120120 


BE547267 


Hs.59791 


hypothetical prolein MGC13183 


126638 


M649257 


Hs.86998 




128531 


H03721 


Hs.63236 


ribosomal protein S15a 




AL049951 


Hs.22370 


Homo sapiens mRNA; cDNA DKFZp564O0122 (f 


108859 


AL121500 




ESTs 


112276 


R53442 


Hs.26038 


ESTs, Weakly similar to I38022 hypothec 


125693 


H23989 


Hs.1 69743 


Homo sapiens clone 25121 neuronal olfact 


113474 




Hs.23856 


hypothetical prolein MGC5297 


132994 


M1 12748 


Hs.279905 


clone HQ0310 PRO0310p1 


124691 


R05835 


Hs.1 10153 


ESTs 


112511 


AW970420 




dynactin 2 (p50) 


132985 


AL045579 


Hs.62113 


KIAA0717 protein 


125743 


H17151 


Hs.7416 


gb:ym37a05.r1 Soares infant brain 1NIB H 


133363 


AI866286 


Hs.71962 


ESTs, Weakly simitar to B36298 proline-r 


107908 


AF087999 




ESTs 


105312 


BE613348 


Hs.211579 


melanoma cell adhesion molecule 


110256 


H63947 


Hs.237955 


RAB7, member RAS oncogene famly 


132073 


AA295052 


Hs.38516 


Homo sapiens, clone MGC15887, mRNA, com 


107090 


AW809208 


Hs.183297 


DKFZP566F2124 protein 


105463 


AA825974 


Hs.32646 


hypothetical protein FU21901 


109592 


AI198059 


Hs.26370 


ESTs 


113649 


N94768 


Hs.16400 


ESTs, Weakly similar lo KIAA1435 prolein 


103898 


AA248884 




gb:k3517.seq.F Human fetal heart, Lambda 


116439 


AA251594 


Hs.43913 


PIBF1 gene product 


129535 


AA397972 


Hs.1 12603 


chimerin (chimaerin) 1 


113283 


T66813 


Hs.12947 


EST 


128992 






ESTs 


133160 


N54968 


Hs.66309 


hypothetical protein MGC11061 


134076 


AF086215 


Hs.78980 


gb:Homo sapiens full length insert cDNA 


128301 


U90552 


Hs.284283 


butyrophilin, subfamily 3, member A1 


127728 


AW404061 




protein kinase C, beta 1 


126516 


R95872 


Hs.117572 


chemokine binding protein 2 


127506 


T61039 


Hs.76067 


ribosomal prolein L10a 


104769 


AA025887 


lls.293943 


hypothetical protein MGC1 1266 




AA648886 


Hs.151999 


ESTs 


130453 






FAX transcription sctivaiion domain inte 


107131 


AW961605 


Hs.21145 


hypothetical protein RG083M05.2 


130422 


AW160614 


Hs.180034 


cleavage stimulation factor, 3" pre-RNA, 


121292 






gb:zv65f11.s1 Soares tots|_fetus_Nb2HF8_ 


123284 


AA488988 


Hs.293796 


ESTs 


130734 


AW137091 


Hs.18624 


KIAA1052 protein 




AF198620 


Hs.10283 


RNA binding motif protein 8A 


105014 


M121123 


Hs.269267 


ESTs, Weakly similar to AF161361 1 HSPCO 




AA251089 




gb:zs04f05,si NCI_CGAP_GCB1 Homo sapiens 


132867 


AF226667 


Hs.58553 


CTP synthase II 


124169 


BE079334 


Hs.271630 


ESTs 


114652 


AI521936 


Hs.107149 


novel protein similar to archaeal, yeast 


113876 


AI799751 


Hs.5635 


ESTs 


111520 


AI985369 


Hs.301134 


ESTs 


121748 


BE536911 


Hs.234545 


hypothetical protein NUF2R 


106834 


AL044182 


Hs.28070 


KIAA0753 gene product 


128869 


AA768242 


Hs.80618 








Hs.284243 


tetraspan NET -6 protein 


126770 


AI292320 


Hs.81361 


heterogeneous nuclear ribonucleoprotein 


116734 


AW900992 


Hs.93796 
Hs.157425 


DKFZP586D2223 protein 


130999 


M326683 




SkelyorUiTo^of'molse vari-nt polyade 


114092 


H31213 


Hs!l4825 


ESTs, Weakly similar to WAA1503 protein 


129284 


AA318224 


Hs.296141 


ESTs 


119127 


AA708035 


Hs.12248 


ESTs 


123910 


AA621262 


Hs.179923 


ESTs, Weakly similar to S55657 alpha-1 C- 


132664 


AI740461 


Hs.54542 


ESTs 


118397 




Hs.161492 


ESTs 


126735 


AW138437 


Hs.24790 
Hs.226795 


KIAA1573 protein 


130847 


M69113 
AI672483 


Hs 20220 


glutathione S-hansferase pi 
lipase protein 


101186 


M020956 


Hs.179881 


core-binding factor, beta subunit 


113626 


T94318 


Hs.17359 


ESTs, Moderately similar lo RL44_HUMAN 6 



WO 03/042661 



121782 


AW452957 


Hs.63348 


Homo sapiens, clone MGC:1520a mRNA, com 


119863 


M081213 


Hs.58608 


Homo sapiens cDNA FU14206 fis, clone NT 
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AI221919 Z19967 AA348780 AW964077 AW166028 BE540193 N94800 AA452368 N99604 A1341345 AW298800 AA724961 
M931 158 AI741227 AI806560 AI982626 D81253 D53937 D52496 AA974487 AW043854 N50483Z39997 AI492961 AI361526 
F04002 AA452141 T23551 AI472655 Al 193667 A1341984 N92658 T32870 R52664 N50428 AW089291 AI934175 AI423737 D60665 

R18656 AVTO80U 1 AM62^8 7 8 7 AW968002 X93079 AJ002788 R51324 AI381600 D80031 AW500520 AW593740 F09382 AA810597 

AA443241 ^MRSmsmW AI457612 R63254 R28645 R27616 H01310 R78329 R76046 R76055 M369734 1457914 H94864 
AI953633R31669 AW769278 R82398 AA131925 R21776 R79031 AI129553 N70340 AI276116 AA709381 N63734 AI342605 
AA327133 AI805980 M005377 AW61 1716 T86946 M369083 R67250 H00240 R32578 W86279 R80248 R23734 H00977 R21732 
R78932AA368675 R16780 R26523 H40161 W93671 R99823 R77956 H61601 R98785 R09738H78317 H53537 H81056T53780 
H12600R82345 M653499 AW953717 R63207 R33888H54071 H66993T48748AI702300 R30775 H68996 AI014957 M369082 
AF075351 AI110886 A1742050 AW954245 A1753458 C19062 AI393674 C18911 C18029 M708613 AI168432 R67389 AI168453 
H00188R21233 H030S^ 22 C75E T r 372 W04151 AI220869 AI090290 AA368730 R94434 H81153 H70950 AA367783 
H81514H53536 C16968 C17797 C17677 C170S4 AA082581 C17044 D63217 C18791 D78839 D63290 D78838 N91085 H54070 
R80360 H78318 H40199 R33887 R02529 H94918 R00900 T87029 AA131924 R99891 M004267 D78764 D78757 H61600 C18615 
H60g72 R98784 wef 323 R09737 C17492 D78811 R67494 D78775 M368244 H12650 D79043 AW957062 R82759 C19002 H01715 
W28614W27435AI983043 M364395AW572472AW19D386AI129278AI913081AW473549 M830713AI982371 AI638647 
AI828466 AW572486 N52583 N89687 AW075557 AI571047 AI887479 AI559469 AI685802 AA805256 AI458777 AA974369 AI866929 
AI886032 AI823925 AI823566 AW198135 AI287510 AI565910 M765775 AI866019 AI253697 AI355825 R42668 AA894603 
AW105585 AI824555 AW339175 D20479 

NM 014253 AF100772BE088769 AL022718 BE151779AW863569 BE161640AL039060BE168542AW296554M323193 
AA235370 AW779760 N48674 AI375997 R45432 D59344 AI203107 F07491 R35360 R25094 AI913631 AI498402 T61382 AI01 6320 

ATOM235 AWim9 3 6 1 M888929 BE174517 Z70704 AL042292 H54958 AA310063 AW499643 AW501380 AW501202 AW501293 
AW501596AW630166AL041326AA780690 L40517 U21556 AI093182 ^062487 M883387AA931302BE252601 BE258381 
M827330 AL043930 M356337 AW962330 AW39381 4 Z70703 AW389484 T62231 AW780049 AL043931 BE149744 AW963292 
BE439669 BEO91737 BE091657AA484004A1673O95AA5991O6AW6O1545AI538739AI53873OAI521786AW366369AWO21O10 
AA362576AI289S27 AA382592 



AW206193 AW137594 AI953685 AI919143 AI424371 AW007698 AW025681 AI954289 R40438AW070364 AI679653 AI679081 
AI623213 AI652310 AI63141 1 AI565384 AI566734 M149597 AI538172 AI040831 AI770021 AI914287 AA279311 W73001 AI493117 
AI693374 AI802007 AI990545 N 29752 R94559 AA806475 AA806828 R90998 R94558 AW300112 W24097 AW003016 N91920 
T47910 
M029703 

AF093097 AI869509 T27070 AA326718 AA024743 W23922 AA479593 AI765668 W23908 U92986 AA081632 N50578 AA065245 
\\ 46AW365014 AA.961091 ^ H53426AA234651 N50522 T79343 T87399 N91858 H41179 AW009453 M024744 

T27069 A1346379 H88431 AA152289 R45373 AA477432 AI745607 AI807602 H41152 AA065244 AI2425S9 AI091032 AI251849 
F03857 H88369 Al 1 74488 A!002695 H53427 T87293 N91 869 W35270 AA453723 M453705 AW071829 AI393866 AW071 807 
W23592AI342074 D31158AA833756AI991895 AA447791 AI864125AW377239AA295365AI687970 R34076 F07607AA521310 
AW499855 R94584 R21283 
AA350256H68126 
M21305 
N24829 



AA195191 AA190578 AI632307 AA195227 AA743633 AI823408 A1832203 AI6531 14 AA205307 AW021913 AI687980 A1674198 
AI675563AI417935AA-W C \A13515 41434721 AI151036 AI038305 R52643 M780141 AW207645 D19691 AI474370 
M4C1739 N22905 N70378 



Table 72A shows the Seq ID No, Pkey, ExAccn, UnigenelD, and Unigene Title for all of the sequences in Table 73. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

793 



WO 03/042661 



PCT/US02/36810 



AI671245 
AW292315 
U94320 

C14187 
A1917116 
BE047680 



132520 AA257992 



Seq ID number correlation for those sequences in Table 73 

ccn UnigenelD Unigene Title 

177231 Hs.82932 cyclin D1 (PRAD1: parathyroid adenomatos 

248508 Hs.279727 ESTs; homologue of PEM-3 [Ciona savignyi 

862258 Hs.169266 neuropeptide Y receptor Y1 

161573 Hs.19492 protocadherin 8 

969726 Hs.98381 ESTs, Weakly similar to serine protease 

r37106 Hs.32250 ESTs, Moderately similar to 178885 serin 

528339 Hs.178062 ESTs, Weakly similar to phosphatjdylseri 

969769 Hs.105201 ESTs 

)748 Hs.57987 B-cefl CLUlymphoma 1 1 E (zinc finger pro 

000399 Hs.1395 early growth response 2 (Krox-20 (Drosop 

3493 Hs.2171 growth differentiation factor 10 

■969706 Hs.293332 ESTs 

1 34708 Hs.145998 ESTs 
Hs.75874 



NMJJ02975 

DE388665 

AA143045 

AU077228 

AF050147 

AA316181 

U29690 

D54745 



Hs.211869 

Hs.91546 

Hs.7457 

Hs.100343 

Hs.6451 

Hs.2549 

Hs.96886 

Hs.1 16502 

Hs.159156 

Hs.50651 

Hs.17109 

Hs.44685 

Hs!l79999 
Hs.81665 
Hs.77256 
Hs.97932 
Hs.61635 
Hs.37744 
Hs.80247 



; (a protein tyrosine kinas 



C3HC4-like zinc finger protein 

stem cell growth factor; lymphocyte seer 

Homo sapiens, clone IMAGE:3457003, mRN'A 

v-kit Hardy-Zuckerman 4 feline sarcoma v 

enhancer of zeste (DrosopMa) homolog 2 

chondromodulin I precursor 

six transmembrane epithelial antigen of 

Homo sapiens beta-1 adrenergic receptor 



Seq ID No 

Seq ID No B1 & B2 
Seq ID No B3 & B4 
Seq IDNOB5&B6 
SeqlDNoB7&B8 
Seq 1DNOB9&B10 
SeqlDNoB1UB12 
SeqlDNoB13, B14.&B15 
SeqlDNoB16 
Seq IDNOB17&B18 
SeqlDNcB19&B20 
Seq ID No B21 & B22 
SeqlDNoB23&B24 
Seq ID No B25-B27 
SeqlDNoB28&B29 
Seq ID No B30&B31 
SeqlDNoB32 
SeqlDNoB33&B34 



Seq ID No B41 & B42 
SeqlDNoB43 
SeqlDNoB44&B45 
SeqlDNoB46&B47 
Seq ID No B48&B49 
Seq ID No B50-B53 
SeqlDNoB54&B55 
Seq ID No B56 & B57 
SeqlDNoB58&B59 
Seq ID No B60 
Seq ID NOB6UB62 
Seq ID No B63 & B64 
Seq IDNOB65&B66 
Seq ID No B67 & B68 
Seq ID No B69 & B70 
Seq ID NoB7UB72 
Seq ID No B73 & B74 
Seq ID No B75 & B76 
Seq ID No B77SB78 
SeqlDNoB79&B30 
Seq ID No B81 & B32 
Seq ID No B33SB84 



Unique Eos probeset identifier number 
Gene duster number 
Genbank accession numbers 



NM_000025X72861 X70811 W2S932 X70812 S53291 AW015148 AW581776 AI276134 

»3290 HC2411 C18327 AA3( 75 J AA367557 H89632 C17954 AA568860 AI752983 AA699451 H04260 AI128118 AW193364 N94503 
^02 ,14'!- A 68741 \I1( - l 6E047384 AA747591 R22855AI032539AI151343AA148534 H63941T49595AA694405H74226 
AI200363 R79731 AA702947 AI400076 AI088494 C17938 AA599478 H02962 R77565 C17370 R65618 H7371 1 R58545 D79189 AW265710 
R77664T52101 AW953745 AW953739 D79107 AA029105 AW953738 AA456487 R67000 AA1 56623 AA368336 H63662 
AW969706M456258M491881 BE501639 D62113AW969710AI591236AI379869AW968997AA040053AI307206AW663917AA454645 
AA489238 BE241958 AA743491 

AU077231 AA852219 M74092 X59798 M64349 NM.001758 AA226806 M73554 BE409154 AA160096 BE384352 AA160820 BE382880 
BE261734 AA1 13821 BE407745 AA156380 BE390287 BE390020 AA100854 AA127152 AW794066 AW367101 AW367093 U47703 AI347077 
W05266 AI824103 AI499061 AA642944 A1042556 AA906539 W60380 AI571777 AL135581 AA112340 N75459 AA592929 AI085348 AI278890 
AA126942 AI023701 AI873252 AA156319 AI190622 W60289 AI27488S R81309 AA100801 AA227161 AI568929 AA160603 AI074344 AI344561 
AI150778 AA852218 AA158286 N20142 AA622148 AA864225 AA576367 AW182124 T89175 AI758455 AA780573 N71757 R81200 AI659596 
AI674613 AA642544 AW503909 AA1 28851 W39350 N40420 AA1 13072 BE168116 AI620604 AI298125 BE075272 N40078 BE075109 BE080779 
AI918938 BE168117 BE087369 AW995539 BE080949 BE080727 BE075271 BE075108 BE080955 BE089655 BE081 115 AW750304 H66084 
AI146884 BE075154 AW992247 AI186525 AI752230 AW263140 W03329 N26056 AA948080 AA113073 H99284 AA227101 AA631077 
AA148042 A1740837 BE082728 M149570 W44495 BE089351 AA375044 N26775 H27771 AA064705 BE091204 R89337 N32676 N27141 
BE164704 H98049 W67603AA425549W31090M807411 BE173280BE000178T09020W23852AA062709BE167894AA076515R97329 
BES41980 N42036 AA102307 AA113772 BE276181 H20622 W44436WS7604 W46412 AW771113 AI700678AA502628 AA133137 BE274186 
BE396090 BE613371 BE612645 W46650 W95203 W92651 AI087288 R76299 AW604781 N55320 AI912334AA403248 AW169156H24970 
AW298322AW080962AI073747 W24123 M577596 H21715 H27925 H2S436 AI288304 AA143043 AA204678 BE047090 W48631 AA908347 
AA599485AI27650^ 95397 563710 H ~ M 47 AA425389 AA516104 AI095335 T77237 AA151696 T92084 AI689037 AI624162 
W49709 AW514883 AA100676AI366087 AA069474 AA525859 AW771076 AA029402 AA994114 AI351505 AW770816 AI333594 AI289794 
A1346589 AA487700 AI081 104 AA6 1 3344 AI377520 A128491 1 AI31 1390 AA622062 AI055890 AI660881 AI3661 17 AA403090 AI272818 
A073353 W46300M Al ' 173AI753 -i f-f i C4A12 4/1 'I '9890 AA699584 AA983722AI079968 AI752231 AA076431 AA113245 
A1168564 AA918965 AI066484 AI123599 AI921 518 W34586 AA535600 AA064B65 AA705388 AA064623 AA962503 AI924926 AW131206 
AW275281 A1280632 T29597 W48728 AW954336 W383 1 7 W94768 AI084717 W46567 AI245645 AW302501 N72201 AW510563 AW079132 
AA207054AI143740AW440672 AA632154 AI290286 AI350704 AI271377 AA025369 AI864756 T77451 H97348 AA852165 AI932951 N98526 
AA487486 R92970 AA934071 AI080448 AA063257 C05786 N99099 R42969 AA887065 AA6626EE c i < V ">04 H51748 BE539444 
AI382164 AI814595 BE537043 AI168307 BE408935 AA453506 R89428 AA936527 AA936890 AW369618 AW264602 R18074 AI474189 
AW372354 AI094358 R37210 AA948510 AA226909 BE172527 AI086652 BE408324 AW292848 AI768952 BE540703 BE409478 AA931 692 

794 
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132520 45737,1 AA257992 M317029 M3620S7 D17042 H48100 AW838107 AW838106 AW838110 AW838231 AW610241 AW338442 AA045061 AW384991 

BE000988 AA131806 BE180577 AW838269 AL039831 AI754380 C06051 AA131737 AW838239 AI767465 A1480134AA610312 AW838190 
AA379252 AW838282 AA484027 AI423413 AI685064 BE328307 AI241857 AW838281 AW838284 AI671267 A1814928 AA828367 AI476306 
AI270180 AA622362 A1076497 AI635919 AI444994 AI194028 AI6691 59 AI928131 AA448353 AI3501 43 AW079289 AW467807 AA480442 
5 AA522935 BE180570 AW380087 AW380109 AI081015 A1690818 AI589485 AI698510 AAB42019 AA714366 AI580430 AA985527 AI740475 

C21398 AA257993 A1302393 AIS8901 8 AV/7701 94 AW753750 AI079164 BE550338 AA559851 T16108 A1864822 A1932827 AA045095 
AA045062 AI954225 AA768559 AA7093Q8 AW958363 

117602 10565_-4 N35020 

102725 11582.1 AB026187 NM_0H522 U79247 F13304 M224524 F10902 R39431 

10 102745 13186 1 AW753865 F07644 F11280 F06355 F08136 U79299 F07459 F08750 F12419 F12842 F06488 F08585 F13403F05921 F05512F11683 F05416 

F12841 F12810 F05418 F12850 R24551 F06275 H11803 H10237 H11542 H22894 R59563 H19351 H18722 F11237 F08507 H23123 R61595 
F07796 F06201 F12289 F07107 T781 13 F08734 F13344 F05760 F11784 F08780 F05835 R20588 F07739 F11497 R17410 H17414 R20440 
T66090 R25292 T66236T78766T79908R25286 R60071 C147S1AW905192AA331914AW965291 H09000 F05212 F08313 F05825 H06399 
H1 51 35 H 1 1 378 R66424 R61 54 1 AA235405 AI205041 R59564 H23124H18638 R37359 AI571275 R41780T66174T66159 R54102 W07657 

1 5 H10794 R52337 R42890 H14354 R40952 T17391 H12068 R56797 R51000 R60011 R37617 R39434 H29286T16403 H09129 R491 14 H1 5161 

H11729 R39160 H10468 H24454 H11432 R54433 H08768 H24042 H09511 H17281 H15292 R37849 AI991965 R52815 R42850 T17390 T16433 
T17388T16285 R48963 R50782 R44247 R46729 R59558 R40696 R43575 R44420 T17403 R37754 R46637 R51039 R40513 T23785 F10433 
T16350 H101S1 R43688 T23543 H08591 R44351 R37575 R49508 F04379 F04056 F05067 F02010 F09158 F10036 F08900 F02559 F09914 
F09434 F05034 H17415 F02644 H10650 F10451 H14589 F10444T16440 R44517 R46643 F03993 F10443 F02170 F09343 F10414 F04794 

20 F11122F04997 F03894 AW005932 F02767 F10941 F01777 R45260 F03386 R45261 T65009 F10990 F08942F01673 H23253 F04533 F02082 

F01669 F03689 H09568 R40763 H06400 R50958 N80570 AI082296 AI198226 T23763 AA814486 AW015823 AI886272 H06514 AA515969 
F10049 AI984267 AI886279 AA558516 F02071 AI783843 AI804884 F01468 AA730438 AF035301 R38389 R41440 R59468 R14437 T89036 
R43560 H10821 R52850 R56722 R14559T09299 H10467 R25829 F13546 R56796 R51 151 D82657 T32952T66257 H06569 F12430 T74326 
H08489 R34430 R13083 H23364 R21134T77098 AW957157 H24150 H11403 R52336 F12830T77173T74281 T77663T74736 H10649 H12067 

25 R20092 T74735 R54530 R17552 R59557 R54529 H17280 T75171 H14694 T74975 H15291 R18574 R54157 R13446 H15221 R35120 R35031 

R21277 R19632 R35820 R50899 R13923 R36771 R26485 R33061 R20100 H24426 H09184 H15958 R19253 H15196 H09510 R36032 R19691 
H14402H14885T17068 

119082 34131 1 AF252297 N27888 F13415 R34552 R1 4377 AA3239S2 AW61 151 4 AW614097 AW195487 AI026040 A1097663 AA012834 AW969885 AA988198 

R49069 AW873673 R37774 F08840 AA484668 AW873654 AA170805 AA1 70825 AA585275 M585280 AA012833 
30 102836 25380J U94320 NM_006174 U6627S 

125770 16581 2 AA143045 AW271802 AI017093 AA450183 W93694 AIB35044 AI890068 AW264522 AW073559 AI355325 AI051302 AI567686 AI143554 

AI457891 AW468784 AI567693 AW884460 AI424892 AA401298 AW244098A1 122936 N24824 AA552932 AA873164 A1308810 N26098 H81852 
N20798 AW263947 AI696267 AA449422 AW877967 

120147 386607.2 A1917116 AA633698 AIE80389 Z39483 

35 113003 420161J AW292315AI090408AW517041 AI928695 A1092060 AA71 6741 N98235 T23629 

104659 82594.1 AW969769 AA484137 A1291362 N54753 AL133784 AA007153 BE046923 D59990 C151SS D59991 D60194 AA648108 N76160 D61027 D60630 

104691 41308.1 U29690 AF272890 AA708166 M010748 R23839 M59572 R23383 A1625747 AA01 1 176 AW665324 H02S74 H59573 AA757931 H12502 R66851 

AA702310 AA748431 AA749352 C01127 AI003679 T96352 

105301 27795 5 AW352357 AA852419 T80568 AA332510 AW149659 AA341609 M809550 AI183690 AA730875 T31719 AW965361 AA779709 T05461 

40 AA564806 AI142536 A1553857 D53384 D53747 D54101 D551 12 AW071935 AI082447 AI29091 1 AW168872 AA199743 A1091646 AI246702 

AI271662 AW117295 AI032138 AI741 489 AI538705 AA233541 A1682685 AW1 36268 AI689641 M599916 AI174443 AI432175 AI131328 
AW136277 AA852418 AA243035 T03902 CI 5651 D52650 H08108 M844525 AI420648 AI282445 AW068986 AW497637 AA779042 AA298564 

105316 181763.1 AI671245 D81062 AI08587S AI193252 AI870535 N23132 H97661 H18098AA552173M233924 M503379 H19618 D60732 F04762 H11343 

45 AW590283 

127664 394361.1 AA806164 AA678564 AW978234 AW023951 AA651859 A1699785 AA913686 AW974559 AI631287 

1061 1 1 4671 1 AW875398 AI064861 NM.0141 38 AF090943 BE071887 B E547232 BE622599 AI345700 AA28431 5 AA1 47700 AW376340 BE220170 BE348475 

AA025037 AA057284 AA025038 T16193 AW130196 AW594516 A1989991 AI139091 AW675747 M830435AI924977 AI670778 AI051887 
AA912574 AW130213 AI828335 AI570587 AI332760 AI3131 15 AA804653 AA477920 AA147645 AI916912 AW763339 M715098 AA292723 
50 AA732421 T33122 AI631560 AI698686 A1185386 R43829 AA421638 M961400 C00503 AA604025 M554026 M553885 BE172244 R24040 

128478 370722.2 AA708205 H15963 R20851 L13829 AI934608 AI950163 H92893 AI267243 N66614 H28983 R42031 H06198 AI468628 H92779 L13828 T34896 

105782 246361 1 H09748 R61554AA350215 R40551 

120830 244476 1 AI568170 M731212AW294848 AW466878 M568908 AI281804 AI923633 M347885 AI718854 M714940 

128797 20410 T NM.002975 AF020044 AB009244 BE620464 AAS41683 AW000991 AW089850 AW664612 AI636676 AI627854 AI885823 AI864548 AI554122 

5 5 AI499682 AI570905 AI376008 AI871 364 AI074230 AI38371 9 M905016 AA977446 AA844732 T78922 AW244007 AI924747 A1968230 AI824575 

AI699224 AI569490 AW379856 AW292314 AI016874 AW292037 
106533 305354 1 AL134708 AW206610AW665919 AW594227AI917093 AI4983S1 AA635141 AA453786 D54178D53224 AI804341 D53111 M78885AL1 34707 

AA453903 AI278054 AI334339 Al 380496 AW235939 D60373 D60993 D81246 
121619 284072.1 AA528339 AI017218 M416911 

60 123049 30801 1 BE047680AF177395NM 014421 AB033208 AB035181 W55979AI093106AI863392AA715452AA300480AWO22313AA6481O3AA72O544 

□61856AA481794AA737112AW316551 AA714937 AW974378 AI370632 AI184289 A1333962 AA714945 
129703 64196 1 BE388665 M7406U AI394542 AA484451 AW070675 AA877S54 AI360981 AI475799 BE349560 AL037351 N29437 AI246453 AW083444 

M844441AI150871AI440477AI889061M682422M680053 M740212AI858728M490589AW243692AI249450AI304448AA534421 
AI500317 AI273337 AW027729 AI979132 C06120 A1860916 AW591923 AI803174 AI559264 BE250500 AW675522 AA883284 T71033 AI346838 
65 AI066754 Al 19791 3 AA931979 M427599 AI304730 AI2851 59 AI273325 AI038450 AI033760 N74972 AA136517 AI304446 AI262652 AA401348 

AW514617 AI3S1795 AW514389 N30307 N93439 AA190775 BE349902 AI342114 AI186831 AA039628 H74242 AI914825 AI955761 AI343851 
AA693856 T71096AI498539AA778995AA084278AI274678AI022673AW339621 AA704929AW089735 M041252M161337A1185333 
AW189397 AI160990 AI082392 AI935848 AI144404 A1141633 AA447987 AA464241 AI342103 AW026118 M334025 AW664459 AA603777 
AA630752AA115450 M49944AW166071 AV\769795AA991877AA873609AW083441 H99130AA910538AA486199AW083358AA151767 
70 AI161193 N54945 AA180193 AI423477 AI188387 AI608747 N58900 AI03B449 M136852 AA565913 AA130502 H61741 AA701125 AI127678 

AA278541 R76523 AA666187 H85083 F24755 R77729 M135137 BE250395 R76239 AW794625 AW794404 H61957AA308197 AW996031 
AA345524 N52726 M354360 AA1797B5 N57375 R16447 AA160645 W21481 
100299 2801 1 D49493 NM.004962 D49492 AA948479 AA993B43 AA017558 M992567 AA912697 AI185363 AA018254 N93253 AA297544T19187 R53287 

AA069337 AA297489 N62906 AW023929 AA018253 AA01761 5 W251 1 0 R52085 R52633 W16549 H88916 H89134 AA069338 
75 129977 2039.1 NM.000399 AA018140 AF139463 J04076 AW390694 AW361756 AL079830 AI190587 AA731059 AA742743 AI635915 M878209 AI262612 

AI264201 AA446300 AA715016 AA446027 AI274101 AA593330 AA593338 AW015388 AI681724 AI347984 AL045275 AA970069 AA018188 
AW014396 AW769445 AI588869 AW338296 AI801548 AI367702 AI888410 
123308 322009.1 C14187 AA496211 D80456 AA484051 D80455 AA528446 

101104 26442.2 AW862258 L07615 

80 130637 7040 1 AA3567B4 NM.004867 AF038953 W1 6828 R50284AI929589 R01 594 AA010474 W37917 N64212 N78175R57511 AA329322 W77834 

AA169869 AA285240 M356670 T931 45 AA325779 AA357064 AA328151 AA306939 AA306801 N31260 AA306202 AA312530 BE242703 
AA354371 N42259 AL021 786 H50965 H38038 A1655955 AI536732 AI953952 AW972835 AI077300 AI948482 A17609B6 AA648335 W72197 
N32815 AA768073 AW338181 AW001742 A1819896 AI015666 N58011 AA677584 AH 83750 AW105514 W37875 AW301679 AA775257 
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M730357 AI128690 AA630H9 AW24151 5 AI147003 AI183499 N90923 A1968582 AA354145 A1147297 BE550502 AI085777 AI709087 
AA773947 AA4691 S2 Al 199239 AI1 27085 H89232 N3071 S AI0281 55 AW029469 AI70931 4 AA780834 AA778057 H891 25 AI654762 T93061 

„j 6 ?5AA719 AI24B24 23392! 2512 AW438666 M01 0373 AA4B4010 AW438790 N22875 M48371 8 Al 185407 
AA736720 AA664587 AA648672 AA503406 AA508123 AA503140 H46633 N75812 AA730938 D60546 AA483039 AA507705 AA480448 
A1768654 AA658955 AA730803 M33031 2 AW021127 BE172031 AW961052 AA339070 AA648172 BE172892 AA174175 AA492012 AA491590 
AW1 1 8733 AA4841 30 AA730783 AA73051 0 D62577 D62828 AI469294 AI801343 N53447 AA740226 A1831983 AA516146 D79799 AI758212 
AA484040H38039AI038772AI039410AA528387M730576AA527231 AA748111 D62946 U38439AA721708H22130AA559229H51620 
AA283355AA730970R01595N724[f W4691 V 7 ("9M480477AA283306AA347822AA328013 

AW243508 A1278454 AI369343 AW291 235 AI343946 AI347189 AA970152 AI279668 AI971894 AA664359 AI191039 AA084598 AA081641 
AA316181 BE439545 AC005053 AF186249 AW386101 AC004969 AA730199 AA032221 AI686139 AI167942 M809228 AI184070 AI394674 
AW969977 M032279 AW079284 AA513174 M888312 AI453179 AA483363 AA528432 AA57951 1 

AU077228 U61145AU044256 AA206185 AL044975 NMJ04456 X95653 AW890905 M332806 AW965453 BE299226 U52965 BE275009 
AA428252AA769212AA312778AA810901M461315A1969286AI953668AW296248AI291422AA704747Z4.4716BE541235AW608806 
AA588583 AW403788 AW408386 AA053521 T87571 AA581163 BE183241 T97906 R39149 AI808196 BE618521 AI970278 AI652680 AW130143 
AI970272 AW197254 BE550449 A1528403 AA010143 AW005107 AA251815 M805836 N53459 AA055309 AA643678 AI345616 AI345618 
AI635728 AI371286 A1018437 AI262514 H75287 All 133155 AI203388 AA205802 AI475265 AW594186 AI696059 A1333312 AW264710 AA876764 
AA864892 AA173156 AI292066 T79B37 W93995 AA503427 AA815135 AI376342 AA010144 AI310421 AI223350 AA970576 N33925 T89364 
AI090913 N55098 AA101673 AA626655 M251640 N68857 AI890053 H05930 AI474199 AI436166 AA628810 N64452 AI205538 AA055310 
BE244495AI365179T99536AI203088 Z40549AW139858 M599627 R91847AA483681 AW068982H51746AA091201 H13030 AA459732 
T99430 AI051764 R45991 

D54745 AU077189 NMJJ00729 R71188 H87563 R80701 AW956319 AA376289 R82442 H46452 R22481 R64323 AW043690 L00354 AW005571 
H1 981 3 AI745S97 AI769202 A1205982 AW205841 R64324 AI201975 H46993C18067 R71189 N29729 A1799838 AI091209 H87455 R82443 
AW970995 AW072136 N40841 H1 31 48 AI21 9052 N30191 T28007 AI817428 M503602 

AF050147 NM_007015 AB006000 AE005999 BE21 81 14 AI703147 AI985891 BE222034 BE549774 BE327618 AI651459 BE549987 AI657194 
BE222414 BE046984 AW470357 AW339338 C02526 AW590220 AI298082 AW296998 AI333135 AI367816 AA730514 BE348513 AA405500 



287965 1 AW969726 AA424028 AI266636 AI271571 M423988 AI266634 M702780 AA452037 AI424866 AA742931 M483361 

46993J AA737106 N35765 N35779 AF086045 N30152 AW374028 AI478237 M923562 AI332236 AA846668 AI627389 AI371709 H98083 H95983 

H26576 H95984 AA745021 AA639180 H26575 
7355 1 AF061573 NMJJ02590 AA780138 AW135620 AI197884 AA483348 R05468 AW969724 H29216 
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c Acid Accession 



I I I I P I 

CACACGGACT ACAGGGGAGT TTTGTTGAAG TTGCAAAGTC CTGGAGCCTC C" 
, . TCGGCGCAGT AGCAGCGAGC AGCAGAGTCC GCACGCTCCG GCGAGGGGCA G 

10 AGGGAGCGCG GGGCAGCAGA AGCGAGAGCC GAGCGCGGAC CCAGCCAGGA CCCACAGCCC 180 

TCCCCAGCTG CCCAGGAAGA GCCCCAGCCA TGGAACACCA GCTCCTGTGC TGCGAAGTGG 240 

AAACCATCCG CCGCGCGTAC CCCGATGCCA ACCTCCTCAA CGACCGGGTG CTGCGGGCCA 300 

TGCTGAAGGC GGAGGAGACC TGCGCGCCCT CGGTGTCCTA CTTCAAATGT GTGCAGAAGG 360 

AGGTCCTGCC GTCCATGCGG AAGATCGTCG CCACCTGGAT GCTGC-AGGTC TGCGAGGAAC 420 

15 AGAAGTGCGA GGAGGAGGTC TTCCCGCTGG CCATGAACTA CCTGGACCGC TTCCTGTCGC 480 

TGGAGCCCGT GAAAAAGAGC CGCCTGCAGC TGCTGGGOGC CACTTGCATG TTCGTGGCCT 540 

CTAAGATGAA GGAGACCATC CCCCTGACGG CCGAGAAGCT GTGCATCTAC ACCGACAACT 600 

CCATCCGGCC CGAGGAGCTG CTGCAAATGG AGCTGCTCCT GGTGAACAAG CTCAAGTGGA 660 

ACCTGGCCGC AATGACCCCG CACGATTTCA TTGAACACTT CCTCTCCAAA ATGCCAGAGG 720 

20 CGGAGGAGAA CAAACAGATC ATCCGCAAAC ACGCGCAGAC CTTCGTTGCC CTCTGTGCCA 780 

CAGATGTGAA GTTCATTTCC AATCCGCCCT CCATGGTGGC AGCGGGGAGC GTGGTGGCCG 840 

CAGTGCAAGG CCTGAACCTG AGGAGCCCCA ACAACTTCCT GTCCTACTAC CGCCTCACAC 900 

GCTTCCTCTC CAGAGTGATC AAGTGTGACC CAGACTGCCT CCGGGCCTGC CAGGAGCAGA 960 

TCGAAGCCCT GCTGGAGTCA AGCCTGCGCC AGGCCCAGCA GAACATGGAC CCCAAGGCCG 1020 

25 CCGAGGAGGA GGAAGAGGAG GAGGAGGAGG TGGACCTGGC TTGCACACCC ACCGACGTGC 1080 

GGGACGTGGA CATCTGAGGG CGCCAGGCAG GCGGGCGCCA CCGCCACCCG CAGCGAGGGC 1140 

GGAGCCGGCC CCAGGTGCTC CCCTGACAGT CCCTCCTCTC CGGAGCATTT TGATACCAGA 1200 

AGGGAAAGCT TCATTCTCCT TGTTGTTGGT TGTTTTTTCC TTTGCTCTTT CCCCCTTCCA 1260 

TCTCTGACTT AAGCAAAAGA AAAAGATTAC CCAAAAACTG TCTTTAAAAG AGAGAGAGAG 1320 

30 AAAAAAAAAA TAGTATTTGC ATAACCCTGA GCGGTGGGGG AG3AGGGTTG TGCTACAGAT 1380 

GATAGAGGAT TTTATACCCC AATAATCAAC TCGTTTTTAT ATTAATGTAC TTGTTTCTCT 1440 

GTTGTAAGAA TAGGCATTAA CACAAAGGAG GCGTCTCGGG AGAGGATTAG GTTCCATCCT 1500 

TTACGTGTTT AAAAAAAAGC ATAAAAACAT TTTAAAAACA TAGAAAAATT CAGCAAACCA 1560 

TTTTTAAAGT AGAAGAGGGT TTTAGGTAGA AAAACATATT CTTGTGCTTT TCCTGATAAA 1620 

35 GCACAGCTGT AGTGGGGTTC TAGGCATCTC TGTACTTTGC TTGCTCATAT GCATGTAGTC 1680 

ACTTTATAAG TCATTGTATG TTATTATATT CCGTAGGTAG ATGTGTAACC TCTTCACCTT 1740 

ATTCATGGCT GAAGTCACCT CTTGGTTACA GTAGCGTAGC GTGGCCGTGT GCATGTCCTT 1800 

TGCGCCTGTG ACCACCACCC CAACAAACCA TCCAGTGACA AACCATCCAG TGGAGGTTTG 1860 

TCGGGCACCA GCCAGCGTAG CAGGGTCGGG AAAGCCCACC TGTCCCACTC CTACGATACG 1920 

40 CTACTATAAA GAGAAGACGA AATAGTGACA TAATATATTC TATTTTTATA CTCTTCCTAT 198 0 

TTTTGTAGTG ACCTGTTTAT GAGATGCTGG TTTTCTACCC AACGGCCCTG CAGCCAGCTC 2040 

ACGTCCAGGT TCAACCCACA GCTACTTGGT TTGTGTTCTT CTTCATATTC TAAAACCATT 2100 

CCATTTCCAA GCACTTTCAG TCCAATAGGT GTAGGAAATA GCGCTGTTTT TGTTGTGTGT 2160 

GCAGGGAGGG CAGTTTTCTA ATGGAATGGT TT IGAATAT T'TACTT GTTTGCAAGC 2220 

45 AGGACTTTGA GGCAAGTGTG GGCCACTGTG GTGGCAGTGG AGGTGGGGTG TTTGGGAGGC 2280 

TGCGTGCCAG TCAAGAAGAA AAAGGTTTGC ATTCTCACAT TGCCAGGATG ATAAGTTCCT 2340 

TTCCTTTTCT TTAAAGAAGT TGAAGTTTAG GAATCCTTTG GTGCCAACTG GTGTTTGAAA 2400 

GTAGGGACCT CAGAGGTTTA CCTAGAGAAC AGGTGGTTTT TAAGGGTTAT CTTAGATGTT 24 50 

TCACACCGGA AGGTTTTTAA ACACTAAAAT ATATAATTTA TAGTTAAGGC TAAAAAGTAT 2520 

50 ATTTATTGCA GAGGATGTTC ATAAGGCCAG TATGATTTAT AAATGCAATC TCCCCTTGAT 2580 

TTAAACACAC AGATACACAC ACACACACAC ACACACACAC AAACCTTCTG CCTTTGATGT 2640 

TACAGATTTA ATACAGTTTA TTTTTAAAGA TAGATCCTTT TATAGGTGAG AAAAAAACAA 2700 

TCTGGAAGAA AAAAACCACA CAAAGACATT GATTCAGCCT GTTTGGCGTT TCCCAGAGTC 27S0 

ATCTGATTGG ACAGGCATGG GTGCAAGGAA AATTAGGGTA CTCAACCTAA GTTCGGTTCC 2820 

55 GATGAATTCT TATCCCCTGC CCCTTCCTTT AAAAAACTTA GTGACAAAAT AGACAATTTG 2880 

CACATCTTGG CTATGTAATT CTTGTAATTT TTATTTAGGA AGTGTTGAAG GGAGGTGGCA 2940 

AGAGTGTGGA GGCTGACGTG TGAGGGAGGA CAGGCGGGAG GAGGTGTGAG GAGGAGGCTC 3000 

CCGAGGGGAA GGGGCGGTGC CCAGACCGGG GACAGGCCGC AGCTCCATTT TCTTATTGCG 3 060 

CTGCTACCGT TGACTTCCAG GCACGGTTTG GAAATATTCA CATCGCTTCT GTGTATCTCT 3120 

60 TTCACATTGT TTGCTGCTAT TGGAGGATCA GTTTTTTGTT TTACAATGTC ATATACTGCC 3180 

ATGTACTAGT TTTAGTTTTC TCTTAGAACA TTGTATTACA GATGCCTTTT TTGTAGTTTT 3240 

TTTTTTTTTT ATGTGATCAA TTTTGACTTA ATGTGATTAC TGCTCTATTC CAAAAAGGTT 3300 

GCTGTTTCAC AATACCTCAT GCTTCACTTA GCCATGGTGG ACCCAGCGGG CAGGTTCTGC 3360 

CTGCTTTGGC GGGCAGACAC GCGGGCGCGA TCCCACACAG GCTGGCGGGG GCCGGCCCCG 3420 

65 AGGCCGCGTG CGTGAGAACC GCGCCGGTGT CCCCAGAGAC CAGGCTGTGT CCCTCTTCTC 3480 

TTCCCTGCGC C1GTGATGCT GGGCACTTCA TCTGATCGGG GGCGTAGCAT CATAGTAGTT 3540 

TTTACAGCTG TGTTATTCTT TGCGTGTAGC TATGGAAGTT GCATAATTAT TATTATTATT 3600 

ATTATAACAA GTGTGTCTTA CGTGCCACCA CGGCGTTGTA CCTGTAGGAC TCTCATTCGG 3660 

GATGATTGGA ATAGCTTCTG GAATTTGTTC AAGTTTTGGG TATGTTTAAT CTGTTATGTA 3720 

70 CTAGTGTTCT GTTTGTTATT GTTTTGTTAA TTACACCATA ATGCTAATTT AAAGAGACTC 3780 

CAAATCTCAA TGAAGCCAGC TCACAGTGCT GTGTGCCCCG GTCACCTAGC AAGCIGCCGA 3840 

ACCAAAAGAA TTTGCACCCC GCTGCGGGCC CACGTGGTTG GGGCCCTGCC CTGGCAGGGT 3900 

CATCCTGTGC TCGGAGGCCA TCTCGGGCAC AGGCCCACCC CGCCCCACCC CTCCAGAACA 3960 

CGGCTCACGC TTACCTCAAC CATCCTGGCT GCGGCGTCTG TCTGAACCAC GCGGGGGCCT 4020 

75 TGAGGGACGC TTTGTCTGTC GTGATGGGGC AAGGGCACAA GTCCTGGATG TTGTGTGTAT 4080 
CGAGAGGCCA AAGGCTGGTG GCAAGTGCAC GGGGCACAGC GGAGTCTGTC CTGTGACGCG 4140 
CAAGTCTGAG GGTCTGGGCG GCGGGCGGCT GGGTCTGTGC ATTTCTGGTT GCACCGCGGC 4200 
GCTTCCCAGC ACCAACATGT AACCGGCATG TTTCCAGCAG AAGACAAAAA GACAAACATG 4260 
AAAGTCTAGA AATAAAACTG GTAAAACCCC AAAAAAAAAA AAAAAA 
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HEHQLLCCEV ETIRRAYPDA NLLNDRVLRA MLKAEETCAP SVSYFKCVQK EVLPSMRKIV SO 

ATVJMLEVCEE QKCEEEVFPL AMNYLDRFLS LEPVKKSRLQ LLGATCMFVA SKKKETIPLT 120 

AEKLCIYTDN SIRPEELLQM ELLLVNKLKW NLAAMTPHDF IEHFLSKMPE AEENKQIIRK ISO 

5 HAQTFVALCA TDVKFISNPP SMVAAGSWA AVQGLNLRSP N1IFLSYYRLT RFLSRVIKCD 240 
PDCLRACOEQ IEALLESSLR QAQQNMDPKA AEEEEEEEEE VDLACTPTDV RDVDI 

Seq ID NO-. B3 DNA sequence 
Nucleic Acid Accession #: XM_0441S6 
1U Coding sequence: 1..1576 

1 11 21 31 41 51 

I I I I I I 

CTTTTGTTTC GCCATGCCTA GTCTAGTGGT ATCTGGAATA ATGGAAAGAA ATGGGGGCTT 60 

15 TGGAGAACTA GGATGTTTCG GGGGAAGCGC TAAGGACCGA GGGCTGCTGG AAGACGAGCG 120 

CGCCCTTCAG CTGGCTCTCG ATCAACTCTG CCTCCTGGGT TTGGGGGAGC CCCCCGCCCC 180 

CAGGGCGGGC GAGGACGGGG GAGGTGGGGG GGGGGGCGCC CCCGCGCAGC CGACAGCCCC 240 

CCCGCAGCCG GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCCCCGA CGACGGCCCC 300 

CGCAGCGCAG ACGCCCCAGC CCCCCACCGC CCCCAAAGGG GCGAGCGACG CCAAGCTCTG 360 

20 CGCTCTCTAC AAAGAGGCCG AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 420 

TCCCGTGCCC ACCTCCGAGC ACGTGGCCGA GATCGTGGGC AGGCAAGGCT GCAAGATTAA 480 

GGCCTTGAGG GCCAAGACCA ACACCTACAT CAAGACACCG GTGAGGGGCG AGGAACCAGT 540 

GTTCATGGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC CGGCG3GAAA TCATCTCAGC 600 

AGCGGAGCAC TTCTCCATGA TCCGTGCCTC CCGCAACAAG TCAGGCGCCG CCTTTGGTGT 660 

25 GGCTCCTGCT CTGCCCGGCC AGGTGACCAT CCGTGTGCGG GTGCCCTACC GCGTGGTGGG 720 

GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAGCAGCAAA CCAACACATA 780 

CGTGGAGCGT GCGCGCGAGG AGATGGAGAC GCACATCGCG GTGCGCACTG GCAAGATCCT 900 

CGAGTACAAC AATGAAAACG ACTTCCTGGC GGGGAGCCCC GACGCAGCAA TCGATAGCCG 960 

30 CIACTCCGAC GCCTGGCGGG TGCACCAGCC CGGCTGCAAG CCCCTCTCCA CCTTCCGGCA 1020 

GAACAGCCTG GGCTGCATCG GCGAGTGCGG AGTGGACTCT GGCTTTGAGG CCCCACGCCT 1080 

GGGTGAGCAG GGCGGGGACT TTGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG 1140 

CAAGCAGGAT GTGTACTACG GCGTGGCCGA GACTAGCCCC CCGCTGTGGG CGGGCCAGGA 1200 

GAACGCCACG CCCACCTCCG TGCTCTTCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCGC 1260 

35 CAAGGCCCGC GCTGGGCCCC CGGGCGCACA CCGCTCCCCT GCCACTTCCG CGGGACCCGA 1320 

GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CTAAACTTGG 1380 

TGGGGGCGGC CTGCGGAGCC CCGGCGGCGG GCGGGATTGC ATGGTCTGCT TTGAGAGCGA 1440 

AGTGACTGCC GCCCTTCTGC CCTGCGGACA CAACCTGTTC TGCATGGAGT GTGCAGTACG 1500 

CATCTGCGAG AGGACGGACC CAGAGTGTCC CGTCTGCCAC ATCACAGCCA CGCAAGCCAT 1560 

40 CCGAATATTC TCCTAAGCCC CGTGCCCCAT GCCTCCGGGG CCCACTCCAC TGGGCCCACC 162 0 

CTGGACCTGT TTTCCACTAA AGCCTTTTGG AAAGCGGTGA TTTGAGGGGC AAGGTGCTTA 1680 

GAG AT ACT CG CTCGCTGGGG AAGGGGGGAG GGAGGCAGTG GTGGCTGGAG GGTGCGCCAC 1740 

TTTCAGAGCC TCTGGTCACC CTGTCCTGGA AAGATTGGGA GGGGGCCAGA CTGAAAATTT 1800 

TACTAGAGT T ACAACTCTGA TACCTCAACA CACCCTTAAA TCTGGAAGCA GCTAAGAGAA 1860 

45 ACTTTTGTTT TGCCAGAGGT GGCCACTAAG GCATTCTGAC GCCCTCTGCC CACCTCCCCC 1920 

GCTGTGTGTC ACTCCACCCC TTCTTCCGAG GAGGGGGTGG GTAAAAGGGA GAGGGAGAAT 1980 

TACCACCTGT ATCTAGAGGT GCTCTTTGCA ATCCCTAAGC CCTCTGGTCC TGACCTCCGA 2040 

CCTCCCAGCT CTGTCTTGTT CCTTGTCTTT GTCTTTCTTC CCTTCCCCCT GCCCCTGCCC 2100 

CTACCAGCCC AGCTTTGGGG ACACCATCCT T CTGGGGAG A AGTAGGGGGA GGAATATTTG 2160 

50 GATGGTCCCT CCATTCCTCT TCAGGCATCT GGAGGCCCTC TCCCCCACTC CTCCAAAGAA 2220 

ACATCTCAAA TTATTGATGG AATGTATCCC CATTCTCAGT GAAAATGTGA GGAGGGGACT 2280 

AATACTGGGG TAAAGGGTCA AACCCCCACC TTCATCACTA TGGGCATTAT ATTTAGGGAG 2340 

TAGTTCTTGG GCTGGATTTT CTGGTTGTGG AAGTGGGGGC GCCAGAGTAG TGTGTCTGCT 2400 

ATTTAAAGGA GCAGGAAAGG GCGTGAGGCA GGAGGAGAGA CTGGTGGAGG GAAGAGCTGC 2460 

55 TCCTCCCATG CAGTGCCCGA CTCCCTGCAC CCCTCTCAAC CTGACCTGAA CCTTTATTGA 2520 

ATCCTTATTA GCTTGAATCC TTATTAGCTT GAATCCTCCA TGCAAATCAT GGAGT CTGTG 2580 

TCCCACCTGA TGTGGTTGAG GAGAAGCCAG GTCTTCAAAG AGGGGTCAGC CTGGGGCAAA 2640 

GCAGGACTGG GGGGAGGTGG GCAGCAGGGC CTATTCTGAG AATCACATAT TGTTACAGGC 2700 

CTTGCACCCC CTTTGCTGCT TCCCTCCTGC TCATTTGGGG CTGCCACCAG CTCTCCACCC 2760 

60 TCCTGGTTCC GCTGGCCGGG C CAAGAG AGG ATGGAGGGAT GGGAGTCCCA GGAGATCCTT 2820 

GTAAATAGTG GGGTGGGACT GTTCTGAGTG ATCACCCGAG CACTTAAAGC TCCAGAGTCC 2880 

CATTCTTCCT GGATGGAGCA GGTGGAGGTG CAGAGGGGAT TTCCTCCTCT CCTTCCTCCT 2940 

GTCGAGAATT AACACCTCTC CACAGCCTTC CCCTCCAGAA CACCAGCCAG GGAGGGGTGG 3000 

GGAAGGAGGT CACAGCCAAG AAAACTGCCC TGTGACGACT TCCCTCCTTC CCGCCTATGT 3060 

65 GAGCCATCCT GAGATGTCTG TACAATAGAA ACCAAACCAA ATGGGCACCC TCGGTTGCCG 3120 

GGGGGCAGGT GGGGAGGGGG GTGGGAAGAA GGGATGTCTG TCTGTCGTCC CCCTCCCCCT 3180 

CTCCACTCTT TACCCACAAA GGCAGAAGAC TGTTACACTA GGGGGCTCAG CAAATTCAAT 3240 

CCCACCCTTA CCAATTGAGC CAAACCTAGA AACAAACACA AAACACGAAT AGTGAGAGAC 3300 

AAAATAGAGG AGAGAAAGAG AGCATGAGAG GGAGCGAGAC AGGCGACCAA CACAGAGGAG 3360 

70 AGAAAACAAA AATAGCAAAA AAAAAAAAAA AAAGCAGTTC TTTATAATTT AATATTCTAT 3420 
TTTAATAAAG GCGTTTATTA CCATATAAAT GTAGCAAAGA ACCTGGGCTA ATATGAA 



FCFAMPSLW SGIMERNGGF GELGCFGGSA KDRGLLEDER ALQLALDQLC LLGLGEPPAP 
RAGEDGGGGG GGAPAQPTAP PQPAPPPPPA APPAAPTTAP AAQTPQPPTA PKGASDAKLC 
ALYKEAELRL KGSSNTTECV PVPTSEHVAE IVGRQGCKIK ALRAKTNTYI KTPVRGEEPV 
FMVTGRREDV ATARREIISA AEHFSMIRAS RNKSGAAFGV APALPGQVTI RVRVPYRWG 
LWGPKGATI KRIQQQTNTY IITPSRDRDP VFEITGAPGN VERAREEI3T HIAVRTGKIL 
EYNNENDFLA GSPDAAIDSR YSDAWRVHQP GCKPLSTFRQ NSLGCIGECG VDSGFHAPRL 
GEQGGDFGYG GYLFPGYGVG KQDVYYGVAE TSPPLWAGQE NATPTSVLFS SASSSSSSSA 



798 



WO 03/042661 PCT/US02/36810 



c Acid Accession «: 



1 11 21 31 41 51 

I I I I I I 

CATTCCCACC CTTCCTTCTT TAATAAGCAG GAGCGAAAAA GACAAATTCC AAAGAGGATT 
GTTCAGTTCA AGGGAATGAA GAATTCAGAA TAATTTTGGT AAATGGATTC CAATATCGGG 
AATAAGAATA AGCTGAACAG TTGACCTGCT TTGAAGAAAC ATACTGTCCA TTTGTCTAAA 
ATAATCTATA ACAACCAAAC CAATCAAAAT GAATTCAACA TTATTTTCCC AGGTTGAAAA 
TCATTCAGTC CACTCTAATT TCTCAGAGAA GAATGCCCAG CTTCTGGCTT TTGAAAATGA 
TGATTGTCAT CTGCCCTTGG CCATGATATT TACCTTAGCT CTTGCTTATG GAGCTGTGAT 
CATTCTTGGT GTCTCTGGAA ACCTGGCCTT GATCATAATC ATCTTGAAAC AAAAGGAGAT 
GAGAAATGTT ACCAACATCC TGATTGTGAA CCTTTCCTTC TCAGACTTGC TTGTTGCCAT 
CATGTGTCTC CCCTTTACAT TTGTCTACAC ATTAATGGAC CACTGGGTCT TTGGTGAGGC 
GATGTGTAAG TTGAATCCTT TTGTGCAATG TGTTTCAATC ACTGTGTCCA TTTTCTCTCT 
GGTTCTCATT GCTGTGGAAC GACATCAGCT GATAATCAAC CCTCGAGGGT GGAGACCAAA 
TAATAGACAT GCTTATGTAG GTATTGCTGT GATTTGGGTC CTTGCTGTGG CTTCTTCTTT 
GCCTTTCCTG ATCTACCAAG TAATGACTGA TGAGCCGTTC CAAAATGTAA CACTTGATGC 
GTACAAAGAC AAATACGTGT GCTTTGATCA ATTTCCATCG GACTCTCATA GGTTGTCTTA 
TACCACTCTC CTCTTGGTGC TGCAGTATTT TGGTCCACTT TGTTTTATAT T 
CTTCAAGATA TATATACGCC TAAAAAGGAG AAACAACATG ATGGACJ 
TAAGTACAGG TCCAGTGAAA CCAAAAGAAT CAATATCATG CTGCTCTCCA TTGTGGTAGC 
ATTTGCAGTC TGCTGGCTCC CTCTTACGAT CTTTAACACT GTGTTTGATT G 
GATCATTGCT ACCTGCAACC ACAATCTGTT ATTCCTGCTC TGCCACC 
ATCCACTTGT GTCAACCCCA TATTTTATGG GTTCCTGAAC AAAAAC] 
GCAGTTCTTC TTCAACTTTT GTGATTTCCG GTCTCGGGAT GATGATTATG AAACAATAGC 

t taaaaacaag cacaacctgc aacatacttt gattacctgt 
Pl agattttctt g 



r TTATGCATAT 1S20 

T TCTTTAAAGT ATTACGATGC 1S80 

GCTGACTTCA GAAGTACCTG CCATCCAATA CGGTCATTAG ATTGGGTCAT CTTGATTAGA 1740 

TTAGATTAGA TTAGATTGTC AACAGATTGG GCCATCCTTA CTTTATGATA GGCATCATTT 1800 

TAGTGTGTTA CAATAGTAAC AGTATGCAAA AGCAGCATTC AGGAGCCGAA AGATAGTCTT I860 

QAAGTCATTC AGAAGTGGTT TGAGGTTTCT GTTTTTTGGT GGTTTTTGTT TGTTTTTTTT 1920 

TTTTTTCACC TTAAGGGAGG CTTTCATTTC CTCCCGACTG ATTGTCACTT AAATCAAAAT 1980 

TTAAAAATGA ATAAAAAGAC ATACTTCTCA GCTGCAAATA TTATGGAGAA TTGGGCACCC 2 040 

ACAGGAATGA AGAGAGAAAG CAGCTCCCCA ACTTCAAAAC CATTTTGGTA CCTGACAACA 2100 

AGAGCATTTT AGAGTAATTA ATTTAATAAA GTAAATTAGT ATTGCTGCAA ATAGCTAAAT 2160 

TATATTTATT TGAATTGATG GT CAAGAG AT TTTCCATTTT TTTTACAGAC TGTTCAGTGT 2220 

TTGTCAAGCT TCTGGTCTAA TATGTACTCG AAAGACTTTC CGCTTACAAT TTGTAGAAAC 2280 

ACAAATATCG TTTTCCATAC AGCAGTGCCT ATATAGTGAC TGATTTTAAC TTTCAATGTC 234 0 

CATCTTTCAA AGGAAGTAAC ACCAAGGTAC AATGTTAAAG GAATATTCAC TTTACCTAGC 2400 

AGGGAAAAAT ACACAAAAAC TGCAGATACT TCATATAGCC CATTTTAACT TGTATAAACT 2460 

3 TGGCGTCTTA TAAATAATGC ACTGTAAAGA TTACTGAATA GTTGTGTCAT 2520 

C CTAATTTCAT GTATCTTGTA ATCATGATTG AGCCTCAGAA TCATTTGGAG 2580 

AAACTATATT TTAAAGAACA AG ACAT ACT T CAATGTATTA TACAGATAAA GTATTACATG 2640 

TGTTTGATTT TAAAAGGGCG GACATTTTAT TAAAATCAAT ATTGTTTTTG CTTTTTCTGA 2700 
GGAGTCTCTT TCAGTTTCAT TTTTTCTCAT CCCATGACTT CCCTCCGATG GT 

Seq ID NO: B6 Protein sequence 



MNSTLFSQVE NHSVHSNFSE KNAQLLAFEN DDCHLPLAMI FTLALAYGAV IILGVSGNLA 
LIIIILKQKE MENVTNILIV NLSFSDLLVA IMCLPFTFVY TLKDHWVFGE AMCKLNPFVQ 
CVSITVSIFS LVLIAVERHQ LIINPRGWRP NNRHAYVGIA VIWVLAVASS LPFLIYQVMT 
DEPFONVTLD AYKDKYVCFD QFPSDSHRLS YTTLLLVLQY FGPLCFIFIC YFKIYIRLKR 
V AFAVCWLPLT IFNTVFDWNH QIIATCNHNL 



: B7 DNA sequence 



GGATCGCACT TGAGGCTGGA GGCATGAGTC CTGTGAGGCG TTGGGGCAGC CCCTGCCTTT 
TCCCCTTGCA GCTCTTCAGC CTCTGCTGGG TGCTCTCAGT GGCCCAGAGC AAAACAGTCC 
GATACAGCAC CTTCGAGGAG GATGCCCCCG GCACGGTCAT CGGGACCCTG GCCGAGGACC 
TGCATATGAA AGTATCGGGT GACACAAGCT TCCGCCTGAT GAAGCAATTC AACAGCTCTC 
TGCTCCGGGT GCGCGAAGGC GACGGGCAGC TGACCGTCGG GGACGCCGGC CTGGACCGCG 
AGCGGCTGTG TGGCCAGGCC CCGCAGTGCG TGCTGGCCTT CGATGTGGTC AGCTTCTCGC 
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AGGAGCAGTT CCGGCTGGTG CACGTGGAGG TAGAGGTGAG GGACGTCAAC GACCACGCGC 500 
CGCGCTTCCC CAGGGCCCAG ATCCCGGTAG AGGTGTCCGA GGGTGCGGCA GTGGG CACGC S60 
GCATCCCCTT GGAGGTGCCG GTGGACGAGG ACGTGGGCGC CAACGGGCTG CAGACCGTGC 720 
GCCTGGCCGA GCCGCACAGC CCCTTTCGCG TGGAGCTGCA GACGCGAGCG GACGGCGCTC 780 
AGTGCGCAGA CCTGGTGCTG CTGCAGGAGC TGGACCGCGA GAGCCAGGCC GCCTACAGCC 84 0 
TGGAGCIGGT GGCCCAGGAC GGCGGCCGCC CGCCGCGCTC CGCCACGGCT GCCCTCAGCG 900 
TGCGCGTCCT GGATGCGAAT GACCACAGCC CGGCCTTCCC GCAGGGCGCC GTGGCCGAAG 960 
TGGAGCTGGC GGAAGACGCG CCCGTGGGCT CCCTGCTTCT CGACCTGC-AC GCAGCCGACC 1020 
G ACCTAACGGC GACGTGGTGT TCGCATTTGG CGCCCGCACC CCGCCGGAGG 1080 
T CTTTCGGCTT GACCCGCGAT CAGGCCGCCT CACCCTGC-CC GGGCCCGTGG 1140 

G TCAGGACACC TACGAGCTGG ACGTGCGGGC GCAGGACCGC GGACCCGGGC 1200 

CCCGCGCTGC CACCTGCAAG GTCATCGTGC GCATCCGAGA CGTCAATGAC AACGCACCCG 1260 
ACATCGCCAT CACCCCGCTG GCCGCCCCAG GCGCGCCGGC AACCTCACCC TTCGCCGCTG 1320 
CCGCCGCCGC CGCTGCACTC GGGGGAGCGG ACGCTAGCTC GCCGGCGGGA GCCGGGACGC 1380 
CGGAGGCTGG TGCCACTTCG CTGGTGCCGG AGGGGGCGGC GCGCGAGAGC CTGGTGGCCC 1440 
TGGTCAGCAC CTCGGACAGG GACTCGGGCG CCAACGGGCA AGTGCGCTGC GCCCTCTATG IE 00 
GGCACGAGCA CTTCCGGCTG CAGCCGGCCT ACGCGGGCAG CTACCTGGTG GTGACCGCGG 1560 
CGTCGCTGGA CCGCGAACGC ATCGCCGAGT ACAACTTGAC GCTGGTGGCC GAGGATCGCG 1620 
GCGCGCCCCC GCTGCGCACA GTGCGGCCCT ACACGGTGCG TGTGGGCGAC GAGAACGACA 1680 
ACGCGCCGCT CTTCACGCGG CCGGTCTATG AGGTGTCGGT GCGCGAGAAC AACCCGCCAG 1740 
GCGCCTACCT GGCCACGGTG GCCGCCCGCG ACCGGGACCT GGGCCGCAAC GGCCAGGTCA 1800 
CCTACCGGCT GCTGGAGGCC GAGGTGGGCC GCGCCGGGGG CGCCGTGTCC ACTTATGTCT I860 
CGGTGGACCC AGCTACCGGA GCCATCTACG CGCTGCGCAG CTTCGACTAT GAGACGCTGC 1920 
GCCAACTCGA CGTTCGCATC CAAGCTAGCG ACGGCGGCTC CCCTCAGCTT TCCAGCAGCG 1980 
CCCTAGTGCA AGTGCGCGTG CTGGACCAGA ACGACCATGC GCCAGTCCTG GTGCACCCGG 2040 
CGCCAGCCAA TGGCTCCCTA GAAGTGGCGG TGCCTGGGCG CACCGCAAAG GACACGGTTG 2100 
TGGCCCGTGT GCAGGCCCGG GATGCAGACG AGGGAGCCAA CGGGGAGCTG GCGTTCGAGC 2160 
TGCAGCAGCA GGAGCCGCGC GAAGCCTTCG CCATCGGCCG CCGCACGGGG GAGATACTGC 2220 
TCACCGGCGA CCTCTCGCAG GAGCCACCCG GTCGCGTGTT CAGGGCGCTC CTGGTCATAT 2280 
CCGACGGCGG CCGTCCCCCG CTCACCACCA CCGCAACTGT CAGCTICGTG GTAACAGCAG 2340 
GGGGCGGGCG TGGGCCGGCT GCGCCTGCCA GTGCAGGAAG CCCGGAGCGT TCCCGCCCGC 2400 
CTGGCTCTCG GCTCGGGGTG TCCGGGTCGG TGCTGCAATG GGACACGCCG CTGATCGTCA 2460 
TCATCGTGCT GGCCGGGAGC TGCACGCTGC TGCTGGCCGC CATCATCGCC ATCGCCACCA 2520 
CCTGCAACCG CCGCAAGAAG GAGGTGCGCA AAGGGGGGGC CCTCCGGGAA GAGCGGCCCG 2580 
GGGCGGCGGG CGGCGGAGCC TCGGCTCCCG GCTCCCCGGA GGAGGCCGCC CGGGGAGCCG 2640 
GGCCCAGGCC CAACATGTTC GACGTGCTCA CCTTCCCTGG CACCGGCAAA GCGCCCTTTG 2700 
GCAGCCCCGC GGCGGACGCG CCTCCGCCTG CGGTCGCCGC GGCCCAAGTC CCGGGCTCAG 2760 
AGGGCGGCAG CGCCACTGGG GAAAGCGCCT GTCACTTCGA GGGGCAGCAG CGGCTCCGCG 2620 
GCGCGCACGC CGAGCCCTAC GGTGCCTCCC CGGGTTTTGG AAAGGAGCCG GCGCCCCCTG 2880 
TGGCGGTGTG GAAAGGACAC TCCTTCAACA CCATTTCTGG CAGAGAAGCA GAGAAGTTCA 2940 
GCGGCAAAGA CAGCGGTAAA GGGGACAGTG ATTTCAACGA CAGCGATTCC GACATCAGCG 3000 
CCGACCCTCT GAAAAAGGAT CTCATCAACC ACATGCAGPl I iACI rGG 3CGT CACCG 5060 
CTGAGTGTAA GATCCTGGGC CACTCTGACC GCTGCTGGAG CCCATCCTGC AGCGGGCCCA 3120 
ACGCACATCC ATCGCCTCAC CCACCAGCCC AGATGTCAAC CTTCTGTAAG AGCACGTCAC 3180 
TGCCTCGGGA TCCTCTGCGC AGGGACAATT ACTACCAGGC CCAGCTGCCC AAGACAGTGG 3240 
GG CTGCAGAG CGTCTATGAG AAAGTACTGC ACAGAGACTA TGACAGGACA GTCACTCTGC 3300 
TCTCCCCTCC CCGT CCAGGG AGGCTCCCAG ACCTGCAGGA GATTGGAGTA CCCCTCTACC 3360 
AGTCCCCTCC TGGCAGGTAC CTGTCCCCGA AGAAGGGAGC CAATGAAAAT GTGTAATCCC 3420 
ATGCTGCATG TCTTCACACA TATACAGGTC ACTCCGAGAA GCCCCIAAGT TATTGACCGG 3480 
TTTCAGTGTT GTATATATAA ATATGCAAGA TGTGCCTTAC AATGAAGTTG TTGGAAGCTA 3540 
TTTCCAATCA CATTGGTACT GTTTGGTATT TGCAAACAAA AATGTAGTTA ATGTAATTTT 3600 
TATGAAATGT GTGCAGTATT TAATTTTTCT TATGCTATTG ACTTTCATTT CAATTGCGGC 3660 
TTGCCATTTT CTTAGTGTTT TTAACCTGTA CATTGTGTAA TGTAATGTTT GTATATAATG 3720 
AAATTTTGTT ATATTTTTAT ATAATAAAAG CTAAAGTGGA AGTTATTGCC AAAGGAACTG 3780 
TCTGTAAGAC AAAAAACAAA ACATGTTGGA ATTACTTAAT TGAAATTTAT CTTTCACCTG 3840 
AAACAACCTA GTGTTTGAGA AATTTTGCCT TGCCAAGTAT AACTTGTATA TCTTGACTCT 3 900 
GTGGTAGATT TCAAGTTCAA TGTTATTTAA TTACATTTGG TTTTCCGTAA ACCGTGTCAC 3960 
TTATAAGCAC AGTAATAAAA GATTTGTCAT GTGTTTGAAG AAAAAAAAAA AAAAAAAAAA 4020 
A AAAAAAAAAA AAAAAAAAAA AT 
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MSPVRRWGSP CLPELQLFSL CWVLSVAQSK TVRYSTFEED APGTVIGTLA EDLHMKVSGD 
TSFRLMKQFN SSLLRVREGD GQLTVGDAGL DRERLCGQAP QCVLAFDWS FSQSQFRLVH 
VEVEVRDWD HAPRFPRAQI PVEVSEGAAV GTRIPLEVPV DEDVGANGLQ T" 
FRVELQTRAD GAQCADLVLL QELDRESQAA YSLELVAQDG GRPPRSATAA L 
HSPAFPQGAV AEVEIjAEDAP VGSLLLDLDA ADPDEGPNGD WFAFGARTP E 
PRSGRLTLAG PVDYERQDTY ELDVRAQDRG PGPRAATCKV IVRIRDVHDN A 
APGAPATSPF AAAAAAAALG GADASSPAGA GTPEAGATSL VPEGAARESL VALVSTSDRD 
SGANGQVRCA LYGHEHFRLQ PAYAGSYLW TAASLDRERI AEYNLTLVAE DRGAPPLRTV 
RPYTVRVGDE NDNAPLFTRP VYEVSVRENN PPGAYLATVA ARDRDLGRNG QVTYRLLEAE 
VGRAGGAVST YVSVDPATGA IYALRSFDYE TLRQLDVRIQ ASDGGSPQLS SSALVQVRVL 
DQNDHAPVLV HPAPANGSLE VAVPGRTAKD TWARVQARD ADEGANGELA FELQQQEPRE 
AFAIGRRTGE ILLTGDLSQE PPGRVFRALL VISDGGRPPL TTTATVSFW TAGGGRGPAA 
PASAGSPERS RPPGSRLGVS GSVLQWDTPL IVIIVLAGSC TLLLAAIIAI ATTCfTRRKKE 
VRKGGALREE RPGAAGGGAS APGSPEEAAR GAGPRPKMFD VLTFEGTGKA PFGSPAADAP 
PPAVAAAEVP GSEGGSATGE SACHFEGQQR LRGAHAEPYG ASPGFGKEPA PPVAVWKGHS 
FNTISGREAE KFSGKDSGKG DSDFNDSDSD ISGDALKIC INHI , S< LWJ CTAECKILGH 
SDRCWSPSCS GPNAHPSPHP PAQMSTFCKS TSLPRDPLRR DNYYQAQLPK TVGLQSVYEK 
VLHRDYDRTV TLLSPPRPGR LPDLQEIGVP LYQSPPGRYL SPKKGANENV 
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Seq ID NO; B9 

Nucleic Acid Accession #: AL121939.12 
Coding sequence: 185.. 1426 

1 11 21 31 41 51 

I I I I I I 

AGCGGAGTST CAGTGGCGAG CTCCGGGTGC TGTGGCCCGG CCTTGGCGC-G GCGGCCTCCG 60 

GCTCAGGCTG GCTGAGAGGC TCCCAGCTGC AGCGTCCCCG CCCGCCTCCT CGGGAGCTCT 120 

GATCTCAGCT GACAGTGCCC TCGGGGACCA AACAAGCCTG GCAGGACAAA ATTAGAAGAT 180 

CAAAATGGAA AATATGCTGC TTTGGTTGAT ATTTTTCACC CCTGGGTGGA CCCTCATTGA 240 

TGGATCTGAA ATGGAATGGG ATTTTATGTG GCACTTGAGA AAG3TACCCC GGATTGTCAG 300 

TGAAAGGACT TTCCATCTCA CCAGCCCCGC ATTTGAGGCA GATGCTAAGA TGATGGTAAA 360 

TACAGTGTGT GGCATCGAAT GCCAGAAAGA ACTCGCAACT CCCAGCCTTT CTGAATTGGA 420 

GGATTATCTT TCCTATGAGA CTGTCTTTGA GAATGGCACC CGAACCTTAA CCAGGGTGAA 480 

AGTTCAAGAT TTGGTICTTG AGCCGACTCA AAATATCACC ACAAAGGGAG TATCTGTTAG 540 

GAGAAAGAGA CAGGTGTATG GCACCGACAG CAGGTTCAGC ATCTTGGACA AAAGGTTCTT 600 

AACCAATTTC CCTTTCAGCA CAGCTGTGAA GCTTTCCACG GGCTGTAGTG GCATTCTCAT 660 

TTCCCCTCAG CATGTTCTAA CTGCTGCCCA CTGTGTTCAT GATGGAAAGG ACTATGTCAA 72 0 

AGGGAGTAAA AAGCTAAGGG TAGGGTTGTT GAAGATGAGG AATAAAAGTG GAGGCAAGAA 780 

ACGTCGAGGT TCTAAGAGGA GCAGGAGAGA AGCTAGTGGT GGTGACCAAA GAGAGGGTAC 840 

CAGAGAGCAT CTGCAGGAGA GAGCGAAGGG TGGGAGAAGA AGAAAAAAAT CTGGCCGGGG 900 

TCAGAGGATT GCCGAAGGGA GGCCTTCCTT TCAGTGGACC CGGGTCAAGA ATACCCACAT 960 

TCCGAAGGGC TGGGCACGAG GAGGCATGGG GGACGCTACC TTGGACTATG ACTATGCTCT 1020 

TCTGGAGCTG AAGCGTGCTC ACAAAAAGAA ATACATGGAA CTTGGAATCA GCCCAACGAT 1080 

CAAGAAAATG CCTGGTGGAA TGATCCACTT CTCAGGATTT GATAACGATA GGGCTGATCA 114 0 

GTTGGTCTAT CGGTTTTGCA GTGTGTCYGA CGAAT CCAAT GATCTCCTTT ACCAATAYTG 1200 

CGATGCTGAG TCGGGCTCCA CCGGTTCGGG GGTCTATCTG CGTCTGAAAG ATCCAGACAA 12 60 

AAAGAATTGG AAGCGCAAAA TCATTGCGGT CTACTCAGGG CACCAGTGGG TGGATGTCCA 1320 

CGGGGTTCAG AAGGACTACA ACGTTGCTGT TCGCATCACT CCCCTAAAAT ACGCCCAGAT 1380 

TTGCCTCTGG ATTCACGGGA ACGATGCCAA TTGTGCTTAC GGCTAACAGA GACCTGAAAC 1440 

AGGGCGGTGT AT CAT CT AAA TCACAGAGAA AACCAGCTCT GCTTACCGTA GTGAGATCAC 1500 

TTCATAGGTT ATGCCTGGAC TTGAACTCTG T CAAT AGCAT TTCWACATTT TTCAAAATCA 1550 

GGAGATTTTC GTCCATTTAA AAAATGTATA GGTGCAGATA TTGAAACTAG GTGGGCACTT 1620 

CAATGCCAAG TATATACTCT TCTTTACATG GTGATGAGTT TCATTTGTAG AAAAATTTTG 1680 

TTGCCTTCTT AAAAATTAGA CACACTTTAA ACCTTCAAAC AGGTATTATA AATAACATGT 1740 

GACTCCTTAA TGGACTTATT CTCAGGGTCC TACTCTAAGA AGAATCTAAT AGGATGCTGG 1800 

TTGTGTATTA AATGTGAAAT YGCATAGATA AAGGTAGATG CTAAACCAAT TAGTATCAGA 1860 

ATAGAGACAG AAAGTTACAA CACAGTTTGT ACTACTCTGA GATGGAYCCA TTCAGCTCAT 1920 

GCCCTCAATG TTTATATTGT GTTATCTGTT GGGTCTGGGA CATTTAGTTT AGTTTTTTTG 1980 

AAGAATTACA AATCAGAAGA AAAAGCAAGC ATTATAAACA AAACTAATAA CTGTTTTACT 2040 

GCTTTAAGAA ATAACAATTA CAATGTGTAT TATTTAAAAA TGGGAGAAAT AGTTTGTTCT 2100 

ATGAAATAAA CCTAGTTTAG AAATAGGGAA GCTGAGACAT TTTAAGATCT CAAGTTTTTA 2160 

TTTAACTAAT ACTCAAAATA TGGACTTTTC ATGTATGCAT AGGGAAGACA CTT CACAAAT 2220 

TATGAATGAT CATGTGTTGA AAGCCACATT ATTTTATGCT ATACATTCTA TGTATGAGGT 2280 

GCTACATTTT TAGGACAAAG AATTCTGTAA TCTTTTTCAA GAAAGAGTCT TTTTCTCCTT 2340 

GACAAAATCC AGCTTTTGTA TGAGGACTAT AGGGTGAATT CTCTGATTAG TAATTTTAGA 2400 

TATGTCCTTT CCTAAAAATG AATAAAATTT ATGAATATGA CTTAAAAAAA AAARWCGACG 2460 

CGGCCGCGAA TTTAGTAGTA GTAGTCGACC CGGGAATTCC GGACCGGTAC CTGCAGGCGT 252 0 
ACCAGC 



Protein Accession #: CAC3S071.1 

1 11 21 31 41 51 

I I I I I I 

MENMLLWLIF FTPGWTLIDG SEMEWDFMWH LRKVPRIVSE RTFHLTSPAF EADAKMMVNT 
VCGIECQKEL PTPSLSELED YLSYETVFEN GTRTLTRVKV QDLVLEPTQN ITTKGVSVRR 
KRQVYGTDSR FSILDKRFLT NFPFSTAVKL STGCSGILIS PQHVLTAAHC VHDGKDYVKG 
SKKLRVGLLK MRNKSGGKKR RGSKRSRREA SGGDQREGTR EHLRERAKGG RRRKKSGRGQ 
RIAEGRPSFQ WTRVKNTHIP KGHARGGMGD ATIDYDYALL ELKRAHKKKY MELGISPTIK 
KMPGGMIHFS GFDNDRADQL VYRFCSVSDE SNDLLYQYCD AESGSTGSGV YLRLKDPDKK 
NWKRKIIAVY SGHQWVDVHG VQKDYNVAVR ITPLKYAQIC LWIHGNDANC AYG 




CCGGCGGCCC O 

CTGCCGCCTT CCTCGTGGCC TTCGTGCTGC T 
CCAAGCCCCT CGCCCTGCCC GGGGCGCATG TGGTGGTTAC A 

T TGCTATCGAG TGCTATAAAC AAGGAGCTTT TATAACTCTG GTTGCACGAA 
A GCTGCTGCAG GCAAAGAAAG AAATTGAAAT GCACTCTATT AATGACAAAC 
AGGTGGTGCT TTGCATATCA GTTGATGTAT CTCAAGACTA TAACCAAGTA GAGAATGTCA 
\ CTGGGTCCAG TGGACATGCT GGTAAATTGT GCAGGAATGG 
4 GATCTTGAAG TTAGTACCTT TGAAAGGTTA ATGAGCATCA 
ATTACCTGGG CAGCGTGTAC CCCAGCCGGG CCGTGATCAC CACCATGAAG GAGCGCCGGG 

CCTACTCTGC ATCCAAGTTT GCCATAAGGG GATTGGCAGA AGCTTTGCAG ATGGAGGTGA 

AGCCATATAA TGTCTACATC ACAGTTGCTT ACCCACCAGA CACAGACACA CCTGGCTTTG 

CCGAAGAAAA CAGAACAAAG CCTTTGGAGA CTCGACTTAT TTCAGAGACC ACATCTGTGT 

GCAAACCAGA ACAGGTGGCC AAACAAATTG TTAAAGATGC C 
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GTTCCCTTGG CTCAGATGGG TACATGCTCT CGGCCCTGAC CTGTGGGATG GCTCCAGTAA 950 

CTTCTATTAC TGAGGGGCTC CAGCAGGTGG TCACCATGGG CCTTTTCCGC ACTATTGCTT 1020 

TGTTTTACCT TGGAAGTTTT GACAGCATAG TTCGTCGCTG CATGATGCAG AGAGAAAAAT 1080 

CIGAAAATGC AGACAAAACT GCCTAATCTT CTTACCCCTT GGAAGAAGAC TGTTTCCAAA 1140 

TAATTTGAAC AGCTTGCTGC TAAATGGGAC CCAATTTTTG GCCTATAGAC ACTTATGTAT 1200 

TGTTTTCGAA TACGTCAGAT TGGACCAGTG CTCTTCAGGA ATGTGGCTGC AAGCAAGGGG 1260 

CTAGAAGTTC ACCTCCTGAC AGTATTATTA ATACTATGCA AATATGGAAT AGGAGACCAT 1320 

TTGATTTTCT AGGCTTTGTG GTAGAGAGGT GAAGGTATGA GAATTAATAG CGTGTGAACA 1380 

AAGTAAAGAA CAGGATTCCA GAATGAT CAT TAAATTTGTT TCTATTTATT CTTTTTTGCC 1440 

CCCCTAGAGA TTAAGTCCAG AAATGTACTT TCTGGCACAT AAAGAAATCT TGAGGACTTT 1500 

GTTTAAACCT TCCATAAAAA AACAATTTTC GGTTTCTCGG GTTCTCTCTC TCTGTCTCTC 1560 

TGTCTCTCTG TCTCTCTGTC TCTCTGTCTC TCTCTCTCTC TCTCTTTCTT TCTTTGTGTA 1620 

TTTTATTCAA GATGAGTTGG ACCCATTGCC AGTGAGTCTG AATGTCACTG ACAGCCCTGT 1680 

GTTGTGCTCA GGACTCACTC TGCTGCTGGT GGAAACTCAT GGCTTCTCTC TCTCTTTGAT 1740 

CCCATAAAGC TACGAGGGGG ACGGGAGAGG GCAGTGCAAT GGGAAGTAAA GAGATATTTT 1800 

CCAGTAGGAA AAGCAATGCT TTCTTGTCTT TAG ACT CAAA TGCTTAGGGA ACGTTTCATT 1860 

TCTCATTCAT GGGGARAGGC AGCCTCCTTA AATGTTTTCT GAAGAGCGGT AAAATCTAGA 1920 

AGCTTAAGAA TTTACAGTTC CTTCA&TAAC CATGATGACC TGAAGTTCAC CTATCCCATT 1980 

C CTTTCCAATT TTGCTTATAC TGCTGTAATA 2040 

A AAGACCAGCT AAAATTTTCG ACTTGACTTT 2100 



TTAACTTAAC TCATGAATTA ATTAAAGCAA ATGAAAAAAT TAAAAAGTGT GACTTTTTCT 
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MLLLAAAFLV AFVLLLYMVS PLISPKPLAL PGAHWVTGG SSGIGKCIAI ECYKOGAFIT 
LVARNEDKLL QAKKEIEMHS INDKOWLCI SVDVSQDYNQ VENVIKQAQE KLGPVDMLVN 
CAGMAVSGKF EDLEVSTFER LMSINYLGSV YPSRAVITTM KERRVGRIVP VSSQAGQLGL 
PGFTAYSASK FAIRGLAEAL QMEVKPYNVY ITVAYPPDTD TPGFAEENRT KPLETRLISE 
TTSVCKPEQV AKQIVKDAIQ GNFNSSLGSD GYMLSALTCG MAPVTSITEG LQQWTMGLF 
RTIALFYLGS FDSIVRRCMM Q " ™ 



ATTATATATT GCATTATATT TGCCGGTAAA ACTCGGTTAA TTTTTAAAAT 
TGGTCCCGGG CTCCCTTATA GGATAATTGG 



TTTGGATTGC TTAAGTCCAA TTATTAAATG CTGCGGTTTC AAATTCCCAG CTGGAAGGAC 
CACCCATTTA AAAACTTCAG AAGGCAGGAT CCTGCT CAAT TTATAAGGCT TTGGAAAATT 
CCAGGCATTG GTTTGACATA TTTCCAGAGC TCAAATCTGC AGTGTTCCAC ATGCACATAC 
AAGATCCAGA GTCTCATGTT AAAATCACTT ACATACCCAG AAAGACCACC ACTTTGCAGG 
TATAATATTG TACTTAAAGA CAGAGAGGAA GTGTTTCTTA ATCCAAACAC ATGTACACCA 
&AGAAI ACAT AAGATGCCTT CTTCCATCAA ATGCACTTGC TTGTGAATTA ATGGACTTGT 
AAATGAAACA ATGCAATCAG TCTTTTATAA TGCACTGTTC AATTTGAGAT TCAAGTATTT 
CTATTTCTTG GAAAAAATTT TAAGAATCAA AAATAAAGAA AATAAAAAGT GCATACAGTT 

Pl aaaaaaaaaa aa 
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CTTGTTCATG GCATCTTTAG AAACAAACTG CAATTTTATT TCATTTCCTT GTCGTTCATA 60 

CAAAGATTAC AAGACTAGCT TATGTGTGGA CTGTGACTGT TTTAAGGAAA AATCATGTCC 120 

TCGGCTGGGT TATCAAGCCA AGCTATTTAA AGGTGTTTTA AAAGAAAGGA TGGAAGGAAG 180 

ACCTCTTAGG ACCACTGTGT TTTTGGATAC AAGTGGTACA TATCCATTCT GTAGCCTATT 240 

ATTTTGTTCT CAGTATAATT GTTCCAGATA AAACTATGAT GGATGGCTCG TTTTCATTTA 30 0 

AATTATTAAA TCAGCTTGGA ATGATTGAAG AGCCAAGGCT TTATGAAAAG AACAAACCAT 3 60 

TTTATAAACT TCAAGAAGTC AAGATTCTTG CTCAATTTTA TAATGACTTT GTA.WTAHT 420 

CAAGCATTGG TTTGACATAT TTCCAGAGCT CAAATCTGCA GTGTTCCACA TGCACATACA 480 

AGATCCAGAG ACTCATGTTA AAATCACTTA CATACCCAGA AAGACCACCA CTTTGCAGGT 540 

ATAATATTGT ACTTAAAGAC AGAGAGGAAG TGTTTCTTAA TCCAAACACA TGTACACCAA 60 0 

AGAACACATA AGATGCCTTC TTCCATCAAA TGCRCTTGCT TGTGAATTAA TGGACTTGTA 660 

AATGAAACAA TGCAATCAGT CTTTTATAAT GCACTGTTCA ATTTGAGATT CAAGTATTTC 720 

TATTTCTTGG AAAAAATTTT AAGAATCAAA AATAAAGAAA ATAAAAAATG CATACAGTTA 780 
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Nucleic Acid Accession #: CAT cluster 

1 11 21 31 41 51 

I I I I I I 

TTTTTTTTTT TTTTTTTTTT TTAAATGCAA AGGAGCAACA AAATTTATTG ACTGAATTAA 
ACACAACAGT AAAATGGCAG TGTTGTAATT TCATTTTCAG ATGTTTGAAT GGAACAAGAA 
T AGCCCAAGCT TCTTACATTC ATTAAAAGAG TGACTATCAA AAACAGCAAC 
G GTAGATATGC ACAAAATGGA ATTATATCAA CAAATATACA AAATACCCAA 
T TTACAGGTTT AAAAATATAA ACATTGATTC CTCTATCCCA TTAAACCATT 
IAGGAAA GACCCTATTG CTATTTAGAA TCCTTTTTAA AACAAGTTTT 
TAAAACATAG AATTAGTTCT AGGAGACAAT TTTTGATGTT TTTCAGGGGT TTAACATTCT 
ATTATAAAAA TAACATCTAT AAACCTACTA ACAATTTTCC TCCTGTGCAC AAAAATAATA 
CTGCCAAAAC CTGTCCTCAA AGACATGCCT GACTTTCAGG AAAGCTAATT ATGGAAATGG 
AGTTTCTCGT TTGGGTTATC TTTGTTACTA TTTTCAAATA ACCAGCAACT CCCTATATTA 
CACTGAGATA CTTTATATAA ATAACGTGGG CGAAACCTGA AGTTCACAAT GAGCCTGCTA 
GGTAGCTGGT GTCAAGTACA AATGATAGGA ATTGACTTTG CCAACAACAT CAAAAGCATT 
TTCCCTGATA TTCCTGAIAG ACCTACCACT ATCAGATCCT CCATATTCAA TAAGATTTAT 
A ATGAAAATGT TAAATATTAC TTTGCTAGAG TTTCTCCTCC TTTATTTAGA 
3 TAGTGGGGAC CAGTGGTTGT AATGTAGATA CTTGAGAAGT TTCATTGATT 
CCTTCAGACC CATGCCG 

Nucleic Acid Accession #: NM_022898.1 



I I I I I I 

TTTTCTTGCT TTTCTTCCCT TTTTTTTCTT TTTGCAAACA AAACAAAAAA C»' 
GAAAGAGCAA AATAAAGAAG AAGAAGAGGA GGAAGAGAGG GAAAGAGAGG A 
AAACACCAAC CCGGGCAGAG GAGGAGGTGC GGCGGCGGCG GCGGCGGCGG CAGCGGCGGC 
AGCGGCGCGG CGGCGGCTCG GACCCCCTCC CCCGGCTCCC CCCATCAGTG C 
GGCGATGCCA GAATAGATGC CGGGGCAATG TCCCGCCGCA AACAGGGCAA C 
TTGTCCCAGA GGGAGCTCAT CACCCCAGAG GCTGACCATG TGGAGGCCGC C 

GAAGACGAGG GTCTGGAGAT AGAGGAGCCA AGTGGCCTGG GGCTGATGGT GGGTGGCCCC 420 

GACCCTGACC TGCTCACCTG TGGCCAGTGT CAAATGAACT TCCCCTTGGG GGACATCCTG 480 

GTTTTTATAG AGCACAAAAG GAAGCAGTGT GGCGGCAGCT TGGGTGCCTG CTATGACAAG 540 

GCCCTGCACA AGGACAGCCC GCCACCCTCC TCACGCTCCG AGCTCAGGAA AGTGTCCGAG 600 

CCGGTGGAGA TCGGG AT CCA AGTCACCCCC GACCAAGATG ACCACCTGCT CTCACCCACG 660 

AAAGGCATCT GTCCCAAGCA GGAGAACATT GCAGGTAAAG ATGAGCCTTC CAGCTACATT 720 

TGCACAACAT GCAAGCAGCC CTT CAACAGC GCGTGGTTCC TGCTGCAGCA CGCGCAGAAC 780 

ACGCACGGCT TCCGCATCTA CCTGGAGCCC GGGCCGGCCA GCAGCTCGCT CACGCCGCGG 840 

CTCACCATCC CGCCGCCGCT CGGGCCGGAG GCCGTGGCGC AGTCCCCGCT CATGAATTTC 900 

CTGGGCGACA GCAACCCCTT CAACCTGCTG CGCATGACGG GCCCCATCCT GCGGGACCAC 960 

CCGGGCTTCG GCGAGGGCCG CCTGCCGGGC ACGCCGCCTC TCTTCAGTCC CCCGCCGCGC 1020 

CACCACCTGG ACCCGCACCG CCTCAGTGCC GAGGAGATGG GGCTCGTCGC CCAGCACCCC 1080 

AGTGCCTTCG ACCGAGTCAT GCGCCTGAAC CCCATGGCCA TCGACTCGCC CGCCATGGAC 1140 

TTCTCGCGGC GGCTCCGCGA GCTGGCGGGC AACAGCTCCA CGCCGCCGCC CGTGTCCCCG 1200 

GGCCGCGGCA ACCCTATGCA CCGGCTCCIG AACCCCTTCC AGCCCAGCCC CAAGTCCCCG 1260 

TTCCTGAGCA CGCCGCCGCT GCCGCCCATG CCCCCTGGCG GCACGCCGCC CCCGCAGCCG 1320 

CCAGCCAAGA GCAAGTCGTG CGAGTTCTGC GGCAAGACCT TCAAGTTCCA GAGCAATCTC 1380 

ATCGTGCACC GGCGCAGTCA CACGGGCGAG AAGCCCTACA AGTGCCAGCT GTGCGACCAC 1440 

GCGTGCTCGC AGGCCAGCAA GCTCAAGCGC CACATGAAGA CGCACATGCA CAAGGCCGGC 1500 

TCGCTGGCCG GCCGCTCCGA CGACGGGCTC TCGGCCGCCA GCTCCCCCGA GCCCGGCACC 1560 

AGCGAGCTGG CGGGCGAGGG CCTCAAGGCG GCCGACGGTG ACTTCCGCCA CCACGAGAGC 1620 

GACCCGTCGC TGGGCCACGA GCCGGAGGAG GAGGACGAGG AGGAGGAGGA GGAGGAGGAG 1680 

GAGCTGCTAC TGGAGAACGA GAGCCGGCCC GAGTCGAGCT TCAGCATGGA CTCGGAGCTG 1740 

AGCCGCAACC GCGAGAACGG CGGTGGTGGG GTGCCCGGGG TCCCGGGCGC GGGGGGCGGC 1800 

GCGGCCAAGG CGCTGGCTGA CGAGAAGGCG CTGGTGCTGG GCAAGGTCAT GGAGAACGTG 1860 

GGCCTAGGCG CACTGCCGCA GTACGGCGAG CTCCTGGCCG ACAAGCAGAA GCGCGGCGCC 1920 

TTCCTGAAGC GTGCGGCGGG CGGCGGGGAC GCGGGCGACG ACGACGACGC GGGCGGCTGC 1980 

GGGGACGCGG GCGCGGGCGG CGCGGTCAAC GGGCGCGGGG GCGGCTTCGC GCCAGGCACC 2040 

GAGCCCTTCC CCGGGCTCTT CCCGCGCAAG CCCGCGCCGC TGCCCAGCCC CGGGCTCAAC 2100 

AGCGCCGCCA AGCGCATCAA GGTGGAGAAG GACCTGGAGC TGCCGCCCGC CGCGCTCATC 2160 

CCGTCCGAGA ACGTGTACTC GCAGTGGCTG GTGGGCTACG CGGCGTCGCG GCACTTCATG 2220 

AAGGACCCCT TCCTGGGCTT CACGGACGCA CGACAGTCGC CCTTCGCCAC GTCGTCCGAG 2280 

CACTCGTCCG AGAACGGCAG CCTGCGCTTC TCCACGCCGC CCGGGGACCT GCIGGACGGC 2340 

GGCCTCTCGG GCCGCAGCGG CACGGCCAGC GGAGGCAGCA CCCCGCACCT GGGCGGCCCG 2400 

GGCCCCGGGC GGCCCAGCTC CAAGGAGGGC CGCCGCAGCG ACACGTGCGA GTACTGCGGC 2460 

AAGGTGTTCA AGAACTGCAG CAACTTGACG GTGCACCGGC GGAGCCACAC CGGCGAGCGG 2520 

CCTTACAAGT GCGAGCTGTG CAACTACGCG TGCGCGCAGA GCAGCAAGCT CACGCGCCAC 2580 

ATGAAGACGC ACGGGCAGAT CGGCAAGGAG GTGTACCGCT GCGACATCTG CCAGATGCCC 2640 

TTCAGCGTCT ACAGCACCCT GGAGAAACAC ATGAAAAAGT GGCACGGCGA GCACTTGCTG 2700 

ACTAACGACG TCAAAATCGA GCAGGCCGAG AGGAGCTAAG CGCGCGGGCC CCGGCGCCCC 2760 

GCACCTGTAC AGTGGAACCG TTGCCAACCG AGAGAATGCT GACCTGACTT GCCTCCGTGT 2820 

CACCGCCACC CCGCACCCCG CGTGTCCCCG GGGCCCAGGG GAGGCGGCAC TCCAACCTAA 2880 

CCTGTGTCTG CGAAGTCCTA TGGAAACCCG AGGGTTGATT AAGGCAGTAC AAATTGTGGA 2 940 

GCCTTTTAAC TGTGCAATAA TTTCTGTATT TATTGGGTTT TGTAATTTTT TTGGCATGTG 3000 

CAGGTACTTT TTATTATTAT TTTTTCTGTT TGAATTCCTT TAAGAGATTT TGTTGGGTAT 3060 

CCATCCCTTC TTTGTTTTTT TTTTAACCCG GTAGTAGCCT GAGCAATGAC TCGCAAGCAA 3120 

TGTTAGAGGG GAAGCATATC TTTTAAATTA TAATTTGGGG GGAGGGGTGG TGCTGCTTTT 3180 

TTGAAATTTA AGCTAAGCAT GTGTAATTTC TTGTGAAGAA GCCAACACTC AAATGACTTT 3240 

TAAAGTTGTT TACTTTTTCA TTCCTTCCTT TTTTTTGTCC TGAAATAAAA AGTGGCATGC 3300 

AGTTTTTTTT TTAATTATTT TTTAATTTTT TTTTTGGTTT TTGTTTTTGG GGTGGGGGGT 3360 

GTGGATGTAC AGCGGATAAC AATCTTTCAA GTCGTAGCAC TTTGTTTCAG AACTGGAATG 3420 
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GAGATGTAGC ACTCATGTCG TCCCGAGTCA AGCGGCCTTT TCTGTGTTGA TTTCGGCTTT 3480 

CATATTACAT AAGGGAAACC TTGAGTGGTG GTGCTGGGGG AGGCACCCCA CAGACTCAGC 3S40 

1C CTCTGGGAAA TGGCCCGACA GCATTCTGAG 3 600 

, GT TGGTGTGCCC TGGGGTGCCA TGGCTGCTAT 3660 

A TTAGGCTACA TAAAATGGGC TGAGGGTACC TTTTTGGGGA GATGGGGTGG 3720 

A CACAGAAAGG AAGAAACTAG CGGTGTTCTT TTAGGCGTTT TCTGGCTTGA 3760 

I CTTTTTTTAA ATCACCCCCA CCACATAAAT CTCAAATCCT ATGTTGCTAC 3840 

T CCATCATTTC CCAAGCAGAC GAATGCCCTA ATTAATTGAA GTTAGTGTTC 3S00 

TCTCATTTAA TGCACACTGA TGATATTGTA GGGATGGGTG GGGTGGGGAT CTTGCAAATT 3S60 

TCTATTCTCT TTTACTGAAA AAGCAGGGGA TGAGTTCCAT CAGAAGGTGC CCAGCGCTAC 4020 

TTCCCAGGTT TTTATTTTTT TTTTCCTATC TCATTAGGTT GGAAGGTACT AAATATTGAA 4080 
CTGTTAAGAT TAGACATTTG AATTCTGTTG ACCCGCACTT TAAAGCTTTT G' 



GTTAAACTGT AAGGGACAGC TGAGATTGAG TGTCAGTATT GCTAAGCGTG GCATTCACAA 4260 
TACTGGCACT ATAAAGAACA AAATAAAATA ATAATTTATA GGACAGTTTT TCTACTGCCA 4320 
TTCAATTTGA TGTGAGTGCC TTGAAAACTG ATCTTCCTAT TTGAGTCTCT TGAGACAAAT 4380 
GCAAAACTTT TTTTTTGAAA TGAAAAGACT TTTTAAAAAA GTAAAACAAG A 
TCTTTAGAAA CTAACAAAGC CACAT1TACT TTAAGTAAAA AAAAAA? 
AGATAGAGGA TATGAAATGC CATAAGACCC AATCAAATGA AGAAATAAAC CCAGCACAAC 
CTTGGACATC CATTAGCTGA ATTATCCTCA GCCCCTTTTG TTTTTGGGAC A " 
AGATATGGAG TGGAGGTGAT TTACTGCTGA ATTAAAACTC AAGTGACACA A 
TGATATCGTT GAATGAAAAG CAAAACAAAA ACAATTCAGG AACAACGGCT A 
AAAGTTAAAT TTAGTGCACT CTGTCTTAAA AATACGTTTA CAGTATTGGG T 
GGTAAAAAAA AAATTGTGTG TATGTGTGTT GGAGCGATCT TTTTTTTTCA AAGTTTGCTT 4860 
AATAGGTTAT ACAAAAATGC CACAGTGGCC GCGTGTATAT TGTTTTCTTT TGGTGACGGG 4920 
GTTTTAGTAT ATATTATATA TATTAAAATT TCTTGATTAC TGTAAAAGTG GACCAGTATT 4980 
TGTAATAATC GAGAATGCCT GGGCATTTTA CAAAACAAGA AAAAAAATAC CCTTTTCTTT 5040 
TCCTTGAAAA TGTTGCAGTA AAATTTAAAT GGTGGGTCTA TAAATTTGTT CTTGTTACAG 510 0 
TAACTGTAAA GTCGGAGTTT TAGTAAATTT TTTTCTGCCT TGGGTGTTGA ATTTTTATTT 5160 
CAAAAAAAAT GTATAGAAAC TTGTATTTGG GGATTCAAAG GGGATTGCTA CACCATGTAG 5220 
AAAAAGTATG TAGAAAAAAA GTGCTTAATA TTGTTATTGC TTTGCAGAAA AAAAAAAAAT 5280 
CACATTTCTG ACCTGTACTT ATTTTTCTCT TCCCGCCTCC CTCTGGAATG GATATATTGG 5340 
TTGGTTCATA TGATGTAGGC ACTTGCTGTA TTTTTACTGG AGCTCGTAAT TTTTTAACTG 540 0 
TAAGCTTGTC CTTTTAAAGG GATTTAATGT ACCTTTTTGT TAGTGAATTT GGAAATAAAA 5460 
AGAAAAAAAA AACAAAAACA AACAGGCTGC CATAATATAT TTTTTTAATT TGGCAGGATA 5520 
AAATATTGCA AAAAAAACAC ATTTGTATGT TAAGTCCTAT TGTACAGGAG AAAAAGGGTT 5580 
GTTTGACAAC CTTTGAGAAA AAGAAACAAA AGGAAGTAGT TAAATGCTTT GGTTCACAAA 5640 
TCATTTAGTT GTATATATTT TTTGTCGGAA TTGGCCTACA CAGAGAACCG TTCGTCTTCC 57 0 0 
GCTTCTCTCT CAACGCCCCG AACCTTGCAT CAAGGCTCCT TGGTGTGGCC ACAGCAGACC 57 6 0 
AGATGGGAAA TTATTTGTGT TGAGTGGAAA AAAATCAGTT TTTGTAAAGA TGTCAGTAAC 582 0 
ATTCCACATC GTCCTCCCTT TCTCTAAGAG GCCATCTCTA AGATGTCAGA TGTACAGGAG 5880 
AGAGAGCGAG AGAACATCTT CCTTCTCTAC CATCACTCCT GTGGCGGTCA CCACCACCAC 5940 
CTCTCCCGCC CTTACCAGCA GAAAGCAATG CAAACTGAGC TGCTTTAGTC CTTGAGAAAT 60 00 
TGTGAAACAA ACACAAATAT CATAAAAGGA GCTGGTGATT CAGCTGGGTC CACGTGAAGT 6 0 60 
GACCTGCTGT TGAG AC CGGT ACAAATTGGA TTTCAGGAAG GAGACTCCAT CACAGCCAGG 6120 
ACCTTTCGTG CCATGGAGAG TGTTCGCCTC TTGTCTTTCT TCCCTGCTTT GCTGCTTTGC 6180 
TCTCTGAAAC CTACATTCCG TCAGTTTCCG AATGCGAGGG CCTGCGATCA ATTTGGTGCC 6240 
TTTCCATATC TCGTTCTCTC TCCTTCCCCT GCGTTTCCTC TCCATCCTTC ATCCTCCATT 6300 
GGTCCTTTTT TTTTCTTTCA TTTTTTATTT AATTTCTTTT CTTCCTGTCT GTTCCTCCCC 6360 
TAATCCTCTA TTTTATTTTT ATTTTTTGTA AAGCCAAGTA GCTTTAAGAT AAAGTGGTGG 642 0 
TCTTTTGGAT GAGGGAATAA TGCATTTTTA AATAAAATAC CAATATCAGG AAGCCATTTT 5480 
TTATTTCAGG AAATGTAAGA AACCATTATT TCAGGTTATG AAAGTATAAC CAAGCATCCT 6540 
TTTGGGCAAT TCCTTACCAA ATGCAGAAGC TTTTCTGTTC GATGCACTCT TTCCTCCTTG 6600 
CCACTTACCT TTGCAAAGTT AAAAAAAAGG GGGGAGGGAA TGGGAGAGAA AGCTGAGATT 6660 
TCAGTTTCCT ACTGCAGTTT CCTACCTGCA GATCCAGGGG CTGCTGTTGC CTTTGGATGC 672 0 
CCCACTGAGG TCCTAGAGTG CCTCCAGGGT GGTCTTCCTG TAGTCATAAC AGCTAGCCAG 6780 
T TGCCAGGACT AAGCCATCCC AAAGCACAAG CATTGTGTGT 6840 
A GAGAATTTTG CTTCTGTTTT GTGTTTAAAA AACCAACACG 690 0 
3 AGAGGCCTCT AGCATGGSIG ACCCAGCCGA CCTCAGGCCG 6960 
. ,^,„.C TGCCACAACT TTGTTCAAAG TTGCCCCCAA TTGGAACCTG CCACTTGGCA 7020 
TTAGAGGGTC TTTCATGGGG AGAGAAGGAG ACTGAATTAC TCTAAGCAAA ATGTGAAAAG 7080 
TAAGGAAATC AGCCTTTCAT CCCGGTCCTA AGTAACCGTC AGCCGAAGGT CTCGTGGAAC 7140 
ACAGGCAAAC CCGTGATTTT GGTGCTCCTT GTAACTCAGC CCTGCAAAGC AAAGT CCCAT 7200 
TGATTTAAGT TGTTTGCATT TGTACTGGCA AGGCAAAATA TTTTTATTAC CTTTTCTATT 72 60 
ACTTATTGTA TGAGCTTTTG TTGTTTACTT GGAGGTTTTG TCTTTTACTA CAAGTTTGGA 7320 
ACTATTTATT ATTGCTTGGT ATTTGTGCTC TGTTTAAGAA ACAGGCACTT TTTTTTATTA 7380 
TGGATAAAAT GTTGAGATGA CAGGAGGTCA TTTCAATATG GCTTAGTAAA ATATTTATTG 7440 
TTCCTTTATT CTCTGTACAA GATTTTGGGC CTCTTTTTTT CCTTAATGTC ACAATGTTGA 7500 
GTTCAGCATG TGTCTGCCAT TTCATTTGTA CGCTTGTTCA AAACCAAGTT TGTTCTGGTT 7560 
T CAAGTT AT A AAAATAAATT GGACATTTAA CTTGATCTCC AAA 

Seq ID NO: B18 Protein sequence 



| I 1 I I 1 

MSRRKQGNPQ HLSQRELITP EADHVEAAIL EEDEGLEIEE PSC-LGLMVGG PDPDLLTCGQ 
CQMNFPLGDI LVFIEHKRKQ CGGSLGACYD KALDKDSPPP SSRSELRKVS EPVEIGIQVT 
PDEDDHLLSP TKGICPKQEN IAGKDEPSSY ICTTCKQPFN SAWFLLQHAQ KTHGFRIYLE 
PGPASSSLTP RLTIPPPLGP EAVAQSPLttN FLGDSNPFNL LRMTGPILRD HPGFGEGRLP 
GTPPLFSPPP RHHLDPHRLS AEEMGLVAQH PSAFDRVMRL NPMAIDSPAM DFSRRLRELA 
GNSSTPPPVS PGRGNPMHRL LNPFQPSPKS PFLSTPPLPP MPPGGTPPPQ PPAKSKSCEF 
CGKTFKFQSN LIVHRRSHTG EKPYKCQLCD HACSQASKLK RHMKTHMHKA GSLAGRSDDG 
LSAASSPEPG TSELAGEGLK AADGDFRHHE SDPSLGHEPE 3EDEEEEE3E EELLLENESR 
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PESSFSMDSE LSRNRENGGG GVPGVPGAGG GAAKALADEK ALVLCI "J L AL 
ELIADKQKRG AFLKRAAGGG DAGDDDDAGG CGDAGAGGAV NGRGGGFAPG TEPFPGLFPR 
KPAPLPSPGL NSAAKRIKVE KDLELPPAAL IPSENVYSQW LVGYAASRHF HKDPFLGFTD 
AEQSPFATSS EHSSENGSLR FSTPPGDLLD GGLSGRSGTA SGGSTPHLC-G PGPGRPSSKS 
GRRSDTCEYC GKVFKNCSNL TVHRRSHTGE RPYKCELCNY ACAQSSKLTR HMKTHGQIGK 
EVYRCDICQM PFSVYSTLEK HHKKWHGEHL LTNDVKIEOA ERS 

Seq ID NO: B19 DNA sequence 

Nucleic Acid Accession #: NM_000399.2 



TAACTGAGCG AGGAGCAATT GATTAATAGC TCGGCGAGGG GACTCACTGA CTGTTATAAT 60 
AACACTACAC CAGCAACTCC TGGCTTCCCA GCAGCCGGAA CACAGACAGG AGAGAGTCAG 120 
G ACATTTTTCT TATTTCTTAA AAAACAGCAA CTTGTTTGCT ACTTTTATTT 180 



T CGCCGCCACG TCGGTGACCA TCTTTCCCAA TGCCGAACTG G 
TTGACCAGAT GAACGGAGTG GCCGGAGATG GCATGATCAA CATTGACATG ACTGGAGAGA 
AGAGGTCGTT GGATCTCCCA TATCCCAGCA GCTTTGCTCC CGTCTCT 
AGACCTTCAC TTACATGGGC AAGTTCTCCA TTGACCCTCA GTACCCTGGT G 
T ATTGTGAGTG CAGGCATCTT GCAAGGGGTC A 

G GCCACAGGAC 



GATTCTTTCC ATCTCAGTGC CAGAGAGACC TACATGGTAC AGCTGGCCCA G 

G CCCACTGGAC ACCCTGCGGG TGCCCCCTCC ACTCACTCCA CTCTCTACAA 
T TACCCTGGGG GGCCCCAGTG CTGGGGTGAC CGGACCAGGG GCCAGTGGAG 
GCAGCGAGGG ACCCCGGCTG CCTGGTAGCA GCTCAGCAGC AGCAGCAGCC GCCGCCGCCG 
CCGCCTATAA CCCACACCAC CTGCCACTGC GGCCCATTCT GAGGCCTCGC AAGTACCCCA 
ACAGACCCAG CAAGACGCCG GTGCACGAGA GGCCCTACCC GTGCCCAGCA G 
ACCGGCGGTT CTCCCGCTCT GACGAGCTGA CACGGCACAT CCGAATCCAC A 

ACCCCTTCCA GTGTCGGATC TGCATGCGCA ACITCAGCCG CAGTGACCAC CTCACCACCC 1500 

ATATCCGCAC CCACACCGGT GAGAAGCCCT TCGCCTGTGA CTACTGTGGC CGAAAGTTTG 15G0 

CCCGGAGTGA TGAGAGGAAG CGCCACACCA AGATCCACCT GAGACAGAAA GAGCGGAAAA 1620 

GCAGTGCCCC CTCTGCATCG GTGCCAGCCC CCTCTACAGC CTCCTGCTCT GGGGGCGTGC 1680 

AGCCTGGGGG TACCCTGTGC AGCAGTAACA GCAGCAGTCT TGGCGGAGGG CCGCTCGCCC 1740 

CTTGCTCCTC TCGGACCCGG ACACCTTGAG ATGAGACTCA GGCTGATACA CCAGCICCCA 1800 

AAGGTCCCGG AGGCCCTTTG TCCACTGGAG CTGCACAACA AACACTACCA CCCTTTCCTG 1860 

TCCCTCTCTC CCTTTGTTGG GCAAAGGGCT TTGGTGGAGC TAGCACTGCC CCCTTTCCAC 1320 

CTAGAAGCAG GTTCTTCCTA AAACTTAGCC CATTCTAGTC TCTCTTAGGT GAGTTGACTA 1980 

TCAACCCAAG GCAAAGGGGA GGCTCAGAAG GAGGTGGTGT GGGGATCCCC TGGCCAAGAG 2040 

GGCTGAGGTC TGACCCTGCT TTAAAGGGTT GTTTGACTAG GTTTTGC1AC CCCACTTCCC 2100 

CTTATTTTGA CCCATCACAG GTTTTTGACC CIGGATGTCA GAGTTGATCT AAGACGTTTT 2160 

CTACAATAGG TTGGGAG AT G CTGATCCCTT CAAGTGGGGA CAGCAAAAAG ACAAGCAAAA 2220 

CTGATGTGCA CTTTATGGCT TGGGACTGAT TTGGGGGACA TTGTACAGTG AGTGAAGTAT 2280 

AGCCTTTATG CCACACTCTG TGGCCCTAAA ATGGTGAATC AGAGCATATC TAGTTGTCTC 2340 

AACCCTTGAA GCAATATGTA TTATATACTC AGAGAACAGA AGTGCAATGT GATGGGAGGA 2400 

ACGTAGCAAT ATCTGCTCCT TTTCGAGTTG TTTGAGAAAT GTAGGCTATT TTTTCAGTGT 2460 

ATATCCACTC AGATTTTGTG TATTTTTGAT GTACCCACAC TGTTCTCTAA ATTCTGAATC 2520 

TTTGGGAAAA AATGTAAAGC ATTTATGATC TCAGAGGTTA ACTTATTTAA GGGGGATGTA 2580 

CATATTCTCT GAAACTAGGA TGCATGCAAT TGTGTTGGAA GTGTCCTTGG TCGCCTTGTG 2640 

TGATGTAGAC AAATGTTACA AGGCTGCATG TAAATGGGTT GCCTTATTAT GGAGAAAAAA 2700 

ATCACTCCCT GAGTTTAGTA TGGCTGTATA TTTATGCCTA TTAATATTTG GAATTTTTTT 2760 

TAGAAAGTAT ATTTTTGTAT GCTTTGTTTT GTGACTTAAA AGTGTTACCT TTGTAGTCAA 2 820 

ATTTCAGATA AGAATGTACA TAATGTTACC GGAGCTGATT TGTTTGGTCA TTAGCTCTTA 2880 

T CTATTCTAAC GCAAAACCAC TAACTGAAGT TCAGATATAA 2 940 



I I I I I I 

MMTAKAVDKI PVTLSGFVHQ LSDNIYPVED KAATSVTIFP NAELGGPFDQ MNG 
NIDMTGEKRS LDLPYPSSFA PVSAPRNQTF TYMGKFSIDP QYPGASCYPE GIINIVSAGI 
LQGVTSPAST TASSSVTSAS PNPLATGPLG VCTMSQTQPD LDHLYSPPPP PPPYSGCAGD 
LYQDPSAFLS AATTSTSSSL AYPPPPSYPS PKFATDPGLF PMIPDYPGFF PSQCQRDLHG 
DTLRVPP PLTPLSTIRN FTLGGPSAGV TGPGASGGSE GPRLPGSSSA 
Y NPHHLPLRPI LRPRKYPNRP SKIPVHERPY PCPABGCDRR FSRSDELTRH 
F QCRICMRNFS RSDHLTTHIR THTGEKEFAC DYCGRKFARS DERKRHTKIH 
LRQKERKSSA PSASVPAPST ASCSGGVQPG GTLCSSNSSS LGGGPLAPCS SRTRTP 

Seq ID NO-. B21 DNA sequence 

Coding sequence: 4 5 7.. 18 93 
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CACACACGGG CGCACGCACA CGGCAGCCGG GCCAGGGACG ACCCTGTCAG CTGCAGCCCC 



_____ 3 GCGCGCAGTG GGCTACAAAC TTTCGCAGCG CGAGTCCGCC 

AAGGCAGCGC GCCGACTCGG GCTCGGCTCG GCTCTGCGCT GCTCCGGACG GCTGTGACCG 
CTGGCCGGGG GCTCGGGCCG CCGGTACCCA CGGACCGCGC GCCCGGGTGC CTGCTCCGCT 
AAGCCCCTCG CCCCGCGCGG ACCTCGGTAT CCAGCGCCCT GCTGCCCGGG CTCTCCCCGC 
GCGCCCTACT GCCGCGAGGT CAGTCCGCAG CCTCCGGTGC GCCAGCGCTC GCCTTCCTCC 
TCCTGGACTT CGGCCCTTTG CCGCCCTCAC CACGCCATGG CTCATGTCCC CGCTCGGACC 
AGCCCGGGAC CCGGGCCCCA GCTGCTGCTG CTGCTGCTGC CGTTGTTTCT GCTGTTGCTC 
CGGGATGTGG CCGGCAGCCA CAGGGCCCCC GCCTGGTCCG CACTGCCCGC GGCCGCCGAC 
GGCCTGCAGG GGGACAGGGA TCTCCAGCGG CACCCTGGGG ACGCGGCCGC CACGTTGGGC 
CCCAGCGCCC AGGACATGGT CGCTGTCCAC ATGCACAGGC TCTATGAGAA GTACAGCCGG 
CAGGGCGCGC GGCCGGGAGG GGGCAACACG GTCCGCAGCT TCAGGGCCAG GCTGGAAGTG 
3GAAATGATC 



CTATGCAAGC CGCGGGCCAA GAACGCTTCA GGCCGCCCGC T 
CGCCAGCACC TGCTCTTCCG CAGCCTCTCG CAGAACACGG C 

GGGGCCATGG CCCTGGCGCC CCCACCGCGC GGCCTGTGGC AGGCCAAGGA CATCTCCCCC 
ATCGTCAAGG CGGCCCGCCG GGATGGCGAG CTGCTCCTCT CCGCCCAGCT GGATTCTGAG 
GAGAGGGACC CGGGGGTGCC CCGGCCCAGC CCCTATGCGC CCTACATCCT AGTCTATGCC 
G CCATCTCGGA GCCCAACAGC GTGGCAGTGA CGCTGCAGAG ATACGACCCC 
G GAGACCCCGA GCCCCGCGCA GCCCCCAACA ACT CAGCGGA CCCCCGCGTG 
3 CGCAGGCCAC TGGGCCCCTC CAGGACAACG A 
AGGCCGCCGC GCGCCCACGC ACAGCACTTC CACAAGCACC AGCTGTGGCC C 
CGGGCGCTGA AACCCCGGCC AGGGCGCAAA GACCGCAGGA AGAAGGGCCA GGAGGTGTTC 1500 
ATGGCCGCCT CGCAGGTGCT GGACITTGAC GAGAAGACGA TGCAGAAAGC CCGGAGGAAG 1S60 
CAGTGGGATG AGCCGAGGGT GTGCTCCCGG AGGTAGCTGA AGGTGGACTT CGCAGACATC 1S20 
GGCTGGAATG AATGGATAAT CTCACCGAAA TCTTTTGATG CCTACTACTG CGCGGGAGCA 1S80 
TGTGAGTTCC CCATGCCTAA GATCGTTCGT CCATCCAACC ATGCCACCAT CCAGAGCATT 1740 • 
GTCAGGGCTG TGGGCATCAT CCCTGGCATC CCAGAGCCCT GCTGTGTTCC CGATAAGATG 1800 
AACTCCCTTG GGGTCCTCTT CCTGGATGAG AATCGGAATG TGGTTCTGAA GGTGTACCCC I860 
AACATGTCCG TGGACACCTG TGCCTGCCGG TGAGACCACT CCAGGGTGGA A 
GCCCAGCAGA GCTGCCTTCT CGGAGCCTTC TGCAAO 

CACAGAGCAC AGCTCATGGG CAACATCACT GGGGCCCAGA GAGAGCTGTC C 
TCATTAGGGG GTCTTTCATT GCTAGTGACT AGCCCCTTAA ATGCCAGCCT GAGTACCTGA 
AGGAATCTGG GAATTAGCCC TGGCCTGAAA GTGGCCCATC A 
AGGCTTGAAA ACAAAACATA TCCACAACAT TGGCTTGATG TGATCATCAT C 

AGCAAGAAGA CTATGCAAAT CTTAGGGCGC TCGCTCCCTG CACACGGAAA GAACTCTGTT 2280 
TAAATGCTCA GTTCAGAACA CTTTGGGCCA CATAGTGATT TTGGAAAACA GGATAATCGT 2340 
GGTGTAAATG AGTGTTTCCT TTCAAAGTCC ACTGCAGAGC TTTTATCCAT ATGGTATGCA 2400 
CATGTAGCCA ATATTGGTTT CTTTTTCTTA ATATATATAT TTTATTTTAA AACAACAAAA 2460 
AGGGAGGGCG TTG ACAC CAT TCCCCACAGA GATAGTCATG CTGAGTGTGG GTTGTTTAAA 252 0 
CATGCATATT GAAATAACAC ATATAGTAAC GTGGGAATAC TAAAAAATAA CCAAGATTTT 2580 
ATATTTTTGT AAATTATACT TTCTATACTG TAGATTGTGT ATGTTATGTG TTTTTATGGA 264 0 
AAGCTAATAA ATTAAAGGTA CAGTGGTATC TTGA 

Seq ID NO: B22 Protein sequence 

VAGSHRAPAW SALPAAADGL QGDRDLQRHP 60 

ARPGGGNTVR SFRARLEWD QKAVYFFNLT 120 

in^io^., _.invn__ ( __ KPRAKNASGR PLPLGPPTRQ HLLFRSLSQN 180 

55 TATQGLLRGA MALAPPPRGL WQAKDISPIV KAARRDGELL LSAQLDSSER DPGVPRPSPY 240 

A VTIiQRYDPFP AGDPEPRAAP NHSADPRVRR AAQATGPLQD 300 

9 PRAHAQHFHK HQLWPSPFRA LKPRPGRKDR RKKGQEVFMA ASQVLDFDEK 360 

TMQKARRKQW DEPRVCSRRY LKVDFADIGW NEWIISPKSF DAYYCAGACE FPMPKIVRPS 420 
NHATIQSIVR AVGIIPGIPE PCCVPDKMNS LGVLFLDENR NWLKVYPNM SVDTCACR 



60 




ATTCTTTAAA T; 

TCCATCATCT GTCATTTTTG CATCTGTTTT TATTGACTAA C 
ACATAT CCTT GCTTCTTCCC ATATTTAGTA AAGTTTAGTT GGATGCTGGA TTGT 



T ACTTGCAAAT CAGCTTGATC A 
AAAGGCTATG CTACTTCAAG TATAATAAAA CCTAGTTTTA G 

A ATTTGAATGT CTCCAACCCT GTGTGAACTC TGAAAATTGT TCAGCTCCCA 
T AATTTTTTTT GTTCAGCCTT GCAGTCTCAT CCTACTCAAG TGTGGCTCTG 
A GATCT CATG A AGATTTCTGA AACTTTTGCT CTGCAGGATA 
CCTTCCTTTG TGGTTATGTG CCCTGAAAAC TCCAGCCTCC GTGTCTTCAA TTCAGTGAAA 



ATCTAAGGGA ATTTCAGGAG TCACCTCATT TATTTTCCTT TTTTTCAGGG ATTATAGTAG 720 

TACTATACCA CCTGTCAGCC AATGTGTGAG AATGGTGGTT TCATACATTT TCTCCAATTC 780 

TTTCTTGTTA CAGGAGCCTC CATTACTTTT CAAACAGTAA TC-CAGTTGCA GTTGCCTCTC 840 

CTTTTCTTCA TTATGTGTTT ATCTCTGGCA GTTTGAGCCA AGAGAGGGCA CGGAGAAGTA 900 

CAATGACTAG AGAGCACTTC TGTTGAGGCT CATTCATTGA CCCCTACCCC AGTGCTTTAT 960 . 

GAATGTGTGC TGCAGATGTC ATACAGCATC ACAGCTTCTT CTCTAATTTA TGAGCCATAA 1020 

TTTTTTTTTT TGTATTTTTA TTGTATTACC TGCTTGGTTG CAAGAGGATC- ATGAGGAGGA 1080 

CTACTATAGG ATAAATTTGT TTTTATAGAG CAATTTCTCA TGGCTCACGA GGGATAGTAC 1140 
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T CGGCTTGATC ATCTGAGTCA GTGGGCICAT IGTGGAAACA 1200 

C TTTATGCTTT TTCTCAGGCT GCTTCTGTTA CATGAAATAA AGCCAGAGTT 1260 
GATTGTGAAA AAAAAAAAAA AAAA 



I I 
AGAGCCAAGT ~ 

I TTATACTTCA TGTAGTAGCA GTGGGTACTG 

T CTGATGATTT GTGCTCTTAT TTTTCCTAGA TTTACCTCAT CTAGGGCATA 
3 TCTTCCTCTT TACCTTTCCT GGTCCTGATC CCTCTGTACT CAGTTCCCCT 
AAATTATTGG ACTACAAACT AATATACTAG AAAAGCATAC ACTTATTTTA TTTGAATGCA 
GAAATGCTAT CTATCAGTAT ATATACATAA GAATGTATAT TACAGTATAT CTATATATAC 
TTAACACTGT AACTTTCAGT ATTCCCCAGT TAGCGTACCT AACTCTCCTG TGGGTTATGT 
TAATTCTTAT TAGACTACTA GAGAAAAACC AACTGGCAGT TTGCTAAGCA TATCTACTGG 
TGTTGTTTCT GCGCCCTCTT TTGGCTAATT GATGTAATTA TACTGGCTCT AAAGATTTAC 
TGCCCCATAA GTAAATAGTA TAGCCACATT CTGAACATAT CAAAAGTACA AACTTAGGAG 
GAGTGTATGT ACAAAAATGT AAAAT TTT AT GAAAATGAAC ATGTTTTTAT GATGTTATTT 
CTAGTTCATA AGAATGTGAT GACTGCTTTG CTTCATTTAT GTACGTTCCC ATTATATTCT 
TGCTGTCAAT CAATCACAAA TTTATATCAG ATTAGGATAA ACTAAGCCAT TTTATGTATT 
TTATTTTAAA CCTTATTTTG GCAGAGTAAT TCCTTAGAAT TGGAAAAGCT GTTACTTTGA 
AATTACCAAT TIATTACAAA ACATAGAAAT GTATTGTAGC TACAAAGACA ACCAAGCATT 
TTCTGTGTTT TAATGAATAT CTAAAAAACT ACATTTAGTT TATTTTACTC "~ 
TGATTTTTTT ACTGGCTCTA TTGCCTTAAA ATAACTAAGA 
ATTTTCCTTT TCTTTGTTCT TTTTTTTACA TTTCGCAGAG 
ACAATTTCTT ATGTATTCTG GATAACTGAA AACAACTAAA GGTGTTGGGC ATTAGAAAAT 
AATTGTGAGC AGTAAGATTA CTGATGTAAT ATGTATGTTG GACTGAAGTA TTTCTTTATA 
AACATTCTAT TTGATTTTAA GCAAAATGTA TGTTAAAGCA TGTTTTTACA TCAGTAAAGT 
CATTTGTCGA CCTTCTGGAA ATGAAAGGTT TTTACCTAGA TACTGTAAGT TACACCTCCT 
TAACAATCAT ATTTGTCATT GTTGTTTTCT GCAAACAAAA ATGTTTATGG GCTTCATGTA 
GGCTTAAGAT TGTAGGCAAA AATGGACTGA GTTCAGGACC CTTCAAGCAG TAGGCATTCA 
GTTACAGAGC AGTTGGTACT TTGTAACCCA GACTTACAGT TTAAAAATAT CAAGTTAGCT 
GATGTTTCAT TATAATAAAA ATACTATTTT GCTTAAGAGT TGTATTACAA ATATTTGTGC 
TTAACATTAG AAATAGCTGT TTTAAATTGT AGTTAACATA TTAACTTTTT CAGAAARAAA 
GCATGGTTTA TTTTTAATAA TGAAATAGAG AACATAATAC GTAATGTTCA GTATAACAGC 
TGAGTTAfiAA CATCTGCCAG GATTAACATC AGTGCGTTTT TGCCAATGCA TAGAGGCATT 
TTTCTCTAAC TATGATGGCT AATGATAACT ATTCTTTGTT ACACATTCAA GTCACTCCCA 
TACAAGTAAC TAGTGGGTGA TATGTTTCAC TCCAAAGGGT GTATTAATTC TGAATGCTAA 
TCATGAAGAC TTAAGTTAGG ACAACACTTC AAACCAGGAA GTGTGAACTG ATTTA 
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III 

AT GGAGCGCG GGATGCACCT CGGTGCAGCG GCCGCCGGCG AGGACGACCT CTTCCTGCAC 60 

AAGAGCCTGA GCGCCTCCAC CTCCAAGCGC TTGGAAGCGG CTTTCCGCTC CACGCCCCCG 120 

GGCATGGACC TGTCCCTGGC GCCGCCGCCT CGGGAACGCC CGGCGTCCTC CTCCTCGTCG 180 

CCCCTGGGCT GCTTCGAGCC GGCTGACCCC GAGGGGGCAG GGCTGCTGTT GCCGCCGCCT 240 

GGAGGAGGCG GCGGCGGCAG CGCGGGAAGT GGCGGCGGCG GCGGCGGCGG GGTGGGTGTC 300 

CCCGGGCTGC TAGTAGGTTC AGCCGGCGTT GGGGGCGACC CTAGCCTAAG CAGCCTGCCG 360 

GCCGGGGCCG CCCTTTGCCT CAAGTACGGC GAAAGCGCGA GCCGGGGCTC GGTGGCCGAG 420 

AGCAGCGGCG GCGAGCAGAG CCCCGACGAC GACAGCGACG GTCGCTGCGA GCTCGTGCTG 480 

CGGGCCGGAG TAGCCGACCC GCGGGCCTCC CCGGGAGCGG GAGGTGGTGG CGCGAAGGCA 540 

GCCGAGGGCT GCTCCAATGC CCACCTCCAC GGCGGCGCCA GCGTCCCCCC GGGGGGCCTG 600 

GGCGGCGGCG GCGGCGGGGG TAGCAGCAGC GGTAGCAGTG GCGGCGGTGG CGGTAGCGGT 660 

AGCGGCAGCG GCGGCAGCAG CAGCAGCAGC AGCAGCAGCA GCAAGAAATC CAAAGAGCAA 720 

AAGGCGCTGC GGCTTAACAT CAATGCCCGA GAGCGCCGGC GGATGCACGA CCTGAACGAC 780 

GCGCTGGACG AGCTGCGCGC GGTGATCCCC TACGCGCACA GCCCCTCGGT GCGAAAGCTC 840 

TCCAAGATCG CCACGCTGCT GCTCGCCAAG AACTACATCC TCATGCAGGC GCAGGCCCTG 900 

GAGGAGATGC GGCGCCTAGT CGCCTACCTC AACCAGGGCC AGGCCATCTC GGCTGCCTCC 960 

CTGCCCAGCT CGGCGGCTGC AGCGGCAGCA GCTGCTGCCC TGCACCCGGC GCTCGGCGCC 1020 

3 CAGCCGGCTA CCCGTTCAGC GCCGGACTGC CCCCGGCTGC CTCCTGCCCG 1080 

3 CCCTGTTTAA CAGCGTCTCC TCCAGCCTCT GCAAACAGTG CACGGAGAAG 1140 

Protein Accession #: FGENESH predicted 



MERGMHLGAA AAGEDDLPLH KSLSASTSKR LEAAFRSTPP GIffiLSLAPPP RERPASSSSS 
PLGCFEPADP EGAGLLLPPP GGGGGGSAGS GGGGGGGVGV PGLLVGSAGV GGDPSLSSLP 
AGAALCLKYG ESASRGSVAE SSGGEQSPDD DSDGRCELVL RAGVADPRAS PGAGGGGAKA 
AEGCSNAHLH GGASVPPGGL GGGGGGGSSS GSSGGC-GGSG SGSGGSSSSS SSSSKKSKEQ 
KALRLNINAR ERRRMHDLND ALDELRAVIP YAHSPSVRKL SKIATLLLAK NYILKQAQAT, 
EEMRRLVAYL NQGQAISAAS LPSSAAAAAA AAALHPALGA YSQAAGYPFS AGLPPAASCP 
EKCALFNSVS SSLCKQCTEK P 



Seq ID NO: B27 DHA sequence 
Nucleic Acid Accession # : cat 
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TTTTTTTTTT TTTTTTTTAA ATGTTTGATC AAAATATTTT AATAAAGATT CTTTCTGACA 
TAGATACACA TACAAATGGT CGTACATAGC TGTCATAGTC TGATTGACCT ATTTAATATA 
TATATCATTC TTTACACATC CAAAACCCGC CAACAGATCC ATCACAGCTC CCAACTCACC 
ATCCAACCTG ACAAACTGAA TTTGTATTAT CTGCAAGGAG TGGAAAATAG CAGGACTCCA 
TTTTTAAAAA AGATTTTCTT GATTTTCATA GGGCGGAAAG GCAGTCAAAC AGCCATGCAA 
T GAAAGCTCAC TTTGGGTAAA TAGCTTCTTG TTCTTCCTTA GTTTTCTTTC 
T TTATTTTTTT AGAAAAATAA CAAAGGATTT CACACCATAG GCAAATCAAA 
T AACTTAAATA ATTCTCCACA GTTAAAATAA CATATATGTA CATATATATT 
I AAATTAGACC TTTTAAAAAT GCACAGCACA GCCTGGAAAA ATATTTGCTT 
5TATAGC AACTATTGCT GTGATGTTTT TCCCTTTTGG AATGTAAAGG 
^AAAGAG ATCAATTCAT TCATCAATTA AGAATACACC TTTCCTGTAA 
T GAAGCAATTT ATTAAAGCTC AATTTAAATA CAGGGATGAT GCAACTGAAA 



AGGTGCACAT AAATATT 
TTTTATATCA CTCTCGCCAA AACTTTGAGC ATTTTCACAG GATTAAAGTT 1080 
AGTCTTTCAT AGTAACATGT TCTCTCTCTC TCGCTCTGGG 1140 



TTTATTTTCA AGTGCCCTTT CTGATTTGTC TGAATGAATA TTCCATCTCT CATGCTACCG 1260 

Seq ID NO> B28 DNA sequence 
Nucleic Acid Accession #: NM_002581.2 
Coding sequence : 3 6 8 . . 52 51 

1 11 21 31 41 51 

I I I I I I 

GGGGGAGGGA ATTCAGCGGA TCAGTCTTAA GAGGAGCTTT TTTTTGGAGC GAGAAATCAT 60 

ATAAAATAAA ATGAAATAAA ACAAGGAGGA AGGCAACCAG CTGTTAGGGG GAAAATAAGG 120 

35 CAGATAAAGG AGCGGGGAGA GAAATTAATT GCCAACCAGG AGGAGTTGGG CTGTATTTTT 180 

CAAAGGTGGG GAGAGTGGAG CACACACCTT GAGGAGGAAA GCGAGAAAGA AAAGAAAAAA 240 

GCAAGTGAAG GGGGGCTCGC CCAAGAAGGG TGAAGAAGCG AAGAAAGTCG AGGCGCCGAG 300 

GCTCCCAAAG CTGGCAGCTC CGGGTGGCGG TGCAGGGGCG AAGGGGGGGG CGGGGGGAAC 360 

GTCGGACATG CGGCTCTGGA GTTGGGTGCT GCACCTGGGG CTGCTGAGCG CCGCGCTGGG 420 

40 CTCCGGGCTG GCCGAGCGTC CCCGCCGGGC CCGGAGAGAC CCGCGGGCCG GCCGACCCCC 480 

GCGCCCCGCC GCCGGCCCGG CCACCTGCGC CACCCGCGGC CCGCGGCCGC CGCGCCTCGC 540 

CGCCGCCGCC GCCGCCGCCG GGCGTGCCTG GGAAGCCGTG CGCGTCCCCC GGCGGCGGCA 600 

GCAGCGGGAG GCGAGGGGCG CCACCGAGGA GCCGAGCCCG COGAGCCGGG CGCTCTATTT 660 

CAGCGGGCGA GGCGAGCAGC TCCGACTCCT CCGGGCCCAC CTCGAGCTGC CCCGGGACGC 720 

45 GTTCACGCTG CAAGTGTGGC TGCGAGCGGA GGGGGGCCAG AGGTCTCCGG CAGTGATCAC 7 80 

AGGGCTGTAT GACAAATGTT CTTATATCTC ACGTGACCGA GGATGGGTCG TGGGCATTCA 840 

CACCATCAGT GACCAAGACA ACAAAGACCC ACGCTACTTT TTCTCCTTGA AGACAGACCG 9D0 

AGCCCGG CAA GTGACCACCA TCAATGCCCA CCGCAGCTAC CTCCCAGGCC AGTGGGTATA 960 

CCTAGCTGCC ACCTATGATG GGCAGTTCAT GAAGCTCTAT GTGAATGGTG CCCAGGTGGC 1020 

50 CACCTCTGGG GAACAAGTGG GTGGCATATT CAGCCCACTG ACCCAGAAGT GCAAAGTGCT 1080 

CATGTTAGGG GGCAGTGCCC TGAATCACAA CTACCGGGGC TACATCGAGC ACTTCAGTCT 1140 

GTGGAAGGTG GCCAGGACTC AGCGGGAGAT ACTGTCTGAC ATGGAAACCC AIGGCGCCCA 1200 

CACTGCTCTA CCTCAGCTCC TCCTCCAGGA GAACTGGGAC AATGTGAAGC ATGCCTGGTC 1260 

CCCCATGAAG GATGGCAGCA GCCCCAAAGT GGAATTCAGC AATGCCCACG GCTTTCTGCT 1320 

55 GGACACGAGT CTGGAGCCTC CTCTGTGCGG ACAGACATTG TGTGACAACA CAGAGGTCAT 1380 

TGCCAGCTAC AATCAGCTCT CAAGTTTCCG CCAGCCCAAG GTGGTGCGCT ACCGCGTGGT 1440 

CAACCT CTAT GAAGATGATC ATAAGAACCC GACGGTGACG CGCGAGCAGG TGGACTTCCA 1500 

GCACCATCAG CTGGCTGAGG CCTTCAAGCA ATACAACATC TCCTGGGAGC TGGACGTGCT 1560 

GGAGGTGAGC AACTCCTCCC TTCGCCGCCG CCTCATCCTG GCCAACTGTG ACATCAGCAA 1620 

60 GATTGGGGAT GAGAACTGTG ACCCCGAGTG CAACCACACG CTGACGGGCC ACGACGGCGG 1680 

GGAITGCCGC CACCTGCGCC ACCCTGCCTT CGTGAAGAAG CAGCACAACG GGGTGTGTGA 1740 

CATGGACTGC AACTATGAAC GGTTCAACTT TGATGGTGGA GAGTGCTGTG ACCCTGAAAT 1800 

CACCAATGTC ACTCAGACTT GCTTTGACCC CGACTCTCCA CACAGAGCCT ACT TGG ATGT 1860 

TAATGAGCTG AAGAACATTC TTAAATTGGA TGGATCAACA CATCTCAATA TTTTCTTTGC 1920 

65 AAAATCCTCA GAGGAGGAGT TGGCAGGAGT AGCAACTTGG CCATGGGACA AGGAGGCCCT 1980 

GATGCACTTA GGTGGCATTG TCTTGAACCC ATCTTTCTAT GGCATGCCTG GGCACACCCA 2040 

CACCATGATC CATGAGATTG GTCACAGCCT GGGCCTCTAT CACGTCTTCC GAGGCATCTC 2100 

AGAAAT CCAG TCCTGCAGIG ACCCCTGCAT GGAGACAGAG CCCTCCTTCG AGACTGGAGA 2160 



70 AAATGACACC TGTGGCTTTC ATAGCTTCTT CAACACTCCT TACAACAACT T CATGAGCTA 22 80 

TGCAGATGAC GACTGTACGG ACTCCTTCAC GCCCAATCAA GTCGCCAGAA TGCACTGTTA 2340 

CCTGGACCTG GTCTACCAGG GCTGGCAGCC CTCCAGGAAA CCAGCGCCTG TTGCCCTCGC 2400 

CCCCCAAGTT CTGGGCCACA CAACGGACTC TGTGACACTG GAGTGGTTCC CACCTATAGA 2460 

TGGCCATTTC TTTGAAAGAG AATTGGGATC AGCATGTCAT CTTTGCCTGG AAGGGAGAAT 2520 

75 CCTGGTGCAG TATGCTTCCA ACGCTTCCTC CCCAATGCCC TGCAGCCCAI CAGGACACTG 2580 

GAGCCCTCGT GAAGCAGAAG GTCATCCTGA TGTTGAACAG CCCTGTAAGT CCAGTGTCCG 2640 

CACCTGGAGC CCAAATTCAG CTGTCAACCC ACACACGGTT CCTCCAGCCT GCCCTGAGCC 2700 

TCAAGGCTGC TACCTCGAGC TGGAGTTCCT CTACCCCTTG GTCCCTGAGT CTCTGACCAT 2760 

TTGGGTGACC TTTGTCTCCA CTGACTGGGA CTCTAGTGGA GCTGTCAATG A CAT CAAACT 2820 

80 GTTGGCTGTC AGTGGGAAGA ACATCTCCCT GGGTCCTCAG AATGTCTTCT GTGATGTCCC 2880 

ACTGACCATC AGACTCTGGG ACGTGGGCGA GGAGGTGTAT GGCATCCAAA TCTACACGCT 2940 

GGATGAGCAC CTGGAGATCG ATGCTGCCAT GTTGACCTCC ACTC-CAGACA CCCCACTCTG 3000 

TCTACAGTGT AAGCCCCTGA AGTATAAGGT GGTCCGC-GAC CCTCCTCTCC AC-ATGGATGT 3060 

GGCCTCCATC CTACATCTCA ATAGGAAATT CGTAGACATG GATCTAAATC TTGGCAGTGT 3120 
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GTACCAGTAT TGGGTCATAA CTATTTCAGG AACTGAAGAG AGTGAC-CCAT CACCTGCTGT 3180 

CACATACATC CATGGACGTG GGTACTGTGG CGATGGCATT ATACAAAAAG ACCAAGGTGA 3240 

ACAATGCGAC GACATGAATA AGATCAATGG TGATGGCTGC TCCCTTTTCT GCCGACAAGA 33 00 

AGTCTCCTTC AATTGTATTG ATGAACCCAG CCGGTGCTAT TTCCATGATG GTGATGGGGT 33 SO 

ATGTGAGGAG TTTGAACAAA AAACCAGCAT TAAGGACTGT GGTGTCTACA CGCCCCAGGG 3420 

ATTCCTGGAT CAGTGGGCAT CCAATGCTTC AGTATCTCAT CAAGACCAGC AATGCCCAGG 3480 



CAGTGAAGGC ATTTCCCAGC ATGCCTGGTA CCCTTGCACC ATCAC-CTACC CATATTCCCA 3S00 

GCTGGCTCAG ACCACTTTTT GGCTCCGGGC GTATTTTTCT CAACCAATGG TTGCCGCAGC 3660 

TGTCATTGTC CACCTGGTGA CGGATGGGAC ATATTATGGG GACCAAAAGC AGGAGACCAT 3720 

CAGCGTGCAG CTGCTTGATA CCAAAGATCA GAGCCACGAT CTAGGCCTCC ATGTCCTGAG 3780 

CTGCAGGAAC AATCCCCTGA TTATCCCTGT GGTCCATGAC CTCAGCCAC-C CCTTCTACCA 3840 

CAGCCAGGCG GTACGTGTGA GCTTCAGTTC GCCCCTGGTC GCCATCTCGG GGGTGGCCCT 3900 

CCGTTCCTTC GACAACTTTG ACCCCGTCAC CCTGAGCAGC TGCCAGAGAG GGGAGACCTA 3960 

CAGCCCTGCC GAGCAGAGCT GCGTGCACTT CGCATGTGAG AAAACTGACT GTCCAGAGCT 4020 

GGCTGTGGAG AATGCTTCTC TCAATTGCTC CAGCAGCGAC CGCTACCACG GTGCCCAGTG 4080 
TACTGTGAGC TGCCGGACAG GCTACGTGCT CCAGATACGG CGGGATGATG A 
G GGACCCAGCG TCACAGTGAC CTGTACAGAG G " 



CTCCTGCCCT GAGGGCACCA CCTTTGGCAG TCAATGTTCC TTCCAGTGCC G 

A GGCAACAACA GCCTCCTGAC CTGCATGGAG GATGGGCTGT GGTCCTTCCC 

GACOGCCCGG TGCCGAGAGA ATAAGCACAA GGTGGGCTCC TTCTGCAAAT ACAAATGCAA 
C CATGTGCCTG GATCCTCTCG GAAGTCAAAG AAACGGGCCT TCAAGACTCA 
G GATGGCAGCT GGCAGGAGGG AGCTTGTGTT CCTGTGACCT GTGACCCACC 
TCCACCAAAA TTCCATGGGC TCTACCAGTG TACTAATGGC T TCCAGTTCA A 
TAGGATCAAG TGTGAAGACA GTGATGCCTC CCAGGGACTT GGGAGCAATG T 

CCGGAAAGAT GGCACCTGGA ACGGCTCCTT CCATGTCTGC CAGGAGATGC AAGGCCAGTG 4800 

CTCGGTTCCA AACGAGCTCA ACAGCAACCT CAAACTGCAG TGCCCTGATG GCTATGCCAT 4860 

AGGGTCGGAG TGTGCCACCT GGTGCCTGGA CCACAACAGC GAGTCCATCA TCCTGCCAAT 492D 

GAACGTGACC GTGCGTGACA TCCCCCACTG GCTGAACCCC ACACGGGTAG AGAGAGTTGT 4980 

CTGCACTGCT GGTCTCAAGT GGTATCCTCA CCCTGCTCTG ATTCACTGTG TCAAAGGCTG 5040 

TGAGCCCTTC ATGGGAGACA ATTATTGTGA TGCCATCAAC AACCGAGCCT TTTGCAACTA 5100 

TGACGGTGGG GATTGCTGCA CCTCCACAGT GAAGACCAAA AAGGTCACCC CATTCCCTAT 5160 

GTCCTGTGAC CTACAAGGTG ACTGTGCTTG TCGGGACCCC CAGGCCCAAG AACACAGCCG 5220 

GAAAGACCTC CGGGGATACA GCCATGGCTA AGGAAGGACA AGAAGTTGTC AAAGAATTCC 5280 

CAACGCCAGG ACCCACATCC CTTTGGTATT GATTTCACAG TCAGCTGCTC AACGGAATGG 5340 

CCTCTCCACA CCAGGGATCC TTAGCACCCA ACCGGTCTGC CTTTAATTTT ACCCAGGAAG 5400 

GACTCACATT GGGGCGAATG AACCAAGTTT CGCCATGCTG GATGATGAAA TGGATTCCCA 5460 

TCCCAAAGTC TGAGATGGAT TCCATATACA CTGTGCAGTC CCAGAGCCTC CTAAAATTCT 5520 
AGCCATTTGT CACACAACCA CAGCAAAAAA AAA 

Seq ID NOs B29 Protein sequence 
Protein Accession ft: NP_002572.1 
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GLLSAALGCG LAERPRRARR DPRAGRPPRP AAGPATCATR GPRPPRLAAA 60 
\ VRVPRRRQQR EARGATEEPS PPSRALYFSG RGEQLRVLRA D 
LQVWLRAEGG QRSPAVITGL YDKCSYISRD RGWWGIHTI SDQDNKDPRY F 

QVTTINAHRS YLPGQWVYLA ATYDGQFMKL YVNGAQVATS GEQVGGIFSP LTQKCKVLML 240 

GGSALNHNYR GYIEHFSLWK VARTQREILS DMETHGAHTA LPQLLIiQENW DNVKHAWSPM 300 

KDGSSPKVEF SNAHGFLLDT SLEPPLCGQT LCDNTEVIAS YNQLSSFRQP KWRYRWNL 360 

YEDDHKNPTV TREQVDFQHH QLAEAFKOYN ISWELDVLEV SNSSLRRRLI LANCDISKIG 420 

DENCDPECNH TLTGHDGGDC RHLRHPAFVK KQHNGVCDMD CNYERFNFDG GECCDPEITN 480 

VTQTCFDPDS PHRAYLDVNE LKNILKLDGS THLNIFFAKS SEEEIAGVAT WPWDKEALMH 54 0 

F YGMPGHTHTM IHEIGKSLGL YHVFRGISEI QSCSDPCMET EPSFETGDLC 600 

)PGPGND TCGFHSFFNT PYNNFMSYAD DDCTDSFTPN QVARMHCYLD 660 

LVYQGWQPSR KPAPVALAPQ VLGHTTDSVT LEWFPPIDGH FFERELGSAC HLCLEGRILV 72D 

QYASHASSPM PCSPSGHWSP REAEGHPDVE QPCKSSVP.TW SPNSAVNPHT VPPACPEPQG 780 

CYLELEFLYP LVPESLTIWV TFVSTDWDSS GAVNDIKLLA VSGKNISLGP QNVFCDVPLT 840 

IRLWDVGEEV YGIQIYTLDE HLEIDAAMLT STADTPLCLQ CKPLKYKWR DPPLQMDVAS 900 

ILHLNRKFVD MDLNLGSVYQ YWVITISGTE ESEPSPAVTY IHGRGYCGDG IIQKDQGEQC 960 

DDMNKINGDG CSLFCRQEVS FNCIDEPSRC YFHDGDGVCE EFEQKTSIKD CGVYTPQGFL 1020 

DQWASNASVS HQDQQCPGWV IIGQPAASQV CRTKVIDLSE C-ISQHAWYPC TISYPYSQLA 1080 

OTTFWLRAYF SQPMVAAAVI VHLVTDGTYY GDQKQETISV QLLDTKDQSH DLGLHVLSCR 1140 

NNPLIIPWH DLSQPFYHSQ AVRVSFSSPL VAISGVALRS FDNFEPVTLS SCQRGETYSP 1200 

AEQSCVHFAC EKTDCPELAV ENASLNCSSS D RYHGAQCTV SCRTGYVLQI RRDDELIKSQ 1260 

TGPSVTVTCT EGKWNKQVAC EPVDCSIPDH HQVYAASFSC PEGTTFGSQC SFQCRHPAQL 1320 

KGHNSLLTCM EDGLWSFPEA LCELMCLAPP PVPNADLQTA RCREKKHKVG SFCKYKCKPG 1380 

YHVPGSSRKS KKRAFKTQCT QDGSWQEGAC VPVTCDPPPP KFHGLYQCTN GFQFNSECRI 1440 

KCEDSDASQG LGSNVIHCRK DGTWNGSFHV CQEMQGQCSV PNELNSNLKL QCPDGYAIGS 1500 

ECATSCLDHN SESIILPMHV TVRDIPHWLN PTRVBRWCT AGLKWYPHPA LIHCVKGCEP 1560 

FMGDNYCDAI NNRAFCNYBG GDCCTSTVKT KKVTPFPMSC DMJGDCACRD PQAQEHSRKD 1620 
LRGYSHG 

Seq ID NO: B30 DNA sequence 

Nucleic Acid Accession #: NM_032808.1 

Coding sequence: 61.. 1008 
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GTGGAGCCCT ATGCCTTCCG CGGCCTCAAC TACCTGCGCG TGCTCAATGT CTCTGGCAAC 

CAGCTGACCA CACTGGAGGA ATTAGTCTTC CACTCGGTGG GCAACCTGGA GACACTCATC 

CTGGACTCCA ACCCGCTGGC CTGCGACTGT CGGCTCCTGT GGGTGTTCCG GCGCCGCTGG 

CGGCTCAACT TCAACCGGCA GCAGCCCACG TGCGCCACGC CCGAGTTTGT CCAGGGCAAG 

GAGTTCAAGG ACTTCCCTGA TGTGCTACTG CCCAACTACT TCACCTGCCG CCGCGCCCGC 

ATCCGGGACC GCAAGGCCCA GCAGGTGTTT GTGGACGAGG GCCACACGGT GCAGTTTGTG 

TGCCGGGCCG ATGGCGACCC GCCGCCCGCC ATCCTCTGGC TCTCACCCCC- AAAGCACCTG 
GTCTCAGCCA AGAGCAATGG GCGGC1CACA GTCTTCCCTG ATGGCACGCT GGAGGTGCGC 

GACTCCATGC CCGCCCACCT GCATGTGCGC A 




G CCGGGCAGGG 

CCAGTCCTTC CCACCTCCTC CCTACCCTTC TACACACGTT CTCTTTCTCC 
CGTCCCCTGC TGCCCCCCGC CAGCCCTCAC CACCTGCCCT CCTTCTACCA GGACCTCAGA 
AGCCCAGACC TGGGGACCCC ACCTACACAG GGGCATTGAC AGACTGGAGT 
CGAACCGACA CGCGGCAGAG TCAATAATTC AATAAAAAAG TTACGAACTT 
TTGGGTTTCA ATAATTATGG ATTTTTATGA AAACTTGAAA TAATAAAAAG AGAAAAAAAC 
TATTTCCTAT AGCTAGTCGG AATGCAAACT TTTGACGTCC TGATTGCTCC 
TCCAACTCAG TTTCTTGTTT TTCTCTTCCT 
CTTCCCCAGT GGGGAGGGAT CACTCAGGAA AACAGGAAAG 
CTGCCCACCC CGCCCCAGGC ACCATCAGGA GCAGGCTAGG 
CCGGGCTGGC TTTTTGCAGG GCGCAGGTGG AGGGGACAGG TCTGCCGATG GGGGTGGGAG 
CCTGTCTGCT GGGCTGCCAG GCGGCACCAC TGCAAGGGGT GGGAGCCTGG CTTGGGTGTG 
GCTGAGACTC TGGACAGAGG CTGGGGTCCT CCTGGGGGAC AGCACAGTCA GTGGAGAGAG 



GTGTGGCCTC AAGCAGGTCA CTGGCCTCTC TGGGCCTCAG TCTCCACATC TGTACAAATG 
GGAACATTAC CCCCTGCCCT GCCTACCTCA CAGGGCTGTT GTGAGGAATT GATGAGATGA 
TGTATGTGAA ACACTTTGTA ACCTGTAAAG CGCTGTGCAC ACGTG 
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MLHELLRLQE IQLVGGQLAV VEPYAFRGLN YLRVLNVSGN QLTTLEELVF HSVGNLETLI 
LDSNPLACDC RLLWVFRRRW RLNFNRQQPT CATPEFVQGK EFKDFPDVLL PNYFTCRRAR 
IRDRKAQQVF VDEGHTVQFV CRADGDPPPA ILWLSPRKHL VSAKSNGRLT VFPDGTLEVR 
YAQVQDNGTY LCIAANAGGN DSMPAHLHVR SYSPDWPHQP NKTFAFISNQ PGEGEANSTR 
ATVPFPFDIK TLIIATTMGF ISFLCWLFC LVLLFLWSRG KGNTKHNIEI EYVPRKSDAG 



CAGCACATAC AAGAAACATA CAGTGTACCT CAAAAGGGGC CCTTGAAATG TCATCAAAGG 
GTAATAACCT AGTGAGTGAG TTGTGATGTC ATCTGGAACA TAGGAAATGG GGCTCTTAGC 
iGGAAGC CAGGCTGGTC CTGGCAGGAA GTAAATGATA A 



TGGCACAGCA CTGACAAAGG TAGAGGGAAA TGTAATAGCA CATCTACGCT GCAGTCTGGT 
GAAAGTGGCC GGGGTGGTCC TTGGAAAACA GTTGGGCTGT TCTTGGCAGG AATTAGTGAC 
AGCCTTTCCG TCACGGGCAG GGACGCCTTG ATTTAAAAAA AATAAATAAA TAAAACGTCT 



Seg ID NO: B33 DNA sequen 
Nucleic Acid Accession t: 
Coding sequence: 71.. 1408 



h ATTCTGATTT CCCAGTCTGG GATGACTATA AAAGCAGTGT 
AGATGACTTA CAGTATTTTC TGATTGGGCT C 
GGGGAATCTA CTTATTTTAA TGGCTCTCAT G 
CTTCCTCATA GGCAATCTGG CCTTTTCTGA T 
CACACTGACG TCTGTCTTGC TGGATCAGTG G 
GCCTTTTCTT CAATGTGTGT CAGTTTTGGT TTCAACTTTA A 
TGTCAGGTAT CATATGATAA AACATCCCAT ATCTAATAAT TTAACAGCAA ACCATGGCTA 
CTTTCTGATA GCTACTGTCT GGACACTAGG TTTTGCCATC TGTTCTCCCC TTCCAGTGTT 
TCACAGTCTT GTGGAACTTC AAGAAACATT TGGTTCAGCA TTGCTGAGCA GCAGGTATTT 
ATGTGTTGAG TCATGGCCAT CTGATTCATA CAGAATTGCC TTTACTAICT CTTTATTGCT 
AGTTCAGTAT ATTCTGCCCT TAGTTTGTCT TACTGTAAGT CATACAAGTG TCTGCAGAAG 
TATAAGCTGT GGATTGTCCA ACAAAGAAAA CAGACTTGAA GAAAATGAGA TGATCAACTT 
AACTCTTCAT CCATCCAAAA AGAGTGGGCC TCAGGTGAAA CTCTCTGGCA C-CCATAAATG 
GAGTTATTCA TTCATCAAAA AACACAGAAG AAGATATAGC AAGAAGACAG CATGTGTGTT 
ACCTGCTCCA GAAAGACCTT CTCAAGAGAA CCACTCCAGA ATACTTCCAG AAAACTTTGG 
CTCTGTAAGA AGTCAGCTCT CTTCATCCAG TAAGTTCATA CCAGGGGTCC CCACTTGCTT 
TGAGATAAAA CCTGAAGAAA ATICAGATGT TCATGAATTG AGAGTAAAAC GTTCTGTTAC 
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AAGAATAAAA AAGAGATCTC GAAGTGTTTT CTACAGACTG ACCATACTGA T, 
TGCTGTTAGT TGGATGCCAC TACACCTTTT CCATGTGGTA ACTGATTTTA ATGACAATCT 1260 
TATTTCAAAT AGGCATTTCA AGTTGGTGTA TTGCATTTGT CATTTGITGG GCATGATGTC 1320 
CTGTTGTCTT AATCCAATTC TATATGGGTT TCTTAATAAT GGGATTAAAG CTGATTTAGT 1380 
TGICTTC AT AT GT AATA ATTCTCACTG TTT 

: B34 Protein sequence 



ATRNSDFPVW DDYKSSVDDL QYFLIGLYTF VSLLGFMGNL 



QCVSVLVSTL ILISIAIVRY HMIKHPISNN LTANHGYFLI ATVWTLGFAI C 
VELQETFGSA LLSSRYLCVE SWPSDSYRIA FTISLLLVQY ILPLVCLTVS HTSVCRSISC 
GLSNKENRLE ENEMINLTLH PSKKSGPQVK LSGSHKWSYS FIKKHRRRYS KKTACVLPAP 
ERPSQENHSR ILPENFGSVR SQLSSSSKFI PGVPTCFEIK PEENSDVHEL RVKRSVTRIK 
KRSRSVFYRL TILILVFAVS WMPLHLFHW TDFNDNLISN RHFKLVYCIC HLLGMMSCCL 
NPILYGFLNN GIKADLVSLI HCLHM 



I I I I I I 

GCGCGGGGGA GCCATTAGGA GGCGAGGAGA GAGGAGGGCG CAGCTCCCGC CCf 

CCTGCCCAGC CCTGCCCGGA GGCAGACGCG CCGGAACCGG GACGCGATAA ATATGCAGAG 120 

CGGAGGCTTC GCGCAGCAGA GCCCGCGCGC CGCCCGCTCC GGGTGCTGAA TCCAGGCGTG 180 

30 GGGACACGAG CCAGGCGCCG CCGCCGGAGC CAGCGGAGCC GGGGCCAGAG CCGGAGCGCG 24 0 

TCCGCGTCCA CGCAGCCGCC GGCCGGCCAG CACCCAGGGC CCTGCATGCC AGGTCGTTGG 300 

AGGTGGCAGC GAGACATGCA CCCGGCCCGG AAGCTCCTCA GCCTCCTCTT CCTCATCCTG 360 

ATGGGCACTG AACTCACTCA AGTGCTGCCC ACCAACCCTG AGGAGAGCTG GCAGGTGTAC 420 

AGCTCTGCCC AGGACAGCGA GGGCAGGTGT ATCTGCACAG TGGTCGCCCC ACAGCAGACC 480 

35 ATGTGTTCAC GGGATGCCCG CACAAAACAG CTGAGGCAGC TACTGGAGAA GGTGCAGAAC 540 

ATGTCTCAAT CCATAGAGGT CTTGGACAGG CGGACCCAGA GAGACTTGCA GTACGTGGAG 600 

AAGATGGAGA ACCAAATGAA AGGACTGGAG TCCAAGTTCA AACAGGTGGA GGAGAGTCAT 660 

AAGCAACACC TGGCCAGGCA GTTTAAGGCG ATAAAAGCGA AAATGGATGA ACTTAGGCCT 72 0 

TTGATACCTG TGTTGGAAGA GTACAAGGCC GATGCCAAAT TGGTATTGCA GTTTAAAGAG 780 

40 GAGGTCCAGA ATCTGACGTC AGTGCTTAAC GAGCTGCAAG AGGAAATTGG CGCCTATGAC 84 0 

TACGATGAAC TTCAGAGCAG AGTGTCCAAT CTTGAAGAAA GGCTCCGTGC ATGCATGCAA 900 

AAACTAGCTT GCGGGAAGTT GACGGGCATC AGTGACCCCG TGACTGTCAA GACCTCCGGC 96 0 

TCGAGGTTCG GATCCTGGAT GACAGACCCT CTCGCCCCTG AAGGCGATAA CCGGGTGTGG 102 0 

TACATGGACG GCT AT CACAA CAACCGCTTC GTACGTGAGT ACAAGTCCAT GGTTGACTTC 108 0 

45 ATGAACACGG ACAATTTCAC CTCCCACCGT CTCCCCCACC CCTGGTCGGG CACGGGGCAG 1140 

GTGGTCTACA ACGGTTCTAT CTACTTCAAC AAGTTCCAGA GCCACATCAT CATCAGGTTT 1200 

GACCTGAAGA CAGAGACCAT CCTCAAGACC CGCAGCCTGG ACTATGCCGG TTACAACAAC 1260 

ATGTACCACT ACGCCTGGGG TGGCCACTCG GACATCGACC TCATGGTGGA CGAGAGCGGG 1320 

CTGTGGGCCG TGTACGCCAC CAACCAGAAC GCTGGCAACA TCGTGGTCAG TAGGCTGGAC 1380 

50 CCCGTGTCCC TGCAGACCCT GCAGACCTGG AACACGAGCT ACCCCAAGCG CAGCGCCGGG 1440 

GAGGCCTTCA TCATCTGCGG CACGCTGTAC GTCACCAACG GCTACTCAGG GGGTACCAAG 1500 

GTCCACTATG CATACCAGAC CAATGCCTCC ACCTATGAAT ACATCGACAT CCCATTCCAG 15S0 

AACAAATACT CCCACATCTC CATGCTGGAC TACAACCCCA AGGACCGGGC CCTGTATGCC 1620 

TGGAACAACG GCCACCAGAT CCTCTACAAC GTGACCCTCT TCCACGTCAT CCGCTCCGAC 1680 

55 GAGTTGTAGC TCCCTCCTCC TGGAAGCCAA GGGCCCACGT CCTCACCACA AAGGGACTCC 1740 

TGTGAAACTG CTGCCAAAAA GATACCAATA ACACTAACAA TACCGATCTT GAAAAATCAT 1800 

CAGCAGTGCG GATTCTGACA TCGAGGGATG GCATTACCTC CGTGTTTCTC CCTTTCGAGC 1860 

CGGCGGGCCA CAGACGTCGG AAGAAACTCC CGTATTTGCA GCTGGAACTG CAGCCCACGG 1920 

CGCCCCGGTT TTCCTCCCCG CCCTGTCCCT CTCTGGTCAA ACAACATACT AAAGAGGCGA 1980 

60 GGCAATGACT GTTGGCCAGT TCTCACCGGG GAAAAACCCA CTGTTAGGAT GGCATGAACA 2040 

TTTCCTTAGA TCGTGGTCAG CTCCGAGGAA TGTGGCGTCC AGGCTCTTTG AGAGCCATGG 2100 

GCTGCACCCG GCCGTAGGCT AGTGTAACTC GCATCCCATT GCAGTGCCGT TTCTTGACTG 2160 

TGTTGCTGTC TCTTAGATTA ACCGTGCTGA GGCTCCACAT AGCTCCTGGA CCTGTGTCTA 222 0 

GTACATACTG AAGCGATGGT CAGAGTGTGT AGAGTGAAGT TGCTGTGCCC ACATTGTTTG 2280 

65 AACTCGCGTA CCCCGTAGAT ACATTGTGCA ACGTTCTTCT GTTATTCCCT TGAGGTGGTA 2340 

ACTTCGTATG TTCAGTTTAT GCGATGATTG TTGTAAATGC AATGCCGTAG TTTGGATTAA 2400 

TAAGTGGATG GTTTTTGTTT CTAAAAAGAA AAAAAAAATC AGTGTTCACC CTTATAGAGA 2460 

CATAGTCAAG TTCATGTTGA TAATAATCAA AGGAATTACT CTCTTCTTGT TAAATTAGCT 2520 

AAATCATGTA ACCGCAGATA GGAAGGGCTC ACCTGGGGAA ACTCTGGTTT CCGATGGGAC 2580 

70 AGGAAAGTCA TACGGGCAAC AGTATGCGGA AAGTACGTTT TTTTAAGTAA AAAACAAAGG 2640 

CAAACTTTGT ACTATCCAGT TAT CTAAGG A ACAATAAAAA CATTAGGAGA AAAAAAAAAA 2700 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 

75 

I l 1 I* f I 1 I 1 

MPGRWRWQRD MHPARKLLSL LFLILMGTEL TQ\L?TNE _ ^ " r RCICTW 

80 APQQTMCSRD ARTKQLRQLL EKVQNMSQS I EVLDRRTQRD LQYVEKMENQ MKGLESKFKQ 
VEESHKQHLA RQFKAIKAKM DELRPLIPVL EEYKADAKLV LQFKEEVQNL TSVLNELQEE 
IGAYDYDELQ SRVSMLEERL RACMQKLACG KLTGISDPVT VKTSGSRFGS WMTDPLAPSG 
DNRWYMDGY HNNRFVREYK SMVDFMNTDN FTSHRLPHPW SGTGQWYNG SIYFNKFQSH 
IIIRFDLKTE TILKTRSLDY AGYNHMYHYA WGGHSDIDLM VDESGLWAVY ATNQNAGNIV 
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a Accession #: NM_006334.1 



I I I I I I 

GCGCGGGGGA GCCATTAGGA GGCGAGGAGA GAGGAGGGCG CAGCTCCCGC C' 
CCTGCCCAGC CCTGCCCGGA GGCAGACGCG CCGGAACCGG GACGCGATAA A 
C GCGCAGCAGA GCCCGCGCGC CGCCCGCTCC GGG1GCT 
3 CCAGGCGCCG CCGCCGGAGC CAGCGGAGCC GGGGCCJ 
A CGCAGCCGCC GGCCGGCCAG CACCCAGGGC CCTGCATGCC A 
C GAGACATGCA CCCGGCCCGG AAGCTCCTCA GCCTCCTCTT CCTCATCCTG 



AGCTCTGCCC AGGACAGCGA GGGCAGGTGT ATCTGCACAG TGGTCGCCCC A 
ATGTGTTCAC GGGATGCCCG CACAAAACAG CTGAGGCAGC TACTGGAGAA G 
ATGTCTCAAT CCATAGAGGT CTTGGACAGG CGGACCCAGA GAGACTTGCA GTACGTGGAG 
AAGATGGAGA ACCAAATGAA AGGACTGGAG TCCAAGTTCA AACAGGTGGA GGAGAGTCAT 
AAGCAACACC TGGCCAGGCA GTTTAAGGGC TAACTTAAAA GAGTTTTTTC AATGCTGCAG 
TGACTGAAGA AGCAGT C CAC TCCCATGTAA CCATGAAAGA GAGCCAGAGA GCTTTTTGCA 
CCATGCATTT TTACTATTAT TTTCCAATAC TTAGCACCAT TTCACTAAGG AACCTTGAAT 
ACAACCAGGA TCCTCCTTTG CATGCGACTG TAGCTGCATT TCATGAATAG TTTGAACCCT 
TGTCAATGCA TTTTTTGAAA AAGAAAGAAA AAAAAAACTT CGTGTATGTG ACTCAAAGCA 
TGTAACCTTA AGATGTTGCA TTCTAAACTG ACAATAAAGA CCTTTCCCC 

Seq ID NO: B38 Protein sequence 
Protein Accession tt : NP_006325.1 



Seq ID NO: B39 DNA sequence 

Nucleic Acid Accession #: NM_058199.1 

Coding sequence: 286.. 795 

40 

1 11 21 31 41 51 

I I I I I I 

GCGCGGGGGA CCCATTACCA GGCGAGGAGA GAGGAGGGCG CAGCTCCCGC CCAGCCCAGC 
CCTGCCCAGC CCTGCCCGGA GGCAGACGCG CCGGAACCGG GACGCGATAA ATATGCAGAG 

45 CGGAGGCTTC GCGCAGCAGA ( I ! CGl CCGCTCC GGGTGCTGAA TCCAGGCGTG 

GGGACACCAC CCAGGCGCCG CCGCCGGAGC CAGCGGAGCC GGGGCCAGAG CCGGAGCGCG 
TCCGCGTCCA CGCAGCCGCC GGCCGGCCAG CACCCAGGGC CCTGCATGCC AGGTCGTTGG 
AGGTGGCAGC GAGACATGCA CCCGGCCCGG AAGCTCCTCA GCCTCCTCTT CCTCATCCTG 
ATGGGCACTC AACTCACTCA AAATAAAAGA GAAAACAAAG CAGAGAAGAT GGGAGGGCCA 

50 GAGAGCGAGA GGAAGACCAC AGGAGAGAAG ACACTGAACG AGCTTCCCTT GTTTTGCCTG 
GAAGCCCACG CTGGCTCCCT GGCTCTGCCC AGGATGTGCA GTCCAAATCC CAATCCAGCA 
GTGGGGTTAT GTCGTCCCGC TTACCCTCAG AGCCCTTCTC CTGGTGCTGC CCAGACGATC 
AGCCAGTCCC TCCTGGAGAG GTTCTGCATG GCCTCTAGGA GAGAAGTTTT CTTGGCCCCA 
GGAAGGCCTG GTGGAGGGTG GTGGTTGTGC ACTGTTGCTG GACAGATGCA TTCATTCATG 

55 TGCACACACA CACACACACA TGCACACACA GGGGAGCAGA TACCTGCAGA GAAGAGCCAA 
CCAGGTCCTG ATTAGTGGCA AGCTGCCCCA CAAAGGGCTA TGCCTGTGTC TTATTGAGAC 
ACCTTGGCAA AGAGATGGCT GATTCTGGGT GGTCCTGGAC ATGGCCGCAC CCAAGGGCCC 
TCCAAGCCTT AATGGCACCC TGAAGCCTCC ATGCCCAGGC CAAAAGATGC TTTTCCTCCC 
TAAAAAAAAA A 



70 



75 



MPGRWRWQRD MHPARKLLSL LFLILMGTEL TQNKRENKAE KMGGPSSERK TTGEKTLNEL 

PLFCLEAHAG SLALPRMCSP NPNPAVGLCR PAYPCjSPSPG AAQTISQSLL E 

VFLAPGRPGG GWWLCTVAGQ MHSFMCTHTH THAHTGEQIP AEKSQPC-PD 



Coding sequence 



ATGAGCAATC AGT AC CAGGA GGAGGGCTGC TCCGAGAGGC CCGAGTGCAA AAGTAAATCT 
CCAACTTTGC TCTCCTCCTA CTGCATCGAC AGCATCCTGG GCCGGAGGAG CCCGTGCAAA 
ATGCGGTTGC TGGGAGCCGC GCAGAGCTTG CCTGCTCCGC TGACCAGCCG CGCCOACCCG 
GAAAAGGCCG TGCAAGGCTC CCCTAAGAGC AGCAGCGCCC CGTTCGAGGC CGA( CTGCAi 
CTGCCGCCCA AGCTGCGGCG CCTGTACGGC CCGGGCGGGG GCCGCCTCCT TCAGGGTGCG 



CCACGCGGGG AGGCCCCTCC GCCGCCACCG C 
GGCGCAGGGG CCGCCGCGGC AGCCGCGGCC GCGGCCC-CCG CGGCCTGGGA CACGCTCAAG 
ATCAGCCAGG CGCCGCAGGT GAGCATCAGC CGCAGCAAGT C ~* 
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CCCTTCGTGC CGCCGCCGCC CGCGCTGGAC GAGCTGGGCG GCCCGGC-GGG CGTCACGCAC 600 

T GGCCGGCGGC CCGGGCAGCG CCCCGGCTGC GGGTGGTGGC 660 

3 AGGACGACGA GGAGGAGCTG CTGGAGGACG AAGAAGATGA GGACGAGGAA 72 0 

GAGGAACTGC TGGAGGACGA CGAGGAGGAG CTGCTGGAGG ACGACGCCCG CGCGCTGCTC 780 

AAGGAGCCCC GGCGCTGTCC TGTGGCCGCC ACTGGCGCCG TGGCCGCACC ACCTGCCGCT 840 

GCAGTGGCCA CAGAGGGCGG GGAGCTGTCA CCCAAGGAGG AGCTGCTGCT GCACCCGGAA 900 

3 GCAAGGACGG CGAGGACAGC GTGTGCCTCT CTGCGGGCAG CGACTCGGAG 960 

3AGGCGC TACCGCACCA CGTTCACCAG CTACCAGCTG 1020 

3AAGACG CACTACCCGG ACGTCITCAC CAGGGAGGAA 1080 

I GACCGAGGCC CGAGTCCAGG TCTGGTTCCA GAACCGTCGG 1140 

GCCAAGTGGC GCAAGCGGGA GAAGGCAGGC GCGCAGACCC ACCCCCCTGG GCTGCCCTTC 12 00 

CCGCACCACC CGGCGCTCGA CTCCGCTTGG ACTGCCGCTG CU < ( LLC 1320 

TTCCCGAGCC TACCTCCGCC TCCGGGCTCG GCCAGCCTGC CGCCCAGCGG GGCGCCC-CTG 1380 



TTCGGCAGGC TCTTTTCCAC AATGGCCCCC CTGACCAGCG CGTCGACCGC GGCCGCGCTC 1500 

CTGAGACAGC CCACACCCGC CGTGGAGGGC GCAGTGGCAT CGGGCGCCCT GGCCGACCCG 1560 

GCCACGGCGG CCGCAGACAG ACGCGCCTCT AGCATAGCCG CGCTGAGGCT CAAGGCCAAG 1620 

GAGCACGCGG CGCAGCTCAC GCAGCTCAAC ATCCTGCCGG GCACCAGCAC GGGCAAGGAG 1680 



I I I I I 1 

MSHQYQEEGC SERPECKSKS PTLLSSYCID SILGRRSPCK MRLLGAAQSL P."* 
EKAVQGSPKS SSAPFEAELH LPPKLRRLYG PGGGRLLQGA AAAAAAJ"' " 
PRGEAPPPPP PTARPGERPD GAGAAAAAAA AAAAAWDTLK ISQAPQVSIS R 

30 PFVPPPPALD ELGGPGGVTH PEERLGVAGG PGSAPAAGGG TGTEDDEEEL L 

EELLEDDEEE LLEDDARALIi KEPRRCPVAA TGAVAAAAAA AVATEGGELS PKEELLLHPE 
DAEGKDGEDS VCLSAGSDSE EGLLKRKQRR YRTTFTSYQL EELERAFQKT HYPDVFTREE 
LAMRLDLTEA RVQVWFQNRR AKWRKREKAG AQTHPPGLPF PGPLSATHPL SPYLDASPFP 
PHHPALDSAW TAAAAAAAAA FPSLPPPPGS ASLPPSGAPL GLSTFLGAAV FRHPAFISPA 

35 FGRLFSTMAP LTSASTAAAL LRQPTPAVEG AVASGALADP ATAAADRRAS SIAALRLKAK 

EHAAQLTQLK ILPGTSTGKE VC 

Seq ID NO: B43 DNA sequence 

Nucleic Acid Accession #: CAT cluster 

40 

1 11 21 31 41 51 

I I I I I 1 

CGCCTGCCCC CCCCTTCCTT CTCTCTCTTC CCTTCTGCCT GCGTGCTCGC CCGTCTTTGG 
TTCCGCCGGT TCCGTGCGCC CTCTCTGGGC TTCGGCTTCA CCCTCCGTCT GGCTGTCGCT 

45 CTGCTCTCCT CCGCTCTCAT GCCCCTTTCA GCTCCTACTC TGGTGTCGTC CCCCTCCCTC 

GCTCGTTTCT CTTCTTCCTC TCCTTCCCGG TTCGTGTTCT GCTTCTTCGT GTTCCCTCTC 
GTCTGTGTTT GTGGGTCGTC CTCTGCGTCC TGCCTCTTTC TCTTGCGTTT CCCCGCGCTT 
CCTGTTTTGG TCCCCCTGGT TCTCGCGCGG TTCTGTGTGC GCGGGCCGCC CCGTTCTTTC 
TTGGTGCTTT GCTCCTGCTC TCTGTCTGGT GTTCTCCGCC GTCTGTGTCT TTCTCCTTTT 

50 ATCTTCTGTT TTCTCTTTTT CTTGTTTCTT CTTGTTTTTT TTTGTCTTTG TTTTCTCTTA 
GCTGAGGTTG GGGAGAGATA ACGCTGTAAA CTTTTATTTT TCAGGAAATC TGGAAACCTA 
CAGTCTCCAT GCCTGCTCAG CCAAGAAGGA GCTCACTGTG GGCACCAGAG ACAGGGACCC 
AATGTGGAGA CCTGTGAGCC TGTGTCCGGC CCTGAACTCT CAAGCACAGG GCAGGCTTCC 
TGAGCATTGA AGAGAATATG TGGGAGAACA AAACAGAAAC TGAAAGAATA TGCAAGGTGT 

55 CTTTCTTGGA TGTTATTCCA TGATAGATAG TAGGGGCAGG AGTGAGAGAG GCTGACTAGG 

TCTGGACATG GAGGCTGGAA GAGT CAGGGT GTGATTCGGA GAGGCGCATG AGAAGGAAGG 
TGGA1TTTAA GGCTGGAAAT CTGAGGGTCA GTGGTCCAAG TCACTCAGAG ACAGAATCAC 



75 




TCCTGATCCC CTACTAAAGC GCCTGCTAAC r^,™.^™ _ 

70 TTGACACATA AAGGATAGGA AAAGAGAGGA GAGAAAAGCA ACTGAGTTGA A 
GCTGATGCGG GCCTCCTGAT CAATTAAGAG GAGAGTTAAA C 
CCAAGGAGGT GCGGGGCAAG A 

TGCACCTTGG GTCACGCCTC CTTGGCGAGA AAGCGCCTCG CATTTGATTG CTTCCAGTTA 
TTGCAGAACT TCCTGTCCTG GTGGAGAAGC GGGTCT 
CCTGAGGCGT GAGACTGAGT T 

AACACAATCT GCAAGCCCCC GCGACCCAAG TGAGGGGCCC CGTGITGGGG T 
CTTTGCATTC CCACCCCTCC GGGCTTTGCG TCTTCCTGGG GACCCCCTCG CCGGGAGATG 
GCCGCGTTGA TGCGGAGCAA GGATTCGTCC TGCTGCCTGC TCCTACTG3C CGCGGTGCTG 
ATGGTGGAGA GCTCACAGAT CGGCAGTTCG CGGGCCAAAC TCAACTCCAT C 
CIGGGCGGGG AGACGCCTGG TCAGGCCGCC AATCGATCTG CGGGCATGTA C 

GCATTCGGCG GCAGTAAGAA GGGCAAAAAC CTGGGGCAGG CCTACCCTIG TAGCAGTGAT 960 
AAGGAGTGTG AAGTTGGGAG GTATTGCCAC AGTCCCCACC AAGGATCATC GGCCTGCATG 1020 
GTGTGTCGGA GAAAAAAGAA GCGCTGCCAC CGAGATGGCA TGTC-CTGCCC CAGTACCCGC 1080 
TGCAATAATG GCATCTGTAT CCCAGTTACT GAAAGCATCT TAACCCCTCA CATCCCGGCT 1140 
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r ACTCAAACCA TGACTTGGGA 12 00 

3 ATGTCACATA TAAAAGGGCA TGAAGGAGAC 1260 

CCCTGCCTAC GAT CAT CAGA CTGCATTGAA GGGTTTTGCT GTGCTCGTCA TTTCTGGACC 1320 

AAAATCTGCA AACCAGTGCT CCATCAGGGG GAAGTCTGTA CCAAACAACG CAAGAAGGGT 13 80 

TCT CATGGGC TGGAAATTTT CCAGCGTTGC GACTGTGCGA AGGGCCTG1C TTGCAAAGTA 1440 

TGGAAAGATG CCACCTACTC CTCCAAAGCC AGACTCCATG TGTGTCAGAA AATTTGATCA 1500 

CCATTGAGGA ACATCATCAA TTGCAGACTG TGAAGTTGTG TATTTAATGC ATTATAGCAT 15S0 

GGTGGAAAAT AAGGTTCAGA TGCAGAAGAA TGGCTAAAAT AAGAAACGTG ATAAGAATAT 1620 

AGATGATCAC AAAAAGGGAG AAAGAAAACA TGAACTGAAT AGATTAGAAT GGGTGACAAA 1680 

TGCAGTGCAG CCAGTGTTTC CATTATGCAA CTTGTCTATG TAAATAATGT ACACATTTGT 1740 

GGAAAATGCT ATTATTAAGA GAACAAGCAC ACAGTGGAAA TTACTGATGA GTAGCATGTG 1800 
ACTTTCCAAG AGTTTAGGTT G 
TACAAATAAC CTACATGCCA GATTTCTATT C 

T TATACAATAG GTTCTAAAAA TAAAATTGCT AAACAAGAAA TGAAAACATG 1980 




CAATCAAGAG TCTTGGTAGA TAAGAAAAAA ATCAGTCAAT A 
TAATGGCCAG TTGTTTAGGA AGGCCTTTAG G 

T ACCTGTAATT A 

T CCTTCAGATT C 
CCTGCTCTGA ATGCATTATA TTTTCCAAAG T 
TTACACAGAG CAGAATTTTC ACAGATGGCA AAAAAATT 

GAAAAGAGCT AACAGAGAGA TCATTATTTC TTAAAGATTG GCCATAACCT GTATTTTGAT 

AGAATTAGAT TGGTAAATAC ATGTATTCAT ACATACTCTG TGGTAATAGA GACTTGAGCT tmu 

GGATCTGTAC TGCACTGGAG TAAGCAAGAA AATTGGGAAA ACTTTTTCGT TTGTTCAGGT 2E80 

TTTGGCAACA CAT AG AT CAT ATGT CTGAGG CACAAGTTGG CTGTTCATCT TTGAAACCAG 2640 

GGGATGCACA GTCTAAATGA ATATCTGCAT GGGATTTGCT ATCATAATNT TTCCTATGCN 27 00 

GNTGAATTCN GTGTGAGGTC CTGTGTCCGT CCTATCCTCA AATTATTTAT TTTATAGTGC 2760 

TGAGATCCTC AAATAATCTC AATTTCGGAG GTTTCACAAA ATGGACTCCT GAAGTAGACA 2820 

GAGTAGTGAG GTTTCATTGC CCTCTATAAG CTTCTGACTA GCCAATGGCA TCATCCAATT 2880 

TTCTTCCCAA ACCTCTGCAG CATCTGCTTT ATTGCCAAAG GGCTAGTTTC GGTTTTCTGC 2940 

CAGCCATTGC GGTTAAAAAA TATAAGTAGG ATAACTTGTA AAACCTGCAT ATTGCTAATC 3000 

TATAGACACC ACAGTTTCTA AATTCTTTGA AACCACTTTA CTACTTTTTT TAAACTTAAC 3060 

TCAGTTCTAA ATACTTTGTC TGGAGCACAA AACAATAAAA GGTTATCTTA TAGTTGTGAC 3120 

TTTAAACTTT TGTAGACCAC AATTCACTTT TTAGTTTTCT TTTACTTAAA TCCCATCTGC 3180 

AGTCTCAAAT TTAAGTTCTC CCAGTAGAGA TTGAGTTTGA GCCTGTATAT CTATTAAAAA 3240 

TTTCAACTTC CCACATATAT TTACTAAGAT GATTAAGACT TACATTTTCT GCACAGGTCT 3300 

GCAAAAACAA AAATTATAAA CTAGTCCATC CAAGAACCAA AGTTTGTATA AACAGGTTGC 33 60 

TATAAGCTTG GTGAAATGAA AATGGAACAT TTCAATCAAA CATTTCCTAT ATAACAATTA 3420 

TTATATTTAC AATTTGGTTT CTGCAATATT TTTCTTATGT CCACCCTTTT AAAAATTATT 34B0 

ATTTGAAGTA ATTTATTTAC AGGAAATGTT AATGAGATGT ATTTTCTTAT AGAGATATTT 3540 

CTTACAGAAA GCTTTGTAGC AGAATATATT TGCAGCTATT GACTTTGTAA TTTAGGAAAA 3 600 

ATGTATAATA AG AT AAAAT C TATTAAATTT TTCTCCTCTA AAAACTGAAA AAAAAAAAAA 3660 



MAALMRSKDS SCCLLLLAAV LMVESSQIGS SRAKLNSIKS SLGGETPGQA A 
LAFGGSKKGK NLGQAYPCSS DKECEVGRYC HSPHQGSSAC MVCRRKKKRC H 
RCNNGICIPV TESILTPHIP ALDGTRHRDR NHGHYSNHDL GWQNLGRPHT KnaMlnUHBLI 
DPCLRSSDCI EGFCCARHFW TKICKPVUKQ GEVCTKQRKK GSHGLEIFQR CDCAKGLSCK 
VWKDATYSSK ARLHVCQKI 




ATGCTCTTTG AGGGCTTGGA T 

TCCGTGACGC TGCTGCTGGC CGTGTCGCAG CAGCTGTGGC A 

CGCGACAAGA GCTGCAAGCT GCCCATCCCC AAGGGATCCA TGGGCTTCCC GCTCATCGGA 
GAGACCGGCC ACTGGCTGCT GCAGGGTTCT GGCTTCCAGT CGTCGCGGAG GGAGAAGTAT 
GGCAACGTGT TCAAGACGCA TTTGTTGGGG CGGCCGCTGA TACGCGTGAC CGGCGCGGAG 
AACGTGCGCA A 
ACCCGCATGT T 

AAGCGCAAGG TCTTCTCCAA GATCTTCAGC CACGAGGCCC T 
ATCCAGCTGG TGATCCAGGA CACACTGCGC GCCTGGAGCA GCCACCCCGA G 

G AGGCGCAGAA GCTGACCTTC CGCATGGCCA TCCGGGTGCT GCTGGGCTTC 600 
3 AGGAGGACCT TGGGCACCTC TTTGAGGTCT ACCAGCAGTT TC-TGGACAAT 660 
GTCTTCTCCC TGCCTGTCGA CCTGCCCTTC AGTGGCTACC GGCGGGGCAT TCAGGCTCGG 720 
CAGATCCTGC AGAAGGGGCT GGAGAAGGCC ATCCGGGAGA AGCTGCAGTG CACACAGGGC 780 
AAGGACTACT TGGACGCCCT GGACCTCCTC ATTGAGAGCA GCAAGGAGCA CGGGAAGGAG 840 
ATGACCATGC AGGAGCTGAA GGACGGGACC CTGGAGCTGA TCTTTGCGGC CTATGCCACC 900 
ACGGCCAGCG CCAGCACCTC ACTCATCATG CAGCTGCTGA AGCACCCCAC TGTGCTGGAG 960 
AAGCTGCGGG ATGAGCTGCG GGCTCATGGC ATCCTGCACA GTGGCGGCTG CCCCTGCGAG 1020 
GGCACACTGC GCCTGGACAC GCTCAGTGGG CTGCGCTACC TGGACTGCGT CATCAAGGAG 1080 
GTCATGCGCC TGTTCACGCC CATTTCCGGC GGCTACCGCA CTGTGCTGCA GACCTTCGAG 1140 
CTTGATGGTT TCCAGATCCC CAAAGGCTGG AGTGTCATGT ATAGCATCCG GGACACCCAT 12 00 
GACACAGCGC CCGTGTTCAA AGACGTGAAC GTGTTCGACC CCGATCGCTT CAGCCAGGCG 1260 
CGGAGCGAGG ACAAGGATGG CCGCTTCCAT TACCTCCCGT TCGGTGGCGG TGTCCGGACC 1320 
TGCCTGGGCA AGCACCTGGC CAAGCTGTTC CTGAAGGTGC TGGCGGTGGA GCTGGCTAGC 1380 
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ACCAGCCGCT TTGAGCTGGC TACACGGACC TTCCCCCGCA TCACCTTGGT CCCCGTCCTG 1440 

CACCCCGTGG ATGGCCTCAG CGTCAAGTTC TTTGGCCTGG ACTCCAACCA GAACGAGATC 1500 

CTGCCGGAGA CGGAGGCCAT GCTGAGCGCC ACAGTCTAAC CCAAGACCCA CCCGCCTCAG 1560 

CCCAGCCCAG GCAGCGGGGT GGTGGTTGTG GGAGGTAGAA ACCTGTGTGT GGGAGGGGGC 1620 

CGGAACGGGG AGGGCGAGTG GCCCCCATAC TTGCCCTCCC TTGCTCCCCC TCCCTGGCAA 1680 

ACCCTACCCA AAGCCAGTGG GCCCCATTCC TAGGGCTGGG CTCCCCTTCT GGCTCCAGCT 1740 

TCCCTCCAGC CACTCCCCAT TTACCATCAG CTCAGCCCCT G3GAAGGGCG TGGCAGGGGC 180 0 

C CGTGACAGTG TTAGGTGTCA GCGCGTGCTA CAGTGTTTTT GTGATGTTCT 1860 

3 ACCCTTTTAG CTGGGGTTGG GGGACGGGAA 1920 

3 CGTCTCCTCC TCCTGCGCCC CCACTGCGTC 1980 

\ CAGCATCCTG GGTAGCAGAA CAGGAGTCAA CCTTGGCGGG GCGGGGGCTG 2040 

CGTCCAACCT GGAGATTGCC CTTCCCTATG CCACGGTTCC CACCCTCCCT CACCAGTTTG 2100 

GACAATTTGA AATTACCTAT TGCTGCTACT TGTTCTGTCC TCTGACCTTG GGGCAAAGGA 2160 

GCCCCAGGCC CTGTCTCCCC AGCATCCTCC CTGGTGGCCC TGGGCAGGTG CACTGACACC 2220 

CCCACCTTCC CATCCCCTGC TGAACCAGGC CCTGTTACAC ACAGCCGCCT AAGGCCCGCG 2280 

GCTCATGTGC TGCCCGCCCC CATATTTATT CACTGA1AGA GAATCTTGGG GATGCTGGGG 2340 

TCTGGAGTGA ACATCTCCTC CCCTTCATGC CCTAGCCTGT GTTCTAGCTG TCCTGGCGAG 2400 

ACTTCTGTGA GTGAAGAGGA AGGGGTCTCT GGTCAAACCC AGCCCCCAGG GCCTAGGGTT 2460 

GAAAGCCTTC CCCGGCTCCG GGCATTATTT GGGTTTAATC TCGGAGCCTC ACTCCTGGAC 2520 

I CAAAAACCCT GATCAGGTAG ATTTGGAGGC GGCCACGATT TCCTGTTTGG 2640 

H CCCCAGTGCA CTGGCCCTGA CTCCAGGCGT GAGTATGGGG AAGGATACGG 2700 

\ CGGGGAGCAA GGGCCTCCGT CTTCCCTTCC TTAACTCTCC CCCTTTGCCC 2760 

C CTTGAAGTCC CTTCCACCTC TATGCCACTG TCTGCTTAGC 2B20 

CCAGCTCAGG GGTGGGGAAG AGGCGAAAGC GTGGGGGAGG TGAGCGCAGC GGCAGTTCTG 2880 



C TACGTGCGGT T 
AGTCGGGATG ACTGGGTGAA AGAGGAGTAG CTCCTGCCAC T 

ATTTGGGATG CCTCCTGGGG AAGGTTTCCG GGCGTTTGGT GAGTCTCTAG ATTTTTCCTT 3120 
GCTTTCTGTG TTTATTGGTT TTTGATGTTG TAAAAGCAAT GAATCCCCTT T 
TCGAAAACAC AGAAGAATGA AGGACATGCC AGTCCCCGAT CGCTGCTGTG A 

TGGCTCCCTC AGACCAGATC CCGTAGGCAG CCCCACAGAC CGACCCTGAC CCCACTCACA 3300 

GCCACCCTGA AGATAGACTA TAGGAACGGG CCCATACCAC ACAGACTGCT CTCCAATCCC 33 60 

TGAGTCTCAG ATGTTTCATT TATTTCCTAC TTTTCCACTA CTAAAAAACA GTGTGGAATA 3420 

GACATTATTG GCAAAATTGC TCATCCCTAA TCCTGAAAAA CAGGCCAGAA TGGGTAAAGA 3480 
CTTGTCAAAG CTTGCAACAT AGCTACATGG TGCACCCGGA CCTGTACCCC C 
CACAAAACCA GTGTCTGGGA GGTTCATTTT CCTTTAAACT GATCCAGCTG G 
AATTGTTTTT GACTGAGTAT CTAGGAGAGC AGTAAGTGGA ACTTCAGACA A 
GTCTGGTCCA GGTGAGGGGC AGGGGGCATG GGGCTGGGAG GTCTCAGGGG C 
GGCTGGCCAG CCTGGTAGGG GGCAGAGAAG GAAAAGCTGA G( 

GAAAGAAGGA TCATTTGCCC CGCTGGGTCT CAAAGGCAGT GAGAAGAGAG CTGAAGAAAG 3 840 

CTCTGGCTGG CTGACAGGAT CCCTGTGTTG TAATTGGTCC CTCCTTTCAG CTCTCTAGTG 3 900 

AGATGCCCGT GTCTGTGCGT GTGCGTGTGT GTTTCATACA GCTAGCATTA GATGGGTGAT 3960 

GTTTCTTACT TATCATCCCT AACTATTGCA ACTTCACCTT AAAAAGACAA AACCCCACAA 4020 

AACTCTTCCT GCCACGGGCT TGCAGATTGA AGCACTTTCG ATGTTGGGCG CTGGCGTTTG 40 80 

TGTTCTGGGC ACCACCGTGA CCCTGCCCAG ATGGCTATAA TATTATTTTA TACACAAACC 4140 

TTTTTTTTCA TAAATGTTAT AATTTTGTGT CTGTCTTTAT AAACTATTAT AAGTACTATT 4200 

TTTGTTATAA TTCAAAATAG ATATTTAGTA TAAAGTTTTT GCTGTTAAAT ATTTGTTATT 4260 

TAGTAAAATA TGAATTTTGC TCTATTGTAA ACATGGTTCA AAATATTAAT ATGTTTTTAT 4320 

CACAGTCGTT TTAATATTGA AAAAGCACTT GTGTGTTTTG TTTTGATATG AAACTGGTAC 4380 

CGTGTGAGTG TTTTTGCTGT CGTGGTTTTA ATCTGTATAT AATATTCCAT GTTGCATATT 4440 



1 11 21 31 41 51 

I I I I I I 

MLFEGLDLVS ALATIiAACLV SVTLLLAVSQ QLWQLRWAAT RDKSCKLPIP KGSMGFPLIG 
ETGHWLLQGS GFQSSRREKY GNVFKTHLLG RPLIRVTGAE NVRKILMGEH H 
TRMLLGPNTV SNSIGDIHRN KRKVFSKIFS HEALESYLPK IQLVIQDTLR A 
VYQEAQKLTF RMAIRVLLGF SIPEEDLGHL FEVYQQFVDN VFSLPVDLPF SGYRRGIQAR 
QILQKGLEKA IREKLQCTQG KDYLDALDLL IESSKEHGKE MTMQELKDGT LELIFAAYAT 
TASASTSLIM QLLKHPTVLE KLRDELRAKG ILHSGGCPCE GTLRLD1LSG LRYLDCVIKE 
VMRLFTPISG GYRTVLQTFE LDGFQIPKGW SVMYSIRDTH DTAPVFKDVN VFDPDRFSQA 

RSEDKDGRFH YLPFGGGVRT CLGKHLAKLF LKVLAVELAS TSRFELATRT F "* 

HPVDGLSVKF FGLDSNQNEI LPETEAMLSA TV 

Nucleic Acid Accession^: AB040527.1; AL136582-1 



I I I I I I 

GCGGCTGCGG GGTTCCTGGT GCTGAGGACG GACGCCATTG GAGTTCCCGA G" 
TGAGGGAAGC TTCAGCGTGC AATCGGAAAG CTACAGTGTT GAAGAO 
CGACGAAGTC GGGGAGGAAG AGATGGTTGA AGGCAACGAC TATGAAC 
TGGTGGCTAT GGCACCCTCA CCAGCTTTGA CATCCATATC CTCAGAGCCT TCGGAAGCTT 
GGGTCCAGGC CTTCGCATCT TATCGAATGA GCCCTGGGAA CTGGAAAACC CTGTGCTGGC 
CCAGACCCTG GTGGAGGCAT TGCAGCTGGA TCCGGAAACA CTTGCCAATG AGACGGCCGC 
CCGTGCTGCC AACGTAGCCC GCGCCGCCGC CTCCAACCGT GCGGCTCGGG CCGCTGCCGC 
CGCTGCCCGT ACCGCCTTCA GTCAGGTGGT CGCTAGCCAC CGGGTGGCCA CGCCGCAGGT 
CT CAGGAGAG GATACCCAGC CCACGACCTA CGCCGCCGAG GCTCAGGGGC CCACCCCTGA 
GCCACCCCTT GCTTCTCCGC AGACCTCCCA GATGTTAGTC ACCAGTAAGA TGGCTGCCCC 

815 
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CGAGGCTCCG GCAACCTCCG CACAGTCCCA GACAGGCTCC CCGGCCCAGG AC-GCTGCTAC 660 

TGAGGGCCCT AGTAGCGCCT GTGCTTTCTC TCAGGCTCCG TGTGCCAGGG AGGTGGACGC 720 

CAACCGGCCC AGCACAGC CT TCCTGGGCCA GAATGATGTC TTCGATTTCA CTCAGCCGGC 780 

AGGTGTCAGT GGCATGGCCT TCCCGCGCCC CAAGAGACCT GCCCCAGCCC AAGAGGCTGC 840 

CACAGAGGGC CCCAGTGCTG CCTCTGGTGT GCCCCAGACG GGACCTGGCA GGGAGGTGGC 900 

AGCCACCCGG CCCAAGACCA CCAAGTCGGG GAAGGCGCTG GCCAAGACTC GGTGGGTGGA 960 

G CAGCTGCTGC CAAGGCCAAG ATGGCCACGA GCATCCCTGA 1020 

T GCAGCTGCTG CCACTGCTCA GCACAGTGCT GAGCCCTGGG CCAGGATGGG 10 80 

AGGCAAGAGG ACCAAGAAGT CCAAGCACCT GGATGATGAG TATGAGAGCA GCGAGGAGGA 1140 

= CACCCACCTG GAGAGCATCA CAGCCCTCAT TGACGGTGCG 1200 

C CCCCGATGGC CCCGAGGTCC CAGATACCCT CAAGGCACGT 1260 

ACTGTGCCTG CCCCCCCGCA ACGTGACCCT TCTGCAGGAG AGGGCAAATA AGTTGGTGAA 1320 

G ATTAAGGACT ACAAGAAGAT CCCCATCAAG CGCGCAGACA TGCTGAAGGA 1380 

A GAATATGATG AACATTTCCC TGAGATCATT GAACGAGCAA CGTACACCCT 1440 

GGAAAAGAAG TTTGGGATCC ACCTGAAGGA GATCGACAAG GAAGAACACC TGTATATTCT 1500 

A CGGGACTCCT CAGCTCGCCT CCTTGGAAAA ACCA^GGACA CTCCCAGGCT 1560 

C TGGGCGTCAT CTTCATGAAT GG CA^CCGTG CCAGCGAGGC 1620 

TGTCCTCTGG GAGGCACTAC GCAAGATGGG ACTGGGCCCT GGGGTGAGGC ACCCATTCCT 1680 

CGGCGATCTG AGGAAGCTCA TCACAGATGA CTTTGTGAAG CAGAAGTACC TGGAATACAA 1740 

GAAGATCCCC AACAGCAACC CACCTGAGTA TGAATTCCTC TGGGGCCTGC GAGCCCGCCA 1800 

TGAGACCAGC AAGATGAGGG TCCTGAGATT CATGGCCCAG AATCAGAACC GAGACCCCCG I860 

GGAATGGAAG GCTCATTTCT TGGAGGCTGT GGATGATGCT TTCAAGACAA TGGATGTGGA 1920 

TATGGCCGAG GAACATGCCA GGGCCCAGAT GAGGGCCCAG ATGAATATCG GGGATGAAGC 1980 

GCTGATTGGA CGGTGGAGCT GGGATGACAT ACAAGTCGAG CTCCTGACCT GGGATGAGGA 2040 

CGGAGATTTT GGCGATGCCT GGGCCAGGAT CCCCTTTGCT TTCTGGGCCA GATACCATCA 2100 

GTACATTCTG AATAGCAACC GTGCCAACAG GAGGGCCACG TGGAGAGCTG GCGTCAGCAG 2160 

TGGCACCAAT GGAGGGGCCA GCACCAGCGT CCTAGATGGC CCCAGCACCA GCTCCACCAT 2220 

CCGGACCAGA AATGCTGCCA GAGCTGGCGC CAGCTTCTTC TCCTGGATCC AGCACCGTTG 2280 

ACGAACTGCA GCGATCTTAC TGGCCAAGCC AGAGCGCCTC CTCTCAGATT CCTTCTCGAC 2340 

ACAGCACCCT AGGCGGCTTC TTCCTGTCAG TCGGAGGTGG CATGCAAGAT GAAGCTCTCT 2400 

TTGCTCTTCC TGCTTTCATT TTGTGCTTTT CCTTGTGTTT TCATGTTTTG GGTATCAGTG 2460 
T TACATT AAA GTTGCAAAAT TAAA 

Seq ID NOs B49 Protein sequence 

Protein Accession #: BAB33378.1; CAB66S17.1 

| I' I' I' I' f 

HAEGSFSVQS ESYSVEDMDE GSDEVGEEEH VEGNDYEEFG AFGGYGT* T,r " 

SLGPGLRILS NEPWELENPV LAQTLVEALQ LDPETItANET AARAANVi 

AAAARTAFSQ WASHRVATP QVSGEDTQPT TYAAEAQGPT PEPPLASPQT SQMLVTSKMA 180 

APEAPATSAQ SQTGSPAQEA ATEGPSSACA FSQAPCAREV DANRPETAFL GQNDVFDFTQ 240 

PAGVSGMAFP RPKRPAPAQE AATEGPSAAS GVPQTGPGRE VAATRPKTTK SGKALAKTRW 3 00 

VEPOBWAAA AAKAKMATSI PEPEGAAAAT AQHSAEPWAR MGGKRTKKSK HLDDEYSSSE 3 60 

EERETPAVPP TWRASQPSLT VRAQLAPRPP MAPRSQIPSR IIVLCLPPRNV TLLQERANKL 420 

VKYLMIKDYK KIPIKRADML KnVIREYDEH FPEIIERATY TLEKKFGIHL KEIDKEEHLY 480 

ILVCTRDSSA RLLGKTKDTP RLSLLLVILG VIFMNGNRAS EAVLMEALRK MGLRPGVRHP 540 

FLGDLRKI,IT DDFVKQKYLE YKKIPNSNPP EYEFLWGLRA RHETSKMRVL RFIAQNQNRD 600 

PREWKAHFLE AVDDAFKTMD VDMAEEHARA QMRAQMNIGD EALIGRWSWD DIQVELLTWD 660 

EDGDFGDAWA RIPFAFWARY HQYILNSNRA NRRATWRAGV SSGTNGGAST SVLDGPETSS 720 
TIRTRNAARA GASFFSWIQH R 

Seq ID NO: B50 DNA sequence 

Nucleic Acid Accession #= XM_084965.1 

Coding sequence: 356.. 2014 

1 11 21 31 41 51 

GACCTAGCGT GTGCTCAGCT CTGGACAAGA CATGGATGTA GGCAGACATG CTTCCTGCCT 60 

GCTGGGCTTA GTGCAC CAAT GCTAAGGACA GACATGAAGG GCTTGGGAAA GGCATGAAGA 120 

TGCAGGGAGT CTAGGAGAGA GGATAAGAAA AACAAGCATT TTGTGGTGGG TTGAATTGCG 180 

TTGTCGTTTT TAGAGGCATT TCGTGATCCA TGGTTGCGCT CCCGTGAAGA GCGAGTCTAG 240 

AGAAGGAATC CGAAGCGGCG GCGGCGAGCG CGGCTCAGGT TGATTTAGAA TACGGGTGAC 300 

AGTGGCCTGG CGCGAGCCCA CTGCTGACGA AAGCGGCTTA TCCCGCGCGG TTTCCATGGA 360 

GACGAGCCGG AGCCGCGGCG GCGGCGGGGC TGTCAGCGAG CGCGGCGGAG CTGGCGCGTC 420 

CGTGGGGGTC TGCAGGAGGA AGGCGGAGGC CGGGGCCGGG ACCGGGACCC TCGCGGCAGA 480 

CATGGACTTG CATTGTGACT GTGCCGCCGA AACGCCGGCC GCCGAGCCGC CGTCGGGGAA 540 

A GCTGCCTTCA AATTATTCAA GAAGAGGAAA TCGGGTGGCA CCATGCCCAG 600 

3 GTCAAAAACA AAGGGGACGG GAA.AAGCTCG GGTCCGACGG GGCTGGTGAG 660 

C TTGCCGAGGT GCTGGTGCTG GAGAGCGGCA GGAAGGAGGA 720 

GCCGCGCGGC GGGGGCGACA GCGGCGGGGG CGGCGGGGGG CGGCCGAACC CGGGGCCCCC 780 

CAGAGCCGCA GGGCCCGGCG GGGGCTCCCT CGCCAGCAGC TCGGTGGCCA AGTCGCACAG 840 

CTTCTTCTCG CTGCTGAAGA AGAACGGGCG CTCGGAAAAC GGCAAGGGAG AGCCTGTGGA 900 

CGCGAGCAAG GCCGGCGGCA AACAAAAGCG GGGGCTGCGG GGGCTGTTCA GCGGCATGCG 960 

CTGGCACAGG AAAGACAAGC GGGCCAAGGC GGAGGCCGCG GAGGGGCGCG CGCCCGGGGG 1020 

CGGCTTGATC CTACCCGGCT CGCTCACCGC CAGCCTGGAG TGCGTCAAGC- AGGAGACGCC 1080 

CAGAGCCGCG CGCGAGCCGG AGGAGCCCAG CCAGGACGCC CCGCGAGACC CAGCAGGCTG 1140 

TGGAGATATT ATTGCAGACC AAGAGGAAGA GGCAGGTCCC AGCTGTGACA AGCATGTCCC 1200 

CGGGCCAGGC AAGCCGGCTC TGTCTAAAAA GAACCCCGGC GTGGTGGCCT ACCAAGGAGG 1260 

CGGGGAAGAG ATGGCCAGCC CGGACGAGGT GGACGACACC TATCTACAGG AGTTCTGGGA 1320 

CATGCTCTCC CAGACCGAGG AGCAGGGACC CGAGCCCCAG GAGGGCGCGG CTAAGGTGGC 1380 

AGCTGCGCTG GAAACCAAGG TGGTGCCCGA GACCCCCAAA GACACCAGGT GTGTGGAAGC 1440 

GGCCAAGGAC GCGTCCTCGG T CAAGCGCAG GAGGCTCAAC CGGATTCCCA TCGAGCCCCA 1500 
TCCTAAGGAG GAGCCCAAGC ACCCGGAGAA GGAGCAGCAG GAAGGCGTCC O 
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... ... CG CTCTATGATC TCTATGCTGA 1680 

CCCGGACGGA AGTCCAGCAA CCCTTCCTGG AGGGAAGGAC AACGAGGAGA CGTCCTCCCT 1740 

GTCCCGGTTA AAGCCCGTAT CTCCAGGCAC CATCACCTGT CCACTGCGAA CACCAGGCAG 1800 

CTTGCTGAAG GACTCTAAGA TCCCTATTAG CAT CAAGCAC CTGACCAACC TTCCATCTAG 1860 

CCATCCCGTG GTGCACCAGC AACCCTCCAG GAGTGAGATG CCCAGAACAA AAATCCCGGT 1920 

TTCCAAAGTG CTGGTCCGCA GAGTCAGCAA CCGGGGCTTG GCTGGGACCA CCATCAGAGC 1980 

AACGGCCTGC CACGACAGTG CCAAAAAGTT GTGAGGTCTT CCAGGCCAAG GTGGATGGGC 2040 

CCCATGCCAA GGAATACAAC TTTTCCCTGG AAACCACTAA AGTAAGTTTT GCTTTTCCTA 2100 

AAGAAAGTCT TTTAGGACAC CACCCGTCCC CCGCCCTGCT CCAGAGCGTG GACCGAGGAG 2160 

GTCTTTGTGC CCTGAGCAGG GACCGGATAA CACCAGAAAG AGGGATGCTA CACGGGGGTT 2220 

TCTCCTCTCA AGATAAGTCC CTGAGAATTA TTTTCAAGCA CTTTTTTCTT TTTTACCTTT 2280 

AAGTTTTTCT TCCTTTTGCT TTAATATACT GAACACTTGG AAGTCACCTT TACTTGCCTT 2340 

TGCAGAAAAC AGAACTTAGC CAAACCTAAG TAAGAGTCAT GCCTGGATAT TGGGATAAGC 2400 

CAGTGTCTAG AGGCCTGAAG GAACCGCTGA AGAACCAGAG GAGATCTCCT CTTCCAAGAC 2460 
ACGTTTCCTT CCTTCCCTTC TTTCCTCTTT CCTTTCCAGT T 

Seg ID NO: BS1 Protein sequence 
Protein Accession #: XP_08496S.l 

1 11 21 31 41 51 

METSRSRGGG GAVSERGGAG ASVGVCB 
GKINKAAFKL FKKRKSGGTM PSIFGVKNKG D 
EEPRGGGDSG GGGGGRPNPG PPRAAGPGGG SLASSSVAKS H 
VDASKAGGKQ KRGLRGLFSG HRWHRKDKRA KAEAAEGRAP GGGLILPGSL T 
TPRAAREPEE PSQDAPRDPA GCGDIIADQE EEAGPSCDKH VPGPGKPALS KKNPGWAYQ 
GGGEEMASPD EVDDTYLQEP WDMLSQTEEQ GPEPQEGAAK VAAALETKW PETPKDTRCV 
EAAKDASSVK RRRLNRIPIE PHPKEEPKHP EKEQQEGVPN SDEGYWDSTT PGPEEDSSSS 
GKKAGIPRDS YSGDALYDLY ADPDGSPATL PGGKDNEETS SLSRLKPVSP GTITCPLRTP 
GSLLKDSKIP ISIKHLTNLP SSHPWHQQP SRSEMPRTKI PVSKVLVRRV SNRGLAGTTI 



1 11 21 31 41 SI 

| I I I I I 

ATGGAGACGA GCCGGAGCCG CGGCGGCGGC GGGGCTGTCA GCGAGCGCGG CGGAGCTGGC 60 

GCGTCCGTGG GGGTCTGCAG GAGGAAGGCG GAGGCCGGGG CCGGGACCGG GACCCTCGCG 120 

GCAGACATGG ACTTGCATTG TGACTGTGCC GCCGAAACGC CGGCCGCCGA GCCGCCGTCG 180 

GGGAAGATTA ATAAAGCTGC CTTCAAATTA TTCAAGAAGA GGAAATCGGG TGGCACCATG 240 

CCCAGCATTT TTGGGGTCAA AAACAAAGGG GACGGGAAAA GCTCGGGTCC GACGGGGCTG 3 00 

GTGAGGAGCA GGACCCACGA CGGACTTGCC GAGGTGCTGG TGCTGGAGAG CGGCAGGAAG 360 

GAGGAGCCGC GCGGCGGGGG CGACAGCGGC GGGGGCGGCG GGGGGCGGCC GAACCCGGGG 420 

CCCCCCAGAG CCGCAGGGCC CGGCGGGGGC TCCCTCGCCA GCAGCTCGGT GGCCAAGTCG 480 

CACAGCTTCT TCTCGCTGCT GAAGAAGAAC GGGCGCTCGG AAAACGGCAA GGGAGAGCCT 540 

GTGGACGCGA GCAAGGCCGG CGGCAAACAA AAGCGGGGGC TGCGGGGGCT GTTCAGCGGC 600 

ATGCGCTGGC ACAGGAAAGA CAAGCGGGCC AAGGCGGAGG CCGCGGAGGG GCGCGCGCCC 660 

GGGGGCGGCT TGATCCTACC CGGCTCGCTC ACCGCCAGCC TGGAGTGCGT CAAGGAGGAG "720 

ACGCCCAGAG CCGCGCGCGA GCCGGAGGAG CCCAGCCAGG ACGCCCCGCG AGACCCAGCA 780 

GGTGAGCCCG CAGGGGGAGA GGAGGTGCCC GCCCCCGCCG ACCGCGCCCC AGCGCGGAGC 840 

TGCCGAGAGG CAGAGGGCCT CGCGCACCCC GGCGACACCG GCGCCCGGGG AGAGGACGCC 900 

GCGGGGCATC GGCGCGCCGA GCCGGGGCCC GGGGAGGTCC GCACGGCAGA GGACGCTTCC 960 

AGGACGGGGG CCGTTCCCGT AAAGACGGTC CCCCTTGTCG ACTCCGAAGG CGGCAGCGGC 1020 

CGGGCGCCCG CCGCCCCAGA CCCTGCCTCT GTCGATCCAC CCTCAGACCC GTCGGCAGAT 1080 

CGTATTTGTT TGATGTTTTC TGACGTGACT TCACTGAAAA GCTTTGACTC TCTTACAGGC 1140 

TGTGGAGATA TTATTGCAGA CCAAGAGGAA GAGGCAGGTC CCAGCTGTGA CAAGCATGTC 1200 

CCCGGGCCAG GCAAGCCGGC TCTGTCTAAA AAGAACCCCG GCGTGGTGGC CTACCAAGGA 1260 

GGCGGGGAAG AGATGGCCAG CCCGGACGAG GTGGACGACA CCTATCTACA GGAGTTCTGG 1320 



GCAGCTGCGC TGGAAACCAA G 
GCGGCCAAGG ACGCGTCCTC GGTCAAGCGC AGGAGGCTCA ACCGGATTCC CATCGAGCCC 1500 
3 AGGAGCCCAA GCACCCGGAG AAGGAGCAGC AGGAAGGCGT CCCCAACAGC 1560 



AGCTTGCTGA AGGACTCTAA GATCCCTATT AGCATCAAGC ACCTGACCAA C 
AGCCATCCCG TGGTGCACCA GCAACCCTCC AGGAGTGAGA TGCCCAGAAC AAAAATCCCG 
GTTTCCAAAG TGCTGGTCCG CAGAGTCAGC AACCGGGGCT TGGCTGGGAC CACCATCAGA 
GCAACGGCCT GCCACGACAG TGCCAAAAAG TTGTGA 



EAGAGTGTLA ADMDLHCDCA AETPAAEPPS 
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AGHRRAEPGP GEVRTAEDAS RTGAVPVKTV PLVDSEGGSG RAPAAPDPAS VDPPSDPSAD 
RICLMFSDVT SLKSFDSLTG CGDIIADQEE EAGPSCDKHV PGPGKPALSK KNPGWAYQG 
GGEEMASPDE VDDTYLQEPW DMLSQTEEQG PEPQEGAAKV AAALETKWP ETPKDTP.CVE 
AAKDASSVKR RRUIRIPIEP HPKEEPKHPE KEQQEGVPNS DEGYWDSTTP GPEEDSSSSG 
KKAGIPRDSY SGDALYDLYA DPDGSPATLP GGKDNEETSS LSRLKPVSPG TITCPLP.TPG 
SLLKDSKIPI SIKHLTNLPS SHPWHQQPS RSEMPRTKIP VSKVLVRRVS NRGLAGTTIR 



Seq ID NO: B54 DNA sequence 

Nucleic Acid Accession #: NM_014138.1 

Coding sequence: 60.. 854 

i r r i 1 T i 1 

CTGCAGAGAC TTCCCAGGAA GGTCCAGCGC CCTCTCAGCC TTCGTACTCA GAACAGCCGA 
TGATGGGCCT CAGTAACCTG AGCCCCGGTC CTGGCCCCAG CCAGGCCGTG CCTCTCCCAG 
AGGGGCTGCT CCGCCAGCGG TACAGAGAGG AGAAGACCCT GGAAGAGCGG CGGTGGGAGA 
GGCTGGAGTT CCTTCAGAGG AAGAAAGCAT TCCTGCGGCA TGTGAGGAGG AGACACCGCG 
ATCACATGGC CCCCTATGCT GTTGGGAGGG AAGCCAGAAT CTCCCCATTA GGTGACAGAA 
GTCAGAATCG AT TCCGATGT GAATGTCGAT ACTGCCAGAG CCACAGGCCG AATCTTTCTG 
GGATCCCTGG GGAGAGTAAC AGGGCCCCAC ATCCCTCCTC CTGGGAGACG CTGGTGCAGG 
GCCTCAGTGG CTTGACTCTC AGCCTAGGCA CCAACCAGCC CGGGCCTCTG CCTGAAGCGG 
CACTCCAGCC ACAGGAGACA GAGGAGAAGC GCCAGCGAGA GAGGCAGCAG GAGAGCAAAA 
TAATGTTTCA GAGGCTGCTC AAGCAGTGGT TAGAGGAAAA CTGAGACGTG CACCCCCATG 
GGATGGAGAC CCGAAGGGAC TCAGACGGAG CCGCCGTGTT GGCAGCGCCT GGGTGTGGGC 
CCATTTTGGG GACCAAACAG CAAGCTGTGG TCGGATGAGT GCCAGGACCT GTGTACCGGG 
ACACGTGGGA GTCCTCCCAG CATGATGCTT GACTGACCCG AGGAAGGTCC TCATGTTTCG 
TGCCTGTCAT TCTCGGATGG CTGTGAGGCA TTCCTTGGCA AGGGACGCTG CGTACCAGCG 



TCCATCTCTC TTCCCTTCCT GCTGTCAGTA AGAGATCACA TGTCTGTGTA GTGTGAATGC 
CTTGTCGCTG TCCTGTGCTT TTGCACCATT GAGTTGACTG CCTCTGAGAA GCAGCACTAG 
GCCTGTTGAA ATGCAATGTG CTGCCCTGAG ATCCAGTTTC AAGAATGGGC AGGTAAACGC 
AGTGTGGGAA AGGAATGTGG AATGAGAACT TGGTGGTTCA CCGCTGTACT ATTTGTGTAA 
ATGTTTACGT ATGTGATAAG CTACATGTAT GTAAATGTTG CAATACCCCT AACAGTCGAG 
TAGTAGTCTC CCTTACAGGA ATTTTTGACG GGGTTCCTCA TCATCAATAC CAAATAAATA 
TATGTAGGAA TGGAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



40 



45 MMGLSNLSPG PGPSQAVPLP EGLLRQRYRE EKTLEERRWE RLEFLQRKKA FLRIIVRRRIIR 

DIIMAPYAVGR EARISPLGDR SQNRFRCECR YCQSHRPNLS GIPGESNRAP HPSSWETLVQ 
G TNQPGPLPEA ALQPQETEEK RQRERQQESK IMFQRLLKQW LEEN 



Seq ID NO-. B56 DNA sequence 

Nucleic Acid Accession #: NM_000025.1 

Coding sequence: 198.. 1424 

1 11 21 31 41 51 

I I I 1 I 

GCTACTCCTC CCCCAAGAGC GGTGGCACCG AGGGAGTTGG GGTGGGC-GGA GGCTGAGCGC 
TCTGGCTGGG ACAGCTAGAG AAGATGGCCC AGGCTGGGGA AGTCGCTCTC ATGCCTTGCT 
GTCCCCTCCC CTGAGCCAGG TGATTTGGGA GACCCCCTCC TTCCTTCTTT CCCTACCGCC 
CCACGCGCGA CCCGGGGATG GCTCCGTGGC CTCACGAGAA CAGCTCTCTT GCCCCATGGC 
CGGACCTCCC CACCCTGGCG CCCAATACCG CCAACACCAG TGGGCTGCCA GGGGTTCCGT 
GGGAGGCGGC CCTAGCCGGG GCCCTGCTGG CGCTGGCGGT GCTGGCCACC GTGGGAGGCA 
ACCTGCTGGT CATCGTGGCC ATCGCCTGGA CTCCGAGACT CCAGACCATG ACCAACGTGT 
TCGTGACTTC GCTGGCCGCA GCCGACCTGG TGATGGGACT CCTGGTGGTG CCGCCGGCGG 
CCACCTTGGC GCTGACTGGC CACTGGCCGT TGGGCGCCAC TGGCTGCGAG CTGTGGACCT 
CGGTGGACGT GCTGTGTGTG ACCGCCAGCA TCGAAACCCT GTGCGCCCTG GCCGTGGACC 
GCTACCTGGC TGTGACCAAC CCGCTGCGTT ACGGCGCACT GGTCACCAAG CGCTGCGCCC 
GGACAGCTGT GGTCCTGGTG TGGGTCGTGT CGGCCGCGGT GTCGTTTGCG CCCATCATGA 
GCCAGTGGTG GCGCGTAGGG GCCGACGCCG AGGCGCAGCG CTGCCACTCC AACCCGCGCT 
GCTGTGCCTT CGCCTCCAAC ATGCCCTACG TGCTGCTGTC CTCCTCCGTC TCCTTCTACC 
TTCCTCTTCT CGTGATGCTC TTCGTCTACG CGCGGGTTTT CGTGGTGGCT ACGCGCCAGC 
TGCGCTTGCT GCGCGGGGAG CTGGGCCGCT TTCCGCCCGA GGAGTCTCCG CCGGCGCCGT 
CGCGCTCTCT GGCCCCGGCC CCGGTGGGGA CGTGCGCTCC GCCCGAAGGG GTGCCCGCCT 
GCGGCCGGCG GCCCGCGCGC CTCCTGCCTC TCCGGGAACA CCGGGCCCTG TGCACCTTGG 
GTCTCATCAT GGGCACCTTC ACTCTCTGCT G 



GTTATGCCAA TTCTGCCTTC AACCCGCTCA TCTACTGCCG CAGCCCGGAC TTTCGCAGCG 
CCTTCCGCCG TCTTCTGTGC CGCTGCGGCC GTCGCCTGCC TCCGGAGCCC TGCGCCGCCG 
CCCGCCCGGC CCTCTTCCCC TCGGGCGTTC CTGCGGCCCG GAGCAGCCCA GCGCAGCCCA 13 80 
GGCTTTGCCA ACGGCTCGAC GGGGCTTCTT GGGGAGTTTC TTAGGCCTGA AGGACAAGAA 1440 
GCAACAACTC TGTTGATCAG AACCTGTGGA AAACCTCTGG CCTCTGTTCA GAATGAGTCC 1500 
CATGGGATTC CCCGGCTGTG ACACTCTACC CTCCAGAACC TGACGACTGG GCCATGTGAC 1560 
CCAAGGAGGG ATCCTTACCA AGTGGGTTTT CACCATCCTC TTGCTCTCTG TCTGAGAGAT 1620 
GTTTTCTAAA CCCCAGCCTT GAACTTCACT CCTCCCTCAG TC-GTAGTGTC CAGGTGCCGT 
GGAGCAGCAG GCTGGCTTTG GTAGGGGCAC CCATCACCCG GCTTGCCTGT GCAGTCAGTG 
AGTGCTTAGG GCAAAGAGAG CTCCCCTGGT TCCATTCCTT CTGCCAO ' ^^r- 

818 



WO 03/042661 



AGACCTTAGT GTTCTCCAGG CTCTGTGGCC CAGGCTGAGA GCAGCAGGGT AGAAAAGACC 1SSO 

AAGATTTGGG GTTTTATCTC TGGTTCCCTT ATTACTGCTC TCAAGCAGTG GCCTCTCTCA 1920 

CTTTAGCCAT GGAATGGCTC CGATCTACCT CACAGCAGTG TCAGAAGGAC TTCGCCAGGG 1980 

TTTTGGGAGC TCCAGGGTTC ATAAGAAGGT GAACCATTAG AACAGATCCC TTCTTTTCCT 2040 

TTTGCAATCA GATAAATAAA TATCACTGAA TGCAGTTCAT CCTCGGCCCA CTTTCCCTCC 2100 

GTTTGTTTTC TTTTCATAAT CCACTTACTC CCTTCCCTTC TACTCTGCGC TGGCTTTTGA 2160 

CAGAGGCAGT AAATTAGGCC TAATCCTCAC TCTTTTCTTC CTAATCTTCA TCAAACAAAA 2220 

AATGAAAAGT CTGTCTGGAC GAAGGGGAGT GAGCTTGAGC CTTTGATATC TTGCTCCCCC 2280 

ACCCTTCCTG AAACTCTTGA AATCCAGTTG CCATTGAGTA GCAAAGCCAC GCTCCCCACA 2340 

GGACTTGGAC AGAGGGCCCA CAGGGGGATG GGCTGGCTGT GGCCAGGTTT AGGGCAGGGG 2400 

GCATTTGTCC CCTCCATGCT ATAATCCAGT GGTGCCTTAC ATGGTGTGTG TGTGTGTGTG 2460 

TGCGTGTGTG TGTGTGTGTG TGTGTCTGGA GGCACAGGCA CAAAGCATTG CTTGGGTTGG 2E20 

TCAAATGTCT TGTGTCATAA ATATATTCTG ATGTTTCCCA GCCTTTCCAC AACCTCTACC 2580 

TTCCCACTCA CCTTCCCCAG CTACAAAAAT CTGTATTATC CTCTTAAAGT AAAACTGGAG 2640 



I I I I I 

,.,«r..jri 1 ^L. u S LAPWPDLPTL APNTANTSGL PGVPWEAALA GALLALAVLA TVGGNLLVIV 
AIAWTPRLQT MTHVFVTSLA AADLVMGLLV VPPAATLALT GHWPLGATGC ELWTSVDVLC 
VTASIETLCA LAVDRYLAVT NPLRYGALVT KR CARTA WL VWWSAAVSF APIMSQWWRV 
GADAEAQRCH SNPRCCAFAS NMPYVLLSSS VSFYLPLLVM LFVYARVFW ATRQLRLLRG 
ELGRPPPEES PPAPSRSLAP APVGTCAPPE GVPACGRRPA RLLPLREHRA LCTLGLIMGT 
PTLCWLPPFL ANVLRALGGP SLVPGPAFLA LNWLGYANSA FNPLIYCRSP DFRSAFRRLL 
CRCGRRLPPE PCAAARPALF PSGVPAARSS PAQPRLCORL DGASWGVS 

Nucleic Acid Accession"*: NM_032SS3.1 
Coding sequence: 37.. 1038 



CACCATTAGG CAAAGATAGT TTCTCTAGAG AGAATCATGC CTGCTAATTA CACGTGTACC 60 

AGGCCAGATG GAGACAATAC AGATTTTCGA TACTTTATTT ATGCAGTGAC ATACACTGTC 120 

ATTCTTGTGC CAGGTCTCAT AGGGAATATA TTAGCCCTGT GGGTATTCTA TGGTTATATG 180 

AAAGAAACAA AACGAGCTGT GATATTTATG ATAAACTTAG CCATTGCTGA CTTACTACAA 24 0 

GTTCTTTCCT TGCCACTGAG GATCTTCTAC TACTTGAATC ATGACTGGCC ATTTGGGCCT 300 

GGTCTCTGCA TGTTCTGTTT CTACCTGAAG TATGTCAACA TGTATGCAAG CATCTACTTC 3 60 

TTGGTCTGCA TCAGTGTGCG ACGATTTTGG TTTCTCATGT ACCCCTTTCG CTTCCATGAC 42 0 

TGCAAACAGA AATATGACCT GTACATCAGC ATTGCTGGCT GGCTGATCAT CTGCCTTGCC 480 

TGTGTACTCT TTCCACTCCT CAGAACCAGT GATGATACCT CTGGCAATAG GACCAAATGC 540 

TTTCTGGATC TTCCTACCAG GAATGTCAAC CTGGCCCAGT CCGTTGTTAT GATGACCATT 600 

GGCGAGTTGA TTGGGTTTGT AACTCCGCTT CTGATTGTCC TATATTGTAC CTGGAAGACG 66 0 

GTTTTATCAC TGCAAGATAA ATATCCCATG GCCCAAGATC TTGGAGAGAA ACAGAAAGCC 720 

TTGAAGATGA TTCTAACCTG TGCAGGGGTA TTCCTAATTT GCTTTGCACC TTATCATTTC 780 

AGTTTTCCTT TAGATTTCCT GGTGAAGTCC AATGAAATTA AAAGCTGCCT AGCCAGAAGG 840 

GTGATTCTAA TATTTCATTC TGTGGCATTG TGTCTTGCTA GTCTGAATTC ATGTCTTGAC 900 

CCAGTCATAT ACTACTTTTC CACTAATGAG TTCCGAAGAC GGCTTTCAAG ACAAGATTTG 950 

CATGACAGCA TCCAACTCCA TGCAAAATCC TTTGTGAGTA ACCATACAGC TTCCACCATG 1020 

ACACCTGAAT TATGCTAAAA CAAAAAACCA AACTGAATGT GACCTGAAAT GCAAGTACAT 10 80 

CAGAACATAT CTGCAATACC CAAGCCACAG GGAAGAACTT GCAAAACAAC ACAGCTTTTC 1140 

AGTTCTGCTC TATCTTACTG CTATGGGGAA TTCACTTCTT CAAAGCAGGA CCTATTTGGA 1200 
GCATTACGAT CCACGATTAT TGATGTTGAC ATGTCCATGT AGTAATTTTT CTTCAAGT 



I I I I I I 

HPANYTCTRP DGDNTDFRYF IYAVTYTVIL VPGLIGNILA LWVFYGY"*'" " 
LAIADLLQVL SLPLRIFYYL NHDWPFGPGL CMFCFYLKYV NMYASIi 
MYPFRFHDCK QKYDLYISIA GWLIICLACV LFPLLRTSDD TSGNRTKCFV D 
QSWMMTIGE LIGFVTPLLI VLYCTWKTVL SLQDKYPMAQ DLGEKQKALK KILTCAGVFL 
ICFAPYHFSF PLDFLVKSNE IKSCLARRVI LIFHSVALCL ASLNSCLDPV IYYFSTNEFR 
RRIiSRQDLHD SIQLHAKSFV SNHTASTMTP ELC 



I I I I I I 

GATTCGGATT TTAATCCGAC TCACTATAGG GAATTTTGGC CCTCGAGGCC AAGAATTCGG 
CCCGAGGGTT AGACATTTAA CTTGAGTCTT TTCATATCAC GGTATGCTTT TATTTTACCC 
TCATATATAA TTAATTTTCA TCTGGGCATA TATTTCCAGA TTAGAAATCA TTTCTTTTCA 
TACATTTTAA GGAATTTATA CATTTGCATC TGAAATTGTA TTCATTATCT TCTAGAAACC 
AAGAGTGGAC TTGGTATCCG TCTGATTATA TTTTCTTTGT TGGTAAAGAG CCTCTTTTTT 
CCCTTTTTAT TTTCATCTTC TCTTTCTCTA GTGTTTATAA TCTTATTTTI ATGAATGTAG 
TCTAATTATT TAGAACACTG TCTCCGAGTA TGTGTTTATT TGGGTTTGTG TTTTGAACAT 
TGCCTCCAGG TGCGAGTTTA TGTGTGTTTA TGTTTCCTAT TTGAGGTTGG ACACTTAGAA 
AATTTTCTCA ATTCGAAAAC TATTTTCCTT CAGCACTCAG A " 
TTCCACGATT ATTTCCTGTC CTTGGTTTTT CGTGTTCTTA A 
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ATGTTAGACC TTACACATCT TCAATGTATC ATATATATTT CACTCATATT ATTTATTGTC 
ATATCTTTTG GACTATTGTG TGAAAGCAAT AAATGTATAT TACTTTAAAA AAAAAAAAAA 
AAAACTHCAT TTTCAAATTT AATACACACA AATTGTGCAT ACCCACACAC ATATACCACA 
GATATATT CA TTTTCAGGAT TTCTTTAAGT GTTATTTTAA AAATAATCAT ATTCTTATTT 
TGTGAACTGT GGGATTTCTT GAAACTTTTT GAAAACTCTC ATIAGTCTGT GTTTGTGTGG 
TTTCTTTAAA TTATCTCTTC CTTCAGAATT ACTTTTGTGT TTTGAATTTG AGTCTTTGCC 
ATTCATAAGG CGGGCTTTCC TTGAATACCA GATAACTCTG AGTTTCCTGT TCATATTTAA 
A CCTAATTTGA AAAAAAAAAA AAAAA 



I I 

\ ATACACTCCT GAAATATCCT GAAAACTTTT 60 

TTTTTTCAGA ATCCTTTAAT AAGCAGTTAT GTCAATCTGA AAGTTGCTTA CTTGTACTTT 120 

ATATTAATAG CTATTCTTGT TTTTCTTATC CAAAGAAAAA TCCTCTAATC CCCTTTTCAC 180 

ATGATAGTTG TTACCATGTT TAGGCATTAG TCACATCAAC CCCTCTCCTC TCCCAAACTT 240 

CTCTTCTTCA AATCAAACTT TATTAGTCCC TCCTTTATAA TGATTCCTTG ^CGTTTTA ^00 

TTATTCATTG CCAAGAGAAT AATTGCATTT TAAACCCATA TTATAACAAA GAATAATGAT 420 
TATATTTTGT GATTTGTAAC AAATACCCTT TATTTTCCCT TAACTATTGA ATTAAATATT 480 
TTAATTATTT GTATTCTCTT TAACTATCTT GGTATATTAA AGTATTATCT TTTATATATT 54 0 
TATCAATGGT GGACACTTTT ATAGGTACIC TGTGTCATTT TTGATACTGT AGGTATCTTA S0 0 
TTTCATTTAT CTTTATTCTT AATGTACGAA TT CAT AAT AT TTGATTCAGA ACAAATTTAT 660 
CACTAATTAA CAGAGTGTCA ATTATGCTAA CATCTCATTT ACTGATTTTA ATTTAAAACA 720 
GTTTTTGTTA ACATGCATGT TTAGGGTTGG CTTCTTAATA ATTTCTTCTT CCTCTTCTCT 780 
CTCTCCTCTT CTTTTGGTCA GTGTTGTGCG GGTTAATACA ACAAACTGTA ACAAGTGTAC 840 
CTGGTATGGA CTTGTTGTCC GGGACGTACA TTTTCGCGGT CCTGCTAGCA TGCGTGGTGT 900 
TCCACTCTGG CGCCCAGGAG AAAAA CTACA CCATCCGAGA AGAAATGCCA GAAAACGTCC 960 
TGATAGGCGA CTTGTTGAAA GACCTTAACT TGTCGCTGAT TCCAAACAAG TCCTTGACAA 1020 
CTGCTATGCA GTTCAAGCTA GTGTACAAGA CCGGAGATGT GCCACTGATT CGAATTGAAG 10 80 
AGGATACTGG TGAGATCTTC ACTACTGGCG CTCGCATTGA TCGTGAGAAA TTATGTGCTG 1140 
GTATCCCAAG GGATGAGCAT TGCTTTTATG AAGTGGAGGT TGCCATTTTG CCGGATGAAA 1200 
TATTTAGACT GGTTAAGATA CGTTTTCTGA T AGAAG AT AT AAATGATAAT GCACCATTGT 12 60 
TCCCAGCAAC AGTTATCAAC ATATCAATTC CAGAGAACTC GGCTATAAAC T CT AAATATA 1320 
I 1 i ,GCTGTTGAT CCTGACGTAG GAATAAACGG AGTTCAAAAC TACGAACTAA 1380 

TTAAG AGT CA AAACATTTTT GGCCTCGATG TCATTGAAAC ACCAGAAGGA GACAAGATGC 1440 
CACAACTGAT TGTTCAAAAG GAGTTAGATA GGGAAGAGAA GGATACCTAC GTGATGAAAG 1500 
\ AGATGGTGGC TTTCCTCAAA GATCCAGTAC TGCTATTTTG CAAGTGAGTG 1560 
C AAATGACAAC CACCCAGTCT TTAAGGAGAC AGAGATTGAA GTCAGTATAC 16 20 
C TCCTGTAGGC ACTTCAGTGA CACAGCTCCA TGCCACAGAT GCTGACATAG 1680 
„.™_C CAAGATCCAC TTCTCTTTCA GCAATCTAGT CTCCAACATT GCCAGGAGAT 1740 
TATTTCACCT CAATGCCACC ACTGGACTTA TCACAATCAA AGAACCACTG GATAGGGAAG 1800 
AAACACCAAA CCACAACTTA CTGGTTTTGG CAAGTGATGG TGGATTGATG CCAGCAAGAG 1860 
CAATGGTGCT GGTAAATGTT ACAGATGTCA ATGATAATGT CCCATCCATT GACATAAGAT 1920 
ACATCGTCAA TCCTGTCAAT GACACAGTTG TTCTTTCAGA AAATATTCCA CTCAACACCA 1980 
AAATTGCTCT CATAACTGTG ACGGATAAGG ATGCGGACCA TAATGGCAGG GTGACATGCT 2040 
TCACAGATCA TGAAATCCCT TTCAGATTAA GGCCAGTATT CAGTAATCAG TTCCTCCTGG 2100 
AGACTGCAGC ATATCTTGAC TATGAGTCCA CAAAAGAATA TGCCATTAAA TTACTGGCTG 2160 
CAGATGCTGG CAAACCTCCT TTGAATCAGT CAGCAATGCT CTTCATCAAA GTGAAAGATG 2220 
AAAATGACAA TGCTCCAGTT TTCACCCAGT CTTTCGTAAC TGTTTCTATT CCTGAGAATA 22 80 
ACTCTCCTGG CATCCAGTTG ACGAAAGTAA GTGCAATGGA TGCAGACAGT GGGCCTAATG 2340 
CTAAGATCAA TTACCTGCTA GGCCCTGATG CTCCACCTGA ATTCAGCCTG GATTGTCGTA 2400 
CAGGCATGCT GACTGTAGTG AAGAAACTAG ATAGAGAAAA AGAGGATAAA TATTTATTCA 2460 
CAATTCTGGC AAAAGATAAC GGGGTACCAC CCTTAACCAG CAATGTCACA GTCTTTGTAA 2520 
GCATTATTGA TCAGAATGAC AATAGCCCAG TTTTCACTCA CAATGAATAC AACTTCTATG 2580 
TCCCAGAAAA CCTTCCAAGG CATGGTACAG TAGGACTAAT CACTGTAACT GATCCTGATT 2640 
ATGGAGACAA TTCTGCAGTT ACGCTCTCCA TTTTAGATGA GAATGATGAC TTCACCATTG 2700 
ATTCACAAAC TGGTGTCATC CGACCAAATA TTTCATTTGA TAGAGAAAAA CAAGAATCTT 27S0 
ACACTTTCTA TGTAAAGGCT GAGGATGGTG GTAGAGTATC ACGTTCTTCA AGTGCCAAAG 2820 
TAACCATAAA TGTGGTTGAT GTCAATGACA ACAAACCAGT TTTCATTGTC CCTCCTTCCA 2 880 
ACTGTTCTTA TGAATTGGTT CTACCGTCCA CTAATCCAGG CACAGTGGTC TTTCAGGTAA 2 940 
TTGCTGTTGA CAATGACACT GGCATGAATG CAGAGGTTCG TTACAGCATT GTAGGAGGAA 3000 
ACACAAGAGA TCTGTTTGCA AT CG ACCAAG AAACAGGCAA CATAACATTG ATGGAGAAAT 3060 
GTGATGTTAC AGACCTTGGT TTACACAGAG TGTTGGTCAA AGCTAATGAC TTAGGACAGC 3120 
CTGATTCTCT CTTCAGTGTT GTAATTGTCA ATCTGTTCGT GAATGAGTCG GTGACCAATG 3180 
CTACACTGAT TAATGAACTG GTGCGCAAAA GCACTGAAGC ACCAGTGACC CCAAATACTG 3240 
AGATAGCTGA TGTATCCTCA CCAACTAGTG ACTATGTCAA GATCCTGGTT GCAGCTGTTG 3300 
CTGGCACCAT AACTGTCGTT GTAGTTATTT TCATCACTGC TGTAGTAAGA TGTCGCCAGG 3360 
CACCACACCT TAAGGCTGCT CAGAAAAACA AGCAGAATTC TGAATGGGCT ACCCCAAACC 3420 
CAGAAAACAG GCAGATGATA ATGATGAAGA AAAAGAAAAA GAAGAAGAAG CATTCCCCTA 3480 
AGAACTTGCT GCTTAATTTT GTCACTATTG AAGAAACTAA GGCAGATGAT GTTGACAGTG 3540 
ATGGAAACAG AGTCACACTA GACCTTCCTA TTGATCTAGA AGAGCAAACA ATGGGAAAGT 3600 
ACAATTGGGT AACTACACCT ACTACTTT CA AGCCCGACAG CCCTGATTTG GCCCGACACT 3660 
ACAAATCTGC CTCTCCACAG CCTGCCTTCC AAATTCAGCC TGAAACTCCC CTGAATTCGA 3720 
AGCACCACAT CATCCAAGAA CTGCCTCTCG ATAACACCTT TGTGGCCTGT GACTCTATCT 3780 
CCAAGTGTTC CTCAAGCAGT TCAGATCCCT ACAGCGTTTC TGACTGTGGC TATCCAGTGA 3840 
CGACCTTCGA GGTACCTGTG TCCGTACACA CCAGACCC-GT AGGTATCCAA GTTTCTAACA 3 900 
CAACTTTCTA ACTATTTTTT TATTATTATT TTCAGTTGAT GTAGAACTTT ACAAAATCTA 3 960 
TTGACTTCAA AGAGGGATCA AAACAATCAT ATTCTACAGA TGTACCCAAT AGATATATGG 4 020 
ATT CAATTAA GTTTGGTAGA AGATGAGAAC AAAATAACTA CTGATTTAGG AAAATTGGAT 4080 
GCAGAATAAT AATTATAGTA GGGGCAATTT TGTCTGTAGA TGGCAGTATG ACAATTCTTG 4140 
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CTAGAGAATA TATTGAAAAA AACTTCAACA CAAAGGGTTG TAGCACTGTC CTCAGTACCA 4200 

TTGTGTGCAT GAGGATCAGA ATAGTCTGGG CTAGATACAT CACAT T AAAG CTTTTCAGAA 42S0 

TCTGATAAAT AGCTCTAAAT ACTAATGATA TTGAGAAGCC TAGCTTCACT TGGGAAAATC 4320 

TGTGGCTGTT CACAGAAATT CAGCACCAAG TTATTCCCCC CATACTCTAC CAGGCCTTCA 4380 

GGTCCTCATA ^q^^AAAGTG TCGTTTTCAG ATTAGGAACT CAAAATTATT TTGGTGCATC 4440 

AAATCTACAG TCACACAATA TAACAAGAAT GGGATTAGAA AAATGAAAGC CTACTCATTC 4S00 

TCATCTTTAA GCCAGAGAAT GAAATATATA TGAGGTCTCT GGATAGCTAT TTAAATATTT 4560 
GCATATTTAT GCAAGGTATT TTGAGCCCTT CAGAAGACAT TCT 



P AVLLACWPH SGAQEKNYTI REEMPENVLI GDLLKDLNLS LIPNKSLTTA 
MQFKLVYKTG DVPLIRIEED TGEIPTTGAR IDREKLCAGI PRDEHCFYEV EVAILPDEIP 
RLVKIRFLIE DINDNAPLFP ATVINISIPE NSAINSKYTL PAAVDPDVGI NGVQNYELIK 
SQNIFGLDVI ETPEGDKMPQ LIVQKELDRE EKDTYVMKUK VEDGGFPQRS STAILQVSVT 
DTNDNHPVFK ETEIEVSIPE NAPVGTSVTQ LHATDADIGE NAKIHFSFSN LVSNIARRLF 



L SEHIPLNTKI ALITVTDKDA DHNGRVTCFT DHEIPFRLRP VFSMQFLLET 
AAYLDYESTK EYAIKLLAAD AGKPPLNQSA MLFIKVKDEN DNAPVFTQSF V 
PGIQLTKVSA MDADSGPNAK INYLLGPDAP PEFSLDCRTG MLTWKKLDR E 
LAKDNGVPPL TSNVTVFVSI IDQKDNSPVF THNEYNFYVP ENLPRHGTVG L 

DNSAVTLSIL DENDDFTIDS QTGVIRPNIS FDREKQESYT FYVKAEDGGR VSRBSSAKVT 660 

INWDVNDNK PVFIVPPSNC SYELVLPSTN PGTWFQVIA VDNDTGMNAE VRYSIVGGNT 720 

RDLFAIDQET GNITLMEKCD VTDLGLHRVL VKANDLGQPD SLFSWIVNL FVNESVTNAT 780 

LINELVRKST EAPVTPNTEI ADVSSPTSDY VKILVAAVAG TITWWIFI TAWRCRQAP 840 

HL KAAQKNKQ NSEWATPNPE NRQMIMMKKK KKKKKHSPKH LLLNFVTIEE TKADDVDSDG 900 

NRVTLDLPID LEEQTMGKYN WVTTPTTFKP DSPDLARHYK SASPQPAFQI QPETPLNSKH 960 

HIIQELPLDN TFVACDSISK CSSSSSDPYS VSDCGYPVTT FEVPVSVHTR PVGIQVSNTT 1020 
F 

Seq ID HO: B63 DNA sequence 

Nucleic Acid Accession #: XM_0S9180.2 

Coding sequence: 276.. 3740 

1 11 21 31 41 SI 

GCGGCGGCCG CGGAGTATCC TGGAGCTGCA GACAGTGCGG GCCTGCGCCC AGTCCCGGCT 60 

GTCCTCGCCG CGACCCCTCC TCAGCCCTGG GCGCGCGCAC GCTGGGGCCC CGCGGGGCTG 120 

GCCGCCTAGC GAGCCTGCCG GTCGACCCCA GCCAGCGCAG CGACGGGGCG CTGCCTGGCC 180 

CAGGCGCACA CGGAAGTGCG CTTCTCTGAA GTAGCTTTGG AAAGTAGAGA AGAAAATCCA 24 0 

GTTTCCTTCT TGGAGAACAC TGGACAGCTG AATAAATGCA GTATCTAAAT ATAAAAGAGG 300 

ACTGCAATGC CATGGCTTTC TGTGCTAAAA TGAGGAGCTC CAAGAAGACT GAGGTGAACC 360 

TGGAGGCCCC TGAGCCAGGG GTGGAAGTGA TCTTCTATCT GTCGGACAGG GAGCCCCTCC 420 

GGCTGGGCAG TGGAGAGTAC ACAGCAGAGG AACTGTGCAT CAGGGCTGCA CAGGCATGCC 480 

GTATCTCTCC TCTTTGTCAC AACCTCTTTG CCCIGTATGA CGAGAACACC AAGCTCTGGT 540 

ATGCTCCAAA TCGCACCATC ACCGTTGATG ACAAGATGTC CCTCCG3CTC CACTACCGGA 600 

TGAGGTTCTA TTTCACCAAT TGGCATGGAA CCAACGACAA TGAGCAGTCA GTGTGGCGTC 660 

ATTCTCCAAA GAAGCAGAAA AATGGCTACG AGAAAAAAAA GATTCCAGAT GCAACCCCTC 720 

TCCTTGATGC CAGCTCACTG GAGTATCTGT TTGCTCAGGG ACAGTATGAT TTGGTGAAAT 780 

GCCTGGCTCC TATTCGAGAC CCCAAGACCG AGCAGGATGG ACATGATATT GAGAACGAGT 840 

GTCTAGGGAT GGCTGTCCTG GCCATCTCAC ACTATGCCAT GATGAAGAAG ATGCAGTTGC 900 

CAGAACTGCC CAAGGACATC AGCTACAAGC GATATATTCC AGAAACATTG AATAAGTCCA 960 

TCAGACAGAG GAACCTTCTC ACCAGGATGC GGATAAATAA TGTTTTCAAG GATTTCCTAA 1020 

AGGAATTTAA CAACAAGACC ATTTGTGACA GCAGCGTGTC CACGCATGAC CTGAAGGTGA 10 80 

AATACTTGGC TACCTTGGAA ACTTTGACAA AACATTACGG TGCTGAAATA TTTGAGACTT 1140 

CCATGTTACT GATTTCATCA GAAAATGAGA TGAATTGGTT TCATTCGAAT GACGGTGGAA 1200 

ACGTTCTCTA CTACGAAGTG ATGGTGACTG GGAATCTTGG AATCCAGTGG AGGCATAAAC 1260 

CAAATGTTGT TTCTGTTGAA AAGGAAAAAA ATAAACTGAA GCGGAAAAAA CTGGAAAATA 1320 

AACACAAGAA GGATGAGGAG AAAAACAAGA TCCGGGAAGA GTGGAACAAT TTTTCTTACT 1380 

T CAGCATTAAC AAGCAGGACA 1440 

TAGATGGCTA CTTCCGGCTC ACAGCAGATG CCCATCATTA CCTCTGCACC GACGTGGCCC 1560 

CCCCGTTGAT CGTCCACAAC ATACAGAATG GCTGTCATGG TCCAATCTGT ACAGAATACG 1620 

CCATCAATAA ATTGCGGCAA GAAGGAAGCG AGGAGGGGAT GTACGTGCTG AGGTGGAGCT 1680 

GCACCGACTT TGACAACATC CTCATGACCG TCACCTGCTT TGAGAAGTCT GAGCAGGTGC 1740 
AGGGTGCCCA GAAGCAGTTC AAGAACTTTC AGATCGAGGT GCAGAAGGGC C 
TGCACGGTTC GGACCGCAGC TTCCCCAGCT TGGGAGACCT CATGAGCCAC C 
AGATCCTGCG CACGGATAAC ATCAGCTTCA TGCTAAAACG CTGCTGCCAG C 
GAGAAATCTC CAACCTGCTG GTGGCTACTA AGAAAGCCCA GGAGTGGCAG CCCGTCTACC 



AAGGAACTTC TGAAGAGAAG A 

C TTCGAGGCAG CCAGCATGAT G AGACAGGT C TCCCACAAAC 2220 



3 GGGTCCTCTG GATCTC1 
CATGGAAATT CAAAGTTGCC AAACAGCTGG CCAGTGCCCT GAGCT 
ACCTGGTCCA TGGAAATGTG TGTACTf 

GTGAGTGTGG CCCATTCATC AAGCTCAGTG ACCCCGGCAT C 
GGCAAGAATG CATTGAACGA ATCCCATGGA TTGCTCCTGA GTGTGTTGAG GACTCCAAGA 2580 
ACCTGAGTGT GGCTGCTGAC AAGTGGAGCT TTGGAACCAC GCTCTGGGAA ATCTGCTACA 2640 
ATGGCGAGAT CCCCTTGAAA GACAAGACGC TGATTGAGAA A 
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AAGAGCAGAA TCCAGATATT GTTTCAGAAA AAAAACCAGC AACTGAAGTG GACCCCACAC 2880 

ATTTTGAAAA GCGCTTCCTA AAGAGGATCC GTGACTTGGG AGAGGGCCAC TTTGGGAAGG 2 940 

TTGAGCTCTG CAGGTATGAC CCCGAAGGGG ACAATACAGG GGAGCAGGTG GCTGTTAAAT 3000 

CTCTGAAGCC TGAGAGTGGA GGTAACCACA TAGCTGATCT GAAAAAGGAA ATCGAGATCT 3060 

TAAGGAACCT CTATCATGAG AACATTGTGA AGTACAAAGG AATCTGCACA GAAGACGGAG 3120 

GAAATGGTAT TAAGCTCATC ATGGAATTTC TGCCTTCGGG AAGCCTTAAG GAATATCTTC 3180 

CAAAGAATAA GAACAAAATA AACCTCAAAC AGCAGCTAAA ATATGCCGTT CAGATTTGTA 3240 

AGGGGATGGA CTATTTGGGT TCTCGGCAAT ACGTTCACCG GGACTTGGCA GCAAGAAATG 3 300 

TCCTTGTTGA GAGTGAACAC CAAGTGAAAA TTGGAGACTT CGGTTTAACC AAAGCAATTG 3360 

AAACCGATAA GGAGTATTAC ACCGTCAAGG ATGACCGGGA CAGCCCTGTG TTTTGGTATG 3420 

CTCCAGAATG TTTAATGCAA TCTAAATTTT ATATTGCCTC TGACGTCTGG TCTTTTGGAG 3480 

TCACTCTGCA TGAGCTGCTG ACTTACTGTG ATTCAGATTC TAGTCCCATG GCTTTGTTCC 3540 

TGAAAATGAT AGGCCCAACC CATGGCCAGA TGACAGTCAC AAGACTTGTG AATACGTTAA 3 600 

AAGAAGGAAA ACGCCTGCCG TGCCCACCTA ACTGTCCAGA TGAGGTTTAT CAACTTATGA 3650 

GGAAATGCTG GGAATTCCAA CCATCCAATC GGACAAGCTT TCAGAACCTT ATTGAAGGAT 3720 
TTGAAGCACT TTTAAAATAA GAAGCATGAA TAACATTTAA ATTCCACAGA T 
CTTCTCCTGC AACAAATGCC CAAGTCATTT TTTAAAAATT TCTAATGAAA G 

TTCTGTCCAA AAAGTCACTG AACTCATACT TCAGTACATA TACATGTATA AGGCACACTG 3 900 

TAGTGCTTAA TATGTGTAAG GACTTCCTCT TTAAATTTGG TACCAGTAAC TTAGTGACAC 3 960 

ATAATGACAA CCAAAATATT TGAAAGCACT TAAGCACTCC TCCTTGTGGA AAGAATATAC 4020 

CACCATTTCA TCTGGCTAGT TCACCATCAC AACTGCATTA CCAAAAGGGG ATTTTTGAAA 4080 

ACGAGGAGTT GACCAAAATA ATATCTGAAG ATGATTGCTT TTCCCTGCIG CCAGCTGATC 4140 

TGAAATGTTT TGCTGGCACA TTAATCATAG ATAAAGAAAG ATTGATGGAC TTAGCCCTCA 4200 

AATTTCAGTA TCTATACAGT ACTAGACCAT GCATTCTTAA AATATTAGAT ACCAGGTAGT 4260 

ATATATTGTT TCTGTACAAA AATGACTGTA TTCTCTCACC AGTAGGACTT AAACTTTGTT 4320 

TCTCCAGTGG CTTAGCTCCT GTTCCTTTGG GTGATCACTA GCACCCATTT TTGAGAAAGC 4380 

\ TGGGGGGATA GCTGTGGAAT AGATAATTTG CTGCATGTTA ATT CTCAAGA 4440 

3 TGCCAGTGCT TTCCTAAGCA GTATACCTTT AATCAGAACT CATTCCCAGA 4500 

ACCTGGATGC TATTACACAT GCTTTTAAGA AACGT CAATG T ATAT CCTTT TATAACTCTA 4560 

CCACTTTGGG GCAAGCTATT CCAGCACTGG TTTTGAATGC TGTATGCAAC CAGTCTGAAT 4620 

ACCACATACG CTGCACTGTT CTTAGAGGGT TTCCATACTT ACCACCGATC TACAAGGGTT 4680 

GATCCCTGTT TTTACCATCA ATCATCACCC TGTGGTGCAA CACTTGAAAG ACCCGGCTAG 4740 

AGGCACTATG GACTTCAGGA TCCACTAGAC AGTTTTCAGT TTGCTTGGAG GTAGCTGGGT 4800 

AATCAAAAAT GTTTAGTCAT TGATTCAATG TGAACGATTA CGGTCTTTAT GACCAAGAGT 4860 

CTGAAAATCT TTTTGTTATG CTGTTTAGTA TTCGTTTGAT ATTGTTACTT TTCACCTGTT 4920 

CACCCCAAAT TCAGGATTGG TTCAGTGGCA GCAATGAAGT TGCCATTTAA ATTTGTTCAT 4 980 

AGCCTACATC ACCAAGGTCT CTGTGTCAAA CCTGTGGCCA CTCTATATGC ACTTTGTTTA 5 040 
CTCTTTATAC AAATAAATAT ACTAAAGACT TT 

Seq ID NO: B64 Protein sequence 



MQYLNIKEDC NAMAFCAKMR SSKKTEVNLE APEPGVSVIF YLSDREPLRL G 

CIEAAQACRI SPLCHNLFAL YDENTKLWYA PNRTITVDDK MSLRLHYRMR FYFTNWHGTN 120 

DNEQSWRHS PKKQKNGYEK KKIPDATPLL DASSLEYLFA QGQYDLVKCL APIEDPKTEQ 180 

DGHDIENECL GMAVLAISHY AMMKKMQLPE LPKDISYKRY IPETIMKSIR QRNLLTRMRI 240 

NNVFKDFLKE FNNKTICDSS VSTHDLKVKY IATLETLTKH YGAEIFETSM LLISSENEMN 300 

WFHSNDGGNV LYYEVMVTGN LGIQWRHKPN WSVEKEKNK LKRKKLENKD KKDEEKNKIR 360 

EEWNNFSFFP EITHIVIKES WSINKQDNK KMELKLSSHE EALSFVSLVD GYFRLTADAH 420 

HYLCTDVAPP LIVHNIQNGC HGPICTEYAI NKLRQEGSEE GKYVLRWSCT DFDNILMTVT 480 

CFEKSEQVQG AQKQFKNFQI EVQKGRYSLH GSDRSFPSLG DLMSKLKKQI LRTDNISFHL 540 

KRCCQPKPRE ISNLLVATKK AQEWQPVYPM SQLSFDRILK KDLVQGEHLG RGTRTHIYSG 600 

TLMDYKDDEG TSEEKKIKVI LKVLDPSHRD ISLAFFEAAS MMROVSHKHI VYLYGVCVRD 660 

VENIMVEEFV EGGPLDLFI4H RKSDVLTTPW KFKVAKQLAS ALSYLEDKDL VHGNVCTKNL 720 

LLAREGIDSE CGPFIKLSDP GIPITVLSRQ ECIERIPWIA PECVEDSKNL SVAADKWEFG 780 

TTLWEICYNG EIPLKDKTLI EKERFYESRC RPVTPSCKEL ADLKTRCMNY DPNQRPFFRA 840 

IMRDINKLEE QNPDIVSRKK NQPTEVDPTK FEKRFLKRIR DLGEGHFGKV ELCRYDPEDN 900 

TGEQVAVKSL KPESGGMHIA DLKKEIEILR NLYHENIVKY KGICTEDGGN GIKLIMEFLP 960 

SGSLKEYLPK NKNKINLKQQ LKYAVQICKG MDYLGSRQYV HRDLAARNVL VESEHQVKIG 1020 

DFGLTKAIET DKEYYTVKDD RDSPVFWYAP ECLMQSKFYI ASDVWSFGVT LHELLTYCDS 1080 

DSSPMALFLK MIGPTHGQMT VTRLVNTLKE GKRLPCPPNC PDEVYQLMRK CWEFQPSNRT 1140 
SFQNLIEGFE ALLK 

Seq ID NO: B65 DNA sequence 

Nucleic Acid Accession «: NM_004867.1 

Codin 3 sequence: 140 .. 931 

i i 1 f i 1 f f 

GATCCCAGAC CTCGGCTTGC AGTAGTGTTA GACTGAAGAT AAAGTAAGTG CTGTTTGGGC 60 
TAACAGGATC TCCTCTTGCA GTCTGCAGCC CAGGACGCTG ATTCCA.GCAG CGCCTTACCG 120 
CGCAGCCCGA AGATTCACTA TGGTGAAAAT CGCCTTCAAT ACCCCTACCG CCGTGCAAAA 180 
GGAGGAGGCG CGGCAAGACG TGGAGGCCCT CCTGAGCCGC A 
GACCGGCAAG GAGCTCCGAG TTGCCACCCA GGAAAAAGAG GGCTCCTCTG G 
GCTTACTCTC TTAGG CCTTT CATTCATCTT GGCAGGACTT ATTGTTGGTG G 
TTACAAGTAC TTCATGCCCA AGAGCACCAT TTACCGTGGA GAGATGTGCT TTTTTGATTC 
TGAGGATCCT GCAAATTCCC TTCGTGGAGG AGAGCCTAAC TTCCTGCCTG TGACTGAGGA 
GGCTGACATT CGTGAGGATG ACAACATTGC AATCATTGAT GTGCCTGTCC CCAGTTTCTC 
TGATAGTGAC CCTGCAGCAA TTATTCATGA CTTTGAAAAG GGAATGACTG CTTACCTGGA 
CTTGTTGCTG GGGAACTGCT ATCTGATGCC CCTCAATACT T — ' 
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AAAT CTGGTA GAGCTCTTTG GCAAACTGGC GAGTGGCAGA TATCTGCCTC AAACTTATGT 720 

GGTTCGAGAA GACCTAGTTG CTGTGGAGGA AATTCGTGAT GTTAGTAACC TTGGCATCTT 780 

TATTTACCAA CTTTGCAATA ACAGAAAGTC CTTCCGCCTT CGTCGCAGAG ACCTCTTGCT 840 

GGGTTTCAAC AAACGTGCCA TTGATAAATG CTGGAAGATT AGACACTTCC CCAACGAATT 900 

TATTGTTGAG ACCAAGATCT GTCAAGAGTA AGAGGCAACA GATAGAGTGT CCTTGGTAAT 960 

AAGAAGT CAG AGATTTACAA TATGACTTTA ACATTAAGGT TTATGGGATA CTCAAGATAT 1020 

TTACTCATGC ATTTACTCTA TTGCTTATGC CGTAAAAAAA AAAAAAAAAA AAAAAAAAAA 1080 
AA 



TQILTGKELH VATQEXEGSS GRCMLTLLGL 
FFDSEDPANS LRGGEPNFLP VTSEADIRED 
IHDFEKGMT AYLDLLLGNC YLMPLNTSIV MPPKNLVSLF 
QTYWREDLV AVEEIRDVSM LGIFIYQLCN NRKSFRLRRR DLLLGFNKRA 



: B67 DNA sequen 



I I I I I 

ACGCGGGAGG TCTGAGCTGT GGGCTGAGGC AGCGCAGCCG CTGCGCCAGG 



CCTTGAACCT GGGAAACTAT 
ATGGGACAGC AAATTTCGGA „ . 

30 GCAAAACATG TTACGTTGGT TCGAGAGAGT G 

AGAGTAGCTG AGCTTAATGA TGTAACGGCT AAAGTGGCTT CTGGCCAGGA A 
CTCTTTGAGG TACAACC1GG GTCTGATTCC TCTGCTTTTT GGAAAGTGGT T 
GTCTGTACCA AGATTAACAA AAGCAGTGGC ATTGTGGAGG CATCACGGAT CATGAATTTA 
TACCAGTTTA TTCAACTTTA TAAAGATATC ACAAGTCAAG CAGCAGGAGT ATCGGCACAG 

35 AGCTCCACCT CTGAAGAACC TGATGAAAAC TCATCCTCTG TAACATCTTG TCAGGCTAGT 
CTTTGGATGG GAAGGGTGAA GCAGCTGACC GATGAGGAGG AGTGTTGTAT CTGTATGGAT 
GGGCGGGCTG ACCTCATCCT GCGTTGTGCT CACAGCTTTT GTCAGAAGTG TATTGATAAA 
TGGAGTGATC GACACAGGAA TTGCCCTATT TGTCGCCTAC AGATGACTGG AGCAAATGAA 
TCTTGGGTGG TAT CAGATG C ACCCACTGAA GATGATATGG CTAACTATAT TCTTAACATG 

40 GCTGATGAGG CAGGCCAGCC CCACAGGCCA TGACCTTGAA GTGAAAGTCT TCTGTTGCTA 

TTGTGGGCTC AAATATTTGG TCATGGGGGA AGAATGTAGG GTTGTGGCAC TGGCACAGAC 



45 



50 MGQQISDQTQ LVINKLPEKV AKHVTLWES GSLTYEEFLG RVAELNDVTA KVASGQEKHL 
LFEVQPGSDS SAFWKVWRV VCTKINKSSG IVEASRIMNL YQFIQLYKDI TSQAAGVLAQ 
SSTSEEPDEN SSSVTSCQAS LWMGRVKQLT DEEECCICMD GRADLILPCA HSFCQKCIDK 
WSDRHRNCPI CRLQMTGANE SWWSDAPTE DDMANYILNM ADEAGQPHRE 

55 Seq ID NO: B69 DNA sequence 



I I I I I 

3 ACCGGACAGA GACGAGGAGA GGAACAGGAA GAGAGAAGCT GGGAGAATCG 
G GGCTAGTGAC CTGCACACAG GGCAGGGGCA CTCGGCAGTT CCCAGAGGCC 
ACCCCTCCCA CCCCAGACAT CCAGACATCT GGAACTTTGG GTGCCAAGAG TCCAGCTTAA 
^_ TGCAGGCAGC CTGGCTTTTG GGGGCTTTGG TGGTCCCCCA GCTCTTGGGC TTTGGCCATG 
65 GGGCTCGGGG AGCAGAGAGG GAGTGGGAGG GAGGCTGGG3 AGGTGCCCAG GAGGAGGAGC 
GGGAGAGGGA GGCCCTGATG CTGAAGCATC TGCAGGAAGC CCTAGGACTG CCTGCTGGGA 
GGGGGGATGA GAATCCTGCC GGAACTGTTG AGGGAAAAGA GGACTGGGAG ATGGAGGAGG 
ACCAGGGGGA GGAAGAGGAG GAGGAAGCAA CGCCAACCCC ATCCTCCGGC CCCAGCCCCT 
CTCCCACCCC TGAGGACAT C GTCACTTACA TCCTGGGCCG CCTGGCCGGC CTGGACGCAG 
70 GCCTGCACCA GCTGCACGTC CGTCTGCACG CGTTGGACAC CCGCGTGGTC GAGCTGACCC 
AGGGGCTGCG GCAGCTGCGG AACGCGGCAG GCGACACCCG CGATGCCGTG CAAGCCCTGC 
AGGAGGCGCA GGGTCGCGCC GAGCGCGAGC ACGGCCGCTT GGAGGGCTGC CTGAAGGGGC 
TGCGCCTGGG CCACAAGT GC TTCCTGCTCT CGCGCGACTT CGAAGCTCAG GCG3a5C;CGC 
AGGCGCGGTG CACGGCGCGG GGCGGGAGCC TGGCGCAGCC GGCAGACCGC CAGCAGATGG 
75 AGGCGCTCAC TCGGTACCTG CGCGCGGCGC TCGCTCCCTA CAACTGGCCC GTGTGGCTGG 
GCGTGCACGA TCGGCGCGCC GAGGGCCTCT ACCTCTTCGA AAACGC-CCAG CGCGTGTCCT 
TCTTCGCCTG GCATCGCTCA CCCCGCCCCG AGCTCGGCGC CCAGCCCAGC GCCTCGCCGC 
G CCCGGACCAG CCCAACGGTG GCACGCTCGA GAACTGCGTG GCGCAGGCCT 
T GCCAGCGGCG T 



TCCCCTTCTA GCGGGGCCGG TACCCCGCCT C 
TGCGCCGTGC CCACCCTCCT CCGGAATCGC CCTTCCCTTC CTGGCCACGA ATGGCAGCGT 
CCTCCCCGAC CCCCAGTCTG GGCGCTTCTG GGAGGGCTCT TGCGGTGCCG GCACTCCTCC 
TTGTTAGTGT CTTTCCTTGA AGGGGCGGGC ACCAGGCTAG GTCCGGTGCC AATAAATCCT 
TGTGGAATCT GAAAAAAAAA AAAAAAA 
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I I I I I I 

MQAAWLLGAL WPQLLGFGH GARGAEREWE GGWGGAQEEE RER3ALMLKH LQEALGLPAG 
RGDENPAGTV EGKEDWEMEE DQGEEEEEEA TPTPSSGPSP SPTPEDIVTY ILGRLAGLDA 
GLHQLHVRLH ALDTRWELT QGLRQLRNAA GDTRDAVQAL OEAQGRAERE HGRLEGCLKG 
L SRDFEAQAAA QARCTARGGS LAQPADRQQM EALTRYLRAA LAPYNWPW1L 
L YLFENGQRVS FFAWHRSPRP ELGAQPSASP HPLSPDQPNG GTLENCVAQA 
D CQRRLYYVCE FPF 



CTGGCCAGGG CGACTGGCGG ATAAGGTCTT GTGCGTGGCC T 

TGGGGCTTTG TGAAGGACAA AATGGCGATG GCGGGCCGTG TAGGTCCCCC TTCCTATGAT 120 

GAGGACCTTT T CACAGACCT GTACTGAGCT CCGTGAGGAT AAGTAACTCT GAGGAGATGG 180 

GCCCTGCAAG CCTCCTTCTT AGCCGTCTGT TCAGAAAATA GCGTTTTCGA AATGCCCTGA 240 

GTTGACCTAA TGTCTTATTG GGCTCCTGTC TGCAGGATTT ACGCGCACGT TGGAACCGAA 300 

GAGAGCTCTG TTGTTGCAAT GTTCAGCCCA CAAGAGCTTA CTGGTGAAGG AATGGGACAA 360 

GACCCATCTT TATGCAAAGC CAGCGTTACA GTAATGTTCC AGCATCTCAT AATCTATCCT 420 

GGGGAATTCA GCTGCCTCCC AGGGTGAATA CAGGTATTCC TGATGACAGT CTGCCTCTAT 480 

TAACCAATAA CCTGCTTGGC TCAAAGGGCC AGCACCTTCT CTCTAAAGCC CAAGAGGAGT 600 

TTGAGGAAAA CTAGGTGTCT GTGTTCACTC CAGGCTGAAG TTACAGGTCT GAGCAAATAA 660 

GGTGTATAAA AAATGGAATC TGTCTTGGAG GACATCAGAA GGTGAATTTT CCAAGTTCTT 720 

GGACAACCTA GCTGTTGAAA AGCTTTCTGG GTTTGGGGGG TATTTCAGAT GTACCTTAAA 780 

GTGTTAGCAG ATACAGATTA AGACACTGGG AGCCAATGAA ACAGCAGTTG AGGGTTTGCT 840 

GTGTATCACA TTTCTGTATT TTATCACCCC CTTCCTGCAA CATTATTTAT CTGGAATCTA 900 

CCTGCCCTTT TGTTTTTTAG ATACAAGGGC TTGGTTTTGT TACCCAGGCT GGTTTCAAGG 960 

CCATAGCTTT AAGAGATCCT CTCACCACAG ATTTCCAAAG TGCTGGGATT GCAGGTGTGA 1020 

TTCATGGCAC CCAGACTTTG CTGCCTTTCT TACATGATCC AGGCCCAGAA CCCAAACTCA 1080 

GGCACTGTAT AGATGACCAC TTTGGTAAAC TACTGACCTA GCTTGTTGCC AATTGTTGAT 1140 

TGAACTTCCC ATAACTCCAC TTCGTGTCTG TTCCTCTGTA TACAGCCACC TTCTGTTCCC 1200 

GTCATGAGCC TTTAGGTCTC CATTTGCATA TTGCAAATAC TATGTTCCAT GTAGGTAGCT 12 60 

CATT CAGGGC CTTGCTCTTC ACTTCAAAAA AGGTTCCCTT GAGGACTGGC TGTCAATTTG 1320 



: CAT cluster 

1 11 21 31 41 51 

I I I I I I 

TGCTAGTAGG GCCTGGGTTA ATCGGCCGAG GGTGGCTTCG TGGTCTTTAT AGCTGTTACT 
50 CTTTGTACTT GTCTTTTTCT TTTATTTTCT TTTGAGCGAT TGTGCGAACA TAGCATAGCA 
CGCACTATGG CACGAGGCTC GTGCTGCCTG GCCAGGGCGA CTGGCGGATA AGGTCTTGTG 
CGTGGCCTCG AGGCTTAAAA GTAGCAGTGG GGCTTTGTGA AGGACAAAAT GGCGATGGCG 
GGCCGTGTAG GTCCCCCTTC CTATGATGAG GACCTTTTCA CAGACCTGTA CTGAGCTCCG 
TGAGGATAAA TAACTCTGAG GAGATGGGCC CTGCAAGCCT CTTGCTTAGC CGTCTGTTCA 
55 GAAAATAGCG TTTTCGAAAT GCCCTGAGTT GACCTAATGT CTTATTGGGC TCCTGTCTGC 
AGGATTTACG CGCACGTTGG AACCGAAGAG AGCTCTGTTG TTGCAATGTT CAGCCCACAA 
GAGCTTACTG GTGAAGGAAT GGGACAAGAC CCATCTTTAT GCAAAGCCAG CGTTACAGTA 
ATGTTCCAGC ATCTCATAAT CTATCCTGGG GAATTCAGCT GCCTCCCAGG GTGAATACAG 
GTATTCCTGA TGACAGTCTG CCTCTATCTT ACAGAGCAGC TTGTTGCTAT ATACCATTGA 
60 AAAGCCTTCA GAGCTGAGAG GTACTACTAA CCAATAACCT GCTTGGCTCA AAGGGCCAGC 
ACCTTCTCTC TAAAGCCCAA GAGGAGTTTG AGGAAAACTA GGTGTCTGTG TTCACTCCAG 
GCTGAAGTTA CAGGTCTGAG CAAATAAGGT GTATAAAAAA TGGAATCTGT CTTGGAGGAC 
AT CAG AAGGT GAATTTTCCA AGTTCTTGGA CAACCTAGCT GTTGAAAAGC TTTCTGGGTT 
TGGGGGGTAT T T CAGATGT A CCTTAAAGTG TTAGCAGACA CAGATTAAGA CACTGGGAGC 
65 CAATGAAACA GCAGTTGAGG GTTTGCTGTG TATCACATTT CTGTATTTTA TCACCCCCTT 
CCTGCAACAT TATTTATCTG GAATCTACCT GCCCTTTTGT TTTTTAGATA CAAGGGCTTG 
GTTTTGTTAC CCAGGCTGGT TTCAAGGCCA TAGCTTTAAG AGATCCTCTC ACCACAGATT 
TCCAAAGTGC TGGGATTGCA GGTGTGATTC ATGG CACCCA GACTTTGCTG CCTTTCTTAC 
ATGATCCAGG CCCAGAACCC AAACTCAGGC ACTGTATAGA TGACCACTTT CGTAAACTAC 
70 TGACCTAGCT TGTTGCCAAT TGTTGATTGA ACTTCCCATA ACTCCACTTC GTGTCTGTTC 
CTCTGTATAC AGCCACCTTC TGTTCCCGTC ATGAGCCTTT AGGTCTCCAT TTGCATATTG 
CAAATACTAT GTTCCATGTA GGTAGCTCAT TCAGGGCCTT GCTCTTCACT TCAAAAAAGG 
TTCCCTTGAG GACTGGCTGT CAATTTGTGT TGCTGTGTTG GTTGTTGATG AAAATAATAA 
AATGATTGAT TACATAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAACAAA CACAAAAAAC 
75 CGCCGCGCTT TTTCCCGGGC GCACAAAGTT ATAAAACGCC GTCCATC 
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CCGTCTCCAC CATCCATCCA TCCAGGAAAA TCAGACTTAA TAGTCCGCGT GGGCGACGAG 180 

ATTAGGCTGT TATGCACTGA TCCGGGCTTT GTCAAATGGA CTTTTGAGAT CCTGGATGAA 24 0 

ACGAATGAGA ATAAGCAGAA TG AATGGAT C ACGGAAAAGG CAGAAGCCAC CAACACCGGC 300 

AAATACACGT GCACCAACAA ACACGGCTTA AGCAATTCCA TTTATGTGTT TGITAGAGAT 360 

5 CCTGCCAAGC TTTTCCTTGT TGACCGCTCC TTGTATGGGA AAGAAGACAA CGACACGCTG 420 

GTCCGCIGTC CTCTCACAGA CCCAGAAGTG ACCAATTATT CCCTCAAGGG GTGCCAGGGG 480 

AAGCCTCTTC CCAAGGACTT GAGGTTTATT CCTGACCCCA AGGCGGGCAT CATGATCAAA 540 

AGTGTGAAAC GCGCCTACCA TCGGCTCTGT CTGCATTGTT CTGTGGACCA GGAGGGCAAG 600 

TCAGTGCTGT CGGAAAAATT CATCCTGAAA GTGAGGCCAG CCTTCAAAGC TGTGCCTGTT 660 

10 GTGTCTGTGT CCAAAGCAAG CTATCTTCTT AGGGAAGGGG AAGAATTCAC AGTGACGTGC 720 
ACAATAAAAG ATGTGTCTAG TTCTGTGTAC TCAACGTGGA AAAGAGAAAA CAGTCAGACT 780 
AAACTACAGG AGAAATATAA TAGCTGG CAT CACGGTGACT TCAATTATGA ACGTCAGGCA 840 
ACGTTGACTA TCAGTTCAGC GAGAGTTAAT GATTCTGGAG TGTTCATGTG TTATGCCAAT 900 
AATACTTTTG GAT CAGCAAA TGTCACAACA ACCTTGGAAG TAGTAGATAA AGGATTCATT 960 

15 AATATCTTCC CCATGATAAA CACTACAGTA TTTGTAAACG ATGGAGAAAA TGTAGATTTG 1020 

ATTGTTGAAT ATGAAGCATT CCCCAAACCT GAACACCAGC AGTGGATCTA TATGAACAGA 1080 

ACCTTCACTG ATAAATGGGA AGATTATCCC AAGTCTGAGA ATGAAAGTAA TAT CAGATAC 1140 

GTAAGTGAAC TTCATCTAAC GAGATTAAAA GGCACCGAAG GAGGCACTTA CACATTCCTA 1200 

GTGTCCAATT CTGACGTCAA TGCTGCCATA GCATTTAATG TTTATGTGAA TACAAAACCA 1260 

20 GAAATCCTGA CTTACGACAG GCTCGTGAAT GGCATGCTCC AATGTGTGGC AGCAGGATTC 1320 

CCAGAGCCCA CAATAGATTG GTATTTTTGT CCAGGAACTG AGCAGAGATG CTCTGCTTCT 1380 

GTACTGCCAG TGGATGTGCA GACACTAAAC TCATCTGGGC CACCGTTTGG AAAGCTAGTG 1440 

GTTCAGAGTT CTATAGATTC TAGTGCATTC AAGCACAATG GCACGGTTGA ATGTAAGGCT 150 0 

25 GAGCAAATCC ATCCCCACAC CCTGTTCACT CCTTTGCTGA TTGGTTTCGT AATCGTAGCT 162 0 
GGCATGATGT GCATTATTGT GATGATTCTG ACCTACAAAT ATTTACAGAA ACCCATGTAT 1680 
GAAGTACAGT GGAAGGTTGT TGAGGAGATA AATGGAAACA ATTATGTTTA CATAGACCCA 1740 
ACACAACTT C CTTATGATCA CAAATGGGAG TTTCCCAGAA ACAGGCIGAG TTTTGGGAAA 1800 
ACCCTGGGTG CTGGAGCTTT CGGGAAGGTT GTTGAGGCAA CTGCTTATGG CTTAATTAAG 1860 
30 TCAGATGCGG CCATGACTGT CGCTGTAAAG ATGCTCAAGC CGAGTGCCCA TTTGACAGAA 1920 
CGGGAAGCCC TCATGTCTGA ACTCAAAGTC CTGAGTTACC TTGGTAATCA CATGAATATT 1980 
GTGAATCTAC TTGGAGCCTG CACCATTGGA GGGCCCACCC TGGTCATTAC AGAATATTGT 2040 
TGCTATGGTG ATCTTTTGAA TTTTTTGAGA AGAAAACGTG ATTCATTTAT TTGTTCAAAG 2100 
CAGGAAGATC ATGCAGAAGC TGCACTTTAT AAGAATCTTC TGCATTCAAA GGAGTCTTCC 2160 
35 TGCAGCGATA GTACTAATGA GTACATCGAC ATGAAACCTG GAGTTTCTTA TGTTGTCCCA 2220 
ACCAAGGCCG ACAAAAGGAG ATCTGTGAGA ATAGGCTCAT ACATAGAAAG AGATGTGACT 2280 
CCCGCCATCA TGGAGGATGA CGAGTTGGCC CTAGACTTAG AAGACTTGCT GAGCTTTTCT 2340 
TACCAGGTGG CAAAGGGCAT GGCTTTCCTC GCCTCCAAGA ATTGTATTCA CAGAGACTTG 2400 
GCAGCCAGAA ATATCCTCCT TACTCATGGT CGG AT CACAA AGATTTGTGA TTTTGGTCTA 2460 
40 GCCAGAGACA TCAAGAATGA TTCTAATTAT GTGGTTAAAG GAAACGCTCG ACTACCTGTG 2520 

AAGTGGATGG CACCTGAAAG CATTTTCAAC TGTGTATACA CGTTTGAAAG TGACGTCTGG 2580 
TCCTATGGGA TTTTTCTTTG GGAGCTGTTC TCTTTAGGAA GCAGCCCCTA TCCTGGAATG 2640 
CGGGTCGATT CTAAGTTCTA CAAGATGATC AAGGAAGGCT TCCGGATGCT CAGCCCTGAA 2700 
CACGCACCTG CTGAAATGTA TGACATAATG AAGACTTGCT GGGATGCAGA TCCCCTAAAA 2760 
45 AGACCAACAT TCAAGCAAAT TGTTCAGCTA ATTGAGAAGC AGATTTCAGA GAGCACCAAT 2820 

CATATTTACT CCAACTTAGC AAACTGCAGC CCCAACCGAC AGAAGCCCGT GGTAGACCAT 2880 
TCTGTGCGGA TCAATTCTGT CGGCAGCACC GCTTCCTCCT CCCAGCCTCT GCTTGTGCAC 2940 
GACGATGTCT GAGCAGAATC AGTGTTTGGG TCACCCCTCC AGGAATGATC TCTTCTTTTG 3000 
GCTTCCATGA TGGTTATTTT CTTTTCTTTC AACTTGCATC CAACTCCAGG ATAGTGGGCA 3060 
50 CCCCACTGCA ATCCTGTCTT TCTGAGCACA , CTTTAGTGGC CGATGATTTT TGTCATCAGC 3120 
CACCATCCTA TTGCAAAGGT T CCAACTGT A TATATTCCCA MAGCAACGT AGCTTCTACC 3180 
ATGAACAGAA AACATTCTGA TTTGGAAAAA GAGAGGGAGG TATGGACTGG GGGCCAGAGT 3240 
CCTTTCCAAG GCTTCTCCAA TTCTGCCCAA AAATATGGTT GATAGTTTAC CTGAATAAAT 3300 
GGTAGTAATC ACAGTTGGCC TTCAGAACCA TCCATAGTAG TATGATGATA CAAGATTAGA 3360 
55 AGCTGAAAAC CTAAGTCCTT TATGTGGAAA ACAGAACATC ATTAGAACAA AGGACAGAGT 3420 
ATGAACACCT GGGCTTAAGA AATCTAGTAT TTCATGCTGG GAATGAGACA TAGGCCATGA 3480 
AAAAAATGAT CCCCAAGTGT GAACAAAAGA TGCTCTTCTG TGGACCACTG CATGAGCTTT 3540 
TATACTACCG ACCTGGTTTT TAAATAGAGT TTGCTATTAG AGCATTGAAT TGGAGAGAAG 3600 
GCCTCCCTAG CCAGCACTTG TATATACGCA TCTATAAATT GTCCGTGTTC ATACATTTGA 3660 
60 GGGGAAAACA CCATAAGGTT TCGTTTCTGT ATACAACCCT GGCATTATGT CCACTGTGTA 3720 
TAGAAGTAGA TTAAGAGCCA TATAAGTTTG AAGGAAACAG TTAATACCAT TTTTTAAGGA 3780 
AACAATATAA CCACAAAGCA CAGTTTGAAC AAAATCTCCT CTTTTAGCTG ATGAACTTAT 3840 
TCTGTAGATT CTGTGGAACA AGCCTATCAG CTTCAGAATG GCATTGTACT CAATGGATTT 3900 
GATGCTGTTT GACAAAGTTA CTGATTCACT GCATGGCTCC CACAGGAGTG GGAAAACACT 3960 
65 GCCATCTTAG TTTGGATTCT TATGTAGCAG GAAATAAAGT ATAGGTTTAG CCTCCTTCGC 4020 
AGGCATGTCC TGGACACCGG GCCAGTATCT ATATATGTGT ATGTACGTTT GTATGTGTGT 4080 
AGACAAATAT TTGGAGGGGT ATTTTTGCCC TGAGTCCAAG AGGGTCCTTT AGTACCTGAA 4140 
AAGTAACTTG GCTTTCATTA TTAGTACTGC TCTTGTTTCT TTTCACATAG CTGTCTAGAG 4200 
TAGCTTACCA GAAGCTTCCA TAGTGGTGCA GAGGAAGTGG AAGGCATCAG TCCCTATGTA 4260 
70 TTTGCAGTTC ACCTGCACTT AAGGCACTCT GTTATTTAGA CTCATCTTAC TGTACCTGTT 4320 
CCTTAGACCT TCCATAATGC TACTGT CTCA CTGAAACATT TAAATTTTAC CCTTTAGACT 43 80 
GTAGCCTGGA TATTATTCTT GTAGTTTACC TCTTTAAAAA CAAAACAAAA CAAAACAAAA 4440 
AACTCCCCTT CCTCACTGCC CAATATAAAA GC-CAAATGTG TACATGGCAG AGTTTGTGTG 4500 
TTGTCTTGAA AGATTCAGGT ATGTTGCCTT TATGGTTTCC CCCTTCTACA TTTCTTAGAC 4560 
75 TACATTTAGA GAACTGTGGC CGTTATCTGG AAGTAACCAT TTGCACTGGA GTTCTATGCT 4620 
CTCGCACCTT TCCAAAGTTA ACAGATTTTG GGGTTGTGTT GTCACCCAAG AGATTGTTGT 4680 
TTGCCATACT TTGTCTGAAA AATTCCTTTG TGTTTCTATT GACTTCAATG ATAGTAAGAA 4740 
AAGTGGTTGT TAGTTATAGA TGTCTAGGTA CTTCAGGGGC ACTTCATTGA GAGTTTTGTC 4800 
TTGCCATACT TTGTCTGAAA AATTCCTTTG TGTTTCTATT GACTTCAATG ATAGTAAGAA 4860 
80 AAGTGGTTGT TAGTTATAGA TGTCTAGGTA CTTCAGGGGC ACTTCATTGA GAGTTTTGTC 4920 

AACTTTATGT GTAAATACAT AAGCGGCGTA AGTTTAAAGG ATGTTGGTGT TCCACGTGTT 5040 
TTATTCCTGT ATGTTGTCCA ATTGTTGACA GTTCTGAAGA ATTC 
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PGFVKWTFEI LDETNENKQN El 

DRSLYGKEDN DTLVRCPLTD PEVTNYSLKG CQGKPLPKDL RFIPDPKAGI MIKSVKRAYH 180 

RLCLHCSVDQ EGKSVLSEKF ILKVRPAFKA VPWSVSKAS YLLREGEEFT VTCTIKDVSS 240 

51 SQTKLQEKYN SWHHGDFNYE RQATLTISSA RVNDSGVFMC YANNTFGSAN 300 

K GFINIFPMIN TTVFVNDGEN VDLIVEYEAF PKPEHQQWIY MNRTFTDKWE 360 

J IRYVSELHLT RLKGTEGGTY TFLVSNSDVN AAIAFNVYVN TKPEILTYDR 420 

LWGMLQCVA AGFPEPTIDW YFCPGTEQRC SASVLPVDVQ TLNSSGPPFG KLWQSSIDS 480 

SAFKHNGTVE CKAYNDVGKT SAYFNFAFKG NNKF.QIHPHT LFTPLLIGFV IVAGMMCIIV 540 

MILTYKYLQK PMYEVQWKW EEINGNNYVY IDPTQLPYDK KWEFPRNRLS FGKTLGAGAF 600 

GKWEATAYG LIKSDAAMTV AVKMLKPSAH LTEREALMSE LKVLSYLGNH MN I VNLLGAC 660 

TIGGPTLVIT EYCCYGDLLN FLRRKRDSFI CSKQEDHAEA ALYKNLLHSK ESSCSDSTNE 720 

YMDMKPGVSY WPTKADKRR SVRIGSYIER DVTPAIMEDD ELALDLEDLL SFSYQVAKGM 780 

AFLASKNCIH RDLAARNILL THGRITKICD FGLARDIKND SNYWKGNAR LPVKWMAPES 840 

IFNCVYTFES DVWSYGIFLW ELFSLGSSPY PGMPVDSKFY KKIKEGFRML SPEHAPAEMY 900 

DIMKTCWDAD PLKRPTFKQI VQLIEKQISE STNHIYSNLA NCSPNRQKPV VDHSVRINSV 960 
GSTASSSQPL LVHDDV 

Seq ID NO: B75 DNA sequence 
Nucleic Acid Accession it: NM_004456 
Coding sequence: 58.. 229a 

1 11 21 31 41 SI 

I I I I I I 

GAATTCCGGG CGACGCGCGG GAACAACGCG AGTCGGCGCG CGGGACGAAG AATAATCATG SO 

GGCCAGACTG GGAAGAAATC TGAGAAGGGA CCAGTTTGTT GGCGGAAGCG TGTAAAATCA 120 

GAGTACATGC GACTGAGACA GCTCAAGAGG TTCAGACGAG CTGATGAAGT AAAGAGTATG 180 

TTTAGTTCCA ATCGTCAGAA AATTTTGGAA AGAACGGAAA TCTTAAACCA AGAATGGAAA 240 

CAGCGAAGGA TACAGCCTGT GCACATCCTG ACTTCTGTGA GCTCATTGCG CGGGACTAGG 300 

3 TGACCAGTGA CTTGGATTTT CCAACACAAG TCATCCCATT AAAGACTCTG 360 

3 CTTCAGTACC CATAATGTAT TCTTGGTCTC CCCTACAGCA GAATTTTATG 420 

GTGGAAGATG AAACTGTTTT ACATAACATT CCTTATATGG GAGATGAAGT TTTAGATCAG 480 

GATGGTACTT TCATTGAAGA ACTAATAAAA AATTATGATG GGAAAGTACA CGGGGATAGA 540 

GAATGTGGGT TTATAAATGA TGAAATTTTT GTGGAGTTGG TGAATGCCCT TGGTCAATAT 600 

AATGATGATG ACG AT GATG A TGATGGAGAC GATCCTGAAG AAAGAGAAGA AAAGCAGAAA 660 

GATCTGGAGG ATCACCGAGA TGATAAAGAA AGCCGCCCAC CTCGGAAATT TCCTTCTGAT 720 

AAAATTTTGG AGGCCATTTC CTCAATGTTT CCAGATAAGG GCACAGCAGA AGAACTAAAG 780 

GAAAAATATA AAG AACT CAC CGAACAGCAG CTCCCAGGCG CACTTCCTCC TGAATGTACC 840 

CCCAACATAG ATGGACCAAA TGCTAAATCT GTTCAGAGAG AGCAAAGCTT ACACTCCTTT 900 

CATACGCTTT TCTGTAGGCG ATGTTTTAAA TATGACTGCT TCCTACATCC TTTTCATGCA 960 

ACACCCAACA CTTATAAGCG GAAGAACACA GAAACAGCTC TAGACAACAA ACCTTGTGGA 1020 

CCACAGTGTT ACCAGCATTT GGAGGGAGCA AAGGAGTTTG CTGCTGCTCT CACCGCTGAG 1080 

CGGATAAAGA CCCCACCAAA ACGTCCAGGA GGCCGCAGAA GAGGACGGCT TCCCAATAAC 1140 

AGTAGCAGGC CCAGCACCCC CACCATTAAT GTGCTGGAAT CAAAGGATAC AGACAGTGAT 1200 

AGGGAAGCAG GGACTGAAAC GGGGGGAGAG AACAATGATA AAGAAGAAGA AGAGAAGAAA 1260 

GAIGAAACTT CGAGCTCCTC TGAAGCAAAT TCTCGGTGTC AAACACCAAT AAAGATGAAG 1320 

CCAAATATTG AACCTCCTGA GAATGTGGAG TGGAGTGGTG CTGAAGCCTC AATGTTTAGA 1380 

GTCCTCATTG GCACTTACTA TGACAATTTC TGTGCCATTG CTAGGTTAAT TGGGACCAAA 1440 

ACATGTAGAC AGGTGTATGA GTTTAGAGTC AAAGAATCTA GCATCATAGC TCCAGCTCCC 1500 

GCTGAGGATG TGGATACTCC TCCAAGGAAA AAGAAGAGGA AACACCGGTT GTGGGCTGCA 15S0 

CACTGCAGAA AGATACAGCT GAAAAAGGAC GGCTCCTCTA ACCATGTTTA CAACTAICAA 1620 

CCCTGTGATC ATCCACGGCA GCCTTGTGAC AGTTCGTGCC CTTGTGTGAT AGCACAAAAT 1680 

TTTTGTGAAA AGTTTTGTCA ATGTAGTTCA GAGTGTCAAA ACCGCTTTCC GGGATGCCGC 1740 

TGCAAAGCAC AGTGCAACAC CAAGCAGTGC CCGTGCTACC TGGCTGTCCG AGAGTGTGAC 1800 

CCTGACCTCT GTCTTACTTG TGGAGCCGCT GACCATTGGG ACAGTAAAAA TGTGTCCTGC 1860 

AAGAACTGCA GTATTCAGCG GGGCTCCAAA AAGCATCTAT TGC1GGCACC AICTGACGTG 1920 

GCAGGCTGGG GGATTTTTAT CAAAGATCCT GTGCAGAAAA ATGAATTCAT CTCAGAATAC 1980 

TGTGGAGAGA TTATTTCTCA AGATGAAGCT GACAGAAGAG GGAAAGTGTA TGATAAATAC 2040 

ATGTGCAGCT TTCTGTTCAA CTTGAACAAT GATTTTGTGG TGGATGCAAC CCGCAAGGGT 2100 

AACAAAATTC GTTTTGCAAA TCATTCGGTA AATCCAAACT GCTATGCAAA AGTTATGATG 2160 

GTTAACGGTG ATCACAGGAT AGGTATTTTT GCCAAGAGAG CCATCCAGAC TGGCGAAGAG 2220 

CTGTTTGTTG ATTACAGATA CAGCCAGGCT GATGCCCTGA AGTATGTCGG CATCGAAAGA 2280 

GAAATGGAAA TCCCTTGACA TCTGCTACCT CCTCCCCCTC CTCTGAAACA GCTGCCTTAG 2340 

CTTCAGGAAC CTCGAGTACT GTGGGCAATT TAGAAAAAGA ACATGCAGTT TGAAATTCTG 2400 

AATTTGCAAA GTACTGTAAG AATAATTTAT AGTAATGAGT TTAAAAATCA ACTTTTTATT 2460 

GCCTTCTCAC CAGCTGCAAA GTGTTTTGTA CCAGTGAATT TTTGCAATAA TGCAGTATGG 2520 
TACATTTTTC AACTTTGAAT AAAGAATACT TGAACTTGAA AAAAAAAAAA AAAAAA 

Seq ID NO: B76 Protein sequence 



I I I I I I 

MGQTGKKSEK GPVCWRKRVK SEYMRLRQLK RFRRADEVKS MFSSNRQKIL ERTEILNQEW 
KQRRIQPVHI LTSVSSLRGT RECSVTSDLD FPTQVIPLKT LNAVASVPIM YSWSPLQQNF 
MVEDETVLHN IPYMGDEVLD QDGTFIEELI KNYDGKVHGD RECGFINDEI FVELVNALGQ 
YNDDDDDDDG DDPEEREEKQ KDLEDHRDDK ESRPPR%FPS CI> ILE M FP X" 
KEKYKELTEQ QLPGALPPEC TPNIDGPNAK SVQREQSLHS F ' 
ATPNTYKRKN TETALDNKPC GPQCYQHLEG K 
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WO 03/042661 PCT/US02/36810 



NSSRPSTPTI NVLESKDTDS DREAGTETGG ENNDKEESEK KDETSSSSEA NSRCQTPIKM 
KPNIEPPENV EWSGAEASMF RVLIGTYYDN FCAIARLIGT KTCRQVYEFR VKESSIIAPA 
PAEDVDTPPR KKKRKHRLWA AHCRKIQLKK DGSSNHVYNY QPCDHPRQPC DSSCPCVIAQ 



Seq ID NO: B77 DNA sequence 
Nucleic Acid Accession it: NM.007015 
Coding sequence: 1..1005 

1 11 21 31 41 51 

I I I I I I 

ATGACAGAGA ACTCCGACAA AGTTCCCATT GCCCTGGTGG GACCTGATGA CGTGGAATTC 
TGCAGCCCCC CGGCGTACGC TACGCTGACG GTGAAGCCCT CCAGCCCCGC GCGGCTGCTC 
AAGGTGGGAG CCGTGGTCCT CATTTCGGGA GCTGTGCTGC TGCTCTTTGG GGCCATCGGG 
GCCTTCTACT TCTGGAAGGG GAGCGACAGT CACATTTACA ATGTCCATTA CACCATGAGT 
A AACTACAAGA TGGGTCAATG GAAATAGACG CTGGGAACAA CTTGGAGACC 
3TGGAGC TGAAGAAGCA ATTGCAGTTA ATGATTTCCA GAATGGCATC 
C GTTTTGCTGG AGGAGAGAAG TGCTACATTA AAGCGCAAGT GAAGGCTCGT 
ATTCCTGAGG TGGGCGCCGT GACCAAACAG AGCATCTCCT CCAAACTGGA AGG CAAGATC 



CTTAAAC CAA CCTATCCAAA AGAAATCCAG AGGGAAAGAA G 
GTTCCAACTA CCACAAAAAG ACCACACAGT GGACCACGGA GCAACCCAGG CGCTGGAAGA 720 
CTGAATAATG AAACCAGACC CAGTGTTCAA GAGGACTCAC AAGCCTTCAA TCCTGATAAT 780 

ATCTGTTGTA TAGAATGTAG GCGGAGCTAC ACCCACTGCC AGAAGATCTG TGAACCCCTG 900 

GGGGGCTATT ACCCATGGCC TTATAATTAT CAAGGCTGCC GTTCGGCCTG CAGAGTCATC 960 

ATGCCATGTA GCTGGTGGGT GGCCCGTATC TTGGGCATGG TGTGAAATCA CTTCATATAT 1020 

CACGTGCTGT AAAATAAGAA CTAGCTGAAG AGACAACCAA AGAAGCATTA AGGCAGGTTG 1080 

ATGCTGATGG GACCATAAAA TATTTTTACA CGCAGCCTGA GCGGTTATTC TTGACACTCT 1140 

TAACAGAATT TTTTTAATCG TTTTCCAGAA CTTTAGTATA TGCAAATGCA CTGAAAGGGT 1200 

AGTTCAAGTC TAAAATGCCA TAACCCCGTT ATTTGTTATT TTTTATTTGC ATTGATTTGC 1260 

CATAAGTCTT CCCTTGCTTG CATCTTCCAA AGCTATTTCG AAATAAACAC GAAAATTTAC 1320 
AGTTTGCC 

Seq I! 

Protein Accession »: 

1 11 21 31 41 51 

I I I I I I 

MTENSDKVPI ALVGPDDVEF CSPPAYATLT VKPSSPAELL KVGAWLIEG AVLLLFGAIG 
AFYFWKGSDS IIIYNVIIYTMS INGKLQDGSM EIDAGNKLET FKMGSGAEEA IAVNDFQNGI 
TGIRFAGGEK CYIKAQVKAR IPEVGAVTKQ SISSKLEGKI HPVKYEENEL IWVAVDQPVK 
DNSFLSSKVL ELCGDLPIFW LKPTYPKEIQ RERREWRKI VPTTTKRPKS GPRSNPGAGR 
LNNETRPSVQ EDSQAFNPDN PYHQQEGESM TFDPRLDHEG ICCIECRRSY THCQKICEPL 
GGYYPWPYNY QGCRSACRVI MPCSWHVARI LGMV 



Coding sequence: 66.. 1085 

1 11 21 31 41 51 

III 

CCGAGACTCA CGGTCAAGCT AAGGCGAAGA GTGGGTGGCT GAAGCCATAC TATITTATAG 60 

AATTAATGGA AAGCAGAAAA GACATCACAA ACCAAGAAGA ACTTTGGAAA ATGAAGCCTA 120 

GGAGAAATTT AGAAGAAGAC GATTATTTGC ATAAGGACAC GGGAGAGACC AGCATGCTAA 180 

AAAGACCTGT GCTTTTGCAT TTGCACCAAA CAGCCCATGC TGATGAATTT GACTGCCCTT 240 

CAGAACTTCA GCACACACAG GAACTCTTTC CACAGTGGCA CTTGCCAATT AAAATAGCTG 300 

CTATTATAGC ATCTCTGACT TTTCTTTACA CTCTTCTGAG GGAAGTAATT CACCCTTTAG 3 60 

CAACTTCCCA TCAACAATAT TTTTATAAAA TTCCAATCCT GGTCATCAAC AAAGTCTTGC 420 

CAATGGTTTC CATCACTCTC TTGGCATTGG TTTACCTGCC AGGTGTGATA GCAGCAATTG 480 

TCCAACTTCA TAATGGAACC AAGTATAAGA AGTTTCCACA TTGGTTGGA1 AAG TGGATGT 540 

TAACAAGAAA GCAGTTTGGG CTTCTCAGTT TCTTTTTTGC TGTACTGCAT GCAATTTATA 600 

GTCTGTCTTA CCCAATGAGG CGATCCTACA GATACAAGTT GCTAAACTGG GCATATCAAC 660 

AGGTCCAACA AAATAAAGAA GATGCCTGGA TTGAGCATGA TGTTTGGAGA ATGGAGATTT 720 

ATGTGTCTCT GGGAATTGTG GGATTGGCAA TACTGGCTCT GTTGGCTGTG ACATCTATTC 780 

3 TGACTCTTTG ACATGGAGAG AATTTCACTA TATTCAGAGC AAGCTAGGAA 840 

r TCTACTGGGC ACAATACACG CATTGATTTT TGCCTGGAAT AAGTGGATAG 900 

\ ATTTGTATGG TATACACCTC CAACTTTTAT GATAGCTGTT TTCCTTCCAA 960 

I GATATTTAAA AGCATACTAT TCCTGCCATG CTTGAGGAAG AAGATACTGA 1020 

A TGGTTGGGAA GACGTCACCA AAATTAACAA AACTGAGATA TGTTCCCAGT 1080 

[TTACAC ACATTTTTGT TCAATATTGA TATATTTTAT C 
T ATTTGTTAAT AAAATGATTA TTCAAGGAAA A 
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VSITLLALVY LPGVIAAIVQ LHNGTKYKKF PHWLDKWMLT RKQFGLLSFF F 

SYPMRRSYRY KLLNWAYQQV QQNKEDAWIE HDVWRMEIYV SLGIVGLAIL AnuBvioim 

VSDSLTWREF HYIQSKLGIV SLLLGTIHAL IFAWNKWIDI KQFVWYTPPT FMIAVFLPIV 
L PCLRKKILKI RHGWEDVTKI NKTEICSQL 



Seg ID NO: B81 DNA sequence 
Nucleic Acid Accession it: NM_000684 
Coding sequence: 87.. 1520 

10 1 11 21 31 41 51 

I I I I I I 

TGCTACCCGC GCCCGGGCTT CTGGGGTGTT CCCCAACCAC GGCCCAGCCC TGCCACACCC SO 

CCCGCCCCCG GCCTCCGCAG CTCGGCATGG GCGCGGGGGT GCTCX3TCCTG GGCGCCTCCG 120 

AGCCCGGTAA CCTGTCGTCG GCCGCACCGC TCCCCGACGG CGCGGCCACC GCGGCGCGGC 1B0 

15 TGCTGGTGCC CGCGTCGCCG CCCGCCTCGT TGCTGCCTCC CGCCAGCGAA AGCCCCGAGC 240 

CGCTGTCTCA GCAGTGGACA GCGGGCATGG GTCTGCTGAT GGCGCTCATC GTGCTGCTCA 300 

TCGTGGCGGG CAATGTGCTG GTGATCGTGG CCATCGCCAA GACGCCGCGG CTGCAGACGC 360 

TCACCAACCT CTTCATCATG TCCCTGGCCA GCGCCGACCT GGTCATGGGG CTGCTGGTGG 420 

TGCCGTTCGG GGCCACCATC GTGGTGTGGG GCCGCTGGGA GTACGGCTCC TTCTTCTGCG 480 

20 AGCTGTGGAC CTCAGTGGAC GTGCTGTGCG TGACGGCCAG CATCGAGACC CTGTGTGTCA 540 

TTGCCCTGGA CCGCTACCTC GCCATCACCT CGCCCTTCCG CTACCAGAGC CTGCTGACGC 600 

GCGCGCGGGC GCGGGGCCTC GTGTGCACCG TGTGGGCCAT CTCGGCCCTG GTGTCCTTCC 660 

TGCCCATCCT CATGCACTGG TGGCGGGCGG AGAGGGACGA GGCGCGCCGC TGCTACAACG 720 

ACCCCAAGTG CTGCGACTTC GTCACCAACC GGGCCTACGC CATCGCCTCG TCCGTAGTCT 780 

25 CCTTCTACGT GCCCCTGTGC ATCATGGCCT TCGTGTACCT GCGGGTGTTC CGCGAGGCCC 840 

AGAAGCAGGT GAAGAAGATC GACAGCTGCG AGCGCCGTTT CCTCGGCGGC CCAGCGCGGC 900 

CGCCCTCGCC CTCGCCCTCG CCCGTCCCCG CGCCCGCGCC GCCGCCCGGA CCCCCGCGCC 960 

CCGCCGCCGC CGCCGCCACC GCCCCGCTGG CCAACGGGCG TGCGGGTAAG CGGCGGCCCT 1020 

CGCGCCTCGT GGCCCTACGC GAGCAGAAGG CGCTCAAGAC GCTGGGCATC ATGATGGGCG 1080 

30 TCTTCACGCT CTGCTGGCTG CCCTTCTTCC TGGCCAAGGT GGTGAAGGCC TTCCACCGCG 1140 

AGCTGGTGCC CGACCGCCTC TTCGTCTTCT TCAACTGGCT GGGCTACGCC AACTCGGCCT 1200 

TCAACCCCAT CATCTACTGC CGCAGCCCCG ACTTCCGCAA GGCCTTCCAG GGACTGCTCT 1260 

GCTGCGCGCG CAGGGCTGCC CGCCGGCGCC ACGCGACCCA GGGAGACCGG CCGCGCGCCT 1320 

CGGGCTGTCT GGCCCGGCCC GGACCCCCGC CATCGCCCGG GGCCGCCTCG GACGACGACG 1380 

35 ACGACGATGT CGTCGGGGCC ACGCCGCCCG CGCGCCTGCT GGAGCCCTGG GCCGGCTGCA 1440 

ACGGCGGGGC GGCGGCGGAC AGCGACTCGA GCCTGGACGA GCCGTGCCGC CCCGCCTTCG 1500 

CCTCGGAATC CAAGGTGTAG GGCCCGGCGC GGGGCGCGGA CTCCGGGCAC GGCTTCCCAG 1560 

GGGAACCAGG AGATCTGTGT TTACTTAAGA CCGATAGCAG GTGAACTCGA AGCCCACAAT 1620 

CCTCGTCTGA ATCATCCGAG GCAAAGAGAA AAGCCACGGA CCGTTGCACA AAAAGGAAAG 16 B0 
40 TTTGGGAAGG GATGGGAGAG TGGCTTGCTG ATGTTCCTTG TTG 



1 11 21 31 41 51 

I I I I I I 

MGAGVLVLGA SEPGNLSSAA PLPDGAATAA RLLVPASPPA SLLPPASESP EPLSQQWTAG 
MGLLMALIVL LIVAGHVLVI VAIAKTPRLQ TLTKLFIMSL ASADLVMGLL WPFGATIW 
WGRWEYGSFF CELWTSVDVL CVTASIETLC VIALDRYLAI TSPFRYQSLL TRARARGLVC 
TVWAISALVS FLPILMHWWR AESDEARRCY NDPKCCDFVT NRAYAIASSV VSFYVPLCIM 
AFVYLRVFRE AQKQVKKIDS CERRFLGGPA RPPSPSPSPV PAPAPPPGPP RPAAAAATAP 
LANGRAGKRR PSRLVALREQ KALKTLGIIM GVFTLCWLPF FLANWKAFH RELVPDRLFV 
FFNWLGYANS AFNPIIYCRS PDFRKAFQGL LCCARRAARR RHATHGDRPR ASGCLARPGP 
PPSPGAASDD DDDDWGATP PARLLEPWAG CNGGAAADSD SSLDEPCRPG FASESKV 

Seq ID NO: B83 DNA sequence 



I I I I I I 

GGCTCAGCTG CCGGGCTGCT CCGGTTGGAA ACGCCAAGCC AGCTGCCGTC CTAATCCAAA 
AGCCATGAAC AGCGGCGTGT GCCTGTGCGT GCTGATGGCG GTACTGGCGG CTGGCGCCCT 
GACGCAGCCG GTGCCTCCCG CAGATCCCGC GGGCTCCGGG CTGCAGCGGG CAGAGGAGGC 
65 GCCCCGTAGG CAGCTGAGGG TATCGCAGAG AACGGATGGC GAGICCCGAG CGCACCTGGG 
CGCCCTGCTG GCAAGATACA TCCAGCAGGC CCGGAAAGCT CCTTCTGGAC GAATGTCCAT 
CGTTAAGAAC CTGCAGAACC TGGACCCCAG CCACAGGATA AGTGACCGGG ACTACATGGG 
CTGGATGGAT TTTGGCCGTC GCAGTGCCGA GGAGTATGAG TACCCCTCCT AGAGGACCCA 
„„ GCCGCCATCA GCCCAACGGA AGCAACCTCC CAACCCAGAG GAGGCAGAAT AAGACAACAA 
[■CATTGT CTGTGGAGTT TGACATTGAA TGTATCTATT TATTAAGTTC 
A AATTGTGTCT GTAAGATTGT CCAGTGCAAC CACACACGCT CT 
T GAAGACAAAA CTGTTTT 



TCCTGTTCTG A 



LIiARYIQQAR kapsgrmsiv KNLQNLDPSH 



P RRQLRVSQRT DGESRAHLGA 
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TABLE 74 Gene Sequences Up- or Down-Regulated in Cancer 
Angioqenesis 



1 DNA SEQUENCE 



endothelial differentiation, sphingolipid G-protein-coupled receptor, 1 
is. 154210 



Probeset Accession #: M31210 
Nucleic Acid Accession #: M31210 
Coding sequence: 251-1396 

TCTAAAGGTC GGGGGCAGCA GCAAGATGCG AAGCGAGCCG TACAGATCCC GGGCTCTCCG 60 

AACGCAACTT CGCCCTGCTT GAGCGAGGCT GCGGTTTCCG AGGCCCTCTC CAGCCAAGGA 120 

AAAGCTACAC AAAAAGCCTG GATCACTCAT CGAACCACCC CTGAAGCCAG TGAAGGCTCT 180 

CTCGCCTCGC CCTCTAGCGT TCGTCTGGAG TAGCGCCACC CCGGCTTCCT GGGGACACAG 240 

GGTTGGCACC ATGGGGCCCA CCAGCGTCCC GCTGGTCAAG GCCCACCGCA GCTCGGTCTC 300 

C AACTATGATA TCATCGTCCG GCATTACAAC TACACGGGAA AGCTGAATAT 360 

C AAGGAGAACA GCATTAAACT GACCTCGGTG GTGTTCATTC TCATCTGCTG 420 

C CTGGAGAACA TCTTTGTCTT GCTGACCATT TGGAAAACCA AGAAATTCCA 480 

,. „..G TACTATTTTA TTGGCAATCT GGCCCTCTCA GACCTGTTGG CAGGAGTAGC 540 

CTACACAGCT AACCTGCTCT TGTCTGGGGC CACCACCTAC AAGCTCACTC CCGCCCAGTG 600 

GTTTCTGCGG GAAGGGAGTA TGTTTGTGGC CCTGTCAGCC TCCGTGTTCA GTCTCCTCGC 660 

CATCGCCATT GAGCGCTATA TCACAATGCT GAAAATGAAA CTCCACAACG GGAGCAATAA 720 

CTTCCGCCTC TTCCTGCTAA TCAGCGCCTG CTGGGTCATC TCCCTCATCC TGGGTGGCCT 780 

GCCTATCATG GGCTGGAACT GCATCAGTGC GCTGTCCAGC TGCTCCACCG TGCTGCCGCT 840 

3 CACTATATCC TCTTCTGCAC CACGGTCTTC ACTCTGCTTC TGCTCTCCAT 900 

3 TACTGCAGAA TCTACTCCTT GGTCAGGACT CGGAGCCGCC GCCTGACGTT 960 

CCGCAAGAAC ATTTCCAAGG CCAGCCGCAG CTCTGAGAAT GTGGCGCTGC TCAAGACCGT 1020 

AATTATCGTC CTGAGCGTCT TCATCGCCTG CTGGGCACCG CTCTTCATCC TGCTCCTGCT 1080 

GGATGTGGGC TGCAAGGTGA AGACCTGTGA CATCCTCTTC AGAGCGGAGT ACTTCCTGGT 1140 

GTTAGCTGTG CTCAACTCCG GCACCAACCC CATCATTTAC ACTCTGACCA ACAAGGAGAT 1200 

GCGTCGGGCC TTCATCCGGA TCATGTCCTG CTGCAAGTGC CCGAGCGGAG ACTCTGCTGG 1260 
CAAATTCAAG CGACCCATCA TCGCCGGCAT GGAATTCAGC CGCAGCAAAT CGGACAATTC 
CTCCCACCCC CAGAAAGACG AAGGGGACAA CCCAGAGACC A "~™ —<•- - -•" 
CAACTCTTCT TCCTAGAACT GGAAGCTGTC CACCCACCGG A 
CTGGCCACCC CAGTGTTTGG AAAAAAATCT CTGGGCTTCG ACTGCTGCCA G 
GCTGCAAGCC AGAGGGAGGA AGGGGGAGAA TACGAACAGC CTGGTGGTGT C 

TGGGTAGACT TAGTTCCTGT GAACAATGCA CTGGGAAGGG TGGAGATCAG GTCCCGGCCT 1620 

GGAATATATA TTCTACCCCC CTGGAGCTTT GATTTTGCAC TGAGCCAAAG GTCTAGCATT 1SB0 

GTCAAGCTCC TAAAGGGTTC ATTTGGCCCC TCCTCAAAGA CTAATGTCCC CATGTGAAAG 1740 

CGTCTCTTTG TCTGGAGCTT TGAGGAGATG TTTTCCTTCA CTTTAGTTTC AAACCCAAGT 1800 

GAGTGTGTGC ACTTCTGCTT CTTTAGGGAT GCCCTGTACA TCCCACACCC CACCCTCCCT I86 0 

TCCCTTCATA CCCCTCCTCA ACGTTCTTTT ACTTTATACT TTAACTACCT GAGAGTTATC 1920 

AGAGCTGGGG TTGTGGAATG ATCGATCATC TATAGCAAAT AGGCTATGTT GAGTACGTAG 198 0 

GCTGTGGGAA GATGAAGATG GTTTGGAGGT GTAAAACAAT GTCCTTCGCT GAGGCCAAAG 2040 

TTTCCATGTA AGCGGGATCC GTTTTTTGGA ATTTGGTTGA AGTCACTTTG ATTTCTTTAA 2100 

AAAACATCTT TTCAATGAAA TGTGTTACCA TTTCATATCC ATTGAAGCCG AAATCTGCAT 2160 

AAGGAAGCCC ACTTTATCTA AATGATATTA GCCAGGATC C TTGGTGTCCT AGGAGAAACA 2220 

GACAAGCAAA ACAAAGTGAA AACCGAATGG ATTAACTTTT GCAAACCAAG GGAGATTTCT 2280 

TAGCAAATGA GTCTAACAAA TATGACATCC GTCTTTCCCA CTTTTGTTGA TGTTTATTTC 2340 

AGAATCTTGT GTGATTCATT TCAAGCAACA ACATGTTGTA TTTTGTTGTG TTAAAAGIAC 2400 

TTTTCTTGAT TTTTGAATGT ATTTGTTTCA GGAAGAAGTC ATTTTATGGA TTTTTCTAAC 24 60 

CCGTGTTAAC TTTTCTAGAA TCCACCCTCT TGTGCCCTTA AGCATTACTT TAACTGGTAG 2520 

GGAACGCCAG AACTTTTAAG TCCAGCTATT CATTAGATAG TAATTGAAGA TATGTATAAA 2580 

TATTACAAAG AATAAAAATA TATTACTGTC TCTTTAGTAT GGTTTTCAGT GCAATTAAAC 2640 

CGAGAGATGT CTTGTTTTTT TAAAAAGAAT AGTATTTAAT AGGTTTCTGA CTTTTGTGGA 2700 
TCATTTTGCA CATAGCTTTA TCAACTTTTA AACATTAATA AACTGATTTT TTTAAAG 



I I I I I 

(C AHRSSVSDYV NYDIIVRHYN YTGKLNISAD KENSIKLTSV VF] 

CKFHRPM YYFIGNLALS DLIjAGVAYTA NLLLSGATTY KLTPAQWFLR 
A SVFSLLAIAI ERYITMLKMK L 
GWNCISALSS CSTVLPLYHK HYILFCT 

ISKASRSSEN VALLKTVIIV LSVFIACWAP LFILLLLDVG CI 
LNSGTNPIIY TLTNKEMRRA FIRIMSCCKC PSGDSAGKFK R! 
QKDEGDNPET IMSSGNVNSS S 

A3 DNA SEQUENCE 

Gene name-. G protein-c 

Unigene number: Hs. 198612 

Probeset Accession #: AA452928 

Nucleic Acid Accession #: NM_004624.1 
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correspond to start and stop codons) 



1 11 21 31 41 51 

I I | I I I 

ATGGCTTCCC CGCGGAGGTC CGGGCAGCCA GGGCGGCCGC CGCCGCCGCC ACCGCCGCCC 60 

GCGCGCCTGC TACTGCTACT GCTGCTGCCG CTGCTGCIGC CTCTGGCGCC CGGGGCCTGG 120 

GGCTGGGCGC GGGGCGCCCC CCGGCCGCCG CCCAGCAGCC CGCCGCTCTC CATCATGGGC 180 

CTCATGCCGC TCACCAAGGA GGTGGCCAAG GGCAGCATCG GGCGCGGTGT GCTCCCCGCC 240 

GTGGAACTGG CCA1CGAGCA GATCCGCAAC GAGTCACTCC TGCGCCCCTA CTTCCTCGAC 300 

CTGCGGCTCT ATGACACGGA GTGCGACAAC GCAAAAGGGT TGAAAGCCTT CTACGATGCA 360 

ATAAAATACG GGCCGAACCA CTTGATGGTG TTTGGAGGCG TCTGTCCATC CGTCACATCC 420 

ATCATTGCAG AGTCCCTCCA AGGCTGGAAT CTGGTGCAGC TTTCTTTTGC TGCAACCACG 480 

CCTGTTCTAG CCGATAAGAA AAAATACCCT TATTTCTTTC GGACCGTCCC ATCAGACAAT 540 

GCGGTGAATC CAGCCATTCT GAAGTTGCTC AAGCACTACC AGTGGAAGCG CGTGGGCACG 600 

CTGACGCAAG ACGTTCAGAG GTTCTCTGAG GTGCGGAATG ACCTGACTGG AGTTCTGTAT 660 

3 AGCTTCTCCA ACGATCCCTG TACCAGTGTC 720 



G GAGCAGGTGC ACACGGAAGC CAACTCATCC 



CGGCACCAGC GGATCCAGGA CTTCAACTAC ACGGACCACA CGCTGGGCAG GATCATCCTC 1200 

AATGCCATGA ACGAGACCAA CTTCTTCGGG GTCACGGGTC AAGTTGTATT CCGGAATGGG 1260 

GAGAGAATGG GGACCATTAA ATTTACTCAA TTTCAAGACA GCAGGGAGGT GAAOGTGGGA 1320 

GAGTACAACG CTGTGGCCGA CACACTGGAG ATCATCAATG ACACCATCAG GTTCCAAGGA 1380 

TCCGAACCAC CAAAAGACAA GACCATCATC CTGGAGCAGC TGCGGAAGAT CTCCCTACCT 1440 

CTCTACAGCA TCCTCTCTGC CCTCACCATC CTCGGGATGA TCATGGCCAG TGCTTTTCTC 1500 

TTCTTCAACA TCAAGAACCG GAATCAGAAG CTCATAAAGA TGTCGAGTCC ATACATGAAC 1560 

AACCTTATCA TCCTTGGAGG GATGCTCTCC TATGCTTCCA TATTTCTCTT TGGCCTTGAT 1620 

GGATCCTTTG TCTCTGAAAA GACCTTTGAA ACACTTTGCA CCGTCAGGAC CTGGATTCTC 1680 

ACCGTGGGCT ACACGACCGC TTTTGGGGCC ATGTTTGCAA AGACCTGGAG AGTCCACGCC 1740 

ATCTTCAAAA ATGTGAAAAT GAAGAAGAAG ATCATCAAGG ACCAGAAACT GCTTGTGATC 1800 

GTGGGGGGCA TGCTGCTGAT CGACCTGTGT ATCCTGATCT GCTGGCAGCC TGTGGACCCC 1860 

CTGCCAAGGA CAGTGGAGAA GTACAGCATG GAGCCGGACC CAGCAGGACG GGATATCTCC 1920 

ATCCGCCCTC T CCTGGAGCA CTGTGAGAAC ACCCATATGA CCATCTGGCT TGGCATCGTC 1980 

TATGCCTACA AGGGACTTCT CATGTTGTTC GGTTGTTTCT TAGCTTGGGA GACCCGCAAC 2040 

CTCACCATCC CCGCACTCAA CGACAGCAAG TACATCGGGA TGAGTGTCTA CAACGTGGGG 2100 

ATCATGTGCA TCATCGGGGC CGCTGTCTCC TTCCTGACCC GGGACCAGCC CAATGTGCAG 2160 

TTCTGCATCG TGGCTCTGGT CATCATCTTC TGCAGCACCA TCACCCTCTG CCTGGTATTC 2 22 0 

GTGCCGAAGC TCATCACCCT GAGAACAAAC CCAGATGCAG CAACGCAGAA CAGGCGATTC 22B0 

CAGTTCACTC AGAATCAGAA GAAAGAAGAT TCTAAAACGT CCACCTCGGT CACCAGTGTG 234 0 

AACCAAGCCA GCACATCCCG CCTGGAGGGC CTACAGTCAG AAAACCATCG CCTGCGAATG 2400 

AAGATCACAG AGCTGGATAA AGACTTGGAA GAGGTCACCA TGCAGCTGCA GGACACACCA 2460 

GAAAAGACCA CCTACATTAA ACAGAACCAC TACCAAGAGC TCAATGACAT CCTCAACCTG 2520 

GGAAACTTCA CTGAGAGCAC AGATGGAGGA AAGGCCATTT TAAAAAATCA CCTCGATCAA 2580 

AATCCCCAGC TACAGTGGAA CACAACAGAG CCCTCTCGAA CATGCAAAGA TCCTATAGAA 2640 

GATATAAACT CTCCAGAACA CATCCAGCGT CGGCTGTCCC TCCAGCTCCC CATCCTCCAC 2700 

CACGCCTACC TCCCATCCAT CGGAGGCGTG GACGCCAGCT GTGTCAGCCC CTGCGTCAGC 2760 

CCCACCGCCA GCCCCCGCCA CAGACATGTG CCACCCTCCT TCCGAGTCAT GGTCTCGGGC 2820 



1 Protein sequence: 



G protein-coupled r 



Unigene number: 

Probeset Accession #: AA452928 

protein Accession jf : NP_005449. 
Signal sequence: 



I I I I 

MASPRRSGQP GRPPPPPPPP ARLLLLLLLP LLI 
LMPLTKEVAK GSIGRGVLPA VELAIEQIRN ESLLRPYFLD LRLYDTECDN AKGLKAFYDA 
IKYGPNHLHV FGGVCPSVTS IIAESLQGWN LVQLSFAATT PVLADKKKYP YFFRTVPSDN 
AVNPAILKLL KHYQWKRVGT LTQDVQRFSE VRNDLTGVLY GEDI3ISDTS SFSNDPCTSV 
KKLKGNDVRI ILGQFDQNMA AKVFCCAYEE NMYGSKYQHI IPGWYEPSNW EQVHTEANSS 
RCLRKNLLAA MEGYIGVDFE PLSSKQIKTI SGKTPQQYER EYNNKRSGVG PSKFHGYAYD 
GIWVIAKTLQ RAMETLHASS RHQRIQDFNY TDHTLGRIIL NAMNETNFFG VTGQWFRNG 
ERMGTIKFTQ FQDSREVKVG EYNAVADTLE IINDTIRFQG SEPPKDKTII LEQLRKIELP 
LYSILSALTI LGMIMASAFL FFNIKNRNQK LIKMSSPYMN NLI ILGGMLS YASIFLFGLD 
GSFVSEKTFE TLCTVRTWIL TVGYTTAFGA MFAKTWRVHA IFKNVKMKKK IIKDQKLLVI 
VGGMLLIDLC ILICWQAVDP LRRTVEKYSM EPDPAGRDIS IRPLLEHCEN THKTIWLGIV 
YAYKGLLMLF GCFLAWETRN VSIPALNDSK YIGMSVYNVG IMCIIGAAVS FLTRDQPNVQ 
FCIVALVIIF CSTITLCLVF VPKLITLRTN PDAATQSTRRF QFTQNQKXED SKTSTSVTSV 
NQASTSRLEG LQSENHRLRM KITELDKDLE EVTMQiQDTP EKTTYIKQNH YQELNDIIiNL 
GNFTESTDGG KAILKNHLDQ NPQLQWHTTE PSRTCKDPIE DINSPEHIQR RLSLQLPILH 
HAYLPSIGGV DASCVSPCVS PTASPRHRHV PPSFRVMVSG L 
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Probeset Accession #: AW0437S2 
Nucleic Acid Accession #: none found 

Coding sequence: 38-1075 (underlined sequence corresponds to start and stop codon) 



10 



AGCAACGACG CCGGGCAGCG GGAGCGGCGG CCGCGCCATG TGGCTGCTGG GGCCGCTGTG 
G AGCAGCGCCG CGGAGAGCCA GCTGCTCCCC G 



I GGCCCAACCT TCTTCCCCTG TGCCAGCGGC AT CCATTGCA TCATTGGTCG 

AAACCCTCTG CTTTGCTCCA CCGCCCGCTA CCACTGCAAG AACGGCCTCT GTATTGACAA 
GAGCTTCATC TGCGATGGAC AGAATAACTG TCAAGACAAC AGTGATGAGG AAAGCTGTGA 
AAGTTCTCAA GAACCCGGCA G 



GAATGCGTCG 
TGCGTGGTAT GACCTTCCTC CACCGCCCTA 
CGACCTGCCC CCCTACCGCT 



TGCTCATGGG AAGCTCTTTA AGCACCTGTA AGGATGTCTC AAGTTACAGT TTGGGATATT 1200 

AACTATCTCT GCATTCCCCT CCTCCCCCAG ACTTCAGAGA TGTTTTTCTG GCGTCTCAGT 1260 

TGACATGATC TGTTGTGCGT CTTTTCTGTC AGGTCACTCT TCCCTTGGGA CCCGAGATCA 132 0 

CACCCTCATT TTTCACATTA TTCTGTTTCT GTTGGAGAGA CAGCATATAA AACAGTATTG 1380 

AAATAGGCTG GGAGAGAGCA ATGTTTCTGT GCTATATTGG ATGCTCAGAA GTGCAGGAGA 1440 

CGCTGGACCC AATTCTCTCT GCTGGGTAGT TACCTTATAG CATTTGGGGA TTTGGGTTAG 1500 

ATGATCTAAC CAGGAGGCCA TCACTGGATG GTCACCCCCC CAAAAAAATT CCATTTGAGC 1560 

ATCAAAACCT GCTTTGCACA ATCCTATTTG ATGCCCCCAG TTCAGCAGAG TCAGTGGCCA 1620 

AAGAAAACTT TGGACGTGAG TAACACCCTT CAGCAGTCGC AACGTTATTT TGGTTTTGTG 16 80 

AAGGACTCTG AAACCATCTA CCCTGTATAA ATTCTGGCTT TAGAAATTTG CCCAAGAATG 1740 

CTCATTCTGA GAGCTTTCCT CAGCAGCATA TATCATCAGC CTCATCCTAA AATAGGCAGG 1E00 

GAGCCCCTCC CATGAGTTTA TCCAAGTTCT CAGCTCCTAA AATGCAGGCT GCCAAGACCC 18 60 

TACACCTGCC CTGGCTCTAC AGCCACTTAC CTGGTTTCTG GACTGTCACC CTCCCAGCTG 1920 

ACCTGCCCGT AGCCAAGGAA TGAGGACCTA ACTTGAGTTG GCCCAAAGTC TGACCTGGCT 1980 

GTATGTCCCT GTGGCCCACA CCCAGCCTGT CTTGCTCATT CATGCAGCCT CAACACTGGC 204 0 

CTCCAAAGTT CCCTTAACAC TTGCAAAGTC CTTTTTACCT GTGCATTTGG ACTTGAGGAC 2100 

ACTGGTTTCT ATCACAGGTG AGAGCCATGT TCAATACCTC CAGCAAGCTC TCCTGGCTCC 2160 

CTGCACTGTG CACGCTCCTC TTCCCAAGGT CCCAATACCA GCACCTCTAG TTAGAGTTAG 2220 

GGTCAGGGTC AGGCCTCTCC CAACATCCCA GTAGTTTCTC CTCTGAGACA CATGGGCAAG 2280 

AGACAATTT G GAGTCAAGAT TTTCCATTTG GATCTATTTT AAATCTTTTA GAAATGCATT 2340 

TGAAACAGTG TGTTTGTTTT TTCCCTTCTA GTTAAGGGAC TATTTATATG TGTATAGGAA 2400 

AGCTGTCTCT TTTTTTGTTT TTCCTTTAAC AAGGTCCAAA GAAAGATGCA AAAGGAGATC 24 60 

ACACCCTTGC CCCGCTGAGC CCCGTGATAA CAAGTCACTC CAGACTAACC TGTGTGCCAG 2520 

ACATTTGTGC ATTGTTGCAC TTTGAGGTTA TTATTTATCA AGTTCTTGAA GGAAGCAGAA 2580 

AGAGGGACTC CTCTCTCCCT CCGTGTATAG TCTCTATGTT TGTGCTAGTT TTTCTTTTTT 2640 

TTCTCTGTGT CCAGTCAGCC ACAGGGCCCG CCTCCCTGCA GGAATAAGGG GTAAAACGTT 2700 

AGGTGTTGTT TGGCAAGAAA CCACACTGAC TGATGAGGGG TAAAATGGAA CCAGGTAGAG 2760 
CCACTCCGGG CAGCTGTCAC CCATTCAGAA C 
AACCTGTTTG ACGCTAATTA A 
TGATCCTGTT CTGTAGACTT T 

AAGCTAGCCA CTGGTATTTT GTTTTGTTTA AAAAAAAAAA GAAAGAAAGA AAGAAAGAAA 
AACGGAAAGG AACCTAGCTG CCTGTATCTT TCATTTTTAA 



I TATTTCTCCC TTTGATGGGG CCCCTTCTTC 3180 
C TCTGGAAGTT GTTTAGAGGA AAGAATTCTA ATTTTAATTA A 

GAGTTAATCT CACTCGCTTT TCTGCTTCCA GGCATCTTAG GAAAAACAAA TGGTTTTAGT 3 300 

AGATAAGGGA TGCCTACTAA TGCTTTTTTA AAACAAACAG GGACATTTTT ATTATAGATT 3360 

TGATTTTTTT AATGAATGTT TTTAAAAATA TATAAATAGG ACACCAAAGC GGCAGGGTTT 3420 

TTTTTGGGGG GAGGGGGTTT GTTTTCCAAC TCAAGATGGC ACATTAGTGG CCAGCAATAT 3480 

TTTTTAACTC ATTCCAACCA GGAAGCTTTT TTATACATTG CCTAAATCTA CGCCAACCAG 3540 
AAAATAGTCT CATCTCTTTT TTTCTCAAAT GAGATCCGTG T 
AGTTACACTG TGATGACTGG CCTATTACCT GACTCAGCTC C 

TTTTTAAAAA ATGCAACTAA GTGGTTAATA GTGTGTGACG CTCAAAGTTA ATGTAAACTG 3720 

GAAAGGTTGT GTGTCGTTGC TTTTTGTGTT TTGGTTAGGC TTGGTTTTGT TTTTTAATTT 3780 

TTATACTTTC TAATAAATTT GCAGTTTCAT TCTTTCTGTT TGTGCAAAWG GWMCTAMARM 3 840 

AAMMAAAAAC AWYWTTGGGG GGGCTTGGGC CTCGGAAAAA GTTTTTAACA CCACTTCGGG 3900 

TGGGGCGGCG GGGCCCACGT AGGTACGGCG ACCACGCGGG CCCAAACGGG ACCCCAGAAG 3960 

GAAACCCTGG CCAAGAAAAA GGTGGCGAGA ATTCTCCACA CCAGAAAAAA ACGCGCCGGG 4020 
GGAAACCGCA GAGTGTTGCG TAAACCACAC CCGAAGAGAG AACTCAGAAG C 
GGACTCAACC AGGAGGACCC AAGGGAACCC GATAGAGTAC G 
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Unigene n 
Protein Accession ft: 
Transmembrane domains: 



AW043782 



I I 1 I I I 1 

MWLLGPLCLL LSSAAESQLL PGNNFTNECN IPGNFMCSNG RCIPGAWQCD GLPDCFDKSD 
EKECPKAKSK CGPTFFPCAS GIHCIIGRFR CNGFEDCPDG SDEENCTANP LLCSTARYHC 
KNGLCIDKSF ICDGQNNCQD NSDEESCESS QEPGSGQVFV TSENQLVYYP SITYAIIGSS 
VI FVLWALL ALVLHHQRKR NNLMTLPVHR LQHPVLLSRL WLDHPHHCN VTYNVKNGIQ 
YVASQAEQNA SEVGSPPSYS EALLDQRPAW YDLPPPPYSS DTESLNQADL PPYRSRSGSA 
NSASSQAASS LLSVEDTSHS PGQPGPQEGT 



A7 DNA SEQUENCE 



Unigene number, Hs. 149609 

Probeset Accession tf: X062S6 



Integrin, alpha 5 (fibronectin receptor, alpha polypeptide) 



Nucleic Acid Accession tt : NM_00220S „„rfrm=> 
1-3150 (underlined sequences correspond to start and stop codons) 



ding sequence: 

n 21 31 41 51 

| | | | I I 

30 ATGGGGAGCC GGACGCCAGA GTCCCCTCTC CACGCCGTGC AGCTGCGCTG GGGCCCCCGG 60 

CGCCGACCCC CGCTSSTGCC GCTGCTGTTG CTGCTSSTGC CGCCGCCACC CAGGGTCGGG 120 

GGCTTCAACT TAGACGCGGA GGCCCCAGCA GTACTCTCGG GGCCCCOSGG CTCCTTCTTC 180 

GGATTCTCAG TGGAGTTTTA CCGGCCGGGA ACAGACGGGG TCAGTGTGCT GGTGGGAGCA 240 

CCCAAGGCTA ATACCAGCCA GCCAGGAGTG CTGCAGGGTG GTGCTGTCTA CCTCTGTCCT 300 

35 TGGGGTGCCA GCCCCACACA GTGCACCCCC ATTGAATTTG ACAGCAAAGG CTCTCGGCTC 360 

CTGGAGTCCT CACTGTCCAG CTCAGAGGGA GAGGAGCCTG TGGAGTACAA GTCCTTGCAG 42 0 

TGGTTCGGGG CAACAGTTCG AGCCCATGGC TCCTCCATCT TGGCATGCGC TCCACTGTAC 1BO 

AGCTGGCGCA CAGAGAAGGA GCCACTGAGC GACCCCGTGG GCACCTGCTA CCTCTCCACA 54 0 

GATAACTTCA CCCGAATTCT GGAGTATGCA CCCTGCCGCT CAGATTTCAG CTGGGCAGCA 60 0 

40 GGACAGGGTT ACTGCCAAGG AGGCTTCAGT GCCGAGTTCA CCAAGACTGG CCGTGTGGTT 660 

TTAGGTGGAC CAGGAAGCTA TTTCTGGCAA GGCCAGATCC TGTCTGCCAC TCAGGAGCAG 720 

ATTGCAGAAT CTTATTACCC CGAGTACCTG ATCAACCTGG TTCAGGGGCA GCTGCAGACT 7 80 

GC M3GCCA GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCIGTGGC TGTTGGTGAA 840 

TTCAGTGGTG ATGACACAGA AGACTTTGTT GCTGGTGTGC CCAAAGGGAA CCICACTTAC 900 

45 GGCTATGTCA CCATCCTTAA TGGCTCAGAC ATTCGATCCC TCTACAACTT CTCAGGGGAA 9S0 

CAGATGGCCT CCTACTTTGG CTATGCAGTG GCCGCCACAG ACGTCAATGG GGACGGGCTG 1020 

GATGACTTGC TGGTGGGGGC ACCCCTGCTC ATGGATCGGA CCCCTGACGG GCGGCCTCAG 1080 

GAGGTGGGCA GGGTCTACGT CTACCTGCAG CACCCAGCCG GCATAGAGCC CACGCCCACC 1140 

CTTACCCTCA CTGGCCATGA TGAGTTTGGC CGATTTGGCA GCTCCTTGAC CCCCCTGGGG 1200 

50 GACCTGGACC AGGATGGCTA CAATGATGTG GCCATCGGGG CTCCCTTTGG TGGGGAGACC 1260 

CAGCAGGGAG TAGTGTTTGT ATTTCCTGGG GGCCCAGGAG GGCTGGGCTC TAAGCCTTCC 132 0 

CAGGTTCTGC AGCCCCTGTG GGCAGCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 1380 

CGAGGAGGCC GAGACCTGGA TGGCAATGGA TATCCTGATC TGATTGTGGG GTCCTTTGGT 1440 

GTGGACAAGG CTGTGGTATA CAGGGGCCGC CCCATCGTGT CCGCTAGTGC CTCCCTCACC 1500 

55 ATCTTCCCCG CCATGTTCAA CCCAGAGGAG CGGAGCTGCA GCTTAGAGGG GAACCCTGTG 1560 

GCCTGCATCA ACCTTAGCTT CTGCCTCAAT GCTTCTGGAA AACACGTTGC TGACTCCATT 162 0 

GGTTTCACAG TGGAACTTCA GCTGGACTGG CAGAAGCAGA AGGGAGGGGT ACGGCGGGCA 1680 

CTGTTCCTGG CCTCCAGGCA GGCAACCCTG ACCCAGACCC TGCTCATCCA GAATGGGGCT 1740 

CGAGAGGATT GCAGAGAGAT GAAG AT CT AC CTCAGGAACG AGTCAGAATT TCGAGACAAA 1800 

60 CTCTCGCCGA TTCACATCGC TCTCAACTTC TCCTTGGACC CCCAAGCCCC AGTGGACAGC 1860 

CACGGCCTCA GGCCAGCCCT ACATTATCAG AGCAAGAGCC GGATAGAGGA CAAGGCTCAG 1920 

ATCTTGCTGG ACTGTGGAGA AGACAACATC TGTGTGCCTG ACCTGCAGCT GGAAGTGTTT 19B0 

GGGGAGCAGA ACCATGTGTA CCTGGGTGAC AAGAATGCCC TGAACCTCAC TTTCCATGCC 2040 

CAGAATGTGG GTGAGGGTGG CGCCTATGAG GCTGAGCTTC GGGTCACCGC CCCTCCAGAG 2100 

65 GCTGAGTACT CAGGACT CGT CAGACACCCA GGGAACTTCT CCAGCCTGAG CTGTGACTAC 2160 

TTTGCCGTGA ACCAGAGCCG CCTGCTGGTG TGTGACCTGG GCAACCCCAT GAAGGCAGGA 2220 

GCCAGTCTGT GGGGTGGCCT TCGGTTTACA GTCCCTCATC TCCGGGACAC TAAGAAAACC 2280 

ATCCAGTTTG ACTTCCAGAT CCTCAGCAAG AATCTCAACA ACTCGCAAAG CGACGTGGTT 2340 

TCCTTTCGGC TCTCCGTGGA GGCTCAGGCC CAGGTCACCC TGAACGGTGT CTCCAAGCCT 2400 

1__ -™ »«^r.*r~rm mivrrrein arrAGCCTCA GAAGGAGGAG 2460 



TCCTTTCfcKiC lULUjiijyft tnjv_iv_rtuo>_^ wivo^w^^v- -~ 

70 GAGGCAGTGC TATTCCCAGT AAGCGACTGG CATCCCCGAG ACCAGCCTCA GAAGGAGGAG 2460 

GACCTGGGAC CTGCTGTCCA CCATGTCTAT GAGCTCATCA ACCAAGGCCC CAGCTCCATT 2520 

AGCCAGGGTG TGCTGGAACT CAGCTGTCCC CAGGCTCTGG AAGGTCAGCA GCTCCTATAT 2580 

GTGACCAGAG TTACGGGACT CAACTGCACC ACCAATCACC CCATTAACCC AAAGGGCCTG 2640 

GAGTTGGATC CCGAGGGTTC CCTGCACCAC CAGCAAAAAC GGGAAGCTCC AAGCOGCAGC 2700 

75 TCTGCTTCCT CGGGACCTCA GATCCTGAAA TGCCCGGAGG CTGAGTGTTT CAGGCTGCGC 2760 

TGTGAGCTCG GGCCCCTGCA CCAACAAGAG AGCCAAAGTC TGCAGTTGCA TTTCCGAGTC 2820 



TGGGCCAAGA CTTTCTTGCA GCGGGAGCAC CAGCCATTTA GCCTGCAGTG TGAGGCTGTG 2880 

TACAAAGCCC TGAAGATGCC CTACCGAATC CTGCCTCGGC AGCTGCCCCA AAAAGAGCGT 2 940 

CAGGTGGCCA CAGCTGTGCA ATGGACCAAG GCAGAAGGCA GCTATGGCGT CCCACTGTGG 3000 

80 ATCATCATCC TAGCCATCCT GTTTGGCCTC CTGCTCCTAG GTCTACTCAT CTACATCCTC 3060 

TACAAGCTTG GATTCTTCAA ACGCTCCCTC CCATATGGCA CCGCCATGGA AAAAGCTCAG 3120 
CTCAAGCCTC CAGCCACCTC TGATGCCTGA 
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Gene name: " Integrin, alpha 5 (fibronectin receptor, alpha polypeptide) 

Probeset Accession #.- X06256 
Protein Accession # : NP_002196 
Signal sequence: 1-42 

, 452-503, 1022-1036 



MGSRTPESPL HAVQLRWGPR RRPPLLPLLL LLLPPPPRVG GFMLDAEAPA V 
GFSVEFYRPG IDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 
LESSLSSSEG EEPVEYKSLQ WFGATVRAHG SSILACAPLY SWRTEKEPLS DPVGTCYLST 
DNFTRILEYA PCRSDFSWAA GQGYCQGGFS AEFTKTGRW LGGPGSYFWQ GQILSATQEQ 
IAESYYPEYL INLVQGQLQT RQASSIYDDS YLGYSVAVGE FSGDDTEDFV AGVPKGNLTY 
GYVTILNGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 
EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 
QQGWFVFPG GPGGLGSKPS QVLQPLWAAS HTPDFFGSAL RGGRDIiDGNG YPDLIVGSFG 
VDKAWYRGR PIVSASASLT I FPAMFNPEE RSCSLEGNPV ACINLSFCLtl ASGKHVADSI 
GFTVELQLDW QKQKGGVRRA LFLASRQATL TQTLLIQNGA REDCRSMKIY LRHESEFRDK 
LSPIHIALNF SLDPQAPVDS HGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 
GEQNHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 
FAVNQSRLLV CDLGNPMKAG ASLWGGLRFT VPHLRDTKKT IQFDFQILSK MJJNSQSDW 
SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPQKEE DLGPAVHHVY ELINQGPSSI 
SQGVLELSCP QALEGQQLLY VTRVTGLNCT TNHPINPKGL ELDPEGSLHH QQKREAPSRS 
SASSGPQILK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQREH QPFSLQCEAV 



A9 DNA SEQUENCE 

Gene name: Integrin, beta 1 (fibronectin receptor, beta, antigen CD29 includes MDF2, 

Unigene number: Hs. 287797 

Coding sequence: 1-2397 (underlined sequences correspond to start and stop codons) 



I I I I I 

^AATTTT CTGGATTGGA CTGATCAGTT CAGTTTGCTG TGTGTTTGCT 
CAAACAGATG AAAATAGATG TTTAAAAGCA AATGCCAAAT CATGTGGAGA ATGTATACAA 
GCAGGGCCAA ATTGTGGGTG GTGCACAAAT TCAACATTTT TACAGGAAGG AATGCCTACT 
TCTGCACGAT GTGATGATTT AGAAGCCTTA AAAAAGAAGG GTTGCCCTCC AGATGACATA 
GAAAATCCCA GAGGCTCCAA AGATATAAAG AAAAATAAAA ATGTAACCAA CCGTAGCAAA 
GGAACAGCAG AGAAGCTCAA G 



TATCCCATTG ACCTCTACTA CCTTATGGAC CTGTCTTACT CAATGAAAGA CGATTTGGAG 
55 AATGTAAAAA GTCTTGGAAC A 

AGAATTGGAT TTGGCTCATT TGTGGAAAAG ACTGTGATGC C 

GCTAAGCTCA GGAACCCTTG CACAAGTGAA CAGAACTGCA CCAGCCCATT TAGCTACAAA 

AATGTGCTCA GTCTTACTAA TAAAGGAGAA GTATTTAATG AACTTGTTGG AAAACAGCGC 

ATATCTGGAA ATTTGGATTC TCCAGAAGGT GGTTTCGATG CCATCATGCA AGTTGCAGTT 
60 TGTGGATCAC TGATTGGCTG GAGGAATGTT ACACGGCTGC TGGTGTTTTC CACAGATGCC 

GGGTTTCACT TTGCTGGAGA TGGGAAACTT GGTGGCATTG TTTTACCAAA TGATGGACAA 

TGTCACCTGG A 



G TTTACAAGGA GCTGAAAAAC TTGATCCCTA AGTCAGCAGT AGGAACATTA 10 B0 

TCTGCAAATT CTAGCAATGT AATTCAGTTG ATCATTGATG CATACAATTC CCTTTCCTCA 1140 

GAAGTCATTT TGGAAAACGG CAAATTGTCA GAAGGAGTAA CAATAAGTTA CAAATCTTAC 1200 

TGCAAGAACG GGGTGAATGG AACAGGGGAA AATGGAAGAA AATGTTCCAA TATTTCCATT 1260 

GGAGATGAGG TTCAATTTGA AATTAGCATA ACTTCAAATA AGTGTCCAAA AAAGGATTCT 1320 

GACAGCTTTA AAATTAGGCC TCTGGGCTTT ACGGAGGAAG TAGAGGTTAT TCTTCAGTAC 1380 

ATCTGTGAAT GIGAATGCCA AAGCGAAGGC ATCCCTGAAA GTCCCAAGTG TCATGAAGGA 1440 

AATGGGACAT TTGAGTGTGG CGCGTGCAGG TGCAATGAAG GGCGTGTTGG TAGACATTGT 1500 

GAATGCAGCA CAGATGAAGT TAACAGTGAA GACATGGATG CTTACTGCAG GAAAGAAAAC 1560 

AGTTCAGAAA TCTGCAGTAA CAATGGAGAG TGCGTCTGCG GACAGTGTGT TTGTAGGAAG 1620 

AGGGATAATA CAAATGAAAT TTATTCTGGC AAATTCTGCG AGTGTGA1AA T1TCAACTGT 1680 

GATAGATCCA ATGGCTTAAT TTGTGGAGGA AATGGIGTT1 GCAAGTGTCG TGTGTGTGAG 1740 

TGCAACCCCA ACTACACTGG CAGTGCATGT GACTGTTCTT TGGATACTAG TACTTGTGAA 1800 

GCCAGCAACG GACAGAT CTG CAATGGCCGG GGCATCTGCG AGTGTGGTGT CTGTAAGTGT I860 

ACAGATCCGA AGTTTCAAGG GCAAACGTGT GAGATGTGTC AGACCTGCCT TGGTGTCTGT 1920 

GCTGAGCATA AAGAATGTGT TCAGTGCAGA GCCTTCAATA AAGGAGAAAA GAAAGACACA I960 

T GCACACAGG AATGTTCCTA TTTTAACATT ACCAAGGTAG AAAGTCGGGA CAAATTACCC 2040 

CAGCCGGTCC AACCTGATCC TGTGT CCCAT TGTAAGGAGA AGGATGTTGA CGACTGTTGG 2100 

TTCTATTTTA CGT ATT CAGT GAATGGGAAC AACGAGGTCA TGGTTCATGT TGTGGAGAAT 2160 

CCAGAGTGTC CCACTGGTCC AGACATCATT CCAATTGTAG CTC-GTGTGGT TGCTGGAATT 2220 
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CCTATTTATA AGAGTGCCGT AACAACTGTG GTCAATCCGA A 
A1Q Protein sequence ■■ 

Gene name-. Integrin, beta 1 (fibronectin receptor, beta, antigen CD29 includes MDF2, MSK12 ) 

Probeset Accession 0: X07979 
Protein Accession #: NP_002202.1 



55 



20 MNLQPIFWIG LISSVCCVPA QTDENRCLKA NAKSCGECIQ AGPNCGWCTN STFLQ2GMPT 
SARCDDLEAL KKKGCPPDDI ENPRGSKDIK KNKNVTNRSK GTASKLKPED ITQIQPQQLV 
LRLRSGEPQT FTLKFKRAED YPIDLYYLMD LSYSMKDDLE NVKSLGTDLM NEMRRITSDF 
RIGFGSFVEK TVMPYISTTP AKLRNPCTSE QNCTSPFSYK NVLSLTNKGE VFNELVGKQR 
ISGNLDSPEG GFDAIMQVAV CGSLIGWRNV TRLLVFSTDA GFHFAGDGKL GGIVLPNDGQ 
25 CHLENNMYTM SHYYDYESIA HLVQKLSENN IQTIFAVTEE FQPVYKELKN LIPKSAVGTL 
S EGVTISYKSY CKNGVNGTGE NGRKCSNISI 



NGTFECGACR CNEGRVGRHC ECSTDEVNSE DMDAYCRKEN SSEICSNNGE CVCGQCVCRK 
RDNTNEIYSG KFCECDNFHC DRSNGLICGG NGVCKCRVCE CNPNYTGSAC DCSLDTSTCE 
ASNGQICNGR GICECGVCKC TDPKFQGQTC EMCQTCLGVC AEHKECVQCR AFNKGEKKDT 
CTQECSYFNI TKVESRDKLP QPVQPDPVSH CKEKDVDDCW FYFTYSVNGN NEVMVHWEN 



35 All DMA SEQUENCE 
Unigene number: 

M24736 

Nucleic Acid Accession tt: NM_000450 
40 Coding sequence: 1-1833 (underlined sequences correspond to start and stop codons) 



Gene name: selectin E (endothelial adhesion molecule 1 

Unigene number: 



TCCATATTGA GCTATTCACC AAGTTATTAC TGGATTGGAA TCAGAAAAGT C 
TGGGTCTGGG TAGGAACCCA GAAACCTCTG ACAGAAGAAG CCAAGAACTG GGCTCCAGGT 
GAACCCAACA ATAGGCAAAA AGATGAGGAC TGCGTGGAGA TCTACATCAA GAGAGAAAAA 
50 GATGTGGGCA TGTGGAATGA TGAGAGGTGC AGCAAGAAGA AGCTTGCCCT ATGCTACACA 
GCTGCCTGTA CCAATACATC CTGCAGTGGC CACGGTGAAT GTGTAGAGAC CATCAATAAT 
TACACTTGCA AGTGTGACCC TGGCTTCAGT GGACTCAAGT GTGAGCAAAT TGTGAACTGT 
ACAGCCCTGG AATCCCCTGA GCATGGAAGC CTGGTTTGCA GTCACCCACT GGGAAACTTC 
AGCTACAATT CTTCCTGCTC TATCAGCTGT GATAGGGGTT ACCTGCCAAG CAGCATGGAG 
ACCATCCAGT GTATGTCCTC TGGAGAATGG AGTGCTCCTA 1TCCAGCCTG C 



AGCTTCCCAT GGAACACAAC CTGTACATTT GACTGTGAAG AAGGATTTGA ACTAATGGGA 840 

C TTCAGTGTAC CT CAT CTGGG AATTGGGACA ACGAGAAGCC AACGTGTAAA 900 

T GCAGGGCCGT CCGCCAGCCT CAGAATGGCT CTGTGAGGTG CAGCCATTCC 9 SO 

CCTGCTGGAG AGTTCACCTT CAAATCATCC TGCAACTTCA CCTGTGAGGA AGGCTTCATG 1020 

TTGCAGGGAC CAGCCCAGGT TGAATGCACC ACTCAAGGGC AGTGGACACA GCAAATCCCA 1080 

G CACAGCCTTG TCCAACCCCG AGCGAGGCTA CATGAATTGT 1140 

:CGTTAT GGGTCCAGCT GTGAGTTCTC CTGTGAGCAG 1200 

GGTTTTGTGT TG AAGGG AT C CAAAAGGCTC CAATGTGGCC CCACAGGGGA GTGGGACAAC 12 60 

65 GAGAAGCCCA CATGTGAAGC TGTGAGATGC GATGCTGTCC ACCAGCCCCC GAAGGGTTTG 1320 

GTGAGGTGTG CTCATTCCCC TATTGGAGAA TTCACCTACA AGTCCTCTTG TGCCTTCAGC 1380 

TGTGAGGAGG GATTTGAATT ATATGGATCA ACTCAACTTG AGTGCACATC TCAGGGACAA 1440 

TGGACAGAAG AGGTTCCTTC CTGCCAAGTG GTAAAATGTT CAAGCCTGGC AGTTCCGGGA 1500 

AAGATCAACA TGAGCTGCAG TGGGGAGCCC GTGTTTGGCA CTGTGTGCAA GTTCGCCTGT 1560 

70 CCTGAAGGAT GGACGCTCAA TGGCTCTGCA GCTCGGACAT GTGGAGCCAC AGGACACTGG 1620 

TCTGGCCTGC TACCTACCTG TGAAGCTCCC ACTGAGTCCA ACATTCCCTT GGTAGCTGGA 1680 

CTTTCTGCTG CTGGACTCTC CCTCCTGACA TTAGCACCAT TTCTCCTCTG GCTTCGGAAA 1740 

TGCTTACGGA AAGCAAAGAA ATTTGTTCCT GCCAGCAGCT GCCAAAGCCT TGAATCAGAC 1800 
GGAAGCTACC AAAAGCCTTC TTACATCCTT TAA 



Selectin E (endothelial adhesion molecule 1 
NP_000441 



Signal sequence: 
Transmembrane domains: 555-573 
C-lectin domain: 23-139 
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Cellular Localization: plasma ir 



I I I I I I 

MIASQFLSAL TLVLLIKESG AWSYNTSTEA MTYDEASAYC QQRYTHLVAI QNKEEIEYLN 60 

SILSYSPSYY WIGIRKVNNV WVWVGTQKPL TEEAKNWAPG EPNNRQKOED CVEIYIKREK 120 

DVGMWHDERC SKKKLALCYT AACTNTSCSG HGECVETINN YTCKCDPGFS GLKCEQIVNC 180 

TALESPEHGS LVCSHPLGNF SYNSSCSISC DRGYLPSSME TMQCMSSGEW SAPIPACNW 240 

ECDAVTNPAN GFVECFQNPG SFPWNTTCTF DCEEGFELMG AQSLQCTSSG NWDNEKPTCK 300 

AVTCRAVRQP QNGSVRCSHS PAGEFTFKSS CNFTCEEGFM LQGPAQVECT TQGQWTQQIP 360 

VCEAFQCTAL SNPERGYMHC LPSASGSFRY GSSCEFSCEQ GFVLKGSKRL QCGPTGEWDN 420 

EKPTCEAVRC DAVHQPPKGL VRCAHSPIGE FTYKSSCAFS CEEGFELYGS TQLECTSQGQ 480 

WTEEVPSCQV VKCSSLAVPG KINNSCSGEP VFGTVCKFAC PEGWTLNGSA ARTCGATGHW 540 

SGLLPTCEAP TESNIPLVAG LSAAGLSLLT LAPFLLWLRK CLRKAKKFVP ASSCQSLESD 600 
GSYQKPSYIL 

A13 DNA SEQUENCE 

Gene name; G protein-coupled receptor 33 

Unigene number: Hs. 85339 

• '■ AA3498S3 

1-13 62 (underlined sequences correspond t 

21 31 41 51 

I I I I 

C CCAGCCTCCC GGGCAGTGAC TGCTCCCAAA TCATTGATCA CAGTCATGTC 60 

CCCGAGTTTG AGGTGGCCAC CTGGATCAAA ATCACCCTTA TTCTGGTGTA CCTGATCATC 120 

TTCGTGATGG GCCTTCTGGG GAACAGCGCC ACCATTCGGG TCACCCAGGT GCTGCAGAAG 1B0 

I TGCAGAAGGA GGTGACAGAC CACATGGTGA GTTTGGCTTG CTCGGACATC 240 

T GCCCATGGAG TTCTACAGCA TCATCTGGAA TCCCCTGACC 300 

I ACACCCTGTC CTGCAAGCTG CACACTTTCC TCTTCGAGGC CTGCAGCTAC 360 

GCTACGCTGC TGCACGTGCT GACACTCAGC TTTGAGCGCT ACATCGCCAT CTGTCACCCC 420 

TTCAGGTACA AGGCTGTGTC GGGACCTTGC CAGGTGAAGC TGCTGATTGG CTTCGTCTGG 480 

GTCACCTCCG CCCTGGTGGC ACTGCCCTTG CTGTTTGCCA TGGGTACTGA GTACCCCCTG 540 

GTGAACGTGC CCAGCCACCG GGGTCTCACT TGCAACCGCT CCAGCACCCG CCACCACGAG 600 

CAGCCCGAGA CCTCCAATAT GTCCATCTGT ACCAACCTCT CCAGCCGCTG GACCGTGTTC 560 

A TCTTCGGCGC CTTCGTGGTC TACCTCGTGG TCCTGCTCTC CGTAGCCTTC 720 

\ ACATGATGCA GGTGCTCATG AAAAGCCAGA AGGGCTCGCT GGCCGGGGGC 780 

Z CGCAGCTGAG GAAGTCCGAG AGCGAAGAGA GCAGGACCGC CAGGAGGCAG 840 

r TCCTGAGGCT GATTGTTGTG ACATTGGCCG TATGCTGGAT GCCCAACCAG 900 

ATTCGGAGGA TCATGGCTGC GGCCAAACCC AAGCACGACT GGACGAGGTC CTACTTCCGG 960 

GCGTACATGA TCCTCCTCCC CTTCTCGGAG ACGTTTTTCT ACCTCAGCTC GGTCATCAAC 1020 

CCGCTCCTGT ACACGGTGTC CTCGCAGCAG TTTCGGCGGG TGTTCGTGCA GGTGCTGTGC 10 80 

TGCCGCCTGT CGCTGCAGCA CGCCAACCAC GAGAAGCGCC TGCGCGTACA TGCGCACTCC 1140 

ACCACCGACA GCGCCCGCTT TGTGCAGCGC CCGTTGCTCT TCGCGTCCCG GCGCCAGTCC 1200 

TCTGCAAGGA GAACTGAGAA GATTTTCTTA AGCACTTTTC AGAGCGAGGC CGAGCCCCAG 1260 

TCTAAGTCCC AGTCATTGAG TCTCGAGTCA CTAGAGCCCA ACTCAGGCGC GAAACCAGCC 1320 
AATTCTGCTG CAGAGAATGG TTTTCAGGAG CATGAAGTT T GA 

A14 Protein sequence: 



Unigene number: 
Probeset Accession (t : 

Protein Accession ft: NM_001508, NP_001409 

Pfam domains: 7tm_l [72-172, 224-344] 

Transmembrane domains: 32-54, 68-90, 111-133, 15 

Cellular Localization: plasma membrane 



I I I I I I 

MASPSLPGSD CSQIIDHSHV PEFEVATWIK ITLILVYLII FVMGLLGNSA TIRVTQVLQK 

KGYLQKEVTD HMVSLACSDI LVFLIGMPME FYSIIWNPLT TSSYTLSCKL HTFLFEACSY 

ATLLHVLTLS FERYIAICHP FRYKAVSGPC QVKLLIGFVW VTSALVALPL LFAMGTEYPL 

VNVPSHRGLT CNRSSTRHHE QPETSNMSIC TNLSSRWTVF QSSIFGAFW YLWLLSVAF 

MCWNMMQVLM KSQKGSLAGG TRPPQLRKSE SEESRTARRQ TIIFLRLIW TLAVCWMPNQ 

IRRIMAAAKP KHDWTRSYFR AYMILLPFSE TFFYLSSVIN PLLYTVSSQQ FRRVFVQVLC 

CRLSLQHANH EKRLRVHAHS TTDSARFVQR PLLFASRRQS SARRTSKIFL STFQSEAEPQ 
SKSQSLSLES LEPNSGAKPA NSAAENGFQE HEV < 
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I I I I I I 

GGCGICCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 
CCGCAACCGC TGAGCCATCC ATGGGGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 
CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 
CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 
ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 
A CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAAATGAGC 
3 CTGTG1CCA1 GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 
I CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 
3 CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 
C TCGGAAGAGG 



TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 



AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG 
GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC 

CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 

GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 

AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 

CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1500 

GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT GTAACCTTTA 1560 

AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 

CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 16 BO 

GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 

TTATCACTGT TGAGTTTGAG CTTGAAACTA ACCAAAAGGA GGTGACAGCT TCTTGTGACC 1800 

TGAGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA I860 

AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 

AAAAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 

CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAACGCT 2040 

GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 

GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 

GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 



GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2 340 

GTTCACCTGG ACATTTCTAC AACACCACCA C1CACCGATG TATTCGTTGC CCAGTGGGAA 2400 

CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 

ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 

GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2 58 0 

AGTGTACGTG GACCATCAAC CCACCCCCCA AGCGCCGCAT CCTGATCGTG GTCCCTGAGA 2640 

TCTTCCTGCC CATAGAGGAC GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCTCTTCAT 2700 

CCAATTCTGT GACAACATAT GAAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 2760 

CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATGA AGGGAACAGC GCTAGAGGGT 2820 

TCCAGGTCCC ATACGTGACA TATGATGAGG ACTACCAGGA ACTCATK3AA GACATAGTTC 2880 

GAGATGGCAG GCTCTATGCA TCTGAGAACC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 2940 

TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 3000 

AGTCCCGAGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 3060 

GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3180 



C TTCCTCTAGC : 

CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG : 
CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTGAG ACCTGGGAGG : 
ACTCAGTTTC TCCACAGCCT T 
AGTTCTAAGC AGTGCTCGTG 
AGCACTTCTG GAGACAT 

A16 Protein sequence 

Unigene number: 
Probeset Accession 

Protein Accession #: CAB92285 

Signal sequence: 1-31 

Transmembrane domains: ■ none 

PFAM domains: EGF-like_domains [49-84, 132-167, 177-213 , 2B6 -321 , 4C 

CUB_domain [809-918] 

Cellular Localization: may be secreted 



OA I I I I I I 

5U MGVAGRNRPG AAWAVLLLLL LLPPLLLLAG AVPPGRGRAA GPQEDVDSCA QGLDDCHADA 60 

LCQNTPTSYK CSCKPGYQGE GRQCEDIDEC GNELNGGCVH DCLNIPGNYR CTCFDGFMLA 120 

HDGHNCLDVD ECLENNGGCQ HTCVNVMGSY ECCCKEGFFL SDNQKTCIHR SEEGLSCMHK 180 

DHGCSHICKE APRGSVACEC RPGFELAKNQ RDCILTCNHG NGGCQHSCDD TADGPECSCH 240 

PQYKMHTDGR SCLEREDTVL EVTESNTTSV VDGDKRVKRR LLMETCAVMN GGCDRTCKDT 300 
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STGVHCSCPV GFTLQLDGKT CKDIDECQTR NGGCDHFCKN IVGSFDCGCK KGFKLLTDEK 
SCQDVDECSL DRTCDHSCIN HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 
TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGIHL 
SSDVTTIRTS VIFKLNEGKC SLKNAELFPE GLRPALPEKH SSVKESFRYV NLTCSSGKQV 
PGAPGRPSTP KEMFITVEFE LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 
FHLQLSGMNL DVAKKPPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDGA RERCILCPKC 
TFQNEEGQMT CEPCPRPGNS GALKTPEAWN MSECGGLCQP GEYSADGFAP CQLCALGTFQ 
PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVQCSPGHFY NTTTHRCIRC PVGTYQPEFG 
KHWCVSCPGN TTTDFDGSTN ITQCKNRRCG GELGDFTGYI ESPNYPGHYP ANTECTWTIN 
PPPKRRILIV VPEIFLPIED DCGDYLVMRK TSSSNSVTTY ETCQTYERPI AFTSRSKKLW 
IQFKSNEGNS ARGFQVPYVT YDEDYQELIE DIVRDGRLYA SEKHQEILKD KXLIKALFDV 
LAHPQNYFKY TAQESREMFP RSFIRLLRSK VSRFLRPYK 



Unigene number: Hs. 293102 

, AW17003S, AL044891, 



sequence i 



U » 31 „ f 

A TAGATAAAAT AAATGGAAAA TTAGAAGAGT CTCCTGATAA TGATGGTTTT 

CTGAAGGCTC CCTGCAGAAT GAAAGTTTCT ATTCCAACTA AAGCCTTAGA ATTGATGGAC 

AT GCAAACTT TCAAAGCAGA GCCTCCCGAG AAGCCATCTG CCTTCGAGCC TGCCATTGAA 

ATGCAAAAGT CTGTTCCAAA TAAAGCCTTG GAATTGAAGA ATGAACAAAC ATTGAGAGCA 

GATCAGATGT TCCCTTCAGA ATCAAAACAA AAGAASGTTG AAGAAAATTC TTGGGATTCT 

GAGAGTCTCC GTGAGACTGT TTCACAGAAG GATGTGTGTG TACCCAAGGC TACACATCAA 

AAAGAAATGG ATAAAATAAG TGGAAAATTA GAAGATTCAA CTAGCCTATC AAAAATCTTG 

GATACAGTT C ATTCTTGTGA AAGAACAAGG GAACTTCAAA AAGACCCCTG TGACCCACGT 
TCAGGAAAA 

AIS Prot 



2r = Hs. 293102 

t Accession (t: AI9S1118, AW17003S, 

Protein Accession none found 

Signal sequence: none 



Cellular Localization: 



QKEIDKINGK LEESPDNDGF LKAPCRMKVS I PTKALELMD MQTFKAEPPE KPSAFSPAIE 
MQKSVPNKAL ELKNEQTLRA DQMFPSESKQ KXUEENSWDS ESLRETVSQK DVCVPXATHQ 
KEMDKI SGKLi EDSTSLSKIL DTVHSCERTR ELQKDPCDPR SGK 



Gene name: 
Unigene number: 
Probeset Accession (f 



I I I I I I 

ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA SO 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT CGGGAAT TAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 180 

AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 

TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG S00 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTC^ TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GC-CCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GT CAGGATGG GGGGCCACAG AGGATGGAGC AGGTGACGCC 1260 

TCCCCTGTCC TGAACCACGC GGCCGTCCCT TTGATTTCCA ACAAGATCTG CAACCACAGG 1320 

GACGTGTACG GTGGCATCAT CTCCCCCTCC ATGCTCTGCG CGGGCTACCT GACGGGTGGC 1380 



: and stop codons) 



837 



WO 03/042661 



PCT/US02/36810 



20 

25 



CAGCGGGGGG 



TTCCGGCACC AGTAGCAGGC CCGAAAGAGG 



CCTCAAATGA TGTGCCTGCT 



CCCCTGGTGT GTCAAGAGAG GAGGCTGTGG 1440 

GGCTGCGCAG AGGTGAACAA GCCTGGGGTG 1500 

ATCCACGAGC AGATGGAC-AG AGACCTAAAA 1560 

TGAGTTCCTG AGC-TGA1GAA GACAC-CCCGA 1620 

CACACGAGCA GACACCCTTG GAGCTCTGAG 1680 

CACCCTTCCA TCIGATTCCA GCACAACCTT 1740 

GGTGGAGTCT CGCTCTGTTG CCCAGGCTGG 1800 



CAGGTGCCCG C 
ATGTTGGCCA GGCTGCTCTC AAACCCCTGA 
ACAGTGCTGG GATTACAGGC A 



CAAAGCCACC AACAGCCACT 



CCAAACCCAC CCTTTCTACT 
ATGACTCGTT TAAGGCCTAT 
TTGTCCTTTG ATTCCAAATA ATATGTTTCC 



TCTTTGTAGC ATTTGGTGCT 



TAATCTAGGA 



A20 Protein sequence: 



Probeset Accession 
Protein Accession It : 
Signal S' 



TMPRSS3a 
HS.2S8241 
AI53S613 
EAB20077 



t determined 



1 11 21 31 41 51 

I I I I I I 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP I IVIGI IALI 
LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 
TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 
VTALHHSVYV RKGCASGHVV TLQCTACGHR RGYSSRIVGG NMSM.SQWPW QASLQFQGYH 
LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 
LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEEGA GDASPVLNIIA 
AVPLISNKIC NHRDVYGGII SPSMLCAGYL TGGVDSCQGD SGGPLVCQER RLWKLVGATS 
FGIGCAEVNK PGVYTRVTSF LDWIHEQMER DLKT 

ESTs; opposite strand to TRPS1 



t and stop codons) 



GATTCTGTTG GGCTTCCTGG TAGAGCTGGA GACAGACTGC AACCCAAGAC AAAACCCAGA 
A CTAACCTCAG CCATTTGCCC AATTTCCACC TTGCTAGTTA TCAGAGTTCA 
A AGCTCCAGGG GTCATACTAT GGCAATCTAC TGGTGGAATT TATTCTGAGC 
G TTCAGCTTGC AGTATTAAAA CAAAAAAAAC ATTGCCTCCA GCTGCAAAGC 
AAGGGCATTG CCATTATGAA AGCCCCTCAA AGACTCTCTG CTATTTTCAA AACATGGAAA 
GAAAAAGGGA AAAAAGAAAA AAAAATAATA ATTAGAAGGA T 
CCCAAAAATG AGAAATG 
CAAGCAAATC TTCTGAAGCA A 

\ TGCAGGTGAG TCTTGCCAAC GTCCTTTCCT A 




CTTTTTACAC AAAATATAT C AGTGAGAGAG 



GATTTCACCA TTTCAATTCA GCAGAGCCTG TATATATATA T, 

TATATATATA TATATATATA TATATTACAA TGATCTGTAT TTCCTATTGC TAGAAGGATG 
AAAGTGAATC CATATAAACC ATACCAACGC CGTTATGTGT AACTGGTGGT A 



GA GTAGAAATGA ACATGCACTA TGAAAACAAA ATAAAATAAA 1440 

I TTCATGTGTT GTAAGAACAG AACTATTATA GCCAACATTC TAGTATTCAA 1500 

ATCAGGACTA CAAATTGAAT TCTTTTTCTT AGCAACATGA AATCATTCCA TATGAAAGAC 1560 

ATTTTCTGCT GGTGAATATT GCTGTAAGTT AAATTTTACA TTGGCATTTT GAGATGTTCC 1620 
CCCCTCATGC CTCCCCCAAA GTTTTCCATG TGGTTGTCAA ATAGTCCGC 



A22 Protein sequence: 
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Gene name: ESTs; opposite s 

Probeset Accession tt: AA428090 

protein Accession #: none found 

Signal sequence: none found 

Transmembrane domains: 113-129 

Cellular Localization: not determined 



R QWGEEIHLQ WSVSYLVEtJ FSDTDDLMSI VEESDSCYHR 
DSVGLPGRAG DRLQPKTKPR GKTTNLSHLP NFHLASYQSS LLFKLQGSYY GNLLVEFILS 
KCFVQLAVLK QKKHCLQLQS KGIAIMKAPQ RLSAIFKTWK EKGKKEKKI I IRRICSLIWA 



I I I I 

3 CGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT SO 

C GCCAGCGCCC CTCTGTGGGC GTCCAGTCCT TGAGGCCGCA GAGCCCGCAG 120 

CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT GCAGTTCCTG 180 

CAGCAGCAGC ACTCGGAGAT GCTGGCCAAG CTCCATGAGG AGATCGAGCA TCTGAAGCGG 240 

GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 

ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 3 60 

GGGGGAACAC AGGACGGGGA GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 

CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGGACCT GGACAGATGC CGCTACCTCT 480 

AGCCGTGGCT GGACGATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGGAAGCCCA 540 

GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 600 

CCAAGTAGAG CTGAAATGGG AAGGAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 660 

CCTCACATTC CTGCTCTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 

ATGCTGGGGG CCCAGGGGAT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 780 

GCAGCAACCA TGGGGACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG B40 

GCACTTCCCC ATCCTGACAG CGGCCCCCAC CCAGCCCAGG ATCCTGGGCT GTGGTCTCAA 900 

GCTCACITCC CATTATCTTT GGGGCTGGGG CTGACATCAG GAGGACATCT GACTGGTGGA 960 

TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 

GACATGGAGA AGGGGGTTGA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 10 80 

CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCOCAGC CCTGCAGTGC TGGGGACGCT 1140 

GACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGGGACCT GCTGTTCCAT GTGTCCCAAG 1200 

CCCTCCTGCT TT CCAGATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1260 

GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 

AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGCG 13 80 

GGCGGTAGCG CCGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 1440 

TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 

TCCTTCAACA AGCAAGATTC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG 1560 

CCCCTACTTC ACAACAGCAA GCTGGACAAA GTTCCTGGGG TACAAGGGCA GG CCAGAAAG 1620 

GAGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 1680 

AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCGAAAGCCC 1740 

ACCACACTTA GGCAGTGCGA AGTGCTCATC CGCGAGCTGT GGAATACCAA CCTCCTGCAG 1800 

ACCCAAGAGC TGCGGCACCT CAAGTCCCTC CTGGAAGGGA GCCAGAGGCC CCAGGCAGCC 1860 

CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATTTCCCCAA GGTCTCCACC 1920 

AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 1980 




Protein Accession ft : T43457 

Signal sequence: none found 

Transmembrane domains: 303-322 

Cellular Localization: not determined 



nc I I I 

I J MS GAGVAAGT RPPSSPTPGS RERRQRPSVG 
QQQHSEMLAK LHEEIEHLKR ENKGEPARGP 
GGTQDGEPLQ TVLAHLAALA PVCQPSGYRF 
GPEVIAGRQV ATGCSPDLPP PSRAEMGRNP 
_ MLGAQGIWTH SIQGSLPAIW AATMGTKGGS 

OU AHFPLSLGLG LTSGGHLTGG WSQPGNIAAG 
LFWAKCGPSR QPQPCSAGDA DRTREEAMLS 
GARWVCINGV WVEPGGPSPA RLKEGSSRTH 
SVKSISNSAN SQGKARPQPG SFNKQDSKAD 



VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL 60 

RPALPPQAHS TLPLPQHRNT AINSSTRLGS 120 

WGTWTDAATS SRGWTMLCSQ AQHVLLSGSP 180 

WDSPCPARSL PQIAAVARPR ISSPMALSPH 240 

RVLFPCHLSK ALPHPDSGPH PAQDPGLWSQ 300 

AVPRALPSQG DMEKGVEGGP FPSP.CGNSSE 360 

LGTCCSMCPK PSCFPDGPSG HHLSRASAPL 420 

RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 480 

VSQKADLEEE PLLHNSKLDK VPGVQGQARK 540 
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EKAEASNAGA ACMGNSQHQG RQMGAGAHPP MILPLPLRKP TTLRQCEVLI RELWNTKLLQ 600 

TQELRHLKSL LEGSQRPQAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERAILPA 660 
LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 

A25 DNA SEQUENCE 

Gene name: ESTs; prolactin recepbor 

Unigene number: Hs. 25252 

Probeset Accession #: AA057193 
Nucleic Acid Accession #: NM_000949 

Coding sequence: 285-2153 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGAGGCTGAA ATCCCCAGAC GCCGGTTTTC TGGGCTGGGC TTTCTGCTTA CTCACTCCTT 60 

CTCCCTCTTT CTGGATTTTA CCGACCGTTC GCGAAACAGC TTTCCACACA ATGGAGCTTC 120 

ATGTCCTCGT GCAGGAAGTA CTCATCGACT GATGTGGCAG ACTTTGCTCC CTGACAAAAC 180 

TAAAGAACTC TCCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT GAACCCTGAA 240 

GTGAACTTCT GAIACATTTC CTGCAGCAAG AGAAGGCAGC CAACATGAAG GAAAATGTGG 300 

CATCTGCAAC CGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATGGACAGT 360 

TACCTCCTGG AAAACCTGAG ATCTTTAAAT GTCGTTCTCC CAATAAG3AA ACATTCACCT 420 

GCTGGTGGAG GCCTGGGACA GATGGAGGAC TTCCTACCAA TTATTCACTG ACTTACCACA 480 

GGGAAGGAGA GACACTCATG CATGAATGTC CAGACTACAT AACCGGTGGC CCCAACTCCT 540 

GCCACTTTGG CAAG CAGT AC ACCTCCATGT GGAGGACATA CATCATGATG GTCAATGCCA 600 

CTAACCAGAT GGGAAGCAGT TTCTCGGATG AACTTTATGT GGACGTGACT TACATAGTTC 660 

AGCCAGACCC TCCTTTGGAG CTGGCTGTGG AAGTAAAACA GCCAGAAGAC AGAAAACCCT 720 

ACCTGTGGAT TAAATGGTCT CCACCTACCC TGATTGACTT AAAAACTGGT TGGTTCACGC 780 

TCCTGTATGA AATTCGATTA AAACCCGAGA AAGCAGCTGA GTGGGAGATC CATTTTGCTG 840 



TACCTAGTGA CTTCACCATG A 
CTGTCATCTG TTTGATTATT G 

GCATCTTTCC GCCAGTTCCT G 

AGGQCAAGTC TGAAGAACTA CTGAGTGCCT TGGGATGCCA AGACTTTCCT CCCACTTCTG 
ACTATGAGGA CTTGCTGGTG GAGTATTTAG AAGTAGATGA TAGTGAGC-AC 
TGTCAGTCCA TTCAAAAGAA CACCCAAGTC AAGGTATGAA ACCCACATAC 
ACACTGACTC AGGCCGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCTGAA A 
AACCCCAGGC CAATCCCTCC ACATTCTATG ATCCTGAGGT CATTGAGAAG CCAGAGAATC 
CTGAAACAAC CCACACCTGG GACCCCCAGT GCATAAGCRT GGAAGGCAAA A 
A TGTTCAACAT GGCCCTTACC A 

\CTGATG TGTGTGAGCT GGCTGTGGGC C 
CACCGGCCAC TCTGTTGAAT GAAGCAGGTA AAGATGCTTT A 

A AGAGGGAAAG GCAACCCAGC AGAGGGAGGT AGAAAGCTTC C 
A TACGCCCTGG CTGCTGCCCC AGGAGAAAAC CCCCTTTGGC TCCGCTAAAC 
CCTTGGATTA TGTGGAGATT CACAAGGTCA ACAAAGATGG T 
AACAGAGAGA GAACAGCGGC AAGCCCAAGA AGCCCGGGAC TCCTGAGAAC A 
ATGCCAAGGT GTCCGGGGTC ATGGATAACA ACATCCTGGT GTTGGTGCCA GATCCACATG 19B0 
CTAAAAACGT GGCTTGCTTT G AAGAAT CAG CCAAAGAGGC CCCACCATCA CTTGAACAGA 2040 
ATCAAGCTGA GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2100 
TGGGTGGTTT GGATTACCTG GAT CCCGCAT GTTTTACACA CTCCTTTCAC TGATAGCTTG 2160 
ACTAATGGAA TGATTGGTTA AAATGTGATT TTTCTTCAGG TAACACTACA GAGTACGTGA 2220 
AATGCTCAAG AATGTAGTCA GACTGACACT ACT AAAGCT C CCAGCTCCTT TCATGCTCCA 22 80 
TTTTTAACCA CTTGCCTCTT TCTCCAGCAG CTGATTCCAG AACAAATCAT TATGTTTCCT 2340 
AACTGTGATT TGTAGATTTA CTTTTTGCTG TTAGTTATAA AACTATGTGT TCAATGAAAT 2400 
T TCTTGAGGGA CAATGCCAAT AGGTATATCC TCTGGAAAAG 2460 
A TTTGGCATGG GACAGACGGA AATGAAATTG TCAAAATTGT TTACCATAGA 2520 
AAGATGACAA AAGAAAATTT T CCACATAGG AAAATGCCAT GAAAATTGCT TTTGAAAAAC 2580 
AACTGCATAA CCTTTACACT CCTCGTCCAT TTTATTAGGA TTACCCAAAT ATAACCATTT 2640 
AAAGAAAGAA TGCATTCCAG AACAAATTGT TTACATAAGT TCCTATACCT TACTGACACA 2700 
TTGCTGATAT GCAAGTAAGA AAT 

A26 Protein sequence: 

Gene name: ESTs; prolactin receptor 

Unigene number: Hs. 25252 

probeset Accession #: AA057193 

protein Accession #: NP_000940 

Signal sequence: 1-23 

PN3 domains: 28-112, 127-215 

Cellular Localization: plasma membrane 



MKENVASATV FTLLLFLNTC LLNGQLPPGK PEIFKCRSPN KETFTCWWRP GTDGGLPTNY 60 

SLTYHREGET LMHECPDYIT GGPNSCHFGK QYTSMWRTYI MMVNATNQMG SSFSDELYVD 120 

VTYIVQPDPP LELAVEVKQP EDRKPYLWIK WSPPTLIDLK TGWFTLLYEI RLKPEKAAEW 180 

OU EIHFAGQQTE FKILSLHPGQ KYLVQTOCKP DHGYWSAWSP ATFIQIPSDF TMNDTTVWIS 240 

VAVLSAVICL IIWAVALKG YSMUTCIFPP VPGPKIKGFD AHLLEKGKSE SLLSALGCQD 300 

FPPTSDYEDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 

SEKCEEPQAN PSTFYDPEVI EKPENPETTH TKDPQCISME GKIPYFHAGG SKCSIKPLPQ 420 
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PSQHNPESSY HNITDVCELA VGPAGAPATL LNEAGKDALK SSQTIKSREE GKATQQREVE 
SPHSETDQDT PWLLPQEKTP FGSAKPLDYV EIHKVNKDGA LSLLPKQREN SGKPKKPGTP 
ENNKEYAKVS GVMDNNI LVL VPDPHAKNVA CFEESAKEAP PSLEQNQAEK ALANFTATSS 



Human neuropeptide Y 



203-1363 (underlined sequences correspond to start and stop codons) 



CATTCCCACC CTTCCTTCTT TAATAAGCAG GAGCGAAAAA GACAAATTCC AAAGAGGATT 
A TAATTTTGGT AAATGGATTC CAATATCGGG 
ATACTGTCCA TTTGTCTAAA 
TTATTTTCCC A 
GAATGCCCAG CTTCTGGCTT T 
TACCTTAGCT CTTGCTTATG GAGCTGTGAT 
GATCATAATC ATCTTGAAAC A 



TAATAGACAT GCTTATGTAG GTATTGCTGT GATTTGGGTC C 
GCCTTTCCTG ATCTACCAAG TAATGACTGA TGAGCCGTTC CAAAATGTAA CACTTGATGC 
GTACAAAGAC *AAATACGTGT GCTTTGATCA ATTTCCA 
TACCACTCTC CTCTTGGTGC TGCAGTATTT TGGTCCACTT T 
CTTCAAGATA TATATACGCC TAAAAAGGAG AAACAACATG ATGGACAAGA TGAGAGACAA 
TAAGTACAGG TCCAGTGAAA CCAAAAGAAT CAATATCATG CTGCTCTCCA TTGTGGTAGC 



CATGTCCACG ATGCACACAG ATGTTTCCAA AACTTCTTTG AAGCAAGCAA GCCCAGTCGC 1320 

ATTTAAAAAA ATCAACAACA ATGATGATAA TGAAAAAATC TGAAACTACT TATAGCCTAT 13 80 

GGTCCCGGAT GACATCTGTT TAAAAACAAG CACAACCTGC AACATACTTT GATTACCTGT 1440 

TCTCCCAAGG AATGGGGTTG AAATCATTTG AAAATGACTA AGATTTTCTT GTCTTGCTTT 1500 

TTACTGCTTT TGTTGTAGTT GTCATAATTA CATTTGGAAC AAAAGGTGTG GGCTTTGGGG 1560 

TCTTCTGGAA ATAGTTTTGA CCAGACATCT TTGAAGTGCT TTTTGTGAAT TTATGCATAT 1620 

AATATAAAGA CTTTTATACT GTACTTATTG GAATGAAATT TCTTTAAAGT ATTACGATGC 1680 

GCTGACTTCA GAAGTACCTG CCATCCAATA CGGTCATTAG ATTGGGTCAT CTTGATTAGA 1740 

TTAGATTAGA TTAGATTGTC AACAGATTGG GCCATCCTTA CTTTATGATA GGCATCATTT 1800 

TAGTGTGTTA CAATAGTAAC AGTATGCAAA AGCAGCATTC AGGAGCCGAA AGATAGTCTT 1860 

GAAGTCATTC AGAAGTGGTT TGAGGTTTCT GTTTTTTGGT GGTTTTTGTT TGTTTTTTTT 1920 

TTTTTTCACC TTAAGGGAGG CTTTCATTTC CTCCCGACTG ATTGTCACTT AAATCAAAAT 1980 

TTAAAAATGA ATAAAAAGAC ATACTTCTCA GCTGCAAATA TTATGGAGAA TTGGGCACCC 2040 

ACAGGAATGA AGAGAGAAAG CAGCTCCCCA ACTTCAAAAC CATTTTGGTA CCTGACAACA 2100 

AGAGCATTTT AGAGTAATTA ATTTAATAAA GTAAATTAGT ATTGCTGCAA ATAGCTAAAT 2160 

TATATTTATT TGAATTGATG GT CAAGAG AT TTTCCATTTT TTTTACAGAC TGTTCAGTGT 2220 

TTGTCAAGCT TCTGGTCTAA TATGT ACT CG AAAGACTTTC CGCTTACAAT TTGTAGAAAC 22 80 

ACAAATATCG TTTTCCATAC AGCAGTGCCT ATATAGTGAC TGATTTTAAC TTTCAATGTC 2 340 

CATCTTTCAA AGGAAGTAAC ACCAAGGTAC AATGTTAAAG GAATATTCAC TTTACCTAGC 2400 

AGGGABAAAT ACACAAAAAC TGCAGATACT TCATATAGCC CATTTTAACT TGTATAAACT 2460 

GTGTGACTTG TGGCGTCTTA TAAATAATGC ACTGTAAAGA TTACTGAATA GTTGTGTCAT 2520 

GTTAATGTGC CTAATTTCAT GTATCTTGTA ATCATGATTG AGCCTCAGAA TCATTTGGAG 2580 



L [57-91] 



Transmembrane domains .- 

Cellular Localization: plasma 

1 11 21 31 41 51 

I I 1 I I I 

MHSTLFSQVE NHSVHSNFSE KNAQLLAFEN DDCHLPIiAMI FTLALAYGAV IILGVSGNLA 60 

LIIIILKQKE MENVTNILIV NLSFSDLLVA IMCLPFTFVY TLMDHWVFGE AMCKLNPFVQ 120 

CVSITVSIFS LVLIAVERHQ LIINPRGWEP NHRHAYVGIA VIWVLAVASS LPFLIYQVMT 180 

DEPFQNVTLD AYKDKYVCFD QFPSDSHRLS YTTLLLVLQY FGPLCFIFIC YFKIYIRLKR 240 

RNNMMDKMRD NKYRSSETKR INIMLLSIW AFAVCWLPLT IFHTVFOWNH QIIATCNHNL 3 00 

LFLLCHLTAM ISTCVNPIFY GFLNKNFQRD LQFFFNFCDF RSRDDDYETI AMSTMHTDVS 360 
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KTSLKQASPV AFKKINNNDD NEKI 



A2 9 DNA SEQUENCE 

Gene name: Homo sapiens G protein- coupled receptor (HOT7T17S) 
Unigene number: Hs. 208229 

Probeset Accession #: AI819198 
Nucleic Acid Accession #: NM_032SS1.1 

Coding sequence: 1-1197 (underlined sequences correspond to start and stop codons) 



ATGCACACCG TGGCTACGTC CGGACCCAAC GCGTCCTGGG GGGCACCGGC CAACGCCTCC 



CTGCTGTACC CGCTGCCCGG CTGGGTGCTG GGCGACTTCA TGTGCAAGTT C 
ATCCAGCAGG TCTCGGTGCA GGCCACGTGT G 
TGGTACGTGA CGGTGTTCCC GTTGCGCGCC CTGCACCGCC GCACGCCCCG CCTGGCGCTG 
GCTGTCAGCC TCAGCATCTG GGTAGGCTCT GCGGCGGTGT CTGCGCCGGT GCTCGCCCTG 
CACCGCCTGT CACCCGGGCC GCGCGCCTAC TGCAGTGAGG C 
GAGCGCGCCT TCGCACTGTA C 



C TTAAGACCTG GGCTCACTGC ATGTCCTACA GCAACTCCGC GCTGAACCCG 9 SO 

:CCTGGG CTCGCACTTC CGACAGGCCT TCCGCCGCGT CTGCCCCTGC 1020 

GCGCCGCGCC GCCCCCGCCG CCCCCGCCGG CCCGGACCCT CGGACCCCGC AGCCCCACAC 1080 

GCGGAGCTGC ACCGCCTGGG GTCCCACCCG GCCCCCGCCA GGGCGCAGAA GCCAGGGAGC 1140 
AGTGGGCTGG CCGCGCGCGG GCTGTGCGTC CTGGGGGAGG ACAACGCCCC TCTCTGA 

A30 Protein sequence 

Gene name: Homo sapiens G protein- coupled receptor (H0T7T17E) , mtUJA 

Unigene number: Hs. 208229 

Protein Accession # : AI819198 

Signal sequence: none found 



MH P rSGPN ASWGAPANAS GCPGCGANAS DGPVPSPRAV DAWLVPLFFA ALMLLGLVGN 
SLVIYVICRH KPMRTVTNFY I ANLAATD VT FLLCCVPFTA LLYPLPGKVL GDFHCKFVNY 
IQQVSVQATC ATLTAMSVDR WYVTVFPLRA LHRRTPRLAL AVSLSIWVGS AAVSAPVLAL 



ADSALQGQVL AERAGAVRAK VSRLVAAWL LFAACWGPIQ LFLVLQALGP AGSWHPRSYA 
AYALKTWAHC MSYSNSALNP LLYAFLGSHF RQAFRRVCPC APRRPRRPRR P 
AELHRLGSHP APARAQKPGS SGLAARGLCV L 



Unigene number: Hs. 252387 

Probeset Accession #: NM_014246 

Coding sequence: 1-9045 (underlined sequences correspond to start and stop codons) 





G TCGCCATCCC C 

G AAGCCCGGGC GGGGCCGGCG CGACGGGCCC GGCG3GGCAC GAGCGGCAGA 720 

GGGAGCCTGA AGTTTCCGAT GCCCAACTAC CAGGTGGCGT TGTTTGAGAA CGAACCGGCG 780 

GGCACCCTCA TCCTCCAGCT GCACGCGCAC TACACCAICG AGGGOGAGGA GGAGCGCGTG 840 

AGCTATTACA TGGAGGGGCT GTTCGACGAG CGCTCCCGGG GCTACTTCCG AATCGACTCT 900 

GCCACGGGCG CCGTGAGCAC GGACAGCGTA CTGGACCGCG AGACCAAGGA GACGCACGTC 960 

A CTACAGTACG CCGCCGCGCT CGGCCACCAC CTACATCACT 1020 

ACAGC CCGGTCTTCG AGCAGTCGGA GTACCGCGAG 1080 

CGCGTGCGGG AGAACCTGGA GGTGGGCTAC GAGGTGCTCA CCATCCGCGC CAGCGACCGC 1140 



842 



WO 03/042661 



PCT/US02/36810 



GACTCGCCCA TCAACGCCAA CTTGCGTTAC CGCGTGTTGG GGGGCGCGTG GGACGTCTTC 
CAGCTCAACG AGAGCTCTGG CGTGGTGAGC ACACGGGCGG TGCTGGACCG GGAGGAGGCG 
GCCGAGTACC AGCTCCTGGT GGAGGCCAAC GACCAG3GGC GCAATCCGGG CCCGCTCAGT 
GCCACGGCCA CCGTGTACAT CGAGGTGGAG GACGAGAACG ACAACTACCC CCAGTTCAGC 
GAGCAGBACT ACGTGGTCCA GGTGCCCGAG GACGTGGGGC TCAACACGGC TGTGCTGCGA 
GTGCAGGCCA CGGACCGGGA CCAGGGCCAG AACGCGGCCA TTCACTACAC- CATCCTCAGC 
GGGAACGTGG CCGGCCAGTT CTACCTGCAC TCGCTGAGCG GGATCCTGGA TGTGATCAAC 
CCCTTGGATT TCGAGGATGT CCAGAAATAC TCGCTGAGCA TTAAGGCCCA GGATGG3GGC 
CGGCCCCCGC TCATCAATTC TTCAGGGGTG GTGTCTGTGC AGGTGCTGGA TGTCAACGAC 
AACGAGCCTA TCTTTGTGAG CAGCCCCTTC CAGGCCACGG TGCTGGAGAA TGTGCCCCTG 
GGCTACCCCG TGGTGCACAT TCAGGCGGTG GACGCGGACT CTGGAGAGAA CGCCCGGCTG 
CACTATCGCC TGGTGGACAC GGCCTCCACC TTTCTGGGGG GCGGCAGCGC TGGGCCTAAG 
AATCCTGCCC CCACCCCTGA CTTCCCCTTC CAGATCCACA ACAGCTCCGG TTGGATCACA 
GTGTGTGCCG AGCTGGACCG CGAGGAGGTG GAGCACTACA GCTTCGGGGT GGAGGCGGTG 
GACCACGGCT CGCCCCCCAT GAGCTCCTCC ACCAGCGTGT CCATCACGGT GCTGGACGTG 
AATGACAACG ACCCGGTGTT CACGCAGCCC ACCTAOGAGC T 



CCAGAGAGGG 



GCGGTGACAG CATCCGACGG C 

GATGCCAACA CCCACAGGCC TGTCTTTCAG AGCTCCCATT ACACAGTGAG TGTCAGTGAG 
GACAGGCCTG TGGGCACCTC CATTGCTACC CTCAGTGCCA ACGATGAGGA CACAGGAGAG 
AATGCCCGCA TCACCTACGT GATTCAGGAC CCCGTGCCGC AGTTCCGCAT TGACCCCGAC 
AGTGGCACCA TGTACACCAT GATGGAGCTG GACTATGAGA ACCAGGTCGC CTACACGCTG 
ACCAT CATGG CCCAGGACAA CGGCATCCCG CAGAAATCAG A 
CTCATCCTCG ATGCCAATGA CAATGCACCC CAGTTCCTGT GGGATTTCTA C 
ATCTTTGAGG ATGCTCCACC CTCGACCAGC ATCCTCCAGG TCTCTGCCAC G 

TCAGGTCCCA ATGGGCGTCT GCTGTACACC TTCCAGGGTG GGGACGACGG CGATGGGGAC 2 820 

TTCTACATCG AGCCCACGTC CGGTGTGATT CGCACCCAGC GCCGGCTGGA CCGGGAGAAT 2 880 

GTGGCCGTGT ACAACCTTTG GGCTCTGGCT GTGGATCGGG GCAGTCCCAC TCCCCTTAGC 2 940 

GCCTCGGTAG AAATCCAGGT GACCATCTTG GACATTAATG ACAATGCCCC CATGTTTGAG 3000 

AAGGACGAAC TGGAGCTGTT TGTTGAGGAG AACAACCCAG TGGGGTCGGT GGTGGCAAAG 3 0 SO 

ATTCGTGCTA ACGACCCTGA TGAAGGCCCT AATGCCCAGA TCATGTAT CA GATTGTGGAA 3120 

GGGGACATGC GGCATTTCTT CCAGCTGGAC CTGCTCAACG GGGACCTGCG TGCCATGGTG 3180 

GAGCTGGACT TTGAGGTCCG GCGGGAGTAT GTGCTGGTGG TGCAGGCCAC GTCGGCTCCG 3240 

CTGGTGAGCC GAGCCACGGT GCACATCCTT CTCGTGGACC AGAATGACAA CCCGCCTGTG 3 300 

CTGCCCGACT TCCAGATCCT CTTCAACAAC TATGTCACCA ACAAGTCCAA CAGTTTCCCC 33SO 

ACCGGCGTGA TCGGCTGCAT CCCGGCCCAT GACCCCGACG TGTCAGACAG CCTCAACTAC 3420 

ACCTTCGTGC AGGGCAACGA GCTGCGCCTG TTGCTGCTGG ACCCCGCCAC GGGCGAACTG 3480 

CAGCTCAGCC GCGACCTGGA CAACAACCGG CCGCTGGAGG CGCTCATGGA GGTGTCTGTG 3540 

TCTGATGGCA TCCACAGCGT CACGGCCTTC TGCACCCTGC GTGTCACCAT CATCACGGAC 3 6 00 

GACATGCTGA CCAACAGCAT CACTGTCCGC CTGGAGAACA TGTCCCAGGA GAAGTTCCTG 3 6 60 

TCCCCGCTGC TGGCCCTCTT CGTGGAGGGG GTGGCCGCCG TGCTGTCCAC CACCAAGGAC 3 720 

GACGTCTTCG TCTTCAACGT CCAGAACGAC ACCGACGTCA GCTCCAACAT CCTGAACGTG 3 7 80 

ACCTTCTCGG CGCTGCTGCC TGGCGGCGTC CGCGGCCAGT TCTTCCCGTC GGAGGACCTG 3 840 

CAGGAGCAGA TCTACCTGAA TCGGACGCTG CTGACCACCA TCTCCACGCA GCGCGTGCTG 3900 

CCCTTCGACG ACAACATCTG CCTGCGCGAG CCCTGCGAGA ACTACATGAA GTGCGTGTCC 3960 

GTTCTGCGAT TCGACAGCTC CGCGCCCTTC CTCAGCICCA CCACCGTGCT CTTCCGGCCC 4020 

ATCCACCCCA TCAACGGCCT GCGCTGCCGC TGCCCGCCCG GCTTCACCGG CGACTACIGC 4080 

GAGACGGAGA TCGACCTCTG CTACTCCGAC CCGTGCGGCG CCAACGGCCG CTGCCGCAGC 4140 

CGCGAGGGCG GCTACACCTG CGAGTGCTTC GAGGACTTCA CTGGAGAGCA CTGTGAGGTG 42 00 

GATGCCCGCT CAGGCCGCTG TGCCAACGGG GTGTGCAAGA ACGGGGGCAC CTGCGTGAAC 42 60 

CTGCTCATCG GCGGCTTCCA CTGCGTGTGT CCTCCTGGCG AGTATGAGAG GCCCTACTGT 4320 

GAGGTGACCA CCAGGAGCTT CCCGCCCCAG TCCTTCGTCA CCTTCCGGGG CCTGAGACAG 4380 

CGCTTCCACT TCACCATCTC CCTCACGTTT GCCACTCAGG AAAGGAACGG CTTGCTTCTC 4440 

TACAACGGCC GCTTCAATGA GAAGCACGAC TTCATCGCCC TGGAGATCGT GGACGAGCAG 4500 

GTGCAGCTCA CCTTCTCTGC AGGCGAGACA ACAACGACCG TGGCACCGAA GGTTCCCAGT 4560 

GGTGTGAGTG ACGGGCGGTG GCACTCTGTG CAGGTGCAGT ACTACAACAA GCCCAATATT 4620 

GGCCACCTGG G 



GATTGTGACA CAACCATGGC TGTGCGCTTT GGAAAGGACA TCGGGAACTA CAGCTGCGCT 4740 

GCCCAGGGCA CTCAGACCGG CTCCAAGAAG TCCCTGGATC TGACCGGCCC TCTACTCCTG 4800 

GGGGGTGTCC CCAACCTGCC AGAAGACTTC CCAGTGCACA ACCGGCAGTT CGTGGGCTGC 4860 

ATGCGGAACC TGTCAGTCGA CGGCAAAAAT GTGGACATGG CCGGATTCAT CGCCAACAAT 4920 

GGCACCCGGG AAGGCTGCGC TGCTCGGAGG AACTTCTGCG ATGGGAGGCG GTGTCAGAAT 4980 

GGAGGCACCT GTGTCAACAG GTGGAATATG TATCTGTGTG AGTGTCCACT CCGATTCGGC 5040 

GGGAAGAACT GTGAGCAAGC CATGCCTCAC CCCCAGCTCT TCAGCGGTGA GAGCGTCGTG 5100 

TCCTGGAGTG ACCTGAACAT CATCATCTCT GTGCCCTGGT ACCTGGGGCT CATGTTCCGG 5160 

ACCCGGAAGG AGGACAGCGT TCTGATGGAG GCCACCAGTG GTGGGCCCAC CAGCTTTCGC 5220 

' ' ~ IT GAGGTGTCCC ACGGCCCCTC CGATGTGGAG 5280 

T GCGGGTGACC GACGGGGAGT GGCACCACCT GCTGATCGAG 5340 

A CAGTGAGATG AAGCACCTGG TCACCATGAC CTTGGACTAT 5400 

GGGATGGACC AGAACAAGGC AGATATCGGG GGCATGCTTC CCGGGCTGAC GGTAAGGAGC 5460 

GTGGTGGTCG GAGGCGCCTC TGAAGACAAG GTCTCCGTGC GCCGTGGATT CCGAGGCTGC 5520 

ATGCAGGGAG TGAGGATGGG GGGGACGCCC ACCAACGTCG CCACCCTGAA CATGAACAAC 5580 

GCACTCAAGG TCAGGGTGAA GGACGGCTGT GATGTGGACG ACCCCTGTAC CICGAGCCCC 5640 

TGTCCCCCCA ATAGCCGCTG CCACGACGCC TGGGAGGACT ACAGCTGCGT CTGTGACAAA 5700 

GGGTACCTTG GAATAAACTG TGTGGATGCC TGTCACCTGA ACCCCTGCGA GAACATGGGG 5760 

GCCTGCGTGC GCTCCCCCGG CTCCCCGCAG GGCTACGTGT GCGAGTGTGG GCCCAGTCAC 5820 

TACGGGCCGT ACTGTGAGAA CAAACTCGAC CTTCCGTGCC CCAGAGGCTG GTGGGGGAAC 5880 

CCCGTCTGTG GACCCTGCCA CTGTGCCGTC AGCAAAGGCT TTGATCCCGA CTGTAATAAG 5940 

ACCAACGGCC AGTGCCAATG CAAGGAGAAT T ACT ACAAG C TCCTAGCCCA GGACACCTGT 6000 

CTGCCCTGCG ACTGCTTCCC CCATGGCTCC CACAGCCGCA CTTGCGACAT GC-CCACCGGG 6060 

CAGTGTGCCT GCAAGCCCGG CGTCATCGGC CGCCAGTGCA ACCGCTGCGA CAACCCGTTT 6120 

GCCGAGGTCA CCACGCTCGG CTGTGAAGTG ATCTACAATG GCTGTCCCAA AGCATTTGAG 6180 
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A CGCAGGTGGA CGGCGCCAGG G 
GCTACACAGC ACACGGGCAC GCTCTTTGGC AATGACGTGC GCACGGCCTA CCAGCTGCTG 6480 
GGCCACGTCC TTCAGCACGA GAGCTGGCAG CAGGGCTTCG ACCTGGCAGC CACGCAGGAC 6540 
GCCGACTTTC ACGAGGACGT CATCCACTCG GGCAGCGCCC TCCTGGCCCC AGCCACCAGG 6600 
GCGGCGTGGG AGCAGATCCA GCGGAGCGAG GGCGGCACGG C 



ATCGTCACCG CCAACATGAT TCTTGCTGTC GACATCTTTG ACAAGTTCAA CTTTACGGGA 6780 
GCCAGGGTCC CGCGATTCGA CACCATCCAT GAAGAG1 
GTCTCCTTCC CAGCCGACTT CTTCAGACCA CCTGAAGAAA A 



GCCCCGATCA GCAGGCGGAG GCGACACCCT 

GTCATCATTT ACCGCACCCT GGGGCAGCTC CTGCCCGAGC GCTACGACCC 
AGCCTCCGGT TGCCTCACCG GCCCATCATT AATACCCCGA TGGTGAGCAC 
AGCGAGGGGG CTCCGCTCCC GAGACCCCTG GAGAGGCCCG 
CTGGAGGTGG AGGAGCGAAC CAAGCCTGTC TGCGTGTTCT 
GGTGGGACGG GAGGGTGGTC 
GTCGCCTGCC AGTGCAGCCA CACAGCCAGC 

GAGAACGGGG AGGTCCTGCC TCTGAAGATT GTCACCTATG CCGCTGTGTC 
T CGTCCTCCTG AGCCTGGTCC GCATGCTGCG 

T CGCCGTGGCG CTCTTCCTCT CTCAGCTGGT GTTCGTGATT 7560 

GGGATCAACC AGACGGAAAA CCCGTTTCTG TGCACAGTGG TTGCCATCCT CCTCCACTAC 762 0 

ATCTACATGA GCACCTTTGC CTGGACCCTC GTGGAGAGCC TGCATGTCTA CCGCATGCTG 7680 

ACCGAGGTGC GCAACATCGA CACGGGGCCC ATGCGGTTCT ACTACGTCGT GGGCTGGGGC 7740 

ATCCCGGCCA TTGTCACAGG ACTGGCGGTC GGCCTGGACC CCCAGGGCTA CGGGAACCCC 7800 

GACTTCTGCT GGCTGTCGCT TCAAGACACC CTGATTTGGA GCTTTGCGGG GCCCATCGGA 7860 

GCTGTTATAA T CAT CAACAC AGTCACTTCT GTCCTATCTG CAAAGGTTTC CTGCCAAAGA 7920 

AAGCACCATT ATTATGGGAA AAAAGGGATC GTCTCCCTGC TGAGGACCGC ATTCCTCCTG 7980 

CTGCTGCTCA TCAGCGCCAC CTGGCTGCTG GGGCTGCTGG CTGTGAACCG CGATGCACTG 8040 

AGCTTTCACT ACCTCTTCGC CATCTTCAGC GGCTTACAGG GCCCCTTCGT CCTCCTTTTC 8100 

CACTGCGTGC TCAACCAGGA GGTCCGGAAG CACCTGAAGG GCGTGCTCGG CGGGAGGAAG 8160 

CTGCACCTGG AGGACTCCGC CACCACCAGG GCCACCCTGC TGACGCGCTC CCTCAACTGC 8220 

AACACCACCT TCGGTGACGG GCCTGACATG CTGCGCACAG ACTTGGGCGA GTCCACCGCC 82 80 

TCGCTGGACA GCATCGTCAG GGATGAAGGG ATCCAGAAGC TCGGCGTGTC CTCTGGGCTG 8340 

GTGAGGGGCA GCCACGGAGA GCCAGACGCG TCCCTCATGC CCAGGAGCTG CAAGGATCCC 8400 

CCTGGCCACG ATTCCGACTC AGATAGCGAG CTGTCCCTGG ATGAGCAGAG CAGCTCTTAC 8460 

GCCTCCTCAC ACTCGTCAGA CAGCGAGGAC GATGGGGTGG GAGCTGAGGA AAAATGGGAC 8520 

CCGGCCAGGG GCGCCGTCCA CAGCACCCCC AAAGGGGACG CTGTGGCCAA CCACGTTCCG 8580 

GCCGGCTGGC CCGACCAGAG CCTGGCTGAG AGTGACAGTG AGGACCCCAG CGGCAAGCCC 8 640 

CGCCTGAAGG TGGAGACCAA GGT CAGCGTG GAGCTGCACC GCGAGGAGCA GGGCAGTCAC 8700 

CGTGGAGAGT ACCCCCCGGA CCAGGAGAGC GGGGGCGCAG CCAGGCTTGC TAGCAGCCAG 87S0 

CCCCCAGAGC AGAGGAAAGG CATCTTGAAA AATAAAGTCA CCTACCCGCC GCCGCTGACG 8 820 

CTGACGGAGC AGACGCTGAA GGGCCGGCTC CGGGAGAAGC TGGCCGACTG TGAGCAGAGC 8880 

CCCACATCCT CGCGCACGTC TTCCCTGGGC TCTGGCGGCC CCGACTGCGC CATCACAGTC 8940 

AAGAGCCCTG GGAGGGAGCC GGGGCGTGAC CACCTCAACG GGGTGGCCAT GAATGTGCGC 9000 

ACTGGGAGCG CCCAGGCCGA TGGCTCCGAC TCTGAGAAAC CGTGAGGCAA GCCCGTCACC 9060 

CCACACAGGC TGCGGCATCA CCCTCAGACC TTGGAGCCCA AGGGGCCACT GCCCTTGAAG 9120 

TGGAGTGGGC CCAGAGTGTG GCGGTCCCCA TGGTGGCAGC CCCCCGACTG ATCATCCAGA 9180 
CACAAAGGTC TTGGTTCTCC CAGGAGCTCA GGGCCTGTCA GACCTGGTGA C 
AGGCCACAGG CATGAGGGAG GCGTGGACCA CTGGGCCAGC A 
CAGTCAAAGC CAGAACTGAG AGGGGACCCC AGACTGGGCC C 

GGAACGCCGG GCACAGACCA AAGACCGCGG TCCAGCCCCG CCCAGGCGGG CATCTCATGG 9420 
CAGTGCGGAC C 

AATCACTTCG CTATGTGGGA AAGGTGGAGA T. 

GAGGTGCAAC CTGTATATAT ATTGCATTCG TGCTGACTTT GTTATCCCGA GAGATCCATG 9600 

CAATGATCTC TTGCTGTCTT CTCTGTCAAG ATTGCACAGT TGTACTTGAA TCTGGCATGT 9660 

GTTGACGAAA CIGGTGCCCC AGCAGATCAA AGGTGGGAAA TACGTCAGCA GTGGGGCTAA 9720 

AACCAAGCGG CTAGAAGCCC TACAGCTGCC TTCGGCCAGG AAGTGAGGAT GGTGTGGGCC 9760 

CTCCCCGCCG GCCCCCTGGG TCCCCAGTGT TCGCTGTGTG TGCGTTTGTC CTCTGCTGCC 9840 
ATCTGCCCCG GCTGTGTGAA TT CAAGACAG GGCAGTGCAG CACTAGGCAG G 
CCCTGCTGAG GTCACTGTGG GGCACGGTTG CCACACGGCT G 



C CAGGCCGTTA CATACCTCCT GCCAATTCTC 10140 
TAACTCTCTG AGACTGCGAG GATCTCCAGG CAGGGTTCTC CCCTCTGGAG TCTGACCAAT 10200 
TACTTCATTT TGCTTCAAAT GGCCAATTGT GCAGAGGGAC AAAGCCACAG CCACACTCTT 10260 
CAACGGTTAC CAAACTGTTT TTGGAAATTC ACACCAAGGT CGGGCCCACT GCAGGCAGCT 1032 0 
GGCACAGCGT GGCCCGAGGG GCTGTGGAAC GGGTCCCGGA ACTGTCAGAC ATGTTTGATT 10380 
TTAGCGTTTC CTTTGTTCTT CAAATCAGGT GCCCAAATAA GTGATCAGCA CAGCTGCTTC 10440 
CAAATAGGAG AAACCATAAA ATAGGATGAA AATCAAGTAA AATGCAAAGA TGTCCACACT 10500 
GTTTTAAACT TGACCCTGAT GAAAATGTGA GCACTGTTAG CAGATGCCTA TGGGAGAGGA 10560 
AAAGCGTATC TGAAAATGGT CCAGGACAGG AGGATGAAAT GAGATCCCAG AGTCCTCACA 1062 0 
CCTGAATGAA TTATACATGT GCCTTACCAG GTGAGTGGTC TTTCGAAGAT AAAAAACTCT 10680 
A AACGTTTGCC CCTGGCGTTT CCTAAGTACG AAAAGGTTTT TAAGTCTTCG 10740 
C TTTCATGACT TTAACAGGAT TCTGCCCCCT GAGGTG1AAT TTTTTTGTTC 10800 
TATTTTTTTC CACGTACTCC ACAGCCAACA TCACGAGGTG TAATTTTTAA TTTGATCAGA 10860 
ACTGTTACCA AAAAACAACT GTCAGTTTTA TTGAGATGGG AAAAATGTAA ACCTATTTTT 1092 0 
■ " " - " G ACTTTATGGG AGAGATTAGA CACTGGAGGT TTTTAACAGA ACGTGTATTT 10980 
C AAAACACTGG AATTACAAAT GAGAAGAGTC TACAATAAAT TAAGATTTTT 11040 
GAATTTGTAC TTCTGCGGTG CTGGTTTTTC TCCACAAACA CCCCCGCCCC TCCCCATGCC 11100 
CAGGGTGGCC GTGGAAGGGA CGGTTTACGG ACGTGCAGCT GAGCTGTCCG TGTCCCATGC 11160 
TCCCTCAGCC AGTGGAACGT GCCGGAACTT TTTGTCCATT CCCTAGTAGG CCTGCCACAG 11220 
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CCTAGATGGG CAGTTTTTGT CTTTCACCAA ATTTGAGGAC TTTTTTTTTT TGCCATTATT 112 80 
TCTTCAGTTT TCTTTTCTTG CACTGATCTT TCTCCTCTCC TTCTGTGACT CCAGTGACTC 11340 
AGACGTTAGA CCTCTTGATG TTTTCCCACT GGTCCCTGAG GCTCTGTTC 

A3 2 Protein Sequence 

Gene name: CH22_FGENES . 706 ; cadherin, EGF LAG seven-pass G-type receptor 1, flamingo 

(Drosophila) homolog (CELSR1) , 
Unigene number: Hs. 252387 

Protein Accession #; NP_0SS061 
Signal sequence: 1-2 0 

Cadherin domains: 2S0-344, 358-449, 464-556, 570-678, 692-780, 794-883, 897-990 1004-1092 

1110-1199 

Pf am domains: Laminin_EGF [2003-2048], 7tm_2 [2465-2708] 

Latrophilin/GPS domains: 2407-2460 
Transmembrane domains: 1219-1238, 2473- 

Cellular Localization: plasma membrane 



1 11 21 

I I I 

MAPPPPPVLP VLLLLAAAAA LPAMGLRAAA 
PRELLDVGRD GRLAGRRRVS GAGRPLPLQV 
CGTGARLCGA LCFPVPGGCA AAQHSALAAP 
LRLLCALRRA AGAVRVGLAL EAATAGTPSA 
GSLKFPMPNY QVALFENEPA GTLILQLHAH 
ATGAVSTDSV LDRETKETHV LRVKAVDYST 
RVRENLEVGY EVLTIRASDR DSPINANLRY 
AEYQLLVEAN DQGRNPGPLS ATATVYI EVE 
VQATDRDQGQ NAAIHYSILS GNVAGQFYLH 
RPPLIHSSGV VSVQVLDVND NEPIFVSSPF 
HYRLVDTAST FLGGGSAGPK MPAPTPDFPF 
DHGSPPMSSS TSVSITVLDV NDNDPVFTQP 
TYQLTGGNTR NRFALSSQRG GGLITLALPL 
DANTHRPVFQ SSHYTVSVSE DRPVGTSIAT 
SGTMYTMMEL DYENQVAYTL TIMAQDNGIP 
IFEDAPPSTS ILQVSATDRD SGPNGRLLYT 
VAVYNLWALA VDRGSPTPLS ASVEIQVTIL 
IRANDPDEGP NAQIMYQIVE GDMRHFFQLD 
LVSRATVIIIL LVDQNDNPPV LPDF0ILFNN 
TFVQGNELRL LLLDPATGEL QLSRDLDNNK 
DMLTNSITVR LENMSQEKFL SPLLALFVEG 
TFSALLPGGV RGQFFPSEDL QEQIYLNRTL 
VLRFDSSAPF LSSTTVLFRP IHPINGLRCR 
REGGYTCECF EDFTGEHCEV DARSGRCANG 
EVTTRSFPPQ SFVTFRGLRQ RFHFTISLTF 
VQLTFSAGET TTTVAPKVPS GVSDGRWHSV 
DCDTTMAVRF GKDIGNYSCA AQGTQTGSKK 
MRNLSVDGKN VDMAGFIANN GTREGCAARR 
GKNCEQAMPH PQLFSGESW SWSDLNIIIS 
LQILNNYLQF EVSHGPSDVE SVMLSGLRVT 
GMDQNKRDIG GMLPGLTVRS VWGGASEDK 
ALKVRVKDGC DVDDPCTSSP CPPNSRCHDA 
ACVRSPGSPQ GYVCECGPSH YGPYCENKLD 
TNGQCQCKEN YYKLLAQDTC LPCDCFPHGS 
AEVTTLGCEV IYNGCPKAFE AGIWWPQTKF 
LFNCTTISFV DLRAMNEKLS RNETQVDGAR 
GHVLQHESWQ QGFDLAATQD ADFHEDVIHS 
EGYFSNVARN VRRTYLRPFV IVTANMILAV 
VSFPADFFRP PEEKEGPLLR PAGRRTTPQT 
VIIYRTLGQL LPERYDPDRR SLRLPHRPII 
LEVEERTKPV CVFWNHSLAV GGTGGWSARG 
ENGEVLPLKI VTYAAVSLSL AALLVAFVLL 
GINQTENPFL CTWAILLHY IYMSTFAWTL 
IPAIVTGLAV GLDPQGYGNP DFCWLSLQDT 
KHHYYGKKGI VSLLRTAFLL LLLISATWLL 
HCVLNQEVRK HLKGVLGGRK LHLEDSATTR 
SLDSIVRDEG IQKLGVSSGL VRGSHGEPDA 
ASSHSSDSED DGVGAEEKWD PARGAVHSTP 
RLKVETKVSV ELHREEQGSH RGEYPPDQES 
LTEQTLKGRL REKLADCEQS PTSSRTSSLG 
TGSAQADGSD SEKP 



31 41 



WEPRVPGGTR AFALRPGCTY AVGAACTPRA 60 

RLVARSAPTA LSRRLRARTH LPGCGARARL 12 0 

TTLPACRCPP RPRPRCPGRP ICLPPGGSVR 180 

SPSPSPPLPP NLPEARAGPA RRARRGTSGR 240 

YTIEGEEERV SYYMEGLFDE RSRGYFRIDS 300 

PPRSATTYIT VLVKDTNDHS PVFEQSEYRE 3 60 

RVLGGAWDVF QLNESSGWS TRAVLDREEA 420 

DENDNYPQFS EQNYWQVPE DVGLNTAVLR 480 

SLSGILDVIN PLDFEDVQKY SLSIKAQDGG 540 

QATVLENVPL GYPWHIQAV DADSGENARL 600 

QIHNSSGHIT VCAELDREEV EHYSFGVEAV 660 

TYELRLNEDA AVGSSVLTLQ ARDRDANSVI 72 0 

DYKQEQQYVL AVTASDGTRS HTAHVLINVT 780 

LSANDEDTGE NARITYV1QD PVPQFRIDPD 840 

QKSDTTTLEI LILDANDNAP QFLWDFYQGS 900 

FQGGDDGDGD FYIEPTSGVI RTQRRLDREN 960 

DINDKAPMFE KDELELFVEE NHPVGSWAK 1020 

LLNGDLRAMV SLDFEVRREY VLWQATSAP 1080 

YVTtJKSNSFP TGVIGCIPAH DPDVSDSLNY 1140 

PLEALMEVSV SDGIHSVTAF CTLRVTIITD 1200 

VAAVLSTTKD DVFVFNVQND TDVSSNILKV 12 60 

LTTIETQRVL PFDDNICLRS PCENYMKCVS 1320 

CPPGFTGDYC ETEIDLCYSD PCGANGRCRS 13 80 

VCKNGGTCVN LLIGGFHCVC PPGEYERPYC 1440 

ATQERNGLLL YNGRFNEKHD FIALEIVDEQ 1500 

QVQYYNKPNI GHLGLPHGPS GEXMAWTVD 1560 

SLDLTGPLLL GGVPNLPEDF PVTMRQFVGC 1620 

NFCDGRRCQN GGTCVNRWNM YLCECPLRFG 1680 

VPWYLGLMFR TRKEDSVLME ATSGGPTSFR 1740 

DGEWHHLLIE LKNVKEDSEM KHLVTHTLDY 1800 

VSVRRGFRGC MQGVRKGGTP THVATLNMNM 1860 

WEDYSCVCDK GYLGINCVDA CHLNPCENMG 1920 

LPCPRGWWGN PVCGPCHCAV SKGFDPDCNK 1980 

HSRTCDMATG QCACKPGVIG RQCNRCDNPF 2040 

GQPAAVPCPK GSVGNAVRHC SGEKGWLPPE 2100 

ALQLVRALRS ATQHTGTLFG NDVRTAYQLL 2160 

GSALLAPATR AAWEQIQRSE GGTAQLLRRL 2220 

DIFDKFNFTG ARVPRFDTIH EEFPRELESS 2280 

TRPGPGTERE APISRRRRHP DDAGQFAVAL 2340 

NTPMVSTLVY SEGAPLPRPL ERPVLVEFAL 2400 

CELLSRNRTH VACQCSHTAS FAVLMDISRR 2460 

SLVRMLRSNL HSIHKHLAVA LFLSQLVFVI 2520 

VESLHVYRML TEVRNIDTGP MRFYYWGWG 2580 

LIWSFAGPIG AVIIINTVTS VLSAKVSCQR 2640 

GLLAVNRDAL SFHYLFAIFS GLQGPFVLLF 27 00 

ATLLTRSLNC NTTFGEGPDM LRTDLGESTA 2760 

SLMPRSCKDP PGHDSDSDSE LSLDEQSSSY 2820 

KGDAVANHVP AGWPDQSLAE SDSEDPSGKP 2880 

GGAARLASSQ PPEQRKGILK NKVTYPPPLT 2 940 

SGGPDCAITV KSPGREPGRD HLMGVAMNVR 3000 



A3 3 DNA SEQUENCE 

Gene name: CXCR3 (G protein-coupled receptor 9); chemokine (C-X-C) receptor 3 
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CCCTCCTGGA GAACTTCAGC TCTTCCTATG ACTATGGAGA AAACGAGAGT GACTCGTGCT 180 
GTACCTCCCC GCCCTGCCCA CAGGACTTCA GCCTGAACTT CGACCGGGCC TTCCTGCCAG 240 
CCCTCTACAG CCTCCTCTTT CIGCTGGGGC TGCTGGGCAA CGGCGCGGTG GCAGCCGTGC 3 00 
TGCTGAGCCG GCGGACAGCC CIGAGCAGCA CCGACACCTT CCTGCTCCAC CTAGCTGTAG 3 SO 
CAGACACGCT GCTGGTGCTG ACACTGCCGC TCTGGGCAGT GGACGCTGCC GTCCAGTGGG 42 0 
TCTTTGGCTC TGGCCTCTGC AAAGTGGCAG GTGCCCTCTT CAACAICAAC TTCTACGCAG 4 80 
GAGCCCTCCT GCTGGCCTGC ATCAGCTTTG ACCGCTACCT GAACATAGTT CATGCCACCC S40 
AGCTCTACCG CCGGGGGCCC CCGGCCCGCG TGACCCTCAC CTGCCTGGCT GTCTGGGGGC 600 
TCTGCCTGCT TTTCGCCCTC CCAGACTTCA TCTTCCTGTC GGCCCACCAC GACGAGCGCC 660 
TCAACGCCAC CCACTGCCAA TACAACTTCC CACAGGTGGG CCGCACGGCT CTGCGGGTGC 720 
TGCAGCTGGT GGCTGGCTTT CTGCTGCCCC TGCTGGTCAT GGCCTACTGC TATGCCCACA 780 
TCCTGGCCGT GCTGCTGGTT TCCAGGGGCC AGCGGCGCCT GCGGGCCATG CGGCTGGTGG 840 
TGGTGGTCGT GGTGGCCTTT GCCCTCTGCT GGACCCCCTA TCACCTGGTG GTGCTGGTGG 900 
ACATCCTCAT GGACCTGGGC GCTTTGGCCC GCAACTGTGG CCGAGAAAGC AGGGTAGACG 960 

TGGCCAAGTC GGTCACCTCA GGCCTGGGCT ACATGCACTG CTGCCTCAAC CCGCTGCTCT 1020 

ATGCCTTTGT AGGGGTCAAG TTCCGGGAGC GGATGTGGAT GCTGCTCTTG CGCCTGGGCT 1080 

CTGAGACCTC AGAGGCCTCC TACTCGGGCT TG TGA GGCCG GAATCCGGGC TCCCCTTTCG 1200 

CCCACAGTCT GACTTCCCCG CATTCCAGGC TCCTCCCTCC CTCTGCCGGC TCTGGCTCTC 1260 

CCCAATATCC TCGCTCCCGG GACTCACTGG CAGCCCCAGC ACCACCAGGT CTCCCGGGAA 1320 

GCCACCCTCC CAGCTCTGAG GACTGCACCA TTGCTGCTCC TTAGCTGCCA AGCCCCATCC 1380 

TGCCGCCCGA GGTGGCTGCC TGGAGCCCCA CTGCCCTTCT CATTTGGAAA CTAAAACTTC 14 40 

ATCTTCCCCA AGTGCG GGGA GTACAAGGCA TGGCGTAGAG GGTGCTGCCC CATGAAGCCA 1S00 

CAGCCCAGGC CTCCAGCTCA GCAGTGACTG TGGCCATGGT GCCCAAGACC TCTATATTTG 15 60 

CTCTTTTATT TTTATGTCTA AAATCCTGCT TAAAACTTTT CAATAAACAA GATCGTCAGG 1620 

ACCAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA BHCLITERAT 1680 

UREBHCBLAS TPBHCORTHO LOGUESMOUS ECXCRMYLEV SERQVLDASD FAFLLENSTS 1740 

PYDYGENESD PSDSPPCPQD FSLNFDRTFL PALYSLLFLL GLLGNGAVAA VLLSQRTALS 1800 

STDTFLLHLA VADVLLVLTL PLWAVDAAVQ WVFGPGLCKV AGALFNINFY AGAFLLACIS 1860 

FDRYLSIVHA TQIYRRDPRV RVALTCIWW GLCLLFALPD FIYLSANYDQ RLNATHCQYN 1920 

FPQVGRTALR VLQLVAGFLL PLLVMAYCYA HILAVLLVSR GQRRFRAMRL WVWAAFAV 1980 

CWTPYHLWL VDILMDVGVL ARNCGRESHV DVAKSVTSGM GYMHCCLNPL LYAFVGVKFR 2040 
EQMWMLFTRL GRSDQRGPQR QPSSSRRESS WSETTEASYL GL 



Unigene number: Hs. 198252 

Protein Accession ft: P4968J 

Signal sequence: none found 

Pfam domains! 7tra_l [70-318] 



-coupled receptor 9) ;chemokine(C-X-C) receptor 



DRY box: 148-149 

Cellular Localization: plasma membrane 



SGLCKVAGAL FNINFYAGAL LLACISFDRY LNIVHATQLY RRGPPARVTL TCLAWGLCL 
LFALPDFIFL SAHHDERLNA THCQYNFPQV GRTALRVLQL VAGFLLPLLV MAYCYAHILA 
VLLVSRGQRR LRAMRLWW WAFALCWTP YHLWLVDIL MDLGALARMC GRESRVDVAK 
SVTSGLGYMH CCLNPLLYAF VGVKFRERMW HLLLRLGCPN QRGLQRQPSS SRRDSSWSET 



A3 5 DNA SEQUENCE 

Gene name: Differentially expressed C 

Unigene number: Hs . 69517 

Probeset Accession #: 
Nucleic Acid Accession #: 



I I I 

G CGCGCTGACC CTCCCTGGGC ACCGCTGGGG A 

C GTGGCCCTAC CGCGGGTGTG GACAGACGCC AACCTGACTG 

AGATCCAGAG GACTCCCAGC GAACGGACGA GGGTGACAAT AGAGTGTGGT GTCATGTTTG 180 

TGAGAGAGAA AACACTTTCG AGTGCCAGAA CCCAAGGAGG TGCAAATGGA CAGAGCCATA 240 

CTGCGTTATA GCGGCCGTGA AAATATTTCC ACGTTTTTTC ATGGTTGCGA AGCAGTGCTC 300 

CGCTGGTTGT GCAGCGATGG AGAGACCCAA GCCAGAGGAG AAGCGGTTTC TCCTGGAAGA 360 

GCCCATGCCC TTCTTTTACC TCAAGTGTTG TAAAATTCGC TACTGCAATT TAGAGGGGCC 420 

ACCTATCAAC XCATCAGTGT TCAAAGAATA TGCTGGGAGC ATGGGTGAGA GCTGTGGTGG 480 

GCTGTGGCTG GCCATCCTCC TGCTGCTGGC CTCCATTGCA GCCGGCCICA GCCTGTCTTG 540 

AGCCACGGGA CTGCCACAGA CTGAGCCTTC CGGAGCAT GG ACTCGCTCCA GACCGTTGTC 600 

ACCTGTTGCA TTAAACTTGT TTTCTGTTGA TTACCTCTTG GTTTGACTTC CCAGGGTCTT 660 

GGGATGGGAG AGTGGGGATC AGGTGCAGTT GGCTCTTAAC CCTCAAGGGT TCTTTAACTC 720 

ACATTCAGAG GAAGTCCAGA TCTCCTGAGT AGTGATTTTG GTGACAAGTT TTTCTCTTTG 780 

AAATCAAACC TTGTAACTCA TTTATTGCTG ATGGCCACTC TTTTCCTTGA CTCCCCTCTG 840 

CCTCTGAGGG CTTCAGTATT GATGGGGAGG GAGGCCTAAG TACCACTCAT GGAGAGTATG 900 

TGCTGAGATG CTTCCGACCT TTCAGGTGAC GCAGGAACAC TGGGGGAGTC TGAATGATTG 960 

GGGTGAAGAC ATCCCTGGAG TGAAGGACTC CTCAGCATGG GGGGCAGTGG GGCACACGTT 102 0 

AGGGCTGCCC CCATTCCAGT GGTGGAGGCG CTGTGGATGG CTGCTTTTCC TCAACCTTTC 1080 
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CTACCAGATT CCAGGAGGCA GAAGATAACT AATTGTGTTG AAGAAACTTA GACTTCACCC 
ACCAGCTGGC ACAGGTGCAC AGAT T CAT AA ATTCCCACAC GTGTGTGTTC AACATCTGAA 
ACTTAGGCCA AGTAGAGAGC ATCAGGGTAA ATGGCGTTCA TTTCTCTGTT AAGATGCAGC 



A3 6 Protein 



Probeset Accession ft: 
Protein Accession #: 
Signal sequence: 
Transmembrane domain: 
Cellular localization: 




I I 

V ALPRVWTDAN LTARQRDPED SQRTDEGDNR V 
KWTEPYCVIA AVKIFPRFFM VAKQCSAGCA AMERPKPEEK RFLLEEPMPF FYLKCCKIRY 
CNLEGPPINS SVPKEYAGSM GESCGGLWLA I 



Coding sequence: 38-107S (underlined sequence corresponds t 



I I I I I I 

AGCAACGACG CCGGGCAGCG GGAGCGGCGG CCGCGCCATG TGGCTGCTGC- GGCCGCTGTG 
CCTGCTGCTG AGCAGCGCCG CGGAGAGCCA GCTGCTCCCC G 
GTGCAACATA CCAGGCAACT TCATGTGCAG CAATGGJ 
GTGTGACGGG CTGCCTGACT GCTTCGACAA GAGTGATGAG A 
GTCGAAATGT GGCCCAACCT TCTTCCCCTG TGCCAGC 

CTTCCGGTGC AATGGGTTTG AGGACTGTCC CGATGGCAGC GATGAAGAGA ACTGCACAGC 3 GO 

AAACCCTCTG CTTTGCTCCA CCGCCCGCTA CCACTGCAAG AACGGCCTCT GTATTGACAA 420 

GAGCTTCATC TGCGATGGAC AGAATAACTG TCAAGACAAC AGTGATGAGG AAAGCTGTGA 4 80 

AAGTTCTCAA GAACCCGGCA GTGGGCAGGT GTTTGTGACT T CAGAGAACC AACTTGTGTA 540 

TTACCCCAGC ATCACCTATG CCATCATCGG CAGCTCCGTC ATTTTTGTGC TGGTGGTGGC GOO 

CCTGCTGGCA CTGGTCTTGC ACCACCAGCG GAAGCGGAAC AACCTCATGA CGCTGCCCGT 660 

GCACCGGCTG CAGCACCCTG TGCTGCTGTC CCGCCTGGTG GTCCTGGACC ACCCCCACCA 720 

C ACCTACAACG T CAATAATGG CATCCAGTAT GTGGCCAGCC AGGCGGAGCA 780 

3 GAAGTAGGCT CCCCACCCTC CTACT CCGAG GCCTTGCTGG ACCAGAGGCC 840 

TGCGTGGTAT GACCTTCCTC CACCGCCCTA CTCTTCTGAC ACGGAATCTC TGAACCAAGC 900 

CGACCTGCCC CCCTACCGCT CCCGGTCCGG GAGTGCCAAC AGTGCCAGCT CCCAGGCAGC 960 

CAGCAGCCTC CTGAGCGTGG AAGACACCAG CCACAGCCCG GGGCAGCCTG GCCCCCAGGA 102 0 

GGGCACTGCT GAGCCCAGGG ACTCTGAGCC CAGCCAGGG C ACTGAAGAAG TATAAGTCCC 1080 

AGTTATTCCA AAGTCCATAT GGGTTAATCT GCTCTGACTT GTTGCCATTC TAACAATTTG 1140 

TGCTCATGGG AAGCTCTTTA AGCACCTGTA AGGATGTCTC AAGTTACAGT TTGGGATATT 1200 

AACTATCTCT GCATTCCCCT CCTCCCCCAG ACTTCAGAGA TGTTTTTCTG GCGTCTCAGT 12 60 

TGACATGATC TGTTGTGCGT CTTTTCTGTC AGGTCACTCT TCCCTTGGGA CCCGAGATCA 132 0 

CACCCTCATT TTTCACATTA TTCTGTTTCT GTTGGAGAGA CAGCATATAA AACAGTATTG 1380 

AAATAGGCTG GGAGAGAGCA ATGTTTCTGT GCTATATTGG ATGCTCAGAA GTGCAGGAGA 1440 

CGCTGGACCC AATTCTCTCT GCTGGGTAGT TACCTTATAG CATTTGGGC-A TTTGGGTTAG 1500 

ATGATCTAAC CAGGAGGCCA TCACTGGATG GTCACCCCCC CAAAAAAATT CCATTTGAGC 1560 

ATCAAAACCT GCTTTGCACA ATCCTATTTG ATGCCCCCAG TTCAGCAGAG TCAGTGGCCA 1620 

AAGAAAACTT TGGACGTGAG TAACACCCTT CAGCAGTCGC AACGTTATTT TGGTTTTGTG 1680 

AAGGACTCTG AAACCATCTA CCCTGTATAA ATTCTGGCTT TAGAAATTTG CCCAAGAATG 1740 

CTCATTCTGA GAGCTTTCCT CAGCAGCATA TATCATCAGC CTCAT CCTAA AATAGGCAGG 1800 

GAGCCCCTCC CATGAGTTTA TCCAAGTTCT CAGCTCCTAA AATGCAGGCT GCCAAGACCC 1860 

TACACCTGCC CTGGCTCTAC AGCCACTTAC CTGGTTTCTG GACTGTCACC CTCCCAGCTG 1920 

ACCTGCCCGT AGCCAAGGAA TGAGGACCTA ACTTGAGTTG GCCCAAAGTC TGACCTGGCT 1980 

GTATGTCCCT GTGGCCCACA CCCAGCCTGT CTTGCTCATT CATGCAGCCT CAACACTGGC 2040 

CTCCAAAGTT CCCTTAACAC TTGCAAAGTC CTTTTTACCT GTGCATTTGG ACTTGAGGAC 2100 

ACTGGTTTCT ATCACAGGTG AGAGCCATGT TCAATACCTC CAGCAAGCTC TCCTGGCTCC 2160 

CTGCACTGTG CACGCTCCTC TTCCCAAGGT CCCAATACCA GCACCTCTAG TTAGAGTTAG 2220 
GGTCAGGGTC AGGCCTCTCC CAACATCCCA GTAGTTTCTC CTCTGAGACA C 
AGACAATTTG GAGTCAAGAT TTTCCATTTG GATCTATTTT A 
TGAAACAGTG TGTTTGTTTT TTCCCTTCTA GTTAAGGGAC T 
AGCTGTCTCT TTTTTTGTTT TTCCTTTAAC AAGGTCCAAA GAAAGATGCA A 

„ CCCGCTGAGC CCCGTGATfiA CAAGTCACTC CAGACTAACC TGTGTGCCAG 



AGAGGGACT C CTCTCTCCCT CCGTGTATAG TCTCTATGTT TGTGCTAGTT T 
TTCTCTGTGT CCAGTCAGCC ACAGGGCCCG CCTCCCTGCA GGAATAS 

CCACTCCGGG CAGCTGTCAC CCATT CAGAA CTTCTTTCCG CAGCTGAAGA A 
AACCTGTTTG ACGCTAATTA AAACAGAGCC TGCAGGAAGT GGGGCTAAAG T 
TGATCCTGTT CTGTAGACTT TTCTTTCTTT TTTTAACCAA ATCCAAAC-GA TGTTACAGAA 
AAGCTAGCCA CTGGTATTTT GTTTTGTTTA AAAAAAAAAA GAAAGAAAGA AAGAAAGAAA 
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AACGGAAAGG AACCTAGCTG CCTGTATCTT TCATTTTTAA AATAGCACTT GAGTTATTTT 3060 

CTGAGTAATC CAATAAAGAA CTTTTGATGA CAGCCAGAAT GTGTTAGAAC TCTGGCTGAA 3120 

CATTTCATCT CCTGTGAGTC AGAAGGGCTT TATTTCTCCC TTTGATGGGG CCCCTTCTTC 31B0 

TTTCTGGTGC TCTGGAAGTT GTTTAGAGGA AAGAATTCTA ATTTTAATTA ATTGCGCAGT 3240 

GAGITAATCT CACTCGCTTT TCTGCTTCCA GGCATCTTAG GAAAAACAAA TGGTTTTAGT 3 3 00 

AGATAAGGGA TGCCTACTAA TGCTTTTTTA AAACAAACAG GGACATTTTT ATTATAGATT 3360 

TGATTTTTTT AATGAATGTT TTTAAAAATA TATAAATAGG ACACCAAAC-C GGCAGGGTTT 3420 

TTTTTGGGGG GAGGGGGTTT GTTTTCCAAC TCAAGATGGC ACATTAGTGG CCAGCAATAT 34B0 

TTTTTAACTC ATTCCAACCA GGAAGCTTTT TTATACATTG CCTAAATCTA CGCCAACCAG 3S40 

AAAATAGTCT CATCTCTTTT TTTCTCAAAT GAGATCCGTG TTTIATTTTA GCATTAAATT 3 600 

AGTTACACTG TGATGACTGG CCTATTACCT GACTCAGCTC CCTCTACCTT GAAATTGACA 3660 

TTTTTAAAAA ATGCAACTAA GTGGTTAATA GTGTGTGACG CTCAAAGTTA ATGTAAACTG 3720 

GAAAGGTTGT GTGTCGTTGC TTTTTGTGTT TTGGTTAGGC TTGGTTTTGT TTTTTAATTT 3780 

TTATACTTTC TAATAAATTT GCAGTTTCAT TCTTTCTGTT TGTGCAAAWG GWMCTAMARM 3840 

AAMMAAAAAC AWYWTTGGGG GGGCTTGGGC CTCGGAAAAA GTTTTTAACA CCACTTCGGG 3900 

TGGGGCGGCG GGGCCCACGT AGGTACGGCG ACCACGCGGG CCCAAACGGG ACCCCAGAAG 3960 

GAAACCCTGG CCAAGAAAAA GGTGGCGAGA ATTCTCCACA CCAGAAAAAA ACGCGCCGGG 4020 

GGAAACCGCA GAGTGTTGCG TAAACCACAC CCGAAGAGAG AACTCAGAAG CACACAAGCG 4080 
GGACTCAACC AGGAGGACCC AAGGGAACCC GATAGAGTAC G 

A3 8 Protein sequence: 



Probeset Accession #: 
Protein Accession «: 

Transmembrane domains 
LDLa domains; 
Cellular Localization 



1 11 21 31 41 51 

I I I I | | 

MWLLGPLCLL LSSAAESQLL PGNNFTNECN IPGKFMCSNG RCIPGAWQCD GLPDCPDKSD 60 

EKECPKAKSK CGPTFFPCAS GIHCIIGRFR CNGFEDCPDG SDEENCTANP LLCSTARYHC 120 

KNGLCIDKSF ICDGQNNCQD NSDEESCESS QEPGSGQVFV TSENQLVYYP SITYAIIGSS 180 

VI FVLWALL ALVLHHQRKR NNLMTLPVHR LQHPVLLSRL WLDHPHHQT VTYNVNNGIQ 240 

YVASQAEQNA SEVGSPPSYS EALLDQRPAK YDLPPPPYSS DTESLNQADL PPYRSRSGSA 300 
NSASSQAASS LLSVEDTEHS PGQPGPQEGT AEPRDSSPSQ GTEEV 

A3 9 dna sequence 

Gene name: ESTs 
Unigene number.- Hs. 128899 

Probeset Accession ft: AA98 3251 

Nucleic Acid Accession )f : AA9832S1 

Coding sequence: 1-1749 (underlined sequences correspond to star 

1 11 21 31 41 51 

I I I | | | 

ATGCTGTCTG GCTTCTTGAT GAGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 60 

GGAAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 120 

GACCGGGAGA GGGAGAGCCG GCCGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 180 

GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 240 

CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGGACTGCGG CTGGGGGCGC GCAGGACCCT 300 

CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 

TCCGGACGAC AGCCCCGGGG GCCTTCIGAC TGCATCCCGA GATTTCCATC AGCGAGTGCA 42 0 

ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 480 

GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 540 

CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC GGGGGTCGCC GGGGCCACGA 600 

CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTGT CCGCGCCCAG GGTGGTGCCA 660 

TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGGAA CGCCGCTTCG CTCCTGCAGC 720 

TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 760 

TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 840 

GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 90 0 

GCGCGCCTGG ACCAGGG CGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 

CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1020 

GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 

TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGC-C TGCTGCCCCG TGCCAGACGC 1140 

CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1200 

ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 
CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 
T GATGrtAGACr DTrnrrETT.n T^^ifrrrr mflfl^^ 



u»«iwu-i; cwraMsc aggatcccca gcagagccga 1320 

GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGIGC CAGCACCTCC 1380 

CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 

GGGGCCCGGG CGCCCCCAAC AAGGT CACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG 1500 

AACAACGTGT ATGT CAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1560 

CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 

CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 

AGGCAGATTC AGTCCCCCTT CCCTCACACC AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1740 

ACTGTATAAC CGAGAGT CAC TGGTGGGTTC CTTTACTGAA GGC-AGACC-AA GGCAGGGGTG 1800 

GATTCTCGAG GTGGAAGT CC GCACATGTCG GTGGTATTTA TGGCACGATT CCTTTGGATG 1860 

GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATTTCT GGATATGTTT 1920 

CATCCAGGGT ATCATTGATT TATGATGGAA AACCGGCCTC AGCTGGAGAT GACTGTGATG 1980 

TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TGGTTGACGA 2040 

AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATTAT TATTTCTTTT 2100 
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TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 

TTTTTTTTTT TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 

TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 

AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 

TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 

TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 

TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 

ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2S80 



GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 

TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 

TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 

TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 

TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTAGTTTTAA TTTTGCCTTG 2940 

TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3 000 

GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 

AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAGTA 3120 

TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 

GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 

GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 

ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 

CAAATTCAAG TGAATTTATT TGTGTGTTCT T T ACTTAT AT AAAAAAAGAT AACTTTAAGG 3420 

ATGTG CAAGT ACATTTCCAA CTGCTAGCAC AACCAGT AT T TTGTAATTAA ACAAATCGCT 3480 

GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 

TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 

" ATTTTACTTA TTAATAAATA TTTTATGATG 3 660 



Signal sequence: 
Transmembrane domains: 
Cellular localization: 



1 11 21 31 41 51 

I I I I I I 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAIITSRGRGS DRSRESRPEA AGLLWDRAAA 60 

GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGRSRGR VRLPVKPPEA 12 0 

SGRQPRGPSD CIPRFPSASA T'HKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 160 

PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRWP CGALAARPSP HPGTPLRSCS 24 0 

CCWLRCWRRG RGPSGEYCHG WLDAOGVWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 300 

ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 

SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP IYVEFLIVGS VFVAFIILGS 420 

LVAACCCRCL RPKQDPQQSR APGGNRLMET IPMIPSASTS RGSSSRQSST AASSSSSANS 480 

GARAPPTRSQ TNCCLPEGTM NUVYVHMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 

A41 DNA SEQUENCE: 

Gene name: ESTs, Weakly similar to JE0350 Anterior gradients [H. sapiens] 
Unigene number: Hs. 100686 
Probeset Accession #: AA487468 
Nucleic Acid Accession #: AA487468 

Coding sequence: 55-555 (underlined sequences correspond to start and stop codoni 

I I' I' I' I' f 

CGGCACCAAG AGCACTGGCC AAGT CAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATG 60 

3 CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 12 0 

h AGGAAAAGAG GCCTCCTCAG ACACTCTCAA GAGGATGGGG AGATGACATC 180 

ACTTGGGTAC AAACT T ATGA AGAAGGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 24 0 

CAAAATGAAG AAATACAAGA AATGGCTCAG AATAAGTTCA TCATGCTAAA CCTTATGCAT 360 

GAAACCACTG ATAAGAATTT ATCACCTGAT GGGCAATATG TGCCTAGAAT CATGTTTGTA 420 

GACCCTTCTT TAACAGTTAG AGCTGACATA G CTGGAAG AT ACTCTAACAG ATTGTACACA 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 540 

CAGTCAGAGC TATAAGAGAT GATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTTCAT 60 0 

GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

TTACTATTTA GTTTTTTTAA TGTGTTTGCA ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA A 



Gene name: ESTs, Weakly s 

Unigene number: Hs. 100686 



Signal" 
Cellular Localization 
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MMLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGWGD DITWVQTYSE GLFYAQKSKK SO 
PLMVIHHLED CQYSQALKKV FAQNEEIQEM AQNKFIMLNL MHETTDKNLS PDGQYVPRIM 120 
FVDPSLTVRA DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSEL 



A43 DNA SEQUENCE 

Gene name: ATPase, Ca+ + transporting, type 2C, member 1 

Nucleic Acid Accession #: AF189723 

Coding sequence: 1-2712 (underlined sequences correspond to start and stop codons) 



I I I I I 

G TATTGACATC AAAAAAAGCA AGTGAATTAC CAGTCAGTGA AGTTGCAAGC 60 

ATTCTCCAAG CTGATCTTCA GAATGGTCTA AACAAATGTG AAGTTAGTCA TAGGCGAGCC 120 

TTTCATGGCT GGAATGAGTT TGATATTAGT GAAGATGAGC CACTGTGGAA GAAGTATATT 180 

TCTCAGTTTA AAAATCCCCT TATTATGCTG CTTCTGGCTT CTGCAGTCAT CAGTGTTTTA 240 

ATGCATCAGT TTGATGATGC CGTCAGTATC ACTGTGGCAA TACTTATCGT TGTTACAGTT 30 0 

GCCTTTGTTC AGGAATATCG TTCAGAAAAA TCTCTTGAAG AATTGAGTAA ACTTGTGCCA 360 

CCAGAATGCC ATTGTGTGCG TGAAGGAAAA TTGGAGCATA CACTTGCCCG AGACTTGGTT 420 

CCAGGTGATA CAGTTTGCCT TTCTGTTGGG GATAGAGTTC CTGCTGACTT ACGCTTGTTT 480 

GAGGCTGTGG ATCTTTCCAT TGATGAGTCC AGCTTGACAG GTGAGACAAC GCCTTGTTCT 540 

AAGGTGACAG CTCCTCAGCC AGCTGCAACT AATGGAGATC TTGCATCGAG AAGTAACATT 600 

GCCTTTATGG GAACACTGGT CAGATGTGGC AAAGCAAAGG GTGTTGTCAT TGGAACAGGA 660 

GAAAATTCTG AATTTGGGGA GGTTTTTAAA ATGATGCAAG CAGAAGAGGC ACCAAAAACC 720 

CCTCTGCAGA AGAGCATGGA CCTCTTAGGA AAACAACTTT CCTTTTACTC CTTTGGTATA 780 

ATAGGAATCA TCATGTTGGT TGGCTGGTTA CTGGGAAAAG ATATCCTGGA AATGTTTACT 840 

ATTAGTGTAA GTTTGGCTGT AGCAGCAATT CCTGAAGGTC TCCCCATTGT GGTCACAGTG 90 0 

ACGCTAGCTC TTGGTGTTAT GAGAATGGTG AAGAAAAGGG CCATTGTGAA AAAGCTGCCT 960 

ATTGTTGAAA CTCTGGGCTG CTGTAATGTG ATTTGTTCAG ATAAAACTGG AACACTGACG 1020 

AAGAATGAAA TGACTGTTAC TCACATATTT ACTTCAGATG GTCTGCATGC TGAGGTTACT 1060 

GGAGTTGGCT ATAATCAATT TGGGGAAGTG ATTGTTGATG GTGATGTTGT TCATGGATTC 114 0 

TATAACCCAG CTGTTAGCAG AATTGTTGAG GCGGGCTGTG TGTGCAATGA TGCTGTAATT 12 0 0 

AGAAACAATA CTCTAATGGG GAAGCCAACA GAAGGGGCCT TAATTGCTCT TGCAATGAAG 1260 

ATGGGTCTTG ATGGACTTCA ACAAGACTAC ATCAGAAAAG CTGAATACCC TTTTAGCTCT 132 0 

GAGCAAAAGT GGATGGCTGT TAAGTGTGTA CAC03AACAC AGCAGGACAG ACCAGAGATT 1380 

TGTTTTATGA AAGGTGCTTA CGAACAAGTA ATTAAGTACT GTACTACATA CCAGAGCAAA 1440 

GGGCAGACCT TGACACTTAC TCAGCAGCAG AGAGATGTGT ACCAACAAGA GAAGGCACGC 15 00 

ATGGGCTCAG CGGGACTCAG AGTTCTTGCT TTGGCTTCTG GTCCTGAACT GGGACAGCTG IS 60 

ACATTTCTTG GCTTGGTGGG AATCATTGAT C CACCTAGAA CTGGTGTGAA AGAAGCTGTT 1620 

ACAACACTCA TTGCCTCAGG AGT AT CAATA AAAATGAT T A CTGGAGATTC ACAGGAGACT 1680 

GCAGTTGCAA TCGCCAGTCG TCTGGGATTG TATTCCAAAA CTTCCCAGTC AGTCTCAGGA 1740 

GAAGAAATAG ATGCAATGGA TGTTCAGCAG CTTTCACAAA TAGTACCAAA GGTTGCAGTA 1800 

TTTTACAGAG CTAGCCCAAG GCACAAGATG AAAATTATTA AGTCGCTACA GAAGAACGGT 1860 

TCAGTTGTAG CCATGACAGG AGATGGAGTA AATGATGCAG TTGCTCTGAA GGCTGCAGAC 192 0 

ATTGGAGTTG CGATGGGCCA GACTGGTACA GATGTTTGCA AAGAGGCAGC AGACATGATC 1980 

CTAGTGGATG ATGATTTTCA AACCATAATG TCTGCAATCG AAGAGGGTAA AGGGATTTAT 2040 

AAT AACAT TA AAAATTTCGT TAGATTCCAG CTGAGCACGA GTATAGCAGC ATTAACTTTA 2100 



ATCAATATTA TTATGGATGG ACCCCCAGCT CAGAGCCTTG GAGTAGAACC AGTGGATAAA 
GATGTCATTC GTAAACCTCC TCGCAACTGG AAAGACAGCA TTTTGACTAA AAACTTGATA 
CTTAAAATAC TTGTTTCATC AATAATCATT GTTTGTGGGA CTTTGTTTGT CTTCTGGCGT 
GAGCTACGAG ACAATGTGAT TACACCTCGA GACACAACAA TGACCTTCAC ATGCTTTGTG 
TTTTTTGACA TGTTCAATGC ACTAAGTTCC AGATCCCAGA CCAAGTCTGT GTTTGAGATT 
GGACTCTGCA GTAATAGAAT GTTTTGCTAT GCAGTTCTTG GATCCATCAT GGGACAATTA 
CTAGTTATTT ACTTTCCTCC GCTTCAGAAG GTT1TTCAGA CTGAGAGCCT AAGCATACTG 
GATCTGTTGT TTCTTTTGGG TCTCACCTCA TCAGTGTGCA TAGTGGCAGA AATTATAAAG 
AAGGTTGAAA GGAGCAGGGA AAAGATCCAG AAGCATGTTA GTTCGACATC ATCATCTTTT 
CTTGAAGTA T GA 

A44 Protein sequence: 

Gene name: ATPase, Ca++ transporting, type 2C, member 1 



I I I I I I 

MIPVLTSKKA SELPVSEVAS ILQADLONGL NKCEVSHRRA FHGWNEFDIS EDEPLWKKYI 

SQFKNPLIML LLASAVISVL MHQFDDAVSI TVAILIWTV AFVQEYRSEK SLESLSKLVP 

PECHCVREGK LEHTLARDLV PGDTVCLSVG DRVPADLRLF 3AVDLSIDES SLTGETTPCS 

KVTAPQPAAT NGDLASRSNI AFMGTLVRCG KAKGWIGTG 2NSEFGEVFK MMQAEEAPKT 

PLQKSHDLLG KQLSFYSFGI IGIIMLVGWL LGKDILEMFT ISVSLAVAAI PEGLPIWTV 
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TLRLGVMRMV KKEAIVKKLP IVETLGCCNV 
GVGYNQFGEV IVDGDWHGF YNPAVSRIVE 
MGLDGLQQDY IRKAEYPFSS EQKWMAVKCV 
GQTLTLTQQQ RDVYQQEKAR MGSAGLRVLA 
TTLIASGVSI KMITGDSQET AVAIASRLGL 
FYRASPRHKM KIIKSLQKNG SWAMTGDGV 
LVDDDFQTIM SAIEEGKGIY NNIKNFVRFQ 
INIIMDGPPA QSLGVEPVDK DVIRKPPKNW 
ELRDNVITPR DTTMTFTCFV FFDMFNALSS 
LVIYFPPLQK VFQTESLSIL DLLFLLGLTS 



ICSDKTGTLT KNEMTVTEIF TSDGLHAEVT 360 

AGCVCNDAVI RNNTLMGKPT EGAL.IAL.iiMK 42 0 

HRTQQDRPEI CFMKGAYEQV IKYCTTYQSK 480 

LASGPELGQL TFLGLVGIID PPRTGVKEAV 540 

YSKTSQSVSG E3IDAMDVQQ LSQIVPKVAV 600 

NDAVALKAAD IGVAMGQTGT DVCKEAADMI 660 

LSTSIAALTL ISLATLMKFP NPLNAMQILW 720 

KDSILTKNLI LKILVSSIII VCGTLFVFWR 780 

RSQTKSVF3I GLCSNRMFCY AVLGSIMGQL 840 

SVCIVAEIIK KVERSREKIQ KHVSSTSSSF 90 0 



30 



15 Uni 3 ene number: Hs.157601 
Probeset Accession # : W074S9 
Nucleic Acid Accession #= AC005383 

Codin 9 Sequence: 328-2751 (underlined sequences correspond to = 

20 

t i 1 i 1 r t r 

™^ TCTTC GCGGCTGCAC CGCTOGGAGG CTGGGTGACC CGCGTAGAAG TGAAGTACTT 60 

TTTTATTTGC AGACCTGGGC CGATGCCGCT TTAAAAAACG CGAGGGGCTC TATGCACCTC 120 

GTTCCTCCGA CCTCAGCCGG GTCGGGTCGT GCCGCCCTCI CCCAgS III 

GTCCCACGTG GCAGCCGCGC CCCGGGCGCC CCTCCTGTGA TCCCGTAGCG 240 

CCCCCTGGCC cgagccgcgc ccgggtctgt GAGTAGAGCC GCCCGGGCAC CGAGCGCTGG 300 

TCGCCGCTCT CCTTCCGTTA TATCAACATG CCCCCTTTCC TGTTGCTGGA GGCCGTCTGT 3 60 

GTTTTCCTGT TTTCCAGAGT GCCCCCATCT CTCCCTCTCC AGGAAGTCCA TGTAAGCAAA 420 

GAAACCATCG GGAAGATTTC AGCTGCCAGC AAAATGATGT GGTGCTCGGC TGCAGTGGAC 480 

ATCATGTTTC TGTTAGATGG GTCTAACAGC GTCGGGAAAG GGAGCTTTGA AAGGTCCAAG 540 

CACTTTGCCA TCACAGTCTG TGACGGTCTG GACATCAGCC CCGAGAGGGT CAGAGTGGGA 60 0 

GCATTCCAGT TCAGTTCCAC TCCTCATCTG GAATTCCCCT TGGATTCATT TTCAACCCAA 660 

CAGGAAGTGA AGGCAAGAAT CAAGAGGATG GTTTTCAAAG GAGGGCGCAC GGAGACGGAA 720 

CTTGCTCTGA AATACCTTCT GCACAGAGGG TTGCCTGGAG GCAGAAATGC TTCTGTGCCC 780 

CAGATCCTCA TCATCGTCAC TGATGGGAAG TCCCAGGGGG ATGTGGCACT GCCATCCAAG 840 

CAGCTGAAGG AAAGGGGTGT CACTGTGTTT GCTGTGGGGG TCAGGTTTCC CAGGTGGGAG 900 

GAGCTGCATG CACTGGC CAG CGAGCCTAGA GGGCAGCACG TGCTGTTGGC TGAGCAGGTG 960 

GAGGATGCCA CCAACGGCCT CTTCAGCACC CTCAGCAGCT CGGCCATCTG CTCCAGCGCC 10 2 0 

^ C ^ ACT GCAGGGTCGA GGCTCACCCC TGTGAGCACA GGACGCTGGA GATGGTCCGG 10 80 

GAGTTCGCTG GCAATGCCCC ATGCTGGAGA GGATCGCGGC GGACCCTTGC GGTGCTGGCT 114 0 

GCACACTGT C CCTTCTACAG CTGGAAGAGA GTGTTCCTAA CCCACCCTGC CACCTGCTAC 1200 

AGGACCACCT GCCCAGGCCC CTGTGACTCG CAGCCCTGCC AGAATGGAGG CACATGTGTT 12 6 0 

5 A 5 AAGGAC TGGACGGCIA CCAGTGCCTC TGCCCGCTGG CCTTTGGAGG GGAGGCTAAC 13 2 0 

40 TGTGCCCTGA AGCTGAGCCT GGAATGCAGG GTCGACCTCC TCTTCCTGCT GGACAGCTCT 1380 

G ^^ CA C^^ACGG CTTCCTGCGG GCCAAAGTCT TCGTG.SgcG G^T^TGC^ 1440 

GCCGTGCTGA GCGAGG ACT C TCGGGCCCGA GTGGGTGTGG CCACATACAG CAGGGAGCTG 1500 

TGG «GTGGG GGAGTACCAG GATGTGCCTG ACCTGGTCTG GAGCCTCGAT 1560 

50 ^^ CCT TCCGTGGTGG CCCCACCCTG ACGGGCAGTG CCTTGCGGCA GGCGGCAGAG 1620 

S G S IT0G aaB0C0OC CAGGACAGGC CAGGACCGGC CACGTAGAGT GGTGGTTTTG 1680 

CTCACTGAGT CACACTCCGA GGATGAGGTT GCGGGCCCAG CGCGTCACGC AAGGGCGCGA 1740 

GAGCTGCICC TGCTGGGTGT AGG CAGTGAG GCCGTGCGGG CAGAGCTGGA GGAGATCACA 1800 

GGCAGCCCAA AGCATGTGAT GGTCTACTCG GATCCTCAGG ATCTGTTCAA CCAAATCCCT 1860 

55 GAG f^ GCAGG GOAAGCTGTG CAGCCGGCAG CGGCCAGGGT GCCGGACACA AGCCCTGGAC 192 0 

CTCGTCTICA TGTTGGACAC CTCTGCCTCA GTAGGGCCCG AGAATTTTGC TCAGATGCAG 1980 

AG "™ TGA ^CTGTGC CCTCCAGTTT GAGGTGAACC CTGACGTGAC ACAGGTCGGC 2040 

CTGGTGGTGT ATGGCAGCCA GGTGCAGACT GCCTTCGGGC TGGACACCAA ACCCACCCGG 210 0 

fSSSS^ TGCGGGG CAT TAGCCAGGCC CCCTACCTAG GTGGGGTGGG CTCAGCCGGC 2160 

, ACCGCCCTGC TGCACATCTA TGACAAAGTG ATGACCGTCC AGAGGGGTGC CCGGCCTGGT 222 0 

OU GTCCCCAAAG CTGTGGTGGT GCTCACAGGC GGGAGAGGCG CAGAGGATGC AGCCGTTCCT 2280 

1^.^t GC TGAGGAACAA TGGCATCTCT GTCTTGGTCG TGGGCGTGGG GCCTGTCCTA 2340 

AGTGAGGGTC TGCGGAGGCT TGCAGGTCCC CGGGATTCCC TGATCCACGT GGCAGCTTAC 240 0 

GCCGACCTGC GGTACCACCA GGACGTGCTC ATTGAGTGGC TGTGTGGAGA AGCCAAGCAG 2460 

M CCAGTCAACC TCTGCAAACC CAGCCCGTGC ATGAATGAGG GCAGCTGCGT CCTGCAGAAT 2520 

05 GGGAGCTACC GCTGCAAGTG TCGGGATGGC TGGGAGGGCC CCCACTGCGA GAACCGTGAG 2580 

TGGAGCTCTT GCTCTGTATG TGTGAGCCAG GGATGGATTC TTGAGACGCC CCTGAkSc 2640 

ATGGCTCCCG TGCAGGAGGG CAGCAGCCGT ACCCCTCCCA GCAACTACAG AGAAGGCCTG 2700 

GGCACTGAAA TGGTGCCTAC CTTCTGGAAT GTCTGTGCCC CAGGTCCTTA GAATGTCTGC 276 0 

TTCCCGCCGT GGCCAGGACC ACTATTCTCA CTGAGGGAGG AGGATGTCCCAACTGCAGCC 2820 

A ™^° CTTA GAGACAAGAA AGCAGCTGAT GICACCCACA AACGATGTTG TTGAAAAGTT 2880 

llnnl^™ AGTAAATACC CACTTTCTGT ACCTGCTGTG CCTTGTTGAG GCTATGTCAT 2 940 

rl^S TCCCTTGAGG ATAAACAAGG GGTCCTGAAG ACTTAAATTT AGCGGCCTGA 300 0 

^o^Z 10 CACACASTCA ATGCTCGCCA GAATGTTGTT GACACAGTAA TGCCCAGCAG 3060 

75 r^llZ?. TAGAGCATCC TTTGGACGGC GAAGGCCACG GCCTTTCAAG ATGGAAAGCA 3120 

3 ° CAGC ! TTTC aC^CCCCA GAGACATTCT GGATGCATTT GCATTGAGIC TGAAAGC-GGG 3180 

rJZr^ AC GTTTGTGACT TCTTGGCGAC TGCCTTTTGT GTGTGGAAGA GACTTGGAAA 3240 

GGTOTCAGAC TGAATGTGAC CAATTAACCA GCTTGGTTGA TGATGGGGGA GGGGCTGAGT 3300 

I^ G f ATGGG CCCRGGTCTG GAGGGCCACG TAAAATCGTT CTGAGTCGTG AGCAGTC-TCC 33 SO 
80 "CCTTGAAGG TCTTC 

A46 Protein sequence 
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Unigene number: Hs. 1S7601 

Signal sequence: 1-17 

Transmembrane domains: none found 

VGW domains: 49-223; 341-518; 529-706 

EGF domains: 298-333; 715-748 

Cellular Localization: secrebed 



I I I I I I 

MPPFLLLEAV CVPLPSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 
SVGKGSFEES KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 
MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 
FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLS3SAICSS ATPDCRVEAH 
PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 
SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 
RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY Q.DVPDLVWSL DGIPFRGGPT 
LTGSALRQAA ERGFGSATRT GQDRPRRVW LLTESHSEDE VAGPARHARA RELLLLGVGS 
EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 
SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 
APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAWVLT GGRGAEDAAV PAQKLRNNGI 
SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 
CMNEGSCVLQ NGSYRCKCRD GWEGPHCEKR EWSSCSVCVS QGKILETPLR HMAPVQEGSS 
RTPPSNYREG LGTEMVPTFW NVCAPGP 



A47 DNA SEQUENCE : 

Gene name: ESTs, Weakly similar to JE0350 Anterior gradient-2 [H. sapiens] 

Unigene number: Hs. 100686 

Probeset Accession « : AA487468 

Nucleic Acid Accession ft: AA487468 

Coding sequence: 55-555 (underlined sequences correspond to start and stop codons 



CGGCACCAAG AGCACTGGCC AAGTCAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATG 
CTACACTCAG CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 
GCAATAAAAA AGGAAAAGAG GCCTCCTCAG ACACTCTCAA GAGGATGGGG AGATGACATC 
ACTTGGGTAC AAACTTATGA AGAAGGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 
ATGGTTATTC ATCACCTGGA GGATTGTCAA TACTCTCAAG CACTAAAGAA AGTATTTGCC 
CAAAATGAAG AAATACAAGA AATGGCTCAG AATAAGTTCA TCATGCTAAA CCTTATGCAT 
GAAACCACTG ATAAGAATTT ATCACCTGAT GGGCAATATG TGCCTAGAAT CATGTTTGTA 
GACCCTTCTT TAACAGTTAG AGCTGACATA GCTGGAAGAT ACTCTAACAG ATTGTACACA 
TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 
CAGTCAGAGC TATAAGAGAT GATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTTCAT 
GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 
TTACTATTTA GTTTTTTTAA TGTGTTTGCA ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



A48 Protein sequence: 

Gene name: ESTs, Weakly similar to JE0350 Anterior gradient-2 [H.e 

Unigene number: Hs. 100686 

Probeset Accession fl: AA487468 

Protein Accession ft: none found 

Signal sequence: 1-23 



Lular Localization: 



MMLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGWGD DITWVQTYEE GLFYAQKSKK 60 
PLMVIHHLED CQYSQALKKV FAQNEEIQEM AQNKFIHLNL MHETTDKNLS PDGQYVPRIM 120 
FVDPSLTVRA DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSEL 



ein-coupled receptor 



ined sequences correspond to start and stop codons) 



CGGCAGCAGG GTCTCGCTCT GT CACACAGG CTGGAGTGCA GTGGTGTGAT CTTGGCTCAT 60 
CGTAACCTCC ACCTCCCGGG TTCAAGTGAT TCTCATGCCT CAGCCTCCCG AGTAGCTGGG 12 0 
ATTACAGGTG GTGACTTCCA AGAGTGACTC CGTCGGAGGA AAATGACTCC CCAGTCGCTG 180 



852 



WO 03/042661 



PCT/US02/36810 



CTGCAGACGA CACTGTTCCT GCTGAGTCTG CTCTTCCTGG TCCAAGGTGC CCACGGCAGG 240 

GGCCACAGGG AAGACTTTCG CTTCTGCAGC CAGCGGAACC AGACACACAG GAGCAGCCTC 3 00 

CACTACAAAC CCACACCAGA CCTGCGCATC TCCATCGAGA ACTCCGAAGA GGCCCTCACA 3 SO 

GTCCATGCCC CTTTCCCTGC AGCCCACCCT GCTTCCCGAT CCTTCCCTGA CCCCAGGGGC 42 0 
CTCTACCACT TCTGCC1CTA CTGGAACCGA CATGCTGGGA G. 



AGCCAGTTCC TGAAGCATCC CCAGAAGGCC TCAAGGAGGC CCTCGGCTGC CCCCGCCAGC 780 

CAGCAGTTGC AGAGCCTGGA GTCGAAACTG ACCTCTGTGA GATTCATGGG GGACATGGTG 840 

TCCTTCGAGG AGGACCGGAT CAACGCCACG GTATGGAAGC TCCAGCCCAC AGCCGGCCTC 900 

CAGGACCTGC ACATCCACTC CCGGCAGGAG GAGGAGCAGA GCGAGATCAT GGAGTACTCG 960 

GTGCTGCTGC CTCGAACACT CTTCCAGAGG ACGAAAGGCC GGAGCGGGGA C-GCTGAGAAG 1020 

AGACTCCTCC TGGTGGACTT CAGCAGCCAA GCCCTGTTCC AGGACAAGAA TTCCAGCCAA 1080 

GTCCTGGGTG AGAAGGTCTT GGGGATTGTG GTACAGAACA CCAAAGTAGC CAACCTCACG 1140 

GAGCCCGTGG TGCTCACTTT CCAGCACCAG CTACAGCCGA AGAATGTGAC TCTGCAATGT 120 0 

GTGTTCTGGG TTGAAGACCC CACATTGAGC AGCCCGGGGC ATTGGAGCAG TGCTGGGTGT 12 60 

GAGACCGTCA GGAGAGAAAC CCAAACATCC TGCTTCTGCA ACCACTTGAC CTACTTTGCA 1320 

GTGCTGATGG TCTCCTCGGT GGAGGTGGAC GCCGTGCACA AGCACTACCT GAGCCTCCTC 1380 

TCCTACGTGG GCTGTGTCGT CTCTGCCCTG GCCTGCCTTG TCACCATTGC CGCCTACCTC 1440 

TGCTCCAGGG TGCCCCTGCC GTGCAGGAGG AAACCTCGGG ACTACACCAT CAAGGTGCAC 1S00 

ATGAACCTGC TGCTGGCCGT CTTCCTGCTG GACACGAGCT TCCTGCTCAG CGAGCCGGTG 1560 

GCCCTGACAG GCTCTGAGGC TGGCTGCCGA GCCAGTGCCA TCTTCCTGCA CTTCTCCCTG 162 0 

CTCACCTGCC TTTCCTGGAT GGGCCTCGAG GGGTACAACC TCTACCGACT CGTGGTGGAG 1680 

GTCTTTGGCA CCTATGTCCC TGGCTACCTA CTCAAGCTGA GCGCCATGGG CTGGGGCTTC 1740 

CCCATCTTTC TGGTGACGCT GGTGGCCCTG GTGGATGTGG ACAACTATGG CCCCATCATC 1800 

TTGGCTGTGC ATAGGACTCC AGAGGGCGTC ATCTACCCTT CCATGTGCTG GATCCGGGAC 1860 

TCCCTGGTCA GCTACATCAC CAACCTGGGC CTCTTCAGCC TGGTGTTTCT GTTCAACATG 1920 

GCCATGCTAG CCACCATGGT GGTGCAGATC CTGGGGCTGC GCCCCCACAC CCAAAAGTGG 1980 

TCACATGTGC TGACACTGCT GGGCCTCAGC CTGGTCCTTG GCCTGCCCTG GGCCTTGATC 204 0 

TTCTTCTCCT TTGCTTCTGG CACCTTCCAG CTTGTCGTCC TCTACCTTTT CAGCATCATC 210 0 

ACCTCCTTCC AAGGCTTCCT CATCTTCATC TGGTACTGGT CCATGCGGCT GCAGGCCCGG 2160 

GGTGGCCCCT CCCCTCTGAA GAGCAACTCA GACTGCGCCA GGCTCCCCAT CAGCTCGGGC 2220 

AGCACCTCGT CCAGCCGCAT CTAGGCCTCC AGCCCACCTG CCCATGTGAT GAAGCAGAGA 228D 

TGCGGCCTCG TCGCACACTG CCTGTGGCCC CCGAGCCAGG CCCAGCCCCA GGCCAGTCAG 2340 

CCGCAGACTT TGGAAAGCCC AACGACCATG GAGAGATGGG CCGTTGCCAT GGTGGACGGA 2400 

CTCCCGGGGC TGGGGCTTTT GAATTGGCCT TGGGGACTAC TCGGCTCTCA CTCAGCTCCC 2460 

ACGGGACTCA GAAGTGCGCC GCCATGCTGC CTAGGGTACT GTCCCCACAT CTGTCCCAAC 2 5 20 

CCAGCTGGAG GCCTGGTCTC TCCTTACAAC CCCTGGGCCC AGCCTCATTG CTGGGGGCCA 2 580 

GGCCTTGGAT CTTGAGGGTC TGGCACATCC TTAATCCTGT GCCCCTGCCT GGGACAGAAA 2640 

TGTGGCTCCA GTTGCTCTGT CTCTCGTGGT CACCCTGAGG GCACTCTGCA TCCTCTGTCA 2700 
TTTTAACCTC A© 



A50 Protein sequence 
Gene name: 
Unigene number: 



G protein-coupled receptor 
NM_005682.1 



7-539, 575-597, 608-630, 638-659 



I I I I I I 

MTPQSLLQTT LFLLSLLFLV QGAHGRGHRE DFRFCSQRNQ THRSSH 
SEEALTVHAP FPAAHPASRS FPDPRGLYHF CLYWNRHAGR LHLLYGKRDF LLSDKASSLL 
CFOHQEESLA QGPPLLATSV TSWWSPQNIS LPSAASFTFS FKSPPETAAH NASVDMCELK 
RDLQLLSQFL KHPQKASRRP SAAPASQQLQ SLESKLTSVR FMGDMVSFEE DRINATVWKL 
QPTAGLQDLH IHSRQEEEQS EIMEYSVLLP RTLFQRTKGR SGEAEKRLLL VDFSSQALFQ 
DKNSSQVLGE KVLGIWQNT KVANLT E P W LTFQHQLQPK NVTLQCVFWV EDPTLSSPGH 
WSSAGCETVR RETQTSCFCN HLTYFAVLMV SSVEVDAVHK HYLSLLSYVG CWSALACLV 
TIAAYLCSRV PLPCRRKPRD YTIKVHMNLL LAVFLLDTSF LLS3PVALTG SEAGCRASAI 
FLHFSLLTCL SWMGLEGYN1, YRLWEVFGT YVPGYLLKLS AMGWGFPIFL VTLVALVDVD 
NYGPIILAVH RTPEGVIYPS MCWIRDSLVS YITHLGLFSL VFLFNMAMLA TMWQILRLR 
PHTQKWSHVL TLLGLSLVLG LPWALIFFSF ASGTFQLWI, YLFSI ITSFQ GFLIFIWYWS 
MR LQ ARGGPS PLKSNSDCAR LPISSGSTSS SRI 



Gene name: Hypothetical protein FLJ20063 

Unigene number: Hs.5940 

Probeset Accession #: AA053660 
Nucleic Acid Accession 



CCCCCATGAC TTTGTACAGC TACTTCACTG CTTCCCCCCA ATTAGTACAC ATAGTTCCTC 



start and stop codons) 



WO 03/042661 PCT/US02/36810 



CACAATTCCT ACACCTGCTC CCCCCATAAT CAGTACACAT AGTTCCTCCA CAATTCCTAT 120 

ACCTACTGCT GCAGACAGTG AGTCAACCAC AAATGTAAAT TCATTAGCTA CCTCTGACAT 180 

AATCACCGCT TCATCTCCAA ATGATGGATT AATCACAATG GTTCCTTCTG AAACACAAAG 240 

TAACAATGAA ATGTCCCCCA CCACAGAAGA CAAT CAAT CA TCAGGGCCTC CCACTGGCAC 300 

CGCTTTATTG GAGACCAGCA CCCTAAACAG CACAGGTCCC AGCAATCCTT GCCAAGATGA 3 SO 

TCCCTGTGCA GATAATTCGT TATGTGTTAA GCTGCATAAT ACAAGTTTTT GCCTGTGTTT 420 

AGAAGGGTAT TACTACAACT CTTCTACATG TAAGAAAGGA AAGGIATTCC CTGGGAAGAT 480 

TTCAGTGACA GTATCAGAAA CATTTGACCC AGAAGAGAAA CATTCCATGG CCTATCAAGA 540 

CTTGCATAGT GAAATTACTA GCTTGTTTAA AGATGTATTT GGCACATCTG TTTATGGACA 600 

GACTGTAATT CTTACTGTAA GCACATCTCT GTCACCAAGA TCTGAAATGC GTGCTGATGA SSO 

CAAGTTTGTT GATGTAACAA TAGTAACAAT TTTGGCAGAA ACCACAAGTG ACAATGAGAA 720 

GACTGTGACT GAGAAAATTA ATAAAGCAAT TAGAAGTAGC TCAAGCAACT TTCTAAACTA 780 

TGATTTGACC CTTCGGTGTG ATTATTATGG CTGTAACCAG ACTGCGGATG ACTGCCTCAA 840 

TGGTTTAGCA TGCGATTGCA AATCTGACCT GCAAAGGCCT AACCCACAGA GCCCTTTCTG 900 

CGTTGCTTCC AGTCTCAAGT GTCCTGATGC CTGCAACGCA CAGCACAAGC AATGCTTAAT 9S0 

AAAGAAGAGT GGTGGGGCCC CTGAGTGTGC GTGCGTGCCC GGCTACCAGG AAGATGCTAA 1020 

TGGGAACTGC CAAAAGTGTG CATTTGGCTA CAGTGGACTC GACTGTAAGG ACAAATTTCA 1080 

GCTGATCCTC ACTATTGTGG GCACCATCGC TGGCATTGTC ATTCTCAGCA TGATAATTGC 1140 

ATTGATTGTC ACAGCAAGAT CAAATAACAA AACGAAGCAT ATTGAAGAAG AGAACTTGAT 1200 

TGACGAAGAC TTTCAAAATC TAAAACTGCG GTCGACAGGC TTCACCAATC TTGGAGCAGA 1260 

AGGGAGCGTC TTTCCTAAGG TCAGGATAAC GGCCTCCAGA GACAGCCAGA TGCAAAATCC 1320 

CTATTCAAGA CACAGCAGCA TGCCCCGCCC TGACTATTAG AATCATAAGA ATGTGGAACC 1380 

CGCCATGGCC CCCAACCAAT GTACAAGCTA TTATTTAGAG TGTTTAGAAA GACTGATGGA 144 0 

GAAGTGAGCA CCAGTAAAGA TCTGGCCTCC GGGGTTTTTC TTCCATCTGA CATCTGCCAG 15 00 

CCTCTCTGAA TGGAAGTTGT GAATGTTTGC AACGAAT CCA GCTCACTTGC TAAATAAGAA 1560 

TCTATGACAT TAAATGTAGT AGATGCTATT AGCGCTTGTC AGAGAGGTGG TTTTCTTCAA 162 0 

T CAGT ACAAA GTACTGAGAC AATGGTTAGG GTTGTTTTCT TAATTCTTTT CCTGGTAGGG 1680 

CAACAAGAAC CATTTCCAAT CTAGAGGAAA GCTCCCCAGC ATTGCTTGCT CCTGGGCAAA 1740 

CATTGCTCTT GAGTTAAGTG ACCTAATTCC CCTGGGAGAC ATACGCATCA ACTGTGGAGG 1800 

TCCGAGGGGA TGAGAAGGGA TACCCACCAT CTTTCAAGGG TCACAAGCTC ACTCTCTGAC 1860 

AAGTCAGAAT AGGGACACTG CTTCTATCCC TCCAATGGAG AGATTCTGGC AACCTTTGAA 1920 

CAGCCCAGAG CTTGCAACCT AGCCTCACCC AAGAAGACTG GAAAGAGACA TATCTCTCAG 1980 

CTTTTTCAGG AGGCGTGCCT GGGAATCCAG GAACTTTTTG ATGCTAATTA GAAGGCCTGG 204 0 

ACTAAAAATG TCCACTATGG GGTGCACTCT ACAGTTTTTG AAATGCTAGG AGGCAGAAGG 2100 

GGCAGAGAGT AAAAAACATG ACCTGGTAGA AGGAAGAGAG GCAAAGGAAA CTGGGTGGGG 2160 

AGGATCAATT AGAGAGGAGG CACCTGGGAT CCACCTTCTT CCTTAGGTCC CCTCCTCCAT 222 0 

CAGCAAAGGA GCACTTCTCT AATCATGCCC TCCCGAAGAC TGGCTGGGAG AAGGTTTAAA 2280 

AACAAAAAAT CCAGGAGTAA GAGCCTTAGG TCAGTTTGAA ATTGGAGACA AACTGTCTGG 2340 

CAAAGGGTGC GAGAGGGAGC TTGTGCTCAG GAGTCCAGCC GTCCAGCCTC GGGGTGTAGG 2400 

T TT CTGAGGT GTGCCATTGG GGCCTCAGCC TTCTCTGGTG ACAGAGGCTC AGCTGTGGCC 2460 

ACCAACACAC AACCACACAC ACACAACCAC ACACACAAAT GGGGGCAACC ACAT C CAGT A 252 0 

CAAGCTTTTA CAAATGTTAT TAGTGTCCTT TTTTATTTCT AATGCCTTGT CCTCTTAAAA 2580 

GTTATTTTAT TTGTTATTAT TATTTGTTCT TGACTGTTAA TTGTGAATGG TAATGCAATA 2640 
AAGTGCCTTT GTTAGATGGT GAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 



A52 Protein s 



Unigene 

Probeset Accession #: AA0S3660 
Protein Accession #: none foui 

Signal 



Hypothetical protein FLJ20063 



SEA domain: 

Cellular Localizations 



I I I I | | 

MVPSETQSNN EMSPTTEDNQ SSGPPTGTAL LETSTWSTG PSNPCQDDPC ADK 

NTSFCLCLEG YYYNSSTCKK GKVFPGKISV TVSETFDPEE KHSMAYQDLH SEITSLFKDV 
FGTSVYGQTV ILTVSTSLSP RSEMRADDKF VDVTIVTILA ETTSDNEKTV TEKINKAIRS 
SSSNFIjNYDL tlrcdyygcn qtaddclngl ACDCKSDLQR PNPQSPFCVA SSLKCPDACN 
AQHKQCLIKK SGGAPECACV PGYQEDANGN CQKCAFGYSG LDCKDKFQLI LTIVGTIAGI 
VILSMTIALI VTARSNNKTK HIEEENLIDE DFQNLKLRST GFTNLGAEGS VFPKVRITAS 
RDSQMQNPYS RHSSMPRPDY 

A53 DNA SEQUENCE 

Gene name: TMPRSS3a 
Unigene number: Hs. 2 9 8241 

Probeset Accession tf: AI538613 
Nucleic Acid Accession tf 



202-1566 (underlined sequences correspond t 



I I I | I | 

ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 
GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 
CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 
AGAGGTCCTG AAAT AGT CAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 



854 



WO 03/042661 PCT/US02/36810 



™~ CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 

^3™ GTTG CTGCA CAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGT CACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

C1TCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACT CTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGC AGGTGACGCC 

TCCCCTGTCC TGAACCACGC GGCCGTCCCT TTGATTTCCA ACAAGATCTG CAACCACAGG 
GACGTGTACG GTGGCATCAT CTCCCCCTCC ATGCTCTGCG CGGGCTACCT GACGGGTGGC 
GTGGACAGCT GCCAGGGGGA CAGCGGGGGG CCCCTGGTGT GTCAAGAGAG GAGGCTGTGG 



» 1K,sula - 1 GCCAGGGGGA CAGCGGGGGG CCCCTGGTGT GTCAAGAGAG GAGGCTGTGG 1440 
AAGTTAGTGG GAGCGACCAG CTTTGGCATC GGCTGCGCAG AGGTGAACAA GCCTGGGGTG 1500 
TACACCCGTG TCACCTCCTT CCTGGACTGG ATCCACGAGC AGATGGAGAG AGACCTAAAA 
ACCTGAAGAG GAAGGGGACA AGTAGCCACC TGAGTTCCTG AGGTGATGAA GACAGCCCGA 
TCCTCCCCTG GACTCCCGTG TAGGAACCTG CACACGAGCA GACACCCTTG GAGCTCTGAG 
TTCCGGCACC AGTAGCAGGC CCGAAAGAGG CACCCTTCCA TCTGATTCCA GCACAACCTT 
CAAGCTGCTT TTTGTTTTTT GTTTTTTTGA GGTGGAGTCT c^*™™™ ™™„„„„,„„ 



uu«a.Tu.iT TTTGTTTTTT GTTTTTTTGA GGTGGAGTCT CGCTCTGTTG C 

AGTGCAGTGG CGAAATCCCT GCTCACTGCA GCCTCCGCTT CCCTGGTTCA AGCGATTCTC 1860 
TTGCCTCAGC TTCCCCAGTA GCTGGGACCA CAGGTGCCCG CCACCACACC CAACTAATTT 1920' 



™~ >»-"«i«i.ui UM.HiU.UG CCACCACACC CAACTAATTT 1920' 

TTGTATTTTT AGTAGAGACA GGGTTTCACC ATGTTGGCCA GGCTGCTCTC AAACCCCTGA 1980 

CCTCAAATGA TGTGCCTGCT TCAGCCTCCC ACAGTGCTGG GATTACAGGC ATGGGCCACC 204 0 

ACGCCTAGCC TCACGCTCCT TTCTGATCTT CACTAAGAAC AAAAGAAGCA GCAACTTGCA 210 0 

AGGGCGGCCT TTCCCACTGG TCCATCTGGT TTTCTCTCCA GGGGTCTTGC AAAATTCCTG 2160 

ACGAGATAAG CAGTTATGTG ACCTCACGTG CAAAGCCACC AACAGCCACT CAGAAAAGAC 2220 

GCACCAGCCC AGAAGTGCAG AACTGCAGTC ACTGCACGTT TTCATCTCTA GGGACCAGAA 2280 

JJ CCAAACCCAC CCTTTCTACT TCCAAGACTT ATTTTCACAT GTGGGGAGGT TAATCTAGGA 2340 

ATGACTCGTT TAAGGCCTAT TTTCATGATT TCTTTGTAGC ATTTGGTGCT TGACGTATTA 2400 

TTGTCCTTTG ATTCCAAATA ATATGTTTCC TTCCCTCAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAAAAA 



Probeset Accession 
Protein Accession 

Transmembrane doma 
Tryp_SPc domain: 



50 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKPFP IIVIGIIALI 
i^, I , GLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 

33 TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 
VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQKPW QASLQFQGYH 
LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 
LGNDIALMKL AGPLTFNEMI Q.PVCLPNSEE NFPDGKVCWT SGWGATEDGA GDASPVLNHA 
AVPLISNKIC NHRDVYGGII SPSMLCAGYL TGGVDSCQGD SGGPLVCQER RLWKLVGATS 

OU FGIGCAEVNK PGVYTRVTSF LDWIHEQMER DLKT 

ASS DNA SEQUENCE 



Gene name: Putative G protein-coupled receator GPCR150 
Unigene number.- Hs. 97101 

Probeset Accession #: AA215333 
Nucleic Acid Accession NM_014373 
Coding sequence: 322-1338(unc 



sequences correspond to start and stop codons) 



GTGGCCTCGA GGTGGTGGCA GGGCCGCCCC CTGCAGTCCG GAGACGAACG CACGGACCGG 
GCCTCCGGAG GCAGGTTCGG CTGGAAGGAA CCGCTCTCGC TTCGTCCTAC ACTTGCGCAA 
ATGTCTCCGA GCTTACTCAC ATAGCATATT GGTATATCAA AATGAAATGC AAGGAACCAA 
AAATAACATA ATTGAAGGCA GTAAAAGTGA AATTAAATAG GAAGATCATC AGTCAAGGAA 
GACCCACTGG AGAGGACAGA AAATGAAGCA GTGTTTTATC ATGTGTATTT CAGCAGGTCT 
TCTTGAAATT TAACTAAAAA TATGACTGCT CTCTCTTCAG AGAACTGCTC TTTTCAGTAC 
CAGTTACGTC AAACAAACCA GCCCCTAGAC GTTAACTATC TGCTATTCTI GATCATACTT 
GGGAAAATAT TATTAAATAT CCTTACACTA GGAATGAGAA GAAAAAACAC CTGTCAAAAT 
TTTATGGAAT ATTTTTGCAT TTCACTAGCA TTCGTTGATC TTTTACTTTT GGTAAACATT 



^-.n-mjwim miiuuLftj TTCACTAGCA TTCGTTGATC T 

TCCATTATAT TGTATTTCAG GGATTTTGTA CTTTTAAGCA TTAGGTTCAC TAAATACCAC 
ATCTGCCTAT TTACTCAAAT TATTTCCTTT ACTTATGGCT TTTTGCATTA TCCAGTTTTC 
CTGACAGCTT GTATAGATTA TTGCCTGAAT TTCTCTAAAA CAACCAAGCT TTCATTTAAG 
TGTCAAAAAT TATTTTATTT CTTTACAGTA ATTTTAATTT GGATTTCAGT CCTTGCTTAT 



855 



WO 03/042661 



PCT/US02/36810 



GTTTTGGGAG ACCCAGCCAT CTACCAAAGC CTGAAGGCAC AGAATGCTTA TTCTCGTCAC 840 

TGTCCTTTCT ATGTCAGCAT TCAGAGTTAC TGGCTGTCAT TTTTCATGGT GATGATTTTA 900 

TTTGTAGCTT TCATAACCTG TTGGGAAGAA GTTACTACTT TGGTACAGGC TATCAGGATA 9S0 

ACTTCCTATA TGAATGAAAC TATCTTATAT TTTCCTTTTT CATCCCACTC CAGTTATACT 1020 

GTGAGATCTA AAAAAATATT CTTATCCAAG CTCATTGTCT GTTTTCTCAG TACCTGGTTA 1080 

CCATTTGTAC TACTTCAGGT AATCATTGTT TTACTTAAAG TTCAGATTCC AGCATATATT 1140 
GAGATGAATA TTCCCTGGTT ATACTTTGTC AATAGTTTTC TCATTGC 
TTTAATTGTC ACAAGCTTAA TTTAAAAGAC ATTGGATTAC CTTTGC-ATCC A 
TGGAAGTGCT GCTTCATTCC ACTTACAATT CCTAATCTTG AGCAAATTGA A 
TCAATAATGA TTTGTTAATA TTATTAATTA AAAGTTACAG CTGTCA1 

TATGAACAGA AAGAACTCAG GACATATTAA AAAATAAACT GAACTAAAAC AACTTTTGCC 1440 

CCCTGACTGA TAGCATTTCA GAATGTGTCT TTTGAAGGGC TATACCAGTT ATTAAATAGT 1500 

GTTTTATTTT AAAAACAAAA TAATTCCAAG AAGTTTTTAT AGTTATTCAG GGACACTATA 15S0 

TTACAAATAT TACTTTGTTA TTAACACAAA AAGTGATAAG AGTTAACATT TGGCTATACT 1620 

GATGTTTGTG TTACTCAAAA AAACTACTGG ATGCAAACTG TTATGTAAAT CTGAGATTTC 1680 

ACTGACAACT TTAAGATATC AACCTAAACA TTTTTATTAA ATGTTCAAAT GTAAGCAAGA 1740 



Gene name: Putative G protein-coupled receptor GPCR150 

Unigene number. Hs. 97101 

Protein Accession (I: NP_0 55188 

Signal sequence : none found 

Transmembrane domains: 23-45, 59-81, 97-119, 138-160, 184-206, 241-2 

Cellular Localization: plasma membrane 



11 21 31 41 51 

I I I I I 

3 FQYQLRQTNQ PLDVNYLLFL IILGKILLNI LTLGMRRKNT CQNFMEYFCI 
SLAFVDLLLL VNISIILYFR DFVLLSIRFT KYHICLFTQI ISFTYGFLHY PVFLTACIDY 
CLNFSKTTKL SFKCQKLFYF FTVILIWISV LAYVLGDPAI YQSLKAQNAY SRHCPFYVSI 
QSYWLSFFMV MILFVAFITC WEEVTTLVQA IRITSYMNET ILYFPFSSHS SYTVRSKKIF 
LSKLIVCFLS TWLPFVLLQV IIVLLKVQIP AYIEMNIPWL YFVNSFLIAT V 
LKDIGLPLDP FVNWKCCFIP LTIPNLEQIE KPISIMIC 



) chain precursor [H . sapiens] 



I I I I I I 

GCCCTTGGAC ACTGACATGG ACTGAAGGAG TAGAATGGAG CACGAGGACA CTGACATGGA 
CTGAAGAAAA AGGAGCTGGA GCAGGAGAAG GAGGTGCTGC TGCAGGGTTT GGAGATGATG 
GCGCGGGGCC GCGACTGGTA CCAGCAGCAG CTGCAACGAG TGCAGGAGCG CCAGCGCCGC 
CTGGGCCAGA GCAGAGCCAG CGCCGACTTT GGGGCIGCAG GGAGCCCCCG CCCACTGGGG 
CGGCTACTGC CCAAGGTACA AGAGGTGGCC CGGTGCCTGG GGGAGCTGCT GGCTGCAGCC 
TGTGCCAGCC GGGCCCTGCC CCCGTCCTCC TCCGGGCCCC CCTGCCCTGC CCTGACGTCC 
ACCTCACCCC CGGTCTGGCA GCAGCAGACC ATCCTCATGC TGAAGGAGCA GAACCGACTC 
CTCACCCAGG AGGTGACCGA GAAGAGTGAG CGCATCACGC AGCTGGAGCA GGAGAAGTCG 
GCGCTCATTA AGCAGCTGTT TGAGGCCCGC GCCCTGAGCC AGCAGGACGG GGGACCTCTG 
GATICCACCT TCATCTAGTC CTTGTGGGCC GCGTGGGCCC CCAGGGCCAG CCTGGCACTC 
AGCCCTTCGA GGGTGGGCGC CCCATCGCAC CCACCCTCTC TGGCTGGAGA CCCCCGGCAG 
GCCCAGGCAC AGT CCCGGAG TGGGCGCCTT CCTGCCGCCC TTGCCAGATG GGCTCCCCAG 
GCCTGCCCCC GGCTGGTCCC CGCACCGAGC GCTTGACTCC GTTTKGGCTC CTGGTTGYTG 
ACATGGGCTG GGGGCTCTCT TGAGTCCGCA TAGTCCGCAG CTACTACTGG CCGCTGTCAG 
TGGACAGTGG GGTACCCCTC CATGAGTTAG CGTCCCCCCG TT1CCAGCGG TGCCGCCCTG 
GGTCCCATCT TCAGGGAAAG GCACTGCCCA CGCCAGGCTG CACTTCCAAC AACGGGCAGC 
3 GGGCGGCTCC GACGCGGGTC CAAGGGCAGC TTCCCGCTCA ACCAGGGCAC 
3 TGGCTGTAGC TCGGACGGAC GGAAGTAGAT G 
GTAAGCGGGG G 

K3GGACC CTGGCTGCAG CGGATCGGCA CGCCGGGTGG GCGAGAGCTT 
3 TGCCTCCCAC AGACCCTGGG GTGATGGCCT TCCCCCTCTT G 
TTGCCCCACG T 

: AGGCACGTCA TAGGCAAAGC CTGTTTCCCC CGACTCAGGA TTTCCAAGGC 1380 
T GCTCACCCCC CTTTGCTCTC ACGCCCAGCC TGTCCCCAGG TTTCAGCTGG 1440 
GAGAGGCCAC CTCCCTCAGC CAAGGAAAAC GAGAACCCCC AGGGTACAGG AGGAGGCTGG 150 0 
GGCAGGTCCC CTTGGGTGTC ACTCCCTCAG CCCCTGCCCA GGCCCACTCC CGCTGGTGCT 1560 
GGAGTACGCA CTGGTGGGGG GGCCCTGCTC AGCCCAACCT GGAGGGTCCC AGTGTCACCA 1620 
GAACCAGGGG CACGGCAACA GCATCGATGG GTTCTGCAGC CCAGGGCCCC CGATGCGGGG 1680 
TCAGTGTGTG TGGGGCGCAG GGCCTCCGAT GCGGGGTCAG TGCGTGGGGG GCGCAGGGCC 1740 
CCCGATGCGG GGTCAGTGCG TGGGGGGCGC AGGGCCCCCT CGTGTCCAGG GCACTTTGGT 1800 
ACACTGTCCC ACAAGGCACC TGTCTCAGAG GAGGGGCCCT GGCAGGCAGC GTGGCAACTC 1860 
CCTTCCGGAG CCCAGCTCCA TGCTAACCTG CCCACAGCAA CCCCACAGAG CCACATTCCC 1920 
TGCTGCACCT GGTCTGCAGG GGTGTCCCAG GACAGGCCCA AGTCAGCCCA GCATGCAGCT 1980 
GCCCTCCTAC CCTGAAGATG GGAGTGGGCT TTCCAGGGGA CATAAGGATG TCAGGCCTGG 2040 
ACCTCCTGGG CAGGAAAGGG TGCAGGTCCT GAGGGCCTGT GCCCCACAGC CCCAGCACCC 2100 
AGGTGGACTG CAGCGCAGTG GGTGGGCCAG TGGCAGCCAG GGAGAAGCCC CCCGTCAGCA 2160 
GGCTGGGGTC TGCCCACCAG GGCCTCCCCA CGTCTGCCTT TGAGGGTGCC TGCCATGCCC 2220 
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TGGGGGATCC TGGCATCTTT ACTGGACTGG AAGCAGGAGA CAGAACAGTG TCTGTCCCGG 2280 

GGTGACTTCA TCAGGAGACC GCCCACATAG AGCTGGACCC CGCAGCTGAA GCGGAAATGT 2340 

GAGACAGGCT GGCACCTCCG GAAAAACTGC CTTTCAGCCT TGGTGTTCCG TGCAAGGTGA 24 0 0 

AAAGAAATAG GTCCTCCCAG TTTACAGCTT GAAATCAGGC TAGTGAGTGG CCCTC-GAGAC 2460 

CACGAGGGGA GAATTTAAAG GCCCCGGCTG GCAGGGTCTA GGTGGCTGGC AGAGGCACAT 252 0 

GCAGACCCIG CCTGGAGCCT GCCCTAGGAC GCTGGGCGGG TCAGTCTCCG TGCAGGATGT 2580 

GAGCAGCGTC CCTGGGCTCT ATCCGCGAGG TGC CAGTAGC GTGTGCAGGT ACATACACGT 2640 

GCGTGCACAC TGTGATGACA CCCGGAAATG TCTCAGGATG TTGAAATGTG TOCTTGGGGG 2700 

CAGAAGTGTC CCCAGTTGAG AATCTGCCCC AGAGGAACAC ACCCACACCA GGCCTCAGGA 2760 

TTTTGTGTTG ATCAAGTTCC AAGGAAAAGG AACATCTCAG CCGGGCGTGG TGGTTCACGC 2820 

CTGGAATCCC AGCACTTGAG GCCAGGAGTT CCAGAGCAGC CTGGGCAACG CAGTGAGAGA 2880 

CCCCATCTCT ACAARAAAAA AAAAAGAAAG AAAGAAAATG AGAGATCCAG GTTTAAAAAT 2940 

TCATAAACAC CACAAGGAAA CAATACACTA TGAGACCCAG CAGAAGCAAC AGATTGACTC 3 0 00 

TAGACCCAGA TACTAGAATT ATCAGAGAGA ATATAAAGTA ACAGTGTTTT ATATATCTAA 3060 
AGAAATAAAA GAGATTTCTG GAAACATGAA AAAAAA 

A58 DHA sequence 



Unigene number i Hs. 157601 

Probeset Accession #: W07459 
Nucleic Acid Accession #= AC00S383 

Coding Sequence: 328-2751 (underlined sequences correspond to start and stop c 

i i 1 r f i 1 i 1 

GACAGTGTTC GCGGCTGCAC CGCTCGGAGG CTGGGTGACC CGCGTAGAAG TGAAGTACTT 60 

TTTTATTTGC AGACCTGGGC CGATGCCGCT TTAAAAAACG CGAGGGGCTC TATGCACCTC 12 0 

CCTGGCGGTA GTTCCTCCGA CCTCAGCCGG GTCGGGTCGT GCCGCCCTCT CCCAGGAGAG 180 

ACAAACAGGT GTCCCACGTG GCAGCCGCGC CCCGGGCGCC CCTCCTGTGA TCCCGTAGCG 240 

CCCCCTGGCC CGAGCCGCGC CCGGGTCTGT GAGTAGAGCC GCCCGGGCAC CGAGCGCTGG 300 

TCGCCGCTCT CCTTCCGTTA TATCAACATG CCCCCTTTCC TGTTGCTGGA GGCCGTCTGT 3 60 

GTTTTCCTGT TTTCCAGAGT GCCCCCATCT CTCCCTCTCC AGGAAGTCCA TGTAAGCAAA 420 

GAAACCATCG GGAAGATTTC AGCTGCCAGC AAAATGATGT GGTGCTOGGC TGCAGTGGAC 480 

ATCATGTTTC TGTTAGATGG GTCTAACAGC GTCGGGAAAG GGAGCTTTGA AAGGTCCAAG 540 

CACTTTGCCA TCACAGTCTG TGACGGTCTG GACATCAGCC CCGAGAGGGT CAGAGTGGGA 600 

GCATTCCAGT TCAGTTCCAC TCCTCATCTG GAATTCCCCT TGGATTCATT TTCAACCCAA 660 

CAGGAAGTGA AGGCAAGAAT CAAGAGGATG GTTTTCAAAG GAGGGCGCAC GGAGACGGAA 72 0 

CTTGCTCTGA AATACCTTCT GCACAGAGGG TTGCCTGGAG GCAGAAATGC TTCTGTGCCC 780 

CAG AT CCTCA TCATCGTCAC TG AT GGGAAG TCCCAGGGGC- ATGTGGCACT GCCATCCAAG 840 

CAGCTGAAGG AAAGGGGTGT CACTGTGTTT GCTGTGGGGG TCAGGTTTCC CAGGTGGGAG 500 

GAGCTGCATG CACTGGCCAG CGAGCCTAGA GGGCAGCACG TGCTGTTGGC TGAGCAGGTG 960 

GAGGATGCCA CCAACGGCCT CTTCAGCACC CTCAGCAGCT CGGCCATCTG CTCCAGCGCC 1020 

ACGCCAGACT GCAGGGTCGA GGCTCACCCC TGTGAGCACA GGACGCTGGA GATGGTCCGG 1080 

GAGTTCGCTG GCAATGCCCC ATGCTGGAGA GGATCGCGGC GGACCCTTGC GGTGCTGGCT 1140 

GCACACTGTC CCTTCTACAG CTGGAAGAGA GTGTTCCTAA CCCACCCTGC CACCTGCTAC 1200 

AGGACCACCT GCCCAGGCCC CTGTGACTCG CAGCCCTGCC AGAATGGAGG CACATGTGTT 1260 

CCAGAAGGAC TGGACGGCTA CCAGTGCCTC TGCCCGCTGG CCTTTGGAGG GGAGGCTAAC 1320 

TGTGCCCTGA AGCTGAGCCT GGAATGCAGG GTCGACCTCC TCTTCCTGCT GGACAGCTCT 1380 

GCGGGCACCA CTCTGGACGG CTTCCTGCGG GCCAAAGTCT TCGTGAAGCG GTTTGTGCGG 1440 

GCCGTGCTGA GCGAGGACTC TCGGGCCCGA GTGGGTGTGG CCACATACAG CAGGGAGCTG 1500 

CTGGTGGCGG TGCCTGTGGG GGAGTACCAG GATGTGCCTG ACCTGGTCTG GAGCCTCGAT 1560 

GGCATTCCCT TCCGTGGTGG CCCCACCCTG ACGGGCAGTG CCTTGCGGCA GGCGGCAGAG 162 0 

CGTGGCTTCG GGAGCGCCAC CAGGACAGGC CAGGACCGGC CACGTAGAGT GGTGGTTTTG 1680 

CTCACTGAGT CACACTCCGA GGATGAGGTT GCGGGCCCAG CGCGTCACGC AAGGGCGCGA 1740 

GAGCTGCTCC TGCTGGGTGT AGGCAGTGAG GCCGTGCGGG CAGAGCTGGA GGAGATCACA 1800 

GGCAGCCCAA AGCATGTGAT GGTCTACTCG GATCCTCAGG ATCTGTTCAA CCAAATCCCT 1860 

GAGCTGCAGG GGAAGCTGTG CAGCCGGCAG CGGCCAGGGT GCCGGACACA AGCCCTGGAC 192 0 

CTCGTCTTCA TGTTGGACAC CTCTGCCTCA GTAGGGCCCG AGAATTTTGC TCAGATGCAG 1980 

AGCTTTGTGA GAAGCTGTGC CCTCCAGTTT GAGGTGAACC CTGACGTGAC ACAGGTCGGC 2040 

CTGGTGGTGT ATGGCAGCCA GGTG CAG ACT GCCTTCGGGC TGGACACCAA ACCCACCCGG 2100 

GCTGCGATGC TGCGGGCCAT TAGCCAGGCC CCCTACCTAG GTGGGGTGGG CTCAGCCGGC 2160 

ACCGCCCTGC TGCACATCTA TGACAAAGTG ATGACCGTCC AGAGGGGTGC CCGGCCTGGT 2220 

GTCCCCAAAG CTGTGGTGGT GCTCACAGGC GGGAGAGGCG CAGAGGATGC AGCCGTTCCT 2280 

GCCCAGAAGC TGAGGAACAA TGGCATCTCT GTCTTGGTCG TGGGCGTGGG GCCTGTCCTA 2340 

AGTGAGGGTC TGCGGAGGCT TGCAGGTCCC CGGGATTCCC TGATCCACGT GGCAGCTTAC 2400 

GCCGACCTGC GGTACCACCA GGACGTGCTC ATTGAGTGGC TGTGTGGAGA AGCCAAGCAG 2460 

CCAGTCAACC TCTGCAAACC CAGCCCGTGC ATGAATGAGG GCAGCTGCGT CCTGCAGAAT 2520 

GGGAGCTACC GCTGCAAGTG TCGGGATGGC TGGGAGGGCC CCCACTGCGA GAACCGTGAG 2580 

TGGAGCTCTT GCTCTGTATG TGTGAGCCAG GGATGGATTC TTGAGACGCC CCTGAGGCAC 2640 

ATGGCTCCCG TGCAGGAGGG CAGCAGCCGT ACCCCICCCA GCAACTACAG AGAAGGCCTG 2700 

GGCACTGAAA TGGTGCCTAC CTTCTGGAAT GTCTGTGCCC CAGGTCCT TA G AATGTCTGC 2760 

TTCCCGCCGT GGCCAGGACC ACTATTCTCA CTGAGGGAGG AGGATGTCCC AACTGCAGCC 2820 

ATGCTGCTTA GAGACAAGAA AGCAGCTGAT GTCACCCACA AACGATGTTG TTGAAAAGTT 2880 

TTGATGTGTA AGTAAATACC CACTTTCTGT ACCTGCTGTG CCTTGTTGAG GCTATGTCAT 2940 

CTGCCACCTT TCCCTTGAGG ATAAACAAGG GGTCCTGAAG ACTTAAATTT AGCGGCCTGA 3000 

CGTTCCTTTG CACACAATCA ATGCTCGCCA GAATGTTGTT GACACAGTAA TGCCCAGCAG 3060 

AGGCCTTTAC TAGAGCATCC TTTGGACGGC GAAGGCCACG GCCTTTCAAG ATGGAAAGCA 312 0 

GCAGCTTTTC CACTTCCCCA GAGACATTCT GGATGCATTT GCATTGAGTC TGAAAGGGGG 3180 

CTTGAGGGAC GTTTGTGACT TCTTGGCGAC TGCCTTTTGT GTGTGGAAGA GACTTGGAAA 3240 

GGTCTCAGAC TGAATGTGAC CAATTAACCA GCTTGGTTGA TGATGGGGGA GGGGCTGAGT 3300 

TGTGCATGGG CCCAGGTCTG GAGGGCCACG TAAAATCGTT CTGAGTCGTG AGCAGTGTCC 3360 
ACCTTGAAGG TCTTC 
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Uiiigene 



i i i i i 

" "jFSRVPP slplqevhvs ketigkisaa skmmwcsaav dimflldgsn 
SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDEFST QQEVKARIKR 
MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 
FAVGVRFPRW EELHALASEP RGQHVLIAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 
PCEHRTLEMV RE FAGNAPCW RGSRRTLAVL AAHCPFYSKK RVFLTHPATC YRTTCPGPCD 
SQPCQKGGTC VPEGLDGYQC LCPLAFGGEA KCALKLSLEC RVDLLFLLDS SAGTTLDGFL 
RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLWSL DGIPFRGGPT 
LTGSALRQAA ERGFGSATRT GQDRPRRWV LLTESHSEDE VAGPARHARA RELLLLGVGS 
EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELOGKLCSR QRPGCRTQAL DLVFMLDTSA 
SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 
APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAWVLT GGRGAEDAAV PAQKLRNNGI 
SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHODV LIEWLCGEAK QPVNLCKPSP 
CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR H 



A60 DNA 



Gene name: Homo sapiens type II membrane s 

unigene number: Hs.63325 
Probeset Accession it ; 
Nucleic Acid Accession it 
Coding sequence: 1 



I I I I I I 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 
AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 
CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 
TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 
CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 
( « rGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 
GGGAACTCGT TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 
AGGCAGATGG GCTACAGCAG CAAACCCACT TICAGAGCTG TGGAGATTGG C 
GAT CTGGATG TTGTTGAAAT CACAGAAAAC A 
GGGCCCTGTC TCTCAGGCTC CCTGGTC 
AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 650 
AGCATCCAGT ACGACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 
CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 
GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CAT CATTGAA 840 
TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 
G GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 950 
C TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 
GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 
GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 
GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 
GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 12 50 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 

A61 Protein sequence: 



LDLa domain: 54-9 
Cellular Localization: plas 



I I I I I I 



'KSFPEGP AVAVRLSKDR STLQVLDSAT 
GNWFSACFDN FTEALAETAC RQMGYSSKPT FRAVEIGPDQ DLDWSITEN SQELRMRNSS 
GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 
LTAAHCFRKH TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPKDND IALMKLQFPL 
TFSGTVRPIC LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQAEVQV IDSTRCNADD 
AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQ;-IE\' VC-IVSWGYGC G 
KVSAYLNW I Y NVWKAEL 
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A62 DNA SEQUENCE 

Gene name: ESTs; Weakly similar to MTHOSTATHINE 1 BETA PRECURSOR [H- sapiens] 

Nucleic Acid Accession #: none found 

Coding sequence: 103-579 (underlined sequences correspond to start and stop codons) 



CCAAACAGAT TTGCAGATCA AGGAGAACCC AGGAGTTTCA AAGAAGCGCT AGTAAGGTCT 
CTGAGATCCT TGCACTAGCT ACATCCTCAG GGTAGG/ 
CGGCTGCTCC TATTGCTGAG CTGCCTGGCC AAAACAC 
AGACCCAGCT GTGCTCCTGG ATGGTTTTAC CACAAGTCCA A 
AAGCTGAGGA ACTGGTCTGA TGCCGAGCTC GAGTGTC 
CTGGCATCTA TCCTGAGTTT AAAGGAAGCC AGCACCf 
CAGAGAAGC C AGCCGATATG GATTGGCCTG CACGACCCAC AGAAGAGGCA GCAGTGGCAG 42 0 
TGGATTGATG GGGCCATGTA TCTGTACAGA TCCTGGTCTG GCAAGTCCAT GGGTGGGAAC 480 
AAGCACTGTG CTGAGATGAG CTCCAATAAC AACTTTTTAA CTTGGAGCAG CAACGAATGC 540 
AACAAGCGCC AACACTTCCT GTGCAAGTAC CGACCATAGA GCAAGAATCA AGATTCTGCT S00 
AACTCCTGCA CCAGCCCCGT CCTCTTCCTT TCTGCTAGCC TGGCTAAATC TGCTCATTAT 660 
TTCAGAGGGG AAACCTAGCA AACTAAGAGT GATAAGGGCC CTACTACACT GGCTTTTTTA 720 
GGCTTAGAGA CAGAAACTTT AGCATTGGGC CCAGTAGTGG CTTCTAGCTC TAAATGTTTG 780 
CCCCGCCATC CCTTTCCACA GTATCCTTCT TCCCTCCTCC CCTGTCTCTG GCTGTCTCGA 840 
GCAGTCTAGA AGAGTGCATC TCCAGCCTAT GAAACAGCTG GGTCTTTGGC CATAAGAAGT 900 
AAAGATTTGA AGACAGAAGG AAGAAACTCA GGAGTAAGCT TCTAGACCCC TTCAGCTTCT 960 
ACACCCTTCT GCCCTCTCTC CATTGCCTGC ACCCCACCCC AGCCACTCAA CTCCTGCTTG 1020 
TTTTTCCTTT GGCCATAGGA AGGTTTACCA GTAGAATCCT TGCTAGGTTG ATGTGGGCCA 1080 
TACATTCCTT TAATAAACCA TTGTGTACAT AAGAGAAAAA AAAAAAAAAA AAAAAAAAAA 



3 Protein sequence: 



ESTs ; Weakly s 
Hs. 105484 




k PRECURSOR [H. sapiens] 



45 
50 



MA RSMRLLL LLSCLAKTGV LGDIIMRPSC APGWFYHKSN CYGYFRKLRN WSDAELECQS 
YGNGAHLASI LSLKEASTIA EYISGYQRSQ PIWIGLHDPQ KRQQWQWIDG AMYLYRSWSG 
KSMGGNKHCA EMSSNNNFLT WSSNECNKRQ HFLCKYRP 



Unigene number: Hs.2877 
Probeset Accession # : X63629 
Nucleic Acid Accession #: X63629 
Coding sequence: 54-2543 (start ai 

1 11 21 31 41 

GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTC1 

TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 

CCTCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 180 

CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 240 

CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 300 

C ACTGAAGGAA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 350 

T CCCTGAAAAT GGCAAGGGTC 420 

A TAGAGACACC AAGATTTTCT 480 

T GTAGAGAAGG 540 

AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATG AG C 600 

TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 

TCATCGTGAC CGACCAGAAT GACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 720 

ft CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 780 

T CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 

AGGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 

CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 

TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 

ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGC-CCCA TGTGCCTGAG AATGCAGTGG 1080 

GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1140 

GTGCCACCTA CCTTATCATG GGCGGTGACG AOGGGGACCA TTTTACCATC ACCACCCACC 1200 

CTGAGAGCAA CCAGGG CATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 

AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 132 0 

CCACAGCCAC CATAGTGGTC CACGTG3AGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 

CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 

CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 
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25 
30 



CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 

GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 

ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTG AT T GATGTCAACG 

ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 

ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 

T CACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 

TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 

ACCATGGCAA CAAAGAGCAG CTGACGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 

TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 

TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 

AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 

AGGCCAGGCC GGAGGTGGTI CTCCGCAATG ACGTGGCACC A 

TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGC 

AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 



C GTAGCAACTT GGCGGAGACA G 
TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 
CACCTGGGCC AGGGTTGCCT CAGAGGCCAA G 



TATTAAAGAA A 



Gene name: Cadherin 3, P-cadherin (placental) 

unigene number: Hs.2877 

Probeset Accession it: X63629 

Protein Accession #: CAA45177 

Signal sequence-. 1-24 

Transmembrane domain: 659-675 (second underlined sequence) 

Cellular localization: pla 



KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 
YELFGHAVSE NGASVEDPMH ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 
DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 
TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAKVPEN AVGHEVQRLT VTDLDAPNSP 
AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 
PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 
VHDHGPVPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ AQLTDDSDIY KTAEVNEEGD 
TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 
GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 
GLEARPEWL RNDVAPTI IP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EHGSRFKKIA DHYGGGEDD 



A66 DNA SEQUENCE 

Gene name: ATPase, Ca++ transporting, type 2C, member 1 

unigene number: Hs . 106778 

Probeset Accession #: N51919 
Nucleic Acid Accession #.- AE189723 

Coding sequence: 1-2712 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGATTCCTG TATTGACATC AAAAAAAGCA AGTGAATTAC CAGTCAGTGA AGTTGCAAGC 60 

ATTCTCCAAG CTGATCTTCA GAATGGTCTA AACAAATGTG AAGTTAGTCA TAGGCGAGCC 120 

TTTCATGGCT GGAATGAGTT TGATATTAGT GAAGATGAGC CACTGTGGAA GAAGTATATT 180 

TCTCAGTTTA AAAATCCCCT TATTATGCTG CTTCTGGCTT CTGCAGTCAT CAGTGTTTTA 240 

ATGCATCAGT TTGATGATGC CGTCAGTATC ACTGTGGCAA TACTTATCGT TGTTACAGTT 3 00 

GCCTTTGTTC AGGAATATCG TTCAGAAAAA TCTCTTGAAG AATTGAGTAA ACTTGTGCCA 3 60 

CCAGAATGCC ATTGTGTGCG TGAAGGAAAA TTGGAGCATA CACTTGCCCG AGACTTGGTT 420 

CCAGGTGATA CAGTTTGCCT TTCTGTTGGG GATAGAGTTC CTGCTGACTT ACGCTTGTTT 480 

GAGGCTGTGG ATCTTTCCAT TGATGAGTCC AGCTTGACAG GTGAGACAAC GCCTTGTTCT 540 

AAGGTGACAG CTCCTCAGCC AGCTGCAACT AATGG AG AT C TTGCATCGAG AAGTAACATT 600 

GCCTTTATGG GAACACTGGT CAGATGTGGC AAAGCAAAGG GTGTTGTCAT TGGAACAGGA 660 
GAAAATTCTG AATTTGGGGA GGTTTTTAAA ATGATG CAAG CAGAAGf 
CCTCTGCAGA AGAGCATGGA CCTCTTAGGA AAACAACTTT CCTTTTACTC C 
ATAGGAATCA TCATGTTGGT TGGCTGGTTA CTGGGAAAAG ATATCCTGGA A 
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ATTAGTGTAA GTTTGGCTGT AGCAGCAATT CCTGAAGGTC TCCCCATTGT GGTCACAGTG 

ACGCTAGCTC TTGGTGTTAT GAGAATGGTG AAGAAAAGGG CCATTGTGAA AAAGCTGCCT 

ATTGTTGAAA CTCTGGGCTG CTGTAATGTG ATTTGTTCAG ATAAAACTGG AACACTGACG 

AAGAATGAAA TGACTGTTAC TCACATATTT ACTTCAGATG GTCTGCATGC TGAGGTTACT 



G CTGTTAGCAG AATTGTTGAG GCGGGCTGTG T 
AGAAACAATA CTCTAATGGG GAAGCCAACA GAAGGGGCCT TAATTGCTCT TGCAATGAAG 1260 
ATGGGTCTTG ATGGACTTCA ACAAGACTAC A 
GAGCAAAAGT GGATGGC 

TGTTTTATGA AAGGTGCTTA CGAACAAGTA ATTAAGTACT GTACTACATA CCAGAGCAAA 
GGGCAGACCT TGACACTTAC T 




GAGCTACGAG ACAATGTGAT TACACCTCGA GACACAACAA TGACCTTCAC ATGCTTTGTG 2400 

TTTTTTGACA TGTTCAATGC ACTAAGTTCC AGATCCCAGA CCAAGTCTGT GTTTGAGATT 2460 

GGACTCTGCA GTAATAGAAT GTTTTGCTAT GCAGTTCTTG GATCCATCAT GGGACAATTA 2520 

CTAGTTATTT ACTTTCCTCC GCTTCAGAAG GTTTTTCAGA CTGAGAGCCT AAGCATACTG 2580 

GATCTGTTGT TTCTTTTGGG TCTCACCTCA TCAGTGTGCA TAGTGGCAGA AATTATAAAG 2640 

AAGGTTGAAA GGAGCAGGGA AAAGATCCAG AAGCATGTTA GTTCGACATC AtCATCTTTT 2700 



A67 Protein sequence; 

Gene name: ATPase, Ca++ transporting, type 2C, xenber 1 

Unigene number: Hs. 106778 

Probeset Accession ft: N51919 

Protein Accession ft: AAF27813 

Signal sequence: none found 

Transmembrane domains: 60-82, 86-108, 252-274, 282-304, 757-779, 827-849, 856-878 

Pfam domains: Cation ATPase N [6-83] , E1-E2 ATPase (89-324), Hydrolase [328-549], Cation A 
[744-889] 

Cellular Localization: not determined 

1 11 21 31 41 

MIPVLTSKKA SELPVSEVAS ILQADLQNGL NKCEVSH — 
SQFKNPLIML LLASAVISVL MHQFODAVSI TVAILIWTV A 
PECHCVHEGK LEHTLARDLV PGDTVCLSVG DRVPADLRLF E 
KVTAPQ PAAT NGDLASRSNI AFMGTLVRCG KAKGWI 
PLQKSMDLLG KQLSFYSFGI IGIIMLVGWL LGKDILEMFT I 
TLALGVMRMV KKRAIVKKLP IVETLGCOtV ICSDKTGTLT KNEHTVTHIF T 
GVGYNQFGEV IVDGDWHGF YNPAVSRIVE AGCVCNDAVI K 
MGLDGLQQDY IRKAEYPFSS EQKWMAVKCV HRTQQDRPEI CFMKGAYEQV IKYCTTYQSK 480 
GQTLTLTQQQ RDVYQQEKAR MGSAGLRVLA LASGPELGQL TFLGLVGIID PPRTGVKEAV 540 
TTLIASGVSI KMITGDSQET AVAIASRLGL YSKTSQSVSG EEIDAMDVQQ LSQIVPKVAV 600 
FYRASPRHKM KIIKSLQKNG SWAMTGDGV NDAVALKAAD IGVAKGQTGT DVCKEAADMI 660 
LVDDDFQTIM SAIEEGKGIY NNIKNFVRFQ LSTSIAALTL ISLATLMNFP NPLNAMQILW 720 
INIIMDGPPA QSLGVEPVDK DVIRKPPRNW KDSILTKNLI LKILVSSIII VCGTLFVFWR 780 
ELRDNVITPR DTTMTFTCFV FFDHFNALSS RSQTKSVFEI GLCSNRMFCY AVLGSIMGQL 840 
LVIYFPPLQK VFQTESLSIL DLLFLLGLTS SVCIVAEIIK KVERSREKIQ KHVSSTSSSF 900 
LEV 

A68 DNA SEQUENCE 

Gene name: bone morphogenetic protein 7 (osteogenic protein 1) (BMP7) 

Unigene number: Hs. 170 195 

Probeset Accession #: BE616633 
Nucleic Acid Accession #: NM_001719 

Coding sequence: 123-1418 (underlined sequences correspond to start and stop codons) 



GGGCGCAGCG GGGCCCGTCT GCAGCAAGTG ACCGACGGCC GGGACGGCCG CCTGCCCCCT 60 

CTGCCACCTG GGGCGGTGCG GGCCCGGAGC CCGGAGCCCG GGTAGCGCGT AGAGCCGGCG 120 

CGATGCACGT GCGCTCACTG CGAGCTGCGG CGCCGCACAG CTTCGTGGCG CTCTGGGCAC 180 

CCCTGTTCCT GCTGCGCTCC GCCCTGGCCG ACTTCAGCCT GGACAACGAG GTGCACTCGA 240 

GCTTCATCCA CCGGCGCCTC CGCAGCCAGG AGCGGCGGGA GATGCAGCGC GAGATCCTCT 300 

CCATTTTGGG CTTGCCCCAC CGCCCGCGCC CGCACCTCCA GGGCAAGCAC AACTCGGCAC 360 

CCATGTTCAT GCTGGACCTG TACAACGCCA TGGCGGTGGA GGAGGGCGGC GGGCCCGGCG 42 0 

GCCAGGGCTT CTCCTACCCC TACAAGGCCG TCTTCAGTAC CCAGGGCCCC CCTCTGGCCA 480 

GCCTGCAAGA TAGCCATTTC CTCACCGACG CCGACATGGT CATGAGCTTC C-TCAACCTCG 540 

TGGAACATGA CAAGGAATT C TTCCACCCAC GCTACCACCA TCGAGAGTTC CGGTTTGATC 600 

TTTCCAAGAT CCCAGAAGGG GAAGCTGTCA CGGCAGCCGA ATTCCGGATC TACAAGGACT 660 
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ACATCCGGGA ACGCTTCGAC AATGAGACGT TCCGGATCAG CGTTTATCAG GTGCTCCAGG 720 

AGCACTTGGG CAGGGAATCG GATCTCTTCC TGCTCGACAG CCGTACCCTC TGGGCCTCGG 780 

AGGAGGGCTG GCTGGTGTTT GACATCACAG CCACCAGCAA CCACTGGGTG GTCAA1CCGC 840 

GGCACAACCT GGGCCTGCAG CTCTCGGTGG AGACGCTGGA TGGGCAGAGC ATCAACCCCA 900 

J AGTTGGCGGG CCTGATTGGG CGGCACGGGC CCCAGAACAA GCAGCCCTTC ATGGTGGCTT 960 

TCTTCAAGGC CACGGAGGTC CACTTCCGCA GCATCCGGTC CACGGGGAGC AAACAGCGCA 1020 

GCCAGAACCG CTCCAAGACG CCCAAGAACC AGGAAGCCCT GCGGATGGCC AACGTGGCAG 1080 

AGAACAGCAG CAGCGACCAG AGGCAGGCCT GTAAGAAGCA CGAGCTGTAT GTCAGCTTCC 1140 

10 AGGGGGAGTG TGCCTTCCCT CTGAACTCCT ACATGAACGC CACCAACCAC GCCATCGTGC 1260 

AGACGCTGGT CCACTTCATC AACCCGGAAA CGGTGCCCAA GCCCTGCTGT C-CGCCCACGC 1320 

AGCTCAATGC CATCTCCGTC CTCTACTTCG ATGACAGCTC CAACGTCATC CTGAAGAAAT 1380 

ACAGAAACAT GGTGGTCCGG GCCTGTGGCT GCCACTAGCT CCTCCGAGAA TTCAGACCCT 1440 

TTGGGGCCAA GTTTTTCTGG ATCCTCCATT GCTCGCCTTG GCCAGGAACC AGCAGACCAA 1500 

I J CTGCCTTTTG TGAGACCTTC CCCTCCCTAT CCCCAACTTT AAAGGTGTGA GAGTATTAGG 1560 

AAACATGAGC AGCATATGGC TTTTGATCAG TTTTTCAGTG GCAGCATCCA ATGAACAAGA 1620 

TCCTACAAGC TGTGCAGGCA AAACCTAGCA GGAAAAAAAA ACAACGCATA AAGAAAAATG 1680 

GCCGGGCCAG GTCATTGGCT GGGAAGTCTC AGCCATGCAC GGACTCGTTT CCAGAGGTAA 1740 

TTATGAGCGC CTACCAGCCA GGCCACCCAG CCGTGGGAGG AAGGGGGCGT GGCAAGGGGT 1800 

2U GGGCACATTG GTGTCTGTGC GAAAGGAAAA TTGACCCGGA AGTTCCTGTA ATAAATGTCA 1860 
CAATAAAACG AATGAATG 



3 Protein sequence: 

ne name: bone morphogenetlc protein 7 (osteogenic protein 1) i 



I I I I I I 

MHVRSLRAAA PHSFVALWAP LFLLRSALAD FSLDNEVHSS FIKRRLRSQE RREMQREILS 
ILGLPI1RPRP IILQGKHNSAP MFMLDLYNAM AVEEGGGPGG QGFSYPYKAV FSTQGPPLAS 
LQDSHFLTDA DMVMSFVNLV EHDKEFFHPR YHHREFRFDL EKIPEGEAVT AAEFRIYKDY 
IRERFDNETF RISVYQVLQE HLGRESDLFL LDSRTLWASE EGWLVFDITA TSNHWWKPR 
HNLGLQLSVE TLDGQSINPK LAGLIGRHGP QNKQPFMVAF FKATEVIIFRS IRSTGSKORS 
QNRSKTPKNQ EALRMANVAE NSSSDORQAC KKHELYVSFR DLGWQDWIIA PEGYAAYYCE 
5E p PLNSY mmatnhaivq TLVHFINPET VPKPCCAPTQ lnaievlyfd DSSNVILKKY 
RNMWRACGC H 



Gene name: bone morphogenetic protein receptor IB (ALK-6) 

Unigene numbers Hs.B7223 

Probeset Accession #: AA250737 

Nucleic Acid Accession #: KM_001203 

Coding sequence: 274-1782 (underlined sequences correspond to start and stop codons) 

i i 1 i 1 i 1 i 1 r 



GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 
GTGAAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT T 



CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 
TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATGCTTT TGCGAAGTGC A 
AATGTGGGCA CCAAGAAAGA GGATGGTGAG A 
TTGCGTTGTA AATGCCACCA GCATTGTCCA GAAGACTCAG T 
GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 
GGTTGCCTAG GACTAGAAGG CTCAGATTT 



CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGACCTA TACACCACAG GGCTTTACTT 
ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA T 
TATAAAAGAC AAGAAACCAG ACCTCGATAC AGCATTGGGT T 
70 ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT A 
ATTGGAAAAG GTCGCTATGG GGAAGT1 

GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATATCAG 
ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 
I J GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACT AT CATG AAAATGC-TTC CCTTTATGAT 
TATCTGAAGT CCACCACCCT AGACGCTAAA Tl 
AGTGGCTTAT GTCATTTACA CACAGAJ 

CAT CGAGATC TGAAAAGTAA AAACATTCTG GTGAAGAAAA ATGGAACTTG CTGTATTGCT 
on GACCTGGGCC TGGCTGTTAA ATTTAT1 

G GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG C 

C AGTCTTACAT CATGGCTGAC ATGTATAGTT TTGGCCTCAT CCTTTGGGAG 1500 
GTTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 1560 
CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 
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CGCCCCTCAT TCCCAAACCG GTGGAGCAGT GATGAGTGTC TAAGGCAGAT GGGAAAACTC 1680 

ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 

ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACTC T GA TAGGAGAG GAAAAGTAAG 1800 

CATCTCIGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA I860 

TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 

CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 1980 




MLLRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVNNICSTD GYCFTMIEED SO 

DSGLPWTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECNKDLHPTL PPLKNRDFVD 120 

GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGEELRDLI 180 

EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 

WFRETEIYQT VLMRHENILG FIAADIKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 3 00 

MLKLAYSSVS GLCHLHTEIF STQGKPAIAH EDLKSKNILV KKKGTCCIAD LGLAVKFISD 360 

TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 420 

EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 



A72 DNA SEQUENCE 

Gene name: Homo sapiens type II membrane serine protease mRNA 

Unigene number: Hs. 63325 

Probeset Accession #: AA411502 
Nucleic Acid Accession #: NM_016425 

Coding sequence: 1-1314 (underlined sequences correspond to start and stop codons) 



ATCTTACACC AT CCTG ACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 60 

AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 120 

CTGAGCCTGG 03AGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGIGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGCCTG TTTCGACAAC TT CACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA S60 

GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC IGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT G TAA 
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GNWFSACFDN FTEALAETAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMPJJSS 
GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 
LTAAHCFRKH TDVFNWKVRA GSDKLGSPPS LAVAKIIIIE FNPMYPKDND IALMKLQFPL 
TFSGTVRPIC LPFFDEELTP ATPLWIIGWG PTKQKGGKMS DILLQASVQV IDSTRCtlADD 
AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSNGYGC GGPSTPGVYT 
IT NVWKAEL 



■akly similar to CGHU7L collagen alpha l(III) chain precursor [H. sapiens] 
nigene number: Hs. 19322 

robeset Accession #: AA0884S8 



Coding sequence: 862-1995 (underlined sequences correspond to start and .stop c 




GGGCATGCAG CCATCGGGGA AAATCCATAG TGCAGATAAA GCAAGC-AGGA AGAAGAAGGA 
CAGTTCTAGT AAAAGGGAGA ACATCAATAT AGGATGTTTC TTAGCAATAG AAAAAGAAGG 
CCAAGAGGAA TTAGGGAGAG AGTTATAAGA GATCAGCAAG GGGACAGGGT TAGATTTGGT 
TTGGTTTGAA AGCATACAGT AAATATGATG TCTGTCCCTG GCAGTGTTGG C 
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A GTCCAAGTAT ATTTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG 5 340 
T GCAAAATGCT TTGCTGGAAG GTAGAAAGTT CTAGATTTAA ACAGGCTTAG 5400 
GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA GGGTTTTTTT C 
TGAGCTTTCT TGTGTTCATC TGAATTGAAC TAAAGACTTA GAGTTACCCA T 
TTAGCCATGG ACCTGGCATA CACTCTTCTT ACGTGCAGAG AATGACCAIC ATGAGGAAAG 5580 
AGCCACAGAT CAGTCAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAGGGCTT 5640 
CCTGGCATAA TCTATTTAAA ATATCCAACC TTCAACATAC TCGTATCCTT GATGACTGTT 5700 
AGAAGTGAAA TATGGTCCTT GCCCATAAGG AGCTGAGAGT TTAACTGGGA A 



ATTGCAGTAC GTTAGAGCAA GAAGAATTTT C 
GAAGGGTATT TGAGGTGAAA TACACCAATT ATCAGGGAAT AACATCAAAG 



TTCAACAAAT CCACTGATGC 



AAGCAAGGCA GATGAGCCTG GCCGACCCAG GTGGAGAGCA 



ATCACTGAAG TGTGGATTTA GGGATAATCT TGTGATAAAA G 
TGAGTAAGAG TAATAAGTAA TAAGATACCA TCGATAAACT GGCACTGACT CAGTCACATA 



\ GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT G 

IGAAGCC TGCTCTGCAG TTTGCAAACC ACAGCCTCTT CTGCTTCTGC 672 0 

TTTTGCATTG CCTACATGAC ACCTGTATAA AAATATCCAT GGACAGGAGA TACTGCATCT 6840 

ATTCAGGGTC TGGATTCAGC TTACTGTTGT TACAAATAAG TAAGTTTGGT AATATATAGT 6900 

TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC TCAAAGGAAT ATTTAGATGC 6960 

CATCAAGAAA TTTTACCAGA CCAGTGTGGA ATCTACTGAT TTTGCAAATG CTCCAGAAGA 702 0 

AAGTCGAAAG AAGATTAACT CCTGGGTGGA AAGTCAAACG AATGGTAGGA GAGCCACCCA 7080 

TTATAGAAAC ACCTTTGAGA AACCTATGCC AGTGAGCCTT GTGCTTGACA CTGCATGGGG 7140 

GAACAGGTGT GGGGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGGGCA CT CCAG ATG A 7200 

AGGATTTGTC CAAATGAATA TGAAGAGAGC CTAGGGGAGC CAAGGAGGAA ATCACAGGAA 72 60 

GCCAATTAGA TGGAAACACA TCTGGAGAAT TATTTGCTTA TGGCCCTGCA TGACAATAGC 732 0 

TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CATATCCTTT ACTTTAAATC 7380 

AGACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAGCGTC TCTCGTCTCC 7440 

TTTACTAATT GGGAAACAAG CAGCTCTCTG GTAAATCACC CTTTTGTCTC TGAGCTGGAG 7500 

CTGCCTGGAT CACATCTGTA GCCAATGTGT TCTGCAGGGA TTATCACAGC TCTCTTCCCC 7560 

ATCAAGGGCA AAGAGCTTGA CAAAGTCTCC ATTCTACAGA CATCTTTCTT ACCTCCCACC 762 0 

TCTCATTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA GATACCCCCG 7 68 0 

GAAGTAGTGT CTGACAGCAC AGGACATGCG TTTCATATTA CAGAGCTCAA GTCACTOVTC 7740 

CTAAAATGCA ATCAGGGCCT CCTTCCTCTG AATGGGGACC CCGTAGTTAA AAAARAATAA 7800 

AAGTAGGAAG AGGAGGGAGG GAGAAAGGAA AGACACATGT TGGAAGAGTA GACAAAATCA 78 60 

GTTTATCAGT ATTCCAAATC AGATGATTGG AGACATTCAT ACACAGAGAA CGTGAACTCC 7920 

TTCTCTATCA CAAGAAGTGA TGTCTCCATC AAGGGTAACT TTATACGACT GGAGCCTTGA 7980 

AGAAAGCTGC ATCTGGTGAA CCACTGGTCA GTGAGTCTAA CAATTCAAAG ATCAAAGTCA 8040 

GTGAGTCTCA AGCAGGGATT TGGGT CAAT A ATT AACG AT C AGTCAOGAAC ATTTGCAAAG 8100 

CATCTTCCAG ACAAGCCATT TGTAGCTTGT GTAAAAGACT CTTTTATTCT TTCCCTTGCA 8150 

A AAAACCTATT TCCTGATGGG ACTATTGGCA ATGATACGAC ACTGGTTCTT 8220 



TAATACATGG AATGTTAAAC ATTTCTGATG G 

T AGGAATACCT 



55 
60 



ACATTGATTT ATTGCAGAAT ACATACAAAT CTGTACAGAT GATGAGGCAA TACAATTCCT 

AAGATCTAAG CATGATTGTG CTGCTGC 
CTTGCATCTA CAACTCTTCC TTCTACTGCC G 
CAAGGTAAAA GCTTATGACC GAGTTGCCTC AAAATGATGA AAAATTCTAA ATGAGGAATG 
ATGACTCACC TTCATATTAC AAATATTTGA GCATAGGGCC TGACACAAAC TGAAAGCTTA 
TTTAGGGTAC 



9180 

TCAATTCCCA TCTATGATTT AATTCCATCT ATGGCTTAGT TAATGATTAA T 
TTACATGCAT TGGATAT CAA TTTGATGATA TTATTATGCA GCAATT1 

T ACCAATGTGA GGAAAGTTTA CAAATAGGCC GAGTAGAAAA GGGAATACAA 9360 

T TTAGGGAATT ACAATTTAAT AATTGCAATG TGTACTAAAT AATGTATACA 9420 

GAAAAATATG ATGAGCCTAT TAAAAATTGA CACATGTAGT AGGCTGTTGG CACAAGAAAT 9480 

AGTGATACAT ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTACA TGTGTATCAT 9540 

ACAGTTGTAT ACATACATAT GTACACATAT ACATATACGT AAAAACATGA TTCTGTTTTT 9600 
ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT TTATATi 
TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTGTATTTC A 

CCAAAATAAC GAATCTCCAC AT AGT CAATT CATTGTTAAG GTGTATTAGA GATGGACAGT 9780 

TAGTCATAT C AGTTTCTTTT TTCCATTTGT ATAGCTTGAA G AG AAACT CA CTGCTGAGAA 9840 

ATT GATG GAA TGGACAAGTT TGCAGAATAT GAGAGAGACA TGTGTCGATT TACACTTACC 9900 

TCGGTTCAAA ATGGAAGAGA GCTATGACCT CAAGGACACG TTGAGAACCA TGGGAATGGT 9960 

GAATATCTTC AATGGGGATG CAGACCTCTC AGGCATGACC TGGAGCCACG GTCTCTCAGT 10020 

ATCTAAAGTC CTACACAAGG CCTTTGTGGA GGTCACTGAG GAGGGAGTGG AAGCTGCAGC 10080 

TGCCACCGCT GTAGTAGTAG TCGAATTATC ATCTCCTTCA ACTAATGAAG AGTTCTGTTG 10140 

TAATCACCCT TTCCTATTCT TCATAAGGCA AAATAAGACC AACAGCATCC TCTTCTATGG 102 00 

CAGATTCT CA TCCCCATAGA TGCAATTAGT CTGTCACTCC ATTTAGAAAA TGTTCACCTA 10260 

GAGGTGTTCT GGTAAACTGA TTGCTGGCAA CAACAGATTC TCTTGGCTCA TATTTCTTTT 10320 
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CTATCTCATC TTGATGATGA TAGTCATCAT CAAGAATTTA ATGATTAAAA TAGCATGCCT 
TTCTCTCTTT CTCTTAATAA GCCCACATAT AAATGTACTT TTCCTTCCAG AAAAATTTCC 
CTTGAGGAAA AATGTCCAAG ATAAGATGAA T CATTTAAT A CCGTGTCTTC TAAATTTGAA 
ATATAATTCT GTTTCTGACC TGTTTTAAAT GAACCAAACC AAATCATACT TTCTCTTCAA 



AATTTAGAGA TATATT AAT C ATATATAAAG GAAAATTAAA AACAGAGTAG TTCATGATGA 
I CCCAGAACAG GAGGAGCCTT 



.0380 
.0500 



TATGGTAGTT GTAGCTAAAA GCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 
AAAGAAGAAT CTAGAATTGA CCTTGAAGTA TATCAGCATG TGTAAAGATC AGGAATTGAT 114S0 
CATTTTTATT TTCCAGAAAG TAGCTTTTCT TAGGGTTCCA TATTTACTCC CATAGATTCT 11520 
TCCC 



riuLcm acijucnug: 

ZJ Gene name: SCCA2 or 2b, serine (or cysteine) proteinase i 

Unigene number: Hs . 2 2 7 94 a 

30 Signal j^ quenc ^ : 



1 11 21 31 41 51 

I I I I I I 

MNSLSEANTK FMFDLFQQFR KSKENNIFYS PISITSALGM VLLGAKDNTA QQISKVLHFD 

QVTENTTEKA ATYHVDRSGN VHIIQFQKLLT EFNKSTDAYS LKIANKLFGE KTYQFLQEYL 

DAIKKFYQTS VESTDFANAP EESRKKINSW VESQTNEKIK NLFPDGTIGN DTTLVLVNAI 

YFKGQWENKF KKENTKEEKF WPNKNTYKSV QMMRQYNSFN FALLEDVQAK VLEIPYKGKD 

LSMIVLLPNE IDGLQKLEEK LTAEKLMEWT SLQNMRETCV DLHLPRFKME SSYDLKDTLR 

TMGMVNIFNG DADLSGMTWS HGLSVSKVLH KAFVEVTEEG VEAAAATAW WELSSPSTN 



60 
65 
70 



hypothetical protein FLJ13459 
none found 
XM_047266 
l #: XM_047266 

Coding sequence: 485-1471 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CTGACCTCAA GTGATCCACC CACCTCTGCC TCCCAAAGTG CTGGGATTAC AGGTGTGAGC 60 

CACCATGCCA GGCCCTCTTA ACCTCTTCAA GTCTGTTTTC TCATCTGCAA AACAGAGGTA 12 0 

ATAAGATCAG TATCTTCTTA ATGGAAGCAC CTGGACTACA TTTTTTTCAT TCATTGTTAT 180 

CATAAATGAG GACTAACCTG TCTCCCGTTG GGAGTTTTGA ACCTAGACCT CATGTCTTCA 240 

TGACGTCATC ACTGCCCCAG GCCCAGCTGT GTCCCTACAC CAGCCCCAGC TGACGCATCT 300 

ATCCTGGCGG G 

GTCAAGGTGG GCAGCAAAGT CTTCCTGACA C 

TCACATGTGG ATTGACATCT TTCCTCAAGA TGTGCCTGCT CCACCCCCAG TTGACATCAA 



TGCAGTGCTG GTCCTGCAGG TCTGGGACTA TGACCGCATC TCTGCCAATG ACTICCTTGG 



GAGGGGCTGG TGGCCGGTAG TGAAGCTGAA GGAGGCAGAG GACGTGGAGC GGGAGGCGCA 
GGAGGCTCAG GCTGGCAAGA AGAAGCGAAA GCAGAGGAGG AGGAAGGGCC G 



GTTTGAGCTG CTGACTGTGG AGGAGGCCGA GAAACGGCCA GTGGGGAAGG GGCGGAAGCA : 

GCCAGAGCCT CTGGAGAAAC CCAGCCGCCC CAAAACT1CC TTCA^CTGGT TTGTGAACCC 

GCTGAAGACC TTTGTCTTCT TCATCTGGCG CCGGTACTGG CGCACCCTGG TGCTGCTGCT 

ACTGGTGCTG CTCACCGTCT TCCTCCTCCT GGTCTTCTAC ACCATCCCTG GCCAGATCAG 

CCAGGTCATC TTCCGTCCCC TCCACAA GTG A CTCTCGCTG ACCTTGGACA CTCACCCAGG 

GTGCCAACCC TTCAATGCCT GCTCCTGGAA GTCTTTCTTA CCCATGTGAG CTACCCCAGA 

GTCTAGTGCT TCCTCTGAAT AAACCTATCA CAGCCACTG 



A78 Protein sequence: 

Gene name: "~" hypothetical protein FLJ13459 
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I I I I I I 

MWIDIFPQDV PAPPPVDIKP RQPISYELRV VIWNTEDWL DDENPLTGEM S~ 

KGLEHDKQET DVHFNSLTGE GNFNWRFVFR FDYLPTEREV SVWRRSGPPA LEEAEPRQPA 

VLVLQVNDYD RISANDFLGS LELQLPDMVR GARGFELCSV QLARNGAGPR CNLFRCRRLR 

GWWFWKLKE AEDVEREAQE AQAGKKKRKQ RRRKGRPEDL EFTDMGGNVY ILTGKVEAEF 

ELLTVEEAEK RPVGKGRKQP EPLEKPSRPK TSFNWFVNPL KTFVFFIWRR YWRTLVLLLL 
VLLTVFLLLV FYTIPGQISQ V 



Homo sapiens mRNA; CDNA DKFZp434K0322 (from clone DKFZp434K0322) 
Unigene number: Hs. 161031 

Probeset Accession it: AL137708 
Nucleic Acid Accession #: AL137708 

Coding sequence: 1315-1791 (underlined sequences correspond to start and stop codons) 
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CCCCTGCAGC ACTGGGCAGA CATGCTGGCC CACGCCCGGC GGCCCATTGC CCAGCGGCAC 3360 

CCCCTGCGGC CAGCCAGGGA GGTGGACCGC ATGCTGGCCC TGCAGCCCCG CCTTCGCCTG 3420 

CGCCTGCCCT TGCCCCACTC CTGAATGCAC CACATGCCTC TGTCTCCCCG CTGAGCCCAG 3480 
GCACTTGCCC AGGCCGCCCT GCAGGACCAC TGCAATAAAC GCCTTCTCCT GCC 



; cDNA DKFZp434K0322 (from clone DKFZp434K0322 ) 



T PCELWDSQEG CGDNPAKWGL QLSTDALSLA 
I AGALAAGVLL VSCLLCAACC CCRRHRKKPH DKESVGLGSA KGTTTTHLVR 
SGSLLTQSRE GLKSRLQSPG QRGEPSPRDG L 



Unigene number: Hs.2877 
Probeset Accession #: XS3629 

Nucleic Acid Accession #: X63629 

Coding sequence: 54-2543 (start and stop codons are underlined) 



GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCATGGGGC 
TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 
CCTCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 
CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 
CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 
AAAG AAGGT C ACTGAAGGAA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 
GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCAAGGGTC 
CCTTCCCCCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 
ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 
AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATGAGC 
TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 
TCATCGTGAC CGACCAGAAT GACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 
GTCT CTT AG A GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 
ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 
AGGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 
CCACTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 
TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 
ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 
GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA C CAGCGTGGC 
GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTTTACCATC ACCACCCACC 
CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 
AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG C 
CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG T 
CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 



CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC A 

GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGC-TCTTG GCCATGGACA 1620 

ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 16 B0 

ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 

ACGTGCTGAA CAT CACGG AC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 

TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 1860 
TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCAC 
ACCATGGCAA CAAAGAGCAG CTGACGGTGA TCAGGGCCAC TGTGTGCGAC T 
ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG C1GGGGGCTG 



70 
75 



GGAAGTGGCC GTAGCAACTT GGCGGAGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 

TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 

CACCTGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 

TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 

CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 



CTTCAAAAGT G 

GACCCCAATG C 
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Transmembrane domain: 659-675 (second underlined sequence) 

Cellular localization: plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FRSAEVTLEA GGAEQEPGQA LGXVFMGCPG 60 

QEPALPSTDN DDFTVRNGET VQERRSLKER HPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 

KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVPAV EKETGWLLLN KPLDREEIAK 180 

YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 

DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

TDMDGDGSTT TAVAWE I LD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 

AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KKQHTLYVEV TNEAPFVLKL 420 

PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC UYTAEDPDKE NQKISYRILR 480 

DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

VHDHGPVPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILFVL 650 

GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGSED QDYDITQLHR 720 

GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 

A83 DNA SEQUENCE 

Gene name: putative G-protein coupled receptor 
Unigene number: Hs. 16085 

Probeset " 
Nucleic Acid 
Coding sequel 



GGGCCTGTGG 



CCATGTTTGA GCTCATCATC TTTGAAATCT TAGGAGTATT 
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TCTCATGGCT CTTCTTTCTG G 
TGTCAACTGC CCATACACTT ACATGTCTTA CTTCCTCAGG AATGTGACTG ACACAGATAT 
TGCTGCAAAC CATGGATATG ATCATAJ 
CAATGTTCCA GAAGGGGGAA GTGCATAACA A 

TTTCTGGGGA ATGATAAAAA GTGTTACCAC TTCAGCATCA GGAAGTGAAA ATCTTACTCT 840 

TATTCAACAG GAAGTGGATG CTTTGGAAGA ATTAAGCAGG CAGCTTTTTC TGGAAACAGC 900 

TGATCTATAT GCTACCAAGG AGAGAATAGA ATACTCCAAA ACCTTCAAGG GGAAATATTT 960 

TAATTTTCTT GGTTACTTTT TCTCTATTTA CTGTGTTTGG AAAATTTTCA TGGCTACCAT 1020 

CAATATTGTT TTTGATCGAG TTGGGAAAAC GGATCCTGTC ACAAGAGGCA TTGAGATCAC 1080 

TGTGAATTAT CTGGGAATCC AATTTGATGT GAAGTTTTGG TCCCAACACA TTTCCTTCAT 1140 

TCTTGTTGGA ATAATCATCG TCACATCCAT CAGAGGATTG CTGATCACTC TTACCAAGTT 1200 

CTTTTATGCC ATCTCTAGCA GTAAGTCCTC CAATGTCATT GTCCTGCTAT TAG CACAG AT 12S0 

60 AATGGGCATG TACTTTGTCT CCTCTGTGCT GCTGATCCGA ATGAGTATGC CTTTAGAATA 1320 

CCGCACCATA ATCACTGAAG TCCTTGGAGA ACTGCAGTTC AACTTCTATC ACCGTTGGTT 13 B0 

TGATGTGATC TTCCTGGTCA GCGCTCTCTC TAGCAT ACT C TTCCTCTATT TGGCTCACAA 1440 

ACAGGCACCA GAGAAGCAAA TGGCACCTTG AACTTAAGCC TACTACAGAC TGTTAGAGGC 1500 

CAGTGGTTTC AAAATTTAGA TATAAGAGGG GGGAAAAATG GAACCAGGGC CTGACATTTT 1560 

65 ATAAACAAAC AAAATGCTAT GGTAGCATTT TTCACCTTCA TAGCATACTC CTTCCCCCTC 1520 

AGGTGATACT ATGACCATGA GTAGCATCAG CCAGAACATG AGAGGGAGAA CTAACTCAAG 1680 

ACAATACTCA GCAGAGAGCA TCCCGTGTGG ATATGAGGCT GGTGTAGAGG CGGAGAGGAG 1740 

CCAAGAAACT AAAGGTGAAA AATACACTGG AACTCTGGGG CAAGACATGT CTATGGTAGC 1800 

TGAGCCAAAC ACGTAGGATT TCCGTTTTAA GGTTCACATG GAAAAGGTTA TAGCTTTGCC 1860 
TTGAGATTGA CTCATTAAAA TCAGAGACTG T 



s name: putative G-protein coupled receptor 

gene number: Hs. 16085 

tein Accession #: NP_057418.1 

nal sequence: none found 

nsmembrane domains.- 5-27, 42-64, 76-98, 110-132, 144-166, 289-311, 342-3 
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MSFLIDSSIM ITSQILFFGF GWLFFMEQLF KDYEIRQYW QVIFSVTFAF SCTMFELIIF 60 

EILGVLNSSS RYFHKKMNLC VILLILVFMV PFYIGYFIVS NIRLLHKQRL LFSCLLWLTF 120 

MYFFWKLGDP FPILSPKHGI LSIEQLISRV GVIGVTLMAL LSGFGAVNCP YTYMSYFLRN 180 

VTDTDILALE RRLLQTMDMI ISKKKRMAMA RRTMFQKGEV H1IKPSGFWGH IKSVTTSASG 240 

SENLTLIQQE VDALEELSRQ LFLETADLYA TKERIEYSKT FKGKYFNFLG YFFSIYCVWK 300 

5KTDPVT RGIEITVNYL GIQFDVKFWS QHISFILVGI IIVTSIRGLL 360 

I SSSKSSNVIV LLLAQIMGMY FVSSVLLIRM SMPLEYRTII TEVLGELQFN 420 
ILSSILF LYLAHKQAPE KQMAP 

A85 DNA SEQUENCE: 

Gene name: TTK protein kinase 

Unigene number: Hs. 16 9840 

Probeset Accession #: M86699 

Nucleic Acid Accession #: NM_003318 

Coding sequence: 1026-3551 (underlined sequences correspond to start and stop codons) 



CTCAGCCTCT CAAGTAGCTG GGATTACAGG CATGTCCCAC CACCCCTGGC T. 
CTTTTCTATT TAGTAGAGAT GGGGTTTCAC CATGTTGGTC AGGCTGGTCT TGAACTCCTG 
ACCTCAGGTG ATCCACTTGC CTTGGCCTCC CAAAGTGCTA GGATTACAGC O 
TGCCTGGCTG ATTCTTTTTT TGTTGTTGGA TTTTTGAAAC AGGGTC: 
AGGCTGGAGT GCAGTGGTGC GATCTTGGCT CACTATAAC C TCCACCTCCT GGTTTCAAGT 
GATCCTCCCA CTTTAGCCTC CTGAGTAGCT GTGATTACAG GCGTGCACCA CCACACCCGG 



CTCCTGGACT CAAGGGATCC GCCTGCCTCC ACTTCCCAAA GTCCCGAGAT TACAGGTGTG 600 

AGTCACCATG CCTGACCTTA TAATTCTTAA GTCATTTTTT CTGGTCCATT TCTTCCTTAG 660 

GGTCCTCACA ACAAATCTGC ATTAGGCGGT ACAATAATCC TTAACTTCAT GATTCACAAA 720 

AGGAAGATGA AGTGATTCAT GATTTAGAAA GGGGAAGTAG TAAGCCCACT GCACACTCCT 7 80 

GGATGATGAT CCTAAATCCA GATACAGTAA AAATGGGGTA TGGGAAGGTA GAATACAAAA 840 

TTTGGTTTAA ATTAATTATC TAAATAT CT A AAAACATTTT TGGATACATT GTTGATGTGA 900 

ATGTAAGACT GTACAGACTT CCTAGAAAAC AGTTTGGGTT CCATCTTTTC ATTTCCCCAG 960 

TGCAGTTTTC TGTAGAAATG GAATCCGAGG ATTTAAGTGG CAGAGAATTG ACAATTGATT 1020 

CCATAATGAA CAAAGTGAGA GACATTAAAA ATAAGTTTAA AAATGAAGAC CTTACTGATG 1080 

AACTAAGCTT GAATAAAATT TCTGCTGATA CTACAGATAA CTCGGGAACT GTTAACCAAA 1140 

TTATGATGAT GGCAAACAAC CCAGAGGACT GGTTGAGTTT GTTGCTCAAA CTAGAGARAA 1200 

ACAGTGTTCC GCTAAGTGAT GCTCTTTTAA ATAAATTGAT TGGTCGTTAC AGTCAAGCAA 1260 

TTGAAGCGCT TCCCCCAGAT AAATATGGCC AAAATGAGAG TTTTGCTAGA ATTCAAGTGA 1320 

GATTTGCTGA ATTAAAAGCT ATT CAAGAGC CAGATGATGC ACGTGACTAC TTTCAAATGG 13 80 

CCAGAGCAAA CTGCAAGAAA TTTGCTTTTG TTCATATATC TTTTGCACAA TTTGAACTGT 1440 

CACAAGGTAA TGTCAAAAAA AGTAAACAAC TTCTTCAAAA AGCTGTAGAA CGTGGAGCAG 1500 

TACCACTAGA AATGCTGGAA ATTGCCCTGC GGAATTTAAA CCTCCAAAAA AAGCAGCTGC 1560 



A GAAATTTAAA GTCTGTTCAA AATAGTCATT 
A GTTCTGAACT TATTATTACT GATTCAATAA 
A TCAAGTCTTC TAGCTAAATT AGAAGAAACT A 
A GGTTCCAGAG AGTAACCAGA AACAGTGGCA AGCTAAC 
GTATTAACCA GAATCCTGCT GCATCTTCAA ATCACTGGCA GATTCCGGAG TTAGCCCGAA 



65 
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AAAAGAAACA GATATATGCT ATAAAATATG 



GTGATGTTTG GTCCTTAGGA TGTATTTTGT ACTATATGAC TTACGGGAAA ACACCATTTC 3180 
AGCAGATAAT TAATCAGATT TCTAAATTAC ATGCCATAAT TGATCCTAAT C 
AATTTCCCGA TATTCCAGAG AAAGATCTTC AAGATGTGTT AAAGTGTTGT T 

ACCCAAAACA GAGGATATCC ATTCCTGAGC TCCTGGCTCA TCCATATGTT CAAATTCAAA 3360 

CTCATCCAGT TAACCAAATG GCCAAGGGAA CCACTGAAGA AATGAAATAT GTTCTGGGCC 3420 

AACTTGTTGG TCTGAATTCT CCTAACTCCA TTTTGAAAGC TGCTAAAACT TTATATGAAC 3480 

ACTATAGTGG TGGTGAAAGT CATAATTCTT CATCCTCCAA GACTTTTGAA AAAAAAAGGG 3540 

GAAAAAA ATG A TTTGCAGTT ATTCGTAATG TCAGATAGGA GGTATAAAAT ATATTGGACT 3600 

GTTATACTCT TGAATCCCTG TGGAAATCTA CATTTGAAGA CAACATCACT CTGAAGTGTT 3660 

ATCAGCAAAA AAAATTCAGT GAG ATT AT CT TTAAAAGAAA ACTGTAAAAA TAGCAACCAC 3720 

TTATGGCACT GTATATATTG TAGACTTGTT TTCTCTGTTT TATGCTCTTG TGTAATCTAC 3780 

TTGACATCAT TTTACTCTTG GAATAGTGGG TGGATAGCAA GTATATTCTA AAAAACTTTG 3840 
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5 



TAAATAAAGT TTTGTGGCTA AAATGA 

Gene name: ^ TTK protein kinase 



Protein Accession #: NP_00330S 

Signal sequence: none found 

Transmembrane domains : none f ound 

10 Protein Kinase Domain: 510-775 

Cellular Localization: cytoplasmic and nuclear 



VPLSDALLNK LIGRYSQAIE ALPPDKYGQN ESFARIQVRF AELKAIQEPD DARDYFQMAR 
ANCKKPAFVH ISPAQPELSQ GKVKKSKQLL QKAVERGAVP LEMLEIALRN LNLQKKQLLS 
EEEKKNLSAS TVLTAQESFS GSLGHLQNRN NSCDSRGQTT KARPLYGENM PPQDAEIGYR 
NSLRQTNKTK QSCPFGRVPV NLLNSPDCDV KTDDSWPCF MKRQTSRSEC RDLVVPGSKP 

20 SGNDSCELRN LKSVQNSHFK EPLVSDEKSS ELIITDSITL KNKTESSLLA KLEETKEYQE 

PEVPESNQKQ WQAKRKSECI NQNPAASSNH WQIPELARKV NTEQKHTTFE QFVFSVSKQS 
PPISTSKWFD PKSICKTPSS NTLDDYMSCF RTPWKNDFP PACQLSTPYG QPACFQQQQH 
QILATPLQNL QVLASSSANE CISVKGRIYS ILKQIGSGGS SKVFQVLNEK KQIYAIKYVN 
LEEADNQTLD SYRNEIAYLN KLQQH3DKI I RLYDYEITDQ YIYMVHECGN IDLNSWLKKK 

25 KSIDPWERKS YWKNMLEAVH TIHQHGIVHS DLKPANFLIV DGMLKLIDFG IANQMQPDTT 

SWKDSQVGT VNYMPPEAIK DMSSSRENGK SKSKISPKSD VWSLGCILYY MTYGKTPFQQ 
IINQISKLHA IIDPNHEIEP PDIPEKDLQD VLKCCLKRDP KQRISIPELL AHPYVQIQTH 
PVNQMAKGTT EEMKYVLGQL VGLNSPNSIL 

30 
35 



bone tnorphogenetic protein 7 (osteogenic protein 1) (BMP7) 
BE616S33 

g sequence: 123-1418 (underlined sequences correspond to start and stop codons) 



40 | | | | | | 

GGGCGCAGCG GGGCCCGTCT GCAGCAAGTG ACCGACGGCC GGGACGGCCG CCTGCCCCCT ' GO 

CTGCCACCTG GGGCGGTGCG GGCCCGGAGC CCGGAGCCCG GGTAGCGCGT AGAGCCGGCG 12 0 

CGATGCACGT GCGCTCACTG CGAGCTGCGG CGCCGCACAG CITCGTGGCG CTCTGGGCAC 18 0 

CCCTGTTCCT GCTGCGCTCC GCCCTGGCCG ACTTCAGCCT GGACAACGAG GTGCACTCGA 2 40 

45 GCTTCATCCA CCGGCGCCTC CGCAGCCAGG AGCGGCGGGA GATGCAGCGC GAGATCCTCT 3 00 

CCATTTTGGG CTTGCCCCAC CGCCCGCGCC CGCACCTCCA GGGCAAGCAC AACTCGGCAC 360 

G TACAACGCCA TGGCGGTGGA GGAGGGCGGC GGGCCCGGCG 420 

GCCAGGGCTT CTCCTACCC 
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GCCAGGGCTT CTCCTACCCC TACAAGGCCG TCTTCAGTAC CCAGGGCCCC CCTCTGGCCA 
GCCTGCAAGA TAGCCATTTC CTCACCGACG CCGACATGGT CATGAGCTTC GTCAACCTCG 
TGGAACATGA CAAGGAATTC TTCCACCCAC GCTACCACCA TCGAGAGTTC CGGTTTGATC 
TTTCCAAGAT CCCAGAAGGG GAAGCTGTCA CGGCAGCCGA ATTCCGGATC TACAAGGACT 
ACATCCGGGA ACGCTTCGAC AATGAGACGT TCCGGATCAG CGTTTATCAG G 
AGCACTTGGG CAGGGAATCG GATCTCTTCC TGCTCGACAG C 
AGGAGGGCTG GCTGGTGTTT GACATCACAG CCACCAGCAA O 

GGCACAACCT GGGCCTGCAG CTCTCGGTGG AGACGCTGGA TGGGCAGAGC ATCAACCCCA 
AGTTGGCGGG CCTGATTGGG CGGCACGGGC CCCAGAACAA GCAGCCCTTC ATGGTGGCTT 



GCCAGAACCG CTCCAAGACG CCCAAGAACC AGGAAGCCCT G 
AGAACAGCAG CAGCGACCAG AGGCAGGCCT GTAAGAf 
GAGACCTGGG CTGGCAGGAC TGGATCATCG CGCCTGAAGG CTACGCCGCC TACTACTGTG 1200 
AGGGGGAGTG TGCCTTCCCT CTGAACTCCT ACATGAACGC CACCAACCAC GCCATCGTGC 1260 
AGACGCTGGT CCACTTCATC AACCCGGAAA CGGTGCCCAA GCCCTGCTGT GCGCCCACGC 1320 
AGCTCAATGC CATCTCCGTC CTCTACTTCG ATGACAGCTC CAACGTCATC CTGAAGAAAT 1380 
ACAGAAACAT GGTGGTCCGG GCCTGTGGCT GCCACTAGCT CCTCCGAGAA TTCAGACCCT 1440 
TTGGGGCCAA GTTTTTCTGG ATCCTCCATT GCTCGCCTTG GCCAGGAACC A 



ATGAACAAGA 



GGGCACATTG GTGTCTGTGC GAAAGGAAAA TTGACCCGGA AGTTCCTGTA ATAAATGTCA 
CAATAAAACG A 



B Protein sequence-. 



bone morphogenetic protein 7 (osteogenic protein 1) ( 
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\ PHSFVALWAP LFLLRSALAD FSLDNEVIISS FIHRRLRSQS RREMQREILS 60 

P HLQGKHNSAP MFMLDLYNAM AV3EGGGPGG QGFSYFYKAV FSTQGPPLAS 12 0 

LQDSHFLTDA DMVMSFVNLV EHDKEFFHPR YHHREFRFDL SKIPEGEAVT AAEFRIYKDY 180 

IRERFDNETF RISVYQVLQE HLGRESDLFL LDSRTLWASE 3GWLVFDITA TSNHWWNPR 240 

HNLGLQLSVE TLDGQSINPK LAGLIGRHGP QNKQPFMVAF FKAT3VKFRS IRSTGSKQRS 300 

QNRSKTPKNQ EALRMANVAE NSSSDQRQAC KKHELYVSFR DLGWQDWI I A PEGYAAYYCE 360 

GECAFPLNSY MNATNHAIVQ TLVHFINPET VPKPCCAPTQ LNAISVLYFD DSSKVILKKY 420 
RNNWRACGC H 

A89 DNA SEQUENCE: 

Gene name: Homo sapiens mRNA ; cDNA DKFZp5S401763 (from clone DKFZp5S401763 ) 



I I I I I 1 

ATGCGCGCCG TGCCGCTGCC CGCCCCGCTC CTGCCGCTGC TGCTGCTCGC GCTCCTGGCC 60 

GCTCCCGCCG CCCGCGCCAG CAGAGCCGAG TCCGTCTCCG CGCCGTGGCC CGAACCCGAG 120 

CGCGAGTCGC GGCCACCGCC CGGCCCGGGG CCCGGGAACA CCACCCGGTT TGGGTCTGGG 180 

GCGGCGGGCG GCAGCGGCAG CTCCAGCTCC AACAGCAGTG GCGACGCCTT GGTGACCCGC 240 

ATTTCCATCC TCCTCCGCGA CCTACCCACC CTCAAGGCAG CCGTGATCGT GGCGTTCGCC 300 

TTTACCACCC TCCTCATCGC CTGCCTGCTG CTGCGCGTCT TCAGGTCGGG AAAGAGGTTA 360 

AAGAAGACAC GCAAGTATGA TATCATCACC ACTCCAGCAG AGCGAGTGGA AATGGCGCCA 420 

CTAAATGAAG AGGATGATGA AGATGAGGAC TCCACAGTAT TCGACATCAA ATACAGAGTG 480 

TCCTTGCCGG CTGCACTGAG ACGTCAGCTG CCAGGGTGCC AGACGCTACT GACAGTTCCT 540 

GTGCCCCCAC CCTTCATCCT CGACATTGAC CTTCCAGCAA GATGCAGTGG AAGGCCTGAT 600 

GGTGGAATCA GACCTGGTAA AACCTGTTTC CCAGCCTGGT GGCATCCTGT GGAAAGTTGG 660 

TCAGCTGCAA CCTGGGGTGT GAAGGACTGG ACCTGGAAGC CCTCTTGCGT CGGAGGTGTT 720 

GAAACCAAAA CGAACGTTAT GTATAAAACC CCAGCTCCAT CGTGCGTGTC AGGCATCTGC 780 

TCAGACTGTC ACTGGCAAGC TCGTTTCCAC GTCACCACAA TGGAGTTGCT TCTGCCACCC 840 

TTTGGGCATC CCTTTAAAGT GCCCCCTACT TCTACTCCCC ATGGTTTTCG ACAACTGCAG 900 

CTGAATCTCA TGGAAAAGCT GGATTCCTCT GCCTTACGCA GAAACACCCG GGCTCCATCT 960 

GCCAGGTGCT TGCCACTGGT CCTGGCAGAA ATGGCGGCTG CTGAAAGTGA CCTT CCAAAT 1020 

CCTTGGTGGC ACTTCAGCGC CACAGGCTCT CCAATAAAAA CCCTTTACAC ACAAACCATG 10 8 0 

AGTACCTTGG GCTTGGATGT TTTCTGTGGT GCCGGCCAGC GGGGCACCTT TTGTGAAGAC 1140 

AGAGCAGTGA CTAAGGTTCT CCAGGGTAGC TCTTTCTCCA AACAGCTGCG CTGGAAGCCA 12 00 

GCCCTAGAGA GTGGGTTTCC CCATCATCTC AGGCTTCTCA GAGAGTGTCC TCCGCTGAGC 12 60 

ACCCATCCTG TCAGGTTGGC TCGTT CAG AT GCCCGGGGAC AAGCCAGCCT GACGGGGAGG 13 2 0 

AGGGTGTTTC GGCGTCCGCG GCAGTCTCTG CATGUCGGAG GGT CAGCGGG TACCGCAACT 1380 

TGCCTTTTGG TTTTGAAGAT TCTGTTGAGG CGCCATCCTC ACCTTGACCT CTTCTACAAA 1440 

ATCTGTCTCC CCTGCTGTGC CGTGGAACAC CTACGGGAAG CCAAGAGAAG CTCAGTGACT 1500 

GTCCTTGCGT CATTTGAGCA GAGCCCACAA AAGGCAGCTG CTGCCCACGG GGAGCCTGTC 1560 

AAACGAGGGC CCAGTGGGCA ATTGACCAGA CACACATGCC CTGGCTGGGG GATCACACAT 1620 

GCGAACCTGC AGACAATTCC AGATACCCAA GGCCAGGAAG GCCCACGTGA GGATGTCACT 1680 

CACCCTGGAG GAGACTTGGA TGGGGTGGCA AATTTCTATT TGGAGGAAGA GGGTTTCCAG 1740 

GATGGCAGAT GCCAGAAGAT GGTCCTGATG T CTGAGGAAG GGCCACCTAG TTTGACAGGA 1800 

TGTGAGAGGC TCACAGGTTC CCATCACTTC TCCAGCCATT CCAAGTCTTG GTCCTTCCTT 1860 
T TCTGTCCAGG CCCTGA 



Unlgene number 



sapiens raRNA; cDNA DKFZp56401763 (from clone DKFZp56401763 ) 



I I I I I I 



AAGGSGSSSS NSSGDALVTR ISILLRDLPT LKAAVIVAFA FTTLLIACLL L 
KKTRKYDIIT TPAERVEMAP LNEEDDEDED STVFDIKYRV SLPAALRRQL P 
VPPPFILDID LPARCSGRPD GGIRPGKTCF PAWWHFVESW SAATWGVKDW T 
ETKTNVMYKT PAPSCVSGIC SDCHWQARFH VTTMELLLPP FGHPFKVPPT STPHGFRQLQ 
LNLMEKLDSS ALRKNTRAPS ARCLPLVLAE MAAAESDLPN PWWHFSATGS PIKTLYTQTM 
STLGIiDVFCG AGQRGTFCED RAVTKVLQGS SFSKQLRWKP ALESGFPHHL K 
.THPVRIiARSD ARGQASLTGR RVFRRPRQSIi HGGGSAGTAT CLLVLK] 
ICLPCCAVEH LREAKRSSVT VLASFEQSPQ KAAAAEGEPV KRGPSGQLTR HTCPGWGITH 
ANLQTIPDTQ GQEGPREDVT HPGGDLDGVA NFYLEEEGFQ DGRCQKKVLH SEEGPPSLTG 



EWING 

A91 DNA SEQUENCE 

Gene name: G protein-coupled receptor 64 
Unigene number: Hs. 184942 



873 



WO 03/042661 



i sequences correspond 



I I I I I I 

AGCCAGCCCG AGGACGCGAG CGGCAGGTGT GCACAGAGGT TCTCCACTTT GTTTTCTGAA 
CTCGCGGTCA GGATGGTTTT CTCTGTCAGG CAGTGTGGCC ATGTTGGCAG AACTGAAGAA 
GTTTTACTGA CGTTCAAGAT ATTCCTTGTC ATCATTTGTC TTCATGTCGT TCTGGTAACA 
TCCCTGGAAG AAGATACTGA TAATTCCAGT TTGTCACCAC CACCTGCTAA ATTATCTGTT 
GTCAGTTTTG CCCCCTCCTC CAATGAGGTT GAAACAACAA GCCTCAATGA TGTTACTTTA 
AGCTTACTCC CTTCAAACGA AACAGAAAAA ACTAAAATCA CTATAGTAAA AACCTTCAAT 
GCTTCAGGCG TCAAACCCCA GAGAAATATC TGCAATTTGT CATCTATTTG CAATGACTCA 
GCATTTTTTA GAGGTGAGAT CATGTTTCAA TATGATAAAG AAAGCACTGT TCCCCAGAAT 
CAACATATAA CGAATGGCAC CTTAACTGGA GTCCTGTCTC TAAGTGAATT AAAACGCTCA 
GAGCTCAACA AAACCCTGCA AACCCTAAGT GAGACTTACT TTATAATGTG TGCTACAGCA 

TGTGCTGCAA TAGCCGCTTT GGAAAGAGTA AAGATTCGAC CAATGGAACA C 
TCTGTCAGGA TACCCTGCCC TTCCTCCCCA GAAGAGTTGG GAAAGCTTCA G 



TCCATCCCAG TGGTGCCTCG GGCCACTGTG CTTTCCCAGG TCCCCAAAGC TACCTCTTTT 900 

GCTGAGCCTC CAGATTATTC ACCTGTGACC CACAATGTTC CCTCTCCAAT AGGGGAGATT 960 

CAACCCCTTT CACCCCAGCC TTCAGCTCCC ATAGCTTCCA GCCCTGCCAT TGACATGCCC 1020 

CCACAGTCTG AAACGATCTC TTCCCCTATG CCCCAAACCC ATGTCTCCGG CACCCCACCT 1080 

CCTGTGAAAG CCTCATTTTC CTCTCCCACC GTGTCTGCCC CTGCGAATGT CAACACTACC 1140 

AGCGCACCTC CTGTCCAGAC AGACATCGTC AACACCAGCA GTATTTCTGA TCTTGAGAAC 1200 

CAAGTGTTGC AGATGGAGAA GGCTCTGTCC TTGGGCAGCC TGGAGCCTAA CCTCGCAGGA 1260 

GAAATGATCA ACCAAGTCAG CAGACTCCTT CATTCCCCGC CTGACATGCT GGCCCCTCTG 1320 

GCTCAAAGAT TCCTGAAAGT AGTGGATGAC ATTGGCCTAC AGCTGAACTT TTCAAACACG 1380 

ACTATAAGTC TAACCTCCCC TTCTTTGGCT CTGGCTGTGA TCAGAGTGAA TGCCAGTAGT 1440 

TTCAACACAA CTACCTTTGT GGCCCAAGAC CCTGCAAATC TTCAGGTTTC TCTGGAAACC 1500 

CAAGCTCCTG AGAACAGTAT TGGCACAATT ACTCTTCCTT CATCGCTGAT GAATAATTTA 1560 

CCAGCTCATG ACATGGAGCT AGCTTCCAGG GTTCAGTTCA ATTTTTTTGA AACACCTGCT 1620 

TTGTTTCAGG ATCCTTCCCT GGAGAACCTC TCTCTGATCA GCTACGTCAT ATCATCGAGT 1680 

GTTGCAAACC TGACCGTCAG GAACTTGACA AGAAACGTGA CAGTCACATT AAAG CACAT C 1740 

AACCCGAGCC AGGATGAGTT AACAGTGAGA TGTGTATTTT GGGACTTGGG CAGAAATGGT 1800 

GGCAGAGGAG GCTGGT CAGA CAATGGCTGC TCTGTCAAAG ACAGGAGATT GAATGAAACC 1860 

ATCTGTACCT GTAGCCATCT AACAAGCTTC GGCGTTCTGC TGGACCTATC TAGGACATCT 1920 

GTGCTGCCTG CTCAAATGAT GGCTCTGACG TTCATTACAT ATATTGGTTG TGGGCTTTCA 19B0 

TCAATTTTTC TGTCAGTGAC TCTTGTAACC TACATAGCTT TTGAAAAGAT CCGGAGGGAT 2 040 

TACCCTTCCA AAATCCTCAT CCAGCTGTGT GCTGCTCTGC TTCTGCTGAA CCTGGTCTTC 2100 

CTCCTGGACT CGTGGATTGC TCTGTATAAG ATGCAAGGCC TCTGCATCTC AGTGGCTGTA 2160 

TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG GCCTAGAAGC ATTCCATATG 2220 

TACCTGGCCC TTGTCAAAGT ATTTAATACT TACATCCGAA AATACATCCT TAAATTCTGC 2280 

ATTGTCGGTT GGGGGGTACC AGCTGTGGTT GTGACCATCA TCCTGACTAT ATCCCCAGAT 2 340 

AACTATGGGC TTGGATCCTA TGGGAAATTC CCCAATGGTT CACCGGATGA CTTCTGCTGG 2400 

ATCAACAACA ATGCAGTATT CTACATTACG GTGGTGGGAT ATTTCTGTGT GATATTTTTG 2460 

CTGAACGTCA GCATGTTCAT TGTGGTCCTG GTTCAGCTCT GTCGAATTAA AAAGAAGAAG 2520 

GCGAAA AACCAGTATT CAAGACCT CA GGAGTATCGC TGGCCTTACA 2 580 

G GAATAACTTG GGGCTTTGCC TTCTTTGCCT GGGGACCAGT TAACGTGACC 2640 

C TGTTTGCCAT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 2700 

TCTGTGGCCA AAGAAAATGT CAGGAAGCAA TGGAGGCGGT ATCTTTGTTG TGGAAAGTTA 2760 

CGGCTGGCTG AAAATTCTGA CTGGAGTAAA ACTGCTACTA ATGGTTTAAA GAAGCAGACT 2820 

GTAAACCAAG GAGTGT CCAG CTCTTCAAAT TCCTTACAGT CAAGCAGTAA CTCCACTAAC 2BB0 

TCCACCACAC TGCTAGTGAA TAATGATTGC T CAGTACACG CAAGCGGGAA TGGAAATGCT 2940 

TCTACAGAGA GGAATGGGGT CTCTTTTAGT GTTCAGAATG GAGATGTGTG CCTTCACGAT 3000 

TTCACTGGAA AACAGCACAT GTTTAACGAG AAGGAAGATT CCTGCAATGG GAAAGGCCGT 3060 

ATGGCTCTCA GAAGGACTTC AAAGCGGGGA AGCTTACACT TTATTGAGCA AATGTGATTC 3120 

CTTTCTTCTA AAATCAAAGC ATGATGCTTG ACAGTGTGAA ATGTCCAATT TTACCTTTTA 3180 

A GATGTATGAA AATCAACTCA TTTTATTCTC GGCAACATCT GGAGAAGCAT 3240 

A AGGGCGATGA TTATTATTAC AAGAAGAAAC CAAGACATTA CACCATGGTT 3300 

T TCTGATTTGG TTTCTTATCT TTCATTTTAT AAGAAGGTTG GTTTTAAACA 3360 

ATACACTAAG AATGACTCCT ATAAAGAAAA CAAAAAAAGG TAGTGAACTT TCAGCTACCT 3420 

TTTAAAGAGG CTAAGTTATC TTTGATAACA TCATATAAAG CAACTGTTGA CTTCAGCCTG 34 80 

TTGGTGAGTT TAGTTGTGCA TGCCTTTGTT GTATATAAGC TAAATTCTAG TGACCCATGT 3540 

GTCAAAAATC TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTCCT 3600 

AAATCCTAAA GCTCCTTTTG GAGATGATAT AGGATGTGAA ATACAGAAAC CTCAGTGAAA 3720 

1 TCAAGAAATA ATGATCCCAG CCAGACTGAG AAAATGTAAG CAGACAGTGC CACAGTTAGC 3 7 BO 

TCATACAGTG CCTTTGAGCA AGTTAGGAAA AGATGCCCCC ACTGGGCAGA CACAGCCCTA 3 840 

TGGGTCATGG TTTGACAAAC AGAGTGAGAG ACCATATTTT AGCCCCACTC ACCCTCTTGG 3900 

GTGCACGACC TGTACAGCCA AACACAGCAT CCAATATGAA TACCCATCCC CTGACCGCAT 3960 

CCCCAGTAGT CAGATTATAG AATCTGCACC AAGATGTTTA GCTTTATACC TTGGCCACAG 4020 

AGAGGGATGA ACTGTCATCC AGACCATGTG T CAGGAAAAT TGTGAACGTA GATGAGGTAC 4080 

ATACACTGCC GCTTCTCAAA TCCCCAGAGC CTTTAGGAAC AGGAGAGTAG ACTAGGATTC 4140 

CTTCTCTTAA AAAGGTACAT ATATATGGAA AAAAATCATA TTGCCGTTCT TTAAAAGGCA 4200 

ACTGCATGGT ACATTGTTGA TTGTTATGAC TGGTACACTC TGGCCCAGCC AGAGCTATAA 4260 

TTGTTTTTTA AATGTGTCTT GAAGAATGCA CAGTGACAAG GGGAGTAGCT ATTGGGAACA 4320 

GGGAACTGTC CTACACTGCT ATTGTTGCTA CATGTATCGA GCCTTGATTG CTCCTAGTTA 43B0 

TATACAGGGT CTATCTTGCT TCCTACCTAC ATCTGCTTGA GCAGTGCCTC AAGTACATCC 4440 

TTATTAGGAA CATTTCAAAC CCCTTTTAGT TAAGTCTTTC ACTAAGGTTC TCTTGCATAT 4500 

ATTTCAAGTG AATGTTGGAT CTCAGACTAA CCATAGTAAT AATACACATT TCTGTGAGTG 4560 

CTGACTTGTC TTTGCAATAT TTCTTTTCTG ATTTATTTAA TTTTCTTGTA TTTATATGTT 4620 
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AAAATCAAAA ATGTTAAAAT CAATGAAATA AATTTGCAGT T. 
A92 Protein sequence 



Unigene number: 
Protein Accession #: 
Signal sequence: 
GPS domain: 



i- coupled receptor S 



3-B05, 828-B50, 858-880 



MVFSVRQCGH 



VCLADHPRGP 
PQFSAFIASS 
VQTDIVNTSS 
LKWDDIGLQ 



STVPQNQHIT 
HHTMNACAAI 
PPSSSQSIPV VPRATVLSQV 
PAIDMPPQSE TISSPMPQTH 



KPQRNICNLS S 
TLQTLSETYF I 
PCPSSPEELG KLQCDLQDPI 
SPIGEIQPLS 



SHLTSFGVLL 
ILIQLCAALL 
VKVFNTYIRK 
AVFYITWGY 
ITWGFAFPAW 
NSDWSKTATN 



FLLVSFTWMG 



MELASRVQFN FFETPALFQD 
VTLKHINPSQ DELTVRCVFW DLGRNGGRGG 
DLSRTSVLPA QMMALTFITY IGCGLSSIPL 
LLNLVFLLDS WIALYKMQGL 
YILKFCIVGW GVPAVWTII 

PCVIPLLNVS MPIWLVQLC RIKKKKQLGA QRKTSIQDLR S 
GPVNVTFMYL FAIFNTLQGF FIFIFYCVAK ENVRKQWRRY LCCGKLRIAE 
GLKKQTVNQG VSSSSNE" 
DVCLHDFTGK QHMFNEP 



A9 3 DMA SE QUENCE 

Gene name: EGF- like -domain; multiple 6 

Unigene number: Hs. 12844 

Probeseb Accession ft: N67SS1 

Nucleic Acid Accession #: NM_015507 

Coding sequence: 241-1902 (underlined sequences correspond t 

1 11 21 31 41 51 

I I I I I I 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 
CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 
CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 180 



80 



ATCCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 
GGGAACGCGG CCAGTGCAAG GCAT CACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 
TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 
TGTGAAGCT A CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 
AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAA1G 
AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 
CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 
ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 
TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 
GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATT1GGAAG CTACTACTGC 
AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 
AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAA1TGCTT CAATACCCAA 
GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA A 



ATAGTTTCCA G. 

GAGGGGCTTG AGGATGAGAA AAGAGAAGAG A 

AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT O 
CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT : 
GACTGCAGCT TCAATCATGG GAT CTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC : 
TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 
GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 
AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 
TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGIGA GGATGAAAAG 
TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 
GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 
TCAGGCTTAT GTCCAGATAG CCTTTTATCT G 

TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC C 
TTAGAATTAC T 

A TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 



A AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 228 0 
TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA A 
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Cellular LoC£ 



N67551 
none found 



I 



I 



I 



I 



MPLPWSLALP LLLSWVAGGF GNAASARHHG LLASARQPGV C. 

CEATCEPGCK FGECVGPMKC RCFPGYTGKT CSQDVHBOGM KPRPCQHRCV NTKGSYKCFC 
LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 
GKVICPYNRR CVNTFGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 
GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKLLAHKNSK KXKAKIKNVT 
PEPTRTPTPK VNLQPFNYEE IVSRGGNSHG GKKGNEEKMK EGLEDEKREE KALKNDIEER 
SLRGDVFFPK VNEAGEFGLI LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 
WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS MFCLLFDYRL AGDKVGKLRV 
FVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 
SGLCPDSLLS VDD 

GLIOBLASTOMA 

A95 DNA SEQUENCE 



40 
45 



Probeset Accession # : 
Nucleic Acid Accession # : none found 

Coding sequence: 38-1075 (underlined sequence corresponds 



> start and stop codon) 



AGCAACGACG 
CCTGCTGCTG 
GTGCAACATA 



GTCGAAATGT 
CTTCCGGTGC 
AAACCCTCTG 
GAGCTTCATC 



I 

" CCGGGCAGCG 
AGCAGCGCCG 
CCAGGCAACT 
CTGCCTGACT 
GGCCCAACCT 
AATGGGTTTG 



GCTTCGACAA 



TTACCCCAGC 
CCTGCTGGCA 
GCACCGGCTG 
CTGCAACGTC 
GAATGCGTCG 
TGCGTGGTAT 
CGACCTGCCC 
CAGCAGCCTC 
GGGCACTGCT 
AGTTATTCCA 
TGCTCATGGG 
AACTATCTCT 
TG ACATG AT C 
CACCCTCATT 



TGCGATGGAC 
GAACCCGGCA 
ATCACCTATG 



AGGACTGTCC 
CCGCCCGCTA 
AGAATAACTG 
GTGGGCAGGT 
CCATCATCGG 
ACCACCAGCG 
TGCTC 



GAGTGATGAG 
TGCCAGCGGC 
CGATGGCAGC 
CCACTGCAAG 
TCAAGACAAC 
GTTTGTGACT 
CAGCTCCGTC 



I 

£ TGGCTGCTGG GGCCGCTGTG 
: GGGAACAACT TCACCAATGA 
TGCATCCCGG GCGCCTGGCA 
AAGGAGTGCC CCAAGGCTAA 
ATCCATTGCA TCATTGGTCG 
GATGAAGAGA ACTGCACAGC 



AGTGATGAGG AAAGCTGTGA 



GACCTTCCTC C 



G GAGTGCCAAC 



ACTCTGAGCC 
GGGTTAATCT 
AGCACCTGTA 



CAGCCAGGGC 
GCTCTGACTT 
AGGATGTCTC 
ACTTCAGAGA 



GCCTTGCTGG ACCAGAGGCC 
ACGGAATCTC TGAACCAAGC 
AGTGCCAGCT CCCAGGCAGC 
GGGCAGCCTG GCCCCCAGGA 
ACTGAAGAAG TATAAGTCCC 
GTTGCCATTC TAACAATTTG 
AAGTTACAGT TTGGGATATT 
TGTTTTTCTG GCGTCTCAGT 



TTTCACATTA 



CGCTGGACCC 
ATGAT CTAAC 
ATCAAAACCT 
AAGAAAACTT 



TTCTGTTTCT 
GCTGGGTAGT 



CTCATTCTGA 
GAGCCCCTCC 
TACACCTGCC 
ACCTGCCCGT 



AATTCTCTCT 
CAGGAGGCCA 
GCTTTGCACA 
TGGACGTGAG 
AAAC CATCTA 
GAGCTTTCCT CAGCAGCATA 



GTTGGAGAGA 
GCTATATTGG 
TACCTTATAG 



CAGCATATAA AACAGTATTG 
ATGCICAGAA GTGCAGGAGA 
CATTTGGGGA TTTGGGTTAG 
CAAAAAAATT CCATTTGAGC 



CTCCAAAGTT 
ACTGGTTTCT 
CTGCACTGTG 
GGTCAGGGTC 
AGACAATTTG 
TGAAACAGTG 
AGCTGTCTCT 
ACACCCTTGC 
ACATTTGTGC 
AGAGGGACTC 
TTCTCTGTGT 



CCCGCTGAGC 
ATTG TTGCAC 
CTCTCTCCCT 



TGAGGACCTA 
CCCAGCCTGT 
TTGCAAAGTC 
AGAGCCATGT 
TTCCCAAGGT 
CAACATCCCA 
TTTCCATTTG 
TTCCCTTCTA 
TTCCTTTAAC 



TATCATCAGC 
CAGCTCCTAA 
CTGGTTTCTG 
ACTTGAGTTG 
CTTGCTCATT 
CTTTTTACCT 
TCAATACCTC 
CCCAATACCA 
GTAGTTTCTC 
GATCTATTTT 



AACGTTATTT 
TAGAAATTTG 
CTCATCCTAA 
AATGCAGGCT 
GACTGTCACC 
GCCCAAAG1C 
CATGCAGCCT 



CCCAAGAATG 



TGACCTGGCT 



TTTGAGGTTA 



AAGGTCCAAA 
CAAGTCACTC 
TTATTTATCA 



GCACCTCTAG 
CTCTGAGACA 
AAATCTTTTA 
TATTTATATG 
GAAAGATGCA 



ACTTGAGGAC 
TCCTGGCTCC 
TTAGAGTTAG 



TGGCAAGAAA 
CAGCTGTCAC 
ACGCTAATTA 



CCACACTGAC 
CCATTCAGAA 
AAACAGAGCC 



TGATGAGGGG 
CTTCTTTCCG 
TGCAGGAAGT 



2220 
2340 



AGTTCTTGAA GGAAGCAGAA 
TGTGCTAGTT TTTCTTTTTT 
GGAATAAGGG GTAAAACGTT 
TAAAATGGAA CCAGGTAGAG 
CAGCTGAAGA AATGTTCAGT 
GGGGCTAAAG TGGCATTCAG 
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45 
50 
55 



TGATCCTGTT CTGTAGACTT TTCTTTCTTT TTTTAACCAA ATCCAAAGGA TGTTACAGAA 2940 

AAGCTAGCCA CTGGTATTTT GTTTTGTTTA AAAAAAAAAA GAAAGAAAGA AAGAAAGAAA 3000 

AACGGAAAGG AACCTAGCTG CCTGTATCTT TCATTTTTAA AATAGCACTT GAGTTATTTT 3060 

CTGAGTAATC CAATAAAGAA CTTTTGATGA CAGCCAGAAT GTGTIAGAAC TCIGGCTGAA 3120 

CATTTCATCT CCTGTGAGTC AGAAGGGCTT TATTTCTCCC TTTGATGGGG CCCCTTCTTC 3180 

TTTCTGGTGC TCTGGAAGTT GTTTAGAGGA AAGAATTCTA ATTTTAATTA ATTGCGCAGT 3240 

GAGTTAATCT CACTCGCTTT TCTGCTTCCA GGCATCTTAG GAAAAACAAA TGGTTTTAGT 3300 

AGATAAGGGA TGCCTACTAA TGCTTTTTTA AAACAAACAG GGACATTTTT ATTATAGATT 33S0 

TGATTTTTTT AATGAATGTT TTTAAAAATA TATAAATAGG ACACCAAAGC GGCAGGGTTT 3420 

TTTTTGGGGG GAGGGGGTTT GTTTTCCAAC TCAAGATGGC ACATTAGTGG CCAGCAATAT 3480 

TTTTTAACTC ATTCCAACCA GGAAGCTTTT TTATACATTG CCTAAATCTA CGCCAACCAG 3540 
AAAATAGTCT CATCTCTTTT TTTCTCAAAT GAGATCCGTG TTTTATTTTA GCATTAAATT 
AGTTACACTG TGATGACTGG CCTATTACCT GACTCAGCTC CCTCTACCTT GAAATTGACA 
TTTTTAAAAA ATGCAACTAA GTGGTTAATA GTGTGTGACG CTCAAAGTTA ATGTAAACTG 

GAAAGGTTGT GTGTCGTTGC TTTTTGTGTT TTGGTTAGGC TTGGTTTTGT TTTTTAATTT 3780 

TTATACTTTC TAATAAATTT GCAGTTTCAT TCTTTCTGTT TGTGCAAAWG GWMCTAMARM 3840 

AAMMAAAAAC AWYWTTGGGG GGGCTTGGGC CTCGGAAAAA GTTTTTAACA CCACTTCGGG 3 900 

TGGGGCGGCG GGGCCCACGT AGGTACGGCG ACCACGCGGG CCCAAACGG3 ACCCCAGAAG 3960 

GAAACCCTGG CCAAGAAAAA GGTGGCGAGA ATTCTCCACA CCAGAAAAAA ACGCGCCGGG 4020 

GGAAACCGCA GAGTGTTGCG TAAACCACAC CCGAAGAGAG AACTCAGAAG CACACAAGCG 4 080 
GGACTCAACC AGGAGGACCC AAGGGAACCC GATAGAGTAC G 

A96 Protein sequence: 



Unigene number: 
Probeset Accession #: 
Protein Accession B: 
Signal sequence: 
Transmembrane domains: 

Cellular Localization-. 



I I I I I I 

MWLLGPLCLL LSSAAESQLL PGNNFTNECN IPGNFMCSNG RCIPGAKQCD GLPDCFDKSD 

EKECPKAKSK CGPTFFPCAS GIHCIIGRFR CNGFEDCPDG SCEENCTANP LLCSTARYHC 

KNGLCIDKSF I CDGQNNCQD NSDEESCESS QEPGSGQVFV TSENQLVYYP SITYAIIGSS 

VIFVLWALL ALVLHHQRKR NNLMTLPVHR LQHPVLLSRL WLDHPHHCN VTYNWNGIQ 

YVASQAEQNA SEVGSPPSYS EALLDQRPAW YHLPPPPYSS DTESLNQADL PPYRSRSGSA 
HSASSQAASS LLSVEDTSHS PGQPGPQEGT AEPRDSEPEQ GTEEV 



A97 DMA S EQUENCE 

Gene name: putative GPCR, Weakly similar to (JJ36S012.1 
Unigene number: Hs. 256897 

Probeset Accession (t: BE001836 
Nucleic Acid Accession #: BE001836 

Coding sequence: 1-2955 (underlined sequences correspond to 

1 11 21 31 41 51 

I I I I I I 

ATGGCCGTCA AATCTTTTTC CTTCAACACT GGGAATGATG GGCTAGACCC AGACCCAATC 60 

ATGTTAAGTA TGATAGGGAT CAGTGGCCCA GCTGGTGTGG AAGACAAGTG GGTCATACIG 120 

GGTTTAGGCC CACATAGCAC TAAAGGTCTT CAACATGCAA TTAGAAGCTC AAATGTTGAC 180 

GGGAATATTG TTCACTCTAA AAAAGATGTT TCTATTAGAA TTTACTTACT CTTTCATGAA 240 

AACATAGATG CTTCCTCTTT CCTGATTAGT GATGGCCATC AGTTAACCCA AGTGCACTCA 3 00 

GAGAAGTCAA ATTCTGACAC AATCCAGCAA GTAACTATAA AAACTGATGG CCCAGTCGAA 360 

GAATATCAGC TGCTGCTTCA GGTGACCTAT AGAGATTCCA AGGAGAAAAG AGATTTGAGA 420 

AATTTTCTGA AGCTCTTGAA GCCTCCATTA TTATGGTCAC ATGGGCIAAT TAGAATTATC 4 80 

AGAGCAAAGG CTACCACAGA CTGCAACAGC CTGAATGGAG TCCTGCAGTG TACCTGTGAA 540 

GACAGCTACA CCTGGTTTCC TCCCTCATGC CTTGATCCCC AGAACTGCTA CCTTCACACG 600 

GCTGGAGCAC TCCCAAGCTG TGAATGTCAT CTCAACAACC TCAGCCAGAG TGTCAATTTC 660 

TGTGAGAGAA CAAAGATTTG GGGCACTTTC AAAATTAATG AAAGGTTTAC AAATGACCTT 720 

TTGAATTCAT CTTCTGCTAT ATACTCCAAA TATGCAAATG GAATTGAAAT TCAACTTAAA 780 

AAAGCATATG AAAGAATTCA AGGTTTTGAG TCGGTTCAGG TCACCCAATT TCGAAATGGA 840 

AGCATCGTTG CTGGGTATGA AGTTGTTGGC TCCAGCAGTG CATCTGAACT GCTGTCAGCC 900 

ATTGAACATG TTGCCGAGAA GGCTAAGACA GCCCTTCACA AGCTGTTTCC ATTAGAAGAC 960 

GGCTCTTTCA GAGTGTTCGG AAAAGCCCAG TGTAATGACA TTGTCTTTGG ATTTGGGTCC 1020 

AAGGATGATG AATATACCCT GCCCTGCAGC AGTGGCTACA GGGGAAACAT CACAGCCAAG 10 80 

TGTGAGTCCT CTGGGTGGCA GGTCATCAGG GAGACTTGTG TGCTCTCTCT GCTTGAAGAA 1140 

CTGAACAAGA ATTTCAGTAT GATTGTAGGC AATGCCACTG AGGCAGCTGT GTCATCCTTC 1200 

GTGCAAAATC TTTCTGTCAT CATTCGGCAA AACCCATCAA CCACAGTGGG GAATCTGGCT 1260 

TCGGTGGTGT CGATT CTGAG CAATATTTCA TCTCTGTCAC TGGCCAGCCA TTTCAGGGTG 1320 

TCCAATTCAA CAATGGAGGA TGTCATCAGT ATAGCTGACA ATATCCTTAA TTCAGCCTCA 1380 

GTAACCAACT GGACAGTCTT ACTGCGGGAA GAAAAG T AT G CCAGCTCACG GTTACTAGAG 144 0 

ACATTAGAAA ACATCAGCAC TCTGGTGCCT CCGACAGCTC TTCCTCTGAA TTTTTCTCGG 1500 

AAATTCATTG ACTGGAAAGG GATTCCAGTG AACAAAAGCC AACTCAAAAG GGGTTACAGC 1560 

TATCAGATTA AAATGTGTCC CCAAAATACA TCTATTCCCA TCAGA3C-CCG TGTC-TTAATT 1620 

GGGTCAGACC AATTCCAGAG ATCCCTTCCA GAAACTATTA TCAGCATGGC CTCGTTGACT 16 8 0 

CTGGGGAACA TTCTACCCGT TTCCAAAAAT GGAAATGCTC AGGTCAATGG ACCTGTGATA 1740 
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65 



TCCACGGTTA TTCAAAACTA TTCCATAAAT GAAGTTTTCC TATTTTTTTC CAAGATAGAG 1800 

TCAAACCTGA GCCAGCCTCA TTGTGTGTTT TGGGATTTCA GTCATTIGCA C-TGGAACGAT I860 

GCAGGCTGCC ACCTAGTGAA TGAAACTCAA GACATCGTGA CGTGCCAATG TACTCACTTG 1920 

ACCTCCTTCT CCATATTGAT GTCACCTTTT GTCCCCTCTA CAATCTTCCC CGTTGTAAAA 1S80 

TGGATCACCT ATGTGGGACT GGGTATCTCC ATTGGAAGTC TCATTTIATG CCTGATCATC 2040 

GAGGCTTTGT TTTGGAAGCA GATTAAAAAA AGCCAAACCT CTCACACACG TCGTATTTGC 2100 

ATGGTGAACA TAGCCCTGTC CCTCTTGATT GCTGATGTCT GGTTTATTGT TGGTGCCACA 2160 

GTGGACACCA CGGTGAACCC TTCTGGAGTC TGCACAGYTG CTGTGTICTT TACACACT1C 2220 

TTCTACCTCT CTTTGTTCTT CTGGATGCTC ATGCTTGGCA TCCTGCTGGC TTACCGGATC 2280 

IV ATCCTCGTGT TCCATCACAT GGCCCAGCAT TTGATGATGC- CTGTTGGATT TTGCCTGGGT 2340 

TATGGGTGCC CTCTCATTAT ATCTGTCATT ACCATTGCTG TCACGCAACC TAGCAATACC 240 0 

TACAAAAGGA AAGATGTGTG TTGGCTTAAC TGGTCCAATG GAAGCAAACC ACTCCTGGCT 2460 

TTTGTTGTCC CTGCACTGGC TATTGTGGCT GTGAACTTCG TTGTGGTGCT GCTAGTTCTC 2520 

ACAAAGCTCT GGAGGCCGAC TGTTGGGGAA AGACTGAGTC GGGATGACAA GGCCACCATC 2580 

ID ATCCGCGTGG GGAAGAGCCT CCTCATTCTG ACCCCTCTGC TAGGGCTCAC CTGGC-GCTTT 2640 

GGAATAGGAA CAATAGTGGA CAGCCAGAAT CTGGCTTGGC ATGTTATTTT TGCTTTACTC 270 0 

AATGCATTCC AGGGATTTTT TATCTTATGC TTTGGAATAC TCTTGGACAG TAAGCTGCGA 2760 

CAACTTCTGT TCAACAAGTT GTCTGCCTTA AGTTCTTGGA AGCAAACAGA AAAGCAAAAC 2820 

„ - TCATCAGATT TATCTGCCAA ACCCAAATTC TCAAAGCCTT TCAACCCACT GCAAAACAAA 2880 

GGCCATTATG CATTTTCTCA TACTGGAGAT TCCTCCGACA ACATCATGCT AACTCAGTTT 2940 
GTCTCAAATG AATAA 

A98 PROTEIN SEQUENCE 

25 Unigene number: 

Protein Accession # : none found 

Signal sequence: 1-17 

Pfam domain. 7t m_2 [561-820] 

30 ^lZZTrTl d0m t inS: 545-567, 571-593, 605-627, 641-663, 677-699, 721-743, 757-789 

J\J Cellular Localization-. plasma membrane 



I I I I I I 

MHALLLCPSV LNGASGLSLL QSPVEEYQLL LQVTYRDSKE KRDLRNFLKL LKPPLLWSHG 

LIRIIRAKAT TDCNSLNGVL QCTCEDSYTW FPPSCLDPQN CYLHTAGALP SCECHLNNLS 

QSVNFCERTK IWGTFKINER FTNDLLNSSS AIYSKYANGI EIQLKKAYER IQGFSSVQVT 

QFRNGSIVAG YEWGSSSAS ELLSAIEHVA EKAKTALHKL FPLEDGSFRV FGKAQCNDIV 

FGFGSKDDEY TLPCSSGYRG NITAKCESSG WQVIRETCVL SLLEELNKNF SKIVGt F 

AVSSFVQNLS VIIRQNPSTT VGHLASWSI LSNISSLSLA SHFRVSNSTM EDVISIADNI 

LNSASVTNWT VLLREEKYAS SRLLETLENI STLVPPTALP LNFSRKFIDW KGIPVNKSQL 

KRGYSYQIKM CPQNTSIPIR GRVLIGSDQF QRSLPETIIS MASLTLGNIL PVSKNGNAQV 

NGPVISTVIQ NYSINEVFLF FSKIESMLSQ PHCVFWDFSH LQWNDAGCIIL VNETQDIVTC 

QCTHLTSFSI LMSPFVPSTI FPWKWITYV GLGISIGSLI LCLIIEALFH KQIKKSQTSH 

TRRICMVNIA LSLLIADWF IVGATVDTTV NPSGVCTAAV FFTHFFYLSL FFWMLKLGIL 

LAYRIILVFH HMftQHLMMRV GFCLGYGCPL IISVITIAVT QPSNTYKRKD V ' 



in-i«vniJL'u-irtu v^puLblb^L UfaVlTlAVT QPSNTYKRKD VOTLNWGNGS 

KPLLAFWPA LAIVAVNFW VLLVLTKLWR PTVGERLSRD DKATI IRVGK SLLILTPLLG 
LTWGFGIGTI VDSQNLAWHV I FALLNAFQV RTVTITYCIV K 



sin coupled receptor 
"s. 16085 



Probeset Accession #: F07953 
Nucleic Acid Accession ft: NM_oi6. 
Coding sequence: 



•1471 (underlined sequences correspond to start and stop c 



AO , T 1 21 31 41 51 

AGCACCTGGG AARAGGCAGA CCGTGTGAGG GGGCCTGTGG CCCCAGCG1G CTGTGGCCTC 60 

GGGGAGTGGG AAGTGGAGGC AGGAGCCTTC CTTACACTTC C-CCAIGAGTT TCCTGATCGA 120 

CTCCAGCATC ATGATTACCT CCCAAATACI ATTTTTTGGA TTTGGGTGGC TTTTCTTCAT 1B0 

GCGCCAATTG TTTAAAGACT ATGAGATACG TCAGTATGTT GTACAGGTGA TCTTCTCCGT 240 

GACGTTTGCA TTTTCTTGCA CCATGTTTGA GCICATCATC TTTGAAATCT TAGGAGTATT 300 

GAATAGCAGC TCCCGTTATT TTCACTGGAA AATGAACCTG TGCGTAATTC TGCTGATCCT 360 

GGTTTTCATG GTGCCTTTTT ACATTGGCTA TTTTATTGTG AGCAATATCC GACTACTGCA 420 

TAAACAACGA CTGCTTTTTT CCTGTCTCTT ATGGCTGACC TTTATGTATT TCTTCTGGAA 480 

ACTAGGAGAT CCCTTTCCCA 1TCTCAGCCC AAAACATGGG ATCTTATCCA TAGAACAGCT 540 

CATCAGCCGG GTTGGTGTGA TTGGAGTGAC TCTCATGGCT CTTCTTTCTG GATTTGGTGC 600 

TGTCAACTGC CCATACACTT ACATGTCTTA CTTCCTCAGG AATGTGACTG ACACAGATAT 660 

TCTAGCCCTG GARCGGCGAC 1GCTGCAAAC CATGG AT AT G ATCATAAGCA AAAAGAAAAG 720 

GATGGCAATG GCACGGAGAA CAATGTTCCA GAAGGGGGAA GTGCATAACA AACCATCAGG 780 

TTTCTGGGGA ATGATAAAAA GTGTTACCAC TTCAGCATCA GGAAGTGAAA ATCTTACTCT 840 

TATTCAACAG GAAGTGGATG CTTTGGAAGA ATTAAGCAGG CAGCTTTTTC TGGAAACAGC 900 

TGATCTATAT GCTACCAAGG AGAGAATAGA ATACTCCAAA ACCTTCAAGG GGAAATATTT 960 

TAATTTTCTT GGTTACTTTT TCTCTATTTA CTGTGTTTGG AAAATTTTCA TGGCTACCAT 1020 

CAATATTGTT TTTGATCGAG TTGGGAAAAC GGATCCTGTC ACAAGAGGCA TTGAGATCAC 1080 

TGTGAATTAT CTGGGAATCC AATTTGATGT GAAGTTTTGG TCCCAACACA TTTCCTTCAT 1140 

TCTTGTTGGA ATAATCATCG TCACATCCAT CAGAGGATTG CTGATCACTC TTACCAAGTT 12 00 

CTTTTATGCC ATCT CTAGCA GTAAGTCCTC CAATGTCATT GTCCTGCTAT TAGCACAGAT 1260 

AATGGGCATG TACTTTGTCT CCTCTGTGCT GCTGATCCGA ATGAGTATGC CTTTAGAATA 1320 

CCGCACCATA ATCACTGAAG TCCTTGGAGA ACTGCAGTTC AACTTCTATC ACCGTTGGTT 13 80 

TGATGTGATC TTCCTGGTCA GCGCTCTCTC TAGCATACTC TTCCTCTATT TGGCTCACAA 1440 
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ACAGGCACCA GAGAAGCAAA TGGCACCTTG AACTTAAGCC T 



ATAAACAAAC AAAATGCTAT GGTAGCATTT TTCACCTTCA TAGCATACTC CTTCCCCCTC 
AGGTGATACT ATGACCATGA GTAGCATCAG CCAGAACATG AGAGGGAGAA CTAACTCAAG 
ACAATACT CA GCAGAGAGCA TCCCGTGTGG ATATGAGGCT GGTGTAGAGG CGGAGAGGAG 
C CAAG AAACT AAAGGTGAAA AATACACTGG AACTCTGGGG CAAGACATGT CTATGGTAGC 
TGAGCCAAAC ACGTAGGATT TCCGTTTTAA GGTTCACATG GAAAAGGTTA TAC-CTTTGCC 
TTGAGATTGA CTCATTAAAA TCAGAGACTG T 

A100 Protein sequence 

Gene name: putative G-protein coupled receptor 

Unigene number: Hs. 16085 

Protein Accession #: NP_057418.1 
Signal sequence: none found 

Transmembrane domains: 5-27, 42-64, 76-98, 110-132, 144-166, 28! 
Cellular Localization: plasma membrane 

1 11 21 31 41 51 

I I I I I I 

MSFLIDSSIM ITSQILFPGP GWLFFMRQLF KDYEIRQYW QVIFSVT 

EILGVLNSSS RYFHHKMNLC VILLILVFMV PFYIGYFIVS NIRLLHKQRL L 

MYFFWKLGDP FPILSPKHGI LSIEQLISRV GVIGVTLMAL LSGFGAVNCP YTYHSYFLRM 
VTDTDILALE RRLLQTMDMI ISKKKRMAMA RRTMFQKGEV HNKPSGFWGM IKSVTTSASG 
SENLTLIQQE VDALEELSRQ LFLETADLYA TKERIEYSKT FKGKYFNFLG YFFSIYCVWK 
IFMATINIVF DRVGKTDPVT RGIEITVNYL GIQFDVKFWS QHISFILVGI IIVTSIRGLL 
ITLTKFFYAI SSSKSSNVIV LLLAQIMGMY FVSSVLLIRM SMPLEYRTI I TEVLGELQFN 
FYHRWFDVIF LVSALSSILF LYLAHKQAPE KQMAP 

A101 DNA SEQUENCE 

Gene name: ESTs, Weakly similar to CGHU7L collagen alpha l(III) ci 
Unigene number: Hs. 19322 

Nucleic Acid Accession #: AA088458 
Coding sequence: 862-1995 (underline 

1 U 21 31 41 51 

GCCCTTGGAC ACTGACATGG ACTGAAGGAG TAGAATGGAG CACGAGGACA CTGACATGGA 60 

CTGAAGAAAA AGGAGCTGGA GCAGGAGAAG GAGGTGCTGC TGCAGGGTTT GGAGATGATG 120 

GCGCGGGGCC GCGACTGGTA CCAGCAGCAG CTGCAACGAG TGCAGGAGCG CCAGCGCCGC 1Q0 

CTGGGCCAGA GCAGAGCCAG CGCCGACTTT GGGGCTGCAG GGAGCCCCCG CCCACTGGGG 240 

CGGCTACTGC CCAAGGTACA AGAGGTGGCC CGGTGCCTGG GGGAGCTGCT GGCTGCAGCC 3 00 

TGTGCCAGCC GGGCCCTGCC CCCGTCCTCC TCCGGGCCCC CCTGCCCTGC CCTGACGTCC 3 50 

ACCTCACCCC CGGTCTGGCA GCAGCAGACC ATCCTCATGC TGAAGGAGCA GAACCGACTC 420 

CTCACCCAGG AGGTGACCGA GAAGAGTGAG CGCATCACGC AGCTGGAGCA GGAGAAGTCG 4 80 

GCGCTCATTA AGCAGCTGTT TGAGGCCCGC GCCCTGAGCC AG CAGG ACGG GGGACCTCTG 540 

GATTCCACCT TCATCTAGTC CTTGTGGGCC GCGTGGGCCC CCAGGGCCAG CCTGGCACTC 600 

AGCCCTTCGA GGGTGGGCGC CCCATCGCAC CCACCCTCTC TGGCTGGAGA CCCCCGGCAG 660 

GCCCAGGCAC AGTCCCGGAG TGGGCGCCTT CCTGCCGCCC TTGCCAGATG GGCTCCCCAG 720 

GCCTGCCCCC GGCTGGTCCC CGCACCGAGC GCTTGACTCC GTTTKGGCTC CTGGTTGYTG 780 

ACATGGGCTG GGGGCTCTCT TGAGTCCGCA TAGTCCGCAG CTACTACTGG CCGCTGTCAG 840 

TGGACAGTGG GGTACCCCTC CATGAGTTAG CGTCCCCCCG TTTCCAGCGG TGCCGCCCTG 900 

GGTCCCATCT TCAGGGAAAG GCACTGCCCA CGCCAGGCTG CACTTCCAAC AACGGGCAGC 960 

AGAGGGCGCG GGGCGGCTCC GACGCGGGTC CAAGGGCAGC TTCCCGCTCA ACCAGGGCAC 1020 

CAGGACGAGG TGGCTGTAGC TCGGACGGAC GGAAGTAGAT GGAGGGGGTG GGGACGGCCT 1080 

GTAAGCGGGG GGTGCCTGCC TGGCTGGGGA GCCCCAGGGA TAGCGGTCGG ACTTCAGGTT 1140 

CTGGCCAAGG CTGAGGGACC CTGGCTGCAG CGGATCGGCA CGCCGGGTGG GCGAGAGCTT 12 00 

GGCCTGCATG TGCCTCCCAC AGACCCTGGG GTGATGGCCT TCCCCCTCTT GGCCGGGACG 1260 

TTGCCCCACG TTGAGTCCCA CACAACATCC TGTGAGCCTG GCTCCCCAGG AGGGCCCCCA 1320 

GACAGCTCCC AGGCACGTCA TAGGCAAAGC CTGTTTCCCC CGACTCAGGA TTTCCAAGGC 1380 

CTGGGGTCCT GCICACCCCC CTTTGCTCTC ACGCCCAGCC TGTCCCCAGG TTTCAGCTGG 1440 

GAGAGGCCAC CTCCCTCAGC CAAGGAAAAC GAGAACCCCC AGGGTACAGG AGGAGGCTGG 1500 

GGCAGGTCCC CTTGGGTGTC ACTCCCTCAG CCCCTGCCCA GGCCCACTCC CGCTGGTGCT 1560 

GGAGTACGCA CTGGTGGGGG GGCCCTGCTC AGCCCAACCT GGAGGGTCCC AGTGTCACCA 1620 

GAACCAGGGG CACGGCAACA GCATCGATGG GTTCTGCAGC CCAGGGCCCC CGATGCGGGG 1680 

TCAGTGTGTG TGGGGCGCAG GGCCTCCGAT GCGGGGTCAG TGCGTGGGGG GCGCAGGGCC 1740 

CCCGATGCGG GGTCAGTGCG TGGGGGGCGC AGGGCCCCCT CGTGTCCAGG GCACTTTGGT 1800 

ACACTGTCCC ACAAGGCACC TGTCTCAGAG GAGGGGCCCT GGCAGGCAGC GTGGCAACTC 1860 

CCTTCCGGAG CCCAGCTCCA TGCTAACCTG CCCACAGCAA CCCCACAGAG CCACATTCCC 1920 

TGCTGCACCT GGTCTGCAGG GGTGTCCCAG GACAGGCCCA AGTCAGCCCA GCATGCAGCT 1980 

GCCCTCCTAC CCTGAAGATG GGAGTGGGCT TTCCAGGGGA CATAAGGATG TCAGGCCTGG 2040 

ACCTCCTGGG CAGGAAAGGG TGCAGGTCCT GAGGGCCTGT GCCCCACAGC CCCAGCACCC 2100 

AGGTGGACTG CAGCGCAGTG GGTGGGCCAG TGGCAGCCAG GGAGAAGCCC CCCGTCAGCA 2160 

GGCTGGGGTC TGCCCACCAG GGCCTCCCCA CGTCTGCCTT TGAGGGTGCC TGCCATGCCC 2220 

TGGGGGATCC TGGCATCTTT ACTGGACTGG AAGCAGGAGA CAGAACAGTG TCTGTCCCGG 2280 

GGTGACTTCA TCAGGAGACC GCCCACATAG AGCTGGACCC CGCAGCTGAA GCGGAAATGT 2340 

GAGACAGGCT GGCACCTCCG GAAAAACIGC CTTTCAGCCT TGGTGTTCCG TGCAAGGTGA 24 00 

AAAGAAATAG GTCCTCCCAG TTTACAGCTT GAAATCAGGC TAGTGAGTGG CCCTGGAGAC 2460 

CACGAGGGGA GAATTTAAAG GCCCCGGCTG GCAGGC-TCTA GGTGGCTGGC AGAGGCACAT 2 520 

GCAGACCCTG CCTGGAGCCT GCCCTAGGAC GCTGGGCGGG TCAC-TCTCCG TGCAGGATGT 2580 

GAGCAGCGTC CCTGGGCTCT ATCCGCGAGG TGCCAGTAGC GTGTGCAGGT ACATACACGT 2640 

GCGTGCACAC TGTGATGACA CCCGGAAATG TCTCAGGATG TTGAAATGTG TCCTTGGGGG 2700 

CAGAAGTGTC CCCAGTTGAG AATCTGCCCC AGAGGAACAC ACCCACACCA GGCCTCAGGA 2760 

TTTTGTGTTG ATCAAGTTCC AAGGAAAAGG AACATCTCAG CCGGGCGTGG TGGTTCACGC 2820 



correspond to start and stop codons) 
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CTGGAATCCC AGCACTTGAG GCCAGGAGTT CCAGAGCAGC CTGGGCAACG CAGTGAGAGA 28 80 

CCCCATCTCT ACAARAAAAA AAAAAGAAAG AAAGAAAATG AGAGATCCAG GTTTAAAAAT 2940 

TCATAAACAC CACAAGGAAA CAATACACTA TGAGACCCAG CAGAAGCAAC AGATTGACTC 30 0 0 

TAGACCCAGA TACTAGAATT AT CAG AG AG A AT AT AAAG T A ACAGTC-TTTT ATATATCTAA 30 60 
AGAAATAAAA GAGATTTCTG GAAACATGAA AAAAAA 



I 



I 



I 



GGTCATGCCC GTGTGAGCCA 



ACCGGGCACC GGACGGCTCG GGTACTTTCG T 
GGAAAGGGCT GTGTTTATGG GAAGCCJ 
CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGGGGA GCCGGATGTC 
AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA A 
T CGCTTTTTGG CCTTGATGAT TTGAAAATAA G 

IACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 
ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 
T ACAGATGTCG CTCATCCTTT AAGTGTATCG A 



AATGCCGTGC 
AAGGGTCACT 
GATAACCTCA 
CACCTCTTGC 
TGTGCCTCTG 



CTTCAGTTCC 
ACTGCTGCAC 
CTAGTTTCCC 
AGCAAGTACA 




TGTTGGACAA 
AGCCAAAGAG 
ATGAAATGAT 



A TGACTTGTAC 



CCAGCCTGTG 



CTCCCCAAGT 
TCCCACTTGG 

GACATCGCCC 
TGCCTGCCCA 
GGGGCCACAG 



TCACGCCCCT 
CATGGACCAT 
TGGAGAAGAT 



ACTCTGAAGA 



GTGGATCATC 
CCAGGTGGGT 
TGTCTACCAC 
GGCCGGGCCA 
GAACTTCCCC 



AAGTTAGTGG 
TACACCCGTG 
ACCTGAAGAG 



GAGCGACCAG 



CTCCCCCTCC 
CAGOGGGGGG 
CTTTGGCATC 



3 ATCCACGAGC 



TTCCGGCACC 



GAAGGGGACA 
GACTCCCGTG TAGGAACCTG CACACGAGCA 
AGTAGCAGGC CCGAAAGAGG CACCCTTCCA 
TTTGTTTTTT GTTTTTTTGA GGTGGAGTCT 
CGAAATCCCT GCTCACTGCA GCCTCCGCTT 



ACGCCTAGCC 



ACGAGATAAG 
GCACCAGCCC 
CCAAACCCAC 



AGTAGAGACA 
TGTGCCTGCT 
TCACGCTCCT 
TTCCCACTGG 
CAGTTATGTG 



CCTTTCTACT 
TAAGGCCTAT 
ATTCCAAATA 



ATATGTTTCC 



TTTCTCTCCA 
CAAAGCCACC 
ACTGCACGTT 
ATTTTCACAT 
TCTTTGTAGC 
TTCCCTCAAA 



CGGGCTACCT 
GTCAAGAGAG 
AGGTGAACAA 
AGATGGAGAG 
AGGTGATGAA 
GACACCCTTG 
TCTGATTCCA 
CGCTCTGTTG 
CCCTGGTTCA 
CCACCACACC 
GGCTGCTCTC 
GATTACAGGC 
AAAAGAAGCA 



GCACAACCTT 
CCCAGGCTGG 
AGCGATTCTC 



AACAGCCACT 
TTCATCTCTA 
GTGGGGAGGT 
ATTTGGTGCT 



Signal sequence: 
Transmembrane domains: 
Tryp_SPc domain: 
Cellular Localization: 



TMPRSS3a 
AI538613 



plasma membrane 



I 



I 



MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP I 
LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 
TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 
VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVC-G NMSLLSQHPW QASLQFQGYH 
LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDH PAPSHLVEKI VYHSKYKPKR 
LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCKT SGKGATEDGA GDASPVLNHA 
AVPLISNKIC NHRDVYGGII SPSHLCAGYL TGGVDSCQGD SGGPLVCQER RLWKLVGATS 
FGIGCAEVNK PGVYTRVTSF LDWIHEQMER DLKT 
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A104 DMA SEQUENCE 

Gene name: Homo sapiens G protein-coupled receptor (HOT7T175) 

Probeset Accession ft: AI819198 
Nucleic Acid Accession #: NM 032551.1 

Coding sequence: 1-1197 (underlined sequel, 

i r t i 1 r r 

ATGCACACCG TGGCTACGTC CGGACCCAAC GCGTCCTGGG GGGCACCGGC CAACGCCTCC 
GGCTGCCCGG GCTGTGGCGC CAACGCCTCG GACGGCCCAG TCCCTT C ;GG( ( 

GACGCCTGGC TCGTGCCGCT CTTCTTCGCG GCGCTGATGC TGCTGGGCCT GGIGGGC-AAC 
TCGCTGGTCA TCTACGTCAT CTGCCGCCAC AAGCCGATGC GGACCGTGAC CAACTTCTAC 
ATCGCCAACC TGGCGGCCAC GGACGTGACC TTCCTCCTGT C-CTGCGTCCC CTTCACGGCC 
CTGCTGTACC CGCTGCCCGG CTGGGTGCTG GGCGACTTCA TGTGCAAGTT CGICAACTAC 
ATCCAGCAGG TCTCGGTGCA GGCCACGTGT GCCACTCTGA CCGCCATGAC- TGTGGACCGC 
TGGTACGTGA CGGTGTTCCC GTTGCGCGCC CTGCACCGCC GCACGCCCCC- CCTGGCGCTG 
GCTGTCAGCC TCAGCATCTG GGTAGGCTCT GCGGCGGTGT CTGCGCCGGT GCTCGCCCTG 
CACCGCCTGT CACCCGGGCC GCGCGCCTAC TGCAGTGAGG CCTTCCCCAG CCGCGCCCTG 
GAGCGCGCCT TCGCACTGTA CAACCTGCTG GCGCTGTACC TGCTGCCGCT GCTCGCCACC 
TGCGCCIGCT ATGCGGCCAT GCTGCGCCAC CTGGGCCGGG TCGCCGTGCG CCCCGCGCCC 
GCCGATAGCG CCCTGCAGGG GCAGGTGCTG GCAGAGCGCG CAGGCGCCGT GCGGGCCAAG 
GTCTCGCGGC TGGTGGCGGC CGTGGTCCTG CTCTTCGCCG CCTGCIGGGG CCCCATCCAG 
CTGTTCCTGG TGCTGCAGGC GCTGGGCCCC GCGGGCTCCT GGCACCCACG CAGCTACGCC 
GCCTACGCGC TTAAGACCTG GGCTCACTGC ATGTCCTACA GCAACTCCGC GCTGAACCCG 
CTGCTCTACG CCTTCCTGGG CTCGCACTTC CGACAGGCCT TCCGCCGCGT CTGCCCCTGC 
GCGCCGCGCC GCCCCCGCCG CCCCCGCCGG CCCGGACCCT CGGACCCCGC AGCCCCACAC 
GCGGAGCTGC ACCGCCTGGG GTCCCACCCG GCCCCCGCCA GGGCGCAGAA GCCAGGGAGC 
' s,r ™ rrr-^-^r. QCTGTGCGTC CTGGGGGAGG ACAACGCCCC TCTCTGA 



A105 P 



start and stop codons) 



Unigene number: Hs. 208229 

AI819198 
7tm_l [59-323] 



receptor (HOT7T175) , mRNA 



I I I I | | 

MHTVATSGPN ASWGAPAHAS GCPGCGANAS DGPVPSPRAV DAWLVPLFFA ALMLLGLVGN 

SLVIYVICHH KPMRTVTNFY IANLAATDVT FLLCCVPFTA LLYPLPGWVL GDFMCKFVm 

IQQVSVQATC ATLTAMSVDR WYVTVFPLRA LHRRTPRLAL AUSLSIKVGS AAVSAPVLAL 

IIRLSPGPRAY CSEAFPSRAL ERAFALYNLL ALYLLPLLAT CACYAANLRH LGRVAVRPAP 

ADSALQGQVJj AERAGAVRAK VSRLVAAWL LFAACWGPIQ LFLVLQALGP AGSWHPRSYA 

AYALKTWAHC MSYSNSALNP LLYAFLGSHF RQAFRRVCPC APRRPRRPRR PGPSDPAAPK 

AELHRLGSHP APARAQKPGS SGLAARGLCV LGEDNAPL 



\ SEQUENCE 



Unigene number: Hs.52S20 
Probeset Accession #-. AA479726 



Coding 



3-2990 (underlined 



I I I I | 
-™„™^03 CCTCCCCCTG TTGCTGGCAT CCCGAGCTTC CTCCCTTGCC AGCCAGGACG 
CTGCCGACTT GTCTTTGCCC GCTGCTCCGC AGACGGGGCT GCAAAGCTGC AACTAATGGI 
GTTGGCCTCC CTGCCCACCT GTGGAAGCAA CTGCGCTGAT TGATGCGCCA CAGACTTTTT 
TCCCCTCGAC CTCGCCGGCG TACCCTCCCA CAGATCCAGC ATCACCCAGT GAATGTACAT 
TAGGGTGGTT TCCCCCCCAG CTTCGGGCTT TGTTTGGGTT TGATTGTGTT TGGCTCTTCG 
CTAAGCTGAT TTATGCAGCA GAAGCCCCAC CGGCTGGAGA GAAACAAAAG CTCTTTTCTT 
TGTCCCGGAG CAGGCTGCGG AGCCCTTGCA GAGCCCTCTC TCCAGTCGCC C-CCGGGCCCT 
TGGCCGTCGA AGGAGGTGCT TCTCGCGGAG ACCGCGGGAC CCGCCGTGCC GAGCCGGGAG 
GGCCGTAGGG GCCCTGAGAT GCCGAGCGGT GCCCGGGCCC GCTTACCIGC ACCGCTTGCT 
CCGAGCCGCG GGGTCCGCCT GCTAGGCCTG CGGAAAACGT CCTAGCGACA CTCGCCCGCC- 
GGrrrrrtaro TmprrYram /vi^nfln/in 



i^^v-^a lA^UiAfciCCi; GCGTCCGGAA GGCAGCCAGG CGGCGGGCGC 660 

GGGGCGGGCT GTTTTGCATT ATGTGCGGCT CGGCCCTGGC TTTTTTTACC GCTGCATTTG 72 0 

TCTGCCTGCA AAACGACCGG CGAGGTCCCG CCTCCTTCCT CTGGGCAC-CC TGGGTGTTTT 780 

CACTTGTTCT TGGACTGGGC CAAGGTGAAG ACAATAGATG TGCATCTTCA AATGCAGCAT 840 

CCTGTGCCAG GTGCCTTGCG CTGGGTCCAG AATGTGGATG GTGTGTTCAA GAGGATTTCA 900 

TTTCAGGTGG ATCAAGAAGT GAACGTTGTG ATATTCTTTC CAATTTAATA AGCAAAGGCT 960 

GCTCAGTTGA TTCAATAGAA TACCCATCTG TGCATGTTAT AATACCCACT GAAAATGAAA 1020 

TTAATACCCA GGTGACACCA GGAGAAGTGT CTATCCAGCT GCGTCCAGGA GCCGAAGCTA 1080 

ATTTTATGCT GAAAGTT CAT CCTCTGAAGA AATATCCTGT GGATCTTTAT TATCTTGTTG 1140 

ATGTCTCAGC ATCAATG CAC AATAATATAG AAAAATTAAA TTCCGTTGGA AACGATTTAT 12 00 

CTAGAAAAAT GGCATTTTTC TCCCGTGACT TTCGTCTTGG ATTTGGCTCA TACGTTGATA 1260 

AAACAGTTTC ACCATACATT AGCATCCACC CCGAAAGGAT TCATAATCAA TC-CAGTGACT 13^0 

ACAATTTAGA CTGCATGCCT CCCCATGGAT ACATCCATGT GCTC-TCTTTG ACAGAGAACA 1380 

TCACTGAGTT TGAGAAAGCA GTTCATAGAC AGAAGATCTC TGGAAACATA GATACACCAG 1440 



t and stop codons) 
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GCAAATTGGC AGGCATAGTG GTGCCCAATG ACGGAAACTG TCATCTGAAA AACAACGTCT 1620 

ACGT CAAAT C GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 1680 

ACAACAACAT TAATGT CATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 174 0 

TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA 1800 

ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 1860 

ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 192 0 

CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 198 0 

TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 204 0 

GTTTTAATGA AACCGCTAAA ATT CAT AT AC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 2100 

ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 216 0 

GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 2220 

ACAAGGATCA GCCTGTTTGC AGTGGTCGAG GAGTTTGTGT TTC-TGGGAAA TGTTCATGTC 228 0 

ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTTGTC 234 0 

CATATCACCA TGGAAATCTG TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC AGATGCCAAT 2400 

GCTTCAGTGG CTGGGAAGGT GATCGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 

TCAATTCAAA GGGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 2520 

GCACCGATCC CAGGAGCATC GGCCGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 2580 

GCAAGGAAAA CTGGAATTGT ATGCAATGCC TTCACCCTCA CAATTTGTCT CAGGCTATAC 2640 

TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTATGTC GACCAAACTT 2700 

CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC AT AGTTA CAT 2760 

TCTTGATTGG GTTGCTTAAA GTCCTGATCA TTAGACAGGT GATACTACAA TGGAATAGTA 282 0 

ATAAAATTAA GTCCTCATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 2880 

TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 2940 

TGGATAT CAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 300 0 

TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 

AAGTCACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT TGCTGGTTGT ACACTCGAAC 312 0 

GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT TTTTCAGAGA 318 0 

AAAATGTGTC TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 

ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACAGG 330 0 

TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 3360 

CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 342 0 

TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAG CT AG 3480 

ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3540 

AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 

TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 

TTTTTACAGG ATAAGTTTAT GTATGTCACA GATG AC TGG A TTAATTAAGT GCTAAGTTAC 372 0 

TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 3780 
GAATGTTAA 



A107 Protein sequence: 

Gene name: Integrin, beta 8 

Unigene number: Hs. 52620 

Probeset Accession tt : AA479726 

Protein Accession It: NP_002205.1 

Signal sequence: 1-39 

Transmembrane domains: 682-704 

EGF domain: 552-584 

INB domain: 54-469 

Cellular Localization: plasma membrane 



I I I I I 

MCGSALAFFT AAFVCLQNDR RGPASFLWAA HVFSLVLGLG 2GEDNRCASS N 

LGPECGWCVQ EDPISGGSRS ERCDIVSNLI SKGCSVDSIE YPSVHVIIPT ENEINTQVTP 
GEVSIQLRPG AEANFMLKVH PLKKYPVDLY YLVDVSASMH NNIEKI* 
SRDFRLGFGS YVDKTVSPYI SIHPERIHNQ CSDYNLDCMP PHGYIHV 
VHRQKISGNI DTPEGGFDAM LQAAVCESHI GWRKEAKRLL LVMTDQTSHL A 
VPNDGNCHLK HNVYVKSTTM EHPSLGQLSE KLIDNNINVI FAVQGKQFHW Y 
TIAGEIESKA ANLNNLWEA YQKLISEVKV QVENQVQGIY FNITAICPDG SRKPGMEGCR 
NVTSNDEVLF NVTVTMKKCD VTGGKNYAII KPIGFNETAK IHIHRNCSCC CEDNRGPKGK 
CVDETFLDSK CFQCDENKCH FDEDQFSSES CKSHKDQPVC SGRGVCVCGK CSCHKIKLGK 
VYGKYCEKDD FSCPYHHGNL CAGHGECEAG RCQCFSGWEG DRCQCPSAAA QHCVNSKGQV 
CSGRGTCVCG RCECTDPRSI GRFCEHCPTC YTACKENKNC MQCLHPHNLS QAILDQCKTS 
CALMEQQHYV DQTSECPSSP SYLRIFFIIF IVTFLIGLLK VLIIRQVILQ WNSNKIKSSS 
DYRVSASKKD KLILQSVCTR AVTYRREKPE EIKMDISKLN AHETFRCNF 



21 31 41 51 

I I I I 

G GCTTCTTGAT GAGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 60 

GGAAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 120 

GACCGGGAGA GGGAGAGCCG GCCGGAGGCT GCCGGGCTCC TGTGGGACCC- CGCTGCAGCC 180 

GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCC-CCT GGATCCGCGC CCAC-CAGCAG 240 
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CGCCTGCGTC CTGGACGTTC O 



GCTCCTGGAC C 



CTTCTCGGAG 
TGTGGGGCGC 
TGCTGCTGGC 



G GCAGGCTCCC G 
CCGGGGGAGG G 
GCCTTCTGAC 
GGGGACCGGG 
GCGTCGTCGC 
GACAGTCAGT 



C CAGTGAAACC 



GCAGGACCCT 300 

TCCAGAGGCT 360 

AGCGAGTGCA 42 0 

TGGCTTAGGA 48 0 

3 TCGCGGCAGA GGGGAGTGGC 54 0 



TCGCCGCTCG 
TGCGCTGCTG 
CGCAGGGCGT 



TCCGTCTCCT C 



GGAGACGGCG 

CGCGGATTCC 

CTGGTGGCAG 
GCCCCAGGGG 
CGGGGGTCGT 

AACAACGTGT 



ACCAGGGCGG 
GAAGGCAGCT 

CCTGTTGCTG 

CCTCACGCCA 
CGCCCCCAAC 



CGGCAGCTGC 
CCGACGGCTC 



GGCTTCCAGT 
GCGTTGCGCT 
GACCGCCAGC 



GTCCCC-AGCG 
AGGGCGCTGG 



TTCCCGGGGC TGCTGCCCCG 



TGTTGGCTCC 
CAGATGTCTC 
GATGGAGACC 



CAGCGCCGAG 

CCCTGAAGGC 

CGCCTTGCCC 
CTTGGGGTCC 



TGGTGGGTTC C 



CCAGCTCCAG CGCCAACTCC 
GCTTGCCGGA AGGGACCATG 
TGAACTGTCA GCAGGCCACC 
ACGTGGGGTA CACGGTGCAG 
ACGGCCTGCA GCCTGGCTAC 
AGAAGATGTA CCCAGCGGTG 



GCTTCATTTG 
CATCCAGGGT 
TTGCTGATGG 
AAGAATTTTA 
TTGTTGTTGA 
TTTTTTTTTT 
TGGTGCGATC 
AGCCTCCCAC 
TGTAGAGATG 
TCTGCCTGTC 
TGAGCCTTTT 
ATTCTAAAAG 
GGCACACCTT 



CCCCCAGACT 
ATCATTGATT 
GTGTATAACA 



TATGATGGAA 



ATCTCCGAAT 



TGGCACGATT CCTTTGGATG 
TAGCATTTCT GGATATGTTT 
AGCTGGAGAT GACTGTGATG 



CTGCACAGGA 
TTTTTTTTAA 
TCGGCTCACT 



TGGGATGATT 
TTGATTGTAT 
TATATTGAAA 
TGGTTATGGT 
GAGTGCAAGT 
AAAAATTTTC 
TAGCTCCTCA 
GCAGATCATA 
GATTGTTAAG 
ATTTCCTCTC 
CAAATTCAAG 



GAAACCTGTT 
AATTTCATTG 
TGTGCTTTTT 
ACTGAAATAA 
TTAATCTTGA 

TCATAAACTA 
TTGGCGTTTC 
AATTACTATA 
TTGTCTTAGT 
GCCATAACCT 
AGGCTTTTTA 



AATTAAGGAT 
TCAAAATGCC 
TCAGACAGGG 
GCAACTTCAG 
TTACAGGTGC 
ATGTTGGCTG 
AAAGTGCTGG 
ATGCATCCAA 
TGAACTCTGT 
TAAAAAGATA 
TACCTTATGT 



TTTTATTATG 
TGTTATCTCC 
TCTTGCTCTG 

CCTCCTGGAT 



GATTACAGGC 
GGTTAAGGGG 
GAGATCAGTC 



ATCAGTCTCA 



AGTAGTAGGT G 



TCTTTCAGGC 
TGAATTTATT 
ACATTTCCAA 
GTCTTCTACA 
CTGAATAATG 



GAGACTTGGG 
TACTCTTGTT 
TTCAACGAAG 
CTCAACAGGC 



TTACAAAGTG 
ATGAAGACAT 
TAAAACCATT 
AGAGGTAATG 
TTACTTTTAA 
CCCCATGTCT 
TTTCTGCCTA 
TGTTACAAAG 



TAACAGGACT 
GTATTCCACA 

ATTTTCTACA 
TTCAGAACTC 
CATCCCCTTC 



GATTGCCTTT 
ACTGTATTCA 
TTACTTATAT 
AACCAGTATT 
TATAGATATC 



TTGTTATTAT TATTTCTTTT 
CTTTTACTGG GACTTTTTTT 
TTGCCCAGGC TGGAGTGCAG 
CTCCTGCCTC 
TTGTATTTTT 
ACTCTCCTGA CCTCAAGCAA 



GATTTACTTT 



2280 
2340 
2400 

:;460 

2520 
2580 



2760 
2940 
3120 



TATCGATCAT CTTTCTATTC 
GGTATCATCA ATGATACTCA 
TTAATAAATA TTTTATGATG 



65 
70 



I 



I I I I I 

S TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPEA AG LLWDRAAA 
GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGRSRGR VRLPVKPPBA 
SGRQPRGPSD CIPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSC- 
PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRWP CGALAARPSP HPGTPLRSCS 
CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 
ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 
SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP IYVPFLIVGS VFVAFIILGS 
LVAACCCRCL RPKQDPQQSR APGGHRLMET IPMIPSASTS RGSSSRQSST AASSSSSANS 
GARAPPTRSQ TNCCLPEGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 



ESTs, Weakly similar 
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Unigen 
Codinc 



cggcaccaag agcactggcc aagtcagctt cttctgagag agtctctaga agacatgatg 
ft ctcgtcacag tttcttccaa ccttgccatt 
:tctcaa gaggatgggg agatgacatc 
a agaaggtctc ttttatgctc aaaaaagtaa gaagccatta 



gaaaccactg ataagaattt atcacctgat 



TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 
cagtcagagc tataagagat gatagaaaaa agccttcact TCAAAGAAGT CAAATTTCAT 
GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 
TTACTATTTA GTTTTTTTAA TGTGTTTGCA ATAGT CTT AT TAAAATAAAT GTTTTTTAAA 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



Weakly similar to JE03S0 Anterior gradient-2 [H. sapiens] 



Transmembrane domains: 
Cellular Localization: 



FVDPSLTVRA DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSEL 
A112 DNA SEQUENCE 

Gene name: Homo sapiens type II membrane serine protease mENA 

Unigene number: Hs.63325 
Probeset Accession #: AA411S02 
Nucleic Acid Accession (f: NM_01fi42S 

Coding sequence: 1-1314 (underlined sequences correspond to start and stop c 




Gene name : 

Probeset Accession (t: 
Protein Accession #: 



erine protease mRNA 



AA411S02 
NP_0S7509 
none found 
31-53 



WO 03/042661 PCT/US02/36810 



I I I I I I 

MLQDPDSDQP LNSLDVKPLR KPRI PMETFR KVGIP1IIAL LSLASIIIW VLIKVILDKY 
YFLCGQPLHF IPRKQLCDGE LDCPLGEDEE HCVKSFPEGP AVAVRLSKDR STLQVLDSAT 
GNWFSACFDN FTEALAETAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMRNSS 
GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 
LTAAHCFRKH TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPKDND IALMKLQFPL 
TFSGTVRPIC LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTRCNADD 
AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT 
KVSAYLKWIY NVWKAEL 

A114 D 



il (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GGAATTCCTT TTTTTTTTTT TTTGAGATGG AGTTTCACTC TTGTTGGCCA GGCTGGAGTG 60 

CAATGGCACA ATCTCAGCTT ACTGCAACCT CCGCCTCCCG GGTTCAAGCG ATTCTCCTGC 12 0 

T CAAGTAGCTG GGATTACAGG CATGTGCCAC CACCCCTGGC TAACTAATTT 180 

T TAGTAGAGAT GGGGTTTCAC CATGTTGGTC AGGCTGGTCT TGAACTCCTG 240 

GGCCTCC CAAAGTGCTA GGATTACAGC CGTGAAACTG 300 

TGCCTGGCTG ATTCTTTTTT TGTTGTTGGA TTTTTGAAAC AGGGTCTCCC TTGGTCGCCC 360 

'GC GATCTTGGCT CACTATAACC TCCACCTCCT GGTTTCAAGT 420 

MC CTGAGTAGCT GTGATTACAG GCGTGCACCA CCACACCCGG 480 

CTAATTTTTG TATTTTTATT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGTTCTCAAA 540 

CTCCTGGACT CAAGGGATCC GCCTGCCTCC ACTTCCCAAA GTCCCGAGAT TACAGGTGTG 600 

AGTCACCATG CCTGACCTTA TAATTCTTAA GTCATTTTTT CTGGTCCATT TCTTCCTTAG 660 

GGTCCTCACA ACAAATCTGC ATTAGGCGGT ACAATAATCC TTAACTTCAT GATTCACAAA 720 

AGGAAGATGA AGTGATTCAT GATTTAGAAA GGGGAAGTAG TAAGCCCACT GCACACTCCT 780 

GGATGATGAT CCTAAATCCA GATACAGTAA AAATGGGGTA TGGGAAGGTA GAATACAAAA 840 

TTTGGTTTAA ATTAATTATC TAAATATCTA AAAACATTTT TGGATACATT GTTGATGTGA 900 

ATCTAAGACT GTACAGACTT CCTAGAAAAC AGTTTGGGTT CCATCTTTTC ATTTCCCCAG 960 

TGCAGTTTTC TGTAGAAATG GAATCCGAGG ATTTAAGTGG CAGAGAATTG ACAATTGATT 1020 

CCATAATGAA CAAAGTGAGA GACATTAAAA ATAAGTTTAA AAATGAAGAC CTTACTGATG 10 80 

AACTAACCTT GAATAAAATT TCTGCTGATA CTACAGATAA CTCGGGAACT GTTAACCAAA 1140 

TTATGATGAT GGCAAACAAC CCAGAGGACT GGTTGAGTTT GTTGCTCAAA CTAGAGAAAA 1200 

ACAGTGTTCC GCTAAGTGAT GCTCTTTTAA ATAAATTGAT TGGTCCTTAC AGTCAAGCAA 12 6 0 

TTGAAGCGCT TCCCCCAGAT AAATATGGCC AAAATGAGAG TTTTGCTAGA ATTCAAGTGA 1320 

GATTTGCTGA ATTAAAAGCT ATTCAAGAGC CAGATGATGC ACGTGACTAC TTTCAAATGG 1380 

CCAGAGCAAA CTGCA AG AAA TTTGCTTTTG TTCATATATC TTTTGCACAA TTTGAACTGT 1440 

CACAAGGTAA TGTCAAAAAA ACTAAACAAC TTCTTCAAAA AGCTGTAGAA CGTGGAGCAG 1500 

TACCACTAGA AATGCTGGAA ATTGCCCTGC GGAATTTAAA CCTCCAAAAA AAGCAGCTGC 1560 

TTTCAGAGGA GGAAAAGAAG AATTTATCAG CATCTACGGT ATTAACTGCC CAAGAATCAT 1620 

TTTCCGGTTC ACTTGGGCAT TTACAGAATA GGAACAACAG TTGTGATTCC AGAGGACAGA 1680 

CTACTAAAGC CAGGTTTTTA TATGGAGAGA ACATGCCACC ACAAGATGCA GAAATAGGTT 1740 

ACCGGAATTC ATTGAGACAA ACTAACAAAA CT AAACAGT C ATGCCCATTT GGAAGAGTCC 1800 

CAGTTAACCT TCTAAATAGC CCAGATTGTG ATGTGAAGAC AGATGATTCA GTTGTACCTT 1860 

GTTTTATGAA AAGACAAACC TCTAGATCAG AATGCCGAGA TTTGGTTGTG CCTGGATCTA 1920 

AACCAAGTGG AAATGATTCC TGTGAATTAA GAAATTTAAA GTCTGTTCAA AATAGTCATT 1980 

TCAAGGAACC TCTGGTGTCA GATGAAAAGA GTTCTGAACT TATTATTACT GATTCAATAA 2040 

CCCTGAAGAA TAAAACGGAA TCAAGTCTTC TAGCTAAATT AGAAGAAACI AAAGAGTATC 2100 

AAGAACCAGA GGTT CCAGAG AGTAACCAGA AACAGTGGCA AGCTAAGAGA AAGTCAGAGT 2160 

GTATTAACCA GAATCCTGCT GCATCTTCAA ATCACTGGCA GATTCCGGAG TTAGCCCGAA 2220 

AAGTTAATAC AGAGCAGAAA CATACCACTT TTGAGCAACC TGTCTTTTCA GTTTCAAAAC 2280 

AGTCACCACC AAT AT CAACA TCTAAATGGT TTGACCCAAA ATCTATTTGT AAGACACCAA 2340 

GCAGCAATAC CTTGGATGAT TACATGAGCT GTTTTAGAAC TCCAGTTGTA AAGAATGACT 2400 

TTCCACCTGC TTGTCAGTTG TCAACACCTT ATGGCCAACC TGCCTGTTTC CAGCAGCAAC 2460 

AGCATCAAAT ACTTGCCACT CCACTTCAAA ATTTACAGGT ITTAGCATCT TCTTCAGCAA 2520 

ATGAATGCAT TTCGGTTAAA GGAAGAATTT ATTCCATATT AAAGCAGATA GGAAGTGGAG 2580 

GTTCAAGCAA GGTATTTCAG GTGTTAAATG AAAAGAAACA GATATATGCT ATAAAATATG 2640 

TGAACTTAGA AGAAGCAGAT AACCAAACTC TTGATAGTTA CCGGAACGAA ATAGCTTATT 2700 

TGAATAAACT ACAACAACAC AGTGATAAGA TCATCCGACT TTATGATTAT GAAATCACGG 2760 

ACCAGTACAT CTACATGGTA ATGGAGTGTG GAAATATTGA TCTTAATAGT TGGCTTAAAA 2 820 

AGAAAAAATC CATTGATCCA TGGGAACGCA AGAGTTACTG GAAAAATATG TTAGAGGCAG 2880 

TTCACACAAT CCATCAACAT GGCATTGTTC ACAGTGATCT TAAACCAGCT AACTTTCTGA 2 940 

TAGTTGATGG AATGCTAAAG CTAATTGATT TTGGGATTGC AAACCAAATG CAACCAGATA 3000 

CAACAAGTGT TGTTAAAGAT T CT CAGGTTG GCACAGTTAA TTATATGCCA CCAGAAGCAA 3060 

TCAAAGATAT GTCTTCCTCC AGAGAGAATG GGAAATCTAA GT CAAAG AT A AGCCCCAAAA 3120 

GTGATGTTTG GTCCTTAGGA TGTATTTTGT ACTATATGAC TTACGGGAAA ACACCATTTC 3180 

AGCAGATAAT TAATCAGATT TCTAAATTAC ATGCCATAAT TGATCCTAAT CATGAAATTG 3240 

AATTTCCCGA TATTCCAGAG AAAGATCTTC AAGATGTGTT AAAGTGTTGT TTAAAAAGGG 3300 

ACCCAAAACA GAGGATATCC ATTCCTGAGC TCCTGGCTCA TCCATATGTT CAAATTCAAA 3360 

CTCATCCAGT TAACCAAATG GCCAAGGGAA CCACTGAAGA AATGAAATAT GTTCTGGGCC 3420 

AACTTGTTGG TCTGAATTCT CCTAACTCCA TTTTGAAAGC TGCTAAAACT TTATATGAAC 3480 

ACTATAGTGG TGGTGAAAGT CATAATTCTT CATCCTCCAA GACTTTTGAA AAAAAAAGGG 3540 

GAAAAAA ATG AT TTGCAGTT ATTCGTAATG TCAGATAGGA GGTATAAAAT ATATTGGACT 3600 

GTTATACTCT TGAATCCCTG TGGAAATCTA CATTTGAAGA CAACATCACT CTGAAGTGTT 3660 

A AAAATTCAGT GAGATTAT CT TTAAAAGAAA ACTGTAAAAA TAGCAACCAC 3720 



885 



WO 03/042661 



PCT/US02/36810 



TAAATAAAGT TTTGTGGCTA S 



Probes 



TAGACTTGTT TTCTCTGTTT TATGCTCTTG TGTAATCTAC 
TGGATAGCAA GTATATTCTA AAAAACTTTG 



I I I I I I 

MNKVEDIKNK FKNEDLTDEL SLNKI SADTT DNSGTVNQIM MMANNPEDWL SLLLKLEKNS 
VPLSDALLNK LIGRYSQAIE ALPPDKYGQN ESPARIQVRP AELKAIQEPD DARDYFQMAR 
ANCKKFAFVH ISFAQPELSQ GNVKKSKQLL QKAVERGAVP LEMLEIALRN LNLQKKQLLS 
EEEKKNLSAS TVLTAQESFS GSLGHLQNRN NSCDSRGQTT KARFLYGENM PPQDAEIGYR 
NSLRQTNKTK QSCPFGRVPV NLLNSPDCDV KTDDSWPCF MKRQTSRSEC RDLWPGSKP 
SGNDSCELRN LKSVQNSHFK EPLVSDEKSS ELIITDSITL KNKTESSLLA KLEETKEYQE 
PEVPESNQKQ WQAKRKSECI NQNPAASSKH WQIPELARKV NTEQKHTTFE QPVFSVSKQS 
PPISTSKWFD PKSICKTPSS NTLDDYMSCF RTPWKNDFP PACQLSTPYG QPACFQQQQH 
QILATPLQNL QVLASSSANE CISVKGRIYS ILKQIGSGGS SKVFQVLNEK KQIYAIKYVN 
LEEADNQTLD SYRNEIAYLN KLQQHSDKII RLYDYEITDQ YIYMVMECGN IDLNSWLKKK 
KSIDPWERKS YWKNMLEAVH TIHQHGIVHS DLKPANFLIV DGMLKL1DFG IANQMQPDTT 
SWKDSQVGT VNYMPPEAIK DMSSSRENGK SKSKISPKSD VWSLGCILYY MTYGKTPFQQ 
IINQISKLHA IIDPKHEIEF PDIPEKDLQD VLKCCLKRDP KQRISIPELL AHPYVQIQTH 
PVNQMAKGTT EEMKYVLGQL VGLNSPNSIL KAAKTLYEHY SGGESHNSSS S 



unigene number: 
40 Probeset Accession #• 

Nucleic Acid Accession #: KM0Q150S 

Coding sequence: 1-1362 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

45 | | | | | | 

ATGGCTTCAC CCAGCCTCCC GGGCAGTGAC TGCTCCCAAA TCATTGATCA CAGTCATGTC SO 

CCCGAGTTTG AGGTGGCCAC CTGGATCAAA ATCACCCTTA TTCTGGTGTA CCTGATCATC 120 

TTCGTGATGG GCCTTCTGGG GAACAGCGCC ACCATTCGGG TCACCCAGGT GCTGCAGAAG 1B0 

AAAGGATACT TGCAGAAGGA GGTGACAGAC CACATGGTGA GTTTGGCTTG CTCGGACATC 240 

50 TTGGTGTTCC TCATCGGCAT GCCCATGGAG TTCTACAGCA TCATCTGGAA TCCCCIGACC 300 

ACGTCCAGCT ACACCCTGTC CTGCAAGCTG CACACTTTCC TCTTCGAGGC CTGCAGCTAC 360 

GCTACGCTGC TGCACGTGCT GACACTCAGC TTTGAGCGCT ACATCGCCAT CTGTCACCCC 420 

TTCAGGTACA AGGCTGTGTC GGGACCTTGC CAGGTGAAGC TGCTGATTGG CTTCGTCTGG 480 

GTCACCTCCG CCCTGGTGGC ACTGCCCTTG CTGTTTGCCA TGGGTACTGA GTACCCCCTG 540 

55 GTGAACGTGC CCAGCCACCG GGGTCTCACT TGCAACCGCI CCAGCACCCG CCACCACGAG 600 

CAGCCCGAGA CCTCCAATAT GTCCATCTGT ACCAACCTCT CCAGCCGCTG GACCGTGTTC 660 

CAGTCCAGCA TCTTCGGCGC CTTCGTGGTC TACCTCGTGG TCCTGCTCTC CGTAGCCTTC 720 

ATGTGCTGGA ACATGATGCA GGTGCTCATG AAAAGCCAGA AGGGCTCGCT GGCCGGGGGC 7B0 

ACGCGGCCTC CGCAGCTGAG GAAGTCCGAG AGCGAAGAGA GCAGGACCGC CAGGAGGCAG B40 

60 ACCATCATCT TCCTGAGGCT GATTGTTGTG ACATTGGCCG TATGCTGGAT GCCCAACCAG 900 

ATTCGGAGGA TCATGGCTGC GGCCAAACCC AAGCACGACT GGACGAGGTC CTACTTCCGG 960 

GCGTACATGA T 



!i CGCCAACCAC GAGAAGCGCC TGCGCGTACA TGCGCACTCC 1140 

A GCGCCCGCTT TGTGCAGCGC CCGTTGCTCT TCGCGTCCCG GCGCCAGTCC 1200 

\ GAACTGAGAA GATTTTCTTA AGCACTTTTC AGAGCGAGGC CGAGCCCCAG 1260 

2 AGTCATTGAG TCTCGAGTCA CTAGAGCCCA ACTCAGGCGC GAAACCAGCC 1320 
AATTCTGCTG CAGAGAATGG TTTTCAGGAG CATGAAGTT T GA 

A117 Protein sequence: 

Gene name: G protein-coupled receptor 39 

Uhigene number: Hs. 85339 

Probeset Accession #: AA349893 

Protein Accession NM_001508, NP_001409 

Signal sequence: none found 

Pfam domains: 7tm_l [72-172, 224-344] 

Transmembrane domains: 32-54, 68-90, 111-133, 151-173, 221-24 

Cellular Localization: plasma membrane 



MASPSLPGSD CSQIIDHSHV PEFEVATWI K ITLILVYLII FVMGLLGNSA TIRVTQVLQK 
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KGYLQKEVTD HMVSLACSDI LVPLIGMPME FYSIIWNPLT TSSYTLSCKL HTFLFEACSY 
ATLLHVLTLS FERYIAICHP FRYKAVSGPC QVKLLIGFVW VTSALVALPL LFAMGTEYPL 
VNVPSHRGLT CNRSSTRHHE QPETSNMSIC TNLSSRWTVF QSSIFGAFW YLWLLSVAF 
MCWNMMQVLM KSQKGSLAGG TRPPQLRKSE SEESRTARRQ TIIFLRLIW TLAVCKMPNQ 
IRRIMAAAKP KHDWTRSYFR AYMILLPFSE TFFYLSSVIN PLLYTVSSQQ FRRVFVQVLC 
CRLSLQHANH EKRLRVHAHS TTDSARFVQR PLLFASRRQS SARRTEKIFL STFQSEAEPQ 



Gene name: 
Unigene nunr 
Probeset Ac 



I I I I I I 

CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCGCT 60 

GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 12 0 

GTGAAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 180 

CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 

TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATGCTTT TGCGAAGTGC AGGAAAATTA 300 

AATGTGGGCA CCAAGAAAGA GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 

TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT TTGCAGCACA 420 

GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 4B0 

GGTTGCCTAG GACTAGAAGG CTCAGATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA S40 

AGAAGATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT ACACCCTACA 600 

CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGACCTA TACACCACAG GGCTTTACTT 660 

ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 

TATAAAAGAC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAGAACAGGA TGAAACTTAC 780 

ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 840 

TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 

ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 

GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATATCAG 1020 

ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 

GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1140 

TATCTGAAGT CCACCACCCT AGACGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 12 00 

ftGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AGCAATTGCC 1260 

CATCGAGATC TGAAAAGTAA AAACATTCTC GTGAAGAAAA ATGGAACTTG CTGTATTGCT 13 2 0 



T TCGACGAGAG CTTGAACAGA 144 0 

AATCACTTCC AGTCTTACAT CATGGCTGAC ATGTATAGTT TTGGCCTCAT CCTTTGGGAG 1500 

GTTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 1560 

CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 

CGCCCCTCAT TCCCAAACCG GTGGAGCAGT GATGAGTGTC TAAGGCAGAT GGGAAAACTC 1680 

ATCACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCT G A CAGCCCTGCG GGTTAAGAAA 1740 

ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACTC T GA TAGGAGAG GAAAAGTAAG 1800 

CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 

TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 192 0 

- R GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 1980 
A CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 



9 Protei 



Signal sequence : 
Transmembrane domains: 



irphogenetic protein receptor I 
UB9326 



M domains : activin_receptor [30-111] , protein kinase [204-491] 

Cellular Localization: plasma membrane 



I I I I I I 

MLLRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCKHHCPE DSVNNICSTD GYCFTMIEED 
DSGLPWTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECNKDLHPTL P 
GPIHHRAIiLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI I 
EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 
WFRETEIYQT VLMRHENILG FIAADIKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 300 
MLKLAYSSVS GLCHLHTEIF STQGKPAIAH RDLKSKNILV KKNGTCCIAD LGLAVKFISD 3 60 
TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 420 
EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRKSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 

A120 DNA SEQUENCE 

Gene name: LIV-1 protein, estrogen regulated 
Unigene number: Hs. 79136 

Probeset Accession #: U41060 
Nucleic Acid Accession #: NM 012319.2 

Coding sequence: 138-2405 (underlined sequences correspond t 



WO 03/042661 



PCT/US02/36810 



. _ - A ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 
CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACGAGGCC 
GCGGAGACGA AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 



AAATTAGTCC GAATTGGGAA TCTGGCATTA A' 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA T 
TCAGAAAATT ACTTCAAAAT ATAGGCA 

ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGC3TCAC T 
AGCATCACTC AGACCACGAG CATCACT. 

VAAGCTC TTTGCCCAGA CCATGACTCA G 
C TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG T 
3AGACAA TAGAGACTCC AAGACCTGGA A 
C ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 



AAAGCTACTG 



60 



G TGAAAAGAAG G 
A CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 
G GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT G 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT G 

CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 13B0 

25 GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 

TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1S00 

AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 

CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1S20 

GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 1SB0 

30 AAGAGGT CAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 

GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 1B00 

TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 1B60 

CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 1920 

TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 

35 GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2D40 

ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2100 

AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2160 

GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 

GCTTATTCAT GTATGTTGCT CTGGTTCATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 

40 GTGACCATGG A1GTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 

GTTTTGGAAT TATGTTACi'T ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 

TCTACTTAAG CTTTAAATCC TACAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 

AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 2520 

TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 

45 TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 

G TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 

GATTTTA GGTCTCIGAA GAACTGCTGG 2760 

TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 

AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2880 

50 AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2940 

CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GG ATTATTT C CCGTAAAAAC 3000 

GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAATCT 3060 

AAATATATTT AATGAATTCA AG CAAT AT AC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3120 

TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3180 

55 TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 

CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT ACACTAAGTA 3300 

TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3350 
GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG G 
GATGTTTCTT TTTTACACAA TAAATT CCTT ATATCAGCTT G 



trogen regulated 



Signal sequence s 
Pfam domain; 
Transmembrane domains: 



75 EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 
SVSASEVTST VYNTVSEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 
NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 
RSCLIHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSL L3VILVPLMN RVFFKPLLSF 
LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 

80 TWKGLTALGG LYFMFLVEHV KTLIKQFKDK KKKNQKKPEN DDDVEIKKQL SKYESQLSTN 
EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLSEEEVMIA HAHPQEVYNE YVPRGCKNKC 
HSHFHDTLGQ SDDLIHHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 
MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLKA GMTVKQAVLY 
NALSAMLAYL GMATGIFIGH YAENVSMWIF ALTAGLFMYV ALVDI4VPEML HMDASDHGCS 



888 



WO 03/042661 



PCT/US02/36810 



RWGYFFLQNA GMLLGFGIML LISIFEHKIV FRINF 



20 



25 



Gene name: EGF- like-domain; multiple 6 

Probeset Accession ft: N67SS1 
Nucleic Acid Accession ft: NM_015507 

Coding sequence: 241-1902 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 

CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 

CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 1B0 

GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 

ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 

TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 

TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 

AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 

AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 

CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 

ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 

TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 

GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 

AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 

AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG COAATTGCTT CAATACCCAA 960 

GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1OB0 

AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1140 

CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 

ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 

GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 

AGCCTGGGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 

CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 

CACTCCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 

TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGnCC GGCCTTGGCA 1560 

GGTCACAAGA AAGACATTGG CCCATTCAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 162 0 

AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGACTG 1680 

TTTGTGAMA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 

TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 

GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 1860 

TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGAC T GA ATGTTACT ATCTTTATAT 1920 

TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 

TTAGAATTAC TACCTCAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2 040 

TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 

TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 

CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 

TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 

TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 

A123 Protein sequence: 

Gene name: EGF-like-doraain; multiple 6 

Unigene number:^ Hs.12 844 



Cellular Localization: 



I I I I I | 

MPLPWSLALP LLLSWVAGGF GNAASARHHG LLASARQPGV CHYGTKLACC Y~ 
CEATCEPGCK FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV N 

LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR D 

GKVICPYNRR CVNTFGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 
GSFKCKCKQG YKGNGLRC S A IPENSVKEVL RAPGTIKDRI KKLLAHKNSM KKKAKIKNVT 
PEPTRTPTPK VNLQPFNYEE IVSRGGNSHG GKKGNEEKMK EGLEDSKBEE KALKNDIEER 
SLRGDVFFPK VNEAGEFGLI LVQRKALTSK LEHKDLNISV DCSFlfSGICD KKQDREDDFD 

WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS N 

FVKNSHHALA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIF E 
SGL.CPDSLLS VDD 



Gene name : bone mcrphcgenetic t> 

Unigene number: Hs. 170195 

Probeset Accession BE616633 

Nucleic Acid Accession ft: NM_001719 



889 



WO 03/042661 



PCT/US02/36810 



123-1418 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GGGCGCAGCG GGGCCCGTCT GCAGCAAGTG ACCGACGGCC GGGACGGCCG CCTGCCCCCT 
CTGCCACCTG GGGCGGTGCG GGCCCGGAGC CCGGAGCCCG GGTAGCGCGT AGAGCCGGCG 
CGATGCACGT GCGCTCACTG CGAGCTGCGG CGCCGCACAG CTTCGTGGCG CTCTGGGCAC 
CCCTGTTCCT GCTGCGCTCC GCCCTGGCCG ACTTCAGCCT GGACAACGAG GTGCACTCGA 
GCTTCATCCA CCGGCGCCTC CGCAGCCAGG AGGGGCGGGA GATGCAGCGC GAGATCCTCT 
CCATTTTGGG CTTGCCCCAC CGCCCGCGCC CGCACCTCCA GGGCAAGCAC AACTCGGCAC 
CCATGTTCAT GCTGGACCTG TACAACGCCA TGGCGGTGGA GGAGGGCGGC GGGCCCGGCG 



CGGTTTGATC 
TACAAGGACT 

TGGGCCTCGG 



3 AGACGCTGGA TGGGCAGAGC A 
AGTTGGCGGG CCTGATTGGG CGGCACGGGC CCCAGAACAA GCAGCCCTTC ATGGTGGCTT 
TCTTCAAGGC CACGGAGGTC CACTTCCGCA GCATCCGGTC C 



AGAACAGCAG CAGCGACCAG AGGCAGGCCT GTAAGAAGCA CGAGCTGTAT GTCAGCTTCC 

25 GAGACCTGGG CTGGCAGGAC TGGATCATCG CGCCTGAAGG C 
AGGGGGAGTG TGCCTTCCCT CTGAACTCCT ACATGAACGC C 

AGACGCTGGT CCACTTCATC AACCCGGAAA CGGTGCCCAA GCCCTGCTGT GCGCCCACGC 132 0 

AGCTCAATGC CATCTCCGTC CTCTACTTCG ATGACAGCTC CAACGTCATC CTGAAGAAAT 13B0 

ACAGAAACAT GGTGGTCCGG GCCTGTGGCT GCCACTAGCT CCTCCGAGAA TTCAGACCCT 1440 

30 TTGGGGCCAA GTTTTTCTGG ATCCTCCATT GCTCGCCTTG GCCAGGAACC AGCAGACCAA 1500 

CTGCCTTTTG TGAGACCTTC CCCTCCCTAT CCCCAACTTT AAAGGTGTGA GAGTATTAGG 1560 

AAACATGAGC AGCATATGGC TTTTGATCAG TTTTTCAGTG GCAGCATCCA ATGAACAAGA 1620 

TCCTACAAGC TGTGCAGGCA AAACCTAGCA GGAAAAAAAA ACAACGCATA AAGAAAAATG 16B0 

GCCGGGCCAG GTCATTGGCT GGGAAGTCTC AGCCATGCAC GGACTCGTTT CCAGAGGTAA 1740 

35 TTATGAGCGC CTACCAGCCA GGCCACCCAG CCGTGGGAGG AAGGGGGCGT GGCAAGGGGT 1800 

GGGCACATTG GTGTCTGTGC GAAAGGAAAA TTGACCCGGA AGTTCCTGTA ATAAATGTCA I860 
CAATAAAACG AATGAATG 

40 A125 Protein sequence : 

Gene name: bone morphogenetic protein 7 (osteogenic protein 1) (BMP7) 

Protein Accession #: 
45 Signal sequence: 



TGFb_propeptide [37-281] 



Cellular Localiza 



I I I I I I 

MHVRSLRAAA PHSFVALWAP LFLLRSALAD FSLDNEVHSS FIHRRLRSQE RREMQREILS 
ILGLPHRPRP HLQGKHNSAP MFMLDLYNAM AVEEGGGPGG QGFSYPYKAV FSTQGPPLAS 
LQDSHFLTDA DHVMSFVNLV EHDKEFFHPR YHHREFRFDL SKIPEGEAVT AAEFRIYKDY 
IRERFDNETF RTSVYQVLQE HLGRESDLFL LDSRTLKASE EGWLVFDITA TSNHWWNPR 
HNLGLQLSVE TLDGQSINPK LAGLIGRHGP QNKQPFMVAF FKATEVHFRS IRSTGSKQRS 
QNRSKTPKNQ EALRMANVAE NSSSDQRQAC KKHELYVSFR DLGWQDWIIA PEGYAAYYCE 
GECAFPLNSY MNATNHAIVQ TLVHFINPET VPKPCCAPTQ LNAISVLYFD DSSNVILKKY 



integrin, beta 8 
AA479726 



sequences correspond to start and stop codons) 



CCCAGAGCCG CCTCCCCCTG TTGCTGGCAT C 
CTGCCGACTT GTCTTTGCCC GCTGCTCCGC AGACGGGGCT G 
GTTGGCCTCC CTGCCCACCT G 
TCCCCTCGAC CTCGCCGGCG TACCCTCCCA CAGATCCAGC ATCACCCAGT GAATGTACAT 
TAGGGTGGTT TCCCCCCCAG CTTCGGGCTT TGTTTGGGTT T 



A GAGCCCTCTC T 
TGGCCGTCGA AGGAGGTGCT TCTCGCGGAG A 
G GCCCTGAGAT G 

T GCTAGGCCTG CGGAAAACGT CCTAGCGACA C 
GGCCCCGAGG TCGCCCGGGA GGCCGAGCCC GCGTCCGGAA GGCAGCCAGG CGGCGGGCGC 660 
GGGG CGGGCT GTTTTGCATT ATGTGCGGCT CGGCCCTGGC TTTTTTTACC GCTGCATTTG 720 
TCTGCCTGCA AAACGACCGG CGAGGTCCCG CCTCGTTCCT CTGGGCAGCC TGGGTGTTTT 780 



890 



WO 03/042661 



PCT/US02/36810 



CACTTGTTCT TGGACTGGGC CAAGGTGAAG ACAATAGATG TGCATCTTCA AATGCAGCAT 84 0 

CCTGTGCCAG GTGCCTTGCG CTGGGTCCAG AATGTGGATG GTGTGTTCAA GAGGATTTCA 90 0 

TTTCAGGTGG ATCAAGAAGT GAACGTTGTG ATATTGTTTC CAATTTAATA AGCAAAGGCT 960 

GCTCAGTTGA TTCAATAGAA TACCCATCTG TGCATGTTAT AATACCCACT GAAAATGAAA 1020 

TTAATACCCA GGTGACACCA GGAGAAGTGT CTATCCAGCT GCGTCCAGGA GCCGAAGCTA 10 80 

ATTTTATGCT GAAAGTTCAT CCTCTGAAGA AATATCCTGT GGATCTTTAT TATCTTGTTG 1140 

ATGTCTCAGC ATCAATGCAC AATAATATAG AAAAATTAAA TTCCGTTGGA AACGATTTAT 120 0 

CTAGAAAAAT GGCATTTTTC TCCCGTGACT TTCGTCTTGG ATTTGGCTCA TAOGTTGATA 1260 

AAACAGTTTC ACCATACATT AGCATCCACC CCGAAAGGAT TCATAATCAA TGCAGTGACT 1320 

ACAATTTAGA CTGCATGCCT CCCCATGGAT ACATCCATGT GCTGTCTTTG ACAGAGAACA 1380 

TCACTGAGTT TGAGAAAGCA GTTCATAGAC AGAAGATCTC TGGAAACATA GATACACCAG 1440 

AAGGAGGTTT TGACGCCATG CTTCAGGCAG CTGTCTGTGA AAGTCATATC GGATGGCGAA 1S00 
AAGAGGCTAA AAGATTGCTG CTGGTGATGA CAGATCAGAC GTCTCATCTC G 



ACAACAACAT TAATGTCATC T 
TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA 
ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 
ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 



TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 
GTTTTAATGA AACCGCTAAA ATTCATATAC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 
ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG T 




A CTGGAATTGT ATGCAATGCC T 
TTGATCAGTG CAAAACCTCA TGTGCTCTCA T 
CAGAATGTTT C 
TCTTGATTGG GTTGCT TAAA G' 

ATAAAATTAA GTCCTCATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT A 

A GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 2 940 



GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT TITTCAGAGA 31B0 

AAAATGTGTC TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 

ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACAGG 330 0 

TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 336 0 

CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 342 0 

TATATTCTAA GGTTG C CAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 3480 

ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3540 

AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 

TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 

TTTTTACAGG ATAAGTT TAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 3720 

TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 37 80 
GAATGTTAA 



Signal sequence: 

EGF domain: 
INB domain: 

Cellular Localization: 



Integrin, beta 
NP_002205.1 



plasma membrane 



I 



I 

" AAFVCLQNDR 



I 



I 



I 



I 



A WVFSLVLGLG CGEDNRCASS NAASCARCLA 
:svdsie YPSVHVIIPT ENEINTQVTP 
Y YLVDVSASMH NNIEKLNSVG NDLSRKMAFF 
I SIHPERIHNQ CSDYNLDCMP PHGYIHVLSL TENITEFEKA 
M LQAAVCESHI GWRKEAKRLL LVMTDQTSHL ALDSKIJM3IV 
M EHPSLGQLSE KLIDNNINVI FAVQGKQFHW YKDLLPLLPG 
A YQKLISEVKV QVENQVQGIY FNITAICPDG SRKPGMEGCR 
5FNETAK IHIHRNCSCQ CEDNRGPKGK 
CVDETFLDSK CFQCDENKCH FDEDQFSSES CKSHKDQPVC SGRGVCVCGK CSCHKIKLGK 
VYGKYCEKDD FSCPYHHGNL CAGHGECEAG RCQCFSGWEG DRCQCPSAAA QHCVNSKGQV 
CSGRGTCVCG RCECTDPRSI GRFCEHCPTC YTACKENWNC MQCLKPHNLS QAILDQCKTS 
CALMEQQHYV DQTSECFSSP SYLRIFFIIF IVTFLIGLLK VLIIRQVILQ NKSKKIKSSS 
DYRVSASKKD KLILQSVCTR A 



n-coupled receptor 64 



WO 03/042661 



Hs. 184942 
AA435577 

Nucleic Acid Recession #: NM_00575S 

Coding sequence: 73-3117 (underlined sequences correspond 



AGCCAGCCCG AGGACGCGAG CGGCAGGTGT GCACAGAGGT T CTCCACTTT G 
CTCGGGGTCA GGATGGTTTT CTCTGTCAGG CAGTGTGGCC ATGTTGGCAG AACTGAAGAA 
rCAAGAT ATTCCTTGTC ATCATTTGTC TTCATGTCGT T 
3 AAGATACTGA TAATTCCAGT TTGTCACCAC CACCTGCTAA A 




\ AAGATTCGAC CAATGGAACA C 

A GTGTGACCTG 

T CCACGTGGCC C 

TCCATCCCAG 1 

GCTGAGCCTC CAGATTATTC ACCTGTGACC CACAATGTTC CCTCTCCAAT A 
CAACCCCTTT CACCCCAGCC TTCAGCTCCC ATAGCTTCCA G 
CCACAGTCTG AAACGATCTC TTCCCCTATG CCCCAAACCC A 

CCTGTGAAAG CCTCATTTTC CTCTCCCACC GTGTCTGCCC CTGCGAATGT CAACACTACC : 

AGCGCACCTC CTGTCCAGAC AGACATCGTC AACACCAGCA GTATTTCTGA TCTTGAGAAC 1200 

CAAGTGTTGC AGATGGAGAA GGCTCTGTCC TTGGGCAGCC TGGAGCCTAA CCTCGCAGGA 1260 

GAAATGATCA ACCAAGTCAG CAGACTCCTT CATTCCCCGC CTGACATGCT GGCCCCTCTG 1320 

GCTCAAAGAT TGCTGAAAGT AGTGGATGAC ATTGGCCTAC AGCTGAACTT TTCAAACACG 13 SO 

ACT AT AAGT C TAACCTCCCC TTCTTTGGCT CTGGCTGTGA TCAGAGTGAA TGCCAGTAGT 1440 

TTCAACACAA CTACCTTTGT GGCCCAAGAC CCTGCAAATC TTCAGGTTTC TCTGGAAACC 1500 

CAAGCTCCTG AGAACAGTAT TGGCACAATT ACTCTTCCTT CATCGCTGAT GAATAATTTA 1S60 

CCAGCTCATG ACATGGAGCT AGCTTCCAGG GTTCAGTTCA ATTTTTTTGA AACACCTGCT 1620 

TTGTTTCAGG ATCCTTCCCT GGAGAACCTC TCTCTGATCA GCTACGTCAT ATCAT CGAGT 1680 

GTTGCAAACC TCACCGTCAG GAACTTGACA AGAAACGTGA CAGTCACATT AAAGCACATC 1740 

AACCCGAGCC AGGATGAGTT AACAGTGAGA TGTGTATTTT GGGACTTGGG CAGAAATGGT 1800 

GGCAGAGGAG GCTGGTCAGA CAATGGCTGC TCTGTCAAAG ACAGGAGATT GAATGAAACC 1860 

ATCTGTACCT GTACCCATCT AACAACCTTC GGCGTTCTGC TGGACCTATC TAGGACATCT 192 0 

GTGCTGCCTG CTCAAATGAT GGCTCTGACG TTCATTACAT ATATTGGTTG TGGGCTTTCA 19D0 

TCAATTTTTC TGTCAGTGAC TCTTGTAACC TACATAGCTT TTGAAAAGAT CCGGAGGGAT 2040 

TACCCTTCCA AAATCCTCAT CCACCTCTCT GCTGCTCTGC TTCTCCTCAA CCTGGTCTTC 210 0 

CTCCTGGACT CGTGGATTGC TCTGTATAAG ATGCAAGGCC TCTGCATCTC AGTGGCTGTA 2160 

TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG GCCTAGAAGC ATTCCATATG 2220 

TACCTGGCCC TTGTCAAAGT ATTTAATACT TACATCCGA^ AATACATCCT TAAATTCTGC 2280 

ATTGTCGGTT GGGGGGTACC AGCTGTGGTT GTGACCATCA TCCTGACTAT ATCCCCAGAT 2340 

AACTATGGGC TTGGATCCTA TGGGAAATTC CCCAATGGTT CACCGGATGA CTTCTGCTGG 2400 

ATCAACAACA ATGCAGTATT CTACATTACG GTGGTGGGAT ATTTCTGTGT GATATTTTTG 2460 

CTGAACGTCA GCATGTTCAT TGTGGTCCTG GTTCAGCTCT GTCGAATTAA AAAGAAGAAG 2520 

CAACTGGGAG CCCAGCGAAA AACCAGTATT CAAGACCTCA GGAGTATCGC TGGCCTTACA 2580 

TTTTTACTGG GAATAACTTG GGGCTTTGCC TTCTTTGCCT GGGGACCAGT TAACGTGACC 2640 

TTCATGTATC TGTTTGCCAT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 27 00 

TGTGTGGCCA AAGAAAATGT CAGGAAGCAA TGGAGGCGGT ATCTTTGTTG TGGAAAGTTA 2760 



GTAAACCAAG GAGTGT CCAG CTCTTCAAAT TCCTTACAGT CAAGCAGTAA CTCCACTAAC 
TCCACCACAC TGCTAGTGAA TAATGATTGC TCAGTACACG CAAGCGGGAA TGGAAATGCT 
TCTACAGAGA GGAATGGGGT CTCTTTTAGT GTTCAGAATG GAGATGTGTG C 
TTCACTGGAA AACAG CACAT GTTTAACGAG AAGGAAGATT CCTGCAATGG G 

A GAAGGACTTC AAAGCGGGGA AGCTTACACT TTATTGAGCA AATGTGATTC 3120 



CACAATGTGA GATGTATGAA AATCAACTCA TTTTATTCTC GGCAACATCT 
AAGCTAATTA AGGGCGATGA TTATTATTAC 



TTTAAAGAGG CTAAGTTATC TTTGATAACA TCATATAAAG CAACTGTTGA CTTCAGCCTG 3480 

^TGTGCA TGCCTTTGTT GTATATAAGC TAAATTCTAG TGACCCATGT 3540 

C TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTCCT 3 600 

TTGTAGAAT C ATGGTTGTTT TGTCTCACGT GATAATTCAG AAAATCCTTG CTCGTTCCGC 3660 

AAATCCTAAA GCTCCTTTTG GAGATGATAT AGGATGTGAA ATACAGAAAC CTCAGTGAAA 3720 

TCAAGAAATA ATGATCCCAG CCAGACTGAG AAAATGTAAG CAGACAGTGC CACAGTTAGC 3780 

TCATACAGTG CCTTTGAGCA AGTTAGGAAA AGATGCCCCC ACTGGGCAGA CACAGCCCTA 3 840 

TGGGTCATGG TTTGACAAAC AGAGTGAGAG ACCATATTTT AGCCCCACTC ACCCTCTTGG 3900 

GTGCACGACC TGTACAGCCA AACACAGCAT CCAATATGAA TACCCATCCC CTGACCGCAT 3960 

CCCCAGTAGT CAGATTATAG AATCTGCACC AAGATGTTTA GCTTTATACC T1GGCCACAG 4020 

AGAGGGATGA ACTGTCATCC AGACCATGTG TCAGGAAAAT TGTGAACGTA GATGAGGTAC 4080 

ATACACTGCC GCTTCTCAAA TCCCCAGAGC CTTTAGGAAC AGGAGAGTAG ACTAGGATTC 4140 

CTTCTCTTAA AAAGGTACAT ATATATGGAA AAAAATCATA TTGCCGTTCT TTAAAAGGCA 4200 

ACTGCATGGT ACATTGTTGA TTGTTATGAC TGGTACACTC TGGCCCAGCC AGAGCTATAA 4260 

TTGTTTTTTA AATGTGTCTT GAAGAATGCA CAGTGACAAG GGGAGTAGCT ATTGGGAACA 4320 

GGGAACTGTC CTACACTGCT ATTGTTGCTA CATGTATCGA GCCTTGATTG CTCCTAGTTA 4380 

TATACAGGGT CTATCTTGCT TCCTACCTAC ATCTGCTTGA GCAGTGCCTC AAGTACATCC 4440 

TTATTAGGAA CATTTCAAAC CCCTTTTAGT TAAGTCTTTC ACTAAGGTTC TCTTGCATAT 4500 

ATTTCAAGTG AATGTTGGAT CT CAGACT AA CCATAGTAAT AATACACATT TCTGTGAGTG 4560 
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A129 Protein sequence 

Gene name: G protein-coupled receptor 64 

Unigene number: Hs. 184942 

Protein Accession #: NP_005747.1 

Signal sequence: 1-38 

GPS domain: 564-615 

Transmembrane domains: 624-646, 660-682, 688-710, 733-755, 783-805, 828-8 

Cellular Localization: plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MVFSVRQCGH VGRTEEVLLT FKIFLVIICL HWLVTSLEE DTDNSSL - 
PSSNEVETTS LNDVTLSLLP SNETEKTKIT IVKTFNASGV KPQRNICNLS SICMDSAFFR 
GEIMFQYDKE STVPQNQHIT NGTLTGVLSL SELKRSELNK TLQTLSSTYF IMCATAEAQS 
TLNCTFTIKL NNTMNACAA I AALERVKIRP MEHCCCSVRI PCPSSPEELG KLQCDLQDPI 
VCLADHPRGP PFSSSQSIPV VPRATVLSQV PKATSFAEPP DYSPVTHNVP SPIGEIQPLS 
PQPSAPIASS PAIDMPPQSE TISSPMPQTH VSGTPPPVKA SFSSPTVSAP ANVNTTSAPP 
VQTDIVHTSS ISDLENQVLQ MEKALSLGSL EPNLAGEMIN QVSRLLHSPP DMLAPLAQRL 
LKWDDIGLQ LNFSNTTISL TSPSLALAVI RVNASSFNTT TFVAQDPANL QVSLETQAPE 
NSIGTITLPS SLMNNLPAHD MELASRVQFN FFETPALFQD PSLEHLSLIS YVISSSVANL 
TVRNLTRNVT VTLKHINPSQ DELTVRCVFW DLGRNGGRGG WSDNGCSVKD RRLNETICTC 
SHLTSFGVLL DLSRTSVLPA QMMALTFITY IGCGLSSIFL SVTLVTYIAF EKIRRDYPSK 
ILIQLCAALL LLNLVFLLDS WIALYKMQGL CISVAVFLHY FLLVSFTWMG LEAFHMYLAL 
VKVFNTYIRK YILKFCIVGW GVPAVWTII LTISPDNYGL GSYGKFPNGS PDDFCWINNN 
AVFYITWGY FCVIFLLNVS MFIWLVQLC RIKKKKQLGA QRKTSIQDLR SIAGLTFLLG 
ITWGFAFFAW GPVNVTFMYL FAIFNTLQGF FIFIFYCVAK ENVRKQWRRY LCCGKLRLAE 
NSDWSKTATN GLKKQTVNQG VSSSSNSLQS SSNSTNSTTL LVNNDCSVHA SGNGNASTER 
NGVSFSVQNG DVCLHDFTGK QHMFNEKEDS CNGKGRMALR RTSKRGSLHF IEQK 

receptor 49 (GPR49) (HG38) (LGR5) 



AA460530 
NM 003667 

201-2924 (underlined sequences correspond to start and stop codons) 



GTGGCGGCAA CCGGCACCTC AGTCCCCGCC GCGCTTCTCC TCGCCGCCCA CGCCGTGGGG 60 

TCAGGAACGC GGCGTCTGGC GCTGCAGACG CCCGCTGAGT TGCAGAAGCC CACGGAGCGG 120 

CGCCCGGCGC GCCACGGCCC GTAGCAGTCC GGTGCTGCTC TCCGCCCGCG TCCGGCTCGT 180 

GGCCCCCTAC TTCGGGCACC ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG 240 

TGCTGCTGCA GCTGGCGACC GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT 300 

GCCCCACACA CTGTCATTGC GAGCCCGACG GCAGGATGTT GCTCAGGGTG GACTGCTCCG 350 

ACCTGGGGCT CTCGGAGCTG CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA 420 

GTATGAACAA CATCAGTCAG CTGCTCCCGA ATCCCCTGCC OAGTCTCCGC TTCCTGGAGG 480 

AGTTACGTCT TGCGGGAAAC GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT 540 

ACAGTCTTAA AGTTCTTATG CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC 600 

TGCAGAATTT GCGAAGCCTT CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC 660 

CCCCAAGCTG TTTCAGTGGC CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT 720 

TAACAGAAAT CCCCGTCCAG GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG 7B0 

CCCTGAACAA AATACACCAC ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG 840 

TTCTACATCT CCATAACAAT AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC 900 

ACAGCCTAGA GACTTTAGAT TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA 960 

GGACACT CT C CAACCTTAAA GAACTAGGAT TTCATAGCAA CAATATCAGG TCGATACCTG 1020 

AGAAAGCATT TGTAGGCAAC CCTTCTCTTA TTACAATACA TTTCTATGAC AATCCCATCC 1080 

AATTTGTTGG GAGATCTGCT TTTCAACATT TACCTGAACT AAGAACACTG ACTCTGAATG 1140 

GTGCCTCACA AATAACTGAA TTTCCTGATT TAACTGGAAC TGCAAACCTG GAGAGTCTGA 1200 

1CAGATC TCATCTCTTC CTCAAACCGT CTGCAATCAG TTACCTAATC 1260 

T AGATCTGTCT TACAACCTAT TAGAAGATTT ACCCAGTT1T TCAGTCTGCC 1320 
AAAAGCTTCA GAAAATTGAC CTAAGACATA ATGAAATCTA CGAAATTAAA G' 



TGTCGTCTTT 



AAGCCCTTCA TTCAGTGCAG TGTTCACCTT CCCCAGGCCC CTTCAAACCC TGTGAACACC I860 

TGCTTGATGG CTGGCTGATC AGAATTGGAG TGTGGACCAT AGCAGTTCTG GCACTTACTT 1920 

GTAATGCTTT GGTGACTTCA ACAGTTTTCA GATCCCCTCT GTACATTTCC CCCATTAAAC 1980 

TGTTAATTGG GGTCATCGCA GCAGTGAACA TGCTCACGGG AGTCTCCAGT GCCGTGCTGG 2040 

CTGGTGTGGA TGCGTTCACT TTTGGCAGCT TTGCACGACA TGGTGCCTGG TGGGAGAATG 2100 

GGGTTGGTTG CCATGTCATT GGTTTTTTGT CCATTTTTGC TTCAGAATCA TCTGTTTTCC 2160 

TGCTTACTCT GGCAGCCCTG GAGCGTGGGT TCTCTGTGAA ATA1TCTGCA AAATTTGAAA 2220 

CGAAAGCTCC ATTTTCTAGC CTGAAAGTAA TCATTTTGCT CTGTGCCCTG CTGGCCTTGA 2280 

CCATGGCCGC AGTTCCCCTG CTGGGTGGCA GCAAGTATGG CGCCTCCCCT CTCTGCCTGC 2340 

CTTTGCCTTT TGGGGAGCCC AGCACCATGG GCTACATGGT CGCTCTCATC TTGCTCAATT 2400 

CCCTTTGCTT CCT CATGATG ACCATTGCCT ACACCAAGCT CTACTGCAAT TTGGACAAGG 2460 
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GAGACCTGGA GAATATTTGG GACTGCTCTA TGGTAAAACA CATTGCCCTG TTGCTCTTCA 2520 

CCAACTGCAT CCTAAACTGC CCTGTGGCTT TCTTGTCCTT CTCCTCTTTA ATAAACCTTA 2580 

CATTTATCAG TCCTGAAGTA ATTAAGTTTA TCCTTCTGGT GGTAGTCCCA CTTCCTGCAT 2640 

GTCTCAATCC CCTTCTCTAC ATCTTGTTCA ATCCTCACTT TAAGGAGGAT CTGGTGAGCC 2700 

J TGAGAAAGCA AACCTACGTC TGGACAAGAT CAAAACACCC AAGCTTGATG TCAATTAACT 2760 

CTGATGATGT CGAAAAACAG TCCTGTGACT CAACTCAAGC CTTGGTAACC TTTACCAGCT 2820 

CCAGCATCAC TTATGACCTG CCTCCCAGTT CCGTGCCATC ACCAGCTTAT CCAGTGACTG 2880 

AGAGCTGCCA TCTTTCCTCT GTGGCATTTG TCCCATGTCT CTAATTAATA TGTGAAGGAA 2940 

1 A AATGTTTTCA AAGGTTGAGA ACCTGAAAAT GTGAGATTGA GTATATCAGA GCAGTAATTA 3000 

10 ATAAGAAGAG CTGAGGTGAA ACT CGGTTT A AA 



A131 Protein sequence 

Gene name: G protein-coupled receptor 49 (GPR49) (HG38) (LGR5) 

15 Unigene number: Hs. 285529 

Protein Accession # : NP_00365B . 1 

Transmembrane domains: 557-579, S96-61B, 641-663, 687-709, 724-746, 770-792, 803-825 
Cellular Localization: plasma membrane 



SLPVLLQLAT GGSSPRSGVL 
LDLSMNNISQ LLPNPLPSLR 
TEALQNLRSL QSLRLDANHT 



I I 
EPDGRKLLRV DCSDLGLSEL 
ALTYIPKGAF TGLYSLKVLM 
LHSLRHLWLD DNALTEIPVQ 
RIHSLGKKCP DGLHSLETLD 
ELGPHSNNIR SIPEKAPVGN PSLITIHFYD NPIQPVGRSA 
'LNGASQITE FPDLTGTANL ESLTLTGAQI SSLPQTVCNQ LPNLQVLDLS 
SVCQKLQKID LRHNEIYEIK VDTFQQLLSL RSLNLAWNKI AIIHPNAFST 
SNLLSSPPIT GLHGLTHLKL TGNHALQSLI SSENFPELKV IEMPYAYQCC 
ISNQWNKGDN SSMDDLHKKD AGMFQAQDER DLEDFLLDFE EDLKALHSVQ 



AVLAGVDAFT FGSPARHGAW 



MDTSRLGVLL 
PSNLSVFTSY 
LQNNQLRHVP 
AFRSLSALQA 

FQHLPELRTL 

LPSLIKLDLS 

CSPSPGPFKP 
AVNMLTGVSS 

STMGYMVALI LLNSLCFLMM TIAYTKLYCN LDKGDLENIH DCSMVKHIAL LLFTHCILNC 
PVAFLSFSSL INLTFISPEV IKFILLVWP LPACIjNPLLY ILFNPHFKED LVSLRKQTYV 
WTRSKHPSLM SINSDDVEKQ SCD3TQALVT FTSSSITYDL PPSSVPSPAY PVTESCHLSS 
VAFVPCL 



3-1795 (underlined s 



Gene name: parathyroid h 
unigene number: 
Probeset A 



Coding sequence: 



3 start and stop codons) 
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GGAGTAGTTT ATTACCTTCT ATTGGCATCA AGTTTTCCTC TAAATTAATG TATGGTATTT 2040 

GCTCTGTGAT TGTTCATTTT TTTCTGCTAC TTTTGGGTAG AAAAAAGATT CAATTGCTTG 2100 

GCTGTAGCTT TCTCTCATAT ATATCACCCT AAATATAATG AAGATCTTTT AGTGTGTATC 2160 

ATTTTCCTTT TAGAAACTAG TATTCTCTTA TTTCTTACTT TAATGTACTT CTATCACTGC 2220 

5 ATTTATTTTG CCTGTGCATA GGAGCAATTA GGATCTAAAA AAATATATGG GAAGATAAAA 2280 

GATCTAAGAA CAAGTACTTG CTGGAAAATT AGTTGGCTGG ACATTGATAA AATAATGCAT 2340 

TTATAACAAT TACATGTGTT TTTGGGAACA AGGAAAATTT CTCAAAAAAG AATATTTCAC 2400 

ACATCCCTTC TTTTGAATGG CCTCTTTGTG ACCAGCCAGA CCTCAGGTCT TCACTCTTTC 2460 

TTCTTTGTAA ACCATGTCAT GTGGAAAGAT TTCCTCAGTT AGTGAGCTTG TGTCTGCAAA 2520 

10 TTGATTTTGT TTGTAATGTA TTTTGATAGC AAATCATGCT GCATCTATAT CTTTTTCTTG 2580 

TTTGAGCTGT TACTACATTG TACATGGCAT GTGGGATCAA TTAAAAATTT GTTTTAAAAA 2640 



Unigene number: 
Protein Accession #: 
Signal sequence; 
Pfam domain: 
Transmembrane domains : 
Cellular Localization; 



parathyroid hortrone receptor 2 
Ha. 159439 

NP_00S039.1 

7tm_2 [141-420] 

177-197, 228-250, 253-275, 280-302, 320-3 
plasma membrane 



, 363-385, 398-4 



45 
50 



MAGLGASLHV WGWLMLGSCL LARAQLDSDG TITIEEQIVL VLKAKVQCEL NITAQLQEGE 
GNCFPEWDGL ICWPRGTVGK ISAVPCPPYI YDFNHKGVAF RHCNPNGTWD E 
NYSDCLRFLQ PDISIGKQEP P 

L ESLIMODDPQ NSIEATSVDK SQYIGCKIAV 
CYLWGFI LIGWGFPAAF VAAWAVARAT 
CWIYQA PILAAIGLNF ILFLNTVRVL ATKIWETNAV GHDTRKQYRK 
V CLPHSFTGLG WEIRMHCELF FNSFQGFFVS IIYCYCNGEV 
QAEVKKMWSR WNLSVDWKRT PPCGSRRCGS VLTTVTHSTS SQSQVAASTR MVLISGKAAK 
IASRQPDSHI TLPGYVWSNS Ei 
GCQGETEDVL 



ESTS 

IIS.120B99 
AA983251 

Nucleic Acid Accession ft* AA9832S1 



(underlined sequences correspond to start and stop c 



ATGCTGTCTG GCTTCTTGAT GAGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 
GGAAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 
GACCGGGAGA GGGAGAGCCG GCCGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 
GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 
CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGGACTGCGG CTGGGGGCGC GCAGGACCCT 
CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 
TCCGGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA GATTTCCATC AGCGAGTGCA 
ACT CATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 



CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC GGGGGTCGCC GGGGCCACGA 
CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTGT CCGCGCCCAG GGTGGTGCCA 
TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGGAA CGCCGCTTCG CTCCTGCAGC 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 
TGGCTGGACG CGCAGGG CGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 
GCGCGCCTGG ACCAGGG CGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 
A AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 



T GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 
CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 
CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 
AGGCAGATTC AGTCCCCCTT CCCTCACACC AACAGTGAAC AGAAGATGTA CCCAGCGGTG 
ACTGTATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 
GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT C 



TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 

TTTTTTTTTT TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 

TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 

AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 
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TCTGCCTGTC TCAGCCTCCC AA 

IGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 

ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 25B0 

GGCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTIT GTGCTTTTGG 2640 

GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 

TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 

TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2320 

TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2380 

TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 

TGGTTATGGT TTGGCGTITC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 

GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 

AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAGTA 3120 

TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 31S0 

GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 

GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 

CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 

ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 34B0 

GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 

TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 

: ATTTTACTTA TTAATAAATA TTTTATGATG 3660 



Signal sequence: 1-11 

Transmembrane domains: 402-424 

Cellular Localization: not determined 



35 | | | | | | 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 

GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGRSRGR VRLPVKPPEA 120 

SGRQPRGPSD CIPRFPSAEA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 180 

PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRWP CGALAARPSP HPGTPLRSCS 240 

40 CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFQCPERPDG GDATICCGSC ALRYCCSSAK 300 

ARLDOGGCDN DRQQGAGEPC RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 3 60 

SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP IYVPFLIVGS VFVAFIILGS 420 

LVAACCCRCL RPKQDPQQSR APGGNRLMET IPHIPSASTS RGSSSRQSST AASSSSSANS 480 

GARAPPTRSQ THCCLPEGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 540 

45 HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 

A136 DNA SEQUENCE 

Gene name: selectin E (endothelial adhesion molecule 1) 

Unigene number: Hs. 89546 

50 Probeset Accession #: M24736 

Nucleic Acid Accession ft: NM_0004S0 

Coding sequence: 1-1833 (underlined sequences correspond to start and stop codons) 

55 | i 1 i 1 f I 1 f 

ATGATTGCTT CACAGTTTCT CTCAGCTCTC ACTTTGGTGC TTCTCATTAA AGAGAGTGGA 60 
GCCTGGTCTT ACAACACCTC CACGGAAGCT ATGACTTATG ATGAGGCCAG TGCTTATTGT 120 
CAGCAAAGGT ACACACACCT GGTTGCAATT CAAAACAAAG AAGAGATTGA GTACCTAAAC 180 
TCCATATTGA GCTATTCACC AAGTTATTAC TGGATTGGAA TCAGAAAAGT CAACAATGTG 240 
60 TGGGT CTGGG TAGGAACCCA GAAACCTCTG ACAGAAGAAG CCAAGAJ 

GAACCCAACA ATAGGCAAAA AGATGAGGAC TGCGTGGAGA TCTACATCAA G. 
GATGTGGGCA TGTGGAATGA TGAGAGGTGC AGCAAGAAGA AGCTTGCCCT ATGCTACACA 



C TGGCTTCAGT GGACTCAAGT GTGAGCAAAT T 
ACAGCCCTGG AATCCCCTGA GCATGGAAGC CTGGTTTGCA GTCACCCACT GGGAAACTTC 
Z TATCAGCTGT GATAGGGGTT ACCTGCC 
T GTATGTCCTC TGGAGAATGG AGTGCTCCTA TTCCAGCCTG C 
GAGTGTGATG CTGTGACAAA TCCAGCCAAT GGGTTCGTGG AATGTTTCCA AAACCCTGGA 



h ACGAGAAGCC AACGTGTAAA 
GCTGTGACAT GCAGGGCCGT CCGCCAGCCT CAGAATGGCT CTGTGAGGTG CAGCCATTCC 
I CAAATCATCC TGCAACTTCA CCTGTGAGGA A 



GTTTGTGAAG CTTTCCAGTG CACAGCCTTG TCCAACCCCG AGCGAGGCTA CATGAATTGT 1140 
CTTCCTAGTG CTTCTGGCAG TTTCCGTTAT GGGTCCAGCT GTGAGTTCTC CTGTGAGCAG 1200 
GGTTTTGTGT TG AAGGGAT C CAAAAGGCTC CAATGTGGCC CCACAGGGGA GTGGGACAAC 1260 
GAGAAGCCCA CATGTG AAG C TGTGAGATGC GATGCTGTCC ACCAGCCCCC GAAGGGTTTG 1320 
GTGAGGTGTG CTCATTCCCC TATTGGAGAA TTCACCTACA AGTCCTCTTG TGCCTTCAGC 1380 
TGTGAGGAGG GATTTGAATT AT ATGG AT CA ACTCAACTTG AGTGCACATC T 
TGGACAGAAG AGGTTCCTTC CTGCCAAGTG GTAAAATGTT CAAGCC1 

AAG AT CAACA TGAGCTGCAG TGGGGAGCCC GTGTTTGGCA CTGTGTGCAA GTTCGCCTGT : 
CCTGAAGGAT GGACGCTCAA TGGCTCTGCA GCTCGGACAT GTGGAGCCAC AGGACACTGG 
TCTGGCCTGC TACCTACCTG TGAAGCTCCC ACTGAGTCCA ACATTCCCTT GGTAGCTGGA 
CTTTCTGCTG CTGGACTCTC CCTCCTGACA TTAGCACCAT TTCTCCTCTG GCTTCGGAAA 
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TGCTTACGGA A 



GGAAGCTACC AAAAGCCTTC TTACATCCTT TAA 



T GCCAGCAGCT GCCAAAGCCT TGAATCAGAC 



15 
20 



Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
C- lectin domain: 
Cellular Localization: 



MIASQFLSAL 



Selectin E (endothelial adhesion molecule 1) 



K GFVECFQNPG SFPWNTTCTF DCEEGFELMG AQSLQCTSSG N 

AVTCRAVRQP QNGSVRCSHS PAGEFTFKSS CNFTCEEGFM LQGPAQVECT TQGQWTQQIP 

VCEAFQCTAL SNPERGYMNC LPSASGSFRY GSSCEFSCEQ GFVLKGSKRL QCGPTGEWDN 

EKPTCEAVRC DAVHQPPKGL VRCAHSPIGE FTYKSSCAES CEEGFELYGS TQLECTSQGQ 

WTEEVPSCQV VKCSSLAVPG KINMSCSGEP VFGTVCKFAC PEGWTLNGSA ARTCGATGHW 
SGLLPTCEAP TESNIPLVAG LSAAGLSLLT LAPFLLWLRK CLRKAKKFVP ASSCQSLESD 
GSYQKPSYIL 



A SEQUENCE : 



Unigene number: 
Probeset Accession 

Coding sequence: 



0 Anterior gradient-2 



;rlined sequences 



d start and stop codons) 



CTACACTCAG C 
GCAATAAAAA A 
ACTTGGGTAC A 



CAAAATCAAG 
GAAACCACTG 
GACCCTTCTT 
TATGAGCCTC 
CAGTCAGAGC 
GAAGAAAACC 
TTACTATTTA 



ATAAGAATTT 
TAACAGTTAG 
GGGATTTACC 
TATAAGAGAT 
TCTGGCACAT 



ATCACCTGAT 
AGCTGACATA 
CCTATTGATA 
GATAGAAAAA 
TGACAAATAC 



CTCGTCACAG 
ACACTCTCAA 
TTTTATGCTC 
TACTCTCAAG 
AATAAGTTCA 
GGGCAATATG 
GCTGGAAGAT 
GAAAACATGA 
AGCCTTCACT 
TAAATGTGCA 
ATAGTCTTAT 



AGACATGATG 



AAAAAAGTAA 
CACTAAAGAA 
TCATGCTAAA 
TGCCTAGAAT 
ACTCTAACAG 
AGAAAGCATT 
TCAAAGAAGT 
AGTATATAGA 
TAAAATAAAT 



AGATGACATC 
GAAGCCATTA 
AGTATTTGCC 
CCTTATGCAT 



A139 Protein s 



60 
65 



ESTs, Weakly 
AA487468 



lirailar to JE0350 Anterior gradient-2 



Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



P PQTLSRGWGD DITHVQTYE3 GLFYAQKSKK 
D CQYSQALKKV FAQNEEIQEM AQNKFIHLNL MKETTDKNLS PDGQYVPRIM 
FVDPSLTVRA DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSSL 



AIS38613 
202-1566 ( 



nderlined sequences correspond t 



ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA G 
GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC C 
CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG T 
AGAGGT CCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCT 
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TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AA1CATCGTC 3S0 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CT1CGACTGC 420 

T CAGGG AAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 4B0 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA S60 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTG3TCATAG AAGGGGCTAC 340 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGT3GCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTCCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGC AGGTGACGCC 1260 

TCCCCTGTCC TGAACCACGC GGCCGTCCCT TTGATTTCCA ACAAGATCTG CAACCACAGG 1320 

GACGTGTACG GTGGCATCAT CTCCCCCTCC ATGCTCTGCG CGGGCTACCT GACGGGTGGC 1380 

GTGGACAGCT GCCAGGGGGA CAGCGGGGGG CCCCTGGTGT GTCAAGAGAG GAGGCTGTGG 1440 

3 AGGTGAACAA GCCTGGGGTG 1500 



ACCTGAAGAG GAAGGGGACA 



CAAGCTGCTT TTTGTTTTTT GTTTTTTTGA GGTGGAGTCT CGCTCTGTTG CCCAGGCTGG 
AGTGCAGTGG CGAAATCCCT GCTCACTGCA GCCTCCGCTT CCCTGGTTCA Ac 
TTGCCTCAGC TTCCCCAGTA GCTGGGACCA CAGGTGCCCG C 
TTGTATTTTT AGTAGAGACA GGGTTTCACC ATGTTGGCCA GGCTGCTCTC AAACCCCTGA 
30 CCTCAAATGA TGTGCCTGCT TCAGCCTCCC ACAGTGCTGG GATTACAGGC ATGGGCCACC 
ACGCCTAGCC TCACGCTCCT TTCTGATCTT CACTAAGAAC AAAAGAAGCA GCAACTTGCA 
AGGGCGGCCT TTCCCACTGG TCCATCTGGT TTTCTCTGCA 
ACGAGATAAG CAGTTATGTG ACCTCACGTG CAAAGCCACC 

GCACCAGCCC AGAAGTGCAG AACTGCAGTC ACTGCACGTT TTCATCTCTA GGGACCAGAA 
CCAAACCCAC CCTTTCTACT TCCAAGACTT ATTTTCACAT GTGGGGAGGT TAATCTAGGA 
ATGACTCGTT TAAGGCCTAT TTTCATCATT TCTTTGTAGC ATTTGGTGCT TGACGTATTA 
TTGTCCTTTG ATTCCAAATA ATATGTTTCC 



I'MPRSS 
Kg. 298241 
AI538613 
BAB20077 

43-65, 239-261 



1 Protein sequence: 



MGEKDPPAVE APFSFRSLFG 



V TLQCTACGHR RGYSSRIVGG HMSLLSQWPW QASLQFQGYH 

I DLYLPKSWTI Q " 

LGNDIALMKL AGPLTFNEMI QPVCLPNSEE N 



A142 DNA SEQUENCE 

Gene name: ESTs; Weakly similar to LITHOSPTATHINE 1 

Probeset Accession AA314779 



t and stop codons) 



CCAAACAGAT T 

3 TCCTGGGTGA TATCATCATG 
C CACAAGTCCA ATTGCTATGG TTACTTCAGG 
A TGCCGAGCTC GAGTGTCAGT CTTACGGAAA CGGAGCCCAC 
CTGGCATCTA TCCTGAGTTT AAAGGAAGCC AGCACCATAG CAGAGTACAT AAGTGGCTAT 
CAGAGAAGCC AGCCGATATG GATTGGCCTG CACGACCCAC AGAAGAGGCA GCAGTGGCAG 

_ , - - R TCCTGGTCTG GCAAGTCCAT GGGTGGGAAC 

G CTGAGATGAG CTCCAATAAC AACTTTTTAA CTTGGAGCAG CAACGAATGC 
C AACACTTCCT GTGCAAGTAC CGACCATAGA GCAAGAATCA AGATTCTGCT 
AACTCCTGCA CCAGCCCCGT CCTCTTCCTT TCTGCTAGCC TGGCTAAATC TGCTCATTAT 
TTCAGAGGGG AAACCTAGCA AACTAAGAGT GATAAGGGCC CTACTACACT GGCTTTTTTA 
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GGCTTAGAGA CAGAAACTTT AGCATTGGGC CCAGTAGTGG CTTCTAGCTC TAAATGTTTG 780 

CCCCGCCATC CCTTTCCACA GTATCCTTCT TCCCTCCTCC CCTGTCTCTG GCTGTCTCGA 840 

GCAGTCTAGA AGAGTGCATC TCCAGCCTAT GAAACAGCTG GGTCTTTGGC CATAAGAAGT 900 

AAAGATTTGA AGACAGAAGG AAGAAACTCA GGAGTAAGCT TCTAGACCCC TTCAGCTTCT 960 

J ACACCCTTCT GCCCTCTCTC CATTGCCTGC ACCCCACCCC AGCCACTCAA CTCCTGCTTG 1020 

TTTTTCCTTT GGCCATAGGA AGGTTTACCA GTAGAATCCT TGCTAGGTTG ATGTGGGCCA 1080 
TACATTCCTT TAATAAACCA TTGTGTACAT AAGAGAAAAA AAAAAAAAAA AAAAAAAAAA 



20 
25 



ESTs; Weakly similar t 

Hs. 105484 

AA314779 



Signal sequence: 1-22 

domains: none found 



LLSCLAKTGV LGDIIMRPSC APGWFYHKSN CYGYFRKLRN WSDAELECQS 
YGNGAHLASI LSLKEASTIA EYISGYQRSQ PIWIGLHDPQ KRQQWQHICG AvffLYRSWSG 
EMSSNNNFLT WSSNECKKRQ HFLCKYRP 

4 DNA SEQUENCE 



Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession # 
Coding sequence: 



TCTCTCTGCA GCCATGGGGC 



CCTCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTGAAGT G 

CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 24 0 

CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG G AAT GGCGAG ACAGTCCAGG 300 

AAAG AAGGT C ACTGAAGGAA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 

GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCAAGGGTC 42 0 

CCTTCCCCCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 

ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 540 

660 
720 

GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 780 

ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 

AGGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 

CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 

TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 

ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGIGG 10 80 

GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1140 

GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTTTACCATC ACCACCCACC 1200 

CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 

AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 132 0 

CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 

CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 

CTG CAG AAG A CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 

CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1560 

GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 

ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 

ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 

ACGTGCTGAA CAT CACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 

TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGG AAGGT GACACAGTGG 1860 

TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 

ACCATGGCAA CAAAGAGCAG CTGACGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 1980 
ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG C 

TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 
AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC C 



TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 
AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 
AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 
ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT C 
ACGGTGGCGG GGAGGACGAC TAGGCGGCCT GCCTGCAGGG CTGGGGACCA A 
CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA : 
GGAAGTGGCC GTAGCAACTT GGCGGAGACA GGCTATGAGT C 
TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTT 
CACCTGGGCC AGGGTTGCCT. CAGAGGCCAA GTTT CCAG AA GCCTCTTACC TGCCGTAAAA 282 0 
TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2S80 
CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTAT CT 2940 
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TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 

TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 

ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 312 0 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 



Probeset Accession it: 
Protein Accession #: CAA4S177 
Signal sequence: 1-24 



Cadherin 3, P-cadherin (placental) 
Hs.2877 



Cellular localization: 



MGLPRGPLAS LLLLQVCWLO. CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 
QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRKKRDWV VAPISVPENG 
KGPFPQRLNQ LKSNKDRDTK I FYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 
YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 
DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 
TDMDGDGSTT TAVAWE I LD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 
AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 
PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE MQKISYRILR 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 
VNDHGPVPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 
TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 
GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED ODYDITQLHR 
GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 



A146 DHA SEQUENCE: 

Gene name: ~ TTK protein kinase 

Unigene number: Hs.169840 

Probeset Accession tt: MB6699 

Nucleic Acid Accession #•. NM_0033ia 

Coding sequence: 1026-3551 (underlined sequences correspond to start and stop codons) 



CTCACCCTCT CAAGTAGCTG GGATTACAGG CATGTGCCAC CACCCCTGGC TAACTAATTT 



3 ATTCTTTTTT TGTTGTTGGA TTTTTGAAAC AGGGTCTCCC T 
AGGCTGGAGT GCAGTGGTGC GATCTTC 
GATCCTCCCA CTTTAGCCTC CTGAGTAGCT G 

CTAATTTTTG TATTTTTATT AGAGACAGGG TTTCACCATG TTGGCCAGGC T 

CTCCTGGACT CAAGGGATCC GCCTGCCTCC ACTTCCCAAA GTCCCGAGAT TACAGGTGTG 
AGTCACCATG CCTGACCTTA TAATTC] 

GGTCCTCACA ACAAATCTGC ATTAGGCGGT ACAATAATCC TTAACTTCAT G 

AGGAAGATGA AGTGATTCAT GATTTAGAAA GGGGAAGTAG TAAGCCCACT GCACACTCCT 
A GATACAGTAA AAATGGGGTA TGGGAAGGTA G 



T GTACAGACTT CCTAGAAAAC AGTTTGGGTT CCATCTTTTC A 

TGCAGTTTTC TGTAGAAATG GAATCCGAGG ATTTAAGTGG CAGAGAATTG ACAATTGATT 1020 

CCATAATGAA CAAAGTGAGA GACATTAAAA ATAAGTTTAA AAATGAAGAC CTTACTGATG 10 BO 

AACTAAGCTT GAATAAAATT TCTGCTGATA CTACAGATAA CTCGGGAACT GTTAACCAAA 1140 

TTATGATGAT GGCAAACAAC CCAGAGGACT GGTTGAGTTT GTTGCTCAAA CTAGAGAAAA 1200 

ACAGTGTTCC GCTAAGTGAT GCTCTTTTAA ATAAATTGAT TGGTCGTTAC AGTCAAGCAA 12S0 

TTGAAGCGCT TCCCCCAGAT AAATATGGCC AAAATGAGAG TTTTGCTAGA ATTCAAGTGA 1320 

GATTTGCTGA ATTAAAAGCT ATTCAAGAGC CAGATGATGC ACGTGACTAC TTTCAAATGG 13B0 

CCAGAGCAAA CTGCAAGAAA TTTGCTTTTG TTCATATAT C TTTIGCACAA TTTGAACTGT 1440 

CACAAGGTAA TGTCAAAAAA AGTAAACAAC TTCTTCAAAA AGCTGTAGAA CGTGGAGCAG 1500 

TACCACTAGA AATGCTGGAA ATTGCCCTGC GGAATTTAAA CCTCCAAAAA AAGCAGCTGC 1560 

TTTCAGAGGA GGAAAAGAAG AATTTATCAG CATCTACGGT ATTAACTGCC CAAGAATCAT 1620 

TTTCCGGTTC ACTTGGGCAT TTACAGAATA GGAACAACAG TTGTGATTCC AGAGGACAGA 1680 

CTACTAAAGC CAGGTTTTTA TATGGAGAGA ACATGCCACC ACAAGATGCA GAAATAGGTT 1740 

ACCGGAATTC ATTGAGACAA ACTAACAAAA CTAAACAGTC ATGCCCATTT GGAAGAGTCC 1B00 

CAGTTAACCT TCTAAATAGC CCAGATTGTG ATGTGAAGAC AGATGATTCA GTTGTACCTT I860 

GTTTTATGAA AAGACAAACC TCTAGATCAG AATGCCGAGA TTTGGTTGTG CCTGGATCTA 192 0 

AACCAAGTGG AAATGATTCC TGTGAATTAA GAAAT1TAAA GTCTGTTCAA AATAGTCATT 1980 

TCAAGGAACC TCTGGTGTCA GATGAAAAGA GTTCTGAACT TATTATTACT GATTCAATAA 2040 

CCCTGAAGAA TAAAACGGAA TCAAGTCTTC TAGCTAAATT AGAAGAAACT AAAGAGTATC 2100 

AAGAACCAGA GGTTCCAGAG AGTAACCAGA AACAGTX3GCA AGCTAAGAGA AAGTCAGAGT 2160 

h GAATCCTGCT GCATCTTCAA ATCACTGGCA GAITCCGGAG TTAGCCCGAA 2 22 0 

C AGAGCAGAAA CATACCACTT TTGAGCAACC TGTCTTTTCA GTTTCAAAAC 2280 

C AATATCAACA TCTAAATGGT TTGACCCAAA ATCTATTTGT AAGACACCAA 2340 
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GCAGCAATAC CTTGGATGAT TACATGAGCT GTTTTAGAAC TCCAGTTGTA AAGAATGACT 2400 

TTCCACCTGC TTGTCAGTTG TCAACACCTT ATGGCCAACC TGCCTGTTTC CAGCAGCAAC 2460 

AGCATCAAAT ACTTGCCACT CCACTTCAAA ATTTACAGGT TTTAGCATCT TCTTCAGCAA 2520 

ATGAATGCAT TTCGGTTAAA GGAAGAATTT ATTCCATATT AAAGCAGATA GGAAGTGGAG 2580 

GTTCAAGCAA GGTATTTCAG GTGTTAAATG AAAAGAAACA GATATATCCT ATAAAATATG 2640 

TGAACTTAGA AGAAGCAGAT AACCAAACTC TTGATAGTTA CCGGAACGAA ATAGCTTATT 270 0 

TGAATAAACT ACAACAACAC AGTGATAAGA TCATCCGACT TTATGATTAT GAAATCACGG 2760 

ACCAGTACAT CTACATGGTA ATGGAGTGTG GAAATATTGA TCTTAATAGT TGGCTTAAAA 2820 

AGAAAAAATC CATTGATCCA TGGGAACGCA AGAGTTACTG GAAAAATATG TTAGAGGCAG 28 80 

TTCACACAAT CCATCAACAT GGCATTGTTC ACAGTGATCT TAAACCAGCT AACTTTCTGA 2940 

TAGTTGATGG AATGCTAAAG CTAATTGATT TTGGGATTGC AAACCAAATG CAACCAGATA 3000 

CAACAAGTGT TGTTAAAGAT TCTCAGGTTG GCACAGTTAA TTATATGCCA CCAGAAGCAA 3060 

TCAAAGATAT GTCTTCCTCC AGAGAGAATG GGAAATCTAA GTCAAAGATA AGCCCCAAAA 312 0 

GTGATGTTTG GTCCTTAGGA TGTATTTTGT ACTATATGAC TTAGGGGAAA ACACCATTTC 3180 

AGCAGATAAT TAATCAGATT TCTAAATTAC ATGCCATAAT TGATCCTAAT CATGAAATTG 3240 

AATTTCCCGA TATTCCAGAG AAAGATCTTC AAGATGTGTT AAAGTGTTGT TTAAAAAGGG 330 0 

ACCCAAAACA GAGGATATCC ATTCCTGAGC TCCTGGCTCA TCCATATGTT CAAATTCAAA 3360 

CTCATCCAGT TAACCAAATG GCCAAGGGAA CCACTGAAGA AATGAAATAT GTTCTGGGCC 342D 

AACTTGTTGG TCTGAATTCT CCTAACTCCA TTTTGAAAGC TGCTAAAACT TTATATGAAC 3480 

ACTATAGTGG TGGTGAAAGT CATAATTCTT CATCCTCCAA GACTTTTGAA AAAAAAAGGG 3540 

GAAAAAAA TG A TTTGCAGTT ATTCGTAATG TCAGATAGGA GGTATAAAAT ATATTGGACT 3600 

GTTATACTCT TGAATCCCTG TGGAAATCTA CATTTGAAGA CAACATCACT CTGAAGTGTT 3660 

ATCAGCAAAA AAAATTCAGT GAGATTATCT TTAAAAGAAA ACTGTAAAAA TAGCAACCAC 3720 

TTATGGCACT GTATATATTG TAGACTTGTT TTCTCTGTTT TATGCTCTTG TGTAATCTAC 3780 

TTGACATCAT TTTACTCTTG GAATAGTGGG TGGATAGCAA GTATATTCTA AAAAACTTTG 3840 
TAAATAAAGT TTTGTGGCTA AAATGA 



35 
40 



I I I I I I 

MNKVRDIKNK FKMEDLTDEL SLNKISADTT DNSG'fVNQIM MMANNPEDWL S 
VPLSDALLNK LIGRYSQAIE ALPPDKYGQN ESFARIQVRF AELKAIQEPD DAEDYFQMAR 
ANCKKFAFVH ISFAQFELSQ GNVKKSKQLL QKAVERGAVP LEMLEIALRN LNLQKKQLLS 
EEEKKNLSAS TVLTAQESFS GSLGHLQNRN NSCDSRGQTT KARFLYGEKM PPQDAEIGYR 
SSLRQTNKTK USCPFGRVPV NLLNSPDCDV KTDDSWPCF MKRQTSRSEC RDLWPGSKP 
SGMDSCELRN LKSVQNSHFK EPLVSDEKSS ELIITDSITL KNICTESSLLA KLEETKEYQE 
PEVPESNQKQ WQAKRKSECI NQNPAASSNH WQIPELARKV NTEQKHTTFE QPVFSVSKQS 
PPISTSKWFD PKSICKTPSS NTLDDYMSCF RTPWKNDFP PACQLSTPYG QPACFQQQQH 
QILATPLQNL QVLASSSANE CISVKGRIYS ILKQIGSGGS SKVFQVLNEK KQIYAIKYVN 
LEEADNQTLD SYRHEIAYLN KLQQHSDKII RLYDYEITDQ YIYMVKECGN IDLNSWLKKK 
KSIDPWERKS YWKNMLEAVH TIHQHGIVHS DLKPANFLIV DGMLKLIDFG IAMQMQPDTT 
SWKDSQVGT VNYMPPEAIK DMSSSRENGK SKSKISPKSD VWSLGCILYY MTYGKTPFQQ 
IINQISKLHA IIDPNHEIEF PDIPEKDLQD VLKCCLKRDP KQRISIPELL AHPYVQIQTH 
PVNQMAKGTT EEMKYVLGQL VGLNSPNSIL K~ - 



endothelial differentiation, lysophosphatidic acid G-protein- coupled receptor 
Unigene number: Hs. 258583 

Probeset Accession #: NM_012152 
Nucleic Acid Accession it: NM_012152 

Coding sequence: 43-1104 (underlined sequences correspond to start and stop codons) 

I I' I' I' I' f 
CTTCTTTAAA TTTCTTTCTA GGATGTT CAC T ' 

GACAAGCACA TGGACTTTTT TTATAATAGG AGCAACACTG ATACTGTCGA TGACTGGACA 
GGAACAAAGC TTGTGATTGT TTTGTGTGTT GGGACGTTTT TCTGCCTGTT TATTTTTTTT 
TCTAATTCTC TGGTCATCGC GGCAGTGATC AAAAACAGAA AAITTCA1TT CCCCTTCTAC 

ATGTTTAACA CAGGCCCAGT TTCAAAAACT TTGACTGTCA ACCGCTGGTT TCTCCGTCAG 
GGGCTTCTGG ACAGTAGCTT GACTGCTTCC CTCACCAACT TGCTGGT1AT CGCCGTGGAG 
AGG CACATGT CAATCATGAG GATGCGGGTC CATAGCAACC TGACCAAAAA GAGGGTGACA 
CTGCTCATTT TGCTTGTCTG GGCCATCGCC ATTTTTATGG GGGCGGTCCC CACACTGGGC 
TGGAATTGCC TCTGCAACAT CTCTGCCTGC TCTTCCCTGG CCCCCATTTA CAGCAGGAGT 
TACCTTGTTT TCTGGACAGT GTCCAACCTC ATGGCCTTCC TCATCATGGT TGTGGTGTAC 
CTGCGGATCT ACGTGTACGT CAAGAGGAAA ACCAACGTCT TGTCTCCGCA TACAAGTGGG 
TCCATCAGCC GCCGGAGGAC ACCCATGAAG CTAATGAAGA CGGTGATGAC TGTCTTAG3G 
GCGTTTGTGG TATGCTGGAC CCCGGGCCTG GTGGTTCTGC TCCTCGACGG CCTGAACTGC 
AGGCAGTGTG GCGTGCAGCA TGTGAAAAGG TGGTTCCTGC TGCTGGCGCT GCTCAACTCC 
GTCGTGAACC CCATCATCTA CTCCTACAAG GACGAGGACA TGTATGGCAC CATGAAGAAG 
ATGATCTGCT GCTTCTCTCA GGAGAACCCA GAGAGGCGTC CCTCTCGCAT CCCCTCCACA 
GTCCTCAGCA GGAGTGACAC AGGCAGCCAG TACATAGAGG ATAGTAITAG CCAAGGTGCA 
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GTCTGCAATA AAAGCACTTC CTAAACTCTG GATGCCTCTC GGCCCACCCA GGTGATGACT 1140 
GTCTTAGG 



A149 Protein sequence: 

Gene name: endothelial differentiation, lysophosphatidic acid G-protein-coupled r 

Unigene number: HE.2S8583 
Probeset Accession ft: NM_0 12152 

Protein Accession # : NP_036284 
Signal sequence: none found 

Transmembrane domains: 
Cellular Localization: 

1 11 21 31 41 51 

I I I I I I 

MNECHYDKHM DFFYNRSNTD TVEDWTGTKL VIVLCVGTPP CLFIFFSNSL VIAAVIKHRK 
FHFPFYYLLA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN RWFLRQGLL3 SSLTASLTNL 
LVIAVERHMS IMRMRVHSNL TKKRVTLLIL LVWAIAIFMG AVPTLGKNCL OJISACSSLA 
PIYSRSYLVP WTVSNLMAFL IMVWYLRIY VYVKRKTNVL SPKTSGSISR RRTPMKLMKT 
VMTVLGAFW CWTPGLWLL LDGLNCRQCG VQHVKRWFLL LALLNSWN? IIYSYKDEDM 
YGTMKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNK STS 



A15Q DNA SEQUENCE 

Unigene number; Hs. 293616 

Probeset Accession #: AW043782 



AGCAACGACG CCGGGCAGCG GGAGCGGCGG CCCCGCCATG TGGCTCCTGG GGCCGCTGTG 60 




GTGCAACATA CCAGGCAACT T 
GTGTGACGGG CTGCCTGACT GCTTCGACAA G 
GTCGAAATGT GGCCCAACCT T 
CTTCCGGTGC AATGGGTTTG A 

3 CTTTGCTCCA CCGCCCGCTA CCACTGCAAG AACGGCCTCT GTATTGACAA 
C TGCGATGGAC AGAATAACTG TCAAGACAAC AGTGATGAGG A 
AAGTTCTCAA GAACCCGGCA G 

TTACCCCAGC ATCACCTATG CCATCATCGG CAGCTCCGTC A 

CCTGCTGGCA CTGGTCTTGC ACCACCAGCG GAAGCGGAAC AACCTCATGA C 
GCACCGGCTG CAGCACCCTG TGCTGCTGTC CCGCCTGGTG GTCCTGGACC ACCCCCACCA 720 
CTGCAACGTC ACCTACAACG TCAATAATGG CATCCAGTAT GTGGCCAGCC AGGCGGAGCA 780 
GAATGCGTCG GAAGTAGGCT CCCCACCCTC CTACTCCGAG GCCTTGCTGG ACCAGAGGCC 840 
TGCGTGGTAT GACCTTCCTC CACCGCCCTA CTCTTCTGAC ACGGAATCTC TGAACCAAGC 900 
CGACCTGCCC CCCTACCGCT CCCGGTCCGG GAGTGCCAAC AGTGCCAGCT CCCAGGCAGC 950 

CAGCAGCCTC CTGAGCGTGG AAGACACCAG CCACAGCCCG GGGCAGCCTG GCCCCCAGGA 1020 

GGGCACTGCT GAGCCCAGGG ACTCTGAGCC CAGCCAGGGC ACTGAAGAAG TATAAGTCCC 1080 

AGTTATTCCA AAGTCCATAT GGGTTAAT CT GCTCTGACTT GTTGCCATTC TAACAATTTG 1140 

TGCTCATGGG AAGCTCTTTA AGCACCTGTA AGGATGTCTC AAGTTACAGT TTGGGATATT 1200 

AACTATCTCT GCATTCCCCT CCTCCCCCAG ACTTCAGAGA TGTTTTTCTG GCGTCTCAGT 1250 

TGACATGATC TGTTGTGCGT CTTTTCTGTC AGGTCACTCT 1CCCTTGGGA CCCGAGATCA 1320 

CACCCTCATT TTTCACATTA TTCTGTTTCT GTTGGAGAGA CAGCATATAA AACAGTATTG 1380 

AAATAGGCTG GGAGAGAGCA ATGTTTCTGT GCTATATTGG ATGCTCAGAA GTGCAGGAGA 1440 

CGCTGGACCC AATTCTCTCT GCTGGGTAGT TACCTTATAG CATTTGGGGA TTTGGGTTAG 1500 

ATGATCTAAC CAGGAGGCCA TCACTGGATG GTCACCCCCC CAAAAAAATT CCATTTGAGC 1550 

ATCAAAACCT GCTTTGCACA ATCCTATTTG ATGCCCCCAG TTCAGCAGAG TCAGTGGCCA 1620 

AAGAAAACTT TGGACGTGAG TAACACCCTT CAGCAGTCGC AACGTTATTT TGGTTTTGTG 1680 

AAGGACTCTG AAACCATCTA CCCTGTATAA ATTCTGGCTT TAGAAATTTG CCCAAGAATG 1740 

CTCATTCTGA GAGCTTTCCT CAGCAGCATA TATCATCAGC CTCATCCTAA AATAGGCAGG 1800 

GAGCCCCTCC CATGAGTTTA TCCAAGTTCT CAGCTCCTAA AATGCAGGCT GCCAAGACCC 1850 

TACACCTGCC CTGGCTCTAC AGCCACTTAC CTGGTTTCTG GACTGTCACC CTCCCAGCTG 1920 

ACCTGCCCGT AGCCAAGGAA TGAGGACCTA ACTTGAGTTG GCCCAAAGTC TGACCTGGCT 1980 

GTATGTCCCT GTGGCCCACA CCCAGCCTGT CTTGCTCATT CATGCAGCCT CAACACTGGC 2040 

CTCCAAAGTT CCCTTAACAC TTGCAAAGTC CTTTTTACCT GTGCATTTGG ACTTGAGGAC 21,00 

ACTGGTTTCT ATCACAGGTG AGAGCCATGT TCAATACCTC CAGCAAGCTC TCCTGGCTCC 2150 

CTGCACTGTG CACGCTCCTC TTCCCAAGGT CCCAATACCA GCACCTCTAG TTAGAGTTAG 2220 

GGTCAGGGTC AGGCCTCTCC CAACATCCCA GTAGTTTCTC CTCTGAGACA CATGGGCAAG 2280 

AGACAATTTG GAGTCAAGAT TTTCCATTTG GATCTATTTT AAATCTTTTA GAAATGCATT 2340 

TGAAACAGTG TGTTTGTTTT TTCCCTTCTA GTTAAGGGAC TATTTATATG TGTATAGGAA 2400 

AGCTGTCTCT TTTTTTGTTT TTCCTTTAAC AAGGTCCAAA GAAAGATGCA AAAGGAGATC 2460 

ACACCCTTGC CCCGCTGAGC CCCGTGATAA CAAGTCACTC CAGACTAACC TGTGTGCCAG 2520 

ACATTTGTGC ATTGTTGCAC TTTGAGGTTA TTATTTATCA AGTTCTTGAA GGAAGCAGAA 2580 

AGAGGGACT C CTCTCTCCCT CCGTGTATAG TCTCTATGTT TGTGCTAGTT TTTCTTTTTT 2640 
TTCTCTGTGT CCAGTCAGCC ACAGGGCCCG CCTCCCTGCA GGAATAAGGG G 

AGGTGTTGTT TGGCAAGAAA CCACACTGAC Ti 

CCACTCCGGG CAGCTGTCAC CCATTCAGAA CTTCTTTCCG CAGCTGAAGA AATGTTCAGT 
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& AAACAGAGCC TGCAGGAAGT GGGGCTAAAG TGGCATTCAG 2 880 

TGATCCTGTT CTGTAGACTT TTCTTTCTTT TTTTAACCAA ATCCAAAGGA TGTTACAGAA 2 940 

AAGCTAGCCA CTGGTATTTT GTTTTGTTTA AARAAAAAAA GAAAGAAAGA AAGAAAGAAA 3000 

AACGGAAAGG AACCTAGCTG CCTGTATCTT TCATTTTTAA AATAGCACTT GAGTTATTTT 3060 

CTGAGTAATC CAATAAAGAA CTTTTGATGA CAGCCAGAAT GTGITAGAAC TCTGGCTGAA 3120 

CATTTCATCT CCTGTGAGTC AGAAGGGCTT TATTTCTCCC TTTGATGGGG CCCCTTCTTC 3180 

TTTCTGGTGC TCTGGAAGTT GTTTAGAGGA AAGAATTCTA ATTTTAATTA ATTGCGCAGT 3240 

GAGTTAATCT CACTCGCTTT TCTGCTTCCA GGCATCTTAG GAAAAACAAA TGGTTTTAGT 3300 

AGATAAGGGA TGCCTACTAA TGCTTTTTTA AAACAAACAG GGACATTTTT ATTATAGATT 3360 

TGATTTTTTT AATGAATGTT TTTAAAAATA TATAAATAGG ACACCAAAGC GGCAGGGTTT 3420 

TTTTTGGGGG GAGGGGGTTT GTTTTCCAAC TCAAGATGGC ACATTAGTGG CCAGCAATAT 3480 

TTTTTAACTC ATTCCAACCA GGAAGCTTTT TTATACATTG CCTAAATCTA CGCCAACCAG 3540 

AAAATAGTCT CATCTCTTTT TTTCTCAAAT GAGATCCGTG TTTTATTTTA GCATTAAATT 3600 

AGTTACACTG TGATGACTGG CCTATTACCT GACTCAGCTC CCTCTACCTT GAAATTGACA 3660 

TTTTTAAAAA ATGCAACTAA GTGGTTAATA GTGTGTGACG CTCAAAGTTA ATGTAAACTG 3720 

GAAAGGTTGT GTGTCGTTGC TTTTTGTGTT TTGGTTAGGC TTGGTTTTGT TTTTTAATTT 3780 

TTATACTTTC TAATAAATTT GCAGTTTCAT TCTTTCTGTT TGTGCAAAWG GKMCTAMARM 3840 

AAMMAAAAAC AWYWTTGGGG GGGCTTGGGC CTCGGAAAAA GTTTTTAACA CCACTTCGGG 3900 

TGGGGCGGCG GGGCCCACGT AGGTACGGCG ACCACGCGGG CCCAAACGGG ACCCCAGAAG 3960 

GAAACCCTGG CCAAGAAAAA GGTGGCGAGA ATTCTCCACA CCAGAAAAAA ACGCGCCGGG 4020 

GGAAACCGCA GAGTGTTGCG TAAACCACAC CCGAAGAGAG AACTCAGAAG CACACAAGCG 4080 
C AGGAGGACCC AAGGGAACCC G. 



25 
30 



Unigene 

Signal sequence: 
Transmembrane dc 

Cellular Localiz 



I I I I I 

L LSSA^KSOLL PGNNFTNECN IPGNFMCSNG HCIPGAWQCD GLPDCFDKSD 60 

EKECPKAKSK CGPTFFPCAS GIHCIICRFR CNCFEDCPDC SDEENCTAN? LLCSTARYHC 12 0 

KNGLCIDKSP ICDGQNNCQD NSDEESCESS QEPGSGQVFV TSENQIjVYYP SITYAIIGSS 180 

VIFVLWALL ALVLHHQRKR NNLMTLPVHR LQHPVLLSRL TOLDHPHHCN VTYNVIJNGIQ 240 

YVASQAEQNA SEVGSPPSYS EALLDQRPAW YDLPPPPYES DTESLNQADL PPYRSRSGSA 300 

NSASSQAASS LLSVEDTSHS PGQPGPQEGT AEPRDSEPSQ GTEEV 

A1 52 DMA SEQUENCE 

Gene name: Transmembrane protease, serine 2 (TMPRSS2 ) 

Unigene number; Hs. 105807 



1 11 21 31 41 51 

I I I I I I 

GTCATATTGA ACATTCCAGA TACCTATCAT TACTCGATGC TGTTGATAAC AGCAAGATGG 60 

CTTTGAACTC AGGGTCACCA CCAGCTATTG GACCTTACTA TGAAAACCAT GGATACCAAC 120 

CGGAAAACCC CTATCCCGCA CAGCCCACTG TGGICCCCAC TGTCTACGAG GTGCATCCGG 180 

CTCAGTACTA CCCGTCCCCC GTGCCCCAGT ACGCCCCGAG GGTCCTGACG CAGGCTTCCA 240 

ACCCCGTCGT CTGCACGCAG CCCAAATCCC CATCCGGGAC AGTGTGCACC TCAAAGACTA 300 

AGAAAGCACT GTGCATCACC TTGACCCTGG GGACCTTCCT CGTGC-GAGCT GCGCTGGCCG 350 

CTGGCCTACT CTGGAAGT T C ATGGGCAGCA AGTGCTCCAA CTCTGGGATA GAGTGCGACT 420 

CCTCAGGTAC CTGCATCAAC CCCTCTAACT GGTGTGATGG CGTGTCACAC TGCCCCGGCG 480 

GGGAGGACGA GAATCGGTGT GTTCGCCTCT ACGGACCAAA CTTCATCCTT CAGATGTACT 540 

CATCTCAGAG GAAGTCCTGG CACCCTGTGT GCCAAGACGA CTGGAACGAG AACTACGGGC 600 

GGGCGGCCTG CAGGGACATG GGCTATAAGA ATAATTTTTA CTCTAGCCAA GGAATAGTGG 660 

ATGACAGCGG ATCCACCAGC TTTATGAAAC TGAACACAAG TGCCGGCAAT GTCGATATCT 720 

ATAAAAAACT GTACCACAGT GATGCCTGTT CTTCAAAAG C AGTGGTTTCT TTACGCTGTT 780 

TAGCCTGCGG GGTCAACTTG AACTCAAGCC GCC AG AG CAG GATCGTGGGC GGTGAGAGCG 840 

CGCTCCCGGG GGCCTGGCCC TGGCAGGTCA GCCTGCACGT CCAGAACGTC CACGTGTGCG 900 

GAGGCTCCAT CATCACCCCC GAGTGGATCG TGACAGCCGC CCACTGCGTG GAAAAACCTC 960 

TTAACAATCC ATGGCATTGG ACGGCATTTG CGGGGATTTT GAGACAATCT TTCATGTTCT 1020 

ATGGAGCCGG ATACCAAGTA CAAAAAGTGA TTTCTCATCC AAATTATGAC TCCAAGACCA 1080 

AGAACAATGA CATTGCGCTG ATGAAGCTGC AGAAGCCTCT GACTTTCAAC GACCTAGTGA 1140 

AACCAGTGTG TCTGCCCAAC CCAGGCATGA TGCTGCAGCC AGAACAGCTC TGCTGGATTT 1200 

CCGGGTGGGG GGCCACCGAG GAGAAAGGGA AGACCTCAGA AGTGCTGAAC GCTGCCAAGG 1260 

TGCTTCTCAT TGAGACACAG AGATGCAACA GCAGATATGT CTATGACAAC CTGATCACAC 1320 

CAGCCATGAT CTGTGCCGGC TTCCTGCAGG GGAACGTCGA TTCTTGCCAG GGTGACAGTG 1380 

GAGGGCCTCT GGTCACTTCG AACAACAATA TCTGGTGGCT GATAGGGGAT ACAAGCTGGG 1440 
GTTCTGGCTG TGCCAAAGCT TACAGACCAG GAGTGTACGG GAATGTGATG GTATTCACGG 
ACTGGATTTA TCGACAAATG AAGGCAAACG GCTAATCCAC A 
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CCGCAAGGGG TGATGGCCGG CTGGTTGTGG GCACTGGCGG TCAATTGTGG AAGGAAGAGG 1800 

GTTGGAGGCT GCCCCCATTG AGATCTTCCT GCTGAGTCCT TTCCAGGGGC CAATTTTGGA I860 

TGAGCATGGA GCTGTCACTT CTCAGCTGCT GGATGACTTG AGATGAAAAA GGAGAGACAT 1920 

GGAAAGGGAG ACAGCCAGGT GGCACCTGCA GCGGCTGCCC TCIGGGGCCA CTTGGTAGTG 1S80 

TCCCCAGCCT ACTT CACAAG GGGATTTTGC TGATGGGTTC TTAGAGCCTT AGCAGCCCTG 2040 

GATGGTGGCC AGAAATAAAG GGACCAGCCC TTCATGGGTG GTGACGTGGT AGTCACTTGT 2100 

AAGGGGAACA GAAACATTTT TGTTCTTATG GGGTGAGAAT ATAGACAGTG CCCTTGGTGC 2160 

GAGGGAAGCA ATTGAAAAGG AACTTGCCCT GAGCACTCCT GGTGCAGGTC TCCACCTGCA 2220 

CATTGGGTGG GGCTCCTGGG AGGGAGACTC AGCCTTCCTC CTCATCCTCC CTGACCCTGC 22 80 

TCCTAGCACC CTGGAGAGTG AATGCCCCTT GGTCCCTGGC AGGGCGCCAA GTTTGGCACC 2340 

ATGTCGGCCT CTTCAGGCCT GATAGTCATT GGAAATTGAG GTCCATGGGG GAAATCAAGG 2400 

TAAGGTACAC TGTTTCCATG TTATGTTTCT ACACATTGAT GGTGGTGACC 24S0 
AGCCATCTT 



A153 Protein sequence: 




I I I I I I 

MALNSGSPPA IGPYYEHHGY QPENPYPAQP TWPTVYEVH PAQYYPSPVP QYAPRVLTQA 
SNPWCTQPK SPSGTVCTSK TKKALCITLT LGTPLVGAAL AAGLLWKFMG SKCSNSGIEC 
DSSGTCINPS NMCDGVSHCP GGEDENRCVR LYGPNFILQM YSSQRKSWHP VCQDDWNENY 
GRAACRDMGY KNNFYSSQGI VDDSGSTSPM KLNTSAGNVD IYKKLYHSDA CSSKAWSLR 
CLACGVNLNS SRQSRIVGGE SALPGAWPWQ VSLHVQNVHV CGGSIITPEW IVTAAHCVEK 
PLNNPWHWTA FAGILRQSFM FYGAGYQVQK VISHPNYDSK TKNNDIALMK LQKPLTFNDL 
VKPVCLPNPG MMLQPEQLCW ISGWGATEEK GKTSEVLNAA KVLLIETQRC NSRYVYDNLI 



A154 DNA SEQUENCE 

Gene name: Homo sapiens cDNA FLJ13S81 fis, clone PLACE1009039 

Probeset Accession ft: AIS947S7 
Nucleic Acid Accession #: 



CAGAC A K3CT GTATTTCAGT GCAGCCTGCC AGACCTCTTC TGGAGGAAGA CTGGACAAAG 
GGGGTCACAC A1TCCTTCCA TACGGTTGAG CCTCTACCTG CCTGGTGCTG GTCACAGTTC 
AGCTTCTTC A TGA TGGTGGA TCCCAATGGC AATGAAT CCA GTGCTACATA CTTCATCCTA 
ATAGGCCTCC CTGGTTTAGA AGAGGCTCAG TTCTGGTTGG CCTTCCCATT GTGCTCCCTC 
TACCTTATTG CTGTGCTAGG TAACTTGACA ATCATCTACA TTGTGCGGAC TGAGCACAGC 
CTGCATGAGC CCATGTATAT ATTTCTTTGC ATGCTTTCAG GCATTGACAT CCTCATCTCC 
ACCTCATCCA TGCCCAAAAT GCTGGCCATC TTCTGGTTCA ATTCCACTAC CATCCAGTTT 
GATGCTTGTC TGCTACAGAT GTTTGCCATC CACTCCTTAT CTGGCATGGA ATCCACAGTG 
CTGCTGGCCA TGGCTTTTGA CCGCTATGTG GCCATCTGTC ACCCACTGCG CCATGCCACA 
GTACTTACGT TGCCTCGTGT CACCAAAATT GGTGTGGCTG CTGTGGTGCG GGGGGCTGCA 
CTGATGGCAC CCCTTCCTGT CTTCATCAAG CAGCTGCCCT TCTGCCGCTC CAATATCCTT 
TCCCATTCCT ACTGCCTACA CCAAGATGTC ATGAAGCTGG CCTGTGATGA TATCCGGGTC 
AATGTCGTCT ATGGCCTTAT CGTCATCAT C TCCGCCATTG GCCTGGACTC ACTTCTCATC 
TCCTTCTCAT ATCTGCTTAT TCTTAAGACT GTGTTGGGCT TGACACGTGA AGCCCAGGCC 
AAGGCATTTG GCACTTGCGT CTCTCATGTG TGTGCTGTGT TCATATTCTA TGTACCTTTC 
ATTGGATTGT CCATGGTGCA TCGCTTTAGC AAGCGGCGTG ACTCTCCACT GCCCGTCATC 



\ CTTTTCCATG TGGCCACACA CGCTTCAGAG 1080 
CCCTAGGTGT CAGTGAT CAA ACTTCTTTTC CATTCAGAGT CCTCTGATTC AGATTTTAAT 1140 
A TACAACTCAG 1200 



GTTTTCTTGC TACATATAAT TATTAATACC CTGACTAGGT TGTGGTTGGA G 



T GACACACTGT TCTGAGAGTT TTCACAGCAT 

ATGGACCCTG TTTTTC 

TTAGTACCCT CATTGTAGCC ATGGGAAAAT TGATGTTCAG T 
GGGGTCATAC A 
GGAAGAACTG T 



GCAACAGAAC TCATGGCTTT AAT CCCACT A GCTATTGCTT ATTGTCCTGG TCCAATTGCC 
AATTACCTGT GTCTTGGAAG AAGTGATTTC TAGGTTCACC ATTA.TGGAAG A 
AGAAAGTCTC CATAGGGCTT ATAGCAAGTT ATTTATTTTT AAAAGTl 
1GATAGGCAG TGAGGTTAGG GAGCCACCAG TTATGATGGG AAGTATGGPA TGGCAGGTGT 
G AATCTTCAGG 
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ACCATGCTTT ATTTGGGGCT TTGTGCAGTA TGGAACAGGG ACTTTGAGAC CGGGAAAGCA 2340 

ATCTGACTTA GGCATGGGAA TCAGGCATTT TTGCTTCTGA GGGGCTATTA CCAAGGGTTA 2400 

ATAGGTTTCA TCTTCAACAG GATATGACAA CAGTCTTAAC CAAGAAACTC AAATTACATA 2460 

TACTAAAACA TGTGATCATA TATGTGGTAA GTTTCATTTT CTTTTTCAAT CCTCAGGTTC 2520 

CCTGATATGG ATTCCTATNA CATGCTTTCA TCCCCTTTTG TAATGGATAT CATATTTGGA 25 80 

AATGCCTATT TAATACTTGT ATTTGCTGCT GGACTGTAAG CCCATGAGGG CACTGTTTAT 2640 

TATTGAATGT CATCTCTGTT CATCATTGAC TGCTCTTTGC TCATCATTGA ATCCCCCAGC 2700 

AAAGTGCCTA GAACATAATA GTGCTTATGC TTGACACCGG TTATTTTTCA TCAAACCTGA 2760 

TTCCTTCTGT GCTGAACACA TAGCCAGGCA ATTTTCCAGC CTTCTTTGAG TTGGGTATTA 2820 

TTAAATTTTA GCCATTACTT CCAATGTGAG TGGAAGTGAC ATGTGCAATT TTTATACCTG 2880 

GCTCATAAAA CCCTCCCATG TGCAGCCTTT CATGTTGACA TTAAATGTGA CTTGGGAAGC 2940 

TATGTGTTAC ACAGAGTTAA TTAACCNGAA AGGCCTGGNA ATTTTTTGNN AANNAAACTG 3000 
TGGCOWGAG GCCCNCAACC CT 



, clone PLACE1009039 



nains: 29-51, 57-79, 82-104, 203-225, 239-261, 273-295 

Cellular Localization: not determined 



I I I I I I 

MVDPNGNESS ATYFILIGLP GLEEAQFWLA FPLCSLYLIA VLGNLTIIYI VRTEHSLHEP 

MYIFLCMLSG IDILISTSSM PKMLAIFWFN STTIQFDACL LQMFAIHSLS GMESTVLLAM 

AFDRYVAICH PLRHATVLTL PRVTKIGVAA VVRGAALMAP LPVFIKQLPF CRSNILSHSY 

CLHQDVMKLA CDDIRVNWY GLIVIISAIG LDSLLISFSY LLILKTVLGL TREAQAKAFG 

TCVSHVCAVF IFYVPFIGLS MVHRFSKRRD SPLPVILANI Y 

RQRILRLFHV ATHASEP 

A156 DNA 



Gene name: vasoactive intestinal peptide receptor 1 

Unigene number: Hs. 198726 

Probeset Accession # : X77777 

Nucleic Acid Accession #-. NM_004S24.1 

Coding sequence: 57~1544 (underlined sequences correspond to start and stop codons) 



TCGGAGCCTG CGGAGGGTGG TGGTGGTGGT GGTGGTGGCC CTCGCCCGCC T CACT CATGC 6 0 

CTCCTCCTCC TCTGCTCTCG CTCAGGCGCC TCGGTGGCGG TTGGTCGGCG CTTACCCGGC 12 0 
TGGTGGTCGC GGCGGCCGGG GCTCGCTCTC GGGGAGGCCG GGGCGGATCT CGCGGCGCAG 180 



CGCTCTTGGG CTCCTCGCTG CAGGAGGAGT GTGACTATGT GCAGATGATC GAGGTGCAGC 300 

ACAAGCAGTG CCTGGAGGAG GCCCAGCTGG AGAATGAGAC AATAGGCTGC AGCAAGATGT 360 

50 GGGACAACCT CACCTGCTGG CCAGCCACCC CTCGGGGCCA GGTAGTTGTC TTGGCCTGTC 420 

CCCTCATCTT CAAGCTCTTC TCCTCCATTC AAGGCCGCAA TGTAAGCCGC AGCTGCACCG 480 

ACGAAGGCTG GACGCACCTG GAGCCTGGCC CGTACCCCAT TGCCTGTGGT TTGGATGACA 540 

AGGCAGCGAG TTTGGATGAG CAGCAGACCA TGTTCTACGG TTCTGTGAAG ACCGGCTACA 600 

CCATTGGCTA CGGCCTGTCC CTCGCCACCC TTCTGGTCGC CACAGCTATC CTGAGCCTGT 650 

55 TCAGGAAGCT CCACTGCACG CGGAACTACA TCCACATGCA CCTCTTCATA TCCTTCATCC 720 

TGAGGGCTGC CGCTGTCTTC ATCAAAGACT TGGCCCTCTT CGACAGCGGG GAGTCGGACC 780 

AGTGCTCCGA GGGCTCGGTG GGCTGTAAGG CAGCCATGGT CTTTTTCCAA TATTGTGTCA 840 

TGGCTAACTT CTTCTGGCTG CTGGTGGAGG GCCTCTACCT GTACACCCTG CTTGCCGTCT 900 

CCTTCTTCTC TGAGCGGAAG TACTTCTGGG GGTACATACT CATCGGCTGG GGGGTACCCA 950 

60 GCACATT CAC CATGGTGTGG ACCATCGCCA GGATCCATTT TGAGGATTAT GGTCTGCTCA 1020 

GGTGCTGGGA CACCATCAAC TCCTCACTGT GGTGGATCAT AAAGGGCCCC ATCCTCACCT 1080 

CCATCTTGGT AAACTTCATC CTGTTTATTT GCATCATCCG AATCCTGCTT CAGAAACTGC 1140 

GGCCCCCAGA TAT CAGGAAG AGTGACAGCA GTCCATACTC AAGGCTAGCC AGGTCCACAC 1200 

3GAGTACACT ACATCATGTT CGCCTTCTTT CCGGACAATT 1250 

3TCTTTGAGC TCGTCGTGGG GTCTTTCCAG GGTTTTGTGG 1320 

TGGCTATCCT CTACTGCTTC CTCAATGGTG AGGTGCAGGC GGAGCTGAGG CGGAAGTGGC 1380 



GGGCGCGCCA GCCCCGGCCC TGGGCTCGGA GGCTGC 

C CTGCCTGGAG CGTTTCTAGC AAGTGAGAGA 
3 TCATTAGACT CCTCCTCCAA 

AGGCCCCCTA CGCCAATCAA GGGCAAAAAG T 

TGCTGGCTCT TCTGCCCAAT TGGAGGAAAG CAACCGGTGG ATCCTCAAAC AACACTGGTG 
TGACCTGAGG GCAGAAAGGT TCTGCCCGGG AAGGTCACCA G 
CCTGAAATTT CACCATTGCT GTCAAGTTCC TTTGGGTTAA G 

T GCAGCTCACT ACCCTATTCT CTCTTTACGC TTAGTTATCA GCTTTTTAAA 2100 
T TGTTTGGAGA GCACACCTAT CTTAGTGGTT CCCCACCGAA 2160 
A GTCTGGTGGG AGGACGGTGC AACCCAAGGA CTGAGGGACT 2220 
CTGAAGCCTC TGGGAAATGA GAAGGCAGCC ACCAGCGAAT GCTAGGTCTC GGACTAAGCC 2280 
TACCTGCTCT CCAAGTCTCA GTGGCTTCAT CTGTCAAGTG GGACTCTGTC ACACCAGCCA 2340 
TTCTTATCTC TCTGTGCTGT GGAAGCAACA GGAATCAAGA GACTGCCCTC CTTGTCCACC 24 00 
CACCTATGTG CCAACTGTTG TAACTAGGCT CAGAGATGTG CACCCATGGG CTCTGACAGA 2460 
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AAGCAGATCC TCACCCTGCT ACACATACAG GATTTGAACT CAGATCTGTC TGATAGGAAT 2520 

GTGAAAGCAC GGACTCTTAC TGCTAACTTT TGTGTATCGT AACCAGCCAG ATCCTCTTGG 2 580 

TTATTTGTTT ACCACTTGTA TTATTAATGC CATTATCCCT GAATTCCCCT TGCCACCCCA 2640 

CCCTCCCTGG AGTGTGGCTG AGGAGGCCTC CATCTCATGT ATCATCTGGA TAGGAGCCTG 2700 

CTGGTCACAG CCTCCTCTGT CTGCCCTTCA CCCCAGTGGC CACTCAGCTT CCTACCCACA 2760 

CCTCTGCCAG AAGATCCCCT CAGGACTGCA ACAGGCTTGT GCAACAATAA ATGTTGGCTT 2 820 
GGAAAAAAAA AAAA 

A157 Protein sequence: 

Gene name: vasoactive intestinal peptide receptor 1 

Unigene number: Hs . 19872 6 

Probeset Accession f) : X77777 

Protein Accession tt: JC2195 

Signal sequence: none found 

Transmembrane domains: 181-202, 214-236, 255-277, 290-311, 332-3 

Cellular Localization: plasma membrane 



I I I I I I 

MPPPPLLSLR RLGGGWSAVT RLWAAAGAR SRGGRGGSRG AGGGGRGGVA Rr" 

RSLLGSSLQE ECDYVQMIEV QHKQCLEEAQ LENETIGCSK MWDNLTCWPA TPRGQVWLA 

CPLIFKLFSS IQGRNVSRSC TDEGWTHLEP GPYPIACGLD DKAASLDEQQ TMFYGSVKTG 

YTIGYGLSLA TLLVATAILS LFRKLHCTRN YIHMHLFISF ILRAAAVFIK DLALFDSGES 

DQCSEGSVGC KAAMVFFQYC VMANFFWLLV EGLYLYTLLA VSFFSERKYF KGYILIGWGV 

PSTFTMVWTI ARIHFEDYGL LRCWDTINSS LWWIIKGPIL TSILVHFILF ICIIRILLQK 
LRPPDIRKSD SSPYSRLARS TLLLI PLFGV HYIMFAFFPD NFKPEVKMVF ELWGSFQGF 

WAILYCFLN GEVQAELRRK WRRWHLQGVL GWNPKYRHPS GGSNGATCST QVSMLTRVSP 



A1SB DNA SEQUENCE 



Accession #: AW207206 
AL133619 
ined sequence 



ATGAGCGGTG CGGGGGTGGC GGCTGGGACG CGGCCCCCCA G 
CGGCGCCGGC GCCAGCGCCC CTCTGTGGGC G 
CTCAGGCAGA GCGACCCGCA GAAACGGAAC C 

CAGCAGCAGC ACTCGGAGAT GCTGGCCAAG CTCCATGAGG AGATCGAGCA TCTGAAGCGG 24 0 

GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 

ACACTGCCCC TCCCCCACCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 

GGGGGAACAC AGGACGGGGA GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 

CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGGACCT GGACAGATGC CGCTACCTCT 480 

iCGATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGGAAGCCCA 540 



50 CCAAGTAGAG CTGAAATGGG AAGGAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 660 

CCTCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 

ATGCTGGGGG CCCAGGGGAT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 780 

GCAGCAACCA TGGGGACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG 840 

GCACTTCCCC ATCCTGACAG CGGCCCCCAC CCAGCCCAGG ATCCTGGGCT GTGGTCTCAA 900 

55 GCTCACTTCC CATTATCTTT GGGGCTGGGG CTGACATCAG GAGGACATCT GACTGGTGGA 950 

TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 

GACATGGAGA AGGGGGTTGA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 

CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1140 

GACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGGGACCT GCTGTTCCAT GTGTCCCAAG 1200 

60 CCCTCCTGCT TTCCAGATGG CCCCTCAGGA AACCACCTTT CCAGC-GCCTC TGCTCCCTTG 1250 



AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGCG 1380 

GGCGGTAGCG CCGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 1440 

TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 

TCCTTCAACA AGCAAGATTC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG 1550 

A GCTGGACAAA GTT CCTGGGG TACAAGGGCA GGC CAGAAAG 1520 

K3TATGG GGAACAGCCA GCACCAGGGC 1580 



CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATTTCCCCAA GGTCTCCACC 1920 

AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCC-TGCCAT CCTGCCCGCA 1980 

CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
AAACGGCGCC TGCATCGCTC AGTGCTTTGA 



Probeset Accession #: 
Protein Accession #: 

Transmembrane domains: 
Cellular Localization: 
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I I I I I I 

MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL 
QQQHSEMLAK LHEEIEHLKR ENKGEPARGP RPALPPQAHS TLPLPQHRNT AINSSTRLGS 
GGTQDGEPLQ TVLAHLAALA PVCQPSGYRP WGTHTDAATS SRGWTMLCSQ AQHULLSGSP 
GPEVIAGRQV ATGCSPDLPP PSRAEMGRNP WDSPCPARSL PQIAAVARPR ISSPMALSPH 
MLGAQGIWTH SIQGSLPAIW AATMGTKGGS RVLPPCHLSK ALPHPDSGPH PAQDPGLWSQ 
AHPPLSLGLG LTSGGHLTGG WSQPGNIAAG AVPRALPSQG DMEKGVEGG? FPSRCGNSSE 
LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGTCCSMCPK PSCFPDGPSG NHLSRASAPL 
GARWVCINGV WVEPGGPSPA RLKEGSSRTH RPGGKRGRLA GGSADTVRS? ADSLSMSSFQ 
SVKSISNSAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPGVQGQARK 
EKAEASNAGA ACMGNSQHQG RQMGAGAHPP MILPLPLRKP TTLRQCEVLI RELWNTNLLQ 

TQELRHLKSL LEGSQRPQAA PEEASFPRDQ EATHFPKVST K 

LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 
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A161 PROTEIN SEQUENCE 

Protein Accession #: 
Signal sequence: 
Pfam domain: 



Cellular Localization: 



I I I I I I 

MARKLSVILI LTFALSVTNP LHELKAAAFP QTTEKISPNK ESGINVDLAI STRQYHLQQL 
FYRYGENNSL SVEGFRKLLQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHER 
EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE 
SVSASEVTST VYNTVSEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS 
NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 
RSCLIHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSL LGVILVPLMN RVFFKFLLSF 

S HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 

tC KKKNQKKPEN DDDVEIKKQL SKYESQLSTN 
EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAKPQEVYNE YVPRGCXNKC 
HSHFHDTLGQ SDDLIHHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 
MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLKA GMTVKQAVLY 
NALSAMLAYL GMATGIFIGH YAENVSMWIF ALTAGLFMYV ALVDMVPEML HNDASDHGCS 
RWGYFFLQNA GMLLGFGIML LISIFEHKIV FRINF 



2 DNA sequence 



Nucleic Acid Accession )f : NM_001203 

Coding sequence: 274-1782 (underlined sequences correspond to start and stop codons) 



GTGAAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATA 



CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 

TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATGCTTT TGCGAAGTGC AGGAAAATTA 3 00 

AATGTGGGCA CCAAGA»J>.GA GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 

TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACT CAG TCAACAATAT TTGCAGCACA 420 

GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 4B0 

GGTTGCCTAG GACTAGAAGG CTCAGATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA 540 

AGAAGATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT ACACCCTACA 500 

CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGACCTA TACACCACAG GGCTTTACTT 560 

ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 

TATAAAAGAC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAGAACAGGA TGAAACTTAC 780 

ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA B4 0 

TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 

ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 

GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATAT CAG 1020 

ACAGTGTTGA TGAGG CATG A AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 

GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1140 

TATCTGAAGT CCACCACCCT AGACGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 
AGTGG CTT AT GTCATTTACA CACAGAAATC TTTAGTACTC A 
CATCGAGATC TGAAAAGTAA AAACATTCTG GTGAAG; 

GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 

3 CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTGAACAGA 1440 




Cellular Localization: 
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I I I I I I 

MLLRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVKNICSTD GYCFTMIEED 
DSGLPWTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECMKDLHPTL PPLKNRDFVD 
GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGESLRDLI 
EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGSKVAV KVFFTTESAS 
WFRETEIYQT VLMRHENILG FIAADIKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 
MLKLAYSSVS GLCHLHTEIF STQGKPAIAH RDLKSKNILV KKNGTCCIAD LGLAVKFISD 
TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 
EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 
RLTALRVKKT LAKMSESQDI KL 



15 



Unigene number: Hs. 157601 

Probeset Accession ft: W074S9 
Nucleic Acid Accession ft: AC005383 



3-2751 (underlined sequences correspond to start and stop codons) 



GAGTAGAGCC 



S GGAAGATTTC AGCTGCCAGC AAAATGATGT GGTGCTCGGC TGCAGTGGAC 
ATCATGTTTC TGTTAGATGG GTCTAACAGC GTCGGGAAAG G 
CACTTTGCCA TCACAGTCTG TGACGGTCTG G 
GCATTCCAGT TCAGTTCCAC TCCTCA1 

CAGGAAGTGA AGGCAAGAAT CAAGAGGATG GTTTTCAAAG G 
CTTGCTCTGA AATACCTTCT GCACAGAGGG T 
CAGATCCTCA TCATCGTCAC TGATGGGAAG TCCCAGGGGG A 
CAG CTGAAGG AAAGGGGTGT CACTGTGTTT G 
GAGCTGCATG CACTGGCCAG CGAGCC1 

GAGGATGCCA CCAACGGCCT CTTCAGCACC CTCAGCAGCT C 
ACGCCAGACT GCAGGGTCGA CGCTCACCCC TGTCAGCACA GGACGCTGGA GATGGTCCGG 1000 
GAGTTCGCTG GCAATGCCCC ATGCTGGAGA GGATCGCGGC GGACCCTTGC GGTGCTGGCT 1140 
GCACACTGTC CCTTCTACAG CTGGAAGAGA GTGTTCCTAA CCCACCCTGC CACCTGCTAC 1200 
AGGACCACCT CCCCAGGCCC CTGTGACTCG CACCCCTGCC AGAATGGAGG CACATGTGTT 1260 
CCAGAAGGAC TGGACGGCTA CCAGTGCCTC TGCCCGCTGG CCTTTGGAGG GGAGGCTAAC 1320 
TGTGCCCTGA AGCTGAGCCT GGAATGCAGG GTCGACCTCC TCTTCCTGCT GGACAGCTCT 1380 
GCGGGCACCA CTCTGGACGG CTTCCTGCGG GCCAAAGTCT TCGTGAAGCG GTTTGTGCGG 1440 
GCCGTGCTGA GCGAGGACT C TCGGGCCCGA GTGGGTGTGG CCACATACAG CAGGGAGCTG 1500 
CTGGTGGCGG TGCCTGTGGG GGAGTACCAG GATGTGCCTG ACCTGGTCTG GAGCCTCGAT 1560 
GGCATTCCCT TCCGTGGTGG CCCCACCCTG ACGGGCAGTG CCTTGCGGCA GGCGGCAGAG 1620 
CGTGGCTTCG GGAGCGCCAC CAGGACAGGC CAGGACCGGC CACGTAGAGT GGTGGTTTTG 1680 
CTCACTGAGT CACACTCCGA GGATGAGGTT GCGGGCCCAG CGCGTCACGC AAGGGCGCGA 1740 
GAGCTGCTCC TGCTGGGTGT AGGCAGTGAG GCCGTGCGGG CAGAGCTGGA GGAGATCACA 1800 
GGCAGCCCAA AGCATGTGAT GGTCTACTCG GATCCTCAGG ATCTGTTCAA CCAAATCCCT I860 
GAGCTGCAGG GGAAGCTGTG CAGCCGGCAG CGGCCAGGGT GCCGGACACA AGCCCTGGAC 1920 
CTCGTCTTCA TGTTGGACAC CTCTGCCTCA GTAGGGCCCG AGAATTTTGC TCAGATGCAG 1980 
AGCTTTGTGA GAAGCTGTGC CCTCCAGTTT GAGGTGAACC CTGACGTGAC ACAGGTCGGC 2040 
CTGGTGGTGT ATGGCAGCCA GGTGCAGACT G 
GCTGCGATGC TGCGGGCCAT TAGCCAGGCC CCCTACCTAG G 
ACCGCCCTGC TGCACATCTA TGACAAAGTG ATGACCGTCC AGAGGGGTGC CCGGCCTGGT 2220 
GTCCCCAAAG CTGTGGTGGT GCTCACAGGC GGGAGAGGCG CAGAGGATGC AGCCGTTCCT 2280 
GCCCAGAAGC TGAGGAACAA TGGCATCTCT GTCTTGGTCG TGGGCGTGGG GCCTGTCCTA 2340 
AGTGAGGGTC TGCGGAGGCT TGCAGGTCCC CGGGATTCCC TGATCCACGT GGCAGCTTAC 2400 
GCCGACCTGC GGTACCACCA GGACGTGCTC ATTGAGTGGC TGTGTGGAGA AGCCAAGCAG 2460 
2 CAGCCCGTGC ATGAATGAGG GCAGCTGCGT CCTGCAGAAT 2520 



TTCCCGCCGT GGCCAGGACC ACTATTCTCA CTGAGGGAGG AGGATGTCCC AACTGCAGCC 
ATGCTGCTTA GAGACAAGAA AGCAGCTGAT GTCACCCACA AACGATGTTG TTGAAAAGTT 
TTGATGTGTA AGTAAATACC CACTTTCTGT ACCTGCTGTG CCTTCn 
CTGCCACCTT TCCCTTGAGG ATAAACAAGG GGTCCTGAAG ACTTAAATTT A 
CGTTCCTTTG CACACAATCA ATGCTCGCCA GAATGTTGTT GACACAC 
AGGCCTTTAC TAGAGCATCC TTTGGACGGC GAAGGCCACG GCCTTTCAAG ATGGAAAGCA 3120 
GCAGCTTTTC CACTTCCCCA GAGACATTCT GGATGCATTT GCATTGAGTC TGAAAGGGGG 3180 
CTIGAGGGAC GTTTGTGACT TCTTGGCGAC TGCCTTTTGT GTGTGGAAGA GACTTGGAAA 3240 
GGTCTCAGAC TGAATGTGAC CAATTAACCA GCTTGGTTGA TGATGC-GGGA GGGGCTGAGT 3300 
- G CCCM3GTCTG GAGGGCCACG TAAAATCGTT CTGAGTCGTG AGCAGTGTCC 3360 
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A165 Protein sequence 

Unigene number: 
Protein Accession if: 
Signal sequence : 
Transmembrane domains: 

EGP domains : 

Cellular Localization: 



P SLPLQEVHVS K 

" """3PERVRV GAPQPSSTPH LEPPLDSPET QQEVXARIKR 
MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKSRGVTV 
P RGQHVLLAEQ V 



40 
45 
50 
55 
60 
65 



SQPCQNGGTC VPEGLDGYQC LCPLAPGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGPL 

RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 

LTGSALRQAA ERGPGSATRT GQDRPRRWV LLTESHSEDE VAGPARHARA RELLLLGVGS 

EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVPMLDTSA 

SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAWVLT GGRGAEDAAV PAQKLRNNGI 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 

CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EKSSCSVCVS QGWILETPLR H 



aceptor mRNA, 3 ' UTR 



Coding Sequence: 87-1520 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

TGCTACCCGC GCCCGGGCTT CTGGGGTGTT CCCCAACCAC GGCCCAGCCC TGCCACACCC GO 

CCCGCCCCCG GCCTCCGCAG CTCGGCATGG GCGCGGGGGT GCTCGTCCTG GGOGCCTCCG 120 

AGCCCGGTAA CCTGTCGTCG GCCGCACCGC TCCCCGACGG CGCGGCCACC GCGGCGCGGC 180 

TGCTGGTGCC CGCGTCGCCG CCCGCCTCGT TGCTGCCTCC CGCCAGCGAA AGCCCCGAGC 240 

CGCTGTCTCA GCAGTGGACA GCGGGCATGG GTCTGCTGAT GGCGCTCATC GTGCTGCTCA 3 00 

TCGTGGCGGG CAATGTGCTG GTGATCGTGG CCATCGCCAA GACGCCGCGG CTGCAGACGC 3 SO 

TCACCAACCT CTTCATCATG TCCCTGGCCA GCGCCGACCT GGTCATGGGG CTGCTGGTGG 420 

TGCCGTTCGG GGCCACCATC GTGGTGTGGG GCCGCTGGGA GTACGGCTCC TTCTTCTGCG 480 

AGCTGTGGAC CTCAGTGGAC GTGCTGTGCG TGACGGCCAG CATCGAGACC CTGTGTGTCA 540 

TTGCCCTGGA CCGCTACCTC GCCATCACCT CGCCCTTCCG CTACCAGAGC CTGCTGACGC 600 

GCGCGCGGGC GCGGGGCCTC GTGTGCACCG TGTGGGCCAT CTCGGCCCTG GTGTCCTTCC SSO 

TGCCCATCCT CATGCACTGG TGGCGGGCGG AGAGCGACGA GGCGCGCCGC TGCTACAACG 720 

ACCCCAAGTG CTGCGACTTC GTCACCAACC GGGCCTACGC CATCGCCTCG TCCGTAGTCT 780 

CCTTCTACGT GCCCCTGTGC ATCATGGCCT TCGTGTACCT GCGGGTGTTC CGCGAGGCCC 840 
AGAAGCAGGT G AAGAAGAT C GACAGCTGCG AGCGCCGTTT C 



CGGGCTGTCT GGCCCGGCCC GGACCCCCGC C 



GTGAACTCGA AGCCCACAAT 
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WGRWEYGSFF CELWTSVDVL CVTASIETLC VIALDRYLAI TSPPRYQSLL TRARARGLVC 180 

TWAISALVS PLPILMHWWR AESDEARRCY NDPKCCDFVT NRAYAIASSV VSPYVPLCIM 240 

APVYLRVPRE AQKQVKKIDS CERRFLGGPA RPPSPSPSPV PAPAPPPGPP RPAAAAATAP 300 

LANGRAGKRR PSRLVALREQ KALKTLGIIM GVFTLCWLPP PLANWKAFH RELVPDRLFV 360 

J FFNWLGYANS AFNPIIYCRS PDPRKAPQGL LCCARRAARR RHATHGDRPR AEGCLARPGP 420 
PPSPGAASDD DDDDWGAT P PARLLEPWAG CNGGAAADSD SSLDEPCRPG FASESKV 



10 
15 



Probeset Accession #: AA2S6485 
Nucleic Acid Accession #: AJ400877 

Coding sequence: 81-3080 (underlined sequences correspond to start and stop codons) 

i i 1 r r i 1 r 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 

CCGCAACCGC TGAGCCATCC ATGGGGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 120 

CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 180 

CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 

ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 

AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAAATGAGC 360 

TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 42 0 

TTGATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 480 

AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 54 0 

GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 60 0 

GCCTGAGCTG CATGAATAAG GATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 660 

GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 72 0 

TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 780 

GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 

AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 90 0 

ATAAACGGGT GAAACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 960 

ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 

TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 1080 

GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1140 

AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 120 0 

GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 12 60 
CCCTGTATGG CTTCACCCAC TGTGGAGACA C 



AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC A 

CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1500 

GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT GTAACCTTTA 15G0 

AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1G2 0 

CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1680 

GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 

TTATCACTGT TGAGTTTGAG CTTGAAACTA ACCAAAAGGA GGTGACAGCT TCTTGTGACC 1800 

TGAGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA 186 0 
AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC G 



k TGATGGAGCA CGAGAACGCT 2040 

ft ATGAGGAAGG ACAAATGACT TGTGAACCAT 210 0 

GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 

GTGGAGGTCT GTGTCAACCT GGT G AAT ATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 

GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 

GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 

GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGJAA 2400 

CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 

ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 

GAGATTTCAC TGGGTACATT GAAT CCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 25 80 

AGTGTACGTG GACCATCAAC CCACCCCCCA AGCGCCGCAT CCTGATCGTG GTCCCTGAGA 2640 

TCTTCCTGCC CATAGAGGAC GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCTCTTCAT 27 00 

CCAATTCTGT GACAACATAT GAAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 2760 

CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATGA AGGGAACAGC GCTAGAGGGT 2820 

TCCAGGTCCC ATACGTGACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTT C 2880 

GAGATGGCAG GCTCTATGCA TCTGAGAACC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 2 940 

TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 3 0 00 

AGTCCCGAGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 3 0S0 

TTTTGAGACC TTACAAATGA CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 3120 

GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3180 

CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 3240 

GAACTTGGTT TTTCTTTCCC AGCATCGTGG ATGTAGACTG AGAATGGCTT TGAGTGGCAT 3 3 00 

CAGCTTCTCA CTGCTGTGGG CGGATGTCTT GGATAGATCA CGGGCTGGCT GAGCTGGACT 3360 
TTGGTCAGCC TAGGTGAGAC TCACCTGTCC TTCTGGGGTC TTACTCCTCC T 
TGTAGTGGAA AGGAGGCCAC AGAATAAGCT GCTTATTCTG AAACTTCAGC T 

CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG 3540 

CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTGAG ACCTGGGAGG 3 600 

ACTCAGTTTC TCCACAGCCT TCTCCAGCCT GTGTGATACA AGTTTGATCC CAGGAACTTG 3 660 

AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
AGCACTTCTG GAGACAT 



Unigene number: 
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Signal sequence: 1-31 
Transmembrane domains : none 
PFAM domains: EGF-like_domains [49-84,132-167,1 
Cellular Localization: may be 



B_domain [809-9 



MGVAGRNRPG 
LCQNTPTSYK 
HDGHNCLDVD 
DHGCSHICKE 
PQYKMHTDGR 



AAWAVLLLLL 

CSCKPGYQGE 
ECLENNGGCQ 
APRGSVACEC 



STGVHCSCPV G 



TVGSYECQCH P( 



SSDVTTIRTS 
PGAPGRPSTP 
FHLQLSGMNL 



VTFKLNEGKC 



LLPPLLLLAG AVPPGRGRAA GPQEDVDECA QGLDDCHADA 
GRQCEDIDEC GNELNGGCVH DCMIPGNYR CTCFDGFMLA 
HTCVNVMGSY ECCCKEGFFL SDHQHTCIHR SEEGLSCMNK 
RPGFELAKNQ RDCILTCNKG NGGCQHSCDD TADGPECSCH 
EVTESNTTSV VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 
CKDIDECQTR NGGCDHPCKi! IVGSPDCGCK KGPKLLTDEK 
HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 
DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGIHL 
SLKNAELFPE GLRPALPEKH SSVKESPRYV NLTCSSGKQV 
LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 



KNNCVSCPGN TTTDFE 



> GALKTPEAWN MSECGGLCQP GEYSADGFAP CQLCALGTFQ 
) GATSFODCET RVQCSPGHFY NTTTHRCIRC PVGTYQPEPG 
I ITQCKNRRCG GELGDFTGYI ESPNYPGNYP ANTECTWTIN 
) DCGDYLVMRK TSSSNSVTTY ETCQTYERPI AFTSRSKKLN 
' YDEDYQELIE DIVRDGRLYA SENHQEILKD KKLIKALFDV 
I VSRFLRPYK 



35 
40 



L-172S (underlin 



s correspond to stop codon) 



I I 

AAGGAGGCGG CCTCCGGGAA 
CTGGACCAGT GCATGACCGC 
ACCTACCTCA AGCCCAGAAC 
CTGGAGATGC AGGCCATGAT 
ATGAAGGAGG CACAGATGCT 
TTCAGCAGCC TGGTGAACCG 
ATGCAGAGTG 



GAGCTGGACA GCCTTGTTCA 
GAAGGAGGAG TGAAGCTTGG 
AGGATCCTGA GGCTGTTGGA 
CAGCTGGAGG AGGGAGCGTC 
CTGTGCTACC ACACCTTCCT 
GCCGAGAAGC TCTTGAAGCC 



GTGTCAGAGG CACCGGAGGA 
CCCCACGCTG GGCCAATTCA 
CCTGCTGCAG CGAGCAGCCC 
AGGCGGCATC AAAGTTCGAA 
GTCCTCACAA TACTGCAAGG 
TGTAGGGGCC TTCAACCTGA 



I I 

- AGAGCAGCCT CCATGAGGCC 
ACCAGTTCTC 
CACTGACATA 
TCCTGCTTGC 
AGTCTTCTGT 
CTGAAGAAGA 
TGACCTTCCT 



AATCCACGCT 3 60 



GACCTACAAG 
CCCGCACTTT 
GCTTCCTACT 



GAGTGCTGTG 
ATGTGGTGCT 
AGCAAGGAGA 
' ATGTTTGGGA 
GTGCCAGGCC 
CGGAAGTCCC 
ATGATGTACA 
ATACTTGAGA 
TCAGTGGATG 



AGGCCCAGCA 
TCACCTACAA 
ACTGCTGGTC 
AGGAGGACCA 
TGAAGCTCAA 
GGCGCTACTT 
TCTGGAACGG 



GGAACGTCAA CATCGAGGAG 
CGGTACCCTA AGGGTGCCAT CTTCCTGTTC 
AACATTGATG CAGCCATCCG GCGTTTCGAG 
CAGTTTCACC ACATGTGCTA CTGGGAGCTG 
AAGATGTCCT ACTTCTACGC CGACCTGCTC 
C TACATTTACA TGAAGGCCGC CTACCTCAGC 
C GGGGACGACG AAGTGGAATT ATTTCGAGCT 



ACGAGTGCTT 



TATGACCACT ACTTGATCCC 
GACAGAAACG AAGAGGCCAT 
TCCATGGAGT CAAGGACACA 
CTAGAGAACA GCAG CAGATC 
CCGGGCTGGA AGACAGAGAC 
CCCCCTGCCC TGCCCTGCCT 



CTACGCCGTG 
GGCTGAAGAG 
GGTGAAATTG 
GAATTTTAGG 
AAACGCCCTG 
CAAACTTTTG 
CTTTCGAATC 
CATGGTCTCA 
AGCTGGACAG 



ATTGGGAAGC 
ATGCTGGAGA 
TTGAAAGGCC 
AGCATCTCTG 
CTGGAGCTGG 



TGCCAGTGCC TGCTCTGGAA 
AGCCGAAACT CACGGATGGG 
AAGGCCCAGA GAACGAGTAC 
TGTGTCTGAA ATACCTGGGC 
CCAATGAAAA GAAGATTAAA 
CCCTGCTGCT ' 



1380 
1500 



CACAACATAG 
TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 
ACGGGTACTT GCTTTTCACT 



C TTTAAAAACC AACAACAATC AACCACCTGT 
TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 
TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA 
AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC . 
CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA 
AAGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTG TAGAAAAATC , 
AGTAGAAAAT GCCAGGGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 
AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCOAGTCTG TATCTTTGCT GTATGCCAAA 
A TAAAAATCTT CTGTACTTCA CTCCAAGGTA 
A ACTGTTATTC ' 



A171 Protein sequenc 
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Gene name: DEME-6 protein [KIAA0452) 

Unigene number ; Hs. 125783 

Probeset Accession #: AL039402 

Protein Accession #: AAC39582 

Signal sequence: none 

Transmembrane domains! 210-22S 

Cellular Localization: plasma membrane 



I I I I I I 



3VTDSFSSL VNRPTLGQPT EEEIHAEVCY AECLLQRAAL TFLQDEUMVS 12 0 

FIKGGIKVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGGV KLGVGAFNLT LSMLPTRILR 180 

LLEFVGFSGN KDYGLLQLEE GASGHSFRSV LCVMLLLCYH TFLTFVLGTG NVMIEEAEKL 240 

LKPYLNRYPK GAIFLFFAGR IEVIKGNIDA AIRRFEECCE AQQHNKQFHH KCYWELMHCF 300 

TYKGOWKMSY FYADLLSKEN CWSKATYIYM KAAYLSMFC-K SDHKPFGDDE VELFRAVPGL 360 

KLKIAGKSLP TEKFAIRKSR RYFSSMPISL PVPALEMMYI WNGYAVIGKQ PKLTDGILEI 420 

ITKAEEMLEK GPENEYSVDD ECLVKLLKGL CLKYLGRVQE AEE1IFRSISA NEKKIKYDHY 480 

LIPNALLELA LLLMEQDRNE EAIKLLESAK QNYKNYSMES RTHFRIQAAT LQAKSSLENS 540 
SRSMVSSVSL 



t and stop codons) 



I 

A AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC 60 

G GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT 12 0 

GGCCCCTGGA GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG 180 

T ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT 240 

T ACCTCACCGT GTCATGGCTC 300 

G CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG 3 60 

3 ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA 42 0 

GTCT CAAGGC GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA 480 

ACAAGGTTGA TTTTCGATGC ACTT CTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG 54 0 

CCAATATTGA TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC GOO 

CATGGAGTGG GACTCTGCTT TGCCCTTTTT CTCTCCGAAT GTGTGAAGTC TCTGAGTTTC 660 

TCCTCCAGTT GGATCATCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC TGTTTCCTCC 720 

TTTGCCTTTG AG AAG CT CAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC 780 

ATCAGCTTCT TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA 840 

GTACTGATCA CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGGA 900 

TACACTGCAT TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG 9 SO 

ACAAGAATGG CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT 1020 

GTGACCAGTG AAGTTCICAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA 1080 

TTTGCAAAAA TCATTGAAGG TATGGAAAGT CTGACTTTCT GCTCCAAACC TGGTGATGGC 1140 

ATGGCCTTCA GCATGCTGGC CTCCTTGAAT CTCCTTCGGC TGTCAGTGTT CTTTGTGCCT 1200 

ATTGCAGTCA AAGGTCTCAC GAATT CCAAG TCTGCAGTGA TGAGGTTCAA GAAGTTTTTC 12 60 

CTCCAGGAGA GCCCTGTTTT CTATGTCCAG ACATTACAAG ACCCCAGCAA AGCTCTGGTC 1320 

TTTGAGGAGG CCACCTTGTC ATGGCAACAG ACCTGTCCCG GGATCGTCAA TGGGGCACTG 13 80 

GAGCTGGAGA GGAACGGGCA TGCTTCTGAG GGGATGACCA GGCCTAGAGA TGCCCTCGGG 1440 

CCAGAGGAAG AAGGGAACAG CCTGGGCCCA GAGTTGCACA AGATCAACCT GGTGGTGTCC 1500 

AAGGGGATGA TGTTAGGGGT CTGCGGCAAC ACGGGGAGTG GTAAGAGCAG CCTGTTGTCA 1560 

GCCATCCTGG AGGAGATGCA CTTGCTCGAG GGCTCGGTGG GGGTGCAGGG AAGCCTGGCC 1620 

TATGTCCCCC AGCAGGCCTG GATCGTCAGC GGGAACATCA GGGAGAACAT CCTCATGGGA 1680 

GGCGCATATG ACAAGGCCCG ATACCTCCAG GTGCTCCACT GCTGCTCCCT GAATCGGGAC 1740 

CTGGAACTTC TGCCCTTTGG AGACATGACA GAGATTGGAG AGCGGGGCCT CAACCTCTCT 1800 

GGGGGGCAGA AACAGAGGAT CAGCCTGGCC CGCGCCGTCT ATTCCGACCG TCAGATCTAC 1860 

CTGCTGGACG ACCCCCTGTC TGCTGTGGAC GCCCACGTGG GGAAGCACAT TTTTGAGGAG 1920 

TGCATTAAGA AGACACTCAG GGGGAAGACG GTCGTCCTGG TGACCCACCA GCTGCAGTAC 1980 

TTAGAATTTT GTGGCCAGAT CATTTTGTTG GAAAATGGGA AAATCTGTGA AAATGGAACT 2 040 

CACAGTGAGT TAATGCAGAA AAAGGGGAAA TATGCCCAAC TTATCCAGAA GATGCACAAG 2100 
GAAGCCACTT CGGACATGTT GCAGGACACA GCAAAGATAG CAGAGAAGCC AAAGGTAGAA 
AGTCAGGCTC TGGCCACCTC CCTGGAAGAG TCTCTCAACG GAAATGCTGT GCCGGAGCAT 
CAGCTCACAC AGGAGGAGGA GATGGM 
TACATCCAGG CAGCTGGAGG TTACATGGTC T 

ATCGTCTTCT TAACGATCTT CAGCTTCTGG TGGCTGAGCT ACTGGTTGGA GCAGGGCTCG 2400 

GGGACCAATA GCAGCCGAGA GAGCAATGGA ACCATGGCAG ACCTGGGCAA CATTGCAGAC 2460 

AATCCTCAAC TGTCCTTCTA CCAGCTGGTG TACGGGCTCA ACGCCCTGCT CCTCATCTGT 2520 

GTGGGGGTCT GCTCCTCAGG GATTTTCACC AAAGTCACGA GGAAGGCATC CACGGCCCTG 2580 

CACAACAAGC TCTTCAACAA GGTTTTCCGC TGCCCCATGA GTTTCTTTGA CACCATCCCA 2640 

ATAGGCCGGC TTTTGAACTG CTTCGCAGGG GACTTGGAAC AGCTGGACCA GCTCTTGCCC 2700 

ATCTTTTCAG AGCAGTTCCT GGTCCTGTCC TTAATGGTGA TCGCCGTCCT GTTGATTGTC 2760 

AGTGTGCTGT CTCCATATAT CCTGTTAATG GGAGCCATAA TCATGGTTAT TTGCTTCATT 2820 



T TCTCCCACAT CCTCAATTCT CTGCAAGGCC TGAGCTCCAT CCATGTCTAT 
GGAAAAACTG AAGACTTCAT CAGCCAGTTT AAGAGGCTGA CTGATGCGCA G 
CTGCTGTTGT TTCTATCTTC CACACGATGG ATGGCATTGA GGCTGGf 
CTTGTGACCT TGGCTGTTGC CCTGTTCGTG GCTTTTGGCA TTTCCTCCAC CCCCTACTCC 3120 
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TTTAAAGTCA TGGCTGTCAA CATCGTGCTG CAGCTGGCGT CCAGCTTCCA GGCCACTGCC 3180 

CGGATTGGCT TGGAGACAGA GGCACAGTTC ACGGCTGTAG AGAGGATACT GCAGTACATG 3 240 

AAGATGTGTG TCTCGGAAGC TCCTTTACAC ATGGAAGGCA CAAGTTGTCC CCAGGGC-TGG 33 00 

CCACAGCATG GGGAAAT CAT ATTTCAGGAT TATCACATGA AATACAGAGA CAACACACCC 3360 

J ACCGTGCTTC ACGGCATCAA CCTGACCATC CGCGGCCACG AAGTGGTGGG CATCGTGGGA 3420 

AGGACGGGCT CTGGGAAGTC CTCCTTGGGC ATGGCTCTCT TCCGCCTGGT GGAGCCCATG 3480 

GCAGGCCGGA TTCTCATTGA CGGCGTGGAC ATTTGCAGCA TCGGCCTGGA GGACTTGCGG 3540 

TCCAAGCTCT CAGTGATCCC TCAAGATCCA GTGCTGCTCT CAGGAACCAT CAGATTCAAC 3500 

CTAGATCCCT TTGACCGTCA CACTGACCAG CAGATCTGGG ATGCCTTGGA GAGGACATTC 3SS0 

10 CTGACCAAGG CCATCTCAAA GTTCCCCAAA AAGCTGCATA CAGATGTGGT GGAAAACGGT 3720 

GGAAACTTCT CTGTGGGGGA GAGGCAGCTG CTCTGCATTG CCAGC-GCTC-T GCTTCGCAAC 3780 

TCCAAGATCA TCCTTATCGA TGAAGCCACA GCCTCCATTG ACATGGAGAC AGACACCCTG 3 840 

ATCCAGCGCA CAATCCGTGA AGCCTTCCAG GGCTGCACCG TGCTCGTCAT TGCCCACCGT 3 900 

GTCACCACTG TGCTGAACTG TGACCACATC CTGGTTATGG GCAATGGGAA GGTGGTAGAA 3 960 

ID TTTGATCGGC CGGAGGTACT GCGGAAGAAG CCTGGGTCAT TGTTCGCAGC CCTCATGGCC 4 02 0 

ACAGCCACTT CTTCACTGAG ATAAGGAGAT GTGGAGACTT CATGGAGGCT GGCAGCTGAG 4080 

CTCAGAGGTT CACACAGGTG CAGCTTCGAG GCCCACAGTC TGCGACCTTC TTGTTTGGAG 4140 

ATGAGAACTT CTCCTGGAAG CAGGGGTAAA TGTAGGGGGG GTGGGGATTG CTGGATGGAA 4200 

ACCCTGGAAT AGGCTACTTG ATGGCTCTCA AGACCTTAGA ACCCCAGAAC CATCTAAGAC 4260 

20 ATGG3ATTCA GTGATCATGT GGTTCTCCTT TTAACTTACA TGCTGAATAA TTTTATAATA 432 0 

AGGTAAAAGC TTATAGTTTT CTGATCTGTG TTAGAAGTGY TGCAAATGCT GTACTGACTT 4380 



PFAM domains : ABC_transporter [S02-673J , ABC_membrane_region [163-432, 771-1060) 

ATP-binding_domains [S08-S16, 1139-1147] 
Cellular Localization: plasma membrane 

1 11 21 31 41 El 

3DDMVSG LIYKTYTLQD GPWSQQERNP EAPGRAAVPP 60 

P PAPQPLDNAG LFSYLTVSWL TPLMIQSLRS RLDENTIPPL 120 

V QRLHRLWEEE VSRRGIEKAS VliVMLRFQR TRLIFDALLG ICFCIASVLG 180 

B YSEEQLGNW HGVGLCFALF LSECVKSLSF SSSWIINQRT AIRFRAAVSS 240 

FAFEKLIQFK SVIHITSGEA ISFFTGDVNY LFEGVCYGPL VLITCASLVI CSISSYFIIG 300 

YTAFIAILCY LLVFPLAVFM TRMAVKAQHH TSSVSDQRIR VTSEVLTCIK LIKMYTWEKP 3G0 

FAKIIEGMES LTFCSKPGDG MAFSMLASLN LLRLSVFFVP I AVKGLTNS K SAVtfRFKKFF 420 

LQESPVPYVQ TLQDPSKALV FEEATLSWQQ TCPGIVNGAL 3LERNGKASE GHTRPRDALG 480 

PEEEGNSLGP ELIIKINLWS KGMMLGVCGN TGSGKSSLLS AILESMHLLE GSVGVQGSIA 540 

YVPQQAWIVS GNIRENILMG GAYDKARYLQ VLHCCSLNRD LELLPFGDMT EIGERGLNLS 600 

GGQKQRISLA RAVYSDRQIY LLDDPLSAVD AHVGKHIFEE CIKKTLRGKT WLVTHQLQY 660 

LEFCGQI1LL ENGKICENGT HSELMQKKGK YAQLIQKMHK EATSDMLQDT AKIAEKPKVE 720 

SQALATSLEE SLNGNAVPEH QLTQEEEMEE GSLSWRVYHH YIQAAGGYMV SCIIFFFWL 780 

IVFLTIFSFW WLSYWLEQGS GTWSSRESNG TMADLGNIAD NPQLSFYQLV YGLNALLLIC 840 

VGVCSSGIFT KVTRKASTAL HNKLFNKVFR CPMSFFDTIP IGRLLNCFAG DLEQLDQLLP 900 

IFSEQFLVLS LMVIAVLLIV SVLSPYILLM GAIIMVICFI YYMMFKKAIG VFKRLENYSR 950 

SPLFSHILNS LQGLSS IHVY GKTEDFISQF KRLTDAQNNY LLLFLSSTRW MALRLEIMTN 1020 

LVTLAVALFV AFGISSTPYS FKVMAVNIVL QLASSFQATA RIGLETEAQF TAVERILQYM 1080 

KMCVSEAPLH MEGTSCPQGW PQHGEIIFQD YHMKYRDNTP TVLHGINLTI RGHEWGIVG 1140 

RTGSGKSSLG MALFRLVEPM AGRILIDGVD ICSIGLEDLR SKLSVIPQDP VLLSGTIRFN 1200 

LDPFDRHTDQ QIWDALERTF LTKAISKFPK KLHTDWENG GNFSVGERQL LCIARAVLRN 1260 

SKI ILIDEAT AS IDMETDTL IQRTIREAFQ GCTVLVIAHR VTTVLNCDHI LVMGNGKWE 1320 
FDRPEVLRKK PGSLFAALMA TATSSLR 



d to start and stop codons) 



TCCGGACGAC A 

ACT CAT AAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA Ti 
GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 



CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTC-T CCGCGCCCAG GGTGGTGCCA 
TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGGAA CGCCGCTTCG CTCCTGCAGC 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGA3GGCCCA G 
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TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG C 

GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 

GCGCGCCTGG ACCAGGGCGG CTG CGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 

CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1020 

GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 

TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGGC TGCTGCCCCG TGCCAGACGC 1140 

CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1200 

ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 

CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 

GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 

CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAC-CTCCAG CC-CCAACTCC 1440 



AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1560 

CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 

CACGACTCTG TGCC CATG AC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 

AGGCAGATTC AGTCCCCCTT CCCTCACACC AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1740 

ACTGTATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 1800 

GATTCTCGAG 



TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 



AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT T 
TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC A 
TCTGCCTGTC TCAGCCI 
TGAGCCTTTT TTTTTTTCTA A 
ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2580 



k ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 

TGGGATGATT T 



^ TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 
TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT A 
GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA G 
AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTAO 

TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3ia0 

GCAGATCATA AGG MTTTl i TT( TT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 

GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 

ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3 3 GO 

CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 

ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3 480 

GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3 540 

TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCAT CA ATGATACTCA 3600 



TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



A175 Protein sequence 



Unigene number: 
Signal sequence: 



Cellular Localization: 



I I I I I I 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESF 
GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGRSRGR V 
SGRQPRGPSD CIPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRASRRR LLGVAAEGSG 
PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRWP CGAIAARPSP HPGTPLRSCS 
CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 
ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 
SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP IYVPFLIVGS VFVAFIILGS 
LVAACCCRCL RPKQDPQQSR APGGNRLMET IPMIPSASTS RGSSSRQSST AASSSESANS 
GARAPPTRSQ TNCCLPEGTM tJNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 
P PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 



Gene name: ESTs, Weakly similar to CGHU7L collagen alpha l(III) chain precursor [H. sapiens] 
Nucleic Acid Accession ft: AA088458 

Coding sequence: 862-1995 (underlined sequences correspond to start and stop codons) 



GCCCTTGGAC ACTGACATGG ACTGAAGGAG TAGAAT GG AG 
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CTGAAGAAAA AGGAGCTGGA GCAGGAGAAG GAGGTGCTGC TGCAGGGTTT GGAGATGATG 12 0 

GCGCGGGGCC GCGACTGGTA CCAGCAGCAG CTGCAACGAG TGCAGGAGCG CCAGCGCCGC 180 

CTGGGCCAGA GCAGAGCCAG CGCCGACTTT GGGGCTGCAG GGAGCCCCCG CCCACTGGGG 240 

CGGCTACTGC CCAAGGTACA AGAGGTGGCC CGGTGCCTGG GGGAGCTGCT GGCTGCAGCC 300 

TGTGCCAGCC GGGCCCTGCC CCCGTCCTCC TCCGGGCCCC CCTC-CCCTGC CCTGACGTCC 360 

ACCTCACCCC CGGTCTGGCA GCAGCAGACC ATCCTCATGC TGAAGGAGCA GAACCGACTC 420 

CTCACCCAGG AGGTGACCGA GAAGAGTGAG CGCATCACGC AGCTGGAGCA GGAGAAGTCG 480 

GCGCTCATTA AGCAGCTGTT TGAGGCCCGC GCCCTGAGCC AGCAGGACGG GGGACCTCTG 540 

GATTCCACCT TCATCTAGTC CTTGTGGGCC GCGTGGGCCC CCAGGGCCAG CCTGGCACTC 600 

AGCCCTTCGA GGGTGGGCGC CCCATGGCAC CCACCCTCTC TGGCTGGAGA CCCCCGGCAG 660 

GCCCAGGCAC AGTCCCGGAG TGGGCGCCTT CCTGCCGCCC TTGCCAGATG GGCTCCCCAG 720 

GCCTGCCCCC GGCTGGTCCC CGCACCGAGC GCTTGACTCC GTTTKGGCTC CTGGTTGYTG 7 80 

ACATGGG CTG GGGGCTCTCT TGAGTCCGCA TAGTCCGCAC- CTACTACTGG CCGCTGTCAG 840 

TGGACAGTGG GGTACCCCTC CATGAGTTAG CGTCCCCCCG TTTCCAGCGG TGCCGCCCTG 900 

GGTCCCATCT TCAGGGAAAG GCACTGCCCA CGCCAGGCTC- CACTTCCAAC AACGGGCAGC 960 

AGAGGGCGCG GGGCGGCTCC GACGCGGGTC CAAGGGCAGC TTCCCGCTCA ACCAGGGCAC 1020 

CAGGACGAGG TGGCTGTAGC TCGGAGGGAC GGAAGTAGAT GGAGGGGGTG GGGACGGCCT 10 80 

GTAAGCGGGG GGTGCCTGCC TGGCTGGGGA GCCCCAGGGA TAC-CGGTCGG ACTTCAGGTT 1140 

CTGGCCAAGG CTGAGGG ACC CTGGCTGCAG CGGATCGGCA CGCCGGGTGG GCGAGAGCTT 12 00 

GGCCTGCATG TGCCTCCCAC AGACCCTGGG GTGATGGCCT TCCCCCTCTT GGCCGGGACG 12 60 

TTGCCCCACG TTGAGTCCCA CACAACATCC TGTGAGCCTG GCTCCCCAGG AGGGCCCCCA 1320 

GACAGCTCCC AGGCACGTCA TAGGCAAAGC CTGTTTCCCC CGACTCAGGA TTTCCAAGGC 13 80 

CTGGGGTCCT GCTCACCCCC CTTTGCTCTC ACGCCCAGCC TGTCCCCAGG TTTCAGCTGG 1440 

GAGAGGCCAC CTCCCTCAGC CAAGGAAAAC GAGAACCCCC AGGGTACAGG AGGAGGCTGG 1500 
GGCAGGTCCC CTTGGGTGTC A 
GGAGTACGCA CTGGTGGGGG G 

GAACCAGGGG CACGGCAACA GCATCGATGG GTTCTGCAGC CCAGGGCCCC CGATGCGGGG 
TCAGTGTGTG TGGGGCGCAG GGCCTCCGAT GCGGGGTCAG T 



55 

60 



ACACTGTCCC ACAAGGCACC TGTCTCAGAG GAGGGGCCCT GGCAGGCAGC GTGGCAACTC 
CCTTCCGGAG CCCAGCTCCA TGCTAACCTG CCCACAGCAA CCCCACf 
TGCTGCACCT GGTCTGCAGG GGTGTCCCAG GACAGGCCCA AGTCAGCCCA G 

GCCCTCCTAC CCTGAAGATG GGAGTGGGCT TTCCAGGGGA CATAAGGATG TCAGGCCTGG 2040 

ACCTCCTGGG CAGGAAAGGG TGCAGGTCCT GAGGGCCTGT GCCCCACAGC CCCAGCACCC 2100 

AGGTGGACTG CAGCGCAGTG GGTGGGCCAG TGGCAGCCAG GGAGAAGCCC CCCGTCAGCA 2160 

GGCTGGGGTC TGCCCACCAG GGCCTCCCCA CGTCTGCCTT TGAGGGTGCC TGCCATGCCC 2220 

TGGGGGATCC TGGCATCTTT ACTGGACTGG AAGCAGGAGA CAGAACAGTG TCTGTCCCGG 22 30 

GGTGACTTCA TCAGGAGACC GCCCACATAG AGCTGGACCC CGCAGCTGAA GCGGAAATGT 2340 

GAGACAGGCT GGCACCTCCG GAAAAACTGC CTTTCAGCCT TGGTGTTCCG TGCAAGGTGA 2400 

AAAGAAATAG GTCCTCCCAG TTTACAGCTT GAAATCAGGC TAGTGAGTGG CCCTGGAGAC 2450 

CACGAGGGGA GAATTTAAAG GCCCCGGCTG GCAGGGTCTA GGTGGCTGGC AGAGGCACAT 2520 

GCAGACCCTG CCTGGAGCCT GCCCTAGGAC GCTGGGCGGG TCAGTCTCCG TGCAGGATGT 25B0 

GAGCAGCGTC CCTGGGCTCT ATCCGCGAGG TGCCAGTAGC GTGTGCAGGT ACATACACGT 2640 

GCGTGCACAC TGTGATGACA CCCGGAAATG TCTCAGGATG TTGAAATGTG TCCTTGGGGG 2700 

TTTTGTGTTG ATCAAGTTCC AAGGAAAAGG AACATCTCAG CCGGGCGTGG TGGTTCACGC 2820 
CTCCAATCCC AGCACTTGAG GCCAGGAGTT CCAGAGCAGC CTGGGCAACG C 
CCCCATCTCT ACAARAAAAA AAAAAGAAAG AAAGAAAATG AGAGATCCAG G 

TCATAAACAC CACAAGGAAA CAATACACTA TGAGACCCAG CAGAAGCAAC AGATTGACTC 3 000 

TAGACCCAGA TACTAGAATT ATCAGAGAGA ATATAAAGTA ACAGTGTTTT ATATATCTAA 3 060 
AGAAATAAAA GAGATTTCTG GAAACATGAA AAAAAA 

A177 DNA SEQUENCE 

Gene name: Cadherin 3, P-cadherin (placental) 

Unigene number: Hs.2877 
Probeset Accession #: X63629 
Nucleic Acid Accession #= X63629 

Coding sequence: 54-2543 (start and stop codons are undf 



GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCITCACCCC TCTCTCTGCA GCCATGGGGC 
TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 
CCTCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 
CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 
CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 
AAAGAAGGTC ACTGAAGGAA AGGAATCCAT TGAAG AT CTT CCCATCCAAA CGTATCTTAC 
GAAGACACAA GAGAGATTGG G 

CCTTCCCCCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA T 

ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCC-CT GTAGAGAAGG 
TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA G 

TCATCGTGAC CGAC CAGAAT GACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 
GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 
ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 
AGGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 
CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 
TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 
ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 
GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC C 
GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTTTACCATC A 
CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 
AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 1320 
CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 
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CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 
CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAC- 
CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTG3GC ACCCTCGACC 
GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 
ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 
ACCATGGCCC AGT CCCTG AG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 
ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 
T CACAGATG A CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAC-GT C-ACACAGTGG 
TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 
ACCATGGCAA CAAAGAG CAG CTGACGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 
ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 
TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 
TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCITC TACTATGGCG 2160 
AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 2220 
AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCGACACCCA 2280 
TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 
AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 
A GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 
S GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2E2 0 
Z TAGGCGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 
C CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 
GGAAGTGGCC GTAGCAACTT GGCGGAGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 
TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 
A GTTTCCAGAA GCCTCTTACC T 



CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 

TCCTGCATTT CTGGTTTCCA GACCC CAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 

ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 



Signal sequence: 



cadtlerin (placental) 



6S9-67S 

Cellular localization: plasma membrane 



I I I I I I 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPG"" " ~ 
QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRD 
KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLL 
YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 
DEDDAIYTYH GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 
TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT V 
3 GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV T 

njEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISY 
P DSGQVTAVGT LDREDEQFVR NNIYEVMVIiA MDNGSPt 
VNDHGPVPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ. AQLTDDSDIY KTAEVHEEGD 
L KQDTYDVHLS LSDKGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 
jLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 
7APTTIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 
A SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 



endothelial differentiation, lysophosphatidic acid G-protein-coupled receptor, 



CTTCTTTAAA TTTCTTTCTA GGATGTTCAC TTCTTCTCCA CAATGAATGA GTGT CACTAT 
GACAAGCACA TGGACTTTTT TTATAATAGG AGCAACACTG ATACTGTCGA TGACTGGACA 
GGAACAAAGC TTGTGATTGT TTTGTGTGTT GGGACGTTTT TCTGCCTGTT TATTTTTTTT 
TCTAATTCTC TGGTCATCGC GGCAGTGATC AAAAACAGAA AATTTCATTT CCCCTTCTAC 
TACCTGTTGG CTAATT TAG C TGCTGCCGAT TTCTTCGCTG GAATTGCCTA TGTATTCCTG 
ATGTTTAACA CAGGCCCAGT TTCAAAAACT TTi 



AGGCACATGT CAATCATGAG GATGCGGGTC CATAGCAACC TGACCAAAAA GAGGGTGACA 
CTGCTCATTT TGCTTGTCTG GGCCATCGCC ATTTTTATGG GGGCGGTCCC CACACTGGGC 
TGGAATTGCC TCTGCAACAT CTCTGCCTGC TCTTCCCTGG CCCCCATTTA CAGCAGGAGT 
TACCTTGTTT TCTGGACAGT GTCCAACCTC ATGGCCTTCC TCATCATGC-T TC-TGGTGTAC 
CTGCGGATCT ACGTGTACGT CAAGAGGAAA ACCAACGTCT TGTCTCCGCA TACAAGTGGG 
TCCATCAGCC GCCGGAGGAC ACCCATGAAG CTAATGAAGA CGGTGATGAC T 
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GCGTTTGTGG TATGCTGGAC CCCGGGCCTG GTGGTTCTGC TCCTCGACGG CCTGAACTGC 840 

AGGCAGTGTG GCGTGCAGCA TGTGAAAAGG TGGTTCCTGC TGCTGGCGCT GCTCAACTCC 900 

GTCGTGAACC CCATCATCTA CTCCTACAAG GACGAGGACA TGTATGGCAC CATGAAGAAG 960 

ATGATCTGCT GCTTCTCTCA GGAGAACCCA GAGAGGCGTC CCTCTCGCAT CCCCTCCACA 1020 

GTCCTCAGCA GGAGTGACAC AGGCAGCCAG TACATAGAGG ATAGTATTAG CCAAGGTGCA 1080 

GTCTGCAATA AAAGCACTTC CTAAACTCTG GATGCCTCTC GGCCCACCCA GGTGATGACT 1140 
GTCTTAGG 



Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



endothelial differentiation, lysophosphatidic acid G-protein-coupled r 



M DFFYNRSMTD TVDDWTGTKL VIVLCVGTFF CLFIFFSNSL V 
FHFPFYYLLA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN RWFLRQGLLD SSLTASLTNL 
LVIAVERHMS IMRMRVHSNL TKKRVTLLIL LVWAIAIFMG AVPTLGWNCL OJISACSSLA 



t and stop codons) 



I I I I I 

ATGACCTACA GTTACTCATT TTTCAGGCCT GAGTTGATCG TTAAT CATCT T 

CATTCTGAAG CCAACAGGAG AACCAAGACC AAAACTTTAT TGTCTCTGCT T 

GATGAAACCT CTGGACTAAG CACACATCTT CCTTGTTTAT CTCTCTCAAA G 

GTGCTTCATC TGGACATCCA CGGGAAGAAG GAAGACATGA GAATCACCCA ACAGTCTTCC 

ATACCCAGAG GGAACAAACG CTCCCCAAAA AGAGTTACAG AAACCATCCT GAGAGATTTT 

AAGCAGAAGC AAAGTTCAAA GATCCAAGAG GAGAGACGAA GAGAGTCTGC AGGACCAAAC 

CTCTCTTCAT TCTGGTTTGT GGGGAATGCI GGAAGAGGAG ACAGGCCCCA GATTTGGGCA 

GGAAGTAAAC AGTTTTCAGG CTGAGGCCAA TCTGAGCAGG AACATTCCAA TATTTCTTCA 

GCTACGTTGT CCCAGCACTT CACTGGTTAA CCTTTTATGT CCACCATTTG TGGATTTCAC 

AGCTACTTGT CAATGGTGAA TATTGATCAT CATCATTATC TACTGAGCTG CTACCATATC 

CCAGCTACTC CTTGCATGTT GTTCATTATT TTCTCAACAC TCAGCATATT TGCAATATGT 

TATGTAATAT CACAGACAAG GAAACTGAAC GCAGAAATGT TTTATTTCTT GCCAAACATC 
ACATGAGGAT GAACAATGAA AC 



Probeset A 

Signal sequence: 
Transmembrane domains 
Cellular Localization 



ELIVNHLNYV HSEANRRTKT KTLLSLLSFL DETSGLSTHL pi 
EDMRITQQSS QLYLWDMGGF TIFKNLWMSL IERGNKRSPK RVTETILRDF 
LSSFWFVGNA GRGDRPQIWA GSKQFSG 



TCACACGTGC CAAGGGGCTG GCTCAGCGGA ACCAC-CCTC-C ACGCGCTGGC TCCGGGTGAC 
AGCCGCGCGC CTCGGCCAGG ATCTGAGTGA TGAGACGTC-T CCCCACTGAG GTGCCCCACA 
GCAGCAGGTG TTGAGCATGG GCTGAGAAGC TGGACCGGCA CCAAAGGGCT GGCAGAAATG 
GGCGCCTGGC TGATTCCTAG GCAGTTGGCG GCAGCAAGC-A GGAGAGGCCG CAGCTTCTGG 
AGCAGAGCCG AGACGAAGCA GTTCTGGAGT GCCTGAACGG CCCCCTGAGC CCTACCCGCC 
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TGGCCCACT A TG GTCCAGAG GCTGTGGGTG A 

CTCTTGCTGG TCAACCTGCT AACCTTTGGC CTGGAGGTGT GTTTGGCCGC A 
TATGTGCCGC CTCTGCTGCT GGAAGTGGGG GTAGAGGAGA AGTTCATGAC C 

GGCATTGGTC CAGTGCTGGG CCTGGTCTGT GTCCCGCTCC TAG3CTCAGC CAGTGACCAC S40 

5 TGGCGTGGAC GCTATGGCCG CCGCCGGCCC TTCATCTGGG CACTGTCCTT GGGCATCCTG 50 0 

CTGAGCCTCT TTCTCATCCC AAGGGCCGGC TGGCTAGCAG GGCTGCTGTG CCCGGATCCC 660 

AGGCCCCTGG AGCTGGCACT GCTCATCCTG GGCGTGGGGC TGCTGGACTT CTGTGGCCAG 720 

GTGTGCTTCA CTCCACTGGA GGCCCTGCTC TCTGACCTCT TCCGGGACCC G3ACCACTGT 760 

OGCCAGGCCT ACTCTGTCTA TGCCTTCATG ATCAGTCTTG GGGGCTGCCT GGGCTACCTC 84 0 

10 CTGCCTGCCA TTGACTGGGA CACCAGTGCC CTGGCCCCCT ACCTGGGCAC CCAGGAGGAG 900 

TCCCTCTTTG GCCTGCTCAC CCTCATCTTC CTCACCTGCG TAGCAGCCAC ACTGCTGGTG 960 

GCTGAGGAGG CAGCGCTGGG CCCCACCGAG CCAGCAGAAG GGCTGTCGGC CCCCTCCTTG 1020 

TCGCCCCACT GCTGTCCATG CCGGGCCCGC TTGGCTTTCC GGAACCTGGG CGCCCTGCTT 1080 

CCCCGGCTGC ACCAGCTGTG CTGCCGCATG CCCCGCACCC TGCGCCGGCT CTTCGTGGCT 1140 

15 GAGCTGTGCA GCTGGATGGC ACTCATGACC TTCACGCTGT TTTACACGGA TTTCGTGGGC 1200 

GAGGGGCTGT ACCAGGGCGT GCCCAGAGCT GAGCCGGGCA CCGAGGCCCG GAGACACTAT 1260 

GATGAAGGCG TTCGGATGGG CAGCCTGGGG CTGTTCCTGC AGTGCGCCAT CTCCCTGGTC 132 0 

TTCTCTCTGG TCATGGACCG GCTGGTGCAG CGATTCGGCA CTCGAGCAGT CTATTTGGCC 13 80 

AGTGTGGCAG CTTTCCCTGT GGCTGCCGGT GCCACATGCC TGTCCCACAG TGTGGCCGTG 1440 

20 GTGACAGCTT CAGCCGCCCT CACCGGGTTC ACCTTCTCAG CCCTGCAGAT CCTGCCCTAC IE 00 

ACACTGGCCT CCCTCTACCA CCGGGAGAAG CAGGTGTTCC TGCCCAAATA CCGAGGGGAC 1560 

ACTGGAGGTG CTAGCAGTGA GGACAGCCTG ATGACCAGCT TCCTGCCAGG CCCTAAGCCT 162 0 

GGAGCTCCCT TCCCTAATGG ACACGTGGGT GCTGGAGGCA GTGGCCTGCT CCCACCTCCA 1680 

CCCGCGCTCT GCGGGGCCTC TGCCTGTGAT GTCTCCGTAC GTGTGGTGGT GGGTGAGCCC 1740 

25 ACCGAGGCCA GGGTGGTTCC GGGCCGGGGC ATCTGCCTGG ACCTCGCCAT CCTGGATAGT 1800 

GCCTTCCTGC TGTCCCAGGT GGCCCCATCC CTGTTTATGG GCTCCATTGT CCAGCTCAGC 1860 

CAGTCTGTCA CTGCCTATAT GGTGTCTGCC GCAGGCCTGG GTCTGGTCGC CATTTACTTT 1920 

GCTACACAGG TAGTATTTGA CAAGAGCGAC TTGGCCAAAT ACTCAGCG TA G AAAACTTCC 1980 



ATGGGGCTGC CGGGCTGGCC GCCAGTTTCT GTTGCTGCCA AAGTAATGTG GCTCTCTGCT 2100 

GCCACCCTGT GCTGCTGAGG TGCGTAGCTG CACAGCTGGG GGCTGGGGCG TCCCTCTCCT 2160 

CTCTCCCCAG TCTCTAGGGC TGCCTGACTG GAGGCCTTCC AAGGGGGTTT CAGTCTGGAC 2220 

TTATACAGGG AGGCCAGAAG GGCTCCATGC ACTGGAATGC GGGGACTCTG CAGGTGGATT 2280 
ACCCAGGCTC Al 
GGGAGCTGAA T. 

CGTTTAATGT AGCTCTTGCA TGGGAGTTTC TAGGATGAAA CACTCCTCCA T 

ACATATGAAA GTTATTTGTA GGGGAAGAGT CCTGAGGGGC AACACACAAG AACCAGGTCC 2E20 

CCTCAGCCCC ACAGGCACTG GTCTTTTTTG CTNGANTCCA CCCCCCCCCT CTTTACCCTT 2580 



Hs. 179809 
XE_050197 



I 



I 



I 



MVQRLWVSRL LRHRKAQLLL VNLLTFGLEV CLAAGITYVP PLLLEVGVEE KFMTMVLGIG 
PVLGLVCVPL LGSASDHWRG RYGRRRPFIW ALSLGILLSL FLIPRAGWLA GLLCPDPRPL 
ELALLILGVG LLDFCGQVCF TPLEALLSDL FRDPDHCRQA YSVYAFMISL GGCLGYLLPA 
IDWDTSALAP YLGTQEECLF GLLTLIFLTC VAATLLVAEE AALGPTEPAE GLSAPSLSPH 
CCPCRARLAF RNLGALLPRL HQLCCRMPRT LRRLFVAELC SWKALMTFTL FYTDFVGEGL 
YQGVPRAEPG TEARRHYDEG VRMGSLGLFL QCAISLVPSL VMDRLVQRFG TRAVYLASVA 
AFPVAAGATC LSHSVAWTA 3AALTGFTF3 ALQILPYTLA SLYHREKQVF LPKYRGDTGG 
ASSEDSLMTS FLPGPKPGAP FPNGHVGAGG SGLLPPPPAL CGASACDVSV RVWGEPTEA 
RWPGRGICL DLAILDSAFL LSQVAPSLFM GSIVQLSOSV TAYMVSAAGL GLVAIYFATQ 



Unigene number: i 
Probeset Accession #: I 
Nucleic Acid Accession # 
Coding sequence: : 
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CCTTACACGG CCTCTGATTT CATAGAAGGG ATCTACCGAA CAGAAAGGGA CAAAGGGACA 560 

TTGTAIGAGC TCACCTTCAA AGGGGACCAC AAACACGAAT TCAAACGGCT CATCTTATTT 72 0 

CGACCATTCG GCCCCATCAT GAAAGTGAAA AATGAAAAGC TCAACATGGC CAACACGCTT 780 

ATCAATGTTA TCGTGCCTCT AGCAAAAAGG GTGGACAAGT TCCGGCAGTT CATGCAGAAT 840 

TTCAGGGAGA TGTGCATTGA GCAGGATGGG AGAGTCCATC TCACTGTTGT TTACTTTGGG 90 0 

AAAGAAGAAA TAAATGAAGT CAAAGGAATA GTTGAAAACA CTTCCAAAGC TGCCAACTTC 96 0 

AGGAACTTTA CCTTCATCCA GCTGAATGGA GAATTTTCTC GGGGAAAGGG ACTTGATGTT 1020 

GGAGCCCGCT TCTGGAAGGG AAGCAACGTC CTTCTCTTTT TCTGTGATGT GGACATCTAC 1080 

TTCACATCTG AATTCCTCAA TACGTGTAGG CTGAATACAC AGCCAGGGAA GAAGGTATTT 1140 

TATCCAGTTC TTTTCAGTCA GTACAATCCT GGCATAATAT ACGGCCACCA TGATGCAGTC 12 0 0 

CCTCCCTTGG AACAGCAGCT GGT CAT AAAG AAGGAAACTG GATTTTGGAG AGACTTTGGA 12 60 

TTTGGGATGA CGTGTCAGTA TCGGTCAGAC TTCATCAATA TAGGTGGGTT TGATCTGGAC 1320 

ATCAAAGGCT GGGGCGGAGA GGATGTGCAC CTTTATCGCA AGTATCTCCA CAGCAACCTC 1380 

C GGACGCCTGT GCGAGGACTC TTCCACCTCT GGCATGAGAA GCGCTGCATG 144 0 



Signal sequence 



I I 

MVRRGLLAWI SRWVLLVLL C 

EWEEQHRNYV SSLKRQIAQL KEELQERSEQ LENGQYQASD AAGLGLDRSP PEKTQADLLA 

FLHSQVDKAE VNAGVKLATE YAAVPFDSPT LQKWQLETG LTRHPEEKPV RKDKRDELVE 

AIE5ALETLN NPAENSPNHR PYTASDFIEG IYRTERDKGT LYELTFKGDH KHEFKRLILF 

[VPLAKR VDKFRQFMQN FREMCIEQDG R 

F RNFTFIOLNG E 

FTSEFLNTCR LNTQPGKKVF YPVLFSQYWP GIIYGHHDAV PPLEQQLVIK KETGFWRDFG 



DELTPEQYKM CMQSKAMHEA SHGQLGMLVF 



t Accession S: 
Coding Gequence: 



ATPase, ( 

K51919 
it: AF189723 
1-2712 (underlined 



QKQKTSSKKT 

transporting, type 2C, member 1 



t and stop codons) 



I I I I I I 

ATGATTCCTG TATTGACATC AAAAAAAGCA AGTGAATTAC CAGTCAGTGA AGTTGCAAGC 60 

ATTCTCCAAG CTGATCTTCA GAATGGTCTA AACAAATGTG AAGTTAGTCA TAGGCGAGCC 120 

TTTCATGGCT GGAATGAGTT T GATATT AGT GAAGATGAGC CACTGTGGAA GAAGTATATT 180 

TCTCAGTTTA AAAATCCCCT TATTATGCTG CTTCTGGCTT CTGCAGTCAT CAGTGTTTTA 240 

ATGCATCAGT TTGATGATGC CGTCAGTATC ACTGTGGCAA TACTTATCGT TGTTACAGTT 300 

GCCTTTGTTC AGGAATATCG TTCAGAAAAA TCTCTTGAAG AATTGAGTAA ACTTGTGCCA 360 

CCAGAATGCC ATTGTGTGCG TGAAGGAAAA TTGGAGCATA CACTTGCCCG AGACTTGGTT 420 

CCAGGTGATA CAGTTTGCCT TTCTGTTGGG GATAGAGTTC CTGCTGACTT ACGCTTGTTT 480 

GAGGCTGTGG ATCTTTCCAT TGATGAGTCC AGCTTGACAG GTGAGACAAC GCCTTGTTCT 540 

AAGGTGACAG CTCCTCAGCC AGCTGCAACT AATGGAGATC TTGCATCGAG AAGTAACATT 60 0 

GCCTTTATGG GAACACTGGT CAGATGTGGC AAAGCAAAGG GTGTTGTCAT TGGAACAGGA 660 

GAAAATTCTG AATTTGGGGA GGTTTTTAAA ATGATGCAAG CAGAAGAGGC ACCAAAAACC 72 0 

CCTCTGCAGA AGAGCATGGA CCTCTTAGGA AAACAACTTT CCTTTTACTC CTTTGGTATA 780 

ATAGGAATCA TCATGTTGGT TGGCTGGTTA CTGGGAAAAG ATATCCTGGA AATGTTTACT 84 0 

ATTAGTGTAA GTTTGGCTGT AGCAGCAATT CCTGAAGGTC TCCCCATTGT GGTCACAGTG 90 0 

ACGCIAGCTC TTGGTGTTAT GAGAATGGTG AAGAAAAGGG CCATTGTGAA AAAGCTGCCT 960 

ATTGTTGAAA CICTGGGCTG CTGT AATGT G ATTTGTTCAG ATAAAACTGG AACACTGACG 102 0 

AAGAATGAAA TGACTGTTAC TCACATATTT ACTTCAGATG GTCTGCATGC TGAGGTTACT 1080 

GGAGTTGGCT ATAATCAATT TGGGGAAGTG ATTGTTGATG GTGATGTTGT TCATGGATTC 1140 
TATAACCCAG CTGTTAGCAG AATTGTTGAG GCGGGCTGTG TGTGCAATGA 1' 
AGAAACAATA CTCTAATGGG GAAGCCAACA GAAGGGGCCT TAATTGCTCT T 
ATGGGTCTTG ATGGACTTCA ACAAGACTAC ATCAGAAAAG CTGAATACCC TTTTAGCTCT 



ACAACACTCA TTGCCTCAGG AGTATCAATA AAAATGATTA CTGGAGATTC ACAGGAGACT 1680 

GCAGTT GCAA TCGCCAGTCG TCTGGGATTG TATTCCAAAA CTTCCCAGTC AGTCTCAGGA 1740 

GAAGAAATAG ATGCAATGGA TGTTCAGCAG CTTTCACAAA TAGTACCAAA GGTTGCAGTA 18 00 

TTTTACAGAG CTAGCCCAAG GCACAAGATG AAAATTATTA AGTCGCTACA GAAGAACGGT 1860 

TCAGTT GT AG CCATGACAGG AGATGGAGTA AATGATGCAG TIGCTCTGAA GGCTGCAGAC 1920 

ATTGGAGTTG CGATGGGCCA GACTGGTACA GATGTTTGCA AAGAGGCAGC AGACATGATC 1980 

CTAGTGGATG ATGATTTTCA AACCATAATG TCTGCAATCG AAGAGGGTAA AGG3ATTTAT 2040 



920 



WO 03/042661 PCT/US02/36810 



AATAACATTA AAAATTTCGT TAGATTCCAG CTGAGCACGA GTATAGCAGC ATTAACTTTA 210 0 

ATCTCATTGG CTACATTAAT GAACTTTCCT AATCCTCTCA ATGCCATGCA GATTTTGTGG 2160 

ATCAATATTA TTATGGATGG ACCCCCAGCT CAGAGCCTTG GAGTAGAACC AGTGGATAAA 2220 

GATGTCATTC GTAAACCTCC TCGCAACTGG AAAGACAGCA TTTTGACTAA AAACTTGATA 2280 

CTTAAAATAC TTGTTTCATC AATAATCATT GTTTGTGGGA CTTTGTTTGT CTTCTGGCGT 2340 

GAGCTACGAG ACAATGTGAT TACACCTCGA GACACAACAA TGACCTTCAC ATGCTTTGTG 2400 

TTTTTTGACA TGTTCAATGC ACTAAGTTCC AGATGGCAGA CCAAGTCTGT GTTTGAGATT 2460 

GGACTCTGCA GTAATAGAAT GTTTTGCTAT GCAGTTCTTG GATCCATCAT GGGACAATTA 252 0 

CTAGTTATTT ACTTTCCTCC GCTTCAGAAG GTTTTTCAGA CTGAGAGCCT AAGCATACTG 2580 

GATCTGTTGT TTCTTTTGGG TCTCACCTCA T CAGTGTGCA TAGTGGCAGA AATTATAAAG 2640 

AAGGTTGAAA GGAGCAGGGA AAAG AT CCAG AAGCATGTTA GTTCGACATC ATCATCTTTT 2700 
CTTGAAGT AT GA 

A188 Protein sequence; 

Gene name-. ATPase, Ca++ transporting, type 2C, member 1 

Unigene number: Hs . 106778 

Probeset Accession it: NS1919 
Protein Accession if: AAP27813 
Signal sequence: none found 

Transmembrane domains : 60-82, 86-108, 252-274, 282-304, 757-779, 827-849, 856-878 

Pf am domains: Cation ATPase N [6-83], S1-E2 ATPase [89-324], Hydrolase [328-649], 

[744-889] 

Cellular Localization: not determined 

1 11 21 31 41 51 

I I I I I I 

MIPVLTSKKA SELPVSEVAS ILQADLQNGL NKCEVSHRRA F" 

SQFKNPLIML LLASAVISVL MHQPDDAVSI TVAILIWTV A 
PECHCVREGK LEHTIiARDLV PGDTVCLSVG DRVPADLRLF E 

KVTAPQPAAT NGDLASRSNI AFMGTLVRCG KAKGWIGTG ENSEPGEVPK MMQAEEAPKT 
PLQKSMDLLG KQLSPYSPGI IGIIMLVGWL LGKDILEMFT I 
TLALGVMRMV KKRAIVKKLP IVETLGCCNV ICSDKTGTLT K 
GVGYNQFGEV IVDGDWHGF YNPAVSRIVE AGCVCNDAVI RNNTLMGKPT E 
MGLDGLQQDY IRKAEYPFSS EQKWMAVKCV HRTQQDRPEI CFMKGAYEQV IKYCTTYQSK 
CQTLTLTQQQ RDVYQQEKAR MGSAGLRVLA LASGPELGQL TFLGLVGIID PPRTGVKEAV 
TTLIASGVSI KMITGDSQET AVAIASRLGL YSKTSQSVSG EEIDAMDVQQ LSQIVPKVAV 
FYRASPRHKM KIIKSLQKNG SWAMTGDGV NDAVALKAAD IGVAMGQTGT DVCKEAADMI 
LVDDDFQTIM SAIEEGKGIY NNIKNFVRFQ LSTSIAALTL ISLATLMNFP NPLNAMQILW 
INIIMDGPPA QSLGVEPVDK DVIRKPPRNW KDSILTKKLI LK1LVSSIII VCGTLFVFWE 
ELRDNVITPR DTTMTFTCFV FFDMFNALSS RSQTKSVFSI GLCSNRMFCY AVLGSIMGQL 
LVIYFPPLQK VFQTESLSIL DLLFLLGLTS SVCIVAEIIK KVERSREKIQ KHVSSTSSSF 



Probeset Accession if: 
Nucleic Acid Accessio 

Coding sequence: 1-1284 (underlined sequences correspond to start and stop codons) 



I I I 

A GCCTGTCATC CCGCCGCAGA GAGGATTGCC TTATTCAATG 60 

TTCCTTT GGGAATATTG CTTTTATTCT GGGTTTCATA TGTTACAGAC 12 0 

TTTTCCCTTG TTTTATTGAT AAAAGGAGGG GCCCTCTCTG GAACAGATAC CTACCAGTCT 180 

TTGGTCAATA AAACTTTCGG CTTTCCAGGG TATCTGCTCC TCTCTGITCT TCAGTTTTTG 24 0 

TATCCTTTTA TAGCAATGAT AAGTTACAAT ATAATAGCTG GAGATACTTT GAGCAAAGTT 300 

TTTCAAAGAA TCCCAGGAGT TGATCCTGAA AACGTGTTTA TTGGTCGCCA CTTCATTATT 360 

GGACTTTCCA CAGTTACCTT TACTCTGCCT TTATCCTTGT ACCGAAATAT AGCAAAGCTT 42 0 

GGAAAGGTCT CCCTCATCTC TACAGGTTTA ACAACTCTGA TTCTTGGAAT TGTAATGGCA 480 

AGGGCAATTT CACTGGGTCC ACACATACCA AAAACAGAAG ACGCTTGGGT ATTTGCAAAG 540 

CCCAATGCCA TTCAAGCGGT CGGGGTTATG TCTTTTGCAT TTATTTGCCA CCATAACTCC 600 

TTCTTAGTTT ACAGTTCTCT AGAAGAACCC ACAGTAGCTA AGTGGTCCCG CCTTATCCAT 660 

ATGTCCATCG TGATTTCTGT ATTTATCTGT ATATTCTTTG CTACA1GTGG ATACTTGACA 72 0 

TTTACTGGCT TCACCCAAGG GGACTTATTT GAAAATTACT GCAGAAATGA TGACCTGGTA 7 80 

ACATTTGGAA GATTTTGTTA TGGTGTCACT GTCATTTTGA CATACCCTAT GGAATGCTTT 840 

GTGACAAGAG AGGTAATTGC CAATGTGTTT TTTGGTGGGA ATCTTTCATC GGTTTTCCAC 900 

ATTGTTGTAA CAGTGATGGT CAT CACTGT A GCCACGCTTG TGTCATTGCT GATTGATTGC 950 

CTCGGGATAG TTCTAGAACT CAATGGTGTG CTCTGTGCAA CTCCCCTCAT TTTTATCATT 1020 

CCATCAGCCT GTTATCTGAA ACTGTCTGAA GAACCAAGGA CACACTCCGA TAAGATTATG 1080 

TCTTGTGTCA TGCTTCCCAT TGGTGCTGTG GTGATGGTTT TTGGATTCGT CATGGCTATT 1140 

ACAAAT ACT C AAGACTGCAC CCATGGGCAG GAAATGTTCT ACTGCTTTCC TGACAATTTC 1200 

TCTCTCACAA ATACCTCAGA GTCTCATGTT CAGCAGACAA CACAACTTTC TACTTTAAAT 12S0 
ATTAGTATCT TTCAACTCGA G TAA 



Accession if: N62096 

Accession if: none found 

equence: none found 

brane domains : 28-50, 66-88 
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I I I I I I 

MGYQRQEPVI PPQRGLPYSM KQAGPPLGIL LLFWVSYVTD PSLVLLIKGG ALSGTDTYQS 
LVNKTFGFPG YLLLSVLQFL YPFIAMISYN IIAGDTLSKV FQRIPGVDPE NVFIGRHFII 
GLSTVTFTLP LSLYRNIAKL GKVSLISTGL TTLILGIVMA RAISLGPHIP KTEDAWVFAK 
PNAIQAVGVM SFAPICHHNS FLVYSSLEEP TVAKWSRLIH MSIVISVFIC IFFATCGYLT 
FTGFTQGDLF ENYCRNDDLV TFGRFCYGVT VILTYPMECF VTREVIANVF FGGNLSSVFH 
IWTVMVITV ATLVSLLIDC LGIVLELNGV LCATPLIFII PSACYLKLSE EPRTHSDKIM 
SCVMLPIGAV VMVFGFVMAI TNTQDCTHGQ EMFYCFPDNF SLTNTSESHV QQTTQLSTLN 
ISIFQLE 

A SEQUENCE 



Gene name: 

Unigene number: Hs. 293185 

Probeset Accession #: N6209S 
. #: N62096 

1-1203 (underlined sequences correspond to start and stop c 



ATGGGCTACC AGAGGCAGGA GCCTGTCATC C 
AAAGGAGGGG CCCTCTCTGG AACAGATACC TACCAGTCTT TGGTCAATAA AACTTTCGGC 



T TGGTCGCCAC TTCATTATTG 
ACAGGTTTAA CAAGTCTGAT 

CACATACCAA AAACAGAAGA CGCTTGGGTA TTTGCAAAGC CCAATGCCAT TCAAGCGGTC 
GGGGTTATGT CTTTTGCATT T 
GAAGAACCCA CAGTAGCTAA G 

TTTATCTGTA TATTCTTTGC TACATGTGGA TACTTGACAT T 
GACTTATTTG AAAATTACTG CAGAAATGAT G 

GGTGTCACTG TCATTTTGAC ATACCCTATG GAATGCTTTG TGACAAGAGA G 
AATGTGTTTT TTGGTGGGAA TCTTTCATCG GTTTTCCACA TTGTTGTAAC AGTGATGGTC 840 
ATCACTGTAG CCACGCTTGT GTCATTGCTG ATTGATTGCC TCGGGATAGT TCTAGAACTC 900 
AATGGTGTGC TCTGTGCAAC TCCCCTCATT TTTATCATTC CATCAGCCTG TTATCTGAAA 960 
CTGTCTGAAG AACCAAGGAC ACACTCCGAT AAGATTATGT CTTGTGTCAT GCTTCCCATT 1020 
GGTGCTGTGG TGATGGTTTT TGGATTCGTC ATGGCTATTA CAAATACTCA AGACTGCACC 1080 
CATGGGCAGG AAATGTTCTA CTGCTTTCCT GACAATTTCT CTCTCACAAA TACCTCAGAG 1140 
TCTCATGTTC AGCAGACAAC ACAACTTTCT ACTTTAAATA TTAGTATCTT T 
TAA 

A192 Protein sequence! 
Gene name: ESTs 
Unigene number: Hs. 293185 

Probeset Accession jh K6209S 
Protein Accession it: none found 

Signal sequence-. 1-26 

Transmembrane domains: 45-61, 92-108, 117-132, 191-2 

Cellular Localization: 



I I I I I I 

MGYQRQEPVI PPQFSLVLLI KGGALSGTDT YQSLVNKTFG FPGYLLLSVL QFLYPFIAMI 
SYNIIAGDTL SKVFQRIPGV DPENVFIGRH FIIGLSTVTF TLPLSLYRNI AKLGKVSLIS 
TGLTTLILGI VMARAISLGP HIPKTEDAWV FAKPNAIQAV GVMSFAFICH HNSFLVYSSL 
EEPTVAKWSR LIHMSIVISV FTCIFFATCG YLTFTGFTQG DLFEKYCRND DLVTFGRFCY 
GVTVILTYPM ECFVTREVIA NVFFGGNLSS VFHIWTVMV ITVATLVSLL IDCLGIVLEL 
NGVLCATPLI FIIPSACYLK LSEEPRTHSD KIMSCVMLPI GAWMVFGFV KAITNTQDCT 
HGQEMFYCFP DNFSLTNTSE SHVQQTTQLS T 



A SEQUENCE 



s correspond to start and stop codons) 



ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGG TCAATAAAAC TTTCGGCTTT 



TACAATATAA T 

CCTGAAAACG TGTTTATTGG TCGCCACTTC ATTATTGGAC TTTCCACAGT TACCTTTACT 
CTGCCTTTAT CCTTGTACCG AAATATAGCA AAGCTTGGAA AGGTCTCCCT CATCTCTACA 
GGTTTAACAA CTCTGATTCT TGGAATTGTA ATGGCAAGGG CAATTTCACT GGGTCCACAC 
ATACCAAAAA CAGAAGACGC TTGGGTATTT GCAAAGCCCA ATGCCATTCA A 
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GTTATGTCTT TTGCATTTAT TTGCCACCAT AACTCCTTCT TAGTTTACAG TTCTCTAGAA 480 

G AACC CACAG TAGCTAAGTG GTCCCGCCTT ATCCATATGT CCATCGTGAT TTCTGTATTT 54 0 

ATCTGTATAT TCTTTGCTAC AT GTGG AT AC TTGACATTTA CTGGCTTCAC CCAAGGGGAC 60 0 

TTATTTGAAA ATTACTGCAG AAATGATGAC CTGGTAACAT TTGGAAGATT TTGTTATGGT 660 

5 GTCACTGTCA TTTTGACATA CCCTATGGAA TGCTTTGTGA CAAGAGAGGT AATTGCCAAT 72 0 

GTGTTTTTTG GTGGGAATCT TTCATCGGTT TTCCACATTG TTGTAACAGT GATGGTCATC 780 

ACTGTAGCCA CGCTTGTGTC ATTGCTGATT GATTGCCTCG GGATAGTTCT AGAACTCAAT 840 

GGTGTGCTCT GTGCAACTCC CCTCATTTTT ATCATTCCAT CAGCCTGTTA TCTGAAACTG 90 0 

^ TCTGAAGAAC CAAGGACACA CTCCGATAAG ATTATGTCTT GTGTCATGCT TCCCATTGGT 960 

GGGCAGGAAA TGTTCTACTG CTTTCCTGAC AATTTCTCTC TCACAAATAC CTCAGAGTCT 10 80 
CATGTTCAGC AGACAACACA ACTTTCTACT TTAAATATTA GTATCTTTCA ACTCGAGTAA 



N62096 



I 



I 



I 



MGYQRQEPVI PPQVNKTFGF PGYLLLSVLQ PLYPPIAMIS YNIIAGDTLS KVFQRIPGVD 
F IIGLSTVTFT LPLSLYRNIA KLGKVSLI3T GLTTLILGIV MARAISLGPH 
JAIQAVG VMSFAFICHH NSFLVYSSLE EPTVAKHSRL IHMSIVISVF 
ICIFPATCGY LTFTGPTQGD LPENYCRNDD LVTFGRFCYG VTVILTYPME CFVTREVIAN 
VFFGGNLSSV FHIWTVMVI TVATLVSLLI DCLGIVLELN GVLCATPLIF IIPSACYLKL 
SEEPRTHSDK IMSCVMLPIG AWMVFGFVM AITNTQDCTH GQEMFYCFPD NFSLTNTSES 
HVQQTTQLST LNISIFQLE 



A19S DMA SEQUENCE: 



Probeset Accession th 
Nucleic Acid Accession if: 
Coding sequence: 



N62096 

1-1389 (underlined sequences correspond t 



t and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGA GAGATTTAGA TGACAGAGAA 
ACCCTTGTTT CTGAACATGA GTATAAAGAG AAAACCTGTC AGTCTGCTGC TCTTTTTAAT 
GTTGTCAACT CGATTATAGG ATCTGGTATA ATAGGATTGC CTTATTCAAT GAAGCAAGCT 
GGGTTTCCTT TGGGAATATT GCTTTTATTC TGGGTTTCAT ATGTTACAGA CTTTTCCCTT 
GTTTTATTGA TAAAAGGAGG GGCCCTCTCT GGAACAGATA CCTACCAGTC TTTGGTCAAT 
AAAACTTTCG GCTTTCCAGG GTATCTGCTC CTCTCTGTTC TTCAGTTTTT GTATCCTTTT 
ATAGCAATGA TAAGTTACAA TATAATAGCT GGAGATACTT TGAGCAAAGT TTTTCAAAGA 
ATCCCAGGAG TTGATCCTGA AAACGTGTTT ATTGGTCGCC ACTTCATTAT TGGACTTTCC 
ACAGTTACCT TTACTCTGCC TTTATCCTTG TACCGAAATA TAGCAAAGCT TGGAAAGGTC 
TCCCTCATCT CTACAGGTTT AACAACTCTG ATTCTTG 
TCACTGGGTC CACACATACC AAAAACAGAA GACGCTTGGG T 
ATTCAAGCGG TCGGGGTTAT GTCTTTTGCA TTTATTTGCC ACCATAACTC CTTCTTAGTT 
TACAGTTCTC TAGAAGAACC CACAGTAGCT AAGTGGTCCC GCCTTATCCA TATGTCCATC 
GTGATTTCTG TATTTATCTG TATATTCTTT GCTACATGTG GATACTTGAC ATTTACTGGC 
TTCACCCAAG GGGACTTATT TGAAAATTAC TGCAGAAATG ATGACCTGGT AACATTTGGA 
AGATTTTGTT ATGGTGTCAC TGTCATTTTG ACATACCCTA TGGAATGCTT TGTGACAAGA 
GAGGTAATTG CCAATGTGTT TTTTGGTGGG AATCTTTCAT CGGTTTTCCA CATTGTTGTA 
ACAGTGATGG TCATCACTGT AGCCACGCTT GTGTCATTGC TGATTGATTG CCTCGGGATA 
GTT CTAGAAC TCAATGGTGT GCTCTGTGCA ACTCCCCTCA TTTTTATCAT TCCATCAGCC 



A TTGGTGCTGT GGTGATGGTT TTTGGATTCG TCATGGCTAT T 
CAAGACTGCA CCCATGGGCA GGAAATGTTC TACTGCTTTC CTGACAATTT CTCTCTCACA 

AGTCTCATGT TCAGCAGACA ACACAACTTT CTACTTTAAA TATTAGTATC 13 80 



Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



MGYQRQEPVI PPQRDLDDRE TLVSEHEYKE KTCQSAALFN WNSIIGSGI IGBPYSMKQA 60 
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GFPLGILLLF WVSYVTDFSL VLLIKGGALS GTDTYQSLVN KTFGFPGYLL LSVLQFLYPE 

IAMISYNIIA GDTLSKVFQR IPGVDPENVF IGRHFIIGLS TVTFTLPLSL YRNIAKLGKV 

SLISTGLTTL ILGIVMARAI SLGPHIPKTE DAWVFAKPNA IQAVGVMSFA FICHHNSFLV 

YSSLEEPTVA KWSRLIHMSI VISVFICIFF ATCGYLTFTG FTQGDLFENY CRNDDLVTFG 

RFCYGVTVIL TYPMECFVTR EVIANVFFGG NLSSVFHIW TVMVITVATL VSLLIDCLGI 



7 DNA SEQUENCE 

e name: ESTs 

gene number: Hs. 31608 

beset Accession #: H1883S 

leic Acid Accession S: NM_017636 

ing sequence: 1-3501 (underlined sequences correspond to start and stop codons) 



ACAGCGATGC ACACACCACG 
GGGCCGGCCG CAAGCACAGC 
TTTATAGTCT GGTCACACGC 
TGGGGGGATC 



TGGGGGTCTG CACACGGGCA T 
GGGGGCACCA A 

ACCCCAAGGG CTCGTTCCCT 



T TCTTCCTGGT GGACGACGGC A 
TTCCGCTTGC G 
30 ATTGACATCC C 

GAGAACGCCA CCCAGGCTCA GCTCCCATGT CTCCTCGTGG CTGGCTCAGG G 
GACTGCCTGG CGGAGACCCT GGAAGACACT CTGGCCCCAG GGAGTGGGGG AGCCAGGCAA 
GGCGAAGCCC GAGATCGAAT CAGGCGTTTC T 

GCCCAGGTGG AGAGGATTAT GACCCGGAAG GAGCTCCTGA CAGTCTATTC T 

35 GGGTCTGAGG AATTCGAGAC CATAGTTTTG AAGGCCCTTG TGAAGGCCTG TGGGAGCTCG 1020 

GAGGCCTCAG CCTACCTGGA TGAGCTGCGT TTGGCTGTGG CTTGGAACCG CGTGGACATT 1080 

GCCCAGAGTG AACTCTTTCG GGGGGACATC CAATGGCGGT CCTTCCATCT CGAAGCTTCC 1140 

CTCATCCACC CCCTCCTGAA TGACCGGCCT GAGTTCGTGC GCTTGCTCAT TTCCCACGGC 1200 

CTCAGCCTGG GCCACTTCCT GACCCCGATG CGCCTGGCCC AACTCTACAG CGCGGCGCCC 126 0 

40 TCCAACTCGC TCATCCGCAA CCTTTTGGAC CAGGCGTCCC ACAGCGCAGG CACCAAAGCC 1320 

CCAGCCCTAA AAGGGGGAGC TGCGGAGCTC CGGCCCCCTG ACGTGGGGCA TGTGCTGAGG 1380 

ATGCTGCTGG GGAAGATGTG CGCGCCGAGG TACCCCTCCG GGGGCGCCTG GGACCCTCAC 144 0 

CCAGGCCAGG GCTTCGGGGA GAGCATGTAT CTGCTCTCGG ACAAGGCCAC CTCGCCGCTC 1500 

TCGCTGGATG CTGGCCTCGG GCAGGCCCCC TGGAGCGACC TGCITCTTTG GGCACTGTTG 156 0 

45 CTGAACAGGG CACAGATGGC CATGTACTTC TGGGAGATGG GTTCCAATGC AGTTTCCTCA 1620 

GCTCTTGGGG CCTGTTTGCT GCTCCGGGTG ATGGCACGCC TGGAGCCTGA CGCTGAGGAG 16 8 0 

GCAGCACGGA GGAAAGACCT GGCGTTCAAG TTTGAGGGGA TGGGCGTTGA CCTCTTTGGC 1740 

GAGTGCTATC GCAGCAGTGA GGTGAGGGCT GCCCGCCTCC TCCTCCGTCG CTGCCCGCTC 1800 

TGGGGGGATG CCACTTGCCT CCAGCTGGCC ATGCAAGCTG ACGCCCGIGC CTTCTTTGCC 1860 

50 CAGGATGGGG TACAGTCTCT GCTGACACAG AAGTGGTGGG GAGATATGGC CAGCACTACA 1920 

CCCATCTGGG CCCTGGTTCT CGCCTTCTTT TGCCCTCCAC TCATCTACAC CCGCCTCATC 1980 

ACCTTCAGGA AATCAGAAGA GGAGCCCACA CGGGAGGAGC TAGAGTTTGA CATGGATAGT 2040 

GTCATTAATG GGGAAGGGCC TGTCGGGACG GCGGACCCAG CCGAGAAGAC GCCGCTGGGG 210 0 

GTCCCGCGCC AGTCGGGCCG TCCGGGTTGC TGCGGGGGCC GCTGCGGGGG GCGCCGGTGC 2160 

55 CTACGCCGCT GGTTCCACTT CTGGGGCGCG CCGGTGACCA TCTTCATGGG CAACGTGGTC 222 0 

AGCTACCTGC TGTTCCTGCT GCTTTTCTCG CGGGTGCTGC TCGTGGATTT CCAGCCGGCG 2280 

CCGCCCGGCT CCCTGGAGCT GCTGCTCTAT TTCTGGGCTT TCACGCTGCT GTGCGAGGAA 23 40 

CTGCGCCAGG GCCTGAGCGG AGGCGGGGGC AGCCTCGCCA GCGGGGGCCC CGGGCCTGGC 240 0 

CATGCCTCAC TGAGCCAGCG CCTGCGCCTC TACCTCGCCG ACAGCTGGAA CCAGTGCGAC 2460 

60 CTAGTGGCTC TCACCTGCTT CCTCCTGGGC GTGGGCTGCC GGCTGACCCC GGGTTTGTAC 2520 

C GCACTGTCCT CTGCATCGAC TTCATGGTTT TCACGGTGCG GCTGCTTCAC 2580 
A GATGATGAAG 

GAGGGGCTCC TGAGGCCACG GGACAGTGAC TTCCCAAGTA TCCTGCGCCG C 
3 ACATGGACGT G 

3 GGCCCAGGCG 

GGCACCTGCG TCTCCCAGTA TGCCAACTGG CTGGTGGTGC TGCTCCTCGT CATCTTCCTG 
G CTCATTGCCA TGTTCAGTTA CACATTCGGC 



TTCCACTCTC GGCCCGCGCT GGCCCCGCCC TTTATCGTCA TCTCCCACTT G 
CTCAGGCAAT TGTGCAGGCG ACCCCGGAGC CCCCAGCCGT CCTCCCCGGC CCTCGAGCAT . 
TTCCGGGTTT ACCTTTCTAA GGAAG CCGAG CGGAAGCTGC TAACGTGGGA ATCGGTGCAT 
AAGGAGAACT TTCTGCTGGC ACGCGCTAGG GACAAGCGGG AGAGCGACTC CGAGCGTCTG 
AAGCGCACGT CCCAGAAGGT GGACTTGGCA CTGAAACAGC TGGGACACAT CCGCGAGTAC 
GAACAGCGCC TGAAAGTGCT GGAGCGGGAG GTCCAGCAGT GTAGCCGCGT CCTGGGGTGG 
GTGGCCGAGG CCCTGAGCCG C 
CTGCCTGGGT CCAAAGA CTG A 

A198 Protein sequence: 

Gene name: ESTs 

Unigene number: Hs. 31608 

Probeset Accession #: H18836 

Protein Accession # : none found 
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Signal sequence: none found 

Transmembrane domains: 214-230, 537-556, 642-662, 730-752, 760-782, 8 

Cellular Localization; plasma membrane 

1 11 21 31 41 51 

I I I I I I 

MEDAFGAAW TVWDSDAHTT EKPTDAYGEL DFTGAGRKHS NFLRLSDRTD PAAVYSLVTR 60 

TWGFRAPNLV VSVLGGSGGP VLQTWLQDLL RRGLVRAAQS TGAWIVTGGL HTGIGRHVGV 12 0 

AVRDHQMAST GGTKWAMGV APWGWRNRD TLINPKGSFP ARYRWRGDPE DGVQFPLDYN 180 

YSAFFLVDDG THGCLGGEHR FRLRLESYIS QQKTGVGGTG IDIPVLLLLI DGDEKMLTRI 240 

ENATQAQLPC LLVAGSGGAA DCLAETLEDT LAPGSGGARQ GEARDRIRRF FPKGDLEVLQ 30 0 

AQVERIMTRK ELLT T i E 3SSEF TI\L KALVKACGSS EASAYLDELR LAVAWNRVDI 360 

SNSLIRHLLD QASHSAGTKA PALKGGAAEL RPPDVGHVLR MLLGKMCAPR YPSGGAWDPH 480 

PGQGFGESMY LLSDKATSPL SLDAGLGQAP WSDLLLWALL LNRAQMAKYF WEMGSNAVSS 540 



WGDATCLQLA MQADARAFFA QDGVQSLLTQ KWWGDMASTT P 
TFRKSEEEPT REELEFDMDS VINGEGPVGT ADPAEKTPLG VPRQSGRPGC CGGRCGGRRC 
5 RVLLVDFQPA P 
L YLADSWNQCD L 
i IFTVNKQLGP KIVIVSKMMK DVFFFLFFLG VWLVAYGVAT 
Y RPYLQIFGQI PQEDMDVALM EHSNCSSEPG FWAHPPGAQA 
GTCVSQYANW LWLLLVIFL LVANILLWL LIAMFSYTFG KVQGNSDLYW KAQRYRLIRE 
j LRQLCRRPRS PQPSSPALEH FRVYLSKEAE RKLLTV1ESVH 
R DKRESDSERL KRTSQKVDLA LKQLGHIREY EQRLKVLERS VQQCSRVLGW 



Unigene number-. 
Probeset Accessio 
Nucleic Acid Acce 
Coding sequence: 1-894 (underlined sequences correspond to st 

1 11 21 31 41 51 

I I I I I I 

ATGGAGCCCC CGGCCCTCCT CACGGCGCTC AGCCTCGGCC TCAGCCTGTG CTCCCTGGGG 
CTGCTCGTCA CGGCCATCTT CACCGACCAC TGGTACGAGA CCGACCCCCG GCGCCACAAG 
GC1 x G( GCC GACCCCCCGG ACCAGAAGAA CCGCCTGATG 

CCGCTGTCGC ACCTGCCGCT GCGGGACTCG CCCCCGCTGG GGCGCCGGCT GCTCCCGGGC 
GGCCCGGGGC GCGCCGACCC CGAGTCCTGG CGCTCGCTCC TGGGGCTCGC- CGGGCTGGAC 
GCCGGCCCCT CTTCGCCACC TACTCGGGCC TCTGGAGGAA GTGCTACTTC 
CTGGGCATCG ACCGGGACAT CGACACCCTC ATCCTGAAAG GTATTGCGCA GCGATGCACG 
GCCATCAAGT ACCACTTTTC TCAGCCCATC CGCTTGCGAA ACATTCCTIT TAATTTAACC 
AAGACCATAC AGCAAGATGA GTGGCACCTG CTTCATTTAA GAAGAATCAC TGCTGGCTTC 
CTCGGCATGG CCGTAGCCGT CCTTCTCTGC GGCTGCATTG TCGCCACAGT CAGTTTCTTC 
TGGGAGGAGA GCTTGACCCA GCACGTGGCT GGACTCCTGT TCCTCATGAC AGGGATATTT 
TGCACCATTT CCCTCTGTAC TTATGCCGCC AGTATCTCGT ATGATTTGAA CCGGCTCCCA 
AAGCTAATTT ATAGCCTGCC TGCTGATGTG GAACATGGTT ACAGCTGGTC CATCTTTTGC 
GCCTGGTGCA GTTTAGGCTT TATTGTGGCA GCTGGAGGT C TCTGCATCGC TTATCCGTTT 
ATTAGCCGGA CCAAGATTGC ACAGCTAAAG TCTGGCAGAG ACTCCACGGT ATGA 

A20 0 Protein 



Probeset Acce 
Signal sequen 



11 21 31 41 51 

I I I I I 

.SLCSLG LLVTATFTDH WYETDPRRHK ESCERSRAGA DPPDQKKRLM 
PLSHLPLRDS PPLGRRLLPG GPGRADPESW RSLLGLGGLD AECGRPLFAT YSGLWRKCYF 
LGIDRDIDTL ILKGIAQRCT AIKYHFSQPI RLRNIPFNLT KTIQODEWHL LHLRRITAGF 
17ATVSFF WEESLTQHVA GLLFLMTGIF CTISLCTYAA SISYDLNRLP 
V EHGYSWSIFC AWCSLGFIVA AGGLCIAYPF ISRTKIAQLK SGRDSTV 
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TTTACAACTA TGGCTTTATT AACCATAATG GAATTCTCAG TATATCAAGA TACATGGATG 180 

AAGTATGAAT ACGAAGTAGA CAAGGATTT T TCTAGCAAAT TAAGAATTAA TATAGATATT 240 

ACTGTTGCCA TGAAGTGTCA ATATGTTGGA GCGGATGTAT TGGATTTAGC AGAAACAATG 3 00 

GTTGCATCTG CAGATGGTTT AGTTTATGAA CCAACAGTAT TTGATCTTTC ACCACAGCAG 3 60 

AAAGAGTGGC AGAGGATGCT GCAGCTGATT CAGAGTAGGC TACAAGAAGA GCATTCACTT 42 0 

CAAGATGTGA TATTTAAAAG TGCTTTTAAA AGTACATCAA CAGCTCTTCC ACCAAGAGAA 48D 

GATGATTCAT CACAGTCTCC AAATGCATGC AGAATTCATG GCCATCTATA TGTCAATAAA S40 

GTAGCAGGGA ATTTTCACAT AACAGTGGGC AAGGCAATTC CACATCCTCG TGGTCATGCA 600 

CATTTGGCAG CACTTGTCAA CCATGAATCT TACAATTTTT CTCATAGAAT AGATCATTTG 660 

TCTTTTGGAG AGCTTGTTCC AGCAATTATT AATCCTTTAG ATGGAACTGA AAAAATTGCT 720 

ATAGATCACA ACCAGATGTT CCAATATTTT ATTACAGTTG TGCCAACAAA ACTACATACA 780 

TATAAAATAT CAGCAGACAC CCATCAGTTT TCTGTGACAG AAAGGGAACG TATCATTAAC 840 

CATGCTGCAG GCAGCCATGG AGTCTCTGGG ATATTTATGA AATATGATCT CAGTTCTCTT 900 

ATGGTGACAG TTACTGAGGA GCACATGCCA TTCTGGCAGT TTTTTGTAAG ACTCTGTGGT 960 

ATTGTTGGAG GAATCTTTTC AACAACAGGC ATGTTACATG GAATTGGAAA ATTTATAGTT 1020 

GAAATAATTT GCTGTCGTTT CAGACTTGGA TCCTATAAAC CTGTCAATTC TGTTCCTTTT 1080 
GAGGATGGCC ACACAGACAA CCACTTACCT CTTTTAGAAA ATAATACACA T TGA 



CDA14 

Unigene number: Hs.2 6813 

probeset Accession #= N32912 
Protein Accession #: NP_0S76S4 



F SSKLRIHIDI TVAMKCQYVG 
KEWQRMLQLI QSRLQEEHSL QDVIPKSAPK STSTALPPRE 
VAGNFHITVG KAIPHPRGHA 



IDHNQMFQYF 
MVTVTEEHMP PWQFFVRLCG 
" P LLENNTH 



I HAAGSHGVSG 



dipeptidylpeptidase IV (CI 
HE.4492S 
S79876 

76-2301 (underlined sequences correspond to 



complexing protein 



t and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CGCGCGTCTC CGCCGCCCGC GTGACTTCTG CCTGCGCTCC TTCTCTGAAC GCTCACTTCC 
GAGGAGACGC CGACGATGAA GACACCGTGG AAGATTCTTC TGGGACTGCT GGGTGCTGCT 
GCGCTTGTCA CCATCATCAC CGTGCCCGTG GTTCTGCTGA ACAAAGGCAC AGATGATGCT 
ACAGCTGACA GTCGCAAAAC TTACACTCTA ACTGATTACT TAAAAAATAC TTATAGACTG 
AAGTTATACT CCTTAAGATG GATTTCAGAT CATGAATATC TCTACAAACA AGAAAATAAT 
ATCTTGGTAT TCAATGCTGA ATATGGAAAC AGCTCAGTTT TCTTGGAGAA CAGTACATTT 
GATGAGTTTG GACATTCTAT CAATGATTAT TCAATATCTC CTGATGGGCA GTTTATTCTC 
TTAGAATACA ACTACGTGAA GCAATGGAGG CATTCCTACA CAGCTTCATA TGACATTTAT 
GATTTAAATA AAAGGCAGCT GATTACAGAA GAGAGGATTC CAAACAACAC ACAGTGGGTC 
ACATGGTCAC CAGTGGGTCA TAAATTGGCA TATGTTTGGA ACAATGACAT TTATGTTAAA 
ATTGAACCAA ATTTACCAAG TTACAGAATC ACATGGACGG GGAAAGAAGA TATAATATAT 
AATGGAATAA CTGACTGGGT TTATGAAGAG GAAGTCTTCA GTGCCTACTC TGCTCTGTGG 
TGGTCTCCAA ACGGCACTTT TTTAGCATAT GCCCAATTTA ACGACACAGA AGTCCCACTT 
ATTGAATACT CCTTCTACTC TGATGAGTCA CTGCAGTACC CAAAGACTGT ACGGGTTCCA 
TATCCAAAGG CAGGAGCTGT GAATCCAACT GTAAAGTTCT TTGTTGTAAA TACAGACTCT 
CTCAGCTCAG TCACCAATGC AACTTCCATA CAAATCACTG CTCCTGCTTC TATGTTGATA 
GGGGATCACT ACTTGTGTGA TGTGACATGG GCAACACAAG AAAGAATTTC TTTGCAGTGG 
CTCAGGAGGA TTCAGAACTA TTCGGTCATG G 
AGATGGAACT GCTTAGTGGC ACGGCAACAC A 
AGATTTAGGC CTTCAGAACC TCATTH 

AGCAATGAAG AAGGTTACAG ACACATTTGC TATTTCCAAA TAGATAAAAA AGACTGCACA 
TTTATTACAA AAGGCACCTG GGAAGTCATC GGGATAGAAG CTCTAACCAG TGATTATCTA 1320 
TACTACATTA GTAATGAATA TAAAGGAATG CCAGGAGGAA GGAATCTTTA TAAAATCCAA 1380 
CTTATTGACT ATACAAAAGT GACATGCCTC AGTTGTGAGC TGAATCCGGA AAGGTGTCAG 1440 
TACTATTCTG TGTCATTCAG TAAAGAGGCG AAGTATTATC AGCTGAGATG TTCCGGTCCT 150 0 
GGTCTGCCCC TCTATACTCT ACACAGCAGC GTGAATGATA AAGGGCTGAG AGTCCTGGAA 1560 
GACAATTCAG CTTTGGATAA AATGCTGCAG AATGTCCAGA TGCCCTCCAA AAAACTGGAC 1620 
~~" T TGAATGAAAC AAAATTTTGG TATCAGATGA TCTTGCCTCC TCATTTTGAT 1680 

A AATATCCTCT ACTATTAGAT GTGTATGCAG GCCCATGTAG TCAAAAAGCA 1740 
GACACTGTCT TCAGACTGAA CTGGGCCACT TACCTTGCAA GCACAGAAAA CATTATAGTA 1800 
GCTAGCTTTG ATGGCAGAGG AAGTGGTTAC CAAGGAGATA AGATCATGCA TGCAATCAAC 1860 
AGAAGACTGG GAACATTTGA AGTTGAAGAT CAAATTGAAG CAGCCAGACA ATTTTCAAAA 192 0 
ATGGGATTTG TGGACAACAA ACGAATTGCA ATTTGGGGCT O 
ACCTCAATGG TCCTGGGATC GGGAAGTGGC GTGTTCAAGT G' 
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TCAGCTCAGA TCTCCAAAGC CCTGGTCGAT GTTGGAGTGG ATTTCCAGGC AATGTGGTAT 2280 
ACTGATGAAG ACCATGGAA T AG CTAGCAGC ACAGCACACC AACATATATA T. 
AGCCACTTCA TAAAACAATG TTTCTCTTTA CCTTAGCACC TCAAAATACC A 
AAGCTTATTA AAACTCATTT TTGTTTTCAT TATCTCAAAA CTGCACTGTC A 

TGATCTTTAA AATACACACT CAAATCAAGA AACTTAAGGT TACCTTTGTT CCCAAATTTC 2 520 

ATACCTATCA TCTTAAGTAG GGACTTCTGT CTTCACAACA GATTATTACC TTACAGAAGT 2580 

TTGAATTATC CGGTCGGGTT TTATTGTTTA AAATCATTTC TGCATCAGCT GCTGAAACAA 2640 

CAAATAGGAA TTGTTTTTAT GGAGGCTTTG CATAGATTCC CTGAGCAGGA TTTTAATCTT 2700 

TTTCTAACTG GACTGGTTCA AATGTTGTTC TCTTCTTTAA AGGGATGGCA AGATGTGGGC 27S0 

AGTGATGTCA CTAGGGCAGG GACAGGATAA GAGGGATTAG GGAGAGAAGA TAGCAGGGCA 2820 

TGGCTGGGAA CCCAAGTCCA AGCATACCAA CACGAGCAGG CTACTGTCAG CTCCCCTCGG 2880 

AGAAGAGCTG TTCACCACGA GACTGGCACA GTTTTCTGAG AAAGACTATT CAAACAGTCT 2940 

CAGGAAATCA AATATCGAAA GCACTGACTT CTAAGTAAAC CACAGCAGTT GAAAGACTCC 3000 

AAAGAAATGT AAGGGAAACT GCCAGCAACG CAGCCCCCAG GTGCCAGTTA TGGCTATAGG 3 060 

4 ACACAGCAAG GGTGATGGGA AAGCATIGTA AATGTGCTTT TAAAAAAAAA 3120 

T CCTAGTGAAA GAGGCAGCTT GAAACTGAGA TGTGAACACA TCAGCTTGCC 3180 

3 ATGAAAATAT TTGTATCACA AATCTTAACT TGAAGGAGTC CTTGCATCAA 3240 

r TTCATTTCTT TGAGTGTCTT AATTAAAAGA ATATTTTAAC TTCCTTGGAC 3300 

A TGTTATGTAT TATTATTCCC ATTCTACATA 3360 
CTATGGAATT TCTCCCAGTC ATTTAATAAA TGTGCCTTCA TTTTTTC 



dipeptidylpeptidase IV (CD26, adenosine deaminase complexing protein 2 
HS. 44926 
S79876 
NP_001926.1 



Signal sequence 

Transmembrane domains s 6-28 

DPPIV_N_term domain: 43-557 

Peptidase_S9 domain: 558-635 

Cellular Localization: plasma membrane 



MKTPWKILLG LLGAAALVTI ITVPWLLNK GTDDATADSR KTYTLTDYLK NTYRLKLYSL 60 

RWISDHEYLY KQENNILVFN AKYGKSSVFL ENSTFDEFGH SINDYSISPD GQFILLEYNY 12D 

VKQWRHSYTA SYDIYDLNKR QLITEERIPN NTQWVTWSPV GHKLAYVWNN DIYVKIEPNL 180 

P3YRITWTGK EDIIYNGITD WVYEEEVFSA YSALV7WSPNG TFLAYAQFND TEVPLISYSF 240 

YSDESLQYPK TVRVPYPKAG AVNPTVKFFV VNTDSLSSVT NATSIQITAP ASMLIGDHYL 3 00 

CDVTWATQER ISLQWLRRIQ NYSVMDICDY D ESSGRWNCL VARQHIEMST TGWVGRFRPS 3 50 

EPHFTLDGNS FYKIISNEEG YRIIICYFQID KKDCTFITKG TWEVIGIEAL TSDYLYYISN 420 

EYKGMPGGRN LYKIQLIDYT KVTCLSCELN PERCQYYSVS FSKEAKYYQL RCSGPGLPLY 480 

TLHSSVNDKG LRVLEDNSAIi DKMLQNVQME SKKLDFIILN ETKFWYQMIL PPHFDKSKKY 540 

PLLLDVYAGP CSQKADTVFR LNWATYLAST ENIIVASFDG RGSGYQGDKI MHAINRRLGT 600 

- 3 IAVAPVSRWE 660 

X VHFQQSAQIS 720 
KALVDVGVDF QAMWYTDEDH GIASSTAHQH IYTHMSHFIK QCFSLP 

A2 05 DNA SEQUENCE 

Gene name: ~ predicted exon 

Unigene number: none found 

Probeset Accession 4: none found 

Nucleic Acid A 



ATGGCTCTGG CGAAGGTGAG GGAGC CAAAC GCAAATGACA ATGCCATCAG AGTTGACAAC 
65 AGAAGTGTGA TTAAAGTGCG TGCTAACCAG TGTTCCCTGC ATGAGGCAGA AAGTGAATCC 
AGAAACCCTC AGGAGCT CT G GATGGGCCTG CTCCTCTTGA TGGGGGTCCT AGAAGCATGT 
GTGGAAATGA GGCCTCTGTC AGTCTGGTCC CTGAGAGATG ACAAGGAGCA GAGCCCCCAC 
CAGCCCACAC TGGATGTC TA A 

70 
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Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: none found 

Coding sequence: 1-462 (underlined sequences correspond to start and stop codons) 



20 

25 



3 CTGAGTTAGA AGCAAAGAGC CTTGGAAGCA GTAAATGTTT AAAAACTGCT 

CTCATACTTG CTGTATGTTG TGGAT CAGCA AATATAGTCA GCCCTCTACT TGAGCAAAAT 
ATTGATGTAT CTTCTCAAGA TCTGGACAGA CGGCCAGAGA GTATGCTGTT TCTAGTCATC 
AT CATGTGGA CCAGTTTTGT GGAAGACAAT CTTTCCATGG GCTGGGGGAA GCTAGAAGAT 
TTTATGGCTA TTGAAGAAGA AATGAAGAAG CACGGAAGTA CTCATGTGGG ATTCCCAGAA 
AACCTGACTA ATGGTGCCGC TGCTGGCAAT GGTGATGATG GATTAATTCC TCCAAGGAAG 



A208 Protein s 



SRTPESQQFP DTEMEEYHRF VKDQIWDMR R 
A2Q9 DNA SEQUENCE 



odz (odd Oz/ten-m, Drosophila) homolog 1 
NM_014253 

lences correspond to start and stop codons) 



C TTCCTCATCC TTTTTTTCAT GAAACTGAGC TTGCTTAATC 60 

AGAGATGGAG CAAACTGACT GCAAACCCTA CCAGCCTCTA CCAAAAGTCA AGCATGAAAT 120 

GGATCTAGCT TACACCAGTT CTTCTGATGA GAGTGAAGAT GGAAGAAAAC CAAGACAGTC 180 

ATACAACTCC AGGGAGACCC TGCACGAGTA TAACCAGGAG CTGAGGATGA ATTACAATAG 240 

CCAGAGTAGA AAGAGGAAAG AAGTAGAAAA ATCTACTCAA GAGATGGAAT TCTGTGAAAC 300 

CTCTCACACT CTGTGCTCTG GCTACCAAAC AGACATGCAC AGCGTTTCTC GGCATGGCTA 360 

C CAGCTAGAG ATGGG AT CT G ATGTGGACAC AGAGACAGAA GGTGCTGCCT CACCTGACCA 420 

TGCACTAAGA ATGTGGATAA GGGGAATGAA ATCAGAGCAT AGTTCCTGTT TGTCCAGCCG 480 

GGCCAACTCT GCATTATCCT TGACTGACAC TGACCATGAA AGGAAGTCTG ATGGGGAAAA 540 

TGGTTT CAAA TTCTCTCCTG TTTGTTGTGA CATGGAGGCT CAAGCTGGGT CTACTCAAGA 600 

TGTGCAGAGC AGCCCACACA ACCAGTTCAC CTTCAGACCC CTCCCACCGC CACCTCCGCC 660 

TCCTCATGCC TGCACCTGTG CCAGGAAGCC ACCCCCTGCA GCGGACTCTC TTCAGAGGAG 720 

AT CAATG ACT ACCCGCAGCC AGCCCAGCCC AGCTGCTCCA GCTCCCCCAA CCAGCACGCA 780 

GGATTCAGTC CATCTGCATA ACAGCTGGGT CCTGAACAGC AACATACCAT TGGAGACCAG 840 

GCATTCCCTG TTCAAACATG GATCTGGTTC CTCTGCGATC TTCAGTGCAG CCAGTCAGAA 900 

CTACCCTCTG ACATCCAATA CCGTGTACTC GCCCCCTCCC AGGCCTCTTC CTCGAAGCAC 960 

CTTTTCCCGA CCTGCCTTTA CCTTTAACAA ACCTTACAGG TGCTGCAACT GGAAGTGCAC 1020 

AGCATTGAGC GCCACTGCAA TCACAGTGAC TTTGGCCTTG TTACTAGCCT ATGTGATTGC 1080 

AGTGCATTTG TTCGGCCTGA CTTGGCAGTT GCAACCAGTT GAAGGAGAGC TGTATGCAAA 1140 

TGGAGTTAGC AAAGGGAACA GGGGGACCGA GTCCATGGAC ACTACTTACT CTCCAATTGG 1200 

AGGAAAAGTT TCTGATAAAT CAGAGAAAAA AGTGTTTCAG AAGGGACGGG CGATAGACAC 1260 

TGGAGAAGTT GACATTGGTG CACAGGTCAT GCAGACCATT CCACCTGGTT TATTCTGGCG 1320 

T ACTATCCACC ATCCAATATA TCTGAAGTTC AATATTTCTT TAGCCAAGGA 1380 

1440 



TCCTGTATGT CAAGAG CAGT GCTCAGGACA CGGAACTTTT CTTCTGGACG C 
CAGCTGTGAT CCCAAGTGGA CAGGATCTGA CTGCTCAACA GAGCTGTGTA CCATGGAGTG 
TGGTAGCCAT GGAGTCTGCT CAAGAGGAAT TTGCCAGTGT GAAC-AAGGCT GGGTAGGACC 
AACATGTGAG GAACGCTCCT GTCATTCTCA TTGTACTGAG CATGGCCAAT GCAAAGATGG 
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AAAATGTGAG TGTAGCCCTG GATGGGAGGG CGACCACTGC ACAATTGCTC ACTACTTAGA 234 0 

TGCTGTCCGA GATGGCTGCC CAGGGCTCTG CTTTGGAAAT GGACGATGTA CCCTGGATCA 240 0 

AAATGGTTGG CACTGTGTGT GTCAGGTGGG TTGGAGTGGG ACAGGCTGCA ATGTTGTCAT 2460 

GGAAATGCTT TGTGGAGATA ACTTGGACAA TGATGGAGAT GGTTTAACCG ACTGTGTGGA 2520 

5 TCCTGACTGT TGTCAACAAA GCAACTGTTA TATAAGTCCT CTCTGCCAGG GCTCACCAGA 2580 

TCCTCTTGAC CTCATTCAGC AAAGCCAAAC TCTCTTCTCT CAGCACACTT CAAGACTTTT 2640 

TTATGATCGA ATCAAATTCC TCATTGGCAA GGACAGTACT CATGTCATTC CTCCTGAGGT 2700 

GTCATTTGAC AGCAGGCGTG CCTGTGTGAT TCGAGGCCAA C-TGGTGGCCA TAGATGC-AAC 2760 

TCCTCTAGTG GGAGTGAATG TCAGTTTCTT GCACCACAGT GATTATGGGT TTACCATCAG 2820 

10 CCGGCAAGAT GGAAGCTTTG ACCTCGTGGC CATCGGTGGC ATCTCTGTCA TCTTAATCTT 2880 

CGACCGATCC CCTTTCCTGC CTGAGAAGAG AACACTCTGG TTGCCTTGGA ATCAGTTTAT 2 940 

TGTGGTAGAG AAAGTCACCA TGCAGAGAGT TGTATCAGAC CCGCCATCCT GCGATATCTC 3000 

CAACTTTATC AGCCCAAACC CTATTGTGCT TCCTTCACCG CTCACATCAT TTGGAGGGTC 3060 

CTGTCCAGAG AGGGGAACTA TTGTTCCTGA GCTGCAGGTT GTACAGGAGG AAATTCCCAT 3120 

15 TCCCTCCAGC TTTGTGAGGC TGAGTTACCT GAGCAGCCGC ACCCCTGGGT ATAAAACCCT 31B0 

AGTAGCTGTG GAAGGGCGAC TCACACAGAA GTGGTTTCCC GCCGCAATTA ATCTTGTCTA 3300 
CACATTTGCT TGGAACAAGA CCGATATCTA TGGACAGAAG GTTTGGGGCC TGGCAGAGGC 3360 
TTTGGTATCT GTGGGATATG AATATGAAAC GTGCCCTGAC TTTATTCTCT GGGAGCAAAG 3420 
20 GACAGTCGTT TTACAAGGTT TTGAGATGGA TGCTTCTAAC CTAGGAGACT GGTCTTTGAA 3480 
TAAGCATCAC ATTTTGAATC CTCAAAGTGG AATCATACAT AAAGGGAATG GAGAAAATAT 354 0 
GTTCATTTCC CAGCAGCCCC CAGTCATATC AACCATAATG GGTAATGGAC AC CAAAGGAG 3600 
TGTAGCCTGC ACCAACTGCA ATGGCCCAGC CCACAACAAC AAACTCTTTG CTCCTGTCGC 3660 
CTTAGCTTCT GGCCCTGATG GCAGTGTGTA TGTTGGCGAC TTCAATTTTG TAAGGAGAAT 3720 
25 ATTTCCCTCG GGAAACTCCG TTAGTATTTT GGAATTAAGC ACAAGTCCTG CTCACAAATA 3780 
CTATCTGGCT ATGGACCCTG TGTCTGAATC ACTCTATCTA TCAGACACCA ATACTCGCAA 384 0 
AGTCTACAAG TTGAAATCTC TTGTGGAGAC GAAAGATCTG TCCAAGAATT TTGAAGTGGT 390 0 
GGCAGGAACT GGTGATCAGT GCCTTCCCTT TGACCAGAGT CATTGTGGAG ATGGTGGGAG 3960 
AGCATCGGAA GCTTCACTGA ATAGCCCTCG AGGCATCACA GTTGATAGGC ATGGATTTAT 4020 
30 TTACTTTGTG GATGGGACTA TGATTCGCAA AATTGATGAG AATGCTGTGA TCACAACTGT 4080 
AATCGGCTCA AATGGTCTGA CTTCCACACA ACCACTGAGC TGTGACTCAG GAATGGACAT 414 0 
CACTCAGGTG CGATTAGAGT GGCCAACAGA CCTTGCAGTA AATCCTATGG ACAATTCATT 4200 
GTATGTCTTG GATAACAACA TTGTGCTGCA AATTTCTGAG AACAGGCGTG TTCGGATCAT 4260 
CGCAGGACGC CCCATTCACT GCCAGGTGCC AGGCATCGAT CATTTCCTGG TCAGCAAGGT 4320 
35 AGCAATTCAC TCCACTCTAG AGTCAGCGAG GGCCATCAGT GTCTCCCAGA GCGGGCTGCT 4380 
CTTCATAGCT GAAACAGACG AGAGGAAAGT AAACCGCATT CAGCAAGTAA CCACCAATGG 4440 
GGAGATCTAC ATCATCGCTG GTGCCCCCAC TGACTGTGAC TGCAAAATTG ATCCAAACTG 4500 
TGACTGTTTT TCAGGTGATG GTGGCTATGC CAAAGATGCA AAGATGAAAG CCCCTTCCTC 4 5 60 
CTTAGCAGTG TCGCCTGATG GAACCCTCTA TGTGGCAGAC CTCGGAAATG TTCGAATTCG 4 620 
40 TACCATCAGC AGGAACCAAG CCCACCTGAA TGACATGAAC ATTTATGAGA TTGCTTCACC 46B0 

CGCTGATCAG GAACTGTACC AGTTCACTGT AAATGGAACC CACCTACACA CCCTGAACTT 4740 
GATAACAAGG GACTATGTTT ATAACTTCAC CTACAATTCT GAAGGTGACT TGGGCGCGAT 48 0 0 
TACCAGCAGC AATGGCAATT CAGTGCACAT TCGCCGTGAT GCAGGCGGAA TGCCGCTATG 4 060 
GCTTGTGGTG CCTGGCGGAC AAGTATACTG GCTGACTATA AGCAGCAATG GAGTCCTGAA 4 92 0 
45 AAGAGTGTCA GCCCAAGGCT ATAATCCGGC CTTAATGACC TATCCAGGAA ACACAGGGCT 4980 

TCTGGCTACC AAAAGTAACG AAAATGGATG GACAACCGTT TATGAGTATG ACCCCGAGGG 504 0 
ACACCTGACC AATGCAACGT TTCCCACTGG AGAGGTCAGC AGCTTCCACA GTGACCTGGA 5100 
GAAGCTGACA AAAGTGGAGC TAGATACTTC CAACCGTGAA AATGTCCTCA TGTCAACCAA 5160 
CTTGACGGCA ACTAGTACCA TATATATTTT AAAACAAGAA AATACTCAAA GTACCTATCG 5220 
50 GGTGAATCCA GATGGTTCCC TGCGTGTCAC TTTTGCCAGC GGGATGGAGA TCGGCCTCAG 5280 
CTCAGAGCCC CACAT CCTGG CAGGGGCAGT CAACCCTACC CTGGGCAAAT GCAACATCTC 534 0 
ATTGCCCGGA GAGCACAATG CAAACCT CAT CGAGTGGCGG CAGAGGAAGG AGCAAAACAA 5400 
AGGCAATGTT TCGGCTTTTG AAAGGAGGCT GAGGGCCCAC AACAGAAACC TACTCTCCAT 5460 
AGATTTTGAT CATATAACCC GCACAGGAAA GATCTATGAT GACCATCGAA AATTCACCCT 5520 
55 TCGAATTCTT TATGACCAGA CTGGGCGACC CATTCTGTGG TCTCCTGTAA GCAGATATAA 5580 
TGAAGTGAAC ATCACATATT CACCTTCGGG ATTGGTGACG TTTATTCAAA GAGGAACGTG 564 0 
GAATGAAAAA ATGGAATATG ACCAGAGTGG GAAAATTATT TCAAGAACTT GGGCTGATGG 570 0 
GAAAATTTGG AGCTATACCT ACTTAGAAAA ATCTGTGATG CTTCTCCTAC ACAG CCAGCG 5760 
GCGTTACATC TTTGAGTATG ACCAATCAGA TTGCCTGCTG TCAGTTACCA TGCCTAGCAT 5820 
60 GGTGCGCCAC AGCTTACAAA CCATGCTTTC AGTGGGCTAC TACCGTAATA TCTACACCCC 5880 
ACCGGACAGT AGCACTTCTT TTATCCAAGA CTATAGTCGA GATGGCCGAT TGCTACAGAC 5940 
CCTGCATCTG GGGACAGGGC GCAGAGTCTT ATACAAGTAC ACCAAGCAAG CAAGGCTTTC 6000 
TGAGGTTCTC TATGATACCA CTCAGGTCAC ATTAACATAT GAAGAGTCTT CTGGAGTGAT 6060 
TAAGACAATA CACCTGATGC ATGACGGATT CATCTGCACA ATCAGATACA GGCAAACAGG 6120 
65 ACCTCTTATT GGACGCCAGA TTTTCAGATT CAGTGAAGAA GGCCTTGTGA ATGCACGGTT 6180 
CGACTACAGC TACAACAATT TCCGAGTCAC AAGCATGCAA GCTGTAATCA ATGAAACCCC 6240 
TTTGCCTATA GATCTTTACC GATATGTTGA TGTCTCTGGC AGAACAGAGC AGTTTGGAAA 6300 
ATTCAGTGTA ATTAAT TACG ATTTAAATCA GGTCATAACT ACTACAGTGA TGAAACACAC 6360 
CAAAATCTTC AGTGCCAATG GACAAGTCAT TGAAGTCCAA TATGAAATCC TAAAGGCAAT 6420 
70 TGCCTACTGG ATGACCATTC AATATGATAA TGTGGGCCGA CATGGTAATA TGTGCATAAG 6480 
GGTAGGAGTA GATGCCAATA TAACAAGGTA CTTCTATGAA TACGATGCTG ATGGGCAACT 6540 
TCAGACTGTT TCTGTAAATG ACAAAACCCA GTGGCGTTAT AGTTACGATC TGAATGGAGA 6600 
CATCAACCTC TTAAGCCATG GGAAGAGTGC TCGTCTTACI CCTCTCCGAT ATGACCTCCG 6660 
AGACCG CAT C ACCAGATTAG GAG AAATT CA GTATAAAATG GATGAAGATG GCTTTCTGAG 6720 
75 GCAGAGGGGA AATGATATTT TTGAATATAA TTCTAATGGC CTC-CTGCAGA AAGCCTACAA 6780 
TAAGGCTTCT GGCTGGACTG TGCAGTATTA CTATC-ATGGG CTTGGGCGAC GTGTCGCGAG 684 0 
TAAGTCCAGC CTAGGGCAGC ACCTTCAGTT CTTTGTCGAC GCGACCGCGA ACCCCATAAG 6900 
AGTTACTCAT TTGTACAACC ACACAAGCTC GGAGATTACA TCTCTGTATT ATGATCTCCA 6960 
AGGTCACCTT ATTGCCATGG AGTTAAGCAG TGGTGAAGAA TATTATGTAG CCTGTGATAA 7020 
80 TACAGGT AC C CCACTAGCTG TGTTCAGCAG CCGAGGTCAG GTCATAAAGG AGATACTATA 7080 
CACACCTTAT GGCGAT AT CT AT CATGACAC TTACCCTGAC TTTCAGGTCA TAATTGGTTT 714 0 
TCATGGAGGA CTCTATGATT TCCTTACTAA ATTAGTGCAC CTGGGGCAAA GGGATTATGA 7200 
TGTTGTTGCT GGCAGATGGA CAACGGCCTA T CAT CACAT A TGGAAACAGT TGAACCTCCT 7260 
TCCTAAACCA TTCAACCTCT ACTCCTTTGA AAATAACTAC CCAGTTGGCA AAATTCAAGA 7320 
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TGTTGCAAAG TATACCACAG ACATCAGAAG TTGGTTGGAG CTATTTGGTT TCCAATTACA 7380 
CAATGTACTA CCTGGATTTC CCAAACCTGA ATTAGAAAAT TTAGAATTAA CTTACGAGCT 7440 
TCTACGGCTT CAGACAAAAA CTCAAGAGTG GGATCCTGGA AAGACTATCC TGGGCATTCA 7S00 
GTGTGAACTC CAGAAACAGC TCAGGAATTT CATTTCCTTG GACCAACTAC CTATGACTCC 7560 
5 CCGATACAAT GATGGACGGT GCCTTGAAGG AGGGAAGCAA CCAAGGTTTG CTGCTGTCCC 7620 
TTCTGTTTTT GGGAAAGGTA TAAAATTTGC CATCAAGGAT GGCATAGTAA CAGCTGATAT 7680 
TATAGGAGTA GCCAATGAAG ATAGCAGGCG GCTTGCTGCC ATTCTCAATA ATGCCCATTA 7740 
CCTGGAAAAC CTACATTTTA CCATAGAGGG GAGGGACACT CACTACTTCA TTAAGCTTGG 7800 
GTCTCTGGAG GAAGACCTGG TGCTCATCGG TAACACTGGG GGGAGGCGGA TTCTGGAGAA 7860 
10 TGGTGTCAAT GTCACTGTGT CCCAGATGAC TTCTCTGTTG AATGGGAGGA CTAGACGGTT 7920 
TG CAGATAT T CAGCTCCAGC ATGGAGCCCT GTGCTTCAAC ATCCGGTATG GGACAACTGT 7980 
CGAAGAGGAA AAGAAT CACG TGTTGGAGAT TGCCAGACAG CGCGCAGTGG CCCAGGCCTG 8040 
GACTAAGGAA CAAAGAAGGC TGCAAGAGGG GGAAGAGGGG ATTAGGGCAT GGACAGAAGG 8100 
GGAAAAGCAG CAGCTTTTGA GCACTGGGCG GGTACAAGGT TACGATGGGT ATTTTGTTTT 8160 
15 GTCTGTTGAG CAGTATTTAG AACTTTCTGA CAGTGCCAAT AATATTCACT TTATGAGACA 8220 
GAGCGAAATA GGCAGGAGG T AA CAAAAATA TCTCTGCCTT TGCGTCACCA AAGACTGCCT 8280 
GTTTTTAAAA CATAAAATGG TTTATTGTAT TGGTTTTCTA GATCAGAACT CTGTATATGT 8340 
AAATATGGAG GAAAAACATA TCCAACTGCC TTTCAATGTG ACGGAAGATG GTATTTTAAT 8400 
ATTGTTTGTT TAAACTCTTT AAGAAATGAC AGAGATTTTT AGTTCTTGTG TGGCAGTATT 8460 
20 CAAAATAACA CAAGTAGAAC TCAAACAGCT AAAAACAGTT TTCAGAAAGC ACCACTTTCA 8520 
ATTTGCCGAG CCATG CAT AT GTTCCAATAT CCAGAAAGAA CCCAAGGTTC TCTATCTCTA 8580 
TTGTGAGAAG CAGTTTCATC CTTAACTGTT GGCAGAACTT ACGGGCTATT TGAATAGGTG 864 0 
GTGCAATAGT ATCTGAAACT TGCCTTTCGA AAGACTGCCA GCCCTTTGAC GTTTTCCAGA 8700 
TCTGTTATAG GAAACTTAAA AACAGGTGTA AAATGTCTTC AGCCACCATC TCCTAGAGTG 8760 
25 AGGACC CAAT TGCCCTTCCT TCTTGATTAT TCCTCCTTGC TTGTTAAAGT AAATGCCATA B820 
TTGTTGTGCT GTGTTTTGGC GTGTGGTGGC TGGGTTCTGT CTACCATGCT TCCCTGTGGG 8880 
TGTGGTAACC AGACTGTATA GCCGCTATTT GCTCGTGTGT ACATGATACC AAAGCAGCTG 8940 
GCCAGCGTGA CCTCTCTCAC ACGACCTGTT TTGACTCAAT TTTTTACTAA AAGTTGTTCA 9000 
GCTGTATTGG TATCATGTAA ACATAGCTTT TATTAACCTG GGTAGGAATT TCTCATTTAT 9060 
30 ATATAGGATG TGTTTTGGTC ATAGTTTCAC ATTAGTGATT CAGTATCTAT ACACTGACCC 9120 
AATGGTTTTG TGCACATGAA CGGTAATTTA CTTAAAAGTA TGATTCTGGT ACAAAAACAA 9180 
ACAAAGGCTT TAGCAGGCAT ACGTGTCTGG GATGCCGATA CATACATTAA CTACTACTGC 924 0 
AGAAATTCAT AAGAGCCAAA ACCTTAAAAA AATAGACCTG GTACTTAAGT GAAAGTACTA 9300 
AAGGGAAGAC CAGACCAAAC ATCACAGCAG TTGCTGCCAC ATTGTTTCAG CCCACTTAGA 9360 
35 TTTATCTTTC AAATGTACAA TTCTGTATTG AACATCTCCC AGCCATCTTC AGGAAATCGA 9420 
ATCAAGTAAA TCCTTTCCAA CCGAAAACAT TTCAACTAAC TATAGAGAGG CAGACTCATT 9480 
TTTACTAAAA TAATTTATAC AGTTAGTTAT TTTCGTTCTC CGTACTTACC CATTTATCTT 9540 
TATTTAATCG TCTCTACTGC CTAGGAAAAT AACTATTTTC CAGGACGGGT TATTTGTTCT 9 60G 
GCGATCATTT AAAATTTGGA GAAAGGTCAG GATTAGTGTT AATATCAGCT GCAGTTTCTC 9660 
40 AATCTCTAGG AATCCTGCAG TAAAACAAGC CCCTTGGTGA GCTGGAAGAT TTGTGCCCAG 9720 

TGACAAAGAG ATAGTTTGTA AAATGCTGTG TAATTGTAAG TTACCACAAA TGAAAATACA 97B0 
TGACAGCACA ATGTGGCCCG TAGAAAATTC CCCTGAGCCA GCTTCTGCAC TTTCATCACC 9840 
GAATCTGAAC ATTTGCTATG TCTGAAGGCA AATTTATGAT GGAATGTTAG TTTGGATTCT 9 900 
TTCCAGATGC TACCTAAATG CAGTGTGGGG TCATTGCCTT GCTTTGCGAT GACAGTTTCT 9960 
45 TTGAAAATAT GCAAAGTCAT AAGCTCATGT TAAGGTTTTT CAAGAGTCTG CCTCCTACTA 10020 

CACAAAGGAA AGCAAGGGAA AGGAAATGAC CCTGGCAAAC AGTAGGGAAG GGTGTATTCA 10080 
AACATTT CAT TTTCAAAACC TTCGGGTTAG AATACCACTT ACACATGTAT TCTGAGAGAC 10140 
AGAATTCATG AGGAACT CAT CTCTCTTTAT AACTGGAAAC ACACCAGCTT GATATATTGC 10200 
TAATCCATAC TAAAATCATA TTATTGGGTT TTTTCTGAAT CAGGCCTGTA TTAATGGTAC 10260 
50 AGTATTTATT CAGAATGGAA TTCTAAAATT ACTAACAAAC TTGTTGAAAA TTTGAATACC 10320 
TCCACACCAA CCTAAAAATG GACCTTAAGT TCCTAGAACC TCTGATGTTC TTTTAAATTA 10380 
ATGGAAAAAT AATTTGTGAA CTGTATATAG AGAGTGCATT CATAAATGTG ATTATGTATT 10440 
TTATCACAAA TCCAAAATGT CAATATTAGA GTCTATTTTG CTTATATTTT AAGCAATTAT 10500 
ACGTTTTTGC AATT CATTGA TGATGTATCA TTTTCAAACT GCTTTAAATA TCCATTAGAA 10560 
55 ACAAATATTT GAAGCTTTTA CTTAATAGTG ATTACCTTGA ACTGTGCATT TCTAGTTTGT 10620 
AATACGTATT TGGTTGGTTC GTGCCTTTAG TTTGTTAAAG TTACATTTGT ATTATATTCA 10680 
GGAAATGCAC TTTTTATTAC TTACAGCTGT GGTTTTAATA CTGCCTTGAA CTATTATTAT 10740 
TCTTTTTACA ACTCCTAAAG CTTGAGGGAG GAAAGAAAAA AAAAACAAAA CTACTAATCA 10800 
GTAGTAAATC GAAGAGAAAC ATTTTGGCAT TTCTTAAGAA GAAGATGGAG ATATTGAGTA 10860 
60 TATCACTTCC TATTCAGCTG AATAGAAAGA ATGCCTTCAT TC-ACTTGCAG TTCTGCAGTT 10 920 
TAAATTATTG AAAGAACAAT TCGTTTGCAT TTCCTGATGA AAGTAAAAGC ATTTTTCAGA 10980 
GAAACATATG AATTTCTCAT ACCCAGCAGA CAGATGGCTG ACACTGCACA GCCACACACC 11040 
ATTCGAGTAA GTTAAAGTGA GAGCATAGTA GTTGGACTCT CCTATGAAGA ACATTCTGGG 11100 
CTGGAGG CAG GGAATACTCC ATGGTTGTTT CTTTTTCCTA CTTAAGCCCA TTTTGTTTGT 11160 
65 GCTTTTCTGT TTTGTTTTGT TTTCACTCTT GCACTACAGT CTAGAGATCC AAATGAACTG 11220 
AAAAGTTCAA AGTTTAACAC ATTTAAATAT GTTTACTTTT AGTTGTCATT CTAATCGTTA 11280 
TTGATTAGAA GCATGACTCC TGAAGGAAAG GGAAATAAAT CT CAAT T CAT ACTAACTTGC 11340 
AACAAAACAC TTTTACCATA TAAATAAGTA TATGATTTAT TTTTAACCCA AAAAATGTAT 11400 
AAAATAAGTG TGTCCTTTAC TGTCAATTTA TCGAGAAGAT CTATAATATA TAC-ACTACAT 11460 
70 ATATATAATA TATACAACAT AGCCAAATGT ATGAAAACTT GACAATGTAT AATTTGGAAT 11520 
TCACATGCTA CCTATGTAGA CAGGTATGAA ATTAAGTTAT AATTTTCATG AGACATTTTC 11580 
ATCACTGTTG ACACAGTTTC AAGGCATTCC ATCATGTTAT TTTGACTCTT TTTCTTTTTT 11640 
TTTTCTTTAA AAATATATTT TTAACTAGAC CAGGCCCCAC TATAATATCA CTTAAC-AGAG 11700 
TCAGGGCAAA GTTTTTGCAT TTATGAAGAT GTGTTCATGT AAGGGTGATT GTAATGGAGT 11760 
75 TCATTGGTAA TAGAAGCAAA AGTACAGTAA CGAAGTATTG AAAAGAAAAT TTTGGAGACA 11820 
TTGGAGCATA TTATATATAG CTTGTGGAAA GACATAAGGC TACAGATGGA ATGGAACATT 11880 
CCTGTTTTCT TGAAGAAATT CACATACACA TAGCTGACCT GACTAGTACT TCAGCTCTTC 1194 0 
CACAGCCTTC TATAAAGGTT CTTTCTTCTG CAAAGAAAAC AAAACAAAAC AAAACAAAAC 12000 
AAAAAAAAAC AAAAAAAGCG CAAAAAACAA AAAAACAAAA AAAAGCAAAG TAAAATTTAA 12060 
80 AAATACAGAA AACAAACAAC AAAAAAGAAT TCAACCATAA ATAGTGACTA TTATTTTCAG 12120 
TGTGTCCTTC ATGTGAAAGC TATTAAGGAC CAAATATACT ACTGTTCATA AGAAC-AAATT 12180 
ACTTTCTAAA CAGTAACTGA AAATACTTAG AGTTAAACTT GCTGTGGATT TTGTCTTGGC 12240 
AGTTGTCATC TTACATTATT TGTCAAAGGA AATGTGTTTG GCAGTTAAAA ATCTTTCCTT 12300 
AGATTTAGTG GTGGACTTTA ACCTCTTAAA TAAATGTTAG TATATCAGAT TGTGTCCTTG 12360 
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AAAAATATT T TACTTGTATG AATCATGACA ACGTCTAAAT CTTTACTATT CTTCTGGCAA 12420 
AAGCATCAGT AAGAAAGAAG GCGAAAAAGA GAAGTATAGC CTTTATGTCA GAAAAACATT 12480 
CTTTTTAGCT GCTTACTTTC TCATGAAAAG TAAAGATGTT TACAGTGTAT GCCAAGTTTT 12540 
CAGTTTCTGT ATAACAACAG GTAGAGGTTC TAATCATATT GAAAATTGTG TTATAATGGT 12S00 
CTGAGCCATG TTGCTAGGAA ACAATAGGTT CCAATTTTGT ATTCCTGCIC TCCTGTGCTG 12SS0 
AAAAGTGACT GGATACTGTA CAGGTTCATG TTCTCTGGCT GCAGTTAAAT GGTCTTTTGC 12720 
ATTTTGCTCT GGCTTT CAGG CCAGAAGCAT GCATTTTTCT ACAAGAGCAT CACAACAACA 12780 
TGCTGTAAAT ATTTAAAGTT AAACATTATG TGTTGATATT TGAAAGAAAA GTACTTTGAA 12840 
TATTTCATTT TTAAAAAATA AAATTGCCAA TGAAAAAAAA 

odz (odd Oz/ten-m, Drosophila) homolog 1 

NM_014253 
NP_0SS068 

31S-340 



LRMWIRGMKS EHSSCLSSRA NSALSLTDTD HERKSDGENG FKFSPVCCDM EAQAGSTQDV 

QSSPHNQPTP RPLPPPPPPP HACTCARKPP PAADSLQRRS MTTRSQPSPA APAPPTSTQD 

EPLETRH SLFKHGSGSS AIPSAASQNY PLTSNTVYSP PPRPLPRSTF 

P YRCCNWKCTA LSATAITVTL ALLLAYVIAV HLFGLTWQLQ FVEGELYANG 



PPTHTQFDFV KLMDGKQLVK QDSKGSDDTQ 



CFGHGTCIMG VCICVPGYKG EICEEEDCLD PMCSNHG 

VCQEQCSGHG TFLLDAGVCS CDPKWTGSDC STELCTMECG SHGVCSRGIC OCEEGWVGPT 720 

CEERSCHSHC TEHGQCKDGK CECSPGWEGD HCTIAHYLDA VRDGCPGLCF GNGRCTLDQN 780 

GWHCVCQVGW SGTGCNWME MLCGDNLDND GDGLTDCVDP DCCQQSNCYI SPLCQGSPDP 840 

LDLIQQSQTL FSQHTSRLFY DRIKFLIGKD STHVIPPEVS FDSRRACVIR GOWAIDGTP 900 

LVGVNVSFLH HSDYGFTISR QDGSFDLVAI GGISVILIFD RSPFLPEKRT LWLPWNQFIV 960 

VEKVTMQRW SDPPSCDISN FISPNPIVLP SPLTSFGGSC PERGTIVPEL QWQEEIPIP 1020 

SSFVRLSYLS SRTPGYKTLL RILLTIISTIP VGMIKVHLTV AVEGRLTQKW FPAAINLVYT 1080 

FAWNKTDIYG QKVWGLAEAL VSVGYEYETC PDFILHEQRT WLQGFEMDA SNLGDHSLNK 1140 

HHILNPQSGI IHKGNGENMF ISQQPPVIST IMGNGHQRSV ACTNCNGPAH NtJXLFAPVAL 1200 

ASGPDGSVYV GDFNFVRRIF PSGNSVSILE LSTSPAHKYY LAMDPVSESL YLSDTI1TRKV 1260 

YKLKSLVETK DLSKNFEWA GTGDQCLPFD OSHCGDGGRA SEASLNSPRG ITVDRHGFIY 1320 

FVDGTMIRKI DBHAVITTVI GSNGLTSTQP LSCDSGMDIT QVRLEWPTDL AVNPMDNSLY 1380 

VLDNNIVLQI SEHRRVRIIA GRPIHGQVPG IDHFLVSKVA IHSTLESARA ISVSHSGLLF 1440 

IAETDERKVN RIQQVTTNGE IYIIAGAPTD CDCKIDPNCD CFSGDGGYAK DAKMKAPSSL 1500 

AVSPDGTLYV ADLGNVRIRT ISRNQAHMD MKIYEIASPA DQELYQFTVN GTHLHTLNL1 1560 

TRDYVYNFTY NSEGDLGAIT SSNGNSVHIR RDAGGMPLNL WPGGQVYWL TISSNGVLKR 1620 

VSAQGYNPAL MTYPGNTGLL AT KSNENGWT TVYEYDPEGII LTNATFPTGE VSSFHSDLEK 1680 

LTKVELDTSN RENVLMSTNL TATSTIYILK QENTQSTYRV NPDGSLRVTF ASGMEIGLSS 1740 

EPHILAGAVN PTLGKOJISL PGEHNANLIE WRQRKEQNKG NVSAFERRLR AHNRNLLSID 1800 

FDHITRTGKI YDDHRKFTLR ILYDQTGRPI LWSPVSRYNE VNITYSPSGL VTFIQRGTMN 1860 

EKMEYDQSGK IISRTWADGK IWSYTYLEKS VMLLLHSQRR YIFEYDQSDC LLSVTMPSMV 1920 

RHSLQTMLSV GYYRNIYTPP DSSTSFIQDY SRDGRLLQTL HLGTGRRVLY KYTKQARLSE 1980 

VLYDTTQVTL TYEESSGVIK TIHLMHDGFI CTIRYRQTGP LIGRQIFRFS EEGLVNARFD 2040 

YSYHNFRVTS MQAVINETPL PIDLYRYVDV SGRTEQFGKF SVINYDLNQV ITTTVMKHTK 2100 

IFSANGQVIE VQYEILKAIA YWMTIQYDNV GRHGNMCIRV GVDANITRYF YEYDADGQLQ 2160 

TVSVHDKTQW RYS YDLNGD I N1LSHGKSAR LT PLRYDLRD RITPXGEIQY KMDEDGFLRQ 2220 

RGNDIFEYNS NGLLQKAYNK ASGWTVQYYY DGLGRRVASK SSLGQHLQFF VDATANPIRV 22B0 

THLYNHTSSE ITSLYYDLQG HLIAMELSSG EEYYVACDNT GTPLAVFSSR GQVIKEILYT 2340 

PYGDIYHDTY PDFQVIIGFH GGLYDFLTKL VHLGQRDYDV VAGRWTTAYK HIWKQLNLLP 2400 

KPFNLYSFEN NYPVGKIQDV AKYTTDIRSW LELFGFQLHN VLPGFPKPEL ENLELTYELL 2460 

RLQTKTQEWD PGKTILGIQC ELQKQLRNFI SLDQLPMTPR YNDGRCL3GG KQPRFAAVPS 2520 



N TGGRRILENG VNVTVSQMTS LLNGRTRRFA DIQLQHGALC FNIRYGTTVE 2640 
EEKNHVLEIA RQRAVAQAWT KEQRRLQEGE EGIRAWTEGE KQQLLSTGRV Q 
VEQYLELSDS ANNIHFMRQS EIGRR 



Gene name: Solute carrier family 26, 

Probeset Accession #: AF030830 



225-2567 (underlined sequences correspond t 



C CCGGTTCGGG AAAGGGGAAG AATGCAGGAG GGGTAGGATT TCTTTCCTGA 
3AAAGAC CGCAGCCTGT GTGTGTCTTT CCCTTCGACC AAGGTGTCTG 
TTGCTCCGTA AATAAAACGT CCCACTGCCT TCTGAGAGCG CTATAAAGGC AGCGGAAGGG 
TAGTCCGCGG GGCATTCCGG GCGGGGCGCG AGCAGAGACA GGICATGGCA GCGCCAGGCG 
GCAGGTCGGA GCCGCCGCAG CT 
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G TGCTGCAGTT GTTCAAGAAA GAGAGCCTTT O 

G GAGT6GCTCC CCAAATACCG AGTCAAGGAA TGGCTGCTTA 480 

T TTCGGGAGTT AGTACTGGGC TAGTGGCCAC GCTGCAAGGG ATGSCATATG 540 

CCCTACTAGC TGCAGTTCCT GTCGGATATG GTCTCTACTC TGCTTTTTTC CCTATCCTGA 600 

CATACTTTAT CTTTGGAACA TCAAGACATA TCTCAGTTGG ACCTTTTCCA GTGGTGAGTT 660 

TAATGGTGGG ATCTGTTGTT CTGAGCATGG CCCCCGACGA ACACTTTCTC GTATCCAGCA 720 

GCAATGGAAC TGTATTAAAT ACTACTATGA TAGACACTGC AGCTAGAGAT ACAGCTAGAG 780 

TCCTGATTGC CAGTGCCCTG ACTCTGCTGG TTGGAATTAT ACAGTTGATA TTTGGTGGCT 840 

TGCAGATTGG ATTCATAGTG AGGTACTTGG CAGATCCTTT GGTTGGTGGC TTCACAACAG 900 

CTGCTGCCTT CCAAGTGCTG GTCTCACAGC TAAAGATTGT CCTCAATGTT TCAACCAAAA 960 

ACTACAATGG AGTTCTCTCT ATTATCTATA CGCTGGTTGA GATTTTTCAA AATATTGGTG 1020 

ATACCAATCT TGCTGATTTC ACTGCTGGAT TGCTCACCAT TGTCGTCTGT ATGGCAGTTA 10 80 

AGGAATTAAA TGATCGGTTT AGACACAAAA TCCCAGTCCC TATTCCTATA GAAGTAATTG 1140 

TGACGATAAT TGCTACTGCC ATTTCATATG GAGCCAACCT GGAAAAAAAT TACAATGCTG 1200 

GCATTGTTAA ATCCATCCCA AGGGGGTTTT TGCCTCCTGA ACTTCCACCT GTGAGCTTGT 1260 

TCTCGGAGAT GCTGGCTGCA TCATTTTCCA TCGCTGTGGT GGCTTATGCT ATIGCAGTGT 1320 

CAGTAGGAAA AGTATATGCC ACCAAGTATG ATTACACCAT CGATGGGAAC CAGGAATTCA 1380 

TTGCCTTTGG GATCAGCAAC ATCTTCTCAG GATTCTTCTC TTGTTTTGTG GCCACCACTG 1440 

CACGGCCGTC CAGGAGAGCA CTGGAGGAAA GACACAGGTT GCTGGCATCA 1500 



T CCTATTTTCT ATGGCAATGT CGATGGTTIT A 
GTGATGCTGT TTCAACAAAT AATGCTTTTG AGCCTGATGA GGATATTGAA GATCTGGAGG 



CTTTCCTGGA CGTTGTTGGA GTGAGATCAC TGCGGGTGAT TGTCAAAGAA TTCCAAAGAA 
TTGATGTGAA TGTGTAT 
GCGGGTTCTT TGACGACAAC A 

TAGTCTATCT ACAGAACCAA GTGAAATCTC AAGAGGGTCA AGGTTCCATT T 
TCACTCTCAT TCAGGATTGT AAAGATACCC TTGAATTAAT AGAAACAGAG CTGACGGAAG : 
AAGAACTTGA TGTCCAGGAT GAGGCTATGC GTACACTTGC ATCCTGAAAG TGGGTTCGGG ; 

GACTCAAAAC ACTCATTCTT TTTTCTATTA AGCCATTGAA AGAGAAGCAC TAAGACTGCT 
TCTAGGCTTT ATTTATAAAA TAAACACCTT ATCCCTAACA TGGGCAAAAT GGCTAGAATT 



TCACAGATTT GCTAATAATG TTCACGTGGG CCCTGGCATA TCTCTGTTCA GTTAGAGTGA 2 880 

GTGCTGACCC AACAGCCTCT GTGGTCAAGC GAGTCACGAA TGATTAATCA TAAAGAAAAA 2 94 0 

TCAGTTTTTG ACTGACCTGG ATATCCATGA GCTGCACTGA TCACCATGTA AGGTCACATT 3 00 0 

TAGTAAATGC TGAAATAAAA TGATTAATGC ATTTATCAAT AAAAGCCTTT GAAAATACTT 3060 

TGGATAATAA ATTGGAGTTT TAAAAATGCA AATTTGCTTA GTATCTAATA ATGAAGTGTT 3120 



CAGAACCAGG CCAATATATT TTGAAATATT GATGCAGACA AATGAAATAA TAAAGAGATT 324 0 

TTCATGGTTT ATAAAAATCT TTTTTGATAT GATAATAATC ATGATCACAA CTGAGAT CAA 3300 

AAAAATATAT GACAGATTAT TTTGTTTAAA AATGCAGTTT TAATTATCTT AGTC1ATAGA 3360 

AATGATCATT GCATGGAGGC ATGTATAGGT ATGATCTGTG TAAAATCTGA CATAAAAACA 3420 

GTGCTATTCT GAGTGAAAAT TTTTTTGATG TGCTTACATA ACCATGGTGA TTAAAATGAG 3480 

TTTATATTTT TTCTCAAAAA TTTTAGCAGT GTGTAAAGTA AGTAATCTTT AACTGAACIC 3540 

TGACCACTTA AAAAAAAATC TAAAAATTGA ACTACCTATA GTAGTCTGTG TTTAAAGTGA 3600 

ATTTTTAAAG ACAAAGCATT CTAAATGAAC TCAATATAAA AACATTCATT TGGAATGTAC 3660 
ATACTGAAAA ATACAGGTTT TTTTGACCAA AAGTTTTTAT ATCTTTTCTT T 
TTTTTCCTAA GTGCCAACAA TTTTCTAGAT A 



CTTGATT GAAAACGACA TCATCCCTTG 3900 

GTATACTCCA GGGATTGGTT TCAGGACCCC TGCATTTACC AAAATTTGTG CACACTCAAG 3960 

TCCTGCAGTC ACCCCTGCCT AAAGATAGAA TGGCTTCTCT GTTTTTCTTC TGAAATACAA 4020 

CCAGAAACAA TGTGTCTATT TCTGAAAGAA TAGGATTAAT GATCATACAA ATGGGTTAAT 4080 

CCTGAATTCT GGTTGTAAAT CTGGTTACAG CATAACTAGG ATTATAATGC TGCCTCATTT 414 0 

TCACAGCACT ACTTGCTTAT ATTGACAACA AATCATCTCG CTAAAGAGTG AATGTAGGCC 4200 

AGGCGCGGTG GCTCATGCCT GTAATCCCAG CACTTTGGGA GGCCGAGGCG GGTGGATCAC 4260 

GAGGTCAGGA GATCGAGACC ATCCTGGCTA ACATGGTAAA ACCCCC-TCTC TACTAAAAAT 4320 

AGAAAAAAAG AAATTAGCCT AGCGTGGTGG CTGGCGGGCG CCTGTAGTCC CAGCTATTTG 4380 

GGAGGCTAAG GCAGGAGAAT GGCGTGAACC CGGGAGGCGG AGCTTGCAGT GAGCCGAGGT 4440 

CGTGCCACTG CACTCCAGCC TGGGCGACAG AGCAAGACTC CGTCTCAAAA AAAAAAAAAA 4500 

AAAAAAAAAA AGAGTGAATG TAATAGTCTT GCAGAAAATG AATGAATACC TTIGTTCAAT 4 560 

GGCTAGAGTT TGTAAATTCT GGGTTCATTT GTGATGACAT AAGTCAGCAA ACTGCGGGAA 4680 
TACTGTCTCT T 



75 TCCAGTATTG TATATG AG TT TTAACAAATT AAAAAATCAA ATCATGTACA TTTGAAAATA 
TTTGCACACA TTTAAAAATA AATGTAAAGT TGTCTTTTAA ACTACTCGGA TGTGTCCTTT 
CTGAACAAAA 



Unigene number: 
Probeset Accessio 

Signal sequence; 
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lellular Local 



RDTARVLIAS 



YAIAVSVGKV YATKYDYTID GNQEPIAPGI SNIFSGPPSC FVATTALSRT AVQESTGGKT 

QVAGIISAAI VMIAILALGK LLEPLQKSVL AAWIANLKG MPMQLCDIPR LWRQNKIDAV 

IWVFTCIVSI ILGLDLGLLA GLIFGLLTW LRVQFPSNNG LGSIPSTDIY KSTKMYKNIE 

EPQGVKILRF SSPIPYGNVD GFKKCIKSTV GFDAIRVYNK RLKALRKIQK LIKSGQLRAT 

KNGIISDAVS TNNAFEPDED IEDLEELDIP TKEIEIQVDW NSELPVKVNV PKVPIHSLVL 

DCGAISFLDV VGVRSLRVIV KEFQRIDVNV YFASLQDYVI EKLEQCGFFD DNIRKDTFFL 



Gene name: ESTs, Highly similar to calcium-activated potassium channel rSK2 [R.norvegicus] 

Unigene number: Hs.982ao 

Probeset Accession #: AA418000 

Nucleic Acid Accession ft : NM_021S14 

Coding sequence: 45S-2197 (underlined sequences correspond to start and stop codons) 




ESTs, Highly similar to calcium-activated potassium channel rSK2 [R.norvegicus 



2-364, 373-394 
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Calmodulin binding domain: 412-488 
Cellular Localization: plasma membrane 



I I I I I I 



IQNIGYK 120 

LGHRRALPEK RKRLSDYALI FGMFGIWMV IETELSWGAY DKASLYSLAL KCLISLSTII 180 

LLGLIIVYHA REIQLFMVDN GADDWRIAMT YERIFFICLE ILVCAIHPIP C-NYTFTWTAR 240 

LAFSYAPSTT TADVDIILSI PMFLRLYLIA RVMLLHSKLF TDASSRSIGA LNKINFNTRF 300 

VMKTLMTICP GTVLLVFSIS LWIIAAWTVR ACERYHDQQD VTSNFLGAMW LISITFLSIG 360 

YGDMVPNTYC GKGVCLLTGI MGAGCTALW AWARKLELT KAEKHVHNFM MDTQLTKRVK 420 

NAAANVLRET WLIYKNTKIA7 KKIDHAKVRK HQRKFLQAIH QLRSVKMBQR KLNDQANTLV 4 80 

DLAKTQNIMY DMISDLNERS EDFEKRIVTL ETKLETLIGS IHALPGLISQ TIRQQQRDFI 540 
EAQMESYDKH VTYNAERSRS SSRRRRSSST APPTSSESS 

A215 UNA SEQUENCE: 

Gene name: CGI-aS protein 

Unigene number: Hs. 109201 

Probeset Accession #: AW161450 

Nucleic Acid Accession #: NM_01S029 

Coding sequence: 228-1097 (underlined sequences correspond t 



C TGGTGCGGGC CGTCTTCTTC CCCCCGAGCT SO 
GGGCGTGCGC GGCCGCAATG AACTGGGAGC TGCTGCTGTG GCTGCTGGTG CTGTGCGCGC 120 



A CGACGCCCAG AATGGGAGCT GACTGATATG G 
TGACTGGAGC CTCGAGTGGA A 
TTTCTCTTGT GCTGTCAGCC A 

TAGAGAATGG CAATTTAAAA GAAAAAGATA TACTTGTTTT GCCCCTTGAC C 
CTGGTTCCCA TGAAGCGGCT A 
TGGTCAACAA TGGTGGJ 

ACAGAAAGCT AATAGAGCTT AACTACTTAG GGACGGTGTC CTTGACAAAA TGTGTTCTGC 600 
CTCACATGAT CGAGAGGAAG CAAGGAAAGA TTGTTACTGT GAATAGCATC CTGGGTATCA 660 
TATCTGTACC TCTTTCCATT GGATACTGTG CTAGCAAGCA TGCTCTCCGG GGTTTTTTTA 720 
ATGGCCTTCG AACAGAACTT GCCACATACC CAGGTATAAT AGTTTCTAAC ATTTGCCCAG 780 
GACCTGTGCA ATCAAATATT GTGGAGAATT CCCTAGCTGG AGAAGTCACA AAGACTATAG 810 
GCAATAATGG AGACCAGTCC CACAAGATGA CAACCAGTCG TTGTGTGCGG CTGATGTTAA 900 
TCAGCATGGC CAATGATTTG AAAGAAGTTT GGATCTCAGA ACAACCTTTC TTGTTAGTAA 960 
CATATTTGTG GCAATACATG CCAACCTGGG CCTGGTGGAT AACCAACAAG ATGGGGAAGA 1020 
AAAGGATTGA GAACTTTAAG AGTGGTGTGG ATGCAGACTC TTCTTATTTT AAAATCTTTA 10S0 
AGACAAAACA TGACTGAAAA GAGCACCTGT ACTTTTCAAG CCACTGGAGG GAGAAATGGA 1140 
AAACATGAAA ACAGCAATCT TCTTATGCTT CTGAATAATC AAAGACTAAT TTGTGATTTT 1200 
ACTTTTTAAT AG ATAT G ACT TTGCTTCCAA CATGGAATGA AATAAAAAAT AAATAATAAA 1260 
AGATTGCCAT GAATCTTGCA AA 

A216 Protein sequence-. 



Hs. 109201 

NP_057113 
1-26 

183-206, 221-243 

1 11 21 31 41 

I I I I I I 

MNWELLLWLL VLCALLLLLV QLLRFLRADG DLTLLWAEWQ GRRPEWE " ' 

GIGEELAYQL SKLGVSLVLS ARRVHELERV KRRCLENGNL KEKDILVLPL DLTDTGSHBA 120 

ATKAVLQEFG RIDILVNNGG MSQRSLCMDT SLDVYRKLIE LNYLG7VSLT KCVLPHMIER 180 

KQGKIVTVNS ILGIISVPLS IGYCASKHAL RGFFNGLRTS LATYPGIIVS NICPGPVQSN 240 

IVENSLAGEV TKTIGNNGDQ SHKMTTSRCV RLMLISMAND LKEVWISECP FLLVTYLWQY 300 

MPTWAWWITN KMGKKRIENF KSGVDADSSY FKIFKTKHD 

A217 DNA SEQUENCE: 

Gene name: Homo sapiens mRNA; cDNA DKFZp56401763 (from clone DKFZp56401763 ) 



ATGCGCGCCG TGCCGCTGCC CGCCCCGCTC CTGCCGCTGC TGCTGCTCGC GCTCCTGGCC 

GCTCCCGCCG CCCGCGCCAG CAGAGCCGAG TCCGTCTCCC- CGCCGTGGCC CGAACCCGAG 

CGCGAGTCGC GGCCACCGCC CGGCCCGGGG CCCGGGAACA CCACCCGGTT TGGGTCTGGG 

GCGGCGGGCG GCAGCGGCAG CTCCAGCTCC AACAGCAGTG GCC-ACGCCTT GGTGACCCGC 
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ATTTCCATCC TCCTCCGCGA CCTACCCACC CTCAAGSCA' CGTTCGCC 300 

TTTACCACCC TCCTCATCGC CTGCCTGCTG CTGCGCGTCT TCAGGTCGGG AAAGAGGTTA 360 

AAGAAGACAC GCAAGTATGA TATCATCACC ACTCCAGCAG AGCGAGTGC-A AATGGCGCCA 420 

CTAAATGAAG AGGATGATGA AGATGAGGAC TCCACAGTAT TCGACATCAA ATACAGAGTG 480 

TCCTTGCCGG CTGCACTGAG ACGTCAGCTG CCAGGGTGCC AGACGCTACT GACAGTTCCT 540 

GTGCCCCCAC CCTTCATCCT CGACATTGAC CTTCCAGCAA GATGCAGTGG AAGGCCTGAT 600 

GGTGGAATCA GACCTGGTAA AACCTGTTTC CCAGCCTGGT GGCATCCTGT GGAAAGTTGG 660 

TCAGCTGCAA CCTGGGGTGT GAAGGACTGG ACCTGGAAGC CCTCTTGCGT CGGAGGTGTT 720 

GAAACCAAAA CGAACGTTAT GTATAAAACC CCAGCICCAT CGTGCGTGTC AGGCATCTGC 7H0 

TCAGACTGTC ACTGGCAAGC TCGTTTCCAC GTCACCACAA TGGAGTTGCT TCTGCCACCC 840 

' TTTGGGCATC CCTTTAAAGT GCCCOCTACT TCTACTCCCC ATGGTTTTCG ACAACTGCAG 900 

CTGAATCTCA TGGAAAAGCT GGATTCCTCT GCCTTACGCA GAAACACCCG GGCTCCATCT 960 

GCCAGGTGCT TGCCACTGGT CCTGGCAGAA ATGGCGGCTC- CTGAAAGTGA CCTTCCAAAT 1020 

AGTACCTTGG GCTTGGATGT TTTCTGTGGT GCCGGCCAGC GGGGCACCTT TTGTGAAGAC 1140 

& AACAGCTGCG CTGGAAGCCA 1200 



ATCTGTCTCC CCTGCTGTGC CGTGGAACAC CTACGGGAAG 

AAACGAGGGC CCAGTGGGCA ATTGACCAGA 
GCGAACCTGC AGACAATTCC AGATACCCAA 

CACCCTGGAG GAGACTTGGA TGGGGTGGCA AATTTCTATT TGGAGGAAGA GGGTTTCCAG 

GATGGCAGAT GCCAGAAGAT GGTCCTGATC TCTGAGGAAG GGCCACCTAG T 

TGTGAGAGGC TCACAGGTTC CCATCACTTC TCCAGCCATT C 

TCCCCCCGAC AGCCCCTGTT TCTGTCCAGG C 




ETKTNVMYKT PAPSCVSGIC SDCHWQARFH VTTMELLLPP FGHPFKVPPT STPHGFRQLQ 300 

LNLMEKLDSS ALRRNTRAPS ARCLPLVLAE MAAAESDLPN PHWHFSATGS PIKTLYTQTH 3 60 

STLGLDVFCG AGQRGTFCED RAVTKVLQGS SFSKQLRWKP ALESGFPHHL RLLRECPPLS 420 
THPVRLARSD ARGQASLTGR RVFRRPRQSL HGGGSAGTAT CLLVLKILLR R 



A219 DNA SEQUENCE 

Gene name: selectin E (endothelial adhesion molecule 1) 

Unigene numbers Hs. 89546 

Probeset Accession jf : M24736 

Nucleic Acid Accession #: NM_000450 

Coding sequence: 1-1833 (underlined sequences correspond to start and stop codons) 



TCAGAAAAGT CAACAATGTG 



GCTGCCTGTA CCAATACATC CTGCAGTGGC CACGGTGAAT G 
TACACTTGCA AGTGTGACCC TGGCTTCAGT GGACTCAAGT GTGAGCAAAT TGTGAACTGT 
ACAGCCCTGG AATCCCCTGA GCATCGAAGC CTGGTTTGCA G 
AGCTACAATT CTTCCTGCTC TATCAGCTGT GATAGGGGTT ACCTGCCAAG C 
ACCATGCAGT GTATGTCCTC TGGAGAATGG AGTGCTCCTA TTCCAGCCTG C 



CCTGCTGGAG AGTTCACCTT CAAATCATCC TGCAACTTCA CCTGTC-AGGA AGGCTTCATG 1020 
TTGCAGGGAC CAGCCCAGGT TGAATGCACC ACTCAAGGGC AGTGGACACA GCAAATCCCA 1080 
GTTTGTGAAG CTTTCCAGTG CACAGCCTTG TCCAACCCCG A 
CTTCCTAGTG CTTCTGGCAG TTTCCGTTAT GGGTCCAGCT G 
GGTTTTGTGT TG AAGGGAT C CAAAAGGCTC CAATGTGGCC CCACAGGGGA GTGGGACAAC 1260 
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GAGAAGCCCA CATGTGAAGC TGTGAGATGC GATGCTGTCC ACCAGCCCCC GAAGGGTTTG 1320 

GTGAGGTGTG CTCATTCCCC TATTGGAGAA TTCACCTACA AGTCCTCTTG TGCCTTCAGC 1380 

TGTGAGGAGG GATTTGAATT ATATGGATCA ACTCAACTTG AGTGCACATC TCAGGGACAA 1440 

TGGACAGAAG AGGTTCCTTC CTGCCAAGTG GTAAAATGTT CAAGCCTGGC AGTTCCGGGA 1500 

AAGATCAACA TGAGCTGCAG TGGGGAGCCC GTGTTTGGCA CTGTGTGCAA GTTCGCCTGT 1560 

CCTGAAGGAT GGACGCTCAA TGGCTCTGCA GCTCGGACAT GTGGAGCCAC AGGACACTGG 1S20 

TCTGGCCTGC TACCTACCTG TGAAGCTCCC ACTGAGTCCA ACATTCCCTT GGTAGCTGGA 1680 

CTTTCTGCTG CTGGACTCTC CCTCCTGACA TTAGCACCAT TTCTCCTCTG GCTTCGGAAA 1740 

TGCTTACGGA AAGCAAAGAA ATTTGTTCCT GCCAGCAGCT GCCAAAGCCT TGAATCAGAC 1800 
GGAAGCTACC AAAAGCCTTC TTACATCCTT TAA 



A220 Protein sequence: 

Gene name: Selectin E (endothelial adhesion molecule 1) 

Unigene number: Hs.a9546 

Probeset Accession # : M24736 

Protein Accession #: NP_000441 

Signal sequence: 1-22 

Transmembrane domains: 555-573 

C-lectin domain: 23-139 

Cellular Localization: plasma membrane 



MIASQFLSAL TLVLLIKESG AWSYNTSTEA MTYDEASAYC QQRYTHLVAI QNKEEIEYLN 60 

SILSYSPSYY WIGIRKVNNV WVWVGTQKPL TEEAKNWAPG EPNNRQKDSD CVEIYIKREK 120 

DVGMWNDERC SKXKLALCYT AACTNTSCSG HGECVETINN YTCKCDPGFS GLKCEQIVNC 180 

TALESPEHGS LVCSHPLGNF SYNSSCSISC DRGYLPSSME TMQCMSSG3W SAPIPACNW 240 

ECDAVTNPAN GFVECFQNPG SFPWHTTCTP DCEEGFELMG AQSLQCTSSG NWDNEKPTCK 300 

AVTCRAVRQP QNGSVRCSHS PAGEFTFKSS CNFTCEEGFM LQGPAQVECT TQGQWTQQIP 360 

VCEAFQCTAL SNPERGYMNC LPSASGSFRY GSSCEFSCEQ GFVLKGSKRL QCGPTGEWDN 42 0 

EKPTCEAVRC DAVHQPPKGL VRCAHSPIGE FTYKSSCAFS CEEGFELYGS TQLECTSQGQ 480 

WTEEVPSCQV VKCSSLAVPG KINMSCSGEP VFGTVCKFAC PEGWTLNGSA ARTCGATGHW 540 

SGLLPTCEAP TESNIPLVAG LSAAGLSLLT LAPFLLWLRK CLRKAKKFVP ASSCQSLESD 600 
GSYQKPSYIL 



A221 DNA SEQUENCE 

Gene name: ESTs; Liprin A2 

Unigene number: Hs. 306480 

Probeset Accession ft: N51002 
Nucleic Acid Accession tf: N51002 

Coding sequence: 1-3793 (underlined sequences correspond to start and stop c 

1 11 21 31 41 51 

ATGATGTGTG AAGTGATGCC CACGATTAAT GAGGACACCC CAATGAGCCA AAGGGGGTCC 60 

CAAAGCAGTG GCTCGGACTC AGACTCCCAT TTTGAGCAGC TGATGGTGAA TATGCTAGAT 120 

GAAAGGGATC GTCTTCTAGA CACCCTTCGG GAGACCCAGG AAAGCCTCTC ACTTCCCCAG 180 

CAAAGACTTC AGGATGTCAT CTATGACCGA GACTCACTCC AGAGACAGCT CAATTCAGCC 240 

CTGCCACAGG ATATCGAATC CCTAACAGGA GGGCTGGCTG GTTCTAAGGG GGCTGATCCA 300 

CCGGAATTTG CTGCACTGAC AAAAGAATTA AATGCCTGCA GGGAACAACT TCTAGAAAAG 3 60 

GAAGAAGAAA TCTCTGAACT TAAAGCTGAA AGAAACAACA CAAGACTATT ACTGGAGCAT 420 

TTGGAGTGCC TTGTGTCACG ACATGAAAGA TCACTAAGAA TGACGGTGGT AAAACGGCAA 480 

GCCCAGTCTC CCTCAGGAGT ATCCAGTGAA GTTGAAGTTC TCAAGGCACT GAAATCTTTG 540 

TTTGAGCACC ACAAGGCCTT GGATGAAAAG GTAAGGGAGC GACTGAGGGT TTCTTTAGAA 600 

AGAGTCTCTG CACTGGAAGA AGAACTAGCT GCTGCTAATC AGGAGATTGT TGCCTTGCGT 660 

GAACAAAATG TTCATATACA AAGAAAAATG GCATCAAGCG AGGGATCCAC AGAGTCAGAA 720 

CATCTTGAAG GGATGGAACC TGGACAGAAA GT CCATG AG A AGCGTTTGTC CAATGGTTCT 780 

ATAGACTCAA CCGATGAAAC TAGTCAAATA GTTGAACTAC AAGAATTGCT TGAAAAGCAA 840 

AACTATGAAA TGGCCCAGAT GAAAGAACGT TTAGCAGCCC TTTCTTCCCG AGTGGGAGAG 900 

GTGGAACAGG AAGCAGAGAC AGCAAGAAAG GATCTCATTA AAACAGAAGA AATGAACACC 960 

AAGTATCAAA GGGACATTAG GGAGGCCATG GCACAAAAGG AAGATATGGA AG AAAGAAT T 1020 

ACAACCCTTG AAAAGCGTTA CCTCAGTGCT CAGAGAGAAT CTACCTCCAT ACATGACATG 1080 

AATGATAAAC TAGAAAATGA GTTAGCAAAT AAAGAAGCTA TCCTACGGCA GATGGAAGAG 1140 

AAAAACAGAC AGTTACAAGA ACGT CTTG AG CTAGCTGAAC AAAAGTTGCA GCAGACCATG 12 00 

AGAAAGGCTG AAACCTTGCC TGAAGTAGAG GCTGAACTGG CTCAGAGAAT TGCAGCCCTA 12 60 

ACCAAGGCTG AAGAGAGACA TGGAAATATT GAAGAACGTA TGAGACATTT AGAGGGTCAA 1320 

CTTGAAGAGA AGAATCAAGA ACTTCAAAGA GCTAGGCAAA GAGAGAAAAT GAATGAGGAG 1380 

CATAACAAGA GATTATCGGA TACGGTTGAT AGACTTCTGA CTGAATCCAA TGAACGCCTA 1440 

CAACTACACT TAAAGGAAAG AATGGCTGCT CTAGAAGAAA AGAATGTTTT AATTCAAGAA 1500 

TCAGAAACTT TCAGAAAGAA TCTTGAAGAA TCTTTACATG ATAAGGAAAG ATTAGCAGAA 1560 

GAAATTGAAA AGCTGAGATC TGAACTTGAC CAATTGAAAA TGAGAACTGG CTCTTTAATT 1620 

GAACCCACAA TACCAAGAAC TCATCTAGAC ACCTCAGCTG AGTTGCGGTA CTCAGTGGGA 1680 

TCCCTAGTGG ACAGCCAGTC TGATTACAGA ACAACTAAAG TAATAAGAAG ACCAAGGAGA 1740 

GGCCGCATGG GTGTGCGAAG AGATGAGCCA AAGGTGAAAT CICTTGGGGA TCACGAGTGG 1800 

AATAGAACTC AACAGATTGG AGTACTAAGC AGCCACCCTT TTGAAAGTGA CACTGAAATG 1860 

T CTGAT ATTG ATGATGATGA CAGAGAAACA ATTTTTAGCT CAATGGATCT TCTCTCTCCA 1920 

AGTGGTCATT CCGATGCCCA GACGCTAGCC ATGATGCTTC AGGAACAATT GGATGCCATC 1980 

AACAAAGAAA TCAGGCTAAT TCAGGAAGAA AAAGAATCTA CAGAGTTGCG TGCTGAAGAA 2040 

ATTGAAAATA GAGTGGCTAG TGTGAGCCTC GAAGGCCTGA ATTTGGCAAG GGTCCACCCA 2100 
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GGTACCTCCA TTACTGCCTC TGTTACAGCT TCATCGCTGG CCAGTTCATC TCCCCCCAGT 2160 

GGACACTCAA CTCCAAAGCT CACCCCTCGA AGCCCTGCCA GGGAAATGGA TCGGATGGGA 2220 

GTCATGACAC TGCCAAGTGA TCTGAGGAAA CATOSGAGAA AGATTGCAGT TGTGGAAGAA 2280 

GATGGTCGAG AGGACAAAGC AACAATTAAA TGTGAAACTT CTCCTCCTCC TACCCCIAGA 2340 

GCCCTCAGAA TGACTCACAC TCTCCCTTCT TCCTACCACA ATGATGCTCG AAGTAGTTTA 2400 

TCTGTCTCTC TTGAGCCAGA AAGCCTCGGG CTTGGTAGTG CCAACAGCAG CCAAGACTCT 2460 

AAAGAAAAAG CTCGACTTGG GCAGCTCCGA GGCTTTATGG AGACTGAAGC TGCAGCTCAG 2 580 

T CAAGCTGAGA AGGATCGAAG ACTAAAGAAA 2 640 



T AGAGCTTTGG TTGGGAATGC CTGCGTGGTA CGTGGCAGCC 2760 

TGCCGAGCCA ACGTGAAGAG TGGTGCCATC ATGTCTGCTT TATCTGACAC TGAGATCCAG 2 82 0 

A TCCACTGCAT CGCTTAAAAC TTCGATTAGC AATCCAGGAG 2880 

2 TTCAGCTCCT CCAACATCTC GAACTCC1TC AGGCAACGTT 2940 

TGGGTGACTC A 
GAAGGAAGCT GGGCCCAGTG T 

CATGAGTGGA TTGGAAATGA ATGGCTTCCC AGCTTGGGGT TACCTCAGTA CAGAAGTTAC 3120 

TTTATGGAAT GCTTGGTAGA TGCAAGAATG TTAGATCACC TAACAAAAAA AGATCTCCGT 3180 



A CTAGAAAGAA GACGGGAAGC AAGCCAACAT 3300 
GAAATAAAAG ACGTGTTGGT GTGGAGCAAT GACCGAATTA TTCGCTGGAT ACAAGCAATT 3360 
GGACTTCGAG AATATGCAAA TAATATACTT GAGAGCGGTG TGCATGGCTC ACTTATAGCC 342 0 



C TCTTGGCCCT G 

AGGCGACTGG ATGAAAGTGA TGACAAGAAC TTCAGACGTG GATCAACCTG GAGAAGGCAG 3600 

; ATGATGCCTG GGTCCTCAGA AACATTACCA 3660 

A GGTTAACCAC AACCTCTGGG CAATCAAGAA AAATGACAAC AGATGTTGCT 372 0 

TCATCAAGAC TGCAGAGGTT AGACAACTCC ACTGTTCGCA CATACTCATG TCTCGAGTAA 3780 



Gene names ESTs; Liprin A 

Unigene number: Hs . 3 06480 

Probeset Accession #: N51002 

Protein Accession #s none found 

Signal sequences none found 

e domains i none found 



Cellular Localization: not determined 

1 11 21 31 41 51 

I I I I I I 

MMCEVMPTIN EDTPMSQRGS QSSGSDSDSH FEQLMVNMLD ERDRLLDTLR ETQESLSLAQ 
QRLQDVIYDR DSLQRQLNSA LPQDIESLTG GLAGSKGADP PEFAALTKSL NACREQLLEK 
EEEISELKAE RKNTRLLLEH LECLVSRIIER SLRMTWKRQ AQSPSGVSSE VEVLKALKSL 
FEHHKALDEK VRERLRVSLE RVSALEEELA AANQEIVALR EQNVHIQRKM ASSEGSTESE 
HEKRLSNGS IDSTDETSQI VELQELLEKQ NYEMAQMKER IiAAL S SRVGE 
LIKTEEMNT KYQRDIREAM AQKEDMEERI TTLEKRYLSA QRESTSIHDM 
» KEAILRQMEE KNRQLQERLE LAEQKLQQTM RKAETLPEVE AELAQRIAAL 
I EERMRHLEGQ LEEKNQELQR ARQREKMNEE HNKRLSDTVD RLLTESNERL 
QLHLKERMAA LEEKNVLIQE SETFRKHLEE SLHDKERLAE EIEKLRSELD QLKMRTGSLI 
EPTIPRTHLD TSAELRYSVG SLVDSQSDYR TTKVIRRPRR GRMGVRRDEP KVKSLGDHEW 
P SGHSDAQTLA MMLQEQLDAI 



G VMTLPSDLRK HRRKIAWEE EH 

jEPESLG LGSANSSQDS LHKAPKKKGI KSSIGRLFGK 840 

KEKARLgQLR GFMETEAAAQ ESLGLGKLGT QAEKDRRLKK KHELLEEARR KGLPFAQWDG 900 

PTWAWLELW LGMPAWYVAA CRANVKSGAI MSALSDTEIQ REIGISNPLH RLKLRLAIQE 960 

MVSLTSPSAP PTSRTPSGNV WVTHEEMENL AAPAKTKESS EGSWAQCPVF LQTLAYGDI1N 1020 

HEWIGNEWLP SLGLPQYRSY FMECLVDARM LDHLTKKDLR VHLKMVDSFK RTSLQYGIMC 1080 

LKRLNYDRKE LERRREASQH EIKDVLVWSN DRIIRWIQAI GLREYANNIL ESGVHGSLIA 1140 

LDENFDYSSL TLLLQIPTQN TQARQ I LERE YNHLLALGTE RRLDESDDKN FRRGSTWRRQ 1200 
FPPREVHGIS MMPGSSETLP AGFRLTTTSG QSRKMTTDVA SSRLQRLDNS TVRTYSCLE 

A223 DNA 



o start and stop codons) 



I I I I I I 

ATGAGGCGAC TGAATCGGAA AAAAACTTTA AGTTTGGTAA AAGAGTTGGA TGCCTTTCCG 
AAGGTTCCTG AGAGCTATGT AGAGACTTCA GCCAGTGGAG GTACAGTTTC TCTAATAGCA 
TTTACAACTA TGGCTTTATT AACCATAATG GAATTCTCAG TAT AT CAAGA TACATGGATG 
AAGTATGAAT ACGAAGTAGA CAAGGATTTT TCTAGCAAAT TAAGAATTAA TATAGATATT 
ACTGTTGCCA TGAAGTGTCA ATATGTTGGA GCGGATGTAT TGGATTTAGC AGAAACAATG 
GTTGCATCTG CAGATGGTTT AGTTTATGAA CCAACAGTAT TTGATCTTTC ACCACAGCAG 
AAAGAGTGGC AGAGGATGCT GCAGCTGATT CAGAGTAGGC TACAAGAAGA GCATTCACTT 
CAAGATGTGA TATTTAAAAG TGCTTTTAAA AGTACATCAA CAGCTCTTCC ACCAAGAGAA 
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GATGATTCAT CACAGTCTCC AAATGCATGC AGAATTCATG GCCATCTATA TGTCAATAAA 
GTAGCAGGGA ATTTTCACAT AACAGTGGGC AAGGCAATTC CACATCCTCG TGGTCATGCA 
CATTTGGCAG CACTTGTCAA CCATGAATCT TACAATTTTT CTCATAGAAT AGATCATTTG 
TCTTTTGGAG AGCTTGTTCC AGCAATTATT AATCCTTTAG ATGGAACTGA AAAAATTGCT 
ATAGATCACA ACCAGATGTT CCAATATTTT ATTACAGTTG TGCCAACAAA ACTACATACA 
TATAAAATAT CAGCAGACAC CCATCAGTTT TCTGTGACAG AAAGGGAACG TATCATTAAC 
CATGCTGCAG GCAGCCATGG AGTCTCTGGG ATATTTATGA AATATGATCT CAGTTCTCTT 
ATGGTGACAG TTACTGAGGA GCACATGCCA TTCTGGCAGT TTTTTGTAAG ACTCTGTGGT 
ATTGTTGGAG GAATCTTTTC AACAACAGGC ATGTTACATG GAATTGGAAA ATTTATAGTT 
GAAATAATTT GCTGTCGTTT CAGACTTGGA TCCTATAAAC CTGTCAATTC TGTTCCTTTT 
GAGGATGGCC ACACAGACAA CCACTTACCT CTTTTAGAAA ATAATACACA TTGA 



I 



2 PTVFDLSPQQ 

KEWQRMLQLI QSRLQEEHSL QDVIPKSAPK STSTALPPRE DDSSQSPNAC RIHGHLYVNK 
VAGNPHITVG KAIPHPRGHA HLAALVNHES YNFSHRIDHL SPGELVPAII NPLDGTEKIA 
IDHNQMFQYF ITWPTKLHT YKISADTHQF SVTERERIIN HAAGSHGVSG IFMKYDLSSL 
MVTVTEEHMP FWQFFVRLCG IVGGIPSTTG MLHGIGKPIV EIICCRPRLG SYKPVNSVPP 
EDGHTDNHLP LLENNTH 



35 A225 DNA SEQUENCE : 

Gene name: BSTs, Weakly similar to JE0350 Anterior gradient-2 [H. sapiens] 

Unigcne number: Hs. 100686 

Probeset Accession It: AA487468 

Nucleic Acid Accession #: AA487468 
40 Coding sequence: 55-555 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

AST I I I I I I 

4j CGGCACCAAG AGCACTGGCC AAGTCAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATG 60 

CTACACT CAG CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 120 

GCAATAAAAA AGGAAAAGAG GCCTCCTCAG ACACT CTCAA GAGGATGGGG AGATGACAT C 180 

ACTTGGGTAC AAACTTATGA AGAAGGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 240 

ATGGTTATTC ATCACCTGGA GGATTGTCAA TACTCTCAAG CACTAAAGAA AGTATTTGCC 300 

jU CAAAATGAAG AAATACAAGA AATGGCTCAG AATAAGTTCA TCATGCTAAA CCTTATGCAT 360 

GAAACCACTG ATAAGAATTT ATCACCTGAT GGGCAATATG TGCCTAGAAT CATGTTTGTA 420 

GACCCTTCTT TAACAGTTAG AGCTGACATA GCTGGAAGAT ACTCTAACAG ATTGTACACA 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 540 

CAGTCAGAGC TATAAGAGAT GATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTTCAT 600 

J J GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

TTACTATTTA GTTTTTTTAA TGTGTTTGCA ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



A226 Protein sequence 



Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs, Weakly similar to JE0350 Anterior gradient-2 [H. sapiens] 



none found 



MMLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGWGD DITWVQTYEE GLFYAQKSKK 60 
PLMVIHHLED CQYSQALKKV FAQNEEIQEM AQNKFIMLNL MHETTDKNLS PDGQYVPRIM 120 
A DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSEL 



coupled receptor 49 (GPR49) (HG38) (LGR5) 

ierlined sequences correspond to start and stop codons) 
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GTGGCGGCAA CCGGCACCTC AGTCCCCGCC GCGCTTCTCC TCGCCGCCCA CGCCGTGGGG 

TCAGGAACGC GGCGTCTGGC GCTGCAGACG CCCGCTGAGT TGCAGAAGCC CACGGAGCGG 

CGCCCGGCGC GCCACGGCCC GTAGCAGTCC GGTGCTGCTC TCCGCCCGCG TCCGGCTCGT 

GGCCCCCTAC TTCGGGCACC ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG 

TGCTGCTGCA GCTGGCGACC GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT 

ACCTGGGGCT CTCGGAGCTG CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA 

GTATGAACAA CATCAGTCAG CTGCTCCCGA ATCCCCTGCC CAGTCTCGGC T 



25 



ACAGTCTTAA AGTTCTTATG CTG CAGAAT A ATCAGCTAAG ACACGTACCC ACAGAAGCTC 
TGCAGAATTT GCGAAGCCTT CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC 
CCCCAAGCTG TTTCAGTGGC CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT 



CCCTGAACAA 



GGACACTCTC CAACCTTAAA GAACTAGGAT TTCATAGCAA CAATATCAGG TCGATACCTG 1020 

20 AGAAAGCATT TGTAGGCAAC CCTTCTCTTA TTACAATACA TTTCTATGAC AATCCCATCC 1080 

AATTTGTTGG GAGATCTGCT TTTCAACATT TACCTGAACT AAGAACACTG ACTCTGAATG 1140 

GTGCCTCACA AATAACTGAA TTTCCTGATT TAACTGGAAC TGCAAACCTG GAGAGTCTGA 12 00 

CTTTAACTGG AGCACAGATC TCATCTCTTC CTCAAACCGT CTGCAATCAG TTACCTAATC 1260 

TCCAAGTGCT AGATCTGTCT TACAACCTAT TAGAAGATTT ACCCAGTTTT TCAGTCTGCC 1320 

AAAAGCTTCA GAAAATTGAC CTAAGACATA ATGAAATCTA CGAAATTAAA GTTGACACTT 13 80 

TCCAGCAGTT GCTTAGCCTC CGATCGCTGA ATTTGGCTTG GAACAAAATT GCTATTATTC 1440 

ACCCCAATGC ATTTTCCACT TTGCCATCCC TAATAAAGCT GGACCTATCG TCCAACCTCC 1500 

TGTCGTCTTT TCCTATAACT GGGTTACATG GTTTAACTCA CTTAAAATTA ACAGGAAATC 1560 

ATGCCTTACA GAGCTTGATA TCATCTGAAA ACTTTCCAGA ACTCAAGGTT ATAGAAATGC 1620 

CTTATGCTTA CCAGTGCTGT GCATTTGGAG TGTGTGAGAA TGCCTATAAG ATTT CTAAT C 1680 

AATGGAATAA AGGTGACAAC AGCAGTATGG ACGACCTTCA TAAGAAAGAT GCTGGAATGT 1740 

TTCAGGCTCA AGATGAACGT GACCTTGAAG ATTTCCTGCT TGACTTTGAG GAAGACCTGA 1800 

AAGCCCTTCA TTCAGTGCAG TGTTCACCTT CCCCAGGCCC CTTCAAACCC TGTGAACACC 1860 

TGCTTGATGG CTGGCTGATC AGAATTGGAG TGTGGACCAT AGCAGTTCTG GCACTTACTT 1920 

GTAATGCTTT GGTGACTTCA ACAGTTTTCA GATCCCCTCT GTACATTTCC CCCATTAAAC 1980 

TGTTAATTGG GGTCATCGCA GCAGTGAACA TGCTCACGGG AGTCTCCAGT GCCGTGCTGG 2 040 

CTGGTGTGGA TGCGTTCACT TTTGGCAGCT TTGCACGACA TGGTGCCTGG TGGGAGAATG 2100 

GGGTTGGTTG CCATGTCATT GGTTTTTTGT CCATTTTTGC TTCAGAATCA TCTGTTTTCC 2160 

TGCTTACTCT GGCAGCCCTG GAGCGTGGCT TCTCTGTGAA ATATTCTGCA AAATTTGAAA 2220 

CGAAAGCTCC ATTTTCTAGC CTGAAAGTAA TCATXXTGCT CTGTGCCCTG CTGGCCTTGA 2280 

CCATGGCCGC AGTTCCCCTG CTGGGTGGCA GCAAGTATGG CGCCTCCCCT CTCTGCCTGC 2340 

CTTTGCCTTT TGGGGAGCCC AGCACCATGG GCTACATGGT CGCTCTCATC TTGCTCAATT 2400 

CCCTTTGCTT CCTCATGATG ACCATTGCCT ACACCAAGCT CTACTGCAAT TTGGACAAGG 2 46 0 

GAGACCTGGA GAATATTTGG GACTGCTCTA TGGTAAAACA CATTGCCCTG TTGCTCTTCA 252 0 

CCAACTGCAT CCTAAACTGC CCTGTGGCTT TCTTGTCCTT CTCCTCTTTA ATAAACCTTA 2580 

CATTTATCAG TCCTGAAGTA ATTAAGTTTA TCCTTCTGGT GGTAGTCCCA CTTCCTGCAT 2640 

GTCTCAATCC CCTTCTCTAC ATCTTGTTCA ATCCTCACTT TAAGGAGGAT CTGGTGAGCC 2700 

TGAGAAAGCA AACCTACGTC TGGACAAGAT CAAAACACCC AAGCTTGATG TCAATTAACT 2760 

CTGATGATGT CGAAAAACAG TCCTGTGACT CAACTCAAGC CTTGGTAACC TTTACCAGCT 2B2 0 

CCAGCATCAC TTATGACCTG CCTCCCAGTT CCGTGCCATC ACCAGCTIAT CCAGTGACTG 2880 

T GTGGCATTTG TCCCATGTCT CTAATTAATA TGTGAAGGAA 2940 

A AAGGTTGAGA ACCTGAAAAT GTGAGATTGA GTATATCAGA GCAGTAATTA 3000 
h ACTCGGTTTA AA 

55 
60 



Signal sequence: 
Transmembrane domair. 



I I I I I I 

MDTSRLGVLL SLPVLLQLAT GGSSPRSGVL L ' 

PSNBSVFTSY LDLSMNHISQ LLPNPLPSLR FLEELRLAGN A 
LQNNQLRHVP TEALQNLRSL QSLRLDANHI SYVPPSCFSG LKSLRHLWLD DNALTEIPVQ 
AFRSLSALQA MTIiALNKIHH IPDYAFGNLS SLWLHLHNW RIHSLGKKCF DGLHSLETLD 
IMYNNLDEFP TAIRTLSNLK ELGFHSNNIR SIPEKAFVGN PSLITIHFYD NPIQFVGRSA 
FQHLPELRTL TLNGASQITE FPDLTGTANL ESLTLTGAQI SSLPQTVCNQ LPNLQVLDLS 
YNLLEDLPSF SVCQKLQKID LRHNEIYEIK VDTFQQLLSL RSLNLAWNKI AIIHPNAFST 
LPSLIKLDLS SNLLSSFPIT GLHGLTHLKL TGNHALQSLI SSENFPELXV IEMPYAYQCC 
AFGVCENAYK ISNQWNKGDN SSMDDLHKKD AGMFQAQDER DLEDFLLDFE EDLKALHSVQ 
CSPSPGPFKP CEHLLDGWLI RIGVWTIAVL ALTOJALVTS TVFRSPLYIS PIKLLIGVIA 
AVHMLTGVSS AVLAGVDAFT FGSFARHGAW WENGVGCHVI C-FLSIFASES SVFLLTIAAL 
ERGFSVKYSA KFETKAPFSS LKVIILLCAL LALTMAAVPL LGGSKYGASP LCLPLPFGEP 
STMGYMVALI LLNSLCFLMM TIAYTKLYCN LDKGDLENIW DCSMVKHIAL LLFTHCILNC 
PVAFLSFSSL INLTFISPEV IKFILLWVP LPACLNPLLY ILFNPHFKED LVSLRKQTYV 
WTRSKHPSLM SINSDDVEKQ SCDSTQALVT FTSSSITYDL P! 
VAFVPCL 

Table 75: See Table 1 
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Table 76A depicts Seq ID No; UnigenelD; UnigeneTille; PKey; Predicted Cellular Localization; and Exemplar Accession for each of the sequences in Table 78. The information in 
Table 7SAis linked by SecJD No to Table 78. 



UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

Pred Subcell Loc: 



Seq ID No 


Pkey 




UnigenelD 


Unigene Title 


Pred Subcell Loc 


Seq ID1&2 


425023 


AW956889 


Hs.154210 


endothelial differentiation, sphingolipi 




Seq ID3&4 


424503 


NM.002205 


Hs.149609 


infegrin, alpha 5 (f bronectin receptor. 




Seq ID5&S 


429423 


AI016712 


Hs.287797 


integrin, beia 1 (f bronectin receptor, 




Seq ID7&8 


400289 


X07820 


Hs.2258 


matrix metalloproteinase 10 (stromelysin 




SeqlD9&10 


419172 




Hs.22120 


T nil r to TRANSMEMBRANE 4 SUPERF 




Seq ID 11 & 12 


418007 


M13509 


Hs.83169 


matrix melalloproteinase 1 (interstitial 


Seq ID 13 & 14 


418007 


M13509 


Hs.83169 


matrix metalloproteinase 1 (interstitial 




Seq ID 15& 16 


407836 


T79340 


Hs.200272 


B-cell CLL/lymphoma 6, member B, zinc fi 




SeqlD17&18 


414577 


AI056548 


Hs.72116 


hypotheti rotcin FLJ20992 sin 




Seq ID19&20 


418738 


AW388633 


Hs.6682 




"membrane 


Seq ID 21 & 22 


428368 


BE440042 




matrix metalloproteinase 3 (stromelysin 


secreted 


Seq ID 23 & 24 




C18356 


HS35944 


tissue factor pathway inhibitor 2 




Seq ID 25 & 26 


429276 


AF056085 


Hs.198612 


G protein-coupled receptor 51 




Seq ID 27 & 23 


418994 


AA296520 


Hs.89546 


seleclin E (endothelial adhesion molecul 
endothelial cell-specific molecule 1 




Seq ID 29 & 30 


407975 


X89426 


Hs.41716 




Seq ID 31 & 32 


429113 


D28235 


Hs.196384 






SeqlD33&34 


418506 


AA08424S 


Hs.85339 


C-ft >li i roupled receptor 39 




Seq ID 35 & 36 


423961 


D13666 


Hs.1 36348 


periostin (OSF-2CS) 




Seq ID 37 & 38 


414812 


X72755 


Hs.77367 


monokine induced by gamma interferon 




Seq ID 39 & 40 


417433 


BE270266 


Hs.82128 


5T4 oncofetal trophoblast glycoprotein 




SeqlD41&42 


417433 


BE270266 


Hs.82128 


5T4 oncofetal trophoblast glycoprotein 




Seq ID 43444 


424399 


AI905687 




A 05 7 H DTI r l 0 011 FTC L H ii ( 




Seq ID 45 & 46 


422867 


L32137 


Hs.1584 


carilage oligomeric matrix protein (pse 




Seq ID47&48 


428227 


AA321649 


Hs.2248 


small inducible cytokine subfamily B (Cy 


secreted 


Seq ID 49 & 50 


444381 


BE387335 




ESTs, Weakly similar to SG4054 hypotheti 




Seq ID 51 & 52 


439569 


AW602166 


Hs.222399 


CEGP1 protein 




Seq ID 53 & 54 


411558 


AA1 02670 


Hs.70725 


gamma-aminobutyric acid (GABA) A rccepto 
LIV-1 protein, estrogen regulated 




Seq ID 55 & 56 


400303 


AA242758 


Hs.79136 




Seq ID 57 & 58 


411789 


AF245505 


Hs.72157 


Adllcan 




Seq ID 59 & 60 


428698 


AA852773 


Hs.334838 


KIAA1866 protein 


plasma membrane 


Seq ID 61 & 62 


450098 


W27249 


Hs.8109 


hypothetical protein FLJ21080 




SeqlDC3&64 


421552 


AF026692 


Hs.105700 


secreted frizzled-related protein 4 




Seq ID 65 & 66 


452747 


BE1 53855 


Hs.61460 


Ig superfamily receptor LNIR 
BMP-RIB 




Seq ID 67 & 68 


415539 


AI733881 


Hs.72472 


Seq ID 69 & 70 




N32536 


Hs.42645 


solute earner family 16 (monocarboxylic 


plasma membrane 


Seq ID 71 472 


416636 


N32536 


Hs.42645 


rcluro earner family 16 [monocarboxylic 




Seq ID 73 & 74 


409079 


W87707 


Hs.82065 






SeqlD75&76 


442082 


R41823 


Hs.7413 






Seq ID 77 & 78 


400297 


AI127076 


Hs.306201 


hypothetical protein DKFZp55401 278 




SeqlD79&80 


451398 


AI793124 


Hs.144479 






Seq ID 81 & 82 


429220 


AW207206 




ESTs 




Seq ID 83 & 84 






Hs.105445 


GDNF family receptor alpha 1 




Seq ID 85486 


423242 


AL039402 


Hs.125783 


DEME-6 protein 




SeqlD87&88 


423242 


AL039402 


Hs.125783 


DEME-6 protein 




SeqlD89&90 


423242 


AL039402 


Hs.125783 


DEME-6 protein 


intracell 


Seq ID 91 & 92 




H26735 


Hs.91668 


Homo sapiens clone PP1498 unknown mRNA 


intracell 


Seq ID93&94 


452190 


H26735 


Hs.91668 


Homo sapiens clone PP1498 unknown mRNA 




Seq ID 95 & 96 


325372 






Phase 2 & 3 Exons 


cytoplasmic 


Seq ID 97 & 98 




AA009647 




a disintegrin and metalloproteinase doma 


Seq ID 99 & 100 


426215 


AW963419 


Hs.155223 


stanniocalcin 2 


secreted 


Seq ID 101 & 102 


425247 


NM_005940 Hs.1 55324 


matrix metalloproteinase 11 (stromelysin 


secreted 


Seq ID 103 & 104 


429353 


AL1 17406 


Ha335891 


ATP-binding cassette transporter MRP8 


plasma membrane 


Seq ID 105 & 106 


429353 


AL1 17406 


Hs.335891 


ATP-binding cassette transporter MRP8 


plasma membrane 


Seq ID 107 & 108 


432201 


AI538613 


Hs.298241 


Transmembrane protease, serine 3 


plasma membrane 
secreted 


Seq ID 109 & 110 


427585 


D31152 


Hs.1 79729 


collagen, type X, alpha 1 (Schrnid metaph 


SeqlD111&112 


446163 


M026880 


Hs.25252 


prolactin receptor 


plasma membrane 


Seq ID 113 & 114 


442117 


AW664964 


Hs.1 28899 


ESTs; hypothetical protein for IMAGE:447 


plasma membrane 


SeqlD115&116 


428179 


AI127772 


Hs.279696 


serum/glucocorticoid regulated k'nase-li 




Seq ID 117 & 118 
Seq ID 119 & 120 


431211 


M86849 




gap junction protein, beta 2, 26kD (conn 


plasma membrane 


447033 


AI357412 




ESTs 


secreted 


Seq ID 121 & 122 


447033 


AI357412 


Hs.1 57601 


ESTs 




Seq ID 123 & 124 




AI357412 


Hs.1 57601 


ESTs 




Seq ID 125 & 126 




BE614387 


Hs.333893 


c-Myc target JP01 




Seq ID 127 & 128 


452679 


Z42387 


Hs.83883 


transmembrane, prostate androcen induced 




Seq ID 129 & 130 








ephrin-A3 


plasma membrane 


Seq ID 131 & 132 


422048 


NM 012445 Hs.288126 


spondin 2, extracellular matrix protein 
transmembrane protease, serine 4 




Seq ID 133 & 134 


410418 


D31382 


Hs.63325 




Seq ID 135 & 136 


446342 


BE298665 


Hs.14846 


Homo sapiens mRNA; cDNA DKFZp564D016(fr 




Seq ID 137 & 138 


422260 


AA315993 


Hs.1 05484 


regenerating gene type IV 
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Seq ID 139& 140 
Seq ID 141 & 142 
Seq ID 143& 144 
Seq ID 145 & 146 
Seq ID 147 & 143 



35 
40 



55 
60 
65 




Predicted ca 
LFAP forlysophosphatidic acid phosphata 
ESTs, Weakly similar to MAT8_HUMAN CHLOR 
ESTs, Weakly similar to MAT8_HUMAN CHLOR 
— Weakly similar to MAT8JIUMAN CHLOR 



C15000305:gi|3806122|gb|AAC69198.1|(AFO 
C1 5000305:gi|38061 22|gb|AAC69198.1| (AFO 
putative chemokine receptor; GIF-binding 
putative chemokine receptor; GTP-binding 
lutative chemokine receptor, GTP-binding 
imilar to lysosome-associaled membrane 
small inducible cytokine subfamily B (Cy 
" iterieukin 1 receptor antagonist 



Seq ID 253 & 254 
Seq ID 255 & 256 
Seq ID 257 & 258 
Seq ID 259 4 260 
SeqlD26U262 
Seq ID 263 & 264 
Seq ID 265 4266 
Seq ID 267 & 268 
Seq ID 269 & 270 
Seq ID 271 & 272 
Seq ID 273 
Seq ID 274 
Seq ID 275 & 276 
Seq ID 277 & 278 
Seq ID 279 & 280 
Seq ID 281 &282 
Seq ID 283 & 284 
Seq ID 285 & 286 
Seq ID 287 & 288 
Seq ID 289 & 290 
Seq ID 291 & 292 
Seq ID 293 & 294 
Seq ID 295 & 296 
Seq ID 297 a 298 



BE259150 Hs.127792 
Hs.12827 
Hs.12827 
NM 001650 Hs.288650 
U84540 

Hs.75379 



del!a(Drosophila)-like3 



T06430 
AF026547 
AA429104 

U46745 Hs.336678 dystrobrevin, alpha 
AL035737 Hs.75184 
AI034361 Hs.135150 
Hs.144121 
Hs.144121 



424343 AW956360 Hs.47 



U58515 Hs.154138 
AL022327 Hs.74518 
Y15065 HS4975 



lung type-l cell membrane-associated gly 
ESTs, Weakly similar to T4691 6 hypotheti 
ESTs, Weakly similar to T46916 hypotheti 
adenylate cyclase activating polypeptide 



412709 
435615 
404049 

418932 L34059 
404029 
436480 
452401 



Hs.89484 

AJ271643 Hs.87469 

NM_007115 Hs.29352 

NM_007115 Hs.29352 

AF037335 Hs.5338 

U90545 Hs.327179 



NMjj18937":Homosai 
cadherin4,type 1, R-cadherin (retinal) 
NM_018936*:Homo sapiens protocadherin be 



solute carrier family 17 (sodium phospha 

941 
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Seq ID 299 & 300 
SeqlD301&302 
Seq ID 303 & 304 
Seq ID 305 & 306 
Seq ID 307 & 303 
Seq ID 309 & 310 
Seq ID 311 & 312 
Seq ID 313 & 314 
Seq ID 315 & 316 
Seq ID 317 & 318 
Seq ID 319 & 320 
Seq ID 321 & 322 
Seq ID 323 & 324 
Seq ID 325 & 326 
Seq ID 327 & 328 
Seq ID 329 & 330 
Seq ID 331 & 332 
Seq ID 333 & 334 
Seq ID 335 & 336 
Seq ID 337 & 338 
Seq ID 339 & 340 
Seq ID 341 & 342 
Seq ID 343 & 344 
Seq ID 345 & 346 
Seq ID 347 & 348 
Seq ID 349 & 350 
Seq ID 351 4352 
Seq ID 353 & 354 
Seq ID 355 & 356 
Seq ID 357 & 358 
Seq ID 359 & 360 
Seq ID 361 & 362 
Seq ID 363 & 364 
Seq ID 365 & 366 
Seq ID 367 & 368 
Seq ID 369 8, 370 
Seq ID 371 8,372 
Seq ID 373 & 374 
Seq ID 375 & 376 
Seq ID 377 & 378 
Seq ID 379 & 380 
Seq ID 381 & 382 
Seq ID 383 & 384 
Seq ID 385 & 386 
Seq ID 387 & 388 
Seq ID 389 & 390 
Seq ID 391 & 392 
Seq ID 393 & 394 
Seq ID 395 & 396 
Seq ID 397 & 393 
Seq ID 399 & 400 
Seq ID 401 & 402 
Seq ID 403 & 404 
Seq ID 405 & 406 
Seq ID 407 & 408 
Seq ID 409 & 410 
SeqlD411 &412 
Seq ID 413 & 414 
Seq ID 415 & 416 
Seq ID 417 & 418 
Seq ID 419 & 420 
Seq ID 421 & 422 
Seq ID 423 4424 
Seq ID 425 4426 
Seq ID 427 & 428 
Seq ID 429 & 430 
Seq ID 431 & 432 
Seq ID 433 & 434 
Seq ID 435 & 436 
Seq ID 437 & 438 
3q ID 439 & 440 



428296 NMJJ03058 Hs.183572 

423508 AW604297 Hs.129711 

450001 NMJ101044 Hs.406 

410407 X66839 Hs.63287 

453496 AA442103 Hs.33084 

420737 L08096 Hs.99899 

309931 AW341683 

412719 AW016610 Hs.816 

417034 NMJ06183 Hs.80962 

430486 BE062109 Hs.241551 

413753 U17760 Hs.75517 

425650 NMJ01944 Hs.1925 



418663 AK001100 

418663 AK001100 

429610 AB024937 

406690 M29540 

431846 BE019924 

422158 L10343 

431958 X63629 

437044 AL035864 

428484 AF104032 

429211 AF052693 

417389 BE260964 

431009 BE1 49762 

417542 J04129 



418462 BE001596 
410274 AA381307 
439606 W79123 



ls.41690 



carbonic anhydrase IX 
solute carrier family 2 (facilitated g!u 
CD70; tumor necrosis facto- (ligandjs 
gb:hd13d01.x1 Soares_NFLT_GBC_S1 



chloride channel, calcium activated, fam 
laminin, beta 3 (nicein (125kD), kalinin 
desmoglein 3 (pemphigus vulgaris antigen 
matrix metalioproteinai '"' 



protease inhibitor 3, skin-derived (SKAL 
cadherin 3, type 1, P-cadherin (placenta 
differentially expressed in Fanconi's an 




NM_014400 Hs.11950 
NM_005329 Hs.85962 
U88967 Hs.78867 



G protein-coupled receptor 87 
NM_005365:Homo sapiens me 
GPI-anchored metastasis-associated | 
hya'uronan synthase 3 
protein tyrosine phosphatase, 
protein tyrosine phosphatase, 
protein tyrosine phosphatase, 



Seq ID 
Seq ID 44: 
Seq ID 445 & 446 
Seq ID 447 8, 448 
Seq ID 449 & 450 
Seq ID 451 &452 
Seq ID 453 & 454 
Seq ID 455 & 456 
Seq ID 457 & 458 
Seq ID 459 & 460 
Seq ID 461 8,462 
Seq ID 463 & 464 



415817 
421817 
418678 
418678 
409420 
332180 



407242 
407242 
432596 



U88967 Hs.78867 

AF146074 Hs.108660 

NM.001327 Hs.87225 

NM_001327 Hs.87225 

Z15008 Hs.54451 

AF134160 Hs.7327 

AW580227 Hs .47860 

AW580227 Hs,47860 

AW238299 Hs.250618 

NM_001898 Hs.123114 

AF120274 Hs.194689 

AF120274 Hs.194689 

AF120274 Hs.194689 

AF120274 Hs.194689 

H39960 Hs.288467 

H39960 Hs.288467 

X02419 Hs.77274 

R34008 Hs.239727 

R34008 Hs.239727 

AW583497 Hs.1 84604 

AI693815 Hs.127179 

W94322 Hs.279651 

AW391342 Hs.199460 

AW391342 Hs.199460 



protein tyrosine phosphatase, re 
protein tyrosine phosphata: 
ATP-binding cassette, sub-family C(CFTR 
cancer/testis antigen (NY-ESO-1) 
can "[» -ntigen (NY-ESO-1) 
laminin, gamma 2 (nicein (100kD), kalini 

neurotrophic tyrosine kinase, receptor, 
neurotrophic tyrosine kinase, receptor, 
UL1C binding protein 2 




cryptic gene 

melanoma inhibitory activity 
DPCR1 protein 
DPCR1 protein 

C9001 1 88*:gi| 1 2738842|reflNP_073725. 1 1 p 



ESTs 



ier family G(neurotransmi:te 



M18728 
M18728 

AJ224741 Hs.278461 

BE395085 Hs.10086 

BE350494 Hs.49753 

H10233 Hs.2265 

MM 003816 Hs.2442 

N30714 Hs.325960 

S73265 Hs.1473 

AW470411 Hs.288433 

Z45051 Hs.22920 

AF067797 Hs.1 76658 

W95642 Hs.82961 

AU076718 Hs.164021 



uveal autoanSgen with coiled coil domai 
secretory granule, neuroendocrine protei 
adisintecrin and metalloproteinase doma 
membrane-spanning 4-domains, subfamily A 
gasirin-reieasing peptide 
neurotrimin 
similar to S68401 
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SeqlD465&466 
Seq ID 467 & 468 
Seq ID 469 & 470 
Seq ID 471 8,472 
Seq ID 473 & 474 
Seq ID 475 & 476 



35 
40 



AI694413 Hs.332649 

J05068 Hs.2012 

Y18198 Hs.1 94725 

AW583672 Hs.256311 

AW369771 Hs.52620 

M86699 Hs.1 69840 

AI687303 Hs.285529 

NM_005756 Hs.184942 

NM_005756 Hs.184942 

NMJ05756 Hs.184942 

NM.005756 Hs.184942 

AJ245671 Hs,12844 



interferon, alpha-inducWe protein (clo 
secretogranir. II (chromogranin C) 
Ubiq 3 rF Flj 
Ubiquitin like protein FAT10??? 



Seq ID 539 & 540 
Seq ID 541 & 542 
Seq ID 543 4544 
Seq ID 545 & 546 
Seq ID 547 & 548 
Seq ID 549 & 550 
Seq ID 551 8,552 
Seq ID 553 8,554 
Seq ID 555 4556 
Seq ID 557 8,558 



NM_006103 Hs.2719 

NM_002407 Hs.97644 

NM.014791 Hs.184339 

AU076643 Hs,313 

AB018305 Hs.5378 

U23752 Hs.32964 

AW972870 Hs.291069 

AL360204 Hs.283853 

BE393948 Hs.50915 

BE616633 Hs.170195 

AI683243 Hs.97258 

AI683243 Hs.97258 

U25128 Hs.1 59499 

U25128 Hs.159499 

AB002364 Hs.27916 

U62801 Hs.79361 

NM_012152 Hs.258583 

U33635 Hs.90572 

; X69699 Hs.73149 

; X69699 Hs.73149 

I NM 006799 Hs.72026 

! AI077715 Hs.39384 

i AA508552 Hs.1 95839 

! AB037765 Hs.30652 

I N95796 Hs.278695 

I N62096 Hs.293185 

I N62096 Hs.293185 

I N62096 Hs.293185 

I N62096 Hs.293185 

I H18836 Hs.31608 

410001 AB041036 Hs.57771 

418396 AI765805 Hs.26691 

451027 AW519204 Hs.40808 

446057 AI420227 Hs.149358 

433466 AA508353 Hs.105314 

453370 AI470523 Hs.1 39336 

453370 AI470523 Hs.1 39336 

414569 AF109298 Hs.118258 

413435 X51405 Hs.75360 

426501 AW043782 Hs.293616 



integrin, betas 
TTK protein kinase 
G protein-coupled receptor 
G protein-coupled receptor 
G protein-coupled receptor 
G protein-coupled receptor 
G protein-coupled receptor 



fn'zzled (Drosophila) homolog 6 
kallikrein 7 (chymofeyptic, stratum com 
kallikrein 8 (neuropsin/ovasin) (KLK8) 
HE4; epididymis-specifio, whey-acidic pr 
mammaglobin 2 
KIAA0175 gene product 
secreted phosphoprotein 1 (osteopontin, 
spor.din 1 , (f-spandin) extracellular mat 
SRY (sex determining region Y)-box 1 1 
ESTs 

Homo sapiens mRNA full length insert cDN 
kallikrein 5 

bone morphogenetic protein 7 (osteogen! 
ESTs. Mode; alely similar to S29539 ribos 
ESTs, Moderately similar to S29539 ribos 
parathyroid hormone receptor 2 
|il hormone rec 
a disintegrin-like and metallopro tease ( 
kallikrein 6 (neurosin.zyme) 
endothelial differentiation, lysophospha 
PTK7 protein tyrosine kinase 7 



protease, serine, 21 (testisin) 

ESTs, Weakly similar to I38022 hypotheli 
KIAA1344 protein 

Homo sapiens prostein mRNA, complete cds 
ESTs, Weakly similarto JC7328 amino aci 
ESTs, Weakly similar to JC7328 amino aci 
ESTs, Weakly similarto JC7328 amino aci 
ESTs, Weakly similarto JC7328 amino aci 
hypothetical protein RJ20041 
kallikrein 1 1 
ESTs 



408430 S79876 Hs,44926 

445413 AA151342 Hs.12677 

451982 F13036 Hs.27373 

427958 AA418000 Hs.98280 

421887 AW161450 Hs.109201 

425071 NM 013989 Hs,154424 

432101 AI918950 Hs,123642 

407786 AA687538 Hs.38972 

416836 D54745 Hs.80247 

416539 Y07909 Hs.79368 

131083 Y09763 Hs.22785 

131083 Y09763 Hs,22785 

131083 Y09763 Hs.22785 

131083 Y09763 Hs.22785 

422424 AI1 86431 Hs.296638 

428970 BE276891 Hs.194691 

428330 L22524 Hs.2256 

439018 AW300887 Hs.26638 

420610 AI683183 Hs.99348 

425723 NM 014420 Hs.159311 



ESTs 

" with EGF-like and 
15 (H777 transport 
hypothetical protein MGC2648 
Ml 1_0 • i656' Homo sapiens transmembrane pr 
NM_005656':Homo sapiens transmembrane pr 
dipeptidylpeptidase IV (CD26, adenosine 
CGI-147 protein 

Homo sapiens mRNA; cDNA DKFZp56401763 (f 



CGI-86 protein 

deiodinase, iodothyronine, type II 
EphA3 



gamma-aminobutyric acid (GABA) A recepto 
gamma-aminobutyric acid (GABA) A recepto 
gamma-aminobutyric acid (GABA) A recepto 
gamma-aminobutyric acid (GABA) A recepto 
prostate differentiation factor 



dickkopf (Xenopus laevis) homolog 4 

943 
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Seq ID 629 & 630 456662 NM_002448 Hs.1494 
EeqlD631 &632 418281 U09550 Hs.1154 
Seq ID 633 & 634 429903 AL134197 Hs.93597 



OS probeset identifier nu 



424399 238961.1 AI906687 AI905624 AI905837 AI905623 AA340069 R75793 W72837 BE07451 2 AI905633 W72838 BE092421 AI127172 BE1 8601 3 AW07091 6 

AI139456AW176044AW291950 
429220 301384_1 AW207206 AW341473 AA448195 AI951341 

450375 83327J AA009647AA131254 M374293 AW954405 H04410AW606284 AA151166 BE157467 BE157601 H04384 W46291 AW663674 H04021 H01532 

AA190993 H03231 H59605 H01642 AA852876 AA1 1 3758 AA626915 AA746952 AI161014 AA099554 R69067 



Pkey: Unique number corresponding to an Eos probeset 

Ref Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled Tl 

sequence of human chromosome 22." Dunham I et al., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted, 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand 

402075 8117407 Plus 

402230 9966312 Minus 

403047 3540153 Minus 

404029 7671252 Plus 108716-111112 

404049 3688074 Minus 75765-78155 

404682 9797231 Minus 40977-41150 

404875 9801324 

404877 1519284 



1553-1 712,1 878-21 40,42524385,5922-6077 
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Table 77 provides Pkey; Seq ID No; Disease Indications; and Preferred Utility for sequences in Table 78. Seq ID No links the information in Table 77 to Table 78. 

Seq ID No: Sequence ID No for sequences in table 

Pkey: Unique Eos probeset identifier number 

Disease Indications: Diseases designated for cove-age as described in Table 1 

Preferred Utility: Preference of utility, based partly upon predicted localization (Ab is antibody; sm is small molecule target; CTL is vaccine target) 



Pkey Disease Indications 

425023 angiogenesis 

424503 pancreas, prosta' 

429423 angiogenesis 



419172 angiogenesis, renal 

418007 angiogenesis, lung, bladder, fibrosis, head 8, neck, pancreas, stomach, colon, ov 
41 8007 angiogenesis, lung, bladder, fibrosis, head 8, neck, pancreas, stomach, colon, ov 



Seq ID No 

Seq ID 1 8,2 
Seq ID 38, 4 
Seq ID5&6 

Seq ID9& 10 
Seq ID 11 & 12 
Seq ID 138. 14 
Seq ID15&16 

Seq ID 17 8, 18 414577 angiogenesis 

Seq ID 19 & 20 418738 angiogenesis, lung, ovarian, bladder 8, stomach, pancreas, uterine 

Seq ID 21 & 22 428368 angiogenesis, head & neck, stomach 

SeqlD238,24 415138 angiogenesis, pancreas, stomach, lung, uterine 

Seq ID 25 8, 26 429276 angiogenesis, bladder, glioblastoma 

SeqlD27& 28 418994 prostate, angiogenesis 

Seq ID 29 & 30 407975 angiogenesis, renal 

Seq ID 31 8, 32 4291 13 angiogenesis, bladder and stomach 

Seq ID 33 & 34 418506 angiogenesis, ovarian, glioblastoma, uterine, lung, bladder, pancreas 

Seq ID 35 & 36 423961 breast, colon, bladder, lung, fibrosis pancreas, head and neck, ovarian 

Seq ID 37 8, 38 414812 breast, bladder, lung, Fibrosis, pancreas, colon, head and neck, cervical, stomach, rt 

Seq ID 39 8, 40 417433 pancreas, breast, bladder, lung, head 8, neck, cervical, uterine, ovarian, stomach, rei 

Seq ID 41 8, 42 417433 pancreas, breast, bladder, lung, head & neck, cervical, uterine, ovarian, stomach, rei 



Seq ID 45 8, 46 422867 
Seq ID 47 & 48 428227 
Seq ID 49 & 50 444381 
Seq ID 51 & 52 439569 
Seq ID S3 & 54 411558 
Seq ID 55 & 56 400303 
Seq IDS7&58 411789 
Seq ID 59 & 60 428698 
Seq ID 61 8, 62 450098 
Seq ID 634 64 421552 
Seq ID 65 & 66 452747 
Seq ID 67 & 68 415539 
Seq ID 69 & 70 416636 
Seq ID 71 & 72 416636 
Seq ID 73 & 74 409079 
Seq ID 76 & 76 
Seq ID 77 & 78 400297 
Seq ID 79 & 80 451398 
Seq ID 81 & 82 429220 
Seq ID 83 & 84 421524 
Seq ID 85 & 86 423242 
Seq ID 87 & 88 423242 
Seq ID 89 & 90 423242 
Seq ID 91 & 92 452190 
Seq ID 93 & 94 452190 
Seq ID 95 & 96 325372 
Seq ID 97 & 98 450375 
Seq ID 99 & 100 426215 
Seq ID 101 & 102 425247 
Seq ID 1038, 104 429353 
Seq ID 105& 106 429353 
Seq ID 107& 108 432201 
Seq ID 109 & 110 427585 
Seq ID 111 & 112 446163 
SeqlD113& 114 442117 
Seq ID 115 8, 116 428179 
Seq ID 117 & 118 431211 
Seq ID 119 & 120 447033 
Seq ID 121 & 122 447033 
Seq ID 123 8,124 447033 
Seq ID 125 8,126 115522 
Seq ID 127 & 128 452679 
Seq ID 129 & 130 446051 
Seq ID 131 & 132 422048 
Seq ID 133 & 134 410418 
Seq ID 135 & 136 446342 
Seq ID 137 & 138 422260 
Seq ID 139 & 140 409041 
Seq ID 141 & 142 409041 
Seq ID 143 & 144 111929 
Seq ID 145 & 146 111929 
Seq ID 147 & 148 111929 



breast o 



in, prostate, pancreas, lung, colon, uterine 
I, bladder, ovarian, head & neck, fibrosis, colon, stomach, cervical 
in, bladder, lung, pancreas, head 8, neck, ovarian, stc 



pancreas, prostate, stomach, breast, uterine, cervical, ovarian 
breast, ovarian, pros, stomach, uterine, bladder lung head 8, neck 
pancreas, lung, breast, stomach, head & neck, ovarian, uterine 



breast, lung, stomach, uterine 
breast, ovarian, pancreas, cervical, uterine, prostate, lung, stomach, head 8, neck 
breast, bladder, lung, head & neck, ov; 



442082 breast, prostate, o\ 



j lung, colon 
breast, lung, renal, colon, ovarian, uterine 
breast, ovarian, lung, colon, pancreas, head 8, neck, stomach, 



lo, ovarian, stomach, pancreas, uterine, cei 



breast, lung, bladder, pancreas, head & neck, sloi 

colon, bladder, lung, pancreas, head 8, neck 
colon, prostate, fibrosis, breast lung pancreas 
colon, prostate, fibrosis, breast, lung pancreas 
colon, prostate, fibrosis, breast, lung pancreas 
colon, lung, bladder, pancreas 
prostate, colon, pancreas, Taxol prostate 
colon, breast 



colon, ovarian n 

uterine, ovarian, lung, colon, stomach, head 8, neck, breast pancreas 
uterine, ovarian, lung, colon, stomach, head 8, neck, breast pancreas 
colon, breast, fibrosis 
colon, breast, fibrosis 
colon, breast fibrosis 



Ab, sm, imaging 
Ab, sm, diagnostic 
Ab, CTL, diagnostic 
Ab.CTL, imaging 
Ab.CTL, imaging 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab.sm, imaging 
Ab.sm, CTL, imaging 
Ab, CTL, diagnostic 
Ab, sm, imaging 
CTL 

Ab, sm, CTL, diagnostic 
Ab, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab, imaging 
Ab.sm, CTL, imaging 
CTL 

Ab, CTL, imaging 
Ab,sm, CTL, imaging 
Ab.CTL, imaging 



Ab, sm, CTL, imaging 
Ab, sm, CTL, imaging 
Ab, sm, CTL, imaging 



Ab, sm, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab,sm, imaging 
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Seq ID 1£ 
Seq ID H 
Seq ID H 

Seq ID 1f 
Seq ID1E 



Seq ID 18' 
Seq ID 18. 
Seq ID W 
Seq ID 18 
Seq ID 18 
Seq ID " 
Seq ID 
Seq ID 
Seq ID 
Seq ID 



i 152 420159 
1 154 422330 
56 452461 




Ab, sm, CTL, diagnostic 



1 95 404875 
1 97 425883 
1 99 404977 



bladder, lung 

bladder, lung, head & neck, cervical 
bladder, pancreas, stomach, ovarian, lung 
bladder, ovarian, lung 



bladder, lung, head 8, nec 
bladder, lung, head 8, net 
bladder, lung, cervical 



Seq ID 200 & 201 420876 
Seq ID 202 & 203 427747 
Seq ID 204 & 205 420281 
Seq ID 206 & 207 446673 
Seq ID 208 437553 
Seq ID 209 & 210 437553 
Seq ID 211 & 212 437553 
Seq ID 213 & 214 426900 
Seq ID 215 & 216 426900 
Seq ID 217 & 218 426900 
Seq ID 219 & 220 405932 
Seq ID 221 & 222 405932 
Seq ID 223 & 224 424008 
Seq ID 225 & 226 424008 
Seq ID 227 & 228 424008 
Seq ID 229 & 230 444342 
Seq ID 231 & 232 421379 
Seq ID 233 & 234 417079 
Seq ID 235 & 236 412986 
Seq ID 237 & 238 412986 
Seq ID 239 & 240 412986 
Seq ID 241 & 242 447072 
Seq ID 243 & 244 419723 
Seq ID 245 & 246 419723 
Seq ID 247 & 248 430890 
Seq ID 249 & 250 456759 
Seq ID 251 429466 
Seq ID 252 429466 
Seq ID 253 & 254 419721 
Seq ID 255 & 256 407034 
Seq ID 257 & 258 413472 
Seq ID 259 & 260 438380 
Seq ID 261 & 262 426271 
Seq ID 263 & 264 419704 
Seq ID 265 & 266 444471 
Seq ID 267 & 268 409395 
Seq ID 269 & 270 413063 
Seq ID 271 & 272 433800 
Seq ID 273 458435 
Seq ID 274 458435 
Seq ID 275 & 276 424343 
Seq ID 277 8,278 424998 
Seq ID 279 & 280 412709 
Seq ID 281 & 282 435615 
Seq ID 283 & 284 404049 
Seq ID 285 & 286 418932 
Seq ID 287 & 288 404029 
Seq ID 289 & 290 436480 
Seq ID 291 & 292 452401 
Seq ID 293 & 294 452401 
Seq ID 295 & 296 436895 
Seq ID 297 & 298 421471 
Seq ID 299 & 300 428296 
Seq ID 301 &302 423508 
Seq ID 303 & 304 450001 
Seq ID 305 & 306 410407 
Seq ID 307 & 308 453496 



bladder, lung, he: 
bladder, lung, hei 

bladder, head 8, neck, stomach, cervical 
bladder, head 8, neck, stomach, cervical 
bladder, head & neck, stomach, cervical 
hepatitis C, lung, fibrosis, bladder 
breast pancreas, head 8, neck, lung, stomach, bladder, ce 
bladder, lung, her ' " ~ ' 



glioblastoma, lung, cervical, bladder 



glioblastoma 
glioblastoma, lung, colon 
glioblastoma 

glioblastoma, ovanan, bladder, lung 
glioblastoma, lung 

glioblastoma 



glioblastoma 
glioblastoma 

bladder, breast, pancreas, head 8. neck, stomach, lung, art 
' \ neck, stomach, lung, art 



renal, lung 
renal, lung, colon, 
renal, prostate 



CTL, diagnostic 

Ab, sm, CTL, imaging 

Ab, sm, CTL, imaging 

sm.CTL 

Ab.CTL, imaging 

Ab, sm, CTL diagnostic 

Ab.sm, CTL, imaging 

sm,CTL 

Ab, sm, imaging 

sm,CTL 

Ab,CTL, imaging 

Ab,CTL, imaging 

Ab.CTL, imaging 

Ab.CTL, imaging 

Ab.CTL, imaging 

Ab,CTL, imaging 



Ab,sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab, CTL, diagnostic 
Ab, CTL, diagnoslic 
Ab, diagnostic 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
sm.CTL 

Ab, CTL, diagnostic 
Ab.sm, CTL, imaging 
Ab, CTL, imaging, diagnostic 
Ab,sm, CTL, imaging 



Ab.sm, CTL, imaging 
Ab.CTL, diagnostic 
Ab,sm, CTL, imaging 
Ab, CTL, diagnostic, imaging 
Ab, CTL, diagnostic 

Ab.sm, CTL, imaging 
Ab.CTL, diagnostic 
Ab, CTL, diagnostic 
Ab.CTL, imaging 



Ab,sm, CTL, imaging 
Ab, CTL, diagnostic 
Ab.sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
sm, CTL 



Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.CTL, imaging 
Ab,sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
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Seq ID 309 & 310 420737 
Seq 1D311 &312 309931 
Seq ID 313 & 314 412719 
Seq ID 315 & 316 417034 
Seq ID317& 313 430486 
Seq ID 31 9 & 320 413753 
Seq ID 321 & 322 425650 
Seq ID 323 & 324 423673 
Seq ID 325 & 326 418663 
Seq ID 327 & 328 418663 
Seq ID 329 & 330 429610 
Seq ID 331 & 332 406690 
Seq ID 333 & 334 431846 
Seq ID 335 & 336 422158 
Seq ID 337 & 338 431958 
Seq ID 339 & 340 437044 
Seq ID 341 & 342 428484 
Seq ID 343 & 344 429211 
Seq ID 345 & 346 417389 
Seq ID347 &348 431009 
Seq ID 349 & 350 417542 
Seq ID 351 & 352 449230 
Seq ID 353 & 354 410555 
Seq ID 355 & 356 410555 
Seq ID 357 & 358 424687 
Seq ID 359 & 360 418462 
Seq ID 361 & 362 410274 
Seq ID 363 & 364 439606 
Seq ID 365 & 366 404877 
Seq ID 367 & 368 444781 
Seq ID 369 & 370 109424 
Seq ID 371 a 372 415817 
Seq ID 373 & 374 415817 
Seq ID 375 & 376 415817 
Seq ID 377 & 378 415817 
Seq ID 379 & 380 415817 
Seq ID 381 & 382 415817 
Seq ID 383 & 384 421817 
Seq ID 385 8 386 418678 
Seq ID 387 & 388 418678 
Seq ID 389 & 390 409420 
Seq ID 391 & 392 332180 
Seq ID 393 & 394 408790 
Seq ID 395 & 396 408790 
Seq ID 397 & 398 439223 
Seq ID 399 & 400 409757 
Seq ID 401 & 402 428969 
Seq ID 403 & 404 428969 
Seq ID 405 & 406 428969 
Seq ID 407 & 408 428969 
Seq ID 409 & 410 450701 
Seq ID411&412 450701 
Seq ID 413 & 414 414774 
Seq ID 415 & 416 407944 
Seq ID 417 & 41 8 407944 
Seq ID 419 & 420 428486 
Seq ID 421 & 422 457489 
Seq ID 423 & 424 432874 
Seq ID 426 & 426 445891 
Seq ID 427 & 428 445891 
Seq ID 429 & 430 404682 
Seq ID 431 & 432 429547 
Seq ID 433 & 434 425921 
Seq ID 435 & 436 407242 
Seq ID 437 & 438 407242 
Seq ID 439 & 440 407242 
Seq ID 441 & 442 432596 
Seq ID 443 & 444 444006 
Seq ID 445 & 446 423685 
Seq ID 447 & 448 428392 
Seq ID 449 & 450 429597 
Seq ID 451 & 452 448030 
Seq ID 453 & 454 422109 
Seq ID 455 & 456 419235 
Seq ID 457 & 458 449048 
Seq ID 459 & 460 427333 
Seq ID 461 & 462 417931 
Seq ID 463 & 464 419216 
Seq ID 465 & 466 431629 
Seq ID 467 & 468 413554 
Seq ID 469 & 470 452194 
Seq ID 471 & 472 452194 
Seq ID 473 & 474 426322 



& neck, bladder, glioblastoma, cervical 




ad & neck, pancreas, stomach, bladder, colon, cervical 
bladder, head & neck, uterine, cervical, stomach, ovarian 
I & neck, bladder, lung, cervical, stomach 



in, lung, bladder, uterine, cervical, pancreas, stomach 
lung, bladder, head & neck 

jung, cervical, head & neck, bladder, ovarian, colon 

head & neck, pancreas, lung, uterine, cervical, colon, stomach 



bladder, head & neck, cervical . 
bladder, heads, neck, cervical 



ilioblastoma, head & neck, cervical, fibrosis 
ilioblastoma, head 4 neck, cervical, fibrosis 
ilioblastoma, head & neck, cervical, fibrosis 
. jioblastoma, head & neck, cervical, fibrosis 
lung, glioblastoma, head & neck, cervical, fibrosis 
lung, glioblastoma, head & neck, cervical, fibrosis 
lung, cervical, head & neck & bladder 
lung, bladder, stomach, ovarian, pancreas & cervical 
lung, bladder, stomach, ovarian, pancreas & cervical 
lung, head & neck, pancreas, stomach, cervical, bladder 



pancreas, stomach, lung, bladder, sto 



lung, head & neck 

pancreas, prostate, lung 
pancreas, stomach 
stomach, pancreas, ovarian 
stomach, pancreas, ovarian 
pancreas 

pancreas, head & neck, lung, ovarian 
stomach, pancreas 

pancreas, colon, bladder, head & neck, stomach, lung, ovarian, ce 



pancreas, lung, colon 
pancreas, fibrosis, head & neck & lung 
pancreas, ovarian, uterine, glioblastoma, head & neck & lung 
pancreas, colon 

ovarian, pancreas, stomach, colon, uterine, prostate 
pancreas, lung, stomach, cervical, prostate, head & neck 



CTL, diagnostic 

Ab, imaging 

Ab, CTL, diagnostic 



m, imaging 
. m, diagnostic 
Ab, imaging 



CTL 

Ab, imaging 

Ab, sm, CTL, imaging 

Ab.sm, CTL, imaging 

Ab.sm, CTL, Imaging 

Ab.sm, CTL, imaging 

Ab.sm, CTL, imaging 

Ab,sm, CTL, imaging 

Ab,sm, CTL, imaging 

Ab,sm, CTL, imaging 

CTL 

CTL 

CTL, diagnostic 
Ab, sm, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab, CTL, imaging 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 



Ab.sm, imaging 
Ab.sm, imaging 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, imaging 
Ab, CTL, Imaging 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab,sm, CTL, imaging 
CTL 

Ab, CTL, imaging 



Ab, CTL, diagnostic 
Ab, CTL, imaging 
Ab, CTL, imaging 



Ab, CTL, diagnostic 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
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3 475 & 476 429010 
D 477 & 478 431462 
D 479 & 480 448243 
ID 481 & 482 426427 
D 483 & 484 428187 
0 485 & 486 428579 
D 487 & 488 428579 
D 489 & 490 428579 
D 491 & 492 428579 
ID 493 & 494 445537 
3 495 & 496 422278 
3497 8,498 424620 
3 499 & 500 406400 
iq ID 501 & 502 431 130 
>q ID 503 & 504 420440 
iq ID 505 8, 506 428450 
sq ID507&508 446619 
sq ID509&510 436982 
iq ID 51 U 512 453392 
431989 



pancreas, lung, glioblastoma 

ovarian, uterine, lung, stomach, head & neck, glioblastoma, pancreas 
ovarian, lung, head 8, neck, cervical, colon, uterine, stomach 



ovarian, Ewing, ut 
ovarian, Ewing, uterine, pancreas 

ovarian, bladder, uterine, breast, lung, head 8, neck, renal, fibrosis, pancreas, ct 
ovarian, head & neck, bladder, cervical, lung 



ovarian, uterine 

ovarian, uterine, cervical 

ovarian, cervical, pancreas, lung 

ovarian, fibrosis, pancreas, head & neck, lung, colon 



SeqID 
SeqtD 515 & 516 409178 
SeqID 517 & 518 426514 
Seq ID 519 & 520 421478 
SeqID 521 & 522 421478 
SeqID 523 8,524 425776 
Seq ID 525 & 526 425776 
Seq tD 527 & 528 452097 
Seq ID 5298, 530 416530 
Seq ID 531 8, 532 431515 
Seq ID 533 & 534 419452 
Seq tD 5358,536 412078 
Seq ID 537 & 538 412078 
Seq ID 539 8, 540 411773 
Seq ID 541 8, 542 407792 
Seq ID 5438.544 431616 
Seq ID 545 8, 546 452792 
Seq ID 547 & 548 400294 
Seq ID 549 & 550 432653 
Seq ID 551 8, 552 432653 
Seq ID 553 & 554 432653 
Seq 10 5558,556 432653 
Seq ID 557 8, 558 400290 
Seq ID 559 & 560 410001 
Seq ID 561 8,562 418396 
Seq 10 5638,564 451027 
Seq 105658,566 446057 
Seq 10 5678,568 433466 
Seq ID 6698, 570 453370 
Seq ID 571 8, 572 453370 
SeqtD 573 8, 574 414569 
Seq ID 675 8, 576 413435 
Seq ID 577 & 578 426501 
Seq ID 579 8, 580 448999 
Seq ID 581 8, 582 408369 
Seq ID 5838, 584 412628 
Seq ID 585 8, 586 403047 
Seq ID 587 8, 588 403047 
Seq ID 589 & 590 408430 
Seq ID 591 8,592 445413 
Seq ID 593 & 594 451982 
Seq ID 595 8, 596 427958 
Seq ID 597 & 598 421887 
Seq ID 599 8,600 425071 
Seq ID 601 8, 602 432101 
Seq ID 603 & 604 407786 
Seq ID 605 & 606 416836 
Seq ID 607 8, 608 416539 
Seq ID 609 8, 610 131083 
Seq ID 6118, 612 131083 
Seq ID 613 8, 614 131083 
Seq ID 615 8,616 131083 
Seq ID 617 & 61 8 422424 
Seq ID 619 8, 620 428970 
Seq ID 621 8, 622 428330 
Seq ID 623 & 624 439018 
Seq ID 625 8, 626 420610 
Seq ID 627 8, 628 425723 
Seq ID 629 & 630 456662 
Seq ID 631 & 632 418281 
Seq ID 633 & 634 429903 



in, lung, glioblastoma 

in, uterine, cervical, breast, prostate 

ovarian, colon, bladder, lung, cervical 
ovarian, bladder 
ovarian, bladder 



prostate, lung 
prostate, lung 
prostate, lung 
prostate, colon 

ovarian, prostate, uterine, cervical, lung 
prostate, uterine, glioblastoma 



prostate, lung, fibrosis, uterine, glioblastoma, cervical, ovarian 




prostate, colon, stomach, uterine, cervical, head 8, neck, pa 
prostate, pancreas 

prostate, colon, utr~' — ' '-" ' 



ZD1839 resistant cai 
androgen withdrawal prostate 
androgen withdrawal prostate 
androgen withdrawal prostate 
androgen withdrawal prostate 



>n, stomach, lung 
m, fibrosis, prostate, pancreas, lung, bladder, head 8, neck 



sm,CTL 
Ab, diagnostic 
Ab,sm, imaging 
sm,CTL 

Ab,sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab, CTL, diagnostic 
Ab.sm, imaging 
Ab, CTL, diagnostic 
Ab, CTL r diagnostic 
Ab, CTL, diagnostic 



Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, imaging 
Ab, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.sm, diagnostic 
Ab, CTL, diagnostic 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
CTL 
CTL 

Ab, sm, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab,sm, CTL, imaging 
Ab, CTL, imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Absm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab, CTL, diagnostic 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab, CTL, diagnostic 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab, sm, diagnostic 
Ab, CTL, imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab, CTL, diagnostic 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
diagnostic 
CTL 

Ab,sm, CTL, imaging 
Ab, CTL, imaging 
Ab, diagnostic 
Ab,sm, imaging 
Ab.sm, imaging 
Ab, CTL, diagnostic 
Ab, sm, CTL, imaging 
Ab, sm, CTL, imaging 
Ab, sm, CTL, imaging 
Ab, sm, CTL, imaging 
Ab, sm, CTL, imaging 
Ab, CTL, diagnostic 
Ab,sm, imaging 



an endometrioid, uterine, colon 
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20 
25 



i Lggcagca gcaagatgcg Lgcgagccg Lcagatccc gggctctccg 

r cgccctgctt gagcgaggct gcggtttccg aggccctctc cagccaagga 

G GATCACTCAT CGAACCACCC CTGAAGCCAG TGAAGGCTCT 

CTCGCCTCGC CCTCTAGCGT TCGTCTGGAG TAGCGCCACC CCGGCTTCCT GGGGACACAG 

GGTTGGCACC ATGGGGCCCA CCAGCGTCCC GCTGGTCAAG GCCCACGGCA GCTCGGTCTC 

TGACTACGTC AACTATGATA TCATCGTCCG GCATTACAAC TACACGGGAA AGCTGAATAT 
CAGCGCGGAC AAGGAGAACA G 

T TGGAAAAOCA A 

CCGACCCATG TACTATTTTA TTGGCAATCT GGCCCTCTCA GACCTGTTGG CAGGAGTAGC 




I GGCTGGAACT GCATCAGTGC GCTGTCCAGC TGCTCCACCG T 
CTACCACAAG CACTATATCC TCTTCTGCAC CACGGTCTTC ACTCTGCTTC TGCTCTCCAT 

CGTCATTCTG TACTGCAGAA TCTACTCCTT GGTCAGGACT CGGAGCCGCC G 

CCGCAAGAAC ATTTCCAAGG CCAGCCGCAG CTCTGAGAAG TCGCTGGCGC T 

CGTAATTATC GTCCTGAGCG TCTTCATCGC CTGCTGGGCA CCGCTCTTCA TCCTGCTCCT 1080 

GCTGGATGTG GGCTGCAAGG TGAAGACCTG TGACATCCTC TTCAGAGGGG AGTACTTCCT 1140 

GGTGTTAGCT GTGCTCAACT CCGGCACCAA CCCCATCATT TACACTCTGA CCAACAAGGA 12 00 

GATGCGTOGG GCCTTCATCC GGATCATGTC CTGCTGCAAG TGCCCGAGCG GAGACTCTGC 12 SO 

TGGCAAATTC AAGCGACCCA TCATCGCCGG CATGGAATTC AGCCGCAGCA AATCGGACAA 1320 

TTCCTCCCAC CCCCAGAAAG ACGAAGGGGA CAACCCAGAG ACCATTATGT CTTCTGGAAA 13 80 

CGTCAACTCT TCTTCCTAGA ACTGGAAGCT GTCCACCCAC CGGAAGCGCT CTTTACTTGG 1440 

TCGCTGGCCA CCCCAGTGTT TGGAAAAAAA TCTCTGGGCT TCGACTGCTG CCAGGGAGGA IS 00 

GCTGCTCCAA GCCAGAGGGA GGAAGGGGGA GAATACGAAC AGCCTGGTGG TGTCGGGTGT 15S0 

TGGTGGGTAG AGTTAGTTCC TGTGAACAAT GCACTGGGAA GGGTGGAGAT CAGGTCCCGG 1620 

CCTGGAATAT ATATTCTACC CCCCTGGAGC TTTGATTTTG CACTGAGCCA AAGGTCIAGC 1SB0 

ATTGTCAAGC TCCTAAAGGG TTCATTTGGC CCCTCCTCAA AGACTAATGT CCCCATGTGA 1740 

AAGCGTCTCT TTGTCTGGAG CTTTGAGGAG ATGTTTTCCT TCACTTTAGT TTCAAACCCA 1800 

AGTGAGTGTG TGCACTTCTG CTTCTTTAGG GATGCCCTGT ACATCCCACA CCCCACCCTC 18S0 



TAGGCTGTGG GAAGATGAAG ATGGTTTGGA GGTGTAAAAC AATGTCCTTC GCTGAGGCCA 2 040 

T GTAAGCGGGA TCCGTTTTTT GGAATTTGGT TGAAGTCACT TTGATTTCTT 2100 

HATGTGTTA CCATTTCATA TCCATTGAAG CCGAAATCTG 2160 

CATAAGGAAG CCCACTTTAT CTAAATGATA TTAGCCAGGA TCCTTGGTGT CCTAGGAGAA 2220 

T GAAAACCGAA TGGATTAACT TTTGCAAACC AAGGGAGATT 2280 

\ TCCGTCTTTC CCACTTTTGT TGATGTTTAT 2340 

TTCAGAATCT TGTGTGATTC ATTTCAAGCA ACAACATGTC GTATTTTGTT GTGTTAAA^G 2400 

TACTTTTCTT GATTTTTGAA TGTATTTGTT TCAGGAAGAA GTCATTTTAT GGATTTTTCT 2460 

AACCCGTGTT AACTTTTCTA GAATCCACCC TCTTGTGCCC TTAAGCATTA CTTTAACTGG 2520 

TAGGGAACGC CAGAACTTTT AAGTCCAGCT ATTCATTAGA TAGTAATTGA AGATATGTAT 2580 

AAATATTACA AAGAATAAAA ATATATTACT GTCTCTTTAG TATGGTTTTC AGTGCAATTA 2640 

AACCGAGAGA TGTCTTGTTT TTTTAAAAAG AATAGTATTT AATAGGTTTC TGACTTTTGT 2700 
T TTAAACATTA ATAAACTGAT TTTTTTAAAG 



Protein Accession # 



[jFILLLLDV GCKVKTCDIL 



3 DNA sequence 
3149 

1 11 21 31 41 51 

I I I I I I 

ATGGGGAGCC GGACGCCAGA GTCCCCTCTC CACGCCGTGC AGCTGCGCTG GGGCCCCCGG 
CGCCGACCCC CGCTSSTGCC GCTGCTGTTG CTGCTSSTGC CGCCGCCACC CAGGGTCGGG 
GGCTTCAACT TAGACGCGGA GGCCCCAGCA GTACTCTCGG GGCCCCCGGG CTCCTTCTTC 
GGATTCTCAG TGGAGTTTTA CCGGCCGGGA ACAGACGGGC- TCAGTGTGCT GGTGGGAGCA 
CCCAAGGCTA ATACCAGCCA GCCAGGAGTG CTGCAGGGTG GTGCTGTCTA CCTCTGTCCT 
TGGGGTGCCA GCCCCACACA GTGCACCCCC ATTGAATTTG ACAGCAAAGG CTCTCGGCTC 
CTGGAGTCCT CACTGTCCAG CTCAGAGGGA GAGGAGCCTG TGGAGTACAA GTCCTTGCAG 
TGGTTCGGGG CAACAGTTCG AGCCCATGGC TCCTCCATCT TGGCATGCGC TCCACTGTAC 
AGCTGGCGCA CAGAGAAGGA GCCACTGAGC GACCCCGTGG GCACCTGCTA CCTCTCCACA 
GATAACTTCA CCCGAATTCT GGAGTATGCA CCCTGCCGCT CAGATTTCAG CTGGGCAGCA 
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GGACAGGGTT ACTGCCAAGG AGGCTTCAGT GCCGAGTTCA CCAAGACTGG CCGTGTGGTT SSO 

TTAGGTGGAC CAGGAAGCTA TTTCTGGCAA GGCCAGATCC TGTCTGCCAC TCAGGAGCAG 720 

ATTGCAGAAT CTTATTACCC CGAGTACCTG ATCAACCTGG TTCAGGGGCA GCTGCAGACT 780 

CGCCAGGCCA GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCTGTGGC TGTTGGTGAA 840 

TTCAGTGGTG ATGACACAGA AGACTTTGTT GCTGGTGTGC CCAAAC-GGAA CCTCACTTAC 900 

GGCTATGTCA CCATCCTTAA TGGCTCAGAC ATTCGATCCC TCTACAACTT CTCAGGGGAA 950 

rGCAGTG GCCGCCACAG ACGTCAATGG GGACGGGCTG 1020 

; ACCCCTGCTC ATGGATCGGA CCCCTGACGC- GCGGCCTCAG 1080 
Pl GGGTCTACGT CTACCTGCAG CACCCAGCCG GCATAGAGCC CACGCCCACC 



TTTCCATGCC 2 040 



GAGGCAGTGC TATTCCCAGT AAGCGACTGG CATCCCCGAG ACCAGCCTCA GAAGGAGGAG 2460 

GACCTGGGAC CTGCTGTCCA CCATGTCTAT GAGCTCATCA ACCAAGGCCC CAGCTCCATT 2520 

AGCCAGGGTG TGCTGGAACT CAGCTGTCCC CAGGCTCTGG AAGGTCAGCA GCTCCTATAT 2580 

GTGACCAGAG TTACGGGACT CAACTGCACC ACCAATCACC CCATTAACCC AAAGGGCCTG 2640 

GAGTTGGATC CCGAGGGTTC CCTGCACCAC CAGCAAAAAC GGGAAGCTCC AAGCCGCAGC 2700 

TCTGCTTCCT CGGGACCTCA GATCCTGAAA TGCCCGGAGG CTGAGTGTTT CAGGCTGCGC 2760 

TGTGAGOTCG GGCCCCTGCA CCAACAAGAG AGCCAAAGTC TGCAGTTGCA TTTCCGAGTC 2820 

TGGGCCAAGA CTTTCTTGCA GCGGGAGCAC CAGCCATTTA GCCTGCAGTG TGAGGCTGTG 2E8 0 

TACAAAGCCC TGAAGATGCC CTACCGAATC CTGCCTCGGC AGCTGCCCCA AAAAGAGCGT 2 940 

CAGGTGGCCA CAGCTGTGCA ATGGACCAAG GCAGAAGGCA GCTATGGCGT CCCACTGTGG 3000 

ATCATCATCC TAGCCATCCT GTTTGGCCTC CTGCTCCTAG GTCTACTCAT CTACATCCTC 3060 

TACAAGCTTG GATTCTTCAA ACGCTCCCTC CCATATGGCA CCGCCATGGA AAAAGCTCAG 3120 
CTCAAGCCTC C 



Protein Accession #s NP_002196.1 

1 11 21 31 41 51 

MGSRTPESPL HAVQLRWGFR RRPPLLPLLL LLLPPPPRVG GFNLDAEAPA VLSGPPGSFF 
GFSVEFYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 
LESSLSSSEG EEPVEYKSLQ WFGATVRAHG SSIUVCAPLY SWRTEKEPLS DPVGTCYLST 
DNFTRILEYA PCRSDFSWAA GQGYCQGGFS AEFTKTGRW LGGPGSYFWQ GQILSATO.EQ 
IAESYYPEYL INLVQGQLQT RQASSIYDDS YLGYSVAVGE FSGDDTEDFV AGVPKGNLTY 
GYVTILNGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 
EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 
QQGWFVFPG GPGGLGSKPS QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG YPDLIVGSFG 
VDKAWYRGR PIVSASASLT IFPAMFNPEE RSCSLEGNPV ACINLSFCLN A 
GFTVELQLDW QKQKGGVRRA LFLASRQATL TQTLLIQNGA REDCREMKIY L 
LSPIHIALNF SLDPQAPVDS HGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 
GEQNHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 
FAVNQSRLLV CDLGHPMKAG ASLWGGLRFT VPHLRDTKKT IQFDFQILSK NLNMSQSDW 
SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPOKEE DLGPAVHHVY ELINQGPSSI 
SQGVLELSCP QALEGQQLLY VTRVTGLNCT TNHPINPKGL ELDPEGSLHH QQKREAPSRS 
SASSGPQILK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQREH QPFSLQCEAV 
YKALKMPYRI LPRQLPQKER QVATAVQWTK AEGSYGVPLW I I I LAI LFGL LLLGLLIYIL 
YKLGFFKRSL PYGTAMEKAQ L 

Seq ID NO s 5 DNA sequence 
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AGAATTGGAT TTGGCTCATT TGTGGAAAAG ACTGTGATGC CTTACATTAG CACAACAOCA 600 

GCTAAGCTCA GGAACCCTTG CACAAGTGAA CAGAACTGCA CCAGCCCATT TAGCTACAAA 660 

AATGTGCTCA GTCTTACTAA TAAAGGAGAA GTATTTAATG AACTTGTTGG AAAACAGCGC 72 0 

ATATCTGGAA ATTTGGATTC TCCAGAAGGT GGTTTCGATG CCATCATGCA AGTTGCAGTT 780 

TGTGGATCAC TGATTGGCTG GAGGAATGTT ACACGGCTGC TGGTGTTTTC CACAGATGCC 840 

GGGTTTCACT TTGCTGGAGA TGGGAAACTT GGTGGCATTG TTTTACCAAA TGATGGACAA 90 0 

TGTCACCTGG AAAATAATAT GTACACAATG AGCCATTATT ATGATTATCC TTCTATTGCT 960 

CACCTTGTCC AGAAACTGAG TGAAAATAAT ATTCAGACAA TTTTTC-CAGT TACTGAAGAA 1020 

TTTCAGCCTG TTTACAAGGA GCTGAAAAAC TTGATCCCTA AGTCAGCAGT AGGAACATTA 1080 

TCTGCAAATT CTAGCAATGT AATTCAGTTG ATCATTGATG CATACAATTC CCTTTCCTCA 1140 

T TGGAAAACGG CAAATTGTCA GAAGGAGTAA CAATAAGTTA CAAAT CTTAC 1200 
3 GGGTGAATGG A 



C TCTGGGCTTT ACGGAGGAAG TAGAGGTTAT TCTTCAGTAC 
AATGGGACAT TTGAGTGTGG CGCGTGCAGG TGCAATGAAG GGCGTGTTGG TAGACATTGT 



GTGTGATAA TTTCAACTGT 



GCCAGCAACG GACAGATCTG CAATGGCCGG GGCATCTGCG AGTGTGGTGT CTGTAAGTGT 



GCTGAGCATA A 

TGCACACAGG AATGTTCCTA TTTTAACATT A 
CAGCCGGTCC AACCTGATCC TGTGTCCCAT TGTAAGGAGA AGGATGTTGA CGACTGTTGG 2100 
TTCTATTTTA CGTATTCAGT GAATGGGAAC AACGAGGTCA TGGTTCATGT TGTGGAGAAT 2160 
CCAGAGTGTC CCACTGGTCC AGACATCATT CCAATTGTAG CTGGTGTGGT TGCTGGAATT 2220 
GTTCTTATTG GCCTTGCATT ACTGCTGATA TGGAAGCTTT TAATGATAAT TCATGACAGA 2280 
AGGGAGTTTG CTAAATTTGA AAAGGAGAAA ATGAATGCCA AATGGGACAC GGGTGAAAAT 2340 
CCTATTTATA AGAGTGCCGT AACAACTGTG GTCAATCCGA AGTATGAGGG AAAATGA 

Seq Ii 



Protein Accession #: NP_002202.1 

1 11 21 31 41 51 

MNLQPIFWIG LISSVCCVFA QTDENRCLKA NAKSCGECIQ AGPNCGWCTN STFLQECMPT 
SARCDDLEAL KKKGCPPDDI ENPRGSKDIK KNKNVTNRSK GTAEKLKPED ITQIQPQQLV 
LRLRSGEPQT FTLKFKRAED YPIDLYYLMD LSYSMKDDLE NVKSLGTDLM NGMRR1TSDF 
RIGFGSFVEK TVMPYISTTP AKLRNPCTSE QNCTSPFSYK NVLSLTNKGE VFNELVGKQR 
ISGNLDSPEG GFDAIMQVAV CGSLIGWRNV TRLLVFSTDA GFHFAGDGKL GGIVLPNDGQ 
CHLENNMYTM SHYYDYPSIA HLVQKLSENH IQTIFAVTEE FQPVYKELKN LIPKSAVGTL 
SANSSNVIQL IIDAYNSLSS EVILENGKLS EGVTISYKSY CKNGVNGTGE NGRKCSNISI 



E DMDAYCRKEN SSEICSNNGE CVCGQCVCRK 



PIYKSAVTTV VNPKYEGK 



55 
60 
65 
70 
75 



AAAGAAGGTA AGGGCAGTGA GAATGATGCA TCTTGCATTC C 
AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 
TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 
AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 



CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA 



TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG A 

;gggatc tgagatgcca gccaagtgtg a 
l ctctgagggg agaatatctg t 
ttggcgaaga tcccactgga accctgaacc tgaatttcat ttgatttctg cattttggcc 
ctctcttcca tcatatttgg atgctgcata tgaagttaac a 
ttttaaagga aatgagtt ct gggccatcag aggaaa1 

AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 
1 ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 



T GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 

ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 

GTTACATTGC TAGGCGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 

ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 1560 
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GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 

ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAGA 1680 

ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1740 

Seq ID NO : 8 Protein sequence 



DLAQOYLEKY YNLEKDVKQP RRKDSNLIVK 



CTCATAACCC TCAGTGGGCT TCAAGTGATC ATCTGCCTCA TCAGAGTAGT CATGCAACTA 720 

TCCAAGATAC TGTGTGGAAG CTATTCAGTG ATCTTCCAGC CTGGAATCAT TTGAATAAGG 7G0 

ACAAAATGTT TTCCATTATC AAGACATGGC CATCTATCTA AATATTATAT CAACTGTGTA 840 

GACTTGAGGG CAATATTGAA ATGATGGTGC TTTCTGCATT TGGTGTTTAT TTGTAAAAAA 90 0 

TTTGCAGTCC TCACTGCACA TGCAAGTATA CCACCCTTCC ATTTAGTATG TTTTTTAAGT 960 

AATATGCATC AGAAACTTCA GAAATACTTC TGCCCTTTGA TCAAACAAAT CCATTTCCAA 1020 

' GAATCTGTAC TAGGGAAGTA AATAAGAATA TGAGAGAAAC CTTTATGCAA ATATGTATAT 1080 

TGCAACATTA TTTAATATTC TGGAAAATTG GAAACACCCC AAAATTCTAA ACTCAGAGGA 1140 

AGGATTAAGT AAAGAGTGGT ACATACTGTA AATGTTTTCT GATATTAAAA AAAAAATTAA 1200 
ATAAAAAATA AAGAGTACTA CATGGTTGTA AAA 

Seq ID NO: 10 Protein 



MGSRKCGGCL SCLLIPLALW SIIVNILLYF PNGQTSYASS NKLTNYVKYF EGICFSGIMM 

LIVTTVLLVL ENNNNYKCCQ SENCSKKYVT LLSIIFSSLG IAFSGYCLVI SALGLVQGPY 

CRTLDGWEYA FBGTAGRFLT DSSIWIQCLE PAHWBWNII LFSILITLSG LOVIICLIRV 
VMQLSKILCG SYSVIFQPGI I 

Seq ID NO: 11 DNA sequence 

Nucleic Acid Accession #: NM_002421.2 

Coding sequence: 1..1409 



G GAAACCAGAT 



G ATTTGCCAAG AGCAGATGTG GACCATGCCA TTGAGAAAGC CTTCCAACTC 
TGGAGTAATG TCACACCTCT GACATTCACC AAGGTCI 

ATATCTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT TTGATGGACC T 
CTTGCTCATG CTTTTCAACC AGGCCCAGGT AT TGGAGGGG A 
GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC G 
GGCCATTCTC TTGGACTCTC CCATTCTACT GATATCGGGG CTTTGATGTA CCCTAGCTAC 
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GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG GATTTTTCTA TTTCTTTCAT 1320 
GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA TTTTGACTCT CCAGAAAGCT 1380 
AATAGCTGGT TCAACTGCAG GAAAAATTAG 



I I I 1 I I 

HHSFPPLLLL LPWGWSHSF PATLETQEQD VDLVQKYLEK YYNLKNDGRQ VEB 

VEKLKQMQEF FGLKVTGKPD AETLKVMKQP RCGVPDVAQF VLTEGNPRWE QTHLTYRIEN 

YTPDLPRADV DHAIEKAFQL WSNVTPLTFT KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 

LAHAFQPGPG IGGDAEFDED ERWTMNFREY NLHRVAAHEL GHSLGLSHST DIGALMYPSY 

TFSGDVQLAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTFDAITTI RGEVMFFKDR 

FYMRTNPFYP EVELNFISVF WPQLPNGLEA AYEFADRDEV RFFKGNKYWA VQGQNVLHGY 



GIGHKVDAVF MKDGFFYFFH GTRQYKFDPK TKRILTLQKA NSWFNCRKN 
Seq ID NO: 13 DMA sequence 
Coding sequence: 1..1409 

1 11 21 31 41 51 

ATGCACAGCT TTCCTCCACT GCTGCTGCTG CTGTTCTGGG GTGTGGTGTC ACACAGCTTC 60 

CCAGCGACTC TAGAAACACA AGAGCAAGAT GTGGACTTAG TCCAGAAATA CCTGGAAAAA 120 

TACTACAACC TGAAGAATGA TGGGAGGCAA GTTGAAAAGC GGAGAAATAG TGGCCCAGTG 180 

GTTGAAAAAT TGAAGCAAAT GCAGGAATTC TTTGGGCTGA AAGTGACTGG GAAACCAGAT 240 

GCTGAAACCC TGAAGGTGAT GAAGCAGCCC AGATGTGGAG TGCCTGATGT GGCTCAGTTT 300 

GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACACATC TGACCTACAG GATTGAAAAT 360 

TACACGCCAG ATTTGCCAAG AGCAGATGTG GACCATGCCA TTGAGAAAG C CTTCCAACTC 42 0 

TGGAGTAATG TCACACCTCT GACATTCACC AAGGTCTCTG AGGGTCAAGC AGACATCATG 480 

ATATCTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT TTGATGGACC TGGAGGAAAT 540 

CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG ATGCTCATTT TGATGAAGAT 600 

GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC GTGTTGCGGC TCATGCCCTC 660 

GGCCATTCTC TTGGACTCTC CCATTCTACT GATATCGGGG CTTTGATGTA CCCTAGCTAC 720 

ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG ATGGCATCCA AGCCATATAT 780 

GGACGTTCCC AAAATCCTGT CCAGCCCATC GGCCCACAAA CCCCAAAAGC ATGTGACAGT 840 

AAGCTAACCT TTGATGCTAT AACTACGATT CGGGGAGAAG TGATGTTCTT TAAAGACAGA 900 

TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC TCAATTTCAT TTCTGTTTTC 96 0 

TGGCCACAAC TGCCAAATGG GCTTGAAGCT GCTTACGAAT TTGCCGACAG AGATGAAGTC 102 0 

CGGTTTTTCA AAGGGAATAA GTACTGGGCT GTTCAGGGAC AGAATGTGCT ACACGGATAC 1080 

CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG TGAAGCATAT CGATGCTGCT 1140 

CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTG CTAACAAATA CTQOAGGTAT 1200 

GATGAATATA AACGATCTAT GGATCCAGGT TATCCCA"\AA TGATAGCACA TGACTTTCCT 1260 

GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG GATTTTTCTA TTTCTTTCAT 132 0 

GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA TTTTGACTCT CCAGAAAGCT 13 80 
AATAGCTGGT TCAACTGCAG GAAAAATTAG 



I I I I I 1 

MHSFPPLLLL LFWGWSHSF PATLETQEQD VDLVQKYLEK YYNLKNDGRQ V~ 
VEKLKQMQEF FGLKVTGKPD AETLKVMKQP RCGVPDVAQF VLTEGNPRWE QTHLTYRIEN 
YTPDLPRADV DHAIEKAFQL WSNVTPLTFT KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 
LAHAFQPGPG IGGDAHFDED ERWTNNFREY NLHRVAAHAL GHSLGLSHST DIGALMYPSY 
TFSGDVQLAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTFDAITTI RGEVMFFKDR 
FYMRTNPFYP EVELNFISVF WPQLPNGLEA AYEFADRDEV RFFKGNKYWA VQGQNVLHGY 
PKDIYSSFGF PRTVKHIDAA LSEENTGKTY FFVANKYWRY DEYKRSMDPG YPKMIAHDFP 
GIGHKVDAVF MKDGFFYFFH GTRQYKFDPK TKRILTLQKA NSWFNCRKN 



ion if : FGENESH predicted 



GCGGGATCCT CACTGACGTC ACGCTGCTGG TTGGCGGGCA ACCCCTCAGA GCACACAAGG 
CAGTTCTCAT CGCCTGCAGT GGCTTCTTCT ATTCAATTTT CCGGGGCCGT GCGGGAGTCG 
GGGTGGACGT GCTCTCTCTG CCCGGGGGTC CCGA^GCGAG AGC-CTTCGCC CCTCTATTGG 
ACTTCATGTA CACTTCGCGC CTGCGCCTCT CTCCAGCCAC TGCACCAGCA GTCCTAGCGG 
CCGCCACCTA TTTGCAGATG GAGCACGTGG TCCAGGCATG CCACCGCTTC ATCCAGGCCA 
GCTATGAACC TCTGGGCATC TCCCTGCGCC CCCTGGAAGC AGAACCCCCA ACACCCCCAA 
CGGCCCCTCC ACCAGGTAGT CCCAGGCGCT CCGAAGGACA CCCAGACCCA CCTACTGAAT 
CTCGAAGCTG CAGTCAAGGC CCCCCCAGTC CAGCCAGCCC TGACCCCAAG GCCTGCAACT 
GGAAAAAGTA CAAGTACATC GTGCTAAACT CTCAGGCCTC CCAAGCAGGG AGCCTGGTCG 
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GAAAGAGGAC TGCTAAGCCT CGCATTCCAT CCCAATTACA AGAAAAATGG AAAGTTGTAT 900 

GTGTCCTATA CCACCAACCA AGAACGGTGG GCTATCGGGC CTCATGACCA CATTCTTAGG 960 

GTTGTGGAAT ACACAGT AT C CAGAAAAAAT CCACACCAAG TTGATTTGAG AACAGCCAGA 1020 

GTCTTTCTTG AAGTTGCAGA ACTCCACAGA AAGCATCTGG GAGGACAACT GCTCTTTGGC 1080 

5 CCTGACGGCT TTTTGTACAT CATTCTTGGT GATGGGATGA TTACACTGGA TGATATGGAA 1140 

GAAATGGATG GGTTAAGTGA TTTCACAGGC TCAGTGCTAC GGCIGGATC-T GGACACAGAC 1200 

ATGTGTAACG TGCCTTATTC CATACCAAGG AGCAACCCAC ACTTCAACAG CACCAACCAG 1260 

CCCCCCGAAG TGTTTGCTCA TGGGCTCCAC GATCCAGGCA GATGTGCTGT GGATAGACAT 1320 




GGAAAGTGCT GCTGCAGTCC AGGCTGGGAG GGGGACTTCT GCAGAACTGC AAAATGTGAG 1920 

CCAGCATGTC GTCATGGAGG TGTCTGTGTT AGACCGAACA AGTGCCTCTG TAAAAAAGGA 1980 

20 TATCTTGGTC CTCAATGTGA ACAAGTGGAC AGAAACATCC GCAGAGTGAC CAGGGCAGAC 2040 

ATCACCTAG 



SLPHSPERBV LERDLVLPLL CKDYCKEPPY TCRGHIPGFL QTTADEFCFY YARKDGGLCF 180 



VSYTTNQERW AIGPHDHILR WEYTVSRKN PHQVDLRTAR VFLEVAELHR KHLGGQLLFG 
3 DGMITLDDME EMDGLSDFTG SVLRLDVDTD MCNVPYSIPR SHPHFHSTNO 
H DPGRCAVDRH PTDININLTI LCSDSNGKNR SSARIDQIIK GKDYESEPSL 
LEFKPFSNGP LVGGFVYRGC QSERLYGSYV FGDRNGNFLT LQQSPVTKQW QEKPLCLGTS 
GSCRGYFSG1I ILGFGEDELG EVYILSSSKS MTQTHNGKLY KIVDPKRPLM PEECRATVQP 
AQTLTSECSR LCRNGYCTPT GKCCCSPGWE GDFCRTAKCE PACRHGGVCV RPNKCLCKKG 
YLGPQCEQVD RNIRRVTRAD IT 

Seq ID NO: 19 DMA sequence 

Nucleic Acid Accession fl : NM_014331.2 

Coding sequence : 1 . . 1S06 



ATGGTCAGAA AGCCTGTTGT GTCCACCATC TCCAAAGGAG GTTACCTGCA GGGAAATGTT 
AACGGGAGGC TGCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 
AAGAGGAAAG TCACTTTACT GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA 
GGAATCTTCA TCTCTCCTAA GGGCGTGCTC CAGAACACGG GCAGCGTGGG CATGTCTCTG 
ACCATCTGGA CGGTGTGTGG GGTCCTGTCA CTATTTGGAG CTTTGTCTTA TGCTGAATTG 



T CTGGGTGGAA CTCCT CAT AA TACGCCCTGC AGCTACTGCT 
GTGATATCCC TGGCATTTGG ACGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 
CCTGAACTTG CGATCAAGCT CATTACAGCT GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 
AGCATGAGTG TCAGCTGGAG CGCCCGGATC CAGATTTTCT TAACCTTTTG CAAGCTCACA 
GCAATTCTGA TAATTATAGT CCCTGGAGTT ATGCAGCTAA TTAAAGGTCA AACGCAGAAC 
TTTAAAGACG CGTTTTCAGG AAGAGATTCA AGTATTACGC GGTTGCCACT GGCTTTTTAT 
TATGGAATGT ATGCATATGC TGGCTGGTTT TACCTCAACT TTGTTACTGA AGAAGTAGAA 
AACCCTGAAA AAACCATTCC CCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGGC 
TATGTGCTGA CAAATGTGGC CTACTTTACG ACCATTAATG CTGAGGAGCT G 



ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATGAACGGTG GTGTGTTTGC TGTCTCCAGG 
TTATTCTATG TTGCGTCTCG AGAGGGTCAC CTTCCAGAAA T 



TTCTCTGGAG ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 
GGGCTGGCAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT 
TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATGGTTGCC 
CTTTCCCTCT ATTCGGACCC ATTTAGTACA GGGATTGGCT TCGTCATCAC TCTGACTGGA 
GTCCCTGCGT ATTATCTCTT TATTATATGG GACAAGAAAC CCAGGTGGTT TAGAATAATG 



TTATGAACTA ATGGACTTGA GATCTTGGCA ATCTGCCCAA GGGGAGACAC 
CAGTTATTTT T ATT CAT ATA TTTTAGCATA TTCGAACTAA TTTCTAAGAA 



TGAAGACTGA C-ATTTTTCTG 1860 



TTTTCAATTC TGAAAAAAAG CATACATCAT GATTATGGCA AAGAGGAGAG A 

ATTTTACATT GACATTGCAT TGCTTCCCCT TAGATACCAA TTTAGATAAC AAACACTCAT 2040 

GCTTTAATGG ATTATACCCA GAGCACTTTG AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 

TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAGACACA TCCAGGAGTT ATGTTTAAGT 2160 

AAAAATCCTT GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 
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TTATCTGTCA TTTTTTTTTT TCACATCAGT TTGATCAGGA AAGTGTATAA CACATCTTAG 2280 

AGCAAGAGTT AGTTTGGTAT TAAATCCTCA TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 

TACCCCTGAT GAGTCTATCT AAACATATGC ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

TGAGAGAAAT AACCAACAAA GAAGATGTTC AAAATAATAG TCCCATATCT GTAATCATAT 2460 

CTACATGCAA TGTTAGTAAT TCTGAAGTTT TTTAAATTTA TGGCTATTTT TACACGATGA 2520 

TGAATTTTGA CAGTTTGTGC ATTTTCTTTA TACATTTTAT ATTCTTCTGT TAAAATATCT 25 80 



A CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 

10 CTGATAAGAA GAAAATTGAA ATGAGAATCT GTGGATAAGT GTTTGTGTTC A 
GTTTTGCCAG TATTAGAAAA TACTGTGAGC CGGGCATGGT G 
GCACTTTGGG AGGCTGAGGG GGTGGATCAC CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 2940 
CAACATGGAG AAACCCCATC TCTACTAAAA ATACAAAATT AGCTGGGCAT GGTGGCACAT 3000 
GCTGGTAATC TCAGCTATTG AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCG 30S0 

15 GAGGTTGCAG TGAGCCAAGA TTGCACCACT GTACTCCAGC CTGGGTGACA AAGT CAGACT 3120 
CCATCTCCAA AAAAAAAAAA AAAA 

Seg ID NO: 20 Protein sequence 
_„ Protein Accession #: NP 0S5146.1 

20 

I I 1 I 1 I 1 I I 

MVRKPWSTI SKGGYLQGNV NGRLPSLGNK EPPGQEKVQL KRKVTLLRGV SIIIGTIIGA SO 
GIFISPKGVL QNTGSVGMSL TIWTVCGVLS LFGALSYAEL GTTIKKSGGH Y 
25 LPAFVRVWVE LLIIRPAATA VISIAFGRYI LEPFFIQCEI P 

SMSVSWSARI QIFLTFCKLT AILIIIVPGV MQLIKGQTQN E 

YGMYAYAGW F YLNFVTEEVE NPEKTIPLAI CISMAITIGV YVLTNVAYFT TINAEELLLS 
NAVAVTFSER LLGNFSIAVP IFVALSCFGS MNGGVFAVSR LFYVASRSGH LPEILSMIHV 
RKHTPLPAVI VLHPLTMIML FSGDLDSLLN FLSFARWLFI GIAVAGLIYL RYKCPDMHRP 
30 FKVPLFIPAL FSFTCLFMVA LSLYSDPFST GIGFVITLTG VPAYYLFIIW DKKPRWFRIM 
SEKITRTLQI ILEWPEEDK L 



Seq ID NO: 21 DNA sequence 
35 Nucleic Acid Accession #: NM_002422.2 
Coding sequence: 64.. 1497 



40 ACAAGGAGGC AGGCAAGACA GCAAGGCATA GAGACAACAT AGAGCTAAGT AAAGCCAGTG 6 0 

GAAATGAAGA GTCTTCCAAT CCTACTGTTG CTGTGCGTGG CAGTTTGCTC AGCCTATCCA 120 

TTGGATGGAG CTGCAAGGGG TGAGGACACC AGCATGAACC TTGTTCAGAA ATATCTAGAA 180 

AACTACTACG ACCTCAAAAA AGATGTGAAA CAGTTTGTTA GGAGAAAGGA CAGTGGTCCT 240 

GTTGTTAAAA AAATCCGAGA AATG CAGAAG TTCCTTGGAT TGGAGGTGAC GGGGAAGCTG 300 

45 GACTCCGACA CT CTGGAGGT GATGCGCAAG CCCAGGTGTG GAGTTCCTGA TGTTGGTCAC 360 

TTCAGAACCT TTCCTGGCAT CCCGAAGTGG AGGAAAACCC ACCTTACATA CAGGATTGTG 420 

AATTATACAC CAGATTTGCC AAAAGATGCT GTTGATTCTG CTGTTGAGAA AGCTCTGAAA 4B0 

GTCTGGGAAG AGGTGACTCC ACTCACATTC TCCAGGCTGT ATGAAGGAGA GGCTGATATA 540 

ATGATCTCTT TTGCAGTTAG AGAACATGGA GACTTTTACC CTTTTGATGG ACCTGGAAAT 600 

50 GTTTTGGCCC ATGCCTATGC CCCTGGGCCA GGGATTAATG GAGATGCCCA CTTTGATGAT 6S0 

GATGAACAAT GGACAAAGGA TACAACAGGG ACCAATTTAT TTCTCGTTGC TGCTCATGAA 720 

ATTGGCCACT CCCTGGGTCT CTTTCACTCA GCCAACACTG AAGCTTTGAT GTACCCACTC 780 

TATCACTCAC TCACAGACCT GACTCGGTTC CGCCTGTCTC AAGATGATAT AAATGGCATT 840 

CAGTCCCTCT ATGGACCTCC CCCTGACTCC CCTGAGACCC CCCTGGTACC CACGGAACCT 900 

55 GTCCCTCCAG AACCTGGGAC GCCAGCCAAC TGTGATCCTG CTTTGTCCTT TGATGCTGTC 960 

AGCACTCTGA GGGGAGAAAT CCTGATCTTT AAAGACAGGC ACTTTTGGCG CAAATCCCTC 1020 

AGGAAGCTTG AACCTGAATT GCATTTGATC TCTTCATTTT GGCCATCTCT TCCTTCAGGC 1080 

GTGGATGCCG CATATGAAGT TACTAGCAAG GACCTCGTTT TCATTTTTAA AGGAAATCAA 1140 

TTCTGGGCCA TCAGAGGAAA TGAGGTACGA GCTGGATACC CAAGAGGCAT CCACACCCTA 1200 

60 GGTTTCCCTC CAACCGTGAG GAAAATCGAT GCAGCCATTT CTGATAAGGA AAAGAACAAA 1260 

ACATATTTCT TTGTAGAGGA CAAATACTGG AGATTTGATG AGAAGAGAAA TT CCATGGAG 1320 

CCAGGCTTTC CCAAGCAAAT AGCTGAAGAC TTTCCAGGGA TTGACTCAAA GATTGATGCT 1380 

GTTTTTGAAG AATTTGGGTT CTTTTATTTC TTTACTGGAT CTTCACAGTT GGAGTTTGAC 1440 

CCAAATGCAA AGAAAGTGAC ACACACTTTG AAGAGTAACA GCTGGCTTAA TTGTTGAAAG 1500 

65 AGATATGTAG AAGGCACAAT ATGGGCACTT TAAATGAAGC TAATAATTCT TCACCTAAGT 1560 

CTCTGTGAAT TGAAATGTTC GTTTTCTCCT GCCTGTGCTG TGACTCGAGT CACACTCAAG 1620 

GGAACTTGAG CGTGAATCTG TATCTTGCCG GTCATTTTTA TGTTATTACA GGGCATTCAA 1680 

ATGGGCTGCT GCTTAGCTTG CACCTTGTCA CATAGAGTGA TCTTTCCCAA GAGAAGGGGA 1740 

AGCACTCGTG TGCAACAGAC AAGTGACTGT ATCTGTGTAG ACTATTTGCT TATTTAATAA 1800 
70 AGACGATTTG TCAGTTGTTT T 



MKSLPILLLL CVAVCSAYPL DGAARGEDTS MNLVQKYLEN YYDLEKDVKQ FVRRKDSGPV 60 

VKKIREMQKF LGLEVTGKLD SDTLEVMRKP PCGVPDVGHF rFPGIPKW i LTYRIVN 120 

YTPDLPKDAV DSAVEKALKV WEEVTPLTFS RLYEGEADIM ISFAVREHGD FYPFDGPGNV 180 

80 LAHAYAPGPG INGDAHFDDD EQWTKDTTGT NLFLVAAHEI GHSLGLFHSA NTEALMYPLY 240 

HSLTDLTRFR LSQDDINGIQ SLYGPPPDSP ETPLVPTEPV PPEPGTPANC DPALSFDAVS 300 

TLRGEILIFK DRHFWRKSLR KLEP3LHLIS SFWPSIPSGV DAAYEVTSKD LVFIFKGNQF 360 

WAIRGNEVRA GYPRGIHTLG FPPTVRKIDA AISDKEKNKT YFFVEDKYWR FDEKRNSMEP 420 
GFPKQIAEDF PGIDSKIDAV FEEFGFFYFF TGSSQL3FDP NAKKVTHTLK SNSWLNC 
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GCCGCCAGCG GCTTTCTCGG ACGCCTTGCC CAGCGGGCCG CCCGACCCCC TGCACCATGG SO 

ACCCCGCTCG CCCCCTGGGG CTGTCGATTC TGCTGCTTTT CCTGACGGAG GCTGCACTGG 120 

GCGATGCTGC TCAGGAGCCA ACAGGAAATA ACGCGGAGAT CTGTCTCCTG CCCCTAGACT 180 

ACGGACCCTG CCGGGCCCTA CTTCTCCGTT ACTACTACGA CAGGTACACG CAGAGCTGCC 240 

GCCAGTTCCT GTACGGGGGC TGCGAGGGCA ACGCCAACAA TTTCTACACC TGGGAGGCTT 300 

GCGACGATGC TTGCTGGAGG ATAGAAAAAG TTCCCAAAGT TTGCCGGCTG CAAGTGAGTG 360 

TGGACGACCA GTGTGAGGGG TCCACAGAAA AGTATTTCTT TAATCTAAGT TCCATGACAT 420 

GTGAAAAATT CTTTTCCGGT GGGTGTCACC GGAACCGGAT TGAGAACAGG TTTCCAGATG 480 

AAGCTACTTG TATGGGCTTC TGCGCACCAA AGAAAATTCC ATCATTTTGC TACAGTCCAA 540 

AAGATGAGGG ACTGTGCTCT GCCAATGTGA CTCGCTATTA TTTTAATCCA AGATACAGAA 600 

CCTGTGATGC TTTCACCTAT ACTGGCTGTG GAGGGAATGA CAATAACTTT GTTAGCAGGG 660 
AGGATTGCAA ACGTGCATGT GCAAAAGCTT TGAAAAAGAA AAAGAAGATG C 
GCTTTGCCAG TAGAATCCGG AAAATTCGGA AGAAGCAATT TTAAACATTC T 
ATCTTGTTTG TCTTTATGGC TTATTTGCCT TTATGGTTGT ATCTGAfl 

GCATGAGGAA ACAAATCATT GGTGATTTAT TCACCAGTTT TTATTAATAC AAGTCACTTT 900 

TTCAAAAATT TGGATTTTTT TATATATAAC TAGCTGCTAT TCAAATGTGA GTCTACCATT 960 

TTTAATTTAT GGTTCAACTG TTTGTGAGAC GAATTCTTGC AATGCATAAG ATATAAAAGC 1020 

AAATATGACT CACTCATTTC TTGGGGTCGT ATTCCTGATT TCAGAAGAGG ATCATAACTG 1080 

AAACAACA1A AGACAATATA ATCATGTGCT TTTAACATAT TTGAGAATAA AAAGGACTAG 1140 



MDPARPLGLS ILLLFLTEAA LGDAAOBPTG NNAEICLLPL DYGPCRALLL RYYYDRYTQS 
CRQFLYGGCE GNANNFYTWE ACDDACWRI E KVPKVCRLQV SVDDQCEGST EKYFFNLSSM 
TCEKFFSGGC HRNRIEKRFP DEATCMGFCA PKKIPSFCYS PKDEGLCSAN VTRYYFNPRY 
RTCDAFTYTG CGGNDNNFVS REDCKRACAK ALKKKKKMPK LRFASRIRKI RKKQF 

Seq ID NO : : 
Hucleic Acid 
Coding sequence: 



C CGCGGAGGTC CGGGCAGCCA GGGCGGCCGC CGCCGCCGCC ACCGCCGCCC 60 

GCGCGCCTGC TACTGCTACT GCTGCTGCCG CTGCTGCTGC CTCTGGCGCC CGGGGCCTGG 120 

GGCTGGGCGC GGGGCGCCCC CCGGCCGCCG CCCAGCAGCC CGCCGCTCTC CATCATGGGC 180 

CTCATGCCGC TCACCAAGGA GGTGGCCAAG GGCAGCATCG GGCGCGGTGT GCTCCCCGCC 240 

GTGGAACTGG CCATCGAGCA GATCCGCAAC GAGTCACTCC TGCGCCCCTA CTTCCTCGAC 300 

CTGCGGCTCT ATGACACGGA GTGCGACAAC GCAAAAGGGT TGAAAGCCTT CTACGATGCA 360 

ATAAAATACG GGCCGAACCA CTTGATGGTG TTTGGAGGCG TCTGTCCATC CGTCACATCC 420 

ATCATTGCAG AGTCCCTCCA AGGCTGGAAT CTGGTGCAGC TTTCTTTTGC TGCAACCACG 480 

CCTGTTCTAG CCGATAAGAA AAAATACCCT TATTTCTTTC GGACCGTCCC ATCAGACAAT 540 

GCGGTGAATC CAGCCATTCT GAAGT1GCTC AAGCACTACC AGTGGAAGCG CGTGGGCACG 600 

CTGACGCAAG ACGTTCAGAG GTTCTCTGAG GTGCGGAATG ACCTGACTGG AGTTCTGTAT 660 

GGCGAGGACA TTGAGATTTC AGACACCGAG AGCTTCTCCA ACGATCCCTG TACCAGTGTC 720 

AAAAAGCTGA AGGGGAATGA TGTGCGGATC ATCCTTGGCC AGTTTGACCA GAATATGGCA 780 

GCAAAAGTGT TCTGTTGTGC ATACGAGGAG AACATGTATG GTAGTAAATA TCAGTGGATC 840 

ATTCCGGGCT GGTACGAGCC TTCTTGGTGG GAGCAGGTGC ACACGC-AAGC CAACTCATCC 900 

CGCTGCCTCC GGAAGAATCT GCTTGCTGCC ATGGAGGGCT ACATTGGCGT GGATTTCGAG 960 

CCCCTGAGCT CCAAGCAGAT CAAGACCATC TCAGGAAAGA CTCCACAGCA GTATGAGAGA 1020 

GAGTACAACA ACAAGCGGTC AGGCGTGGGG CCCAGCAAGT TCCACGGGTA CGCCTACGAT 10 80 

GGCATCTGGG TCATCGCCAA GACACTGCAG AGGGCCATGG AGACACTGCA TGCCAGCAGC 1140 

CGGCACCAGC GGATCCAGGA CTTCAACTAC ACGGACCACA CGCTGGGCAG GATCATCCTC 1200 

AATGCCATGA ACGAGACCAA CTTCTTCGGG GTCACGGGTC AAGTTGTATT CCGGAATGGG 1250 

GAGAGAATGG GGACCATTAA ATTTACTCAA TTTCAAGACA GCAGGGAGGT GAAGGTGGGA 1320 

GAGTACAACG CTGTGG CCG A CACACTGGAG ATCATCAATG ACACCATCAG GTTCCAAGGA 13 80 

TCCGAACCAC CAAAAGACAA GACCATCATC CTGGAGCAGC TGCGGAAGAT CTCCCTACCT 1440 

CTCTACAGCA TCCTCTCTGC CCTCACCATC CTCGGGATGA TCATGGCCAC- TGCTTTTCTC 15 00 

TICTTCAACA TCAAGAACCG GAATCAGAAG CTCATAAAGA TGTCGAGTCC ATACATGAAC 1560 

AACCTTATCA TCCTTGGAGG GATGCTCTCC TATGCTTCCA TATTTCTCTT TGC-CCTTGAT 1620 

GGATCCTTTG TCTCTGAAAA GACCTTTGAA ACACTTTGCA CCGTCAGGAC CTGGATTCTC 1680 

ACCGTGGGCT ACACGACCGC TTTTGGGGCC ATGTTTGCAA AGACCTGGAG AGTCCACGCC 1740 

ATCTTCAAAA ATGTGAAAAT GAAGAAGAAG ATCATCAAGG ACCAGAAACT GCTTGTGATC 1800 

GTGGGGGGCA TGCTGCTGAT CGACCTGTGT ATCCTGATCT GCTGGCAGGC TGTGGACCCC 1860 

CIGCGAAGGA CAGTGGAGAA GTACAGCATG GAGCCGGACC CAGCAGGACG GGATATCTCC 1920 

ATCCGCCCTC TCCTGGAGCA CTGTGAGAAC ACCCATATGA CCATCTGGCT TGGCATCGTC 19B0 
TATGCCTACA AGGGACTTCT CATGTTGTTC GGTTGTTTCT TAGCTTGGGA C 
GICAGCATCC CCGCACTCAA CGACAGCAAG TACATCGGGA TGAGTCT 
ATCATGTGCA TCATCGGGGC CGCTGTCTCC TTCCTGACCC GGGACCAGCC CAATGTGCAG 
TTCTGCATCG TGGCTCTGGT CATCATCTTC TGCAGCACCA TCACCCTCTG CCTGGTATTC 



GCACATCCCG CCTGGAGGGC CTACAGTCAG AAAACCATCG CCTGCGAATG 2400 
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AAGATCACAG AGCTGGATAA AGACTTGGAA GAGGTCACCA TGCAGCTGCA GGACACACCA 2460 

GAAAAGACCA CCTACATTAR ACAGAACCAC TACCAAGAGC TCAATGACAT CCTCAACCTG 2520 

GGAAACTTCA CTGAGAGCAC AGATGGAGGA AAGGCCATTT TAAAAAATCA CCTCGATCAA 2580 

AATCCCCAGC TACAGTGGAA CACAACAGAG CCCTCTCGAA CATGCAAAGA TCCTATAGAA 2640 

CACGCCTACC TCCCATCCAT CGGAGGCGTG GACGCCAGCT GTC-TCAGCCC CTGCGTCAGC 2760 

CCCACCGCCA GCCCCCGCCA CAGACATGTG CCACCCTCCT TCCGAGTCAT C-GTCTCGGGC 2820 

Seg ID NO: 26 Protein 



1 11 21 31 41 51 

I I I 1 I I 

MASPRRSGQP GRPPPPPPPP ARLLLLLLLP LLLPIAPGAW GWARGAPRPP PSSPPLSIMG 
LMPLTKEVAK GSIGRGVLPA VELAIEQIRN ESLLRPYPLD LRLYDTECDN AKGLKAFYDA 
7CPSVTS IIAESLOGWN LVQLSFAATT PVLADKKKYP YPPRTVPSD1I 
[, KHYQWKRVGT LTQDVQRFSE VRNDLTGVLY G 
[ ILGQFDQNMA AKVFCCAYEE NMYGSKYQWI 

fIGVDFE PLSSKQIKTI SGKTPQQYER E 
GIWVIAKTLQ RAMETLHASS RHQRIQDFNY TDHTLGRIIL NAMN3TNFFG VTGQWFRNG 420 
ERMGTIKFTQ FQDSREVKVG EYNAVADTLE IINDTIRFQG SEPPKDKTII LEQLRKISLP 480 
LYSILSALTI LGMIMASAFL FFNIKNRHQK LIKMSSPYMN NLIILGGHLS YASIPLFGLD 540 
GSFVSEKTFE TLCTVRTWIL TVGYTTAFGA MFAKTWRVHA IFKNVKMKKK IIKDQKLLVI 600 
VGGMLLIDLC ILICWQAVDP LRRTVEKYSM EPDPAGRDIS IRPLLEHCEK THMTIWLGIV 660 
YAYKGLLMLF GCFLAWETRN VSIPALNDSK YIGMSVYNVG IMCIIGAAVS FLTRDQPNVQ 720 
FCIVALVIIF CSTITLCLVF VPKLITLRTN PDAATQNRRF OFTQNQKKED SKTSTSVTSV 780 
NQASTSRLEG LQSENHRLRM KITELDKDLE EVTMQLQDTP EKTTYIKQNH YQELNDILNL 840 
GMFTESTDGG KAILKNHLDQ NPQLQWNTTE PSRTCKDPIE DINSPEHIQR RLSLQLPILH 9D0 
HAYLPSIGGV DASCVSPCVS PTASPRHRHV PPSFRVMVSG L 

Seq ID NO-. 27 DHA sequence 

Nucleic Acid Accession it: NM_0004S0.1 

Coding sequence: 117.-1949 

1 11 21 31 41 51 

I I I I I I 

CCTGAGACAG AGGCAGCAGT GATACCCACC TGAGAGATCC TGTGTTTGAA CAACTGCTTC 60 
CCAAAACGGA AAGTATTTCA AGGCTAAACC TTTGGGTGAA AAGAACTCTT GAAGTCATGA 12 0 
TTGCTTCACA GTTTCTCTCA GCTCTCACTT TGGTGCTTCT CATTAAAGAG AGTGGAGCCT ISO 
GGTCTTACAA CACCTCCACG GAAGCTATGA CTTATGATGA GGCCAGTGCT TATTGTCAGC 240 
AAAGGTACAC ACACCTGGTT GCAATTCAAA ACAAAGAAGA GATTGAGTAC CTAAACTCCA 300 
TATTGAGCTA TTCACCAAGT TATTACTGGA TTGGAATCAG AAAAGTCAAC AATGTGTGGG 360 
TCTGGGTAGG AACC CAG AAA CCTCTGACAG AAGAAGCCAA GAACTGGGCT CCAGGTGAAC 42 0 
CCAACAATAG GCAAAAAGAT GAGGACTGCG TGGAGATCTA CATCAAGAGA GAAAAAGATG 480 
TGGGCATGTG GAATGATGAG AGGTGCAGCA AGAAGAAGCT TGCCCTATGC TACACAGCTG 540 
CCTGTACCAA TACATCCTGC AGTGGCCACG GTGAATGTGT AGAGACCATC AATAATTACA 60 0 
CTTGCAAGTG TGACCCTGGC TTCAGTGGAC TCAAGTGTGA GCAAATTCTG AACTGTACAG 660 
CCCTGGAATC CCCTGAGCAT GGAAGCCTGG TTTGCAGTCA CCCACTGGGA AACTTCAGCT 720 
ACAATTCTTC CTGCTCTATC AGCTGTGATA GGGGTTACCT GCCAAGCAGC ATGGAGACCA 780 
TGCAGTGTAT GTCCTCTGGA GAATGGAGTG CTCCTA 

GTGATGCTGT GACAAATCCA GCCAATGGGT TCGTCGAATG TTTCCAAAAC C 
TCCCATGGAA CACAACCTGT ACATTTGACT GTGAAGAAGG ATTTGAACTA ATGGGAGCCC 
AGAGCCTTCA GTGTACCTCA TCTGGGAATT GGGACAACGA GAAGCCAACG TGTAAAGCTG 
TGACATGCAG GGCCGTCCGC CAGCCTCAGA ATGGCTCTGT GAGGTGCAGC CATTCCCCTG 
CTGGAGAGTT CACCTTCAAA TCATCCTGCA ACTTCACCTG TGAGGAAGGC TTCATGTTGC 
AGGGACCAGC CCAGGTTGAA TGCACCACTC AAGGGCAGTG GACACAGCAA ATCCCAGTTT 



TTGTGTTGAA 

GGTGTGCTCA TTCCCCTATT 

AGGAGGGATT TGAATTATAT GGATCAACTC AACTTGAGTG C 
CAGAAGAGGT TCCTTCCTGC CAAGTGGTAA AATGTTCAAG CCTGGCAGTT CCGGGAAAGA 
TCAACATGAG CTGCAGTGGG GAGCCCGTGT TTGGCACTGT GTGCAAGTTC GCCTGTCCTG 
AAGGAIGGAC GCTCAATGGC TCTGCAGCTC GGACATGTGG AGCCACAGGA CACTGGTCTG 
GCCTGCTACC TACCTGTGAA GCTCCCACTG A 



TACGGAAAGC AAAGAAATTT GTTCCTGCCA GCAGCTGCCA AAGCCTTGAA TCAGACGGAA 1920 

GCTACCAAAA GCCTTCTTAC ATCCTTTAAG TTCAAAAGAA TCAGAAACAG GTGCATCTGG 1980 

CTTCTTGCCT ACTATGCCAG ATGCCTTTAT GGCTGAAACC GCAACACCCA TCACCACTTC 2100 

AATAGATCAA AGTCCAGCAG GCAAGGACGG CCTTCAACTG AAAAGACTCA GTGTTCCCTT 2160 

TCCTACTCTC AGGATCAAGA AAGTGTTGGC TAATGAAGGG AAAGGATATT TTCTTCCAAG 2220 

CAAAGGTGAA GAGACCAAGA CTCTGAAATC TCAGAATTCC TTTTCTAACT CTCCCTTGCT 2280 

CGCTGTAAAA TCTTGGCACA GAAACACAAT ATTTTGTGGC TTTCTTTCTT TTGCCCTTCA 2340 

CAGTGTTTCG ACAGCTGATT ACACAGTTGC TGTCATAAGA ATGAATAATA ATTATCCAGA 2400 

GTTTAGAGGA AAAAAATGAC TAAAAATATT ATAACTTAAA AAAATGACAG ATGTTGAATG 2460 

CCCACAGGCA AATGCATGGA GGGTTGTTAA TGGTGCAAAT CCTACTGAAT GCTCTGTGCG 2520 

AGGGTTACTA TGCACAATTT AATCACTTTC ATCCCTATGC- GATTCAGTGC TTCTTAAAGA 2580 



A CAAGTGTGGT AGGGACTTAA AAAACTTGTA AATGCTGTCA ACTATGATAT 2700 
GGTAAAAGTT ACTTATTCTA GATTACCCCC TCATTGTTTA TTAACAAATT ATGTTACATC 2760 
A AAGGGAAACT ATTGTCCCCT AGCAAGG 
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r TCTGAGTGTT TTTACTACAG TTGTTTTTTG AAAACATGGT AGAATTGGAG 2880 
AGTAAAAACT GAATGGAAGG TTTGTATATT GTCAGATATT TTTTCAGAAA TATGTGGTTT 2940 
A AAACTTCCAT GAGGCCAAAC GTTTTGAACT AATAAAAGCA TAAATGCAAA 30 00 
3 TATAATTTTA TGAATGTCTT TGTTGGAAAA GAATACAGAA AGATGGATGT 30 60 
GCTTTGCATT CCTACAAAGA TGTTTGTCAG ATGTGATATG TAAACATAAT TCTTGTATAT 3120 
TATGGAAGAT TTTAAATTCA CAATAGAAAC TCACCATGTA AAAGAGTCAT CTGGTAGATT 3180 
TTTAACGAAT GAAGATGTCT AATAGTTATT CCCTATTTGT TTTCTTCTGT ATGTTAGGGT 3240 
GCTCTGGAAG AGAGGAATGC CTGTGTGAGC AAGCATTTAT GTTTATTTAT AAGCAGATTT 330 0 
AACAATTCCA AAGGAATCTC CAGTTTTCAG TTGATCACTG GCAATGAAAA ATTCTCAGTC 3360 
AGTAATTGCC AAAGCTGCTC TAGCCTTGAG GAGTGTGAGA ATCAAAP 

r AGCATGTGTT GAAAAAAAAA GTTTCAGAGA AGTTCTGGCT G 

3600 
3660 
3720 
3730 

ATGTGCTTAT GTTT 



1 I I I I I 

MIASQFLSAL TLVLLIKESG AWSYNTSTEA MTYDEASAYC QQRYTHLVAI QNKEEIEYLN 
SILSYSPSYY WIGIRKVNNV WVWVGTQKPL TEEAKNWAPG EPMNRQKDED CVEIYIKEEK 
DVGMWNDERC SKKKLALCYT AACTNTSCSG HGECVETINN YTCKCDPGPS GLKCEQIVMC 
TALESPEHGS LVCSHPLGNF SYNSSCSISC DRGYLPSSME TMQCMSSGEW SAPIPACNW 
ECDAVTNPAN GFVECFQNPG SFPWNTTCTF DCEEGFELMG AQSLQCTSSG NWDNEKPTCK 
AVTCRAVRQP QNGSVRCSHS PAGEFTFKSS CNFTCEEGFM LQGPAQVECT TQGQWTQQIP 
VCEAFQCTAL SNPERGYMNC LPSASGSFRY GSSCEFSCEQ GFVLKGSKRL QCGPTGEWDN 
EKPTCEAVRC DAVHQPPKGL VRCAHSPIGE FTYKSSCAFS CEEGFELYGS TQLECTSOGO 
WTEEVPSCQV VKCSSLAVPG KINMSCSGEP VFGTVCKFAC PEGWTLNGSA ARTCGATGHW 
SGLLPTCEAP TESNIPLVAG LSAAGLSLLT LAPFLLWLRK CLRKAKKFVP ASSCQSLESD 
GSYQKPSYIL 

Seq ID NO : 29 DHA se quence 



CTTCCCAGCA GCAAAGACCA CGACTGGAGA GCCGAGCCGG AGGCAGCTGG GAAACATGAA 
GAGCGTCTTG CTGCTGACCA CGCTCCTCGT GCCTGCACAC CTGGTGGCCG CCTGGAGCAA 



CTGCAAGAGG ACAGTGCTCG ACGACTGTGG CTGCTGCCGA GTGTGCGCTG CAGGGCGGGG 
AGAAACTTGC TACCGCACAG TCTCAGGCAT GGATGGCATG AAGTGTGGCC CGGGGCTGAG 

TCCCTACGGC ACCTTCGGGA TGGATTGCAG AGAGACCTGC AACTGCCAGT CAGGCATCTG 
TGACAGGGGG ACGGGAAAAT GCCTGAAATT CCCCTTCTTC CAATATT CAG TAACCAAGTC 
; TCACGGAGCA TGACATGGCA TCTGGAGATG GCAATATTGT 
A GTTGTGAAAG AGAATGCTGC CGGGTCTCCC GTAATGAGGA AATGGTTAAA 
TCCACGCTGA TCCCGGCTGT GATTTCTGAG AGAAGGCTCT ATTTTCGTGA TTGTT CAACA 
h CTTTCTAGAT ATAGCATAAG TACATGTAAT TTTTGAAGAT 



55 ATCCATATGA CTGAACACTT GTATGTGTTT GTTAAATATT CGAATGCATG T 

TAGGTAA TCTTACATGG 

A ACCAAAGAGG GAGCTAGGCA AAGCTGAAGA CCGCAGTGAG T CAAATTAGT 
3 AAGACTTAAG AG CAGG AGAA 

60 l;1 ' lllJW '^ J '-' — " — — — C CTxCCTTGCT AGGTAGCTTC 

65 GTAAATATTT ATATATTTTT ATAAATAAAT GTGTTAGTGC A 

ATTTATCATC CTCTTGAGGA AAGAAATCTA G 

AACCTATGAC TCTATAAGGT TTTCAAACAT CTGAGGCATG A 

TATAGGAGTC ACTCTGGATT T 

TAAACATAAG TGCTGTGACT TCGGTGAATT T 
70 GGAGGTTTGT AAAAGAAGAA TCAATTTTCA G 

TGGAATTAGG AGTATATTTG AAAGAATCTT A 

CTTAGGAAAT ATCTCAGAAG TATTTTATTT GAAGTGAAGA ACTTATTTAA GAATTATTTC 1860 

AGTATTTACC TGTATTTTAT TCTTGAAGTT GGCCAACAGA GTTGTGAATG TGTGTGGAAG 1920 

GCCTTTGAAT GT AAAGCTG C ATAAGCTGTT AGGTTTTGTT TTAAAAGGAC ATGTTTATTA 1980 
75 TTGTT CAATA AAAAAGAACA AGATAC 




MKSVLLLTTL LVPAHLVAAW SNNYAVDCPQ HCDSSECKSS PRCKRTVLDD CGCCRVCAAG 
RGETCYRTVS GMDGMKCGPG LRCQPSNGED PFGEEFGICK DCPYGTFGMD CRETOICQSG 
ICDRGTGKCL KFPFFQYSVT K 
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40 



Protein Accession #: NP_000954 

1 11 21 31 41 

MLARALLLCA VLALSHTANF CCSHPCQNRG VCMSVGFDQY KCDCTRTGFY G 
TRIKLFLKPT PNTVHYILTH FKGFWNWNN IPFLRNAIMS YVLTSRSHLI DSPPTYNADY 
GYKSWEAFSN LSYYTRALPP VPDDCPTPLG VKGKKQLPDS NEIVEKLLLR RKFIPDPQGS 
NMMFAFFAQH FTHQFFKTDH KRGPAFTNGL GHGVDLNHIY GETJuARQRKL RLFKDGKMKY 
QIIDGEMYPP TVKDTQAEMI YPPQVPEHLR FAVGQEVFGL VPGLMMYAT I WLREHNRVCD 
VLKQEHPEWG DEQLFQTSRL ILIGETIKIV IEDYVQHLSG YHFKLKFDPE LLFNKQFQYQ 
NRIAAEFNTL YHWHPLLPDT FQIHDQKYNY QQFIYNNSIL LEHGITQFVE SFTRQIAGRV 
AGGRNVPPAV QKVSQASIDQ SRQMKYQSFN EYRKRFMLKP YESFEELTGE KEMSAELEAL 
YGDIDAVELY PALLVEKPRP DAIFGETMVE VGAPFSLKGL MGNVICSPAY WKPSTFGGEV 
GFQII1JTASI QSLICNNVKG CPFTSFSVPD PELIKTVTIN ASSSRSGLDD INPTVLLKER 



ATGGCTTCAC CCAGCCTCCC GGGCAGTGAC TGCTCCCAAA TCATTGATCA CAGTCATGTC 
CCCGAGTTTG AGGTGGCCAC CTGGATCAAA ATCACCCTTA T 



AAAGGATACT TGCAGAAGGA GGTGACAGAC CACATGGTGA GTTTGGCTTG CTCGGACATC 
TTGGTGTTCC T CATCGGCAT GCCCATGGAG TTCTACAGCA TCATCTGGAA TCCCCTGACC 
ACGTCCAGCT ACACCCTGTC CTGCAAGCTG CACACTTTCC TCTTCGAGGC CTGCAGCTAC 
GCTACGCTGC TGCACGTGCT GACGCTCAGC TTTGAGCGCT ACATCGCCAT CTGTCACCCC 
TTCAGGTACA AGGCTGTGTC GGGACCTTGC CAGGTGAAGC TGCTGATTGG CTTCGTCTGG 
GTCACCTCCG CCCTGGTGGC ACTGCCCTTG CTGTTTGCCA TGGGTACTGA GTACCCCCTG 



CAGCCCGAGA CCTCCAATAT GTCCATCTGT ^w^w-„ v*™—^^ 
35 CAGTCCAGCA TCTTCGGCGC CTTCGTGGTC TACCTCGTGG TCCTGCTCTC CGTAGCCTTC 

ATGTGCTGGA ACATGATGCA GGTGCTCATG AAAAGCCAGA AGGGCTCGCT GGCCGGGGGC 7B0 

ACGCGGCCTC CGCAGCTGAG GAAGTCCGAG AGCGAAGAGA GCAGGACCGC CAGGAGGCAG 840 

ACCATCATCT TCCTGAGGCT GATTGTTGTG ACATTGGCCG TATGCTGGAT GCCCAACCAG 900 

ATTCGGAGGA TCATGGCTGC GGCCAAACCC AAGCACGACT GGACGAGGTC CTACnCCGC- 96 0 

GCGTACATGA TCCTCCTCCC CTTCTCGGAG ACGTTTTTCT AOCTCAGCTC GGTCATCAAC 1020 

CCGCTCCTGT ACACGGTGTC CTCGCAGCAG TTTCGGCGGG TGTTCGTGCA GGTGCTGTGC 10B0 

TGCCGCCTGT CGCTGCAGCA CGCCAACCAC GAGAAGCGCC TGCGCGTACA TGCGCACTCC 1140 

ACCACCGACA GCGCCCGCTT TGTGCAGCGC CCGTTGCTCT TOGOGTCCCG GCGCCAGTCC 1200 

TCTGCAAGGA GAACTGAGAA GATTTTCTTA AGCACTTTTC AGAGCGAGGC CGAGCCCCAG 1260 

TCTAAGTCCC AGTCATTGAG TCTCGAGTCA CTAGAGCCCA ACTCAGGCGC GAAACCAGCC 1320 



MASPSLPGSD CSQIIDHSHV PEFEVATWIK ITLILVYLII FVMGLLGNSV TIRVTQVLQK 
KGYLQKEVTD HMVSLACSDI LVFLIGMPME FYSIIWNPLT TSSYTLSCKL HTFLFEACSY 
ATLLHVLTLS FERYIAICHP FRYKAVSGPC QVKLLIGFVW VTSALVALPL LFAMGTEYPL 
VNVPSHRGLT CNRSSTRHHE QPETSNMSIC TNLSSRWTVF QSSIFGAFW YLWLLSVAF 
MCKNMMQVLM KSQKGSLAGG TRPPQLRKSE SEESRTARRQ IIIFLRLIW TLAVCWMPNQ 
IRRIMAAAKP KHDWTRSYFR AYMILLPFSE TFFYLSSVIN PLLYTVSSQQ FRRVFVQVLC 



AACAGAACTG 

\ GGGGTCGGGA CCAAGGCCCA AATGTCTGTG CCCTICAACA 



TGCCCAGCAG TTTTGCCCAT TGACCATGTT TATGGCACTC TGGGCATCGT GGGAGCCACC 
ACAACGCAGC GCTATTCTGA CGCCTCAAAA CTGAGGGAGG AGAT CGAGGG AAAGGGATCC 
TTCACTTACT TTGCACCGAG TAATGAGGCT TGGGACAACT TGGATTCTGA 



GCCCTTGGAA GAGACGGTCA 
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T AACAGTAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 
GTGACAAATA ATGGTGTGAT CCATTTGATT GATCAGGTCC TAATTCCTGA TTCTGCCAAA 
CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 
GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACACTT TGCTGGCACC TGTGAATAAT 
5 GCATTTTCTG ATGATACTCT CAGCATGGTT CAGCGCCTCC TTAAATTAAT TCTGCAGAAT 

ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TATCGTACAG CTGTCTGCAT TGAAAATTCA 
TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT CCGCGAGATC 



AAGGATTTGA ACCTGGTGTT 



AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA CTGAAAAAGA GACACGAGAA 2280 

A TACACTAGGA TTTCTACTGG AGGTGGAGAA 234 0 

h GAAGAGGTCA CCAAGGTCAC CAAATTCATT 240 0 

GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 2460 

ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2520 

GAAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA ACCCTAAGTC 2580 

AATAACCTGA CCTTAGAAAA TTGTGAGAGC CAAGTTGACT TCAGGAACTG AAACATCAGC 2640 

ACAAAGAAGC AATCATCAAA TAATTCTGAA CACAAATTTA ATATTTTTTT TTCTGAATGA 2700 



CACCTTACAC GCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 2820 

AGAAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACTGTTATC 2880 

CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 294 0 

TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA ACTTTTTATA 3000 

TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTTTA 3060 

CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA TATTTTTTAA 3120 

TCTCAAACGT TTCAATAAAA CCATTTTTCA GATATAAAGA GAATTACTTC AAATTGAGTA 3180 
ATTCAGAAAA ACT CAAG ATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 



otein sequence 



Protein Accession # 



MIPFLPMFSL LLLLIVNPIN ANNHYDKIIiA HSRIRGRDQG PNVCALQQIL GTKKKYFSTC 
KNWYKKSICG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLGI VGA TTTQRYSDAS 
KLREEIEGKG SFTYFAPSNE AWDNLDSDIR RGLESNVNVE LLNALHSHMI NKRMLTKDLK 
NGMIIPSMYN NLGLFINHYP NGWTVNCAR IIHGNQIATN GWHVIDRVL TQIGTSIQDF 



HKYHILNTLQ CSESIMGGAV FETLEGNTIE IGCDGDSITV NGIKMVNKKD IVTNNGVIHL 3 60 
IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 
VQRLLKLILQ NHILKVKVGL NELYNGQILE TIGGKQLRVF VYRTAVCI EN SCMEKGSKQG 480 
RNGAIHIFRE IIKPAEKSLH EKLKQDKRFS TFLSLLEAAD LKELLTQPGD W 
FKGMTSEEKE ILIRDKNALQ NIILYHLTPG VFIGKGFEPG VTNILKTTQ.G S 
TLLVNELKSK ESDIMTTNGV IHWDKLLYP ADTPVGNDQL LEILNKLIKY IQIKFVRGST 



I CAGCACCAAC CAAGGGACTA TCCACCTACA A 
C AAGCCCTTCC TGCGAGAAAA T 
AAGAATGGAG TTCAAACATG TCTAAACCCA GATTCAGCAG A 

G CCAAAAGAAA AAGCAAAAGA ATGGGAAAAA ACATCAAAAA 



540 

TTGTTAAAGG CTATGATTGT CTTTGTTCTT CTACCACCCA CCAGTTGAAT TTCATCATGC 660 

TTAAGGCCAT GATTTTAGCA ATACCCATGT CTACACAGAT GTTCACCCAA CCACATCCCA 720 

CTCACAACAG CTGCCTGGAA GAGCAGCCCT AGGCTTCCAC GTACTGCAGC CTCCAGAGAG 780 

TATCTGAGGC ACATGT CAGC AAGTCCTAAG CCTGTTAGCA TGCTGGTGAG CCAAGCAGTT 840 

' TGAAATTGAG CTGGACCTCA CCAAGCTGCT GTGGCCATCA ACCTCTGTAT TTGAATCAGC 900 

CTACAGGCCT CACACACAAT GTGTCTGAGA GATTCATGCT GATTGTTATT GGGT AT CACC 960 

ACTGGAG AT C ACCAGTGTGT GGCTTTCAGA GCCTCCTTTC TGGCTTTGGA AGCCATGTGA 1020 

TTCCATCTTG CCCGCTCAGG CTGACCACTT TATTTCTTTT TGTTCCCCTT TGCTTCATTC 1080 

AAGTCAGCTC TTCTCCATCC TACCACAATG CAGTGCCTTT CTTCTCTCCA GTGCACCTGT 1140 

CATATGCTCT GATTTATCTG AGTCAACTCC TTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 
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AGTGCTTTCT TCTCCCAATT CATCCTCACT CAGTCCAGCT TAGTTCAAGT CCTGCCTCTT 
AAATAAACCT TTTTGGACAC ACAAATTATC TTAAAACTCC TGTTTCACTT GGTTCAGTAC 
CACATGGGTG AACACTCAAT GGTTAACTAA TTCTTGGGTG TTTATCCTAT CTCTCCAACC 
AGATTGTCAG CTCCTTGAGG GCAAGAGCCA CAGTATATTT CCCTGTTTCT TCCACAGTGC 
CTAATAATAC TGTGGAACTA GGTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGGA 
TGGCAACCAG ACCATTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 
CTAGCCTCTG GTAACCTCTT ACTTATTATC TTCAGGACAC TCACTACAGG GACCAGGGAT 
GATGCAACAT CCTTGTCTTT TTATGACAGG ATGTTTGCTC AGCTTCTCCA ACAATAAGAA 
GCACGTGGTA AAACACTTGC GGATATTCTG GACTGTTTTT AAAAAATATA CAGTTTACCG 
AAAAT CAT AT AATCTTACAA TGAAAAGGAC TTTATAGATC AGCCAGTGAC CAACCTTTTC 
CCAACCATAC AAAAATTCCT TTTCCCGAAG GAAAAGGGCT TTCTCAATAA GCCTCAGCTT 
T AACAAGATAG CCACCGAGAT CCTTATCGAA ACT CAT! 



25 



TAGTGGAAGC 



T GCTCACACTG C 
TCCCACCCGA ACGTCTTATC TAATCATGAA ACTCCCTAGT T 

[TGAGAG TCTGTGACCC ACTTACCTTG C 
C AACCAAAGAC T. 

^TGCATA CACTCTCAAA GCAAATAATT TTTCACTTCA 2460 
\ CCTTGTAATT TGAAATATTT TCTTTGTTAA AATAGAATGG 2520 
TATCAATAAA TAGACCATTA ATCAG 



30 MKKSGVLFLL GIILLVLIGV QGTPWRKGR CSCISTNQGT IHLQSLKDLK QFAPSPSCEK 60 
IEIIATLKNG VQTCLNPDSA DVKELIKKWE KQVSQKKKQK NGKKHQKKKV LKVRKSQRSR 120 

Seq ID NO: 39 DNA sequence 
35 Nucleic Acid Accession #: NM_006670 
Coding sequence: 85.. 1347 



4U CCGGCTCGCG CCCTCCGGGC CCAGCCTCCC GAGCCTTCGG AGCGGGCGCC GTCCCAGCCC 60 

AGCTCCGGGG AAACGCGAGC CGCGATGCCT GGGGGGTGCT CCCGGGCCCC CGCCGCCGGG 120 

GACGGGCGTC TGCGGCTGGC GCGACTAGCG CTGGTACTCC TGGGCTGGGT CTCCTCGTCT 180 

TCTCCCACCT CCTCGGCATC CTCCTTCTCC TCCTCGGCGC CGTTCCTGGC TTCCGCCGTG 240 

TCCGCCCAGC CCCCGCTGCC GGACCAGTGC CCCGCGCTGT GCGAGTGCTC CGAGGCAGCG 3 00 

45 CGCACAGTCA AGTGCGTTAA CCGCAATCTG ACCGAGGTGC CCACGGACCT GCCCGCCTAC 360 

GTGCGCAACC TCTTCCTTAC CGGCAACCAG CTGGCCGTGC TCCCTGCCGG CGCCTTCGCC 420 

CGCCGGCCGC CGCTGGCGGA GCTGGCCGCG CTCAACCTCA GCGGCAGCCG CCTGGACGAG 480 

GTGCGCGCGG GCGCCTTCGA GCATCTGCCC AGCCTGCGCC AGCTCGACCT CAGCCACAAC 540 

CCACTGGCCG ACCTCAGTCC CTTCGCTTTC TCGGGCAGCA ATGCCAGCGT CTCGGCCCCC 600 

50 AGTCCCCTTG TGGAACTGAT CCTGAACCAC ATCGTGCCCC CTGAAGATGA GCGGCAGAAC 6 60 

CGGAGCTTCG AGGGCATGGT GGTGGCGGCC CTGCTGGCGG GCCGTGCACT GCAGGGGCTC 720 

CGCCGCTTGG AGCTGGCCAG CAACCACTTC CTTTACCTGC CGCGGGATGT GCTGGCCCAA 780 

CTGCCCAGCC TCAGGCACCT GGACTTAAGT AATAATTCGC TGGTGAGCCT GACCTACGTG 840 

TCCTTCCGCA ACCTGACACA TCTAGAAAGC CTCCACCTGG AGGACAATGC CCTCAAGGTC 900 

55 CTTCACAATG GCACCCTGGC TGAGTTGCAA GGTCTACCCC ACATTAGGGT TTTCCTGGAC 960 

AACAATCCCT GGGTCTGCGA CTGCCACATG GCAGACATGG TGACCTGGCT CAAGGAAACA 1020 

GAGGTAGTGC AGGGCAAAGA CCGGCTCACC TGTGCATATC CGGAAAAAAT GAGGAATCGG 1080 
G TGCTGACCTG GACTGTGACC CGATTCTTCC 



AAGTGGATGC ATAACATCAG AGATGCCTGC 1260 

TCATTACAGA TATGAAATCA ATGCGGACCC CAGATTAACA 1320 

TGTCTGAGAA ATATTAGAGG ACAGACCAAG GACAACTCTG 1380 
TTTATCCCTA CTAGGCTTGC TCCACTI 
GGACTTTGAC TAAAAGCAGT GAAGGGGATT TGCTTCCTTG T 

ATGTAAGACG ATGAACAGTT GTGTATAGTG TTTTACCCTC 1560 
CAACACGTAT GGAGGGATTT TTCAGGTTTC A( 
TGGGCTTCTT GCTGTCTGTC TCTCTCTCAG TACAGTTCAA GGTGTAGCAA G' 

ACAGATAGCA TTCAACAAAA GCTGCCTCAA CTTTTTCGAG AAAAATACTT T ATT CAT AAA 1740 

TATCAGTTTT ATTCTCATGT ACCTAAGTTG TGGAGAAAAT AATTGCATCC TATAAACTGC 1800 

CTGCAGACGT TAGCAGGCTC TTCAAAATAA CTCCATGGTG CACAGGAGCA CCTGCATCCA 1860 

AGAGCATGCT TACATTTTAC TGTTCTGCAT ATTACAAAAA ATAACTTGCA ACTTCATAAC 1920 



ATTCTTAAAA GAA 



Protein Accession 



MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD 
QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQ LAVLPAGA FARRPPLAEL 
AALNLSGSRL DEVRAGAFEH LPSLRQLDLS HNPLADLSPF AFSGSNASVS APSPLVELIL 
NHIVPPEDER QNRSFEGMW AALLAGRALQ GLRRLELASN HFLYLPRDVL AQLPSLRHLD 
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25 
30 



LSNNSLVSLT YVSFRNLTHL ESLHLEDNAL KVLHNGTLAE LQGLPHIRVF LDNNPWVCDC 300 
HMADMVTWLK ETEWQGKDR LTCAYPEKMR NRVLLELNSA DLDCDPILPP SLQTSYVFLG 360 
IVLALIGAIF LLVLYLNRKG IKKWMHNIRD ACRDHMEGYH YRYEINADPR LTNLSSNSDV 

Seq ID NO-. 41 DNA sequence 

Nucleic Acid Accession 8: Eos sequence 



GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 



AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 



AATGGCACCC 
GTGCAGGGCA 



A AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD SO 

QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASHHFLY LPRDVLAQLP 120 

rHLESLH LEDNALKVLH NGTLAED3GL PHIRVFLDNN 1B0 

17 VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 240 

SYVFLGIVLA LIGAIFLLVL YLNRKGIKKN MHNIRDACRD HMSGYHYRYE INADPRLTNL 300 



Seq ID NO: 43 DNA sequence 



A GTTTCCATCT TTCTGGTCTC 120 
TGCCCAGAAT CCGACAACAG CTGCTCCAGC TGACACGTAT CCAGCTACTG GTCCTGCTGA 180 
TGATGAAGCC CCTGATGCTG AAACCACTGC TGCTGCAACC ACTGCGACCA C 



GCGAGCTAAC AT 



YPATGPADDE APDAETTAAA TTATTAAPTT 



Seq ID NO -. ^ 45 DNA sequence 
Coding sequence: 26.. 2299 



I I I I 

CAGCACCCAG CTCCCCGCCA CCGCCATGGT CCC 



TTGGGCTCAG ACCTGGGCCC 

GAAAAACACG GTGAIGGAGT GTGACGCGTG 
ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 
GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG G 



CGCCCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 
GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC G 
GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTG1 
CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 
C GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 
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A TGTGGACCGC GATGGCATCG GAGACGCCTG 
CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC O 
GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGGG A 
GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCC-GGGCG ATGCGTGCGA 




G ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 
T TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 
T TCTATGAGGG CCCTGAGCTG GTGGCGGACA GCAACGTGGT 
CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 
CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 
2AGGGTG AGGACCCGCC GGATGACAGC CACCCTCACC 2340 



1 11 21 31 41 51 

I I I I I I 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQ0VREIT 60 
FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 120 

35 NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQGVGLAF AKANKQVCTD 180 
INECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 
ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGO 300 
EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGDACDN CRSQKNDDQK 3 SO 
DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 420 

40 VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 4B0 
RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTWLD 540 
PEGDAQIDPN WWLNQGREI VQTMNSDPGL AVGYTAFNGV DFSGTFHVNT VTDDDYAGFI 
FGYQDSSSFY WMWKQMEQT YWQANPFRAV AEPGIQLKAV KS3TGPGEQL RNALNIITGDT 
ESQVRLLWKD PHNVGWKDKK SYRWFLQHRP QVGYIRVRFY EGPELW " 

45 RLGVFCFSQE NIIWANLRYR CNDTIPEDYE THQLRQA 



TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 



C TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 
CCTGTTAATC CAAGGTCTTT AGAAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 
CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 
TCGAAGGCCA TCAAGAATTT ACTGAAAGCA 
TAAAACCAGA GGGGAGCAAA 
60 CCTCTCCCAT CACTTCCCTA 

GTTACACTAA AAGGTGACCA ATGATGGTCA 

GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC 
GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA TATTTCCCTC 660 

65 TCAGAATCTC AAATAACTAA AAGG TATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 780 

ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TGGCTACCTA 840 

CAT ACAATT C CAAACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 

CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 

TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 

70 TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080 

TTTTCAAATA AAAATGAGGT ACTCTCCTGG AAATATTAAG 



Protein Accession #: NP_001556.1 

1 11 21 31 41 51 

I I I I I I 

MNQTAILICC LIFLTLSGIQ GVPLSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 
EIIATMKKKG EKRCLNPESK AIKNLLKAVS KEHSKRSP 



Coding sequence: 
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GGGAGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 

CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 120 

CGCTGCCCGG CAGCCGGGAG CCATGCGACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT 180 

CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG OCGTGGAGCG CCTCTGAGAT 240 

AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 

A AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGI 

iTAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 
T CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA G 



GGATTAGTGG 



A TTTTGCTTCA A 

TGGTTAGAAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA A 
GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCAATCTT 1200 
AATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 1260 



MEPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLQGPA 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWGTCSD YPKGDASTGW NSVSRIIIEE 



1 11 21 31 41 51 

I I I I I I 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 
CCGCAACCGC TGAGCCATCC ATGGGGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 
CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 
CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 
ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 
AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAAATGAGC 
TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 
TTGATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 
AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 
GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 
GCCTGAGCTG CATGAATAAG GATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 
GCAGCGTOGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 
TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 
GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 
AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 
3 AAACGTGTGC T 



CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 
CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 
ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 




TGAGCTGCAT 



CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA 
GCATTTTATG I 
GCCCAAGACC A 

GTGGAGGTCT GTGT CAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT T 
GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC Tl 
GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 234 
GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 240 
CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 246 
3 CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA O 
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GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2 580 

AGTGTACGTG GACCATCAAC CCACCCCCCA AGCGCCGCAT CCTGATCGTG GTCCCTGAGA 2640 

TCTTCCTGCC cataqaggac GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCTCTTCAT 2 70 0 

CCAATTCTGT GACAACATAT GAAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 2760 

CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATGA AGGGAACAGC GCTAGAGGGT 2B2 0 

TCCAGGTCCC ATACGTGACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 2880 

GAGATGGCAG GCTCTATGCA TCTGAGAACC AT CAGGAAAT ACTTAAGGAT AAGAAACTTA 2940 

TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 3 000 

AGTCCCGAGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 3060 

TTTTGAGACC TTACAAATGA CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 312 0 

GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3180 

rCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 3240 

T TTTCTTTCCC AGCATCGTGG ATGTAGACTG AGAATGGCTT TGAGTGGCAT 3300 

:TGTGGG CGGATGTCTT GGATAGATCA OGGGCTGGCT GAGCTGGACT 336 0 

TTGGTCAGCC TAGGTGAGAC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 3420 

TGTAGTGGAA AGGAGGCCAC AGAATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 

CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTGAG ACCTGGGAGG 360 0 

ACTCAGTTTC TCCACAGCCT TCTCCAGCCT GTGTGATACA AGTTTGATCC CAGGAACTTG 3660 

AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3 72 0 
AGCACTTCTG GAGACAT 



I I I I I I 

MGVAGRNHPG AAWAVLLLLL LLEPLLLLAG AVPPGRGRAA GPQEDVDECA Q" 
LCQNTPTSYK CSCKPGYQGE GRQCEDIDEC GNELNGGCVH DCLNIPC 
HDGHNCLDVD ECLENNGGCQ HTCVHVMGSY ECCCKEGFFL SDNQHTCIHR SEEGLSCMNK 
DHGCSHICKE APRGSVACEC RPGFELAKNQ RDCILTCNHG NGGCQHSCDD TADGPECSCH 
PQYKMHTDGR SCLEREDTVL EVTESNTTSV VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 
STGVHCSCPV GFTLQLDGKT CKDIDECQTR NGGCDHFCKN IVGSFDCGCK KGFKLLTDEK 
SCQDVDECSL DRTCDHSCIN HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVH 
TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSLH GGKSGGGDGC FLRCHSGIHL 
SSDVTTIRTS VTFKLNEGKC SLKNAELFPE GLRPALPEKH SSVKESFRYV NLTCSSGKQV 
PGAPGRPSTP KEMFITVEFE LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 
FHLQLSGMNL DVAKKPPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDGA RSRCILCPNG 
TFQNEEGQMT CEPCPRPGNS GALKTPEAWN MSECGGLCQP GEYSADGFAP CQLCALGTFQ 
PEAGI r CFP CGGGIiATKHQ GATSFQDCET RVQCSPGHFY NTTTHRCIRC PVGTYQPEFG 
KNNCVSCPGN TTTDFDGSTN ITQCKNRRCG GELGDFTGYI ESPNYPGNYP ANTECTKTIN 
PPPKRRILIV VPEIFLPIED DCGDYLVMKK TSSSNSVTTY ETCQTYERFI AFTSRSKKLK 
IQFKSMEGNS ARGFQVPYVT YDEDYQELIE DIVRDGRLYA SENHQEILKD KKLIKALFDV 
LAHPQNYFKY TAQESREMFP RSFIRLLRSK VSRFLRPYK 

Seq ID NO: 53 DMA sequence 

Coding sequence: 157.. 1479 

1 11 21 31 41 51 

GGGACAGGGC TGAGGATGAG GAGAACCCTG GGGACCCAGA AGACCGTGCC TTGCCCGGAA 
GTCCTGCCTG TAGGCCTGAA GGACTTGCCC TAACAGAGCC TCAACAACTA CCTGGTGATT 
CCTACTTCAG CCCCTTGGTG TGAGCAGCTT CTCAACATGA ACTACAGCCT CCACTTGGCC 
TTCGTGTGTC TGAGTCTCTT CACTGAGAGG ATGTGCATCC AGGGGAGTCA GTTCAACGTC 
GAGGTCGGCA GAAGTGACAA GCTTTCCCTG CCTGGCTTTG AGAACCTCAC AGCAGGATAT 
AACAAATTTC TCAGGCCCAA TTTTGGTGGA GAACCCGTAC AGATAGCGCT GACTCTGGAC 
ATTGCAAGTA TCTCTAGCAT TTCAGAGAGT AACATGGACT ACACAGCCAC CATATACCTC 
CGACAGCGCT GGATGGACCA GCGGCTGGTG TTTGAAGGCA ACARGAGCTT CACTCTGGAT 



A TGGATCIGTC TAAATACCCC 

GAGTTCACCT G 



3 CTTTGTGTTT G 

TACAGTTCCT TACAGCAGAT GGCAGCCAAA GATAGGGGGA CAACAAAGGA AGTAGAAGAA 1200 
GTCAGTATTA CTAATATCAT CAACAGCTCC ATCTCCAGCT TTAAACGGAA GATCAGCTTT 12 60 



CAAAACCCCA 



G GTATTTTAGG CCAAGTGTGC 15 60 
ACCCACATCC AATGGTGCTA CAAGTGACTG AAATAATATT TGAGTCTTTC TGCTCAAAGA 1620 



TTACAAATGT ACTCAGGGCT GTTTATTCGG TGGCTCCCTG GTTTGCATTT ACCTCATATA 1800 

AAGAATGGGA AGGAGACCAT TGGGTAACCC TCAAGTGTCA GAAGTTGTTT CTAAAGTAAC 18 60 

TATACATGTT TTTTACTAAA TCTCTGCAGT GCTTATAAAA TACATTGTTG CCTATTTAGG 1920 

GAGTAACATT TTCTAGTTTT TGTTTCTGGT TAAAATGAAA TATGGGCTTA TGTCAATTCA 1980 
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TTGGAAGTCA ATGCACTAAC TCAATACCAA GATGAGTTTT TAAATAATGA ATATTATTTA 2040 

ATACCACAAC AGAATTATCC CCAATTTCCA ATAAGTCCTA TCATTGAAAA TTCAAATATA 210 0 

AGTGAAGAAA AAATTAGTAG ATCAACAATC TAAACAAATC CCTCGGTTCT AAGATACAAT 2160 

GGATTCCCCA TACTGGAAGG ACTCTGAGGC TTTATTCCCC CACTATGCAT ATCTTATCAT 222 0 

TTTATTATTA TACACACATC CATCCTAAAC TATACTAAAG CCCTTTTCCC ATGCATGGAT 2280 

GGAAATGGAA GATTTTTTTG TAACTTGTTC TAGAAGTCTT AATATGGGCT GTTGCCATGA 2340 



AAAGTACTGG GTTGACTCAG AGAGTCGCTG TCATTCTGTC ATTGCTGCTA CTCTAACACT 2460 

T CTCCCAGTGG CAGATCCCCT GTATCATTCC AAGAGGAGCA TTCATCCCTT 252 0 

\ GAAAACAAAC TGCTTGACCC AGGAACAAGT 2580 

GGCTTAGCTT AAGTAAACTT GGCTTTGCTC AGATCCCTGA TCCTTCCAGC TGGTCTGCTC 2640 

G AGCAGGAGCG TGCTGGCCCT GAGTACTGAA CTTTCTGAGT 270 0 

CCAGCCAGAG 

TTAATTTCTT 
ACCAAGGCT C 

C ACAATTCAGT TTCTATGAGT TTGATCACCT GATTTTTTTA 3060 

A AATGTGGTTG 312 0 

TATGAGCCAA TCATATTTGT GATTTTTTAA AAAAAGTTTA AAAGGAAATA TCTGTTCTGA 3180 

AACCCCACTT AAGCATTGTT TTTATATAAA AACAATGATA AAGATGTGAA CTGTGAAATA 32 40 
AATATACCAT ATTAGCTACC CACC 



I I I I I I 

MNYSLHLAFV CLSLFTERMC IQGSQFNVEV GRSDKLSLPG FENLTAGYNK FLRPNFGGEP 60 

VQIALTLDIA SISSISESNM DYTATIYLRQ RWMDQRLVFE GNKSFTLDAR LVEFLWVPDT 120 

YIVESKKSFL HEVTVGNRLI RLFSNGTVLY ALRITTTVAC NMDLSKYPMD TQTCKLQLES 180 

WGYDGNDVEF TWLRGNDSVR GLEHLRLAQY TIERYFTLVT RSQQETGNYT RLVLQFELRR 240 

NVLYFILETY VPSTFLWLS WVSFWISLDS VPARTCIGVT TVLSMTTLMI GSRTSLPNTN 300 

CFIKAIDVYL GICFSFVFGA LLEYAVAHYS SLQQMAAKDR GTTKEVEEVS ITNIINSSIS 3 60 

SFKRKISFAS IEISSDNVDY SDLTMKTSDK FKFVFREKMG RIVDYFTIQN PSNVDHYSKL 420 



Seq ID NO: 55 DNA sequence 

Nucleic Acid Accession it: XM_084007 

Coding sequence: 138.. 2405 



CTCGTGCCGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 6 0 

45 CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACGAGGCC 12 0 

GCGGAGACGA AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 180 

TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 240 

AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 

AT CAT CT ACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 360 

50 TCAGAAAATT ACTTCAAAAT AT AGG CAT AG ATAAGATTAA AAGAATCCAT ATACACCATG 42 0 

C CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 4B0 
2ACTCTG ACCATGATCA TCACTCCCAC C 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA G. 

GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 660 

GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 72 0 

TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 7 80 

CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 840 
TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATT CCA 



TCAACCAAAT 

T GTCTCTGCTG GGGGTTATCT 



TTCTTGTTGA 



GAGCAGACTC ACAAGAGCCC TCCCACTTTG 



CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 

GTCATCTGTC T 
TGGAAGGGTC TAACAGCTCT A 



TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 
TCATGACTAC CAT CAT ATT C TCCATCATCA C 



ATGACCGTTA : 
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AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 2520 

TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 25B0 

^ ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 

?GTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 

T TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 2760 



20 
25 



TCTGAGAATT GGGGAGGCAT AGATTCTTAT 



GTAGTGAGCA CTCTCATATA CTAATTAGTG 



TATTGCCAAG TT AT AT AT CA CCAAAAGCTG T 



Protein Acces 



I I I I I I 

MARKLSVILI LTPALEVTNP LHELKAAAFP QTTEKISPNW ESGINVDLAI STRQYHLQQL 
FYRYGENNSL SVEGFRKLLQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHER HSDHEHHSDH 
EHHSDHDKHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 
SVSASEVTST VYNTVSEGTH FLETIETPRP GKLPPKDVSS STPPSVTSKS RVSRLAGRKT 
NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 
RSCLIHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSL LGVILVPLMN RVFFKFLLSF 
LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKE GPLFSHLSSQ NIEESAYFDS 
TWKGLTALGG LYFMFLVEHV LTLIKQFKDK KKKNQKKPEN DDDVEIKKQL SKYESQLSTN 
EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKNKC 
HSHFHDTLGQ SDDLIHHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 
MGDGLIINFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLKA GMTVKQAVLY 



Seq ID NO: 57 DNA sequence 

Nucleic Acid Accession tt : NM_015419.1 

Coding sequence : 1 . . 8487 

i I 1 f V T i 1 

ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCCGTGGTGC TGATCCTGCT TTGGGGCCAT 60 

CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT ACGTCCCCAG CGAGGTCCAC 120 

TGCACGTTCC GATCCCTGGC TTCCGTGCGC GCTGGCATTG CTAGACACGT GGAAAGAATC 180 

AATTTGGGGT TTAATAGCAT ACAGGCCCIG TCAGAAACCT CATTTGCAGG ACTGACCAAG 240 

3 ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 3 00 

T CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT GATCACAGGA 360 

CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA TTGACCACAA CAAGATCGAG 420 

A ACGTCTCTGA GGCTACTOCA TTTGGAAGGA 480 




ATAGCATACT ACAGTGAAGT T 
CCCAGAGTCA GCTACCAGTA CAGGCAGGAT G 

GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC AGCCATCCAT AGATATCCAG 
CTGAACCGAC GTCAGAG 

T CCACCAAAGA TACAAGGCAG GCTCGGGGCA GAAGCTGGGT A 
3 CTGTG CAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 



CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA TTCTCAGCAG TGGCTGGCTG 
: ATCTGACTCA GGCTTGTACC AGTGCATTGC T 
T ATATAGGGTA CTTGTGCAGT CTCCCTCCAC T 
T TGGCAAGAAC CCAGGGGAGT CGGTGACATT G 



GTCCAAGTCA GTGATAGTGG TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC 1980 

CATTTTACGG TGGGAATCAC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 2 040 

AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA GGATGAAGGG 2100 

GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC TTCTGCATCC AAAGGACCAA 2160 

GAGGTGTTCC TCAAAACAAA GGATGATGCC ATCAATGGAG ACAAGAAAGC CAAGAAAGGG 2220 

AGAAGAAAGC TGAAACTCTG GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA 2280 

GGT CGCAG AG TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 2340 . 

GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA GGGCACAGAA 2400 ' 

GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA GCCTAGAAGT CACACCACCT 2460 
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GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG TCCAC-CTGC-T ATGCATGGCA 7560 

CGCAACGAGG GAGGGGAGGC GAGGTTGATC GTGCAGCTCA CTGTCCTGGA GCCCATGGAG 762 0 

AAACCCATCT TCCACGACCC GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC 7680 

AGCCTCBACT GCTCTGCCGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 7740 

GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC TGAOGGCATG 7800 

CTACACATTA GCGGTCTCTC CTCGGTGGAC GCIGGGGCCT ACCGCTGCGT GGCCCGCAAT 7860 

3 GCTGGTCTCC CTGAAGGTGG GACTGAAGCC AGAAGCAAAC 792 0 

C ATAACCTGGT CAGCATCATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC 7980 

C TGGACGCTCC CCAATGGCAT GCATCTGGAG 8040 

Pl TGCAGGATGG AGACGGAGTA CGGCCCTTCG 8160 

T TGTGATCGCC TATCCTCCCC GGATCACCAG CGAGCCCACC 822 0 



30 
35 




CCTTCTCCAG GAACCCTCCA G 

TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT GAGAGGAGAG GAAGGAGAAA ACTGCATCAT 942 0 

AACTTTACAG AATTGAATCT AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT 94 80 

CTGGCTTGTC CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 

CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 
ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 

Seq ID NO : 58 Protein sequence 

1 11 21 31 41 51 

I I I I I I 

MPKRAHWGAL SWLILLWGH PRVALACPHP CACYVPSEVH CTFRSLASVP AGIARHVERI SO 

NLGFNSIQAL SETSFAGLTK LELLMIKGNE IPSIPDGALR DLSSLQVFXF SYNKLRVITG 120 

QTLQGLSNLM RLHIDHNKIE FIHPQAFNGL TSLRLLHLEG NLLHQLHPST FSTFTFLDYF 180 

RLSTIRHLYL AENMVRTLPA SMLKNMPLLE NIiYLQGNPWT CDCEMRHFLE WDAKSRGILK 240 

CKKDKAYEGG QLCAMCFSPK KLYKHEIHKL KDMICLKPSI ESPLRQNRSR SIEEEQEQEE 300 

DGGSQLILEK FQLPQWSISL NMTDEHGNMV HLVCDIKKPM DVYKIHLNQT DPPDIDINAT 3 60 

VALDFECPMT REMYEKLWKL IAYYSEVPVK LHRELMLSKD PRVSYQYRQD ADEEALYYTG 420 

VRAQILAEPE WVMQPSIDIQ LNRRQSTAKK VLLSYYTQYS QTISTKDTRQ ARGRSWVMIE 480 

PSGAVQRDQT VLEGGPCQLS CNVKASESPS IFWVLPDGSI LKAPMDDPDS KFSILSSGWL 540 

RIKSMEPSDS GLYQCIAQVR DEMDRMVYRV LVQSPSTQPA EKDTVTIGKN PGESVTLPCN 500 

ALAI PEAHLS WILPNRRIIN DLANTSHVYM LPNGTLSIPK VQVSDSGYYR CVAVNQQGAD 660 

HFTVGITVTK KGSGLPSKRG RRPGAKALSR VREDIVEDEG GSGMGDESNT SRRLLHPKDQ 720 

EVFLKTKDDA INGDKKAKKG RRKLKLWKHS EKEPETNVAE GRRVFESRRR INMANKQINP 780 

ERWADILAKV RGKNLPKGTE VPPLIKTTSP PSLSLEVTPP FPAVSPPSAS PVQTVTSAEE 840 

SSADVPLLGE EEHVLGTISS ASMGLEHNHN GVILVEPEVT STPLEEWDD LSEKTEEITS 900 

TEGDLKGTAA PTLISEPYEP SPTLHTLDTV YEKPTHEETA TEGWSAADVG SSPEPTSSEY 960 

EPPLDAVSLA ESEPMQYFDP DLETKSQPDE DKMKEDTFAH LTPTPTIWVN DSSTSQLFED 1020 

STIGEPGVPG QSHLQGLTDN IHLVKSSLST QDTLLIKKGM KEMSQTLQGG NKLEGDPTHS 1080 

RSSESEGQES KSITLPDSTL GIMSSMSPVK KPAETTVGTL LDKDTTTVTT TPRQKVAPSS 1140 

TMSTHPSRRR PNGRRRLRPN KFRHRHKQTP PTTFAPSETF STQPTQAPDI KISSQVESSL 1200 

V HTPKQLEMEK NAEPTSKGTP RRKHGKRPNK HRYTPSTVSS RASGSKPSPS 1260 
3TILLPR TVSLKTEGPY DSLDYMTTTR KIYSSYPKVQ ETLPVTYKPT 1320 

V ATNVDKHKSD ILVTGESITN AIPTSRSLVS TMGEFK3ESS PVGFPGTPTW 13 80 
NPSRTAQPGR LQTDIPVTTS GENLTDPPLL KELEDVDFTS EFLSSLTVST PFHQEEAGSS 1440 

TTLSSIKVEV ASSQAETTTL DQDHLETTVA ILLSETRPQN H ------ 

MSLGQTTTTK PALPSPRISQ ASRDSKENVF LNYVGNPETE ATPVNNEGTO. H 
PSSDRDAFNL STKLELEKQV FGSRSLPRGP DSQRQDGRVH ASHQLTRVPA K 
LPEMSTQSAS RYFVTSQSPR HWTNKPEITT YPSGALPENK QFTTPRLSST T 
PSIPSKFTDR RTDQFNGYSK V 

PQLGVTRRPQ IPTSPAPVMR ERKVIPGSYN RIHSHSTFHL DFGPPAPPLL HTPQTTGSPS 1800 

TNLQNIPMVS STQSSISFIT SSVQSSGSFH QSSSKFFAGG PPASKFWSLG EKPQILTKSP 1860 

QTVSVTAETD TVFPCEATGK PKPFVTWTKV STGALMTPNT RIQRFEVLKN GTLVIRKVQV 192 0 
QDRGQYMCTA SNLHGLDRMV VLLSVTVQQP QILASHYQDV 1 
QISWIFPDRR VWQTVSPVES R 

HVAALPPVIH QEKLENI SLP PGLSIHIHCT AKAAPLPSVR WVLGDGTQIR PSQFLHGNLF 
VFPNGTLYIR NLAPKDSGRY ECVAANLVGS ARRTVQLNVQ RAAAHARITG T 



M KPAKIEHKEE NDHKVFYGGD L 
DGSLVNSFMQ SDDSGGRTKR YWFNNGTLY FNEVGMREEG DYTCFAEHQV G 
VVTAPATIRN KTYLAVQVPY GDWTVACEA KGEPMPKVTW LSPTNKVIPT SSEKYQIYQD 2400 
GTLLIQKAQR SDSGNYTCLV RNSAGEDRKT WIHVNVQPP KINGNPNPIT TVREIAAGGS 2460 
RKLIDCKAEG IPTPRVLWAF PEGWLPAPY YGNRITVHGN GSLDIRSLRX SDSVQLVCMA 2520 
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RNEGGEARLI VQLTVLEPME KPIFHDPISE KITAMAGHTI SLNCSAAC-TP TFSLVWVLPN 2580 

GTDLQSGQQL QHFYHKADGM LHISGLSSVD AGAYRCV ARN AAGHTERLVS LKVGLKPEAN 2640 

KQYHNLVSII NGETLKLPCT PPGAGQGRFS WTLPNGMHLE GPQTLGRVSL LDNGTLTVRE 2700 

ASVFDRGTYV CRMETEYGPS VTSIPVIVIA YPPRITSEPT PVIYTRPGNT VKLNCMAMGI 2760 

PKADITWELP DKSHLKAGVQ ARLYGNRFLH PQGSLTIQHA TQRDAGFYKC MAKNILGSDS 2820 
KTTYIHVF 

Seq ID NO i 59 DNA sequence 



CCGAACAGGC GCCCCAGCAG ACTACAGAGT GATATTGAAG 
AACAGAAGAA Ac 



GCAGTCCGTA TTTCACAGGG TGAAAGAGAT GGCAAATGGA 
20 ACACCAGAAT CTGCCCCTAC CACAGCTCCT GAAAACTTGA 
AAACCTACAG 

GTCTGTCTGC TGGACACAGG ACTGTTTTCA GTTTCCTCCT TCCAACCATC 
TTTCAGAATA CATTCTTTCA TACGCCCCGG CTCTCAAACC ATTTGGAGCA 
CCTATCCTGG AGACACTACT TCTGCCCTGG TGGATGGTCT GCAGCCTGGG ^^^^^^ 
25 TTTTCAAAAT CCGGGCCACA AACAGGAGAG GCCTGGGACC TCACTCCAAA GCCTTCATTG 
TCGCTATGCC AACAAGAATG CAGCTGTACC CAGAAGGATT TCAGTTGTCT AGCTTACCTG 
ATCGATATCC AAACCAAACA AGTTAATAAA GATCCACAAC TGGAAGGGAG TGTTTTTGGA 
CCATGTTTTC TTTTCTACTT CCTCACATTT A 



CCTTCCTCAC CTTCTCCCAG AGCTCCAGCT TCCTCCCAAC ACCCCTCTGT GCCTGCTTCT 1080 

CCCCAAGGGA GAAATGCCAA GGACCTTCTT CTTGACTTGA AGAACAAAAT ATTGGCTAAT 1140 

GGTGGGGCGC CCCGAAAACC CCAGCTTCGC GCCAAGAAGG CAGAGGAGCT GGATCTTCAG 1200 

TCGACAGAAA TCACTGGGGA GGAGGAGCTG GGTTCCCGGG AGGACTCGCC CATGTCACCC 1260 

TCAGACACCC AAGACCAGAA ACGGACCCIG AGGCCGCCAA GTAGACACGG CCACTCGGTG 13 20 

GTTGCTCCCG GCAGGACTGC AGTGAGGGCC CGGATGCCAG CGCTGCCCCG AAGGGAAGGC 1380 

GTAGATAAGC CTGGCTTTTC CCTGGCCACG CAGCCCCGCC CAGGGGCGCC CCCCTCGGCT 1440 

TCGGCCTCTC CTGCCCACCA CGCGTCCACC CAGGGCACCT CTCATCGTCC TTCCCTGCCT 1500 

GCCAGCTTGA ATGACAACGA CTTGCTGGAC TCAGACGAAG ATGAGCGCGC TGTGGGCTCC 1560 

CTCCACCCCA AGGGCGCCTT CGCCCAGCCC CGGCCAGCCC TGTCCCCCAG CCGCCAGTCC 1620 

CCGTCCAGCG TTCTCCGCGA CAGAAGCTCT GTGCACCCOG GOGCAAAGCC AGCCTCGCCG 1680 

GCGCGGAGGA CCCCCCATTC AGGGGCCGCA GAGGAAGATT CCAGTGCCTC AGCCCCACCC 174 0 

TCAAGACTTT CTCCACCCCA TGGGGGATCA TCTCGGCTGC TGCCCACCCA GCCACACCTG 1800 

AGCTCTCCAC TTTCCAAGGG CGGGAAGGAT GGTGAGGACG CCCCAGCCAC CAACTCCAAT 1860 

GCGCCATCAC GGTCCACCAT GTCCTCCTCC GTCTCTTCTC ATCTCTCGTC CAGGACGCAG 1920 

GTCTCTGAGG GAGCGGAGGC TTCTGATGGT GAAAGCCACG GTGACGGCGA TAGGGAAGAC 1980 

GGCGGAAGGC AGGCGGAGGC CACGGCCCAG ACGCTGCGGG CCCGGCCTGC CTCTGGACAC 2040 

TTCCATTTGC TCAGACACAA ACCCTTTGCT GCCAACGGGA GGTCTCCAAG CAGGTTCAGC 2100 

ATTGGGCGGG GACCTCGGCT GCAGCCCTCC AGCTCCCCAC AGTCGACTGT GCCCTCCCGA 2160 
GCCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACCCTA AGCTTAGCTC At 
GGAGACGAGG AGGATGAGAA G 

TCCTCCAGGC AGCCCATCTC CCGGGGCTGG GAGGACTTAA GGAGAAGCCC GCAGAGAGGG 
GCCAGCCTGC ATCGGAAGGA ACCCATCCCA GAGAACCCCA AATCCACAGG G 




65 GGGAAGTCGG AGCCTCCTTC CAAGCGGCCC CTGTCCTCCA AGTCCCAGCA GTCGGTCTCA 

GCCGAGGACG AGGAGGAGGA GGACGCGGGG TTTTTTAAAG GCGGGAAAGA AGACCTTCTG 3180 

TCTTCCTCTG TGCCAAAGTG GCCCTCTTCC TCCACTCCCA GGGGCGGCAA AGACGCCGAT 3240 

GGGAGCCTCG CCAAGGAAGA GAGGGAGCCT GCCATCGCGC TTGCCCCTCG CGGAGGGAGC 33 00 

CTGGCTCCTG TGAAGCGACC TCTCCCCCCA CCTCCAGGCA GCTCCCCCAG GGCCTCCCAC 3 3 60 

70 GTCCCTTCCC GACCGCCGCC TCGCAGCGCT GCCACCGTGA GCCCCGTCGC GGGCACCCAC 3420 

CCCTGGCCGC GGTACACCAC GCGCGCCCCV CCTGGCCACT TCTCCACCAC CCCGATGCTG 3 480 

TCCTTGCGCC AGAGGATGAT GCATGCCAGA TTCCGTAACC CTCTCTCCCG ACAGCCTGCC 3540 

AGACCCTCTT ACAGACAAGG TTATAATGGC AGACCAAATG TAGAAGGGAA AGTCCTTCCT 3 600 

GGTAGTAATG GAAAACCGAA TGGACAGAGA ATTATCAATG GCCCTCAAGG AACAAAGTGG 3S60 

75 GTTGTGGACC TTGATCGTGG GTTAGTATTG AATGCAGAAG GAAGGTACCT CCAAGATTCA 3720 

CATGGAAATC CTCTTCGGAT TAAACTAGGA GGAGATGGTC GAACCATTGT AGATCTGGAA 3 780 

GGGACCCCCG TGGTGAGTCC TGACGGCCTC CCACTCTTT GGCAO ( CATG~CACA 3840 

CCTCTGGCCA ATGCCCAAGA TAAGCCAATT TTGAGTCTTG GAGGAAAGCC GCTGGTGGGC 3 900 

TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATGCAC-CC CACCACTACT 3 960 

80 ACGACGCCCC TGCCTACCAC TACAACCCCG AGGCCCACCA CTGCCACCAC CATGCAGCCC 4020 

ACCACTACTA CGACGCCCC1 GCCTACCACT ACACCGAGGC CCACCACTGC CACCACCCGC 4080 

CGCACGACCA CCAGGCGTCC AACAACCACA GTCCGAACCA CTACGCGGAC AACCACCACC 4140 

ACCACCCCCA AACCCACCAC TCCCATCCCC ACCTGTCCCC rGC nr GAACGGCAC 4200 

GACGATGATG GCAACCTGAT AATGAGCTCC AATGGGATCC CAGAGTGCTA CGCTGAAGAA 4260 
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GATGAGTTCT CAGGCTTGGA GACTGACACT GCAGTACCTA CGGAAGAGGC CTACGTTATA 4320 

TATGATGAAG ATTATGAATT TGAGACGTCA AGGCCACCAA OCACCACTGA GCCTTCGACC 4380 

ACTGCTACCA CACCGAGGGT GATCCCAGAG C-AAGGCGCCA TCAC-TTCCTT TCCTGAAGAA 444 0 

GAATTTGATC TGGCTGGAAG GAAACGATTT GTTGCTCCTT ACGTGACC-TA CCTAAATAAA 4500 

GACCCATCAG CCCCGTGCTC TCTGACTGAT GCACTGGATC ACTTCCAAGT GGACAGCCTG 4560 

GATGAAATCA TCCCCAATGA CCTGAAGAAG AGTGATCTGC CTCCCCAGCA TGCTCCCCGC 4620 

AACATCACCG TGGTGGCCGT GGAAGGTTGC CACTCATTTG TCATTGTGGA TTGGGACAAA 46 80 

GCCACCCCAG GAGATTTGGT CACAGGTTAT TTGGTTTACA GTGCATCCTA TGAAGATTTC 4740 
A TCAGTAACTC ACTTGCCCAT T 



30 
35 



A GTGACAACCT G 

TTGGAGACAG CTGGGGAAGA GGTGAAGACC ATTGCCAATT TGTGGATTCA CACCTTGATG 



T GCTCAGCCCC GCTGCCCTAG GTGCCAGGAA GGTCACAGAT GGACACTGGC 5460 
CATTCTGGTC ATCTCAGTCT GGAACTCAGT CCCACTTCTT GGCCTGGACA ATGAACAGGA 5520 
TTCAGTTTTG CTGTTAACTT TGCTTCTCTA CTTTTTTTTG T 
CCAGAGACAT CAGAAACCAG CAACTGATTC AGTGTGATTT O 

AATTCGGACA CTTCAGTATT TCCAGGAATA GCATATGCAC GCTGTTCTTG CTTCATGGAA 
TGCTACATGC TTTCTGTTTT TCTCAT1 

TCAGGTCCCT TTGTATGCAG TAGAAAGGAA TTATTAAAAA CACCACCAAA G 
ATATCCTACT TGAAATTTAC TCTATGGACT TACCCACTGC TAGAATAAAT GTATCAAATC 5880 
TTATTTGTAA ATTCTCAATT TTGATATATA T 



MFGTKLTRTG AEADYRVILK TSQEDELDVP DDISVRVMSS QSVLVSWVDP VLEKQKKWA 
SRQYTVRYRE KGELARWDYK QIANRRVLIE NLIPDTVYEF AVRISQGERD GKWSTSVFQR 
TPESAPTTAP ENLNVWPVNG KPTVVAASWD ALPETEGKVK VCLKDTGLFS VSSFQFSAKS 
FQNTFFHTPR LSNHLEQSPS PILETLLLPW WMVCSLGNAI FSXSGPQTGE AWDLTEKPSL 
SLCQQECSCT QKDFSCLAYL IDIQTKQVNK DPQLEGSVFG PCFLFYFLTF MLDIGGFSFI 
MCYEDPVSSL TGNSLKSVAA SKADVQQNTE DNGKPEKPSF SSPSPRAFAS SQHPSVFASP 
QGRNAKDLLL DLKNKILANG GAPRKPQLRA KKAEELDLQS TEITGEEELG SREDSEMSPS 
DTQDQKRTLR PPSRIIGIISW APGRTAVRAR MPALPRREGV DKPGFSLATQ PRPGAPPSAE 
ASPAHHASTQ GTSHRPSLPA SLNDNDLVDS DEDSRAVGSL HPKGAFAQPR PALSPSRQSF 
SSVLRDRSSV HPGAKEASPA RRTPHSGAAE EDSSASAPPS RLSEPHGGSS RLLPTQPHLS 
SPLSKGGKDG EDAPATNSNA PSRSTMSSSV SSHLSSRTQV SEGAEASDGB SHGDGDREDG 
GRQAEATAQT LRARPASGHF HLLRHKPFAA NGRSPSRFSI GRGPRLQESS SEQSTVPSRA 
HPRVPSHSDS HPKLSSGIHG DEEDEKELPA TWNDHVPSS SRQPISRGWE DLRRSPQRGA 
SLHRKEPIPE NPKSTGADTH PQGKYSSLAS KAQDVQQSTD ADTEGHSPKA QPGSTDRHAS 
FARFPAARSQ QHPSVPRRMT PGRAPEQQPP PPVATSQHHP GPQSRDAGRS PSQPRLSLTC 
AGRPRPTSQG RSHSSSDPYT ASSRGMLPTA LQNQDEDAQG SYDDDSTEVE AQDVRAPAHA 
L PKHQQVESPT GAGAGGDHRS QRGHAASPAR PSRPGGEQSR ARVPSRAAPG 
L SSKSQQSVSA EDEEEEDAGF FKGGKEDLLS SSVPKWPSSS TPRGGKDADG 
KRFLPPP PGSSPRASHV PSRPPPRSAA T 
S LRORMMHARF RNPLSRQEAR P 
SNGKPNGQRI INGPQGTKWV VDLDRGLVLN AEGRYLQDSH G 
TPWSPDGLP LFGQGRHGTP LANAQDKPIL SLGGKPLVGL EVIKKTTHPP TTTMQPTTTT 

TPKPTTPIPT CPPGTLERHD DDGNLIMSSN GIPECYAEED EFSGLETDTA VPTEEAYVIY 

j PPXTTEPSTT ATTPRVIPEE GAISSFPEEE FDLAGRKRFV APYVTYLNKD 

\ LDHFQVDSLD EIIPNDLKKS DLEFQHAFRN ITWAVEGCH SFVIVDWDKA 

1 RNKFSTQASS VTHLP IENLK PNTRYYFKVQ AQNPHGYGPI 

SPSVSFVTES DNPLLWRPP GGELSGSHSL SNMIPATRTA MDGNM 



AAAGCTGATG CGATGCTCTC AGTGCCGCGT CGCCAAATAC TGTAGTGCTA AC-TGTCAGAA 180 



AGAAATACAG GATGCCTCTC AGCTGCCACC TGCCTTTGAC C 
AGTGATCTGC AACTCTTTCA CCATCTGTAA TGCGGAGATG CAGGAAGTTG G 
ATATCCCAGT ATCTCTTTGC TCAATCACAG CTGTGACCCC AACTGTTCGA TTGTGTTCAA 
TGGGCCCCAC CTCTTACTGC GAGCAGTCCG AGACATCGAG GTGGGAGAGG A 
CTGCTACCTG GATATGCTGA TGACCAGTGA GGAGCGCCGG AAGCAGCTGA G 
CTGCTTTGAA TGTGACTGTT TCCGTTGCCA AACCCAGGAC AAGGATGCTG ATATGCTAAC 
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TGGTGATGAG CAAGTATGGA AGGAAGTTCA AGAA1CCCTG AAAAAAATTG AAGAACTGAA 840 

GGCACACTGG AAGTGGGAGC AGGTTCTGGC CATGIGCCAG GCGATCATAA GCAGCAATTC 900 

TGAACGGCTT CCCGATATCA ACATCTACCA GCTGAAGGTG CTCGACTGCG CCATGGATGC 960 

CTGCATCAAC CTCGGCCTGT TGGAGGAAGC CTTGTTCTAT GGTACTCGGA CCATGGAGCC 1020 

ATACAGGATT TTTTTCCCAG GAAGCCATCC CGTCAGAGGG GTTCAAGTGA TGAAAGTTGG 1080 

CAAACTGCAG CTACATCAAG GCATGTTTCC CCAAGCAATG AAGAATCTGA GACTGGCTTT 1140 

TGATATTATG AGAGTGACAC ATGGCAGAGA ACACAGCCTG ATTGAAGATT TGATTCTACT 1200 

TTTAGAAGAA TGCGACGCCA ACATCAGAGC ATCCTAAGGG AACGCAGTCA GAGGGAAATA 12 60 

CGGCGTGTGT CTTTGTTGAA TGCCTTATTG AGGTCACACA CTCTATGCTT TGTTAGCTGT 1320 

GTGAACCTCT CTTATTGGAA ATTCTGTTCC GTGTTTGTGT AGGTAAATAA AGGCAGACAT 1380 

GGTT TGCAAA CCACAAGAAT CATTAGTTGT AG AG AAG CAC GATTATAATA AATTCAAAAC 1440 
ATTTGGTTGA GGATGCCAAA A 



MRCSQCRVAK 



50 



CNSPTICNAE MQEVGVGLYP 



LDMLMTSEER RKQLRDQYCF ECDCFRCQTQ DKDADMLTGD EQVWKEVQES L 
WKWEQVLAMC QAIISSNSER LPDINIYQLK VLDCAMDACI N 



GVQVMKVGKL QLHQGHFPQA MKNLRLAPDI MRVTHGREHS LIEDLILLLE 

Seq ID NO: 63 DNA sequence 
Nucleic Acid Accession tt : NM_003014.2 
^ Coding sequence: 238.-648 

1 11 21 31 41 51 

I I I I I I 

GGCGGGTTCG CGCCCCGAAG GCTGAGAGCT GGCGCTGCTC GTGCCCTGTG TGC 
CGGAGCTCCG CGGCCGGACC CCGCGGCCCC GCTTTGCTGC CGACTGGAGT TTGGGGGAAG 
35 AAACTCTCCT GCGCCCCAGA AGATTTCTTC CTCGGCGAAG GGACAGCGAA A 
GGCAGGAAGA GAAGGCGCTT TCTGTCTGCC GGGGTCGCAG CGCGAGAGGG C 

TTCCTCTCCA TCCTAGTGGC GCTGTGCCTG TGGCTGCACC TGGCGCTGGG CGTGCGCGGC 3 00 

GCGCCCTGCG AGGCGGTGCG CATCCCTATG TGCCGGCACA TGCCCTGGAA CATCACGCGG 360 

ATGCCCAACC ACCTGCACCA CAGCACGCAG GAGAACGCCA TCCTGGCCAT CGAGCAGTAC 42 0 

40 GAGGAGCTGG TGGACGTGAA CTGCAGCGCC GTGCTGCGCT TCTTCTTCTG TGCCATGTAC 4 80 

GCGCCCATTT GCACCCTGGA GTTCCTGCAC GACCCTATCA AGCCGTGCAA GTCGGTGTGC 540 

CAACGCGCGC GCGACGACTG CGAGCCCCTC ATGAAGATGT ACAACCACAG CTGGCCCGAA 60 0 

AGGCTGGCCT GCGACGAGCT GCCTGTCTAT GACGGTGGCG TGTGCATTTC GCCTGAAGCC 660 

ATCGTCACGG ACCTCCCGGA GGATGTTAAG TGGATAGACA TCACACCAGA CATGATGGTA 72 0 

45 CAGGAAAGGC CTCTTGATGT TGACTGTAAA CGCCTAAGCC CCGATCGGTG CAAGTGTAAA 78 0 

AAGGTGAAGC CAACTTTGGC AACGTATCTC AGCAAAAACT ACAGCTATGT TATTCATGCC 840 

AAAATAAAAG CTGTGCAGAG GAGTGGCTGC AATGAGGTCA CAACGGTGGT GGATGTAAAA 900 

GAGATCTTCA AGTCCTCATC ACCCATCCCT CGAACTCAAG TCCCGCTCAT TACAAATTCT 960 

TCTTGCCAGT GTCCACACAT CCTGCCCCAI CAAGATGTTC TCATCATGTG TTACGAGTGG 1020 

CGTTCAAGGA TGATGCTTCT TGAAAATTGC TTAGTTGAAA AATGGAGAGA TCAGCTTAGT 1080 

AAAAGATCCA TACAGTGGGA AGAGAGGCTG CAGGAACAGC GGAGAACAGT T CAGGACAAG 1140 

AAGAAAACAG CCGGGCGCAC CAGTCGTAGT AATCCCCCCA AACCAAAGGG AAAGCCTCCT 1200 

GCTCCCAAAC CAGCCAGTCC CAAGAAGAAC ATTAAAACTA GGAGTGCCCA GAAGAGAACA 1260 

AACCCGAAAA GAGTGTGAGC TAACTAGTTT CCAAAGCGGA GACTTCCGAC TTCCTTACAG 132 0 

GATGAGG CTG GGCATTGCCT GGGACAGCCT ATGTAAGGCC ATGTGCCCCT TGCCCTAACA 13 80 
ACTCACTGCA GTGCT CTTCA TAGACACATC TTGCAGCATT T 
GTTTTTCTTT GTAAGCCATC A 
GAGTTAAAGC TGGTGGAAAA GGCTTATTGC A 



ATT AG AG AAG T 

AAACAAAAAT 



CTGGCTA ATGCTCAAGT 

CATCTTACTT C 



TTAAATATTT TCTTTGCCTA AATACATGTG 

3 TTGGGATACT 



ATTTATTTTA AAAAACAATT TTATTGGCCT T 

AGGCATTCAA TAAATGCACA ACGCCCAAAG GAAATAAAAT CCIATCTAAT CCTACTCTCC 2520 

ACTACACAGA GGTAATCACT ATTAGTATTT TGGCATATTA TTCTCCAGGT GTTTGCTTAT 2580 

GCACTTATAA AATGATTTGA ACAAATAAAA CTAGGAACCT GTATACATGT GTTTCATAAC 2640 

CTGCCTCCTT TGCTTGGCCC TTTATTGAGA TAAGTTTTCC TGTCAAGAAA GCAGAAACCA 2700 

TCTCATTTCT AACAGCTGTG TTATATTCCA TAGTATGCAT TACTCAACAA ACTGTTGTGC 2760 
TATTGGATAC TTAGGTGGTT TCTTCACTGA CAATACTGAA TAAACATCTC ACCGGAATTC 
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I I I I I I 

MFLSILVALC LWLHLALGVR GAPCEAVRIP MCRHMPWNIT RMPNHLHIIST QEIIAILAISQ 
YEELVDVNCS AVLRFFFCAM YAPICTLEFL HDPIKPCKSV CQRARDDCEP LMKMYNHSWP 
ESLACDELPV YDRGVCISPE AIVTDLPEDV KWIDITPDM.M VQERPLDVDC KRLSPDRCKC 
KKVKPTLATY LSKNYSYVIH AKIKAVQRSG CNEVTTWDV KEIFKSSSPI PRTQVPLITN 
SSCQCPHILP HQDVLIMCYE WRSRMMLLEN CLVEKWRDQL SKRSIQWEER LQEQRRTVQD 
KKKTAGRTSR SHPPKPKGKP PAPKPASPKK NIKTRSAQKR TNPKRV 

Seq ID NO: 65 DNA sequence 

Nucleic Acid Accession #: BC010423 

Coding sequence: 248.. 1780 



CACAGCGTGG GAAGCAGCTC TGGGGGAGCT 



CGTGGTAACT GTGGTGCTGG G 

C TCGGGTGGAC GCGGGCGAAG GCGCCCAGGA 



GCAGCTTCCA 



C ACAGCTGAGG GCAGCCCAGC 780 

\ AGGCACAACG TCCAGCCGTT CCTTCAAGCA 840 

r CCACTTGGTG CCTAGCCGCA GCATGAATGG 900 

GCAGCCACTG ACTTGTGTGG TGTCCCATCC TGGCCTGCTC CAGGACCAAA GGATCACCCA 9 SO 

CATCCTCCAC GTGTCCTTCC TTGCTGAGGC CTCTGTGAGG GGCCTTGAAG ACCAAAATCT 1020 

GTGGCACATT GGCAGAGAAG GAGCTATGCT CAAGTGCCTG AGTGAAGGGC AGCCCCCTCC 10 80 

CTCATACAAC TGGACACGGC TGGATGGGCC TCTGCCCAGT GGGGTACGAG TGGATGGGGA 1140 

CACTTTGGGC TTTCCCCCAC TGACCACTGA GCACAGCGGC ATCTACGTCT GCCATGTCAG 12 00 

CAATGAGTTC TCCTCAAGGG ATTCTCAGGT CACTGTGGAT GTTCTTGACC CCCAGGAAGA 1260 

CTCTGGGAAG CAGGTGGACC TAGTGTCAGC CTCGGTGGTG GTGGTGGGTG TGATCGCCGC 13 20 

ACTCTTGTTC TGCCTTCTGG TGGTGGTGGT GGTGCTCATG TCCCGATACC ATCGGCGCAA 13 80 

GGCCCAGCAG ATGACCCAGA AATATGAGGA GGAGCTGACC CTGACCAGGG AGAACTCCAT 1440 

CCGGAGGCTG CATTCCCATC ACACGGACCC CAGGAGCCAG CCGGAGGAGA GTGTAGGGCT 1500 

GAGAGCCGAG GGCCACCCTG ATAGTCTCAA GGACAACAGT AGCTGCTCTG TGATGAGTGA 1560 

AGAGCCCGAG GGCCGCAGTT ACTCCACGCT GACCACGGTG AGGGAGATAG AAACACAGAC 1G20 

TGAACTGCTG TCTCCAGGCT CTGGGCGGGC CGAGGAGGAG GAAGATCAGG ATGAAGGCAT 1680 

CAAACAGGCC ATGAACCATT TTGTTCAGGA GAATGGGACC CTACGGGCCA AGCCCACGGG 174C 

CAATGGCATC TACATCAATG GGCGGGGACA CCTGGTCTGA CCCAGGCCTG CCTCCCTTCC 1800 

CTAGGCCTGG CTCCTTCTGT TGACATGGGA GATTTTAGCT CATCTTGGGG GCCTCCTTAA 1860 

ACACCCCCAT TTCTTGCGGA AGATGCTCCC CATCCCACTG ACTGCTTGAC CTTTACCTCC 1920 

AACCCTTCTG TTCATCGGGA GGGCTCCACC AATTGAGTCT CTCCCACCAT GCATGCAGGT 1980 

CACTGTGTGT GTGCATGTGT GCCTGTGTGA GTGTTGACTG ACTGTGTGTG TGTGGAGGGG 2040 



T GCCACGGGAT TTGAGTGGTT GCGTGGGCAA CACTGTCAGG 2160 

GTTTGGCGTG TGTGTCATGT GGCTGTGTGT GACCTCTGCC TGAAAAAGCA GGTATTTTCT 2 220 

CAGACCCCAG AGCAGTATTA ATGATGCAGA GGTTGGAGGA GAGAGGTGGA GACTGTGGCT 2280 

CAGACCCAGG TGTGCGGGCA TAGCTGGAGC TGGAATCTGC CTCCGGTGTG AGGGAACCTG 2340 

TCTCCTACCA CTTCGGAGCC ATGGGGGCAA GTGTGAAGCA GCCAGTCCCT GGGTCAGCCA 2400 

GAGGCTTGAA CTGTTACAGA AGCCCTCTGC CCTCTGGTGG CCTCTGGGCC TGCTGCATGT 2460 

ACATATTTTC TGTAAATATA CATGCGCCGG GAGCTTCTTG CAGGAATACT GCTCCGAATC 2520 

ACTTTTAATT TTTTTCTTTT TTTTTTCTTG CCCTTTCCAT TAGTTGTATT TTTTATTTAT 2580 

TTTTATTTTT ATTTTTTTTT AGAGTTTGAG TCCAGCCTGG ACGATATAGC CAGACCCTGT 2640 



I I I I I I 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGELETSDW TWLGQDAKL P~ 
QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPRNI 
DEGEYECRVS TFPAGSFQAR LRLRVLVPPL PSLNPGPALE EGQGLTLAAS CTAEGSPAPS 
VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCWSHPGL LQDQRITHIL 
HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 
GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VWGVIAALL 
FCLLWWVL MSRYHRRKAQ QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA 
EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGIKQ 
AMNHFVQENG TLRAKPTGNG IYINGRGHLV 

Seq ID NO : 67 DNA sequence 
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CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAC-TTAAA CTTACAAGCC 

TGCCATAAGT GAG AAG CAAA CTTCCTTGAT AACATGCTTT TGCGAAGTGC AGGAAAATTA 

W3AAAGA GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 

A AATGCCACCA CCATTGTCCA G AAG ACT C AG TCAACAATAT TTGCAGCACA 

GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTC-T GGTCACTTCT 

GGTTGCCTAG GACTAGAAGG CTCAGATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA 

AGAAGAT CAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT ACACCCTACA 
1 TGAAAAACAG AGATTTTGTT GATGGACCTA TACACCACAG GGCTTTACTT 



T TAGAACAGGA 

ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 
ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 
GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATATCAG 
ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 
GGGTCCTGGA CCCAGTTGTA CCTAAT CACA GACTATCATG AAAATGGTTC CCTTTATGAT 
TATCTGAAGT CCACCACCCT AG ACG CT AAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 
AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC 



GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG 
ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG 
AATCACTTCC AGTCTTACAT CATGGCTGAC ATGTATAGTT TTGGCCTCAT 
GTTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC 

& GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 
»AACCG GTGGAGCAGT GATGAGTGTC TAAGGCAGAT GGGAAAACTC 
ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 
ACACTTGCCA AAATGT CAG A GTCCCAGGAC ATTAAACTCT GATAGGAGAG GAAAAGTAAG 
CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 
TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 
CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 
TCTGTTTGTA GGCGGAGAAA CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 



1 11 21 31 41 51 

MLLRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVNNICSTD GYCFTMIEED 
DSGLPWTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERH ECNKDLHPTL PPLKNRDFVD 

40 GPIHIIRALLI SVTVCSLLLV LIILFCYFRY KRQETRFRYS IGLSQDETYI PPGESLRDLI 

EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKNRGEKVAV KVFFTTEEAS 
WFRETEIYQT VLMRHENILG FIAADIKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 
MLKLAYSSVS GLCHLIITEIF STQGKPAIAH RDLKSKNILV KKNGTCCIAD LGLAVKFISD 
TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 

45 EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 
RLTALRVKKT LAKMSESQDI KL 

T Eos sequence 



CTTCTAACTT GACAGGGTCT TGCT CTGTCA TTCAGGCAAG 



TCCTCCCACC TCAGCCACCC AAGTGGCTGG GACTACAGAT T. 
TTAAAGCTTT GTTCCAAAGC CAATGTGTAT ACTGAAGTGC CTGATGGAGG ATGGGGCTGG 
GCGGTAGCTG TTTCATTTTT CTTCGTTGAA GTCITCACCT A 



TCTGTGTGTT 



GTGGGCCTAC TACAGTTAAA CATTGTCATC TTCGGAGCAC TGCTCAGACC C 

3 cgtcaccgaa aatagtcatc caggaaaatc ggaaagaagc gcagtatatg 
:tcaata gactccattg actcaggagt agaactaact 
acctcaccta aaaatgtgcc tactcacact aacctggaac tggagccgaa ggccgacatg 
c cagccccagg ccaagcgaaa agaaagcccc gctattagac 
[■tttatt tgttatgcat tatttggtct ctttgcaaca 
ctgggattct ttgcaccttc cttgtacatc attcctctgg gcattagtct gggcattgac 
caggaccgcg ctgctttttt attatctacg atggccattg cagaagtttt cggaaggatc 
ggagctggtt ttgtcctcaa cagggagccc attcgtaaga tttacattga g 

GTCATCTTAT TGACTGTGTC TCTGTTTGCC TTTACTTTTG CTACTG7 

ATGTCATG CA GCATATTTTT TGGGTTTATG GTTGGAACAA TAGGAGGGAC TCACATTCCA 1380 

CTGCTTGCTG AGGATGATGT CGTGGGCATT GAGAAGATGT CTTCTGCAC-C TGGGGTCTAC 1440 

ATCTTCATTC AGAGCATAGC AGGACTGGCT GGACCGCCCC TTGCAGGTTT GTTGGTGGAC 1500 

CAAAGTAAGA TCTACAGCAG GGCCTTCTAC TCCTGCGCAG CTG3CATGC-C CCTGGCTGCT 1560 

GTGTGCCTCG CCCTGGTGAG ACCGTGTAAG ATGGGACTGT GCCAGCATCA TCACTCAGGT 1S20 

GAAACAAAGG TAGTGAGCCA TCGTGGGAAG ACTTTACAGG ACATACCTGA AGACTTTCTG 1680 

G AAATGGAT C TTGCAAAAAA TGAG CACAGA GTTCACGTGC AAATGGAGCC GGTATGACAC 1740 

ACTTTCTTAC AACAACAGCC ACTGTGTTGG CTGGAGAGGG ATGC-GGTGGG CCCAACGGGG 1800 

ACACAAGGAG GCAGAGGAGC TAACCCCTCT ACTCCACTTT CAAAACTACA TTTTAAAGGG 1860 
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AATGTGTATG TGAAGAGCAC TACCAACATC GCTTTTGTTT T 
TTTTTTTTTT TGCTTGTTTT TAAAGCCAAA ACAAAAAACA ACCAAGCACT CTTCCATATA 
TAAATCTGGC TGTATTCAGT AGCAATACAA GAGATATGTA GAAAGACTCT TTGGTTCACA 
TTCCGATATT AAAATAGTGA CATGAACTGG CAAAGTGGTT TTAAAAGCTT TCACGTGGGA 
T AAATGAT TT TCTTTTTTTC TTTTCTTTCT TCCTATGGTC TTGTCTGAAT AAACTACTCT 
CCTGAATAAA ACAACATCCA ACCCAGGTCA TTGAAATGAA ATTGGCCAGT C 



MTQNKLKLCS KANVYTEVPD GGWGWAVAVS F 

jSNHFGH rlwmlggll vstgmvaasf sqevshmyva 
L sqyfgkrrsi vtavastgec pavpapapai malkerigwr 

YSLLFVGLLQ LNIVIFGALL RPIFIRGPAS PKIVIQENRK EAQYMLEKEK TRTSIDSIDS 
3 PKADMQQVLV KTSPRPSEKK APLLDFSILK EKSFICYALF 
GLFATLGFFA PSLYIIPLGI SLGIDQDRAA FLLSTMAIAE VFGRIGAGPV LNREPIRKIY 
I VSLFAFTFAT EFWGLMSCSI FFGFMVGTIG G 



TTGGGGGTTT ATTCTCTTCC CTTCTAACTT GACAGGGTCT 



3 GACTACAGAT T 
; CAATGTGTAT ACTGAAGTGC CTG 
GCGGTAGCTG TTTCATTTTT CTTCGTTGAA GTCTTCACCT A 
GGTGTCTTCT TTAATGACTT AATGGACAGT TTTAATGAAT CCAATAGCAG G 

T TGTCTTAACA TTTTCAGCTC CCCTCGCCAC AGTCCTGAGC 420 

G TTGGGGGGGC TACTTGTCAG CACCGGGATG 4 BO 

GTGGCCGCCT CCTTCTCACA AGAGGTTTCT CATATGTACG TCGCCATCGG CATCATCTCT 540 

GGTCTGGGAT ACTGCTTTAG TTTTCTCCCA ACTGTAACCA TCCTATCACA ATATTTTGGC 600 

AAAAGACGTT CCATAGTCAC TGCAGTTGCT TCCACAGGAG AATGTTTCGC TGTGTTTGCT 650 

40 TTCGCACCAG CAATCATGGC TCTGAAGGAG CGCATTGGCT GGAGATACAG CCTCCTCTTC 720 

GTGGGCCTAC TACAGTTAAA CATTGTCATC TTCGGAGCAC TGCTCAGACC CAT CAT TAT C 7B0 

AGAGGACCAG CGTCACCGAA AATAGTCATC CAGGAAAATC GGAAAGAAGC GCAGTATATG B40 

CTTGAAAATG AGAAAACACG AACCTCAATA GACTCCATTG ACTCAGGAGT AGAACTAACT 900 

ACCTCACCTA AAAATCTGCC TACTCACACT AACCTGGAAC TGGAGCCGAA GGCCGACATG 960 

45 CAGCAGGTCC TGGTGAAGAC CAGCCCCBGG CCAAGCGAAA AGAAAGCCCC GCTATTAGAC 102 0 

TTCTCCATTT TGAAAGAGAA AAGTTTTATT TGTTATGCAT TATTTGGTCT CTTTGCAACA 10HO 

CTGGGATTCT TTGCACCTTC CTTGTACATC ATTCCTCTGG GCATTAGTCT GGGCATTGAC 1140 

CAGGACCGCG CTGCTTTTTT ATTATCTACG ATGGCCATTG CAGAAGTTTT CGGAAGGATC 1200 

GGAGCTGGTT TTGTCCTCAA CAGGGAGCCC ATTCGTAAGA TTTACATTGA GCTCATCTGC 1260 

GTCATCTTAT TGACTGTGTC TCTGTTTGCC TTTACTTTTG CTACTGAATT CTGGGGTCTA 132 0 

ATGTCATGCA GCATATTTTT TGGGTTTATG GTTGGAACAA TAGGAGGACT CACATTCCAC 13B0 

TGCTTGCTGA AGATGATGTC GTGGGCATTG CAGAAGATGT CTTCTGCAGC TGGGGTCTAC 1440 

ATCTTCATTC AGAGCATAGC AGGACTGGCT GGACCGCCCC TTGCAGGTTT GTTGGTGGAC 1500 

CAAAGTAAGA TCTACAGCAG GGCCTTCTAC TCCTGCGCAG CTGGCATGGC CCTGGCTGCT 1560 

GTGTGCCTCG CCCTGGTGAG ACCGTGTAAG ATGGGACTGT GCCAGCGTCA TCACTCAGGT 1620 

GAAACAAAGG TAGTGAGCCA TCGTGGGAAG ACTTTACAGG ACATACCTGA AGACTTTCTG 1680 

G AAATGGAT C TTGCAAAAAA TGAGCACAGA GTTCACGTGC AAATGGAGCC GGTATGACAC 1740 

ACTTTCTTAC AACAACAGCC ACTGTGTTGG CTGGAGAGGG ATGGGGTGGG CCCAACGGGG IB 00 



50 



C TACCAACATC GCTTTTGTTT TGTTTTGTTT TGTTTTAAGC IS 20 

TTTTTTTTTT TGCTTGTTTT TAAAGCCAAA ACAAAAAACA ACCAAGCACT CTTCCATATA 19B0 

TAAATCTGGC TGTATTCAGT AGCAATACAA GAGATATGTA GAAAGACTCT TTGGTTCACA 2040 

TTCCGATATT AAAATAGTGA CATGAACTGG CAAAGTGGTT TTAAAAGCTT TCACGTGGGA 2100 

TAAATGATTT TCTTTTTTTC TTTTCTTTCT TCCTATGGTC TTGTCTGAAT AAACTACTCT 2160 
CCTGAATAAA ACAACATCCA ACCCAGGTCA TTGAAATGAA ATTGGCCAGT C 



I I I I I 



L ATVLSNRFGH RLWMLGGLL VSTGMVAASF SQEVSHMYVA 
L SQYFGKRRSI VTAVAST ~ " 
YSLLFVGLLQ LNIVIFGALL RPIIIRGPAS PKIVIQENRK E 
GVELTTSPKN VPTHTNLELE PKADMQQVLV KTSPRPSEKK A 
GLFATLGFFA PSLYIIPLGI SLGIDQDRAA FLLSTMAIAE VFGRIGAGPV LNREPIRKIY 
IELICVILLT VSLFAFTFAT EFWGLMSCSI FFGFMVGTIG GLTFHCLLKM MSWALQKMSS 
AAGVYIFIQS IAGLAGPPLA GLLVDQSKIY SRAFYSCAAG MALAAVCLAI, V 
RHHSGETKW SHRGKTLQDI PEDFLEMDLA KNEHRVBVQM ET~ 

Seq ID NO: 73 DNA sequence 
Nucleic Acid Accession f = NMJ302184.1 
Coding sequence: 256. .3012 
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1 11 21 31 41 51 

I I I I I.I 

GAGCAGCCAA AAGGCCCGCG GAGTCGCGCT GGGCCGCCCC GGCGCAGCTG AACCGGGGGC 60 

CGCGCCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGCGCC AAGGGGTTCG TGCGCTGTGG 12 0 

AGACGCGGAG GGTCGAGGCG GCGCGGCCTG AGTGAAACCC AATGGAAAAA GCATGACATT 180 

TAGAAGTAGA AGACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTGATTTGG 240 

AAATATCCGC GCAAGATGTT GACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTTTATTTTC 300 

CTCACCACTG AATCTACAGG TGAACTTCTA GATCCATGTG GTTATATCAG TCCTGAATCT 360 

CCAGTTGTAC AACTTCATTC TAATTTCACT GCAGTTTGTG TGCTAAAGGA AAAATGTATG 42 0 

: ATGTAAATGC TAATTACATT GTCTGGAAAA CAAACCATTT TACTATTCCT 480 

riATCAT AAACAGAACA GCATCCAGTG TCACCTTTAC AGATATAGCT 540 

\ TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTGA ACAGAATGTT 600 



CCTGTATATA AAGTGAAGCC CAATCCGCCA CATAATTTAT CAGTGATCAA CTCAGAGGAA 

T GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 

]AAAGAT GCCTCAACTT GGAGCCAGAT TCCTCCTGAA 

GACACAGCAT CCACCCGATC TTCATTCACT GTCCAAGACC TTAAACCTTT TACAGAATAT 
T GAAGGAAGAT G 



ATAGATCCAT 

\ AATCTTGGAT 



ATCCCTGCCT GTGACTTTCA A 

h ATGGACTACT CCAAGGGAAT C 
A TAAAGCACCC TGTATCACAG ACTGGCAACA A 
ACCGTGCATC GCACCTATTT AAGAGGGAAC TTAGCAGAGA GCAAATGCTA T 
GTTACTCCAG TATATGCTGA TGGACCAGGA AGCCCTGAAT C 
CAAGCTCCAC CTTCCAAAGG ACCTACTGTT CGGACAAAAA AAGTAGGGAA A 
GTCTTAGAGT GGGACCAACT TCCTGTTGAT GTTCAGAATG GATTTATCAG AAATTATACT 192 0 
ATATTTTATA G AACCAT CAT TGGAAATGAA ACTGCTGTGA ATGTGGATTC TTCCCACACA 1980 
GAATATACAT TGTCCTCTTT GACTAGTGAC ACATTGTACA TGGTACGAAT GGCAGCATAC 2040 
ACAGATGAAG GTGGGAAGGA TGGTCCAGAA TTCACTTTTA CTACCCCAAA GTTTGCTCAA 2100 
GGAGAAATTG AAGCCAT AGT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTC 2160 
GGAGTGCTGT TCTGCTTTAA TAAGCGAGAC CTAATTAAAA AACACATCTG GCCTAATGTT 22 2 0 
CCAGATCCTT CAAAGAGTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 22 8 0 
TTTAATTCAA AAGATCAAAT GTATTCACAT GGCAATTTCA CTGATGTAAG TGTTGTGGAA 234 0 
ATAGAAGCAA ATGACAAAAA GCCTTTTCCA GAAGATCTGA AATCATTGGA CCTGTTCAAA 2400 



TCTTCTAGGC CAACCATTTC TACCACTCAT CAAAATCAAT CTTCACAAAA CACTTCGAGC 2520 

ACTGTCCAGT ATTCTACCGT GGTACACAGT GGCTACAGAC ACCAAGTTCC GTCAGTCCAA 2580 

GTCTTCTCAA GATCCGAGTC TACCCAGCCC TTGTTAGATT CAGAGGAGCG GCCAGAAGAT 264 0 

CTACAMTAG TAGATCATGT AGATGGCGGT GATGGTATTT TGCCCAGGCA ACAGTACTTC 2700 

AAACAGAACT GCAGTCAGCA TGAATCCAGT CCAGATATTT CACATTTTGA AAGGTCAAAG 2760 

CAAGTTTCAT CAGTCAATGA GGAAGATTTT GTTAGACTTA AACAGCAGAT TTCAGATCAT 2820 

ATTTCACAAT CCTGTGGATC TGGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2880 

GCTTTTGGTC CAGGTACTGA GGGACAAGTA GAAAGATTTG AAACAGTTGG CATGGAGGCT 2 940 

GCGACTGATG AAGGCATGCC TAAAAGTTAC TTACCACAGA CTGTACGGCA AGGCGGCTAC 3000 

ATGCCTCAGT GAAGGACTAG TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAGTAAAG 3 060 
CTAAAATGAT TTTATCTGTG AATTC 

Seq ID NO: 74 Protein sequence 



MLTLQTWWQ ALFIFLTTES TGELLDPCGY ISPESPWQL KSNFTAVCVL K 

N HFTIPKEQYT IINRTASSVT FTDIASLNIQ LTCNILTFGQ LEQNVYGITI 

:IVNEGK KMRCEWDGGR ETHLETNFTL KSEWATHKFA DCKAKRDTPT 
i FVNIEVWVEA ENALGKVTSD HINFDPVYKV KPNPPHNLSV INSEELSSIL 
IILKYNIQ YRTKDASTKS QIPPEDTAST RSSFTVQDLK P 



80 



^ILEWCVL S 
I KAYLKQAPPS K 
QLPVDVQNGF IRNYTIFYRT I IGNETAVNV DSSHTEY 
KDGPEFTFTT PKFAQGEIEA IWPVCLAFL LTTLLGVLFC F 
SHIAQWSPHT PPRHNFNSKD QMYSDGNFTD VSWEIEAND KKPFPEDLKS LDLFKKEKIN 
TEGHSSGIGG SSCMSSSRPS ISSSDENESS QNTSSTVQYS TWHSGYRKQ VPSVQVFSRS 
ESTQPLLDSE ERPEDLQLVD HVDGGDGILP RQQYFKQNCS QHESSPDISH FERSKQVSSV 
NEEDFVRLKQ QISDHISQSC GSGQMKMFQE VSAADAFGPG T3GQVERFET VGMEAATDEG 
MPKSYLPQTV RQGGYMPQ 

Seq ID NO: 75 DNA sequence 



TGCTGCGAGG ATGCTGCCTG GGCGGCTGTG CTGGGTGCCG CTCCTGCTGG CGCTGGGCGT 
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CACAGTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGG TTCCTTTTGC 240 

AGGGGAAATC TGTGCGTTCA AGATCCATGG CCAGGAGCTG CCCTTTGAGG CTGTGGTGCT 300 

CAACAAGACA TCAGGAGAGG GCCGGCTCCG TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 

GAAGGAGTAC ACATTCATCA TCCAGGCCTA TGACTGTGGT GCTGGGCCCC ACGAGACAGC 420 

CTGGAAAAAG TCACACAAGG CCGTGGTCCA TATACAGGTG AAGGATGTCA ACGAGTTTGC 480 

TCCCACCTTC AAAGAGCCAG CCTACAAGGC TGTTGTGAGG GAGGGCAAGA TCTATGACAG E40 

CATTCTGCAG GTGGAGG CCA TTGACGAGGA CTGCTCCCCA CAGTACAGCC AGATCTGCAA 600 

CTATGAAATC GTCACCACAG ATGTGCCTTT TGCCATCGAC AGAAATGGCA ACATCAGGAA 660 

CACTGAGAAG CTGAGCTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 

CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 

CAAGCCTGGC TGGCAAGACT GGACCAAGAG GATTGAGTAC CAGCCTGGCT CCGGGAGCAT 840 

GCCCCTGTTC CCCAGCATCC ACCTGGAGAC GTGCGATGGA GCCGTGTCTT CCCTCCAGAT 900 

CGT CACAGAG CTGCAGACTA ATTACATTGG GAAGGGTTGT GACCGGGAGA CCTACTCTGA 960 

GAAATCCCTT CAGAAGTTAT GTGGAGCCTC CTCTGGCATC ATTGACCTCT TGCCATCCCC 1020 

TAGCGCTGCC ACCAACTGGA CTGCAGGACT GCTGGTGGAC AGCAGTGAGA TGATCTTCAA 1080 

GTTTGACGGC AGGCAGGGTG CCAAAATCCC CGATGGGATT GTGCCCAAGA ACCTGACCGA 1140 

GGAAACCATC CTCTGCAACT CAGACAAAAC CGAAATGAAC CGGCATCACT ATGCCCTGTA 
TGTGCACAAC TGCCGCCTCG TCTTTCTCTT GCGGAAGGAC TTCGACCAGG CTGACACCTT 
TCGCCCCGCG GAGTTCCACT GGAAGCTGGA TCAGATTTGT GACAAAGAGT GGCACTACTA 



; TGCAAGGAAG GGCTGGACAT TAATTCCTTG GAAAGCCTTG GCCAAGGAAT 16B0 

AAAGT AT CAC TTCAACCCCT CGCAGTCCAT CCTGGTGATG GAAGGTGACG ACATTGGGAA 1740 

CATTAACCGT GCTCTCCAGA AAGTCTCCTA CATCAACTCC AGGCAGTTCC CAACGGCGGG 1800 

30 TGTGCGGCGC CTCAAAGTAT CCTCCAAAGT CCAGTGCTTT GGGGAAGACG TATGCATCAG 1860 

TATCCCTGAG GTAGATGCCT ATGTGATGGT CCTCCAGGCC ATCGAGCCCC GGATCACCCT 1920 

CCGGGGCACA GACCACTTCT GGAGACCTGC TGCCCAGTTT GAAAGTGCCA GGGGAGTGAC 1980 

CCTCTTCCCT GATATCAAGA TTGTGAGCAC CTTCGCCAAA ACCGAAGCCC CCGGGGACGT 2040 

GAAAACCACA GACCCCAAAT CAGAAGTCTT AGAGGAAATG CTTCATAACT TAGATTTCTG 2100 

35 TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AGCTCAACCA 2160 

CAGTGAGCTC CACCAACGAC ACCTGGATGC CACTAATTCT ACTGCAGGCT ACTCCATCTA 222 0 

CGGTGTGGGC TCCATGAGCC GCTATGAGCA GGTGCTACAT CACATCCGCT ACCGCAACTG 2280 

GCGTCCGGCT TCCCTTGAGG CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 2340 

CTACACTAGC AVTGAGTTCA ACTTGGAGGT CAGCATCCTT CATGAAGACC AAGTCTCAGA 2400 

40 TAAGGAGCAT GTCAATCATC TGATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 

TGAGTCCCGG AGTAGCATCC AGCACAGTTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 

CATCATCTCC GTGTGCATGC TTGTGTTTGT CGTGGCCATG GGTGTGTACC GGGTCCGGAT 2580 

CGCCCACCAG CACTTCATCC AGGAGACTGA GGCTGCCAAG GAATCTGAGA TGGACTGGGA 2640 

CGATTCTGCG CTGACTATCA CAGTCAACCC CATGGAGAAA CATGAAGGAC CAGGGCATGG 270 0 

45 GGAAGATGAG ACTGAGGGAG AAGAGGAGGA AGA^GCCGAG GAAGAAATGA GCTCCAGCAG 2760 

TGGCTCTGAC GACAGCGAAG AGGAGGAGGA GGAGGAAGGG ATGGGCAGAG GCAGACATGG 282 0 

GCAGAATGGA GCCAGGCAAG CCCAGCTGGA GTGGGATGAC TCCACCCTCC CCTACTAGTG 2880 

CCCAGGGGTC TGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTATGCC 2940 

CATGTCTATC ATACCTCACC TCTGATGTCT GTGACATGTC TGGGAAGGCC TTCTCCAGCT 3000 

50 TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 

AAGAAGCCCA GCATGGCCAT CAGTGAGGAC TTCAGGGTAG ACTTTGTCCT GTAGCCTCCA 312 0 

CTTCTGCCCT AAGTTCCCCA GCATCCTGAC TACCTGTCTG CAGAGTTTGC CTTTGTTTTT 3180 

TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACCTC CCCAGGCTCA GAGTCCCCAA 3240 

GGCCCTGGGG TTCCAACTCA CTGTGCGTCT CCTCCACACA GACCAGTAGG TTCTCCTATG 3300 

55 CTGACTCCAG GTTGCTTCAT ACAAGGAGGG TGGTTGAACT TCACACACGT AAGGTCTTAG 3360 

TGCTTAACAG TTTAAAGGAA AGTCCTTGTT GAGGCAGAAC TAAGTTTACA GGGAAAGGTA 3420 

CACACATTCT CTCTCTCTCT CTCTCTCTGT CTATCTAGTT CCCCAGCTTG GAGAGCCTTT 3480 

CCCCTTGCTT CTTTCTGAGG CCATATAAGC TTATAAGARA AGTCCCAAftC CAAGAATAGG 3540 

60 CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC A 

CCAGGAGGCA TTCCACACAG GCACTGCCCC AGGACAACAC AACAAGGACA G 
GGACAACAAG GACACAACAC AACACACARC AAGGACAGTC ACAACAAGCC TAGAGCCAGA 
T GAGGTCAAAC G 

GCTGGGCTCC CCCAGGACAG AGGGGACCCT G«v J «nv_ f v^rt nvjuviviwiv w^ww^j*,^* 
65 TATGGTCCCT TATCTCCTAT CTTCCCTCTT GAGAAAATAC ACGCTTTCTG CATGTATTAG 




CTGTGATCTT TTGTGGCCAA G 

T TTGTTTCGAA TGTCTGATTC CTCTTTGTCA TCAATGTGTA 4320 
^TCCTCA AGCTCACACC AATTGGTTTG GCACAGGCAC 4380 
AGAGCTGGTC CCTAGTTAAG TGGCATTTAT GTTAAAAAAA A 



I I I I I I 

MLPGRLCWVP LLLALGVGSG SGGGGDSRQR RLLAAKV1IKH KPKIETSYHG VII 
LDPPLVALDK DAPVPFAGEI CAFKIHGQEL PFEAWLNKT SGEGRLRAKS PIDCELQKEY 
TFIIQAYDCG AGPHETAWKK SHKAWHIQV KDVNEFAPTF KEPAYKAWT EGKIYDSILQ 
VEAIDEDCSP QYSQICNYEI VTTDVPFAID RHGNIRNTEK LSYDKQHQYE ILVTAYDCGQ 
KPAAQDTLVQ VDVKPVCKPG WQDWTKRIEY QPGSGSHPLF PSIHLETCDG AVSSLQIVTE 
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LQTNYIGKGC DRETYSEKSL QKLCGASSGI IDLLPSPSAA TNWTAGLLVD SSEMIFKFDG 
RQGAKIPDGI VPKNLTDQPT ITMWMKHGPS PGVRAEKETI LCNSDKTEMN RHHYALYVHN 
CRLVFLLRKD FDQADTFRPA EFHWKLDQIC DKEWHYYVIN VEFPWTLYM DGATYEPYLV 
TNDWPIHPSH IAMQLTVGAC WQGGEVTKPQ FAQFFHGSLA SLTIRPGKME SQKVISCLQA 
CKEGLDINSL ESLGQGIKYH FNPSQSILVM EGDDIGNINR ALQKVSYINS RQFPTAGVRR 
LKVSSKVQCF GEDVCISIPE VDAYVMVLQA IEPRITLRGT DHFWRPAAQF ESARGVTLFP 
DIKIVSTFAK TEAPGDVKTT DPKSEVLEEM LHNLDFCDIL VIGGDLDPRQ ECLELNHSEL 
HQRHLDATNS TAGYSIYGVG SMSRYEQVLH HIRYRNWRPA SLEARRFRIK CSSLNGRYTS 
NEFNLEVSIL HEDQVSDKEH VNHLIVQPPF LQSVHHPESR SSIQHSSWP SIATWIIIS 
VCMLVFWAM GVYRVRIAHQ HFIQETEAAK ESEMDWDDSA LTITVNPMEX HEGPGHGEDE 
TEGEEEEEAE EEMSSSSGSD DSEEEEEEEG MGRGRHGQNG ARQAQLEWDO STLPY 

Nucleic Acid Accession #: Eos sequence 
Coding sequence: 4B2..3007 



AACTGAGCTA ACAAGAAATA CTAGAAAAGG 



ATGCCGTGAA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT C 

CTTTATGAAG CTATGGGACT TGACAAAAAG TGATATTTGA GAAGAAAGTA CGCAGTGGTT 
GGTGTTTTCT TTTTTTTAAT AAAGGAATTG 



I CTGAAATAAG 

TGCAGATATT GAGATAGGTG CATTTAATGG C 



TGTTGGTTTT CTCGAACACA TTGGCCGAAT ATTGGATCTT CAGTTGGAGG ACAACAAATG 1140 

GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 
TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATT1 
ACTAAAGAAG GAATCTATTT GCCCTACTCC A' 

AGGATCATTA CATCTCCCAC CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 1380 

CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 

ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1500 

ATCAGGACTT CTAATACATT CTCAGGACCC CAACATTGAA AGCTTATCAG ATCTGAGACC 1560 

TCCTCCGCAA AATCCTAGAA AGCTCATTCT AGCGGGAAAT ATTATTCACA GTTTAATGAA 1620 

GTCTGATCTA GTGGAATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTGA 1680 

AGTTCTTGAA GAAGGATCGT TTATGAACCT AACGAGATTA CAAAAACTCT AT CTAAATGG 1740 

TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1B00 

ATATCTTGAA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CAATGCCTAA 1B6 0 

ACTT AAAGT C CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTTTTC 1920 

AGGGGTTCCT CTAACTAAGG T AAAT CTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 19 B0 

TAATATTTTG GATGATCTTG ATTTACTAAC CCAGATTGAC CTTGAGGATA ACCCCTGGGA 2040 

CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 

GACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 

AAATAGTGAA ATTCTCTGTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 222 0 

TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 22 80 

TCTTACGGAC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTTCTGATTA TGTTCATCAC 2340 

TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTGTTCTT CACCGCAGGA GAAGATACAA 2400 
AAAGAAACAA GTAGATGAGC AAATGAGAGA C 

GTATGGCCAT AAAACCACTC ATCACACTAC TGAAAGACCC TCTGCCTCAC T 
GCACATGGTG AGCCCCATGG TTCATGTCTA T AGAAGT CCA TCCTTTGGTC C 
GGAAGAGGAA GAAGAGAGGA ATGAGAAAGA AGGAAGTGAT GCAAAACATC TCCAAAGAAG 
TCTTTTGGAA CAGGAAAATC A 



AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 



TTTTGAACTT AAAGCTAATT TACATGCTGA ACCTGACTAT TTAGAAGTCC TGGAGCAGCA 



A TAAGTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 
AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 
GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA T 



I I I I I I 



VPPSRPFQLS LLNNGLTMLH TNDFSGLTNA ISIHLGFTOII ADIEIGAFNG LGLLKQLHIN 
HNSLEILKED TFHGLENLEF LQADNNFITV IEPSAFSKUJ RLKVLILNDN AIESLPPNIF 
RFVPLTHLDL RGNQLQTLPY VGFLEHIGRI LDLQLEDNKW ACNCDLLQLK TWLENMPPQS 
IIGDWCHSP PFFKGSILSR LKKESICPTP PVYEEHEDPS GSLHLAATSS INDSRMSTKT 
TSILKLPTKA PGLIPYITKP STQLPGPYCP IPCNCKVLSP SGLLIHCQER N 
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PPQNPRKLIL AGNIIHSLMK SDLVEYFTLE MLHLGNN3IE VLEEGSFMNL TRLQKLYLNG 420 

NHLTKLSKGM FLGLHNLEYL YLEYNAIKEI LPGTFNPMPK LKVLYLNNNL LQVLPPHIFS 480 

GVPLTKVNLK TNQFTHLPVS NILDDLDLLT QIDLEDNPWD CSCDLVGLQQ WIQKLSKNTV 540 

TDDILCTSPG HLDKKELKATj NSEILCPGLV NNPSMPTQTS YLMVTTPATT TNTADTILRS 600 

5 LTDAVPLSVL ILGLLIMFIT IVFCAAGIW LVLHRRRRYK KKQVDEQMRD NSPVHLQYSM SSO 

YGHKTTHHTT ERPSASLYEQ HMVSPMUHVY RSPSFGPKHL SEEESRNEKE GSDAKHLQRS 72 0 

LLEQENHSPL TGSNMKYKTT NQSTEFLSFQ DASSLYRNIL EKERELQQLG ITEYLRKNIA 780 

QLQPDHEAHY PGAHEELKLH ETLMYSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLEQQ 840 



10 



TTATGAAGTA 

ACTTCACCAA 

AGCCCGAACC CGAACCTGAA CCTGCGCTGG ACCTCGCGGC 



25 ACTGCCTGCT GCAGGAGCAC TTCTACCTGC GGCGCAGGCG GCGCGTGCAC CGTTACGAGG 

30 



AAATTAAGAA 



G CCTTTTTGCT TGGACTGGAG CACAAGCTAT GTATCAAGGA TTCTGGAGTG 1080 

AAGCAGATGT TACTCGACCT TTTGTCTCCC AGGCTGTGAT CACAGATGGA AAATACTTTT 1140 

CCTTTTTCTG CTACCAGCTA AATACTTTGG CACTGACTAC ACAAGCTGAT CAAAATAACC 1200 

CTCGTAAAAA TATATGTTGG GGTACACAAA GTAAGCCTCT TTATGAAACA ATTGAGGATA 1260 

ATGATGTGAA AGGTTTTAAT CATCATCTTC TACTTCAGAT AGTTCACTTT CTACTGAATA 1320 

GACCAAAAGA AGAAAAATCA CAGCTGTTGG AAAACTGAAA AAGCATATTT GATTGAGAAC 13 80 

TGTGGGAATA TTTAAATTTT ACIGAAGGAA CAATAATGAT GAGATTTGTA ACTGTCAACT 1440 
ATTAAATACA TTGATTTTTG AGACAAAAAA AAAAAAAAAA AA 

Seq ID NO: 80 Protein sequence 
Protein Accession #> NP_0S7724.1 



MAAARCWRPL LRGPRLSLHT AANAAATATE TTSQDVAATP VARYPPIVAS NTADSKAARL 60 

RRIERWQATV HAAESVDEKL RILTKMQFMK YMVYPQTFAL NADRWYQYFT KTVFLSGLPP 120 

PPAEPEPEPE PEPEPALDLA ALRAVACDCL LQEHFYLRRR RRVHRYEESE VISLPFLDQL 180 

VSTLVGLLSP HNPALAAAAL DYRCPVHFYW VRGEEIIPRG HRRGRIDDLR YQIDDKPNHQ 240 

IRISKQLAEF VPLDYSVPIE IPTIKCKPDK LPLFKRQYEK HIFVGSKTAD PCCYGHTQFH 300 

LLPDKLRRER LLRQNCADQ I EWFRANAIA SLFAWTGAQA MYQGFWSEAD VTRPFVSQAV 360 

ITDGKYFSFF CYQLNTLALT TQADQNNPRK NICWGTQSKP LYETIEDNDV KGFMDDVLLQ 420 



= CTCTGTGGGC GTCCAGTCCT ■ TGAGGCOGCA GAGCCCGCAG 
CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT GCAGTTCCTG 



AACCCAGTGG GTACAGGTTC TGGGGGACCT G 
GGACGATGTT ATGCAGCCAA GCACAGCACG T 

GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 



75 
80 



GACATGGAGA AGGGGGTT G A GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA C 

CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1140 

GACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGGGACCT GCTGTTCCAT GTGTCCCAAG 1200 

CCCTCCTGCT TTCCAGATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1260 

GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 
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CC TCTCCATGTC AAGCTTCCAG 
TTCAGCCAAC TCTCAAGGCA AGGCCAGGO 

TCCTTCAACA AGCAAGATTC 



CGCGAGCTGT 

CAGGGACCAA 



AAGAGCCTCT CCAAGAAATG 
AAACGGCGCC TGCATCGCTC AGTGCTTTGA 

Protein Accee 



MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL SO 

QQQHSEMLAK LHEEIEHLKR EKKGEPARGP RPALPPQAHS TLPLPQKRNT AINSSTRLGS 12 0 

GGTQDGEPLQ TVLAHLAALA PVCQPSGYRF WGTHTDAATS SRGWTMLCSQ AQHVLLSGSP 180 

GPEVIAGRQV ATGCSPDLPP PSRAEMGRNP WDSPCPARSL PQIAAVARPR ISSPMALSPH 240 

MLGAQGIWTH SIQGSLPAIW AATMGTKGGS RVLPPCHLSK ALPHPDSGPH PAQDPGLWSQ 300 

AHFPLSLGLG LTSGGHLTGG WSQPGNIAAG AVPRALPSQG DMEKGVEGGP FPSRCGNSSE 360 

LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGTCCSMCPK PSCFPDGPSG NHLSRASAPL 420 

GARWVCINGV WVEPGGPSPA RLKEGSSRTH RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 4S0 

SVKSISNSAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPGVQGQARK 540 

EKAEASNAGA ACMGNSQHQG RQMGAGAHPP MILPLPLRKP TTLRQCEVLI RELWNTOLLQ 600 

TQELRHLKSL LEGSQRPQAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERAILPA 660 
LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 

Seq ID NO: 63 DNA sequence 

Nucleic Acid Accession tt: NM_005264.1 

Coding sequence; 55T..1954 

1 11 21 31 41 51 

I I I I I I 

GAATTCCGGC CAGAAGAAAT CTGGCCTCGG AACACGCCAT TCTCCGCGCC GCTTCCAATA 60 

ACCACTAACA TCCCTAACGA GCATCCGAGC CGAGGGCTCT GCTCGGAAAT CGTCCTCGCC 12 0 

CAACTCGGCC CTTCGAGCTC TCGAAGATTA CCGCATCTAT TTTTTTTTTC TTTTTTTTCT 180 

TTTCCTAGCG CAGATAAAGT GAGCCCGGAA AGGGAAGGAG GGGGCGGGGA CACCAITGCC 240 

CTGAAAGAAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG CCGCAGCTGG ACGCGGTCGG 300 

TTGAGTCCAG GTTGGGTCGG ACCTGAACCC CTAAAAGCGG AACCGCCICC CGCCCTCGCC 360 

ATCCCGGAGC TGAGTCGCCG GCGGCGGTGG CTGCTGCCAG ACCCGGAGTT TCCTCTTTCA 42 0 

CTGGATGGAG CTGAACTTTG GGCGGCCACA GCACCACAGC TCTCCGGGGA TCGCTGCACG 4B0 

CTGAGCTCCC TCGGCAAGAC CCAGCGGCGG CTCGGGATTT TTTTGGGGGG GOGGGGACCA 540 

GCCCCGCGCC GGCACCATGT TCCTGGCGAC CCTGTACTTC GCGCTGCCGC TCTTGGACTT 600 

GCTCCTGTCG GCCGAAGTGA GCGGCGGAGA CCGCCTGGAT TGCGTGAAAG CCAGTGATCA 660 

GTGCCTGAAG GAGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGCGGG ,720 



CATGGAGGCC C 

GGAGAAGAAC TGCCTGCGCA TTTACTGGAG CATGTACCAG A 

G TCAGATATAT T 

A TCAGATGTTT TTCAGCAAGT GGAGCACATT CCCAAAGGGA ACAACTGCCT 1020 
GGATGCAGCG AAGGCCTGCA ACCTCGACGA CATTTGCAAG AAGTACAGGT CGGCGTACAT 1080 
: CGCCGCAAGT GCCACAAGGC 1140 



A TTTGCAGGAC TCCTGCAAGA O 

C CAACTGCCAG CCAGAGTCAA GGTCTGTCAG 1380 

CAGCTGTCTA AAGGAAAACT AGGCTGACTG CCTCCTCGCC TACTCGGGGC TTAITGGCAC 1440 

AGTCATGACC CCCAACTACA TAGACTCCAG TAGCCTCAGT GTGGCCCCAT GGTGTGACTG 1500 

CAGCAACAGT GGGAACGACC TAGAAGAGTG CTTGAAATTT TTGAATTTCT TCAAGGACAA 1560 

TACATGTCTT AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTGA CCGTGTGGCA 162D 

GCCAGCCITC CCAGTACAGA CCACCACTGC CACTACCACC ACTGCCCTCC GGGTTAAGAA 1680 
CAAGCCCCTG GGGCCAGCAG GGTCTGAGAA TGAAATTCCC ACTCATGTTT T 



TAACCGCTCT 



G ACAAAAACCA AGTTATCTGT TTCCTGTTCT CTTGTATAGC T 

GTTTAGGAGC TCAGTTGAGA AACAGTTCCA TTCAACTGGA ACATTTTTTT TITTCCTTTT 2100 

AAGAAAGCTT CTTGTGATCC TTCGGGGCTT CTGTGAAAAA CCTGATGCAG TGCTCCATCC 2160 

AAACTCAGAA GGCITTGGGA TATGCTGTAT TTTAAAGGGA CAGTTTGTAA CTTGGGCTGT 2220 

AAAGCAAACT GGGGCTGTGT TTTCGATGAT GATGATCATC ATGATCATGA TGATTTTAAC 2280 

AGTTTTACTT CTGGCCTTTC CTAGCTAGAG AAGGAGT T AA TATTTCTAAG GTAACTCCCA 2340 

TATCTCCTTT AATGACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TGATGCCAAG 2400 

CTTTTTTGCC ACAAAGAAGA TTCTTACCAA GAGTGGGCTT TGTGGAAACA GCTGGTACTG 2460 

ATGTTCACCT TTATATATGT ACTAGCATTT TCCACGCTGA TGTTTATGTA CTGTAAACAG 2520 



Seq ID NO: 84 Prote 
Protein Accession K 



982 



WO 03/042661 



PCT/US02/36810 



I I I I I I 

MFLATLYFAL PLLDLLLSAE VSGGDRLDCV KASDQCLKEQ SCSTKYP.TLR QCVAGKETKF 
SLASGLEAKD ECRSAMEALK QKSLYNCRCK RGMKKEKNCL RIYWSMYQSL QGNDLL3DSP 
YEPVHSP.LSD IFRWPFISD VFQQVEHIPK GNNCLDAAKA CNLDDICI R£ YITP TT 
SVSNDVCNRR KCHKALRQFF DKVPAKHSYG MLFCSCRDIA CTERRRQTIV PVCSYEEREK 
PNCLNLQDSC KTNYICRSRL ADPFTNCQPE SRSVSSCLKE MYADCLLAYS GLIGTVMTPN 
YIDSSSLSVA PWCDCSNSGN DLEECLKFLN FFKDNTCLKN AIQAFGNGSD VTVWQPAFPV 
QTTTATTTTA LRVKNKPLGP AGSENEIPTH VLPPCANLQA QKLKSNVSGN THLCISNGNY 
EKEGLGASSH ITTKSMAAPP SCGLSPLLVL WTALSTLLS LTETS 



CCGCCTCAGG GCTCGGTGCA CAGTGGACAT TTGGGGAGCG TTGTGGGTGA 



GTTGGCTTTG GATTCAGGAG GGATGGAT1C CAGTCCTAGC 
TGAGAGCAGC CTCCATGAGG CCCTGGACCA GTGCATGACC 
CAACCAGTTC TCAGAAGCAC TCAGCTACCT CAAGCCCAGA ACCAAGGAAA GCATGTACCA 
CTCACTGACA T ATG CCACCA TCCTGGAGAT GCAGGCC 

CATCCTGCTT GCCGGCAACA TGATGAAGGA GGCACAGATG CTGTGTCAGA G 



CCTGACCTTC CTGCAGGGTT CCTCACACGG AGGGGCAGTC AGGCCCAGAG C 

TCCCTCTCAC GCCTGCAGCT GCCCACC1GG G - 

30 GGACGAGAAC A 

CTACAAGGAG CTGGACAGCC TTGTTCAGTC CTCACAATAC T< 
GCACTTTGAA GGAGGAGTGA AGCTTGGTGT AGGGGCCTTC AACCTGACAC T( 
TCCTACTAGG ATCCTGAGGC TGTTGGAGTT TGTGGGGTTT TCAGGAAACA AGGACTATGG 
GCTGCTGCAG CTGGAGGAGG G 
35 GCTCCTGCTG TGCTACCACA CCTTCCTCAC CTTCGTGCTC GGTACTGGGA A 
CGAGGAGGCC GAGAAGCTCT TGAAGCCCTA CCTGAACCGG T 
CCTGTTCTTT GCAGGGAGGA TTGAAGTCAT TAAAGGCAAC ATTGATGCAG Ti 

GGCTCCGGGC CGGGCCTGGG CATCCCTCGG GGTCTCCCAG ACCAGCAGGA AGTCAGGCAC 12 0 0 

ATGTGACATA CTCAGGGACA GGATAGACTG GGGGCGGGGG GGGGGCCAAG AGAGAACCAA 12 GO 

40 CCAGAGAGCA GGGGCAGGAG AGGCCCTTCT GGCAGAGCAG CCTGGGAAGA CAAGGGAGGA 132 0 

GGAGGCATTT CTGGTGCCTG GGATTITGAC TGGGAGATAT AGGACTGCAG CATTGCAGTG 13 80 

GAGGGAGGTG GAGGGAGGTG CITGAGGGAG GCAGAGGTTA GGAAAGCCCA TCTGTTTACC 1440 

GCATGACGAT T AGGCT GG AG TCTGGTACCT CCCCTCCATT ATAGCTCTCT CCTGCTCTTT 1500 

CATTTTGTTA CTAAAAACCA GAGTCCTAGG CGGGGGCTGT ATTTGAGCCC AAOGTCATGT 1560 

45 AAGACTTAGG AGGTAAAACC AGGACTGGAG GCCAGATCTC CTGGCTCCTG GGGCCCCACC 1620 

TGAGCCTAGC ACAGGGCTGG ACCACTATGC CCTGGAGGAG TCCCGGTCTG CTGTGGTGTT 1680 

GGGAGGTTCG GAGGATGCAG AGGGGTTGGG GCXGGGTGGG CACCCGTCAG GCTGACCAGA 1740 

AGGTGCCTGC AGGCCATCCG GCGTTTCGAG GAGTGCTGTG AGGCCCAGCA GCACTGGAAG 1800 

CAGTTCCACC ACATGTGCTA CTGGGAGCTG ATGTGGTGCT TCACCTACAA GGGCCAGTGG I860 

50 AAGATGTCCT ACTTCTACGC CGACCTGCTC AGCAAGGAGA ACTGCTGGTC CAAGGTGGGC 1920 

TGATGCCACG TGTTAGGGGC ATTGGGTGAC CAGGGCTGAC TGTGTGCCTC CAGACCACGG 19 B0 

GCCAAATCCC TAACTGAACA CAGATGTCTC AGCTGGAATC TAAACATAAC CTTAAATTCT 2040 

AAT AGGACT C AGGCTTGGAA GGAACTAAAG ACCACAAGAG AAACTTCTGA ACCACAATGT 2100 

GTCACAAAGA GATTTCTAGT CACAAAGGAC AGAAACATGG CTCCCTCTGT CCAGTAGAAC 2160 

55 TGTTTCTGTG GTAGAAATGT TCTACTTATG CACTGCCCAA ATATGGTAGT CACCAGCCAC 2220 

ATGTGCACAA TGAGCACATG AAATGTGCCT AGTGCAACTG GGAAACTGAT TGTTTTCAGT 2280 

TTTATTTAAT TTTAATTAAC TAAATGTTAA ATTTAAATAG CCATGTAGGG CTTGTGGCCA 2340 

CTATATTGGA CTATGCAGGT CCAAAACACA AAAGGCTCAT ATAACTGAAC ATTCTTGGCA 2400 

CATCCGACTT CAGGTAGGGC TGGATCCAGG AATTCAAATG ATGTCGTCTG GCTTGGTCTT 2460 

60 TCCATTTGTG GCTGTGCTCT CTCCTATGAC ATCTTTGTTT CTGCTGCATC TTTGCAGGAA 2520 

GGTTCTCTCC ATGTGACAGG CAAGGTGGCC ACGGGCTGCT TCTACTCATA TCCTCCCCTT 2580 

GGTTTCAACC AGAGTGCCAC AGTTTGAGCC TCACTGGTCT GACTTGTCAC CTGCCAATGC 2S40 

CTGGAACAGA GGGGTGGGAA GATTCCTCAT GGGCTGAGAG TAGGAGAGGG GTGGTTCCAC 2700 

AGAAGAAAAT GATGCACCCA GAAATAGTGG AGGGATTAAC AAGATGCCAT ACAGGCAAAA 2760 

65 CAAAGCCAAA CAGATGCCCG CCTACCAAAG ATGAAATTTA TCATGGIAAG IATTGAAATA 2820 

AGTGTTAGCT TGTACCATCA TAGTAATGAT AGTGCAGAAA TTGGAACCAA GAGTCTTACA 2880 

ACCACCTAGC TCAGCAAACG TCTAATCIGT TATTTGTAAA TACACAGGAC ATGTGTCTTC 2940 

ATGGCTTCAT CCCCTGCCCA TAGCAGACAT TGCTAATCAA TCCTCTGCCA TGAGCCTAGC 3000 

TGTGACCTTA GATACCTTCC CTGCAAAGCT CCAGGCAGCC GTTAGAACTG ACCCACATTG 3060 

70 GCACCTGTTA GGAGACCTGT CATCTGCATC TGGTCCCTCC CCCTCCTCTG CATGGAGGTC 312 0 
TCTTCCGTAG CCCTGCCTGG GGACCAAGCT GAAGGGGAAT T 
GGGGCTGGGG 

GATGCTAAGC 

TGAGGAGTCG 

CTCCCATGGC CCCCACACCC CCTCCTCATC ACCTCCTACT CCCAAAAAGG A 
T TTTTCTTTTT TTAGAGACAG G 

T CTTCCCACCT CTGCCTCCTA AAGTGCTGGG ATTCTGGGCC 




GGCTGGGGCT 



TCTGTGAAAA ATGGGGGTTG TAACCCTCTG AGTTCTGGGG CCCCACCCAT TTCTAATCTC 3960 
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CRGAACTTGG CCACCACAGA CTCCACCAAC TTCTAGTCCT GGGGCCTGGG 
TTGCCAIAGG CACCACCTGC TCTGTGCAGG CAGCGCCCCC CTCTGCCAGG 
GTCAGCTGCT GGGTCTGACC CGCAGACCCT GGCTGAGCGA CGGATGAACG GAGTATGCAG 
ACACAGGCTT GCCTGTCAGC AGATGGGGGA CCCCTGCCAG AGTCAGCAGC GGCCCCCATA 
AGCCTGCCAC GCTTGCATTT ATTTAGTACA GATGTAATGA CAAAGGCCTA AAGCAAACTC 
CATTTGTGGG TAATTAACAT TGTCGCCCCC CCAGAAAGAG CAGTCCTCCG CATGATGATT 
AAAGG CCAGG TTCCGAGGCC TAAGTAAACC AACTTATCTA GATCAATTCC CTTACTTCTT 
GTTATCTACT CTGAGAGAAT TCAGCTGCCT TCAGCCAAAT CCTTTCCCGA AGCTTT1GCA 
AAACCTCCGA GCCTTCCAAG GTTTGCTTCT TTCTGTAATT TTTCTCACCA CCCTGACCTA 
TCTCCTGCAG TCAGCCCTGT GGAGGCCTTT GTGTTTCCCC CAGT3CTGGC AGCCTAGAGG 
CTGAGATGGC CAGAAACAAG GTGGTGACAG TGGCGTGCTC AGGGCTTGGG AAACCCAAGG 
AGCTAAAGGC ATGCCCAGGC AACCAAAGAG GACAGGAAGG CTTCTGAGGA GAGACCTCTG 
1 TGGAGAGGAA GGACTTAGGG AGGCAGAGTG GAGGAAGTGA GAGGACACCC 
3 ACCAAAGGCT CAGAAGCCAG GGCGCTGCAG AGGGGCTGTG 
TGCCACAGGG TGAAGAGTTT GTGTGGCAGA AGGGCAGGGG GCTTGCATCA GGGGTGACAG 
CTGCTCTTTT GTCCCAGCAT AGCCCCTGTA CATCCCTGGA GAGCTGGGGC GTCCACAACT 
rCCTGGT GGTGCAGTGA GGGTGAGCTG TCTGTGGGCA 
A TGAGCCTGGT GAAGGGATAA TGGCATCCCG GGCCGAGGAG 
CAGCACAGGC AGAGGCCTGG GGAGAGTTTA AGGAGTGTAG GGGAGGAAAT GGCAGAAGAT 
GAGCCAGAAA AAGAAAGGTT AGGGCAGGTC CTGGAGGACA TGAGTGGCTG TTTGGGCTTT 
ATCCAGCAGT GGGGGAGCCT TGGCAGGCTT GTGGCTTAGA TAGGTGCTTT AGAAAGCCCA 
CCAGCAGTTG CTGGGCCACC CCGCTGGCTG GGTCCTGTTC TAAGGCAGGA AATACAAGCA 
TGAGCAGGAA AAGACCCCCT CAAGGCTCAC GTCCTAGTGG 



K TGGGAGGGGC TTTGTAAGGG GAGGTCGGGG GAAGCCTGTC TCAGAGGACA 
\ GCGCAGGAGC AGCACGTGGC AGTCACATGG CAGGCCGTTA GGGCAGAGGG 
G GGCAGGAGTG TGTTTGATGT GTCCTGGGAA C 
GCCGTCGTGT GGCTGGAGTG CTGCAGGTGT C 

G AACTGACATG A 
T GGGAGCAGGG AGATAGGAGA C 
CAGGACAGCA GAAAGACATC CCGGGTAGCC TGGAACAGGG AGACGGTGTG G 
A ATGAGAGCCG TAGGGCAAGG CCAGCAGGAT CCTAGAGTGA G 
3 TCTCCACGTC A 
GACCCGGGAG GTTAAAGATG GGACCGGGGC CAGACGCAGT G 
GCACTTTGGG AGGCTGAGGC GGGCGGATAG CTTGAGGTCA GGAGTTTGAA ACCAGCCTGG 606D 
CCAACATGGT GAAACCCCGT CTCTACTAAA ATATACAAAA ATTAGCCTGG CGTGGTGGTG S120 
CATGCCTGCA GTCCCAGCTA TTCAGGAGGC TGAGGCAACA AGAATCGCTT GAACCTGGGA 6 ISO 
CCCGGAGGTT CCACTCAGCC GAGATCGCGC CATAGCACTC CAGCCTTAGC CTGGGCGACA 6240 
GAGCGAGACC ACATC 
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MDSSPSLPLI RTPESSLHEA LDQCMTALDL FLTNQFSEAL SYLKPRTKES MYHSLTYATI 
LEMQAMMTFD PQDILLAGNM HKEAQHLCQR HRRKSSVTDS FSSLVMRPTL GQFTEEEIHA 
EVCYAECLLQ RAALTFLQGS SHGGATOPRA LHDPSHACSC PPGPGRQHLF LLQDENMVSF 
IKGGIKVRNS YQTYKELDSL VQSSQYCKGE NHPHFEGGVK I/3VGAFNLTL SMLPTRILRL 
LEFVGFSGNK DYGLLQLEEG ASGHSFRSVL CVMLLLCYHT FLTFVLGTGN VNIEEAEKLL 
KPYLNRYPKG AIFLFFAGRI EVIKGKIDAV SDGGPGRGWG SLGVSQTSRK SGTCDILRDR 
IDWGRGGGQE RTNQRAGAGE ALLAEQPGKT REEEAFWPG ILTGRYRTAA LQWREVEGGA 



1 11 21 31 41 SI 

I I I I I I 

GGTGGAGACA CCGCCTCAGG GCTCGGTGCA CAGTGGACAT TTGGGGAGCG T 
CCCCCACACA GGCACTGGGA ATGCAGGGGA GAGGGGGCCA AGGGGGAAAG GGGCCAGAGT 
GTTGGCTTTG GAT TCAGG AG GGATGGATTC CAGTCCTAGC TTGCCACTTA TTAGGACTCC 
TGAGAGCAGC CTCCATGAGG CCCTGGACCA GTGCATGACC GCCCTGGACC TCTTCCTCAC 
CAACCAGTTC TCAGAAGCAC TCAGCTACCT CAAGCCCAGA ACCAAGGAAA GCATGTACCA 
CTCACTGACA TATGCCACCA TCCTGGAGAT GCAGGCCATG ATGACCTTTG ACCCTCAGGA 
CATCCTGCTT GCCGGCAACA TGATGAAGGA GGCACAGATG CTGTGTCAGA GGCACCGGAG 
GAAGTCTTCT GTAACAGATT CCTTCAGCAG CCTGGTGAAC CGCCCCACGC TGGGCCAATT 
CACTGAAGAG GAAATCCACG CTGAGGTCTG CTATGCAGAG TGCCTGCTGC AGCGAGCAGC 
CCTGACCTTC CTGCAGGGTT CCTCACACGG AGGGGCAGTC AGGCCCAGAG CCTTGCATGA 
Z GCCTGCAGCT GCCCACCTGG GCCAGGCCGT CAGCATCTTT TCCTCCTGCA 
C ATGGTGAGCT TCATCAAAGG CGGCATCAAA GTTCGAAACA GCTACCAGAC 
3 CTGGACAGCC TTGTTCAGTC CTCACAATAC TGCAAGGGTG AGAACCACCC 
GCACTTTGAA GGAGGAGTGA AGCTTGGTGT AGGGGCCTTC AACCTGACAC TGTCCATGCT 
TCCTACTAGG ATCCTGAGGC TGTTGGAGTT TGTGGGGTTT TCAGGAAACA AGGACTATGG 
75 GCTGCTGCAG CTGGAGGAGG GAGCGTCAGG GCACAGCTTC CGCTCTGTGC TCTGTGTCAT 
GCTCCTGCTG TGCTACCACA CCTTCCTCAC CTTCGTGCTC GGTACTGGGA ACGTCAACAT 
OGAGGAGGCC GAGAAGCTCT TGAAGCCCTA CCTGAACCGG TACCCTAAGG GTGCCATCTT 
CCTGTTCTTT GCAGGGAGGA TTGAAGTCAT TAAAGGCAAC ATTGATGCAG TGAGTGATGG 
GGGTCCGGGC CGGGGCTGGG GATCCCTCGG GGTCTCCCAG ACCAGCAGGA AGTC^GGCAC 
80 ATGTGACATA CTCAGGGACA GGATAGACTG GGGGCGGGGC- GGGGGGCCAA GAGAGAACCA 
ACCAGAGAGC AGGGGCAGGA GAGGCCCTTC TGGCAGAGCA GCCTGGGAAG ACAAGGGAGG 
AGGAGGCATT TGTGGTGCCT GGGATTTTGA CTGGGAGATA TAGC-ACTGCA GCATTGCAGT 
GGAGGGAGGT GGAGGGAGGT GCTTGAGGGA GGCAGAGGTT AGGAAAGCCC ATCTGTTTAG 
GGCATGACGA TTAGGCTGGA GTCTGGTACC TCCCCTCCAT TATAGCTCTC TCCTGCTCTT 
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TCATTTTGTT ACTAAAAACC AGAGTCCTAG GCGGGGGCTG TATTTGAGCC CAACGTCATG 
TAAGACTTAG GAGGTAAAAC CAGGACTGGA GGCCAGATCT CCTGGCTCCT GGGGCCCCAC 
CTGAGCCTAG CACAGGGCTG GACCACTATG CCCTGGAGGA GTCCCGGTCT GCTGTGGTGT 
TGGGAGGTTC GGAGGATGCA GAGGGGTTGG GGCTGGGTGG GCACCCGTCA GGCTGACCAG 
G CAGGCCATCC GGCGTTTCGA GGAGTGCTGT GAGGCCCAGC AGCACTGGAA 
C CACATGTGCT ACTGGGAGCT GATGTGGTGC TTCACCTACA AGGC-CCAGTG 
C TACTTCTACG CCGACCTGCT CAGCAAGGAG AACTGCTGGI CCAAGGTGGG 
C GTGTTAGGGG CATTGGGTGA CCAGGGCTGA CTGTGTGCCT CCAGACCACG 

TGAAC ACAGATGTCT CAGCTGGAAT CTAAACATAA CCTTAAATTC 2040 
T CAGGCTTGGA AGGAACTAAA GACCACAAGA GAAACTTCTG AACCACAATG 2100 

G AGATTTCTAG TCACAAAGGA CAGAAACATG GCTCCCTCTG TCCAGTAGAA 21S0 

CTGTTTCTGT GGTAGAAATG TTCTACTTAT GCACTGCCCA AATAIGGTAG TCACCAGCCA 2220 
CATGTGCACA CTGAGCACAT GAAATGTGCC TAGTGCAACT GGGAAACTGA TTGTTTTCAG 2280 
TTTTATTTAA TTTTAATTAA CTAAATGTTA AATTTAAATA GCCATGTAGG GCTTGTGGCC 2340 
ACTATATTGG ACTATGCAGG TCCAAAACAC AAAAGGCTCA TATAACTGAA CATTCTTGGC 2400 
ACATCCGACT TCAGGTAGGG CTGGATCCAG GAATTCAAAT GATGTCGTCT GGCTTGGTCT 2460 
TTCCATTTGT GGCTGTGCTC TCTCCTATGA CATCTTTGTT TCTGCTGCAT CTTTGCAGGA 2520 
AGGTTCTCTC CATGTGACAG GCAAGGTGGC CACGGGCTGC TTCTACTCAT ATCCTCCCCT 25S0 
TGGTTTCAAC CAGAGTCCCA CAGTTTGAGC CTCACTGGTC TGACTTGTCA CCTGCCAATG 2 540 
CCTGGAACAG AGGGGTGGGA AGATTCCTCA TGGGCTGAGA GTAGGAGAGG GGTGGTTCCA 2 700 
CAGAAGAAAA TGATGCACCC AGAAATAGTG GAGGGATTAA CAAGATGCCA TACAGGCAAA 276 0 
ACAAAGCCAA ACAGATGCCC GCCTACCAAA GATGAAATTT ATCATGGTAA GTATTGAAAT 282 0 
AAGTGTTAGC TTGTACCATC ATAGTAATGA TAGTGCAGAA ATTC-GAACCA AGAGTCTTAC 288 0 
AACCACCTAG CTCAGCAAAC GTCTAATCTG TTATTTGTAA ATACACAGGA CATGTGTCTT 2940 
CATGGCTTCA TCCCCTGCCC ATGGCAGACA TTGCTAATCA ATCCTCTGCC ATGAGCCTAG 30 00 
CTGTGACCTT AGATACCTTC CCTGCAAAGC TCCAGGCAGC CGTTAGAACT GACCCACATT 3 060 
GGCACCTGTT AGGAGACCTG TCATCTGCAT CTGGTCCCTC CCCCTCCTCT GCATGGAGGT 3120 
CTCTTCCGTA GCCCTGCCTG GGGACCAAGC TGAAGGGGAA TTCTCTGGGT CTGGGGCAGG 3180 
TGGGGCTGGG GAAGGGAGCA GGCTTCTACT GAGCTCTCAA CATGTCTGGC ACCTGTCATG 3240 
TCCTCCACAC CACACCCAGT GAGGAGAAGT TATTATCTCC ATTGGGCACA TGAGGGCCAC 33 00 
TGATGCTAAG CGAGGACTGT GACTTTCCCA CCCACATGGC TGTTGGTGAC AGACACCGGA 33 60 
TTTGAGGACA GTCCTGTGTG ACTCAGAGGC CTCTGCTTCT CTCTTGCTCC TAGAGTCCTC 3420 
CTGAGGAGTC GGGGCTTGCC CTGAGCCCAC CCTGCTGTTG AAGGTGCTTC CTCAGGCCCA 34 80 
GCTCCCATGG CCCCCACACC CCCTCCTCAT CACCTCCTAC TCCCAAAAAG GACAAAGCCT 3 540 
CAGGGAACCT t Tttt ctTTT TTTAGAGACG GGGTCTTGCT ATGTTGGTCA GGCTGGTCTT 3 600 
GAACTCCTGG GCCCAGGCAA TCTTCCCGCC TCTGCCTCCT AAAGTGCTGG GATTCTGGGC 36S0 
CTTAGGGAAC CTTTTTGAAC TGAAAGTGAC GCTCGAAGCC TTCCTCTAGA GTGAGGTGGG 3720 
TGGCTGGGGC TAACCATATG GGGAAGGAGA GACCTGGTGG GGGCACACAG CTGCTATATA 3 700 
CACCAACAGA AGGTGGGCCA GGCCTCCCAG CTGTCAGGAA ATCTGGCTCA GTCCCCAGAC 3 840 
CGCTGTGTGG CTTTGGGGTG GCCAGTCCCT CTCTCTCTGC AACACTCTCC TCATGAGGAC 3900 
TTCTGTGAAA AATGGGGGTT GTAACCCTCT GAGTTCTGGG GCCCCACCCA TTTCTAATCT 3 9S0 
CCAGAACTTG GCCACCACAG ACTCCACCAA CTTCTAGTCC TGGGGCCTGG GCCTCTGGCC 4020 
G GCACCACCTG CTCTCTCCAG CCAGCCCCCC CCTCTGCCAG GATCCTCCGA 4080 

TCTGAC CCGCAGACCC TGGCTGAGCG ACGGATGAAC GGACTATCCA 414 0 

T TGCTTGTCAG CAGATGGGGG ACCCCTGCCA GAGTCAGCAG CGGCCCCCAT 4200 
AAGCCTGCCA CGCTTGCATT TATTTAGTAC AGATGTAATG ACAAAGGCCT AAAGCAAACT 426 0 
CCATTTGTGG GTAATTAACA TTGTCGCCCC CCCAGAAAGA GCAGTCCTCC GCATGATGAT 4320 
TAAAGGCCAG GTTCCGAGGC CTAAGTAAAC CAACTTATCT AGATCAATTC CCTTACTTCT 4380 
TGTTATCTAC TCTGAGAGAA TTCAGCTGCC TTCAGCCAAA TCCTTTCCCG AAGCTTTTGC 4440 
AAAACCTCCG AGCCTTCCAA GGTTTGCTTC TTTCTGTAAT TTTTCTCACC ACCCTGACCT 4500 
ATCTCCTGCA GTCAGCCCTG TGGAGGCCTT TGTGTTTCCC CCAGTGCTGG CAGCCTAGAG 4560 
GCTGAGATGG CCAGAAACAA GGTGGTGACA GTGGCGTGCT CAGGGCTTGG GAAACCCAAG 462 0 
GAGCTAAAGG CATGCCCAGG CAACCAAAGA GGACAGGAAG GCTTCTGAGG AGAGACCTCT 4680 
GAGGTGGGTC TTGGAGAGGA AGGACTTAGG GAGGCAGAGT GGAGGAAGTG AGAGGACACC 4740 
CCAAGCCAAG AGGGCGGCAG GACCAAAGGC TCAGAAGCCA GGGCGCTGCA GAGGGGCTGT 4800 
GTGCCACAGG GTGAAGAGTT TGTGTGGCAG AAGGGCAGGG GGCTTGCATC AGGGGTGACA 4860 
GCTGCTCTTT TGTCCCAGCA TAGCCCCTGT ACATCCCTGG AGAGCTGGGG CGTCCACAAC 4920 
TCTAAGTCAC AGCCCCCATC CTAACCCTGG TGGIGCAGTG AGGGTGAGCT GTCTGTGGGC 4980 
AGGAGGGAAG ACTCTTGGAG ATGAGCCTGG TGAAGGGATA ATGGCATCCC GGGCCGAGGA 504D 
G CAGAGGCCTG GGGAGAGTTT AAGGAGTGTA GGGGAGGAAA TGGCAGAAGA 5100 
3AAAGGT TAGGGCAGGT CCTGGAGGAC ATGAGTGGCT GTTTGGGCTT 5160 
G TGGGGGAGCC TTGGCAGGCT TGTGGCTTAG ATAGGTGCTT TAGAAAGCCC 5220 
ACCAGCAGTT GCTGGGC CAC CCCGCTGGCT GGGTCCTGTT CTAAGGCAGG AAATACAAGA 52 BO 
ATGAGCAGGA AAAGACCCCC TCAAGGCTCA CGTCCTAGTG GGGAGACAAG AAACACAGAT 5340 
GGGCAATATA ACACGATGTC TGGTTCCAGT AAGTGCAGTG AAGAACAAGC GAGGCTGGAT 5400 
GCAGGGAGTG ATGGGAGGGG CTTTGTAAGG GGAGGTCGGG GGAAGCCTGT CTCAGAGGAC 54 60 
ACCAGAATGG AGCGCAGGAG CAGCACGTGG CAGTCACATG C-CAGC-CCGTT AGGGCAGAGG 5520 



A TGGCTTGCAG GGGACAGGAG TGGGAGCAGG GAGATAGGAG ACAATGTGTA 
CCAGGACAGC AGAAAGACAT CCCGGGTAGC CTGGAACAGG GAGACGGTGT GGAGATGGTG 
GCAGTCCGAT AATGAGAGCC GTAGGGCAAG GCCAGCAGGA TCCTAGAGTG AGACGGGAGG 
TAAAGTCACC GGGACTTGGT GTCTCCACGT CAGGGGCAGG GGAAAGGGAG AGGACAAGGG 
TGACCCGGGA GGTTAAAGAT GGGACCGGGG CCAGACGCAG TGGCTCATGC CIGTAATCCT 
AGCACTTTGG GAGGCTGAGG CGGGCGGATG GCTTGAGGTC AGGAGTTTGA AACCGGCCTG 
GCCAACATGG TGAAACCCCG TCTCTACTAA AATATACAAA AATTAGCCTG GCGTGGTGGT 
GCATGCCTGC AGTCCCAGCT ATTCAGGAGG CTGAGGCGAC AAGAATCGCT TGAACCTGGG 
AGGCGGAGGT TGCAGTGAGC CGAGATCGCG CCATAGCACT CCAGCCTTAG CCTGGGCGAC 
AGAGCGAGAC CACATC 
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VETPPQGSVH SGHLGSWGD PHTGTGNAGE RGPRGKGARV LALDSGGKDS SPSLPLIRTP 
ESSLHEALDQ CMTALDLFLT NQFSEALSYL KPRTKESMYH SLTYATILEH QAMMTFDPQD 
ILLAGNMMKE AQMLCQRHRR KSSVTDSFSS LVNRPTLGQF TEEEIHAEVC YAECLLQRAA 
LTFLQGSSHG GAVRPRALHD PSHACSCPPG PGRQHLFLLQ DENMVSFIKG GIKVRNSYQT 
YKELDSLVQS SQYCKGENHP HFEGGVKLGV GAFNLTLSML PTRILRLLEF VGFSGNKDYG 
LLQLEEGASG HSFRSVLCVM LLLCYHTFLT FVLGTGNVNI ESAEKLLKPY LNRYPKGAIF 
LFFAGRIEVI KGNIDAVSDG GPGRGWGSLG VSQTSRKSGT CDILRDRIDW GRGGGPRENQ 
PESRGRRGPS GRAAWEDKGG GGICGAWDFD WEI 



CCTCCGGGAA AAGCGACCGC 



G CCTGCTGCAG CGAGCAGCCC TGACCTTCCT G 
AACATGGTGA GCTTCATCAA AGGCGGCATC AAAGTTCGAA ACAGCTACCA GACCTACAAG 
A GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA CCCGCACTTT 
3 TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT G 

CAGCTGGAGG AGGGAG CGT C AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 
CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGT 
GCCGAGAAGC TCTTGAAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT C 
TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG G 
GAGTGCTGTG AGGCCCAGCA G CACTGGAAG CAGTTTCACC ACATGTGCTA C 
ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC O 

S. ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTACCTCAGC 10 80 

A AGGAGGACCA CAAGCCGTTC GGGGACGACG AAGTGGAATT ATTTCGAGCT 1140 

GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 12 0 0 

CGGAAGTCCC GGCGCTACTT CTCCTCCAAC CCTATCTCGC TCCCACTCCC TGCTCTGGAA 12 60 

ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGGAAGC AGCCGAAACT CACGGATGGG 13 2 0 

ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGGAGA AAGGCCCAGA GAACGAGTAC 13 80 

TCAGTGGATG ACGAGTGCTT GGTGAAATTG TTGAAAGCCC TGTGTCTGAA ATACCTGGGC 1440 

CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 

TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 1560 

GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTCCCA AGCAAAACTA CAAGAATTAC 1620 

TCCATGGAGT CAAGGACACA CTTTCGAATC CAGGCAGCCA CACTCCAAGC CAAGTCTTCC 1680 

CTAGAGAACA GCAGCAGATC CATGGTCTCA TCAGTGTCCT TGTAGCTTTG TGCAGCAGTT 1740 

CCGGGCTGGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1800 

CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG I8 60 

TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1920 

GGCACACCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1980 

GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTGATGGGAT 2040 

CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2100 

TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 2160 

TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 

AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 2280 

CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 

AAGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCIACC 2400 

AGTAGAAAAT GCCAGGGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 2460 

AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 

CTGAAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 

CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACTGTTATTC TTGAAAAAAA AAAAAAAAAA 2640 



1 11 21 31 41 51 

70 KEAASGKSDR RTPESSLHEA LDQCMTALDL FLTNQFSEAL SYLKPRTKES MYHSLTYATI 60 

LEMQAMMTFD PQDILLAGNM MKEAQMLCQR HRRKSSVTDS FSSLVNRPTL C-QFTEEEIHA 120 

EVCYAECLLQ RAALTFLQDE NMVSFIKGGI KVRHSYQTYK ELDSLVQSSQ YCKGENHPHF 180 

EGGVKLGVGA FNLTLSMLPT RILRLLEFVG FSGNKDYGLL QLEEGASGHS FRSVLCVMLL 240 

LCYHTFLTFV LGTGNVNIEE AEKLLKPYLN RYPKGAIFLF FAGRIEVIKG KIDAAIRRFE 300 

75 ECCEAQQHWK QFHHMCYWEL MWCFTYKGQW KMSYFYADLL SKENCWSKAT YIYMKAAYLS 360 

MFGKEDHKPF GDDEVELFRA VPGLKLKIAG KSLPTEKFAI RKSRRYFSSN PISLPVPALE 420 

MMYIWKGYAV IGKQPKLTDG ILEIITKAEE MLEKGPENEY SVDDECLVKL LKGLCLKYLG 480 

RVQEAEENFR SISANEKKIK YDHYLIPNAL LELALLLMEQ DRNEEAIKLL ESAKQNYKNY 540 
SMESRTHFRI QAATLQAKSS LENSSRSMVS SVSL 

Nucleic Acid Accession # : Eos sequence 
Coding sequence: 31.. 906 
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CGGGTCGACC CACGCGTCCG GC-GAGAAAGG ATGGCCCCCC TCCCGGCGCG GTTGGTCCTG SO 

CTAGCTGGGG CAGCGGCGCT GGCGAGCGGC TCCCAGGGCG ACCGTC-AGCC GGTC-TACCGC 120 

GACTGCGTAC TGCAGTGCGA AGAGCRGAAC TGCTCTGGGG GCGCTCTGAA TCACTTCCGC 180 

TCCCGCCAGC CAATCTACAT GAGTCTAGCA GGCTGGACCT GTCGGC-ACGA CTGTAAGTAT 240 

GAGTGTATGT GGGTCACCGT TCGGCTCTAC CTCCAGGAAG GTCACAAAGT GCCTCAGTTC 300 

CATGGCAAGT GGCCCTTCTC CCGGTTCCTG TTCTTTCAAG AGCCGGCATC GGCCGTGGCC 350 

TCGTTTCTCA ATGGCCTGGC CAGCCTGGTG ATGCTCTGCC GCTACCGCAC CT1CGTGCCA 420 

GCCTCCTCCC CCATGTACCA CACCTGTGTG GCCTTCGCCT GGGTC-TCCCT CAATGCATGG 4B0 

TTCTGGTCCA CAGTYTTCCA CACCAGGGAC ACTGACCTCA CAGAC-AAAAT GGACTACTTC 540 

TGTGCCTCCA CTGTCATCCT ACACTCAATC TACCTGTGCT GCGTCAGCCT CATCCGCTTC 600 

GACTATGGCT ACAACCTGGT GGCCAACGTG GCTATTGGCC TGGTCAACGT GGTGTGGTGG SSO 

CTGGCCTGGT GCCTGTGGAA CCAGCGGCGG CTGCCTCACG TGCGCAAGTG CGTGGTGGTG 720 

C TGCAGGGGCT GTCCCTGCTC GAGCTGCTTG ACTTCCCACC GCTCTTCTGG 780 

G CCCATGCCAT CTGGCACATC AGCACCATCC CTGTCCACGT CCTCTTTTTC 340 

AGCTTTCTGG AAGATGACAG CCTGTACCTG CTGAAGGAAT CAGAGGACAA GTTCAAGCTG 900 

GACTGAAGAC CTTGGAGCGA GTCTGCCCCA GTGGGGATCC TGCCCCCGCC CTGCTGGCCT 960 

CCCTTCTCCC CTCAACCCTT GAGATGATTT TCTCTTTTCA ACTTCTTGAA CTTGGACATG 1020 

AAGGATGTGG GCCCAGAATC ATGTGGCCAG CCCACCCCCT GTTGGCCCTC ACCAGCCTTG 1080 

GAGTCTGTTC TAGGGAAGGC CTCCCAGCAT CTGGGACTCG AGAGTGGGCA GCCCCTCTAC 1140 

CTCCTGGAGC TGAACTGGGG TGGAACTGAG TGTGCTCTTA GCTCTACCGG GAGGACAGCT 1200 

GCCTGTTTCC TCCCCATCAG CCTCCTCCCC ACATCCCCAG CTGCCTGGCT GGGTCCTGAA 1260 

GCCCTCTGTC TACCTGGGAG ACCAGGGACC ACAG3CCTTA GGGATACAGG GGGTCCCCTT 1320 

CTGTTACCAC CCCCCACCCT CCTCCAGGAC ACCACTAGGT GGTGCTGGAT GCTTGTTCTT 138 0 

TGGCCAGCCA AGGTTCACGG CGATTCTCCC CATGGGATCT TGAGGGACCA AGCTGCTGGG 1440 

ATTGGGAAGG AGTTTCACCC TGACCRTTGC CCTAGCCAGG TTCCCAGGAG. GCCTCACCAT 1500 

ACTCCCTTTC AGGGCCAGGG CTCCAGCAAG CCCAGGGCAA GGATCCTGTG CTGCTGTCTG 1560 

GTTGAGAGCC TGCCACCGTG TGTCGGGAGT GTGGGCCAGG CTGAGTGCAT AGGTGACAGG 1620 

GCCGTGAGCA TGGGCCTGGG TGTGTGTGAG CTCAGGCACT AGGTGCGCAG TGTGGAGACG 1680 

GGTGTTGTCG GGGAAGAGGT GTGGCTTCAA AGTGTTGTGT GTGCAGGGGG TKGGTGTGTT 1740 
AAGCGTGGGT TAGGGGAACG TGTGTGCGCG TGCTGGTGGG CATGTGAGAT G 
CGGTGAATGT GTCCACAGTT GAGAGGTTGG AG CAGG ATG A GGGAATCCTG T 
TAATCACTTG TGGAGCGCCA CTTGGCCCAA GACGCCACCT GGGCGGJ 
CCATGGCCAG GCTGCCTGTG TGCATGTTCC CTGTCTGGTG CCCCTTl 

AAACCTCACA GGGTCCCCAC ACAACAGTGC CCTCCAGAAG CAGCCCCTCG GAGGCAGAGG 20 4 0 

AAGGAAAATG GGGATGGCTG GGGCTCTCTC CATCCTCCTT TTCTCCTTGC CTTCGCATGG 2100 

CTGGCCTTCC OCTCCAAAAC CTCCATTCCC CTGCTGCCAG CCCCTTTGCC ATAGCCTGAT 2160 

TTTGGGGAGG AGGAAGGGGC GATTTGAGGG AGAAGGGGAG AAAGCTTATG GCTGGGTCTG 2220 

GTTTCTTCCC TTC CCAGAGG GTCTTACTGT TCCAGGGTGG CCCCAGGGCA GGCAGGGGCC 2200 

ACACTATGCC TGCGCCCTGG TAAAGGTGAC CCCTGCCATT TACCAGCAGC CCTGGCATGT 2340 

TCCTGCCCCA CAGGAATAGA ATGGAGCGAG CTCCAGAAAC TTTCCATCCC AAAGGCAGTC 24 00 

TCCGTGGTTG AAGCAGACTG GATTTTTGCT CTGCCCCTGA CCCCTTGTCC CTCTTTGAGG 2460 

GAGGGGAGCT ATGCTAGGAC TCCAACCTCA GGGACTCGGG TGGCCTGCGC TAGCTTCTTT 2520 

TGATACTGAA AACTTT TAAG GTGGGAGGGT GGCAAGGGAT GTGCTTAATA AATCAATTCC 2580 
AAGCCTCAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID NO: 92 Protein sequence 
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MAGLAARLVL IiAGAAALASG SQGDREPVYR DCVLQCEEQN CSGGALNHFR SRQPIYMSLA 

GWTCRDDCKY ECMWVTVGLY LQEGHKVPQF HGKWPFSRFL FFQEPASAVA SFLNGLASLV 

55 MLCRYRTFVP ASSPHYHTCV AFAWVSLNAW FWSTVFHTRD TDLTEKMDYF CASTVILHSI 

YLCCVRTVGL QKPAWSAFR ALLLLMLTVH VSYLSLIRFD YGYKLVANVA IGLVNVWWL 

AWCLNMQRRL PHVRKCWW LLLOGLSLLE LLDFPPLFWV LDAEAIWHIS TIPVHV1,FFS 
FLEDDSLYLL KESEDKFKLD 




A GCAGAACTGC TCTGGGGGCG CTCTGAATCA C 
G TCTAGCAGGC TGGACCTGTC GGGACGACTG T 
TCACCGTTGG GCTCTACCTC CAGGAAGGTC ACAAAGTGCC TCAGTTCCAT GG CAAGTGGC 
CCTTCTCCCG GTTCCTGTTC TTTCAAGAGC CGGCATCGGC CGTGGCCTCG TTTCTCAATG 
GCCTGGCCAG CCTGGTGATG CTCTGCCGCT ACCGCACCTT CGTGCCAGCC TCCTCCCCCA 
TGTACCACAC CTGTGTGGCC TTCGCCTGGG TGTCCCTCAA TGCATGGTTC TGGTCCACAG 
TTTTCCACAC CAGGGACACT GACCTCACAG AGAAAATGGA CTACTTCTGT GCCTCCACTG 
TCATCCTACA CTCAATCTAC CTGTGCTGCG TCAGGACCGT GGGGCTGCAG CACCCAGCTG 
T CTCCTGCTGC TCATGCTGAC CGTGCACGTC T 

ACGTGGTGTG GTGGCTGGCC TGGTGCCTGT GGAACCAGCG GCGGCTGCCT CACGTGCGCA 
AGTGCGTGGT GGTGGTCTTG CTGCTGCAGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 
CACCGCTCTT CTGGGTCCTG GATGCCCATG CCATCTGGCA CATCAGCACC A^ 
ACGTCCTCTT TTTCAGCTTT CTGGAAGATG ACAGCCTGTA 0 ~ 
ACAAGTTCAA GCTGGACTGA AGACCTTGGA GCGAGTCTGC CCCAGTGGGG ATCCTGCCCC 
CGCCCTGCTG GCCTCCCTTC TCCCCTCAAC CCTTGAGATG ATTTTCTCTT TTCAACTTCT 
TGAACTTGGA CATGAAGGAT GTGGGCCCAG AATCATGTGG CCAGCCCACC CCCTGTTGGC 
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CCTCACCAGC CTTGGAGTCT GTTCTAGGGA AGGCCTCCCA GCATCTGGGA C 

GGCAGCCCCT CTACCTCCTG GAGCTGAACT GGGGTGGAAC TGAGTGTC-CT CTTAGCTCTA 1260 

CCGGGAGGAC AGCTGCCTGT TTCCTCCCCA TCAGCCTCCT CCCCACATCC CCAGCTGCCT 1320 

GGCTGGGTCC TGAAGCCCTC TGTCTACCTG GGAGACCAGG GACCACAGGC CTTAGGGATA 1380 

CAGGGGGTCC CCTTCTGTTA CCACCCCCCA CCCTCCTCCA GGACACCACT AGGTGGTGCT 1440 

GGATGCTTGT TCTTTGGCCA GCCAAGGTTC ACGGCGATTC TCCCCAIGGG ATCTTGAGGG 1500 

ACCAAGCTGC TGGGATTGGG AAGGAGTTTC ACCCTGACCA TTGCCCTAGC CAGGTTCCCA 1ES0 

GGAGGCCTCA CCATACTCCC TTTCAGGGCC AGGGCTCCAG CAAGCCCAGG GCAAGGATCC 1620 

TGTGCTGCTG TCTGGTTGAG AGCCTGCCAC CGTGTGTCGG GAGTGTGGGC CAGGCTGAGT 1680 

GCATAGGTGA CAGGGCCGTG AGCATGGGCC TGGGTGTGTG TGAGCTCAGG CCTAGGTGCG 1740 

CAGTGTGGAG ACGGGTGTTG TCGGGGAAGA GGTGTGGCTT CAAAGTGTGT GTGTGCAGGG 1800 

GGTGGGTGTG TTAGCGTGGG TTAGGGGAAC GTGTGTGCGC GTGCTC-GTGG GCATGTGAGA 1860 

TGAGTGACTG CCGGTGAATG TGTCCACAGT TGAGAGGTTG GAGCAC-GATG AGGGAATCCT 1920 

GTCACCATCA ATAATCACTT GTGGAGCGCC AGCTCTGCCC AAGGCGCCAC CTGGGCGGAC 1980 

AGCCAGGAGC TCTCCATGGC CAGGCTGCCT GTGTGCATGT TCCCTC-TCTG GTGCCCCTTT 2040 

GCCCGCCTCC TGCAAACCTC ACAGGGTCCC CACACAACAG TGCCCTCCAG AAGCAGCCCC 2100 

3 AGGAAGGAAA ATGGGGATGG CTGGGGCTCT CTCCATCCTC CTTTTCTCCT 2160 

\ TGGCTGGCCT TCCCCTCCAA AACCTCCATT CCCCTGCTGC CAGCCCCTTT 2220 

GCCATAGCCT GATTTTGGGG AGGAGGAAGG GGCGATTTGA GGGAGAAGGG GAGAAAGCTT 2280 

ATGGCTGGGT CTGGTTTCTT CCCTTCCCAG AGGGTCTTAC TGTTCCAGGG TGGCCCCAGG 2340 

GCAGGCAGGG GCCACACTAT GCCTGCGCCC TGGTAAAGGT GACCCCTGCC ATTTACCAGC 2400 

AGCCCTGGCA TGTTCCTGCC CCACAGGAAT AGAATGGAGG GAGCTCCAGA AACTTTCCAT 2460 

CCCAAAGGCA GTCTCCGTGG TTGAAGCAGA CTGGATTTTT GCTCTGCCCC TGACCCCTTG 2520 

TCCCTCTTTG AGGGAGGGGA GCTATGCTAG GACTCCAACC TCAGGGACTC GGGTGGCCTG 2580 

CGCTAGCTTC TTTTGATACT GAAAACTTTT AAGGTGGGAG GGTGGCAAGG GATGTGCTTA 2 640 
ATAAATCAAT TCCAAGCCTC AAAAAAAAAA AAAAAAAAAA AA 

Seq ID NO: 94 Protein sequence 



MKDVGPESCG QPTPCWPSPA LESVLGKASQ HLGLESGQPL YLLELNWGGT ECALSSTGRT 
AACFLPISLL PTSPAAWLGP EALCLPGRPG TTGLRDTGGP LLLPPPTLLQ DTTRHCWMLV 
LWPAICVHGDS PKGILRDQAA GIGKEFHPDH CPSQVPRRPH HTPFQGQGSS KPRARILCCC 
LVESLPPCVG SVGQAECIGD RAVSMGLGVC ELRPRCAVWR RVLSGKRCGP KVCVCRGWVC 

Seq ID NO: 95 DNA sequence 

Nucleic Acid Accession #: XM_090469 

Coding sequence: 

1 11 21 31 41 51 

I I I I I I 

ATGGGGTTTG GAGACCAGGG AACGGTGGAA GGGAGCCTAG GAACGTCGAA AGATAGCTCC 
TGCCGCGTGC TGGACAG G AC GCAGGTGAAC AACCCCTGCC TGCTGGAGGC TGCCAAGCAC 
AGCCCACACC CCTTGCATGG ACAGTCGGTC AACAGCCACC CATCAGGTGC T CAT CAG AAA 
CCACCTGAAG TGAAAATGTT TGGAGCCAGT CAAGGTTTGC TGACAATGGA AACAAACCAG 
TCCCTGGCAC AACGCACAGG CTGCTCAGTG TGCTGGGAGG TGAACGGCCC AGGGTTGGAA 
ATGTCTCTTC CTGGAGTGCT CAGTGCTGAT GCTGGCCAGG TGGAC-CACAG AAGACAAATG 
AACTCCGCAG ATACTGACTG GGGAACTGAG GAGGGACCAG GAAGTCTAGA TTTCGCCGTG 
AC TTTCTTTTTG AAAGACATCA AG CACACAAG TACGTTCAGG 



CTCAGCCGCA GCCCGGCGCC GTCCCCACGT ACACCCATGG CGCCCTTCCC GACCTCTGAC 
CGCGAGCTAG ACGCGCCAGG CCCGCCCCCC GGGCTGCGGA GCTCGGCAGC AGCCCCTCAC 
TGCCTGCCCG CGGCTCCAGA AGCCCAGAAA CTCTTCTTCC TGCTGCCCCT TTATCCAGAT 
GGCAGCCCAC CACCTAAGGA CATCTTGCAG ACGCTCCAGC ACAAGGCACC TGGGAAGAGA 
CTGCATCAAG AATCACCAGG TTCATTTCCA ATAGGTTTAT GTAACAGAAC AGCTGAACCA 
G GGGCTCTGGG TGTGGCTGCT GAAGCAGCAT TTCACTTACA GTTTTCCAGT 



MGPGDQGTVE GSLGTSKDSS CRVLDRTQVN KPCLLEAAKH SPHPLHGQSV NSHPSGAHQK 
PPEVKMPGAS QGLLTMETNQ SLAQGTGCSV CWEVNGPGLE MSLPGVLSAD AGQVEHRRQM 
NSADTDWGTE EGPGSLDFAV AAHQEDTFFL KDIKHTSTFR QSVQQQNCIY SPREKPCGNV 
RAPCAPPRRE APLALSRPWR LSRSPAPSPR TPMAPFPTSD RELDAPGPPP GLRSSAAAPH 
CLPAAPEAQK LFFLLPLYPD GSPPPKDILQ TLQHKAPGKR LHQESPGSFP I 
C I KGALGV AA EAAFHLQFSS E 



I I I I I I 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGG 
TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 
CTTTTTTAAA AATGAAAGGC TAGAAGAGCT CAGCGGCGGC GCGGGCCGTG CGCGAGGGCT 
CCGGAGCTGA CTCGCCGAGG CAGGAAATCC CTCCGGICGC GACGCCCGGC CCCGCTCGGC 
GCCCGCGTGG GATGGTGCAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA C 
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GCGACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCCT GCTCGCCCTG 360 

GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CC-AAGGAAGA 42 0 

GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AC-TGAAGAGC 480 

TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGGA AAG CAAAG AA 540 

5 CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC C-GAAACCCAC 600 

TATCTGCAAG ACGGTACTGA TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC 660 

TGTTACTACC ATGGACATGT ACGGGGATAT TCTGATTCAG CAC-TCAGTCT CAGCACGTGT 720 

TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 780 

AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT CCGGGGATCA 840 

10 TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 

CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 

GTGATCGTGG CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAGATCTGGA AAAAGTTAAG 1020 

CAGCGATTAA TAGAGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT GAACATTCGG 1080 

ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1140 

15 CCATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TC-AAGCTTCT ACCTCGCAAA 1200 

TCCCATGACA ATGCGCAGCT TGTCAGTGGG GTTTATTTCC AAGGGACCAC CATCGGCATG 1260 

GCCCCAATCA TGAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 

GACAATCCCC TTGGTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGATG 1380 

AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGTTGAGAA AGGAGGCTGC 1440 

20 ATCATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGGAAG 1500 

GACTTGGAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 

AGGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAGATTTG TGGAAGAAGG AGAGGAGTGT 1620 

GACTGTGGGG AGCCAGAGGA ATGTATGAAT CGCTGCTGCA ATGCCACCAC CTGTACCCTG 1680 

AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTGCA 1740 

25 GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGCC 1800 

AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTCA GGATGTGGAC 1860 

GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 

CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 

TAT GGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAGAGATGCT 2040 

30 AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTGG TACCAATGCC 2100 

GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTG CCGGGGGACC 2160 

CACGTGTACT TGGGCGATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 2220 

GCAGATGGAA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 

GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2340 

35 GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGGAGGAAG CACAGACAGC 2400 

GGCCCCATCC GGCAAGCAGA TAACCAAGGT TTAACCATAG GAATTCTGGT GACCATCCTG 2460 

TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACCTTGAT ACGACTGCTG 2520 

TTTACAAATA AGAAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2580 

CGTGGCTTCC AACCCTGTCA GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATGAGGAAC 2640 

40 CCGCCAGATT CCTACCCACC GAAGGACAAT CCCAGGAGAT TGCTGCAGTG TCAGAATGTT 2700 

GACATCAGCA GACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2760 

CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAGACC CCTGCCAGCC 2E20 

AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 

CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 

45 CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CT CCACAAT A TCCACACCAA 3000 

GTGCCCAGAT CCACCCACAC CGCCTATATT AAGTGAGAAG CCGACACCTT TTTTCAACAG 3060 

TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 3120 

GGATTTTTTT TAATGTTTAA AACATCATTA CTATAAGAAC TTTGAGCTAC TGCCGTCAGT 3180 

GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTAATTTAT 3240 

50 GCAGAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 

CCATGGCAGG AAGGCTTGTT GTGCTTTTAG TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 

ATGGGATTCT GGACAGGATG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 

CCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 

CTCAGTTGAT TTTCTGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 

55 TGTGTTTGGC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 

ACACCTGGGA GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TGAGAACCTG GGTTGCAGAC 36S0 

AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 

TGACCCTGAG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCAAGGC 3780 

GGTGCTTGAT AGAAATGCCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA 3840 

60 CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG TGTTTCTGTA AGAAACCTAC TGCCCAGGCA 3900 

CTGCAAACCG CCACCTCCCT ATACTGCTTG GAGCTGAGCA AATCACCACA AACTGTAATA 3 960 

CAATGATCCT GTATTCAGAC AG AT GAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 4020 

TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 

AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 

65 TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATTGGAC AAAGAACCTT CTCTATGGGG 4200 

CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 

CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 

CTCTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 

70 AATGGCATGA GAAATACAAA AATACTCAGA TAAGGTAAAA TGCCATGATG CCTCTGTCTT 4500 

CTGGACTGGT TTTCACATTA GAAGACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 
TTTATGAGAA AGCCTTCTTT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 4620 

TGTACCAAGA ATCTTGGTTT GCCTTCCAGA AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 
TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 

75 AAACACACAC AAAAGGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 
TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACIAAA C-CCAAACATA 4 920 
TATATACTAT TAAAAAGGTT TACAGAATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 
AGATGCCCAA ATCCTTAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA C-AGGATATC-A 5040 

80 ACCAAAAAAA AAAAAAAAAA AA 

Seq ID NO: 98 Protein sequence 
Protein Accession #: NP_003465 



989 



WO 03/042661 



PCT/US02/36810 



I I I I I I 

MAARPLPVSP ARALLLALAG ALLAPCEARG VSLWNEGRAD EWSASVP.SG Dm 
SKNHPEVLNI RLQRESKELI INLERNEGLI ASSFTETHYL QBGTDVSLAR NYTVILGHCY 
YHGHVRGYSD SAVSLSTCSG LRGLIVFENE SYVLEPMKSA TNRYKLFPAX KLKSVRGSCG 
SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLEKVKQR 
LIEIAMWDK FYRPLNIRIV LVGVEVKNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 
DNAQLVSGVY FQGTTIGMAP IMSMCTADQS GGIVMDHSDN PLGAAVTLAH ELGHNFGMNH 
DTLDRGCSCQ MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFKLPEVRE 
SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAGT 
ACRDSSNSCD LPEFCTGASP HCPANVYLHD GHSCQDVDGY CYNGICQTHE QQCVTLWGPG 



IETNIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GKICLNRQCQ NISVPGVHEC 
AMQCHGRGVC NNRKNCHCEA HWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVTILCL 
LAAGFWYLK RKTLIRLLFT NKKTTIEKLR CVRPSRPPRG PQPCQAHLGH LGKGLMRKPP 
DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 
ALRQAQGTCK PNPPQKPLPA DPLARTTRLT HALARTPGQW ETGLRLAPLR PAPQYPHQVP 



I I I I I I 

GAGGAGGAGG GAAAAGGCGA GCAAAAAGGA AGAGTGGGAG GAGGAGGGGA AGCGGCGAAG 60 

GAGGAAGAGG AGGAGGAGGA AGAGGGGAGC ACAAAGGATC CAGGTCTCCC GACGGGAGGT 120 

TAATACCAAG AACCATGTGT GCCGAGCGGC TGGGCCAGTT CATGACCCTG GCTTTGGTGT 180 

TGGCCACCTT TGACCCGGCG CGGGGGACCG ACGCCACCAA CCCACCCGAG GGTCCCCAAG 240 

ACAGGAGCTC CCAGCAGAAA GGCCGCCTGT CCCTGCAGAA TACAGCGGAG ATCCAGCACT 300 

GTTTGGTCAA CGCTGGCGAT GTGGGGTGTG GCGTGTTTGA ATGTTTCGAG AACAACTCTT 360 

GTGAGATTCG GGGCTTACAT GGGATTTGCA TGACTTTTCT GCACAACGCT GGAAAATTTG 420 

ATGCCCAGGG CAAGTCATTC ATCAAAGACG CCTTGAAATG TAAGGCCCAC GCTCTGCGGC 480 

ACAGGTTCGG CTGCATAAGC CGGAAGTGCC CGGCCATCAG GGAAATGGTG TCCCAGTTGC 540 

AGCGGGAATG CTACCTCAAG CACGACCTGT GCGCGGCTGC CCAGGAGAAC ACCCGGGTGA 600 

TAGTGGAGAT GATCCATTTC AAGGACTTGC TGCTGCACGA ACCCTACGTG GACCTCGTGA 660 

ACTTGCTGCT GACCTGTGGG GAGGAGGTGA AGGAGGCCAT CACCCACAGC GTGCAGGTTC 720 

AGTGTGAGCA GAACTGGGGA AGCCTGTGCT CCATCTTGAG CTTCTGCACC TCGGCCATCC 780 

AGAAGCCTCC CACGGCGCCC CCCGAGCGCC ACCCCCAGGT GGACAGAACC AAGCTCTCCA 840 

GGGCCCACCA CGGGGAAGCA GGACATCACC TCCCAGAGCC CAGCAGTAGG GAGACTGGCC 900 

GAGGTGCCAA GGGTGAGCGA GGTAGCAAGA GCCACCCAAA CGCCCATGCC CGAGGCAGAG 960 

TCGCGCGCCT TGGGGCTCAG GGACCTTCCG GAAGCAGCGA GTGGGAAGAC GAACAGTCTG 1020 

AGTATTCTGA TATCCGGAGG TGAAATGAAA GGCCTGGCCA CGAAATCTTT CCTCCACGCC 10 8 0 

GTCCATTTTC TTATCTATGG ACATTCCAAA ACATTTACCA TTAGAGAGGG GGGATGTCAC 1140 

ACGCAGGATT CTGTGGGGAC TGTGGACTTC ATCGAGGTGT GTGTTCGCGG AACGGACAGG 12 0 0 

TGAGATGGAG ACCCCTGGGG CCGTGGGGTC TCAGGGGTGC CTGGTGAATT CTGCACTTAC 1260 

ACGTACTCAA GGGAGCGCGC CCGCGTTATC CTCGTACCTT TGTCTTCTTT CCATCTGTGG 1320 

AGTCAGTGGG TGTCGGCCGC TCTGTTGTGG GGGAGGTGAA CCAGGGAGGG GCAGGGCAAG 1380 

GCAGGGCCCC CAGAGCTGGG CCACACAGTG GGTGCTGGGC CTCGCCCCGA AGCTTCTGGT 1440 

GCAGCAGCCT CTGGTGCTGT CTCCGCGGAA GTCAGGGCGG CTGGATTCCA GGACAGGAGT 1500 

GAATGTAAAA ATAAATATCG CTTAGAATGC AGGAGAAGGG TGGAGAGGAG GCAGGGGCCG 15S0 

AGGGGGTGCT TGGTG CCAAA CTGAAATTCA GTTTCTTGTG TGGGGCCTTG CGGTTCAGAG 1620 

CTCTTGGCGA GGGTGGAGGG AGGAGTGTCA TTTCTATGTG TAATTTCTGA GCCATTGTAC 1680 

TGTCTGGGCT GGGGGGGACA CTGTCCAAGG GAGTGGCCCC TATGAGTTTA TAITTTAACC 1740 

ACTGCTTCAA ATCTCGATTT CACTTTTTTT ATTTATCCAG TTATATCTAC ATATCTGTCA 1800 

TCTAAATAAA TGGCTTTCAA ACAAAGCAAC TGGGTCATTA AAACCAGCTC AAAGGGGGTT 1860 

TAAAAAAAAA AAAACCAGCC CATCCTTTGA GGCTGATTTT TCTTTTTTTT AAGTTCTATT 1920 

TTAAAAGCTA TCAAACAGCG ACATAGCCAT ACATCTGACT GCCTGACATG GACTCCTGCC 1980 

CACTTGGGGG AAACCTTATA CCCAGAGGAA AATACACACC TGGGGAGTAC ATTTGACAAA 2040 

TTTCCCTTAG GATTTCGTTA TCTCACCTTG ACCCTCAGCC AAGATTGGTA AAGCTGCGTC 2100 

CTGGCGATTC CAGGAGACCC AGCTGGAAAC CTGGCTTCTC CATGTGAGGG GATGGGAAAG 2160 

GAAAGAAGAG AATGAAGACT ACTTAGTAAT TCCCATCAGG AAATGCTGAC CTTTTACATA 2220 

AAATCAAGGA GACTGCTGAA AATCTCTAAG GGACAGGATT TTCCAGATCC TAATTGGAAA 2280 

TTTAGCAATA AGGAGAGGAG TCCAAGGGGA CAAATAAAGG CAGAGAGAGA GAGAGAGAGA 2340 
GGGAGAGGAA GAAAAGAGAG AGAGAAAAGA GCCTOGTGCC 



70 1 11 21 31 

MCAERLGQFM TLALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSL 

GDVGCGVFEC FENNSCEIRG LHGICMTFLH NAGKFDAQGK S 
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A HARGRVGGLG AQC-PSGSSEW ED3QSEYSDI 300 
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AAGCCCAGCA GCCCCGGGGC GGATGGCTCC GGCCGCCTGG CTCCGCAGCG CGGCCGCGCG SO 

CSCCCTCCTG CCCCCGATGC TGCTGCTGCT GCTCCAGCCG CCGCCC-CTGC TGGCCCGGGC 120 

TCTGCCGCCG GACGTCCACC ACCTCCATGC CGAGAGGAGG GGGCCACAGC CCTGGCATGC 180 

AGCCCTGCCC AGTAGCCCGG CACCTGCCCC TGCCACGCAG GAAGCCCCCC GGCCTGCCAG 240 

CAGCCTCAGG CCTCCCCGCT GTGGCGTGCC CGACCCATCT GATGGGCTGA GTGCCCCCAA 300 

CCGACAGAAG AGGTTCGTGC TTTCTGGCGG GCGCTGGGAG AAGACGGACC TCACCTACAG 360 

GATCCTTCGG TTCCCATGGC AGTTGGTGCA GGAGCAGGTG CGGCAGACGA TGGCAGAGGC 420 

CCTAAAGGTA TGGAGCGATG TGACGCCACT CACCTTTACT GAGGTGCACG AGGGCCGTGC 480 

TGACATCATG ATCGACTTCG CCAGGTACTG GCATGGGGAC GACCTGCCGT TTGATGGGCC 540 

TGGGGGCATC CTGGCCCATG CCTTCTTCCC CAAGACTCAC CGAGAAGGC-G ATGTCCACTT S00 

CGACTATGAT GAGACCTGGA CTATCGGGGA TGACCAGGGC ACAGACCTC-C TGCAGGTGGC 660 

AGCCCATGAA TTTGGCCACG TGCTGGGGCT GCAGCACACA ACAGCAGCCA AGGCCCTGAT 720 

GTCCGCCTTC TACACCTTTC GCTACCCACT GAGTCTCAGC CCAC-ATGACT GCAGGGGCGT 780 

TCAACACCTA TATGGCCAGC CCTGGCCCAC TGTCACCTCC AGGACCCCAG CCCTGGGCCC 840 

CCAGGCTGGG ATAGACACCA ATGAGATTGC ACCGCTGGAG CCAGACGCCC CGCCAGATGC 900 

CTGTGAGGCC TCCTTTGACG CGGTCTCCAC CATCCGAGGC GAGCTCTTTT TCTTCAAAGC 960 

GGGCTTTGTG TGGCGCCTCC GTGGGGGCCA GCTGCAGCCC GGCTACCCAG CATTGGCCTC 1020 

TCGCCACTGG CAGGGACTGC CCAGCCCTGT GGACGCTGCC TTCGAGGATG CCCAGGGCCA 1080 

CATTTGGTTC TTCCAAGGTG CTCAGTACTG GGTGTACGAC GGTGAAAAGC CAGTCCTGGG 1140 

CCCCGCACCC CTCACCGAGC TGGGCCTGGT GAGGTTCCCG GTCCATGCTG CCTTGGTCTG 1200 

GGGTCCCGAG AAGAACAAGA TCTACTTCTT CCGAGGCAGG GACTACTGGC GTTTCCACCC 1260 

CAGCACCCGG CGTGTAGACA GTCCCGTGCC CCGCAGGGCC ACTGACTGGA GAGGGGTGCC 1320 

CTCTGAGATC GACGCTGCCT TCCAGGATGC TGATGGCTAT GCCTACTTCC TGCGCGGCCG 13 80 

CCTCTACTGG AAGTTTGACC CTGTGAAGGT GAAGGCTCTG GAAGGCTTCC CCCGTCTCGT 1440 

GGGTCCTGAC TTCTTTGGCT GTGCCGAGCC TGCCAACACT TTCCTCTGAC CATGGCTTGG 1500 

ATGCCCTCAG GGGTGCTGAC CCCTGCCAGG CCACGAATAT CAGGCTAGAG ACCCATGGCC 1560 

ATCTTTGTGG CTGTGGGCAC CAGGCATGGG ACTGAGCCCA TCTCTCCTGC AGGGGGATGG 1620 

IACCATG ACAACTGCCG GGAGGGCCAC GCAGGTCGTG GTCACCTGCC 1680 



GTAGCACCAT GGCAGGACTG GGGGAACTGG AGTGTCCTTG CTGTATCCCT GTTGTGAGGT 1860 
TCCTTCCAGG GGCTGGCACT GAAGCAAGGG TGCTGGGGCC CCATGGCCTT CAGCCCTGGC 1920 

C CTTCTGGCTG ACAATCCTGG AAATCTGTTC TCCAGAATCC AGGCCAAAAA 2 040 

3 GGGTATTCTT CATGCAGGAG ACCCCAGGCC CTGGAGGCTG 2100 

CAACATACCT CAATCCTGTC CCAGGCCGGA TCCTCCTGAA GCCCTTTTCG CAGCACTGCT 2 ISO 

ATCCTCCAAA GCCATTGTAA ATGTGTGTAC AGTGTCIATA AACCITCTTC TTCTTTTTTT 2220 
TTTTTAAACT GAGGATTGTC ATTAAACACA GTTGTTTTCT 

Seq ID NO: 102 Protein sequence 
Protein Accession ft: NP_005931 

1 11 21 31 41 51 
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HAPAAWLRSA AARALLPPML LLLLQPPPLL AHALPPDVHH LHAERRGPQP WHAALPSSPA 60 
PAPATQEATR PASSLRPPRC GVPDPSDGLS ARMRQKRFVL SGGRWEKTDL TYRILRFPNQ 120 
LVQEQVRQTM AEALKVWSDV TPLTFTEVHE GRADIMIDFA RYWKGDDLPF DGPGGILAHA 180 
FFPKTHREGD VHFDYDETWT IGDDQGTDLL QVAAHEFGHV LGJjQHTTAAK h" ~ " 

YPLSLSPDDC RGVQHLYGQP WPTVTSRTPA LGPQAGIDTN E " 

VSTIRGELFF FKAGFVWRLR GGQLQPGYPA LASRIIWQGLP SPVDAAFEDA QGHIWFFQGA 
QYWVYDGEKP VLGPAPLTEL GLVRFPVHAA LVWGPEKNKI YFFRGRDYWR FHPSTRRVDS 
PVPRRATDWR GVPSEIDAAF QDADGYAYFL RGRLYWKFDP VKVKALEGFP RLVGPDFFGC 
AEPANTFL 

Seq ID NO: 103 DNA 



ACTAAGTGAT TTGGGCCTGA GGCCTGAGAA GATGTTTAAA AAGAGGGATC AAGCACAGGC 180 

TAAGGAGAGG AAAGAGCAGG CACCCAAACC TCTGCATGGC CCCAATATGC TCCCTGCAGG 240 

GTAGTGCCCC CTCTTCTGGC TGCTCAAGGC GAGATCTAAG CTTCTTCTAA CTCCTGCIGT 300 

CTTTTCATAT TCTCTGATTC TGGGAAACGA AGAATTGGCA GGAACTGAAA ATGACTAGGA 3S0 

A CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC ATCGACATAG 420 

VTTTATA AAACCTATAC TCTCCAAGAT GGCCCCTGGA 480 

GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCC- TGGGGGAAGT 540 

ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT CCTGCCCCCC S00 

AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC ACCCCGCTCA S60 

TGATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT CCCICCACIG TCAGTCCATG 720 

ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA GTCTCAAGGC 780 

:TTCTGG TGATGCTGAG GTTCCAGAGA ACAAGGTTGA 840 

: ATCTGCTTCT GCATTC-CCAG TGTACTCGGG CCAATATTGA 900 

TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC CATGGAGTGG 960 

GACTCTGCTT TGCCCTTTTT CTCTCCGAAT GTGTGAAGTC TCTGAGTTTC TCCTCCAGTT 1020 

ATCAGGT TCCGACCA3C T( TT r i C rTTGCCTTTG 1080 

T CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGC-CC ATCAGCTTCT 1140 

TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA GTACTGATCA 1200 

CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACIT CAITATTGGA IACACTGCAT 1260 

TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG ACAAGAATGG 1320 

CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT GTGACCAGTG 1380 

AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA TTTGCAAAAA 1440 
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TCATTGAAGA CCTAAGAAGG AAGGAAAGGA AACTATTGC-A GAAGTGCGGG CTTGTCCAGA 1500 

GCCTGACAAG TATAACCTTG TTCATCATCC CCACAGTGGC CACAGCGGTC TGGGTTCTCA 1560 

TCCACACATC CTTAAAGCTG AAACTCACAG CGTCAATGGC CTCCAGCATG CTGGCCTCCT 1620 

TGAATCTCCT TCGGCTGTCA GTGTTCTTTG TGCCTATTGC AGTCAAAGGT CTCACGAATT 1680 

CCAAGTCTGC AGTGATGAGG TTCAAGAAGT TTTTCCTCCA GGAGAGCCCT GTTTTCTATG 1740 

TCCAGACATT ACAAGACCCC AGCAAAGCTC TGGTCTTTGA GGAGGCCACC TTGTCATGGC 1800 

AACAGACCTG TCCCGGGATC GTCAATGGGG CACTGGAGCT f GCATGCTT 1860 

CTGAGGGGAT GACCAGGCCT AGAGATGCCC TCGGGCCAGA GGAAGAAGGG AACAGCCTGG 1920 

GCCCAGAGTT GCACAAGATC AACCTGGTGG TGTCCAAC-GG GATGATGTTA GGGGTCTGCG 1980 

GCAACACGGG GAGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGGAGGAG ATGCACTTGC 2040 

TCGAGGGCTC GGTGGGGGTG CAGGGAAGCC TGGCCTATGT CCCCCAGCAG GCCTGGATCG 2100 

TCAGCGGGAA CAT CAG GGAG AACATCCTCA TGGGAGGCGC ATATGACAAG GCCCGATACC 2160 

TCCAGGTGCT CCACTGCTGC TCCCTGAATC GGGACCTGGA ACTTCTGCCC TTTGGAGACA 2220 

TGACAGAGAT TGGAGAGCGG GGCCTCAACC TCTCTGGGGG GCAGAAACAG AGGATCAGCC 2280 

TGGCCCGCGC CGTCTATTCC GACCGTCAGA TCTACCTGCT GGACGACCCC CTC-TCTGCTG 2340 

TGGACGCCCA CGTGGGGAAG CACATTTTTG AGGAGTGCAT TAAGAAGACA CTCAGGGGGA 2400 

AGACGGTCGT CCTGGTGACC CACCAGCTGC AGTACTTAGA ATTTTGTGGC CAGATCATTT 2460 

TGTTGGAAAA TGGGAAAATC TGTGAAAATG GAACTCACAG TGAGTTAATG CAGAAAAAGG 2520 

GGAAATATGC CCAACTTATC CAGAAGATGC ACAAGGAAGC CACTTCGGAC ATGTTGCAGG 2580 

ACACAGCAAA GATAGCAGAG AAGCCAAAGG T AGAAAGT CA GGCTCTGGCC ACCTCCCTGG 2640 

AAGAGTCTCT CAACGGAAAT GCTGTGCCGG AGCATCAGCT CACACAGGAG GAGGAGATGG 2700 

AAGAAGGCTC CTTGAGTTGG AGGGTCTACC ACCACTACAT CCAGGCAGCT GGAGGTTACA 2760 

TGGTCTCTTG CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTTAACG ATCTTCAGCT 2820 

TCTGGTGGCT GAGCTACTGG TTGGAGCAGG GCTCGGGGAC CAATAGCAGC CGAGAGAGCA 2880 

ATGGAACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTCC TTCTACCAGC 2940 

TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCTGTGTGGG GGTCTGCTCC TCAGGGATTT 3000 

TCACCAAAGT CACGAGGAAG GCATCCACGG CCCTGCACAA CAAGCTCTTC AACAAGGTTT 3060 

TCCGCTGCCC CATGAGTTTC TTTGACACCA TCCCAATAGG CCGGCTTTTG AACTGCTTCG 3120 

T GGAACAGCTG GACCAGCTCT TGCCCATCTT TTCAGAGCAG TTCCTGGTCC 3180 

3ATCGCC GTCCTGTTGA TTGTCAGTGT GCTGTCTCCA TATATCCTGT 3240 

VATCATG GTTATTTGCT TCATTTATTA TATGATGTTC AAGAAGGCCA 3300 

T CAAGAGACTG GAGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCTCA 3360 

ATTCTCTGCA AGGCCTGAGC TCCATCCATG TCTATGGAAA AACTGAAGAC TTCATCAGCC 3420 

AGTTTAAGAG GCTGACTGAT GCGCAGAATA ACTACCTGCT GTTGTTTCTA TCTTCCACAC 3480 

35 GATGGATGGC ATTGAGGCTG GAGATCATGA CCAACCTTGT GACCTTGGCT GTTGCCCTGT 3540 

TCGTGGCTTT TGGCATTTCC TCCACCCCCT ACTCCTTTAA AGTCATGGCT GTCAACATCG 3600 

TGCTGCAGCT GGCGTCCAGC TTCCAGGCCA CTGCCCGGAT TGGCTTGGAG ACAGAGGCAC 3660 

AGTTCACGGC TGTAGAGAGG ATACTGCAGT ACATGAAGAT GTGTGTCTCG GAAGCTCCTT 3 72 0 

TACACATGGA AGGCACAAGT TGTCCCCAGG GGTGGCCACA GCATGGGGAA ATCATATTTC 37 80 
40 AGGATTATCA CATGAAATAC AGAGACAACA CACCCACCGT GCTTCACGGC ATCAACCTGA 
CCATCCGCGG CCACGAAGTG GTGGGCATCG TGGGAAGGAC GGGCTCTGGG A " 
TCCGCATGCC TCTCTTCCGC CTGGTGGAGC CCATGGCAGG CCGGATTCTC 
TGGACATTTG CAGCATCGGC CTGGAGGACT TGCGGTCCAA GCTCTCAGTC 
ATCCAGTGCT GCTCTCAGGA ACCATCAGAT TCAACCTAGA TCCCTTTGAC CGTCACACTG 
45 ACCACCACAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCC 
CCAAAAAGCT GCATACAGAT GTGGTGGAAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 



TCCAGGGCTG CACCGTGCTC GTCATTGCCC ACCGTGTCAC CACTGTGCTG AACTGTGACC 4380 

ACATCCTGGT TATGGGCAAT GGGAAGGTGG TAGAATTTGA TCGGCCGGAG GTACTGCGGA 4440 

AGAAGCCTGG GTCATTGTTC GCAGCCCTCA TGGCCACAGC CACTTCTTCA CTGAGATAAG 4500 

GAGATGTGGA GACTTCATGG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 

TCGAGGCCCA CAGTCTGCGA CCTTCTTGTT TGGAGATGAG AACTTCTCCT GGAAGCAGGG 4620 

GTAAATGTAG GGGGGGTGGG GATTGCTGGA TGGAAACCCT GGAATAGGCT ACTTGATGGC 4680 

TCTCAAGACC TTAGAACCCC AGAACCATCT AAGACATGGG ATTCAGTGAT CATGTGGTTC 4740 

TCCTTTTAAC TTACATGCTG AATAATTTTA TAATAAGGTA AAAGCTTATA GTTTTCTGAT 4800 

CTGTGTTAGA AGTGTTGCAA ATGCTGTACT GACTTTGTAA AATATAAAAC TAAGGAAAAC 4860 
TC 
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MTRKRTYWVP NSSGGLVNEG IDIGDDMVSG LIYKTYTLQD GPWSQ.QERNP EAPGRAAVPP 
WGKYDAALRT MIPPRPKPRF PAPQPLDNAG LFSYLTVSWL TPLMIQSLRS RLDENTIPPL 
SVHDASDKNV QRLHRLWEEE VSRRGIEKAS VLLVMLRFQR TRLIFDALLG ICFCIASVLG 

PILIIPKILE YSEEQLGNW HGVGLCFALF LSECVKSLSF SSSWIINQRT A 

FAFEKLIQFK SVIHITSGEA ISFFTGDVNY LFEGVCYGPL VLITCAE 
YTAFIAILCY LLVFPLAVFM TRMAVKAQHH TSEVSDQRIR VTSEVLTCIX L 
FAKIIEDLRR KERKLLEKCG LVQSLTSITL FIIPTVATAV WVLIHTf 
LASLNLLRLS VFFVPIAVKG LTNSKSAVMR FKKFFLQESP VFYVQTLQDP S 
LSWQQTCPGI WGALELERH GHASEGMTRP RDALGPEEEG NSLGPELHKI NLWSKGMML 540 
GVCGNTGSGK SSLLSAILEE MHLLEGSVGV QGSLAYVPQQ AWIVSGNIRE NILMGGAYDK 600 
ARYLQVLHCC SLNRDLELLP FGDMTEIGER GLNLSGGQKQ RISLARAVYS DRQIYLLDDP 660 
LSAVDAHVGK HIFEECIKKT LRGKTWLVT HQLQYLEFCG QIILLENGKI CENGTHSELM 720 
QKKGKYAQLI QKMHKEATSD MLQDTAKIAE KPKVESQALA ISLSESLNGN AVPEHQLTQE 780 
EEMEEGSLSW RVYHHYIQAA GGYMVSCIIF FFWLIVFLT IFSFWWLSYW LSQGSGTNSS 840 
RESNGTMADL GNIADNPQLS FYQLVYGLKA LLLICVGVCS SGIFTKVTRK ASTALHKKLF 900 
NKVFRCPMSF FDTIPIGRLL NCFAGDLEQL DQLLPIFSEQ FLVLSLMVIA VLLIVSVLSP 960 
YILLMGAIIM VICFIYYMMF KKAIGVFKRL ENYSRSPLFS KILNSLQGLS SIHVYGKTED 1020 
FISQFKRLTD AQNNYLLLFL SSTRWMALRL EIMTNLVTLA VALFVAFGIS STPYSFKVMA 1080 
VNIVLQLASS FQATARIGLE TEAQFTAVER ILQYMKMCV3 EAPLHMEGTS CFQGWFQHGE 1140 
IIFQDYHMKY RDNTPTVLHG IKLTIRGHEV VGIVGRTGSG KSSLGMALFR LVEPMAGRIL 1200 
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IDGVDICSIG LEDLRSKLSV IPQDPVLLSG TIRFNLDPFD EHTDQQIWDA LERTFLTKAI 
SKFPKKLHTD WENGGNFSV GERQLLCIAR AVLRNSKIIL IDEATASIDM ETDTLIQRTI 
REAFQGCTVL VIAHRVTTVL NCDHILVMGN GKWEFDRPE VLRKKPGSLF AALMATATSS 



Seq ID NO: 105 DNA sequence 

Eos sequence 

11 21 31 41 51 
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A CTGGGTGCCC AACTCTTCTG G"" 

ATCGACATAG GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT 
GGCCCCTGGA GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG 
TGGGGGAAGT ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGH I 
CCTGCCCCCC AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC 
3 CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG 
CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA 
GTCTCAAGGC GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA 
C ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG 



3CCCTTTTT CTCTCCGAAT GTGTGAAGTC 



CTGTTTGAAG 

CCTGCGCATC GCTGGTCATC TGCAGCATTT CTT CCTACTT C 

CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG 



GTGACCAGTG AAGTTCTCAC TTGCATTAAG CTGATTAAAA 
G TATGGAAAGT 



ATTGCAGTCA AAGGTCTCAC GAATTCCAAG TCTGCAGTGA TGAGGTTCAA GAAGTTTTTC 12 60 

CTCCAGGAGA GCCCTGTTTT CTATGTCCAG ACATTACAAG ACCCCAGCAA AGCTCTGGTC 132 0 

TTTGAGGAGG CCACCTTGTC ATGGCAACAG ACCTGTCCCG GGATCGTCAA TGGGGCACTG 1380 

35 GAGCTGGAGA GGAACGGGCA TGCTTCTGAG GGGATGACCA GGCCTAGAGA TGCCCTCGGG 1440 

CCAGAGGAAG AAGGGAACAG CCTGGGCCCA GAGTTGCACA AGATCAACCT GGTGGTGTCC 1500 

AAGGGGATGA TGTTAGGGGT CTGCGGCAAC AGGGGGAGTG GTAAGAGCAG CCTGTTGTCA 1560 

GCCATCCTGG AGGAGATGCA CTTGCTCGAG GGCTCGGTGG GGGTGCAGGG AAGCCTGGCC 1620 

TATGTCCCCC AGCAGGCCTG GATCGTCAGC GGGAACATCA GGGAGAACAT CCTCATGGGA 16 80 

40 GGCGCATATG ACAAGGCCCG ATACCTCCAG GTGCTCCACT GCTGCTCCCT GAATCGGGAC 1740 

CTGGAACTTC TGCCCTTTGG AGACATGACA GAGATTGGAG AGCGGGGCCT CAACCTCTCT 1800 

GGGGGGCAGA AACAGAGGAT CAGCCTGGCC CGCGCCGTCT ATTCCGACCG TCAGATCTAC 1860 

CTGCTGGACG ACCCCCTGTC TGCTGTGGAC GCCCACGTGG GGAAGCACAT TTTTGAGGAG 1920 

TGCATTAAGA AGACACTCAG GGGGAAGACG GTQ3TCCTGG TGACCCACCA GCTGCASTAC 1980 

45 TTAGAATTTT GTGGCCAGAT CATTTTGTTG GAAAATGGGA AAATCTGTGA AAATGGAACT 2040 

CACAGTGAGT TAATGCAGAA AAAGGGGAAA TATGCCCAAC TTATCCAGAA GATGCACAAG 2100 

GAAGCCACTT CGGACATGTT GCAGGACACA GCAAAGATAG CAGAGAAGCC AAAGGTAGAA 2160 

AGTCAGGCTC TGGCCACCTC CCT GGAAG AG TCTCTCAACG GAAATGCTGT GCCGGAGCAT 2220 

CAGCTCACAC AGGAGGAGGA GATGGAAGAA GGCTCCTTGA GTTGGAGGCT CTACCACCAC 2280 

50 TACATCCAGG CAGCT GGAGG TTACATGGTC TCTTGCATAA TTTTCTTCTT CGTGGTGCTG 2340 

ATCGTCTTCT TAACGATCTT CAGCTTCTGG TGGCTGAGCT ACTGGTTGGA GCAGGGCTCG 2400 

GGGACCAATA GCAGCCGAGA GAGCAATGGA ACCATGGCAG ACCTGGGCAA CATTGCAGAC 2460 

AATCCTCAAC TGTCCTTCTA CCAGCTGGTG TACGGGCTCA ACGCCCTGCT CCTCATCTGT 2520 

GTGGGGGTCT GCTCCTCAGG GATTTTCACC AAAGTCACGA GGAAGGCATC CACGGCCCTG 2580 

55 CACAACAAGC TCTTCAACAA GGTTTTCCGC TGCCCCATGA GTTTCTTTGA CACCATCCCA 2640 

ATAGGCCGGC TTTTGAACTG CTTCGCAGGG GACTTGGAAC AGCTGGACCA GCTCTTGCCC 2700 

ATCTTTTCAG AGCAGTTCCT GGTCCTGTCC TTAATGGTGA TCGCCGTCCT GTTGATTGTC 2760 

AGTGTGCTGT CTCCATATAT CCTGTTAATG GGAGCCATAA TCATGGTTAT TTGCTTCATT 2820 

TATTATATGA TGTTCAAGAA GGCCATCGGT GTGTTCAAGA GACTGGAGAA CTATAGCCGG 2880 

60 TCTCCTTTAT TCTCCCACAT CCTCAATTCT CTGCAAGGCC TGAGCTCCAT CCATGTCTAT 2940 

GGAAAAACTG AAGACTTCAT CAGCCAGTTT AAGAGGCTGA CTGATGCGCA GAATAACTAC 3000 

CTGCTGTTGT TTCTATCTTC CACACGATGG ATGGCATTGA GGCTGGAGAT CATGACCAAC 3060 

3 GCTTTTGGCA TTTCCTCCAC CCCCIACTCC 3120 
3180 

T TGGAGACAGA GG CACAGTT C ACGGCTGTAG AGAGGATACT GCAGTACATG 3240 

AAGATGTGTG TCTCGGAAGC TCCTTTACAC ATGGAAGGCA CAAGTTGTCC CCAGGGGTGG 3300 



AAGTGGTGGG CATCGTGGGA 



70 GCAGGCCGGA TTCTCATTGA CGGCGTGGAC ATTTGCAGCA TCGGCCTGGA GGACTTGCGG 3540 

TCCAAGCTCT CAGTGATCCC TCAAGATCCA GTGCIGCTCT CAGGAACCAT CAGATTCAAC 3600 

CTAGATCCCT TTGACCGTCA CACTGACCAG CAGATCTGGG ATGCCTTGGA GAGGACATTC 3660 

CTGACCAAGG CCATCTCAAA GTTCCCCAAA AAGCTGCATA CAGATGTGGT GGAAAACGGT 3720 

GGAAACTTCT CTGTGGGGGA GAGGCAGCTG CTCTGCATTG CCAGGGCTGT GCTTCGCAAC 3780 

TCCAAGATCA TCCTTATCGA TGAAGCCACA GCCTCCATTG ACATGGAGAC AGACACCCTG 3840 

A AGCCTTCCAG GGCTGCACCG TGCTCGTCAT TGCCCACCGT 3900 
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I I 

G LIYKTYTLQD GPWSQQERNP EAPGRAAVPP 60 

WGKYDAALRT MIPPRPKPRP PAPQPLDNAG LFSYLTVSWL TPLMIQSLRS RLDENTIPPL- 120 

SVHDASDKNV QRLHRLWBBE VSRRGIEKAS VLLVMLRFQR TRLIFDALLG ICFCIASVLG 180 

PILIIPKILE YSEEQLGNW HGVGLCFALF LSBCVKSLSF SS3WI INQRT AIRFRAAVSS 240 

FAFEKLIQFK SVIHITSGEA ISFFTGDVNY LFEGVCYGPL VLITCASLVI CSISSYFIIG 300 

YTAFIAILCY LLVFPLAVFM TRMAVKAQEH TSEVSDQRIR VT3EVLTCIK LIKMVTWBKP 360 

FAKIIEGMES LTFCSKPGDG MAFSMLASLN LLRLSVFFVP IAVKGLTNSK SAVMRFKKFF 420 

LQESPVFYVQ TLQDPSKALV FEEATLSWQQ TCPGIVNGAL ELERNGHASE GMTRPRDALG 480 

PEEEGNSLGP ELHKINLWS KGMMLGVCGN TGSGKSSLLS AILEEMHLLE GSVGVQGSLA 540 

YVPQQAWIVS GNIRENILMG GAYDKARYLQ VLHCCSLNRD LELLPFGDMT EIGERGLNLS 600 

GGQKQRISLA RAVYSDRQIY LLDDPLSAVD AHVGKHIFEE CIKKTLRGKT WLVTHQLQY 660 

LEFCGQIILL ENGKICENGT HSELMQKKGK YAQLIQKMHK EATSDMLQDT AKIAEKPKVE 720 

SQALATSLEE SLNGNAVPEH QLTQEEEMEE GSLSWRVYHH YIQAAGGYMV SCIIFFFWL 780 

IVFLTIFSFW WLSYWLEQGS GTNSSRESNG TMADLGNIAD NPQLSFYQLV YGLNALLLIC 840 

VGVCSSGIFT KVTRKASTAL HNKLFNKVFR CPMSFFDTIP IGRLLNCFAG DLEQLDQLLP 900 

IFSEQFLVLS LMVIAVLLIV SVLSPYILLM GAIIMVICFI YYMMFKKAIG VFXRLENYSR 960 

SPLFSHILNS LQGLSSIHVY GKTEDFISQF KRLTDAQNNY LLLFLSSTRW MALRLEIMTN 1020 

LVTLAVALFV AFGISSTPYS FKVMAVNIVL QLASSFQATA RIGLETEAQF TAVERILQYM 10S0 

KMCVSEAPLH MEGTSCPQGW PQHGEIIFQD YHMKYRDNTP TVLHGIKLTI RGHEWGIVG 1140 

RTGSGKSSLG MALFRLVEPM AGRILIDGVD ICSIGLEDLR SKLSVIPQDP VLLSGTIRFN 1200 

LDPFDRHTDQ QIWDALERTF LTKAISKFPK KLHTDWENG GKFSVGERQL LCIARAVLRN 1260 

SKIILIDEAT ASIDMETDTL IQRTIREAFQ GCTVLVIAHR VTTVLNCDHI LVMGNGKWE 1320 
FDRPEVLRKK PGSLFAALMA TATSSLR 



| I I I 1 I 

ACCGGGCACC GGACGGCTCG GGTACTTTGG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 180 

AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 

TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 

GATGCTGTTG CTCCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

T CAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 72 0 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 70 0 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 950 

ACTGCTGCAC ACTGTCTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCIGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGG TGACGCCTCC 1260 

CCTGTCCTGA ACCACGCGGC CGTCCCTTTG ATTTCCAACA AGATCTGCAA CCACAGGGAC 1320 

GTGTACGGTG GCATCATCTC CCCCTCCATG CTCTGCGCGG GCTACCTGAC GGGTGGCGTG 1380 

GACAGCTGCC AGGGGGACAG CGGGGGGCCC CTGGTGTGTC AAGAGAGGAG GCTGTGGAAG 1440 

TTAGTGGGAG CGACCAGCTT TGGCATCGGC TGCGCAGAGG TGAACAAGCC TGGGGTGTAC 1500 

ACCCGTGTCA CCTCCTTCCT GGACTGGATC CACGAGCAGA TGGAGAGAGA CCTAAAAACC 1560 

TGAAGAGGAA GGGGACAAGT AGCCACCTGA GTTCCTGAGG TGATGAAGAC AGCCCGATCC 1620 

TCCCCTGGAC TCCCGTGTAG GAACCTGCAC ACGAGCAGAC ACCCTTGGAG CTCTGAGTTC 1680 

CGGCACCAGT AGCAGGCCCG AAAGAGGCAC CCTTCCATCT GATTCCAGCA CAACCTTCAA 1740 

GCTGCTTTTT GTTTTTTGTT TTTTTGAGGT GGAGTCTCGC TCTGTTGCCC AGGCTGGAGT 1800 

GCAGTGGCGA AATCCCTGCT CACTGCAGCC TCCGCTTCCC TGGTTCAAGC GATTCTCTTG 1860 

CCTCAGCTTC CCCAGTAGCT GGGACCACAG GTGCCCGCCA CCACACCCSiA CTAATTTTTG 1920 

TATTTTTAGT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGCTCTCAAA CCCCTGACCT 1980 

CAAATGATGT GCCTGCTTCA GCCTCCCACA GTGCTGGGAT TACAGGCATG GGCCACCACG 204 0 

CCTAGCCTCA CGCTCCTTTC TGATCTTCAC TAAGAACAAA AGAAGCAGCA ACTTGCAAGG 2100 

GCGGCCTTTC CCACTGGTCC ATCTGGTTTT CTCTCCAGGG GTCTTGCAAA ATTCCTGACG 2160 

AGATAAGCAG TTATGTGACC TCACGTGCAA AGCCACCAAC AGCCACTCAG AAAAGACGCA 2220 

CCAGCCCAGA AGTGCAGAAC TGCAGTCACT GCACGTTTTC ATCTCTAGGG ACCAGAACCA 2280 

AACCCACCCT TTCTACTTCC AAGACTTATT TTCACATGTG GGGAGGTTAA TCTAGGAATG 2340 

ACTCGTTTAA GGCCTATTTT CATGATTTCT TTGTAGCATT TGGTGCTTGA CGTATTATTG 2400 

TCCTTTGATT CCAAATAATA TGTTTCCTTC CCTCAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 



MGENDPPAVE APFSFRSLFG LDDLKISPVA PD ADAVAAQ I LSLLPLKFFP IIVIGIIALI 60 

LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDSYRCVRV GGQNAVLQVF 120 

TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDN1RVSS LEGQFRESFX' SIDHLLPDDK 180 

VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 
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I QVGLVSLLDN PAPSHLVEKI 
LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DAS PVLNHAA 
VPLISNKICN HRDVYGGIIS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR L'"" 
GIGCAEVNKP GVYTRVTSFL DWIHEQMERD LKT 



1 11 21 31 41 51 

| | I I I I 

CACCTTCTGC ACTGCTCATC TGGGCAGAGG AAGCTTCAGA AAGCTGCCAA GGCACCATCT 60 
CCAGGAACTC CCAGCACGCA GAATCCATCT GAGAATATGC TGCCACAAAT ACCCTTTTTG 120 
CTGCTAGTAT CCTTGAACTT GGTTCATGGA GTGTTTTACG CTGAACGATA CCAAATGCCC 180 
ACAGGCATAA AAGGCCCACT ACCCAACACC AAGACACAGT TCTTCATTCC CTACACCATA 240 
AAGAGTAAAG GTATAGCAGT AAGAGGAGAG CAAGGTACTC CTGGTCCACC AGGCCCTGCT 3 00 
GGACCTCGAG GGCACCCAGG TCCTTCTGGA CCACCAGGAA AACCAGGCTA CGGAAGTCCT 360 
GGACTCCAAG GAGAGCCAGG GTTGCCAGGA CCACCGGGAC CATCAGCTGT AGGGAAACCA 420 
GGTGTGCCAG GACTCCCAGG AAAACCAGGA GAGAGAGGAC CATATGGACC AAAAGGAGAT 480 
GTTGGACCAG CTGGCCTACC AGGACCCCGG GGCCCACCAG GACCACCTGG AATCCCTGGA 540 
CCGGCTGGAA TTTCTGTGCC AGGAAAACCT GGACAACAGG GACCCACAGG AGCCCCAGGA 600 
CCCAGGGGCT TTCCTGGAGA AAAGGGTGCA CCAGGAGTCC CTGGTATGAA TGGACAGAAA 660 
GGGGAAATGG GATATGGTGC TCCTGGTCGT CCAGGTGAGA GGGGTCTTCC AGGCCCTCAG 720 
GGTCCCACAG GACCATCTGG CCCTCCTGGA GTGGGAAAAA GAGGTGAAAA TGGGGTTCCA 780 
GGACAGCCAG GCATCAAAGG TGATAGAGGT TTTCCGGGAG AAATGGGACC AATTGGCCCA 840 
CCAGGTCCCC AAGGCCCTCC TGGGGAACGA GGGCCAGAAG GCATTGGAAA GCCAGGAGCT 900 
GCTGGAGCCC CAGGCCAGCC AGGGATTCCA GGAACAAAAG GTCTCCCTGG GGCTCCAGGA 960 
ATAGCTGGGC CCCCAGGGCC TCCTGGCTTT GGGAAACCAG GCTTGCCAGG CCTGAAGGGA 1020 
riGGCCT TCCTGGGGGT CCAGGTGCCA AAGGGGAACA AGGGCCAGCA 1080 
3 GGAAGCCAGG TCTGACTGGA CCCCCTGGGA ATATGGGACC CCAAGGACCA 1140 
___iTAGCCA TGGTCTCCCA GGCCCTAAAG GTGAGACAGG GCCAGCTGGG 1200 
CCTGCAGGAT ACCCTGGGGC TAAGGGTGAA AGGGGTTCCC CTGGGTCAGA TGGAAAACCA 1260 
GGGTACCCAG GAAAACCAGG TCTCGATGGT CCTAAGGGTA ACCCAGGGTT ACCAGGTCCA 1320 
AAAGGTGATC CTGGAGTTGG AGGACCTCCT GGTCTCCCAG GCCCTGTGGG CCCAGCAGGA 1380 
GCAAAGGGAA TGCCCGGACA CAATGGAGAG GCTGGCCCAA GAGGTGCCCC TGGAATACCA 1440 
GGTACTAGAG GCCCTATTGG GCCACCAGGC ATTCCAGGAT TCCCIGGGTC TAAAGGGGAT 1500 
CCAGGAAGTC CCGGTCCTCC TGGCCCAGCT GGCATAGCAA CTAAGGGCCT CAATGGACCC 1560 
ACCGGGCCAC CAGGGCCTCC AGGTCCAAGA 5 K i Hi, TCTTCCAGGG 1620 

CCCCCTGGGC CTCCAGGCCC ACCAGGTCAA GCAGTCATGC CTGAGGGTTT TATAAAGGCA 16a 0 
GGCCAAAGGC CCAGTCTTTC TGGGACCCCT CTTGTTAGTG CCAACCAGGG GGTAACAGGA 174 0 
i TGT CTGCTT1 1 ATTC T(_ T( ( AAAGCTT ACCCAGCAAT A ~ " " 

ATACCATTTG ATAAAATTTT GTATAACAGG CAACAGCATT ATGACCC" 
TTTACTTGTC AGATACCAGG AATATACTAT TTTTCATACC ACGTGCATGT G 
CATGTTTGGG TAG GCCTGTA TAAGAATGGC ACCCCTGTAA TGTACACCTA TGATGAATAC 19 8 0 
ACCAAAGGCT ACCTGGATCA GGCTTCAGGG AGTGCCATCA TCGATCTCAC AGAAAATGAC 2040 
CAGGTGTGGC TCCAGCTTCC CAATGCCGAG TCAAATGGCC TATACTCCTC TGAGTATGTC 2100 
CACTCCTCTT TCTCAGGATT CCTAGTGGCT CCAATGTGAG TACACCCCAC AGAGCTAATC 2160 
TAAATCTTGT GCTAGAAAAA GCATTCTCTA ACTCTACCCC ACCCTACAAA ATGCATATGG 2220 
AGGTAGGCTG AAAAGAATGT AATTTTTATT TTCTGAAATA CAGATTTGAG CTATCAGACC 22 80 
AACAAACCTT CCCCCTGAAA AGTGAGCAGC AACGTAAAAA CGTATGTGAA GCCTCTCTTG 2340 
AATTTCTAGT TAGCAATCTT AAGGCTCTTT AAGGTTTTCT CCAATATTAA AAAATAT CAC 2400 
CAAAGAAGTC CTGCTATGTT AAAAACAAAC AACAAAAAAC AAAGCAACAA AAAAAAAAAT 2460 
TAAAAAAAAA AACAGAAATA GAGCTCTAAG TTATGTGAAA TTTGATTTGA GAAACTCGGC 25 20 
ATTTCCTTTT TAAAAAAGCC TGTTTCTAAC TATGAATATG AGAACTTCTA GGAAACATCC 2580 
AGGAGGTATC ATATAACTTT GTAGAACTTA AAT ACTTGAA TATTCAAATT TAAAAGACAC 2640 
TGTATCCCCT AAAATATTTC TGATGGTGCA CTACTCTGAG GCCTGTATGG CCCCTTTCAT 2700 
CAATATCTAT TCAAATATAC AGGTGCATAT ATACTTGTTA AAGCTCTTAT ATAAAAAAGC 2760 
CCCAAAATAT TGAAGTTCAT CTGAAATGCA AGGTGCTTTC ATCAATGAAC CTTTTCAAAA 2820 
CTTTTCTATG ATTGCAGAGA AGCTTTTTAT ATACCCAGCA TAACTTGGAA ACAGGTATCT 2880 
GACCTATTCT TATTTAGTTA ACACAAGTGT GATTAATTTG ATTTCTTTAA TTCCTTATTG 2940 
AATCTTATGT GATATGATTT TCTGGATTTA CAGAACATTA GCACATGTAC CTTGTGCCTC 3000 
CCATTCAAGT GAAGTTATAA TTTACACTGA GGGTTTCAAA ATTCGAC1AG AAGTGGAGAT 3 060 
ATATTATTTA TTTATGCACT GTACTGTATT TTTATATTGC TGT1TAAAAC TTTTAAGCTG 3120 
TGCCTCACTT ATTAAAGCAC AAAATGTTTT ACCTACTCCT TATTTACGAC ACAATAAAAT 3180 
AACATCAATA GATTTTTAGG CTGAATTAAT TTGAAAGCAG CAATTTGCTG TTCTCAACCA 3240 
TTCTTTCAAG GCTTTTCATT CGACACAATA AAATAACATC AATAG 

Seq ID NO : 110 Protein sequence 
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MLPQIPFLLL VSLNLVHGVF YAERYQMPTG IKGPLPNTKT QFFIPYTIKS KGIAVRGEQG 
TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 
GPYGPKGDVG PAGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GFPGEKGAPG 
VPGMNGQKGE MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRGENGVPGQ PGIKGDRGFP 
GEMGPIGPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 
PGLPGLKGER GPAGLPGGPG AKGEQGPAGL PGKPGLTC- - P E IGLE 
KGETGPAGPA GYPGAKGERG SPGSDGKPGY PGKPGLDGPK GNPGLPGPKG DPGVGGPPGL 

ATKGLNGPTG PPGPPGPRGH SGEPGLPGPP GPPGPPGQAV KPEGFIKAGQ RPSLSGTPLV 

SANQGVTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ KYDPRTGIFT CQIPGIYYFS 

YHVHVKGTHV WVGL YKNGT P VMYTYDEYTK GYLDQASGSA IIDLTENDCV WLQLPNAESN 
GLYSSEYVHS SFSGFLVAPM 
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GGAGGCTGAA ATCCCCAGAC GCCGGTTTTC TGGGCK3GGC TTTCTGCTTA CTCACTCCTT SO 

CTCCCTCTTT CTGGATTTTA CCGACCGTTC GCGAAACAGC TTTCCACACA ATGGAGCTTC 120 

ATGTCCTCGT GCAGGAAGTA CTCATCGACT GATGTGGCAG ACTTTGCTCC CTGACAAAAC 180 

TAAAGAACTC TCCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT GAACCCTGAA 240 

GTGAACTTCT GAT AC AT TT C CTGCAGCAAG AGAAGGCAGC CAACATGAAG GAAAATGTGG 300 

CAT CTGCAAC CGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATGGACAGT 3S0 

TACCTCCTGG AAAACCTGAG ATCTTTAAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 

GCTGGTGGAG GCCTGGGACA GATGGAGGAC TTCCTACCAA TTATTCACTG ACTTACCACA 4B0 

GGGAAGGAGA GACACT CATG CATGAATGTC CAGACTACAT AACCGGTGGC CCCAACTCCT 540 

GCCACTTTGG CAAGCAGTAC ACCTCCATGT GGAGGACATA CATCATGATG GTCAATGCCA S00 

T GGGAAGCAGT TTCTCGGATG AACTTTATGT GGACGTGACT TACATAGTTC S60 

C TCCTTTGGAG CTGGCTGTGG AAGTAAAACA GCCAGAAGAC AGAAAACCCT 720 

I TAAATGGTCT CCACCTACCC TGATTGACTT AAAAACTGGT TGGTTCACGC 780 

TCGATTA AAACCCGAGA AAGCAGCTGA GTGGGAGATC CATTTTGCTG 840 

GGCAGCAAAC AGAGTTTAAG ATTCTCAGCC TACATCCAGG ACAGAAATAC CTTGTCCAGG 900 

\ ACCAGACCAT GGATACTGGA GTGCATGGAG TCCAGCGACC TTCATTCAGA 960 

A CTTCACCATG AATGATACAA CCGTGTGGAT CTCTGTGGCT GTCCTTTCTG 1020 

3ATTATT GTCTGGGCAG TGGCTTTGAA GGGCTATAGC ATGGTGACCT 1080 

GCATCTTTCC GCCAGTTCCT GGGCCAAAAA TAAAAGGATT TGATGCTCAT CTGTTGGAGA 114 0 

AGGGCAAGTC TGAAGAACTA CTGAGTGCCT TGGGATGCCA AGACTTTCCT CCCACTTCTG 1200 

ACTATGAGGA CTTGCTGGTG GAGTATTTAG AAGTAGATGA TAGTGAGGAC CAGCATCTAA 12 SO 

TGTCAGTCCA TTCAAAAGAA CACCCAAGTC AAGGTATGAA ACCCACATAC CTGGATCCTG 1320 

30 ACACTGACTC AGGCCGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCTGAA AAGTGTGAGG 13 80 

AACCCCAGGC CAATCCCTCC ACATTCTATG ATCCTGAGGT CATTGAGAAG CCAGAGAATC 1440 

CTGAAACAAC CCACACCTGG GACCCCCAGT GCATAAGCAT GGAAGGCAAA ATCCCCTATT 1500 

TTCATGCTGG TGGATCCAAA TGTTCAACAT GGCCCTTACC ACAGCCCAGC CAGCACAACC 1560 

CCAGATCCTC TTACCACAAT ATTACTGATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1620 
35 CACCGGCCAC TCTGTTGAAT GAAGCAGGTA AAGATGCTTT AAAATCCTCT C 
AGTCTAGAGA AGAGGGAAAG GCAACCCAGC AGAGGGAGGT AGAAAGC 

CTGACCAGGA TACGCCCTGG CTGCTGCCCC AGGAGAAAAC CCCCTTTGGC TCCGCTAAAC 1800 

CCTTGGATTA TGTGGAGATT CACAAGGTCA ACAAAGATGG TGCATTATCA TTGCTACCAA I860 

AACAGAGAGA GAACAGCGGC AAGCCCAAGA AGCCCGGGAC TCCTGAGAAC AATAAGGAGT 1920 

40 ATGCCAAGGT GTCCGGGGT C ATGGATAACA ACATCCTGGT GTTGGTGCCA GATCCACATG 1980 

CTAAAAACGT GGCTTGCTTT HAAGAATCAG CCAAAG AGG C CCCACCATCA CTTGAACAGA 2 0 40 

AT CAAGCTG A GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2100 

TGGGTGGTTT GGATTACCTG GATCCCGCAT GTTTTACACA CTCCTTTCAC TGATAGCTTG 2160 

ACTAATGGAA TGATTGGTTA AAATGTGATT TTTCTTCAGG TAACACTACA GAGTACGTGA 2220 

45 AATGCTCAAG AATGTAGTCA GACTGACACT ACTAAAGCTC CCAGCTCCTT TCATGCTCCA 2280 

TTTTTAACCA CTTGCCTCTT TCTCCAGCAG CTGATTCCAG AACAAATCAT TATGTTTCCT 2340 

AACTGTGATT TGTAGATTTA CTTTTTGCTG TTAGTTATAA AACTATGTGT TCAATGAAAT 2400 

AAAAGCACAC TGCTTAGTAT TCTTGAGGGA CAATGCCAAT AGGTATATCC TCTGGAAAAG 2460 

QCTTTCATGA TTTGGCATGG GACAGACGGA AATGAAATTG TCAAAATTGT TTACCATAGA 2520 

50 AAGATGACAA AAGAAAATTT TCCACATAGG AAAATGCCAT GAAAATTGCT TTTGAAAAAC 2580 

AACTGCATAA CCTTTACACT CCTCGTCCAT TTTATTAGGA TTACCCAAAT ATAACCATTT 2640 

AAAGAAAGAA TGCATTCCAG AACAAATTGT TTACATAAGT TCCTATACCT TACTGACACA 2700 
TTGCTGATAT GCAAGTAAGA AAT 

55 Seq ID NO: 112 Protein sequence 
Protein Accession #: NP_000940 

1 11 21 31 41 51 
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60 MKENVASATV FTLLLFLNTC LLNGQLPPGK PEIFKCRSPN KETFTCWWRP GTDGGLPTNY 60 

SLTYHREGET LMHECPDYIT GGPNSCHFGK QYTSMWRTYI MMVNAXNQMG SSFSDELYVD 120 

VTYIVQPDPP LELAVEVKQP EDRKPYLWIK WSPPTLIDLK TGWFTLLY2I RLXPEKAAEW 180 

EIHFAGQQTE FKILSLHPGQ KYLVQVRCKP DHGYWSAWSP ATFIQIPSDF TMNDTTVWIS 240 

VAVLSAVICL IIVWAVALKG YSMVTCIFPP VPGPKIKGFD AHLLEKGKSE ELLSALGCQD 300 

65 FPPTSDYEDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 350 

SEKCEEPQAN PSTFYDPEVI EKPENPETTH TWDPOCISME GKI PYFHAGG SKCSTWPLPQ 420 

PSQHKPRSSY HNITDVCELA VGPAGAPATL LNEAGKDALK SSQTIKSREE GKATQQREVE 480 

SFHSETDQDT PWLLPQEKTP FGSAKPLDYV EIHKVNKDGA LSLLPKQREN SGKPKKPGTP 540 

ENNKEYAKVS.GVMDNNILVL VPDPHAKNVA CFEESAKEAP PSLEQNQAEK ALANFTATSS 600 

70 KCRLQLGGLD YLDPACFTHS FH 

Seq ID NO: 113 DNA sequence 



ATGTGGGGCG CTCGCCGCTC GTCCGTCTCC T 
CTGCTGCTGG CTGCGCTGCT GGCGGCGGGG GCGAGGGCCA G 
TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 
GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 
CGGGCGGACA AAGACGGCCC CGACGGCTCG GCAGTGCCCA TCTACGTGCC GTTCCTCATT 
GTTGGCTCCG TGTTTGTCGC CTTTATCATC TTGGGGTCCC TGGTGGCAGC CTGTTGCTGC 
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AGATGTCTCC GGCCTAAGCA GGATCCCCAG CAGAGCCGAG C 
ATGGAGACCA TCCCCATGAT CCCCAGTGCC AGCACCT 

rCAGTTC CAGCTCCAGC GCCAACTCAG GGGCCCGGGC GCCCCCAACA 
A CCAACTGTTG CTTGCCGGAA GGGAC CATG A A 
CCCACGAATT TCTCTGTGCT GAACTGTCAG CAGGCCACCC AGATTGTGCC A 
C ATCCCCCATA CGTGGGGTAC ACGGTGCAGC A 



ACAGTGAACA GAAGATGTAC CCAGCGGTGA 



Protein Accession #: XP_062S11 

1 11 21 31 41 51 
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MWGARRSSVS SSWNAASLLQ LLLAALLAAG ARASGEYCHG WLDAQGVWRI GFQCPERFDG 
GDATICCGSC ALRYCCSSAE ARLDQGGCDN DRQQGAGEPG RADKDGPDGS AVPIYVPFLI 
VGSVFVAFII LGSLVAACCC RCLRPKQDPQ QSRAPGGNRL METIPMIPSA STSRGSSSRQ 
SSTAASSSSS ANSGARAPPT RSQTNCCLPE GTMNNVYVNM PTNFSVLNCQ QATQIVPHQG 
QYLHPPYVGY TVQHDSVPMT AVPPFMDGLQ PGYRQIQSPF PHTOSEQKMY PAVTV 

Seq ID NO: 115 DNA sequence 



25 1 11 21 31 41 51 
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GGTGTGCTCT TGAGGGATTA AATGCAAAGA GATCACACCA TGGACTACAA GGAAAGCTGC 
CCAAGTGTAA GCATTCCCAG CTCCGATGAA CACAGAGAGA AAAAGAAGAG GTTTACTGTT 
TATAAAGTTC TGGTTTCAGT GGGAAGAAGT GAATGGTTTG TCTTCAGGAG AT ATG CAG AG 

30 TTTGATAAAC TTTATAACAC TTTAAAAAAA CAGTTTCCTG CTATGGCCCT GAAGATTCCT 

GCCAAGAGAA TATTTGGTGA TAATTTTGAT CCAGATTTTA TTAAACAAAG ACGAGCAGGA 
CTAAACGAAT T CATT CAG AA CCTAGTTAGG TATCCAGAAC TTTATAACCA TCCAGATGTC 
AGAGCATTCC TTCAAATGGA CAGTCCAAAA CACCAGTCAG A 
GAAAGAAGTT CTCAGAAGCT ACACTCTACC TCACAGAACA TCAACCTGGG A 

35 AATCCTCATG CCAAACCAAC TGACTTTGAT TTCTTAfi 
GGCAAGGTTC TTCTTGCAAA ACGG AAACTG GATGGAJ 
CAGAAAAAAA TAGTTCTCAA CAGAAAAGAG CAAAAACATA T 
CTCTTGAAAA ATGTGAAACA TCCGTTTTTG GTTGGATTGC ATTATTCCTT CCAAACAACT 



40 AGAGAACGGT CCTTTCCTGA GCACAGAGCT AGGTTTTACG CTGCTGAAAT TGCTAGTGCA 



ATTTCTGACA CCACTACCAC ATTTTGTGGG 
AGAAAACAGC CCTATGACAA TACTGTAGAT TGGTGGTGCC . 
45 ATGCTGTATG GATTGCCTCC TTTTTATTGC CGAGATGTTG C 

CTTCACAAAC CCCTAAGTTT GAGGCCAGGA GTGAGTCTTA CAGCCTGGTC C 

GAACTCCTAG AAAAAGACAG GCAAAAT CG A CTTGGTGCCA AGGAAGACTT TCTTGAAATT 1260 

CAGAATCATC CTTTTTTTGA ATCACTCAGC TGGGCTGACC TTGTACAAAA GAAGATTCCA 1320 

CCACCATTTA ATCCTAATGT GGCTGGACCA GATGATATCA GAAACTTTGA CACAGCATTT 1380 

50 ACAGAAGAAA CAGTTCCATA TTCTGTGTGT GTATCTTCTG ACTATTCTAT AGTGAATGCC 1440 

AGTGTATTGG AGGCAGATGA TGCATTCGTT GGTTTCTCTT ATGCACCTCC TTCAGAAGAC 1500 

TTATTTTTGT GAGCAGTTTG CCATTCAGAA ACCATTGAGC AAAATAAGTC TATAGATGGG 1560 

ACTGAAACTT CTATTTGTGT GAATATATTC AAATATGTAT AACTAGTGCC TCATTTTTAT 1620 

ATGTAATGAT GAAAACTATG AAAAAATGTA TTTTCTTCTA TGTGCAAGAA AAATAGGGCA 1680 

55 TTTCAAAGAG CTGTTTTGAT TAAAATTTAT ATTCTTGTTT AATAAGCTTA TTTTTAAACA 1740 

ATTTAAAAGC TATTATTCTT AGCATTAACC TATTTTTAAA GAAACCTTTT TTGCTATTGA 1800 

CTGTTTTTTC CCTCTAAGTT TACACTAACA TCTACCCAAG ATAGACTGTT TTTTAACAGT 1860 

CAATTTCAGT TCAGCTAACA TATATTAATA CCTTTGTAAC TCTTTGC1AT GGCTTTTGTT 1920 

ATCACACCAA AACTATGCAA TTGGTACATG GTTGTTTAAG AAGAAACCGT ATTTTTCCAT 1980 

60 GATAAATCAC TGTTTGAAAT ATTTGGTTCA TGGTATGATC GAAATGTAAA AGCATAATTA 2040 

ACACATTGGC TGCTAGTTAA CAATTGGAAT AACTTTATTC TGCAGATCAT TTAAGAAGTA 2100 

ACAGGCCGGG CGCGGTGGCT CACGCCTGTA ATCCCAGCAC TTTGGGAGGC TGAGGCGGGC 2160 

AGATCACCTG AGGTCAGGAG TTGGAGACCA GCCTGACCAA CATGGACAAA CCCCGTCTCT 2220 

ACTAAAAATA CAAAATTGGC AGGGTGTGGT GGCACATGCC TATAATCCCA GCTACTTGGG 2280 

65 AGGCTAAGGC AGGAGAATCG CTTGAACCCG GGAGGCGGAG GITGCAGTGA GCCGAGATCX3 2340 
CACCATTGCA CTCCTGCCTG GGCAACAAGA GTGAAACTCC ATCTCCAAAA A 



11 21 31 41 51 

I I I I I 

■ -JFDPDFI KQRKAGLNEF IQNLVRYPEL YNHPDVRAFL QMESPKHQSD 

PSEDEDERSS QKLHSTSQNI NLGPSGNPHA KPTDFDFLKV IGKGSFGKVL LAKRKLDGKF 
YAVKVLQKKI VLNRKEQKHI MAERNVLLKM VKHPFLVGLH YSFQTTEKLY FVLDFVNGGE 
LFFHLQRERS FPEHRARFYA AEIASALGYL HSIKIVYRDL KPENILLDSV GHWLTDFGL 
CKEGIAISDT TTTFCGTPEY LAPEVIRKQP YDNTVDWWCL GAVLYEMLYG LPPFYCRDVA 
EMYDNILHKP LSLRPGVSLT AWSILEELLE KDRQNRLGAK EDFLE IQNHP FFESLSWADL 
VQKKIPPPFN PMVAGPDDIR NFDTAFTEET VPYSVCVSSD YSIVNASVLE A 
APPSEDLFL 

Seq ID NO: 117 DNA sequence 
Nucleic Acid Accession #: NM_004004.1 
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WO 03/042661 
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I I I I I I 

ATQGATTGGG GCACGCTGCA GACGATCCTG GGGGGTGTGA ACAAACACTC " 
T CCTCTTCATT TTTCGCATTA TGATCCTCGT 
A GCAGGCCGAC TTTGTCTGCA ACACCCTGCA 
AAGAACGTGT GCTACGATCA CTACTTCCCC ATCTCCCACA TCCGGCTATG 
CTGATCTTCG TGTCCAGCCC AGCGCTCCTA GTGGCCATGC ACGTGGCCTA C< 
10 GAGAAGAAGA GGAAGTTCAT CAAGGGGGAG ATAAAGAGTG AATTTAJ 

ATCAAAACCC AGAAGGTCCG CATCGAAGGC TCCCTGTGGT GGACCTACAC A 
TTCTTCCGGG TCATCTTCGA AGCCGCCTTC ATGTACGTCT TCTATGTCAT GTACGACGGC 
TTCTCCATGC AGCGGCTGGT GAAGTGCAAC GCCTGGCCTT GTCCCAACAC TGTGGACTGC 
TTTGTGTCCC GGCCCACGGA GAAGACTGTC TTCACAGTGT TCATGATTGC AGTGTCTGGA 
15 ATTTGCATCC TGCTGAATGT CACTGAATTG TGTTATTTGC TAATTAGATA TTGTTCTGGG 



MDWGTLQTIL GGVNKHSTSI GKIWLTVLFI FRIMILWAA KETOGDEQAD FVCNTLQPGC 
KNVCYDHYPP ISHIRLWALQ LIFVSSPALL VAMHVAYRKH EKKEKFIKGE IKSEFKDIEE 
IKTQKVRIEG SLWWTYTSSI FFRVIFEAAF MYVFYVMYDG FSMQRLVKQJ AWPCPNTVDC 
L CYLLIRYCSG KSKKPV 



Seq ID NO-. 119 DNA sequence 
Nucleic Acid Accession #: XM 
Coding sequence: 1. .2431 



GAAATTTGGC TTCTCCAAGA GCCGCCCGGG 



CCCGGGTACC 

3 AGCCTTGGTG TTCTTTGCTA TGAATTTTTA 
jVTGTAA GCAAAGAAAC CATCGGGAAG 
C TCGGCTGCAG TGGACATCAT GTTTCTGTTA 
C TTTGAAAGGT CCAAGCACTT TGCCATCACA 
G AGGGTCAGAG TGGGAGCATT CCAGTTCAGT 

TCCACTCCTC ATCTGGAATT C 

AGAATCAAGA GGATGGTTTT CAAAGGAGGG CGCACGGAGA CGGAACTTGC T 
CTTCTGCACA GAGGGTTGCC TGGAGGCAGA AATGCTTCTG TGCCCCAGAT CCTCATCATC 
GTCACTGATG GGAAGTCCCA GGGGGATGTG GCACTGCCAT CCAAGCAGCT GAAGGAAAGG 
GGTGTCACTG TGTTTGCTGT GGGGGTCAGG TTTCCCAGGT GGGAGGAGCT GCATGCACTG 
AG( ,\3C CTAGAGGGCA GCACGTGCTG TTGGCTGAGC AGGTGGAGGA TGCCACCAAC 
GGCCTCTTCA GCACCCTCAG CAGCTCGGCC ATCTGCTCCA GCGCCACGCC A 
CCCGAGCTTG TCTTCATGGA GCGGTTAATG GGCATCTCTC TGATAGGCCC C 
CAGCCCTGCC AGAATGGAGG CACATGTGTT CCAGAAGGAC TGGACGGCTA CCAGTGCCTC 
TGCCCGCTGG CCTTTGGAGG GGAGGCTAAC TGTGCCCTGA AGCTGAGCCT GGAATGCAGG 
GTCGACCTCC TCTTCCTGCT GGACAGCTCT GCGGGCACCA CTCTGGACGG CTTCCTGCGG 
GCCAAAGTCT TCGTGAAGCG GTTTGTGCGG GCCGTGCTGA GCGAGGACTC TCGGGCCCGA 
G CCACATACAG CAGGGAGCTG CTGGTGGCGG TGCCTGTGGG G 



GATGTGCCTG ACCTGGTCTG G 



GCCGTGCGGG CAGAGCTGGA GGAGATCACA GGCAG CCCAA AGCATGTGAT GGTCTACTCG 15S0 

GATCCTCAGG ATCTGTTCAA CCAAATCCCT GAGCTGCAGG GGAAGCTGTG CAGCCGGCAG 1620 

CGGCCAGGGT GCCGGACACA AGCCCTGGAC CTCGTCTICA TGTTGGACAC CTCTGCCTCA 16 BO 

GTAGGGCCCG AGAATTTTGC TCAGATGCAG AGCTTTGTGA GAAGCTGTGC CCTCCAGTTT 1740 

GAGGTGAACC CTGACGTGAC ACAGGTCGGC CTGGTGGTGT ATGGCAGCCA GGTGCAGACT 1800 

GCCTTCGGGC TGGACACCAA ACCCACCCGG GCTGCGATGC TGCGGGCCAT TAGCCAGGCC 1860 

■ CCCTACCTAG GTGGGGTGGG CTCAGCCGGC ACCGCCCTGC TGCACATCTA TGACAAAGTG 1920 

C AGAGGGGTGC CCGGCCTGGT GTCCCCAAAG CTGTGGTGGT GCTCACAGGC 1980 

3 CAGAGGATGC AGCCGTTCCT GCCCAGAAGC TGAGGAACAA TGGCATCTCT 2 040 

3 TGGGCGTGGG GCCTGTCCTA AGTGAGGGTC TGCGGAGGCT TGCAGGTCCC 2100 

CGGGATTCCC TGATCCACGT GGCAGCTTAC GCCGACCTGC GGTACCACCA GGACGTGCTC 2160 

ATTGAGTGGC TGTGTGGAGA AGCCAAGCAG CCAGTCAACC TCTGCAAACC CAGCCCGTGC 2220 

ATGAATGAGG GCAGCTGCGT CCTGCAGAAT GGGAGCTACC GCTGCAAGTG TCGGGATGGC 2280 

TGGGAGGGCC CCCACTGCGA GAACCGTGAG TGGAGCTCTT GCTCTGTATG TGTGAGCCAG 2340 

GGATGGATTC TTGAGACGCC CCTGAGGCAC ATGGCTCCCG TGCAGGAGGG CAGCAGCCGT 2400 

ACCCCTCCCA GCAACTACAG AGAAGGCCTG GGCACTGAAA TGGTGCCTAC CTTCTGGAAT 2460 
GTCTGTGCCC CAGGTCCTTA G 



998 
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DGSNSVGKGS FERSKHFAIT VCDGLDISPE RVRVGAFQFS STPELEFPLD SFSTQQEVKA 
RIKRMVFKGG RTETELALKY LLHRGLPGGR NASVPQILII VTDGKSQGDV ALPSKQLKER 
C ASEPRGQHVL LAEQVEDATN GLFSTLSSSA ICSSATPAGS 
•I GISLIGPCDS QPCQNGGTCV PEGLDGYQCL CPLAFGGEA1J CALKLSLECR 
JVKRFVR AVLSEDSRAR VGVATYSREL LVAVPVGEYQ 
h TGSALRQAAE RGFGSATRTG QDRPRRWVL LTE3HSEDEV 

LLLGVGSE AVRAELEEIT GSPKHVMVYS DPQDLFNQIP ELQGKLCSRQ 

RPGCRTQALD LVFMLDTSAS VGPENFAQMQ SFVRSCALQF EVNPDVTQVG LWYGSQVQT 
AFGLDTKPTR AAMLRAISQA PYLGGVGSAG TALLHIYDKV MTVQRGARPG VPKAWVLTG 
GRGAED AAV P AQKLRNNGIS VLWGVGPVL SEGLRRLAGP RDSLIHVAAY AD1RYHQDVL 
IEWLCGEAKQ PVNLCKPSPC MNEGSCVLQN GSYRCKCRDG WEGPHCEMRE WSSCSVCVSQ 
GWILETPLRH MAPVQEGSSR TPPSNYREGL GTEMVPTFWN VCAPGP 

Seq I 



d Accession #: Eos sequence 



1 11 21 31 41 SI 

I I I I I I 

ATGCCCCCTT TCCTGTTGCT GGAGGCCGTC TGTGTTTTCC TGTTTTCCAG AGTGCCCCCA SO 

TCTCTCCCTC T CCAGG AAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 120 

AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 1B0 

AGCGTCGGGA AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCATCACAGT CTGTGACGGT 240 

CTGGACATCA GCCCCGAGAG GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTICAT 300 

CTGGAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 360 

ATGGTTTTCA AAGGAGGGCG CACGGAGACG GAACTTGCTC TGAAATACCT TCTGCACAGA 420 

GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 480 

AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG 540 

TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAGCGAGCCT 60 0 

AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG GTGGAGGATG CCACCAACGG CCTCTTCAGC 660 

ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAG ACTGCAGGGT CGAGGCTCAC 72 0 

CCCTGTGAGC ACAGGACGCT GGAGATGGTC CGGGAGTTCG CTGGCAATGC CCCATGCTGG 780 

AGAGGATCGC GGCGGACCCT TGCGGTGCTG GCTGCACACT GTCCCTTCTA CAGCTGGAAG 840 

AGAGTGTTCC TAACCCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 900 

TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG GACTGGACGG CTACCAGTGC 960 

CTCTGCCCGC TGGCCTTTGG AGGGGAGGCT AACTGTGCCC TGAAGCTGAG CCTGGAATGC 1020 

AGGGTCGACC TCCTCTTCCT GCTGGACAGC TCTGCGGGCA CCACTCTGGA CGGCTTCCTG 1080 

CGGGCCAAAG TCTTCGTGAA GCGGTTTGTG CGGGCCGTGC TGAGCGAGGA CTCTCGGGCC 1140 

CGAGTGGGTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCCTGT GGG3GAGTAC 1 .J 

CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCCCACC 12 6 0 

CTGACGGGCA GTGCCTTGCG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC CACCAGGACA 1320 

GGCCAGGACC GGC< &CGTA J I GTT TTGCTCACTG AGTCACACTC CGAGGATGAG 1380 

GTTGCGGGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TCCTGI TC < 1C 

GAGGCCGTGC GGGCAGAGCT GGAGGAGATC ACAGGCAGCC CAAAGCATGT GATGGTCTAC 15 00 

TCGGATCCTC AGGATCTGTT CAACCAAATC CCTGAGCTGC AGGGGAAGCT GTGCAGCCGG 15 6 0 

CAGCGGCCAG GGTGCCGGAC ACAAGCCCTG GACCTCGT CT TCATGTTGGA CACCTCTGCC 1620 

TCAGTAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTCCAG 1680 

TTTGAGGTGA ACCCTGACGT GACACAGGTC GGCCTGGTGG TGTATGGCAG CCAGGTG CAG 1740 

ACTGCCTTCG GGCTGGACAC CAAACCCACC CGGGCTGCGA TGCTGCGG3C CATTAGCCAG 1800 

GCCCCCTACC TAGGTGGGGT GGG CTCAGCC GGCACCGCCC TGCTGCACAT CTATGACAAA 1860 

GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTGCTCACA 1920 

GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 1980 

TCTGTCTTGG TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 2040 

CCCCGGGATT CCCTGATCCA CGTGGCAGCT TACGCCGACC TGCGGTACCA CCAGGACGTG 2100 

CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CAGCCAGTCA ACCICTGCAA ACCCAGCCCG 2160 

TGCATGAATG AGGGCAGCTG CGTCCTGCAG AATGGGAGCT ACCGCTGCAA GTGTCGGGAT 2220 

GGCTGGGAGG GCCCCCACTG CGAGAACCGT GAGTGGAGCT CTTGCTCTGT ATGTGTGAGC 2280 

CAGGGATGGA TTCTTGAGAC GCCCCTGAGG CACATGGCTC CCGTGCAGGA GGGCAGCAGC 2340 

CGTACCCCTC CCAGCAACTA CAGAGAAGGC CTGGGCACTG AAATGGTGCC TACCTTCTGG 2400 
AATGTCTGTG CCCCAGGTCC TTAG 

Seq ID NO: 122 Protein sequence 



I I I I I I 

MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 
SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 
MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSOGDVALPS KQLKERGVTV 
FAVGVRFPRW EELHALASEP RGOHVLLAEQ VEDATNGLFS TLSSSAICSS AT PDCRVEAH 
PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 
SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 
RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 
LTGSALRQAA ERGFGSATRT GQDRPRRVW LLTESHSEDE VAGPARHARA RELLLLGVGS 
EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 
SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 
APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAWVLT GGRGAEDAAV PAQKLRNNGI 
SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPTOLCKPSP 
CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 
RTPPSNYREG LGTEMVPTFW NVCAPGP 

Seq ID NO: 123 DNA sequence 

Eos sequence 



999 
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TGTTTTCCTG TTTTCCAGAG TGCCCCCATC TCTCCCTCTC CAGGAAGTCC ATGTAAGCAA 
AG AAACCAT C GGGAAGATTT CAGCTGCCAG CAAAATGATG TGGTGCTCGG CTGCAGTGGA 
CATCATGTTT CTGTTAGATG GGTCTAACAG CGTCGGGAAA GGGAGCTTTG AAAGGTCCAA 
GCACTTTGCC ATCACAGTCT GTGACGGTCT GGACATCAGC CCCGAGAGGG TCAGAGTGGG 
AG CATTCCAG TTCAGTTCCA CTCCTCATCT GGAATTCCCC TTGGATTCAT TTTCAACCCA 
ACAGGAAGTG AAGG CAAGAA TCAAGAGGAT GGTTTTCAAA GGAGGGCGCA CGGAGACGGA 
ACTTGCTCTG AAATACCTTC TGCACAGAGG GTTGCCTGGA GGCAGAAATG CTTCTGTGCC 
CCAGATCCTC ATCATCGTCA CTGATGGGAA GTCCCAGGGG GATGTGGCAC TGCCATCCAA 
GCAGCTGAAG GAAAGGGGTG TCACTGTGTT TGCTGTGGGG GTCAGGTTTC CCAGGTGGGA 
GGAGCTGCAT GCACTGGCCA GCGAGCCTAG AGGGCAGCAC GTGCTGTTGG CTGAGCAGGT 
GGAGGATGCC ACCAACGGCC TCTTCAGCAC CCTCAGCAGC TCGGCCATCT GCTCCAGCGC 
CACGCCAGAC TGCAGGGTCG AGGCTCACCC CTGTGAGCAC AGGACGCTGG AGATGGTCCG 



35 
40 



CAGGACCACC TGCCCAGGCC CCTGTGACTC GCAGCCCTGC CAGAATGGAG GCACATGTGT 1020 

TCCAGAAGGA CTGGACGGCT ACCAGTGCCT CTGCCCGCTG GCCTTTGGAG GGGAGGCTAA 10B0 

TGCGGGCACC ACTCTGGACG GCTTCCTGCG GGCCAAAGTC TTCGTGAAGC GGTTTGTGCG 1200 

GGCCGTGCTG AGCGAGGACT CTCGGGCCCG AGTGGGTGTG GCCACATACA GCAGGGAGCT 1260 

T GACCTGGTCT GGAGCCTCGA 1320 



GCGTGGCTTC G 

GCTCACTGAG TCACACTCCG AGGATGAGGT TGCGGGCCCA GCGOGTCACG CAAGGGCGCG 1500 

AGAGCTGCTC CTGCTGGGTG TAGGCAGTGA GGCCGTGCGG GCAGAGCTGG AGGAGATCAC 1560 

AGGCAGCCCA AAGCATGTGA TGGTCTACTC GGATCCTCAG GATCTGTTCA ACCAAATCCC 1S20 

TGAGCTGCAG GGGAAGCTGT GCAGCCGGCA GCGGCCAGGG TGCCGGACAC AAGCCCTGGA 1680 

: GAGAATTTTG CTCAGATGCA 1740 



TGTCCCCAAA GCTGTGGTGG TGCTCACAGG CGGGAGAGGC GCAGAGGATG C 
TGCCCAGAAG CTGAGGAACA ATGGCATCTC TGTCTTGGTC GTGGGCG 

CTGCGGAGGC TTGCAGGTCC CCGGGATTCC CTGATCCACG TGGCAGCTTA 



GCCAGTCAAC CTCTGCAAAC CCAGCCCGTG CATGAATGAG GGCAGCTGCG TCCTGCAGAA 2280 

TGGGAGCTAC CGCTGCAAGT GTCGGGATGG CTGGGAGGGC CCCCACTGCG AGAACCGATT 2340 

CTTGAGACGC CCCTGAGGCA CATGGCTCCC GTGCAGGAGG GCAGCAGCCG TACCCCTCCC 2400 

AGCAACTACA GAGAAGG CCT GGGCACTGAA ATGGTGCCTA CCTTCTGGAA TGTCTGTGCC 24 6 0 

CCAGGTCCTT AGAATGTCTG CTTCCCGCCG TGGCCAGGAC CACTATTCTC ACTGAGGGAG 2520 

GAGGATGTCC CAACTGCAGC CATGCTGCTT AGAGACAAGA AAGCAGCTGA TGTCACCCAC 25 80 

AAACGATGTT GTTGAAAAGT TTTGATGTGT AAGTAAATAC CCACTTTCTG TACCTGCTGT 2640 

GCCTTGTTGA GGCTATGTCA TCTGCCACCT TTCCCTTGAG GATAAACAAG GGGTCCTGAA 2700 

2760 



Protein Accession tf : Eos sequence 



MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGKISAA SKMHWCSAAV DIMFLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 120 

MVFKGGRTET EIiALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 

LTGSALRQAA ERGFGSATRT GQDRPRRVW LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAW\'LT GGRGAEDAAV PAQKLRNHGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYKQDV LIEWLCGEAK QPVNLCKPSP 720 
CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR FLRRP 



Seq ID NO: 125 DNA sequence 
Coding sequence: 145.. 1260 



1000 



WO 03/042661 



PCT/US02/36810 



50 



AATTTTTTGG AGAAAAGGGC TTTAAATATA AAGCAAAACA AAGCAATGCT TGCAAAACTC 540 

ATGTCTGAAT TAGAAAGCTT CCCTGGCTCG TTCCGTGGAA GACATCCCCT CCCAGGCTCC 600 

GACT CACAAT CAAGGAGACC GCGAAGGCGT ACATTCCCGG GTGTTGCTTC CAGG AG AAA C SSO 

CCTGAACGGA GAGCTCGTCC TCTTACCAGG TCAAGGTCCC GGATCCTCGG GTCCCTTGAC 720 

GCTCTACCCA TGGAGGAGGA GGAGGAAGAG GATAAGTACA TGTTGGTGAG AAAGAGGAAG 780 

ACCGTGGATG GCTACATGAA TGAAGATGAC CTGCCCAGAA GCCGTCGCTC CAGATCATCC 840 

GTGACCCTTC CGCATATAAT TCGCCCAGTG GAAGAAATTA CA.GAGGAGGA GTTGGAGAAC 900 

GTCTGCAGCA ATTCTCGAGA GAAGATATAT AACCGTTCAC TGGGCTCTAC TTGTCATCAA 9S0 

TGCCGTCAGA AGACTATTGA TACCAAAACA AACTGCAGAA ACCCAGACTG CTGGGGCGTT 1020 

C CGAACTGGCA TTGCCCGCCT TGTCGAGOAA TCTGCAACTG CAGTTTCTGC 1140 

G ATGGACGGTG TGCGACTGGG GTCCTTGTGT ATTTAGCCAA ATATCATGGC 1200 

TTTGGGAATG TGCATGCCTA CTTGAAAAGC CTGAAACAGG AATTTGAAAT GCAAGCATAA 1260 

TATCTGGAAA ATTTGCTGCC TGCCTTCTAC TTCTCAAATC TTTCTTGTAA AAGTTTCCAA 1320 

TTTTTTCACT GAAACCTGAG TTABAAATCT TGATGATCAG CCTGTTTCAT AAGAAACTCC 1380 

AATCAAGTTA ATCTTAGCAG ACATGTGTTT CTGGAGCATC ACAGAAGGTA TATTGCTAGT 1440 

TACACTTTGC CCTCCTGCAG TTTCTTCTCT GCTCCCAACC CCCATCTCAT AGCATCCCCC 1500 

TCTATTTCCA ATGCTCCTCT CCAACCGCTT AGTTTCTGAA TTTCTTTTAA ATTACAGTTT 1560 

TATGAAAGCA TATTTTATTT ACTTGGTGTT GAAATAGCCC TCATAAAACC TAAGCACTTG 1620 

GAAA CACAAT AATAGTATTA ACTAACTAGA TCTATTGAAT TTCAGAGAAG AGCCTTCTAA 1680 

CTTGTTTACA CAAAAACGAG TATGATTTAG CACTCATACT AGTTGAAATT TTTAATAGAA 1740 

TCAAGGCACA AAAGTCTTAA AACCATGTGG AAAAATTAGG TAATTATTGC AGATTGATGT 1800 

CTCTCAATCC CATGTATTGC GCTTATGTTA CAAGTTGTTG TCACAGTTGA GACTTAATTT 1860 

CTCCTAATTT CTTCTGCCCG AAGGGTAAGT GGTGCGT CCA GCTTACACGA TCATAATTCA 1920 

AAGGTTGGTG GGCAATGTAA TACTTAATTA AAAT AAT GAT GGAAGAGCTA TCTGGAGATT 1980 

ATGAGTAAGC TGATTTGAAT TTTCAGTATA AAACTTTAGT ATAATTGTAG TTTGCAAAGT 2040 

TTATTTCAGT TCACATGTAA GGTATTGCAA ATAAATTCTT GGACAATTTT GTATGGAAAC 2100 

TTGATATTAA AAACTAGTCT GTGGTTCTTT GCAGTTTCTT GTAAATTTAT AAACCAGGCA 2160 

CAAGGTTCAA GTTTAGATTT TAAGCACTTT TATAACAATG ATAAGTGCCT TTTTGGAGAT 2220 

GTAACTTTTA GCAGTTTGTT AACCTGACAT CTCTGCCAGT CTAGTTTCTG GGCAGGTTTC 2280 

CTGTGTCAGT ATTCCCCCTC CTCTTTGCAT TAATCAAGGT ATTTGGTAGA GGTGGAATCT 2340 

AAGTGTTTGT ATGTCCAATT TACTTGCATA TGTAAACCAT TGCTGTGCCA TTCAATGTTT 2400 

GATGCATAAT TGGACCTTGA ATCGATAAGT GTAAATACAG ' CTTTTGATCT GTAATGCTTT 2460 
TATACAAAAG TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA 

Seq ID NO: 126 Protein sequence 
Protein Accession # : NP_11414a.l 

1 11 21 31 41 51 

I I I I I | 

MDARRVPQKD LRVKKNLKKF RYVKLISMET SSSSDDSCDS FASDNPANTR LQSVHEGCRT £0 

RSQCRHSGPL RVAMKFPARS TRGATNKKAE SRQPSENSVT DSNSDSEDES GMNFLEKRAL 12 0 

NIKQNKAMLA KLMSELESFP GSFRGRHPLP GSDSQSRRPR RRTFPGVASR RNPERRARPL 180 

TRSRSRILGS LDALPMEEEE EEDKYMLVRK RKTVDGYMNE DDLPRSRRSR SSVTLPHIIR 240 

PVEEITEEEL ENVCSNSREK IYNRSLGSTC HQCRQKTIDT KTNCRNPDCW GVRGOFCGPC 300 

LRNRYGEEVR DALLDPNWHC PPCRGICNCS FCRQRDGRCA TGVLVYLAKY KGFGNVHAYL 360 
KSLKQEFEMQ A 

Seq ID NO j 127 DNA sequence 

Nucleic Acid Accession #: AF305616.1 

Coding sequence: 1..863 

i i 1 T T T T 



1 i i i i i 

JJ ATGCACCGCT TGATGGGGGT CAACAGCACC GCCGCCGCCG CCGCCGGGCA GCCCAATGTC 
TCCTGCACGT GCAACTGCAA ACGCTCTTTG TTCCAGAGCA TGGAGATCAC GGAGCTGGAG 
TTTGTTCAGA TCATCATCAT CGTGGTGGTG ATGATGGTGA TGGTGGTGGT GATCACGTGC 
CTGCTGAGCC ACTACAAGCT GTCTGCACGG TCCTTCATCA GCCGGCACAG CCAGGGGCGG 
AGGAGAGAAG ATGCCCTGTC CTCAGAAGGA TGCCTGTGGC CCTCGGAGAG CACAGTGTCA 

OU GGCAACGGAA TCCCAGAGCC G CAGGTCT AC GCCCCGCCTC GGCCCACCGA CCGCCTGGCC 
GTGCCGCCCT TCGCCCAGCG GGAGCGCTTC CACCGCTTCC AGCCCACCTA TCCGTACCTG 
CAGCACGAGA TCGACCTGCC ACCCACCATC TCGCTGTCAG ACGGGGAGGA GCCCCCACCC 
TACCAGGGCC CCTGCACCCT CCAGCTTCGG GACCCCGAGC AGCAGCTGGA ACTGAACCGG 

^ GAGTCGGTGC GCGCACCCCC AAACAGAACC ATCTTCGACA GTGACCTGAT GGATAGTGCC 

OJ AGGCTGGGCG GCCCCTGCCC CCCCAGCAGT AACTCGGGCA TCAGCGCCAC GTGCTACGGC 
AGCGGCGGGC GCATGGAGGG GCCGCCGCCC ACCTACAGCG AGGTCATCGG CCACTACCCG 
GGGTCCTCCT TCCAGCACCA GCAGAGCAGT GGGCCGCCCI CCTTGCTGGA GGGGACCCGG 
CTCCACCACA CACACATCGC GCCCCTAGAG AGCGCAGCCA TCTGGAGCAA AGAGAAGGAT 

7 q AAACAGAAAG GACACCCTCT CTAG 

Seq ID NO: 128 Protein sequence 
Protein Accession #: AAL09357.1 

75 t i 1 r r r t 

MHRLMGVNST AAAAAGQPNV SCTCNCKRSL FQSMEITELE FVQIIIIWV MMVHWVITC 
LLSHYKLSAR SFISRHSQGR RREDALSSEG CLWPSEST^/S GNGIPEPQVY APPRPIDRLA 
VPPFAQRERF HRFQPTYPYL QHEIDLPPTI SLSDGEEPPP YQGPCTLQLR DPEQQLELNR 
„ n ESVRAPPNRT I FDSDLMDSA RLGGPCPPSS NSGISATCYG SGGRMEGPPP TYSEVIGHYP 
OU GSSFQHQQSS GPPSLLEGTR LHHTHIAPLE SAAIWSKEKD KQKGHPL 



1001 
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1 11 21 31 41 51 

I I I I I I 

ATGGCGGCGG CTCCGCTGCT GCTGCTGCTG CTGCTCGTGC CCGTGCCGCT GCTGCCGCTG 
CTGGCCCAAG GGCCCGGAGG GGCGCTGGGA AACCGGCATG CGGTGTACTG GAACAGCTCC 
AACCAGCACC TGCGGCGAGA GGGCTACACC GTGCAGGTGA ACGTGAACGA CTATCTGGAT 
ATTTACTGCC CGCACTACAA CAGCTCGGGG GTGGGCCCCG GGC-CGGGACC GGGGCCCGGA 
GGCGGGGCAG AGCAGTACGT GCTGTACATG GTGAGCCGCA ACGGCTACCG CACCTGCAAC 
GCCAGCCAGG GCTTCAAGCG CTGGGAGTGC AACCGGCCGC ACGCCCCGCA CAGCCCCATC 
AAGTTCTCGG AGAAGTTCCA GCGCTACAGC GCCTTCTCTC TGGGCTACGA GTTCCACGCC 
GGCCACGAGT ACTACTACAT CTCCACGCCC ACTCACAACC TGCACTGGAA GTGTCTGAGG 
ATGAAGGTGT TCGTCTGCTG CGCCTCCACA TOGCACTCCG GGGAGAAGCC GGTCCCCACT 
CTCCCCCAGT TCACCATGGG CCCCAATGTG AAGATCAACG TGCTGGAAGA CTTTGAGGGA 
GAGAACCCTC AGGTGCCCAA GCTTGAGAAG AGCATCAGCG GGACCAGCCC CAAACGGGAA 
CACCTGCCCC TGGCCGTGGG CATCGCCTTC TTCCTCATGA CGTTCTTGGC CTCCTAG 

Seq ID NO.- 130 Protein sequence 



L LAQGPGGALG NRHAVYWNSS NQHLRREGYT VQVNVNDYLD 60 

G GGAEQYVLYM VSRNGYRTCN ASQGFKRWEC NRPHAPHSPI 120 

KFSEKFQRYS AFSLGYEFHA GHEYYYISTP THNLHWKCLR MKVFVCCAST SHSGEKPVPT 180 
LPQFTMGPNV KINVLEDFEG ENPQVPKLEK SISGTSPKRE HLPIAVGIAF FLMTFLAS 

Seq ID NOs 131 DMA sequence 

Nucleic Acid Accession tt : NM_012445.1 

Coding sequence: 27S..1271 

1 11 21 31 41 51 

I I I I I I 

GCACGAGGGA AGAGGGTGAT CCGACCCGGG GAAGGTCGCT GGGCAGGGCG AGTTGGGAAA 60 

GCGGCAGCCC CCGCCGCCCC CGCAGCCCCT TCTCCTCCTT TCTCCCACGT CCTATCTGCC 120 

TCTCGCTGGA GGCCAGGCCG TGCAGCATCG AAGACAGGAG GAACTGGAGC CTCATTGGCC 180 

GGCCCGGGGC GCCGGCCTCG GGCTTAAATA GGAGCTCCGG GCTCTGGCTG GGACCCGACC 24 0 

GCTGCCGGCC GCGCTCCCGC TGCTCCTGCC GGGTGATGGA AAACCCCAGC CCGGCCGCCG 300 

CCCTGGGCAA GGCCCTCTGC GCTCTCCTCC TGGCCACTCT CGGCGCCGCC GGCCAGCCTC 360 

TTGGGGGAGA GTCCATCTGT TCCGCCAGAG CCCCGGCCAA ATACAGCATC ACCTTCACGG 42 0 

GCAAGTGGAG CCAGACGGCC TTCCCCAAGC AGTACCCCCT GTTCCGCCCC CCTGCGCAGT 480 

GGTCTTCGCT GCTGGGGGCC GCGCATAGCT CCG ACT AC AG CATGTGGAGG AAGAACCAGT 540 

ACGTCAGTAA COGGCTGOGC GACTTTGCGG AGCGCGGCGA GGCCTGGGCC CTGATGAAGG SCO 

AGATCGAGGC GGCGGGGCAG GCGCTGCAGA GCGTGCACGC GGTGTTTTCG GCGCCCGCCG 660 

TCCCCAGCGG CACCGGGCAG ACGTCGGCGG AGCTGGAGGT GCAGCGCAGG CACTCGCTGG 720 

TCTCGTTTGT GGTGCGCATC GTGCCCAGCC CCGACTGGTT CGTGGGCGTG GACAGCCTGG 780 

ACCTGTGCGA CGGGGACCGT TGGCGGGAAC AGGCGGCGCT GGACCTGTAC CCCTACGACG 840 

CCGGGACGGA CAGCGGCTTC ACCTTCTCCT CCCCCAACTT CGCCACCATC CCGCAGGACA 900 

CGGTGACCGA GATAACGTCC TCCICTCCCA GCCACCCGGC CAACTCCTTC TACTACCCGC 960 

GGCTGAAGGC CCTGCCTCCC ATCGCCAGGG TGACACTGGT GCGGCTGCGA CAGAGCCCCA 1020 

GGGCCTTCAT CCCTCCCGCC CCAGTCCTGC CCAGCAGGGA CAATGAGATT GTAGACAGCG 1080 

CCTCAGTTCC AGAAACGCCG CTGGACTGCG AGGTCTCCCT GTGGTCGTCC TGGGGACTGT 1140 

GCGGAGGCCA CTGTGGGAGG CTCGGGACCA AGAGCAGGAC TCGCTACGTC CGGGTCCAGC 1200 

CCGCCAACAA CGGGAGCCCC TGCCCCGAGC TCGAAGAAGA GGCIGAGTGC GTCCCTGATA 1260 

ACTGCGTCTA AGACCAGAGC CCCGCAGCCC CTGGGGCCCC CGGAGCCATG GGGTGTCGGG 1320 

GGCTCCTGTG CAGGCTCATG CTGCAGGCGG CCGAGGCACA GGGGGTTTCG CGCTGCTCCT 1380 

GACCGCGGTG AGGCCGCGCC GACCATCTCT GCACTGAAGG GCCCTCTGGT GGCCGGCACG 1440 

GGCATTGGGA AACAGCCTCC TCCTTTCCCA ACCTTGCTTC TTAGGGGCCC CCGTGTCCCG 1500 

TCTGCTCTCA GCCTCCTCCT CCTGCAGGAT AAAGTCATCC CCAAGGCTCC AGCTACTCTA 1560 

AATTATGGTC TCCTTATAAG TTATTGCTGC TCCAGGAGAT TGTCCTTCAT CGTCCAGGGG 1620 

CCTGGCTCCC ACGTGGTTGC AGATACCTCA GACCTGGTGC TCTAGGCTGT GCTGAGCCCA 1680 

CTCTCCCGAG GGCGCATCCA AGCGGGGGCC ACTTGAGAAG TGAATAAATG GGGCGGTTTC 1740 

GGAAGCGTCA GTGTTTCCAT GTTATGGATC TCTCTGCGTT TGAATAAAGA CTATCTCTGT 1800 



I I I I I 

h GKALCALLLA TLGAAGQPLG GESICSARAP AKYSITFTGK WSQTAFPKQY 

PLFRPPAQWS SLLGAAHSSD YSMWRKNQYV SNGLRDFAER GEAWALMKEI EAAGEALQSV 

HAVFSAPAVP SGTGQTSAEL EVQRRHSLVS FWRIVPSPD WFVGVDSLDL CDGDRWREQA 

ALDLYPYDAG TDSGFTFSSP NFATIPQDTV TEITSSSPSH PANSFYYPRL KALPPIARVT 

LVRLRQSPRA FIPPAPVLPS RDNEIVDSAS VPETPLDCEV SLWSSNGLCG GKCGRLGTKS 
RTRYVRVQPA NNGSPCPELE EEAECVPDNC V 

Seq I! 



Nucleic Acid Accession #: NM_019894 



ATGTTACAGG ATCCTGACAG TGATCAACCT 
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AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AG6TGA1 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG T 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC C 

;CCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 

T TCTCTGCCTG TTTCGACAAC TT CACAG AAG CTCTCGCTGA GACAGCCTGT 

3 GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 



35 
40 
45 
50 
55 
60 
65 
70 
75 



C CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 
3 GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 
T GGAGGGAGCA TCCTGGACCC CCACTGGGTC 
CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 
GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT C 



I CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

C T CTGG ATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCT CAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 



1 11 21 31 41 51 

I I I I I I 

MLQDPDSDQP LNSLDVKPLR KPRIPMETFR KVGIPIIIAL LSLASIIIW VLIKVILDKY 
YFLCGQPLHF IPRKQLCDGE LDCPLGEDEE HCVKSFPEGP AVAVRLSKDR STLQVLDSAT 
GNWFSACFDN FTEALAETAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMRNSS 
GPCLSGSLVS LHCLACGKSL, KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 
LTAAHCFRKK TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPXDND IALMKLQFPL 
TFSGTVRPIC LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTRCNADD 
AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT 

Seq ID NO: 135 DNA sequence 
Nucleic Acid Accession »: NM_003045 
Coding sequence: 148.. 2037 



CGATCCTGCC GGAGCCCCGC CGCCGCCGGC TTGGATTCTG AAACCTTCCT TGTATCCCTC 
CTGAG AC AT C TTTGCTGCAA. GATCGAGGCT GTCCTCTGGT GAGAAGC-TGG TGAGGCTTCC 
CGTCATATTC CAGCTCTGAA CAGCAACATG GGGTGCAAAG TCCTGCTCAA CATTGGGCAG 
C GGCGGAAGGT GGTGGACTGT AGCCGGGAGG AGACGCGGCT GTCTCGCTGC 



ATTCTCATCT TGACAGGACT TTTAACTCTT GGTGTGAAAG AGTCGGCCAT GGTCAACAAA 
ATATTCACTT GTATTAACGT CCTGGTCCTG GGCTTCATAA TGGTGTCAGG ATTTGTGAAA 
GGATCGGTTA AAAACTGGCA GCTCACGGAG GAGGATTTTG GGAACACATC AGGCCGTCTC 

TTCTCTGGTG TCCTGTCGGG GGCAGCGACT TGCTTCTATG CCTTCGTGGG CTTTGACTGC 
ATCGCCACCA CAGGTGAAGA GGTGAAGAAC C 
GCGTCCCTCT TGATCTGCTT CATCGCC 

ATGCCCTACT TCTGCCTGGA CAATAACAGC CCCCTGCCCG ACGCCTTTAA GCACGTGGGC 1140 

TGGGAAGGTG CCAAGTACGC AGTGGCCGTG GGCTCCCTCT GCGCTCTTTC CGCCAGTCTT 1200 

CTAGGTT CCA TGTTTCCCAT GCCTCGGGTT ATCTATGCCA TGGCTGAGGA TGGACTGCTA 1260 

TTTAAATTCT TAGCCAACGT CAATGATAGG ACCAAAACAC CAATAATCGC CACATTAGCC 1320 

TCGGGTGCCG TTGCTGCTGT GATGGCCTTC CTCTTTGACC 1GAAGGACT1 GGTGGACCTC 1380 

ATGTCCATTG GCACTCTCCT GGCTTACTCG TTGGTGGCTG CCTGTGTGTT GG1CTTACGG 1440 

TACCAGCCAG AGCAGCCTAA CCTGGTATAC CAGATGGCCA GTACTTCCGA CGAGTTAGAT 1500 

CCAGCAGACC AAAATGAATT GGCAAGCACC AATGATTCCC AGCTGGGGTT TTTACCAGAG 15S0 

GCAGAGATGT TCTCTTTGAA AACCATACTC TCACCCAAAA ACATGGAGCC TTCCAAAATC 1620 

TCTGGGCTAA TTGTGAACAT TTCAACCAGC CTTATAGCTG TTCTCATCAT CACCTTCTGC 16BO 

ATTGTGACCG TGCTTGGAAG GGAGGCTCTC ACCAAAGGGG CGCTGTGGGC AGTCTTTCTG 1740 

CTCGCAGGGT CTGCCCTCCT CTGTGCCGTG GTCACGGC-CG TCATCTGGAG GCAGCCCGAG 1800 

AGCAAGACCA AGCTCTCATT TAAGGTTCCC TTCCTGCCAG TGCTCCCCAT CCTGAGCATC 1860 

TTCGTGAACG TCTATCTCAT GATGCAGCTG GACCAGGGCA CCTGGGTCCG GTTTGCTGTG 1920 

TGGATGCTGA TAGGCTTCAT CATCTACTTT GGCTATGGCC TGTGGCACAG CGAGGAGGCG 1980 

TCCCTGGATG CCGACCAAGC AAGGACTCCT GACGGCAACT TGGACCAGTG CAAGTGACGC 2040 

ACAGCCCCGC CCCCCGGAGG TGGCAGCAGC CCCGAGGGAC GCCCCCAGAG GACCGGGAGG 2100 
CACCCCACCC TCCCCACCAG TGCAACAGAA ACCACCTGCG TCCACACCCT CACTGCA 

Seq ID NO: 136 Protein sequence 
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MGCKVLLNIG QQMLRRKWD CSREETRLSR CLNTFDLVAL GVGSTLGAGV YVLAGAVARE 

NAGPAIVISF LIAALASVLA GLCYGEFGAR VPKTGSAYLY SYVTVGELWA FITGWNLILS 

YIIGTSSVAR AWSATFDELI GRPIGEFSRT HMTLNAPGVL AENPDIFAVI IILILTGLLT 

U3VKESAMVN KIFTCINVLV LGFIMVSGFV KGSVKNKQLT EEDFGNTSGR LCLNNDTKEG 

KPGVGGFMPF GFSGVLSGAA TCFYAFVGFD CIATTGEEVK NPQKAIPVGI VASLLICFIA 

YFGVSAALTL MMPYFCLDNIJ SPLPDAFKHV GWEGAKYAVA VGSLCALSAS LLGSMFPMPR 

VIYAMAEDGL LFKFLANVND RTKTPIIATL ASGAVAAVMA FLFDLKDLVD LMSIGTLLAY 

SLVAACVLVL RYQPEQPNLV YQMASTSDEL DPADQNELAS TNDSQLGFLP EAEMFSLKTI 

LSPKNMEPSK ISGLIVNIST SLIAVLIITF CIVTVLGREA LTKGALWAVF LLAGSALLCA 

WTGVIWRQP ESKTKLSFKV PFLPVLPILS IFVNVYLMMQ LDQGTHVRFA VKMLIGFIIY 
FGYGLWHSEE ASLDADQART PDGNLDQCK 



I i i 1 r r r 

AAGATATAAA AGCTCCAGAA ACGTTGACTG GGACCACTGG AGACACTGAA GAAGGCAGGG 60 

GCCCTTAGAG TCTTGGTTGC CAAACAGATT TGCAGATCAA GGAGAACCCA GGAGTTTCAA 12 0 

AGAAGCGCTA GTAAGGTCTC TGAGATCCTT GCACTAGCTA CATCCTCAGG GTAGGAGGAA ISO 

GATGGCTTCC AGAAGCATGC GGCTGCTCCT ATTGCTGAGC TGCCTGGCCA AAACAGGAGT 240 

CCTGGGTGAT ATCATCATGA GACCCAGCTG TGCTCCTGGA TGGTTTTACC ACAAGTCCAA 300 

TTGCTATGGT TACTTCAGGA AGCTGAGGAA CTGGTCTGAT GCCGAGCTCG AGTGTCAGTC 360 

TTACGGAAAC GGAGCCCACC TGGCATCTAT CCTGAGTTTA AAGGAAGCCA GCACCATAGC 42 0 

AGAGTACATA AGTGGCTATC AGAGAAGCCA GCCGATATGG ATTGGCCTGC ACGACCCACA 480 

GAAGAGGCAG CAGTGGCAGT GGATTGATGG GGCCATGTAT CTGTACAGAT CCTGGTCTGG 540 

CAAGTCCATG GGTGGGAACA AGCACTGTGC TGAGATGAGC TCCAATAACA ACTTTTTAAC 600 

TTGGAGCAGC AACGAATGCA ACAAGCGCCA ACACTTCCTG TGCAAGTACC GACCATAGAG 660 

CAAGAATCAA GATTCTGCTA ACTCCTGCAC AGCCCCGTCC TCTTCCTTTC TGCTAGCCTG 720 

GCTAAATCTG CTCATTATTT CAGAGGGGAA ACCTAGCAAA CTAAGAGTGA TAAGGGCCCT 780 

ACTACACTGG CTTTTTTAGG CTTAGAGACA GAAACTTTAG CAT TGGCCCA GTAGTGGCTT 840 

CTAGCTCTAA ATGTTTGCCC CGCCATCCCT TTCCACAGTA TCCTTCTTCC CTCCTCCCCT 900 

GTCTCTGGCT GTCTCGAGCA GTCTAGAAGA GTGCATCTCC AGCCTATGAA ACAGCTGGGT 960 

CTTTGGCCAT AAGAAGTAAA GATTTGAAGA CAGAAGGAAG AAACTCAGGA GTAAGCTTCT 1020 

AGACCCCTTC AGCTTCTACA CCCTTCTGCC CTCTCTCCAT TGCCTGCACC CCACCCCAGC 1080 

CACTCAACTC CTGCTTGTTT TTCCTTTGGC CATAGGAAGG TTTACCAGTA GAATCCTTGC 1140 
TAGGTTGATG TGGGCCATAC ATTCCTTTAA TAAACCATTG TGTACATAAG AAAAAAAAAA 

Seq ID HOi 138 Protein 
Protein Accession If: "" 



MASRSMRLLL LLSCLAKTGV LGDIIMRPSC APGWFYHKSN CYGYFRKLRN WSDAELECQS 
NG LASI LSLKEASTIA EYISGYQRSQ PIWIGLHDPQ KRQQWQWIDG AMYLYRSWSG 
KSMGCNKHCA EMSSNNNFLT WSSNECNKRQ HFLCKYRP 



1 11 21 31 41 51 

I I I I I I 

GCTCACCCAG GAAAAATATG CAATCGTCCC ATTGATATAC AGGCCACTAC AATGGATGGA 60 

GTTAACCTCA GCACCGAGGT TGTCTACAAA AAAGGCCAGG ATTATAGGTT TGCTTGCTAC 120 

GACCGGGGCA GAGCCTGCCG GAGCTACCGT GTACGGTTCC TCTGTGGGAA GCCTGTGAGG 180 

CCCAAACTCA CAGTCACCAT TGACACCAAT GTGAACAGCA CCATTCTGAA CTTGGAGGAT 240 

AATGTACAGT CATGGAAACC TGGAGATACC CTGGTCATTG CCAGTACTGA TTACTCCATG 300 

TACCAGGCAG AAGAGTTCCA GGTGCTTCCC TGCAGATCCT GCGCCCCCAA CCAGGTCAAA 360 

GTGGCAGGGA AACCAATGTA CCTGCACATC GGGGAGGAGA TAGACGGCGT GGACATGCGG 420 

GCGGAGGTTG GGCTTCTGAG CCGGAACATC ATAGTGATGG GGGAGATGGA GGACAAATGC 480 

TACCCCTACA GAAACCACAT CTGCAATTTC TTTGACTTCG AT ACC TTTGG GGGCCACATC 540 

AAGTTTGCTC TGGGATTTAA GGCAGCACAC TTGGAGGGCA CGGAGCTGAA GCATATGGGA 600 

CAGCAGCTGG TGGGTCAGTA CCCGATTCAC TTCCACCTGG CCGGTGATGT AGACGAAAGG 660 

GGAGGTTATG ACCCACCCAC ATACATCAGG GACCTCTCCA TCCATCATAC ATTCTCTCGC 720 

TGCGTCACAG TCCATGGCTC CAATGGCTTG TTGATCAAGG ACGTTGTGGG CTATAACTCT 780 

TTGGGCCACT GCTTCTTCAC GGAAGATGGG CCGGAGGAAC GCAACACTTT TGACCACTGT 840 

CTTGGCCTCC TTGTCAAGTC TGGAACCCTC CTCCCCTCGG ACCGTGACAG CAAGATGTGC 900 

AAGATGATCA CAGGAGACTC CTACCCAGGG TACATCCCCA AGCCCAGGCA AGACTGCAAT 960 

GCTGTGTCCA CCTTCTGGAT GGCCAATCCC AACAACAACC TCATCAACTG TGCCGCTGCA 1020 

GGATCTGAGG AAACTGGATT TTGGTTTATT TTTCACCACG TACCAACGGG CCCCTCCGTG 1080 

GGAATGTACT CCCCAGGTTA TTCAGAGCAC ATTCCACTGG GAAAATTCTA TAACAACCGA 1140 

GCACATTCCA ACTACCGGGC TGG CATGATC ATAGACAACG GAGTCAAAAC CACCGAGGCC 1200 

TCTGCCAAGG ACAAGCGGCC GTTCCTCTCA ATCATCTCTG CCAGATACAG CCCTCACCAG 1260 

GACGCCGACC CGCTGAAGCC CCGGGAGCCG GCCATCATCA GACACTTCAT TGCCTACAAG 1320 

AACCAGGACC ACGGGGCCTG GCTGCGCGGC GGGGATGTGT GGCTGGACAG CTGCCGGTTT 1380 

GCTGACAATG GCATTGGCCT GACCCTGGCC AGTGGTGGAA CCTTCCCGTA TGACGACGGC 1440 

TCCAAGCAAG AGATAAAGAA CAGCTTGTTT GTTGGCGAGA GTGGCAACGT GGGGACGGAA 1500 

ATGATGGACA ATAGGATCTG GGGCCCTGGC GGCTTGGACC ATAGCGGAAG GACCCTCCCT 1560 

ATAGGCCAGA ATTTTCCAAT TAGAGGAATT CAGTTATATG ATGGCCCCAT CAACATCCAA 1620 

AACTGCACTT TCCGAAAGTT TGTGGCCCTG GAGGGCCGGC ACACCAGCGC CCTGGCCTTC 1680 

CGCCTGAATA ATGCCTGGCA GAGCTGCCCC CATAACAACG TGACCGGCAT TGCCTTTGAG 1740 

GACGTTCCGA TTACTTCCAG AGTGTTCTTC GGAGAGCCTG GGCCCTGGTT CAACCAGCTG 1800 
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SEYPGSYLTK NDNWLVRHPD CINVPDWRGA I CSGCYAQMY IQAYKTSHLR MKIIKNDFPS 

HPLYLEGALT ESTHYQQYQP WTLQKGYTI HWDQTAFAEL AIWLINFNKG DWIRVGLCYP 

RGTTPSILSD VHNRLLKQTS KTGVFVRTLQ MDKVEQSYPG RSHYYKDEDS GLLFLKLKAQ 

NEREKFAFCS MKGCERIKIK ALIPKNAGVS DCTATAYPKF TERAWDVPK PKKLFGSQLK 

TKDHFLEVKM ESSKQHFFHL WNDFAYIEVD GKKYPSSEDG IQWVIDGNQ GRWSHTSFR 

NSILQGIPWQ LFNYVATIPD NSIVLMASKG RYVSRGFWTR VLEKLGADRG LKLKEQMAFV 

GFKGSFRPIH VTLDTEDHKA KIFQWPIPV VKKKKL 



Seq ID NO: 141 DHA 5 
Nucleic Acid Accessn 
Coding sequence: 261 



i r i i i 

GAGCTAGCGC TCAAGCAGAG CCCAGCGCGG TGCTATCGGA CAGAGCCTGG CGAGCGCAAG SO 

CGGCGCGGGG AGCCAGCGGG GCTGAGCGCG GCCAGGGTCT QAACCCAGAT TTCCCAGACT 120 

AGCTACCACT CCGCTTGCCC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT CAGCGACCAG 180 

ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC ACTGTCTCGG CTACAGACCC 240 

AGAGGGAGCA CACTGCCAGG ATGGGAGCTG CTGGGAGGCA GGACTTCCTC TTCAAGGCCA 300 

TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC ACAGTGGCTG 3 50 

CTGGGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAGACC 420 

ACCATGTGCA TATCGGCCAG GGCAAGACAC TGCTGCTCAC CTCTTCTGCC ACGGTCTATT 480 

CCATCCACAT CTCAGAGGGA GGCAAGCTGG TCATTAAAGA CCACGACGAG CCGATTGTTT 540 

TGCGAACCCG GCACATCCTG ATTGACAACG GAGGAGAGCT GCATGCTGGG AGTGCCCTCT 600 

GCCCTTTCCA GGGCAATTTC ACCATCATTT TGTATGGAAG GGCTGATGAA GGTATTCAGC 660 

CGGATCCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGCGCT CTTGAGTTGC 720 

ATGGACAGAA AAAGCTCTCC TGGACATTTC TGAACAAGAC CCTTCACCCA GGTGGCATGG 780 

CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 840 

TCGACCCCAA ATCAGGCACA GTCATCCATT CTGACCGGTT TGACACCTAT AGATCCAAGA 900 

AAGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGOGGTGCC CGATGGCAGG ATCCTTTCTG 960 

TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAGGAAG GCGATGACCA 1020 

AATTGGGAAG CAAACACTTC CTGCACCTTG GATTTAGACA CCCTTGGAGT TTTCTAACTG 1080 

TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATCGA CATCGAGGCT 1140 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 12 00 

CTTTGTCCAG IGAGTGGGTT CAAGACGTGG AGTGGACGGA GTGGTTCGAT CATGATAAAG 12 60 

TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 1320 

TAIGCAATCG TCCCATTGAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAGCCT 1440 

CCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA 1500 

CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA 15 6 0 

AACCTGGAGA TACCCTGGTC ATTGCCAGTA CTGATTACTC CATGTACCAG GCAGAAGAGT 1620 

TCCAGCTCCT TCCCTGCAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 16 80 

TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GCGGGCGGAG GTTGGGCTTC 1740 

TGAGCCGGAA CATCATAGTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC TACAGAAACC 180 0 

ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT I8 6 0 

TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 192 0 

AGTACCCGAT TCACTTCCAC CTGGCCGGTG ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2040 

GCTCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 2100 

TCACGGAAGA 1GGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC CTCCTTGTCA 2160 

AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 2220 
ACTCCTACCC AGGGTACATC CCCAAGCCCA GGCAAGACTG CAATGCTGTG 



«».i.v.v.iai.<.<. ««^i«UATi_ GGCAAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTCT 2280 

GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT GAGGAAACTG 2340 

GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC CGTGGGAATG TACTCCCCAG 2400 

GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 2460 

GGGCTGGCAT GATCATAGAC AACGGAGTCA AAACCACCGA GGCCTCTGCC AAGGACAAGC 252 0 

AGCCCCGGGA GCCGGCCA TC TCTGCCAGAT ACASCCCTCA CCAGGACGCC GACCCGCTGA 2580 

CCTGGCTGCG CGGCGGGGAT GTGTGGCTGG ACAGCTGCCA TTTCAGAGGG GAGGCTCAGG 2700 

AAGGCTTCTT GCTTACAGGA ATGAAGGCTG GGGGCATTTT GCTGGGGGGA GATGAGGCAG 2760 

CCTCTGGAAT GGCTCAGGGA TTCAGCCCTC CCTGCCGCTG CCTGCTGAAG CTGGTOACTA 2820 

CGGGGTCGCC CTTTGCTCAC GTCTCTCTGG CCCACTCATG ATGGAGAAGT GTGGTCAGAG 2880 

GGGAGCAATG GGCTTTGCTG CTTATGAGCA CAGAGGAATT CAGTCCCCAG GCAGCCCTGC 2940 

CTCTGACTCC AAGAGGGTGA AGTCCACAGA AGTGAGCTCC TGCCTTAGGG CCTCATTTGC 3000 

TCTTCATCCA GGGAACTGAG CACAGGGGGC CTCCAGGAGA CCCTAGATGT GCTCGTACTC 3060 

CCTCGGCCTG GGATTTCAGA GCTGGAAATA TAGAAAATAT CTAGC CCAAA GCCTTCATTT 312 0 

TAACAGATGG GGAAAGTGAG CCCCCAAGAT GGGAAAGAAC CACACAGCTA AGGGAGGGCC 3180 

TGGGGAGCCC CACCCTAGCC CTTGCTGCCA CACCACATTG CCTCAACAAC CGGCCCCAGA 3240 

GTGCCCAGGC ACTCCTGAGG TAGCTTCTGG AAATGGGGAC AAGTCCCCTC GAAGGAAAGG 3300 

AAATGACTAG AGTAGAATGA CAGCTAGCAG ATCTCTTCCC TCCTGCTCCC AGCGCACACA 3360 

AACCCGCCCT CCCCTTGGTG TTGGCGGTCC CTGTGGCCTT CACTTTGTTC ACTACCTGTC 3420 

AGCCCAGCCT GGGTGCACAG TAGCTGCAAC TCCCCATTGG TGCTACCTGG CTCTCCTGTC 3480 

TCTGCAGCTC TACAGGTGAG GCCCAGCAGA GGGAGTAGGG CTCGCCATGT TTCTGGTGAG 354 0 

CCAATTTGGC TGATCTTGGG TGTCTGAACA GCTATTGGGT CCACCCCAGT CCCTTTCAGC 3600 

TGCTGCTTAA TGCCCTGCTC TCTCCCTGGC CCACCTTATA GAGAGCCCAA AGAGCTCCTG 3660 

TAAGAGGGAG AACTCTATCT GTGGTTTATA ATCTTGCACG AGGCACCAGA GTCTCCCTGG 3720 

GTCTTGTGAT GAACTACATT TATCCCCTTT CCTGCCCCAA CCACAAACTC TTTCCTTCAA 3780 

AGAGGGCCTG CCTGGCTCCC TCCACCCAAC TGCACCCATG AGACTCGGTC CAAGAGTCCA 3840 

TTCCCCAGGT GGGAGCCAAC TGTrAGGGLG RTmrrrn rru.rjTm ,„„- 



i^^iv_^-_nrt^ iuuiu^hj rtbrtLlUtfjlU UAAGAGTCCA 3840 

TTCCCCAGGT GGGAGCCAAC TGTCAGGGAG GTCTTTCCCA CCAAACATCT TTCAGCTGCT 3900 

GGGAGGTGAC CATAGGGCTC TGCTTTTAAA GATATGGCTG CTTCAAAGGC CAGAGTCACA 3960 

GGAAGGACTT CTTCCAGGGA GATTAGTGGT GATGGAGAGG AGAGTTAAAA TGACCTCATG 4020 

TCCTTCTTGT CCACGGTTTT GTTGAGTTTT CACTCTTCTA ATGCAAGGGT CTCACACTGT 4080 

GAACCACTTA GGATGTGATC ACTTTCAGGT GGCCAGGAAT CTTna UTrrr^ T-nv^r-T^^ ,,-,„„ 



--^^.-.v-ii^i uiiuHbu'n' uv-iLiiLXA AlULMGGGT CTCACACTGT 

GAACCACTTA GGATGTGATC ACTTTCAGGT GG CCAGGAAT GTTGAATGTC TTTGGCTCAG 
TTCATTTAAA AAAGATATCT ATTTGAAAGT TCTCAGAGTT GTACATATGT TTCACAGTAC 

1006 



WO 03/042661 PCT/US02/36810 



AGGATCTGTA CATAAAAGTT TCTTTCCTAA ACCATTCACC AAQAGCCAAT ATCTAGGCAT 42 60 

TTTCTTGGTA GCACAAATTT TCTTATTGCT TAGAAAATTG TCCTCCTTGT TATTTCTGTT 4320 

TGTAAGACTT AAGTGAGTTA GGTCTTTAAG GAAAGCAACG CTCCTCTGAA ATGCTTGTCT 4380 

TTTTTCTGTT GCCGAAATAG CTGGTCCTTT TTCGGGAGTT AGATGTATAG AGTGTTTGTA 4440 

TGTAAACATT TCTTGTAGGC ATCACCATGA ACAAAGATAT ATTTTCTATT TATTTATTAT 4500 

ATGTGCACTT CAAGAAGTCA CTGTCAGAGA AATAAAGAAT TGTCTTAAAT GTCATGATTG 4560 

GAGATGTCCT TTGCATTGCT TGGAAGGGGT GTACCTAGAG CCAAGGAAAT TGGCTCTGGT 4620 
TTGGAAAAAT TTTGCTGTTA TTATAGTAAA CATACAAAGG ATGTCAAAAA A 
AAAAAAAAAA AAAAAAAAAA AA 

Seq ID NO ! 142 Protein sequence 



21 31 41 51 

I I I I 

L TLTCFPGATS TVAAGCPDQS PELQPWNPGH DQDHHVHIGQ 

A TVYSIHISEG GKLVIKDHDE PIVLRTRHIL IDNGGELHAG SALCPFQGNF 

TIILYGRADE GIQPDPYYGL KYIGVGKGGA LELHGQKKLS WTFLNKTLHP GGMAEGGYFF 

ERSWGHRGVI VHVIDPKSGT VIHSDRFDTY RSKKESERLV QYLNAVPDGR ILSVAVNDEG 

SRNLDDMARK AMTKLGSKHF LHLGFRHPWS FLTVKGNPSS SVEDHIEYHG HRGSAAARVF 

KLFQTEHGEY FNVSLSSEWV QDVEWTEWFD HDKVSQTKGG EKISDLWKAH PGKICNRPID 

IQATTMDGVN LSTEWYKKG QDYRFACYDR GRACRSYRVR FLCGKPVRPK LTVTIDTNVN 

STILNLEDNV QSWKPGDTLV IASTDYSMYQ AEEFQVLPCR SCAPNQVKVA GKPMYLHIGE 

EIDGVDMRAE VGLLSRNI IV MGEMEDKCYP YRNHICNFFD FDTFGGHIKF ALGFKAAHLE 

GTELKHMGQQ LVGQYPIHFH LAGDVDERGG YDPPTYIRDL SIHHTFSRCV TVHGSNGLLI 

KDWGYNSLG HCFFTEDGPE ERNTFDHCLG LLVKSGTLLP SDRDSKMCKM ITEDSYPGYI 

PKPRQDCNAV STFWMANPNN NLINCAAAGS EETGFWFIFH HVPTGPSVGM YSPGYSEHIP 

LGKFYNNRAH SNYRAGMIID NGVKTTEASA KDKRPFLSII SARYSPHQDA DPLKPREPAI 

IRHFIAYKNQ DHGAWLRGGD VWLDSCHFRG EAQEGFLLTG MKAGGILLGG DEAASGMAQG 
FSPPCRCLLK LVTTGSPFAH VSLAHS 

Seq ID NO: 143 NM DNA sequence 
Nucleic Acid Accession #.• KM_006017 
Coding sequence* 38.. 2635 

I I" I' f I" I' 

CCAAGTTCTA CCTCATGTTT GGAGGATCTT GCTAGCTATG GCCCTCGTAC TCGGCTCCCT 
GTTGCTGCTG GGGCTGTGCG GGAACTCCTT TTCAGGAGGG CAGCCTTCAT CCACAGATCC 
TCCTAAGGCT TGGAATTATG AATTCCCTCC AACAAATTAT GAGACCCAAG ACTCCCATAA 
ACCTCCACCC ATTGGCATTC TCTTTGAACT AGTGCATATC TTTCTCTATG TGGTACAGCC 
GCGTGATTTC CCAGAAGATA CTTTGAGAAA ATTCTTACAG AAGGCATATG AAT C CAAAAT 
TGATTATGAC AAGCCAGAAA CTGTAATCTT AGGTCTAAAG ATTGTCTACT ATGAAGCAGG 
GATTATTCTA TGCTGTGTCC TGGGGCTGCT GTTTATTATT CTGATGCCTC TGGTGGGGTA 
TTTCTTTTGT ATGTGTCGTT GCTGTAACAA ATGTGGTGGA GAAATGCACC AGCGACAGAA 
GGAAAATGGG CCCTTCCTGA GGAAATGCTT TGCAATCTCC CTGTTGGTGA TTTGTATAAT 
AATAAGCATT GGCATCTTCT ATGGTTTTGT GGCAAATCAC CAGGTAAGAA CCCGGATCAA 
AAGGAGTCGG AAACTGGCAG ATAGCAATTT CAAGGACTTG CGAACTCTCT TGAATGAAAC 
TCCAGAGCAA ATCAAWATA TATTGGCCCA GTACAACACT ACCAAGGACA AGGCGTTCAC 
AGATCTGAAC AGTATCAATT CAGTG CTAGG AGGCGGAATT CTTGACCGAC TGAGACCCAA 
CATCATCCCT GTTCTTGATG AGATTAAGTC CATGGCAACA GCGATCAAGG AGACCAAAGA 
GGCGTTGGAG AACATG AA CA GCACCTTGAA GAGCTTCCAC CAACAAAGTA CACAGCTTAG 
CAGCAGTCTG ACCAGCGTGA AAACTAGCCT GCGGTCATCT CTCAATGACC C 
GGTGCATCCA TCAAGTGAAA CCTGCAACAG CATCAGATTG TCTCTAAGCC A 
CAACCCTGAA CTGAGGCAGC TTCCACCCGT GGATGCAGAA CTTGACAACG TTAATAACGT 
TCTTAGGACA GATTTGGATG GCCTGGTCCA ACAGGGCTAT CAATCCCTTA ATGATATACC 
TGACAGAGTA CAACGCCAAA CCACGACTGT CGTAGCAGGT ATCAAAAGGG TCTTGAATTC 
CATTGGTTCA GATATCGACA ATGTAACTCA GCGTCTTCCT ATTCAGGATA TACTCTCAGC 
ATTCTCTGTT TATGTTAATA ACACTGAAAG TTACATCCAC AGAAATTTAC CTACATTGGA 
AGAGTATGAT TCATACTGGT GGCTGGGTGG CCTGGTCATC TGCTCTCTGC TGACCCTCAT 
CGTGATTTTT TACTACCTGG GCTTACTGTG TGGCGTGTGC GGCTATGACA GGCATGCCAC 
CCCGACCACC CGAGGCTGTG TCTCCAACAC CGGAGGCGTC TTCCTCATGG TTGGAGTTGG 
ATTAAGTTTC CTCTTTTGCT GGATATTGAT GATCATTGTG GTTCTTACCT TTGTCTTTGG 
TGCAAATGTG GAAAAACTGA TCTGTGAACC TTACACGAGC AAGGAATTAT TCCGGGTTTT 
GGATACACCC TACTTACTAA ATGAAGACTG GGAA1ACTAT CTCTCTGGGA AGCTATTTAA 
TAAATCAAAA ATGAAGCTCA CTTTTGAACA AGTTIACAGT GACTGCAAAA AAAATAGAGG 
CACTTACGGC ACTCTTCACC TGCAGAACAG CTTCAATATC AGTGAACATC TCAACATTAA 
TGAGCATACT GGAAGCATAA GCAGTGAATT GGAAAGTCTG AAGGTAAATC TTAATAT CTT 1860 
TCTGTTGGGT GCAGCAGGAA GAAAAAACCT TCAGGATTTT GCTGCTTGTG GAATAGACAG 192 0 
AATGAATTAT GACAGCTACT TGGCTCAGAC TGGTAAATCC CCCGCAGGAG TGAATCTTTT 
ATCATTTGCA TATGATCTAG AAGCAAAAGC AAACAGT"T C CAG T ^TTTGAGGAA 
CTCCCTGAAA AGAGATGCAC AAACTATTAA AACAATTCAC CAGCAACGAG TCCTTCCTAT 2100 
AGAACAATCA CTGAGCACTC TATACCAAAG CGTCAAGATA CTTCAACGCA CAGGGAATGG 2160 
ATTGTTGGAG AGAGTAACTA GGATTCTAGC TTCTCTGGAT TTTGCTCAGA ACTTCATCAC 2220 
AAACAATACT TCCTCTGTTA TTAT TGAGGA AACTAAGAAG TATGGGAGAA CAATAATAGG 2280 
ATATTTTGAA CATTATCTGC AGTGGATCGA GTTCTCTATC AGTGAGAAAG TGGCATCGTG 2340 
CAAACCTGTG GCCACCGCTC TAGATACTGC TGTTGATGTC TTTCTGTGTA GCTACATTAT 2400 
CGACCCCTTG AATTTGTTTT GGTT TGGCAT AGGAAAAGCT ACTGTATTTT TACTTCCGGC 2460 
TCTAATTTTT GCGGTAAAAC TGGC TAAGTA CTATCGTCGA ATGGATTCGG AGGACGTGTA 252 0 
CGATGATGTT GAAACTATAC CCATGAAAAA TATGGAAAAT GGTAATAATG GTTATCATAA 2580 
AGAT CATGTA TATGGTATTC ACAATCCTGT TATGACAAGC CCATCACAAC ATTGATAGCT 2640 
GATGTTGAAA CTGCTTGAGC ATCAGGATAC TCAAAGTGGA AAGGATCACA GATTTTTGGT 2700 
AGTTTCTGGG TCTACAAGGA CTTTCCAAAT CCAGGAGCAA CGCCAGTGGC A 
CTCAGGCGGG CACCAAGGCA ACGGCACCAT TGGTCTCTGG GTAGTGCTTT A 
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T TATAGTCCAT GGTCCATCAC TATTCAAGGA TGACTCCCTC CCTTCCTGTC 2880 

TATTTTTGTT TTTTACTTTT TTACACTGAG TTTCTATTTA GACACTACAA CATATGGGGT 2940 

GTTTGTTCCC ATTGGATGCA TTTCTATCAA AACTCTATCA AATGTGATGG CTAGATTCTA 3000 

ACATATTGCC ATGTGTGGAG TGTGCTGAAC ACACACCAGT TTACAGGAAA GATGCATTTT 3060 

TTATAGGACT TTCTTCTAAA TGAGCTAAAT AAGTCACCAT TGACTTCTTG GTGCTGTTGA 3180 

AAATAATCCA TTTTCACTAA AAGTGTGTGA AACCTACAGC ATATTCTTCA CGCAGAGATT 3240 

TTCATCTATT ATACTTTATC AAAGATTGGC CATGTTCCAC TTGGAAATGG CATGCAAAAG 3300 

CCATCATAGA GAAACCTGCG TAACTCCATC TGACAAATTC AAAAGAGAGA GAGAGATCTT 3360 

GAGAGAGAAA TGCTGTTCGT TCAAAAGTGG AGTTGTTTTA ACAGATGCCA ATTACGGTGT 3420 

ACAGTTTAAC AGAGTTTTCT GTTGCATTAG GATAAACATT AATTGGAGTG CAGCTAACAT 3480 

GAGTATCATC AGACTAGTAT CAAGTGTTCT AAAATGAAAT ATGAGAAGAT CCTGTCACAA 3540 

TTCTTAGATC TGGTGTCCAG CATGGATGAA ACCTTTGAGT TTGGTCCCTA AATTTGCATG 3600 

AAAGCACAAG GTAAATATTC ATTTGCTTCA GGAGTTTCAT GTTGGATCTG TCATTATCAA 3660 

AAGTGATCAG CAATGAAGAA CTGGTCGGAC AAAATTTAAC GTTGATGTAA TGGAATTCCA 3720 

GATGTAGGCA TTCCCCCCAG GTCTTTTCAT GTGCAGATTG CAGTTCTGAT TCATTTGAAT 3780 
AAAAAGGAAC TTGG 



1 11 21 31 41 51 

I I I I I I 

MALVLGSLLL LGLCGNSFSG GQPSSTDAPK AWNYELPATN YETQDSHKAG PIGILFELVH 
I FLYWQPRD FPEDTLRKFL QKAYESKIDY DKPETVILGL KIVYYEAGII LCCVLGLLFI 
ILMPLVGYFF CMCRCCNKCG GEMHQRQKEM GPFLRKCFAI SLLVICIIIS IGIFYGFVAN 
HQVRTRIKRS RKLADSNFKD LRTLLNETPE QIKYILAQYN TTKDKAFTDL NSINSVLGGG 
ILDRLRPNII PVLDEIKSMA TAIKETKEAL ENMNSTLKSL HQQSTQLSSS LTSVKTSLRS 
SLUDPLCLVH PSSETCNSIR LSLSQLNSNP ELRQLPPVDA ELDNVNNVLR TDLDGLVQQG 
YQSLNDIPDR VQRQTTTWA GIKRVLNSIG SDIDNVTQRL PIQDILSAFS VYVNNTESYI 
HRNLPTLEEY DSYWWLGGLV ICSLLTLIVI FYYLGLLCGV CGYDRHATPT TRGCVSNTGG 
VFLHVGVGLS FLFCHILMII WLTFVFGAN VEKLICEPYT SKELFRVLDT PYLLNEDWEY 
YLSGKLFNKS KMKLTFEQVY SDCKKNRGTY GTLHLQNSFN ISEHLNINEH TGSISSELES 
LKVNLNIFLL GAAGRKNLQD FAACGIDRMN YDSYLAQTGK SPAGVNLLSF A 
LPPGNLRNSL KRDAQTIKTI HQQRVLPIEQ SLSTLYQSVK ILQRTGNGLL E 
DFAQNFITNN TSSVIIEETK KYGRTIIGYF EHYLQWIEFS ISEKVASCKP V 
VFLCSYIIDP LNLFWFGIGK ATVFLLPALI FAVKLAKYYR RMDSEDVYDD V 
NGNNGYHKDH VYGIHNPVMT SPSQH 



: 145 D 




I I I I I I 

CCAAGTTCTA CCTCATGTTT GGAGGATCTT GCTAGCTATG GCCCTCGTAC TCGGCTCCCT 
G GGGCTGTGCG GGAACTCCTT TTCAGGAGGG CAGCCTTCAT CCACAGATGC 
T TGGAATTATG AATTGCCTGC AACAAATTAT GAGACCCAAG ACTCCCATAA 
AGCTGGACCC ATTGGCATTC TCTTTGAACT AGTGCATATC TTTCTCTATG T 
GCGTGATTTC CCAGAAGATA C 

TGATTATGAC ATTGTCTACT ATGAAGCAGG GATTATTCTA T( 

GTTTATTATT CTGATGCCTC TGGTGGGGTA TTTCTTTTGT ATGTGTCGTT GCTGTAACAA 
ATGTGGTGGA GAAATGCACC A 

A AAGGAGTCGG AAACTGGCAG ATAGCAATTT 600 

A ATCAAATATA TATTGGCCCA 660 

C AGTATCAATT CAGTGCTAGG 720 

A CATCATCCCT GTTCTTGATG AGATTAAGTC 780 

3 AGACCAAAGA GGCGTTGGAG AACATGAACA GCACCTTGAA B40 

3 CAGCAGTCTG ACCAGGGTGA AAACTAGCCT 900 

GCGGTCATCT CTCAATGACC CTCTGTGCTT GGTGCATCCA TCAAGTGAAA CCTGCAACAG 960 

CATCAGATTG TCTCTAAGCC AGCTGAATAG CAACCCTGAA CTGAGGCAGC TTCCACCCGT 102 0 

GGATGCAGAA CTTGACAACG TTAATAACGT TCTTAGGACA GATTTGGATG GCCTGGTCCA 1080 

ACAGGGCTAT CAATCCCTTA ATGATATACC TGACAGAGTA CAACGCCAAA CCACGACTGT 1140 

CGTAGCAGGT ATCAAAAGGG TCTTGAATTC CATTGGTTCA GATATCGACA ATGTAACTCA 1200 

GCGTCTTCCT ATTCAGGATA TACTCTCAGC ATTCTCTGTT TATGTTAATA ACACTGAAAG 1260 

TTACATCCAC AGAAATTTAC CTACATTGGA AGAGTATGAT TCATACTGGT GGCTGGGTGG 1320 

CCTGGTCATC TGCTCTCTGC TGACCCTCAT CGTGATTTTT TACTACCTGG GCTTACTGTG 1380 

IGGCGTCTGC GGCTATGACA GGCATGCCAC CCCGACCACC CGAGGCTGTG TCTCCAACAC 1440 

CGGAGGCGTC TTCCTCATGG TTGGAGTTGG ATTAAGTTTC CTCTTTTGCT GGATATTGAT 1500 

GATCATTGTG GTTCTTACCT TTGTCTTTGG TGCAAATGTG GAAAAACTGA TCTGTGAACC 1560 

TTACACGAGC AAGGAATTAT TCCGGGTTTT GGATACACCC TACTTACTAA ATGAAGACTG 1620 

GGAATACTAT CTCTCTGGGA AGCTATTTAA TAAATCAAAA ATGAAGCTCA CTTTTGAACA 1680 

AGTTTACAGT GACTGCAAAA AAAATAGAGG CACTTACGGC ACTCTTCACC TGCAGAACAG 1740 

CTTCAATATC AGTGAACATC TCAACATTAA TGAGCATACT GGAAGCATAA GCAGTGAATT 1800 

GGAAAGTCTG AAGGTAAATC TTAATATCTT TCTGTTGGGT GCAGCAGGAA GAAARAACCT 1860 

TCAGGATTTT GCTGCTTGTG GAATAGACAG AATGAATTAT GACAGCTACT TGGCTCAGAC 1920 

1GGTAAATCC CCCGCAGGAG TGAATCTTTT ATCATTTGCA TATGATCTAG AAGCAAAAGC 1980 
AAACAGTTTG CCCCCAGGAA ATTTGAGGAA C 
AACAATTCAC CAGCAACGAG TCCTTCCTAT A 

CGTCAAGATA CTTCAACGCA CAGGGAATGG ATTGTTGGAG AGAGTAACTA G 

TTCTCTGGAT TTTGCTCAGA ACTTCAICAC AAACAATACT TCCTCTGITA TTATTGAGGA 2220 

AACTAAGAAG TATGGGAGAA CAATAATAGG ATATTTTGAA CATTATCTGC AGTGGATCGA 2280 

GTTCTCTATC AGTGAGAAAG TGGCATCGTG CAAACCTGTG GCCACCGCTC TAGATACTGC 2340 
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C TTTCTGTGTA GCTACATTAT CGACCCCTTG AATTTGTTTT GGTT 
AGGAAAAGCT ACTGTATTTT TACTTCCGGC TCTAATTTTT G 
CTATCGTCGA ATGGATTCGG AGGACGTGTA CGATGATGTT GAAACTATAC CCATGAAAAA 
TATGGAAAAT GGTAATAATG GTTATCATAA AGATCATGTA TATGGTATTC ACAATCCTGT 
TATGACAAGC CCATCACAAC ATTGATAGCT GATGTTGAAA CTGCTTGAGC ATCAGGATAC 
TCAAAGTGGA AAGGATCACA GATTTTTGGT AGTTTCTGGG TCTACAAGGA CTTTCCAAAT 
CCAGGAGCAA CGCCAGTGGC A 



25 



TATTCAAGGA TGACTCCCTC CCTTCCTGTC TATTTTTGTT TTTTACTTTT TTACACTGAG 2880 

TTTCTATTTA GACACTACAA CATATGGGGT GTTTGTTCCC ATTGGATGCA TTTCTATCAA 2940 

AACTCTATCA AATGIGATGG CTAGATTCTA ACATATTGCC ATGTGTGGAG TGTGCTGAAC 300 0 

ACACACCAGT TTACAGGAAA GATGCATTTT GTGTACAGTA AACGGTGTAT ATACCTTTTG 3060 

TTACCACAGA GTTTTTTAAA CAAATGAGTA TTATAGGACT TTCTTCTAAA TGAGCTAAAT 3120 

AAGTCACCAT TGACTTCTTG GTGCTGTTGA AAATAATCCA TTTTCACTAA AAGTGTGTGA 3180 

AACCTACAGC ATATTCTTCA CGCAGAGATT TTCATCTATT ATACTTTATC AAAGATTSGC 3240 

CATGTTCCAC TTGGAAATGG CATGCAAAAG CCATCATAGA GAAACCTGCG TAACTCCATC 330 0 

TGACAAATTC AAAAGAGAGA GAGAGATCTT GAGAGAGAAA TGCTGTTCGT TCAAAAGTGG 3360 

AGTTGTTTTA ACAGATGCCA ATTACGGTGT ACAGTTTAAC AGAGTTTTCT GTTGCATTAG 342 0 

GATAAACATT AATTGGAGTG CAGCTAACAT GAGTATCATC AGACTAGTAT CAAGTGTTCT 3480 

AAAATGAAAT ATGAGAAGAT CCTGTCACAA TTCTTAGATC TGGTGTCCAG CATGGATGAA 3540 

ACCTTTGAGT TTGGTCCCTA AATTTGCATG AAAGCACAAG GTAAATATTC ATTTGCTTCA 3600 

GGAGTTTCAT GTTGGATCTG TCATTATCAA AAGTGATCAG CAATGAAGAA CTGGTCGGAC 3660 

AAAATTTAAC GTTGATGTAA TGGAATTCCA GATGTAGGCA TTCCCCCCAG GTCTTTTCAT 3720 
GTGCAGATTG CAGTTCTGAT TCATTTGAAT AAAAAGGAAC TTGG 

Seq ID NO: 146 Protein sequence 
Protein Accession #: 

~ A 1 11 21 31 41 51 

30 | | | | | | 

MALVLGSLLL LGLCGNSFSG GQPSSTDAPK AWNYELPATN YETQDSHKAG PIGILFELVH 60 

I FLYWQPRD FPEDTLRKFL QKAYESKIDY DIVYYEAGII LCCVLGLLFI ILMPLVGYFF 120 

CMCRCCNKCG GEMHQRQKEN GPFLRKCFAI SLLVICIIIS IGIFYGFVAN HQVRTRIKRS 180 

RKLADSNFKD LRTLLNETPE QIKYILAQYN TTKDKAFTDL NSINSVLGGG ILDRLRPNII 240 

JJ PVLDEIKSMA TAIKETKEAL ENMNSTLKSL HQQSTQLSSS LTSVKTSLRS SLNDPLCLVH 300 

PSSETCNSIR LSLSQLNSNP ELRQLPPVDA ELDNVNNVLR TDLDGLVQQG YQSLNDIPDR 360 

VQRQTTTWA GIKRVLNSIG SDIDHVTQRL PIQDILSAFS VYVBNTESYI HRNLPTLEEY 42 0 

DSYWWLGGLV ICSLLTLIVI FYYLGLLCGV CGYDRHATPT TRGCVSNTGG VFLMVGVGLS 480 

API FLFCWILMII WLTFVFGAN VEKLICEPYT SKELFRVLDT PYLLNEDWEY YLSGKLFNKS 54 0 

4U KMKLTFEQVY SDCKKNRGTY GTLIILQNSFN ISEHLNINEH TGSISSELES LKVNLNIFLL 600 

GAAGRKNLQD FAACGIDRMN YDSYLAQTGK SPAGVNLLSF AYDLEAKAUS LPPGNLRKSL 660 

KRIJAQTIKTI HQQRVLPIEQ SLSTLYQSVK ILQRTGNGLL ERVTRILASL DFAQNFITNN 72 0 

TSSVIIEETK KYCRTIIGYF EIIYLQWIEFS ISEKVASCKP VATALDTAVD VFLCSYIIDP 780 

. LNLFWFGIGK ATVFLLPALI FAVKLAKYYR RMDSEDVYDD VETIPMKIIMS NGNKGYHKDH 84 0 
43 VYGIHNPVMT SPSQH 

Seq ID NO: 147 DNA sequence 
Nucleic Acid Accession tf: BC012089.1 
<jq Coding sequence: 236.. 2806 

i Y T T T T 

ATGCTCTCAG CTCTCCCGCC GCGGGATGGT GCCTTGAGTG AATGACCCCC TTGGAGAACA 6D 
TTCTTCCGCA TCCCTCGCCT CAAGCCAGCC TCAGACAGAA AACTGAAGAT T 
J J CAGTGCTTCC TGCTCCTCTT CIGCCCAGGA ACACGCTTGC CTTCCCCAAG G 

GCTCTGAGGC AGGAGGCAC C AAGTTCTACC TCATGTTTGG AGGATCTTGC TAGCTATGGC 
CCTCGTACTC GGCTCCCTGT TGCTGCTGGG GCTGTGCGGG AACTCCTTTT CAGGAGGGCA 
GCCTTCATCC ACAGATGCTC CTAAGGCTTG GAATTATGAA TTGCCTGCAA CAAATTATGA 
GACCCAAGAC TCCCATAAAG CTGGACCCAT TGGCATTCTC TTTGAACTAG TGCATATCTT 
~ " " ^AGCCGC GTGATTTCCC AGAAGATACT TTGAGAf 

A TCCAAAATTG ATTATGACAA GATTGTCTAC TATGAAGCAG G 

ATGCTGTGTC CTGGGGCTGC TGTTTATTAT TCIGATGCCT CTGGTGGGGT ATTTCTTTTG 600 

TATGTGTCGT TGCIGTAACA AATGTGGTGG AGAAATGCAC CAGCGACAGA AGGAAAATGG 660 

GCCCTTCCTG AGGAAATGCT TTGCAATCTC CCT3TTGGTG ATTTGTATAA TAATAAGCAT 720 

TGGCATCTTC TATGGTTTTG TGGCAAATCA CCAGGTAAGA ACCCGGATCA AAAGGAGTCG 780 

GAAACTGGCA GATAGCAATT TCAAGGACTT GCGAACTCTC TTGAATGAAA CTCCAGAGCA 840 

AAT CAAAT AT ATATTGGCCC AGTACAACAC TACCAAGGAC AAGGCGTTCA CAGATCTGAA 900 

CAGTATCAAT TCAGTGCTAG GAGGCGGAAT TCTTGACCGA CTGAGACCCA ACATCATCCC 960 

TGTTCTTGAT GAGATTAAGT CCATGGCAAC AGCGATCAAG GAGACCAAAG AGG CGTTGGA 1020 

GAACATGAAC AGCACCTTGA AGAGCTTGCA CCAACAAAGT ACACAGCTTA GCAGCAGTCT 1080 

GACCAGCGTG AAAACTAGCC TGCGGTCATC TCTCAATGAC CCTCTGTGCT TGGTGCATCC 1140 

ATCAAGTGAA ACCTGCAACA GCATCAGATT GTCTCTAAGC CAGCTGAATA GCAACCCTGA 12 00 

ACTGAGGCAG CTTCCACCCG TGGATGCAGA ACTTGACAAC GTTAATAACG TTCTTAGGAC 1260 

AGATTTGGAT GGCCTGGTCC AACAGGGCTA TCAATCCCTT AATGATATAC CTGACAGAGT 1320 

ACAACGCCAA ACCACGACTG TCGTAGCAGG TATCAAAAGG GTCTTGAATT CCATTGGTTC 1380 

AGATATCGAC AATGTAACTC AGCGTCTTCC TATTCAGGAT ATACICTCAG CATTCTCTGT 1440 

TTATGTTAAT AACACTGAAA GTTACATCCA CAG AAATT T A CCTACATTGG AAGAGTATGA 1500 

TTCATACTGG TGGCTGGGTG GCCTGGTCAT CTGCTCTCTG CTGACCCTCA TCGTGATTTT 1560 

TTACTACCTG GGCTTACTGT GTGGCGTGTG CGGCTATGAC AGGCATGCCA CCCCGACCAC 1620 

CCGAGGCTGT GTCTCCAACA CCGGAGGCGT CTTCCTCATG GTTGGAGTTG GATTAAGTTT 1680 

CCTCTTTTGC TGGATATTGA TGATCATTGT GGTTCTTACC TTTGTCTTTG GTGCAAATGT 1740 

GGAAAAACTG ATCTGTGAAC CTTACACGAG CAAGGAATTA TTCCGGGTTT TGGATACACC 1800 

CTACTTACTA AATGAAGACT GGGAATACTA TCTCTCTGGG AAGCIATTTA ATAAATCAAA 1860 

AATGAAGCTC ACTTTTGAAC AAGTTTACAG TGACTGCAAA AAAAATAGAG GCACTTACGG 1920 
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TGACAGCTAC TTGGCTCAGA CTGGTAAATC CCCCGCAGGA GTGAATCTTT TATCATTTGC 2160 

ATATGATCTA GAAGCAAAAG CAAACAGTTT GCCCCCAGGA AATTTGAGGA ACTCCCTGAA 2220 

AAGAGATGCA CAAACTATTA AAACAATTCA CCAGCAACGA GTCCTTCCTA TAGAACAATC 2280 

ACTGAGCACT CTATACCAAA GCGTCAAGAT ACTTCAACGC ACAGGGAATG GATTGTTGGA 2340 

GAGAGTAACT AGGATTCTAG CTTCTCTGGA TTTTGCTCAG AACTTCATCA CAAACAATAC 2400 

TTCCTCTGTT ATTATTGAGG AAACTAAGAA GTATGGGAGA ACAATAATAG GATATTTTGA 24 SO 

ACATTATCTG CAGTGGATCG AGTTCTCTAT CAGTGAGAAA GTGGCATCGT GCAAACCTGT 2520 

GGCCACCGCT CTAGATACTG CTGTTGATGT CTTTCTGTGT AGCTACATTA TCGACCCCTT 2580 

GAATTTGTTT TGGTTTGGCA TAGGAAAAGC TACTGTATTT TTACTTCCGG CTCTAATTTT 2 640 

TGCGGTAAAA CTGGCTAAGT ACTATCGTCG AATGGATTCG GAGGACGTGT ACGATGATGT 2700 

TGAAACTATA CCCATGAAAA ATATGGAAAA TGGTAATAAT GGTTATCATA AAGATCATGT 2760 

ATATGGTATT CACAATCCTG TTATGACAAG CCCATCACAA CATTGATAGC TGATGTTGAA 2820 

ACTGCTTGAG CATCAGGATA CTCAAAGTGG AAAGGATCAC AGATTTTTGG TAGTTTCTGG 2880 

GTCTACAAGG ACTTTCCAAA TCCAGGAGCA ACGCCAGTGG CAACGTAGTG ACTCAGGCGG 2940 

GCACCAAGGC AACGGCACCA TTGGTCTCTG GGTAGTGCTT TAAGAATGAA CACAATCACG 3000 

TTATAGTCCA TGGTCCATCA CTATTCAAGG ATGACTCCCT CCCTTCCTGT CTATTTTTGT 3060 

TTTTTACTTT TTTACACTGA GTTTCTATTT AGACACTACA ACATATGGGG TGTTTGTTCC 3120 

CATTGGATGC ATTTCTATCA AAACTCTATC AAATGTGATG GCTAGATTCT AACATATTGC 3180 

CATGTGTGGA GTGTGCTGAA CACACACCAG TTTACAGGAA AGATGCATTT TGTGTACAGT 3240 

AAACGGTGTA TATACCTTTT GTTACCACAG AGTTTTTTAA ACAAATGAGT ATTATAGGAC 3300 

TTTCTTCTAA ATGAGCTAAA TAAGTCACCA TTGACTTCTT GGTGCTGTTG AAAATAATCC 3360 

ATTTTCACTA flMlG - I:GTGTG AAACCTACAG CATATTCTTC ACGCAGAGAT TTTCATCTAT 3420 

TATACTTTAT CAAAGATTGG CCATGTTCCA CTTGGAAATG GCATGCAAAA GCCATCATAG 3480 

AGAAACCTGC GTAACTCCAT CTGACAAATT CAAAAGAGAG AGAGAGATCT TGAGAGAGAA 3540 

ATGCTGTTCG TTCAAAAGTG GAGTTGTTTT AACAGATGCC AATTACGGTG TACAGTTTAA 3600 
CAGAGTTTTC TGTTGCATTA GGATAAACAT TAATTGGAGT GCAGCT? 
CAGACTAGTA TCAAGTGTTC TAAAATGAAA TATGAGAAGA TCCTGTCACA A 

CTGGTGTCCA GCATGGATGA AACCTTTGAG TTTGGTCCCT AAATTTGCAT GAAAGCACAA 3780 

GGTAAATATT CATTTGCTTC AGGAGTTTCA TGTTGGATCT GTCATTATCA AAAGTGATCA 384 0 

iATTTAA CGTTGATGTA ATGGAATTCC AGATGTAGGC 3900 

A TGTGCAGATT GCAGTTCTGA TTCATTTGAA TAAAAAGGAA 3960 



ain sequence 



11 21 31 41 El 

I I I I I 

L LGLCGNSFSG GQPSSTDAPK AKNYELPATN YETQDSHKAG P 
IFLYWQPRD FPEDTLRKFL QKAYESKIDY DKIVYYEAGI ILCCVLGLLF IILMPLVGYP 
FCMCRCOJKC GGEMHQRQKE NGPFLRKCFA ISLLVICIII SIGIFYGFVA NHQVRTRIKR 
SRKLADSNFK DLRTLLNETP EQIKYILAQY NTTKDKAFTD LN S INSVLGG GILDRLRPNI 
IPVLDEIKSM ATAIKETKEA LENMHSTLKS LHQQSTQLSS SLTSVKTSLR SSLNDPLCLV 
HPSSETCNSI RLSLSQLNSN PELRQLPPVD AELDNVHKVL RTDLDGLVQQ GYQSLNDIPD 
RVQRQTTXW AGIKRVUISI GSDIDNVTQR LPIQDILSAF SVYVNNTESY IHRNLPTLEE 
I GL VICSLLTLIV IFYYLGLLCG VCGYDRHATP TTRGCVSNTG GVFLMVGVGL 
SFLFCWILMI IWLTFVFGA KVEKLICEPY TSKELFRVLD TPYLLNEDWE YYLSGKLFNK 
SKMKLTFEQV YSDCKKNRGT YGTLHLQNSF NISEHLNINE HTGSISSELE SLKVNLNIFL 
LGAAGRKNLQ DFAACGIDRM NYDSYLAQTG KSPAGVNLLS FAYDLEAKAN SLPPGNLRNS 
LKRDAQTIKT IHQQRVLPIE QSLSTLYQSV KILQRTGNGL LERVTRILAS LDFAQNFITN 
NTSSVIIEET KKYGRTIIGY FEHYLQWIEF SISEKVASCK PVATALDTAV DVFLCSYIID 
PLNLFWFGIG KATVFLLPAL IFAVKLAKYY RRMDSEDVYD DVETIPKKHM E 



D NO: 149 DNA 



CCACGCGTCC GAGCAAGAAC AGCTAAAATG AAAGCCATCA TTCATCTTAC TCTTCTTGCT 
CTCCTTTCTG TAAACACAGC CACCAACCAA GGCAACTCAG CTGATGCTGT AACAACCACA 
GAAACTGCGA CTAGTGGTCC TACAGTAGCT GCAGCTGATA CCACTGAAAC TAATTTCCCT 
GAAACTGCTA GCACCACAGC AAATACACCT TCTTTCCCAA CAGCTACTTC ACCTGCTCCC 
CCCATAATTA GTACACATAG TTCCICCACA ATTCCTACAC CTGCTCCCCC C 
ACACATAGTT CCTCCACAAT TCCTATACCT ACTGCTGCAG ACAGTGf 
GTAAATTCAT TAGCTACCTC TGACATAATC ACCGCTTCAT CTCCAAATGA T 
ACAATGGTTC CTTCTGAAAC ACAAAGTAAC AAIGAAATGT CCCCCAC 
CAATCATCAG GGCCTCCCAC TGGCACCGCT TTATTGGAGA CCAGCACCCT AAACAGCACA 
GGTCCCAGCA ATCCTTGCCA AGATGATCCC TGTGCAGATA ATTCGTTATG TGTTAAGCTG 
CATAATACAA GTTTTTGCCT GTGTITAGAA GGGTATTACT ACAACTCTTC TACATGTAAG 
AAAGGAAAGG TATTCCCTGG GAAGATTTCA GTGACAGTAT CAGAAACATT TGACCCAGAA 
GAGAAACATT CCAIGGCCTA TCAAGACTTG CATAGTGAAA TTACTAGCTT GTTTAAAGAT 
GTATTTGGCA CATCTGTTTA TGGACAGACT GTAATTCTTA CTGTAAGCAC ATCTCTGTCA 



AGTAGCTCAA GCAACTTTCT AAACTATGAT TTGACCCTTC GGTGTGATTA TTATGGCTGT 
AACCAGACTG CGGATGACTG CCTCAATGGT TTAGCATGCG ATTGCAAATC TGACCTGCAA 
AGGCCTAACC CACAGAGCCC TTTCTGCGTT GCTTCCAGTC TCAAGTGTCC TGATGCCTGC 
AACGCACAGC ACAAGCAATG CTTAATAAAG AAGAGTGGTG GGGCCCCTGA GTGTGCGTGC 
GTGCCCGGCT ACCAGGAAGA TGCTAATGGG . AACTGCCAAA AGTGTGCATT TGGCTACAGT 
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GGACTCGACT GTAAGGACAA ATTTCAGCTG ATCCTCACTA TTGTGGGCAC CATCGCTGGC 1320 

ATTGTCATTC TCAGCATGAT AATTGCATTG ATTGTCACAG CAAGATCAAA TAACAAAACG 1380 

AAGCATATIG AAGAAGAGAA CTTGATTGAC GAAGACTTTC AAAATCTAAA ACTGCGGTCG 1440 

ACAGGCTTCA CCAATCTTGG AGCAGAAGGG AGCGTCTTTC CTAAGGTCAG GATAACGGCC 150 0 

TCCAGAGACA GCCAGATGCA AAATCCCTAT TCAAGACACA GCAGCATGCC CCGCCCTGAC 1560 

TATTAGAATC ATAAGAATGT GGAACCCGCC ATGGCCCCCA ACCAATGTAC AAGCTATTAT 1620 

TTAGAGTGTT TAG AAAG AC T GATGGAGAAG TGAGCACCAG TAAAGATCTG GCCTCCGGGG 1680 

TTTTTCTTCC ATCTGACATC TGCCAGCCTC TCTGAATGGA AGTTGTGAAT GTTTGCAACG 1740 

AATCCAGCTC ACTTGCTAAA TAAGAATCTA TGACATTAAA TGTAGTAGAT GCTATTAGCG 1800 

CTTGTCAGAG AGGTGGTTTT CTTCAATCAG TACAAAGTAC TGAGACAATG GTTAGGGTTG 1860 

TTTTCTTAAT TCTTTTCCTG GTAGGGCAAC AAGAACCATT TCCAATCTAG AGGAAAGCTC 1920 

CCCAGCATTG CTTGCTCCTG GGCAAACATT GCTCTTGAGT TAAGTGACCT AATTCCCCTG 1980 

GGAGACATAC GCATCAACTG TGGAGGTCCG AGGGGATGAG AAGGGATACC CACCACCTTT 2 040 

CAAGGGTCAC AAGCTCACTC TCTGACAAGT CAGAATAGGG ACACTGCTTC TATCCCTCCA 2100 

ATGGAGAGAT TCTGGCAACC TTTGAACAGC CCAGAGCTTG CAACCTAGCC TCACCCAAGA 2160 

AGACTGGAAA GAGACATATC TCTCAGCTTT TTCAGGAGGC GTGCCTGGGA ATCCAGGAAC 2220 

TTTTTGATGC TAATTAGAAG GCCTGGACTA AAAATGTCCA CTATGGGGTG CACTCTACAG 2280 

TTTTTGAAAT GCTAGGAGGC AGAAGGGGCA GAGAGTAAAA AACATGACCT GGTAGAAGGA 2340 

AGAGAGGCAA AGGAAACTGG GTGGGGAGGA TCAATTAGAG AGGAGGCACC TGGGATCCAC 240 0 

CTTCTTCCTT AGGTCCCCTC CTCCATCAGC AAAGGAGCAC TTCTCTAATC ATGCCCTCCC 2460 

GAAGACTGGC TGGGAGAAGG TTTAAAAACA AAAAATCCAG GAGTAAGAGC CTTAGGTCAG 2520 

TTTGAAATTG GAGACAAACT GTCTGGCAAA GGGTGCGAGA GGGAGCTTGT GCTCAGGAGT 2580 

CCAGCCGTCC AGCCTCGGGG TGTAGGTTTC TGAGGTGTGC CATTGGGGCC TCAGCCTTCT 2640 

CTGGTGACAG AGGCTCAGCT GTGGCCACCA ACACACAACC ACACACACAC AACCACACAC 2700 

ACAAATGGGG GCAACCACAT CCAGTACAAG CTTTTACAAA TGTTATTAGT GTCCTTTTTT 2760 

ATTTCTAATG CCTTGTCCTC TTAAAAGTTA TTTTATTTGT TATTATTATT TGTTCTTGAC 2820 

TGTTAATTGT GAATGGTAAT GCAATAAAGT GCCTTTGTTA GATGGTGAAA AAAAAAAAAA 2 880 

Seq ID NO: 150 Protein sequence 
Protein Accession #; NP_149038.1 

1 11 21 31 41 51 

I I I I I I 

MKAI IHLTLL ALLSVNTATN QGNSADAVTT TETATSGPTV AAADTTETHP PETASTTANT 60 

PSFPTATSPA PPIISTHSSS TIPTPAPPII STHSSSTIPI PTAADSESTT NVNSLATSDI 120 

ITASSPNDGL ITMVPSETQS NNEMSPTTED NQSSGPPTGT ALLETSTLNS TGPSNPCQDD 180 

PCADHSLCVK LHNTSFCLCL EGYYYNSSTC KKGKVFPGKI SVTVSETFDP EEKHSMAYQD 240 

LHSEITSLFK DVFGTSVYGQ TVILTVSTSL SPRSEMRADD KFVNVTIVTI LAETTSDNEK 300 

TVTEKINKAI RSSSSNFLNY DLTLRCDYYG CNQTADDCLN GIiACDCKSDL QRPNPQSPFC 360 

VASSLKCPDA CNAQHKQCLI KKSGGAPECA CVPGYQEDAN GNCQKCAFGY SGLECXDKFQ 420 

LILTIVGTIA GIVILSMIIA LIVTARSNNK TKIIIEEENLI DEDFQNLKLR STGFTNLGAE 480 




I 

3 CTCTTCTCAG CTGCGCGAGC SO 

AGCTGCTCCA ATGCCCCGGA GTGGCCATGG GCGCCCCGCA CTGGTGGGAC CAGCTGCAGG 120 

CTGGTAGCTC GGAGGTGGAC TGGTGCGAGG ACAACTACAC CATCGTGCCT GCTATCGCCG 180 

AGTTCTACAA CACGATCAGC AATGTCTTAT TTTTCATTTT ACCGCCCATC TGCATGTGCT 240 

TGTTTCGTCA GTATGCAACA TGCTTCAACA GTGGCATCTA CTTAATCTGG ACTCTTTTGG 300 

TTGTAGTGGG AATTGGATCC GTCTACTTCC ATGCAACCCT TAGTTTCTTG GGTCAGATGC 360 

TTGATGAACT TGCAGTCCTT TGGGTTCTGA TGTGTGCTTT GGCCATGTGG TTCCCCAGAA 420 

GGTATCTACC AAAGATCTTT CGGAATGACC GGGGTAGGT T CAAGGTGGTG GTCAGTGTCC 480 

'T TACGACGTGC CTGGCATTTG TCAAGCCTGC CATCAACAAC ATCTCTCTGA 540 

G AGTTCCTTGC ACTGCACTGC TCATCGCAGA GCTAAAGAGG TGTGACAACA 600 

T TAAGCTGGGC CTCTTCTCGG GCCTCTGGTG GACCCTGGCC CTGTTCTGCT 660 

GGATCAGTGA CCGAGCTTTC TGCGAGCTGC TGTCATCCTT CAACTTCCCC TACCTGCACT 720 

GCATGTGGCA CATCCTCATC TGCCTTGCTG CCTACCTGGG CTGTGTATGC TTTGCCTACT 780 

TTGATGCTGC CTCAGAGATT CCTGAGCAAG GCCCTGTCAT CAAGTTCTGG CCCAATGAGA B40 

AATGGGCCTT CATTGGTGTC CCCTATGTGT CCCTCCTGTG TGCCAACAAG AAATCATCAG 900 

TCAAGATCAC GTGATGGCAA GATGGTGGCT GGCTTCICTG CTTATCGCCC CTCATGCAGT 960 

GGGCTTCCTT TGCTAGGAAG ACAGCCAAGG GAGTTCGAAT AGTTGGGGTG TGGGCTATCT 1020 

TTTCAAAAAT CTATTTGCTG GGGCTCTTAA TTTCTTTAGT GTTCTTTGTA TGTAGGGATT 1080 

T CATATGGTAC AAATATTCCC TGCCCCCCTG CAGTTTCCCA TITGTCTTTC 1140 

A TATTTTTGTG CCATACTGGT TTTAAACTTT CATGTTGTCA CATCTGTTAA 1200 

TCTTTTCTTT AGGATTTCTG GATTTTGTGT AATTTTTAAA AAGGTCCCCT CCTCCTCCCT 1260 

AATGTGTCTG TGGACCACCT GGATTCCACT GTACAAGGGG AAAAGTGTCT ATTCCTTTCC 1320 

CAAAGATGGA AAATGGAGGG CTTAGGGACA CTAGATGCAT CTTTCTCAGC ATCACTTCCA 1380 

GATGCAGTGA CTTGTTGGGC TGCGTCCTTA ATGGCCATGG CAGAGCAGTC CCTTGGGGGA 1440 

TCCAGCCCTG TACAATGCAT CTCTTCCTGG AGAAAGCTGG CCTGCTCCAG ACCCCACCAT 1500 

TCCCAGGCGC CCTTGGAGTG GACTCTACTG ATGACAGACA GACCCTCTGA GAGA CAAG AC 1560 

CCTCTGACTC TGTGATGGAA GATGCCAGAG ATTTTCCTTT GGGGTAATTG TCCTTAAACA 1620 

AAACCAAACA GATGAAACAC ACACAGGACT TGTGGCTAAA AAGGCTAGTT TTTCACTTGC 1680 

ATTTCTCAAC TAACCCAGGT TTTACATGCA TCTGTGAATC CTTTTACIAC TACCTCTGTC- 174 0 

GAGAGATGGA GAGACTTCAG ATAAACGTGA AGCTAATGAG TAAAACCCTC TCTGCCAAAA 180 0 

CCTACACTCC ACTTTAGGCC CTTCTTGAAG ATGAGCACAA TTTTTAAATA CTGAGCACAA I860 

TTTTTAAATA CTGACAT CAC TTCCTCTTCC CCCTCCCACC CCAGCTCAGC AGCCTCAAAT 192 0 

CTACAGAGAA GAAGAATTAT GGCATGAACA TTCCCACAGA CCCACCATCT TTAAGACTTG 1980 

ACCTCTGTAA GTTTACCAAA GGGCTCCTCA CAATTGTGGT GGGGGTTCTG GTTCAAAATT 2040 
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TGGAGCAAAC ATGAAGTTTT TGGAAACGTT TTCTCATTTG AAGCCTCCAG TATGCTGTAC 2100 

TATTCTGGAA ATTACCTTCA AGAGTCTCAC TTCTTGTTTC TGTTGTGTTT TCTGTGGGCA 2160 

TCATGTTCTT CACGCTTGCA GTAGAAGGTG CTTTCTCGGT TTCCCAGAGT ATCCAACGGC 222 0 

TCACCTTTCT CAAGTGCTGG CAGTAGCTAT GCACTCACGG GCTGGTTTGG GTCGCTGGTG 2280 

CAGCAGCGCA AATCTGTTGC CTTCTGAATT TTTCTCACCT AATGTGACAC TGGCTACAAT 2340 

GAATCTTCTC TTCATCGGGC TGAATGAAAG ATTCAAGAAC CATCTTCAAG GTGCATGGTG 2400 

GGAATTATCA ACCTCAGGGA TACTCATTTT AACTCAGGCG TGTCCTGCTT TGTAACATTC 2460 

CATTGTTGGG AGAGGGCAGG ACAGGTGTGT TCTTCTGTGG GCAGGAGTCA TGTCACTGTC 252 0 

CTACATATGT AAGAGTTGGG AAGGTGACGA TTTTTGACAC ATCCAGGAAC TCTTACTCTA 258 0 

GTTAGAATTT GTACCAGATC CAAGGTGAAA ACCCCAATAA GCAACTGAAT TTAGAGTTTA 2640 

AAAATGAATG ACTTTATGCT ACATCTGTGG TTATCAAATT ATATAGGTTG TTGAGAAGCA 2700 

GAACGCTGTT TGTAGTAAGA AATCTTTGTG GAACCCCAGT GTGTGAAGTA AATTGTATGT 2760 

TATTAAATTT ATTTAAGGTT AAATTTATGG CATTTACTTA ATAATATATG AGGTGGTGAA 2820 

AATCCAAATT AACAAATTGG TAATTTCCAA GGTAGAAAAA TTAGGTGTTG AATGAATGTA 2880 

TATGTTGGTT TATTATAGTT TTATATATAT ATAGAGAGAG TGTTTTGTTT TTGAGTCAGG 2940 

GTCTTGCTCT ATTACCCAGG CTGGAGTGCA GTGGTGCAGT CATGACTCAC TGTAGCCTCT 30 00 

GTCTCCCAGG CTCAAGTGAT CCTCTCACCT CAGCCTCCCC AGGAGCTGGG AGTACTGGTG 3 060 

CGTGCCTCCA CTCCCAGCTA ATTTTTGTAT TTTTTCATAG AGATGGGGTT TCACCATTTT 3120 

GCCCAGGCTG GTCTCAAACT TCTGGGCTCA AGAGATTCGC CCGCCTCGGC CTCCCAAAGT 31B0 

GCTGGGAATA GGCATGAGCC GTCACGCCTG GCCTAAAAAA TATTTTTAAA ATGATCTTTG 3240 

AATTAAATAT TCGTAGAATT TCTAATGTAT CTCTTTGAGA CCTAGGAGGT TGATGGAACA 3300 

" GCTG r ttagaTTGAC CTTATCAGGC ATGGATTCTG GTCTCATCTA CTTTATGGTA 3420 



T GAGAAGCAGA ATGTGCTTCT AGAAACTGGT TTTAAAGAGA 



A AAGAAATGTG GAATGGAGTA TATTTGAGGA GGACAAAACA T, 

?GCCAGC ATGGGATGTA AACCAAGAGA GTAGAAATAT 3660 

ACCCATCTTA TTTTAAGTTG GGTTTATGGC ATCGCTCATA TATGTAAAAG CACTACAAAC 3720 

A ACTACAGAGA AGTCAAAGAA AAAAAAAAGT AACCCATATT 3780 

5TTAATA TTTTGGTTTG GTCTCCTCTT TTTTTCCCCC 3840 

. CCTTTCA GAGTTGACAT TTATCCTGTA ' GCTTGAATGG 3900 

CATGTAAATG CCAGTTGTAT ATTTTTTCAT GAAGTGTAGG TTTGGAATAC ACTAGAGTTA 3960 

GCTATATGCT TGAATGCTGA TCACTGGATT CTGAGACTGA CTACTGAGTC TACCTTTTTA 4020 

ATCAAGCCTA ACATGAATGG GCTCCAAAAA GTAATGAATG TAATTGTACT TTTTGATGTG 4080 

CCTCTGCACT TGGCTTGGTG AGT CATCATA AATAGCTGTT AAATATGTGA CTTTACAGAT 4140 

TTTGATATGT TCAGATTGTA AAAAATGAAT AGTTTATTTC ATTAATTGAT GGGCAGTCAA 4200 
GAATCTCCCT CC 

Seq ID NO: 



1 11 21 31 41 51 

I I I I I I 

MGAPHNWDQL QAGSSEVDWC EDNYT I VPAI AEFYNTISNV LFFILPPICK CLFRQYATCF 

NSGIYLIWTL LWVGIGSVY FHATLSFLGQ MLDELAVLWV LMCALAMWFP RRYLPXIFRN 

DRGEFKVWS VLSAVTTCLA FVKPAINNIS LMTLGVPCTA LLIAELKRCD NMRVFKLGLF 

SGLWWTLALF CWISDRAFCE LLSSFNFPYL HCMWHILICL AAYLGCVCFA YFDAASEIPE 

QGPVIKFWPN EKWAFIGVPY VSLLCANKKS SVKIT 



NM_001432.1 

55 i i 1 i 1 r T T 

TCACTTGCCT GATATTTCCA GTGTCAGAGG GACACAGCCA ACGTGGGGTC CCTTCTAGGC 
TGACAGCCGC TCTCCAGCCA CTGCCGCGAG CCCGTCTGCT CCCGCCCTGC CCGTGCACTC 
TCCGCAGCCG CCCTCCGCCA AGCCCCAGCG CCCGCTCCCA TCGCCGATGA CCGCGGGGAG 
GAGGATGGAG ATGCTCTGTG CCGGCAGGGT CCCTGCGCTG CTGCTCTGCC TGGGTTTCCA 

60 TCTTCTACAG GCAGTCCTCA GTACAACTGT GATTCCATCA TGTATCCCAG GAGAGTCCAG 
TGATAACTGC ACAGCTTTAG TTCAGACAGA AGACAATCCA CGTGTGGCTC AAGTGTCAAT 
AACAAAGTGT AGCTCTGACA TGAATGGCTA TTGTTTGCAT GGACAGTGCA TCTATCTGGT 
GGACATGAGT CAAAACTACT GCAGGTGTGA AGTGGGTTAT ACTGGTGTCC GATGTGAACA 
CTTCTTTTTA ACCGTCCACC AACCTTTAAG CAAAGAGTAT GTGGCTTTGA CCGTGATTCT 

65 TATTATTTTG TTTCTTATCA CAGTCGTCGG TTCCACATAT TATTTCTGCA GATGGTACAG 
AAATCGAAAA AGTAAAGAAC CAAAGAAGGA ATATGAGAGA GTTACCTCAG GGGATCCAGA 
GTTGCCGCAA GTCTGAATGG CGCCATCAAA CTTATGGGCA GGGATAACAG TGTGCCTGGT 
TAATATTAAT ATTCCATTTT ATTAATAATA TTTATGTTGG GTCAAGTGTT AGGTCAAIAA 
CACTGTATTT T AATGTACT T GAAAAATGTT TTTATTTTTG TTTTATTTTT GACAGACTAT 



C CTGAGCTAAA TGCTTCATTG AAAGCTTCAA AGTTTATATG CCTGGTGCAC 960 

AGTGCTTAGA AGTAAGCAAT TCCCAGGTCA TAGCTCAAGA ATTGTTAGCA AATGACAGAT 1020 

TTCTGTAAGC CTATATATAT AGTCAAATCG ATTTAGTAAG TATGTTTTTT ATC-TTCCTCA 1080 

AATCAGTGAT AATTGGTTTG ACTGTACCAT GGTTTGATAT GTAGTTGGCA CCATGGTATC 1140 

ATATATTAAA ACAATAATGC AAT TAGAATT TGGGAGAAGC AAATATAGGT CCTGTGTTAA 1200 

ACACTACACA TTTGAAACAA GCTAACCCTG GGGAGTCTAT GGTC1CTTCA CTCAGGTCTC 1260 

AGCTATAATT CTGTTATATG AGGGGCAGTG GACAGTTCCC TATGCCAACT CACGACTCCT 1320 

ACAGGTACTA GTCACTCATC TACCAGATTC TGCCTATGTA AAATGAATTG AAAAACAATT 1380 

TTCTGTAATC TTTTATTTAA GTAGTGGGCA TTTCATAGCT TCACAATGTT CCTTTTTTGT 1440 

ATATTACAAC ATTTATGTGA GGTAATTATT GCTCAACAGA CAATTAGAAA AAAGT CCACA 1500 

CTTGAAGCCT AAATTTGTGC TTTTTAAGAA TATTTTTAGA CTATTTCTTT T1ATAGGGGC 1560 

TTTGCTGAAT TCTAACATTA AATCACAGCC CAAAATTTGA TGGACTAATT ATTATTTTAA 1620 

AATATATGAA GACAATAATT CTACATGTTG TCTTAAGATG GAAATACAGT TATTTCATCT 1680 

TTTATTCAAG GAAGTTTTAA CTTTAATACA GCTCAGTAAA TGGCTTCTTC TAGAATGTAA 1740 
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r TTAAAGTTGT ATCTTGACAC AGGAAATGGG AAAAAACTTA A 

TGGTGTATTT TTCCAAATGA AAAATCTCA& TTGAAAGCTT TTAAAATGTA GAAACTTAAA I860 

CACACCTTCC TGTGGAGGCT GAGATGAAAA CTAGGGCTCA TTTTCCTGAC ATTTGTTTAT 1920 

TTTTTGGAAG AGACAAAGAT TTCTTCTGCA CTCTGAGCCC ATAGGTCTCA GAGAGTTAAT 1980 

AGGAGTATTT TTGGGCTATT G CAT AAGGAG CCACTGCTGC CACCACTTTT GGATTTTATG 2040 

GGAGGCTCCT TCATCGAATG CTAAACCTTT GAGTAGAGTC TCCCTGGATC ACATACCAGG 2100 

T CAGGGAGGA TCTGTTCTTC CTCTACGTTT ATCCTGGCAT GTGCTAGGGT AAACGAAGGC 2160 

C ATGGCTGACC TCTGGAGCAC CAGGTGCCAG GACTTGTCTC CATGTGTATC 2220 

:CCTGGT GCAATCACAC GACTGTCATC TAAAGTCCTG GCCCTGGCCC 2280 

TTACTATTAG GAAAATAAAC AGACAAAAAC AAGTAAATAT ATATGGTCCT ATACATATTG 2340 

TATATATATT CATATACAAA CATGTATGTA TACATGACCT TAATGGATCA TAGAATTGCA 2400 

GTCATTTGGT GCTCTGCTAA CCATTTATAT AAAACTTAAA AACAAGAGAA AAGAAAAATC 2460 

AATTAGATCT AAACAGTTAT TTCTGTTTCC TATTTAATAT AGCTGAAGTC AAAATATGTA 2520 

AGAACACATT TTAAATACTC TACTTACAGT TGGCCCTCTG TGGTTAGTTC CACATCTGTG 2E80 

GATTCAACCA ACCAAGGACG GAAAATGCTT AAAAAATAAT ACAACAACAA CAAAAAATAC 2640 

ATTATAACAA CTATTTACTT TTTTTTTTTT CTTTTTGAGA TGGAGTCTCG CTCTGTTGCC 2700 

CAGGTTGGAG TGCAGTGGCA CGATCTCGGC TCACTGCAAC CTCACCTCCC GGGTTCAAGA 2760 

GATCCTCCTG CCTCAGCCTC CTGAGCAGCT GGGACTACAG GCGCATGCCA CCATGCCCAG 2820 

CTAATTTTTG TATTTTTAGT AGAGGCGGGG TTTCACCATG TTGGCCAGGA TGGTCTCAAT 2880 

CTCCTAACCT TGAGATCCAC CCTCCACAGC CTCCCAAACT GCTGGGATTA CAGGCGTGAG 2940 

CCACCGCACG TAGCATTTAC ATTAGGTATT ACAAGTAATG TAAAGATGAT TTAAGTATAC 3000 

AGGAGGATGT GAATAGGTTA TATGCAAGCA CTATGCCCTT TTATATAAGT GACTTGAACA 3060 

TCTGTGCCCG ATTTTAGTAT GTGCAGGGGG GCGATCTGGG AATCAGTCCC CTGTGGATAC 3120 

CAAGGTACAA CTGTATTTAT TAACGCTTAC TAGATGTGAG GAGAGTCTGA ATATTTTCAG 3180 

TGATCTTGGC TGTTTCAAAA AAATCTATTG ACTTTTCAAT AAATCAGCTG CAATCCATTT 3240 

ATTTCATTTA CAAAAGATTT ATTGTAAGCC TCTCAATCTT GGTTTTTCAG TTGATCTTAA 3300 

GCATGTCAAT TCATAAAAAC AAGTCATTTT TGTATTTTTC ATCTTTAAGA ATGCTTAAAA 33 60 
AAGCTAATCC CTAAAATAGT TAGATCTTTG TAAATGCATA TTAAATAATA A 
CACATTAGTT TTTATGGGTG AAAATAAGAC AAAAATAATA GTTTTAG 
GAGTAAACAT AAAAACT GAT TTGCTCTCAG CTGATGTGTC CTGTACA 
TTAGTTCACA CTTAGTCTAA CTCCCCCATT TTACAGATTT C 
AAGGGGCTAT GCATATTCAA TGTATTGAGA ACCAAAGCAA C 
AATTTATGGT CTTCAACCAA GGCCACATAA TAACCCAGTT AACTTACTCT T 

ATATTAAGTT CTATAACTAG TACTCAAGGT TTAACCTTAA AATTAAGATT TCCTTAACCT 3780 

TAACCTTAAA ATTGATATTA TATTAAACAT ACATAATACA ATGTAACTCC ACTGTTCTCC 3840 

TGAATATTTT TTGCTCTAAT CTCTCTGCCG AAAGTCAAAG TGATGGGAGA ATTGGTATAC 3 900 

TGGTATGACT ACGTCTTAAG TCAGATTTTT ATTTATGAGT CTTTGAGACT AAATTCAATC 3960 

ACCACCAGGT ATCAAATCAA CTTTTATGCA GCAAATATAT GATTCTAGTG TCTGACTTTT 402 0 

GTTAAATTCA GTAATGCAGT TTTTAAAAAC CTGTATCTGA CCCACTTTGT AATTTTTGCT 4080 

CCAATATCCA TTCTGTAGAC TTTTGAAAAA AAAGTTTTTA ATTTGATGCC CAATATATTC 4140 

TGACCGTTAA AAAATTCTTG TTCATATGGG AGAAGGGGGA GTAATGACTT GTACAAACAG 4200 

TATTTCTGGT GTATATTTTA ATGTTTTTAA AAAGAGTAAT TTCATTTAAA TATCTGTTAT 4260 

TCAAATTTGA TGATGTTAAA TGTAATATAA TGTATTTTCT TTTTATTTTG CACTCTGTAA 4320 

TTGCACTTTT TAAGTTTGAA GAGCCATTTT GGTAAACGGT TTTTATTAAA GATGCTATGG 4380 

AACATAAAGT TGTATTGCAT GCAATTTAAA GTAACTTATT TGACTATGAA TATTATCGGA 4440 

TTACTGAATT GTATCAATTT GTTTGTGTTC AATATCAGCT TTGATAATTG TGTACCTTAA 4500 

GATATTGAAG GAGAAAATAG ATAATTTACA AGATATTATT AATTTTTATT TATTTTTCTT 4560 

GGGAATTGAA AAAAATTGAA ATAAATAAAA ATGCATTGAA CATCTTGCAT TCAAAATCTT 4620 



AQVSITKCSS DMNGYCLHGQ CIYLVDMSQN Y 



L ITWGSTYYF CRWYRNRKSK EPKKEYERVT SGDPELPQV 

155 DNA sequence 

d Accession (f: NM_013282.2 



ACCCACACGG TGGACTCGCT GTCCAGGCTG ACCAAGGTGG AGGAGCTGAG GCGGAAGATC 180 

CAGGAGCTGT TCCACGTGGA GCCAGGCCTG CAGAGGCTGT TCTACAGGGG CAAACAGATG 240 

GAGGACGGCC ATACCCTCTT CGACTACGAG GTCCGCCTGA ATGACACCAT CCAGCTCCTG 300 

GTCCGCCAGA GCCTCGTGCT CCCCCACAGC ACCAAGG AG C GGGACTCCGA GCTCTCCGAC 360 

ACCGACTCCG GCTGCTGCCT GGG CCAGAGT GAGTCAGACA AGTCCTCCAC CCACGGCGAG 420 

GCGGCCGCCG AGACTGACAG CAGGCCAGCC GATGAGGACA TGTGGGATGA GACGGAATTG 480 

GGGCTGTACA AGGTCAATGA GTACGTCGAT GCTCGGGACA CGAACATGGG GGCGTGGTTT 540 

GAGGCGCAGG TGGTCAGGGT GACGCGGAAG GCCCCCTCCC GGGACGAGCC CTC-CAGCTCC 6 00 

ACGTCCAGGC CGGCGCTGGA GGAGGACGT C ATTTACCACG 1GAAATACGA CGACTACCCG 660 

GAGAACGGCG TGGTCCAGAT GAACTCCAGG GACGTCCGAG CGCGCGCCCG CACCATCATC 720 

AAGTGGCAGG ACCTGGAGGT GGGCCAGGTG GTCATGCTCA ACTACAACCC CGACAACCCC 780 

AAGGAGCGGG GCTTCTGGTA CGACGCGGAG ATCTCCAGGA AGCGCGAGAC CAGGACGGCG 840 

CGGGAACTCT ACGCCAACGT GGTGCTGGGG GATGATTCTC TGAACGACTG TCGGATCATC 900 

TTCGTGGACG AAGTCTTCAA GATTGAGCGG CCGGGTGAAG C-GAGCCCCAT GGTTGACAAC 960 

CCCATGAGAC GGAAGAGCGG GCCGTCCTGC AAGCACTGCA AGGACGACGT GAACAGACTC 1020 

TGCCGGGTCT GCGCCTGCCA CCTGTGCGGG GGCCGGCAGG ACCCCGACAA GCAGCTCATG 1080 

TGCGATGAGT GCGACATGGC CTTCCACATC TACTGCCTGG ACCCGCCCCT CAGCAGTGTT 1140 
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G ACGAGTGGTA CTGCCCTGAG TGCCGGAATG ATGCCAGCGA G 

GCGGGAGAGC GGCTGAGAGA GAGCAAGAAG AAGGCGAAGA TGGCCTCGGC CACATCGTCC 1260 

3 ACTGGGGCAA GGGCATGGCC TGTGTGGGCC C-CACCAAGGA ATGTACCATC 1320 

ZATCCCG GGGATCCCCG TGGGCACCAT GTGGCGGTTC 1380 

CGAGTCCAGG TCAGCGAGTC GGGTGTCCAT CGGCCCCACG TGGCTGGCAT ACACGGCCGG 1440 

3 GAGCGTACTC CCTAGTCCTG GCGGGGGGCT ATGAGGATGA CGTGGACCAT 1500 

CAGTGGT GGTCGAGATC TTTCCGGCAA CAAGAGGACC 1560 

A GAAACTCACC AACACCAACA GGGCGCTGGC TCTCAACTGC 1620 

TTTGCTCCCA TCAATGACCA AGAAGGGGCC GAGGCCAAGG ACTGGCGGTC GGGGAAGCCG 1680 



AACCGCTACG ATGGCATCTA CAAGGTTGTG AAATACTGGC CGGAGAAGGG GAAGTCCGGG 1800 

TTTCTCGTGT GGCGCTACCT TCTGCGGAGG GACGATGATG AGCCTGGCCC TTGGACGAAG 1860 

GAGGGGAAGG ACCGGATCAA GAAGCTGGGG CTGACCATGC AGTATCCAGA AGGCTACCTG 1920 

GAAGCCCTGG CCAACCGAGA GCGAGAGAAG GAGAACAGCA AGAGGGAGGA GGAGGAGCAG 1980 

CAGGAGGGGG GCTTCGCGTC CCCCAGGACG GGCAAGGGCA AGTGGAAGCG GAAGTCGGCA 2040 



CCCTACAGTC TCACGGCCCA GCAGAGCAGC CTCATCAGAG AGGACAAGAG CAACGCCAAG 2160 

CTGTGGAATG AGGTCCTGGC GTCACTCAAG GACCGGCCGG CGAGCGGCAG CCCGTTCCAG 2220 

TTGTTCCTGA GTAAAGTGGA GGAGACGTTC CAGTGTATCT GCTGTCAGGA GCTGGTGTTC 2280 

CGGCCCATCA CGACCGTGTG CCAGCACAAC GTGTGCAAGG ACTGCCTGGA CAGATCCTTT 2 340 

CGGGCACAGG TGTTCAGCTG CCCTGCCTGC CGCTACGACC TGGGCCGCAG CTATGCCATG 2400 

CAGGTGAACC AGCCTCTGCA GACCGTCCTC AACCAGCTCT TCCCCGGCTA CGGCAATGGC 2460 

CGGTGATCTC CAAGCACTTC TCGACAGGCG TTTTGCTGAA AACGTGTCGG AGGGCTCGTT 2520 

CATCGGCACT GATTTTGTTC TTAGTGGGCT TAACTTAAAC AGGTAGTGTT TCCTCCGTTC 2580 

CCTAAAAAGG TTTGTCTTCC TTTTTTTTTA TTTTTATTTT TCAAATCTAT ACATTTTCAG 2640 

GAATTTATGT ATTCTGGCTA AAAGTTGGAC TTCTCAGTAT TGTGTTTAGT TCTTTGAAAA 2700 

CATAAAAGCC TGCAATTTCT CGACAAAACA ACACAAGATT TTTTAAAGAT GGAATCAGAA 2760 

T GTGGAGGCTG TTGATGTTTC TGGTGTCAAG TTCTCAGAAG TTGCTGCCAC 2820 

3 GTTCTGGCCC CCAAGGTCAG 2880 

3 CATTTTGTCA TCTAAAGTCC AGTGACATGG TTCCCCGTGG 2940 

3G CAGCCCGTGG CATGGOGTGG CTCAGCTGTC TGTTGAAGTT GTTGCAAGGA 3000 



CACGCAGAAA TGGCCTCAAG GGGACTCTGC TCCACGTGGG GCCAGGCGTG TGACTGACGC 



GTCAACCAGA TTCTAGAAAC TGCGGTCATC CAGTTCTTCC TGACACCGGA TGGGTGCTTG 3360 
GGAACCGTTT GAGCCTTATA GATCATTTAC ATTCAATTTT TTTAACTCAG CAAGTGAGAA 3420 
CTTACAAGAG GGTTTTTTTT TAATTTTTTT TTCTCTTAAT GAACACATTT TCTAAATGAA 3480 



: 156 Protein sequence 



MWIQVRTMDG RQTHTVDSLS RLTKVEELRR KIQELFHVEP GLQRLFYRGK QMEDGHTLFD 
YEVRLNDTIQ LLVRQSLVLP HSTKERDSEL SDTDSGCCLG QSESDKSSTH GEAAAEIDSR 
PADEDMWDET ELGLYKVNEY VDARDTNMGA WFEAQWRVT RKAPSRDEPC SSTSRPALEE 
DVIYHVKYDD YPENGWQMN SRDVRARART IIKWQDLEVG QWMLNYNPD NPKERGFWYD 



RFRVQVSESG VHRPHVAGIH GRSNDGAYSL VLAGGYEDDV DHGNFFTYTG 
SGGRDLSGNK RTAEQSCDQK LTNTNRALAL NCFAPINDQE GAEAKDWRSG KPVRWRWVK 
GGKNSKYAPA EGNRYDGIYK WKYWPEKGK SGFLVWRYLL RRDDDEPGPW TKEGKDRIKK 
LGLTMQYPEG YLEALANRER EKENSKREEE EQQEGGFASP RTGKGKWXRK SAGGGPSRAG 
SPRRTSKKTK VEPYSLTAQQ SSLIREDKSN AKLWNEVLAS LKDRPASGSP FQLFLSKVEE 
TPQCICCQEL VFRPITTVCQ HNVCKDCLDR SFRAQVFSCP ACRYDLGRSY AHQVNQPLQT 



I I I I I I 

AGAAACTCAG AGACCAAGTC CATTGAGAGA CTGAGGGGAA AGAGAGC ' ' 

GAGAGTGGGA ACAGTAAAGA GAAAGGAAGA CAACCTCCAG AGAAAGC 
TCTCTGCAGA GAGGCGGCAG CACCCGGCTC ACCTGCGAAG CGCCTGGGAA GCGAGTGCCC 
CTAACAT GCG GCTGCCGCTG CTTGTGTCCG CGGGAGTCCT GCTGGTGGCT CTCCIGCCCT 



GCAGCCGCCC TTCGCCGGAA CAGGCGACCG CCAACTTTTT CCC-CGTGTTG C 

TGCTGCTGCC TCGGCGCTCG CTCGACAGCC CCGCGGCTCT CGCGGAGCGC GGCGCTAGGA 600 

ATGCCCTCGG CGGCCACCAG GAGGCACCGG AGAGAGAAAG GCGGTCCGAG GAGCCTCCCA 660 

TCTCCCTGGA TCTCACCTTC CACCTCCTCC GGGAAGTCTT GGAAATGGCC AGGGCCGAGC 720 

AGTTAGCACA GCAAGCTCAC AGCAACAGGA AACTCATGGA GATTATTGGG AAATAAAACG 780 
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GTGCGTTTGG CCAAAAAGAA TCTGCATTTA GCACAAAAAA AATTTAAAAA AATACAGTAT 
TCTGTACCAT AGCGCTGCIC TTATGCCATT TGTTTATTTT TATATAGCTT GAAACATAGA 
GGGAGAGAGG GAGAGAGCCT ATACCCCTTA CTTAGCATGC ACAAAGTGTA TTCACGTGCA 
GCAGCAACAC AATGTTATTC GTTTTGTCTA CGTTTAGTTT CCGTTTCCAG GTGTTTATAG 
TGGTGTTTTA AAGAGAATGT AGACCTGTGA GAAAACGTTT TGTTTGAAAA AGCAGACAGA 
AGTCACTCAA TTGTTTTTGT TGTGGTCTGA GCCAAAGAGA ATGCCATTC1 CTTGGGTGGG 
TAAGACTAAA TCTGTAAGCT CTTTGAAACA ACTTTCTCTT GTAAACGTTT CAGTAATAAA 
ACATCTTTCC AGTCCTTGGT CAGTTTGGTT GTGTAAGAGA ATGTTGAATA CTTATATTTT 
TAATAAAAGT TGCAAAGGT 

Seq I 



Protein Accession #: NP_000747.1 



MRLPLLVSAG VLLVALLPCP PCRALLSRGP VPGARQAPQH PQPLDFFQPP PQSEQPQQPQ 
ARPVLLRMGE EYFLRLGNLN KSPAAPLSPA SSLLAGGSGS RPSPEQATAN FFRVLLQQLL 
""""""" "' "™"'A LGGHQEAPER ERRSEEPPIS LDLTFHLLRE VLEMARAEQL 



I I I I I I 

GGGGACTTCT TGAACTTGCA GGGAGAATAA CTTGCGCACC CCACTTTGCG CCG~.~„ . 
TGCCCCAGCG GAGCCTGCTT CGCCATCTCC GAGCCCCACC GCCCCTCCAC TCCTCGGCCT 
TGCCCGACAC TGAGACGCTG TTCCCAGCGT GAAAAGAGAG ACTGCGCGGC CGGCACCCGG 
GAGAAGGAGG AGGCAAAGAA AAGGAACGGA CATTCGGTCC TTGCGCCAGG TCCTTTGACC 
AGAGTTTTTC CATGTGGACG CTCTTTCAAT GGACGTGTCC CCGCGTGCTT CTTAGACGGA 
CTGCGGTCTC CTAAAGGTCG ACCATGGTGG CCGGGACCCG CTGTCTTCTA GCGTTGCTGC 
TTCCCCAGGT CCTCCTGGGC GGCGCGGCTG GCCTCGTTCC GGAGCTGGGC CGCAGGAAGT 
TCGCGGCGGC GTCGTCGGGC CGCCCCTCAT CCCAGCCCTC TGACGAGGTC CTGAGCGAGT 
TCGAGTTGCG GCTGCTCAGC ATGTTCGGCC TGAAACAGAG ACCCACCCCC AGCAGGGACG 
CCGTGGTGCC CCCCTACATG CTAGACCTGT ATCGCAGGCA CTCAGGTCAG CCGGGCTCAC 600 
CCGCCCCAGA CCACCGGTTG GAGAGGGCAG CCAGCCGAGC CAACACTGTG CGCAGCTTCC 660 
ACCATGAAGA ATCTTTGGAA GAACTACCAG AAACGAGTGG GAAAACAACC CGGAGATTCT 720 
TCTTTAATTT AAGTTCTATC CCCAOGGAGG AGTTTATCAC CTCAGCAGAG CTTCAGGTTT 780 
T CCGAG AACA GATGCAAGAT GCTTTAGGAA ACAATAGCAG TTTCCATCAC CGAATTAATA 840 
TTTATGAAAT CATAAAACCT GCAACAGCCA ACICGAAATT CCCCGTGACC AGACTTTTGG 900 
ACACCAGGTT GGTGAATCAG AATGCAAGCA GGTGGGAAAG TT1TGATGTC ACCCCCGCTG 960 
TGATGCGGTG GACTGCACAG GGACACGCCA ACCATGGAT T CGTGGTGGAA GTGGCCCACT 1020 
TGGAGGAGAA ACAAGGTGTC TCCAAGAGAC ATGTTAGGAT AAGCAGGTCT TTGCACCAAG 10 8 0 
ATGAACACAG CTGGTCACAC ATAAGGCCAT TGCTAGTAAC TTTTGGCCAT GAT GG AAAAG 114 0 
GGCATCCTCT CCACAAAAGA GAAAAACGTC AAGCCAAACA CAAACAGCGG AAACGCCTTA 1200 
AGTCCAGCTG TAAGAGACAC CCTTTGTACG TGGACTTCAG TGACGTGGGG TGGAATGACT 1260 
GGATTGTGGC TCCCCCGGGG TATCACGCCT TTTACTGCCA CGGAGAATGC CCTTTTCCTC 1320 
TGGCTGATCA TCTGAACTCC ACTAAT CATG CCATTGTTCA GACGTTGGTC AACTCTGTTA 1380 
ACTCTAAGAT TCCTAAGGCA TGCTGTGTCC CGACAGAACT CAGTGCTATC TCGATGCTGT 1440 
ACCTTGACGA GAATGAAAAG GTTGTATTAA AGAACT AT CA GGACATGGTT GTGGAGGG1T 1500 
GTGGGTGTCG CTAGTACAGC AAAATTAAAT A 



Protein Accession #: NP_0 



MVAGTRCLLA LLLPQVLLGG AAGLVPELGR RKFAAASSGR PSSQPSDEVL SEFELRLLSM 

FGLKQRPTPS RDAWPPYML DLYRRHSGQP GSPAPDHRLE RAASRANTVR SFHHEESLEE 

LPETSGKTTR RFFFNLSSIP TEEFITSAEL QVFREQMQDA LGNNSSFHHR INIYEIIKPA 
TANSKFPVTR LLDT 

Seq ID NO: 161 DNA sequence 

Nucleic Acid Accession #-. NM_001999.2 

Coding sequence: 1..8736 

1 11 21 31 41 51 

I I I I I I 

ATGGGGAGAA GACGGAGGCT GTGICTCCAG CTCTACTTCC TGTGGCTGGG C 
CTCTGGGCGC AGGGCACGGC CGGCCAGCCT CAGCCTCCTC C 

CAGCCGCCGC CGCAACAGGT TCGGTCCGCT A 

CCCGAGTATC GCGAGGAGGG TGCCGCAGTG GCCAGCCGCG TCCGCCGGCG AGGACAGCAG 
GACGTGCTCC GAGGGCCCAA CGTGTGCGGC TCCAGATTCC ACTCCTACTG CTGCCCTGGA 
TGGAAGACGC TCCCTGGAGG AAACCAGTGC ATTGTCCCGA TTTGTAGAAA TAGTTGTGGA 
GATGGATTTT GTTCCCGTCC TAACATGTGT ACTTGTTCCA GTGGGCAAAT ATCATCAACC 
TGTGGATCAA AATCAATTCA GCAGTGCAGT GTGAGATGCA TGAATGGTGG GACCTGTGCA 
GATGACCACT GCCAGTGCCA GAAAGGATAT ATTGGAACTT ATTGTGGACA ACCTGTCTGT 
GAAAATGGAT GTCAGAATGG TGGACGTTGC ATCGCCCAAC CGTGTGCTTG TGTTIATGGG 
TTCACTGGTC CACAGTGTGA AAGAGATTAC AGGACAGGCC CGTGTTTCAC 1CAGGTCAAC 
AACCAGATGT GCCAAGGGCA GCTGACAGGC ATTGTCTGCA CGAAGACTCT C-TGCTGTGCC 
ACCACTGGAC GGGCGTGGGG CCATCCCTGT GAGATGTGTC CAGCCCAGCC TCAGCCCTGC 
CGACGGGGTT TCATCCCCAA CATCCGCACT GGAGCTTGCC AAGATGTTGA TGAATGCCAG 
GCTATCCCAG GGATATGCCA AGGAGGAAAC TGTATCAATA CAGTGGGCTC TTTTGAATGC 
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AGATGCCCTG CTGGTCACAA ACAGAGTGAA ACT ACT CAG A AATGTGAAGA CAT TG AT GAG 960 
TGCAGCATCA TTCCTGGGAT ATGTGAAACT GGTGAATGTT CCAACACCGT GGGAAGCTAT 1020 
TTTTGTGTTT GTCCACGTGG ATATGTAACC TCAACAGATG GCTCTCGATG CATCGATCAG 1080 
AGAACAGGCA TGTGTTTCTC GGGCCTGGTG AATGGCCGCT GTGCACAAGA GCTCCCGGGG 1140 
5 AGAATGACGA AAATGCAGTG CTGCTGTGAG CCTGGCCGCT GCTGGGGCAT CGGAACCATT 1200 
CCTGAAGCCT GTCCTGTCAG AGGTTCTGAG GAATATCGCA GACTTTGCAT GGATGGACTT 1260 
CCAATGGGAG GAATTCCAGG GAGTGCTGGT TCCAGACCTG GAGGCACTGG GGGAAATGGC 1320 
TTTGCCCCAA GTGGCAATGG CAATGGCTAT GGCCCAGGAG GGACAGGCTT CATCCCCATC 1380 
CCTGGAGGCA ATGGCTTTTC TCCTGGCGTT GGGGGAGCCG GTGTGGGGGC CGGGGGACAG 1440 
10 GGACCTATCA TCACTGGACT AACAATTCTG AACCAGACAA TAGATATCTG TAAGCATCAT 1500 
GCTAACCTTT GTTTAAATGG ACGCTGTATA CCAACTGTCT CAAGCTACCG ATGTGAATGC 1E60 
AACATGGGTT ATAAGCAGGA TGCAAATGGA GATTGTATAG ATGTTGATGA ATGCACATCA 1620 
AATCCCTGCA CTAATGGAGA TTGTGTTAAC ACACCTGGTT CCTATTATTG TAAATGTCAT 1680 
GCTGGATTCC AGAGGACTCC TACCAAGCAA GCATGCATTG ATATTGATGA GTGCATCCAG 1740 
15 AATGGGGTTC TTTGTAAAAA CGGTCGATGC GTGAACTCAG ATGGAAGTTT CCAGTGCATT 1800 
TGCAATGCCG GCTTTGAATT AACTACAGAT GGAAAAAACT GTGTTGATCA TGATGAATGT 1860 
ACAACTACCA ACATGTGTTT GAATGGAATG TGCATCAATG AAGATGGCAG CTTCAAGTGC 1920 
ATCTGCAAAC CAGGATTTGT CTTGGCTCCA AATGGGCGTT ACTGTACTGA TGTTGATGAA 1980 
TGCCAGACCC CAGGAATCTG CATGAATGGG CACTGCATCA ACAGTGAAGG GTCCTTCCGC 2040 
20 TGTGACTGTC CCCCAGGCCT GGCTGTGGGC ATGGATGGAC GTGTGTGTGT TGATACTCAC 2100 
ATGCGCAGTA CCTGCTATGG AGGAATCAAG AAAGGAGTGT GTGTGCGTCC TTTCCCCGGT 2160 
GCAGTGACCA AGTCCGAATG CTGCTGTGCC AATCCAGACT ATGGTTTTGG AGAACCCTGC 2220 
CAGCCATGCC CTGCAAAAAA TTCAGCTGAA TTCCACGGCC TTTGTAGTAG TGGAGTAGGT 2280 
AT CACTGTGG ATGGAAGAGA TATCAATGAA TGTGCTTTGG AT CCTGATAT ATGTGCCAAT 2340 
25 GGGATTTGTG AAAACTTACG TGGTAGTTAC CGTTGTAATT GCAACAGTGG CTATGAACCA 2400 
GATGCCTCTG GAAGAAACTG TATTGACATT GATGAATGTT TAGTAAACAG ACTGCTTTGT 2460 
GATAACGGAT TGTGCCGAAA CACGCCAGGA AGTTACAGCT GTACGTGCCC ACCAGGGTAT 2520 
GTGTTCAGGA CTGAGACAGA GACCTGTGAA GATATAAATG AATGTGAAAG CAACCCATGT 2580 
GTCAATGGGG CCTGCAGAAA CAACCTTGGA TCTTTCAATT GTGAATGTTC GCCCGGCAGC 2640 
30 AAACTCAGCT CCACAGGATT GATCTGTATT GACAGCCTGA AGGGGACCTG TTGGCTCAAC 2700 
AT CCAGGACA GCCGCTGTGA GGTGAATATT AATGGAGCCA CTCTGAAATC TGAATGCTGT 2760 
GCCACCCTCG GAGCCGCCTG GGGGAGCCCC TGTGAGCGGT GTGAACTAGA TACAGCTTGC 2820 
CCAAGAGGGC TTGCCAGGAT TAAAGGTGTT ACGTGTGAAG ATGTTAATGA GTGTGAGGTG 2880 
TTCCCTGGCG TTTGTCCAAA TGGACGCTGT GTCAACAGTA AGGGATCTTT TCATTGCGAG 2 940 
35 TGCCCTGAAG GCCTTACGTT GGATGGGACT GGCCGTGTAT GTTTGGATAT TCGCATGGAG 3000 
CAGTGTTACT TGAAGTGGGA TGAAGATGAA TGCATCCACC CCGTTCCTGG AAAGTTCCGC 3060 
ATGGATGCCT GCTGCTGTGC TGTCGGGGCG GCTTGGGGCA CCGAGTGTGA GGAGTGCCCC 3120 
AAACCTGGCA CCAAGGAATA CGAGACACTG TGCCCCCGCG GGGCTGGCTT TGCTAACCGA 3180 
GGGGATGTTC TTACTGGGCG GCCATTTTAC AAAGACATCA ATGAATGCAA AGCATTTCCT 3240 
40 GGGATGTGCA CTTATGCGAA GTCCAGAAAT ACAATCGGAA GCTTCAAATG CCGTTGCAAT 33 00 

AGTGGCTTTG CTCTAGACAT GGAGGAAAGA AACTGCACGG ACATCGACGA GTGCAGGATT 33 GO 
TCTCCTGACC TCTGTGGL'AG TGGAATCTGC GTCAATACAC CGGGCAGCTT TGAGTGCGAG 3420 
TGCTTCGAAG CCTATCAAAG TGGCTTCATG ATGATGAAGA ACTGCATGGA CATTGACGGA 3480 
TGTGAACGTA ACCCTCTCCT TTGTAGGGGT GGCACCTGTG TGAACACTGA GGGCAGCTTT 3540 
45 CAGTGT !ACT G( OA. I G vCA G G< TG rCA CATCCC GTGAGGACTG TGTGGATATT 3600 

AATGAATGCT CCCTGAGTGA CAATCTCTGC AGAAATGGAA AATGTGTGAA CATGATTGGA 3 6 60 
ACCTATCAGT GCTCTTGCAA TCCTGGATAT CAGGCTACGC CAGACCGCCA GGGCTGTACA 3720 
GATATTGATG AATGTATGAT AATGAACGGA GGCTGTGACA CCCAGTGCAC AAATTCAGAG 3780 
GGAAGCTACG AATGCAGCTG CAGTGAGGGT TATGCCCTGA TGCCAGATGG GAGATCGTGT 3840 
50 GCAGACATTG ATGAATGTGA AAACAATCCT GATATCTGTG ATGGCGGCCA GTGTACCAAC 3900 
ATTCCTGGAG AGTATCGCTG CCTCTGCTAT GATGGCTTCA TGGCTTCCAT GGACATGAAA 3950 
ACATGCATTG ATGTCAATGA ATGTGACCTA AATTCAAATA TCTGCATGTT TGGGGAATGT 4020 
GAGAACACAA AGGGATCCTT CATTTGCCAC TGTCAGCTGG GTTACTCAGT GAAGAAGGGG 4080 
ACCACAGGAT GTACAGATGT GGATGAGTGT GAAATTGGTG CTCATAACTG CGACATGCAT 4140 
55 GCCTCATGTC TGAATATCCC AGGAAGCTTC AAGTGTAGCT G CAGAG AAGG CTGGATTGGA 4200 
AACGGCATCA AGTGT ATT G A TCTGGACGAA TGTTCTAATG GAACCCACCA GTGTAGCATC 42 60 
AATGCTCAGT GTGTAAATAC CCCGGGCTCA TACCGCTGTG CCTGCTCCGA AGGTTTCACT 4320 
GGTGATGGCT TTACCTGCTC AGATGTTGAT GAGTGTGCAG AAAACATAAA CCTCTGTGAG 4380 
AACGGACAGT GCCTTAATGT CCCGGGTGCA TATCGCTGCG AGTGTGAGAT GGGCTTCACT 4440 
60 CCAGCCTCAG ACAGCAGATC CTGCCAAGAT ATTGATGAAT GCTCCTTCCA AAACATTTGT 4500 
GTCTCTGGAA CATGTAATAA CCTGCCTGGA ATGTTTCATT GCATCTGCGA TGATGGTTAT 4560 
GAATTGGACA GAACAGGAGG GAACTGTACA GATATTGATG AGTGTGCAGA TCCTATAAAC 4S20 
TGTGTCAATG GCCTATGTGT CAACACGCCT GGTCGCTATG AGTGT AACTG CCCACCCGAT 4680 
TTTCAGTTGA ACCCAACTGG TGTGGGTTGT GTTGACAACC GTGTGGGCAA CTGCTACCTG 4740 
65 AAGTTTGGAC CTCGAGGAGA TGGGAGTCTG TCTTGCAACA CCGAGATCGG GGTGGGCGTC 4800 
AGTCGCTCTT CATGCTGCTG CTCTCTGGGA AAGGCCTGGG GAAACCCCTG TGAGACATGC 4860 
CCCCCTGTCA ATAGCACTGA ATATTACACC CTGTGTCCCG GAGGTGAAGG CTTCAGACCT 4920 
AACCCCATCA CAATCATTTT AGAAGACATT GACGAATGCC AGGAGTTACC AGGTCTCTGC 4980 
CAGGGTGGAA ACTGCATCAA CACTTTTGGG AGCTTCCAGT GTGAGTGCCC ACAAGGCTAC 5040 
70 TACCTCAGCG AGGATACCCG CATCTGTGAG GATATTGATG AGTGTTTTGC ACATCCTGGT 5100 
GTGTGTGGGC CTGGGACCTG CTATAACACC CTGGGAAATT ACACCTGCAT TTGCCCACCT 5160 
GAGTACATGC AGGTCAATGG AGGCCACAAC TGCATGGACA TGAGAAAAAG CTTTTGCTAC 5220 
CGAAGCTATA ATGGAAC CAC TTGTGAGAAT GAGTTGCCTT TCAATGTGAC AAAAAGGATG 5280 
^ TGCTGCTGCA CATATAATGT GGGCAAAGCT GGGAACAAAC CTTGTGAACC ATGCCCAACT 5340 

CACACAGGAA AAGCTGTTGA CATTGATGAA TGTAAAGAC-A TTCCAGGCAT TTGTGCAAAT 5460 

GGTGTGTGCA TTAACCAGAT TGGCAGTTTC CGCTGTGAAT GCCCTACAGG ATTCAGTTAC 5520 

AATGACCTGC TGTTGGTTTG TGAAGATATA GATGAGTGCA GCAATGGTGA TAATCTCTGC 5580 

CAGCGGAATG CAGACTGCAT CAATAGTCCT GGTAGTTACC GCTGTGAATG TGCCGCGGGT 5640 

80 TTCAAACTTT CACCCAATGG GGCCTGTGTA GATCGCAATG AATGTTTAGA AATTCCTAAC 5700 

GTTTGCAGTC ATGGCTTGTG TGTTGATCTG CAAGGAAGT T ACCAGTGCAT CTGCCACAAT 5760 

GGCTTTAAGG CTTCTCAGGA CCAGACCATG TGCATGGATG TTGATGAGTG CGAGCGGCAC 5820 

CCATGTGGAA ATGGAACTTG TAAAAACACC GTTGGATCCT ATAACTGTCT GTGCTACCCA 5880 

GGGTTTGAAC TCACTCATAA TAATGATTGC CTGGACATAG ATGAC-TGCAG TTCCTTTTTT 5940 



1016 



WO 03/042661 PCT/US02/36810 



50 
55 



GGTCAGGTGT GCAGAAATGG ACGTTGTTTT A 

AACGAAGGTT ATGAACTTAC CCCAGATGGC AAAAACTGTA TAGACACTAA TGAGTGTGTC 
G GCTCTTGCTC TCCTGGTACC TGTCAGAATT TGGAGC-GATC C 
H AGTAAAAAGC G 

GAAGATCCCA ACATTTGTCT TTTTGGTTCC TGTACTAATA CTCCAGGGGG C 
CTCTGCCCCC CTGGCTTTGT ACTATCTGAT AATGGACGGA GATGCTTTGA TACTCGCCAG S3 00 
AGCTTCTGCT TCACAAATTT TGAAAA1 

ACAAAAGCAA AATGCTGCTG TAGTAAGATG CCAGGAGAGG GCTGGGGGGA C 
CTGTGCCCCA AAGACGATGA AGTTGCATTT CAGGATTTGT GTCCATATGG C 

G CCCAGGCATT 



TACAACCTTG ACTACACTGG A 

CCGTGTGGAA ATGGTACATG CACCAATGTT ATTGGGAGTT TTGAATGCAA TTGCAATGAA 6720 

GGCTTTGAGC CAGGGCCCAT GATGAATTGT GAAG AT AT C A ACGAATGTGC CCAGAACCCA 6780 

CTGCTGTGTG CTTTACGCTG CATGAACACT TTTGGGTCCT ATGAATGCAC GTGCCCGATT 6840 

GGCTATGCCC TCAGGGAAGA TCAAAAGATG TGCAAAGATC TGGATGAATG TGCTGAAGGG 6 900 

TTACACGACT GTGAATCTAG GGGCATGATG TGTAAGAATC TAATCGGCAC CTTCATGTGC 6960 

ATCTGCCCTC CTGGAATGGC CCGAAGGCCC GATGGAGAAG GCTGTGTAGA TGAAAATGAA 7020 

TGCAGGACCA AGCCAGGAAT CTGTGAAAAT GGACGTTGTG TTAACATTAT TGGAAGCTAT 7080 

AGATGTGAGT GTAATGAAGG ATTCCAGTCA AGTTCTTCAG GCACTGAATG CCTTGACAAT 7140 

CGACAGGGTC TCTGCTTTGC AGAGGTACTG CAGACAATAT GTCAAATGGC ATCCAGTAGT 7200 

CGCAATCTCG TCACTAAGTC AGAATGCTGC TGTGATGGTG GGCGAGGCTG GGGCCACCAG 726 0 

TGCGAGCTTT GCCCACTTCC TGGAACTGCC CAGTACAAAA AGATATGTCC TCATGGCCCA 7320 

GGATATACAA CTGATGGAAG AGATATTGAT GAATGTAAGG TAATGCCAAA CCTCTGCACC 7380 

AATGGTCAGT GCATCAATAC CATGGGCTCA TTCCGATGCT TCTGCAAGGT TGGCTACACC 7440 

ACAGACAT CA GTGGAACCTC TTGTATAGAC CTTGATGAAT GCTCCCAGTC CCCGAAACCA 7500 

TGCAACTACA TCTGCAAGAA CACTGAGGGG AGTTATCAGT GTTCATGTCC GAGGGGGTAT 7560 

GTCCTGCAAG AGGATGGAAA GACATGCAAA GACCTTGATG AATGTCAAAC AAAGCAGCAT 7620 

AACTGCCAGT TCCTCTGTGT CAACACCCTG GGGGGGTTTA CCTGTAAATG TCCACCTGGT 7680 

TTCACACAGC ATCACACTGC TTGTATCGAC AACAACGAAT GTGGGTCTCA ACCTTTGCTT 7740 

TGTGGAGGAA AGGGAATCTG TCAAAACACT CCAGGCAGTT TCAGCTGTGA ATGCCAAAGA 7800 

GGGTTCTCTC TTGATGCCAC CGGACTGAAC TGTGAAGATG TTGATGAATG TGATGGGAAC 7860 



GGCTACATCC AGCACTACCA GTGGAATCAG T 
AATGCCTGTG GCTCTGCTTC CTGCTAC 
TCGGGGTTCT CCTTCGACCA GTTCTCCAGT G 



CATGAACCTG ATCCCACTGC TGTTGAACAG ATCAGCCTAG AGAGTGTCGA CATGGACAGC 
\ CCTCTCCCAC CTCGGCTCTA AGGAGCACAT CCTGGAACTA 



\ TCCCTCTCTA CAAGAAGAAG 8640 
GCTCTCAGAA TGAGGCTGCA GATTCAGCTC TATTAACCGT T 



. ACCTAAATGT 

TCAAGATATC AGCAT ATGG C ACTAAATGCA C 



CTGATACACA CCTGATCGAT TTTAAAGAAA GCCACACAGA GCTGAATCGG GCAGTGCTAA 93 60 

TCAATAATTT AAAAGACATG AATGTCATTA GATCCTTTAT AACGTAGATC GAAGCCAAAG 9420 

CAGCTCATTT GTGACAACAT TTCATATCAC CAGACACACC AGGCAACAGA AGTTGAAGCA 9480 

CAACCACTGT AGCAAAATAC CTTGACTGCT TGTGAGACCA TTAGCATTGC AGGCCAAACC 9540 

GTACTGTATT TCCTTCTCAT AACCTCAAGG AACCATATGT GCTACCCACA ACACCTCATT 9600 

CTTACCCAGG GTGCGCTGCG TCCTCATGGT ACTGTAGGCA GCTGAAGAAC CGCCGTTCCC 9660 

TTGAAAGGGA ACACCTGGCA TTCTGTGGTG TTTCGTGCTG TCTTAAATAA TGGTGCATTT 9720 

ATTATGTTCA AGTTATTTCA GGATTGCCAT ATGTGCAAAC AAATCATGCA ATGCAGCCAA 9780 



ACAATGAATT GATGTTTAGT TTGCTTTAGT CATTTAAAAA G, 
TATTAAGAGC ACGTATC CAT TATTCTTCTC AACCCAAGAA CCTGTTTCCT GGACCAGTGA 10020 
CCAAACCTCA TATGTGAAAT GGCCAAAGCA CATGCAGGCT CCTGGTTGTT CCTCTCAAAC 10080 
CTGTGCTGAC CAAAGATTAG TAACCAGTTA TACCCAGTAT TTTGAGGTTT TATTGTTTTT 10140 
TTAATAACTA A 



Protein Accession #s NP_o 

1 11 21 31 41 51 

MGRRRRLCLQ LYFLWLGCW LWAQGTAGQP QPPPPKPPRP QPPPQQVRSA TAGSEGGFLA 
PEYREEGAAV ASRVRRRGQQ DVLRGPNVCG SRFHSYCCPG WKTLPGGNQC IVPICRNSCG 
DGFCSRPNMC TCSSGQISST CGSKSIQQCS VRCMNGGTCA DDECQCQKGY IGTYCGQPVC 
ENGCQNGGRC IAQPCACVYG FTGPQCERDY RTGPCFTQVN NQMCQGQLTG IVCTKTLCCA 
TTGRAWGHPC EMCPAQPQPC RRGFIPNIRT GACQDVDECQ AIPGICQGGN CINTVGSFEC 
RCPAGHKQSE TTQKCEDIDE CSIIPGICET GECSNTVGSY FCVCPRGYVT STDGSRCIDQ 
RTGMCFSGLV NGRCAQELPG RMTKMQCCCE PGRCWGIGTI PEACPVRGSE E 
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3 FAPSGNGNGY GPGGTGPIPI PGGNGFSPGV GGAGVGAGGQ 
GPIITGLTIL NQTIDICKHH ANLCLNGRCI PTVSSYRCEC NMGYKQDANG DCIDVDECTS 
NPCTNGDCVN TPGSYYCKCH AGFQRTPTKQ ACIDIDECEQ NGVLCKNGRC VNSDGSFQCI 
CNAGFELTTD GKNCVDHDEC TTTNMCLNGM CINEDGSFKC ICKPGFVLAP M 
CQTPGICMNG HCINSEGSFR CDCPPGLAVG MDGRVCVDTH MRSTCYGGIK K 
AVTKSECCCA NPDYGFGEPC QPCPAKNSAE FHGLCSSGVG I 



VFRTETETCE DINECESNPC VNGACRNNLG SFNCECSPGS KLSSTGLICI DSLKGTCWLN 900 

IQDSRCEVNI NGATLKSECC ATLGAAWGSP CERCELDTAC PRGLARIKGV TCEDVNECEV 960 

FPGVCPNGRC VNSKGSFHCE CPEGLTLDGT GRVCLDIRME QCYLKWDEDE CIHPVPGKFR 1020 

MDACCCAVGA AWGTECEECP KPGTKEYETL CPRGAGFANR GDVLTGRPFY KDINECKAFP 1080 

GMCTYGKCRN TIGSFKCRCN SGFALDMEER NCTDIDECRI SPDLCGSGIC VNTPGSFECE 1140 

CFEGYESGFM MMKNCMDIDG CERNPLLCRG GTCVNTEGSF QCDCPLGHEL SPSREDCVDI 1200 

NECSLSDNLC RNGKCVNMIG TYQCSCNPGY QATPDRQGCT DIDECMIMMG GCDTQCTNSE 1260 

GSYECSCSEG YALMPDGRSC ADIDECENNP DICDGGQCTN IPGEYRCLCY DGFMASMDMK 1320 

TCIDVKECDL NSNICMFGEC ENTKGSFICH CQLGYSVKKG TTGCTDVDEC EIGAHNCDMH 13S0 

ASCLNIPGSF KCSCREGWIG NGIKCIDLDE CSNGTHQCSI NAQCVNTPGS YRCACSEGFT 1440 

GDGFTCSDVD ECAENINLCE NGQCLNVPGA YRCECEMGFT PASDSRSCQD IDECSFQNIC 1500 

VSGTCNNLPG MFHCICDDGY ELDRTGGNCT DIDECADPIN CVNGLCVNTP GRYECNCPPD 1560 

FQLNPTGVGC VDNRVGNCYL KFGPRGDGSL SCNTEIGVGV SRSSCCCSLG KAWGNPCETC 1620 

PPVNSTEYYT LCPGGEGFRP NPITIILEDI DECQELPGLC QGGNCINTFG SFQCECPQGY 1680 

YLSEDTRICE DIDECFAHPG VCGPGTCYNT LGNYTCICPP EYMQVNGGHN CMDMRKSFCY 1740 

RSYNGTTCEN ELPFNVTKRM CCCTYNVGKA GNKPCEPCPT PGTADFKTIC GNIPGFTFDI 1800 

HTGKAVDIDE CKEIPGICAN GVCIKQIGSF RCECPTGFSY NDLLLVCEDI DECSNGDNLC 1860 

QRNADCINSP GSYRCECAAG FKLSPNGACV DRNECLEIPN VCSHGLCVDL QGSYQCICHN 1920 

GFKASQDQTM CMDVDECERH PCGNGTCKNT VGSYNCLCYP GFELTHMNDC LDIDECSSFF 1980 

GQVCRNGRCF NEIGSFKCLC NEGYELTPDG KNCIDTNECV ALPGSCSPGT CQNLEGSFRC 2040 

ICPPGYEVKS ENCIDINECD EDPNICLFGS CTNTPGGFQC LCPPGFVLSD NGRRCFDTRQ 2100 

SFCFTNFENG KCSVPKAFNT TKAKCCCSKM PGEGWGDPCE LCPKDDEVAF QDLCPYGHGT 2160 

VPSLHDTRED VNECLESPGI CSNGQCINTD GSFRCECPMG YNLDYTGVRC VDTDECSIGN 2220 

PCGNGTCTHV IGSFECNCNE GFEPGPMMNC EDINECAQNP LLCALRCMNT FGSYECTCPI 2280 

GYALREDQKM CKDLDECAEG LHDCESRGMM CKNLIGTFMC ICPPGMARRP DGEGCVDENE 2340 

CRTKPGICEN GRCVNIIGSY RCECNEGFQS SSSGTECLDN RQGLCFAEVL QTICQMASSS 2400 

RNLVTKSECC CDGGRGWGKQ CELCPLPGTA QYKKICPHGP GYTTDGRDID ECKVMPNLCT 2460 

NGQCINTMGS FRCFCKVGYT TDISGTSCID LDECSQSPKP CNYICKNTEG SYQCSCPRGY 2520 

VLQEDGKTCK DLDECQTKQH NCQFLCVNTL GGFTCKCPPG FTQHETACID NNECGSQPLL 2580 

CGGKGICQNT PGSFSCECQR GFSLDATGLM CEDVDECDGN ERCQHGCQNI LGGYRCGCPQ 2640 

CYIQHYQWHQ CVDENECSNP NACGSASCYN TLGSYKCACP SGFSFDQFSS AQIDVNECSS 2700 

SKNPCNYGCS NTEGGYLCGC PPGYYRVGQG HCVSGMGFNK GQYLSLDTEV DEENALSPEA 27G0 

CYECKINGYP KKDSRQKHSI HE PD PTAVEQ ISLESVDMDS PVNXKFNLSH LGSKSHILEL 2820 

RPAIQPLNNH IRYVISQGND DSVFRIHQRN GLSYLI3TAKK KLMPGTYTLE I1SIPLYKKK 2880 
ELKKLEESNE DDYLLGELGE ALRMRLOIQL Y 

Seq ID NOs 163 DNA sequence 

Nucleic Acid Accession #: NMJU3372.1 

Coding sequence: 63 . . 617 



C TCAGCGCCAC GCGTCGAAAG CGCAGGCCCC GAGGACCCGC CGCACTGACA 
GTATGAGCCG CACAGCCTAC ACGGTGGGAG CCCTGCTTCT CCTCTTGGGG ACCCTGCTGC 
CGGCTGCTGA AGGGAAAAAG A 



GCCAGTGCAA CTCTTTCTAC ATCCCCAGGC ACAT CCGGAA GGAGGAAGGT TCCTTTCAGT 480 
CCTGCTCCTT CTGCAAGCCC AAGAAATTCA CTACCATGAT GGTCACACTC AACTGCCCIG 540 
AACTACAGCC ACCTACCAAG AAGAAGAGAG TCACACGTGT GAAGCAGTGT CGTTGCATAT 600 
CCATCGATTT GGATTAAGCC AAATCCAGGT GCACCCAGCA TGTCCTAGGA ATGCAGCCCC 660 
AGGAAGTCCC AGACCTAAAA CAACCAGATT CTTACTTGGC TTAAACCTAG AGGCCAGAAG 72 0 
AACCCCCAGC TGCCTCCTGG CAGGAGCCTG CTTGTGCGTA GTTCGTGTGC ATGAGTGTGG 780 
ATGGGTGCCT GTGGGTGTTT TTAGACACCA GAGAAAACAC AGTCTCTGCT AGAGAGCACT 840 
CCCTATTTTG TAAACAT AT C TGCTTTAATG GGGATGTACC AGAAACCCAC CTCACCCCGG 90 0 
CTCACATCTA AAGGGGCGGG GCCGTGGTCT GGTTCTGACT TTGTGTTTTT GTGCCCTCCT 960 
GGGGACCAGA ATCTCCTTTC GGAATGAATG TTCATGGAAG AGGCTCCTCT GAGGGCAAGA 102 0 
GACCTGTTTT AGTGCTGCAT T CG ACATGG A AAAGTCCTTT TAACCTGTGC TTGCATCCTC 1080 
CTTTCCTCCT CCTCCTCACA ATCCATCTCT TCTTAAGTTG ATAGTGACTA TGTCAGTCTA 114 0 
ATCTCTTGTT TGCCAAGGTT CCTAAATTAA TTCACTTAAC CATGATGCAA ATGTTTTTCA 12 00 
TTTTGTGAAG ACCCTCCAGA CTCTGGGAGA GGCTGGTGTG GGCAAGGACA AGCAGGATAG 1260 
TGGAGTGAGA AAGGGAGGGT GGAGGGTGAG GCCAAATCAG G1CCAGCAAA AGTCAGTAGG 1320 
GACATTGCAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGCTTC T 
TTTTCCTAGT ATTTAACAGA ACCCAAGTGA ACAGAGGAGA AATGAGATTG C 
ATTAACTTTG GCCGTTGCAA TCTGCTCAAA CCTAACACCA AACTGAAAAC ATAAATACTG 
ACCACTCCTA TGTTCGGACC CAAGCAAGTT AGCTAAACXA AACCAACTCC TCTGCTTTGT 
CCCTCAGGTG GAAAAGAGAG GTAGTTTAGA ACTCTCTGCA T, 
AAACCKCAGA GGCTGAAATT CCTAATACCT TTCCTTTATC 

TCCATTCCAC TATTTCCCAT AATGCTTCTG AGAGCCACTA ACTTGATTGA TAAAGATCCT 
GCCTCTGCTG AGTGTACCTG ACAGTAAGTC TAAAGATGAR AGAGTTTAGG GACTACTCTG 
TTTTAGCAAG ARATATTKTG GGGGTCTTTT TGTTTTAACT ATTGTCAGGA GATTGGGCTA 
RAGAGAAGAC GACGAGAGTA AGGAAATAAA GGGRATTGCC TCTGGCTAGA GAGTAAGTTA 
GGTGTTAATA CCTGGTAGAA ATGTAAGGGA TATGACCTCC CTTTCTTTAT 
AGGATCTGAG GGGACCCTGT TAGGAGAGCA TAGCATCATG ATGTATTAGC 
CTACTGGTTG GATGGACATA ACTATTGTAA CTATTCAGTA ITTACTGGTA GGCACTGTCC 
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20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 



TCTGATTAAA CTTGGCCTAC TGGCAATGGC TACTTAGGAT TGATCTAAGG GCCAAAGTGC 
AGGGTGGGTG AACTTTATTG TACTTTGGAT TTGGTTAACC TGTTTTCTTC AAGCCTGAGG 
TTTTATATAC AAACTCCCTG AATACTCTTT TTGCCTTGTA TCTTCTCAGC CTCCTAGCCA 
AGTCCTATGT AATATGGAAA fli 

TCTGGCATTC AGAGAACCCT TGCAACTCGA GAAGCTGTTT TTATTTGGTT T 

TCCAGTGCTC TCCCATCTAA CAACTAAACA GGAGCCATTT CAAGGCGGGA GATATTTTAA 2460 

ACACCCAAAA TGTTGGGTCT GATTTTCAAA CTTTTAAACT CACTACTGAT GATTCTCACG 2520 

CTAGGCGAAT TTGTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCCCACC 2580 

CCAAATCTTT GTATTGTCCA CATTCTCCAA CAATAAAGCA CAGAGTGGAT TTAATTAAGC 2640 

ATGTAAAACC ACACCAGGGA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 2760 

TTGTTGTTTT AACTCTGCCA CAAGAATGCA ATTTCGTTAA TGGAGATGAC TTAAGTTGGC 2820- 

AGCAGTAATC TTCTTTTAGG AGCTTGTACC ACAGTCTTGC ACATAAGTGC AGATTTGGCT 2880 

CAAGTAAAGA GAATTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2940 

TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTGTG CCTATATTAA 3000 

GACTAGTACA AATGTGGTGT GTCTTCCAAC TTTCATTGAA AATGCCATAT CTATACCATA 3060 

TTTTATTCGA GTCACTGATG ATGTAATGAT ATATTTTTTC ATTATTATAG TAGAATATTT 3120 

TTATGGCAAG ATATTTGTGG TCTTGATCAT ACCTATTAAA ATAATGCCAA ACACCAAATA 3180 

TGAATTTTAT GATGTACACT TTGTGCTTGG CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 

GTCTGTAAGT TGTTTTTTGT TACTGTAGGT CTTCAAAGTT AAGAGTGTAA GTGAAAAATC 3300 

TGGAGGAGAG GATAATTTCC ACTGTGTGGA ATGTGAATAG TTAAATGAAA AGTTATGGTT 3360 

TTCTCCTTTA TATGACTTTC TCTGAGTTGG GCAAAGAAGA AGCTGACACA CCGTATGTTG 3480 

TTAGAGTCTT TTATCTGGTC AGGGGAAACA AAATCTTGAC CCAGCTGAAC ATGTCTTCCT 3540 

GAGTCAGTGC CTGAATCTTT ATTTTTTAAA TTGAATGTTC CTTAAAGGTT AACATTTCTA 3600 

AAGCAATATT AAGAAAGACT TTAAATGTTA TTTTGGAAGA CTTACGATGC ATGTATACAA 3660 

ACGAATAGCA GATAATGATG ACTAGTTCAC ACATAAAGTC CTTTTAAGGA GAAAATCTAA 3720 

AATGAAAAGT GGATAAACAG AACATTTATA AGTGATCAGT TAATGCCTAA GAGTGAAAGT 3780 

AGTTCTATTG ACATTCCTCA AGATATTTAA TATCAACTGC ATTATGTATT ATGTCTGCTT 3840 

AAATCATTTA AAAACGGCAA AGAATTATAT AGACTATGAG GTACCTTGCT GTGTAGGAGG 3900 

ATGAAAGGGG AGTTGATAGT CTCATAAAAC TAATTTGGCT TCAAGTTTCA TGAATCTGTA 3960 

ACTAGAATTT AATTTTCACC CCAATAATGT TCTATATAGC CTTTGCTAAA GAGCAACTAA 402 0 
TAAATTAAAC CTATTCTTTC A 



MSRTAYTVGA LLLLLGTLLP AAEGKKKGSQ GAIPPPDKAQ HNDSBQTQSP QQPGSBKEGR 
GQGRGTAMPG EEVLESSQEA LHVTERKYLK RDWCKTQPLK QTIHEEGCNS RTIINRFCYG 
QCHSFYIPRH IRKEEGSFQS CSFCKPKKFT TMMVTLNCPE LQPPTKKKRV TRVKQCRCIS 



Nucleic Acid Accession #: CAT e 



I I I I I I 

GAATTGCATC GGACAGAGCT TCGCCATGGC CGAGTTGCGC GTGGTTGTGG CACTAACACT 
GCTACGTTTC CGCCTGAGCG TGGACCGAAC GCGCAAGGTG CGGCGGAAGC CGGAGCTCAT 
ACTGCGCACG GAGAACGGGC TCTGGCTCAA G 



I I I I I I 

ATGCCACCTC TGCCATCCAG AATGTCTCGG GGGCCCTCCT TGGTGACAGG CAGGATGCTG 60 

CCCATCACAG ACCGCCTGCT GCACCTCCTG GGGCTGGAGA AGACGGCGTT CCGCATATAC 120 

GCGGTGTCCA CCCTTCTCCT CTTCCTGCTC TTCTTCCTGT TCCGCCTGCT GCTGCGGTTC 180 

CTGAGGCTCT GCAGGAGCTT CTACATCACC TGCCGCCGGC TGCGCTGCTT CCCCCAGCCT 240 

CCCCGGCGCA ACTGGCTGCI GGGCCACCTG GGCATGTACC TTCCAAATGA GGCGGGCCTT 300 

CAAGATGAGA AGAAGGTACT GGACAACATG CACCATGTAC TCTTGGTATG GATGGGACCT 360 

GTCCTGCCGC TGTTGGTTCT GGTGCACCCT GATTACATCA AACCCCTTTT GGGAGCCTCA 42 0 

GCTGCCATCG CCCCCAAGGA TGACCTCTTC TATGGCTTCC TAAAACCTTG GCTAGGGGAT 480 

GGGCTGCTGC TCAGCAAAGG TGACAAGTGG AGCCGGCACC GTCGCCTGCT GACACCCGCC 540 

TTCCACTTTG ACAT CCTGAA GCCTTACATG AAGATCTTCA ACCAGAGCGC TGACATTATG 600 

CATGCTAAAT GGCGGCATC1 GGCAGAGGGC TCAGCGGTCT CCCTTGATAT GTTTGAGCAT 6 60 

ATCAGCCTCA TGACCCTGGA CAGTCTTCAG AAATGTGTCT TCAGCTACAA CAGCAACTGC 720 

CAAGAGAAGA TGAGTGATTA TATCTCCGCT ATCATTGAAC TGAGCGCTCT GTCTGTCCGG 780 

CGCCAGTATC GCTTGCACCA CTACCTCGAC TTCATTTACT ACCGCTCGGC GGATGGGCGG 840 

AGGTTCCGGC AGGCCTGTGA CATGGTGCAC CACTTCACCA CTGAAGTCAT CCAGGAACGG 900 

CGGCGGGCAC TGCGTCAGCA GGGGGCCGAG GCCTGGCTTA AGGCCAAGCA GGGGAAGACC 960 

TTGGACTTTA TTGATGTGCT GCTCCTGGCC AGG3ATGAAG ATGGAAAGGA ACTGTCAGAC 1020 

GAGGATATCC GAGCCGAAGC AGACACCTTC ATGTTTGAGG GTCACGACAC AACATCCAGT 1080 

GGGATCTCTT GGATGCTGTT CAATTTGGCA AAGTATCCGG AATACCAGGA GAAATGCCGA 1140 

GAAGAGATTC AGG AAGT CAT GAAAGGCCGG GAGCTGGAGG AGCTGGAGTG GGACGATCTG 12 00 

ACTCAGCTGC CCTTTACAAC TATGTGCATT AAGG AG AG C C TGCGCCAGTA CCCACCTGTC 12 60 

ACTCTTGTCT CTCGCCAATG CACGGAGGAC ATCAAGCTCC CAGATGGGCG CATCATCCCC 132 0 

AAAGGAATCA TCTGCTTGGT CAGCATCTAT GGAACCCACC ACAACCCCAC AGTGTGGCCT 1380 

GACTCCAAGG TGTACAACCC CTACCGCTTT GACCCGGACA ACCCACAGCA GCGCTCTCCA 1440 
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CTGGCCTATG TGCCCTTCTC TGCAGGACCC AGGAATTGCA TCGGACAGAG CTTCGCCATG 1500 

GCCGAGTTGC GCGTGGTTGT GGCACTAACA CTGCTACGTT ICCGCCTGAG CGIGGACCGA 1560 

ACGCGCAAGG TGCGGCGGAA GCCGGAGCTC ATACTGCGCA CGGAGAACGG GCICTGGCTC 162 0 
AAGGTGGAGC CGCTGCCTCC GCGGGCCTGA 



FGENESH predicted 

1 11 21 31 41 51 

I I I I I I 

MPPLPSRMSR GPSLVTGRML PITDRLLHLL GLEKTAPRIY AVSTLLLPLL PPLPRLLLRP 

LRLCRSPYIT CRRLRCPPQP PRRNWLLGHL GMYLPN3AGL QDEKKVLDNM HHVLLVWMGP 

VLPLLVLVHP DYIKPLLGAS AAIAPKDDLF YGFLKPWLGD GLLLSKGDXW SRHRRLLTPA 

FHFDILKPYM KIFNQSADIM HAKWRHLAEG SAVSLDMFEH ISLMTLDSLQ KCVFSYNSNC 

QEKMSDYISA IIELSALSVR RQYRLHHYLD FIYYRSADGR RFRQACDMVH HFTTEVIQER 
RRALRQQGAE AWLKAKQGKT LDFIDVLLLA RDEDGKELSD EDIRAEADTF MFEGHDTTSS 

GISWMLFNLA KYPEYQEKCR EEIQEVMKGR ELEELEWDDL TQLPFTTMCI KESLRQYPPV 

TLVSRQCTED IKLPDGRIIP KGIICLVSIY GTHHNPTVWP DSKVYNPYRF DPDNPQQRSP 

LAYVPFSAGP RNCIGQSFAM ABLRVWALT LLRFRLSVDR TRKVRRKPEL ILRTE1IGLKL 
KVEPLPPRA 



I I I I I 

A GGCAAGCTCC CCCTGTCAAA GCACCTTGGC CCATAAGAAG AAAAGGGGGA 60 

G TGATGAGCGC TTCCAGGCTT CAGGCTCAGA AGGCGCCCCC AGCTCTCCTG 12 0 

TAACTCAGAG GCCAGTGTGA TGGGAGTTCC TCCACTCAGC ACACTTCCCC TGTAAACACG 180 

CCTGTGGTGG GCAAAAGGGC TTTGGAACGG TTGCTTGTCT TTTCTCTCCT GCGTAATTTC 240 

CACTTTGATT CATGATAATG TCGAACACGC ACAAAGCTCG GCTGGAACGC CGGGTCACTG 3 00 

GCTCAACCAA CCGGTGGCGT TTGCCCAAAC AGCCTTTCTC TGGGGACCTG CTCTCACTTT 360 

CCCAGATGTG CAAGGCTCTG AGCATAGACT TTGAGGAAGC TTTGAGGAAC CCAGACAGGT 420 

TATGCATTTC ACAAATCCAG AAGTTTTTCT TTGAGAATTT CAAGAACAAG GACATCCAAA 4S0 

GTGGGGAAGC AGATGTGATT CTCGAGTGCC TGGGCTTCAA ATGGGAGCTC CATCAGCCCC 540 

GGCTTTTTCA GTCTGAGACC TTGGCCAAGC TCTACCTGAA AGCCCTGGCG CAGGGCACCA GOO 

CACACCCCCT GAGGGAGCTG GAGGAGCTTC TGCGAGCTCA ATCACCTAAG AAGACCAAAG 6 60 

AAAAATCCCC TGCAAAGAGG ATCATCATTT CCTTGAAGAT CAATGACCCA CTGGTCACTA 720 

AAGTCGCCTT CGCCACGGCC CTGAAGAACC TCTACATGAG TGAGGTGGAG ATTAACTTGG 78 0 

AAGACCTACT GGGAGTGCTG GCTTCCGCCC ACATCCTCCA GTTCAGTGGC CTGTTTCAAA 840 

GGTGCGTGGA TGTGATGATA GCCAGACTCA AGCCAAGCAC CATCAAGAAA TTCTACGAGG 900 

CCGGCTGCAA GTACAAGGAA GAGCAGCTCA CCACCGGCTG CGAGAAGTGG CTGGAAATGA 960 

ACTTGGTTCC TCTAGGGGGG ACGCAGATCC ACCTCCACAA AATCCCACAG GACCTGCTCC 1020 

ACAAAGTGCT GAAGTCCCCC AGGTTATTTA CCTTTAGTGA ATTCCATCTT CTGAAAACAA 10 8 0 

TGCTTTTGTG GGTCTTCTTG CAACTGAACT ACA AG ATT CA GGCAATTCCG ACTTATGAAA 1140 

CCGTGATGAC ATTTTTTAAG AGCTTTCCTG AGAACTGTTG CTTTCTGGAC CGGGACATAG 12 00 

GACGGAGCTT GAGGCCGCTC TTCCTCTGCT TGCGTCTGCA CGGCATCACC AAAGGCAAGG 12 60 

ATCTGGAGGT GCTGCGGCAC CTTAACTTCT TCCCAGAGTC ATGGCTCGAC CAGGTTACAG 132 0 

TCAACCATTA CCACGCACTG GAGAATGGGG GCGACATGGT CCACCTGAAA GAT CTTAACA 1380 

CCCAGGCTGT GAGATTTGGG CTGCTCTTTA ACCAGGAGAA TACAACTTAT TCGAAAACGA 1440 

TTGCTCTATA TGGATTCTTC TTTAAGATAA AGGGACTCAA ACATGATACT ACCTCTTATA 1500 

GTTTTTACAT GCAGAGAATA AAGCACACAG ACCTGGAATC TCCCTCTGCG GTCTACGAGC 1560 

ACAACCACGT CAGCCTGCGA GCGGCACGCC TGGTGAAGTA TGAGATCAGA GCAGAGGCCC 1620 

TGGTTGACGG CAAGTGGCAG GAGTTCAGGA CAAACCAGAT CAAGCAGAAG TTTGGGTTGA 16 80 

CCACGTCATC CTGCAAAAGC CATACCTTGA AAATCCAAAC TGTGGGCATC CCAATCTATG 1740 

TAAGTTTTGC ATTCATCTTC CCAGCATCTT GACAGTTTCC AGAAGAATCT ATGGGATTTT 1800 

CCCCCCACTG GTCTGCATAA AAGAAAATAA AATGACATAA AAGGGAGC 

Seq I 



Protein Accession #.- BAB71658.1 

I I' I' I' 

MIMSNTHKAR LERRVTGSTN RWRLPKQPFS GDI 
QIQKFFFENF KNKDIQSGEA DVILECLGFK WELHQPRLFQ SETLAKLYLK ALAQGTTHPL 
RELEELLRAQ SPKKTKEKSP AKRIIISLKI NDPLVTKVAF ATALKNLYMS EVEINLEDLL 
GVLASAHILQ FSGLFQRCVD VMIARLKPST IKKFYEAGCK YKEEQLTTGC EKWLEMNLVP 
LGGTQIHLHK IPQDLLHKVL KSPRLFTFSE FHLLKTMLLW VFLQLNYKIQ AIPTYETVMT 
FFKSFPENCC FLDRDIGRSL RPLFLCLRLK GITKGKDLEV LRKLNFFPES WLDQVTVNHY 
HALENGGDMV HLKDLNTQAV RFGLLFNQEN TTYSKTIALY GFFFKIKGLK HDTTSYSFYM 
QRIKHTDLES PSAVYEHNHV SLRAARLVKY EIRAEALVDG KWQEFRTNQI KQKFGLTTSS 
CKSHTLKIQT VGIPIYVSFA FIFPAS 



G CGGCAGCAGC GGAGGCCGAG AAGGGATCTC CAGTTGTGGT GGGCCTGCTA 
A ATATCATTAT TCTGCTGTCA GGCCTGTCCC TGTTTGCTGA GACCATATGG 
GTGACAGCCG ACCAGTACCG TGTATACCCA CTGATGGGAG TCTCAGGCAA GGATGACGTC 
TTCGCTGGTG CCTGGATTGC CATCTTCTGC GGCTTCTCCT TCTTCA1GGT AGCCAGTTTT 
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GGTGTGGGTG CCGCACTCTG CCGCCGCCGG TCCATGGTCC TCACGTACCT GGTGCTCATG 
CTCATCGTCT ACATCTTCGA GTGCGCCTCC TGCATCACGT CCTACACCCA CCGTGACTAC 
ATGGTGTCCA ACCCATCCCT GATCACCAAG CAGATGCTGA CCTTCTACAG CGCGGACACC 
GACCAGGGCC AGGAGCTGAC CCGCCTCTGG GACCGCGTCA TGATTGAGCA AGAATGCTGT 
GGCACATCTG GTCCCATGGA CTGGGTGAAC TTCACGTCAG CCTTCCGGC-C GGCCACTCCG 
GAGGTGGTGT TCCCCTGGCC CCCACTGTGC TGTCGCCGGA CGGGAAACTT CATCCCCCTC 
AACGAGGAGG GCTGCCGCCT GGGGCACATG GACTACCTGT TCACCAAGC-G CTGCTTCGAA 
CACATCGGCC ACGCCATCGA CAGCTACACG TGGGGTATCT CGTGGTTTGG GTTTGCCATC 
CTGATGTGGA CGCTCCCGGT CATGCTGATA GCCATGTATT TCTACACCAT GCTCTGA 
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l saa; \EAE KGSPVWGLL WGNIIILLS GLSLFAETIW VTADQYRVYP LMGVSGKDDV 
FAGAWIAIFC GFSFFMVASF GVGAALCRRR SMVLTYLVLM LIVYIFECAS CITSYTHRDY 
MVSNPSLITK QMLTFYSADT DQGQELTRLW DRVMIEQECC GTSGPMDWVN FTSAFRAATP 
EWFPWPPLC CRRTGNFIPL NEEGCRLGHM DYLFTKGCFE HIGHAIDSYT WGISWFGFAI 
LMWTLPVMM AMYFYTML 
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GGACCTTGCC TCCACCTCCC AGCCCAGCAT GGCACCCCTG CTGCCCATCC 



*■ ftu^wiuuil ULjum_l_<_t_lU UTGCCCATCC 

G ATTCTGCTGG CTCTGCTGTC CCCAGGGGCI GCAGACTTCA 

3 CCTCTCTGGT CTGCTGTCCC CGGCGCTAAC GGAGAGCCTG CTGGTTGCCT 
A GGAGGCAATG CCACACTGAT GGTCCGGAGA GCCAATGACA 

- C TTTGTGGTGC CTCCGTGCCG TGGGCGCAGG GAACTGGTGA 

A CAGTGGTGCT GGCTTCACAG TCACTCGGCT CAGTGCATAC CAGGTGACAA 
C AGGAACCAAA TTCTACATTT CCTACCTAGT GAAGAAGGGG ACAGCCACTG 
^o^^u^G AGAGATCCCA ATGTCCACAC TCCCTCGAAG GAACATGGAA TCCATTGGGC 
TGGGTATGGC CCGCACAGGG GGCATGGTGG TCATCACGGT GCTGCTCTCT GTCGCCATGT 
rCCTGCTGGT GC ,( TTC ATCATTGCCC TGGCACTGGG CTCCCGCAAG TAAGGAGGTC 
r :c i CGGAGC AGCAGCTTCT CCAGGAAGCC CAGGGCACCA TCCAGCTCCC CAGCCCACCT 
GCTCCCAGGC CCCAGGCCTG TGGCTCCCTT GGTGCCCTCG CCTCCTCCTC CTGCCCTCCT 
CTCCCCTAGA GCCCTCTCCT CCCTCTGTCC CTCTCCTTGC CCCCAGTGCC TCACCTTCCA 
ACACTCCATT ATTCCTCTCA CCCCACTCCT GTCAGAGTTG ACTTTCCTCC CATTTTACCA 
CTTTAAACAC CCCCATAACA ATTCCCCCAT CCTTCAGTGA ACTAAGTCCC TATAATAAAG 
GCTGAGGCTG CATCTGCCAA AAAAAAAAAA AA 

Seq ID NO: 173 Protein sequence 
Protein Accession # : NP_006751.1 



OU MAPLLPIRTL FLILILLALL SPGAADFNIS SLSGLLSPAL TESLLVALPP CHLTGGNATL 
MVRRANDSKV VTSSFWPPC RGRRELVSW DSGAGFTVTR LSAYQVTNLV PGTKFYISYL 
VKKGTATESS REIPMSTLPR RNMESIGLGM ARTGGMWIT VLLSVAMFLL VLGFIIALAL 
GSRK 

55 Seq ID NO: 174 
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ATGAAAGTTG GAGTGCTGTG GCTCATTTCT TTCTTCACCT TCACTGACGG CCACGGTGGC 60 

TTCCTGGGGA AAAATGATGG CATCAAAACA AAAAAAGAAC TCATTGTGAA TAAGAAAAAA 120 

CATCTAGGCC CAGTCGAAGA ATATCAGCTG CTGCTTCAGG TGACCTATAG AGATTCCAAG 180 

GAGAAAAGAG ATTTGAGAAA TTTTCTGAAG CTCTTGAAGC CTCCATTATT ATGGTCACAT 240 

VD GGGCTAATTA GAATTATCAG AGCAAAGGCT ACCACAGACT GCAACAGCCT GAATGGAGTC 300 

CTGCAGTGTA CCTGTGAAGA CAGCTACACC TGGTTTCCTC CCTCATGCCT TGATCCCCAG 3 60 

AACTGCTACC TTCACACGGC TGGAGCACTC CCAAGCTGTG AATGTCATCT CAACAACCTC 420 

AGCCAGAGTG TCAATTTCTG TGAGAGAACA AAGATTTGGG GCACTTTCAA AATTAATGAA 4 80 

n AGGTTTACAA ATGACCTTTT GAATTCATCT TCTGCTATAT ACTCCAAATA TGCAAATGGA 540 

/U ATTGAAATTC AACTTAAAAA AGCATATGAA AGAATTCAAG GTTTTGAGTC GGTTCAGGTC 600 

ACCCAATTTC GAAATGGAAG CATCGTTGCT GGGTATGAAG TTGTTGGCTC CAGCAGTGCA 660 

TCTGAACTGC TGTCAGCCAT TGAACATGTT GCCGAGAAGG CTAAGACAGC CCTTCACAAG 720 

CTGTTTCCAT TAGAAGACGG CTCTTTCAGA GTGTTCGGAA AAC-CCCAGTG TAATGACATT 780 

GTCTTTGGAT TTGGGTCCAA GGATGATGAA TATACCCTGC CCTGCAC-CAG TGGCTACAGG 84 0 

GGAAACATCA CAGCCAAGTG TGAGTCCTCT GGGTGGCAGG TCATCAGGGA GACTTGTGTG 900 

CTCTCTCTGC TTGAAGAACT GAACAAGAAT TTCAGTATGA TTGTAGGCAA TGCCACTGAG 960 

GCAGCTGTGT CATCCTTCGT GCAAAATCTT TCTGTCATCA TTCGGCAAAA CCCATCAACC 1020 

ACAGTGGGGA ATCTGGCTTC GGTGGTGTCG ATTCTGAGCA ATATTTCATC TCTGTCACTG 1080 

SO GCCAGCCATT TCAGGGTGTC CAATTCAACA ATGGAGGATG TCATCAGTAT AGCTC-ACAAT 1140 

OU ATCCTTAATT CAGCCTCAGT AACCAACTGG ACAGTCTTAC TGCGGGAAGA AAAGTATGCC 1200 

AGCTCACGGT TACTAGAGAC ATTAGAAAAC ATCAGCACTC TGGTGCCTCC GACAGCTCTT 1260 

CCTCTGAATT TTTCTCGGAA ATTCATTGAC TGGAAAGGGA TTCCAGTGAA CAAAAGCCAA 1320 

CTCAAAAGGG GTTACAGCTA TCAGATTAAA ATGTGTCCCC AAAATACATC TATTCCCATC 1380 

AGAGGCCGTG TGTTAATTGG GTCAGACCAA TTCCAGAGAT CCCTTCCAGA AACTATTATC 1440 
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AGCATGGCCT CGTTGACTCT GGGGAACATT CTACCCGTTT CCAAAAATGG AAATGCTCAG 1500 

GTCAATGGAC CTGTGATATC CACGGTTATT CAAAACTATT CCATAAATGA AGTTTTCCTA 1560 

TTTTTTTCCA AGAT AGAGT C AAACCTGAGC CAC-CCTCATT I T TT rTTCAGT 1S20 

CATTTGCAGT GGAACGATGC AGGCTGCCAC CTAGTGAATG AAACTCAAGA CATCGTGACG 1680 

J TGCCAATGTA CTCACTTGAC CTCCTTCTCC ATATTGATGT CACCTTTTGT CCCCTCTACA 1740 

ATCTTCCCCG TTGTAAAATG GATCACCTAT GTGGGACTGG C-TATCTCCAT TGGAAGTCTC 1800 

ATTTTATGCC TGATCATCGA GGCTTTGTTT TGGAAGCAGA TTAAAAAAAG CCAAACCTCT 1860 

CACACACGTC GTATTTGCAT GGTGAACATA GCCCTGTCCC TCTTGATTGC TGATGTCTGG 1920 
TTTATTGTTG GTGCCACAGT GGACACCACG GTGAACCCTT CTGGAGTCTG CACAGCTGCT 

1U GTGTTCTTTA CACACTTCTT C~" 



CTGCTGGCTT ACCGGAT CAT CCTCGTGTTC CATCACATGG CCCAGCATTT GATGATGGCT 2100 

GTTGGATTTT GCCTGGGTTA TGGGTGCCCT CTCATTATAT CTGTCATTAC CATTGCTGTC 2160 

ACGCAACCTA GCAATACCTA CAAAAGGAAA GATGTGTGTT GGCTTAACTG GTCCAATGGA 2220 

AGCAAACCAC TCCTGGCTTT TGTTGTCCCT GCACTGGCTA TTGTGGCTGT GAACTTCGTT 2280 

GTGGTGCTGC TAGTTCTCAC AAAGCTCTGG AGGCCGACTG TTGGGGAAAG ACTGAGTCGG 2340 

GATGACAAGG CCACCATCAT CCGCGTGGGG AAGAGCCTCC TCATTCTGAC CCCTCTGCTA 2400 

GGGCTCACCT GGGGCTTTGG AATAGGAACA ATAGTGGACA GCCAGAATCT GGCTTGGCAT 2460 

GTTATTTTTG CTTTACTCAA TGCATTCCAG GGATT7TTTA TCTTATGCTT TGGAATACTC 2520 

TTGGACAGTA AGCTGCGACA ACTTCTGTTC AACAAGTTGT CTGCCTTAAG TTCTTGGAAG 2580 

CAAACAGAAA AGCAAAACTC ATCAGATTTA TCTGCCAAAC CCAAATTCTC AAAGCCTTTC 2640 

AACCCACTGC AAAACAAAGG CCATTATGCA TTTTCTCATA CTGGAGATTC CTCCGACAAC 2700 
ATCATGCTAA CTCAGTTTGT CTCAAATGAA TAA 



- acq ±u wu: ±/b protein seq uence 

Zj Protein Accession #i Eos sequence 

i i 1 r r r r 

MKVGVLWLIS PFTFTDGHGG FLGKNDGIKT KKELIVNKKK HLGPVEEYQL LLQVTYRDSK 
JU EKRDLRNPLK LLKPPLLWSH GLIRIIRAKA TTDCNSLNGV LQCTCEDSYT WPPPSCLDPQ 
NCYLHTAGAL PSCECHLNNL SQSVNFCERT KIWGTFKINE RFTNDLLNSS SAIYSKYANG 
IEIQLKKAYE RIQGFESVQV TQFRNGSIVA GYEWGSSSA SELLSAIEHV AEKAKTALHK 
LFPLEDGSFR VFGKAQCNDI VFGFGSKDDE YTLPCSSGYR GMITAKCESS GWQVIRETCV 
LSLLEELNKN FSMI VGNATE AAVSSFVQNL SVIIRQNPST TVGNLASWS ILSNISSLSL 
ASHFRVSNST MEDVISIADN ILNSASVTNW TVLLREEKYA SSRLLETLEN ISTLVPPTAL 
PLNFSRKFID WKGIPVHKSQ LKRGYSYQIK MCPQNTSIPI RGRVLIGSDQ FQRSLPETII 
SMASLTLGNI LPVSKNGNAQ VNGPVISTVI QNYSINEVFL FFSKIESNLS QPHCVFWDFS 
HT,QWNDAGCH LVNETQDIVT CQCTHLTSFS ILMSPFVPST IFPWKWITY VGLGISIGSL 
ILCLIIEALF WKQIKKSQTS HTRRICMVNI ALSLLIADTO FIVGATVDTT VNPSGVCTAA 
HU VFFTHFFYLS LFFWMLMLGI LLAYRIILVF HHMAQHLMMA VGFCLGYGCP LIISVITIAV 
TQPSNTYKRK DVCWLKWSNG SKPLLAFWP ALAIVAVNFV WLLVLTKLW RPTVGERLSR 
DDKATIIRVG KSLLILTPLL GLTWGFGIGT IVDSQNLAWH VIFAI.LNAFQ GF^ILCFGIL 
LD3KLRQLLF NKLSALSSWK QTEKQMSSDL SAKPKFSKPF NPLQNKGHYA FSHTGDSSDN 
^ IMLTQFVSNE 

Seq ID NO: 176 DNA 
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GGGCATGCAG CCAT CGGGGA AAATCCATAG TGCAGATAAA GCAAGGAGGA AGAAGAAGGA 60 

CAGTTCTAGT AAAAGGGAGA ACATCAATAT AGGATGTTTC TTAGCAATAG AAAAAGAAGG 120 

CCAAGAGGAA TTAGGGAGAG AGTTATAAGA GATCAGCAAG GGGACAGGGT TAGATTTGGT 180 

JO TTGGTTTGAA AGCATACAGT AAATATGATG TCTGTCCCTG GCAGTGTTGG CAGAGTAGGA 240 

AGGAGGAAGG GAGGCAAGAG ATAATATCAT TTTCTCTGTG CTCCAACTGT ACTTACATAT 300 

GAG ACT ATT T CCCTCTCTGC TTTTCAAACC TTACTGGAGT TGTTTTCCCT CATGAAAACC 360 

AAGAAAGGAA AGCTAGTTAG TCTTGTTCTG AGGTTGTTCA ATGTATACAT ATCTATATCT 420 

GTAGACAGAA TCCTTGGGAA TACAGTAATT GACATATATT CTGTTATTTG ATGCTTGAAA 480 

OU AATCTCCTCC ACTAACCAGT TTCCCTATAG ATTGCCACAA GCACATAATA AGAAACAATA 540 

AATAAAATGT TCTCTTGACT TTGTTACTTA ACAATGCTGA GAAAACTTTA CAGCCTTCAT 600 

AAGGAAGTGA GGTCCAGGAA AATCTAGGAG ATATTTCTTA ACCAATCTAT AAAGGCATTA 660 

GTAATGACAG GATATTTCCT GAAAGTGTAA TTTCCCATTG AGGATTTGTT TTTAATTTCT 720 

- GGATTCCTGG AGCCAATGAA GTTGGTGTAT GTTTATGAAA TATCAAGAGA CATAAGTTGG 780 

OJ CAAGTGTTCA TATGCAAAAA CTTCTTGGAA TTTCTGAGTT CTCTGTGGCA ATATATGACA 840 

T CAGGAT ATG TCCAGTCTCA CACACCAGGA TATGTCCTTT CTAGCCTGTC TATCACATGC 900 

TAGGAGAACT ATTTAGGAAC AGAAAAAAAT GCCTGAAATG ATTTCTCATT TGAACTCATC 960 

CAAGCTTTCT CTAAATTTAA GCAAACTCCT GGTCATTTTC AGTTAGTACC TTTCCTTAAG 1020 

TTCAACCTTC AGGGCAAACC TCCGTGCCTC AGACGTTTAG CCATAGTCTG AAATTCTCTT 1080 

IV CCATAGATTG GTCCCCTGTA ACCCCGGTTT GTCTCAGCTT GTTATCCTGT TTTTTTCTTC 1140 

CCTCCATTCC CAGGATGAGC TTGTTGCTTC TGTCCTATGA GACATTAGAT TCCTTTTCTT 1200 

TGGTACCCGA GTAAATCCAT CCTACTCCAA TAGAGGAAGG TCCATTTTTG TCTTATAGCG 1260 

CTGGATGCAG ACT CAGCTG A GAAGACCATT ATTCATTTTT GGAATTCTTT ATCTCAGATA 1320 

TTTCCTCTTC TTTCTTTTTC TTCTATCTTT GGATTTTTAG TCCATOACG CCCCATTAGT 1380 

/3 CTATTCCCCG ACTTCAATCA GGGAACTTAT ACCTCTTAAA CTCATTCAGA GACTCAAAAC 1440 

ATATATATTG ATACAGGAGA CCTAAGAAGA GCATGTCTTG GGGGTTGAGG AAACAGGCAG 1500 

GTGAGAAATT TCCAGATTGG AAACACAGCT TCCTTTCTCC CATCCAGCCC CTACTTTCAG 1560 

CCTATGTGTT TCTGGCACCT TGTTGTAGAT AAATCTCCCT TGACTTTC-TG ATGTGCTGAG 1620 

„^ AAAACAAACT CACGGCTGGT GTTAAAAAGG GCCCATGACA ATACCAAC-TG TTGGGGAGAA 1680 

TGTGGAGAAA TCAGAACTCT ATTCACGGTC GGTTGGAATG CACACTTC-TG CAGAATTCTA 1740 

TGGAGAAGAG TCTGGCATTT CCTCAAAATG TTAACCTGGA TTTACCATAT GACCCAGCGA 1800 

TTTCATTCAT AGGTTTATAC TCAAAAGAAA TGAAGAAATA TGCCATGCAA AAAAATGTAC 1860 

ATGAAAGGTC ACAACATCAT TATTCATAAT AGTAAAAGGA TGGAAACAAC ACAAATGTCC 1920 

ATCAACTTAT GATTAAAGAA AATCTGGTCT ATTCATAGAA TGGAATATTA TTCGACCACA 1980 
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AAAAGGAATG ATGTACTGAT CCATGCAATG ATGTGGACAA ACOATGAAAA TAACACTAGA 2040 
TTAAAGAAGC CAGTCACAAA AGGACTTACT GTATGATTCC ATTTACCTGA AATGTTTGGA 2100 
ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT CCAGGGTCTC CAGGAAGGGA 2160 
AGAATGAAGT ACAAGATTTC TTTTGGAGGT AGTGAAATTG TTGTGGAATG AGATCATGAT 2220 
5 GATGATAGCA CAACTTTGTG AATATAATAA AATCATTGAA TTGTACAGTT GAATTTATGG 22 80 
TATATAAATT ATATGTTAAT AAAAAGGGGG TCCACAAAAC AAACAGCCCC CCACTCTGGT 2340 
TGTCAGGGAG ATATTGGATT AAATGGCCTT GGACAAC^A C CCCTCTCCCT GGCCACAGAC 2400 
ATTCTTCAGA TTACAAGATA TTCCAGGGGA AACACTGGAA TGAGTCTGAA GCCAGGTGCT 2460 
AAACAGAAGG ACCATTGAGA AATGTTGTGA TCCTGACAGG TCAAGCAATT TATTTTTCGG 2520 
10 CTTCATTTTT AAATGTAAAA TTAGAAAGCT GCCATTTAAA ATGGCCCGTC TGTTTCAATT 2580 
GCTCTTCTCA GTGTCAGCCT GTTAACTCAA TGTGTTAGTC TGTTTTCATC- CTC-CTGATAA 2 640 
AAACATACCT GAGACTGGCA AGAAAAAGAG GTTTAATTGG GCTTAGAGTT CCACGTGATT 2700 
GGGGAGG CCT CAGAATCACA GTAGGAGGCA AAAGTTATTC TTACATGGTG GCTGCAAGAG 2760 
AAGATGAGGA AGAAGCAAAA GAAGAAACCC CTGATAAACC CATCGGATCT CCTGAGGCTT 2820 
15 ATTAACTATC ATGAGAATAG CACAAGAAAG ACCGGCCCCC ATGATTCAAT TACCTCTACC 2 880 
TGGGTCCCTC CAATAACATG TGGAAATTCT GGTAGATACA ATTCAAGTTG AGATTTGGGT 2 940 
GGGAACACAG CCAAACCATA TCACTCAGCA AGGCAGATAA CTTTCTCACT GAGCCTATGC 3000 
AACAGAAAAC CATCTGGGAT GGTTGTAAGG GGCACAGGAA GTGACTGGTA GGATCACTGC 3060 
CAAAGCTGAG CACTCAGGAG AAGGCAATAG AATCCTATTC TCCATAGTAT GCTATAAGAT 3120 
20 ACTGAAGTAC ACTTCTTCAC TATCTCTTTG GACTTAGAAT TAGCACTACA TTCCTTGTTA 3180 
TACAGAAAAA TTACTAAGGA AATTCATAGG ATGACAAAAA CTTTCAGAAC TGAAAAACAG 3240 
GAAATGTAAG CTTTTTAGTT CTTTGGTATT CGAAGTATGC CTAAAAGACA ATGCAAAATC 3300 
CAAGAAAAGA ATGGTGGGGT TTTTGTTTGT TTGGTTTTGT TTTTGTTTTA CAGCTGGAGT 3360 
AGAATACAAA GGGATGGAGT TGAAACAAAT GAGAGGAAAT TGGAATTCTA AACTTATTCT 3420 
25 CATTGGCATT AGAAAGGCAC CTACATGTAT TTCACATGAG CCGGTGACTG CTGACTTGCA 3480 

TTCTTATTTT TTCCCTATAG ATTAAAAAGG AGGTACAATG GTAGAACTGT AATCCTGTCC 3540 
TTTGTCATAA ATTTTCATAT TCATAAAGGT GAGTGTTAGC CCGCTTGTGA AATCTGAAGT 3600 
TGAGTAACTT CAAATACTAA CCACAGAGGG AAAGGCAGCA AGAGGAGAGG CATAAATTTA 3 660 
GGATCTCACC CTTCATTCCA CAGACACACA CAGCCTCTCT GCCCACCTCT GCTTCCTCTA 3720 
30 GGAACACAGG TAAGAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTTGAGAAT 37 80 
AATAATGATA CTGTGTTCTA TATCATGCAT CTCCTGCATT CTGTCTGATT ATATTTTACT 3840 
TATTCTGCCA GAGCAAAATT AAAATACCTA TTTCATCTGA TTTGTCCTTT ATCTAAATTG 3 900 
CTTAGTTCCA AGTAAACCAA GGCACTTTTA GGAACACAGA GGGAGAGTGC CTTGCAGCCA 3 960 
GAGAGTCTTG AAGGAGATGT CAGGGACGCA TCTTAACAGC TGGTTGGATG TGATCCACAG 4020 
35 AGGTCTCCTG TTAGCATTCA TTGTAAAGCC ATCCTACCTA GCTCTAGTGT AACCAGCAAT 4080 
GAAAGAAAGA TAAAGAGGGT CGATTACTTA TTTACAATAG TCTTTAAAAA CGTAGTTTTG 4140 
TAAGCCTTCT AATTAGGACA TTAATATATT TAATATATGC ACATTGTAGA AAGATTGAAG 4200 
CGTTAAAAAT AAGAGAAAAA CTTTAAATGT CAAAATCTCA CAACCCAGAT ATATCATTTC 4260 
TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAGTCATTCT GACAGGAATC 4320 
40 TGATGCTTTT CCAGGAGTTC C AG AT CACAT CGAGTTCACC ATGAATTCAC TCAGTGAAGC 4380 

CAACACCAAG TTCATGTTCG ATCTGTTCCA ACAGTTCAGA AAATCA=J>AG AGAACAACAT 4440 
CTTCTATTCC CCTATCAGCA TCACATCAGC ATTAGGGATG GTCCTCTTAG GAGCCAAAGA 4500 
CAACACTGCA CAACAAATTA GCAAGGTAGC TATCAGCATC ATTACGTTGT CCTGTTGCAG 45 60 
TTTTTCTCTG GTTCCGTCGG CTAGCACGCA GATGGTAATA GATGTGGTGG TCTGATGGGT 4620 
45 AGCACAGGGG GCTGTGCAGG AATTCCCATA ACTGTGAGAC CACTGACTTA AACAGATCTT 4 6 80 

TTGAGTAAAG TTTTCTTGTC CCGCTTCATG TCTCTTCCAG GTTCTTCACT TTGATCAAGT 4740 
CACAGAGAAC ACCACAGAAA AAGCTGCAAC ATATCATGTG AGTCACAGAG CACTCTGATT 4800 
CAGCTTTAGA TCCCTGAACA GGTCATAGTT TAAACCTGGA ACTTCACAAA AACTAAGAAA 4 860 
AGGCCAGTTT TAGGGAAAAT CTTGGACACA AAGAT TG AG A CATACAGAGT GGGTTGGCAT 4 920 
50 TTCATGGCAC ATAATTATTA TTCCTCATTT CTGCGTTACT AAAAGACAGT CAGCACTGTA 4 9 BO 
CCTCAGAGCA TAGGTCTGGA TCAGGATAGG CTGGGTTCAG ACTCCAGCTT TGCTCTTCAC 5040 
AAATGATGAA TAAGAGCAGG ACACAACTGC TCGGAGTCCC AGTGACCTCA TCCCAGAAAA 5100 
CTAAGGGTAA GAAAAAATCT GACTCAATAC ATGCAAATAC ATGCAAATGT TTACAACAGT 5160 
GCCTTGCCCA TAAAAGTCAT AATAAATGTT ATTATTATTA TAAAGTAGCT ATAATTATAC 5220 
55 T AAT CAT AAT AATGTGAAAA TAATTTAATT TTCATTGAGT CATTAATGAG ATTCAGAGGA 5280 
ATAAGCACAA GTCCAAGTAT AT TTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG 5340 
CCTTAATAGT GCAAAATGCT TTGCTGGAAG GTAGAAAGTT CTAGATTTAA ACAGGCTTAG 5400 
GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA GGGTTTTTTT CCCCATTCTC 5460 
TGAGCTTTCT TGTGTTCATC TGAATTGAAC TAAAGACTTA GAGTTACCCA TGTAAAGTCC 5520 
60 TTAGCCATGG ACCTGGCATA CACTCTTCTT ACGTGCAGAG AATGACCATC ATGAGGAAAG 5580 
AGCCACAGAT CAGTCAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAGGGCTT 5640 
CCTGGCATAA TCTATTTAAA ATATCCAACC TTCAACATAC TCGTATCCTT GATGACTGTT 5700 
AGAAGTGAAA TATGGTCCTT GCCCATAAGG AGCTGAGAGT TTAACTGGGA AGCTAAACCT 5760 
AACC CTTTAA ACCAACAAGG AGAAAATCTA CTGGTAGACA GCGCTGCATC TTTAGTTCAG 5820 
65 AAGAGAAAAG ATTGCAGTAC GTTAGAGCAA GAAGAATTTT CTGGAAGAAG TCAAATATAA 5880 
GGTGGATTTT GAAGGGTATT TGAGGTGAAA TACACCAATT ATCAGGGAAT AACATCAAAG 5940 
GTCCTCAATG AGACTACCAG CATTTAGGGA CTGATCTAAC AGACTTAGCA TGGGTTTAGT 6000 
ATTTACATTG ATACAGCAAT TGAATGATCT CCTTTTTTGA TGTTTGAAGG TTGATAGGTC 60 60 
AGGAAATGTT CATCACCAGT TTCAAAAGCT TCTGACTGAA TTCAACAAAT CCACTGATGC 6120 
70 ATATGAGCTG AAGATCGCCA ACAAGCTCTT CGGAGAAAAG ACGTATCAAT TTTTACAGGT 6180 
AATTTCACCT GGCCTACCCA CATTTCATTT GCATCCTGAT GTCTGTGTCT CTGAGTC-GCC 6240 
AAATGGAAGA AAGCAAGGCA GATGAGCCTG GCCGACCCAG GTGGAGAGCA TTTACTCAGA 6300 
GTGCATTAGC TCCATTTCCA CAACTCTCCC CCACTGGAGT GTCCCAGACC CCAACGATAC 6360 
AT CACTGAAG TGTGGATTTA GGGATAATCT TGTGATAAAA GAGGAGGTTG TGTAATAGAG 6420 
75 TGAGTAAGAG TAATAAGTAA TAAGATACCA TCGATAAACT GGCACTGACT CAGTCACATA 6480 
CGATACATCT TGGTGGGAAA TGTATGACTA ATGGGATATT ATTGGAATGG GCAGGCTTGG 6540 
GTGAGTTCCT GAGAATAGTT GAGGAAGTAC CAGGAAATAT TGAATGCACA GGATGAAAGA 6600 
CAAAAACAAA GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT GAGAAGCAAT 6660 
GAATCTCCTT CAGGGAAGCC TGCTCTGCAG TTTGCAAACC ACAGCCTCTT CTGCTTCTGC 6720 
80 CTTTTGCCAA GATGATATTG ACCTTCAGTG ACCTCTTTCT TGTGCCAGCC CACATTCCCC 6780 
TTTTGCATTG CCTACATGAC ACCTGTATAA AAAT AT CCAT GGACAGGAGA TACTGCATCT 6840 
ATTCAGGGTC TGGATTCAGC TTACTGTTGT TACAAATAAG TAAGTTTGGT AATATATAGT 6900 
TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC TCAAAGGAAT ATTTAGATGC 6960 
CATCAAGAAA TTTTACCAGA CCAGTGTGGA ATCTACTGAT TTTGCAAATG CTCCAGAAGA 7020 
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AAGTCGAAAG AAGATTAACT CCTGGGTGGA AAGTCAAACG AATGGTAGGA GAGCCACCCA 7080 
TTATAGAAAC ACCTTTGAGA AACCTATGCC AGTGAGCCTT GTGCTTGACA CTGCATGGGG 7140 
GAACAGGTGT GGGGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGGGCA CTCCAGATGA 7200 
AGGATTTGTC CAAATGAATA TGAAGAGAGC CTAGGGGAGC CAAGGAGGAA AT CACAGGAA 72 60 
5 GCCAATTAGA TGGAAACACA TCTGGAGAAT TATTTGCTTA TGGCCCTGCA TGACAATAGC 7320 
TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CATATCCTTT ACTTTAAATC 7380 
AGACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAGCGTC TCTCGTCTCC 7440 
TTTACTAATT GGGAAACAAG CAGCTCTCTG GTAAATCACC CTTTTGTCTC TGAGCTGGAG 7500 
CTGCCTGGAT CACATCTGTA GCCAATGTGT TCTGCAGGGA TTATCACAGC TCTCTTCCCC 7560 
10 ATCAAGGGCA AAGAGCTTGA CAAAGTCTCC ATTCTACAGA CATCTTTCTT ACCTCCCACC 7620 
TCTCATTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA GATACCCCCG 7 680 
GAAGTAGTGT CTGACAGCAC AGGACATGCG TTTCATATTA CAGAGCTCAA GTCACTCATC 7740 
CTAAAATGCA ATCAGGGCCT CCTTCCTCTG AATGGGGACC CCGTAGTTAA AAAAAAATAA 7800 
AAGTAGGAAG AGGAGGGAGG GAGAAAGGAA AGACACATGT TGGAAGAGTA GACAAAATCA 7 860 
15 GTTTATCAGT ATTCCAAATC AGATGATTGG AGACATTCAT ACACAGAGAA CGTGAACTCC 7920 
TTCTCTATCA CAAGAAGTGA TGTCTCCATC AAGGGTAACT TTATACGACT GGAGCC1TGA 7 980 
AGAAAGCTGC ATCTGGTGAA CCACTGGT CA GTGAGTCTAA CAATTCAAAG ATCAAAGTCA 8040 
GTGAGTCTCA AGCAGGGATT TGGGTCAATA ATTAACGATC AGTCACGAAC ATTTGCAAAG 8100 
CATCTTCCAG ACAAGCCATT TGTAGCTTGT GTAAAAGACT CTTTTATTCT TTCCCTTGCA 8160 
20 GAAAAAATTA AAAACCTATT TCCTGATGGG ACTATTGGCA ATGATACGAC ACTGGTTCTT 8220 
GTGAACGCAA TCTATTTCAA AGGGCAGTGG GAGAATAAAT TTAAAAAAGA AAACACTAAA 8280 
GAGGAAAAAT TTTGGCCAAA CAAGGTATTG TCTATATTTT ATTTATATAG TGTAATATGT 8340 
TAATACATGG AATGTTAAAC ATTTCTGATG GAATGTAACA TGATAAGTAA AAAATAAAAA 8400 
TTGTTCATGT CTGTTATTTT GTTGTTTTAC TCTTATAACT TTATTTAGTT AGGAATACCT 8460 
25 GAAAAACTAT TGTTTCTAAC TCATGGAATT CCTGGGTTAT TTCTTAGAAG AAGAAGGATG 8520 
TGTTGCTATC TCAATAATAT TATCTTTTTT GTCTTGTGTT TCACGTGTTA TTTGTTGGAC 8580 
ACATTGATTT ATTGCAGAAT ACATACAAAT CTGTACAGAT GATGAGGCAA TACAATTCCT 8640 
TTAATTTTGC CTTGCTGGAG GATGTACAGG CCAAGGTCCT GGAAATACCA TACAAAGGCA 8700 
AAGATCTAAG CATGATTGTG CTGCTGCCAA ATGAAATCGA TGGTCTGCAG AAGGTAAGAA 8760 
30 CTTGCAT CT A CAACTCTTCC TTCTACTGCC GGACATTTTT CCAAAGATAC CAAGTTTAAA 8820 
CAAGGTAAAA GCTTATGACC GAGTTGCCTC AAAATGATGA AAAATTCTAA ATGAGGAATG 8880 
ATGACTCACC TTCATATTAC AAATATTTGA GCATAGGGCC TGACACAAAC TGAAAGCTTA 8940 
GTTTTTGTTT GTTTGTTTGT TTTTATTATT ATTATTATAA TACTTTAAGC TTTAGGGTAC 9000 
ATGTGCACAA TGTGCAGGTT AGTTACATAT GTATACATGT GCCATGCTGG TGTGCTGCAC 9060 
35 CCATTAACTC ATCATTTAGC GTTAGGTATA TCTCCTAATG CTATCCCTCC CCCCTCCCCC 9120 

CACCCCACAA CAGTCCTCAG AGTGTGATGT TACCTTCCTG TGTCCAAGTG TTCTCATTGT 9180 
TCAATTCCCA TCTATGATTT AATTCCATCT ATGGCTTAGT TAATGATTAA TTTATTAGAG 9240 
TTACATGCAT TGGATATCAA TTTGATGATA TTATTATGCA GCAATTTAAA CTTGACTGGG 9300 
AGAAATATAT ACCAATGTGA GGAAAGTTT A ' A TA j^AAA GGGAATACAA 9360 

40 ATTTAGGAAT TTAGGGAATT ACAATTTAAT AATTGCAATG TGTACTAAAT AATGTATACA 942 0 

GAAAAATATG ATGAGCCTAT TAAAAATTGA CACATGTAGT AGGCTGTTGG CACAAGAAAT 9480 
AGTGATACAT ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTACA TGTGTATCAT 9540 
ACAGTTGTAT ACATACATAT GTACACATAT ACATATACGT AAAAACATGA TTCTGTTTTT 9600 
ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT TTATATATTC AGGACTCATA 9660 
45 TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTGTATTTC ACATTTATTG 9720 

CCAAAATAAC GAATCTCCAC ATAGTCAATT CATTGTTAAG GTGTATTAGA GATCGACAGT 97B0 
TAGTCATATC AGTTTCTTTT TTCCATTTGT ATAGCTTGAA GAGAAACTCA CTGCTGAGAA 9840 
ATTGATGGAA TGGACAAGTT TGCAGAATAT GAGAGAGACA TGTGTCGATT TACACTTACC 9900 
TCGGTTCAAA ATGGAAGAGA GCTATGACCT CAAGGACACG TTGAGAACCA TGGGAATGGT 9960 
50 GAATATCTTC AATGGGGATG CAGACCTCTC AGGCATGACC TGGAGCCACG GTCTCTCAGT 10020 
ATCTAAAGTC CTACACAAGG CCTTTGTGGA GGTCACTGAG GAGGGAGTGG AAGCTGCAGC 10080 
TGCCACCGCT GTAGTAGTAG TCGAATTATC ATCTCCTTCA ACTAATGAAG AGTTCTGTTG 10140 
TAATCACCCT TTCCTATTCT TCATAAGGCA AAATAAGACC AACAGCATCC TCTTCTATGG 10200 
CAGATTCTCA TCCCCATAGA TGCAATTAGT CTGTCACTCC ATTTAGAAAA TGTTCACCTA 10260 
55 GAGGTGTTCT GGTAAACTGA TTGCTGGCAA CAACAGATTC TCTTGGCTCA TATTTCTTTT 10320 
CTATCTCATC TTGATGATGA TAGTCATCAT CAAGAATTTA ATG AT TAAAA TAGCATGCCT 10380 
TTCTCTCTTT CTCTTAATAA GCCCACATAT AAATGTACTT TTCCTTCCAG AAAAATTTCC 10440 
CTTGAGGAAA AATGTCCAAG ATAAGATGAA TCATTTAATA CCGTGTCTTC TAAATTTGAA 10500 
ATATAATTCT GTTTCTGACC TGTTTTAAAT GAACCAAACC AAATCATACT TTCTCTTCAA 10560 
60 ATTTAGCAAC CTAGAAACAC ACATTTCTTT GAATTTAGGT GATACCTAAA TCCTTCTTAT 10620 
GTTTCTAAAT TTTGTGATTC TATAAAACAC ATCATCAATA AAATAATGAC ATAAAATCAT 10680 
TTTTGCTTTA CCTGTTTTCT CTCTGGAAAG GGCAAGTGTC CAGTTACACA TAGGAAAGAT 10740 
AATTTAGAGA TATATTAATC ATATATAAAG GAAAATTAAA AACAGAGTAG TTCATGATGA 10800 
GCCTGGAGTA GAAGGCATAT CCCAGAACAG GAGGAGCCTT GTAAACCACA TAGGAACTTC 10860 
65 CTATTTTATG CTAAAGGGAT AAGAAACTCA TTACAGGCTT TGATGGTTGT TTGTCAAAGA 10920 
GGGGCATAAA ATTATCATAT CCACATCTAG AAAATACATC TCTGGCIACG CTGATATCAA 10980 
TGGATGCGAG GAAAGAACAG TGTGGTTACC ATATATAAAT TAGGAAATCA TTAGAGTATT 11040 
GGGAGTGGAA ATGGAGAGAA AGAAAGAGCC TGGGGGAATT ATTTAGGAAA TAATAGTTAC 11100 
AGAAAGACAT CTAAGTTGCT GACCTATCTG ACTGGATGGA TGGAAGAATA TCTTGTTTCT 11160 
70 GAGAGAAAAA AAGACTTTGG GTTTAAATTT GTACTTGATG AATTAAGGTA CTTTTAATAT 11220 
TCAAATGGAT TTGCCTGGCA GGCACTTGAA GATATTAGTC TAAATCTCAG AAACAGAATA 11280 
TGATCTGAAG CTCTAAATTT GTGATATTCA ATATAAATAC TTTAGAGTCA TTGGGATAAA 11340 
TATGGTAGTT GTAGCTAAAA GCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 
AAAGAAGAAT CTAGAATTGA CCTTGAAGTA TATCAGCATG TGTAAAGATC AGGAATTGAT 11460 
75 CATTTTTATT TTCCAGAAAG TAGCTTTTCT TAGGGTTCCA TATTTACTCC CATAGATTCT 11520 
TCCC 




MNSLSEANTK PMPDLPQQPR KSKENNIFYS PISITSALGM \ LLC JCDNTA OQISKVLHFD 60 
QVTENTTEKA ATYHVDRSGN VHHQFQKLLT EFNKSTDAYE LKIANKLFGE KTYQFLQEYL 120 
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DAIKKPYQTS VESTDFANAP EESRKKINSW VESQTNEKIK NLFPDGTIGN D 
YFKGQWENKF KKENTKEEKF WPNKNTYKSV QMMRQYNSFN FALLEDVQAK V 
LSMIVLLPNE IDGLQKLEEK LTAEKLMEWT SLQNMRETCV DLHLPRFKME ESYDLKDTLR 
TMGMVNIFNG DADLSGMTWS HGLSVSKVLH KAFVEVTEEG VEAAAATAW WELSSPSTN 
EEFCCNHPFL FFIRQNKTNS I 



1 11 21 31 41 51 

I I I I I I 

GGAGAGAAGA AAGGAGGGGG CAAGGGAGAA GCTGCTGGTC GGACTCACAA TGAAAACGCT 
CCTTCTTTTG CTGCTGGTGC TCCTGGAGCT GGGAGAGGCC CAAGGATCCC TTCACAGGGT 
GCCCCTCAGG AGGCATCCGT CCCTCAAGAA GAAGCTGCGG GCACGGAGCC AGCTCTCTGA 
GTTCTGGAAA TCCCATAATT TGGACATGAT CCAGTTCACC GAGTCCTGCT CAATGGACCA 
GAGTGCCAAG GAACCCCTCA TCAACTACTT GGATATGGAA TACTTCGGCA CTATCTCCAT 
TGGCTCCCCA CCACAGAACT TCACTGTCAT CTTCGACACT GGCTCCTCCA ACCTCTGGGT 
CCCCTCTGTG TACTGCACTA GCCCAGCCTG CAAGACGCAC AGCAGGTTCC AGCCTTCCCA 
GTCCAGCACA TACAGCCAGC CAGGTCAATC TTTCTCCATT CAGTATGGAA CCGC3GAGCTT 
GTCCGGGATC ATTGGAGCCG ACCAAGTCTC TGTGGAAGGA CTAACCGTGG TTGGCCAGCA 
GTTTGGAGAA AGTGTCACAG AGCCAGGCCA GACCTTTGTG GATGCAGAGT TTGATGGAAT 
TCTGGGCCTG GGATACCCCT CCTTGGCTGT GGGAGGAGTG ACTCCAGTAT TTGACAACAT 
GATGGCTCAG AACCTGGTGG ACTTGCCGAT GTTTTCTGTC TACATGAGCA GTAACCCAGA 
AGGTGGTGCG GGGAGCGAGC TGATTTTTGG AGGCTACGAC CACTCCCATT TCTCTGGGAG 
-CAGTCA CCAAGCAAGC TTACTGGCAG ATTGCACTGG ATAACATCCA 



\ CTTGACATCC ACCCTCCAGC 1140 
3 GGGATGTCTT CATTCGACAG TTTTACTCAG TCTTTGACCG 1200 
C CGTGTGGGAC TGGCCCCAGC AGTCCCCTAA GGAGGGGCCT TGTGTCTGTG 12 SO 
CCTGCCTGTC TGACAGACCT TGAATATGTT AGGCTGGGGC ATTCTTTACA CCTACAAAAA 1320 
C AGAGAATGTA GCTGTTTCCA GGGTTGCAAC TTGAATTAAG ACCAAACAGA 1380 
T ACACACACAC ACACTTCACA CATACACACC 1440 
ACTCCCACCA CCGTCATGAT GGAGGAATTA CGTTATACAT TCATATTTTG TATTGATTTT 1500 
TGATTATGAA AATCAAAAAT TTTCACATTT GATTATGAAA A' 
GCAGAGATCA TGGTATAATA AATCCCTTTG CAA.CTCCACT CAGCCCTGAC A 
CACACGGCCA GGCCTGTTTA TCTACACTGC TGCCCACTCC TCTCTCCAGC TCCACATGCT 
GTACCTGGAT CATTCTGAAG CAAATTCCGA GCATTACATC ATTTTGTCCA TAAATATTTC 
TAACATCCTT AAATATACAA TCGGAATTCA AGCATCTCCC ATTGTCCCAC A 
CTGTTTTTGT AGTTGGATTG TTTGTATTAG GATTCAAGCA AGGCCCATAT A 

T CTGTAAGTCT CTTTCCATCT ACAGAGTTTA GCACATTTGA ACGTTG 
C GAGGTGTCAT TTGACATGGT TCTCTGAACT TATC 
T GGAGGTCTGA TTTTGTGGCA AAAATAC 

T CCTGTCAGGA GGCAGATAAT GCTGGTGCCT CTCTATTGGT AATGTTAAGA 
CTGCTGGGTG GGTTTGGAGT TCTTGGCTTT AATCATTCAT TACAAAGTTC AGCATTTT 

: 179 Protein sequence 



I I I I I I 

MKTLLLLLLV LLELGEAQGS LHRVPLRRHP SLKKKLRARS QLSEFHKSHN LDMIQFTESC 
SMDQSAKEPL INYLDMEYFG TISIGSPPQN FTVIFDTGSS NLWVPSVYCT SPACKTHSRF 
QPSQSSTYSQ PGQSFSIQYG TGSLSGIIGA DQVSVEGLTV VGQQFGESVT EPGQTFVDAE 
FDGILGLGYP SLAVGGVTPV FDNMMAQNLV DLFMFSVYMS SNPEGGAGSE LIFGGYDHSH 
FSGSIiNWVPV TKQAYWQIAL DNIQVGGTVM FCSEGCQAIV DTGTSLITGP SDKIKQLQNA 
IGAAPVDGEY AVECANLNVM PDV1FTINGV PYTLSPTAYT LLDFVDGMQF CSSGFQGLDI 
HPPAGPLWIL GDVFIRQFYS VFDRGNNRVG LAPAVP 

Seq ID NO: 180 DNA sequence 

Nucleic Acid Accession #: NM_018058.1 

Coding sequence: 319.. 1575 

1 11 21 31 41 51 

I I I I I I 

TACGCGCTGC GGGACCGGCA GGGGAACGCC ATCGGGGTCA CAGCCTGCGA CATCGACGGG 
GACGGCCGGG AGGAGAT CT A CTTCCTCAAC ACCAATAATG CCTTCTCGGG GG7GGCCACG 
TACACCGACA AGTTGTTCAA GTTCCGCAAT AACCGGTGGG AAGACATCCT GAGCGATGAG 



AGAAAGGGCT CTGGACGCTA CTCTATCTAC ATTGCCAATT ACGCCTAOGG T 
CCTGATGCCC TCATTGAAAT GGACCCTGAG GCCAGTGACC TCTCCCGGGG CATTCTGGCG 
CT CAGAGATG TGGCTGCTGA GGCTGGGGTC AGCAAATATA CAGGGGGCCG AGGCGTCAGC 
GTGGGCCCCA TCCTCAGCAG CAGTGCCTCG GATATCTTCT GCGACAATGA GAATGGGCCT 
AACTTCCTTT TCCACAACCG GGGCGATGGC ACCTTTGTGC- ACGCTGCGGC CAGTGCTGGT 
GTGGACGACC CCCACCAGCA TGGGCGAGGT GTCGCCCTGG CTGACTTCAA CCGTGATGGC 
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25 



GGAGAGTCCA TGGCTCAGCC GCTGTCCGTC TTCCGGGGCA ATCAGGGCTT CAACAACAAC 
TGGCTGCGAG TGGTGCCACG CACCCGGGTT GGGGCCTTTG CCAGGGGAGC TAAGGTCGTG 
CTCTACACCA AGAAGAGTGG GGCCCACCTG AGGATCATCG ACGGG3GCTC AGGCTACCTG 
TGTGAGATGG AGCCCGTGGC ACACTTTGGC CTGGGGAAGG ATGAAGCCAG CAGTGTGGAG 
5 GTGACGTGGC CAGATGG CAA GATGGTGAGC CGGAACGTGG CCAGCGGGGA GATC-AACTCA 
GTGCTGGAGA TCCTCTACCC CCGGGATGAG GACACACTTC AGGACCCAGC CCCACTGGAG 
ACACCAATGA ATGCATCCAG TTCCCATTCG TGTGCCCTCG AGACAAGCCC GTATGTGTCA 
ACACCTATGG AAGCTACAGG TGCCGGACCA ACAAGAAGTG CAGTCGGGGC TACGAGCCCA 
ACGAGGATGG CACAGCCTGC GTGGGGACTC TCGGCCAGTC ACCGGGCCCC CGCCCCACCA 

10 CCCCCACCGC TGCTGCTGCC ACTGCCGCTG CTGCTGCCGC TGCTGGAGCT GCCACTGCTG 
CACCGGTCCT CGTAGATGGA GATCTCAATC TGGGGTCGGT GGTTAAGGAG AGCTGCGAGC 
CCAGCTGCTG AGCAGGGGTG GGACATGAAC CAGCGGATGG AGTCCAGCAG GGGAGTGGGA 
AAGTGGGCTT GTGCTGCTGC CTAGACAGTA GGGATGTAAA GGCCTGGGAG CTAGACCCTC 
CCCAAGCCCA TCCATGCACA TTACTTAGCT AACAATTAGG GAGACTCGTA AGGCCAGGCC 

15 CTGTGCTGGG CACATAGCTG TGATCACAGC AGACAGGGTC GCTGCCCTGA TGGCGCTTAC 
ATTCCAGTGG GTCTAATGAC CATATCTTAG GACACAGATG TGCCCAGGGA GGTGGTGTCA 
CTGCACAGGA AGTATGAGGA CTTTAGTGTC CTGAGTTCAA ATCCTGATTC AGGAACTCAC 
AAAGCTATGT GACCTTACAC CAGTCACTTA ACTTGTTAGC CATCCATTAT CGCATCTGCA 
AAATGGGGAT TAAGAATAGA ATCTTGGGGT TAGTGTGGAG ATTAGATTAA ATGTATGTAA 

20 GACACTTGGC ACAAAACCTG GCACATAGTA AAGGCTCAAT AAAAACAAGT GCCTCTCACT 



I I I I I 

MDPEASDLSR GILALRDVAA EAGVSKYTGG RGVSVGPILS SSASDIFCDN E 
RGDGTFVDAA ASAGVDDPHQ HGRGVALADF NRDGKVDIVY GNWNGPHRLY L 
FRDIASPKFS MPSPVRTVIT ADFDNDQELE IFFNNIAYRS SSANRLFRVI R 
ELNPGDALEP EGRGTGGWT DFDGDGMLDL ILSHGESMAQ PLSVFRGNQG FNNNWLRWP 
RTRVGAFARG AKWLYTKKS GAHLRIIDGG SGYLCEMEPV AHFGLGKDEA SSVEVTWPDG 
KMVSRNVASG EMNSVLEILY PRDEDTLQDP APLETPMNAS SSHSCALETS PYVSTPMEAT 
GAGPTRSAVG ATSPTRMAQP AWGLSASHRA PAPPPPPLLL PLPLLLPLLE LPLLHRSS 



I I 
; TGCCCATCAC TGAGGGGTCC 



CAGCGGGCTG AACCCATGTT CACTGCAGTC ACCAACTCAG TTCTGCCTCC TGACTATGAC 12 0 

AGTAATCCCA CCCAGCTCAA CTATGGTGTG GCAGTTACTG ATGTGGACCA TGATGGGGAC 180 

45 TTTGAGATCG TCGTGGCGGG GTACAATGGA CCCAACCTGG TTCTGAAGTA TGACCGGGCC 240 

CAGAAGCGGC TGGTGAACAT CGCGGTCGAT GAGCGCAGCT CACCCTACTA CGCGCTGCGG 300 

GACCGGCAGG GGAACGCCAT CGGGGTCACA GCCTGCGACA TCGACGGGGA CGGCCGGGAG 3S0 

GAGATCTACT TCCTCAACAC CAATAATGCC TTCTCGGGGG TGGCCACGTA CACCGACAAG 420 

TTGTTCAAGT TCCGCAATAA CCGGTGGGAA GACATCCTGA GCGATGAGGT CAACGTGGCC 480 

50 CGTGGTGTGG CCAGCCTCTT TGCCGGACGC TCTGTGGCCT GTGTGGACAG AAAGGGCTCT 540 

GGACGCTACT CTATCTACAT TCCCAATTAC GCCTACGGTA ATGTGGGCCC TGATGCCCTC 600 

ATTGAAATGG ACCCTGAGGC CAGTGACCTC TCCCGGGGCA TTCTGGCGCT CAGAGATGTG 6 SO 

GCTGCTGAGG CTGGGGTCAG CAAATATACA GGGGGCCGAG GCGTCAGCGT GGGCCCCATC 720 

CTCAGCAGCA GTGCCTCGGA TATCTTCTGC GACAATGAGA ATGGGCCTAA CTTCCTTTTC 780 

55 CACAACCGGG GCGATGGCAC CTTTGTGGAC GCTGCGGCCA GTGCTGGTGT GGACGACCCC 840 

CACCAGCATG GGCGAGGTGT CGCCCTGGCT GACTTCAACC GTGATGGCAA AGTGGACATC 900 

GTCTATGGCA ACTGGAATGG CCCCCACCGC CTCTATCTGC AAATGAGCAC CCATGGGAAG 960 

GTCCGCTTCC GGGACATCGC CTCACCCAAG TTCTCCATGC CCTCCCCTGT CCGCACGGTC 1020 

ATCACCGCCG ACTTTGACAA TGACCAGGAG CTGGAGATCT TCTTCAACAA CATTGCCTAC 1080 . 

60 CGCAGCTCCT CAGCCAACCG CCTCTTCCGC GTCATCCGTA GAGAGCACGG AGACCCCCTC 1140 

ATCGAGGAGC TCAATCCCGG CGACGCCTTG GAGCCTGAGG GCCGGGGCAC AGGGGGTGTG 1200 
GTGACCGACT TCGACGGAGA CGGGATGCTG GACCTCATCT K3TCCCATGG A' 



70 



GTGCCACGCA CCCGGTTTGG GGCCTTTGCC AGGGGAGCTA AGGTCGTGCT CTACACCAAG 1380 

AAGAGTGGGG CCCACCTGAG GATCATCGAC GGGGGCTCAG GCTACCTGTG T G AGATGGAG 1440 

CCCGTGGCAC ACTTTGGCCT GGGGAAGGAT GAAGCCAGCA GTGTGGAGGT GACGTGGCCA 1500 

GATGGCAAGA TGGTGAGCCG GAACGTGGCC AG CGGGG AGA TGAACTCAGT GCTGGAGATC 1560 

CTCTACCCCC GGGATGAGGA CACACTTCAG GACCCAGCCC CACTGGAGTG TGGCCAAGGA 1620 

TTCTCCCAGC AGGAAAATGG CCATTGCATG GACACCAAIG AATGCATCCA GTTCCCATTC 1680 

GTGTGCCCTC GAGACAAGCC CGTATGTGTC AACACCTATG GAAGCTACAG GTGCCGGACC 1740 

AACAAGAAGT GCAGTCGGGG CTACGAGCCC AACGAGGATG GCACAGCCTG CGTGGGGACT 1800 

CTCGGCCAGT CACCGGGCCC CCGCCCCACC ACCCCCACCG CTGCTC-CTGC CACTGCCGCT 1860 

GCTGCTGCCG CTGCTGGAGC TGCCACTGCT GCACCGGTCC TCGTAGATGG AGATCTCAAT 1920 

CTGGGGTCGG TGGT TAAGGA GAGCTGCGAG CCCAGCTGCT GAGCAGGGGT GGGACATGAA 1980 

75 CCAGCGGATG GAGTCCAGCA GGGGAGTGGG AAAGTGGGCT TGTGCTGCTC- CCTAGACAGT 2040 

AGGGATGTAA AGGCCTGGGA GCTAGACCCT CCCCAAGCCC ATCCATGCAC ATTACTTAGC 2100 

TAACAATTAG GGAGACTCGT AAGGCCAGGC CCTGTGCTGG GCACATAGCT GTGATCACAG 2160 

CAGACAGGGT CGCTGCCCTG ATGGCGCTTA CATT CCAGTG GGTCTAATGA CCATATCTTA 2220 

GGACACAGAT GTGCCCAGGG AGGTGGTGTC ACTGCACAGG AAGTATGAGG ACTTTAGTGT 2280 

80 CCTGAGTTCA AATCCTGATT CAGGAACTCA CAAAGCTATG TGACCTTACA CCAGTCACTT 2340 

AACTTGTTAG CCATCCATTA TCGCATCTGC AAAATGGGGA TTAAGAATAG AATCTTGGGG 2400 

A AGACACTTGG CACAAAACCT GGCACATAGT 2460 

AAAGGCTCAA TAAAAACAAG Ti 
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Protein Accession « : CAC08451 

1 11 21 31 41 51 

I I I I I I 

MSRMLPFLLL LWFLPITEGS QRAEPMFTAV TNSVLPPDYD SNPTQLNYGV AVTDVDHDGD 
FEIWAGYNG PNLVLKYDRA QKRLVNIAVD ERSSPYYALR DRQGNAIGVT ACDIDGDGRE 
EIYFLNTNNA FSGVATYTDK LFKFRNNRWE DILSDEVNVA RGVASLFAGR SVACVDRKGS 
GRYSIYIANY AYGNVGPDAL IEMDPEASDL SRGILALRDV AAEAGVSKYT GGRGVSVGPI 
LSSSASDIFC DNENGPNFLF HNRGDGTFVD AAASAGVDDP HQHGRGVALA DFNRDGKVDI 
VYGNWNGPHR LYLQMSTHGK VRFRDIASPK FSMPSPVRTV ITADFDNDQS LEIFFNNIAY 
RSSSANRLFR VIRREHGDPL IEELNPGDAL EPEGRGTGGV VTDFDGDGML DLILSHGSSM 
AQPLSVFRGN QGFNNNWLRV VPRTRFGAFA RGAKWLYTK KSGAHLRIID GGSGYLCEME 
PVAHFGLGKD EASSVEVTWP DGKMVSRNVA SGEMNSVLE I LYPRDEDTLQ DPAPLECGQG 
FSQQENGHCM DTNECIQFPF VCPRDKPVCV NTYGSYRCRT NKKCSRGYEP NEDGTACUGT 
LGQSPGPRPT TPTAAAATAA AAAAAGAATA APVLVDGDLN LGSWKESCE PSC 




K GGTACAATGG A 

C CCAGAAGCGG C 

3 GGGAACGCCA TCGGGGTCAC A 

ATCGACGGGG A 

CACAGCAGCT CAGCGCAGGT CCCTTCTGGG CTCCACAGAA ACAGGCCTGT GCTGAAGCCT 
A CCCCTGCAGG CCTCCTGGGT CTGCCTCCAC T 

TGTCGGGGTG GACTGAGACC TACCCATGAA CCAGAACCAT TTCTTCTGAG ACCCAAATCA 
GGGGTGGCCA CGTACACCGA CAAGTTGTTC AAGTTCCGCA A 
CTGAGCGATG A 

GCCTGTGTGG ACAGAAAGGG CTCTGGACGC TACT CTATCT ACATTGCCAA TTACGCCTAC 720 

GGTAATGTGG GCCCTGATGC CCTCATTGAA ATGGACCCTG AGGCCAGTGA CCTCTCCCGG 780 

GGCATTCTGG CGCTCAGAGA TGTGGCTGCT GAGGCTGGGG TCAGCAAATA TACAGAAGGC 840 

TTCTCCCACA CTGCCTCTCC AAGCATTGGT GAGATATCTG GCAGAACCGA GGAGCGGGAA 900 

GGAGGAGACC CAGAGGAGGC AGATGAGGAG CACAGTGGGG ATGGAAGCAC CAGCCAACTG 9 SO 

TGCCGGCTGG GCTGGAAGGA CGGGCAGTTC AAGGAAGAAG CAGCAGCTTT GGTGGAGGAA 1020 

CAGAGGGAGG CTGGGGCAGC TGGCGTGCCC AGAGGACGTG TTCGAACAGC TCTGCAGACT 1080 

TCCAAAAGCC ATTTGGCTGA CAAGAACCTA TTTGGCCCAC CATGTTACTA TTCTGTCTGC 1140 

GCGCCTTCTC CAGCCCACCC TTTCCCTGCC CGCCAAGCCC CCCAACACTA CCCTGTAGCC 1200 

CCCCTTGTCA CTCAGCTAAT GACACATGGA CGTCTGGCTG GAAAACTAGC CCGGBGTGTC 1260 

CCCCACCCCC GAGCCCCAGG AATGGACCCC AAATGTAAGG GCCGCCATGC TGAGCCCGGC 1320 

CTGATGGCTG AGGCTTTGGG CGCGTGGCCA GCGCTCAGCA CCACTGTGGT GCCAGGGGGC 1380 
G GGGCAGGCCA TGTCCAGATG TGCACTCAGG 
A CAGCACCTGC CTGCTAGAGA GCTGTATGAC 

CTGGGAGAAC CTCCCATTTT ACAAAGAACA GACGGAGATC CAGGGAGGAG AAGGGACT CG 1560 

CCCAAGGTCA CACAGGAGTG CCATCTAGTG GCCACCATGC CAGCTCTCGG GGGACTCGAG 1S20 

GGCCCCGGGA GGGTGGCCAA GCGAGAGATT GGGAGAGAGA CTGGGGCAGT AGGAAGACCA 1680 

CTCTCCCATC CCCTGGTCCC CAACTTCCCC AGCTGCTTGA GGCCTCTTGA AGCCGGGACA 1740 

GTGCCGGGAG CTGCCCTGCC TGGGAATCCT GGGAACTGGG TTCTGGACAT GGCCAAGGCC 1800 

CTGGCGTGGA ACCAGATGGA AAAAGAGGAG GGGAAGATTC ATGGAGACCA TGAGCCCAGA 186 0 

TTTAGGCTCA GGAAAGCACG GGAAGCAGAA TTCCCCCCAG GCT CCTCTGA GGAGCCTCTG 192 0 

CTGCAGTTCC CCTCAGGCCT CAGAGGCAGC CCTGTCCTCC AGGTGGGCCT GGGGCTTGCT 1980 

TCTGCCACTC ACTGTGGGTC GATGTCTTTT CTAGGGGGCC GAGGCGTCAG CGTGGGCCCC 2040 

AT CCT CAGCA GCAGTGCCTC GGATATCTTC TGCGACAATG AGAATGGGCC TAACTTCCTT 2100 

TTCCACAACC GGGGCGATGG CACCTTTGTG GACGCTGCGG CCAGTGCTGA ACGTCGTTTA 2160 

GCCTTCATCG nCACCTCAA ATATCACCTC TGCAGAGATT TTCCTCACTC CCTGTGCCAC 2220 

CTAGCAGAAA CTGGTCCTTC CTCCTCCTGC TGCCCGTGGC ATGCACGTCT TCTTCAGGCT 22 80 

CCACATTGCC ATCATGGTTT GTCTATGAGC TTTACAAGGA CCGGGTCACG GTTCTATTCA 2340 
C CACCGGAGGA CACTCAGCCT CCAGGGTTCT 




G GAAAAGGGGC TACGGGGTCC 
GCTACGGGCT CCAATCACTA CCAGGAAAAG G 
AGGGGCTACG GGCTCCAATC ACTACCAGGA A 

AGAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGGG GCTACGGGGT 
CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC 3540 
ACTACCAGGA AAAGGGGCTA CGGGCTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC 3600 
AATCACTACC AGGAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG G 
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GTCCAATCAC TACCACAGAA AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA 3720 

CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGAG 3780 

GCTATGGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG CTCCAATCAC TACCAGGAAA 3840 

AGGGGCTATG GGGTCCAATC ACTACCACAG AAAGGGGCTA CGGGGTCCAA CGTCATCCGT 3900 

AGAGAG CACG GAGACCCCCT CATCGAGGAG CTCAATCCCG GCGACGCCTT GGAGCCTGAG 39 60 

GGCCGGGGCA CAGGGGGTGT GGTGACCGAC TTCGACGGAG ACGGGATGCT GGACCTCATC 4020 

TTGTCCCATG GAGAGTCCAT GGCTCAGCCG CTGTCCGTCT TCCGGGGCAA TCAC-GGCTTC 4080 

AACAACAACT GGCTGCGAGT GGTGCCACGC ACCCGGTTTG GGGCCTTTGC CAGGGGAGCT 4140 

AAGGTCGTGC TCTACACCAA GAAGAGTGGG GCCCACCTGA GGATCATCGA CGGGGGCTCA 42 00 

GGCTACCTGT GTGAGATGGA GCCCGTGGCA CACTTTGGCC TGGGGAAGGA TGAAGCCAGC 42 SO 

AGTGTGGAGG TGACGTGGCC AGATGGCAAG ATGGTGAGCC GGAACGTGGC CAGCGGGGAG 4320 

ATGAACTCAG TGCTGGAGAT CCTCTACCCC CGGGATGAGG ACACACTTCA GGACCCAGCC 4380 

CCACTGGAGT GTGGCCAAGG ATTCTCCCAG CAGGAAAATG GCCATTGCAT GGACACCAAT 4440 

GAATGCATCC AGTTCCCATT CGTGTGCCCT CGAGACAAGC CCGTATGTGT CAACACCTAT 4500 

GGAAGCTACA GGTGCCGGAC CAACAAGAAG TGCAGTCGGG GCTACGAGCC CAACGAGGAT 4 5 SO 

GGCACAGCCT GCGTGGGTAC TGAGCTAGGC TCTAGGCATA CAATGACGTG GAAACCAAGG 4620 



P SGSSPASPPH SSSRYNGPNL VLKYDRAQKR 
SPYYALEDRQ GNAI GVTACD IDGDGREEIY FLNTNNAFSG HSSSAQVPSG 
PPTTPAGLLG LPPLSGRDFS SSLGQASPDS RQGERVPVPC CRGGLRPTHE P 
GVATYTDKLF KFRNNRWEDI LSDEVNVARG VASLFAGRSV ACVDRKGSGR YSIYIANYAY 240 



GGDPEEADEE HSGDGSTSQL CRLGWKDGQF KEEAAALVEE QREAGAAGVP RGRVRTALQT 360 

SKSHLADKNL FGPPCYYSVC APSPAHPFPA RQAPQHYPVA PLVTQLMTHG RLAGKLARSV 42 0 

PHPRAPGKDP KCKGRHAEPG LMAEALGAWP ALSTTWPGG LRSWEESRQK GQAMSRCALR 480 

ELGGPWSQAT QHLPARELYD LGEPPILQRT DGDPGRRRDS PKVTQECHLV ATMPALGGLE 540 

35 GPGRVAKREI GRETGAVGRP LSHPLVPNFP SCLRPLEAGT VPGAALPGHP GNWVLDMAKA 600 

laAWNQMEKEE GKIHGDHEPR FRLRKAREAE FPPGSSEEPL LQFPSGLRGS PVLQVGLGLA 660 

SATHCGSMSF LGGRGVSVGP ILSSSASDIF CDNENGPNFL FHHRGDGTFV DAAASAERRL 720 

AFIVHLKYHL CRDFPIISLdl LAETGPSSSC CPWHARLLQA PHCHHGLSMS FTRTGSRFYS 780 

FLTQGLASSA HRRTLSLQGS QGAPPCLLAR APCVLGSLIP TAYYIVLWSA IPSSLMTHSY 840 

40 LSSERVNVGV DDPHQHGRGV ALADFNRDGK VDIVYGNWNG PHRLYLQMST HGKVRFRDIA 900 

SPKFSMPSPV RTVITADFDN DQELEIFFNN IAYRSSSANK LFRCSILARG SSSLTAGGRN 960 

GOGEGLRIRR GGFPGPGGQA KVNTGPLMKK QKGRKDEDWA RGCGNAGQSL AKEPASAIAG 1020 

KGKGNVAQ S V PRTQAPQDTK PHYHKKGLQG PITTRKRGYG VQSLPGKGAT GSNHYQSKGL 1080 

RGPITTRKRG YGVQSLPGKG ATGSNHYQEK GLQGPITTRK RGYGLQSLPG KGATGSNHYH 1140 

45 RKGLRAPITT RKRGYGVQSL PGKGATGSNH YQEKGLRGPI TTRKRGYGLQ SLPGKGATGS 1200 

NHYQEKGLQG PITTRKRGYR VQSLPQKGAT GSNKYQEKGL RGPITTRKRG YGLQSLPGKE 1260 

AMGSNHYQEK GLRAPITTRK RGYGVQSLPQ KGATGSNVIR REHGDPLIEE LNPGDALEPE 1320 

GRGTGGWTD FDGDGMLDLI LSHGESMAQP LSVFRGNQGF NNNWLRWPR TRFGAFARGA 1380 

KWLYTKKSG AHLRI IDGGS GYLCEMEPVA HFGLGKDEAS SVEVTWPDGK MVSRNVASGE 1440 

50 MNSVLEILYP RDEDTLQDPA PLECGQGFSQ QENGHCMDTN ECIQFPFVCP RDKPVCVNTY 1500 

GSYRCRTNKK CSRGYEPNED GTACVGTELG SRHTHTWKPR PKKELQLSQG ICTPVWSFFL 1560 
PGCRLLLKRA QLQAAPSTLL QKAPGIPEAQ VYEQDQE 



Seq ID NO : 186 DNA sequenc 
Nucleic Acid Accession #: 
Coding sequence: 75.-374 



I AGGACAAGAG CCAGGAAGAA ACCACCGGAA GGAACCATCT 
CACTGTGTGT AAACATGACT TCCAAGCTGG CCGTGGCTCT CTTGGCAGCC TTCCTGATTT 
AGTTTTGC CAAGGAGTGC TAAAGAACTT A"" 



55 



no«<»»«.C ACACTGCGCC AACACAGAAA TTATTGTAAA GCTTTCTGAT GGAAGAGAGC 
65 TCTGTCTGGA CCCCAAGGAA AACTGGGTGC AGAGGGTTGT GGAGAAGTTT TTGAAGAGGG 
CTGAGAATTC ATAAAAAAAT TCATTCTCTG TGGTATCCAA GAATCAGTGA AGATGCCAGT 
GAAACTTCAA GCAAATCTAC TTCAACACTT CATGTATTGT G 
CCAGATGCAA TACAAGATT C CTGGTTAAAT TTGAATTTCA GTAAACAATG A 
CATTGTACCA TGAAATATCC AGAACATACT TATATGTAAA G' 
70 CAAAAAACAA CAAATAATTT TTAAATATAA GGATTT1 
TACAAATAGC AAAATTGAGC CAAGGGCCAA GAGAAT; 
GAATGGGTTT GCTAGAATGT GATATTTGAA GCATCACATA A 
TTTGCCATAA AGT CAAATTT AGCTGGAAAT CCTGGATTTT TTTCTGTTAA A 
CCTAGTCTGC TAGCCAGGAT CCACAAGTCC TTGTTCCACT GTGCCTTGGT T 
75 TTTCTAAGTG GAAAAAGTAT TAGCCACCAT CTTACCTCAC A — 
TGGAAGCACT TTAAGTTTTT TCATCATAAC ATAAATTATT T 
CTATTTATTA TTTATGTATT TATTTAAGCA TCAAATATTT C-TGCAAGMT TTGGAAAAAT 
AGAAGATGAA TCATTGATTG AATAGTTATA AAGATGTTAT AGTAAATTTA TTTTATTTTA 



h ATTTTCATTT CAGATAAACA ACAAATAATT TTTTAGTATA 1260 

T TGTTTATCTG AAAGTTTTAA TTGAACTAAC AATCCTAGTT TGATACTCCC 1320 

AGTCTTGTCA TTGCCAGCTG TGTTGGTAGT GCTGTGTTGA ATTACGGAAT AATGAGTTAG 1380 

AACTATTAAA ACAGCCAAAA CTCCACAGTC AAT AT TAGTA ATTTCTTGCT GGTTGAAACT 1440 
TGTTTATTAT GTACAAATAG ATTCTTATAA TATTATTTAA A 
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I I I I I I 

MTSKLAVALL AAFLISAALC EGAVLPRSAK ELRCQCIKTY SXPFHPKFIK ELRVIESGPH 60 
L SDGRELCLDP KENWVQRWE KFLKRAENS 



AGGGCAGAGG 



GCTGCTGGCA CCATGGACCC AGAGAGCAGT ATCTTTATTG AGGATGCCAT TAAGTATTTC 180 

AAGGAAAAAG TGAGCACACA GAATCTGCTA CTCCTGCTGA CTGATAATGA GGCCTGGAAC 240 

GGATTCGTGG CTGCTGCTGA ACTGCCCAGG AATGAGGCAG ATGAGCTCCG TAAAGCTCTG 300 

GACAACCTTG CAAGACAAAT GATCATGAAA GACAAAAACT GGCACGATAA AGGCCAGCAG 3 60 

TACAGAAACT GGTTTCTGAA AGAGTTTCCT CGGTTGAAAA GTGAGCTTGA GGATAACATA 420 

AGAAGGCTCC GTGCCCTTGC AGATGGGGTT CAGAAGGTCC ACAAAGGCAC CACCATQ3CC 480 

AATGTGGTGT CTGGCTCTCT CAGCATTTCC TCTGGCATCC TGACCCTCGT CGGCATGGGT 540 

CTGGCACCCT TCACAGAGGG AGGCAGCCTT GTACTCTTGG AACCTGGGAT GGAGTTGGGA 600 

ATCACAGCCG CTTTGACCGG GATTACCAGC AGTACCATGG ACTACGGAAA GAAGTGGTGG 660 

ACACAAGCCC AAGCCCACGA CCTGGTCATC AAAAGCCTTG ACAAATTGAA GGAGGTGAGG 720 

GAGTTTTTGG GTGAGAACAT ATCCAACTTT CTTTCCTTAG CTGGCAATAC TTACCAACTC 780 

ACACGAGGCA TTGGGAAGGA CATCCGTGCC CTCAGACGAG CCAGAGCCAA TCTTCAGTCA 840 

GTACCGCATG CCTCAGCCTC ACGCCCCCGG GTCACTGAGC CAATCTCAGC TGAAAGCGGT 900 

GAACAGGTGG AGAGGGTTAA TGAACCCAGC ATCCTGGAAA TGAGCAGAGG AGTCAAGCTC 960 

AOGGATGTGG CCCCTGTAAG CTTCTTTCTT GTGCTGGATG TAGTCTACCT CGTGTACGAA 1020 

TCAAAGCACT TACATGAGGG GGCAAAGTCA GAGACAGCTG AGGAGCTGAA GAAGGTGGCT 1080 

CAGGAGCTGG AGGAGAAGCT AAACATTCTC AACAATAATT ATAAGATTCT GCAGGCGGAC 1140 
CAAGAACTGT GA 



I "I I I I I 

45 MSALFLGVGV RAEEAGARVQ onvpsgtdtg dpqskplgdw aagtmdpess ifisdaikyf 

KEKVSTQNLL LLLTDNEAWN GKVAAAELPR NEADELRKAL DNLARQMIMK DKNWHDKGQQ 
YRNWFLKEFP RLKSELEDNI RRLRALADGV QKVHKGTTIA NWSGSLSIS SGILTLVGMG 
LAPFTEGGSL VLLEPGMELG ITAALTGITS STKDYGKKWW TQAQAHDLVI KSLDKLKEVR 
EFLGENISNF LSLAGNTYQL TRGIGKDIRA LRRARANLQS VPHASASRPR VTEPISAESG 
50 EQVERVNEPS ILEMSRGVKL TDVAPVSFFL VLDWYLVYE SKHLHEGAKS ETAEELKKVA 

QELEEKLNIL NNNYKILQAD QEL 

Seq ID NO: 190 DNA sequence 
Nucleic Acid Accession ft: NM_0144S2.1 
55 Coding sequence: 1..1968 

1 11 21 31 41 51 

I I I I I I 

ATGGGGACCT CTCCGAGCAG CAGCACCGCC CTCGCCTCCT GCAGCCGCAT CGCCCGCCGA 
60 GCCACAGCCA CGATGATCGC GGGCTCCCTT CTCCTGCTTG GATTCCTTAG CACCACCACA 
G AACAGAAGGC CTCGAATCTC ATTGGCACAT ACCGCCATGT TGACCGTGCC 
CAACCTG TGACAAGTGT CCAGCAGGAA CCTATGTCTC TGAGCATTGT 
ACCAACACAA GCCTGCGCGT CTGCAGCAGT TGCCCTGTGG GGACCTTTAC CAGGCATGAG 
AATGGCATAG AGAAATGCCA TGACTGTAGT CAGCCATGCC CATGGCCAAT GATTGAGAAA 
TTACCTTGTG CTGCCTTGAC TGACCGAGAA TGCACTTGCC CACCTGGCAT GTTCCAGTCT 
AACGCTACCT GTGCCCCCCA TACGGTGTGT CCTGTGGGTT GGGGTGTGCG GAAGAAAGGG 
ACAGAGACTG AGGATGTGCG GTGTAAGCAG TGTGCTCGGG GTACCTTCTC AGATGTGCCT 
TCTAGTGTGA TGAAATGCAA AGCATACACA GACTGTCTGA GTCAGAACCT GGTGGTGATC 
AAGCCGGGGA CCAAGGAGAC AG ACAACGT C TGTGGCACAC TCCCGTCCTT CTCCAGCTCC 
ACCTCACCTT CCCCTGGCAC AGCCATCTTT CCACGCCCTG AGCACATGGA AACCCATGAA 
GTCCCTTCCT CCACTTATGT TCCCAAAGGC ATGAACTCAA CAGAATCCAA CTCTTCTGCC 
TCTGTTAGAC CAAAGGTACT GAGTAGCATC CAGGAAGGGA CAGTCCCTGA CAACACAAGC 
TCAGCAAGGG GGAAGGAAGA CGTGAAC 
CAGCAAGGCC CCCACCACAG ACACATCCTG A 
GGCGAGAAGT CCAGCACGCC CATCAAGGGC CCCAAGAGGG G 



.TC CATG ACT CCA 



CTTGTAGCAG CCCAAGTGGG AAGCCAGT GG A 
AGTGAGAGGG AGGTTGCTGC TTTCTCCAAT GGGTACACAG CCGACCACGA GCGGGCCTAC 
GCAGCTCTGC AGCACTGGAC CATCC3GGGC CCCGAGGCCA GCCTCGCCCA GCTAATTAGC 
GCCCTGCGCC AGCACCGGAG AAACGATGTT GTGGAGAAGA TTCGTGGGCT GATGGAAGAC 
ACCACCCAGC TGGAAACTGA CAAACTAGCT CTCCCGATGA GCCCCAGCCC GCTTAGCCCG 
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AGCCCCATCC CCAGCCCCAA CGCGAAACTT GAGAATTCCG CTCTCCTGAC GGTGGAGCCT 
TCCCCACAGG ACAAGAACAA GGGCTTCTTC GTGGATGAGT CGGAGCCCCT TCTCCGCTGT 
GACTCTACAT CCAGCGGCTC CTCCGCGCTG AGCAGGAACG GTTCCTTTAT TACCAAAGAA 
AAGAAGGACA CAGTGTTGCG GCAGGTACGC CTGGACCCCT GTGACTTGCA GCCTATCTTT 
GATGACATGC TCCACTTTCT AAATCCTGAG GAGCTGCGGG TGATIGAAGA GATTCCCCAG 
GCTGAGGACA AACTAGACCG GCTATTCGAA ATTATTGGAG TCAAGAGCCA GGAAGCCAGC 
CAGACCCTCC TGGACTCTGT TTATAGCCAT CTTCCTGACC TGCTGTAG 



I I I I I I 

MGTSPSSSTA LASCSRIARH ATATMIAGSL LLLGFLSTTT AQPEQKASNL IG1 
TGQVLTCDKC PAGTYVSEHC TNTSLRVCSS CPVGTFTRHE NGIEKCHDCS QPCPNPMIEK 
LPCAALTDRE CTCPPGMFQS NATCAPHTVC PVGWGVRKKG TETEDVRCKQ CARGTFSDVP 
SSVMKCKAYT DCLSQNLWI KPGTKETDNV CGTLPSFSSS TSPSPGTAIF PRPEHMETHE 
VPSSTYVPKG MNSTESNSSA SVRPKVLSSI QEGTVPDNTS SARGKEDVNK TLPNLQWNH 
QQGPHHRHIL KLLPSMEATG GEKSSTPIKG PKRGHPRQNL HKHFDINEHL PWMIVLFLLL 
VLWIWCSI RKSSRTLKKG PRQDPSAIVE KAGLKKSMTP TQNREKWIYY CNGHGIDILK 
LVAAQVGSQW KDIYQFLOJA SEREVAAFSN GYTADHERAY AALQHWTIRG PEASLAQLIS 
ALRQHRRNDV VEKIRGLMED TTQLETDKLA LPMSPSPLSP SPIPSPNAKL ENSALLTVEP 
SPQDKKKGFF VDESEPLLRC DSTSSGSSAL SRNGSFITKE KKDTVLRQVR LDPCDLQPIF 
E ELRVIEEIPQ AEDKLDRLFE IIGVKSQEAS QTLLDSVYSH LPDLL 



30 i 

I I I I I I 

GCTCTGCCCA AGCCGAGGCT GCGGGGCCGG CGCCGGCGGG AGGACTGCGG TGCCCCGCGG 60 

AGGGGCTGAG TTTGCCAGGG CCCACTTGAC CCTGTTTCCC ACCTCCCGCC CCCCAGGTCC 120 

GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GCGGACCGCG GGGCGGAGCT GCCGCCCGTG 180 

35 AGTCCGGCCG AGCCACCTGA GCCCGAGCCG CGGGACACCG TCGCTCCTGC TCTCCGAATG 240 

CTGCGCACCG CGATGGGCCT GAGGAGCTGG CTCGCCGCCC CATGGGGCGC GCTGCCGCCT 300 

CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 3 60 

TGGGCGCTCA GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGCGGCC ATTCCTCAGA 420 

TTCGAAGCTG AACACATCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA TGGCAGGACC 480 

40 CTGTACGTGG GTGCTCGAGA GGCCCTCTTT GCACTCAGTA GCAACCTCAG CTTCCTGCCA 540 

GGCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAGC 600 

TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CCTGCCGCTC 6 60 

AGCGGCAGTC ACCTGTTCAC CTGTGGCACA GCAGCCTTCA GCCCCATGTG TACCTACATC 720 

AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGAAGGGGA ATGTCCTCCT GGAAGATGGC 7 80 

45 AAGGGCCGTT GTCCCTTCGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGCGAG 840 

CTCTACACTG GAACAGT CAG CAGCTTCCAA GGGAATGACC OGGCCATCTC GCGGAGCCAA 900 
AGCCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TGCAAGACCC AGCTTTTGTG 960 

GCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG GCGATGATGA CAAGATCTAC 1020 

TTTTTCTTCA GCGAGACTGG CCAGGAATTT GAGTTCTTTG AGAACACCAT TGTGTCCCGC 1080 

50 ATTGCCCGCA TCTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 1140 

TCCTTCCTCA AGGCCCAGCT GCTGTGCTCA CGGCCCGACG ATGGCTTCCC CTTCAACGTG 1200 

CTGCAGGATG TCTTCACGCT GAGCCCCAGC CCCCAGGACT GGCGTGACAC CCTTTTCTAT 1260 

GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 1320 

ACAATGAAGG ATGTGCAGAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CCGTGAGACA 1380 

55 CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCAT CACC 1440 

AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCAGACCG CGTGCTGAAC 150 0 

TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTGCTGCAG 1560 

C GCTACCAGCG CGTGGCTGTA CACCGCGTCC CTGGCCTGCA CCACACCTAC 1620 

[■GGGCAC TGGTGACGGC CGGCTCCACA AGGCAGTGAG CGTGGGCCCC 1680 

A TCATTGAGGA GCTGCAGAT C TTCTCATCGG GACAGCCCGT GCAGAAT CTG 1740 

CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCGGCCTCAC ACTCGGGCGT AGTCCAGGTG 1800 

•V ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCC1CGC CCGGGACCCC 1860 

T GGAGCGGCTC CAGCTGCAAG CACG1CAGCC TCTACCAGCC T CAGCTGGCC 192 0 

ACCAGGCCGT GGATCCAGGA CATCGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 1980 
TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA A 



TGGCTACGCA ACGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 2160 

GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 2220 

GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACG3 GGTGGCAGAC 22 80 

CAAACAGATG AGGGTGG CAG TGTACCCGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 2340 

GCTGGTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG 2400 

TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 2460 

AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTGC 252 0 

CCTGTGGTGC TGCCCCCTGA GACCCGCCCA CTCAACGGCC TAGGGCCCCC TAGCACCCCG 2580 

CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCCCC CGGGGTCCCG AGTCTTCACT 2640 

GAGTCAGAGA AGAGGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC CCCAGTGTGC 2 7 00 

CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG ATCCGTGACT CTGTGGTGTG AGAGCTGACT 2760 

T CCAGAGG AC GCTGCCCTGG CTTCAGGGGC TGTGAATC-CT CGGAGAGGGT CAACTGGACC 2 820 

TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 2880 

GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG TAGCGAAGCT CCTACCACCC AGACACCCAA 2 940 

ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGIT GGTGGAACAG 3000 

TGCTCCTTAT GTAAACTGAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 3060 
GAGAGGGAAG AGATAGCATG GCATGCAGCA C 



GGGGTGCTGG GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 3180 
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T GCCACCGGCT GCCCTGTCTC ACTGCAC-ATT 3240 

C AGCCGAGGAT GTAGTTGTTG 3300 

T CAGGGACCAG AGGGCTAGGT TGGCACTGCG GCCCTCACCA 3 3 60 

GGTCCTGGGC TCGGACCCAA CTCCTGGACC TTTCCAGCCT GTATCAGGCT GTGGCCACAC 3420 

GAGAGGACAG CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT 3480 
CAGGGAAGAG A 

C CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 3S60 

TCAACACTGC CCAGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 3720 
ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 



1 11 21 31 41 51 

I I I I I I 

MLRTAMGLRS WLAAPWGALP PRPPLLLLLL LLLLLQPPPP TWALSPRISL PLGS3ERPFL 
RFBAEHISNY TALLLSRDGR TLYVGAREAL FALSSNLSPL PGGEYQELLW GADA3KKQQC 
SFKGKDPQRD CQNYIKILLP LSGSHLFTCG TAAFSPMCTY INMENFTLAB DEKGNVLLED 
GKGRCPFDPN FKSTALWDG ELYTGTOSSF QGNDPAISRS QSLRPTKTES SLNNLQDPAF 
VASAYIPESL GSLQGDDDKI YFFFSETGQE FEFFENTIVS RIARICKGDE GGERVLQQRW 
TSFLKAQLLC SRPDDGFPFN VLQDVFTLSP SPQDWRDTLF YGVFTSQWHR GTTEGSAVCV 
FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TNSARERKIN SSLQLPDRVL 
NFLKDHFLMD GQVRSRMLLL QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 
PRVHIIEELQ IFSSGQPVQN LLLDTHRGLL YAASHSGWQ VPMANCSLYR SCGDCLLARD 
PYCAWSGSSC KHVSLYQPQL ATRPWIQDIE GASAKDLCSA SSWSPSFVP TGEKPCEQVQ 
FQPNTVKTLA CPLLSNLATR LWLRNGAPVN ASASCHVLPT GDLLLVGTQQ LGEFQCWSLE 
EGFQQLVASY CPEWEDGVA DQTDEGGSVP VIISTSRVSA PAGGKASWGA DRSYWKEFLV 
MCTLFVLAVL LPVLFLLYRH RNSMKVFLKQ GECASVHPKT CPWLPPETR PUJGLGPPST 
PLDHRGYQSL SDSPPGSRVF TESEKRPLSI QDSFVEVSPV CPRPRVRLGS EIRDSW 

Seq ID NO: 194 DNA sequence 



ATGGCAGATG GGGCAAAGGC CAACCCCAAA GGGTTCAAAA AGAAGGTGCT GGATAGATGC 



AGCCTGGGTA TGAAGAAGTT CTTCACCGTG GCCATCCTTG CTGGCAGCGT T 

\ CCTGAAGGCC ATGGTGGAGG CCGTCACAGG GAGGAGCGCC 
\ CGGTTGCTAC TGTGGGCTGG GGGGCCGTGG CCAGCCCAAG 
3 ACTGGTGCTG CCACGCCCAC GACTGCTGCT ACCAGGAACT CTTTGACCAA 
k CCACTATGAT CACACCATCG AGAACAACAC TGAGATAGTC 
TGCAGTGACC TCAACAAGAC AGAGTGTGAC AAGCAGACAT GCATGTGTGA CAAGAACATG 
5. GACGTACCGA GAGGAGTACC GTGGCTTCCT CAATGTCTAC 
C CCACGCCCAA CTGCAGCATC TATGAACCGC CCCCTGAGGA GGTCACCTGC 
AGTCACCAAT CCCCAGCGCC CCCCGCCCCT CCCTAG 

Seq ID NO: 195 Protein sequence 



MADGAKANPK GFKKKVLDRC FSGWRGPRFG ASCPSRTSRS SLGMKKFFTV AILAGSVLST 

AHGSLLNLKA MVEAVTGRSA ILSFVGYGCY CGLGGRGQPK DEVDWCCHAH DCCYQELFDQ 

GCHPYVDHYD HTIENNTEIV CSDLNKTECD KQTCMCDKNM VLCLKNQTYR EEYRGFLNVY 

CQGPTPKCSI YEPPPEEVTC SHQSPAPPAP P 



60 



65 | | | | | | 

GGCATTGATG CTGTGTGCGC GTGCGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT 
GTGTGTGTGT GTCTGGAGTC ATGGCAGGGT CCCTTTCTGT CTGTCTCCTT GCTCTGCCCC 
AGACTGGGGG GCTGCAGAGG TGAGGGTATC TGGCCTCAAC AGCIGCTTAT TCCCGATGGG 
ATGGCCTGGG CTGGGCCCCT GAGG CCAGGC TGACTTGGAC ATGGCAAGAG GGGTCCCAGG 

JO CTCTTGTGGG CAAAGCAGGG GAGGCGCCAA TGTGGAGGAA CAGAGTCTCC TGGCTGGCTG 
CTGCTCTCTG GAGCGGGTGG AGTCAGGGAA GAGCTGAGCT GGGGAGTCAC CCTGGGCCTG 
GGGTCACCGT AGGCCCCATG TAGCACCCTG GTTCCCCTGC CTGTAGGTGA CAGGAGCCAG 
CCCAGCCAGG TGTGCTCCCT CCCCAGGCCC TAGGCAGGCG GGTACAGGGG CCAGCAGCTG 
CGCCCGCCCC ACCTTCCTTC CCACCCACAT GCCGAAGGGT GGCCAGGCAG GCAGGTGGAC 

/5 GAGTCCAGGC AGCGGCTGAG TCAGTGTGTG TGGAATGTTC TGGCCGCICC CAGCTGCACC 

CTGCCCCTAC CTGCCACCAC CTCACCTTCA TCCTCAGGCG CTGCGGCCCT GAGCCCCTGC 
CAGGAATGCA CCTTTAGCCC AGGCCTGCTC AGTGAGCTCC GCCGACAGCC AGCCCTGCTC 
CTCCCGCCAT GACCCTGCAG ACCCCTCIGG GCTTCCAAGT TCCTGGGGGC TGCAGTGAAC 
ATGCTCCACC TGCATGGCTG GCAAACCATG GTGGGCCCCA GCIGTGGTGC GTGCTGGGGT 

oil AGAGGCAAGG AAGTGATGGG ACCGCAGAGA TGAGACCCCC AGGGATGAGA IGGGACCCCC 
AGGCAGGGCC CAGGGTC CAG GGCC CAGGAG AGAGAAGCAG GGAGGGAGAG AGCTTCCTGG 
TGGAGGACGC ATCCTACAGT GGGGGCAAGG GTGCTCTGAG GTCCGGTGAA GGCAGGGACT 
AGGCTGCCCA GGCCCTGCCT GCTTGGCIGG GGCTGGGGGC TGCTGGGAGG TGGCTGGGAG 
GCTGGGCCTG GGCAGCTAAG CTGGAGCTTT GGCCAGGGTC CAGAGCCTCX: C 
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C TCGCCCCTGG CCACCCCGTG CTGCCTCCTT GCCCTGGCAG 1200 

T AGTCAGATGG GGTAGCGGGC AGGGC-CCGGA GGGGCCACCC 12S0 

TCCCAGCTGA CCCAGCCTCC TGGGCCGCTT CTTCCAAACC AGCAGGGTAG AAAGATGGGG 1320 

CACCCACCAG TCTCTCCCAG TGCCCCGGCC CCAGCTGGCA CCACAGCTAT ACCTGGGCTT 1380 

ATT CCAGACC TTGTCGCCGG GACCCCCTGT GAGTTGTC-GG ATTCCCAAGA GGGGTGTGGG 1440 

GATAACCCAG CCAAGTGGGG GCTGCAGCTG TCCACAGATG CACTCAGCCI GGCCTCTACC 1500 

CCAGGGCCCC GCTGGGCTCT CATTGCCGGC GCCCTTGCCG CGGGCGTCCT CCTCGTCTCC 15S0 

TGCCTCCTCT GTGCTGCCTG CTGCTGCTGC CGCCGCCACA GGAAGAAGCC CAGGGACAAG 1620 

GAGTCCGTGG GTCTGGGCAG TGCCCGCGGC ACCACCACCA CCCACCTGGT GAC-GAGCGGC 1680 

TCCTTGCTCA CTCAGTCCAG AGAGGGCTTG AAATCCAGGC TCCAGAGCCC AGGGCAGCGA 1740 

GGCGAGTTCA GCCCCAGGGA TGGTTTAACC CCCACAGAGG CAGGGCGTTG AGGACCTTCC 1800 

TGGCAGGGAA AGTGGGTGAA CAGAGGTGAG AAGGAGGCCA IGCAACAGGG GCTGCCCCAT 1860 

GGGCCCGAGG GAGCCACAGC GGGTTCTTGA GGAAGGCAGG GGGTACCCCA GATGCCACGT 1920 

TTTGGGTGGG TTTGGCCGGT CTCACAGAGC GAAGCCGACG ATTTGTGCCT GTTGGGTGGC 1980 

CTGGCCTGGA GGCGGGGGGT CTTGACCCAT GTCATGCAAG GGCTGCCCGG GAGCCCAGGG 2040 

CTCTGATGAG GCATGATGTC AGCACCACCT GCCCCTTGTC CCAACTCACT CCAGGTGCAA 2100 

CCT GATGTGG ATGGCCTGGA GTCCAGCCCG GGGGATGCTC AGCAATGGGG GTGCCTGCAG 2160 

CTCTCCCTGG AGTTCGACTT TGGAAGCCAG GAGGTGAAGG GCCCCGCTGC GCAGGACCAG 2220 

CGGTTCTGCG AGTTTCCGGA AAGGGTGACG GGGOAAGGGC AGACCCCATG CCCTGGGTGG 2280 

TGGGGAGCTG ACAGGGCAGG GGCCCTTGGC TGAGCCCACC CCGCTGGCTC CCAGATCAGG 2 340 



GCTGGGTGGG CCTGAGCTAG GGCAGCAGGG CCTGGCTCAC GCCGCTGCCT CAGATCCCGC 2640 

AGGCGGAGCT GCCAGGGGCC ACCCTGCAGG TGCAGCTTTT CAACTTCAAG CGCTTCTCGG 2700 

GGCATGAGCC CCTGGGTGAG CTCCGTCTGC CACTGGGCAC CGTGGATCTG CAGCATGTTC 2760 

TGGAGCACTG GTACCTGCTG GGCCCGCCGG CTGCCACTCA GGTGAGGTGC TGGTCACCAG 2820 

GCCACAGCCC AAGGCAGAGC TGGCAGGGAC CCTGCCCTAT GGGCCATGGG AAAGACAGGC 2880 

CTGATGGGCA GCATTTTCGG GGGTCTGAGC CCCAACTCGG CCAGAATCAC CCTCCCGGGC 2 940 

TGAAGCCCCT CTTGCTGCCC ACAGCCCGAG CAGGTCGGGG AGCTGTGCTT CTCTCTCCGG 3000 

TACGTGCCCA GCTCAGGCCG GCTGACCGTG GTGGTGCTGG AGGCTCGAGG CCTGCGTCCA 3060 

GGACTTGCAG AGCCCTACGT GAAGGTCCAG CTCATGCTGA ACCAGAGGAA GTGGAAGAAG 3120 

AGAAAGACAG CCACCAAAAA GGGCACGGCG GCCCCCTACT TCAATGAGGC CTTCACCTTC 3180 

CTGGTGCCCT TCAGCCAGGT CCAGAATGTG GACCTGGTGC TGGCTGTCTG GGACCGCAGC 3240 

CTGCCGCTCC GAACTGAGCC CGTAGGCAAG GTGCACCTGG GTGCCCGGGC CTCGGGGCAG 3 300 

CCCCTGCAGC ACTGGGCAGA CATGCTGGCC CACGCCCGGC GGCCCATTGC CCAGCGGCAC 3360 

CCCCTGCGGC CAGCCAGGGA GGTGGACCGC ATGCTGGCCC TGCAGCCCCG CCTTCGCCTG 3420 

CGCCTCCCCT TGCCCCACTC CTGAATGCAC CACATGCCTC TGTCTCCCCG CTGAGCCCAG 3480 
GCACTTGCCC AGGCCGCCCT GCAGGACCAC TGCAATAAAC GCCTTCTCCT GCC 

Seq ID NOs 197 Protein sequence 



k PAPAGTTAIP GLIPDLVAGT PCELWDSQEG CGDNPAKWGL QLSTDALSLA 
STPGPRWALI AGALAAGVXiL VSCLLCAACC CCRRHRKKPR DKESVGLGSA RGTTTTHLVR 
SGSLLTQSRE GLKSRLQSPG QRGEFSPRDG L 



1 11 21 31 41 51 

I I I I I I 

TTCTCCCGCA ACCTTCCCTT CGCTCCCTCC CGTCCCCCCC AGCTCCTAGC CTCCGACTCC 60 

CTCCCCCCCT CACGCCCGCC CTCTCGCCTT CGCCGAACCA AAGTGGATTA ATTACACGCT 120 

TTCTGTTTCT CTCCGTGCTG TTCTCTCCCG CTGTGCGCCT GCCCGCCTCT CGCTGTCCTC 180 

TCTCCCCCTC GCCCTCTCTT GGGCCCCCCC CTTTCACGTT CACTCTGTCT CTCCCACTAT 240 

CTCTGCCCCC CTCTATCCTT GATACAACAG CTGACCTCAT TTCCCGATAC CTTTTCCCCC 300 

CCGAAAAGTA CAACATCTGG CCCGCCCCAG CCCGAAGACA GCCCGTCCTC CCTGGACAAT 360 

CAGACGAATT CTCCCCCCCC CCCCAAAAAA AAAAGCCATC CCCCCGCTCT GCCCCGTCGC 420 

ACATTCGGCC CCCGCGACTC GGCCAGAGCG GCGCTGGCAG AGGAGTGTCC GGCAGGAGGG 480 

CCAACGCCCG CTGTTCGGTT TGCGACACGC AGCAGGGAGG TGGGCGGCAG CGTCGCCGGC 540 

TTCCAGACAC CAATGGGAAT CCCAATGGGG AAGTCGATGC TGGTGCTTCT CACCTTCTTG 600 

GCCTTCGCCT CGTGCTGCAT TGCTGCTTAC CGCCCCAGTG AGACCCTGTG CGGCGGGGAG 660 

CTGGTGGACA CCCTCCAGTT CGTCTGTGGG GACCGCGGCT TCTACTTCAG CAGGCCCGCA 720 

AGCCGTGTGA GCCGTCGCAG CCGTGGCATC GTTGAGGAGT GCTGTTTCCG CAGCTGTGAC 7 80 

CTGGCCCTCC TGGAGACGTA CTGTGCTACC CCCGCCAAGT CCGAGAGGGA CGTGTCGACC 840 

CCTCCGACCG TGCTTCCGGA CAACTTCCCC AGATACCCCG TGGGCAAGTT CTTCCAATAT 900 

GACACCTGGA AGCAGTCCAC CCAGCGGCTG CGCAGGGGCC TGCCTGCCCT CCTGCGTGCC 960 

CGCCGGGGTC ACGTGCTCGC CAAGGAGCT C GAGGCGTTCA GGGAGGCCAA ACGTCACCGT 1020 

CCCCTGATTG CTCTACCCAC CCAAGACCCC GCCCACGGGG GCGCCCCCCC AGAGATGGCC 1080 

AGCAATCGGA AGTGAGCAAA ACTGCCGCAA GTCTGCAGCC CGGCGCCACC ATCCTGCAGC 114 0 

CTCCTCCTGA CCACGGACGT TTCCATCAGG TTCCATCCCG AAAATCTCTC GGTTCCACGT 1200 

CCCCCTGGGG CTTCTCCTGA CCCAGTCCCC GTGCCCCGCC TCCCCGAAAC AGGCTACTCT 1260 

CCTCGGCCCC CTCCATCGGG CTGAGGAAGC ACAGCAGCAT CTTCAAACAT GTACAAAATC 1320 
GATTGGCTTT AAACACCCTT CACATACCCT CCCCCC 
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I I I I I I 

MGIPMGKSML VLLTFIAFAS CCIAAYRPSE TLCGGELVDT LQFVCGDRGF YFSRPASRVS 

RRSRGIVEEC CFRSCDLALL ETYCATPAKS ERDVSTPPTV LPDNFPRYPV GKFFQYDTWK 

QSTORLRRGL PALLRARRGH VLAKELEAFR EAKRHRPLIA LPTQDPAHGG APPEMASNRK 



I I I I I I 

AGTCTGGGCG TTTAGGTCAG AACTACCCCG GTAGCCTGAC AGCAGGAGCT CGAGAGAAGC 
ATGGCTCAGC GGTGCGTTTG CGTGCTGGCC CTGGTGGCTA TGCTGCTCCT AGTTTTCCCT 
ACCGTCTCCA GATCGATGGG CCCGAGGAGC GGGGAGCATC AAAGGGCGTC GCGAATCCCT 
TCTCAGTTCA GCAAAGAGGA ACGCGTCGCG ATGAAAGAGG CGCTGAAAGG TGCCATCCAG 
ATTCCAACAG TGACTTTTAG CTCTGAGAAG TCCAATACTA CAGCCCTGGC TGAGTTCGGA 
AAATACATTC ATAAAGTCTT TCCTACAGTG GTCAGCACCA GCTTTATCCA GCATGAAGTC 
GTGGAAGAGT ATAGCCACCT GTTCACTATC CAAGGCTCGG ACCCCAGCTT GCAGCCCTAC 
CTGCTGATGG CTCACTTTGA TGTGGTGCCT GCCCCTGAAG AAGGCTGGGA GGTGCCCCCA 
TTCTCTGGGT TGGAGCGTGA TGGCGTCATC TATGGTTGGG GCACACTGGA CGACAAGAAC . 
TCTGTGATGG CATTACTGCA GGCCTTGGAG CTCCTGCTGA TCAGGAAGTA CATCCCCCGA 

ATCTCAGCCC TGCTACAGTC AAGGGGCGTC CAGCTAGCCT TCATTGTGGA CGAGGGGGGC 
C AAGAAGCCCA TCGCCTTGAT T 



CCTCCAAAGG AGACAAGCAT TGGCATCCTT G 
CCAATGCCTA TCATATTTGG AAGCGGGACA GTGGTGACTG TATTGCAGCA ACTGGCAAAT 
: CCTTAACCAA TGCAATAATC AGGACCACCA CGGCACTCAC 
\ AGTTCAATGT CATCCCCCCA G 
CTTCCGGATT CACCCTGGAC AGACAGTCCA A 

r TCCATGTGTT GAGTGCCTTT GACCCCCTCC CCGTCAGCCC 



GATTCTTTAC 1320 

ACAGGTTCTA C 



ACTGTGAGGT CAAGGGGCCT GCTGGGTTAG GCATGCCCGA CCCCGGGACA G 

G TAAAGTCAGC 16 80 

>\ TCTGCATCCC TTGATCCACT GGCATTTGCT 1740 

I TCCAGGCTTG 1800 

A TGTAGATTTG AGAAATCTCA ACCAGTTGTT ACCTGATAGG AGTCTTTAAT 18S0 

TTAGGGCACT CTTGCTGGGA TGCTTTCTCC AGAGCTTATA TATTTCTTCT TACTAGAACT 1920 

TTCTTCCCCC TTTTATTCCC CTCTCTTCTT GGACTCATGA GCTGTCTCTT CATCTCTCCT 1980 

CTCTCTCCTG CATCTCTCCC CTTACTCTTC AATTTATTCT ACTTCTGGAC CTGGACTTAC 2040 

CCAAACTGTG ATACTACCAT AATTGTCACC ATAATCAGTC AAATAAAGTG ATCTGTGCAT 2100 



1 11 21 31 41 51 

I I I I I I 

MAQRCVCVLA LVAMLLLVFP TVSRSMGPRS GEHQRASRIP SQFSKEERVA MKEALKGAIQ 
IPTVTFSSEK SNTTALAEFG KYIHKVFPTV VSTSFIQHEV VEEYSHLFTI QGSDPSLQPY 
LLMAHFDWP APEEGWEVPP FSGLERDGVI YGWGTLDDKN SVMALLQALE LLLIRKYIPR 
RSFFISLGHD EESSGTGAQR ISALLQSRGV QLAFIVDEGG FILDDFIPNF KKPIALIAVS 
EKGSMNLMLQ VNMTSGHSSA PPKETSIGIL AAAVSRLEQT PMPIIFGSGT WTVLQQLAN 
EVYGEKSLNQ CNNQDHHGTH HIQSRGQVQC HPPSGPGHSQ LPDSPWTDSP RGPRTHEEHC 
G 

Seq ID NO: 202 DNA sequence 
Nucleic Acid Accession #: NM_004217.1 
Coding sequence: 58.. 1092 

1 11 21 31 41 51 

I I I I I I 

GGCCGGGAGA GTAGCAGTGC CTTGGACCCC AGCTCTCCTC CCCCTTTCTC TCTAAGGATG 
GCCCAGAAGG AGAACTCCTA CCCCTGGCCC TACGGCCGAC AGACGGCTCC ATCTGGCCTG 
AGCACCCTGC CCCAGCGAGT CCTCCGGAAA GAGCCTGTCA CCCCATCTGC ACTTGTCCTC 
ITGTCCA GCCCACAGCT GCCCCTGGCC AGAAGGTGAT GGAGAATAGC 
C CCGACATCTT AACGCGGCAC TTCACAATTG ATGACTTTGA GATTGGGCGT 
CCTCTGGGCA AAGGCAAGT T TGGAAACGTG TACTTGGCTC GGGAGAAGAA AAGCCATTTC 

CTGCGCAGAG AGATCGAAAT CCAGGCCCAC CTGCACCATC CCAACATCX 
AACTATTTTT ATGACCGGAG GAGGATCTAC TTGATTCTAG AGTATGCCCC CCGCGGGGAG 
3 AGCTGCAGAA GAGCTGCACA TTTGACGAGC AGCGAACAGC C 
G CAGATGCTCT AATGTACTGC CATGGGAAGA A 
AAGCCAGAAA ATCTGCTCTT AGGGCTCRAG GGAGAGCTGA AGATTGCTGA CTTCGGCTGG , 720 
3 CGCCCTCCCT GAGGAGGAAG ACAATGTGTG GCACCCIGGA CTACCTGCCC 780 
A TTGAGGGGCG CATGCACAAT GAGAAGGTGG ATCTGTGGTG CATTGGAGTG 840 
CTTTGCTATG AGCTGCTGGT GGGGAACCCA CCCTTTGAGA GTGCATCACA CAACGAGACC S00 
TATCGCCGCA TCGTCAAGGT GGACCTAAAG TTCCCCGCTT CTGTGCCCAC GGGAGCCCAG 960 
GACCTCATCT CCAAACTGCT C 
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TCAGCCCACC CTTGGGTCCG GGCCAACTCT CGGAGGGTGC TGCCTCCCTC TGCCCTTCAA 1080 

TCTGTCGCCT GATGGTCCCT GTCATTCACT CGGGTGCGTG TGTTTGTATG TCTGTGTATG 1140 

TATAGGGGAA AGAAGGGATC CCTAACTGTT CCCTTATCTG TTTTCTACCT CCTCCITTGT 1200 
TTAATAAAGG CTGAAGCTTT TTGT 



Protein Accession #: 



MAQKENSYPW PYGRQTAPSG LSTLPQRVLR KEPVTPSALV LMSRSNVQPT AAPGQKVMEN 
SSGTPDILTR HFTIDDFEIG RPLGKGKFGN VYLAHEKKSH FIVALKVLFK SQIEKEGVEH 
QLRREIEIQA HLHHPNILRL YNYFYDRRRI YLILEYAPRG ELYKELQKSC TFDEQRTATI 
MEELADALMY CHGKKVIHRD IKPENLLLGL KGELKIADFG WSVHAPSLRR KTMCGTLDYL 
PPEMIEGRMH NEKVDLWCIG VLCYELLVGN PPFESASHNE TYRRIVKVDL KFPASVPTGA 
QDLISKLLRH NPSERLPLAQ VSAHPWVRAN SRRVLPPSAL QSVA 



I I I I I I 

AGAACGGCTT CCGGCGGGAG CTGTGCAGCT CCTTATCATG GGGACAATTC ATCTCTTTCG 
AAAACCACAA AGATCCTTTT TTGGCAAGTT G 
CCGAAGGTCC T 

GCTTATGTGG TGCAGTTCTA CTAATAGTAT AGCTTTAACT GCCTATACTT ACCTGACCAT 
TTTTGATCTT TTTAGTTTAA TGACATGTTT A 
TAGCCCTGTC TATTCATTTG G 

AGTCTTGGCA CAGTTGGGAG CTCTCTTTAT ATTAAAAGAA AGTGCAGAAC GCTTTTTGGA 
ACAGCCCGAG ATACACACGG GAAGATTATT AGTTGGTACT TTTGTGGCTC TTTGTTTCAA 
CCTGTTCACG ATGCTTTCTA TTCGGAATAA ACCTTTTGCT TATGTCTCAG AAGCTGCTAG 
TACGAGCTGG CTTCAAGAGC ATGTTGCAGA TCTTAGTCGA AGCTTGTGTG GAATTATTCC 
GGGACTTAGC AGTATCTTCC TTCCCCGAAT GAATCCATTT GTTTTGATTG ATCTTGCTGG 
AGCATTTGCT CTTTGTATTA CATATATGCT CATTGAAATT AATAATTATT TTGCCGTAGA 
CACTGCCTCT GCTATAGCTA TTGCCTTGAT GACATTTGGC ACTATGTATC CCATGAGTGT 
GTACAGTGCG AAAGTCTTAC TCCAGACAAC A 
ACTCATCAGA GAGGTATCTA CCTTAGATGG A 

GACCCTAGGT TTTGGCTCAT TGGCTGGATC AGTGCATGTA AGAATTCGAC GAGATGCCAA 9 SO 

TGAACAAATG GTTCTTGCTC ATGTGACCAA CAGGCTGTAC ACTCTAGTGT CTACTCTAAC 1020 

\AGGATG ACTGGATTAG GCCTGCCTTA TTGTCTGGGC CTGTTGCAGC 10 80 

A AACTTTTCAG AT CATCACGT AATCCCAATG CCTCTTTTAA AGGGTACTGA 1140 

TGATTTGAAC CCAGTTACAT CAACTCCAGC TAAACCTAGT AGTCCACCTC CAGAATTTTC 12 00 

3GGAAAA ATGTGAACCC AGTTATTCTT CTAAACACAC AAACAAGGCC 12 60 

T GGTCTCAATC ATGGACACAC ACCTTACAGC AGCATGCTTA ATCAAGGACT 1320 

TGGAGTTCCA GGAVTTGGAG CAACTCAAGG ATTGAGGACT GGTTTTACAA ATATACCAAG 13 80 

TAGATATGGA ACTAATAATA GAATTGGACA ACCAAGACCA TGATAGACTC TAACTTATTT 1440 

TTATAAGGAA TATTGACTCC TTGGCTTCCA ATTTATTTAG TAATCCAACT TTGCATTGAC 1500 

TGTTTAATCA TTTACTCTAA ATGTTAGATA ATAGTAGTCT TGTTCACATT TCATGAAACC 15S0 

TATGAAACTA TATTTTTGTA AAATGTATTT GTGACAGTGA AATCCTCGTA AATGTTAAAG 1620 

GCTTTAAATA GGCTTCCTTT AGAAAATGTG TTTCTTTAAA TTTGGATTTT GGTATCTTTG 1680 

GTTTTGTAGT TGACTGCAGT GTGATGTGAC CTTACCTTTA TAAGAGCCAC TTGATGGAGT 1740 

AGATCTGTCA CATTACTAAG ATACGATATT TCTTTTTTTT TCCGAGACGG AGTCTTGCTC 1800 

TGCCACTGTG CCCGGCCAAT ACATTATTAT TAACTTAAGG CTGTACTTTA TTAAGGCTT C 1860 

CTTAGTTTTT GTTTTGTTTT GTTTTTTGAG ATGGAGTCTC ACTCTGTCGC CCAGGCTGGA 1920 

ATGCAGTGGC ATGATCTCAG CTCACTGCAA CCTCTGCCTC CTGAGTTCAA ATGATTCTCC 1980 

TGCCTCAGCC TCCCGAGTAG CTGGGATTAC AGGCACCTGC CACCACGCCC AGCTAATTTT 2 040 

TGTATTTTTA GTAAAGACGG GGGATTTCAC CATGTTGGCC AGGCTGGTCT TGAACTCCTG 2100 

ACCTCATGAT CCACCCACCT TAGCCTCCCA AAGTGCTGGG ATTAGGTGTG AGCCACCGCA 2160 

CCTGGCCGAT ATTTTCTTTA ATGAAATTTA TAAATATGCT TCTTGAATAA TACACATTTT 222 0 

GGGAAAGGGA AAAATGTCTG TTCAAAAAGT AAAGGTCTCT TTTATAGCTT TTCCAAACTT 2280 

AATTGCTAAA TTTTTCTTTG AGGTTCTCCT GAATTATGTC TTACAAACTA AAAGCAAAAA 2340 

TTTTTAGCAG AAATTTTGGA ATACATTCTA TCTAGCACAA TTTGAATITT TAATTATCAA 2400 
GATTTTTGTT AAAGTTTCTC TCCITTAAAA ATTTTAGTAC ATTTGTAAAT 

Seq ID NO: 2 OS Protein sequence 



I I I 

QRSFFGKLLR EFRLVAP 

LFSLMTCLIS YWTLRKPSP VYSFGFERLE 
ESAERFLEQP EIHTGRLLVG TFVALCFNLF TMLSIRNKPF 
RSLCGIIPGL SSIFLPRMNP FVLIDLAGAF ALCITYMLIE I 
GTMYPMSVYS GKVLLQTTPP HVIGQLDKLI REVSTLDGVL EVRNEHFWTL G 
VRIRRDANEQ MVLAHVTNRL YTLVSTLTVQ IFKDDWIRPA LLSGPVAANV LNFSDHHVIP 
MPLLKGTDDL NPVTSTPAKP SSPPPEFSFN TPGKNTOPVI LLNTQTRPYG FGLNHGHTPY 
SSMLNQGLGV PGIGATQGLR TGFTHIPSRY GTHNRIGQPR P 

Seq ID NO: 206 DNA sequence 
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GGAACTCAGG GCCGGCTCCT GTTCCTTCAA GAGTGCTGGA GGCCAAACTT GAAATACAAG 60 

TTTAATGTTC CTCGTCGGGC AAAAGATAAG GATCCGATCT CCCCCGGCCC GGTGTGCAGC 120 

AGGAGCGACC AACCCCGACC CGGGTTAAAA CTCCCAGGGA CTCTTCGCTG CTGCCACCTC 180 

TTGTTCTCTC CCCCGTTCCC ACTCGGGGTC TCCCTCAGGG CCGGGAGGCA CAGCGGTCCC 240 

TGCTTGCTGA AGGGCTGGAT GTACGCATCC GCAGGTTCCC GCGGACTTGG GGGCGCCCGC 300 

TGAGCCCCGG CGCCCGCAGA AGACTTGTGT TTGCCTCCTG CAGCCTCAAC CCGGAGGCAG 360 

CGAGGGCCTA CCACCATGAT CACTGGTGTG TTCAGCATGC GCITGTGGAC CCCAGTGGGC 420 

GTCCTGACCT CGCTGGCGTA CTGCCTGCAC CAGCGGCGGG TGGCCCTGGC CGAGCTGCAG 480 

GAGGCCGATG GCCAGTGTCC GGTCGACCGC AGCCTGCTGA AGTTGAAAAT GGTGCAGGTC 540 

GTGTTTCGAC ACGGGGCTCG GAGTCCTCTC AAGCCGCTCC CGCTGGAGGA G CAGGTAGAG S00 

TGGAACCCCC AGCTATTAGA GGTCCCACCC CAAACTCAGT TTGATTACAC AGTCACCAAT 660 

CTAGCTGGTG GTCCGAAACC ATATTCTCCT TACGACTCTC AATACCATGA GACCACCCTG 720 

AAGGGGGGCA TGTTTGCTGG GCAGCTGACC AAGGTGG3CA TGCAGCAAAT GTTTGCCTTG 780 

GGAGAGAGAC TGAGGAAGAA CTATGTGGAA GACATTCCCT TTCTTTCACC AACCTTCAAC 840 

CCACAGGAGG TCTTTATTCG TTCCACTAAC ATTTTTCGGA ATCTGGAGTC CACCCGTTGT 900 

GCAGATTCAG AAGTCTTGTA TCCCAACTAC CAAAGCTGCT G3AGCCTGAG GCAGAGAACC 1020 

AGAGGCCGGA GGCAGACTGC CTCTTTACAG CCAGGAATCT CAGAGGATTT GAAAAAGGTG 1080 

AAGGACAGGA TGGGCATTGA CAGTAGTGAT AAAGTGGACT TCTTCATCCT CCTGGACAAC 1140 

ATGATCGAAC AGAGAGCTGT GGACACATCC TTGTACATAC TGCCCAAGGA AGACAGGGAA 1260 

AGTCTTCAGA TGGCAGTAGG CCCATTCCTC CACATCCTAG AGAGCAACCT GCTGAAAGCC 1320 

ATGGACTCTG CCACTGCCCC CGACAAGATC AGAAAGCTGT ATCTCTATGC GGCTCATGAT 1380 

GTGACCTTCA TACCGCTCTT AATGACCCTG GGGATTTTTG ACCACAAATG GCCACCGTTT 144 0 

GCTGTTGACC TGACCATGGA ACTTTACCAG CACCTGGAAT CTAAGGAGTG GTTTGTGCAG 1500 

A GCAGGTGCCG AGAGGTTGCC CTGATGGGCT CTGCCCGCTG 1560 

T GTCAGTTTAT ACCTTAAGCC CAGAAAAATA CCATGCACTC 1620 

T GGAAGTTGGA AATGAAGAGT AACTGATTTA TAAAAGCAGG 1680 
A AGTGCCTTTA TACAAAAAAA AAAAAAAAAA A 



Seq ID NO i 2 07 Protein sequence 
Protein Accession #: NP_0S7445.1 

1 11 21 31 41 51 

MRLWTPVGVL TSLAYCLHOR RVALAELQEA DGQCPVDRSL LKLKMVQWF RHGARSPLKP 
LPLEEQVEWN PQLLEVPPQT QFDYTVTOLA GGPKPYSPYD SQYHETTLKG GMFAGQLTKV 
CMQQMFALCE RLRKNYVEDI PFLSPTFNPQ EVFIRS7NIF RNLESTRCLL AGLFQCQKEG 
40 PIIIHTDEAD SEVLYPNYQS CWSLRQRTRG RRQTASLQPG ISEDLKKVKD RMGIDSSDKV 

DFFILLDNVA AEQAHNLPSC PMLKRFARMI EQRAVDTSLY ILPKEDRESL QKJWGPFLHI 
LESNLLKAMD SATAPDKIRK LYLYAAHDVT FIPLLMTLGI FDHKWPPFAV DLTMELYQHL 
ESKEWFVQLY YIIGKEQVPRG CPDGLCPLDM FLNAMSVYTL SPEKYHALCS QTQVMEVGME 



1 11 21 31 41 51 

50 | | | I I I 

TTTGAGGGGG TGGTGGGGCG AGTTTAATTC ATAAAGAAGC CTCCTGATCA GAAAGGGGCC 
TAACAGCCTG CCCTTGGAGA GAAGTCCTTC CT1GAGGATA AGGCCTCCCA GGGGAGGAGG 
TGCTGGGGGC CAGTGTTAGG CTTCAGGCCA TCCCTGGAGG CCAGTCCTGT GCTCAGCAAG 
TAGTGGCAGA GCCTGGAGTG ATGAGTGGGA TGGCCTTCTC AGGTACAGGA CTGTGCTGCT 
55 TCTGGCTGCT CTTGCATTTG CATTTGCCAC TCAGAACTGC CGCGATCCCA GCAATGGCCA 
GGAGCCCTCC GCAGATCAGT CCGCTCAGCT GCAGGTTTTT CCAGTCATAG TAGAAGGGAT 
CGTCTTTATT GGCAAATGGG TCATTGGCTT CCAAGGCAGT CAGGCCAACT GTGTGACTCT 
GCAGGTTCCT CACTGCTCCT TCACCAGTGT CCTGCGAGGT CACCTTGGCG AGGGCTCACC 
TGAGCTGGCA GCGCAG 

60 



I I I I I I 

ATGGAGCCCT GGGCGTGGCT GCAGGGTTTA AAGAGCCGAC CCACGTGCCC AGCAGCCTCC 
TCAGATCCGT TCTCTGCGCT GCCAGCTCAG GACACTGGTG AAGGAGCAGT GAGGAACCTG 
CAGAGTCACA CAGTTGGCCT GACTGCCTTG GAAGCCAATG ACCCATTTGC CAATAAAGAC 
GATCCCTTCT ACTATGACTG GAAAAACCTG CAGCTGAGCG GACTGATCTG CGGAGGGCTC 
CTGGCCATTG CTGGGATCGC GGCAGTTCTG AGTGGCAAAT GCAAATGCAA GAGCAGCCAG 
AAGCAGCACA GTCCTGTACC TGAGAAGGCC ATCCCACTCA TCACTCCAGG CAGATTTCTC 
ACCTTGGCCA AATCAAATAA ACCTTTATCT CCAAGCACCT TTGTCTTGGT GTTTGGCATC 
TCTTCCG AGTGCCTCTT TCTGCGTCCC TGTACCCTGC CATTCCTGGT 
G CCCTCACATC AGGCCATCCA AGCATGCAGA ACATAAGCAT GCAGAACACT 
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TLAKSNKPLS PSTFVLVFGI SYTSVFRVPL SASLYPAIPG DAAALTSGHP SMQNISMQNT 




I I I I | 

ATGCCCGGCC ACCCCGTCTG TGAAGTGAGG AGCACCTCTG CCCGSCTGCC CCGTCTGGGA 
AGTGAGGAGC GCCTCTGCCC GGCTGCCACC CCGTCTGTGA GTGCCTGCTG CGCTGGGCCC 



™>«ni»w VJWIW-UAU. UJiTCTCTSA GTGCCTGCTG CGCTGGGCCC 

AGGCCGCCCG TGCCCTGCCA GGCCCTCCGG CCCCCCACCT TCCACCCCAG GGCCTGCTCC 
TCACCCCAGG GTTCCATCTC CCTAGTTTCC ACCAGAGACT GGGTCTTCAT TCTCACCCTG 



CTACACAGCC CCTACCAGAA CGTTCTGAAA TGCAAACCTA ACAACTGTCT CACCCCAGCA 
GGAAACTCCC CAGGGTCCCG GGCCCCCTGC GGGGTTGCAG GCCTCACTCT TCGCGCCCAT 
CCCTCCGCCC TGACCGCCCT GAGCTCGCCC CCAGTGCTGO CCCTTCACGT CCAGTTATCC 
CTCCCAGCCT CCAAGGTCCC CGTTACCGAA GACCGCCACC ATCACGACAT AGCGCAGCAC 
ATATGGGACA CTGGTGAAGG AGCAGTGAGG AACCTGCAGA GTCACACAGT TGGCCTGACT 
GCCTTGGAAG CCAATGACCC ATTTGCCAAT AAAGACG AT C CCTTCTACTA TGACTGGAAA 
AACCTGCAGC TGAGCGGACT GATCTGCGGA GGGCTCCTGG CCATTGCTGG GATCGCGGCA 
GTTCTGAGTG GCAAATGCAA ATGCAAGAGC AGCCAGAAGC AGCACAGTCC TGTACCTGAG 
AAGGCCATCC CACTCATCAC TCCAGGCAGA TTTCTCACCT TGGCCAAATC AAATAAACCT 
TTATCTCCAA GCACCTTTGT CTTGGTGTTT GGCATCAGCT ACACATCAGT CTTCCGAGTG 
CCTCTTTCTG CGTCCCTGTA CCCTGCCATT CCTGGTGATG CTGCTGCCCT CACATCAGGC 
CATCCAAGCA TGCAGAACAT AAGCATGCAG AACACTGGAA CGAAGGGCTG TACCTAA 



Seq I 



i i T i i 1 

MPGHPVCEVR STSARLPRLG SEERLCPAAT PSVSACCAGP RPPVPCQALR PPTFHPRACS 
SPQGSISLVS TRDWVFILTL LHSPYQNVLK CKPNNCLTPA GNSPGSRAPC GVAGLTLRAH 
PSALTALSSP PVLALKVQLS LPASKVPVTE DRHHHDIAQH IWDTGEGAVR NLQSHTVGLT 
ALEANDPFAN KDDPFYYDWK NLQLSGLICG GLLAIAGIAA VLSGKCKCKS SQKQHSPVPE 
KAIPLITPGR FLTLAKSNKP LSPSTFVLVF GISYTSVFRV PLSASLYPAI PGDAAALTSG 
HPSMQNISMQ NTGTKGCT 



q ID NO* 213 DNA 



45 ATGATGGGGT CTCATGTTGC 



•••• >-w~v*-*>~l uiomtiu. TGGGCTCGAG TUACJCCTCCT 

GCCTTGGCCT CCGAAAGTGC TGGGATTACA GGACTGTTAT T ACAGGAAT C CATAACACTG 
GAGGATGTGG CTGTGGACTT CACTTGGGAG GAGTGGCAAC TCCTGGGCGC TGCTCAGAAG 
GACCTGTACC GGGATGTGAT GTTGGAGAAC TACAGCAACC TGGTGGCAGT GGGGTATCAA 
GCCAGCAAAC CGGATGCACT CTTCAAGTTG GAACAAGGAG AACAACTGTG GACAATTGAA 
JV GATGGAATCC ACAGTGGAGC CTGTTCAGGT TCTCCAAAGG TCCCGTTCTC CATTTTCTCA 
TCTGTGCCCT TCACTCTTCA AAATTGCCTT CATTCTAACA TATGGAAAGT TGATCATGTG 
CTGGAGCGCT TGCAGAGTGA AAGCCTGGTG AACAGAAGGA AACCATGTCA TGAACATGAT 
GCATTTGAAA ATATTGTTCA TTGCAGCAAA AGTCAGTTTC TGTTAGGGCA AAATCATGAT 
„ ATATTTGACT TACGTGGAAA AAGTTTGAAA TCCAATTTAA CTTTAGTTAA CCAGAGCAAA 
GGCTATGAAA TAAAGAACTC TGTTGAGTTT ACTGGAAATG GGGACTCCTT TCTTCATGCT 
AACCATGAAC GACTTCATAC TGCAATTAAA TTCCCTGCAA GTCAAAAACT CATCAGCACT 
AAGTCCCAAT TCATCAGTCC CAAGCATCAG AAAACACGAA AATTAGAGAA GCATCATGTG 
TGCAGTGAAT GTGGGAAAGC CTTCATCAAG AAGTCTTGGC TAACTGATCA CCAGGTAATG 
CATACAGGAG AGAAACCCCA CAGATGTAGT CTATGTGAGA AAGCCTTCTC CAGAAAGTTC 
DU ATGCTTACTG AACAT CAGCG AACTCATACA GGAGAAAAAC CTTATGAATG CCCTGAATGT 
GGCAAAGCCT TTCTCAAGAA ATCACGGCTC AACATACATC AGAAAACACA TACCGGAGAG 
AAACCCTATA TATGCAGTGA ATGTGGAAAA GGCTTCATCC AGAAAGGAAA TCTCATTGTA 
CACCAGCGAA TTCATACAGG T G AGAAACCT TATATATGCA ATGAATGTGG AAAAGGCTTC 
^ ATTCAGAAGA CGTGTCTCAT AGCACATCAG AGATTTCACA CAGGAAAGAC GCCCTTTGTG 
OJ TGCAGTGAAT GTGGAAAATC CTGTTCTCAG AAATCAGGTC TCATTAAACA TCAAAGAATT 
CACACAGGAG AGAAACCCTT TGAATGTAGT GAATGTGGGA AAGCCTTTAG CACAAAGCAA 
AAGCTCATTG TCCATCAAAG GACTCATACA GGAGAGAGAC CCTATGGCTG TAACGAGTGT 
GGGAAAGCGT TTGCGTATAT GTCGTGTCTG GTTAAGCATA AGAGAATACA CACAAGGGAG 
„ AAACAAGAGG CAGCCAAGGT GGAAAATCCT CCTGCAGAGA GGCACAGCTC ATTACACACC 
/V AGTGATGTCA TGCAGGAGAA AAACTCTGCT AACGGGGCGA CTACACAAGT GCCTTCIGTG 
GCCCCTCAGA CAT CATT AAA CATCAGCGGC CTCCTCGCAA ACAGGAACGT AGTCCTTGTG 
GGACAGCCAG TGGTCAGATG TGCAGCCTCA GGAGATAACA GAGGATTTGC ACAGGACAGA 
AACCTTGTGA ATGCAGTGAA TGTGGTTGTG CCTTCCGTGA TCAATTATGT CTTATTTTAT 
GTTACAGAAA 



Seq ID NO: 214 Protein sequence 
Protein Accession #•. FGENESH predicted 



SVPFTLQNCL HSNIWKVDHV LERLQSESLV NRRKPCHEHD AFENIVHCSK SQFLLGQNHD 
IFDLRGKSLK SNLTLVNQSK GYEIKNSVEF TGNGDSFLHA NHERLHTAIK FPASQKLIST 



I I I | | 

HMGSHVAQAG LELLGSSDPP ALASESAGIT GLLLQESITL EDVAVDFTWE EWQLLGAAQK 
DLYRDVMLEN YSNLVAVGYQ ASKPDALFKL EQGEQLWTIS DGIHSGACSG SPKVPFSIFS 

"""" ' " '" " kbk sqfli 

:aik fpasc 
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KSQFISPKHQ KTRKLEKHHV CSECGKAFIK KSWLTDHQVM HTGSKPHRCS 
MLTEHQRTHT GEKPYECPEC GKAFLKXSRL NII-Q THTGS KPYICSECGf IQKGKI 
HQRIHTGEKP YICNECGKGF IQKTCLIAHQ RFHTGKTPFV CSECGKSCSQ KSGLIKHORI 
HTGEKPFECS ECGKAFSTKQ KLIVHQRTHT GERPYGCNEC GKAFAYMSCL VKHKRIHTRE 
KQEAAKVENP PAERHSSLHT SDVMQEKNSA NGATTQVPSV APQTSLNISG LLANRNWLV 
GQPWRCAAS GDNRGFAODR NLVNAVNVW PSVINYVLFY VTENP 

Seq ID NO: 215 D 



Nucleic Acid Accession # : NM_032190.1 
10 Coding sequence: S02.-1332 

1 11 21 31 41 SI 

I I I I I I 

TCTTGGCCAT GTTAGCCATA ATATCCTGTG CAGTATGTTT TTCCTGTGCA 60 

CATATTGGGC ATATGTTCCC AAGCCCCCAG CAGTATGACC CATACTTTGG 120 

AGTGACACT C CTCCTAAGAT TTATCATGAT TAAGGAGCAT GGGCTCCAGG ACCCCTAACT 180 

T AGACTCTCAG AATAATGTCA TTAATTATAC CGCTCCATTG 240 

T CACCACAAAG ACATCACTCA GCCATAGCTG TCTTACAGTT 300 

CAAGCTCACA CATGGTTGAG TCACTATGGG AAAATCATGT ACTTATTAAG TCTTGGTTAT 360 

T AACCAACCAT TCCTGGCCCA ATCGCCTTCA TTGTGCTGAC 42 0 

CCACTGGCTA GAAAACAATC TATGTTAACT GGAGACATTG TGGATTGGGG A 
CAATTAGATG GAAAAGAAGA AAATCAGAAA TCGTGGCACA AACTTTGCTG G 
CAAGCTTTTA ATGCTTCTTC TT T ATATAAC ACTGGGATCC AATCCCAGTC GGCCGCCCAG 
ATTGCTTGGC ATGGAGCAGG CTTTAGCCCG CCTCTTC 

AAAGGACCAA TTCAAAAGAT GATATGGAAG GCAGCATTCC CATTTATGAA T 
TGGGTTGCCA TAATACTATC CAATAATAGC AATAGTAAGC AACACAGTCT TAATGTTACA 
A ATATCACCAC TCAATTTACA GTTTGTGTTT T 
A AGGACCAGCT CCAGGTAAAC AATACCCAAT T 
TTATATCACT GCATTAATCA TAGCACATTG CAAACACATA ATATCTCTAC TTTGATGATT 
TTAGGTTGCA TCCCTGGGCT ATGGATTCCT GTTAATCTGT C " 
ATTGCTTTAC ATTTTGTGAA ACTTCTTCTA ACTCAGTTTA CTCATTGTGT C 

TTAGGCATGA TAATTTTTGC TATTGTTTAC TTGGTCACAC TAATAATTTC TGTTGTGATG 1200 
3 CTTTGCATAG T TCT ATTCAA ACAGCTCAGT ATGTGGAGAA CTGGACACGC 12 60 
C AAGGGTGGCT ACTTGAGAAT AAAATTAACA CTGAGTTACA AACTGAAGTG 1320 
GCAGTGTTAT AATCCACGAT TCTATGGTTA GGGGAACAAG TACAAAGCTT GCAATTGCAG 13 80 
CAGTAATTGT GTTGTCATTT T AAT CACACT CATAT1TGTG TAACCAACTT AGAATATAAC 1440 
CAAAGTGAGT ATCCATGGGA TCTTGTGAAA GCCCATTTGC AGGGAGCTTT CACATCCGAC 1500 
ATCACCTTTG ATATTGGTGA ATTACAAAAC AAAATTCTTG ATTTAAATAA ACAAATTCCA 15 60 
GAGTTTCAGC CTTCTTTAGA ACACTCCACT GAATTCCAGC AAGGCCTGGA GAGCGTCAAC 1620 
CCTTGGACCT ATCTAAAGCA CCACATTAAC ATCTTATATA TAGTTCTTGG AATAATGTTG 1680 
TTTTGTCTCT GTCTTCTGTT CATAGT CTGT AAAATCGGAT GGACTGCCAA TCGGAGAATG 1740 
AAACCTACCC AGCCTGGCCT TACATTCTTT CACTTAATAC ATAAACAAGA AGGGGGAAAT 1800 
GTTGGGAGCC AAAAAGGCCA AAGGGATGGT GACCAACTCA GCATTCCACT GGAGGCTACA 1860 
TGATCAAACA GCAAACTGTT TATCATGAAT ACAGAATGTG GGCAAACTCG CTTCTGTGCC 1920 
TGCCCAGAAG GTTTGCTGAG GGCCATCGCT CCCTGGCCCC GGCTCCTTGA GGTTATCTAC 1980 
TGGGACATCT AGAGCCTATT GTTCGAGGAA TGCAGTCTTG CAAGCCTACT CTGGACCGAG 2040 
CAGCTGACCT CTTCTTCCAC ACCCCTTCTC ACTATCTCTT TTGCCTAATA AATATGGAGG 2100 
GCTGTGTAAA GCTCAGGGCC CTTGTCCACT AGAGGCAAGG TGTCCCCIGA CCCTTCTTCC 2160 
AAACAT 



I I I I I I 

MLTGDIVDWG PKGQLDGKEE NQKSWHKLCW HWWQAFNASS LYNTGIQSQS AAQIAWHGAG 
FSPPLPQWHY LGRKGPIQKM IWKAAFPFMN GNIWVAIILS NNSNSKQHSL NVTFVKNITT 
QFTVCVFNPY VFLAAKKDQL QVNNTQLTCK SCQLYHCINH STLQTHNIST LMILGCIPGL 
WIPVNLSEPW AAT I ALHFVK LLLTQFTHCV ERGLGMIIFA IVYLVTLIIS WMSSVALHS 
SIQTAQYVEN WTRTVNQGWL LENKINTEM TEVAVL 

Seq ID NO: 217 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1.-1566 



,-.,-.^ivauu_C CCAAATCCAC TTCCTCCCTC TTCAGGTTAT GTTTTTTGCT C^.„„ - 
70 CAGAACCTGT GGGTTGAAGA GCAAATTCAA TGCAAAAACA TATTGGGCAT ATGTTCCCAA 120 
TCCCCCAGCA GTATGGCCTA TACTTTGGAG CTCACTCCTC CTGAGATTTA TCACGATCAG 



A ATTATACCAC TCCACTGGAA GGACTCCCTT 
TCGCTCAGCC ATAGCTGTCT TGCAATTCAA GCTCAAACAT GGTTGAGTCA 

75 ATTATGTACT TATTAGGTCT TGGTTCTATT AATGTAACTG GTGTGCTAAC CAATCATTCC 420 

CAGTCCAGTC ACCCTAATTG TGCTGATTAT ACAGAATGGA TTCCATTCAA TAGTTCCTAC 480 

CCCACTCTGT GGACCCAGTG TCTTGATCCA CTGGCTAGTA AACAATATAT GTCAACTGAA 540 

GACACTGTGG ATTGGGAACC TAAAGGTCAA TTAGATGGAA AAGGTGAAAG TCAGAAATCA 600 

TGGCACAAAC TTCACTGGCA TTGGCGGCAA GCTTTTAATG CTTCTTCTTT ATACAACAGC 660 

80 AGAAT CCAAT CCCAGTCTGC TGCTCAGATT GCT TGGCATG GAGCAGGCTT TAGCCCACCT 720 

T TGCATTATCT GGGGAGGAAA GGACCAATTC AAGAAACTAT ATGGAAGGCA 780 

:GAATGG CAACATCTGG ATTGGAACAC TGTCTAATAA TAGCAATAGT 840 

AAGCAACACA GTCTTAATGT TGCATTTGTA AAGAATATCA CCACTCAGTT TACAGTTTGT 900 
GTTTTTAATC CTTATGCCTT TTTGGCAGCT AAGAAGAACC AGCTTCAGGT Q 
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ACACGCACAG CTGACCAAGC GAGGCTACTT CAGAATAAAA TTAACACTGA GTTACAAACT : 
GAAGTGGCAA TGTTGAAATC CATGGTTCTG TGGTTAG3AG AACAGGTACA AAGCTTGCAG : 
TTGCAGCAGC AATTGCGTCA TCATTTTAAT CACATTCATA TTTGCGTAAC TAACTCAGAA : 
TATAACCAAA GTGAGTATCC GTGGGACCTT GTGAAAGCCC ATTTGCAAGG AGCTTTCACA 
TCCAACATCA CCTTTGATAT TGGTGAATTA CAAAACAAAA TTATTGATTT AAATAGGCAA : 
ACTCAAGAAT TTCAGCCTTC TTTAGAAGAC TGGACCGAAT TCCAGGAAGG CCTGGAGAGC . 
CTCAACCCTT GGACCTATCT AAGGCACCAC ATTAACATCT TATATGTAGT TCTIGGAATA 
ATGTTGTTTT GTCTCTGTCT TCGGTTTATA GTCTGTAAAA TCGGATGGAC CACCAATTGG 
AGAATGAGAG CCTCCCAGCC CAGCCTTACA TTCTTTCAAT TAATACATAA ACAGAAAGGG 
GGATATGCAG GGAGCCAAAG GCCTGTGGGA CGTGACCAAC TCAGCATTCT GCTGGAGGCT 
ATATGA 

Seq ID NO: 218 Protein sequence 



I I I I I I 

MVNPKSTSSL FRLCFLLLRS QNLWVEEQIQ CKNILGICSQ SPSSMAYTLE LTPPEIYHDQ 
GEWAPGPLTP RDIEKLDSQN NVINYTTPLE GLPLFITTKT SLSHSCLAIQ AOTWLSHYGK 
IMYLLGLGSI NVTGVLTNHS QSSHPNCADY TEWIPFNSSY PTLWTQCLDP LASKQYMSTE 
DTVDWEPKGQ LDGKGESQKS WHKLHWHWRQ AFNASSLYNS RIQSQSAAQI AWHGAGFSPP 
LPQLHYLGRK GPIQETIWKA ALPFMNGNIW IGTLSNNSNS KQHSLNVAFV KNITTQFTVC 
VFNPYAFLAA KKNQLQVENW TRTADQARLL QNKINTELQT EVAKLKSMVL WLGEQVQSLQ 
LQQQLRHHFN HIHICVTNSE YNQSEYPWDL VKAHLQGAFT SNITFDIGEL QNKIIDLNRQ 
TQEFQPSLED WTEFQEGLES LNPWTYLRHH INILYWLGI MLFCLCLRFI VCKIGWTTNW 
RMRASQPSLT FFQLIHKQKG GYAGSQRPVG RDQLSILLEA I 




ATGCCGCCGC GGGAGCTGAG CGAGGCCGAG CCGCCCCCGC TCCGGGCCCC GACCCCTCCC 
CCGCGGCGGC GTAGCGCGCC CCCAGAGCTG GGCATCAAGT GCGTGCTGGT GGGCGACGGC 
GCCGTGGGCA AGAGCAGCCT CATCGTCAGC TACACCTGCA A 
CGGCCCACTG CGCTGGACAC CTTCTCTGGT ACGTACGTTC AATCCCCCCT G 

G GGGCTGTGCA CCGGGGAGCT GGGGCGGGCG TCTCGGCGGG AGGGCGCAGA 



\GATCCCCA CGCACAACCC CCAGGCGCCT 
GTGCTGCTGG TGGGCACCCA GGCCGACCTG AGGGACGATG TCAACGTACT AATTCAGCTG 
GACCAGGGGG GCCGGGAGGG CCCCGTGCCC CAACCCCAGG CTCAGGGTCT GGCCGAGAAG 
ATCCGAGCCT GCTGCTACCT TGAGTGCTCA GCCTTGACGC AGAAGAACTT GAAGGAAGTA 
TTTGACTCGG CTATTCTCAG TGCCATTGAG CACAAAGCCC GGCTGGAGAA GAAACTGAAT 
GCCAAAGGTG TGCGCACCCT CTCCCGCTGC CGCTGGAAGA AGTTCTTCTG CTTCGTTTGA 



I I I I I I 

MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS YTCNGYPARY 
RPTALDTFSG TYVQSPVRPR GCGGAVHRGA GAGVSAGGRR GPRGGDWSRP RGGAGAAQDA 
LPNSGSPRPA PAVQVLVDGA PVRIELWDTA GQEDFDRLRS LCYPDTDVFL ACFSWQPSS 
FQNITEKWLP EIRTHNPQAP VLLVGTQADL RDDVNVLIQL DQGGREGPVP QPQAQGLAEK 
IRACCYLECS ALTQKNLKEV FDSAILSAIE H 

Seq ID NO: 221 DNA sequence 



I I I I I 

C GGGAGCTGAG CGAGGCCGAG CCGCCCCCGC TCCGGGCCCC GACCCCTCCC 
CCGCGGCGGC GTAGCGCGCC CCCAGAGCTG GGCATCAAGT GCGTGCTGGT C- 
GCCGTGGGCA AGAGCAGCCT CATCGTCAGC TACACCI 
CGGCCCACTG CGCTGGACAC CTTCTCTGTG CAAGTCCTGG T 
ATTGAGCTCT GGGACACAGC GGGACAGGAG GATTTTGACC GACTTCGTTC C 
CCGGATACCG ATGTCTTCCT GGCGTGCTTC AGCGTGGTGC AGCCCAGCTC CTTTCAAAAC 

GTGGGCACCC AGGCCGACCT GAGGGACGAT GTCAACGTAC TAATTCAGCT GGACCAGGGG 
GGCCGGGAGG GCCCCGTGCC CCAACCCCAG GCTCAGGGTC TGGCCGAGAA GATCCGAGCC 
TGCTGCTACC TTGAGTGCTC AGCCTTGACG CAGAAGAACT TGAAGGAAGT ATTTGACTCG 
GCTATTCTCA GTGCCATTGA GCACAAAGCC CGGCTGGAGA AGAAACTGAA TGCCAAAGGT 
GTGCGCACCC TCTCCCGCTG CCGCTGGAAG AAGTTCTTCT GCTTCGTTTG A 
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MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS YTCNGYPARY 
RPTALDTFSV QVLVDGAPVR IELWDTAGQE DFDRLRSLCY PDTDVFLACF SWQPSSFQN 
ITEKWLPEIR THNPQAPVLL VGTQADLRDD VNVLIQLDQG GREGPVPQPQ AQGLAEKIRA 
CCYLECSALT QKNLKEVFDS AILSAIEHKA RLEKKLNAKG VRTLSRCRWK KFFCFV 



25 
30 



Coding sequence: 1-.11S1 

1 11 21 31 41 51 

I I I I I I 

ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTCTG 

TTCCGAGATG ACTTCATTGC CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA G 

GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA G 

TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCTGACT TTCTACTGAT CATCTGCCTG 

CCGTTCGTGA TGGACTACTA TGTGCGGCGT TCAGACTGGA AGTTTGGGGA CATCCCTTGC 

CGGCTGGTGC TCTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 

GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 

AATTGGACAG CAGCCAT CAT CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 

CACCTCCTGA AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC 

C ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGCCC 

A TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA G 

GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA I 



TCGGGCACGC AGAATTGTGA AGTGTACCGC T 
AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT A 
TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 960 
CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020 
GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1080 
ACCTCAAATA ACCATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTGGAGAAA 1140 
CAGTTGGGAT GTTGCATCGA G 



1 11 21 31 41 51 

I I I I I I 

MNRHHLQDHF LEIDKKNCCV FRDDFIAXVL PPVLGLEFIF GLLGNGLALW I 

SSRIFLFNLA VADFLLIICL PFVMDYYVRR SDWKFGDIPC 
VAVDRYFRW HPHHALNKIS NWTAAIISCL LWGITVGLTV K 
SICHTFRWHE AMFLLEFLLP LGIILFCSAR IIWSLRQRQM DRHAKIKRAI T 
VICFLPSVW RIRIFWLLHT SGTQNCEVYR SVDLAFFITL SFTYMNSMLD PWYYFSSPS 
FPNFFSTLIN RCLQRKMTGE PDNNRSTSVE LTGDPNKTRG APEALNANSG EPKSPSYLGP 
TSNHHSKKGH CHQEPASLEK QLGCCIE 

Seq ID NO: 225 DNA sequence 



1 11 21 31 41 51 

I I I I I I 

ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 60 

TTCCGAGATG ACTTCATTGT CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA GTTTATCTTC 120 

GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 180 

TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTGGCTGACT TTCTACTGAT CATCTGCCTG 240 

CCCTTCCTGA TGGACAACTA TGTGAGGCGT TGGGACTGGA AGTTTGGGGA CATCCCTTGC 300 

CGGCTGATGC TCTTCATGTT GGCTATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 360 

GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420 

AATCGGACAG CAGCCAT CAT CTCTTGCCTT CTGTGGGGCA TCACTATTGG CCTGACAGTC 480 

CACCTCCTGA AGAAGAAGAT GCCGATCCAG AATGGCGGTG CAAATTTGTG CAGCAGCTTC 540 

AGCATCTGCC ATACCTTCCA GTGGCACGAA GCCATGTTCC TCCTGGAGTT CTTCCTGCCC 600 

CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 660 

GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTCTTT 72 0 

GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 780 

TCGGGCACGC AGAATTGTGA AGTGTACCGC TCGGTGGACC TGGCGTTCTT TATCACTCTC 840 

AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 900 

TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 960 

CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020 

GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1080 



I I I I I I 

MNRHHLQDHF LEIDKKNCCV FRDDFIVKVL PPVLGLEFIF GLLGNGLALW IFCFHLKSWK 
SSRIFLFNLA VADFLLIICL PFLMDNYVRR WDWKFGDIPC RLMLFMLAMN Ri 
VAVDRYFRW HPHHALNKIS NRTAAIISCL LWGITIGLTV HLLKKKKPIQ N' 
SICHTFQWHE AMFLLEFFLP LGIILFCSAR IIWSLRQRQM DRHAKIKRAI T 
VICFLPSVW RIRIFWLLHT SGTQNCEVYR SVDLAFFITL SFTYMNSMLD PWYYFSSPS 
FPNFFSTLIN RCLQRKMTGE PDNNRSTSVE LTGDPNKTRG APEALMANSG EPWSPSYLGP 
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Seq ID NO: 22 7 DNA sequence 
Nucleic Acid Accession ft: NM_006018 
Coding sequence: 61.. 1224 



TTTTCCTGTT 

TCAGACTGGA 



GTGGCGGTAG ACAGGTATTT 
AATTGGACAG 

A AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG 



CTGGGCATCA T 

GACCGGCATG CCAAGATCAA GAGAGCCATC A 

GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT C 

VTTGTGA AGTGTACCGC TCGGTGGACC TGGCGTTCTT TATCACTCTC 
T ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 



T TAGGGAAACT GTGGCAGATG AGTGGGAGAC TGGTTGCAAG 1320 

GTGTGACCAC AGGAATCCTG GAGGAACAGA GAGTAAAGCT TCTAGGCATC TGAAACTTGC 1380 

TTCATCTCTG ACGCTCGCAG GACTGAAGAT GGGCAAATTG TAGGCGTTTC TGCTGAGCAG 1440 

AGTTGGAGCC AGAGATCTAC TTGTGACTTG TTGGCCTTCT TCCCfiCATCT GCCTCAGACT 1500 

GGGGGGGGCT CAGCTCCTCG GGTGATATCT AGCCTGCTTG TGAGCICTAG CAGGGATAAG 1560 

GAGAGCTGAG ATTGGAGGGA ATTGTGTTGC TCCTGGAGGA AGCCCAGGCA TCATTAAACA 1620 

AGCCAGTAGG TCACCTGGCT TCCGTGGACC AATTCATCTT TCAGACAAGC TTTAGAGAAA 1680 

TGGACTCAGG GAAGAGACTC ACATGCTTTG GTTAGTATCT GTGTTTCCGG TGGGTGTAAT 1740 

AGGGGATTAG CCCCAGAAGG GACTGAGCTA AACAGTGTTA TTATGGGAAA GGAAATGGCA 1800 

TTGCTGCTTT CAACCAGCGA CTAATGCAAT CCATTCCTCT CTTGTTTATA GTAATCTAAG 1860 

GGTTGAGCAG TTAAAACGGC TTCAGGATAG AAAGCTGTTT CCCACCTGTT TCGTTTTACC 1920 

ATTAAAAGCG AAACGTGCCT CTCCCCCACG GGTAGACGGG GTGCACGTTC CTCCTGGTTC 1980 

CTTCGCTTGT GTTTCTGTAC TTACCAAAAA TCTACCACTT CAATAAATTT TGATAGGAGA 2040 

D NO: 22S Protein sequence 



I I I I I I 

MNRHHLQDHF LEIDKKNCCV FRDDFIAKVL PPVLGLEFIF GLLGNGLALW IFCFHLKSWK 
SSRIFLFNLA VADFLLIICL PFVMDYYVRR SDWNFGDIPC RLVLFMFAMN RQGSIIFLTV 
VAVDRYFRW HPHHALNKIS NWTAAIISCL LWGITVGLTV HLLKKKLLIQ NGPANVCISF 
S I CHTFRWHE AMFLLEFLLP LGIILFCSAR IIWSLRQRQM DRHAKIKRAI TFIMWAIVF 
VICFLPSVW RIRIFWLmT SGTQNCEVYR SVDLAFFITL SFTYMNSMLD PWYYFSSPS 
FPNFFSTLIN RCLQRKMTGE PDNNRSTSVE LTGDPNKTRG APEALMANSG EPWSPSYLGP 
TSNNHSKKGH CHQEPASLEK QLGCCIE 

Seq ID NO: 22 9 DNA sequence 



C AAGATTCATG GATGGTCATA TCACCTTTCA AACAGCGGCC 
ACAGTAAAAA TTCCAACAAC TACCCCAGCA ACTACAAAAA ACACTGC 
ATTACCTACA CCCTGGTCAC AACCCAGGCC ACACCCAACA ACTCACACAC A 
GTTACTGAAG TTACAGTCGG CCCTAGCTTA GCCCCTTATT CACTGCC 
70 CCACCAGCTC ATACAGCTGG AACCAGTTCA TCAACCGTCA GCCACACAAC T 
A GTAACCAGAC CACCCTTCCA GCAACTTTAT CGATAGCACT G 
C AGAAGCCTGA TCAACCCACC CATGCCCCAG GAACAACGGC A 
AATACCACCC GCACAGCTGC ACCTGCCTCC ACGGTTCCTG GGCCCACCCT TGCACCTCAG 
CCATCGTCAG TCAAGACTGG AATTTATCAG GTTCTAAACG GAAGCAGACT C 
/ J GCAGAGATGG GGATACAGCT GATTGTTCAA GACAAGGAGT CGGTTT1 

TACTTCAACA TCGACCCCAA CGCAACGCAA GCCTCTGGGA ACTGTGGCAC CCGAAAATCC 
T TGAATTTTCA GGGCGGATTT GTGAATCTCA CATTTACCAA G 



A GTTGTCAGCC CACCTGCAGG TGAAAACAAC CGATGTCCAA 
T TTGATTTTGA AGATGACCAC TTTGGAAATG T 

TACACAATTG T 

GGTGTCTATA AAATCCGCCT AAGGTGTCAA TCATCTGGAT ACCAGAGAAT CTAATTGTTG 132 0 
CCCGGGGGGA ATGAAAATAA TGGAATTTAG AGAACTCTTT CATCCCTTCC AG3ATGGATG 1380 
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TTGGGAAATT CCCTCAGAGT GTGGGTCCTT CAAACAATGT AAACCACCAT CTTCTATTCA 
AATGAAGTGA GTCATGTGTG ATTTAAGTTC AGGCAGCACA TCAATTTCTA AATACTTTTT 
GTTTATTTTA TGAAAGATAT AGTGAGCTGT TTATTTTCTA GTTTCCTTTA GAATATTTTA 
GCCACTCAAA GTCAACATTT GAGATATGTT GAATTAACAT AATATATGTA AAGTAGAATA 
A AGGGTCAATT GTAACTAATA CTACTGTGTG TGCATTGAAG 
ICAAAGC CTTTGCTTTG TTATCAAATG GACTTTCAGT 
T ATGGTTTCAT GTAACATACA TATTCCTGGT GTAGCACTTA 
ACTCCTTTTC CACTTTAAAT TTGTTTTTGT TTTTTGAGAC GGAGTTTCAC TCTTGTCACC 
rCTCGGC TTATGGCAAC CTCCGCCTCC CGGGTTCAAG 
T GCTTCAGCTT CCCGAGTAGC TGGGATTACA GGCACACACT ACCACGCCTG 
T GTATTTTTAT TATAGACGGG TTTCACCATG TTGGCCAGAC TGGTCTTGAA 
T CAGGTGAT CC ACCCACCTCA GCCTCCCAAA GTGCTGGGAT TACAGGCATG 
AGCCATTGCG CCCGGCCTTA AATGTTTTTT TTAATCATCA AAAAGAACAA CATATCTCAG 
GTTGTCTAAG TGTTTTTATG TAAAACCAAC AAAAAGAACA AATCAGCTTA TATTTTTTAT 



C ACTTTATCTA TATCTCATTT CATTCTCACA ACTTATAAGT GAATGAGTAA 
ACTGAGACTT AAGGGAACTG AATCACTTAA ATGTCACCTG GCTAACTGAT GGCAGAGCCA 
GAGCTTGAAT TCATGTTGGT CTGACATCAA GGTCTTTGGT CTTCTCCCTA CACCAAGTTA 
CCTACAAGAA CAATGACACC ACACTCTGCC TGAAGGCTCA CACCTCATAC C 
TCACCTTACA GGGAAATGGG TTTATCCAGG ATCATGAGAC ATTAGGC 

^CAAAAT AGCCTATCCT TAATAAATCC TCCACTCTCT GGAAGGAGAC 
T TGTAAAACAT TAGTCAGTTG CTCATTTTTA TGGGATTGCT T 
GTAAAGATGA AGGCATCAAA TAAACTCAAA GTATTTTTAA ATTTTTTTGA T 
ACTTCGCTAA CCAACTGTTC TTTCTTGAGT GTATAGCCCC ATCTTGTGGT AACTTGCTGC 2880 
TTCTGCACTT CATATCCATA TTTCCTATTG TTCACTTTAT TCTGTAGAGC AGCCTGCCAA 2940 
GAATTTTATT TCTGCTGTTT TTTTTGCTGC TAAAGAAAGG AACTAAGTCA GGATGTTAAC 3 000 
AGAAAAGTCC ACATAACCCT AGAATTCTTA GTCAAGGAAT AATTCAAGTC AGCCTAGAGA 3060 
CCATGTTGAC TTTCCTCATG TGTTTCCTTA TGACTCAGTA AGTTGGCAAG GTCCTGACTT 3120 
TAGTCTTAAT AAAACATTGA ATTGTAGTAA AGGTTTTTGC AATAAAAACT TACTTTGG 

Seq ID NO: 230 Protein sequence 
Protein Accession (t : NP_0SS213.1 

1 11 21 31 41 51 

I I I I I I 

MPRQLSAAAA LFASLAVILH DGSQMEAKAF PETRDYSQPT AAATVQDIKK PVQQPAKQAP 60 
IIQTLAARFMD GI1ITFQTAAT VKIPTTTPAT TKNTATTSPI TYTLVTTQAT PKNSHTAPPV 12 0 
TEVTVGPSLA PYSLPPTITP PAHTAGTSSS TVSHTTGNTT QPSNQTTLPA TLSIAL1IKST 180 
TGQKPDQPTH APGTTAAAHN TTRTAAPAST VPGPTLAPQP SSVKTGIYQV LNGSRLCIKA 240 
BMGIQLIVQD KESVFSPRRY FNIDPNATQA SGNCGTRKSN LLLNFQGGFV NLTFTKDEES 300 
YYISEVGAYL TVSDPETVYQ GIKHAWMFQ TAVGHSFKCV SEQSLQLSAII LQVKTTDVQL 360 
QAFDFEDDHF GNVDECSSDY TIVLPVIGAI WGLCLMGMG VYKIRLRCQS SGYQRI 

Seq ID NO: 231 DNA sequence 



1 11 21 31 41 51 

I I I I I I 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 
CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 
TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 
TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 
ATAATGTACC CAAGTAACAA CTGTGACAAA ATAGAAGTGA TTATTACCCT GAAAGAAAAT 

GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATGAAGTC CTGGAAAAGG G 
ITAACT GTGACTACTG AAATGACAAG AATTCTACAG T 
GACTTTCAAC TTTTGTACAG TT AT GTGAAG G 
GAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 
CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 
GGTTACCATC GGAGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT ATACATTCAT 

GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA T 
CTGTGGTTAC AGT GGAGACA TTGACATTAT TACTGGAGTC AAGCCCTTAT A 

CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 

CCAAAT AT CA TGTAGCACAT CAATATGTAG GG AAACATT C TTATGCATCA TTTGGTTTGT 1020 

TTTATAACCA ATT CATT AAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCTA 10B0 

TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1140 

TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 1320 

AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 

TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 
GTCATTTTTT TCTCTAATAA ACTAC CACAA CCTTTCTTTT T 



Seq I 
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1 11 21 31 41 51 

I I I I I I 

GGCACGAGGG GAAGACCTCC TGTCCTATCA GGCCCTCCCC AIGGCTTTAG AGACGATCTG 60 

CCGACCCTCT GGGAGAAAAT CCAGCAAGAT GCAAGCCTTC AGAATCTGGG ATGTTAACCA 120 

GAAGACCTTC TATCTGAGGA ACAACCAACT AGTTGCCGGA TACTTGCAAG GACCAAATGT 180 

CAATTTAGAA GAAAAGATAG ATGTGGTACC CATTGAGCCT CATGCTCTGT TCTTGGGAAT 240 

T CAAGTCTGGT GATGAGACCA GACTCCAGCT 300 

G ACCTGAGCGA GAACAGAAAG CAGGACAAGC GCTTCGCCTT 360 

CATCCGCTCA GACAGTGGCC CCACCACCAG TTTTGAGTCT GCCGCCTGCC CCGGTTGGTT 42 0 

CCTCTGCACA GCGATGGAAG CTGACCAGCC CGTCAGCCTC ACCAATATGC CTGACGAAGG 4 80 

CGTCATGGTC ACCAAATTCT ACTTCCAGGA GGACGAGTAG TACTGCCCAG GCCTGCCTGT 540 
TCCCATTCTT GCATGGCAAG GACTGCAGGG ACTGCCAGTC CCCCTGCCCC AGGGCTCCCG 




a Accession ft: NP_000S68.1 



AACPGWFLCT AMEADQPVSL TNMPDEGVMV TKFYFQEDE 



TCCCGAGGAC CAGGGGATGC GAAGGGATTG CCCCCTGTGG GTCACTTTCT CAGTCATTTT 
GAGCTCAGCC TAATCAAAGA CTGAGGTTAT GAAGTCGATC CTAGATGGCC TTGCAGATAC 
CACCTTCCGC ACCATCACCA CTGACCTCCT GTACGTGGGC TCAAATGACA TTCAGTACGA 
AGACATCAAA GGTGACATGG CATCCAAATT AGGGTACTTC CCACAGAAAT TCCCTTTAAC 
TTCCTTTAGG GGAAGTCCCT TCCAAGAGAA GATGACTGCG GGAGACAACC CCCAGCTAGT 



GCCTGATGTG GACCATAGCC ATTGTGATCG CCGTGCTGCC 
TGGAACTGCG AGAAACTGCA ATCTGTTTGC TCAGACATTT TCCCACACAT 
TACCTGATGT TCTGGATCGG GGICACCAGC GTACTGCTTC TGTTCATCGT : 



GCCAGACCAA GCCCGCATGG ACATTAGGTT AGCCAAGACC CTGGTCCTGA TCCTGGTGGT 12 00 



TGTTTGCATT CTGCAGTATG CTCTGCCTGC T 1 
CTCTGAGGAG TAAGGACCTG CGACAOGCTT TCCGGAGCAT 
CTGCGCAGCC T 



Protein Access 



1042 
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I I I I I I 

MKSILDGLAD TTFRTITTDL LYVGSNDIQY EDIKGDMASK LGYFPQKFPL TSFRGSPFQE 
KMTAGDNPQL VPADQVNITE FYNKSLSSFK ENEENIQCGE NFMDIECFMV LNPSQQLAIA 
VLSLTLGTFT VLENLLVLCV ILHSRSLRCR PSYHFIGSLA VADLLGSVIF VYSFIDFHVF 
HRKDSRNVFL FKLGGVTASF TASVGSLFLT AIDRYISIHR PIAYKRIVTR PKAWAFCLM 
WTIAIVIAVL PLLGWNCEKL QSVCSDIFPH IDETYLMFWI GVTSVLLLFI VYAYMYILWK 
AHSHAVRMIQ RGTQKSIIIH TSEDGKVQVT RPDQARMDIR LAKTLVLILV VLIICWGPLL 
AIMVYDVFGK MNKLIKTVFA FCSMLCLLNS TVNPI IYALR SKDLRHAFRS MFPSCEGTAO 
PLDNSMGDSD CLHKHANNAA SVHRAAESCI KSTVKIAKVT MSVSTDTSAE AL 

= 237 DNA 



c Acid Accession # 



I I I I I I 

GATTGCCCCC TGTGGGTCAC TTTCTCAGTC ATTTTGAGCT CAGCCTAATC " ' ' 
GTTATGAAGT CGATCCTAGA TGGCCTTGCA GATACCACCT TCCGCACCAT 
CTCCTGTACG TGGGCTCAAA TGACATTCAG TACGAAGACA TCAAAGGTGA C 

T ACTTCCCACA GAAATTCCCT TTAACTTCCT TTAGGGGAAG TCCCTTCCAA 
A CTGCGGGAGA CAACCCCCAG CTAGTCCCAG C 
T CTCGTCCTTC A 



GTCATCCTCC A 
GCGGTGGCAG ACCTCCTGGG G 

TTCCACCGCA AAGATAGCCG CAACGTGTTT CTGTTCAAAC TGGGTGGGGT CACGGCCTCC 
CTCACTGCCT CCGTGGGCAG CCTGTTCCTC ACAGCCATCG A 

AGGCCCCTGG CCTATAAGAG GATTGTCACC AGGCCCAAGG CCGTGGTGGC GTTTTGC 
ATGTGGACCA TAGCCATTGC GATCGCCGTG C 
CTGCAATCTG TTTGCTCAGA CATTTTCCCA CACATTGATG A 

ATCGGGGTCA CCAGCGTACT GCTTCTGTTC ATCGTGTATG CGTACATGTA TATTCTCTGG 960 

AAGGCTCACA GCCACGCCGT CCGCATGATT CAGCGTGGCA CCCAGAAGAG CATCATCATC 1020 

CACACGTCTG AGGATGGGAA GGTACAGGTG ACCCGGCCAG ACCAAGCCCG CATGGACATT 1080 

AGGTTAGCCA AGACCCTGGT CCTGATCCTG GTGGTGTTGA TCATCTGCTG GGGCCCTTTG 1140 

CTTGCAATCA TGGTGTATGA TGTCTTTGGG AAGATGAACA AGCTCATTAA GACGGTGTTT 1200 

" ~ ~A GTATGCTCTG CCTGCTGAAC TCCACCGTGA ACCCCATCAT CTATGCTCTG 1260 

^ ACCTGCGACA CGCTTTCCGG AGCATGTTTC CCTCTTGTGA AGGCACTGCG 132 0 

CAGCCTCTGG ATAACAGCAT GGGGGACTCG GACTGCCTGC ACAAACACGC AAACAATGCA 13 80 

GCCAGTGTTC ACAGGGCCGC AGAAAGCTGC ATCAAGAGCA CGGTCAAGAT TGCCAAGGTA 1440 

ACCATGTCTG TGTCCACAGA CACGTCTGCC GAGGCTCTGT GAGCCTGATG CCTCCCTGGC 1500 

AGCACAGGAA AAGAATTTTT TTTTTTAAGC TCAAAATCTA GAAGAGTCTA TTGTCTCCTT IS 6 0 

GGTTATATTT TTTTAACTTT ACCATGCTCA ATGAAAAGGT GATTGTCACC ATGATCACTT 162 0 

ATCAGTTTGC TAATGTTTCC ATAGTTTAGG TACTCAAACT CCATTCTCCA GGGGTTTACA 1680 

GTGAAGAAAG CCTGTTGTTT AAGTGACT GA ACGATCCTTC AAAGTCTCAA TGAAATAGGA 1740 

GGGAAACCTT TGGCTACACA ATTGGAAGTC TAAGAACCCA TGGAAAAATG CCATCAAATG 1800 

AATAATGCCT TTGTAACCAC AACTTTCACT ATAATGTGAA ATGTAACTGT CCGTAGTATC 1860 

AGAGATGTCC ATTTTTACAA GTTATAGTAC TAGAGATATT TTGTAAAATG TATTATGTCC 1920 

TGTGAGATGT GTATCAGTGT TTATGTGCTA TTAATATTTG TTTAGTTCAG CAAAACTGAA 1980 

AGGTAGACTT TTATGAGAAC AATGGACAAG CAGTGGATAC GTGT CAATGT GTGCACTTTT 2040 

TTTCTATATT ATTGCCCATG ATATAACTTT AGAAATAAAC CTTAATATTT CTTCAAATAT 2100 

CTCTATTTAA TTTTGACACT GAAATAACCG TAAAGGTTTA TTTTTCTGTT ACCT CAACAA 2160 

GAAGAATTTG AAGACTTCAA AATATTGAGC AGAATTCATT CATACTTAAA AATTTATTAG 2220 

CCCTGCATTT TCATAGGAAG ACACATTATC TTCTGGACTA TAGCTGTTCT AATGGATTAT 2280 

AATCAGAATG GAAGAGAGAA AGCATATTGA CTTTTTTTGA GCGACATCTC TGACTTTCTT 2340 

TAGTCTTTAG CTATTACTGG ATCTCTTAAG ACAGCATGTG TTAATCTTAA TGTATATCGT 2400 

TATCACTGTG CAGTT GCTGT TTACTTGAAT AGTATTGTGT TCCTATATTC CAGGTTTAAG 2460 

TAGATTTCAT GCCTGGGTGG CCAAACAACA GTCTTCATTT TTTTTAATTG AAAAGAAGTA 2520 

GTGTCTGGAT CAGTAAAATT ATACTGTGTG TGAGTGTGAA TATAAATGTG TGTATGTGTG 2580 

TTTCTGTCCG TAACTGTTAC AGTAATGTCA TAAAGTGAGA AAACTGTGAC CAAGTATAAA 2640 

CTTTTACCAC TTGCTGCACT CTTGCACATG GATTCAGTTT CTAAAATTGA GTTCTTCCTG 2700 

TAATCTTGTT GATAAAAATA CTGACTCCAA CCATTCAAAA ATTTCACCCC ATCCCTCCTT 2760 

AAGAGATTGG ATCAAGTATT ACTAAATTGA CCTTTAGGTA TTACACAAGA CCAGTGCTTA 2820 

GCAAAAAATA ATGACAGGCA TCCAAGGAAG GGATGTATTT GTAGTGTTAT TGCCAGGAAA 2880 

GGAGAGTACT TTGGTTTCTG AGCACCGAAT ATTGAGCAAT ATGTCAGTCA CTAAAAGGAA 2940 

GACAGTTCTA CAGAAAAACA AATGGTAACA TTTTTCAATA GCGTGTGTAG ATAGTATGCA 3000 
CTATATACAT CACGTTAAAG TAGGACTATC ACACCCAGCC CATGTGGCTA A, 
ATCAGACAGT GGATGAGACA CACAACGGCA GTGAAGAACC G 

CTAGCTATGC TGTATCTGTG CTTTGCCCAC ATGCCCTTGG TGACAGCTGA GCACCCAGCT 



AATGGCGCCA TGGGTGCTTG TTGGGCCTTT TTCCAGTAAG GAATGATATT GCTGAAGAAT 
CTACTTAACC CTGACAAATT TTAATTATAA TCTCTTCTTA T. 
TACAAGGCCC CAAGGTTTAC A 
I CTCGAGATAC C 

3ATGTGG CTTATGAGAT ACTTTGCATT GTCTGTCTGC A 
TCTGCCTGCT GGCTCCCTTA CTTTACCTCT C 
TAGAGGATTA GTAAGATCTC TTTCTAAAGA CAGGAGAGAT TATTTACAAG AAGAACTCAC 3720 
CAGGGTTTAG TTTGCATTTA AGAATTGCCA GTCTTTTGTC CT SCAT CATC TTGAACATTA 3780 
ATCCACATGT TTCAGAGCTC ACCAGGCAGT ACCAATGCTC TTTTCACAGC TATGAAGAGC 3840 
TAGAGAAATT CTTGTTATGG TAGAAAAATT TCACGGTTCA TTTTTGAAAC TGCATTTGTG 3900 
CGTATGCAGT GTAGATTTTA TAGTGTGTTG TGCTTTCAAG ATCTAAATCA TATATAATAA 3960 
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ATTAAGGGAC AATGGGGCTG ACAGCACTAA ACTTGGTGCT TAT T GAT ATT CTAAGAAATA 402 0 

TCTGTGAAAT ATCATCACGT ATGTTATACA ACCTTCATTT AAAAAG3TTT AAAACTAGTT 4080 

AGATTCACTT TGACACTTTT CATATCATTT CTTAACCCAA GTGACGAAAA CATTC-TCCCC 4140 

AATGAATATA CTCATTAGAA TTACCATTTG TTAATATCAC TCATTAATTA ACCCCATAAT 4200 

TAGATCCATT AATTTAAATG ATTTAAATTT AAGTAAGTTT TATAAGC-TCT GACATCAGAG 42 60 

GTATCTTACT TTCCTCTGAG GATGATGTAC TTGCCCTGAC CATGCATTTT ACCATCACAC 4320 

ATGTTCAGAA AGGGCCAAAT TCCCAACCTG CTCATTTTTT TTTTTATCAG AG T CAT GAT G 4380 

AATCAGTCCT AGAATGTTTC ATTTGCACAA GTAGGGCTGC CTCCAAGAGG AACCTCTGAT 4440 

TTATTTTGTA TGAAATATAT GTGAAAGGAT ATGAATCTGA GAGATGCTGT AGACATCTGT 4500 

CCTACACTTG AGATGATTTC CAAGCCTCTC TGGCACTTTG AGTTAAGTCT ATCTGGTATT 4560 

AAATGCCAAG GACCTTTTGC TGCCTAAATC CACTCTGCAG GAAATAGGCC CAACCACCAG 4620 

ATGAGAATTA GGCCCTGGAT GAGTAGCGCT ATAGTTACTG TCCTGTTGAT TAATTTCTGC 4680 



55 



CTCCTAAATG CTGGGCAATG GCTATTAACA 

TTGCACTGGC C 



\ TTTTCGGGTG G 

I TAAAAGCTAT CTTCATGTAA ATTGACTGTA CTAGGCCTAC TGGGGATCAG 5040 
C TTTTCTTGTA TCTGGATTCA A 
AGCGGGAAAC ATGACTCTTT ATTGTCTGTA AATCTAA 

GAATATTGTA TTGTTAGATG TTTGTTGAGC TGGTAACATC GTTGCAACCA C 
TTCGTTAGTA ATCTGTATAA TACTTTGTAT ACAAGTACTG GTAAGATTGT TATTAAATGT 52 80 
AGCTTCAGTC ATTAAATTAC TATAGCAAAG TAGTACTTCT TCTGTAATAT TTACAATGTA 5340 
TTAAGCCCAC AGTATATTTT ATTTCAATGT AATTAAACTG TTAACTTATT CAAAGAGAAA 5400 
ACATCTCATC ATGTCTATTG TCCAAAGTTA CCTGGAATCA AATAAAAATT CTAGATTACC 5460 
ATGAAGAACA TA 

Seq ID NO: 238 Protein sequence 
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MKSILDGLAD TTFRTITTDL LYVGSNDIQY EDIKGDMASK LGYFPQKFPL TSI 
KMTAGDNPQL VPADQVNITE FYNKSLSSFK ENEENIQCGE NFMDIECFMV LNPSQQLAIA 
VLSLTLGTFT VLENLLVLCV ILHSRSLRCR PSYHFIGSLA VADLLGSVIF VYSFIDFHVF 
HRKDSRNVFL FKLGGVTA3L TASVGSLFLT AIDRYVSIHR PLAYKRIVTR PKAWAFCLM 
WTIAIAIAVL PLLGWNCEKL QSVCSDIFPH IDETYLMFWI GVTSVLLLFI VYAYMYILWK 
AHSHAVRMIQ RGTQKSIIIH TSEDGKVQVT RPDQARMDIR IiAKTLVLI LV VLIICWGPLL 
AIMVYDVFGK MNKLIKTVFA FCSMLCLLNS TVNPI I YALR SKDLRHAFRS MFPSCEGTAQ 
PLDNSMGDSD CLHKHANNAA SVHRAAESCI KSTVKIAKVT MSVSTDTSAE AL 



Seq ID NO: 239 DMA sequence 
Coding sequence: 17. .1252 

1 11 21 31 41 51 

I I I I I I 

ATGAAGTCGA TCCTAGATGG CCTTGCAGAT ACCACCTTCC GCACCATCAC CACTGACCTC 
CTGTACGTGG GCTCAAATGA CATTCAGTAC GAAGACATCA AAGGAGAATG AGGAGAACAT 
CCAGTGTGGG GAGAACTTCA TGGACATAGA GTGTTTCATG GTCCTGAACC CCAGCCAGCA 
GCTGGCCATT GCAGTCCTGT CCCTCACGCT GGGCACCTTC ACGGTCCTGG AGAACCTCCT 
GGTGCTGTGC GTCATCCTCC ACTCCCGCAG CCTCCGCTGC AGGCCTTCCT ACCACTTCAT 
CGGCAGCCTG GCGGTGGCAG ACCTCCTGGG G 



, usiwjuw. fti^uvjuui^n ^jitio.iC ATCGTGTATG ^j^w.^*,. 

60 TATTCTCTGG AAGGCTCACA GCCACGCCGT CCGCATGATT CAGCGTGGCA CCCAGAAGAG 
CATCATCATC CACACGTCTG AGGATGGGAA GGTACAGGTG ACCCGGCCAG ACCAAGCCCG 
CATGGACATT AGGTTAGCCA AGACCCTGGT CCTGATCCTG GTGGTGTTGA TCATCTGCTG 
GGGCCCTCTG CTTGCAATCA TGGTGTATGA TGTCTTTGGG AAGATGAACA AGCTCATTAA 
GACGGTGTTT GCATTCTGCA GTATGCTCTG CCTGCTGAAC TCCACCGTGA ACCCCATCAT 
65 CTATGCTCTG AGGAGTAAGG ACCTGCGACA CGCTTTCCGG AGCATGTTTC CCTCTTGTGA 
AGGCACTGCG CAGCCTCTGG ATAACAGCAT GGGGGACTCG GACTGCCTGC ACAAACACGC 
AAACAATGCA GCCAGTGTTC ACAGGGCCGC AGAAAGCTGC ATCAAGAGCA CGGTCAAGAT 
TGCCAAGGTA ACCATGTCTG TGTCCACAGA CACGTCTGCC GAGGCTCTGT GA 

70 Seq ID NO: 240 Protein sequence 



I I I I I I 

MALQIPPSAP SPLTSCTWAQ MTFSTKTSKE NEENIQCGEM FMDI3CFMVL NPSQQLAIAV 
LSLTLGTFTV LENLLVLCVI LHSRSLRCRP SYHFIGSLAV ADLLGSVIFV YSFIDFHVFH 
RKDSRNVFLF KLGGVTASFT ASVGSLFLTA IDRYISIHRP LAYKRIVTRP KAWAFCLMW 
TIAIVIAVLP LLGWNCEKLQ SVCSDIFPHI DETYLMFWIG VTSVLLLFIV YAYKYILWKA 
HSHAVRMIQR GTQKSIIIHT SEDGKVQVTR PDQARMDIRL AKTLVLILW LIICWGPLLA 
IMVYDVFGKM NKLIKTVFAF CSMLCLLNST VNPIIYALRS KDLRHAFRSM FPSCEGTAQP 
LDNSMGDSDC LHKHANNAAS VHRAAESCIK STVKIAKVTM ST 
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Coding sequence: 82.. 1194 

i r t r r r 

GTAGACTGTC CATGGCCTGA ACATTTTCCG AAAATCATTT TGAGCAAAAT ATCTGTTTAA 
TAACAAGATA ACCACATCAA GATGGTTGGA AAGCTGAAGC AGAACTTACT ATTGGCATGT 
CTGGTGATTA GTTCTGTGAC TGTGTTTTAC CTGGGCCAGC ATGCCATGGA ATGCCATCAC 
CGGATAGAGG AACGTAGCCA GCCAGTCAAA TTGGAGAGCA CAAGGACCAC TGTGAGAACT 
GGCCTGGACC TCAAAGCCAA CAAAACCTTT GCCTATCACA AAGATATGCC TTTAATATTT 
ATTGGRGGTG TGCCTCGGAG TGGAACCACA CTCATGAGGG CCATGCTGGA CGCACATCCT 
GACATTCGCT GTGGAGAGGA AACCAGGGTC ATTCCCCGAA TCCTGGCCCT GAAGCAGATG 
TGGTCACGGT CAAGTAAAGA GAAGATCCGC CTGGATGAGG CTGGTGTTAC TGATGAAGTG 
CTGGATTCTG CCATGCAAGC CTTCTTACTA GAAATTATCG TTAAGCATGG GGAGCCAGCC 
CCTTATTTAT GTAATAAAGA TCCTTTTGCC CTGAAATCTT TAACTTACCT TTCTAGGTTA 
TTCCCCAATG CCAAATTTCT CCTGATGGTC CGAGATGGCC GGGCATCAGT ACATTCAATG 
ATTTCTCGAA AAGTTACTAT AGCTGGATTT GATCTGAACA GCTATAGGGA CTGTTTGACA 
AAGTGGAATC GTGCTATAGA GACCATGTAT AACCAGTGTA TGGAGGTTGG TTATAAAAAG 
TGCATGTTGG TTCACTATGA ACAACTTGTC TTACATCCTG AACGGTGGAT GAGAACACTC 
TTAAAGTTCC TCCAGATTCC ATGGAACCAC TCAGTATTGC ACCATGAAGA GATGATTGGG 
AAAGCTGGGG GAGTGTCTCT GTCAAAAGTG GAGAGATCTA CAGACCAAGT AATCAAGCCA 
GT CAATGT AG GAGCTCTATC AAAATGGGTT GGGAAGATAC CGCCAGATGT TTTACAAGAC 
ATGGCAGTGA TTGCTCCTAT GCTTGCCAAG CTTGGATATG ACCCATATGC CAACCCACCT 
AACTACGGAA AACCTGATCC CAAAATTATT GAAAACACTC GAAGGGTCTA TAAGGGAGAA 
TTCCAACTAC CTGACTTTCT TAAAGAAAAA CCACAGACTG AGCAAGTGGA GTAGCAGAAC 
CAGGAGCCTC TTCCATACAT GAGGAAAGAT TGCTGCCTTT TCAGCAGAAG GGAAATTCCT 
AGGATTGGCT GTCCCCTGCC AAGCTTGGTG GAGCGTCTGC ACCTTGGCTG CGCCGCCTGT 
GCATTTGCCA GTTTCCTCCC ACTGAGAGGA TGGAGGTGTC CGCACAGCTT TGGGCCTCGT 
GAGGGATCTG CCTCCTGAGC AAAGAGCTCT TGATCCCGAT TTCATGCACA GCCCTGCAGT 
AAGGAGCCCA GAAGGAACAT GTGTTTCCTG TTAAAACTCC TCTTGTTCTC TTTTCTTACA 
TTATGACGTT TGTTTTCAAG GAGAGGGTTT AAAAATGGGA TCCTGTAAGC AGACTTGGGC 
AGTCTCCTTT TGAAATAGGT TGTCTGTACA TGTTCTAATG TTTTGTAGAA CACGTGTGCC 



I I I I 

MVGKLKQNLI, t.ACLVISSVT VFYLGQHAME CHHRIEERSQ PVKLESTRTT VRTGLDLKAN 
KTFAYHKDMP LIFIGCVPRS GTTLMRAMLD AHPDIRCGEE TRVIPRILAL KQMMSRSSKE 
" T DEVLDSAMQA FLLEIIVKHG EPAPYLCNKD PFALKSLTYL SKLFPNAKFL 

7 HSMISRKVTI AGFDLNSYRD CLTKWNRAIE TMYNQCMEVG YKKCMLVHYE 
>I RTLLKFLQIP WNIISVLiniEE MIGKAGGVSL SKVERSTDQV IKPVNVGALS 
KWVCKIPPDV LQDMAVIAPM LAKLGYDPYA NPPNYGKPDP KIIENTRRVY KGEFQLPDFL 
KEKPQTEQVE 

Seq i: 



ACGCGGGG CG CGCGGCTCCG TCGGCTACCG CGGGCGGGCG CAGGCGACGG GCACGGCGGG 



CCGAGCTACG CGCAGCTAGT GCAGCGCGGC TGGGGCAGCG CGCTGGCGGC GGCGCGGGGC 
TGCACGGACT GCGGCTGGGG GCTGGCGCGT CGCGGCCTGG CTGAGCACGC GCACCTGGCG 
CCGCCCGAGC TGCTGCTGCT GGCGCTCGGC GCGCTGGGCT GGACCGCGCT GCGCTCCGCG 
GCCACTGCGC GCCTCTTTCG GCCCCTGGCG AAGCGGTGCT GCCTCCAGCC CAGAGATGCC 
GCCAAGATGC CCGAGAGCGC TTGGAAGTTT CTCTTCTACC TGGGCAGCTG GAGCTACAGT 
GCCTACCTGC TGTTTGGCAC CGACTACCCC TTCTTCCATG ACCCACCATC TGTCTTCTAC 
GACTGGACGC CGGGCATGGC AGTGCCACGG GACATTGCAG CCGCCTACCT GCTCCAGGGA 
AGCTTCTATG GCCACTCCAT CTACGCTACG CTATACATGG ACACCTGGCG CAAGGACTCG 
GTGGTCATGC TGCTCCACCA CGTGGTCACT CTCATCCTCA TCGTCTCCTC CTACGCCTTC 
CGGTACCACA ATGTGGGCAT CCTTGTGCTC TTCCTGCACG ATATCAGTGA CGTGCAGCTI 
GAGTTCACCA AGCTCAACAT TTACTTCAAG TCCCGCGGCG GCTCCTACCA TCGGCTGCAT 
GCCTTGGCAG CAGACTTGGG CTGCCTCAGC TTCGGCTTCA GCTGGTTCTG GTTCCGCCTC 
TACTGGTTCC CGCTCAAGGT CCTGTATGCC ACCAGTCACT GCAGTCTGCG CACGGTGCCT 
GACATCCCCT TCTACTTCTT CTT CAATGCG CTCCTGCTGC TGCTCACCCT TATGAACCTC 
TACTGGTTCC TGTACATCGT GGCGTTTGCA GCCAAGGTGT TGACAGGCCA GGTGCACGAG 
CTGAAGGACC TGCGGGAGTA TGACACAGCC GAGGCCCAGA GCCTGAAGCC CAGCAAAGCC 
GAGAAGCCAC TGAGGAACGG CCTGGTGAAG GACAAGCGCT TCTGAACCCC TCGGCCCCGC 
CCCCGTGGAC CCGGCCCCAC CCCGAATACC CCGGCCACGC TCCCCGTCCT TGGCCGCCCC 
TCCACCCCCT CCAACTCTGC TCCTCTAGGG CCGCCGCCAC CTCCCCTGGG ACCCCGCCCC 
CTCATCCTGC CTCCATTTCC CGGCCACGCC CCCCAGGACC CCTGCCCCTC CGGGGACACC 
GGCCCCGCCC TCAGCCCACT GGTCCCGGGC CGCCGCGGAC CCTGCGCACT CTCTGGTCAT 
CGCCT GGGAG GAAGATGCCA CCGCCGCAGC AAGGTCCCTG CGGCCACCAC CTCCTCCTCC 
TCCTGGCCCT GCTGCTGCCC TCGCTGCCCC TGACCCGCGC CCCCGTGCCC CCAGGCCCAG 
CCGCCGCCCT GCTCCAGGCT CTAGGACTGC GCGATGAGCC CCAGGGTGCC CCCAGGCTCC 
GGCCGGTTCC CCCGGTCATG TGGCGCCTGT TTCGACGCCG GGACCCCCAG GAGACCAGGT 
CTGGCTCGCG GCGGACGTCC CCAGGGGTCA CCCTGCAACC GTGCCACGTG GAGGAGCTGG 
GGGTCGCCGG AAACATCGTG CGCCACATCC CGGACCGCGG TGCGCCCACC O 
AGCCTGTCTC GGCCGCGGGG CATTGCCCTG AGTGGACAGT CGTCTTCGAC C 
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TGGAACCCGC TGAGCGCCCG AGCCGGGCCC GCCTGGACCT GCGTTTCGCG GCGGCGGCGG I860 

CGGCAGCCCC GGAGGGCGGC TGGGAGCTGA GCGTGGCGCA AGCGGGCCAG GGCGCGGGCG 1920 

CGGACCCCGG GCCGGTGCTG CTCCGCCAGT TGGTGCCCGC CCTGGGGCCG CCAGTGCGCG 1980 

CGGAGCTGCT GGGCGCCGCT TGGGCTCGCA ACGCCTCATG GCCGCGCAGC CTCCGCCTGG 2040 

5 CGCTGGCGCT ACGCCCCCGG GCCCCTGCCG CCTGCGCGCG CCTGGCCGAG GCCTCGCTGC 2100 

TGCTGGTGAC CCTCGACCCG CGCCTGTGCC ACCCCCTGGC CCC-GCCGCGG CGCGACGCCG 2160 

AACCCGTGTT GGGCGGCGGC CCCGGGGGCG CTTGTCGCGC GCGGCGGCTG TACGTGAGCT 2220 

TCCGCGAGGT GGGCTGGCAC CGCTGGGTCA TCGCGCCGCG CGGCTTCCTG C-CCAACTACT 2280 

GCCAGGGTCA GTGCGCGCTG CCCGTCGCGC TGTCGGGGTC CGGGGGGCCG CCGGCGCTCA 2340 

10 ACCACGCTGT GCTGCGCGCG CTCATGCACG CGGCCGCCCC GGGAGCCGCC GACCTGCCCT 2400 

GCTGCGTGCC CGCGCGCCTG TCGCCCATCT CCGTGCTCTT CTTTGACAAC AGCGACAACG 2460 
TGGTGCTGCG 

CGGGCAGGGA CGCGGGCCCA ACAATAAATG 



Seq i: 



I I I I I I 

MPPPQQGPCG HHLLLLLALL LPSLPLTRAP VPPGPAAALL QALGLRDEPQ GAI 
VMWRLFRRRD PQETRSGSRR TSPGVTLQPC HVEELGVAGN IVRHIPDRGA PTRASEPVSA 
AGHCPEWTW FDLSAVEPAE RPSRARLELR FAAAAAAAPE GGWELSVAQA GQGAGADPGP 
VLLRQLVPAL GPPVRAELLG AAWARNASWP RSLRLALALR PRAPAACARL AEASLLLVTL 
DPRLCHPLAR PRRDAEPVLG GGPGGACRAR RLYVSFREVG KHRKVIAPRG FLANYCQGQC 
ALPVALSGSG GPPALNHAVL RA1MHAAAPG AADLPCCVPA RLSPISVLFF DNSDNWLRQ 
YEDMWDECG CR 



Nucleic Acid Accession It: NM_021267.1 
30 Coding sequence: 17. .1125 

1 11 21 31 41 51 

I I I I I I 

ACGCGGGGCG CGCGGCTCCG TCGGCTACCG CGGGCGGGCG CAGGCGACGG GCACGGCGGG 60 

35 CGAGCGGGCG GTATGGCGGC GGCGGGGCCC GCGGCGGGGC CGACGGGGCC CGAGCCCATG 120 

CCGAGCTACG CGCAGCTAGT GCAGCGCGGC TGGGGCAGCG CGCTGGCGGC GGCGCGGGGC 180 

TGCACGGACT GCGGCTGGGG GCTGGCGCGT CGCGGCCTGG CTGAGCACGC GCACCTGGCG 240 

CCGCCCGAGC TGCTGCTGCT GGCGCTCGGC GCGCTGGGCT GGACCGCGCT GCGCTCCGCG 300 

GCCACTGCGC GCCTCTTTCG GCCCCTGGCG AAGCGGTGCT GCCTCCACCC CAGAGATGCC 360 

40 GCCAAGATGC CCGAGAGCGC TTGGAAGTTT CTCTTCTACC TGGGCAGCTG GAGCTACAGT 420 

GCCTACCTGC TGTTTGGCAC CGACTACCCC TTCTTCCATG ACCCACCATC TGTCTTCTAC 480 

GACTGGACGC CGGGCATGGC AGTGCCACGG GACATTGCAG CCCCCTACCT GCTCCAGGGA S40 

AGCTTCTATG GCCACTCCAT CTACGCTACG CTATACATGG ACACCTGGCG CAAGGACTCG 600 

GTGGTCATGC TGCTCCACCA CGTGGTCACT CTCATCCTCA TCGTCTCCTC CTACGCCTTC 660 

45 CGGTACCACA ATGTGGGCAT CCTTGTGCTC TTCCTGCACG ATATCAGTGA CGTGCAGCTT 720 

GAGTTCACCA AGCTCAACAT TTACTTCAAG TCCCGCGGCG GCTCCTACCA TCGGCTGCAT 780 

GCCTTGGCAG CAGACTTGGG CTGCCTCAGC TTCGGCTTCA GCTGGTTCTG GTTCCGCCTC 840 

TACTGGTTCC CGCTCAAGGT CCTGTATGCC ACCAGTCACT GCAGTCTGCG CACGGTGCCT 900 

GACATCCCCT TCTACTTCTT CTTCAATGCG CTCCTGCTGC TGCTCACCCT TATGAACCTC 960 

50 TACTGGTTCC TGTACATCGT GGCGTTTGCA GCCAAGGTGT TGACAGGCCA GGTGCACGAG 1020 

CTGAAGGACC TGCGGGAGTA TGACACAGCC GAGGCCCAGA GCCTGAAGCC CAGCAAAGCC 1080 

GAGAAGCCAC TGAGGAACGG CCTGGTGAAG GACAAGCGCT TCTGAACCCC TCGGCCCCGC 1140 

TCCACCCCCT CCAACTCTGC TCCTCTAGGG CCGCCGCCAC CTCCCCTGGG ACCCCGCCCC 1260 

55 CTCATCCTGC CTCCATTTCC CGGCCACGCC CCCCAGGACC CCTGCCCCTC CGGGGACACC 1320 

GGCCCCGCCC TCAGCCCACT GGTCCCGGGC CGCCGCGGAC CCTGCGCACT CTCTGGTCAT 1380 

CGCCTGGGAG GAAGATGCCA CCGCCGCAGC AAGGT CCCTG CGGCCACCAC CTCCTCCTCC 1440 

TCCTGGCCCT GCTGCTGCCC TCGCTGCCCC TGACCCGCGC CCCCGTGCCC CCAGGCCCAG 1500 

CCGCCGCCCT GCTCCAGGCT CTAGGACTGC GCGATGAGCC CCAGGGTGCC CCCAGGCTCC 1560 

60 GGCCGGTTCC CCCGGTCATG TGGCGCCTGT TTCGACGCCG GGACCCCCAG GAGACCAGGT 1620 

CTGGCTCGCG GCGGACGTCC CCAGGGGTCA CCCTGCAACC GTGCCACGTG GAGGAGCTGG 1680 

GGGTCGCCGG AAACATCGTG CGCCACATCC CGGACCGCGG TGCGCCCACC CGGGCCTCGG 1740 

AGCCTGTCTC GGCCGCGGGG CATTGCCCTG AGTGGACAGT CGTCTTCGAC CTC-TCGGCTG 1800 

TGGAACCCGC TGAGCGCCCG AGCCGGGCCC GCCTGGAGCT GCGTTTCGCG GCGGCGGCGG 1860 

65 CGGCAGCCCC GGAGGGCGGC TGGGAGCTGA GCGTGGCGCA AGCGGGCCAG GGCGCGGGCG 1920 
CGGACCCCGG GCCGGTGCTG CTCCGCCAGT TGGTGCCCGC CCTGGGGCCG CCAGTGCGCG 



70 



TCCGCGAGGT GGGCTGGCAC CGCTGGGTCA TCGCGCCGCG CGGCTTCCTG GCCAACTACT 2280 

GCCAGGGTCA GTGCGCGCTG CCCGTCGCGC TGTCGGGGTC CGGGGGGCCG CCGGCGCTCA 2340 

ACCACGCTGT GCTGCGCGCG CTCATGCACG CGGCCGCCCC GGGAGCCGCC GACCTGCCCT 2400 

GCTGCGTGCC CGCGCGCCTG TCGCCCATCT CCGTGCTCTT CTTTGACAAC AGCGACAACG 2460 



75 TGGTGCTGCG GCAGTATGAG GACATGGTGG TGGACGAGTG CGGCTGCCGC TAACCCGGGG 

80 1 11 21 31 
I I I I 
MAAAGPAAGP TGPEPMPSYA QLVQRGWGSA LAAARGCTDC 
LLLALGALGW TALRSAATAR LFRPLAKRCC LQPRDAAKMP ESAWKFLFYL 
FGTDYPFFHD PPSVFYDWTP GMAVPRDIAA AYLLQGSFYG ' 



1046 
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I VSSYAFRYHN VGILVLFLHD ISDVQL3FTK LNIYFKSRGG SYHRLHALAA 240 
DLGCLSFGFS WFWFRLYWFP LKVLYATSHC SLRTVPDIPF YFFFNALLLL LTLKNLYWFL 300 
L TGQVHELKDL REYDTAEAQS LKPSKAEKPL RNGLVKDKRF 




3 AGCACCTGCG G 

3 CAGCCATGCC G, 

G CTGCGCAGCT 

TCGATGACCA CTTCCAGCAC CTGCTGAACG ACTCGGAGCG GACGCTGCAG G 
CCGGCGCCTT CGGAGAGCTG TACACGCAGA ACGCGAGGGC CTTCCGGGAC CTGTACTCAG 
C TGCACCTGGA G 



CGAGAGCTGT CATGAAGCTG GTCTACTGTG CTCACTGCCT GGGAGTCCCC GGCGCCAGGC 
CCTGCCCTGA CTATTGCCGA AATGTGCTCA AGGGCTGCCT TGCCAACCAG G 
ACGCCGAGTG GAGGAACCTC CTGGACTCCA TGGTGCTCAT CACCGAC 
CATCGGGTGT GGAGAGTGTC ATCGGCAGCG TGCACACGTG GCTGGCGGAG G 

CCCTCCAGGA CAACAGGGAC ACGCTCACGG CCAAGGTCAT CCAGGGCTGC GGGAACCCCA 1260 

AGGTCAACCC CCAGGGCCCT GGGCCTGAGG AGAAGCGGCG CCGGGGCAAG CTGGCCCCGC 1320 

GGGAGAGGCC ACCTTCAGGC ACGCTGGAGA AGCTGGTCTC TGAAGCCAAG GCCCAGCTCC 1380 

GCGACGTCCA GGACTTCTGG ATCAGCCTCC CAGGGACACT GTGCAGTGAG AAGATGGCCC 144 0 

TGAGCACTGC CAGTGATGAC CGCTGCTGGA ACGGGATGGC CAGAGGCCGG TACCTCCCCG IS 00 

AGGTCATGGG TGACGGCCTG GCCAACCAGA TCAACAACCC CGAGGTGGAG GTGGACATCA 1560 

CCAAGCCGGA CATGACCATC CGGCAGCAGA TCATGCAGCT GAAGATCATG ACCAACCGGC 1620 

TGCGCACCGC CTACAACGGC AACGACGTGG ACTTCCAGGA CGCCAGTGAC GACGGCAGCG 168 0 

GCTCGGGCAG CGGTGATGG C TGTCTGGATG ACCTCTGCGG CCGGAAGGTC AGCAGGAAGA 174 0 

GCTCCAGCTC CCGGACGCCC TTGACCCATG CCCTCCCAGG CCTGTCAGAG CAGGAAGGAC 18 00 

AGAAGACCTC GGCTGCCAGC TGCCCCCAGC CCCCGACCTT CCTCCTGCCC CTCCTCCTOT 1860 

TCCTGGCCCT TACAGTAGCC AGGCCCCGGT GGCGGTAACT GCCCCAAGGC CCCAGGGACA 1D20 

GAGGCCAAGG ACTGACTTTG CCAAAAATAC AACACAGACG ATATTTAATT CACCTCAGCC 19 8 0 

TGGAGAGGCC TGGGGTGGGA CAGGGAGGGC CGGCGGCICT GAGCAGGGGC AGGCGCAGAG 2040 

GTCCCAGCCC CAGGCCTGGC CTCGCCTGCC TTTCTGCCTT TTAATTTTGT ATGAGGTCCT 2100 

CAGGTCAGCT GGGAGCCAGT GTGCCCAAAA GCCATGTATT TCAGGGACCT CAGGGGCACC 2160 

TAGCCCTCCC CCCAGCTCCC TGCACCGCCG CAGAAGCAGC CCCTCGAGGC 2220 

CAACCCGCTG GACCCCACAG CGAGCCTGTG CCTTCCTCCC 2280 

CGCCICCTCC CACTGGGACT CCCAGCAGAG CCCACCAGCC AGCCCTGGCC CACCCCCCAG 2340 

50 CCTCCAGAGA AGCCCCGCAC GGGCTGTCTG GGTGTCCGCC ATCCAGGGTC TGGCAGAGCC 2400 

TCTGAGATGA TGCATGATGC CCTCCCCTCA GCGCAGGCTG CAGAGCCCGG CCCCACCTCC 2460 

CTGCGCCCTT GAGGGGCCCC AGCGT CTGCA GGGTGACGCC TGAGACAGCA CCACTGCTGA 2520 

GGAGT CTGAG GACTGTCCTC CCACAGACCC TGCAGTGAGG GGCCCTCCftT GCGCAGATGA 2580 

GGGGCCACTG ACCCACCTGC GCTTCTGCTG GAGGAGGGGA AGCTGGGCCC AAAGGCCCAG 2 640 

55 GGAGGCAGCG TGGGCTCTGC CAATGTGGGC TGCCCCTCGC ACACAGGGCT CACAGGGCAG 2700 
C TGTTGGAGGA CCCCGAGGGC TGAGGAGCAG 



TGTTCGCTCC TCCCAGATGG CTTCGGAGGC CCGCAGGGCC 3060 

CACCTTGGAC CCTGGTGACC TCCTGTCACT CACTGAGGCC ATCAGGGCCC TGCCCCAGGC 3120 

CCTCCTTCCC TCCTGTGCCC CAGCIGCCAG GTGGCCCTGG GGAGGGGTGG 3180 

CTGCAGGGGG AGGAGGACTI GGAGGGTCTG GGGGCAGCTG 3240 



65 TCCTGAACCG ACTGACCCTG AGGAGGCCGC TTAGTGCTGC TTTGCTTTTC ATCACCGTCC 
G ACGGAGGTCC CCGGTTGCTG GTCAGGTCCC CATGGCTTGT TCTCTGGAAC 



70 



CCCTGCCAGT GCCAGGGTGG GCTGGGGACT CTGGCACAGT GATGCCGGGC G 
CAGCACTCCC GCTGCACACA GACGGCCTAG GGGTGGCGCT CAGACCCCAC CCTACGCTCA 
TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC TGGTCAGGC-C AGTGGCCAAG CCTGCTGTGT 
CCTTCCTCCA CAAGGTCCCC CCACCGCTCA GTGTCAGCGG GTGACGTGTG TTCTTTTGAG 
TCCTTGTATG AATAAAAGGC TGGAAACCTA AA 

Seq ID NO: 248 Protein sequence 
Protein Accession 8: NP_002072.1 

1 11 21 31 41 51 

I I I I I I 

MELRARGWWL LCAAAALVAC ARGDPASKSR SCGEVRQIYG AKGFSLSDVP QAEISGEHLR 
ICPQGYTCCT SEMEENLANR SHAELETALR DSSRVLQAML ATQLRSFDDH FQHLLNDSER 
TLQATFPGAF GELYTQNARA FRDLYSELRL YYRGANLHLE ETLAEFWARL LERLFKQLHP 
QLLLPDDYLD CLGKQAEALR PFGEAPRELR LRATRAFVAA RSFVQGLGVA SDVTOKVAQV 
PLGPECSRAV MKLVYCAHCL GVPGARPCPD YCRNVLKGCL ANQADLDAEW RNLLDSMVLI 
TDKFWGTSGV ESVIGSVHTW LAEAINALQD NRDTLTAKVI QGCGNPKVNP QGPGPEEKRR 
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Seq ID NO: 249 DNA_ 



25 
30 



GTCTCCCCAC GGATGTCCGG GCTCCTCTCC CAGACTGTGA TCCTAGCGCT 
CGTCTTCGAG CTGCAGATCC ACTCTTTCGG 
CCCCTGCAGC GCCCGGCTCC CCTGCCGCCT 
GTCTGCCTGA AGCCTGGGCT CTCAGAGGAG GCCGCCGAGT CCCCGTGCGC 
CGCGCGGACC GGTCTACACC GAGCAGCCCG GAGCGCCCGC 
ACGGGCTCTT G( 
CACCTTCTCT TTCATCATCG AAACCTGGAG Al 
CGCCTGGAGC CTGCTGGCGC GCGTGGCTGG CAGGCGGCGC TTGGCAGCCG GAGGCCCGTG 
GGCCCGGGAC ATTCAGCGCG C 

GACGAATGTG A 

GTCTCCACCA G 
CGGATGCCTT GTCCCTGGGC CTGGGCCCTG 

TGACGGGAAC CCGTGTGCCA 



GTCCTCGCTG O 

CGTGTGTGGA GGGCGGCGGC GCGCACCGCT G 
GCTGGGCTTC GGCGGCCGCG A 

TGCTCACGGC GGCCGCTGCT ACGCCCACTT CTCCGGCCTC GTCTGCGCTT G 

CTACATGGGA GCGCGGTGTG AGTTCCCAGT GCACCCCGAC GGCGCAAGCG CCTTGCCCGC 1440 

: CCGGGGACCC TCAGCGCTAC CTTTTGCCTC CGGCTCTGGG 15 0 0 

I TGGCCGGCGC TGCGCTCTTG CTGGTCCACG TGCGCCGCCG 1560 

TGGCCACTCC CAGGATGCTG GGTCTCGCTT CCTGCCTGGG ACCCCGGAGC CGTCAGTCCA 1620 

CGCACTCCCG GATGCACTCA ACAACCTAAG GACGCAGGAG GGTTCCGGGG ATGGTCCGAG 1600 

CTCGTCCGTA GATTGGAATC GCCCTGAAGA TGTAGACCCT CAAGGGATTT ATGTCATATC 1740 

TGCTCCTTCC ATCTACGCTC GGGAGGTAGC GACGCCCCTT TTCCCCCCGC TACACACTGG 1800 

GCGCGCTGGG CAGAGGCAGC ACCTGCTTTT TCCCTACCCT TCCTCGATTC TGTCCGTGAA 1Q60 
I TAGAGTCTCT GGAAGGTTTT AAGCCCATTT TCAGTTCTAA CTTACTTTCA 



1 11 21 31 41 51 

I I I I I I 

MVSPRMSGLL SQTVILALIF LPQTRPAGVF ELQIHSFGPG PGPGAPRSPC SARLPCRLFF 60 

RVCLKPGLSE EAAESPCALG AALSARGPVY TEQPGAPAPD LPLPDGLLQV PFRDAWPGTF 120 

SFIIETWREE LGDQIGGPAW SLLARVAGRR RIAAGGPNAR DIQRAGAWEL RFSYRARCEP 1B0 

PAVGTACTRL CRPRSAPSRC GPGLRPCAPL EDECEAPLVC RAGCSPEHGF CEQPGECRCL 240 

EGWTGPLCTV PVSTSSCLSP RGPSSATTGC LVPGPGPCDG NPCANGGSCS ETPRSFECTC 300 

PRGFYGLRCE VSGVTCADGP CFNGGLCVGG ADPDSAYICH CPPGFQGSNC EKRVDRCSLQ 360 

PCRNGGLCLD LGHALRCRCR AGFAGPRCEH DLDDCAGRAC ANGGTCVEGG GAHRCSCALG 420 

FGGRDCP.ERA DPCAARPCAH GGRCYAHFSG LVCACAPGYM GARCEFPVHP DGASALPAAP 480 

PGLRPGDPQR YLLPPALGLL VAAGVAGAAL LLVHVRRRGH SQDAGSRLLA GTPEPSVHAL 540 

PDALNNLRTQ EGSGDGPSSS VDWNRPEDVD PQGIYVISAP SIYAREVATP LFPPUTTGRA 600 
GQRQHLLFPY PSSILSVK 



I I I I I I 

GAAATATAAC CATTGCAATT AGAAAATATC CAAAATAGCC TGTTATCTTC CAC 
AGATTATTGA CAATCCCAAA TATACAATTT TTCTTTAAAA GTAGTACAAT TTCCTTTGTA 
GCTTCAATTC CTTATATGAC TTCAGACTGG AGAAGCCTGT TAAACCACTG TTAGTTTCAG 
ICATAAA TTCTCAATTT GGCTGCTGTA CACGTGCCAG 
h CTGTAGTGAC CGTTGAGAAG A 
1 TGTTAAGTGC C 



Seq ID NO: 252 DNA sequence 

Nucleic Acid Accession #: Eos sequence 



AGGTACTGCC AGAAAGGATC AGGACCTGGA GTCTGGCAAG AGGAAGACAG AGGCCTGTGT 
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GGGAAGCGAG TTGTTATCTT TGGTTATCTA GCTGTATGAG TGTATTGGTC TTCATAAAGC 
TAGATAACCG AAAGTAAAAA CTTCTTCAAG ATCGCCGGGG AGCGTGTGAG AATGAAAGAC 
TACAGCCGAG AGACAGTAAA AACCAGAAAG GTCAGGAATA CTTATTGAAT CTAACTTTGT 
TTTTGTTTTG TTTTTTTCCT TATGATTAAA GGTGGGATGA GAGAAAATTA AATGACACAC 
ACATGCTAAA ATATCAAGGT TCCAGATATG TCTTGAGAGG GGTTGTTGCA GCTGCAAAGG 
AGAAGTGTAT AGTGATAATG AGTAAAGATG CATGTGOAGT TTGTTCTATT TTAAGGCAAA 
AGTTATATCA GGGATTTTTT TCTTAGAAAG GTGTTGCAGA GATGTCTGGT ACCTAGTTTA 
AAAATGATTC CATAATATGT AGACTTGGGC AGTTCCTTTG GGAGGCACCT CCCTCTCAAA 
ATTTGAAGAT TGTGCTTGGA AATTACTTTA CATGTATTTG GGCTGTATGT CAATTTGTAC 
GAAATTAGAC TTTCAGAAAA GTTTATACTG GAAGGTTAAT AATTTGTATC TACTGAGGAC 
TTAGAGCTAG CAGGCAAAAT GAAAAAAAAA AAAAAAAARG CAAGGGCTGA TTTTTATTCT 
TCTATTCAAA ATACAAGGAC AGATGCTTCT CTGTTCCAAG AGGGTTTCCT TGAGGAAGCT 
ACTGAAGCAG AAAGACATGA TGGAGACGAG ATCGCCTCCC CTCITGTCAA A 
AAATGTTCTG TCTTACTCTG CGCCTAGCAT T 



CATACTCATC TCCATCGCTT 

\ATCTGG CCATAATTAA GTTTGGCTTT CTTCGTTGTT 
3 GCAAGCTAGT AGAAAGCAAT 
AATGTCAAAA TCTCAGTCCA 



45 
50 



60 
65 




ACAATTTCCT TTGTAGCTTC A 

CACTGTTAGT TTCAGTTAGA AAGTCTGAGA GACTTTATAC ATAAATTCTC AATTTGGCTG 
CTGTACACGT GCCAGAGTTT T 
C ACTGTTTGTG C 

G TCCAGGGGTG CTATTTACCT AGAACCATTG TCTACTACAA TTAACATTTA 2280 
CATTACAAAG TGTGTGGTTT TCTTTTTCAA GGAGGTTCAA TTAAGC-CAAT AAGATGTTTG 2340 
CTGGAGAAAC CTATTGTTTA CK3AAAGCAC TCRATGAAGT CAAATTACTG AAGCTTTTGC 2400 
CTACATCTTG GTCTTTTATG TAAATATGTT AAATATAACA TCTAAGGAAA ATAAACAATA 24S0 
TTATAATTAT GTGTTTGCCA TTGTCATATC AAACTTGCTT TGTATCATAC TAATGTTACA 2520 
TAACTTATCG ATCAATAAAA ATACATTTCA ATGTT 



Coding sequence: 40.1011 

1 11 21 31 41 51 

I I I I I I 

GGGGCAGGCA ATGAGAGCTG CACTCTGGCT GGGGAAGGCA TGAGTGACAG ACCCACAGCA 

AGGCGGTGGG GTAAGTGTGG ACCTTTGTGT ACCAGAGAGA ACATCATGGT GGCTTTCAAA 

GGGGTCTGGA CTCAAGCTTT CTGGAAAGCA GTCACAGCGG AATTTCTGGC CATGCTTATT 

TTTGTTCTCC TCAGCCTGGG ATCCACCATC AACTGGGGTG GAACAGAAAA GCCTTTACCG 

GTCGACATGG TTCTCATCTC CCTTTGCTTT GGACTCAGCA TTGCAACCAT GGTGCAGTGC 

TTTGGCCATA TCAGCGGTGG CCACATCAAC CCTGCAGTGA CTGTGGCCAT GGTGTGCACC 

AGGAAGATCA GCATCGCCAA GTCTGTCTTC TACATCGCAG CCCAGTGCCT GGGGGCCATC 

ATTGGAGCAG GAATCCTCTA TCTGGTCACA CCTCCCAGTG TGGTGGGAGG CCTGGGAGTC 

ACCATGGTTC ATGGAAATCT TACCGCTGGT CATGGTCTCC TGGTTGAGTT GATAATCACA 



TCAATAGCTT TAGCAATTGG ATTTTCTGTT GCAATTGGAC ATTTATTTGC 
ACTGGTGCCA GCATGAATCC CGCCCGATCC TTTGGACCTG 
GAAAACCATT GGATATATTG GGTTGGGCCC 
TATGAGTATG TCTTCTGTCC AGATGTTGAA 
AAAGCTGCCC AGCAAACAAA AGGAAGCTAC ATGGAGGTGG AGGACAACAG GAGTCAGGTA 
GAGACGGATG ACCTGATTCT A 



A CTCCTTAGAA CTGTCCTCAG ATTTCCTTCC ACCCATTAAG 
GAAACAGATT TGTTATAAAT T. 



AACCGTGTGT CAAGATTTGG TTAAGTCTTG 132 0 
CTTATTCCTT CTCTACTGGA ATATTGGTAT 13B0 
TATTTGAATA TTTATTCTAT TAAACTGAGT TTAACAATGG C 



I I I I 

MSDRPTARRW GKCGPLCTRE NIMVAPKGW TQAFWKAVTA E 
GTEKPLPVDM VLISLCPGLS IATMVQCPGH ISGGHINPAV TVAMVCTRKI SIAKSVFYIA 
AQCLGAIIGA GILYLVTPPS WGGLGVTMV HGNLTAGHGL LVELIITFQL VFTIFASCDS 
KRTDVTGSIA LAIGFSVAIG HLFAINYTGA SMMPARSFGP AVIMGNWENH KIYWVGPIIG 
AVLAGGLYEY VFCPDVEFKR RFKEAFSKAA QQTKGSYMEV EDNRSQVETD DLILKPGWH 
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VIDVDRGEEK KGKDQSGEVL SSV 



40 




CAGCACCAAA TGAAAGAGTA CACGTCATGG 
AAATCACCTG CTAAGAAGCT GACTAATGCA TTAAGCAAGT CCCTGAGCTG TGCTTCCAGC 
30 CGTGAACCTT TGCACCCCAT GTTCCCAGAT CAGCCTGAGA AGCCACTCAA CTTGGCTCAC 
ATCGTTGATA CTTGGCCTCC CAGACCTGTA ACCAGCATGA ACGACACCCT GTTCTCCCAC 
TCTGTTCCCT CCTCAGGAAG TCCTTTTATT ACCAGGAGCT CGGACGGTGC TTTTGGTGGA 
TGCGTCTAGA TGGATAACAT GACTTCTTCT ACCCTAAAAT ATTCCTATAA TACTTTGAGC 
TGTTCTGGTT CCTCCAGGGT GCATGGTACC CATTAACCCA AAATATGATT ATTTCCCTTT 
JJ TTTCCCATTT TCAGTCATTT TGGAATGTTC TCTGTGAACC ACAGTTGGGT TGTTTAAAGC 
TCACATTTCT TTCTGTCACC ACAGAGATTG GCCTACGGTT TCTGTTTTGA GGGTGCTGTT 
CAATAAAGCT GTGTACACTA AATGTCC 



Protein Accession f 



rotein sequence 



PEGHGKISVF AVKMALATLC GGKIMDKLRY IFSMISDSSG VMVYGRYDQF LREVLKLPTA 
T EQSARSCFSQ QKKVTLNGFL DTLMSDPPPQ CLVHLPLLHR LANVENVFHF 



VECSYCHSES MMGFRYRCQQ CHNYQLCQDC FWRGHAGGSH SNQHQMKEYT SWKSPAKKLT 300 
NALSKSLSCA SSREPLHPMF PDQPEKPLNL AHIVDTWPPR PVTSMNDTLF SHSVPSSGSP 360 
FITRSSDGAF GGCV 

Seq ID NO: 257 DNA sequence 

Nucleic Acid Accession #: NM_004172.1 

Coding sequence : 179 . . 1807 



GCGGATTGTT GCTCCGTTGT ACCTGCTGGG GAATTCACCT CGTTACTGC 
'TCTAATACC AAAGAGGAGI 

ft AGTAAAATAT 



\ CGCACACTTT TGGCCAAGAA GAAAGTGCAG AACATTACAA AGGAGGATGT 
:ttttgt GCTGCTCACA GTCACCGCTG TCATTGTGGG 
T GGATTTACCC TCCGACCATA CAGAATGAGC TACCGGGAAG TCAAGTACTT 
I GGGGAACTTC TGATGAGGAT GTTACAC 

A TGGCGGCGCT AGATAGTAAG G 
GCGAGCTGTA GTCTATTATA TGACTACCAC CATCATTGCT GTG3TGATIG GCATAATCAT 
TGTCATCATC ATCCATCCTG GGAAGGGCAC AAAGGAAAAC A' 
TGTACGAGTG ACAGCTGCAG ATGCCTTCCT GGACTTGATC AGGAACATGT T 
A GCCTGCTTTA AACAGTTTAA AACCAACTAT G. 



GGAGACTCTT ACCCGAATCA CAGAGGAGCT GGTCCCAGTT CCAGGATCTG TGAATGGAGT 
3 GGTCTAGTTG TCTTCTCCAT GTGCTTCGGT TTTGTGATTG GAAACATGAA 
3CCCTGA GAGAGTTCTT TGATTCTCTT AACGAAGCCA TCATGAGACT 
GGTAGCAGTA ATAATGTGGT ATGCCCCCGT GGGTATI 
TGTGGAGATG GAAGACATGG GTGTGATTGG GGGGCAGCTT G 

CATTGTTGGC TTACTCATTC ACGCAGTCAT CGTCTTGCCA CTCCTCTACT TCTTG3TAAC 
ACGGAAAAAC CCTTGGGTTT TTATTGGAGG GTTGCTGCAA GCACTCATCA CCGCTCTGGG 
GACCTCTTCA AGTTCTGCCA CCCTACCCAT CACCTT CAAG TGCCTGGAAG A 
CGTGGACAAG CGCGTCACCA GATTCGTGCT CCCCGTAGGA GCCACCATTA A 
GACTGCCCTC TATGAGGCTT TGG CTGCCAT TTTCATTGCT CAAGTTAACA ACTTTGAACT 1440 
GAACTTCGGA CAAATTATTA CAAT CAGCAT CACAGCCACA GCTGCCAGTA TTGGGGCAGC 1500 
TGGAATTCCT CAGGCGGGCC TGGT CACTAT GGTCATTGTG CTGACATCTG TCGGCCTGCC 1560 
CACTGACGAC ATCACGCTCA TCATCGCGGT GGACTGGTTC CTGGATCGCC TCCG3ACCAC 1620 
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CACCAACGTA CTGGGAGACT CCCTGGGAGC TGGGATTGTG GAGCACTTGT CACGACATGA 
.C AGAGATGTTG AAATGGGTAA CTCAGTGATT GAAGAGAATG AAATGAAGAA 
C AGGACAATGA AACTGAGAAA CCCATCGACA GTGAAACCAA 
A AACACTTTCT TGAGCACCAG GTGTTAAAAA CCATTATAAA 
T CTCATTACAG CTCATTCGCT CCAGCAAGCC CGTCATCTTC CCTTTCCTCC 
i TAGTCCTCCA AAACACAAGG GAGGATTTTG GGTGGCCAAA 
; AATTGAAATT TTTAAATCAT TTCATGTTAG TCTTACCGAA 2040 
h TAGTGTTGAT CAGATCTTAC AAGTTTATGT GGCACACAAT 2100 



TTTTGAAATT TAAAAATCTT TCAGAATACA A 
TTGAATTACA ACCGGTTATC AGTTGGACAG TAAGATTTTA T 
TATACCTATT TCATTAGTAG CTAGGTGCAC ATATACATCT AGCACAGCTG 
CAGAAGGCAA AGTTTCCATG TGGCCTTGAG CAAGTCCCAT C 
TAAAATGAGG GACTTCC 

CACCTGCCCT CTGTTTCCCC T CAG AACACC 



: TGTGTGTGGT GGTCACCAGG TCCTGTGAGC 
I CGCCAGGCAG GAGGCCATTC CAGGAGTGGG ATTATTCATC AAACTCTTTG 
T CCCAATGGGG GAAGTATTCC CTTCTTTCCT ACTCTGGGAA G 
TGCCACTCCT CAACTGATGA TAGACTTCGA AAACAGATGA GAAGAC] 
GGTGCTTGTA GTCACACTGT GGAACACTAA AGAGCTAGGA AAGAGTTGAG CACAGGCAAC 
ATTACAAACA AAGGATTTGA AAACACCAAG AGTACAGGTC TTCTTTAAGG AAGAATAAAA 
AAGAAGAGGT TCATTTTTCT GGCTTTTTTT TTCACCTGAA ACACTTTTTC TCGAGTCCAA 
AATCATTCCC CCCGTGAAGT CTGCTTACCA AAACATAAGA CGACTTJ" 
AGTCAAATGA ATGAGCTCTC TAATAGAAGT CCATGAGTTG AGTGGGTATT T 
AAGTGTTTTT CTTTAATCAA AAGTCCTTAG AATGAGGGAA ACAAAATATT TATTTGTTTT 3180 
GGAAT CCCAC TTATCAAATC ATTCAAAACT TTCAGCTGGA GTGGGGTTTG CTTTTGTTTT 3240 
GTTTGTGTCC ATAAGAGAAA TGGTAGAAGA TGAATCAGTA TGAAGACACT GTCAATGAGG 3300 
TTATGAGAAA AAAACAGCAG GGGCATTAGT TTCAGGCAAG GCAGCTCCCA GGTTTAGAGA 3360 
TTAATTTTTA CCCCCTAAGG AATATCCAGT CAAAGACGCT GAGTGGGAGC TGTCAGGCAG 3420 
TAGCAGCTGT GTTTGAGTTT CTGGCTGAAA ATGGTGAAGA ATGGACTTAA TTATGCTAAC 3480 
AAACTGAAAA ATCTAGACAT AGATCCTCTG ATATACAATT AGAGATATTT TTATATAGAC 3540 
CCCAAGCATT CTGTGCATAA AAGTTAACAT TAGGCTGTGG TGCAGTAACC ATTTAATGTC 3600 
GAGGCTCTAT TTCGGAAATA CACTACAAAT GTTAAAGTAC GTGGCTGTCC TCTTAAGACA 3660 
CTAGTAGAGC AAAGACTTAA TCATATCAAC TTAATTCTGT TACACAATAT GTGTTTTTTA 3720 
ATATACTAAC CATTTCTTAT GGAAAGGTCC TGTGGGGAGC CCATCCTCTC GCCAAGCCAT 3780 



Protein Accession #s 

1 11 21 31 

I I I I 

45 MTKSNGEEPK MGGRMERFQQ GVRKRTLLAK KKVQNITKED VKSYLFENAF V 

GTILGFTLRP YRMSYREVKY FSFPGELLMR MLQMLVLPLI ISSLVTGMAA LDSKASGKMG 
MRAWYYMTT TIIAWIGII IVIIIHPGKG TKENMHREGK IVRVTAADAF LDLIRNMFPP 
NLVEACFKQF KTNYEKRSFK VPIQANETLV GAVINNVSEA METLTP.ITEE LVPVPGSVNG 
VNALGLWFS MCFGFVIGNM KEQGQALREF FDSLNEAIMR LVAVIMWYAF VGILFLIAGK 
50 1VEMEDMGVI GGQIiAMYTVT VIVGLLIHAV I VLPLLYFLV TRKNPWVFIG GLLQALITAL 

» GVDKRVTRFV LPVGATINMD GTALYEALAA IFIAQVNNFE 
3IITIS ITATAASIGA AGIPQAGLVT MVIVLTSVGL PTDDITLIIA VDWFLDRLRT 
G NSVIEENEMK KPYQLIAQDN ETEKPIDSET 



TGTGGCACTG CCTGCGTACC CAACCCCAGC CCTGGGTAGC CTGCAGCATG GCCCAGCTGT 
TCCTGCCCCT GCTGGCAGCC CTGGTCCTGG CCCAGGCTCC TGCAGCTTTA GCAGATGTTC 
TGGAAGGAGA CAGCTCAGAG GACCGCGCTT TTCGCGTGCG CATCGCGGGC GACGCGCCAC 
TGCAGGGCGT GCTCGGCGGC GCCCTCACCA TCCCTTGCCA CGTCCACTAC CTGCGGCCAC 
CGCCGAGCCG CCGGGCTGTG CTGGGCTCTC CGCGGGTCAA GTGGACTTTC CTGTCCCGGC- 
GTGCTG GTGGCGCGGG GAGTGCGCGT CAAGGTGAAC GAGGCCTACC 
T GGCACTGCCT GCGTACCCAG CGTCGCTCAC O 
AACGAC TCAGGT AT CT A 
ACAGCAGCGA CGCTGTGGAG GTCAAGGTCA AAGGGGTCGT CTTTCICTAC CGAGAGGGCT 

ACATCGCCAC CCCGGAGCAG CTCTATGCCG CCTACCTTGG GGGCTATGAG CAATGTGATG 
CTGGCTGGCT GTCGGATCAG ACCGTGAGGT ATCCCATCCA GACCCCACGA GAG3CCTGTT 
ACGGAGACAT GGATGGCTTC CCCGGGGTCC GGAACTATGG TGTGGTGGAC CCGGATGACC 
TCTATGATGT GTACTGTTAT GCTGAAGACC TAAATGGAGA ATTGTTCCTG GGTGACCCTC 
CAGAGAAGCT GACATTGGAG GAAGCACGGG CGTACTGCCA GGAGCGGGGT GCAGAGATTG 
CCACCACGGG CCAACTGTAT GCAGCCTGGG ATGGTGGCCT GGACCACTGC AGCCCAGGGT 
GGCTAGCTGA TGGCAGTGTG CGCTACCCCA TCGTCACACC CAGCCAGCGC TGTGGTG3GG 
GCTTGCCTGG TGTCAAGACT CTCTTCCTCT TCCCCAACCA GACTGGCTTC CCCAATAAGC 
ACAGCCGCTT CAACGTCTAC TGCTTCCGAG ACTCGGCCCA GCCTTCTGCC A 
CCTCCAACCC AGCCTCCAAC CCAGCCTCTG ATGGACTAGA GGCTATCGTC A 
AGACCCTGGA GGAACTGCAG CTGCCTCAGG AAGCCACAGA G 
TCTACTCCAT CCCCATCATG GAGGACGGAG GAGGTGGAAG CTCCACTOCA GAAGACCCAG 1320 
C TAGGACGCTC CTAGAATTTG AAACACAATC CATGC-TACCG CCCACGGGGT 1380 
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TCTCAGAAGA GGAAGGTAAG GCATTGGAGG AAGAAGAGAA ATATGAAGAT GAAGAAGAGA 
AAGAGGAGGA AGAAGAAGAG GAGGAGGTGG AGGATGAGGC TCTGTGGGCA TGGCCCAGCG 
AGCTCAGCAG CCCGGGCCCT GAGGCCTCTC TCCCCACTGA GCCAGCAGCC CAGGAGGAGT 
CACTCTCCCA GGCGCCAGCA AGGGCAGTCC TGCAGCCTGG TGCATCACCA CTTCCTGATG 
GAGAGTCAGA AGCTTCCAGG CCTCCAAGGG TCCATGGACC ACCTACTGAG ACTCTGCCCA 
CTCCCAGGGA GAGGAACCTA GCATCCCCAT CACCTTCCAC TCTGGTTGAG GCAAGAGAGG 
TGGGGGAGGC AACTGGTGGT CCTGAGCTAT CTGGGGTCCC TCGAGGAGAG AGCGAGGAGA 
CAGGAAGCTC CGAGGGTGCC CCTTCCCTGC TTCCAGCCAC ACGGGCCCCT GAGGGTACCA 
GGGAGCTGGA GGCCCCCTCT GAAGATAATT CTGGAAGAAC TGCCCCAGCA GGGACCTCAG 



CCGCATCAGG TGACTGTGTC CCCAGCCCCT GCCACAATGG TGGGACATGC TTGGAG3AGG 2 040 

AGGAAGGGGT CCGCTGCCTA TGTCTGCCTG GCTATGGGGG GGACCTGTGC GATGTTGGCC 2100 

TCCGCTTCTG CAACCCCGGC TGGGACGCCT TCCAGGGCGC CTGCTACAAG CACTTTTCCA 2160 

CACGAAGGAG CTGGGAGGAG GCAGAGACCC AGTGCCGGAT GTACGGCGCG CATCTGGCCA 2220 

GCATCAGCAC ACCCGAGGAA CAGGACTTCA TCAACAACCG GTACGGGGAG TACCAGTGGA 22B0 

TCGGACT CAA CGACAGGACC ATCGAAGGCG ACTTCTTGTG GTCGGATGGC GTCCCCCTGC 2340 

TCTATGAGAA CTGGAACCCT GGGCAGCCTG ACAGCTACTT CCTGTCTGGA GAGAACTGCG 2400 

TGGTCATGGT GTGGCATGAT CAGGGACAAT GGAGTGACGT GCCCTGCAAC TACCACCTGT 2460 

CCTACACCTG CAAGATGGGG CTGGTGTCCT GTGGGCCGCC ACCGGAGCTG CCCCTGGCTC 2520 

AAGTGTTCGG CCGCCCACGG CTGCGCTATG AGGTGGACAC TGTGCTTCGC TACCGGTGCC 2580 

GGGAAGGACT GGCCCAGCGC AATCTGCCGC TGATCCGATG CCAAGAGAAC GGTCGTTGGG 2 640 

AGGCCCCCCA GATCTCCTGT GTGCCCAGAA GACCTGCCCG AGCTCTGCAC CCAGAGGAGG 2700 

ACCCAGAAGG ACGTCAGGGG AGG CTACTGG GACGCTGGAA GGCGCTGTTG ATCCCCCCTT 2760 

CCAGCCCCAT GCCAGGTCCC TAGGGGGCAA GGCCTTGAAC ACTGCCGGCC ACAGCACTGC 2820 
CCTGTCACCC AAATTTTCCC TCACACCCTG CGCTCACCAC AGGAAGTGAC AACATGAC 

Seq ID NO: 260 Protein sequence 
Protein Accession #: NP_068767.1 

1 11 21 31 41 51 

I I I I I I 

MAQLFLPLLA ALVLAQAPAA LADVLEGDSS EDRAFRVRIA GDAPLQGVLG GALTIPCHVH 60 

YLRPPPSRRA VLGSPRVKWT FLSRGREAEV LVARGVRVKV NEAYRFRVAL PAYEASLTDV 120 

SLALSELRPN DSGIYRCEVQ HGIDDSSDAV EVKVKGWFL YREGSARYAF SFSGAQEACA 180 

RIGAHIATPE QLYAAYLGGY EQCDAGWLSD QTVRYPIQTP REACYGDMDG FPGVRNYGW 240 

DPDDLYDVYC YAEDLNGELF LGDPPEKLTL EEARAYCQER GAEIATTGQL YAAWDGGLDH 3 00 

CSPGWLADGS VRYPIVTPSQ RCGGGLPGVK TLFLFPNQTG FPNKHSRFNV YCFRDSAQPS 360 

AIPEASNPAS NPASDGLEAI VTVTETLEEL QLPQEATESE SRGAIYSIPI MEDGGGGSST 420 

PEDPAEAPRT LLEFETQSMV PPTGFSEEEG KALEESEKYE DEEEKEEESE EEEVEDKALW 480 

AWPSELSSPG PEASLPTEPA AQEESLSQAP ARAVLQPGAS FLPDGESEAS RPPRVHGPPT 540 

ETLPTPRERN KASPSPSTLV EAREVGEATG GPELSGVPRG ESEETGSSSG APSLLPATRA 600 

' SEDNSCRTAP ACTSVQAQPV LPTDSASRGG VAWPASGDC VPSPCHNGGT 660 

: LCLPGYGGDL CDVGLRFCNP GWDAFQGACY KHFSTRRSWE EAETQCRMYG 720 

AHLASISTPE EQDFINNRYR EYQWIGLNDR TIEGDFLNSD GVPLLYENWN PGQPDSYFLS 780 

GENCWMVWH DQGQWSDVPC NYHLSYTCKM GLVSCGPPPE LPLAQVFGRP RLRYEVDTVL 840 

RYRCRECLAQ RNLPLIRCQE NGRWEAPQIS CVPRRPARA", HEEL it ( [ RLLGRWKAL 900 
LIPPSSPMPG P 

Seq ID NO: 261 DNA sequence 

Nucleic Acid Accession #: NM_004386.1 

Coding sequence-. 2.. 3967 

1 11 21 31 41 51 

I I I I I I 

GATGGGGGCC CCGTTTGTCT GGGCCTTGGG CCTTTTGATG CTGCAGATGC TGCTCTTTGT 6D 

GGCTGGGGAA CAGGGCACAC AGG AT AT CAC CGATGCCAGC GAAAGGGGGC TCCACATGCA 120 

GAAGCTGGGG TCTGGGTCAG TGCAGGCTGC GCTGGCGGAG CTGGTGGCCC TGCCCTGTCT 180 

CTTTACCCTG CAGCCACGGC CAAGCGCAGC CCGAGATGCC CCTCGGATAA AGTGGACCAA 240 

GGTGCGGACT GCGTCGGGCC AGCGACAGGA CTTGCCCATC CTGGTGGCCA AGGACAATGT 300 

CGTGAGGGTG GCCAAAAGCT GGCAGGGACG AGTGTCACTG CCTTCCTACC CCCGGCGCCG 360 

AGCCAACGCC ACGCTACTTC TGGGGCCACT GAGGGCCAGT GACTCTGGGC TGTACCGCTG 420 

CCAGGTGGTG AGGGGCATCG AGGATGAGCA GGACCTGGTG CCCTTGGAGG TGACAGGTGT 480 

TGTGTTCCAC TACCGATCAG CCCGGGACCG CTATGCACTG ACCTTCGCTG AGGCCCAGGA 540 

GGCCTGCCGT CTCAGCTCAG CCATCATTGC AGCCCCTCGG CATCTACAGG CTGCCTTTGA 600 

GGATGGCTTT GACAACTGTG ATGCTGGCTG GCTCTCTGAC CGCACTGTTC GGTATCCTAT 660 

CACCCAGTCC CGTCCTGGTT GCTATGGCGA CCGTAGCAGC CTTCCAGGGG TTCGGAGCTA 720 

TGGGAGGCGC AACCCACAGG AACTCTACGA TGTGTATTGC TTTGCCCGGG AGCTGGGGGG 780 

CGAGGTCTTC TACGTGGGCC CGGCCCGCCG CCTGACACTG GCCGGCGCGC GTGCACAGTG 840 

CCGCCGCCAG GGTGCCGCGC TGGCCTCGGT GGGACAGCTG CACCTGGCCT GGCATGAGGG 900 

CCTGGACCAG TGCGACCCGG GCTGGCTGGC CGACGGCAGC GTGCGCTACC CGATCCAGAC 960 

GCCGCGCCGG CGCTGCGGGG GCCCAGCCCC GGGCGTGCGC ACCGTCTACC GCTTCGCTAA 1020 

CCGGACCGGC TTCCCCTCAC CCGCCGAGCG CTTCGACGCC TACTGCTTCC GAGCTCATCA 1080 

CCCCACGTCA CAACATGGAG ACCTAGAGAC CCCATCCTCT GGGGATGAGG GGGAGATTCT 1140 

GT CAGCAGAG GGGCCCCCAG TTAGAGAACT GGAGCCCACC CTGGAGGAGG AAGAGGTGGT 1200 

CACCCCTGAC TTCCAGGAGC CTCTGGTGTC CAGTGGGGAA GAAG.VACCC TGATTTTGGA 12 60 

GGAGAAGCAG GAGTCTCAAC AGACCCTCAG CCCTACCCCT GGGGACCCCA TGCTGGCCTC 1320 

ATGGCCCACT GGGGAAGTGT GGCTAAGCAC GGTGGCCCCC AGCCCTAGCG ACATGGGGGC 1380 

AGGCACTGCA GCAAGTTCAC ACACGGAGGT GGCCCCAACT GACCCTATGC CTAGGAGAAG 1440 

GGGGCGCTTC AAAGGGTTGA ATGGGCGCTA CTTCCAGCAG CAGGAACCGG AGCCGGGGCT 1500 

GCAAGGGGGG ATGGAGGCCA GCGCCCAGCC CCCCACCTCA QAGGCTGCAG TGAACCAAAT 1560 

GGAGCCTCCG TTGGCCATGG CAGTCACAGA GATGTTGGGC AGTGGCCAGA GCCGGAGCCC 162 0 

CTGGGCTGAT CTGACCAATG AGGTGGATAT GCCTGGAGCT GGTTCTGCTG GTGGCAAGAG 1680 

CTCCCCAGAG CCCTGGCTGT GGCCCCCTAC CATGGTCCCA CCCAGCATCT CAGGCCACAG 1740 

CAGGGCCCCT GTCCTGGAGC TAGAGAAAGC CGAGGGCCCC AGTGCCAGGC CaGCCACCCC 1800 
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AGACCTGTTT TGGTCCCCCT TGGAGGCCAC TGTCTCAGCT CCCAGCCCTG CCCCCTGGGA I860 

GGCATTCCCT GTGGCCACCT CCCCAGATCT CCCTATGATG GCCATGCTGC GTGGTCCCAA 1920 

AGAGTGGATG CTACCACACC CCACCCCCAT CTCCACCGAG C-CCAATAGAG TTGAGGCACA 1980 

TGGTGAGGCC ACCGCCACGG CTCCACCCTC CCCTGCTGCA GAGACCAAGG TGTATTCCCT 2040 

GCCTCTCTCT TTGACCCCAA CAGGACAGGG TGGAGAGGCC ATGCCCACAA CACCTGAGTC 2100 

CCCCAGGGCA GACTTCAGAG AAACTGGGGA GACCAGCCCT GCTCAGGTCA ACAAAGCTGA 2160 

GCACTCCAGC TCCAGCCCAT GGCCTTCTGT AAACAGGAAT GTGGCTGTAG GTTTTGTCCC 2220 

CACTGAGACT GCCACTGAGC CAACGGGCCT CAGGGGTATC CCGGGGTCIG AGTCTGGGGT 2280 

CTTCGACACA GCAGAAAGCC CCACTTCTGG CTTGCAGGCC ACTGTAGATG AGGTGCAGGA 2340 

CCCCTGGCCC TCAGTGTACA GCAAAGGGCT GGATGCAAGT TCCCCATCTG CCCCCCTGGG 2400 

GAGCCCTGGA GTCTTCTTGG TACCCAAAGT CACCCCAAAT TTGGAGCCTT GGGTTGCTAC 2460 

AGATGAAGGA CCCACTGTGA ATCCCATGGA TTCCACAGTC ACGCCGGCCC CCAGTGATGC 2520 

TAGTGGAATT TGGGAACCTG GATCCCAGGT GTTTGAAGAA GCCGAAAGCA CCACCTTGAG 2580 
CCCTCAGGTG GCCCTGG 

A TGTCTACACT GGGCTCCTCA A 
\ CCCAGGGAAC A 

A GTTTCCTCAG GGGAGCCTAC GGTACCGTGG 
GCCTGTCACC CTGGGCATAG AGGACTTCGA 
AGAGAGCTTC TGGGAGGAGG TGGCAAGTGG 
GAATGCAGGT GCGGAGGAGG TGCACTCAGA TCCCTGTGAG 
AGGGACATGT AATGCCAATG 



G GCCACTGTTA 



CAACACCGGG CTGCAATTTG AGAACTGGCG AGAGAACCAG CCGGACAATT TCTTCGCGGG 3540 

G ATGTCCCCTG 3600 

C CCCCTCCGGC 3660 

AGTGGAGAAT GCCTCACTCA TCGGTGCCCG CAAGGCCAAG AACAATGTCC ATGCCACTGT 3720 

AAGGTACCAG TGCAATGAAG GATTTGCCCA GCACCATGTG GTCACCATTC GATGCCGGAG 3780 

CAATGGCAAG TGGGACAGGC CCCAAATTGT CTGCACCAAA CCCAGAGGTT CACATCGGAT 3840 



CTGTTCTCGC AAAAAAAATA TTCCTTTGAA CAAAGGTCTT CTTTTCCTTT TTTTACATAC 4140 

ACAAGATCTT CTTGGCAGGT GGAGCCAGGT GTCTGAAAAG TTCATTCTCG TCTGGCTGAA 42 0 0 

CTCTGGGAGT GTGTCCCAGC TGAGGGAAGC ACAAGTAGCA AAGCTCATTG GTCTGGTCTC 4260 

TTGTTTGCCA GGCTGATTGA AGCAGGCCTT GATGAGGGTG CATGAGTGTA TGTTTGCATT 43 20 

CACATCAAGG AATTGCTTTT CACACCAGAA ATTCAGACTT AGTCAATGTT GGCTGAATTC 43 8 0 

CTAAATCCAG GAAGAAGCCT GGACGTAGGG TCATTAGCTT TGGGAATAGA AGGCTACACA 4440 

GAAGCACACT GTTTTTGAAC TTGACAACAG CTCTCCCTTT ACCCTGGACT TCAGCCCAAG 4S00 

TTCCCTCTTT GGTCTTGGTG CATAAACACA CAGTGTGGAG ATCCCACGTA CTGCATTTTA 4560 

GGGATGTTTT TAGGACAACC TCCCTCCATG CCTTCAGAGT TAGGAGTGAG AATGATCAAA 462 0 

GCAATATGTA GGTGATGGAG GGAGAGTGTA TTGCTAACCC TTCCAGGTCT AGTCCAGCGC 4680 

TGAGATTTGG TGGTTCTGCA TGTGTGATGA ATCTCTTTCA CACAAATAGA CGAGAGGATA 4740 

TTTAGGGCTA GATGAGCCCA GATTTCTTCC CCCTCCATCT CTCAGGGAGA CAAAGAACCT 4800 

CCTTCCTGGA CCAAGGAGGT GCTGCCAAGT TTTCTAGCCC AGTGCACATA CCCAGTCCTT 4860 

AAGCAGACAT TGGTAGTGCC CCTGCCCTGG GTCCCACTCC TGCCCCACCC CACCCTTGTC 4920 

CCTGGCCATT GCCTGGTGGT CTAGAAACAC TTAAAACTTG AAGTAGTGAC ACCTACCIGC 4980 

GGTCATATTG TAGAGAGATG CTCAGTGTTA AAACTGAAAC ACACAAACAC ACACACACAC 5040 

ACATTTTTCT CTTGT AG AT T TTAATTTTTT AAGTGGGAAA GAACTCACCT TGCCTTCCTC 5100 

CAACCTGTA AAAGGTCTCT CCACACCAGG GGCCAGGATC CAGTTCCCTC 5160 

5220 

AGAAAAATAA GCAAATGAAT GGTCCACCTA GGGGTTCAGT AAAGAAAGAA ATGTGTTAAC 5340 

TGAGCCTGAA TCCCTTCTGG GAAGTAATAA TGACCATTGA CAACTAAGAA GTAGACACCA 5400 
TGCTAAAGAC T 



65 
70 



GTAAACTGAA CCCTTATCCT 



, qgtcatcTAT GTCACACAAA AATTTTCCTT 5940 

CTTGGTTTAT GCTCCCTGCC 6000 

CTTGAGCCCC TCAGCCGTTT GCCCTGCCCC CACCTCGGCT CCATGGTGGG AGGGGGCTCT 6060 

GGTCTTTTCT AAAGTGGG CG GTTTGTCTTT TGATCTTTCC CTTTTGGATG TGCGTGTGTG 612 0 

TCTGCGTGTG CCATGTGCGT GGCACGCATA TGAGTGTGTG TGCGTGTGAA CGGCTTTGGG 6180 

TCCTGCTGGT TTTGCTGTGA GCTGCAGTGT TCTGTGGGTC TGTGGTATCT GACACTGTGG 6240 
ACATTAATGT ACTTCTTGGA CATTTTAATA AATTTTTTAA C 
AAAAAAAAAA 



LLMLQMLLFV AGEQGTQDIT DASEEGLHtfQ KLGSGSVQAA 
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PTLQPRPSAA RDAPRIKWTK VRTASGQRQD LPILVAKDNV VRVAKSWQGR VSLPSYPRRR 
ANATLLLGPL RASDSGLYRC QWRGIEDEQ DLVPLEVTGV VFHYRSARDR YALTFAEAQE 
ACRLSSAIIA APRHLQAAFE DGFDNCDAGW LSDRTVRYPI TQSRPGCYGD RSSLPGVRSY 
GRRNPQELYD VYCFARELGG EVFYVGPARR LTLAGARAQC RRQGAALASV GQLHLAWHEG 
LDQCDPGWLA DGSVRYPIQT PRRRCGGPAP GVRTVYRFAN RTGFPSPAER FDAYCFRAHH 
•PTSQHGDLET PSSGDEGEIL SAEGPPVREL EPTLEEEEW TPDFQEPLVS SGEEETLILE 
EKQESQQTLS PTPGDPMLAS WPTGEVWLST VAPSPSDMGA GTAASSHTEV AFTDPMPRRR 

Y FQQQEPEPGL QGGMEASAQP PTSEAAVNQM EPPLAMAVTE MLGSGQSRSP 

5 SPEPWLWPPT MVPPSISGHS RAPVLELEKA SGPSARPATP 



35 



GEATATAPPS PAAETKVYSL PLSLTPTGQG 



Y GCSCDQGFAG EMCEIDIDDC LCSPCENGGT CIDEVNGFVC LCLPSYGGSF 1080 

CEKDTEGCDR GWHKFQGHCY RYFAHRRAWE DAEKDCRRRS GHLTSVHSPE EHSFINSFGH 1140 

ENTWIGLNDR IVERDFQWTD NTGLQFENWR ENQPDNFFAG GEDCWMVAH ESGRWNDVPC 1200 

20 NYMLPYVCKK GTVLCGPPPA VENASLIGAR KAKNNVHATV RYQCHEGFAQ HHVUTIRCRS 1260 

NGKHDRPQIV CTKPRRSHRM RGHHHHHQHH HQHHHHKSRK ERRKHKKHPT EDWEKDEGNF 1320 

Seq ID NO: 263 
25 Nucleic Acid A. 

Coding sequence: 7.. 2 085 

1 11 21 31 41 51 

m I I I I I I 

J (J GCCGCGATGG CCAGCACCAG GAGTATCGAG CTGGAGCACT TTGAGGAACG GGACAAAAGG SO 

CCGCGGCCGG GGTCGCGGAG AGGGGCCCCC AGCTCCTCCG GGGGCAGCAG CAGCTCGGGC 120 

CCCAAGGGGA ACGGGCTCAT CCCCAGTCCG GCGCACAGTG CCCACTGCAG CTTCTACCGC 1B0 

ACGCGGACCC TGCAGGCCCT CAGCTCGGAG AAGAAGGCCA AGAAGGCGCG CTTCTACCGG 240 

AACGGGGACC GCTACTTCAA GGGCCTGGTG TTTGCCATCT CCAGCGACCG CTTCCGGTCC 3 00 

TTCGATGCGC TCCTCATAGA GCTCACCCGC TCCCTGTCGG ACAACGTGAA CCIGCCCCAG 3 SO 

GGTGTCCGCA CTATCTACAC CATCGACGGC AGCCGGAAGG TCACCAGCCT GGACGAGCTG 420 

CTGGAAGGTG AGAGTTACGT GTGTGCATCC AATGAACCAT TTCGTAAAGT CGATTACACC 480 

AAAAATATTA AT CCAAACT G GTCTGTGAAC ATCAAGGGTG GGACATCCCG AGCGCTGGCT 540 

GCTGCCTCCT CTGTGAAAAG TGAAGTAAAA GAAAGTAAAG ATTTCATCAA ACCCAAGTTA 600 

40 GTGACTGTGA TTCGAAGTGG AGTGAAGCCT AGAAAAGCCG TGCGGATCCT TCTGAATAAA 660 

AAGACTGCTC ATTCCTTTGA ACAAGTCTTA ACAGATATCA CCGAAGCCAT TAAACNAGCC 720 

TCAGGAGTCG TCAAGAGGCT CTGCACCCTG GATGGAAAGC AGGTGAGAGT TACGTGTGTG 7E0 

CATCTGCCAG ACTTTTTTGG TGATGACGAT GTTTTTATTG CATGTGGACC AGAAAAATTT 840 

CGTTATGCCC AAGATGACTT TGTCCTGGAT CATAGTGAAT GTCGTGTCCT GAAGTCATCT 90 0 

45 TATTCTCGAT CCTCAGCTGT TAAGTATTCT GGATCCAAAA GCCCTGGGCC CTCTCGACGC 960 

AGCCAGATTT CTGCTCATGG CAGATCTTCT TCCAATGTAA ACGGTGGACC TGAGCTTGAC 102 0 

CGTTGCATAA GTCCTGAAGG TGTCAATGGA AACAGATGCT CTGAATCATC AACTCTTCTT 1080 

GAGAAATACA AAATTGGAAA GGTCATTGGT GATGGCAATT TTGCAGTAGT CAAAGAGTGT 1140 

ATAGACAGGT CCACTGGAAA GGAGTTTGCC CTAAAGATTA TAGACAAAGC CAAATGTTGT 1200 

50 GGAAAGGAAC ACCTGATTGA GAATGAAGTG TCAATACTGC GCCGAGTGAA ACATCCCAAT 1260 

ATCATTATGC TGGTCGAGGA GATGGAAACA GCAACTGAGC TCTTTCTGGT GATGGAATTG 1320 

GTCAAAGGTG GAGATCTCTT TGATGCAATT ACTTCGTCGA CCAAGTACAC TGAGAGAGAT 1380 

GGCAGTGCCA TGGTGTACAA CTTAGCCAAT GCCCTCAGGT ATCTCCATGG CCTCAGCATC 144 0 

GTGCACAGAG ACATCAAACC AGAGAATCTC TTGGTGTGTG AATATCCTGA TGGAACCAAG 1500 

55 TCTTTGAAAC TGGGAGACTT TGGGCTTGCG ACTGTGGTAG AAGGCCCTTT AT ACACAGT C 1560 

TGTGGCACAC CCACTTATGT GGCTCCARAA ATCATTGCTG AAACTGGCTA TGGCCTGAAG 1620 

GTGGACATTT GGGCAGCTGG TGTGATCACA TACATACTTC TCTGTGGATT CCCACCATTC 1680 

CGAAGTGAGA ACAATCTCCA GGAAGATCTC TTCGACCAGA TCTTGGCTGG GAAGCTGGAG 1740 

TTTCCGGCCC CCTACTGGGA T AACAT CACG GACTCTGCCA AGGAATTAAT CAGTCAAATG 1800 

60 CTTCAGGTAA ATGTTGAAGC TCGGTGTACC GCGGGACAAA TCCTGAGTCA CCCCTGGGTG 1860 

TCAGATGATG CCTCCCAGGA GAATAACATG CAAGCTGAGG TGACAGGTAA ACTAAAACAG 1920 
CACTTTAATA ATGCGCTCCC CAAACAGAAC ft 
GTGAGTGGAA GGCGGCAGGT CTGGCCTGAC T 

GGTAGCCGGG AGCTGCCCTC ACATGGAAGT TGGTGCCTTC CGTAGTCCTA T 
65 AGATTGGCTT GGCATGTGGA GGGCACTCAT TCGGCAACTC CCAGGCTTTG G 

AAGGAGCTC 2 220 

CTAGAGAGA 2 2 BO 

1CCCAAGGCC 2340 

AAGCTAAACA TATTTCAGTT TTAAAAAATC AGTGTTTTAT AAAACACAGT 1 
A TATTCATTTT T 
^ GTGACAAC G A TGGCCTTATT TTTTTCAGCC T 
TCTTAAGGCA ACTCTCCTAA ATACATAAAC ACAACAAATT AAAATGAAAA G 

A AAAACATTGA ACCAGAGGTG AGGGCAGCAC ACCCGCAGCA 

3 CCTGAGCCAA TGCAACCCTG GGCGGGAAGG CCAGCTCACC GTGAGCAGGT 2760 
AGAAGCCAGC CAGCCACCCA GGCAGGGACC TTGGTTCTCC CCACACACTC C 
GGAACAGGGG TGGAGTGGCC TTTCCCAGAG CTGGAGTTGG CTGCAGCAGC T 

GACCTGCCAA GGTGATGGGC GTCTGAGTTT CACATCTGG3 CCCCCCGTGA CCCCACTGAG 2 940 
TCCTGACAGC TAAGG AT GGG C 

CTCATCCTCT GGGAAGGTCC TCCTTGTTTC CTACCCAACT AGAAGGGAAA C 

TTCTCATGGT ACATGGTTGT CTGAAAGCCT TACCTAGGAA GACC-CAG3GT CTAGATAGAA 312 0 
GCTATAAGGA AGCCACACAC ATAACCCACA TCCCCACACC CCCAACATCG C 
CCACACCCCC CACACCCCCC ACATCCCCAC CATAATTACC CCCACCTCCA A 
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Seq ID NO: 254 Protein sequence 
Protein Accession # : Eos sequence 

5 i i 1 21 r r r 

K GtJGLIPSEAH SAHCSFYRTR 

ALLIBLTRSL SDNVNLPQGV 
IHPNWSVNIK GGTSRALAAA 
l i , . ... AHSFEOVLTD ITE.UKXASG 

10 WKRLCTLDG KQVRVTCVHL PDFFGDDDVF IACGPEKFRY AQDDFVLDHS 
2 ISPEGVNGNR 



MLVEEMETAT ELFLVMELVK GGDLFDAITS STKYTERDGS AMVYNMNAL R 
RDIKPENLLV CEYPDGTKSL KLGDFGLATV VEGPLYTVCG TPTYVAPXII AETGYGLKVD 
IWAAGVITYI LLCGFPPFRS ENNLQEDLFD QILAGKLEFP APYWMITDS AKELISQMLQ 
VKVEARCTAG QILSHPWVSD DASQENNMQA EVTGKLKQHF N 
GRRQVWPDCG AGLEVFELGS RELPSHGSWC LP 




I I I 

CCCCCTTGTC ATTAATACAT TAAAAAGATT CAATCTTTAC CCTGAGGTAA TTTTGGCCAG SO 

TTGGTACCGG ATTTATACCA AAATAATGGA CTTGATTGGT ATTCAAACCA AGATATGTTG 120 

GACGGTTACC AGAGGAGAAG GACTCAGTCC TATTGAAAGC TGTGAAGGAT TGGGAGATCC 1B0 

TGCTTGCTTT TATGTTGCTG TAATTTTTAT TTTAAATGGA CTAATGATGG CATTATTCTT 240 

CATATATGGC ACATATTTAA GTGGCAGCCG ATTAGGAGGC CTGGTTACAG TGTTGTGCTT 300 

CTTTTTCAAT CATGGAGAGT GTACCCGTGT AATGTGGACA CCACCTCTCC GTGAAAGCTT 360 

CTCATATCCA TTTCTTGTTC TTCAGATGTT GCTAGTGACT CATATTCTCA GGGCTACAAA 420 

ACTTTATAGA GGAAGCTTGA TTGCACTCTG CATTTCCAAT GTATTTTTCA TGCTTCCTTG 480 

GCAGTTTGCT CAGTTTGTAC TTCTTACTCA GATTGCATCA TTATTTGCAG TATATGTTGT 540 

CGGGTACATT GATATATGTA AATTACGGAA GATCATTTAT ATACACATGA TTTCTCTTGC SOO 

ACTTTGTTTT GTTTTGATGT TTGGGAACTC AATGTTATTA ACTTCTTATT ATGCTTCTTC 560 

TTTGGTAATT ATTTGGGGTA TTCTGGCAAT GAAACCACAT TTCCTGAAAA TAAATGTATC 72 0 

TGAACTTAGT TTATGGGTTA TTCAAGGATG TTTTTGGTTA TTTGGAACTG TCATACTTAA 780 

ATACTTGACA TCTAAAATTT TTGGTATTGC AGATGACGCT CATATTGGCA ACTTACTAAC B40 

ATCAAAATTC TTTAGTTATA AGGATTTTGA TACTTTATTG TATACCTGTG CAGCGGAGTT 900 

TGACTTTATG CAAAAACACA CTCCACTGAG ATACACAAAG ACATTATTGC TTCCAGTTGT 96 0 

TCTTGTAGTG TTTGTTGCTA TTGTTAGAAA GATTATTAGT GATATGTGGG GTGTCTTAGC 102 0 

TAAACAACAG ACACATGTAA GAAAACACCA GTTTGATCAT GGAGAGCTGG TTTACCATGC 10 8 0 

ATTGCAATTG TTAGCATATA CAGCCCTTGG TATTTTAATT ATGAGACTAA AACTCTTCTT 114 0 

GACACCACAC ATGTGTGTTA TGGCATCACT GATCTGCTCA AGACAGCTAT TTGGATGGCT 12 0 0 

CTTTTGCAAA GTACATCCTG GTGCTATTGT GTTTGCTATA TTAGCAGCAA TGTCAATACA 12 6 0 

AGGTTCAGCA AATCTGCAAA CCCAGTGGAA TATTGTAGGG GAGTTCAGCA ATTTGCCCCA 132 0 

AGAAGAACTT ATAGAATGGA TCAAATATAG TACTAAACCA GATGCAGTGT TTGCGGGTGC 1380 

CATGCCCACG ATGGCAAGTG TTAAGCTCTC TGCACTTCGG CCCATTGTGA ATCATCCACA 1440 

TTATGAAGAC GCAGGCTTGA GAGCCAGAAC AAAAATAGTA TACTCAATGT ATAGTCGGAA 1500 

AGCAGCCGAA GAAGTGAAGC GAGAACTGAT AAAGTTAAAA GTGAACTATT ACATTCTAGA 1560 

AGAGTCATGG TGTGTAAGAA GATCGAAGCC TGGTTGCAGT ATGCCTGAAA TTTGGGATGT 162 0 

3 GGAAAACTCC CTTATGTAAC CTCTTGGTGA AGGATTCCAA 1680 

T TCCAGAACAG TGTTTACAAA GTCCTAGAAG TTGTAAAAGA 1740 

A CATGACCTGC TGCCTACGGA GAACTACATC TGT AATGGT T TTAATGTTTT 1800 

rCAAAA ACTTTTATAG GTAACTGTTT TCAAATAGAA 18 50 

T TGAATGTCAT TCTAATTATA AAAATGACTT ACACCTTTAT 1920 

CAATTGGTTA CTATTTCAAT GCACCCTTTA AAATTTGCTA TGCAAATGAG TATATGCTTG 1980 

TACTTGACTT TAATATTTGT GCTAAAGTGA GCAAAGCTAC CTGTATAAAG AAAACACAGT 2 040 

GGGTTGTGAC AAGGATGACA TGAAAATACA GGACAATTCT GACAATGTAG GGGCTGATTT 2100 

TATAGTGTAA GAACTATTAA TGCCCCTTGC TTCTTTTTTC TGCCTCTTGC TCTTGTCTTT 2160 

TGGACATTTC AGTGATTGTA AGTTCTTCGG TCATGTCAGC CCCTGTCATC AACTTGAGTT 222 0 

ACAGTAGATG GGGCAGACAT GGAGTGTTTG CTATATAAAA CTATCTGTTT GTTTTACTTC 2280 

CTTGTGCGCT TTTTGTTCTC TGTTCTCTTG TTAATGAAGC TTTTCCTGCC CATTATTAAT 2340 
CCAAACTCTT GGACCTTGTG GT T AGGAAAT TCCCTTAACT T CCAGCCATA T 
GTGTCTCTTT CTCTCTCTCT CTTGCTCTCT CTCTTCTCCT C 
AATAAGTACT GTTTACTCAT TTAGTTGCTT ATCAAGTACT T 
ATTAATGGTA ACTGTATTTT TCTCATTTTT AGCATTATTC AAATGTTTAT A 
CTTTAAACCA CTTTAAAGTT TTTTCATGTT T. 




TGGAACAGTA GACTGTAGTA CATGGTAATT 

TCTAAAAACC ATCATTTCAG TATAAGGAAT 

TTCTTCCTAT TTATTTTTAT TTTGAAAAAT TTCTACACCT TCTTTGAATT CCTTC-TATGA 3000 

ATAGCTCTGA GAAAAGGTTT TAGGGTTTTA AATTCTAAGC AAAGCGTGAC TATGC-CTGAC 3120 
AGACTACACA TTTAATTATA CAGCTTCTCT TTCTTAACCA CAGGCAGATT A 
TGGATTGTCC TTCAGACCTT AGTCCTCAGG CATGGTTTCT GGTGCGCACT C 
GCTGTTCCCT TTCTACCTTC TTACCAGAGC CCAAGGGCAG GCCTGGTCCC G 

CAGCTTGCTG ACATAAGTCA GCTGCAAAGG CTGAGGAGTG TGCCCTCAGA GAAGCACCGC 3360 

CCCCCAGTCT TGTGCCAGCG CCTAGAGCCG CAGCTCCCAG GGATGCTCCT TCCCTGGAGG 342 0 

CAGCCCAGGA GAGGGACTCT GGCAGCGTTC TTCAGATTTG TGGCCACTGT TTCTCATTTG 3480 

CTGGTTGACT GTTTTTATTT CTTAGGCTTT TGCTAGTTTT AGAAAATAGG GAAGCAGCCC 354 0 

TTGATTTGTG GATTAAAAGC AACATTTGAG CGATGATGCA CAACAGTCCA GGAAAATGGG 360 0 
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CGGTGGACAC TTGAGGCTGA GGATGGGAGT TGACATGAGC AGGGAGAGGG AGGTGCGCGC 3660 

TGCTTATCTG TGATTGTTGC TCACCTGAGT GTGGCTGATT GTGTACATCC AC-CAGTTACA 372 0 

ATTTTTAAAA ATTATACTTT TACATTTATT TTATATTTTT CTCACCCCCA GTAATTTCCT 3780 

TCCAAAGAAG TTCACATGTA ATAAGTAGAA ATTCTGTATA GGAAAAAAGC ATTAAAAATA 3 840 

CTATTATAAC TGCTTCATTT GCTGGGAACC ATTAAAAGTA ATATAAATTA GCTTTTTCCA 390 0 

GAAGGATCCT TTTGTAGCAG TGTTTATGAA TGTAACCCCC AG CAAAAT AT GGCTATATAT 3960 

TAGGGGAGCC AGTTTGGAGC AGAGGCCTGA AGGTCCCTGC TATGCAGCCG TGGCCACAGC 402 0 

TCGCAGCCCA AGCACTGTGG AGCATCCACA CCTTTGATGG CAATGCAGAT TGGTAGCAGG 4080 

TTCCATAGGC GTACAAAACA GTATTAAAGC TCAGTGTTTT GCATATTGTT AGCATTTACA 4140 

AATATTTTTG CTTTAGTATG AGGAAAGTAA GGATGGGCAA AGAAGCGATC AAAATAGCTA 4200 

TTGCTACAAC ATTTTCGAAA ACAAAGTTGG GGCTGTATTT CTTTAAAAAG ATAAGCCTCT 426 0 

AAAAATGCTT GGCAAAAAAA ATATAGTGTT AAAATAGGCC AGTGATATTA ATGAGAAAAT 432 0 

GAAAGTATGT ATCAGGAATA AAGTGATATT GCATAGGAGT ATTGTATTTT TATGAATTTT 4380 



F NLYPEVILAS WYRIYTKIMD LIGIQTKICW 
ACFYVAVIFI LNGLMMALFF IYGTYLSGSR LGGLVTVLCF 

< LYRGSLIALC ISNVFFMLPW QFAQFVLLTQ 



ELSLWVIQGC FWLFGTVILK YLTSKIFGIA DDAHIGNLLT SKFFSYKDFD TLLYTCAAEF 
DFMEKETPLR YTKTLLLPW LWFVAIVRK IISDMWGVLA KQQTHVRKHQ FDHGELVYHA 
LQLLAYTALG ILIMRLKLFL TPHMCVMASL ICSRQLFGWL FCKVHPGAIV FAILAAMSIQ 
GSANLQTQWN IVGEFSNLPQ EELIEWIKYS TKPDAVFAGA MPTMASVKLS ALRPIVNHPH 
YEDAGLRART KIVYSMYSRK AAEEVKRELI KLKVNYYILE ESWCVRRSKP GCSMPEIWDV 
EDPANAGKTP LCNLLVKDSK PHFTTVFQNS VYKVLEWKE 

Nucleic Acid Accessions : Ce u26744.1 
Coding sequence: 59.. 1600 

1 11 21 31 41 SI 

CTTCAAAGAA TATAGATGGT TTTGAAAAGT TCATGCTGTC CCTTCATTGA ATTTTAGAAT 
GATTGAAGAT AGTGGGAAAA GAGGAAATAC CATGGCAGAA AGAAGACAGC TGTITGCAGA 
GATGAGGGCT CAAGATCTGG ATCGCATCCG ACTCTCCACC TACAGAACAG CATGCAAGCT 
TAGGTTTGTT CAGAAGAAAT GCAATTTGCA CCTGGTGGAC ATATGGAATG TCAIAGAAGC 
ATTGCGGGAA AATGCTCTGA ACAACCTGGA CCCAAACACT GAACTCAACG TGTCCCGCTT 
AGAGGCTGTG CTCTCCACTA TTTTTTACCA GCTCAACAAA CGGATGCCAA CCACTCACCA 
AATCCATGTG GAGCAGTCCA TCAGCCTCCT CCTTAACTTC CTGCTTGCAG CGTTTGATCC 
GGAAGGCCAT GGTAAAATTT CAGTATTTGC TGTCAAAATG GCTTTAGCCA CATTGTGTGG 
AGGGAAGATC ATGGACAAAT TAAGATATAT TTTCTCAATG ATTTCTGACT CCAGTGGGGT 



TTTGGAGGGT CCTTCATTTG GTTACACAGA ACACTCAGCC AGATCCTGTT TCTCCCAACA 660 

GAAAAAAGTC ACGTTAAATG GTTTCTTGGA CACGCTTATG TCAGATCCTC CCCCGCAGTG 720 

TCTGGTCTGG TTGCCTCTTC TGCATCGACT AGCAAATGTG GAAAATGTCT TCCATCCGGT 780 

TGAGTGTTCC TACTGCCACA GTGAGAGTAT GATGGGATTT CGCTACCGAT GCCAACAGTG 840 

TCACAATTAC CAGCTCTGTC AGGACTGCTT CTGGAGGGGA CATGCCGGTG GTTCTCATAG 900 

CAACCAGCAC CAAATGAAAG AGT ACACGT C ATGGAAATCA CCTGCTAAGA AGCTGACTAA 960 

TGCATTAAGC AAGTCCCTGA GCTGTGCTTC CAGCCGTGAA CCTTTGCACC CCATGTTCCC 1020 

AGATCAGCCT GAGAAGCCAC TCAACTTGGC TCACATCGTT GATACTTGGC CTCCCAGACC 1080 

TGTAACCAGC ATGAACGACA CCCTGTTCTC CCACTCTGTT CCCTCCTCAG GAAGTCCTTT 1140 

TATTACCAGG AGCATGCTTG AGAGTTCAAA CCGGCTTGAT GAAGAACACA GGCTAATTGC 1200 

CAGGTATGCG GCAAGGCTGG CAGCAGAGTC CTCTTCGTCT CAGCCACCTC AGCAGAGAAG 1260 

TGCTCCTGAC ATCTCTTTCA CCATCGATGC GAATAAGCAG CAAAGGCAGC TGATTGCTGA 1320 

GCTAGAAAAC AAGAACAGAG AAATCTTACA GGAGAT CCAG AGACTTCGGC TAGAGCATGA 13 BO 

ACAAGCTTCT CAGCCCACGC CAGAGAAGGC ACAGCAAAAC CCCACCCTGC TGGCAGAACT 1440 

CCGGCTCCTC AGACAGCGCA AAGATGAGCT GGAACAGAGA ATGTCTGCTC T CCAGG AGAG 1500 



T TTTTATAACT ATCACTACTA 
TTAGAAAAGG GAACGAATTG TCATTTATTG GAAACATTTT AGATCCCCAG 



AACCCACAGC ACTCAGAAG C 

ACCCACTCAC ACTGTGAGTA T 



ATAACTGACT ATTCACGTCC CATCTTTTTG GCTCTTCCCC 
TTCGGCTCCG 



sion it: AAC50426.1 
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MIEDSGKRGN TMAERRQLFA EMRAQDLDRI RLSTYRTACK LRFVQKKCNL HLVDIWNVIE 

ALR ENALNNL DPNTELNVSR LEAVLSTIFY QLNKRMPTTII QIHVEQSISL LLNFLLAAFD 

PEGHGKISVF AVKMALATLC GGKIMDKLRY IFSMISDSSG VMVYGRYDQF LREVLKLPTE 

VLEGPSFGYT EQSARSCFSQ QKKVTLNGFL DTLMSDPPPQ CLVKLPLLHR LANVENVFHP 

VECSYCHSES MMGFRYRCQQ CHNYQLCQDC FWRGHAGGSH SNQHQMKEYT SWKSPAKKLT 

NALSKSLSCA SSREPLHPMF PDQPEKPLNL AHIVDTWPPR PVTSMNDTLF SHSVPSSGSP 

FITRSMLESS NRLDEEHRLI ARYAARLAAE SSSSQPPQQR SAPDISFTID AMKQQRQLIA 

ELENKNREIL QEIQRLRLEH EQASQPTPEK AQQNPTLLAE LRLLRQRKDE LEQEMSALQE 
SRRELMVQLE GLMKLLKEEE LKQGVSYVPY CRS 

Seq ID NO: 269 DNA sequence 

Coding sequence: 127.. 1278 

I I' I' I' 



50 



A AGAGGCCACA CCCTGCCCTG CTCTGCTGCA 
2 GTCTCAAACA GGCTTTGTGG TCCTGGTC-CT GCTCCAGTGC 
T ACAAACTGGT CTGCTACTAC ACCAGCTGGT CCCAGTACCG GGAAGGCGAT 
GGGAGCTGCT TCCCAGATGC CCTTGACCGC TTCCTCTGTA CCCACATCAT CTACAGCTTT 
GCCAATATAA GCAACGATCA CATCGACACC TGGGAGTGGA ATGATGTGAC GCTCTACGGC 
ATGCT CAACA CACTCAAGAA CAGGAACCCC AACCTGAAGA CTCTCTJ- 
TGGAACTTTG GGTCTCAAAG ATTTTCCAAG ATAGCCTCCA ACACCCAGAG T 
TTCATCAAGT CAGTACCGCC ATTCCTGCGC ACCCATGGCT TTGATGC 

TGGCTCTACC CTGGACGGAG AGACAAACAG CATTTTACCA CCCTAATCAA GGAAATGAAG 600 

GCCGAATTTA TAAAGGAAGC CCAGCCAGGG AAAAAGCAGC TCCTGCTCAG CGCAGCACTG 660 

TCTGCGGGGA AGGTCACCAT TGACAGCAGC TATGACATTG CCAAGATATC CCAACACCTG 720 

GATTT CATT A GCATCATGAC CTACGATTTT CATGGAGCCT GGCGTGGGAC CACAGGCCAT 780 

CACAGTCCCC TGTTCCGAGG TCAGGAGGAT GCAAGTCCTG ACAGATTCAG CAACA CTGAC 840 

TATGCTGTGG GGTACATGTT GAGGCTGGGG GCTCCTGCCA GTAAGCTGGT GATGGGCATC 900 

CCCACCTTCG GGAGGAGCTT CACTCTGGCT TCTTCTGAGA CTGGTGTTGG AGCCCCAATC 960 

TCAGGACCGG GAATTCCAGG CCGGTTCACC AAGGAGGCAG GGACCCTTGC CTACTATGAG 1020 

ATCTGTGACT TCCTCCGCGG AGCCACAGTC CATAGAACCC TCGGCCAGCA GGTCCCCTAT 1080 

GCCACCAAGG GCAACCAGTG GGTAGGATAC GACGACCAGG AAAGCGTCAA AAGCAAGGTG 1140 

CAGTACCTGA AGGATAGGCA GCTGGCAGGC GCCATGGTAT GGGCCCTGGA CCTGGATGAC 1200 

TTCCAGGGCT CCTTCTGCGG CCAGGATCTG CGCTTCCCTC TCACCAATGC CAT CAAGG AT 12 60 

GCACTCGCTG CAACGTAGCC CTCTGTTCTG CACACAGCAC GGGGGCCAAG GATGCCCCGT 1320 

CCCCCTCTGG CTCCAGCTGG CCGGGAGCCT GATCACCTGC CCTGCTGAGT CCCAGGCTGA 13 80 

GCCTCAGTCT CCCTCCCTTG GGGCCTATGC AGAGGTCCAC AACACACAGA TTTGAGCTCA 1440 

GCCCTGGTGG GCAGAGAGGT AGGGATGGGG CTGTGGGGAT AGTGAGGCAT CGCAATGTAA 1500 

GACTCGGGAT TAGTACACAC TTGTTGATGA TTAATGGAAA TGTTTACAGA TCCCCAAGCC 1560 

TGGCAACGGA ATTTCTTCAA CTCCCTGCCC CCTAGCCCTC CTTATCAAAG GACACCATTT 1620 

TGGCAAGCTC TATCACCAAG GAGCCAAACA TCCTACAAGA CACAGTGACC ATACTAATTA 1680 

TACCCCCTGC AAAGCCAGCT TGAAACCTTC ACTTAGGAAC GTAATCGTGT CCCCTATCCT 1740 

ACTTCCCCTT CCTAATTCCA CAGCTGCTCA ATAAAGTACA AGAGTTTAAC AGTGTGTTGG 1800 

CGCTTTGCTT TGGTCTATCT TTGAGCGCCC ACTAGACCCA CTGGACTCAC CTCCCCCATC 1860 

TCTTCTGGGT TCCTTCCTCT GAGCCTTGGG ACCCCTGAGC TTGCAGAGAT GAAGGCCGCC 1920 
ATGTT 



1 11 21 31 41 51 

55 | | | | I I 

MGVKASQTGF WLVLLQCCS AYKLVCYYTS WSQYREGDGS CFPDAU3RFL CTHIIYSFAN 
ISNDHIDTWE WNDVTLYGML NTLKKRNPNL KTLLSVGGWN FGSQRFSKIA SMTQSRRTFI 
KSVPPFLRTH GFDGLDLAWL YPGRRDKQHF TTLIKEMKAE FIKEAQPGKK QLLLSAALSA 
GKVTIDSSYD IAKISQHLDF ISIMTYDFHG AWRGTTGHHS PLFRGQEDAS P 
60 VGYMLRLGAP ASKLVMGIPT FGRSFTLASS ETGVGAPISG PGIPGRFTKE A 
DFLRGATVHR TLGOOVPYAT KGNQWVGYDD QESVKSKVQY LKDRQLAGAM V 
GSFCGQDLRF PLTNAI KDAL AAT 



Coding sequence: 181.. 669 

| I" I' I' I' f 
GCTGCCTAGG GTCTGGAAAG CTCGGGCACC CTCCCTCTCC GGGGCTCCTG CTCCCACCCC 
TCCGGCCCCC CCACCGTCGC GCTCCTCCAG GCTGGGCCTG TGGCCGCGGT GCTTTTAATT 
TTCCCCCAGC TCAGAATCTT GCTGCTCGGC CCCCAGGAGA GCAACAACTC AACGGGAACG 
ATGTGGAAGG TGTCAGCTCT GCTCTTCGTT TTGGGAAGCG CGTCGCTCTG GGTCCTGGCA 
GAAGGAGCCA GCACAGGCCA GCCAGAAGAT GACACTGAGA CTACAGGTTT GC-AAGGCGGC 
GTTGCCATGC CAGGTGCCGA AGATGATGTG GTGACTCCAG GAACCAGCGA AGACCGCTAT 
AAGTCTGGCT TGACAACTCT GGTGGCAACA AGTGTCAACA GTGTAACAGG CATTCGCATC 
GAGGAT CTGC CAACTTCAGA AAGCACAGTC CACGCGCAAG AACAAAGTCC AAGCGCCACA 
GCCTCAAACG TGGCCACCAG TCACTCCACG GAGAAAGTGG ATGGAGACAC ACAGACAACA 
GTTGAGAAAG ATGGTTTGTC AACAGTGACC CTGGTTGGAA TCATAGTTGG GGTCTTACTA 
GCCATCGGTT TCATTGGTGG AATCATCGTT GTGGTTATGC GAAAAATGTC GGGAAGGTAC 
TCGCCCTAAA GAGCTGAAGG GTTACGCCCT GCTTGCCAAC GTC-CTTTAAA AAAAGACCGT 
TTCTGACTCT GTGGCCCTGT CCCTGAGCTC GTGGGGAGAA GATGACCCTG GGAACATTTG 
CGGGCCCATT CAGATTCCAC GGTGACTTTC CGTTTGCCAA ATTAACCGAG G. 
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TCACCAGATT TGGTTCTTAA ACTTT 
Protein Accession fl : NP_00S46 



LGSASLWVLA EGASTGQPED DTETTGLEGG VAMPGAEDDV VTPGTSEDRY 
EDLPTSESTV HAQEQSPSAT ASNVATSHST EKVDGDTQTT 
AIGFIGGIIV WMRKMSGRY SP 



45 



I I I I I I 

GCGGCCGCCA GCTTGCAAAG CCGAAGTCTG GCCGCGCTCT TCGACTCGCT GCGCCACGTC 
CCCGGGGGTG CCGAGCCGGC GGGGGGTGAG GTGGCTGCGC CGGCGGCCGG GCTAGGAGGT 
GCGGGCACTG GGGGCGCGGG AGGGGACGTG GCAGGCCCCG CGGGGGCCAC GGCGATCCCA 

20 GGGGCCAGGA AGGTCCCGCT GCGGGCACGC AATCTGCCTC CGTCCTTCTT CACGGAGCCG 
TCCCGGGCAG GCGGCGGCGG GTGTGGCCCG TCGGGGCCGG ACGTGAGCTT GGGCGACCTG 
GAGAAGGGCG CGGAGGCCGT GGAGTTCTTT GAGCTGCTGG GGCCCGACTA CGGCGCCGGC 
ACGGAGGCGG CAGTCTTGCT TGCCGCCGAG CCTCTCGACG TGTTCCC 
GTACTGCGGG GACCCCCGGA GCTGGAGCCC GGCCTCTTTG AGCCGCCGCC G 

25 GGAAACCTAC TGTACCCCGA GCCCTGGAGC GTCCCGGGCT GCTCCCCGAC C 



30 Seq ID NO : 274 DNA sequence 



CAAAGAGGCC GGGCTCCAGC TCCGGGGGTC CCCGCAGTAC GGAGGCTCCG GCGGGGAACA 
_T A AGG CTCGGCGGCA AGCAAGACTG CCGCCTCCGT GCCGGCGCCG TAGTCGGGCC 
CCAGCACCTC AAAGAACTCC ACGGCCTCCG CGCCCTTCTC CAGGTCGCCC AAGCTCACGT 
CCGGCCCCGA CGGGCCACAC CCGCCGCCGC CTGCCCCCCA CGGCTCCGTC AAGAAGGACG 
GAGGCAGATT GCGTGCCCGC AGCGGGACCT TCCTGGCCCC TGGGATCGCC GTGGCCCCCG 
CGGGGCCTGC CACGTCCCCT CCCGCGCCCC CAGTGCCCGC ACCTCCTAGC CCGGCCGCCG 
GCGCAGCCAC CTCACCCCCC GCCGGCTCGG CACCCCCGGG GACGTGGCGC AGCGAGTCGA 
AGAGCGCGGC CAGACTTCGG CTTTGCAAGC TGGCGGCCGC 

Seq ID NOs 275 DNA_ 



31 

jO AGCCCAGAGA CACATTGGGG CTGACCTGCC GCTGCTGTCA GTGGGAGGCC AGTGGTGCTG 
GCCAAGAAGT GTCATGGCTG GTGTCGTGCA CGTTTCCCTG GCTGCTCACT GCGGGGCCTG 
TCCGTGGGGC CGGGGCAGAC TCCGCAAAGG ACGCGCAGCC TGCAAGTCCG CGGCCCAGAG 
ACACATTGGG GCTGACCTGC CGCTGCTGTC AGTGGGAGGC CAGTGGTGCT GGCCAAGAAG 
TGTCATGGCT GGTGTCGTGC ACGTTTCCCT GGCTGCTCTC CTCCTGCTGC CTATGGCCCC 

55 TGCCATGCAT TCTGACTGCA TCTTCAAGAA GGAGCAAGCC ATGTGCCTGG AGAAGATCCA 
GAGGGCCAAT GAGCTGATGG GCTTCAATGA TTCCTCTCCA GGCTGTCCTG GGATGTGGGA 
CAACAT CACG TGTTGGAAGC CCGCCCATGT GGGTGAGATG GTCCTGGTCA GCTGCCCTGA 
GCTCTTCCGA ATCTTCAACC CAGACCAAGT CTGGGAGACC GAAACCATTG GAGAGTCTGA 



C CCTTCCCTCA TTACTTTGAT GCCTGTGGGT TTGATGAATA 
TGAATCTGAG ACTGGGGACC AGGATTATTA CTACCTGTCA GTGAAGGCCC TCTACACGGT 
C ACATCCCTCG TCACCCTCAC CACTGCCATG GTCATCCTTT GTCGCTTCCG 
C TGCACACGCA ACTTCATCCA CATGAACCTG TTTGTGTCGT TCATGCTGAG 
GGCGATCTCC GTCTTCATCA AAGACTGGAT TCTGTATGCG GAGCAGGACA GCAACCACTG 
CTTCATCTCC ACTGTGGAAT GTAAGGCCGT CATGGTTTTC TTCCACTACT GTGTTGTGTC 



CTTCCCTGAA AGGAGATACT TCTACTGGTA CACCATCATT GGCTGGGGGA CCCCAACTGT 
A GTGTGGGCTA CGCTGAGACT CTACTTTGAT GACACAGGCT GCTGGGATAT 
3CTCTGT GGTGGGTGAT CAAAGGCCCT C-TGGTTGGCT CTATCATGGT 
TAACTTTGTG CTTTTTATTG GCATTATCGT CATCCTIGTG CAGAAACTTC AGTCTCCAGA 
CATGGGAGGC AATGAGTCCA GCATCTACTT GCGACTGGCC CGGTCCACCC TGCTGCTCAT : 
CCCACTATTC GGAATCCACT ACACAGTATT TGCCTTCTCC CCAGAGAATG T CAG CAAAAG 
GGAAAGACTC GTGTTTGAGC TGGGGCTGGG CTCCTTCCAG GGCTTTGTGG TGGCTGTTCT 
CTACTGTTTT CTGAATGGTG AGGTACAAGC GG AG AT CAAG CGAAAATGGC GAAGCTGGAA 
GGTGAACCGT TACTTCGCTG TGGACTTCAA GCACCGACAC CCGTCTCTGG CCAGCAGTGG 
GGTGAATGGG GGCACCCAGC TCTCCATCCT GAGCAAGAGC AGCTCCCAAA TCCGCATGTC 
TGGCCTCCCT GCTGACAATC TGGCCACCTG AGCCATGCTC CCCT 

Seq ID NO: 276 Protein sequence 



AHCGACPWGR GRLRKGRAAC KSAAQRHIGA DLPLLSVGGQ WCWPRSVMAG 
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WHVSLAALL LLPMAPAMHS DCIFKKEQAM CLEKIQRANE LMGENDSSPG CPGMWDNITC 
WKPAHVGEMV LVSCPELFRI FNPDQVWETE TIGESDFGDS MSLDLSDMGV VSRNCTEDGW 
SEPFPHYFDA CGFDEYESET GDQDYYYLSV KALYTVGYST SLVTLTTAMV ILCRFRKLHC 
TRNFIHMNLF VSFMLRAISV FIKDWILYAE QDSNHCFIST VECKAVMVFF HYCWSNYFW 
J LFIEGLYLFT LLVETFFPER RYFYWYTIIG WGTPTVCVTV WATLRLYFDD TGCWDMMDST 
j FIGIIVILVQ KLQSPDMGGN ESSIYLRIAR STLLLIPLFG 
P ENVSKRERLV FELGLGSFQG FWAVLYCFL NGEVQAEIKR KWRSWKVNRY 
FAVDFKHRHP SLASSGVNGG TQLSILSKSS SQIRMSGLPA DNLAT 

10 Seq ID NO: 277 DNA sequence 

- ------ - » NM_004000.1 



15 I I I | | | 

AGAAGAAGCT GGCCAAGGAT ATGGGAGCAA CCACCATGGA CCAGAAGTCT CTCTGGGCAG 
GTGTAGTGGT CTTGCTGCTT CTCCAGGGAG GATCTGCCTA CAAACTGGTT TGCTACTTTA 
CCAACTGGTC CCAGGACCGG CAGGAACCAG GAAAATTCAC CCCTGAGAAT ATTGACCCCT 
TCCTATGCTC TCATCTCATC TATTCATTCG CCAGCATCGA AAACAACAAG GTTATCATCA 

20 AGGACAAGAG TGAAGTGATG CTCTACCAGA CCATCAACAG TCTCAAAACC AAGAATCCCA 
AACTGAAAAT TCTCTTGTCC ATTGGAGGGT ACCTGTTTGG TTCCAAAGGG TTCCACCCTA 
TGGTGGATTC TTCTACATCA CGCTTGGAAT TCATTAACTC CATAATCCTG TTTCTGAGGA 
ACCATAACTT TGATGGACTG GATGTAAGCT GGATCTACCC AGAT CAG AAA GAAAACACTC 
ATTTCACTGT GCTGATT CAT GAGTTAGCAG AAGCCTTTCA GAAGGACTTC ACAAAATCCA 

15 CCAAGGAAAG GCTTCTCTTG ACTGCGGGCG TATCTGCAGG GAGGCAAATG ATTGATAACA 
GCTATCAAGT TGAGAAACTG GCAAAAGATC TGGATTTCAT CAACCTCCTG TCCTTTGACT 
TCCATGGGTC TTGGGAAAAG CCCCTTATCA CTGGCCACAA CAGCCCTCTG AGCAAGGGGT 
GGCAGGACAG AGGGCCAAGC TCCTACTACA ATGTGGAATA TGCTGTGGGG TACTGGATAC 



CACTGGCCTC TGCAGAAACC ACCGTGGGGG CCCCIGCCTC TGGCCCTGGA GCTGCTGGAC 
CCATCACAGA GTCTTCAGGC TTCCTGGCCT ATTATGAGAT CTGCCAGTTC CTGAAAGGAG 
CCAAGATCAC GCGCCTCCAG GATCAGCAGG TTCCCTACGC AGTCAAC 
TGGGCTATGA TGATGTGAAG AGTATGGAGA CCAAGGTTCA ' GTTCTTAAAG A 
TGGGAGGAGC CATGATCTGG TCTATTGACA TGGATGACTT CACTGGCAAA TCCTGCAACC 



CTCATGTGGC A 
CCTGAAGTAC AATAAAAAAA ATTCATTTTG CTCCAGTA 
Seq ID NO: 278 Protein sequence 



MDQKSLHAGV WLLLLQGGS AYKLVCYFTN WSQDP.QSPGK FTPENIDPFL CSHLIYSFAS 
IENHKVIIKD KSEVML YQT I NSLKTKNPKL KILLSIGGYL FGSKGFIIPMV DSSTSRLEFI 
NSIILFLRNH NFDGLDVSWI YPDQKENTHF TVLIKELAEA FQKDFTKSTK ERLLLTAGVS 
AGRQMIDNSY QVEKLAKDLD FINLLSFDFH GSWEKPLITG HNSPLSKGWQ DRGPSSYYNV 
EYAVGYWIHK GMPSEKWMG IPTYGHSFTL ASAETTVGAP ASGPGAAGPI TESSGFLAYY 



Seq ID NO: 279 DNA s 



TGCTGGAAGT CCCTCACCCA GAGACCAGTG CTCCCAACGG CAGAGCAGCG GGGGAGATAA 
AGAACTGGTG ACACGTGGCT GTACATTCAG CACAGCTGTG GTGTCCCCAA GTGCCATGAC 
CCAGGAGCCA TTCAGAGAGG AGCTGGCCTA TGACCGGATG CCCACGCTC-G AGCGGGGCCG 
GCAAGACCCC GCCAGCTATG CCCCAGACGC GAAGCCGAGC GACCTGCAGC TGTCGAAGAG 
ACTGCCCCCC TGCTTCAGCC ACAAGACGTG GGTCTTCTCT GTGCTGATGG GGAGCTGCCT 



GAGGAACGCC AATGTGATTC CCAACTTTCA GATATTGTTT GTTTCCACGT TTGCTGTGAC 480 

CACTACGTGT TTAATTTGGT TTGGATGCAA ACTAGTCCTG AACCCATCAG CAATAAACAT 540 

CAACTTCAAC CTCATCCTGC TGCTCCTGCT GGAGCTGCTC ATGGCGGCCA CGGTGATCAT 600 

CG CTGCACGG TCCAGCGAGG AGGACTGCAA GAAAAAGAAG GGCTCCATGT CTGACAGCGC 660 

CAACATTCTG GACGAAGTGC CATTTCCTGC TCGGGTCCTG AAATCTTACT CAGTCGTCGA 720 

GGTAATCGCA GGCATCTCTG CCGTCCTCGG GGGGATCATT GCCCTGAACG TGGATGACTC 780 

AGTTTCAGGC CCACACCTCT CAGTGACGTT CTTTTGGATC CTAGTGGCCT GCTTTCCAAG 840 

TGCCATTGCC AGTCATGTGG CAGCAGAGTG TCCCAC-CAAG TGTCTGGTGG AGGTCCTGAT 900 

TGCCATAAGC AGCCTCACGT CTCCGCTGCT GTTCACAGCC TCTGGATATC TGTCATTCAG 960 

CATCATGAGA ATCGTGGAGA TGTTTAAGGA TTACCCGCCA GCCATAAAAC CATCCTACGA 1020 

TGTGCTGCTG CTGCTGCTGC TGCTAGTGCT CCTGCTGCAC- GCCGGCCTCA ACACGGGCAC 1080 

CGCCATCCAG TGCGTGCGCT TCAAGGTCAG TGCAAGGCTG CAGGGTGCAT CCTGGGACAC 1140 

CCAGAACGGC CCGCAGGAGC GCCTGGCTGG GGAGGTGGCC AGGAGCCCCC TGAAGGAGTT 1200 

CGACAAGGAG AAAGCCTGGA GAGCCGTCGT GGTGCAAATG GCCCAGTGAC CCCCAGACGC 1260 

GGAAACCGGG TGGCAGCGCC CAGCCTGGCC CCAAGCATGC- AAACGCACAA CCCCTAATCG 1320 

CCCTGAGCTA CTGCTTCTAA CACCTCTTTT CCCTTGTGTG AGGGCAAACC AGGCTGCAGG 1380 

TGGGGTTTTC ACTTCCTAGG GTAGTTTAAT TTTAAAATAG GCCAATGTTG GCTAGTCTGT 1440 
GCCTCAGTGA GATCAGTCAG CTCCGAGTGG CTCCCGTGTC GTAACAGCAG G 
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GCAACTTCCC AGGCCGAGGA AGGGCCCCCG GCTCGGCCTC TTGAGAGCCC CACCCCTGAA 
CTGGCCCCAG CTCCTCTTCC TGCCTCTCTC ATGGCTTGGG CTGGAGTGGG CTCTCTGGAC 
CTGACCAGAC TGTGGGTCCC TGCGTCTCCT GCCCACTCTG ACCGGGCTTC CTCCCTCCAC 
GCTTAGGGTC TGTCCCGGGT ACTCAGTCAG CCCAGTGGGA TCTTACCCAC TTCCCTGCAA 
GGTGCACCTG CCCCAGGCTC AGGCTGCCCA GCGGCTCTTC CTGGACAGTG AGAGCAGGGC 
TTGGCGCCTC TGTCCTGGCC CGGGAGCCGC AGGGGCCCCT CCTCCAGAGC CTGGGCGCAA 
GCGACACAGG CTGCCGCTGC TCTCCCAGGT GAAATCCACA CCAGTCCACG CCGGGTCGCC 
TGCCCTGTCT CCCTACTTAG ACCCAGTCAT TCTAGAGGGA TCCACCGCCA CACTGGCCGG 
CCCACGTCCT GGGTGCTGTC ATGCCCAGCT TGGAGTGCCA CGTGGCCGCT GCCCACGTCC 
CGGGCACTGT CATGCCCAGC TTGGAGTGCC ACATGGCCC-C TC-CCCACGTC CCGGGCACTG 2100 
TCATGCCCAG CTTGGAGTGC CACGTGGCCG CTGCTGTGAC AGGCAGTGTT CTTGGGGGTG 2160 
GGGCTGCATC CAAGGCTTTG TAAACCGGCT GGACCACGTC TCCCTGGCCC CAGTGACCGG 2220 
GGGAAGCTGA GCCCCTCCCT CCTGTGTTTG CTCCCATTAC TCAAAATGCA GGACAGATCA 2280 
GGTCAGAGCC CAGGAATTCT CACAGGTTCA CCCAGCGCCC TCTACCTCCT AGCAAGTACT 2340 
TTGTCTTGAT CCTCACTGAG AAGGCCCCAG GGCAGCGGTC TTCTCCATCT CCGCTGTTTT 2400 
GGGGTCTTAG GGTACAGCCC AGGCGGTCAC TGCCCACCTG CCAGGCTGCA GGGACAGTTG 2460 
GGTGTGAGAA TAACACTGGC TTTGGGTAGT GCCATGGCCA GGAGTGGGTT TCCCTGCGTC 2520 

A GGTGTTTTCC 2640 



GTGGCCTGGG GC CAT CAAG A TCAAAGAACC AGGAGGCCTG GGAGATGCAG CTGGATGGGG 2820 
CGGCCTGCAG ACCCTGCCAG GGGGTTTGAG G 
GAGTGACTCT GGCTGCCAAG ATACCTI 
CAACCATTGA AGGGGGATGC AGGCAAGACA CCTTCCCAGC TGCTCCTAGA GGGGACAAGC 3000 
CAGGCCCTCT CTGCAGTCCT CGGCAGCTCC GGAAGGACAC AGTCAGGGGC CGGGCAAACA 3060 
CTTTGGCCAC AGCCCCAAAC AAGCGCCACC GTGGGAGAGG A 



Protein Accession # : HP_055981.1 

1 11 21 31 41 51 

I I I I I I 

MTQEPFREEL AYDRHPTLER GRQDPASYAP DAKPSDLQLS KRLPPCFSHK TWVFSVLMGS 
CLLVTSGFSL YLGNVFPAEM DYLRCAAGSC IPSAIVSFTV SRRNANVIPN FQILFVSTFA 
VTTTCLIWFG CKLVLNPSAI NINFNLILLL LLELLMAATV IIAARSSEED CKKKKGSMSD 
SANILDEVPF PARVLKSYSV VEVIAGISAV LGGIIALNVD DSVSGPHLSV TFFWILVACF 
PSAIASHVAA ECPSKCLVEV LIAISSLTSP LLFTASGYLS FSIMRIVEMF KDYPPAIKPS 
YDVLLLLLLL VLLLQAGLNT GTAIQCVRFK VSARLQGASW DTQNGPQERL AGEVARSPLK 
EFDKEKAWRA VWQMAQ 



Seq ID NO: 281 DNA segue 
Nucleic Acid Accession # 



1 11 21 31 41 51 

I I I I I I 

GCTGAGCCTG AGCCCGACCC GGGGCGCCTC CCGCCAGGCA CCATGGTGCA GAAGTCGCGC 

AACGGCGGCG TATACCCCGG CCCGAGCGGG GAGAAGAAGC TGAAGGTGGG CTTCGTGGGG 

CTGGACCCCG GCGCGCCCGA CTCCACCCGG GACGGGGCGC TGCTGATCGC CGGCTCCGAG 

GCCCCCAAGC GCGGCAGCAT CCTCAGCAAA CCTCGCGCGG GCGGCGCGGG CGCCGGGAAG 

CCCCCCAAGC GCAACGCCTT CTACCGCAAG CTGCAGAATT TCCTCTACAA CGTGCTGGAG 

CGGCCGCGCG GCTGGGCGTT CATCTACCAC GCCTACGTGT TCCTCCIGGT TTTCTCCTGC 

TACATCCTGG AAATCGTGAC TATCGTGGTG TTTGGCGTGG AGTACTTCGT GCGGATCTGG 

GCCGCAGGCT GCTGCTGCCG GTACCGTGGC TGGAGGGGGC GGCTCAAGTT TGCCCGGAAA 
CCGTTCTGTG T 



GCACTCTGGT GGGGCCTGAT CACGCTGACC ACCATTGGCT ACGGGGACAA GTACCCCCAG 
ACCTGGAACG GCAGGCTCCT TGCGGCAACC TTCACCCTCA TCGGTGTCTC C 
CTGCCTGCAG GCATCTTGGG GTCTGGGTTT GCCCTGAAGG T 
AAGCACTTTG AGAAGAGGCG GAACCCGGCA G 
TACGCCACCA ACCTCTCGCG CACAGACCTG CACTCCACGT GGCAGTACTA CGAGCGAACG 
GTCACCGTGC CCATGTACAG ACTTATCCCC CCGCTGAACC AC-CTGGAGCT GCTGAGGAAC 
CTCAAGAGTA AATCTGGACT CGCTTTCAGG AAGGACCCCC CGCCGGAGCC GTCTCCAAGC 
CAGAAGGTCA GTTTGAAAGA TCGTGTCTTC TCCAGCCCCC GAGGCGTGGC TGCCAAGGGG . 
AAGGGGTCCC CGCAGGCCCA GACTGTGAGG CGGTCACCCA GCGCCGACCA G 
GACAGCCCCA GCAAGGTGCC CAAGAGC 
GC1TTCCGCA TCAAGGGTGC CGCGTCACGG CAGAACTCAG AAGAAGCAAG CCTCCCCGGA 
GAGGACATTG TGGATGACAA GAGCTGCCCC TGCGAGTTTG TGACCGAGGA CCTGACCCCG 
GGCCTCAAAG TCAGCATCAG AGCCGTGTGT GTCATGCGGT TCCTGGTGTC CAAGCGGAAG 
TTCAAGGAGA GCCTGCGGCC CTACGACGTG ATGGACGTCA TCGAGCAGTA CTCAGCCGGC 
CACCTGGACA TGCTGTCCCG AATTAAGAGC CTGCAGTCCA C-AGTGGACCA GATCGTGGGG 
CGGGGCCCAG CGATCACGGA CAAGGACCGC ACCAAGGGCC CGGCCGAGGC GGAGCTGCCC 
GAGGACCCCA GCATGATGGG ACGGCTCGGG AAGGTGGAGA AGCAGGTCTT G 
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AAGAAG CTGG ACTTCCTGGT GAATATCTAC ATGCAGCGGA TGGGCATCCC CCCGACAGAG 1920 
ACCGAGGCCT ACTTTGGGGC CAAAGAGCCG GAGCCGGCGC CGCCGTACCA CAGCCCGGAA 1S80 
GACAGCCGGG AGCATGTCGA CAGGCACGGC TGCATTGTCA AGATCGTGCG CTCCAGCAGC 2040 
TCCACGGGCC AGAAGAACTT CTCGGCGCCC CCGGCCGCGC CCCCTGTCCA GTGTCCGCCC 2100 
5 TCCACCTCCT GGCAGCCACA GAGCCACCCG CGCCAGGGCC ACGGCACCTC CCCCGTGGGG 2160 
GACCACGGCT CCCTGGTGCG CATCCCGCCG CCGCCTGCCC ACGAGCGGTC GCTGTCCGCC 222 0 
TACGGCGGGG GCAACCGCGC CAGCATGGAG TTCCTGCGGC AGGAGGACAC CCCGGGCTGC 2280 
AGGCCCCCCG AGGGGACCCT GCGGGACAGC GACACGTCCA TCTCCATCCC GTCCGTGGAC 2340 
CACGAGGAGC TGGAGCGTTC CTTCAGCGGC TTCAGCATCT CCCAGTCCAA GGAGAACCTG 2400 
10 GATGCTCTCA ACAGCTGCTA CGCGGCCGTG GCGCCTTGTG CCAAAGTCAG GCCCTACATT 24 60 
GCGGAGGGAG AGTCAGACAC CGACTCCGAC CTCTGTACCC CGTGCGGGCC CCCGCCACGC 2520 
TCGGCCACCG GCGAGGGTCC CTTTGGTGAC GTGGGCTGGG CCGGC-CCCAG GAAGTGAGGC 2580 
GGCGCTGGGC CAGTGGACCC GCCCGCGGCC CTCCTCAGCA CGGTGCCTCC GAGGTTTTGA 2640 
GGCGGGAACC CTCTGGGGCC CTTTTCTTAC AGTAACTGAG TGTGGCGGGA AGGGTGGGCC 2700 
15 CTGGAGGGGC CCATGTGGGC TGAAGGATGG GGGCTCCTGG CAGTGACCTT TTACAAAAGT 27S0 
TATTTTCCAA CAGGGCACTC CCAGGCCCTG TCGCCATTGA GGTGCCTCCG CTGGGCTGTC 2820 
TCCTCACCCC TCCCTGTGCT GGAGCCTGTC CCAAAAAGGT GCCAACTGGG AGGCCTCGGA 2880 
AGCCACTGTC CAGGCTCCCA CTGCCTGTCT GCTCTGTTCC CAAAGGCAC-C GTGTGTGGCC 2940 
TCGGGCCCTG CGGTGGCATG AAGCATCCCT TCTGGTGTGG GCATCGCTAC GTGTTITGGG 3000 
20 GGCAGCGTTT CACGGCGGTG CCCTTGCTGT CTCCCTTGGG CTGGCTCGAG CCTGGGGTCC 3 060 
ATGTCCCTTT GCCGTCCCGT CATGGGGCAG GGAATCCATA GCGGGGCCCA CAGGCAGGGG 3120 
TATGAGTGCG TCCCACCCAA CGCAGCACCA GCCCCGGCCA CCGCTCCCCG TGTCCCCAGT 3180 
TCCGTCTCAG CTACCTGGAC TCCAGGACCC TGGAGAAGGG AGACCTGGCA GTGGAGGGAG 3240 
GCTGTGCTGT GTGTCCCCCT GCAGGTGTGA CCCCGCCTGC TCTTTCCTCC CCCGCCAGGT 3300 
25 GTGGCCCCGC CTGCTCTTTC CTCCCCCACC AGTATGGCCC CACCTGCTCT TTCCTNCCCC 3360 
CCCAAGGTGT GGCCCCACCT GTTCTTTCCT CCCCTGCCGA GGTGTGACCC CACCTGCTCT 3420 
TTCCTCCCTC CCAGTATGGC CCCACCTGCT CTTTCCTCCC CCGAGGTGAG GCCCCGCCTG 3480 
CTCTTTCCTC CCATGGGAGC CGCTGAGGCG TGCGCACCTG GGCACAGGTT GGGGCTCTGC 3S40 
AGGATGAGGA AGACAGGCCA ATCCCTTCCC TCCCAGAAGC TGGCCGCCCA GCAGGAGGGA 3600 
30 CTGAGGCCAG ACTCATGTCC AGCAAGGAAC GTGTGGTGTG TCCCCTGGGA AGTCTCTGGG 3660 
CCCTGGGAAG AGGGAAGGTG CACGTCCTGG GATGGTTGCG GGGCCCTGTT TTGGGAGACA 3720 
AAGGGGTAGA GGGTCTGTCT TGNGCCCCCC CAGACTCTAG CCCGAGCAGT GCAGCCACCT 3780 
ACTGCCCCAC CTCAGAGAAG TGCAGCGGGA AGGAGGCTGG AGGTGGTGCG GCGCTGCCTC 384 0 
GGGTGTCTGC GTGAATGAGC GTGGCCAAGG ACCAGTGCCA CCTCATGGCA AAGAGCTCCC 3900 
35 GCAGTGTTTG TTAGAGTGCA CATCCTACGT GCCCACTGGC ACACACACGT GCTCACATAC 3 960 

ANGTCNGNGT ACAGGNGTAC ACATGCANGC TTGCACACAT GCACACAGAC CACATAGCAC 4020 
ACATGTGCAC TGACCACACC TGTATAGACC ATGCACAGTA CACATACGTG CATACACATG 408 D 
CCTGCATACA GGCATACACA TGCACGCTTA CATGTACACG TGCACAGATC ACACACATGC 414 0 
ACACACGTGT AGCT CACACA CAGTATACAC ATACACAAGT GCACAGACCA CACACAGCAC 42 00 
40 TAACACATGC ACACACAAAG TGCATAGGCC ACACAGCACA TGCACACAGG TGCACAGACC 4260 

ACACAGCACA CACAAGTGCA CAGAGCACAC TGCACACATG CACACACACA CGCGTGCATG 4 3 20 
CACACTCCTC GCACTTCCAG CCTTGGAGCC CTTCTGTCTC TGGTCTTTCT CTTTGACCCT 43 80 
GCTGAGTGTA AGCTGCCTGG GGAGGGGCTA CAAGGAGTAA TTGTGGCTTT AGGGGTCGTG 4440 
GTGATGCTGG AATGTCAAGC GCCGTCGTGG GGTATCCGAC TGTCCGGGCT CCTGGTCCGC 4500 
45 AGTGGCAGAG CGCCAGGCAG AGCCAATCAG GGTCTCGTGC TGCCCTTCCC CCCCACAGCC 4560 

TGGCAGCCAT CCAGAGCAGC GGCTCTACCA GATGCCAAGG TGCCCCGGTG TCTGTATGGG 4620 
TGTCCGGTTG GGTCCTGTGT TTGGTCTGCC CTGGAGGTGG CTGGGCCCTC CTGGGATGGG 4680 
TGGCTCAGCC TCGAATCCCA GGCCCCAGCC CAGGCAGGTG CTGCTGCCTG TTGTGGTTTC 4740 
CTGGCCCAGC TTCTCCTTCT CCCTCTGCAT AAAATCACAG TCCGTGAGTC TTCCAGCTGC 4 800 
50 CACCACGGCT GGGACACGCT GGGGGAGGGC TCCTCCCATG CCTCCTGCAC ACAGCCGTCT 4860 
GAGCAGGGCA GGTGCCAACA CCCCCCACCG GAGACACGCT GCCCCTCAGC GATGCCCCTA 4 920 
CCTTTTGGGG GGCCTCGTCT CAAGCCCCCC CTTGGAGGCT GAAATCACCC CAGGCACTGT 4980 
GAGGGCTTCT CCAGGGGGAC ACCCTTTGAG CTGTGGGTCT GATCACCCCA AGTCCCGCAC 504 0 
ACGGAGGAGA GGCACAGCCA GGGCGTGTGG TTTAATGTTT GCCCCTTCGG GGCTGGAGGT 5100 
55 CTCAGTGTTT CTAGATTCCA GACCCTGCTG CCAGAGAGAC CTGCTGCCGG AGAGAAGGGG 5160 
AGGAGGACT C CAGCTGGGCT CGGTCCCCCA CAGTCAGGGA CCCCCATAAA GGACACCCCC 5220 
TTCTCTCTAG AAAGAGCTGG GCTCTCAGCT ATTTCTAGTT GCTTCCCAGA AGCCGAGGAG 528 0 
CAGAAGGAGC TGTGAGAGCT TTGCAGAAAC GCCCTTGTCC CCGCCCTCCT GAGCTATGAA 5340 
TGCCGTACAG AGCAGAGGCT GGGGCATTGG CAAGATCACA GGTTGATGCT GCACAGCCCC 54 00 
60 ATTGACACAA ACCCTCAAAG CAGACGTGAG AGGGACGGTT CACAAAGCTT GGACCTGCCG 5460 
TGGAGGGTGC CCGGCAGACG TGGCGTGAGA GGGACGGCTC ACGAGGCTTG GACCTGCTGT 5520 
GGAGGGTGCC CAGCAGACGT GGTGTGAGAG GAACGGCTCA CGAGACTTGG ACCTGGTGGA 5580 
GGGTGCCCAG CAGACGTGGT GTGAGAGGGA CGGCTCACAG GGCTTGGACC GGAGAGAGAT 5640 
GGCTCATGAG ACTTGGACCT GCCGTGGAGG GTGCCCAGCA GACGTGGTAT GAGAGGGATG 5700 
65 GCTCACGAGG CTTGGACCTG GTGGAGGGTG CCCGGCAGAC GTGTGAGAGG GACGGTTCAC 5760 
AAGGCTTGGA CCTGCCATGG AGGGTGCCCA GCAGACGTGG TGTGAGAGGG ACAGCTCACG 5820 
AGGCTTGGAC CTGCCGTGGA GGGTGCCCAG CAGGGGGCTG AGCTCTGAGG GGTGGGTGCT 5880 
CAGTGCACGG GTGCCCCCAG TGTCCTCTGA TCCTGTCCGG TGCCTCCCCC AACCCCCACA 5940 
CCCATGCAGA ACTCCCAGGT CACATG CACG TATGTCCAGG GCATGGGGGT GGCGTGAAGA 6000 
70 GGCCTGGTCA GGGCCTTTAG GGGCTGCAGG ACGGAATGGC CACCTGGGGA GCCTGTGTGG 6060 
CTGTGCCGGG CAGCCATCCT GCATTCCCAC CCAGCGCGCA GTCTCCACCT CGGCCCCAGC 6120 
AAAGCGCTAA GCAGCCGGAG AGACAGCCAG GGCGGCTTCC TGAAGGATGT GGGATGGTGG 6180 
ACTCCGGGGT CGAGGGAATA CGCAGGTTCC TGTCCTCCGG GAGACCTAGA GAAGCTGCAC 6240 
ACCCAGGAGC TTTCCATGAC CCGGGAGCAT GAGTGAATGG GGGGTTCCAG TTTGCTGAAC 6300 
75 TTTGCTGTCT TGTAAGGGTG GGGGCTGACG GCCGACCCTG GGAC-GAGGTG ACACCGCAGG 6360 
GGGAGGTTGT GGGCAACGGT GGAGGAGGAG AGACGGGAGG GGACCATTTG GGATGGAGGG 642 0 
GCCTCTTCAG AGTTTTAAAA GGCGTTTGTG GGGTGGAGTT GAGTGTGCTC TGGGCTTGGA 6480 
CACTTGCCGT GGTGCCCCTG GCTGGCCGAG GAGACTGGCT CTGGCCAGGG CCCCGTCCTG 6540 
AGAGGTCCTC AGCGTCTGAC TCTCGGCCAG GCGCCAGCAA GGAGGGGCCG GTCCCCGGGG 6600 
80 CTACCAGGCA GGCACGTGCA CATCGCCATC GCCACACGCC AACTCCGCCT GGGTTTTACA 6660 
AAGTCGTTGC CTTAATGCAT GTGGACAGGA ACTCCCTGAG GTCGCCCCAT GCCCCCTGGC 6720 
TGTGCCAGGT ACGGACGCCC TGGACCCTGC GAACAGGTGG GGCGGGCGAG GGGCCCAAGG 6780 
GACGGGCTCC AGAGACACGC GCAGGGCAGG AGGGGTCTCA CGGAGGGGTC TCGCACTGAG 6840 
GCGCCCAGAG CTGGTGGTCC CGCTGGACGC CATCCCTCTG CCCGGGATCC ACACGGCCCA 6900 
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CGTGTGCCCG CCATGCCCGC GCCCCACGCC ATTGCAGTCT TCCATCCTCT GGCCGTGACG 6960 

GTGGCTGCAG CTTCCCCATT TGCGCCGTTG CCTCIGGCTG TCTGCACTTT TGTTCATGCT 7020 

CCAAAGAACA TTTCATAATG CCITCAGTAC CGACGTACAC TTCTGACCAT TTTGTATGTG 7080 

TCCTTGTGCC GTAGTGACCA GGCCTTTTTT TGGTGGATGT GTTACCCCGC ACACTTCAAT 714 0 

5 CTCAACTTTG TGCACCGTCC ATTTTCTAGG GATAGACGCC CAGGGAATGA ACTCTAGTTT 7200 

TCTAACAGAT TAGCTGAGAT ATTAACTTAC TCACACX3GAC AGGTTGATGC CAGAGCCGTA 7260 

AGAATGCGCC AGTGCGGGTT TGCGGGGGAC TTCGGGTGTG GGGTCCTGCG GCCGCGATGG 7320 

CCGTGGAAGG TTCTGGGGAT CCCTGC1GCC ACGGGGACGA GTTCGGACGC CAGGTGGACC 7380 
TGTGCACTCA GTAAAACGCA GTGATTCAAA AAAAAAAAAA 



I I I I I 

MVQKSRNGGV YPGPSGEKKL KVGPVGLDPG APDSTEDGAL L T ' 
GAGAGKPPKR NAFYRKLQNF LYNVLERPRG WAFIYHA ' 
SSEGALYILE IVTIWFGVE YFVRIWAAGC CCRYRGWRGR L 

AVLAAGSQGN VFATSALRSL RFLQILRMIR MDRRGGTWKL LGSWYAHSK ELVTAWYIGF 
LCLILASFLV YLAEKGENDH FDTYADALWH GLITLTTIGY GDKYPQTWNG R ' ™ 
GVSFFALPAG ILGSGFALKV QEQHRQKHFE KRRNPAAGLI QSAWRFYATN L 
QYYERTVTVP MYRLIPPLNQ LELLRNLKSK SGLAFRKDPP PEPSPSQKVS L 
GVAAKGKGSP QAQTVRRSPS ADQSLEDSPS KVPKSWSFGD RSRARQAFRI " 
EASLPGEDIV DDKSCPCEFV TEDLTPGLKV SIRAVCVMRF LVSKRKFKES 
EQYSAGHLDM LSRIKSLQSR VDQIVGRGPA ITDKDRTKGP AEAELPSDPS 
QVLSMEKKLD FLVNIYMQRM GIPPTETEAY FGAKEPEPAP PYHSPEDSRE H 
IVRSSSSTGQ KNFSAPPAAP PVQCPPSTSW QPQSHPRQGH GTSPVGDHGS L 

ERSLSAYGGG NRASMEFLRQ EDTPGCRPPE GTLRDSDTSI SIPSVDHEEL ERSFSGFSIS 780 

QSKENLDALN SCYAAVAPCA KVRPYIAEGE SDTDSDLCTP CGPPPRSATG EGPFGDVGWA 840 
GPRK 

Seq ID NO: 2838 DNA sequence 
Nucleic Acid Accession # = AF1S249S.1 
Coding sequence: 1..2391 

1 11 21 31 41 51 

ATGGAGGCGG GAGGAGAGCG ATTTCTTAGA CAAAGGCAAG TCTTGCTTCT CTTTGTTTTT 6 0 

CTGGGAGGGT CTCTGGCTGG GTCCGAGTCA AGACGCTATT CTGTGGCTGA GGAAAAAGAG 120 

AAGGGCTTTT TAATAGCCAA CCTAGCAAAG GATCTGGGAC TAAGGGTAGA GGAACTGGCC 18 □ 

GCGAGGGGGG CCCAAGTTGT GTCCAAAGGG AACAAACAGC ATTTTCAGCT CAGTCATCAG 240 

ACAGGTGATT TGCTCCTGAA TGAGAAATTG GACCGGGAGG AGCTATGCGG CCCCACAGAA 3 00 

CCATGCATAC TACATTTTCA GATATTACTG CAAAACCCTT TGCAATTCGT TACAAACGAG 3 60 

CTCCGTATCA TAGATGTAAA TGACCATTCT CCGGTATTCT TTGAAAATGA AATGCATCTG 420 

AAAATCCTAG AAAGCACTCT GCCAGGAACA GTAATTCCTT TGGGAAATGC TGAGGACTTG 480 

GATGTGGGAA GAAACAGCCT CCAAAACTAC ACTATCACTC CGAATTCCCA CTTCCACGTA 540 

CTCACTCGCA GTCGTAGGGA CGGAAGGAAG TACCCGGAAC TAGTACTGGA TAAAGCGCTC 600 

GATCGGGAGG AGCAGCCGGA ACTCAGCTTA ACGCTCACCG CGCTGGACGG CGGCTCICCC 660 

CCTCGGTCTG GGACAGCCCA GATAAACATC CAGGTCTTAG ATATAAACGA CAATGCACCA 720 

GAATTTGCAC AGCCGCTCTA TGAGGTTGCA GTICTAGAGA ATACCCCCGT TAACTCTGTC 780 

ATTGTCACTG TCTCGGCTTC TGACTTAGAT ACAGGAAGTT TTGGGACAAT ATCATATGCA 840 

TTTTTTCATG CTTCTGAAGA AATTCGCAAA ACTTTTCAGC TAAATCCAAT TACTGGTGAT 900 

ATGCAACTGG TCAAATATTT GAATTTTGAA GCGATTAATA GTTATGAAGT CGACATCGAG 960 

GCCAAGGATG GCGGAGGCCT ATCCGGAAAG TCTACAGTCA TAGTCCAGGT GGTTGATGTC 1020 

AACGACAACC CACCGGAACT GACCTTGTCT TCAGTAAACA GCCCTATTCC TGAGAACTCG 1080 

GGAGAGACTG TACTGGCTGT TTTCAGTGTT TCTGATCTAG ACTCTGGAGA CAACGGAAGA 114 0 

[TGAGAA CAATCTCCCC TTCTTCCTGA AACCATCTGT AGAGAATTTT 1200 

ZAGAAGG CGCGCTGGAC AGAGAGACCA GATCCGAGTA CAACATTACC 1260 

A CTGACCTGGG GACACCCAGG CTGAAAACCA AGTACAACAT AACCGTGCTG 1320 

GTCTCCGACG TCAATGACAA CGCCCCCGCC TTCACCCAAA TCICCTACAC CCTGTTCGTC 1380 

CGCGAGAACA ACAGCCCCGC CCTGCACATC GGCAGTGTCA GCGCCACAGA CAGAGACTCA 1440 

GGCACCAACG CCCAGGTAAC CTACTCGCTG CTGCCGCCCC AGC-ACCCGCA CCTGCCCCTC 1500 

TCTTCCCTGG TCTCCATCAA CGCGGACAAC GGCCACCTGT TTGCCCTCAG GTCGCTGGAC 1560 

TACGAGGCCC TGCAGGCGTT CGAGTTCCGC GTGGGCGCCA CAGACCGTGG CTCCCCGGCT 1620 

TTGAGCAGCG AGGCGCTGGT GCGCGTGCTG GTGCTGGACG CCAACGACAA CTCGCCCTTC 1680 

GTGCTGTACC CGCTGCAGAA CGGCTCCGCG CCCTGCACCG AGCTGGTGCC CCGGGCGGCT 1740 

GAGCCGGGCT ACCTGGTGAC CAAGGTGGTG GCGGTGGACG GCGACTCGGG CCAGAACGCC 1800 

TGGCTGTCGT ACCAGCTGCT CAAGGCCACG GAGCCCGGGC TGTTCGGCGT GTGGGCGCAC 1860 

AATGGCGAAG TGCGCACCGC CAGGCTGCTG AGCGAGCGCG ACGCGC-CCAA GCACAGGCTG 1920 

GTGGTGCTGG TCAAGGACAA TGGCGAGCCT CCGCGCTCGG CCACCGCCAC GCTGCATC-TG 1980 

CTCCTGGTGG ACGGCTTCTC CCAGCCCTAC CTGCCTCTCC CGGAGGCGGC ACCGGCCCAG 2040 

GCCCAGGCCG ACTTGCTCAC CGTCTACCTG GTGGTGGCAT TGGCCTCG3T GTCTTCGCTC 2100 

TTCCTCTTTT CGGTGCTCCT GTTCGTGGCG GTGCGGCTGT GCAGGAGGAG CAGGGCGGCC 2160 

TCGGTGGGTC GCTGCTCGGT GCCCGAGGGC CCCTTTCCAG C-GCAC-ATGGT GGACGTGAGC 2220 

GGCACCGGGA CCC1GTCCCA GAG CTACCAG TACGAGGTGT GTCTGACTGG AGGCTCCGGG 2280 

ACAAATGAGT TCAAGTTCCT GAAGCCAATT ATCCCCAACT TCGTTGCTCA GGGTGCAGAG 2340 
AGGGTTAGCG AGGCAAATCC CAGTTTCAGG AAGAGCTTTG AATTCAGTTA A 



Seq ID NO: 



MEAGGERFLR QRQVLLLFVF LGGSLAGSES RRYSVAEEKE KGFLIANLAK DLGLRVEELA 
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ARGAQWSKG NKQHFQLSHQ TGDLLLNEKL DREELCGPTE PCILHFQILL QKPLQKVTNE 
LRIIDVOTHS PVFFENEMHL KILESTLPGT VIPLGNAEDL DVGRNSLQNY TITPNSHFHV 
LTRSRRDGRK YPELVLDKAL DREEQPELSL TLTALDGGSP PRSGTAQINI QVLDINDNAP 
EFAQPLYEVA VLENTPVNSV IVTVSASDLD TGSFGTISYA FFHASESIRK TFQLNPITGD 
MQLVKYLNFE AINSYEVDIE AKDGGGLSGK STVIVQWDV NDNPPELTLS SVHSPIPENS 
GETVLAVFSV SDLDSGDNGR VMCSIENNLP FFLKPSVENF YTLVSEGALD RETRSEYNIT 
ITITDLGTPR LKTKYNITVL VSDVNDNAPA FTQISYTLFV RENNSPALHI GSVSATDRDS 
GTNAQVTYSL LPPQDPHLPL SSLVSINADN GHLFALRSLD YSALQAFEFR VGATDRGSPA 
F VLYPLQNGSA PCTELVPRAA EPGYLVTKW AVDGDSGQNA 



AACAGCATGG 



CTGCCGTGGC 430 

ATCCGGTCCG 600 

AGTGGGCGCC GACCAGCCCC 660 

CCATGGAGGT CTTCAGCATT G ACT CCATGT CCGGCCQ 

ACCGGGAGGA GCACGCCTCT TACCACCTCC GAGCCCACGC TGTGGACATG A 
AGGTGGAGAA CCCCATCGAC CTGTACATCT ACGTCAT' 
35 AGTTCATCAA CCAGGTCTAC AACGGCTCCG TGGACGAGGG CTCCAAGCCA G 
TGATGACCGT CACGGCCAAC GATGCTGACG ACAGCACCAC GGCCAACGGG A 

ACCGGATCGT GACCCAGACC CCACAGAGCC CGTCCCAGAA TATGTTCACC ATCAACAGCG 1020 

AGACTGGAGA TATCGTCACA GTGGCGGCTG GCCTGGACCG AGAGAAAGTT CAGCAGTACA 1080 

CAGTCATCGT TCAGGCCACA GATATGGAAG GAAATCTCAA CTATGGCCTC TCAAACACAG 1140 

41) CCACAGCCAT CATCACGGTG ACAGATGTGA ATGACAACCC GCCAGAATTT ACCGCCAGCA 12 00 

CGTTTGCAGG GGAGGTCCCC GAAAACCGCG TGGAGACCGT GGTCGCAAAC CTCACGGTGA 1260 

TGGACCGAGA TCAGCCCCAC TCTCCAAACT GGAATGCCGT TTACCGCATC ATCAGTGGGG 1320 

ATCCATCCGG GCACTTCAGC GTCCGCACAG ACCCCGTAAC CAACGAGGGC ATGGTCACCG 1380 

TCGTGAAGGC AGTCGACTAC GAGCTCAACA GAGCTTTCAT GCTGACAGTG ATGGTGTCCA 1440 

45 ACCAGGCGCC CCTGGCCAGC GGAATCCAGA TGTCCTTCCA GTCCACGGCA GGGGTGACCA 150 0 

TCTCCATCAT GGACATCAAC GAGGCTCCCT ACTTCCCCTC AAACCACAAG CTGATCCGCC 1560 

TCGAGGAGGG CGTGCCCCCC GGCACCGTGC TGACCACGTT TTCAGCTGTG GACCCTGACC 1620 

GGTT CATGCA GCAGGCTGTG AGATACTCAA AGCTGTCAGA CCCAGCGAGC TGGCTGCACA 1680 

TCAATGCCAC CAACGGCCAG ATCACCACGG CGGCAGTGCT GGACCGTGAG TCCCTCTACA 1740 

50 CCAAAAACAA CGTCTACGAG GCCACCTTCC TGGCAGCTGA CAATGGGATA CCCCCGGCCA 1800 

GCGGCACCGG GACCCTCCAG ATCTATCTCA TTGACATCAA CGACAACGCC CCTGAGCTGC 1860 

TGCCCAAGGA GGCGCAGATC TGCGAGAAGC CCAACCTGAA CGCCATCAAC ATCACGGCGG 1920 

CCGACGCTGA CGTCGACCCC AACATCGGCC CCTACGTCTT CGAGCTGCCC TTTGTCCCGG 1980 

CGGCCGTGCG GAAGAACTGG ACCATCACCC GCCTGAACGG TGACTATGCC CAACTCAGCT 2040 

55 TGCGCATCCT GTACCTGGAG GCCGGGATGT ATGACGTCCC CATCATCGTC ACAGACTCTG 210 0 

GAAACCCTCC CCTGTCCAAC ACGTCCATCA TCAA«3TCAA GGTGTGCCCA TGTGATGACA 2160 

ACGGGGACTG CACCACCATT GGCGCAGTGG CAGCGGCTGG TCTGGGCACC GGTGCCATCG 2220 

TGGCCATCCT CATCTGCATC CTCATCCTGC TGACCATGGT CCTGCTGTTT GTCATGTGGA 2280 
TGAAGCGGCG AGAGAAGGAG CGCCACACGA AGCAGCTGCT 
TCCGCGACAA CATCCTCAAG TATGACGAGG AAGGCGGTGG 
ACCTCAGCCA GCTGCAGCAG CCGGAAGCCA TGGGGCACGT GCCAAGCAAA 

G CTGAGCCCCA GTACCCGATC AGGCCCATGG 



60 



ACCCCACGGC ACCCCCCTAT GACTCCCTGC 

CCGCAGGCTC CGTCAGCTCC CTGAACTCAT CCAGTTCCGG GGACCAAGAC 




GCTCGGACGC CGGAGGAGCA 



AAAAAAAAAG 



I I I I I I 

MTAGAGVLLL LLSLSGALRA HNEDLTTRET CKAGFSEDDY TALISQNILE GEKLLQVKFS 
SCVGTKGTQY ETNSMDFKVG ADGTVFATRE LQVPSEQVAF TVTAWDSQTA E 
VAQTSSPHSG HKPQKGKKW ALDPSPPPKD TLLPWPQHQN ANGLRRRKRD W 
NSRGPFPQQL VRIRSDKDND IPIRYSITGV GADQPPKEVF SIDSMSGRMY V 
ASYHLRAHAV DMNGNKVENP IDLYIYVIDM NDNRPEFINQ VYNGSVDSC-S K 
ANDADDSTTA NGMVRYRIVT QTPQSPSQNM FTINSETGDI V7VAAGLDRE KVQQYTVIVQ 
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ATDMEGNLNY GLSNTATAI I TVTDVNDNPP EFTASTFAGE VPEKRVETW ANLTVMDRDQ 
PHSPNWNAVY RIISGDPSGH FSVRTDPVTN EGMVTWKAV DYELNRAFML TVMVSNQAPL 
ASGIQMSFQS TAGVTISIMD INEAPYFPSN HKLIRLEEGV PPGTVLTTFS AVDPDRFMQQ 
AVRYSKLSDP ASWLHINATN GQITTAAVLD RESLYTKNNV Y3ATFLAADN GIPPASGTC-T 
LQIYLIDIND NAPELLPKEA QICEKPNLNA INITAADADV DPNIGPYVFE LPFVPAAVRK 
NWTITRLNGD YAQLSLRILY LEAGMYDVPI IVTDSGNPPL SNTSIIKVKV CPCDDNGDCT 
TIGAVAAAGL GTGAIVAILI CILILLTMVL LFVMWMKRRE KERHTKQLLI DPSDDVRDNI 
LKYDEEGGGE EDQDYDLSQL QQPEAMGHVP SKAPGVRRVD ERPVGAEPQY PIRPMVPHPG 
DIGDFINEGL RAADNDPTAP PYDSLLVFDY EGSGSTAGSV SSLNSSSSGD QDYDYLNDWG 



I I I I I I 

ATGGAGGCCG GAGAGGGGAA GGAGCGCGTT CCGAAACAAA GGCAAGTCCT GATATTCTTT 
GTTTTGCTGG GCATAGCTCA GGCTAGTTGC CAGCCTAGGC ACTATTCAC-T GGCCGAGGAA 
20 ACGGAGAGTG GCTCCTTTGT GGCCAATTTG TTAAAAGACC TGGGGCTGGA GATAGGAGAA 
CTTGCTGTGA GGGGGGCCAG GGTCGTTTCC AAAGGAAAAA AAATGCATTT GCAGTTCGAT 
AGGCAGACCG GGGATTTGTT GTTAAATGAG AAATTGGACC GGGAGGAGCT GTGCGGCCCC 
ACAGAGCCCT GTGTCCTACC TTTCCAGGTG TTACTAGAAA ATCCCTTGCA GTTTTTTCAG 
GCGGAGCTAC GGATTAGGGA CGTAAATGAT CATTCCCCAG TTTTCCTAGA CAAAGAAATA 
25 CTTTTGAAAA TTCCAGAAAG TATCACTCCT GGAACTACTT TCTTAATAGA ACGTGCCCAG 
GACTTGGATG TAGGAACCAA CAGTCTCCAA AATTACACAA TCAGTCCCAA TTTCCACTTT 
CATCTTAATT TACAAGACAG TCTCGATGGC ATAATATTAC CACAGCTGGT GCTGAACAGA 
GCCCTGGATC GCGAGGAGCA GCCTGAGATC AGGTTAACCC TCACAGOGCT AGATGGCGGG 
AGTCCACCCA GGTCCGGCAC GGCCCTGGTA CGGATTGAAG TTGTGGACAT CAATGACAAC 
30 GTCCCAGAGT TTGCAAAGCT GCTCTATGAG GTGCAGATCC CGGAGGACAG CCCCGTTGGA 
TCCCAGGTTG CCATCGTCTC TGCCAGGGAT TTAGACATTG GAACTAATGG AGAAATATCT 
TATGCATTTT CCCAAGCATC TGAAGACATT CGCAAAACGT TTCGATTAAG TGCAAAATCG 
GGAGAACTGC TTTTAAGACA GAAACTGGAT TTCGAATCCA TCCAGACATA CACAGTAAAT 
ATTCAGGCGA CAGATGGTGG GGGCCTATCT GGAACTTGTG TGGTATTTGT O 
35 GATTTGAATG ACAATCCTCC GGAACTAACT ATGTCGACAC TTATCAATCA G. 

AACTTGCAGG ACACCCTCAT TGCTGTATTC AGCGTTTCAG ATCCTGACTC CGGAGACAAC 1140 
GGAAGGATGG TGTGCTCCAT CCAAGATGAT CTTCCTTTTT TCTTGAAACC TTCTGTTGAG 1200 
AACTTTTACA CTCTGGTGAT AAGCACGGCC CTGGACCGGG AGACCAGATC CGAATACAAC 1260 
ATCACCATCA CCGTCACCGA CTTCGGGACA CCCAGGCTGA AAACCGAGCA CAACATAACC 132 0 
40 GTGCTGGTCT CCGACGTCAA TGACAACGCC CCCGCCTTCA CCCAAACCTC CTACACCCIG 1380 
TTCGTCCGCG AGAACAACAG CCCCGCCCTG CACATCGGCA GCGTCAGCGC CACAGACAGA 1440 
GACTCGGGCA CCAACGCCCA GGTCACCTAC TCGCTGCTGC CGCCCCAGGA CCCGCACCTG 1500 
CCCCTCGCCT CCCTGGTCTC CAT CAACGCG GACAACGGOC ACCTGTTCGC TCTCCAGTCG 1560 
CTGGACTACG AGGCCCTGCA GGCGTTCGAG TTCCGCGTGG GCGCCGCAGA CCGCGGCTCC 162 0 
45 CCGGCGTTGA GCAGCGAGGC GCTGGTGCGC GTGCTGGTGC TGGACGCCAA CGACAACTCG 1600 
CCCTTCGTGC TGTACCCGCT GCAGAACGGC TCCGCGCCCT GCACCGAGCT GGTGCCCCGG 1740 
GCGGCCGAGC CGGGCTACCT GGTGACCAAG GTGGTGGCGG TGGACGGCGA CTCGGGCCAG 180D 
AACGCCTGGC TGTCGTACCA GCTGCTCAAG GCCACGGAGC CCGGGCTGTT CGGCGTGTGG I860 
GCGCACAATG GCGAGGTGCG CACCGCCAGG CTGCTGAGGG AGCGCGACGC TGCCAAGCAG 1920 
50 AGGCTGGTGG TGCTGGTCAA GGACAAT GGC GAGCCTCCGC GCTCGGCCAC CGCCACGCTG 1980 
CACGTGCTCC TGGTGGACGG CTTCTCCCAG CCCTACCTGC TGCTCCCGGA GGCGGCACCG 2040 
GCCCAGGCCC AGGCCGACTT GCTCACCGTC TACCTGGTGG TGGCGTTGGC CTCGGTGTCT 2100 
TCGCTCTTCC TCTTCTCGGT GCTCCTGTTC GTGGCGGTGC GGCTGTGCAG GAGGAGCAGG 2160 
GCGGCCTCGG TGGGT CGCTG CTCGGTGCCC GAGGGCCCCT TTCCAGGGCA GATGGTGGAC 2220 
55 GTGAGCGGCA CCGGGACCCT GTCCCAGAGC TACCAGTACG AGGTGTGTCT GACTGGAGGC 22B0 
TCCGGGACAA ATGAGTTCAA GTTCCTGAAG CCAATTATCC CCAACTTCGT TGCT CAGGGT 2 340 
GCAGAGAGGG TTAGCGAGGC AAATCCCAGT TTCAGGAAGA GCTTTGAATT CACTTAA 

D NO: 288 P 



OU Protein Accession #: AAD43756.1 

1 11 21 31 41 5 
I I I I 1 I 
MEAGEGKERV PKQRQVLIFF VLLGIAQASC QPRHYSVAEE TSSGSFVAML L. 

65 LAVRGARWS KGKKMHLQFD RQTGDLLLNE KIiDREELCGP TSPCVLPFQV LLENPLQFFQ 
AELRIRDVKD HSPVFLDKEI LLKIPESITP GTTFLIERAQ DLDVGTNSLQ NYTISPNFHF 
HLNLQDSLDG IILPQLVLNR ALDREEQPEI RLTLTALDGG SPPRSG1 
VPEFAKLLYE VQIPEDSPVG SQVAIVSARD LDIGTNGEIS YAFSQASEDI R 
GELLLRQKLD FESIQTYTW IQATDGGGLS GTCWFVQVM DLKDNPPELT M. 

70 NLQDTLIAVF SVSDPDSGDN GRMVCSIQDD LPFFLKPSVE NFYTLVISTA Li 

ITITVTDFGT PRLKTEHNIT VljVSDVNDNA PAFTQTSYTL FVRENNSPAL HIGSVSATDR 
DSGTNAQVTY SLLPPQDPHL PLASLVSINA DNGHLFALQS LDYEALQAFE FRVGAADRGS 
PALSSEALVR VLVLDANDNS PFVLYPLQNG SAPCTELVPR AAEPGYLVTK WAVDGDSGQ 
NAWLSYQLLK ATEPGLFGVW AHNGEVRTAR LLRERDAAKQ RLWLVKDNG EPPRSATATL 

75 HVLLVBGPSQ PYLLLPEAAP AQAQADLLTV YLWALASVS SLFLFSVLLF VAVRLCRRSR 
AASVGRCSVP EGPFPGQMVD VSGTGTLSQS YQYEVCLTGG SGTNEFKFLK PIIPNFVAQG 
AERVSEANPS FRKSFEFT 
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CGGAGCACAT GCTGAGCGGA GCGGCTGGGG CTGCGCGGCG TGGCGGAGCA G 

C TCGCTCGCTC GCAGGGACAC ACGCAGGGGC TGACAGCTGT GCTGGTGCTG 120 

G CCACAAGGAG ACGATCGAGG AGAGAGACAA GCGGCAGCAG AGGCAGCAGC 180 

GGCAGAGGCA GCACCAGGGC TGCGGAGCTG CTGGGAGTGG GAGTGACTCC CCCACCTCGG 240 

GCCCCCACCC TGTCCCTGTC CTCTTCCCGC TTGCCCTGAG TTTAGAAGAG CAGCCGCTGC 300 

CACCACTGCC ACTCGGGAGG GCACCAGGGC TGCTGGCTAG GGAGGGACAG GGCAGGGAGG 360 

CTCTGGCCAG TCCCAGCAGC CGGGGACAGA TGCCGATCGA GATTGTGTGC AAAATCAAAT 420 

TTGCTGAGGA GGATGCGAAA CCCAAGGAGA AGGAGG CAGG GGATGAGCAG AGCCTCCTCG 480 

GGGCTGTTGC CCCTGGAGCA GCCCCCCGAG ACCIGGCCAC CTTTGCCAGC ACCAGCACCC 540 

TGCATGGACT GGGCCGGGCC TGTGGCCCAG GCCCCCACGG ACTGCGCAGA ACCCTGTGGG 600 

CACTGGCCCT ACTCACCTCG CTGGCTGCCT TCCTGTACCA GGCGGCTGGC CTGGCCCGGG 660 

GCTACCTGAC CCGGCCTCAC CTGGTGGCAA TGGACCCCGC TGCCCCAGCC CCAGTGGCGG 720 

GCTTCCCGGC TGTCACCCTC TGCAATATCA ACCGCTTCCG GCATTCGGCA CTCAGCGATG 780 

CCGACATCTT CCACCTGGCC AATCTGACAG GGCTGCCCCC CAAAGACCGG GATGGGCACC 840 

GTGCGGCTGG CCTGCGCTAC CCAGAGCCTG ACATGGTAGA CATCCTCAAC CGCACTGGCC 900 

ACCAGCTCGC CGACATGCTT AAGAGCTGCA ACTTCAGTGG GCATCACTGC TCCGCCAGCA 960 

ACTTCTCTGT GGTCTATACT CGCTATGGGA AGTGTTACAC CTTCAACGCG GACCCGCGGA 1020 

T GGAGATCATG CTGGACATCC 10SD 

3TCGTTT GAGGCAGGTA 1140 

C CAGGAGGAGC CGCCCTACAT CCACCAGCTG GGGTTCGGGG 1200 

A GCGGCTGACC TACCTGCCCC 1260 



CGGCCTACAG TGTGTCTGCC TGCCGGCTGC GCTGTGAAAA GGAGGCCGTG CTTCAGCGCT 
GCCACTGCCG GATGGTGCAC ATGCCAGGCA A 
TCGAGTGTGC AGACCACACA CTGGACTCCC T 

GCCCCACCCC CTGCAACCTG ACACGCTATG GGAAAGAGAT CTCCATGGTC A 
ACAGGGGCTC A 



A TGGCCTGTCA GCCCTGCTGG GAGACCTCGG GGGACAGATG GGCCTGTTCA 
TTGGGGCCAG CATCCTCACG TTGCTGGAGA TCCTCGACTA C 



CCAATCACCA CCACCCCCAC GGTCCCCCAG 19B0 



CTGGGACCCC TCCTGGGATC CCCAGCACAT T 

G GGAGGGCCAG GACTCAGTTC CTGCTCTCAT CCTCCCCTGC CCTGATGTCA 2160 



G ACCAGGCCAT GGGCCCTCAC GGAGAGGAAG GGAAGGAAGG AGAGGGAGGG 2280 

GGAGGATAGA GCCCATCCCA GCCGGGGAGG GGGAGCCCTC TGTACATTTG TAAATATTTA 2 34 0 

GGGAAAGCCG GGTGGGGGGA GGGGATACAG ATGTAGAAGG TGGGTAGGGC TACAGGGGTG 2400 

GGTGATTTAG GGACAGCCAG GGTCCCAGCC CCAATGT CAG CAGGATAGGG AGAGCCCCAG 2460 

GACTCAGGAG TGCTGGGCTG GTCCTACTTC CTGCCCCTCT CCAGGCCCAG CTCCCCTCTT 2520 

GGCAGGGGGA GAGGATGGCC CAGCAGGCCT GGCCCAGCTC CCAGTTCCCC CTGCACCAGC 250 0 

CCCACCCCTA GAGTCCCTTC TATAGGGAGG GGGCAGGAGA CCTTCCAGAC TTCGGCTGAG 2 64 0 

CTTGGAGGGT GGGAAGGGAG CCTTCTCAGT CCTCTCTCCC TCCAGTCTGA TTTTATAAAG 2700 
TGCTGACGAG 



Protein Accession #: NP061144.1 

1 11 21 31 41 51 

MPIEIVCKIK FABEDAKPKB KEAGDEQSLL GAVAPGAAPR DLATFASTST LHGLGRACGP 60 

GPHGLRRTLW ALALLTSLAA FLYQAAGLAR GYLTRPHLVA MDPAAPAPVA GFPAVTLCNI 120 

NRFRHSALSD ADIFHLANLT GLPPKDRDGH RAAGLRYPEP DMVDILNRTG HQLADMLKSC 180 

NFSGHHCSAS NFSWYTRYG KCYTFNADPR SSLPSRAGGM GSGLEIMLDI QQEEYLPIWR 240 

ETNETSFEAG IRVQIHSQEE PPYIHQLGFG VSPGFQTFVS CQEQRLTYL? QPWGNCRAES 300 

ELREPELQGY SAYSVSACRL RCEKEAVLQR CHCRMVHMPG NETICPPNIY IECADHTLDS 360 

LGGGPEGPCF CPTPCNLTRY GKEISKVRIP HRGSARYLAR KYNRNE1 



D NO: 291 DMA 



Nucleic Acid Accession tt: Eos sequence 



I I I I I I 

CACTGCTCTG AGAATTTGTG AGCAGCCCCT AACAGGCTGT TACTTCACTA CAACTGACGA 
TATGATCATC TTAATTTACT TATTTCTCTT GCXATGGGAA GACACTCAAG GATGGGGATT 
A ACTCCATATG GCTTGAACGA G 
\GCTCAC CTACGCAGAA G 
TGAAGGCGGC CATCTCGCAA CTTACAAGCA GCTAGAGGCA GCCAGAAAAA TTGGATTTCA 
A TGGCTAAGGG CAGAGTTGGA TACCCCAT1G TGAAGCCAGG 
IT GGATTTGGAA AAACTGGCAT TATTGATTAT GGAATCCG1C TCAATAGGAG 
TGAAAGATGG GATGCCTATT GCTACAACCC ACACGCAAAG GAGTGTGGTG GCGTCTTTAC 
AGAT CCAAAG CAAATTTTTA AATCTCCAGG CTTCCCAAAT GAGTACGAAG ATAACCAAAT 
CTGCTACTGG CACATTAGAC T CAAGT ATGG TCAGCGTATT CACCTGAGTT TTTTAGATTT 
TGACCTTGAA GATGACCCAG GTTGCTTGGC TGATTATGTT GAAATATATG ACAGTTACGA 
TGATGTCCAT GGCTTTGTGG GAAGATACTG TGGAGATGAG CTTCCAGATG ACATCATCAG 
TACAGGAAAT GTCATGACCT TGAAGTTTCT AAGTGATGCT TCAC-TGACAG CTGGAGGTTT 
CCAAATCAAA TATGTTGCAA TGGATCCTGT ATCCAAATCC AGTCAAGGAA AAAATACAAG 
TACTACTTCT ACTGGAAATA AAAACTTTTT AGCTGC-AAGA TTTAGCCACT TATAAAAAAA 
AAAAAAAGGA TGAT C AAAAC ACACAGTGTT TATGTTGGAA TCTTTTGGAA CTCCTTTGAT 
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CTCACTGTTA TTATTAACAT TTATTTATTA TTTTTCTAAA TGTGAAAGCA ATACATAATT 1020 

TAGGGAAAAT TGGAAAATAT AGGAAACTTT AAACGAGAAA ATGAAACCTC TCATAATCCC 1080 

ACTGCATAGA AATAACAAGC GTTAACATTT TCATATTTTT TTCTTTCAGT CATTTTTCTA 1140 

TTTGTGGTAT ATGTATATAT GTACCTATAT GTATTTGCAT TTGAAATTTT GGAATCCTGC 1200 

5 T CT ATGTACA GTTTTGTATT ATACTTTTTA AATCTTGAAC TTTATAAACA TTTTCTGAAA 1260 

T CATTGATTA TTCTACAAAA ACATGATTTT AAACAGCTGT AAAATATTCT ATGATATGAA 1320 

TGTTTTATGC ATTATTTAAG CCTGTCTCTA TTGTTGGAAT TTCAGGTCAT TTTCATAAAT 1380 



L LWEDTQGWGF KDGIFHNSIW LERAAGVYHE EARSGKYKLT Y 
EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNOGFGI 
ERWDAYCYNP HAKECGGVFT DPKQIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 
DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 
QIKYVAMDPV SKSSQGKMTS TTSTGNKNFL AGRFSHL 



I I I I I > 

GAATTCGCAC TGCTCTGAGA ATTTGTGAGC AGCCCCTAAC AGGCTGTTAC TTC 
CTGACGATAT GATCATCTTA ATTTACTTAT TTCTCTTGCT ATGGGAAGAC ACTCAAGGAT 
GGGGATTCAA GGATGGAATT TTTCATAACT CCATATGGCT TGAACGAGCA GCCGGTGTGT 
ACCACAGAGA AGCACGGTCT GGCAAATACA AGCTCACCTA CGCAGAAGCT A 
GTGAATTTGA AGGCGGCCAT CTCGCAACTT ACAAGCAGCT AGAGGCAGCC A< 
GATTTCATGT CTGTGCTGCT GGATGGATGG CTAAGGGCAG AGTTGGATAC C 
AGCCAGGGCC CAACTGATGA TTTGGAAAAA CTGGCATTAT TGATTATGGA A' 
ATAGGAGTGA AAGATGGGAT GCCTATTGCT ACAACCCACA CGCAAAGGAG TGTGGTGGCG 480 
TCTTTACAGA TCCAAAGCGA ATTTTTAAAT CTCCAGGCTT CCCAAATGAG TACGAAGATA 540 
ACCAAATCTG CTACTGGCAC AT T AG ACTCA AGTATGGTCA GCGTATTCAC CTGAGTTTTT 600 
TAGATTTTGA CCTTGAAGAT GACCCAGGTT GCTTGGCTGA TTATGTTGAA ATATATGACA 660 
GTTACGATGA TGTCCATGGC TT TGTGGGAA GATACTGTGG AGATGAGCTT CCAGATGACA 720 
TCATCAGTAC AGGAAATGTC ATGACCTTGA AGTTTCTAAG TGATGCTTCA GTGACAGCTG 780 
GAGGTTTCCA AATCAAATAT GTTGCAATGG ATCCTGTATC CAAATCCAGT CAAGGAAAAA B40 
ATACAAGTAC TACTTCTACT GGAAATAAAA ACTTTTTAGC TGGAAGATTT ACCCACTTAT 900 
AAAAAAAAAA AAGGATGATC AAAACACACA GTGTTTATGT TGGAATCTTT TGGAACTCCT 960 
TTGATCTCAC TGTTATTATT AACATTTATT TATTATTTTT CTAAATGTGA AAGAAATACA 102 0 
TAATTTAGGG AAAATTGGAA AATATAGGAA ACTTTAAACG AGAAAATGAA ACCTCTCATA 10 80 
ATCCCACTGC ATAGAAATAA CAAGCGTTAA CATTTTCATA TTTTTTTCTT TCAGTCATTT 1140 
TTGTATTTGT GGTATATGTA TATATGTACC TATATGTATT TGCATTTGAA ATTTTGGAAT 12 OD 
CCTGCTCTAT GTACAGTTTT GTATTATACT TTTTAAATCT TGAACTTTAT GAACATTTTC 1260 
TGAAATCATT GATTATTCTA CAAAAACATG ATTTTAAACA GCTGTAAAAT ATTCTATGAT 1320 
ATGAATGTTT TATGCATTAT TTAAGCCTGT CTCTATTGTT GGAATTTCAG GTCATTTTCA 13 80 
TAAATATTGT TGCAATAAAT ATCCTTCGGA ATTC 



1 11 21 31 41 51 

I I I I I I 

MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 
EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNXXFGKTGI IDYGIRLNRS 
ERWDAYCYNP HAKECGGVFT DPKRIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 
DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 
QIKYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRFSHL 



Seq ID NO: 295 DNA sequence 
OD Nucleic Acid Accession #: NM_001218.2 
Coding sequence: 116.. 1180 



/U GTACTCGCCA CGGCACCCAG GCTGCGCGCA CGCGGTCCCG GTGTGCAGCT GGAGAGCGAG 60 

CGGCCACCGG GAGCCCCCGG CACAGCCCGC GCCCC-CCCCG CAGGAGCCCG CGAAGATGCC 120 

CAGCCCGGCC CCAGTGAACG GTTCCAAGTG GACTTATTTT GGTCCTC-ATG GGGAGAATAG 240 

CTGGTCCAAG AAGTACCCGT CGTGTGGGGG CCTGCTGCAG TCCCCCATAG ACCTGCACAG 300 

75 TGACATCCTC CAGTATGACG CCAGCCTCAC GCCCCTCGAG TTCCAAGGCT ACAATCTGTC 360 

TGCCAACAAG CAGTTTCTCC TGACCAACAA TGGCCATTCA GTGAAGCTGA ACCTGCCCTC 420 

GGACATGCAC ATCCAGGGCC TCCAGTCTCG CTACAGTGCC ACGCAGCTGC ACCTGCACTG 480 

GGGGAACCCG AATGACCCGC ACGGCTCTGA GCACACCGTC AGCGGACAGC ACTTCGCCGC 540 

CGAGCTGCAC ATTGTCCATT ATAACTCAGA CCTTTATCCT GACGCCAGCA CTGCCAGCAA 600 

OL) CAAGTCAGAA GGCCTCGCTG TCCTGGCTGT TCTCATTGAG ATGGGCTCCT TCAATCCGTC 660 

CTATGACAAG ATCTTCAGTC ACCTTCAACA TGTAAAGTAC AAAGGCCAGG AAGCATTCGT 720 

CCCGGGATTC AACATTGAAG AGCTGCTTCC GGAGAGGACC GCTGAATATT ACCGCTACCG 780 

GGGGTCCCTG ACCACACCCC CTTGCAACCC CACTGTGCTC TGGACAGTTT TCCGAAACCC 840 

CGTGCAAATT TCCCAGGAGC AGCTGCTGGC TTTGGAGACA C-CCCTGTACT GCACACACAT 900 
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G AAATGATCAA CAACTTCCGG CAGGTCCAGA AGTTCGATGA 960 

GAGGCTGGTA TACACCTCCT TCTCCCAAGT GCAAGTCTGT ACTGCGGCAG GACTGAGTCT 1020 

GGGCATCATC CTCTCACTGG CCCTGGCTGG CATTCTTGGC ATCTGIATTG TGGTGGTGGT 1080 

A GGAAGAGTAT CAAAAAAGGT GATAACAAGG GAGTCATTTA 1140 



2 ACCAAGATGG AGACTGAGGC CCACGCTTGA GGTCCCCGGA GCTCCCGGGC 

ACATCCAGGA AGGACCTTGC TTTGGACCCT ACACACTTCG GCTCTCTGGA CACTTGCGAC 1260 

ACCTCAAGGT GTTCTCTGTA GCTCAATCTG CAAACATGCC AGGCCTCAGC- GATCCTCTGC 1320 

TGGGTGCCTC CTTGCCTTGG GACCATGGCC ACCCCAGAGC CATCCGATCC- ATGGATGGGA 1380 

TGCACTCTCA GACCAAGCAG CAGGAATTCA AAGCTGCTTG CTGTAACTGT GTGAGATTGT 1440 

10 GAAGTGGT CT GAATT CTGGA ATCACAAACC AAGCCATGCT GGTGGGCCAT TAATGGTTGG 1500 

AAAACACTTT CATCCGGGGC TTTGCCAGAG CGTGCTTTCA AGTGTCCTGG AAAGTCTGCT 1560 

GCTTCTCCAA GCTTTCAGAC AAGAATGTGC ACTCTCTGCT TAGGTTTTC-C TTGGGAAACT 1620 

CAACTTCTTT CCTCTGGAGA CGGGGCATCT CCCTCTGATT TCCTTCTGCT ATGACAAAAC 1680 

CTTTAATCTG CACCTTACAA CTCGGGGACA AATGGGGACA GGAAGGATCA AGTTGTAGAG 1740 

15 AGAAAAAAGA AAACAAGAGA TATACATTGT GATATATTAG GGACACTTTC ACAGTCCTGT 1800 

CCTCTGGATC ACAGACACTG CACAGACCTT AGGGAATGGC AGGTTCAAGT TCCACTTCTT 1860 

GGTGGGGATG AGAAGGGAGA GAG AG CT AG A GGGACAAAGA GAATGAGAAG ACATGGATGA 1920 

TCTGGGAGAG TCTCACTTTG GAATCAGAAT TGGAATCACA TTCTGTTTAT CAAGCCATAA 1980 

TGTAAGGACA GAATAATACA ATATTAAGTC CAAATCCAAC CTCCTGTCAG TGGAGCAGTT 2040 

20 ATGTTTTATA CTCTACAGAT TTTACAAATA ATGAGGCTGT TCCTTGAAAA TGTGTTGTTG 2100 

CTGTGTCCTG GAGGAGACAT GAGTTCCGAG ATGACCCAAT CTGCCTTTGA ATCTGGAGGA 2160 

AATAGGCAGA AACAAAATGA CTGTAGAACT TATTCTCTGT AGGCCAAATT TCATTTCAGC 222 0 

CACTTCTGCA GGATCCCTAC TGCCAACCTG GAATGGAGAC TTTTATCTAC TTCTCTCTCT 2280 

CTGAAGATGT CAAATCGTGG TTTAGATCAA ATATATTTCA AGCTATAAAA GCAGGAGGTT 2340 

25 ATCTGTGCAG GGGGCTGGCA TCATGTATTT AGGGGCAAGT AATAATGG-AA TGCTACTAAG 2400 

ATACTCCATA TTCTTCCCCG AATCACACAG ACAGTTTCTG ACAGGCGCAA CTCCTCCATT 246 0 

TTCCTCCCGC AGGTGAGAAC CCTGTGGAGA TGAGTCAGTG CCATGACTGA GAAGGAACCG 252 0 

ACGCCTAGTT GAGAGCACCT TGCAGTTCCC CGAGAACTTT CTGATTCACA GTCTCATTTT 258 0 

GACAGCATGA AATGTCCTCT TGAAGCATAG CTTTTTAAAT ATCTTTTTCC TTCTACTCCT 2640 

30 CCCTCTGACT CTAAGAATTC TCTCTTCTGG AATCGCTTGA ACCCAGGAGG CGGAGGTTGC 27 0 0 

AGTAAGCCAA GGTCATGCCA CTGCACTCTA GCCTGGGTGA CAGAGCGAGA CTCCATCTCA 2760 
AAAAA AAAAA 

D NO: 296 Protein 
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MPRESLHAAA VLLLVILKEQ PSSPAPVNGS KWTYFGPDGE NSWSKKYPSC GGLLQSPIDL 
40 HSDILQYDAS LTPLEFQGYN LSANKQFLLT NNGIISVKLNL PSDMHIQGLQ SRYSATQLHL 

HWGNPNDPHG SEHTVSGQHF AARLHIVHYN SDLYPDASTA SNKSEGLAVL AVLIEMGSFN 
PSYDKIFSHL QHVKYKGQEA FVPGFNIEEL LPERTAEYYR YRGSLTTPPC NPTVLWTVFR 
NPVQISQEQL LALETALYCT HMDDPSPREM INNFRQVQKF DERLVYTSFS QVQVCTAAGL 
[, AGILGICIW WSIWLFRRK SIKKGDNKGV IYKPATKMET EAHA 



50 



Seq ID NO; 297 DNA sequen 
Nucleic Acid A 
Coding sequenc 



I I I I I I 

ACGCGTCCGC CCACGCGTCC GCCCACGCGT CCGGTCGGGG CCAGAGCGCA GGTGTACCTG 
GCGGCCGTGC TGGAGCACCT GACCGCCGAG ATCCTGGAGC TGGCTGGCAA CCCGGCCCGC 
GACAAGAAGA CCCGCATCAT CCTGCGCCAC CTGTAGCTGG CCATTCGCAA CGGCGAGGAG 
55 CTTAACAAGC TGCTGGGCGA AGTCACCATC GCGCAGGGCG GTGTCCTGCC CAACATTCAG 
GGCGTGCTTC TGCCCCAGAA GACCAAGAGC CACCACAAGG CCAAGGGTGA AAACCATTCA 
CTAGGAGAGG AGAAACACAA TGGCCACCAA GACAGAGTTG AGTCCCACAG CAAGGGAGAG 
CAAGAACGCA CAAGATATGC AAGT GGATGA GACACTGATC CCCAGGAAAG GTCCAAGTTT 
T CGCTATGGAA TAGCCCTCGT CTTACATTTC TGCAATTTCA CAACGATAGC 
A TCACCATGGT AGCCATGGTC AACAGCACAA GCCCTCAATC 
3 AGGTGCTGCC TGTTGACTCA TTTGGTGGCC TAAGTAAAGC 
CCCAAAGAGT CTTCCTGCAA AGTCCTCAAT AC1TGGGGGT CAGTTTGCAA TTTGGGAAAA 
GTGGGGCCCT CCACAAGAAC GAAG CAGACT CTGCAGCATT GCTTTATCAG GAATGTTACT 
GGGATGCTTT ACTGCCATCC TCATAGGTGG CTTCATTAGT GAAACCCTTG GGTGGCCCTT 
TGTCTTCTAT ATCTTTGGAG GTGTTGGCTG TGTCTGCTGC CTTCTCTGGT TTGTTGTGAT 
C CCCTTTTCCT ATCCATGGAT AAGCACCTCA GAAAAAGAAT ACATCATATC 



CTCCTTGAAA CAACAGGTCG GGTCTTCTAA GCAGCCTCTT CCCATCAAAG C 

ATCTCTACCC ATTTGGTCCA TATGTTTAGG CTGTTTCAGC CATCAATGC-T TAGTTAGCAC 1020 

AATGGTTGTA TACATACCAA CTTACATCAG CTCTGTGTAC CATGTTAACA TCAGAGACAA 1080 

70 TGGACTTCTA TCTGCCCTTC CTTTTATTGT TGCCTGGGTC ATAGGCATGG TGGGAGGCTA 1140 

TCTGGCAGAT TTCCTTCTAA CCAAAAAGTT TAGACTCATC ACTGTGAGGA AAATTGCCAC 120 0 

AATTTTAGGA AGTCTCCCCT CTTCAGCACT CATTGTGTCT CTGCCTTACC TCAATTCCGG 1260 

CTATAT CACA GCAACTGCCT TGCTGACGCT CTCTTGCGGA TTAAGCACAT TGTGTCAGTC 1320 

AGGGATTTAT ATCAATGTCT TAGATATTGC TCCAAGGTAT TCCAGTTTTC TCATGGGAGC 1380 

75 ATCAAGAGGA TTTTCGAGCA TAGCACCTGT CATTGTACCC ACTGTCAGCG GATTTCTTCT 1440 

TAGTCAGGAC CCTGAGTTTG GGTGGAGGAA TGTCTTCTTC TTGCTGTTTG CCGTTAACCT 150 0 

GTTAGGACTA CTCTTCTACC TCATATTTGG AGAAGCAGAT GTCCAAGAAT GGGCTAAAGA 1560 

GAG AAAACT C ACTCGTTTAT GAAGTTATCC CACCTTGGAT GGAAAAGTCA 1TAGGCACCG 162 0 

0/ . TATTGCATAA AATAGAAGGC TTCCGTGATG AAAATACCAG T. AAA? C-ATT f rTTTTTCCT 1680 

80 GTGGCTCTTT TCAATTATGA GATCAGTTCA TTATTTTATT CAGACTTTTT TTTGAGAGAA 1740 
ATGTAAGATG AATAAAAATT CAAATAAAAT GATAACTAAG AAAAAAAAAA AAAAA 

Seq ID NO: 298 Protein sequence 
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ILIGGFISET LGWPFVFYIF GGVGCVCCLL WFWIYDDPF SYPWISTSEK EYIISSLKQQ 
VGSSKQPLPI KAMLRSLPIW SICLGCFSHQ WLVSTMWYI PTYISSVYHV NIRDNGLLSA 
LPFIVAWVIG MVGGYLADFL LTKKFRLITV RKIATILGSL PSSALIVSLP YLNSGYITAT 
ALLTLSCGLS TLCQSGIYIN VLDIAPRYSS FLMGASRGFS SIAPVIVPTV SGFLLSQDPE 
FGWRNVFFLL FAVNLLGLLF YLIFGEADVQ El 



GGCCCTGCCC TGAAGGCTGG TCACTTGCAG AGGTAAACTC CCCTCTTTGA CTTCTGGCCA 
GGGTTTGTGC TGAGCTGGCT GCAGCCGCTC TCAGCCTCGC TCCGGGCACG TCGGGCAGCC 
TCGGGCCCTC CTGCCTGCAG GATCATGCCC ACCACCGTGG ACGATGTCCT GGAGCATGGA 
GGGGAGTTTC ACTTTTTCCA G 



AGGTTTGGCC G 

CTCATGGCCA TTTCCCCAAC CTATACGTGG ATGTTAATTT TTCGCTTAAT CCAAGGACTG 
GTCAGCAAAG CAGGCTGGTT A 



I TCCTCACTGG AGGTGGTTGC AGTTCACAGT TGCTCTGCCC 

AACTTCTTCT Ti 



CCTTCATTTC 



A TTTCTTCTAC TCTGCCCTGG TTGAATTCCC AGCTGCCTTC 132 0 

ATGATCATCC TCACCATCGA CCGCATCGGA CGCCGTTACC CTTGGGCTGC ATCAAATATG 13 8 0 

T GGCCTCAGTT TTTATACCTG GTGATCTACA ATGGCTAAAA 144 0 

T CATGCTTGGG AAGAATGGGG ATCACAATGG CCTATGAGAT AGTCTGCCTG 15 0 0 

GTCAATGCTG AGCTGTACCC CACATTCATT AGGAATCTTG GCGTCCACAT CTGTTCCTCA 15 6 0 

A TTGGTGGCAT CATCACGCCA TTCCTGGTCT ACCGGCTCAC TAACATCTGG 1620 

3AGCTCC CGCTGATGGT TTTCGGCGTA CTTGGCTTGG TTGCTGGAGG TCTGGTGCTG 1680 

S GAAAGCTTTG CCTGAGACCA TCGAGGAAGC CGAAAATATG 174D 

CAAAGACCAA GAAAAAATAA AGAAAAGATG ATTTACCTCC AAGTTCAGAA ACTAGACATT 180 0 

CCATTGAACT AAGAAGAGAG ACCGTTGCTG CTGTCATGAC CTAGCTTTGA TGGCAGCAAG 1860 

ACCAAAAGTA GAAATCCCTG CACTCATCAC AAAGCCCATA CAACTCAACC AAACTTACCC 1920 

CIGAGCCCTA TCAACCTAGG TCTACAGCCA GTGGAGTCTA TTGTACACTG TGGAAAAATA 1980 

CCCATGGGAC CAGATCCTGC CAAATTCTTC CAGCTCACTT TATTCTCAGC ATTCCTAGGA 2040 

CATTGGACAT TGGTTTTCTG GAGGGTTTTT TTTCCGATCT TTGTATTTTT TTAAATTTGA 2100 

TTCTTTTCTT TGCAATGCTA GCAACCAGAA TACATAGGGG AACTGTGGGC TAGGCAAANA 2160 

AAATAGAAAA AGTGTGAAAA ACAGTAAAGT TGGGAGAGGA GCATCTATTT TCTTAAAGAA 2220 
ATAAAACACC NAAAACAAAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 300 Protein sequence 
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MPTTVDDVLE HGGEFHFFQK QMFFLLALLS ATFAPIYVGI VFLGFTPDHR CRSPGVAELS 
LRCGWSPAEE LNYTVPGPGP AGEASPRQCR RYEVDWNQST FDCVDPLASL DTNRSRLPLG 
PCRDGWVYET PGSSIVTEFN LVCANSWMLD LFQSSVNVGF FIGSMSIGYI ADRFGRKLCL 
LTTVLINAAA GVLMAISPTY TWMLIFRLIQ GLVSKAGWLI GYILITEFVG RRYRRTVGIF 
YQVAYTVGLL VLAGVAYALP HWRWLQFTVA LPNFFFLLYY WCIPESPRWL ISQNKNAEAM 
RIIKHIAKKN GKSLPASLQR LRLEEETGKK LNPSFIBLVR TPQIRKHTMI LMYNWFTSSV 
LYQGLIMHMG LAGDNIYLDF FYSALVEFPA AFMIILTIDR IGRRYPWAAS HMVAGAACLA 
SVFIPGDLQW LKIIISCLGR MGITMAYEIV CLWAELYPT FIRNLGVHIC SSMCDIGGII 
TPFLVYRLTN IWLELPLMVF GVLGLVAGGL VLLLPETKGK ALPETIEEAE NMQRPRKNKE 
KMIYLQVQKL DIPLN 



Seq ID NO: 301 DNA sequence 



TCAATGTGCT GGAATAGAGG CTCATGTTCT CTATTCACAT GCCAAAATGG CATTGTCTGG 
ACCAATGGAA CCCACGTCAC CTATCGGAAG GACACACG CT ATAAGCTATT GGGGGACCTT 
TCAAGAAGGG ATGTCTCTTT GACCATAGAA AATACAGCTG TGTCTGACAG T 
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TGTTGCCGTG TTGAGCACCG TGGGTGGTTC AATGACATGA AAATCACCGT ATCATTGGAG 420 

ATTGTGCCAC CCAAGGTCAC GACTACTCCA ATTGTCACAA CTGTTCCAAC CGTCACGACT 480 

GTTCGAACGA GCACCACTGT TCCAACGACA ACGACTGTTC CAACGACAAC TGTTCCAACA 540 

ACAATGAGCA TTCCAACGAC AACGACTGTT CCGACGACAA TGACTGTTTC AACGACAACG 600 

AGCGTTCCAA CGACAACGAG CATTCCAACA ACAACAAGTG TTCCAGTGAC AACAACGGTC 660 

TCTACCTTTG TTCCTCCAAT GCCTTTGCCC AGGCAGAACC ATGAACCAGT AGCCACTTCA 720 

CCATCTTCAC CTCAGCCAGC AGAAACCCAC CCTACGACAC TGCAGGGAGC AATAAGGAGA 780 

GAACCCACCA GCTCACCATT GTACTCTTAC ACAACAGATG GGAATGACAC CC-TGACAGAG 84 0 

TCTTCAGATG GCCTTTGGAA TAACAATCAA ACTCAACTGT TCCTAGAACA TAGTCTACTG 900 

ACGGCCAATA CCACTAAAGG AATCTATGCT GGAGTCTGTA TTTCTGTCTT GGTGCTTCTT 960 

GCTCTTTTGG GTGTCATCAT TGCCAAAAAG TATTTCTTCA AAAAGGAGGT TCAACAACTA 1020 

AGT GT TTCAT TTAGCAGCCT TCAAATTAAA GCTTTGCAAA ATC-CAGTTGA AAAGGAAGTC 1080 

iTATCTA CATTGAGAAT AGTCTTTATG CCACGGACTA AGACCCAGTG 1140 

3 AGAGTTTACG CCCATGACTG CAGAAGACIG AACAGGTATC AGCACATCAG 1200 

["CCAAGA CAATTTTTCT GTTTCAGTTT CATCTGGCAT TCCAACATGT 1260 

T GGGTAGAGTA ACTCTCCCAC TCCAAACTGT GTATAGICAA CCTCATCATT 1320 

TTTGTTT TGCTAAAACT GGCTCAATCC TTCTGAICAT TGCAGAGTTT 1380 
TCTCTCAAAC ATGAACACTT TAGAATTGTA TGTTCTCTTT AGACCCCATA AATCCTGTAT 



I I I I I I 

MHPQWILSL ILHLADSVAG SVKVGGEAGP SVTLPCHYSG AVTSMCWNRG SCSLFTCQNG 
IVWTNGTHVT YRKDTRYKLL GDLSRRDVSL TIENTAVSDS GVYCCRVEHR GWFNDMKITV 
SLEIVPPKVT TTPIVTTVPT VTTVRTSTTV PTTTTVPTTT VPTTMSIPTT TTVPTTMTVS 
TTTSVPTTTS IPTTTSVPVT TTVSTFVPPM PLPRQNHEPV ATSPSSPQPA ETHPTTLQGA 
L YSYTTDGNDT VTESSDGLWN NNQTQLFLEH SLLTANTTKG IYAGVCISVL 
V QQLSVSF3SL QIKALQNAVE KEVQAEDNIY IENSLYATD 



Seq I 



| I' I' f I' I" 

i TCCGG A AGGG ( AG&CTTCGC GGAACGCTCT CGGCGCCAGG ACT 

AAAGCCCAGG CCCGGGCGGC CAGACCAAGA GGGAAGAAGC ACAGAATTCC TCAACTCCCA 
GTGTGCCCAT GAGTAAGAGC AAATGCTCCG TGGGACTCAT GTCTTCCGTG GTGGCCCCGG 
CTAAGGAGCC CAATGCCGTG GGCCCGAAGG AGGTGGAGCT CATCCTTGTC AAGGAGCAGA 
ACGGAGTGCA GCTCACCAGC TCCACCCTCA CCAACCCGCG GCAGAGCCCC GTGGAGGCCC 
AGGATCGGGA GACCTGGGGC AAGAAGATCG ACTTTCTCCT GTCCGTCATT GGCTTTGCTG 
TGGACCTGGC CAACGT CTGG CGGTTCCCCT ACCTGTGCTA CAAAAATGGT GGCGGTGCCT 
TCCTGGTCCC CTACCTGCTC TTCATGGTCA TTGCTGGGAT GCCACTTTTC TACATGGAGC 
TGGCCCTCGG CCAGTTCAAC AGGGAAGGGG CCGCTGGTGT CTGGAAGATC TGCCCCATAC 
TGAAAGGTGT GGGCTTCACG GTCATCCTCA TCTCACTGTA TGTCGGCTTC TTCTACAACG 
TCATCATCGC CTGGGCGCTG CACTATCTCT TCTCCTCCTT CACCACGGAG CTCCCCTGGA 
TCCACTGCAA CAACTCCTGG AACAGCCCCA ACTGCTCGGA TGCCCATCCT GGTGACTCCA 
GTGGAGACAG CTCGGGCCTC AACGACACTT TTGGGACCAC ACCTGCTGCC GAGTACTTTG 
AACGTGGCGT GCTGCACCTC CACCAGAGCC ATGGCATCGA CGACCTGGGG CCTCCGCGGT 
GGCACCTCAC AGCCTGCCTG GTGCTGGTCA TCGTGCTGCT CTACTTCAGC CTCTGGAAGG 
C CTCAGGGAAG GTGGTATGGA TCACAGCCAC CATGCCATAC 



TGAGCGTTGA CTTCTACCGG C 

TGTGCTTCTC CCTGGGCGTG GGGTTCGGGG TGCTGATCGC CTTCTCCAGC TACAACAAGT 1140 

TCACCAACAA CTGCTACAGG GACGCGATTG TCACCACCTC CATCAACTCC CTGACGAGCT 12 00 

TCTCCTCCGG CTTCGTCGTC TTCTCCTTCC TGGGGTACAT GGCACAGAAG CACAGTGTGC 1260 

CCATCGGGGA CGTGGCCAAG GACGGGCCAG GGCTGATCTT CATCATCTAC CCGC-AAGCCA 1320 

TCGCCACGCT CCCTCTGTCC TCAGCCTGGG CCGTGGTCTT CTTCATCATG CTGCTCACCC 13 80 

TGGGTATCGA CAGCGCCATG GGTGGTATGG AGT CAGTGAT CACCGGGCTC AT CGATGAGT 1440 

TCCAGCTGCT GCACAGACAC CGTGAGCTCT TCACGCTCTT CATCGTCCTG GCGACCTTCC 150D 

TCCTGTCCCT GTTCTGCGTC ACCAACGGTG GCATCTACGT CTTCACGCTC CTGGACCATT 1560 

TTGCAGCCGG CACGTCCATC CTCTTTGGAG TGCTCATCGA AGCCATCGGA GTGGCCTGGT 1620 

TCTATGGTGT TGGGCAGTTC AGCGACGACA TCCAGCAGAT GACCGGGCAG CGC-CCCAGCC 1680 

TGTACTGGCG GCTGTGCTGG AAGCTGGTCA GCCCCTGCTT TCTCCTGTTC GTGGTCGTGG 1740 

TCAGCATTGT GACCTTCAGA CCCCCCCACT ACGGAGCCTA CATCTTCCCC GACTGGGCCA 1800 

ACGCGCTGGG CTGGGTCATC GCCACATCCT CCATGGCCAT GGTC-CCCATC TATGCGGCCT 1860 

ACAAGTTCTG CAGCCTGCCT GGGTCCTTTC GAGAGAAACI GGCCTACGCC ATTGCACCCG 1920 

AGAAGGACCG TGAGCTGGTG GACAGAGGGG AGGTGCGCCA GTTCACGCTC CGCCACTGGC 1980 

TCAAGGTGTA GAGGGAGCAG AGACGAAGAC CCCAGGAAGT CATCCTGCAA TGGGAGAGAC 2040 

ACGAACAAAC CAAGGAAATC TAAGTTTCGA GAGAAAGGAG GGCAACTTCT ACTCTTCAAC 2100 

CT CTACTGAA AACACAAACA ACAAAGCAGA AGACTCCTCT CTTCTGACTG TTTACACCTT 2160 

G GAGCGCACCT CGCCGTGTCT TGTGTTGCTG TAATAACGAC GTAGATCTGT 2220 
i CTTCATGCTG 

3 ATCACGATCC TTGTAGACGC ACCTGCTC-AG A 
CT CACAGTAG CTTCCTAGAC CATTTACTTT G CCCATATT A AAAAGCCAAG T 
GGTTTAGCTG TGCAGAAGGT GAAATGGAGG AAACCACAAA T 
GATGCGTGGC TCCCAGCAGA GGCCGTAAAT TGAGCGTTCA G 
GTCTGTTCAG AGGCATTGGA GGATGGGGGT CCTGGTATGT CTCACCAGGA AATTCTGTTT 
ATGTTCTTGC AGCAGAGAGA AATAAAACTC CTTGAAACCA GCTCAGGCTA CTGCCACTCA 
GGCAGCCTGT GGGTCCTTGT GGTGTAGGGA ACGGCCTGAG AGGAGCGTGT CCTATCCCCG 
GACGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG GACGCATGCA GGGCCCCCAC 
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AGGAGCATGT CCTATCCCTG GACGCATGCA GGGCCCCCAC AGGAGCGTGT ACTACCCCAG 2880 

AACGCATGCA GGGCCCCCAC AGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC 2940 

TGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG 3000 

GACCGGACGC ATGCAGGGCC CCCACAGGAG CGTGTACTAC CCCAGGACGC ATGCAGGGCC 3060 

CCCACAGGAG CGTGTACTAC CCCAGGATGC ATGCAGGGCC CCCACAGGAG CGTGTACTAC 312 0 

CCCAGGACGC ATGCAGGGCC CCCATGCAGG CAGCCTGCAG ACCAACACTC TGCCTGGCCT 3180 

TGAGCCGTGA CCTCCAGGAA GGGACCCCAC TGGAATTTTA TTTCTCTCAG GTGCC-TGCCA 3240 

CATCAATAAC AACAGTTTTT ATGTTTGCGA ATGGCTTTTT AAAAT CAT AT TTACCTGTGA 33 0 0 

ATCAAAACAA ATTCAAGAAT GCAGTATCCG CGAGCCTGCT TGCTGATATT GCAGTTTTTG 33 60 

TTTACAAGAA TAATTAGCAA TACTGAGTGA AGGATGTTGG CCAAAAC-CTG CTTTCCATGG 342 0 

CACACTGCCC TCTGCCACTG ACAGGAAAGT GGATGCCATA GTTTGAATTC ATGCCTCAAG 3480 

TCGGTGGGCC TGCCTACGTG CTGCCCGAGG GCAGGGGCCG TGCAGGGCCA GTCATGGCTG 3540 

TCCCCTGCAA GTGGACGTGG GCTCCAGGGA CTGC-AGTGTA ATGCTCGGTG GGAGCCGTCA 3600 

GCCTGTGAAC TGCCAGGCAG CTGCAGTTAG CACAGAGGAT GGCTTCCCCA TTGCCTTCTG 3660 

GGGAGGGACA CAGAGGACGG CTTCCCCATC GCCTTCTGGC CGCTGCAGTC AGCACAGAGA 3720 

GCGGCTTCCC CATTGCCTTC TGGGGAGGGA CACAGAGGAC AGTTTCCCCA TCGCCTTCTG 3780 

GTTGTTGAAG ACAGCACAGA GAGCGGCTTC CCCATCGCCT TCTGGGGAGG GGCTCCGTGT 3840 

AGCAACCCAG GTGTTGTCCG TGTCTGTTGA CCAATCTCTA TTCAGCATCG TGTGGGTCCC 3 90 0 
TAAGCACAAT AAAAGACATC CACAATGGAA AAAAAAAAAG GAATTC 



1 11 21 31 41 51 

I I I I I I 

L MSSWAPAKE PNAVGPKBVE LILVKEQNGV QLTSSTLTNP RQSPVEAQDR 

iGFAVDL ANVWRFPYLC YKNGGGAFLV PYLLFMVIAG MPLFYMELAL 
GQFNREGAAG VWKICPILKG VGFTVILISL YVGFFYNVII AWALHYLFSS FTTELPWIHC 
NMSWNSPNCS DAHPGDSSGD SSGLNDTFGT TPAAEYFERG VLHLHQSHGI DDLGPPRWQL 
TACLVLVIVL LYFSLWKGVK TSGKWWITA TMPYWLTAL L 
DFYRLCEASV WIDAATQVCF SLGVGFGVLI f 

GFWFSFLGY MAQKHSVPIG DVAKDGPGLI FIIYPEAIAT LPLSSAWAVU F 
DSAMGGMESV ITGLIDEFOL LHRHRELFTL FIVLATFLLS LFCVTNGGIY VFTLLDHFAA 
GTSILFGVLI EAIGVAWFYG VGQFSDDIQQ MTGQRPSLYW RLCWKLVSPC FLLFWWSI 

VTFRPPHYGA YIFPDWANAL GWVIATSSMA MVPIYAAYKF CSLPGSFREK I, 

RELVDRGEVR QFTLRHWLKV 
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GCCCGTACAC ACCGTGTGCT GGGACACCCC ACAGTCAGCC GCATGGCTCC CCTGTGCCCC 60 

AGCCCCTGGC TCCCTCTGTT GATCCCGGCC CCTGCTCCAG GCCTCACTGT GCAACTGCTG 120 

CTGTCACTGC TGCTTCTGAT GCCTGTCCAT CCCCAGAGGT TGCCCCGGAT GCAGGAGGAT 180 

TCCCCCTTGG GAGGAGGCT C TTCTGGGGAA GATGACCCAC TGGGCGAGGA GGATCTGCCC 240 

AGTGAAGAGG ATTCACCCAG AGAGGAGGAT CCACCCGGAG AGGAGGATCT ACCTGGAGAG 3 00 

GAGGATCTAC CTGGAGAGGA GGATCTACCT GAAGTTAAGC CTAAATCAGA AGAAGAGGGC 360 

TCCCTGAAGT T AG AGG AT CT ACCTACTGTT GAGGCTCCTG GAGATCCTCA AGAACCCCAG 420 

AATAATGCCC ACAGGGACAA AGAAGGGGAT GACCAGAGTC ATTGGCGCTA TGGAGGCGAC 480 

CCGCCCTGGC CCCGGGTGTC CCCAGCCTGC GCGGGCCGCT TCCAGTCCCC GGTGGATATC 540 

CGCCCCCAGC TCGCCGCCTT CTGCCCGGCC CTGCGCCCCC TGGAACTCCT GGGCTTCCAG 600 

CTCCCGCCGC TCCCAGAACT GCGCCTGCGC AACAATGGCC ACAGTGTGCA ACTGACCCTG 660 

CCTCCTGGGC T AGAGAT GGC TCTGGGT CCC GGGCGGGAGT ACCGGGCTCT GCAGCTGCAT 720 

CTGCACTGGG GGGCTGCAGG TCGTCCGGGC TCGGAGCACA CTGTGGAAGG CCACCGTTTC 780 

CCTGCCGAGA TCCACGTGGT TCACCTCAGC ACCGCCTTTG CCAGAGTTGA CGAGGCCTTG 840 

GGGCGCCCGG GAGGCCTGGC CGTGTTGGCC GCCTTTCTGG AGGAGGGCCC GGAAGAAAAC 90 0 

AGTGCCTATG AGCAGTTGCT GTCTCGCTTG GAAGAAATCG CTGAGGAAGG CTCAGAGACT 960 

CAGGTCCCAG GACTGGACAT ATCTGCACTC CTGCCCTCTG ACTTCAGCCG CTACTTCCAA 1020 

TATGAGGGGT CTCTGACTAC ACCGCCCTGT GCCCAGGGTG TCATCTGGAC TGTGTTTAAC 1080 

CAGACAGTGA TGCTGAGTGC TAAGCAGCTC CACACCCTCT CTGACACCCT GTGGGGACCT 1140 

GGTGACTCTC GGCTACAGCT GAACTTCCGA GCGACGCAGC CTTTGAATGG GCGAGTGATT 12 00 

C AGTCCTCGGG CTGCTGAGCC AGTCCAGCTG 1260 

ACCAGCGTCG CGTTCCTTGT GCAGATGAGA AGGCAGCACA GAAGGGGAAC CAAAGGGGGT 13 80 

GTGAGCTACC GCCCAGCAGA GGTAGCCGAG ACTGGAGCCT AGAGGCTGGA TCTTGGAGAA 1440 

TGTGAGAAGC CAGCCAGAGG CATCTGAGGG GGAGCCGC-TA ACTGTCCTGT CCTC-CTCATT 1500 



MAPLCPSPWL PlilPAPAPG LTVQLLLSLL LLMPVHPQRL PRMQEDSPLG GGSSGEDDPL 60 

GEEDLPSEED SPREEDPPGE EDLPGEEDLP GEEDLPEVKP KSESEGSLKL EOLPTVEAPG 120 

DPQEPQHNAH RDKEGDDQSH WRYGGDPPWP RVSPACAGRF QSPVDIRPQL AAFCPALRPL 180 

ELLGFQLPPL PELRLRNNGH SVQLTLPPGL EMALGPGREY RALQLHLHWG AAGRPGSEHT 240 

VEGHRFPAEI HWHLSTAFA RVDEALGRPG GLAVLAAFLS EGPEENSAYE QLLSRLEEIA 3 00 

EEGSETQVPG LDISALLPSD FSRYFQYEGS LTTPPCAQGV IHTVFNQTVM LSAKQLHTLS 3 60 

DTLWGPGDSR LQLNFRATQP LNGRVIEASF PAGVDSSPRA AEPVQLNSCL AAGDILALVF 420 

GLLFAVTSVA FLVQMRRQHR RGTKGGVSYR PAEVAETGA 
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Seq ID NO: 307 DNA segue 
Nucleic Acid Accession # 
Coding sequence: 76.- 158 



CCTTCCAGAG CAAGCATGGA GCAACAGGAT CAGAGCATGA AGGAAGGGAG GCTGACGCTT 
GTGCTTGCCC TGGCAACCCT GATAGCTGCC TTTGGGTCAT CCTTCCAGTA T GG GT ACAAC 
GTGGCTGCTG TCAACTCCCC AGCACTGCTC ATGCAACAAT TTTACAATGA GACTTACTAT 
GGTAGGACCG GTGAATT CAT GGAAGACTTC CCCTTGACGT TGCTGTGGTC TGTAACCGTG 
3 GTTTATCGGA TCCCTCCTGG TCGGCCCCTT GGTGAATAAA 



55 




3 CTGCTGGGCT T 

G TGTCCTGGAT G 

G TCCCATACCC 

GCGCTGCTCA TCACTGAGAT CTTCCTGCAG T 
GGCAGTGTGC ACTGGCTCTC CAACTTCACC GTGGGCTTG 

30 GGCCTCGGCC CGTACAGCTT CATTGTCTTC GCCGTGATCT GCCTCCTCAC CACCATCTAC 1440 
ATCTTCTTGA TTGTCCCGGA GACCAAGGCC AAGACGTTCA TAGAGATCAA CCAGATTTTC 1500 
ACCAAGATGA ATAAGGTGTC TGAAGTGTAC CCGGAAAAGG AGGAACTGAA AGAGCTTCCA 1560 
CCTGTCACTT CGGAACAGTG ACTCTGGAGA GGAAGCCAGT GGAGCTGGTC TGCCAGGGGC 1620 
TTCCCACTTT GGCTTATTTT TCTGACTTCT AGCTGTCTGT GAATATCCAG AAATAAAACA 1680 

35 ACTCTGATGT GGAATGCAGT CCTCATCTCC AGCCTCCCCA CCCCAGTGGG AACTGTGCAA 1740 
AGGGCTGCCT TGCTGTTCTT GAAGCTGGGC TGTCTCTCTC CATGTTGGCC TGTCACCAGA 1800 
CCCGAGTCAA TTAAACAGCT GGTCCTCCAC TTTGCTGGTT CAGCCTTCGT GTGGCTCCTG 18 6 0 
GTAACGTGGC TCCACCTTGA TGGGTCAACC TTTGTGTGGC TCCTGGTAAC ATAACAACAA 1920 
CAGTTACTAT AGTGGTGAGA TGGAAGGAAT CAAATTTTGC CACAGAAACT AACTCGGTGG 19 8 0 

40 CCCCAACAGG TCTTCCGGGG CCATGGGCAT TTGTTTAGAG CCAAATTCAT CCTCTTACCA 2 04 0 

GATCCTTTTC CAGAAATACC TGTCTAGGAA GGTGTGATGT CAGAAACAAT GACATCCAGA 2100 
AAGCTGAGGA ACAGGTTCCT GTGGAGACAC TGAGTCAGAA TTCTTCATCC AAATTATTTT 2160 
GTTAGTGGAA AATGGAATTG CTTCTGTGTA GTCAATAAAA TGAACCTGAT CACTTTTC 

45 
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3U MEQQDQSMKE GRLTLVLALA TLIAAFGSSF QYGYNVAAW SPALLMQQFY NETYYGRTGE 
FMEDFPLTLL WSVTVSMFPF GGFIGSLLVG PLVNKFGRKG ALLFNNIFSI VPAILMGCSR 
VATSFELIII SRLLVGICAG VSSNWPMYL GELAPKNLRG ALGWPQLFI TVGILVAQIF 
GLRNLLANVD GWPILLGLTG VPAALQLLLL PFFPESPRYL LIQKKDEAAA KKALQTLRGW 
DSVDREVAEI RQEDEAEKAA GFISVLKLFR MRSLRWQLLS 1 1 VLMGGQQL SGVNAIYYYA 
DQIYLSAGVP EEHVQYVTAG TGAVNWMIF CAVFWELLG RRLLLLLGFS ICLIACCVLT 
AALALQDTVS WMPYISIVCV ISYVIGHALG PSPIPALLIT EIFLQSSRPS AFMVGGSVHW 
LSNFTVGLIF PFIQEGLGPY SFIVFAVICL LTTIYIFLIV PETKAKTFIE INQIFTKMNK 



70 



Coding sequence: 13 8.. 71 9 

65 | t 1 I 1 I 1 I 1 I 1 

GGCTGGTCCC CTGACAGGTT GAAGCAAGTA GACGCCCAGG AGCCCCGGGA GGGGGCTGCA 
GTTTCCTTCC TTCCTTCTCG GCAGCGCTCC GCGCCCCCAT CGCCCCTCCT GCGCTAGCGG 
AGGTGATCGC CGCGGCGATG CCGGAGGAGG GTTCGGGCTG CTCGGTGCGG CGCAGGCCCT 
ATGGGTGCGT CCTGCGGGCT GCTTTGGTCC CATTGGTCGC GGGCTTGGTG ATCTGCCTCG 
TGGTGTGCAT CCAGCGCTTC GCACAGGCTC AGCAGCAGCT GCCGCTCGAG TCACTTGGGT 
GGGACGTAGC TGAGCTGCAG CTGAATCACA CAGGACCTCA GCAGGACCCC AGGCTATACI 
GGCAGGGGGG CCCAGCACTG GGCCGCTCCT TCCTGCATGG A 
AGCTACGTAT CCATCGTGAT GGCATCTACA TGGTACACAT CCAGGTGACG C 
GCTCCTCCAC GACGGCCTCC AGGCACCACC CCACCACCCT GGCCGTGC-GA ATCTC-CTCTC 
CCGCCTCCCG TAGCATCAGC CTGCTGCGTC TCAGCTTCCA CCAAGGTTGT ACCATTGCCT 
CCCAGCGCCT GACGCCCCTG GCCCGAGGGG ACACACTCTG CACCAACCTC ACTGGGACAC 
TTTTGCCTTC CCGAAACACT GATGAGACCT TCTTTGGAC-T GCAGTGGGTG CGCCCCTGAC 
CACTGCTGCT GATTAGGGTT TTTTAAATTT TATTTTATTT TATTTAAGTT CAAGAGAAAA 
AGTGTACACA CAGGGGCCAC CCGGGGTTGG GGTGGGAGTG T 
AGGACAAGAG AAGGCAT TGA GCTTTTTCTT TCATTTT CCT ATTAAAAA 
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MPEEGSGCSV RRRPYGCVLR AALVPLVAGL VICLWCIQR FAQAQQQLPL ESLGWDVAEL 
QLNHTGPQQD PRLYWQGGPA LGRSFLHGPE LDKGQLRIKR DGIYMVHIQV T: 
SRHHPTTLAV GICSPASRSI SLLRLSFHQG CTIASQRLTP LARGDTLCTN L 



Seq ID NO: 311 DMA sequence 

Nucleic Acid Accession #; Eos sequence 

Coding sequence: 1..3 978 



ATGGTGGGTG AAGGACCCTA CCTTATCTCA GATCTGGACC AGCGAGGCCG O 
TTTGCAGAAA GATATGACCC CAGCCTGAAG ACCATGATCC CAGTGCGACC CTGTGCAAGG 
TTAGCACCCA ACCCGGTGGA TGATGCCGGG CTACTCTCCT TCGCCACATT TTCCTGGCTC 
ACGCCGGTGA TGGTGAAAGG CTACCGGCAA AGGCTGACCG TAGACACCCT GCCCCCATTG 



GTAGCAAGGG TGGGTCCTGA GAAGGCCTCT CTGAGCCACG TGGTGTGGAA ATTCCAGAGG 
ACACGCGTGT TGATGGACAT CGTGGCCAAC ATCCTGTGCA TCATCATGGC AGCCATAGGG 
CCGACAGTTC TCATTCACCA AATCCTCCAG CAGACTGAGA GGACCTCTGG GAAAGTCTGG 
GTTGGCATTG GACTGTGCAT AGCCCTTTTT GCCACCGAGT TTACCAAAGT CTTCTTTTGG 
GCCCTTGCCT GGGCCATCAA CTACCGCACG GCCATCCGGT TGAAGGTGGC GCTCTCCACC 
TTGGTTTTTG AAAACCTAGT GTCCTTCAAG ACATTGACCC ACATCTCTGT TGGCGAGGTG 
CTCAATATAC TGTCAAGTGA TAGCTATTCT TTGTTTGAAG CTGCCTTGTT TTGTCCTTTG 
CCAGCCACCA TCCCGATCCT AATGGTCTTT TGTGCGGCGT ACGCCTTTTT CATT CTGGGG 
CCCACAGCTC TCATCGGGAT ATCAGTGTAT GTCATATTCA TACCCGTCCA GATGTTTATG 
3AGCTTT CCGAAGGTCA GCAATTTTGG TGACAGACAA GCGAGTTCAG 
TTCTGAC CTGCATCAGG CTGATCAAAA TGTATGCCTG G 
A CTATCCAAGA TATAAGAAGG AGGGAAAGAA AATTACI 
TTTGTCCAAA GTGGAAACTC TGCCCTGGCC CCCATCGTGT CCACCATAGC C 

ACATT AT CCT GCCACATCCT CCTGAGACGC AAACTCACCG CACCCGTGGC ATTTAGTGTG 1140 

ATTGCCATGT TTAATGTAAT GAAGTTTTCC ATTGCAATCT TGCCCTTCTC CATCAAAGCA 120D 

ATGGCTGAAG CGAATGTCTC TCTAAGGAGA ATGAAGAAAA TTCTCATAGA TAAAAGCCCC 1260 

CCATCTTACA TCACCCAACC AGAAGACCCA GATACTGTCT TGCTTTTAGC AAATGCCACC 1320 

TTGACATGGG AGCATGAAGC CAGCAGGAAA AGTACCCCAA AGAAATTGCA GAACCAGAAA 1380 

AGGCATTTAT GCAAGAAACA GAGGTCAGAG GCATACAGTG AGAGGAGTCC ACCAGCCAAG 1440 

GGAGCCACTG GCCCAGAGGA GCAAAGTGAC AGCCTCAAAT CGGTTCTGCA CAGCATAAGC 1500 

TTTGTGGTGA GAAAGTTATG TCGTTATCCC GAAGCCCAGC TCCTGGCTTG GAGGTGGCCA 15S0 

GCAGTGTTTG TTGGGAGAAT CATCAGAGGA TACAGGCCTC ATGGATTTTC TGCTAAAGAC 1620 

AAGGATGAAT CTAGAAGGCT TCTTACTTGG CCCCAAGAAG TGGATAGGAC TCAAAGGGCA 16B0 

GCCAAATACC TGGGGAAGAT CTTGGGAATA TGTGGGAATG TGGGAAGTGG AAAGAGCTCC 1740 

CTCCTTGCAG CTCTCCTAGG ACAGATGCAG CTCCACAAAG GGGTGGTGGC AGTCAATGGA 1800 

ACTTTGGCCT ACGTTTCACA GCAGGCATGG ATCTTTCATG GAAATGTGAG AGAAAACATA 1860 

CTCTTTGGAG AAAAGTATGA TCACCAAAGG TATCAGCACA CAGTCCGCGT CTGTGGCCTC 1920 

CAGAAGGACC TGAGCAACCT CCCCTATGGA GACCTGACTG AGATTGGGGA GCGGGGCCTC 19B0 

AACCTCTCTG GGGGGCAGAG GCAGAGGATT AGCCTGGCCC GCGCTGTCTA CTCCGACCGT 2 040 

CAGCTCTACC TGCT GGACGA CCCCCTGTCG GCCGTGGACG CCCACGTGGG GAAGCACGTC 2100 

TTTGAGGAGT GCATTAAGAA GACGCTCAGG GGAAAGACAG TCGTCCTGGT GACCCACCAG 2160 

CTACAGTTCT TAGAGTCTTG TGATGAAGTT ATTTTATTAG AAGATGGAGA GATTTGTGAA 2220 

AAGGGAACCC ACAAGGAGTT AATGGAGGAG AGAGGGCGCT ATGCAAAACT GATTCACAAC 2280 

CTGCGAGGAT TGCAGTTCAA GGATCCTGAA CACCTTTACA ATGCAGCAAT GGTGGAAGCC 2340 

TTCAAGGAGA GCCCTGCTGA GAGAGAGGAA GATGCTGGTA TAATCGGGTA CCTCCTTTCT 2400 

CTCTTCACTG TGTTCCTCTT CCTCCTGATG ATTGGCAGCG CTGCCTTCAG CAACTGGTGG 2460 

CTGGGTCTCT GGTTGGACAA GGGCTCACGG ATGACCTGTG GGCCCCAGGG CAACAGGACC 2520 

ATGTGTGAGG TCGGCGCGGT GCTGGCAGAC ATCGGTCAGC ATGTGTACCA GTGGGTGTAC 2580 

ACTGCAAGCA TGGTGTTCAT GCTGGTGTTT GGCGTCACCA AAGGCTTCGT CTTCACCAAG 2 640 

ACCACACTGA TGGCATCCTC CTCTCTGCAT GACACGGTGT TTGATAAGAT CTTAAAGAGC 2700 

CCAATGAGTT TCTTTGACAC GACTCCCACT GGCAGGCTAA TGAACCGTTT T1CCAAGGAT 2760 

ATGGACGAGC TGGATGTGAG GCTGCCGTTT CACGCAGAGA ACTTTCTGCA GCAGTTTTTT 2820 

ATGGTGGTGT TTATTCTCGT GATCTTGGCT GCTGTGTTTC CTGCTGTCCT TTTAGTCGTG 2B80 

GCCAGCCTTG CTGTAGGCTT CTTCATTCTG TTACGCATTT TCCACAGAGG AGTCCAGGAG 2 940 

CTCAAGAAGG TGGAGAATGT CAGCCGGTCA CCCTGGTTCA CCCACATCAC CTCCTCCATG 3000 

CAGGGCCTGG GCATCATTCA CGCCTATGGC AAGAAGGAGA GCTC-CATCAC CTATACTTCA 3 060 

TCCAAAGGCC TGTCATTGTC ATACATCATC CAGCTGAGCG GACTGCTCCA AGTGTGTGTG 3120 

CGAACGGGAA CAGAGACGCA AGCCAAATTC ACCTCCGTGG AGCTGCTCAG GGAATACATT 3180 

TCGACCTGTG TTCCTGAATG CACTCATCCC CTCAAAGTGG GGACCTGTCC CAAGGACTGG 3240 

CCCAGCTGTG GGGAGATCAC CTTCAGAGAC TATCAGATGA GATACAGAGA CAACACCCCC 3300 

CTTGTTCTCG ACAGCCTGAA CTTGAACATA CAAAGTGGGC AGACAGTCGG GATTGTTGGA 3360 

AGAACAGGTT CCGGAAAGTC ATCGTTAGGA ATGGCTTTGT TTCGTCTGGT GGAGCCAGCC 3420 

AGTGGCACAA TCTTTATTGA TGAGGTGGAT ATCTGCATTC TCAGCTTGGA AGACCTCAGA 3480 

ACCAAGCTGA CTGTGATCCC ACAGGATCCT GTCCTGTTTG TAGGTACAGT AAGGTACAAC 3540 

TTGGATCCCT TTGAGAGTCA CACCGATGAG ATGCTCTGGC AGGTTCTGGA GAGAACATTC 3600 

ATGAGAGACA CAATAATGAA ACTCCCAGAA AAATTACAGG CAGAAGTCAC AGAAAATGGA 3660 

GAAAACTTCT CAGTAGGGGA ACGTCAGCTG CTTTGTGTGG CCCGAGCTCT TCTCCGTAAT 3720 

TCAAAGATCA TTCTCCTTGA TGAAGCCACC GCCTCTATGG ACTCCAAGAC TGACACCCTG 3780 

GTTCAGAACA CCAT CAAAGA TGCCTTCAAG GGCTGCACTG TGCTGACCAT CGCCCACCGC 3840 

CTCAACACAG TTCTCAACTG CGATCACGTC CTGGTTATGG AAAATGGGAA C-GTGATTGAG 3900 

TTTGACAAGC CTGAAGTCCT TGCAGAGAAG CCAGATTCTG CATTTGCGAT C-TTACTAGCA 3960 
GCAGAAGT CA GATTGTAG 
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1 11 21 31 41 51 

I I I I I I 

MVGEGPYLIS DLDQRGRRRS FAERTOPSLK TMIPVRPCAR LAPNPVDDAG LLSFATFSWL 
TPVMVKGYRQ RLTVDTLPPL STYDSSDTNA KRFRVLWDEE VARVGPEKAS LSHWWKFQR 
TRVLMDIVAN ILCIIMAAIG PTVLIHQILQ QTERTSGKVW VGIGLCIALF ATEFTKVFFW 
ALAWAINYRT AIRLKVALST LVFENLVSFK TLTHISVGEV LNILSSDSYS LFEAALFCPL 
PATIPILMVF CAAYAFFILG PTALIGISVY VIFIPVQMFM AKLNSAFRRS AILVTDKRVQ 
TMNEFLTCIR LIKMYAWEKS FTNTIQDIRR RERKLLEKAG FVQSGNSALA PIVSTIAIVL 
TLSCHILLRR KLTAPVAFSV IAMFNVMKFS IAILPFSIKA MAEANVSLRR MKKILIDKSP 
PSYITQPEDP DTVLLLANAT LTWEHEASRK STPKKLQNQK RHLCKKQRSE AYSERSPPAK 
GATGPEEQSD SLKSVLHSIS FWRKLCRYP EAQLLAWRWP AVFVC-RIIRG YRPHGFSAKD 
KDESRRLLTW PQEVDRTQRA AKYLGKILGI CGNVGSGKSS LLAALLGQMQ LQKGWAVUG 
TLAYVSQQAW IFHGNVRENI LFGEKYDHQR YQHTVRVCGL QKDLSNLPYG DLTEIGERGL 
NLSGGQRQRI SLARAVYSDR QLYLLDDPLS AVDAHVGKHV FEECIKKTLR GKTWLVTHQ 
V ILLEDGEICE KGTHKELMEE RGRYAKLIHN LRGLQFKDPE HLYNAAMVEA 
E DAGIIGYLLS LFTVFLFLLM IGSAAFSNWW LGLHLDKGSR MTCGPQGNRT 
3 IGQHVYQWVY TASMVFMLVF GVTKGFVFTK TTLMASSSLH DTVFDKILKS 
PMSFFDTTPT GRLMNRFSKD HDELDVRLPF HAEMFLQQFF MWFILU 
ASLAVGFFIL LRIFHRGVQE LKKVENVSRS PWFTHITSSM QGLGIIHAYG K 
SKGLSLSYII QLSGLLQVCV RTGTETQAKF TSVELLREYI STCVPECTHP L 
PSCGEITFRD YQMRYRDNTP LVLDSLNLNI QSGQTVGIVG RTGSGKSSLG MALFRLVEPA 1140 
SGTIFIDEVD ICILSLEDLR TKLTVIPQDP VLFVGTVRYN LDPFESHTDE MLWQVLERTF 1200 
MRDTIMKLPE KLQAEVTENG ENFSVGERQL LCVARALLRN SKIILLDEAT ASMDSKTDTL 1260 
VQNTIKDAFK GCTVLTIAHR IjNTV LHCDHV LVMENGKVIE FDKPEVLAEK PDSAFAMLLA 1320 



I I I I I I 

CACAGCGCCC GCATGTACAA CATGATGGAG ACGGAGCTGA AGCCGCCGGG CCCGCAGCAA 60 

ACTTCGGGGG GCGGCGGCGG CAACTCCACC GCGGCGGCGG CCGGCGGCAA CCAGAAAAAC 12 0 

AGCCCGGACC GCGTCAAGCG GCCCATGAAT GCCTTCATGG TGTGGTCCCG CGGGCAGCGG 180 

CGCAAGATGG CCCAGGAGAA CCCCAAGATG CACAACTCGG AGATCAGCAA GCGCCTGGGC 240 

GCCGAGTGGA AACITTTGTC GGAGACGGAG AAGCGGCCGT TCATCGACGA GGCTAAGCGG 300 

CTGCGAGCGC TGCACATGAA GGAGCACCCG GATTATAAAT ACCGGCCCCG GCGGAAAACC 3 60 

AAGACGCTCA TGAAGAACGA TAAGTACACG CTGCCCGGCG GGCTGCTGGC CCCCGGCGGC 420 

AATAGCATGG CGAGCGGGGT CGGGGTGGGC GCCGGCCTGG GCCCGGGCGT GAACCACCGC 4 80 

ATGGACAGTT ACGCGCACAT GAACGGCTGG AGCAACGGCA GCTACAGCAT GATGCAGGAC 540 

CAGCTGGGCT ACCCGCAGCA CCCGGGCCTC AATGCGCAC ( ( GAT 6 30 

ATGCACCGCT ACGACGTGAG CGCCCTGCAG TACAACTCCA TGACCAGCTC GCAGACCTAC 6 60 

ATGAACGGCT CGCCCACCTA CAGCATGTCC TACTCGCAGC AGGGCACCCC TGGCATGGCT 720 

CTTGGCTCCA TGGGTTCGGT GGTCAAGTCC GAGGCi AC ( T ' CCCO 1CTGGTTACC 7 80 

TCTTCCTCCC ACTCCAGGGC GCCCTGCCAG GCCGGGGACC TCCGGGACAT GATCAGCATG 840 

TATCTCCCCG GCGCCGAGGT GCCGGAACCC GCCGCCCCCA GCAGACTTCA CATGTCCCAG 900 

CACTACCAGA GCGGCCCGGT GCCCGGCACG GCCATTAACG GCACACTGCC CCTCTCACAC 960 

ATGTGAGGGC CGGACAGCGA ACTGGAGGGG GGAGAAATTT TCAAAGAAAA ACGAGGGAAA 1020 

TGGGAGGGGT GCAAAAGAGG AGAGTAAGAA ACAGCATGGA GAAAACCCGG TACGCTCAAA 1080 



E TELKPPGPQQ TSGGGGGNST AAAAGGNQKN SPDRVKRPMN AFMVWSRGQR 
RKMAQENPKM HNSEISKRLG AEWKLLSETE KRPFIDEAKR LRALHMKEHP DYKYRPRRKT 
KTLMKKDKYT LPGGLLAPGG NSMASGVGVG AGLGAGVNQR MDSYAHMNGW SNGSYSMMOD 
QLGYPQHPGL NAHGAAQMQP MHRYDVSALQ YNSMTSSQTY MNGSPTYSM3 YSQQGTPGMA 
LGSMGSWKS EASSSPPWT SSSHSRAPCQ AGDLRDMISM YLPGAEVPEP AAPSRLHMSQ 
HYQSGPVPGT AINGTLPLSH M 



I I I I I 

TTGTTAGAAG GCTGAAAGAT GATGGCAGGA ATGAAAATCC 
CTGGCTITCA GCTCCTGGAG TCTGTGCTCA 
AGCATTAGAA GCAGATTTCT TGACCAATAT GCATACATCA A 

3 AAGATGACTC TGCTAAATGT TTGCAGTCTT GTAAATAATT TGAACAGCCC 
A ACAGGAGAAG TTCATGAAGA GGAGCTTGTT GCAAGAAGGA AACTTCCTAC 
TGCTTTAGAT GGCTTTAGCT TGGAAGCAAT GTTGACAATA TACCAGCTCC A 
TCACAGCAGG GCTTTTCAAC ACTGGGAGTT AATCCAGGAA GATATTCTTG A 
TGACAAAAAT GGAAAGGAAG AAGTCATAAA GAGAAAAATT CCTTATATTC TGAAACGGCA 
GCTGTATGAG AATAAAC CCA GAAGACCCTA CATACTC 

AGAGAATAAA TCATTTATTT ACATGTGATT GTGATTCATC ATCCCTTAAT T. 
ATTATATTTG TGTGAAAATG TGACAAACAC ACTTATCTGT CTCTTCTACA ATTGTGGTTT 
ATTGAATGTG TTTTTCTGCA CTAATAGAAA TTAGACTAAG TGTTTTCAAA T. 
TCTTCAAAAA AAAAAAAAAA AAATGGGGCC GCAATT 
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MMAGMKIQLV CMLLLAFSSK SLCSDSEEEM KALEADFLTN M 
VCSLVNNLNS PAEETGEVHE EELVARRKLP TALDGFSLEA MLTIYQLHKI CHSRAFQHWE 
LIQEDILDTG NDKNGKEEVI KRKIPYILKR QLYENKPRRP YILKRDSYYY 

Nucleic Acid Accession^ : NM_006536.2 
Coding sequence: 109.. 2940 



ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCATC 
ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC TCTACAACAT 
AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC 

GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGGGTATAA TGGATTGCTC 240 

ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA GAAGAGTATT TTTCAGAAAT 360 

ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG GGGCACATGG AGATGATCCA 480 

TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACACCTAAT 540 

TTCCTACTGA ATGATAACTT AACAGCTGGC TACGGATCAC GAGGCCGAGT GTTTGTCCAT S00 

GAATGGGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC S60 

AT AAATGGG C AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT 720 

GTGTGTGAAA AAGGTCCTTG CCCCCAAGAA AACTGTATTA TTAGTAAGCT TTTTAAAGAA 780 

GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CAT CAATAAT GTTCATGCAA 840 

AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 

CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG TAATCACAGA CTCTGCTGAC 960 

TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 1020 

GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 1080 

GCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATTGTTGAA 1140 

ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCAGCTA 1200 

CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 12 60 

TCAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA AAGGATTTGA GGTGGT T G A A 1320 

AAACTGAATG GAAAAGCTTA TGGCTCTGTG AT GAT AT TAG TGACCAGCGG AGATGATAAG 13 80 

CTTCTTGGCA ATTGCTTACC CACTGTGCTC AGCAGTGGTT CAACAATTCA CTCCATTGCC 144 0 

CTGGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTCTTACAGG AGGTTTAAAG 1500 

TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 1G2 0 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 16 80 

ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA TTATATTATT TGATCCTGAT 1740 

GG ACG AAA AT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT 1800 

TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGGACTTACA CCCTGAACAA TACCCATCAT 18 60 

TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 1920 

GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 1980 

TATGCCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG ATGATGGAGC AGGTGCTGAT 2100 

GTTATAAAAA ATGATGGAAT TTACTCGAGG TATTTTTTCT CCTTTGCTGC AAATGGTAGA 2160 

TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA GCACCCCAGC CCACTCTATT 2220 

CCAGGGAGTC ATGCTATGTA TGTACCAGGT TACACAGCAA ACGGTAATAT TCAGATGAAT 2280 

GCTCCAAGGA AATCAGTAGG CAGAAATGAG GAGGAGCGAA AGTGGGGCTT TAGCCGAGTC 2340 

AGCTCAGGAG GCTCCTTTTC AGTGCTGGGA GTTCCAGCTG GCCCCCACCC TGATGTGTTT 2400 

CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG AAGAGGAATT GACCCTATCT 2460 

TGGACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA CAAGCTATGA AATAAGAATG 2520 

AGTAAAAGTC TACAGAATAT CCAAGATGAC TTTAACAATG CTATTTTAGT AAATACATCA 2580 

AAGCGAAATC CTCAGCAAGC TGGCATCAGG GAGATATTTA CGTTCTCACC CCAGATTTCC 2640 

ACGAATGGAC CTGAACATCA GCCAAATGGA GAAACACATG AAAGC CACAG AATTTATGTT 2700 

GCAATACGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG TATCTAACAT TGCCCAGGCG 2760 

CCTCTGTTTA TTCCCCCCAA TTCTGATCCT GTACCTGCCA GAGATTATCT TATATTGAAA 2820 

GGAGTTTTAA CAGCAATGGG TTTGATAGGA ATCATTTGCC TTATTATAGT TGTGACACAT 2880 
CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA ATGGAACAAA ATTATTATAA 
ATAAATATCC AAAGTGTCTT CCTTCTTAGA TATAAGACCC ATGGCCTTCG A 
CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA ATGCATTGAG T 
ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT TTGGGGGTAG A 

CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT TTAAAGTAAT GTCTTTAAAG 3180 

GCAAAGGGAA GGGTAAAGT C GGACCAGTGT CAAGGAAAGT TTGTTTTATT GAGGTGGAAA 3240 

AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA TAACTGTCTG TGTGAAGCAA 3 300 

TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 3360 

TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG CTCTTTACCT 3420 

CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 3480 

TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC CTTTGTCTCT TCATACCGGT 3540 

TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 3600 

TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCCN TATAATGCCT 3 660 
TACCTAGGAA A 



MTQRSIAGPI CNLKFVTLLV ALSSELPFLG AGVQLQDNGY NGLLIAINPQ VPENQNLISN 
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V FFRNIKILIP ATWKANNNSK IKQESYEKAN VIVTDWYGAH 
GDDPYTLQYR GCGKEGKYIH FTPNFLLNDN LTAGYGSRGR VFVHEWAHLR KGVFDEYNND 
KPFYINGQNQ IKVTRCSSDI TGIFVCEKGP CPQENCIISK LFKEGCTFIY NSTQNATASI 
MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADFHHSF? MNGTELPPPP 
TFSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLM QIVEIHTFVG IASFDSKGEI 
RAQLHQINSN DDRKLLVSYL PTTVSAKTDI SICSGLKKGF EWEKLNGK.il YGSVMILVTS 
GDDKLLGNCL PTVLSSGSTI HSIALGSSAA PNLEELSRLT GGLKFFVPDI SNSNSMIDAF 
SRISSGTGDI FQQHIQLEST GENVKPHHQL KNTVTVDNTV GNDTMFLVTW QASGPPSIIL 
FDPDGRKYYT NNFITNLTFR TASLWIPGTA KPGHWTYTLN MTHHSLQALK VTVTSRASNS 
AVPPATVEAF VERDSLHFPH PVMIYANVKQ GFYPILNATV TATVEPETGO PVTLRLLDDG 
AGADVIKNDG IYSRYFFSFA ANGRYSLKVH WHSPSISTP AHSIPGSHAM YVPGYTANGN 
IQMNAPRKSV GRNEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 
LTLSWTAPGE DFDQGQATSY EIRMSKSLQN IQDDFNNAIL VNTSKRNPQQ AGIREIFTFS 
PQISTNGPEH QPNGETHESH R 



Seq ID NO: 319 DHA sequel 
Nucleic Acid Accession # 
Coding sequence: B2..360i 

I I 1 I 1 I I 

GCTTTCAGGC GATCTGGAGA AA'"^" ™*r.^r-»n^r- n*i 
GGATCACCCC ATTGGCTGAA GA 

CTCCTGCATG CCCAACAAGC CTGCTCCCGT GGGGCCTGCT A 
CTTGTTGGGA GGACCCGGTT TCTCCGAGCT TCATCTACCT GTGGACTGAC CAAGCCTGAG 
ACCTACTGCA CCCAGTATGG CGAGTGGCAG ATGAAATGCT GCAAGTGTGA CTCCAGGCAG 
CCTCACAACT ACT AC AGT C A CCGAGTAGAG AATGTGGCTT CATCCTCCGG C 



30 TTCCAGCTTC AAGAAGT CAT GATGGAGTTC CAGGGGCCCA TGCCCGCCGG CATGCTGATT 480 
GAGCGCTCCT CAGACTTCGG TAAGACCTGG CGAGTGTACC AGTACCTGGC TGCCGACTGC 54 0 
ACCTCCACCT TCCCTCGGGT CCGCCAGGGT CGGCCTCAGA GCTGGCAGGA TGTTCGGTGC 600 
CAGTCCCTGC CTCAGAGGCC TAATGCACGC CTAAATGGGG GGAAGGTCCA ACTTAACCTT 6 60 
ATGGATTTAG TGTCTGGGAT TCCAGCAACT CAAAGTCAAA AAATTCAAGA GGTGGGGGAG 720 
35 ATCACAAACT TGAGAGTCAA TTTCACCAGG CTGGCCCCTG TGCCCCAAAG GGGCTACCAC 780 
CCTCCCAGCG CCTACTATCC TGTGTCCCAG CTCCGTCTGC AGGGGAGCTG CTTCTGTCAC 840 
GGCCATGCTG ATCGCTGCGC ACCCAAGCCT GGGGCCTCTG CAGGCCCCTC CACCGCTGTG 900 
CAGGTCCAU HTGTCTGTGT I TGCCAGCAC AACACTGCCG GCCCAAATTG TGAGCGCTGT 96 0 
GCACCCTTCT ACAACAACCG GCCCTGGAGA CCGGCGGACr ( J A ATGAATGC 102 0 

40 CAAAGGTGCG ACTGCAATGG GCACTCAGAG ACATGTCACT TTGACCCCGC TGTGTTTGCC 10 80 

GCCAGCCAGG GGGCATATGG AGGTGTGTGT GACAATTGCC GGGACCACAC CGAAGGCAAG 1140 
AACTGTGAGC GGTGTCAGCT GCACTATTTC CGGAACCGGC GCCCGGGAGC TTCCATTCAG 1200 
GAGACCTGCA TCTCCTGCGA GTGTGATCCG GATGGGGCAG TGCCAGGGGC TCCCTGTGAC 12 60 
CCAGTGACCG GGCAGTGTGT GTGCAAGGAG CATGTGCAGG GAGAGCGCTG TGACCTATGC 1320 
45 AAGCCGGGCT TCACTGGACT CACCTACGCC AACCCGCAGG GCTGCCACCG CTGTGACTGC 1380 

AACATCCTGG GGTCCCGGAG GGACATGCCG TGTGACGAGG AGAGTGGGCG CTGCCTTTGT 1440 
CTGCCCAACG TGGTGGGTCC CAAATGTGAC CAGTGTGCTC CCTACCACTG GAAGCTGGCC 1500 
AGTGGCCAGG GCTGTGAACC GTGTGCCTGC GACCCGCACA ACTCCCCTCA GCCCACAGTG 1560 
CAACCAGTTC ACAGGGCAGT GCCCTGTCGG GAAGGCTTTG GTGGCCTGAT GTGCAGCGCT 1620 
50 GCAGCCATCC GCCAGTGTCC AGACCGGACC TATGGAGACG TGGCCACAGG ATGCCGAGCC 1680 
TGTGACTGTG ATTTCCGGGG AACAGAGGGC CCGGGCTGCG ACAAGGCATC AGGCCGCTGC 1740 
CTCTGCCGCC CTGGCTTGAC CGGGCCCCGC TGTGACCAGT GCCAGCGAGG CTACTGCAAT 1B00 
CGCTACCCGG TGTGCGTGGC CTGCCACCCT TGCTTCCAGA CCTATGATGC GGACCTCCGG 1860 
GAGCAGGCCC TGCGCTTTGG TAGACTCCGC AATGCCACCG CCAGCCTGTG GTCAGGGCCT 192 0 
55 GGGCTGGAGG ACCGTGGCCT GGCCTCCCGG ATCCTAGATG CAAAGAGTAA GATTGAGCAG 1980 
ATCCGAGCAG TTCTCAGCAG CCCCGCAGTC ACAGAGCAGG AGGTGGCTCA GGTGGCCAGT 204 0 
GCCATCCTCT CCCTCAGGCG AACTCTCCAG GGCCTGCAGC TGGATCTGCC CCTGGAGGAG 2100 
GAGACGTTGT CCCTTCCGAG AGACCTGGAG AGTCTTGACA GAAGCTTCAA TGGTCTCCTT 2160 
ACTATGTAT C AGAGGAAGAG GGAGCAGTTT GAAAAAATAA GCAGTGCTGA TCCTTCAGGA 2220 
60 GCCTTCCGGA TGCTGAGCAC AGCCTACGAG CAGTCAGCCC AGGCTGCTCA GCAGGTCTCC 2280 
GACAGCTCGC GCCTTTTGGA CCAGCTCAGG GACAGCCGGA GAGAGGCAGA GAGGCTGGTG 2340 
CGGCAGGCGG GAGGAGGAGG AGGCACCGGC AGCCCCAAGC TTGTGGCCCT GAGG CTGGAG 2400 
ATGTCTTCGT TGCCTGACCT GACACCCACC TTCAACAAGC TCTGTGGCAA CTCCAGGCAG 24 60 
ATGGCTTGCA CCCCAATATC ATGCCCTGGT GAGCTATGTC CCCAAGACAA TGGCACAGCC 2520 
65 TGTGGCTCCC GCTGCAGGGG TGTCCTTCCC AGGGCCGGTG GGGCCTTCTT GATGGCGGGG 2580 
CAGGTGGCTG AGCAGCTGCG GGGCTTCAAT GCCCAGCTCC AGCG3ACCAG GCAGATGATT 2640 
AGGGCAGCCG AGGAAT CTGC CTCACAGATT CAATCCAGTG CCCAGCGCTT GGAGACCCAG 2700 
GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC GCACACGGCT CCTAATCCAG 27 60 
CAGGTCCGGG ACTTCCTAAC AGACCCCGAC ACTGATGCAG CCACTATCCA GGAGGTCAGC 2820 
70 GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA CTGTTCTGCA GAAGATGAAT 2880 
GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT TGGTGCTGTC CCAGACCAAG 2 940 
CAGGACATTG CGCGTGCCCG CCGGTTGCAG GCTGAGGCTG AGGAAGCCAG GAGCCGAGCC 3000 
CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC TGCGGCAGGG GACAGTGGCA 3060 
CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT CCCTTCGGCT TATCCAGGAC 3120 
75 AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA AGCTGGTGAC AAGCATGACC 3180 
AAGCAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC GCCACCAAGC CCGGCAGCAG 3240 
GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG CCAGCGAGCA GC-CATTGAGT 3300 
GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG AGTTGAAGGA CCGGTTGGGT 3360 
CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC CGGATCCAGA GTGTGAAGAC AGAGGCAGAG 3420 
80 GAGCTGTTTG GGGAGAC CAT GGAGATGATG GACAGGATGA AAGACATGGA GTTGGAGCTG 3480 
CTGCGGGGCA GCCAGGCCAT CATGCTGCGC TCGGCGGACC TGACAGGACT GGAGAAGCGT 3540 
GTGGAGCAGA TCCGTGACCA CATCAATGGG CGCGTGCTCT ACTATGCCAC CTGCAAGTGA 3600 
TGCTACAGC T TCCAGCCCGT TGCCCCACTC ATCTGCCGCC TTTGCTTTTG GTTGGGGGCA 3660 
GATTGGGTTG GAATGCTTTC CATCTCCAGG AGACTTTCAT GCAGCCTAAA GTACAGCCTG 3720 
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GACCACCCCT GGTGTGTAGC TAGTAAGATT ACCCTGAGCT GCAGCTGAGC CTGAGCCAAT 3780 

GGGACAGTTA CACTTGACAG ACAAAGATGG TGGAGATTGG CATGCCATTG AAACTAAGAG 3840 

CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG TATCCCCCGC CTTTAGTTCT CCACTGGGGA 3900 

GGAATCCTGG ACCAAGCACA AAAACTTAAC AAAAGTGATG T AAAA^T GAA AAGCCAAATA 3960 
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MRPFFLLCFA LPGLLHAQQA CSRGACYPPV GDLLVGRTRF LRASSTCGLT KPETYCTQYG SO 

EWQMKCCKCD SRQPHNYYSH RVEMVASSSG PMRWWQSQND VNPVSLQLOL DRRFQLQEVM 120 

MEFQGPMPAG MLIERSSDFG KTWRVYQYLA ADCTSTFPRV RQGRPQSMQD VRCQSLPQRP 180 

NARLNGGKVQ LNLMDLVSGI PATQSQKIQE VGEITNLRVN FTRLAPVPQR GYHPPSAYYA 240 

VSQLRLQGSC FCHGHADRCA PKPGASAGPS TAVQVHDVCV CQHNTAGPNC ERCAPFYNNR 300 

PWRPAEGQDA HECQRCDCNG HSETCHFDPA VFAASQGAYG GVCDMCRDHT EGKHCERCQL 3 SO 

HYFRNRRPGA SIQETCISCE CDPDGAVPGA PCDPVTGQCV CKEHVQG2RC DLCKPGFTGL 420 

TYANPQGCHR CDCNILGSRR DMPCDEESGR CLCLPNWG? KCDQCAPYHW KLASGQGCEP 480 

CACDPHNSPQ PTVQPVHRAV PCREGFGGLM CSAAAIRQCP DRTYGDVATG CRACDCDFRG 540 

TEGPGCDKAS GRCLCRPGLT GPRCDQCQRG YCNRYPVCVA CHPCFQTYDA DLREQALRFG S00 

TLQGLQLDLP LEEETLSLPR DLESLDRSFH GLLTMYQRKR EQFEKISSAD PSGAFRMLST 720 

AYEQSAQAAQ QVSDSSRLLD QLRDSRREAE RLVRQAGGGG GTGSPKLVAL RLEMSSLPDL 780 

TPTFNKLCGN SRQMACTPIS CPGELCPQDN GTACGSRCRG VLPRAGGAFL MAGQVAEQLR 840 

GFNAQLQRTR QMIRAAEESA SQIQSSAQRli ETQVSASRSQ MESDVRRTRL LIQQVRDFLT 900 

DPDTDAATIQ EVSEAVLALW LPTDSATVLQ KMNEIQAIAA RLPNVDLVLS QTKQDIARAR 960 

RLQAEAEEAR SRAHAVEGQV EDWGNLRQG TVALQEAQDT MQGTSRSLRL IQDRVAEVQQ 1020 



MLRSADLTGL EKRVEQIRDH INGRVLYYAT CK 
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TTTTCTTAGA cattaactGC AGACGGCTGG CAGGATAGAA GCAGCGGCTC ACTTGGACTT so 

TTTCACCAGG GAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG 120 

CCATCTTCGT GGTGGTCATA TTGGTTCATG CAGAATTGCG AATAGAGACT AAAGGTCAAT 180 

ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GA^TGGGTGA 2 40 

AATTTGCCAA ACCCTGCAGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTGCCAAGA 300 

TTACTTCACA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCTCTGGA GTGGGAATCG 3 60 

ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTCGA CCGGGAGGAA ACTCCAAGCT TCCTGATCAC ATGTCGGGCT CTAAATGCCC 480 

AAGGACTAGA TGTAGAGAAA CCACTTATAC TAACGGTTAA AATTTTGGAT ATTAATGATA 540 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 600 

ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 

AAATTGCCTI CAAAATTGTC TCTCAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 720 

GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 780 

ATCGTCTGGT TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 

GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 900 

CAGCACGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 960 

TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT CTTTACCTCT GGGAATGAAG 1020 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCIG AAAGTGGTGA 1080 
AGGCTCTAGA TTATGAACAA CTACAAAGCG TGAAACTTAG T. 
\ CCAATCAGTT ATCTCTCGAT A 



AAAAAGGCAT i 

A AAAACTGCTG AAATCAAATT TGTCAAAAA' 



CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC C 
CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCAGTTCTTC A 
CTAGAACACT GAATAATAGA TACACTGGCC CCTATAC 
TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC T. 
CCCAGGAACA GATACCTCCT GGAGTATACC ACATCTCCCT GGTACTTACA G. 
ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGi 
GCATCTGTGG AACTT CTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 
CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 
TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG G 
GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG G 
GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 

T CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG : 
«3GAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 
GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 
CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 



C TGTGCGGAGG AAGACGATGG CCAGGAAGCA AATGACTGCT 2520 

TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 2580 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2 640 

TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2700 

CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCCAGCAGA 2760 
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CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGCTTCTGCT TTGTCCGCCT 2820 

CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATC-GT AACTATTTAG 2880 

TAACGGAGAC TTACTCGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT TCCAGTGTTC 3000 

5 CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 30 60 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 3180 

TGGCACTTAT TAGCTTCTCT CAT AAACTG A TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 

TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3300 
10 TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 
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MMGLFPRTTG ALAIFWVIL VHGELRIETK GQYDEEEMTM CjQAKRRQKRE WVKPAKPCRE 
GEDNSKRNPI AKITSDYQAT QKITYRISGV GIDQPPFGIF WDKNTGDIN ITAIVDREET 
PSFLITCRAL NAQGLDVEKP LILTVKILDI NDNPPVFSO.Q IFMGEIEENS ASNSLVMILN 
ATDADEPNHL NSKIAFKIVS QEPAGTPMFL LSRNTGEVRT LTNSLDRSQA SSYRLWSGA 
DKDGEGLSTQ CECNIKVKDV NDNFPMFRDS QYSARIEENI LSSELLRFCV TDLDSEYTDN 
WLAVYFFTSG NEGNWFEIQT DPRTNEGILK WKALDYEQIi QSVKLSIAVK NKAEFHQSVI 
SRYRVQSTPV TIQVINVREG IAFRPASKTF TVQKGISSKK LVDYILGTYQ AIDEDTNKAA 
SNVKYVMGRN DGGYLMIDSK TAEIKFVKNM NRDSTFIVNK TITAEVLAID EYTGKTSTGT 
VYVRVPDFND NCPTAVLEKD AVCSSSPSW VSARTIMNRY TGPYTFALED QPVKLPAVWS 
ITTLNATSAL LRAQEQIPPG VYHISLVLTD SQNNRCEMPR SLTLEVCQCD NKGICGTSYP 
TTSPGTRYGR PHSGRLGPAA IGLLLLGLLL LLLAPLLLLT CDCGAGSTGG VTGGFIPVPD 
GSEGTIHQWG IEGAHPEDKE ITNICVPPVT ANGADFMESS EVCTNTYARG TAVEGTSGME 
MTTKLGAATE SGGAAGFATG TVSGAASGFG AATGVGICSS GQSGTMRTRH STGGTNKDYA 
DGAISMMFLD SYFSQKAFAC AEEDDGQEAN DCLLIYDNEG ADATGSPVGS VGCCSFIADD 
LDDSFLDSLG PKFKKLAEIS LGVDGEGKEV QPPSKDSGYG IESCGHPIEV QQTGFVKCQT 
LSGSQGASAL SASGSVQPAV SIPDPLQHGN YLVTETYSAS GSLVQPSTAG FDPLLTQtJVI 
VTERVICPIS SVPGNLAGPT QLRGSHTMLC TEDPCSRLI 

Seg ID NO: 32 3 DMA sequence 

Nucleic Acid Accession #: XM_058069.2 

Coding sequence: 1..1413 

1 11 21 31 41 51 

III 
ATGAAGTTTC TTCTAATACT GCTCCTGCAG GCCACTGCTT CTGGAGCTCT TCCCCTGAAC 
AGCTCTACAA GCCTGGAAAA AAATAATGTG CTATTTGGTG AAAGATACTT AGAAAAATTT 
TATGGCCTTG AGATAAACAA ACTTCCAGTG ACAAAAATGA AATATAGTGG AAACTTAATG 
AAGGAAAAAA TCCAAGAAAT GCAGCACTTC TTGGGTCTGA AAGTGACCGG GCAACTGGAC 
ACATCTACCC TGGAGATCAT GCACGCACCT CGATGTGGAG TCCCCGATGT CCATCATTTC 
AGGGAAATGC CAGGGGGGCC CGTATGGAGG AAACATTATA TCACCTACAG AATCAATAAT 
TACACACCTG ACATGAACCG TGAGGATGTT GACTACGCAA TCCGGAAAGC TTTCCAAGTA 
TGGACTAATC TTACCCCCTT GAAATTCAGC AAGATTAACA C AGG CATGGC TGACATTTTG 
GTGGTTTTTG CCCGTGGAGC TCATGGAGAC TTCCATGCTT TTGATGGCAA AGGTGGAATC 
CTAGCCCATG CTTTTGGACC TGGATCTGGC ATTGGAGGGG ATGCACATTT CGATGAGGAC 
GAATTCTGGA CTACACAT TC AGGAGGCACA AACTTGTTCC TCACTGCTGT TCACGAGATT 
GGCCATTCCT TAGGTCTTGG CCATTCTAGT GATCCAAAGG CCGTAATGTT CCCCACCTAC 
AAATATGTTG ACATCAACAC ATTTCGCCTC TCTGCTGATG ACATACGTGG CATTCAGTCC 
CTGTATGGAG ACCCAAAAGA GAACCAACGC TTGCCAAATC CTGACAATTC AGAACCAGCT 
CTCTGTGACC CCAATTTGAG TTTTGATGCT GTCACTACCG T 
TTCAAAGACA GGTTCTTCTG GCTGAAGGTT TCTGAGAGAC C 

T TATGGCCAAC CTTGCCATCT GGCATTGAAG CTGCTTATGA AATTGAAGCC 
T TAAAGATGAC A 



GAGCCAAATT ATCCCAAGAG CATACATTCT TTTGGTTTTC CTAACTl 

GATGCAGCTG TTTTTAACCC ACGTTTTTAT AGGACCTACT TCTTTGTAGA TA^CCAGTAT 1200 

TGGAGGTATG ATGAAAGGAG ACAGATGATG GACCCTGGTT ATCCCAAACT GATTACCAAG 12 GO 

AACTTCCAAG GAATCGGGCC TAAAATTGAT GCAGTCTTCT ACTCTAAAAA CAAATACTAC 1320 

TATTTCTTCC AAGGATCTAA CCAATTTGAA TATGACTTCC TACTCCAACG TATCACCAAA 1380 
ACACTGAAAA GCAATAGCTG GTTTGGTTGT TGA 

Seq ID NO: 324 Protein sequence 



i i 1 r i 1 f T 

MKFLLILLLQ ATASGALPLN SSTSLEKNNV LFGERYLEKF YGLEINKLPV TKMKYSGNLM 
KEKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 
YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WFARGAHGD FEAFDGKG3I 
LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHEI GHSLGLGKSS DPKAVMFPTY 
KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 
FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA RNQVFLFKDD KYNLISNLRP 
EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDERRQMM DPGYPKLITK 
NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK TUKSNSWFGC 

Seq ID NO: 325 
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GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 
CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 120 
CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 
5 CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 
TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGT CAGTG 300 
TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 3 60 
GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 
TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 4 80 
10 ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 
GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 
AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 
CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 
GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 
15 CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 
ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 
CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 
AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 
TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 10 80 
20 ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 
TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 
GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 
GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 
GT AAAGC C AC TGAATTATGA AGAAAACCGT CAAGTGAACC TOGAAATTGG AGTAAACAAT 1380 
25 GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 
GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 
ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 
AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 
ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 
30 CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA T AGAT CATGT 1740 
ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 
GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 
GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 
35 AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 
GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 
ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 
ATAGCACTGC TCTTTTCTGT ATTGCT'AACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2 220 
GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 
40 CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATCA CCCAAACTAC CAACAACTCT 2340 

AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 
GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTG( 3 T T -/ T 

ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2 520 
GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAGAAT CCATTAGAGG ACACACTGGT 2580 
45 TAAAAATTAA ACATAAAAGA AATTGCATCG ATGTAATCAG AATGAAGACC GCATGCCATC 2 640 

CCAAGATTAT GTCCTCACTT ATAACTATGA GGGAAGAGGA TCTCCAGCTG GTTCTGTGGG 2700 
CTGCTGCAGT GAAAAGCAGG AAGAAGATGG CCTTGACTTT TTAAATAATT TGGAACCCAA 2 760 
ATTTATTACA TTAGCAGAAG CATGCACAAA GAGATAATGT CACAGTGCTA CAATTAGGTC 2 820 
TTTGTCAGAC ATTCTGGAGG TTTCCAAAAA TAATATTGTA AAGTTCAAIT TCAACATGTA 2880 
50 TGTATATGAT GATTTTTTTC TCAATTTTGA ATTATGCTAC TCACCAATTT ATATTTTTAA 2940 
AGCCAGTTGT TGCTTATCTT TTCCAAAAAG TGAAAAATGT TAAAACAGAC AACTGGTAAA 3000 
TCTCAAACTC CAGCACTGGA ATTAAGGTCT CTAAAGCATC TGCTCTTTTT TTTTTTTACG 3060 
GATATTTTAG TAATAAATAT GCTGGATAAA TATTAGTCCA ACAATAGCTA AGTTATGCTA 3120 
ATATCACATT ATTATGTATT CACTTTAAGT GATAGTTTAA AAAATAAACA AGAAATATTG 3180 
55 AGTATCACTA TGTGAAGAAA GTTTTGGAAA AGAAACAATG AAGACTGAAT TAAATTAAAA 3240 
ATGTTGCAGC T CAT AAAG AA TTGGGACTCA CCCCTACTGC ACTACCAAAT TCATTTGACT 3300 
TTGGAGGCAA AATGTGTTGA AGTGCCCTAT GAAGTAGCAA TTTTCTATAG GAATATAGTT 3360 
GGAAATAAAT GTGTGTGTGT ATATTATTAT TAATCAATGC AATATTTAAA ATGAAATGAG 3420 
AACAAAGAGG AAAATGGTAA AAACTTGAAA TGAGGCTGGG GTATAGTTTG TCCTACAATA 3480 
60 GAAAAAAGAG AGAGCTTCCT AGGCCTGGGC TCTTAAATGC TGCATTATAA CTGAGTCTAT 3540 
GAGGAAATAG TTCCTGTCCA ATTTGTGTAA TTTGTTTAAA ATTGTAAATA AATTAAACTT 3600 
TTCTGGTTTC TGTGGGAAGG AAATAGGGAA TCCAATGGAA CAGTAGCTTT GCTTTGCAGT 3660 
CTGTTTCAAG ATTTCTGCAT CCACAAGTTA GTAGCAAACT GGGGAATACT CGCTGCAGCT 3720 
GGGGTTCCCT GCTTTTTGGT AGCAAGGGTC CAGAGATGAG GTGTTTTTTT CGGGGAGCTA 3780 
65 ATAACAAAAA CATTTTAAAA CTTACCTTTA CTGAAGTTAA ATCCTCTATT GCTGTTTCTA 3840 
TTCTCTCTTA TAGTGACCAA CATCTTTTTA ATTTAGATCC AAATAACCAT GTCCTCCTAG 3900 
AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GATCTTACTG AAAGCACCCT GGGGAGATTG 3960 
ATTGTCCTTA AACCTAAGCC CCACAAACTT GACACCTGAT CAGGTCTGGG AGCTACAAAA 4020 
TTTCATTTTT CTCCTCACTG CCCTTCTTCT GAGTGGCATT GGCCTGAATC AAC-GAAAGCC 4080 
70 AGGCCTTGTG GGCCCCCTTC TTTCGGCTTT CTGCT AAAGC AACACCTCCA GCAGAGATTC 4140 
CCTTAAGTGA CTCCAGGTTT TCCACCATCC TTCAGCGTGA ATTAATTTTT AATCAGTTTG 4200 
CTTTCTCCAG AGAAATTTTA AAATAATAGA AGAAATAGAA ATTTTGAATG TATAAAAGAA 4260 
AAAGATCAAG TTGTCATTTT AGAACAGAGG GAACTTTGGG AGAAAGCAGC CCAAGTAGGT 4320 
TATTTGTACA GTCAGAGGGC AACAGGAAGA TGCAGGCCTT CAAGGGCAAG GAGAGGC CAC 4380 
75 AAGGAATATG GGTGGGAGTA AAAGCAACAT CGTCTGCTTC ATACTTTTTC CTAGGCTTGG 4440 
CACTGCCTTT TCCTTTCTCA GGCCAATGGC AACTGCCATT TGAGTCCGC-T GAGGGATCAG 4500 
CCAACCTCTT CTCTATGGCT CACCTTATTT GGAGTGAGAA ATCAAGGAGA CAGAGCTGAC 4560 
TGCATGATGA GTCTGAAGGC ATTTGCAGGA TGAGCCTGAA CTGC-TTGTGC AGAACAAACA 4620 
AGGCATTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC CTTCTGGGCA CTAAGAAGGI 4680 
80 CTATGAATTA AATGCCTATC TAAAATTCTG ATTTATTCCT ACATTTTCTG TTTTCTAATT 4740 
TGACCCTAAA ATCTATGTGT TTTAGACTTA GACTTTTTAT TGCCCCCCCC CCCTTTTTTT 4800 
TTGAGACGGA GTCTCGCTCT GACGCACAGG CTGGAGTGCA GTGGCTCCGA T CT CTGCT CA 4860 
CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC TCAGCCTCCT GAGTAGCTGG 4920 
GACTACAGGC GCCCACCACC ACGCCCGGCT AATTTTTTGT ATTTTTAATA GAGACGGGGT 4980 
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TTCACTGTGT TAG CCAGGAT GGTCTCGATC TCCTGACCTC GIGATCCGCC TGCCTCGGCC 504 0 

TCCCAAAGTG CTGGGATTAC AGGCATGACC CACCGCTCCC GGCCTTGTTT TCCGTTTAAA 5100 

GTCGTCTTCT TTTAATGTAA TCATTTTGAA CATGTGTGAA AGTTGATCAT ACGAATTGGA 51S0 

TCAATCTTGA AATACTCAAC CAAAAGACAG TCGAGAAGCC AGGGGGAGAA AGAACTCAGG 5220 

GCACAAAATA TTGGTCTGAG AATGGAATTC TCTGTAAGCC TAGTTGCTGA AATTTCCTGC 52 80 

TGTAACCAGA AGCCAGTTTT ATCTAACGGC TACTGAAACA CCCACTGTGT TTTGCTCACT 5340 

CCCACTCACC GATCAAAACC TGCTACCTCC CCAAGACTTT ACTAGTGCCG ATAAACTTTC 5400 

TCAAAGAGCA AC CAGTATCA CTTCCCTGTT TATAAAACCT CTAACCATCT CTTTGTTCTT 54 60 

TGAACATGCT GAAAACCACC TGGTCTGCAT GTATGCCCGA ATTTC-TAATT CTTTTCTCTC 5520 

AAATGAAAAT TTAATTTTAG GGATTCATTT CTATATTTTC ACATATGTAG TATTATTATT 5580 

TCCTTATATG TGTAAGGTGA AATTTATGGT ATTTGAGTGT GCAAGAAAAT ATATTTTTAA 5540 

AGCTTTCATT TTTCCCCCAG TGAATGATTT AGAATTTTTT ATGTAAATAT ACAGAATGTT 5700 

TTTTCTTACT TTTATAAGGA AGCAGCTGTC TAAAATGCAG TGGGGTTTGT TTTGCAATGT 5760 
TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAGTTACT TTGCTTTTAA A 
CTGCTTAAAA TAAGCAAAAA TTGGATGCAT A 



G TAATAAAACA ATATTAACTT GGTTTCTTGT T 
TTTAGAGATT AAATAATTCT AAGATGATCA CTTTGCAAAA TTATGCTTAT 

GAAATAGAAA TACTCAATTA TGTCTTTGTT GTATTAATGG GGAATATTTT GGACAATGTT 6120 

20 TCATTATCAA ATTGTCGACA T CATT AAT AT ATATTGTAAT GTTGGGAAGA GATCACTATT 6180 

TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT GTATAATAAA TTTTGATCGG 624 0 

GTATTAAAAG TATTAGAAGG TGGTTATAAT TGCAGAGTAT TCCATGAATA GTACACTGAC 6300 

ACAGGGGTTT TACTTTGAGG ACCAGTGTAG TCAAGGGAAA ACATGAGTTA AAAAGAAAAG 5360 

CAGGCAATAT TGCAGTCTTG ATTCTGCCAC TTACAGGATA GATAATGCCT GAACTTTAAT 6420 

25 GACAAGATGA TCCAACCATA AAGGTGCTCT GTGCTTCACA GTGAATCTTT TCCCCATGCA 6480 

GGAGTGTGCT CCCCTACAAA CGTTAAGACT GATCATTTCA AAAATCTATT AGCTATATCA 6540 

AAAGCCTTAC ATTTTAATAT AGGTTGAACC AAAATTTCAA TTCCAGTAAC TT CTATTGTA 6600 

ACCATTATTT TTGTGTATGT CTTCAAGAAT GTTCATTGGA TTTTTGTTTG TAATAGTAAA 6660 

ATACCGGATA CATTTCACGT GTCCTTCAGT ATTGATTTGG TTGAATATTG GGTCATAATG 6720 

30 GTTGAGAAGC ATGGACACTA GAGCCAGAAT GCTTGGATAT GAATCCTGGA TCTGTCACTT 6780 

ACTTCTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT TAC-CTTTCTC ATTAAAATCA 6840 

ATGAACAATG CCAGCCTCAT GGGGTTGTTG AATGATTAAA TTAGTTAATA TACCTAAAGT 6900 

ACATAGARCA CTGCCTGCAC ATAGTAAAAG AATTATAAGT GTGAGGTAGT TGGTAAAATT 6960 

ATGTAGTTGG ATATACTACC GAACAATATC TAATCTCTTT TTAGGGAAAT AAAGTTTGTG 7020 
35 CATATATATA ATCCCGAAAC ATG 



I I I I I I 

MAAAGPRRSV RGAVCLHLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIG RVNLE3CFRS 
ADLIRSSDPD FRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKEVT VLLEHQKKVS 
KTRIITRETVL RRAKRRWAPI PCSMQENSLC PFPLFLQQVE SDAAQNYTVF YSISGRGVDK 
EPliWLFYIER DTGNLFCTRP VDREEYDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 
PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLFSVHPS 
TGVITTVSIIY LDREWDKYS LIMKVODMDG QFFGLIGTST CIITVTDSND NAPT FRQNAY 
EAFVEENAFN VEILRIPIED KDLINTANWR VNFTILKGNE NGHFKISTDK ETNEGVLSW 
KPLHYEENRQ VMLE IGVNNE APFARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 
KENIAVGSKI NGYKAYDPEN RNGNGLRYKK LHDPKGWITI DEISGSIITS KILDREVETP 
KNELYNITVL AIDKDDRSCT GTLAWIEDV NDNPPEILQE YWICKPKMG YTDILAVDPD 
EPVHGAPFYF SLPNTSPEIS RLWSLTKVHD TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 
TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 
KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNGGQETIE 
MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EESIRGHTG 
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GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCC 
CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 
CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 

TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TICTAAATGA TGGGTCAGTG 
A GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 
A AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 
TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 
ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 



T TAAATTTGTT TTATATAGAA AGAGACACTG G 
CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTG2 

CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT IGGAAAGTAG TAGACCTGGT 840 

3 TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 

A GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 

AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTC-T AGACAAGTAC 1020 

TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 1080 

ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 

GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGIGT TCTTTCTGTT 1320 
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GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1330 

GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAC-C CTTGGTTACA 1440 

GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 

AT TAAAG AAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 15S0 

5 AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 

ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAG AT CATGT 1740 

ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TC-TTGATCCT 1860 

10 GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 

AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 
ACTT CAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 
15 ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GC-GGGGCTGG GCATCATCAT 24 60 

20 ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAAAAT TGCATCGATG TAATCAGAAT 2580 

CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2700 

AATAATTTGG AACCCAAATT TATTACATTA GCAGAAGCAT GCACAAAGAG ATAATGTCAC 27 60 

25 AGTGCTACAA TTAGGTCTTT GTCAGACATT CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2820 

TTCAATTTCA ACATGTATGT ATATGATGAT TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 

CCAATTTATA TTTTTAAAGC CAGTTGTTGC TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2940 

AACAGACAAC TGGTAAATCT CAAACTCCAG CACTGGAATT AAGGTCTCTA AAGCATCTGC 3000 

TCTTTTTTTT TTTTACGGAT ATTTTAGTAA TAAATATGCT GGATAAATAT TAGTCCAACA 3060 

30 ATAGCTAAGT TATGCTAATA TCACATTATT ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 

ATAAACAAGA AATATTGAGT ATCACTATGT GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 

ACTGAATTAA ATTAAAAATG TTGCAGCTCA TAAAGAATTG GGACTCACCC CTACTGCACT 3240 

ACCAAATTCA TTTGACTTTG GAGGCAAAAT GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3300 

TCTATAGGAA TATAGTTGGA AATAAATGTG TGTGTGTATA TTATTATTAA TCAATGCAAT 33 60 

35 ATTTAAAATG APATGAGAAC AAAGAGGAAA ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 

TAGTTTGTCC TACAATAGAA AAAAGAGAGA GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 

ATTATAACTG AGTCTATGAG GAAATAGTTC CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 

GTAAATAAAT TAAACTTTTC TGGTTTCTGT GGGAAGGAAA TAGGGAATCC AATGGAACAG 3 6 00 

TAGCTTTGCT TTGCAGTCTG TTTCAAGATT TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3S60 

40 GAATALTf I n AGC 1 , GTTCCCTGCT TTTTGGTAGC AAGGGTCCAG AGATGAGGTG 3720 

TTTTTTTCGG GGAGCTAATA ACAAAAACAT TTTAAAACTT ACCTTTACTG AAGTTAAATC 3780 

CTCTATTGCT GTTTCTATTC TCTCTTATAG TGACCAACAT CTTTTTAATT TAGATCCAAA 3 840 

TAACCATGTC CTCCTAGAGT TTAGAGGCTA GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3900 

GCACCCTGGG GAGATTGATT GTCCTTAAAC CTAAGCCCCA CAAACTTGAC ACCTGATCAG 3 960 

45 GTCTGGGAGC TACAAAATTT CATTTTTCTC CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4 020 

CTGAATCAAG GAAAGCCAGG CCTTGTGGGC CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 

ACCTCCAGCA GAGATTCCCT TAAGTGACTC CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 

AATTTTTAAT CAGTTTGCTT TCTCCAGAGA AATTTTAAAA TAATAGAAGA AATAGAAATT 4200 

TTGAATGTAT AAAAGAAAAA GATCAAGTTG TCATTTTAGA ACAGAGGGAA CTTTGGGAGA 4260 

50 AAGCAGCCCA AGTAGGTTAT TTGTACAGTC AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 

GGGCAAGGAG AGGCCACAAG GAATATGGGT GGGAGTAAAA GCAACATCGT CTGCTTCATA 4380 

CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC TTTCTCAGGC CAATGGCAAC TGCCATTTGA 4440 

GTCCGGTGAG GGATCAGCCA ACCTCTTCTC TATGGCTCAC CTTATTTGGA GTGAGAAATC 4500 

AAGGAGACAG AGCTGACTGC ATGATGAGTC TGAAGGCATT TGCAGC-ATGA GCCTGAACTG 4560 

55 GTTGTGCAGA ACAAACAAGG CATTCATGGG AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 

CTGGGCACTA AGAAGGTCTA TGAATTAAAT GCCTATCTAA AATTC7GATT TATTCCTACA 4680 

TTTTCTGTTT TCTAATTTGA CCCTAAAATC TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 

CCCCCCCCCC TTTTTTTTTG AGACGGAGTC TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 

60 GCCTCCTGAG TAGCTGGGAC TACAGGCGCC CACCACCACG CCCGGCTAAT TTTTTGTATT 4920 

TTTAATAGAG ACGGGGTTTC ACTGTGTTAG CCAGGATGGT CTCGATCTCC TGACCTCGTG 4 980 

ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG GGATTACAGG CATGACCCAC CGCTCCCGGC 5040 

CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 

TGATCATACG AATTGGATCA ATCTTGAAAT ACTCAACCAA AAGACAGTCG AGAAGCCAGG 5160 

65 GGGAGAAAGA ACTCAGGGCA CAAAATATTG GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 5220 

TTGCTGAAAT TTCCTGCTGT AACCAGAAGC CAGTTTTATC TAACGGCTAC TGAAACACCC 5280 

ACTGTGTTTT GCTCACTCCC TCACTCACCG ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 

CTAGTGCCGA TAAACTTTCT CAAAGAGCAA CCAGT AT CAC TTCCCTGXTT ATAAAACCTC 5400 

TAACCATCTC TTTGTTCTTT GAACATGCTG AAAACCACCT GGTCTGCATG TATGCCCGAA 5460 

70 TTTGTAATTC TTTTCTCTCA AATGAAAATT TAATTTTAGG GATTCATTTC TATATTTTCA 5520 

CATAT GTAGT ATTATTATTT CCTTATATGT GTAAGGTGAA ATTTATGC-TA TTTGAGTGTG 5580 

CAAGAAAATA TATTTTTAAA GCTTTCATTT TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 

TGTAAATATA CAGAAT GTTT TTTCTTACTT TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 

GGGGTTTGTT TTGCAATGTT TTAAACAGAG TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 

75 TGCTTTTAAA GAAACTTGGC TGCTTAAAAT AAGCAAAAAT TGGATGCATA AAGTAATATT 5820 

TACAGATGTG GGGAGATGTA ATAAAACAAT ATTAACTTC-G TTTCTTGTTT TTGCTGTATT 5880 

TAGAGATTAA ATAATTCTAA GATGATCACT TTGCAAAATT ATGCTTATGG CTGGCATGGA 5940 

AATAGAAATA CTCAATTATG TCTTTGTTGT ATTAATGGGG AATATTTTGG ACAATGTTTC 6000 

ATT AT CAAAT TGTCGACATC ATTAATATAT ATTGTAATGT TGGGAAGAGA TCACTATTTT 6060 

80 GAAGCACAGC TTTACAGATG AGTATCTATG ATACATATGT ATAATAAATT TTGATCGGGT 6120 

ATTAAAAGTA TTAGAAGGTG GTTATAATTG CAGAGTATTC CATGAATAGT ACACTGACAC 6180 

AGGGGTTTTA CTTTGAGGAC CAGTGTAGTC AAGGGAAAAC ATGAGTTAAA AAGAAAAGCA 6240 

GGCAATATTG CAGTCTTGAT TCTGCCACTT ACAGGATAGA TAATGCCTGA ACTTTAATGA 6300 

CAAGATGATC CAACCATAAA GGTGCTCTGT GCTTCACAGT GAATCTTTTC CCCATGCAGG 6360 
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AGTGTGCTCC CCTACAAACG TTAAGACTGA TCATTTCAAA AATCTATTAG CTATATCAAA 6420 

AGCCTTACAT TTTAATATAG GTTGAACCAA AATTTCAATT CCAGTAACTT CTATTGTAAC S480 

TATTATTTTT GTGTATGTCT TCAAGAATGT TCATTGGATT TTTGTTTGTA ATAGTAAAAT SS40 

A TTTCACGTGT CCTTCAGTAT TGATTTGGTT GAATATTGGG T CAT AATGGT 6600 

T GGACACTAGA GCCAGAATGC TTGGATATGA ATCCTG3ATC TGTCACTTAC 6660 

TTCTGTGTGA CCTTTGAAAG GCTACTTATT TCCTCTCTTA GCTTTCTCAT TAAAATCAAT 6720 

GAACAATGCC AGCCTCATGG GGTTGTTGAA TGATTAAATT AGTTAATATA CCTAAAGTAC 6780 

ATAGAACACT GCCTGCACAT AGTAAAAGAA TTATAAGTGT GAGGTAGTTG GTAAAATTAT 6840 

GTAGTTGGAT ATACTACCGA ACAATATCTA ATCTCTTTTT AGGGAAATAA AGTTTGTGCA 6900 
TATATATAAT CCCGAAACAT G 
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MAAAGPRRSV RGAVCLHLLL TLVIPSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS 
ADLIRSSDPD FRVLNDGSVY TARAVAL SDK KRSFTIWLSD KRKQTQ.KEVT VLLEHQKKVS 
KTRHTRETVL RRAKRRWAPI PCSMQENSLG PFPLFLQQVE SDAACNYTVF YSISGRGVDK 
EPLNLFYIER DTGNLFCTRP VDREEYDVPD LIAYASTADG YSADLPLPLP IRVEDENDNH 
PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLPSVHPS 
TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 
EAFVEENAFN VEILRIPIED KDLINTANWR VNFTILKGNE NGHFKISTDK ETNEGVLSW 
KPLNYEENRQ VNLE IGVNNE APFARDIPRV TALNRALVTV HVRDIDEGPE CTPAAQYVRI 
KENLAVGSKI NGYKAYDPEN RNGNGLRYKK LHDPKGWITI DEISGSIITS KILDREVETP 
KNELYNITVL AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTDILAVDPD 
EPVHGAPFYF SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 
TKLLRVMLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 
KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNGGQETIE 
MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EKLHRCNQNE 
DRMPSQDYVL TYNYEGRGSP AGSVGCCSEK QEEDGLDFLN NLEPKFITLA EACTKR 



Seq ID NO: 329 DNA sequence 

Nucleic Acid Accession #: NM_016583.2 

Coding sequence: 72.-842 



TAAGAGCAAA GATGTTTCAA ACTGGGGGCC TCATTGT 
CCATGGCCCA GTTTGGAGGC CTGCCCGTGC CCCTGGACCA GACCCTGCCC TTGAATGTGA 
ATCCAGCCCT GCCCTTGAGT CCCACACCTC TTGCAGGAAG CTTGACAAAT GCCCT A( ( A 
ATGGCCTGCT GTCTGGGGGC CTGTTGGGCA TTCTGGAAAA CCTTCCGCTC CTCGACATCC 
TGAAGCCTGG AGGAGGTACT TCTGGTGGCC TCCTTGGGGG ACTGCTTGGA AAAGTGACGT 
CAGTGATTCC TGGCCTGAAC AACATCATTG ACATAAAGGT CACTGACCCC CAGCTGCTGG 
AACTTGGCCT TGTGCAGAGC CCTGATGGCC ACCGTCTCTA TGTCACCATC CCTCTCGGCA 
TAAAGCTCCA AGTGAATACG CCCCTGGTCG GTGCAAGTCT GTTGAGGCTG GCTGTGAAGC 
TGGACATCAC TGCAGAAATC TTAGCTGTGA GAGATAAGCA GGAGAGGATC CACCTGGTCC 
TTGGTGACTG CACCCATTCC CCTGGAAGCC TGCAAATTTC TCTGCTTGAT GGACTTGGCC 
CCCTCCCCAT TCAAGGTCTT CTGGACAGCC TCACAGGGAT CTTGAATAAA GTCCTGCCTG 
AGTTGGTTCA GGGCAACGTG TGCCCTCTGG TCAATGAGGT TCTCAGAGGC TTGGACATCA 
CCCTGGTGCA TGACATTGTT AACATGCTGA TCCACGGACT ACAGTTTGT C ATCAAGGTCT 
AAGCCTTCCA GGAAGGGGCT GGCCTCTGCT GAGCTGCTTC CCAGTGCTCA CAGATGGCTG 
GCCCATGTGC TGGAAGATGA CACAGTTGCC TTCTCTCCGA GGAACCTGCC CCCTCTCCTT 
TCCCACCAGG CGTGTGTAAC ATCCCATGTG CCTCACCTAA TAAAATGGCT CTTCTTCTGC 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 

Seq ID NO: 330 Protein sequence 
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MFQTGGIiIVF YGLLAQTMAQ FGGLPVPLDQ TLPLNVNPAL PLSPTGLAGS LTNALSNGLL 
SGGLLGILEN LPLLDILKPG GGTSGGLLGG LLGKVTSVIP GLNNIIDIKV TDPQLLELGL 
VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRLAVKLDIT AEILAVKDKQ ERIHLVLGDC 
THSPGSLQIS LLDGLGPLPI QGLLDSLTGI LNKVLPELVQ GNVCPLWEV LRGLDITLVH 
[i QFVIKV 
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CTCAGGGCAG AGGGAGGAAG GACAGCAGAC CAGACAGTCA CAGCAGCCTT GACAAAACGT 
TCCTGGAACT CAAGCTCTTC TCCACAGAGG AGGACAGAGC AGACAGCAGA GACCATGGAG 
TCTCCCTCGG CCCCTCCCCA CAGATGGTGC ATCCCCTGGC AGAGGCTCCT GCTCACAGCC 
TCACTTCTAA CCTTCTGGAA CCCGCCCACC ACTGCCAAGC TCACTATTGA ATCCACGCCG 
TTCAATGTCG CAGAGGGGAA GGAGGTGCTT CTACTTGTCC ACAATCTGCC CCAGCATCTT 
TTTGGCTACA GCTGGTACAA AGGTGAAAGA GTGGATGGCA ACCGTCAAAT TATAGGATAT 
GTAATAGGAA CTCAACAAGC TACCCCAGGG CCCGCATACA GTGGTCGAGA GATAATATAC 
CCCAATGCAT CCCTGCTGAT CCAG AACAT C ATCCAGAATG ACACAGGATT CTACACCCTA 
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CACGT CAT AA AGTCAGATCT TGTGAATGAA GAAGCAACTG GCCAGITCCG GGTATACCCG 54 0 

GAGCTGCCCA AGCCCTCCAT CTCCAGCAAC AACTCCAAAC CCGTGGAGGA CAAGGATGCT 600 

GTGGCCTTCA CCTGTGAACC TGAGACTCAG GACGCAACCT ACCTGTGGTC- GGTAAACAAT 660 

CAGAGCCTCC CGGTCAGTCC CAGGCTGCAG CTGTCCAATG GCAACAGGAC CCTCACTCTA 720 

5 TTCAATGTCA CAAGAAATGA CACAGCAAGC TACAAATGTG AAACCCAGAA CCCAGTGAGT 780 

GCCAGGCGCA GTGATTCAGT CATCCTGAAT GTCCTCTATG GCCCGGATGC CCCCACCATT 840 

TCCCCTCTAA ACACATCTTA CAGATCAGGG GAAAAT CTGA ACCTCTCCTG CCACGCAGCC 900 

TCTAACCCAC CTGCACAGTA CTCTTGGTTT GTCAATGGGA CTTTCCAGCA ATCCACCCAA 960 

GAGCTCTTTA TCCCCAACAT CACTGTGAAT AATAGTGGAT CCTATACG1G CCAAGCCCAT 1020 

10 AACTCAGACA CTGGCCTCAA TAGGACCACA GTCACGACGA TCACAGTCTA TGCAGAGCCA 1080 

CCCAAACCCT TCATCACCAG CAACAACTCC AACCCCGTGG AGGATGAGGA TGCTGTAGCC 1140 

TTAACCTGTG AACCTGAGAT TCAGAACACA ACCTACCTGT GGTGGGTAAA TAATCAGAGC 1200 

CTCCCGGTCA GTCCCAGGCT GCAGCTGTCC AATGACAACA GGACCCTCAC TCTACTCAGT 1260 

GTCACAAGGA ATGATGTAGG ACCCTATGAG TGTGGAATCC AGAACGAATT AAGTGTTGAC 1320 

15 CACAGCGACC CAGTCATCCT GAATGTCCTC TATGGCCCAG ACGACCCCAC CATTTCCCCC 1380 

TCATACACCT ATTACCGTCC AGGGGTGAAC CTCAGCCTCT CCTGCCATGC AGCCTCTAAC 1440 

CCACCTGCAC AGTATTCTTG GCTGATTGAT GGGAACATCC AGCAACACAC ACAAGAGCTC 1500 

TTTATCTCCA ACAT CACTG A GAAGAACAGC GGACTCTATA CCTGCCAGGC CAATAACTCA 1560 

GCCAGTGGCC ACAGCAGGAC TACAGTCAAG ACAATCACAG TCTCTGCGGA GCTGCCCAAG 1620 

20 CCCTCCATCT CCAGCAACAA CTCCAAACCC GTGGAGGACA AGGATGCTGT GGCCTTCACC 1680 

TGTGAACCTG AGGCTCAGAA CACAACCTAC CTGTGGTGGG TAAATGGTCA GAGCCTCCCA 1740 

GTCAGTCCCA GGCTGCAGCT GTCCAATGGC AACAGGACCC TCACTCTATT CAATGTCACA 1800 

AGAAATGACG CAAGAGCCTA TGTATGTGGA ATCCAGAACT CAGTGAGTGC AAACCGCAGT 1860 

GACCCAGTCA CCCTGGATGT CCTCTATGGG CCGGACACCC CCATCATTTC CCCCCCAGAC 192 0 

25 TCGTCTTACC TTTCGGGAGC GAACCTCAAC CTCTCCTGCC ACTCG3CCTC TAACCCATCC 1980 

CCGCAGTATT CTTGGCGTAT CAATGGGATA CCGCAGCAAC ACACACAAGT TCTCTTTATC 204 0 

GCCAAAATCA CGCCAAATAA TAACGGGACC TATGCCTGTT TTGTCTCTAA CTTGGCTACT 2100 

GGCCGCAATA ATTCCATAGT CAAGAGCATC ACAGTCTCTG CATCTGGAAC TTCTCCTGGT 2160 

CTCTCAGCTG GGGCCACTGT CGGCATCATG ATTGGAGTGC TGGTTGGGGT TGCTCTGATA 2220 

30 TAGCAGCCCT GGTGTAGTTT CTTCATTTCA GGAAGACTGA CAGTTGTTTT GCTTCTTCCT 2280 

TAAAGCATTT GCAACAGCTA CAGTCTAAAA TTGCTTCTTT ACCAAGGATA TTTACAGAAA 2340 

AGACTCTGAC CAGAGATCGA GACCATCCTA GCCAACATCG TGAAACCCCA TCTCTACTAA 24 00 

AAATACAAAA ATGAGCTGGG CTTGGTGGCG CGCACCTGTA GTCCCAGTTA CTCGGGAGGC 2460 

TGAGGCAGGA GAATCGCTTG AACCCGGGAG GTGGAGATTG CAGTGAGCCC AGATCGCACC 2520 

35 ACTGCACTCC AGTCTGGCAA CAGAGCAAGA CTCCATCTCA AAAAGAAAAG AAAAGAAGAC 2580 

TCTGACCTGT ACTCTTGAAT ACAAGTTTCT GATACCACTG CACTGTCTGA GAATTTCCAA 2640 

AACTTTAATG AACTAACTGA CAGCTTCATG AAACTGTCCA CCAAGATCAA GCAGAGAAAA 27 00 

TAATTAATTT CATGGGACTA AATGAACTAA TGAGGATTGC TGATTCTTTA AATGTCTTGT 2760 

TTCCCAGATT TCAGGAAACT TTTTTTCTTT TAAGCTATCC ACTCTTACAG CAATTTGATA 2 82 0 

40 AAATATACTT TTGTGAACAA AAATTGAGAC ATTTACATTT TCTCCCTATG TGGTCGCTCC 2880 

AGACTTGGGA AACTATTCAT GAATATTTAT ATTGTATGGT AATATAGTTA TTGCACAAGT 2 940 
TCAATAAAAA TCTGCTCTTT GTATAACAGA AAAA 

Seq ID NO: 332 Protein sequence 
45 Protein Accession tt : NP_004354.1 

1 11 21 31 41 51 

I I I I I I 

MESPSAPPHR WCIPWQRLLL TASLLTFWNP PTTAKI/riEt TPKNV EGI - 'LLLVKNLPQ 60 

50 HLFGYSWYKG ERVDGNRQII GYVIGTQQAT EGPAYSGREI IYPNASLLIQ NIIQNDTGFY 120 
TLHVIKSDLV NEEATGQFRV YPELPKPSIS SNNSKPVEDK DAVAFTCEPE TQDATYLWWV 180 

NNQSLPVSPR LQLSNGNRTL TLENVTRNDT ASYKCETQNP VSARRSDSVI LHVLYGPDAP 240 

TISPLHTSYR SGENLNLSCH AASNPPAQYS WFVNGTFQQS TQELFIPNIT VNNSGSYTCQ 3 00 
AHNSDTGLNR TTVTTITVYA EPPKPFITSN NSNPVEDEDA VALTCEPEIQ NTTYLWWVNN 360 

55 QSLPVSPRLQ LSNDNRTLTL LSVTRNDVGP YECGIQNELS VDHSDFVILN VLYGPDDPTI 420 
SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL IDGNIQQHTQ ELFISKITEK NSGLYTCQAN 480 
NSASGHSRTT VKTITVSAEL PKPSISSNNS KPVEDKDAVA FTCEPEAQNT TYLWWVNGQS 540 
LPVSPRLQLS NGNRTLTLFN VTRNDARAYV CGIQNSVSAN RSDPVTLDVL YGPDTPIISP 600 
PDSSYLSGAN LNLSCHSASN PSPQYSWRIN GIPQQHTQVL FIAKITPNNN GTYACFVSNL 660 
60 ATGRNNS IVK SITVSASGTS PGLSAGATVG IMIGVLVGVA LI 



Seq ID NO: 333 DNA sequence 




AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TC-CTGATTTT 60 

TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCIGACT GCGGAGTC-CA TCTTCITTGT 12 0 

70 ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGT C CA GCAGGAAAAT TCTTCTCGCG TATTTCATTC TGATGTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 42 0 

75 TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 54 0 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TC-TGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GC-TTTTTATC ACAAT CAGGG 660 

CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

80 ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 
AATTGAATAT TAAGAA 

Seq ID NO: 334 Protein sequence 
Protein Accession #: NP_008883.1 



1082 



WO 03/042661 
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1 11 21 31 41 51 

I I I I I I 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIPFVSDQ HSLYPLLEAT DNDDIYGAAW 
IGIFVGICLF CLSVLGIVGI MKSSEKILLA YFILMFIVYA FEVASCITAA TQRDFFTPNL 
FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDKQK YTSAFRTENN 
DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWFGFAI 
3 TMFYWSRIEY 



CGCTGGGACG C 
TC-TCAAAGGC CGTGTTCCAT 
A AGATCCCGTT AAAGGACAAG TTTCAGTTAA AGGTCA; 
CGCAAGAGCC AGTCAAAGGT CCAGTCTCCA CTAAGCCTGG CTCCTGCCCC A 
TCCGGTGCGC CATGTTGAAT CCCCCTAACC GCTGCTTGAA AGATACTGAC TGCCCAGGAA 
TCAAGAAGTG CTGTGAAGGC TCTTGCGGGA TGGCCTGTTT CGTTCCCCAG TGAAGGGAGC 
CGGTCCTTGC TGCACCTGTG CCGTCCCCAG AGCTACAGGC CCCATCTGGT CCTAAGTCCC 
TGCTGCCCTT CCCCTTCCCA CACTGTCCAT TCTTCCTCCC ATTCAGGATG CCCACGGCTG 
GAGCTGCCTC TCTCATCCAC TTTCCAATAA A 



35 
40 



Seq ID NO: 337 DNA sequence 
Nucleic Acid Accession tt : NM 
Coding sequence: 71.. 2560 



AAAGGGGCAA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 
CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTG 
CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG CCGGGCGGTC TTCAGGGAGG CTGAAGTGAC 
45 CTTCt =, C( GAi G( GG A 5CAGGAGCC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 

CTGCCCTGGG CAAGAGCCAG CTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA 
TGGCGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 
ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 
TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 
50 AGACACCAAG ATTTTCTACA GCAT CACGGG GCCGGGGGCA GACAGCCCCC CTGAGGGTGT 
CITCGCTGTA GAGAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 
GATTGCCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CAGTGGAGGA 
CCCCATGAAC ATCTCCATCA TCGTGACCGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 
GGACACCTTC CGAGGGAGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 
55 GACAGCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 
CCATAGCCAA GAACCAAAGG ACCCACACGA CCICATGTTC ACCATTCACC GGAGCACAGG 
CACCATCAGC GTCATCTCCA GTGGCCTGGA CCGGGAAAAA GTCCCTGAGT ACACACTGAC 
CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA ] 
GATCCTTGAT GCCAATGACA ATGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT 3 
60 GCCTGAGAAT GCAGTGGGCC ATGAGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCCC j 
CAACTCACCA GCGTGGCGTG CCACCTACCT TATCATGGGC GGTGACGACG GGGACCATTT , : 
TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGA : 
TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT : 
GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TGAATGAGGC : 
65 ACCTGTGTTT GTCCCACCCT CCAAAGTCGT TGAGGTCCAG GAGGGCATCC CCACTGGGGA : 
GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG : 
CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC : 
3 AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 
3 GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 
T GTCAATGACC ATGGCCCAGT CCCTGAGCCC CGTCAGATCA CCATCTGCAA 
T GTGCGCCAGG TGCTGAACAT CACGGACAAG GACCTGTCTC CCCACACCTC 



GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA GAAGTTCCTG AAGCAGGATA CATATGACGT 1920 

GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTGATCA GGGCCACTGT 1980 

GTGCGACTGC CATGGCCATG TCGAAACCTG CCCTGGACCC TGGAAGGGAG GTTTCATCCT 2040 

CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCTGC TTTTGTTGGT 2100 

GAGAAAGAAG CGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 2160 

CGTCTTCTAC TATGGCGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 2220 

GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG TGGCACCAAC 2280 

CATCATCCCG ACACCCATGT ACCGTCCTOG GCCAGCCAAC CCAGATGAAA TCGGCAACTT 2340 

TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCCCCGCCCT ACGACACCCT 2400 
CTTGGTGTTC GACTATGAGG GCAGCGGCTC CGACGCGCCG TCCC1 I CCC1 T 1 

CTCCGCCTCC G ACCAAGAC C AAGATTACGA TTATCTGAAC C-AGTGGGGCA GCCGCTTCAA 2520 

GAAGCTGGCA GACATGTACG GTGGCGGGGA GGACGACTAG C-CGGCCTGCC TGCAGGGCTG 2580 
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GGGACCAAAC GTCAGGCCAC AGAGCATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 2640 

GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGTCTG 2700 

ACGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 2760 

AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 2820 

5 TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 2880 

TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCIGGTGCA ACTTAATTTT 2940 

TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 3000 

GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 3060 

TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 3120 

10 GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 
TAAAGAAACT TTTCCCAGAA AAAAA 



I I I I I 

MGLPEGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGOA L 
QEPALPSTDN DDFTVRNGET VQEERSLKER NPLKIFPSKR ILRRHKRDWV V 
KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 
YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 
DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 
TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 
AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 
PTSTATITOH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 
VNDHGPVPEP RQITICNQSP VRQVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 
TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TUIRATVCDC HGHVETCPGP WKGGFILPVL 
GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDMVFY YGEEGGGEED QDYDITQLHR 
GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 



Coding sequence: 



ATGAGGCTCC AAAGACCCCG ACACGCCCCG GCGGGTGGGA GGCGCGCGCC CCGGGGCGGG 
TA CGG=C AGACCCGGGG AGAGGCGCGC GGAGGCTGCG AAGGTTCCAG 
AAGGGCGGGG AGGGGGCGCC GCGCGCTGAC CCTCCCTGGG CACCGCTGGG GACGATGGCG 
CTGCTCGCCT TGCTGCTGGT CG1GGCCCTA CCGCGGGTGT GGACAGACGC CAACCTGACT 
GCGAGACAAC GAGATCCAGA GGACTCCCAG CGAACGGACG AGGGTGACAA TAGAGTGTGG 
TGTCATGTTT GTGAGAGAGA AAACACTTTC GAGTGCCAGA ACCCAAGGAG GTGCAAATGG 
ACAGAGCCAT ACTGCGTTAT AGCGGCCGTG AAAATATTTC CACGTTTTTT CATGGTTGCG 
AAGCAGTGCT CCGCTGGTTG TGCAGCGATG GAGAGACCCA AGCCAGAGGA GAAGCGGTTT 
CTCCTGGAAG AGCCCATGCC CTTCTTTTAC CTCAAGTGTT GTAAAATTCG CTACTGCAAT 
TTAGAGGGGC CACCTATCAA CTCATCAGTG TTCAAAGAAT ATGCTGGGAG CATGGGTGAG 
AGCTGTGGTG GGCTGTGGCT GGCCATCCTC CTGCTGCTGG CCTCCATTGC AGCCGGCCTC 



□ NO: 340 Protei 



MRLQRPRQAP AGGRRAPRGG RGSPYRPDPG RGARRLRRFQ KGGEGAPRAD PPWAPLGTMA 
LLALLLWAL PRVWTDANLT ARQRDPEDSQ RTDEGDNRW CHVCERENTF SCQHPRRCKW 
TEPYCVIAAV KIFPRFFMVA KQCSAGCAAM ERPKPEEKRF LLSEPMPFFY LKCCKIRYCN 
60 LEGPPINSSV FKEYAGSMGE SCGGLWLAIL LLLASIAAGL SIiS 



65 



CGTGACCCTG CAGCGGAACA TCACGCTGCT CAACGGCGTG GCCATCATCG TGGGGACCAT 
TATCGGCTCG GGCATCTTCG TGACGCCCAC GGGCGTGCTC AAGGAGGCAG GCTCGCCGGG 
GCTGGCGCTG GTGGTGTGGG CCGCGTGCGG CGTCTTCTCC ATCGTGGGCG CGCTCTGCTA 
CGCGGAGCTC GGCACCACCA TCTCCAAATC GGGCGGCGAC TACGCCTACA TGCTGGAGGT 
CTACGGCTCG CTGCCCGCCT TCCTCAAGCT CTGGATCC-AG CTGCTCATCA TCCGGCCTTC 
ATCGCAGTAC ATCGTGGCCC TGGTCTTCGC CACCTACCTG CTCAAGCCGC TCTTCCCCAC 
CTGCCCGGTG CCCGAGGAGG CAGCCAAGCT CGTGGCCTGC CTCTC-CGTGC TGCTGCTCAC 
GGCCGTGAAC TGCTACAGCG TGAAGGCCGC CACCCGGGTC CAGGATGCCT TTC-CCGCCGC 
CAAGCTCCTG GCCCTGGCCC TGATCATCCT GCTGGGCTTC GTCCAGATCG GAAAGGGTGA 
TGTGTCCAAT CTAGATCCCA ACTTCTCATT TGAAGGCACC AAACTGGATG TGGGGAACAT 
IGTGCTGGCA TTATACAGCG GCCTCTTTGC CTATGGAGGA TGGAATTACT TGAATTTCGT 
CACAGAGGAA ATGATCAACC . CCTACAGAAA CCTGCCCCTG GCCATCATCA TCTCCCTGCC 
CATCGTGACG CTGGTGTACG TGCTGACCAA CCTGGCCTAC TTCACCACCC TC-TCCACCGA 
GCAGATGCTG TCGTCCGAGG CCGTGGCCGT GGACTTCGGG AACTATCACC T " 

1084 
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CTCCATGATC CACCCACAGC TCCTCACCCC CGTGCCGTCC CTCGTC-TTCA CGTGTGTGAT 

GACGCTGCTC TACGCCTTCT CCAAGGACAT CTTCTCCGTC ATCAACTTCT TCAGCTTCTT 

CAACTGGCTC TGCGTGGCCC TGGCCATCAT CGGCATGATC TGGCTGCGCC ACAGAAAGCC 

TGAGCTTGAG CGGCCCATCA AGGTGAACCT GGCCCTGCCT GTGTTCTTCA TCCTGGCCTG 

CCTCTTCCTG ATCGCCGTCT CCTTCTGGAA GACACCCGTG GAGTGTGGCA TCGGCTTCAC 

CATCATCCTC AGCGGGCTGC CCGTCTACTT CTTCGGGGTC TGGTC-GAAAA ACAAGCCCAA 

GTGGCTCCTC CAGGGCATCT TCTCCACGAC CGTCCTGTGT CAGAAGCTCA TGCAGGTGGT 
CCCCCAGGAG ACAT AG CCAG GAGGCCGAGT GGCTGCCGGA GGAGCATGC 

Seq ID NO: 342 Protein sequence 
Protein Accession tl-. XP_035292.2 

1 11 21 31 41 51 

I I I I I I 

MAGAGPKRRA LAAPAAEEKE EAREKMLAAK SADGSAPAGE GEGVTLQRNI TLLNGVAIIV 
GTIIGSGIFV TPTGVLKEAG SPGLALWWA ACGVFSIVGA LCYAELGTTI SKSGGDYAYM 
LEVYGSLPAF LKLWIELLII RPSSQYIVAL VFATYLLKPL PPTCPVPEEA AKLVACLCVL 
LLTAVNCYSV KAATRVQDAF AAAKLLALAL IILLGFVQIG KGDVSNLDPN PSPEGTKLDV 
GNIVLALYSG LFAYGGWNYL NFVTEEMINP YRNLPLAIII SLPIVTLVYV LTNLAYFTTL 
STEQMLSSEA VAVDFGNYHL GVMSWIIPVF VGLSCFGSVN GSLFTSSRLF FVGSREGHLP 
SILSMIHPQL LTPVPSLVFT CVMTLLYAFS KDIFSVINFF S FFNWLCVAL AIIGMIWLRH 
iLPVFFI LACLFLIAVS FWKTPVECGI GFIIILSGLP VYFFGVWWKH 
? STTVLCQKLM QWPQET 



Seq i: 



1 11 21 31 41 51 

I I I I I I 

TAAAAAGCAA AAGAATTCGC GGCCGCGTCG ACACGGGCTT CCCCGAAAAC CTTCCCCGCT SO 

TCTGGATATG AAATTCAAGC TGCTTGCTGA GTCCTATTGC CGGCTGCTGG GAGCCAGGAG 120 

35 AGCCCTGAGG AGTAGTCACT CAGTAGCAGC TGAOGGGTGG GTCCACCATG AACTGGAGTA 180 

TCTTTGAGGG ACTCCTGAGT GGGGTCAACA AGTACTCCAC AGCCTTTGGG CGCATCTGGC 240 

TGTCTCTGGT CTTCATCTTC CGCGTGCTGG TGTACCTGGT GACGGCCGAG CGTGTGTGGA 300 

GTGATGACCA CAAGGACTTC GACTGCAATA CTCGCCAGCC CGGCIGCTCC AACGTCTGCT 360 

TTGATGAGTT CTTCCCTGTG TCCCATGTGC GCCTCTGGGC CCTGCAGCTT ATCCTGGTGA 42 0 

40 CATGCCCCTC ACTGCTCGTG GTCATGCACG TGGCCTACCG GGAGGTTCAG GAGAAGAGGC 480 

ACCGAGAAGC CCATGGGGAG AACAGTGGGC GCCTCTACCT GAACCCCGGC AAGAAGCGGG 540 

GTGGGCTCTG GTGGACATAT GTCTGCAGCC TAGTGTICAA GGCGAGCGTG GACATCGCCT 60 0 

TTCTCTATGT GTTCCACTCA TTCTACCCCA AATATATCCT CCCTCCTGTG GTCAAGTGCC 66 0 

ACGCAGATCC ATGTCCCAAT ATAGTGGACT GCTTCATCTC CAAGCCCTCA GAGAAGAACA 72 0 

45 TTTTCACCCT CTTCATGGTG GCCACAGCTG CCATCTGCAT CCTGCTCAAC CTCGTGGAGC 780 

TCATCTACCT GGTGAGCAAG AGATGCCACG AGTGCCTGGC AGCAAGGAAA GCTCAAGCCA 840 

TGTGCACAGG TCATCACCCC CACGGTACCA CCTCTTCCTG CAAACAAGAC GACCTCCTTT 900 

CGGGTGACCT CATCTTTCTG GGCTCAGACA GTCATCCTCC TCTCTTACCA GACCGCCCCC 960 

GAGACCATGT GAAGAAAACC ATCTTGTGAG GGGCTGCCTG GACTGGTCTG GCAGGTTGGG 1020 

50 CCTGGATGGG GAGGCTCTAG CATCTCTCAT AGGTGCAACC TGAGAGTGGG GGAGCTAAGC 1080 

CATGAGGTAG GGGCAGGCAA GAGAGAGGAT TCAGACGCTC TGGGAGCCAG TTCCTAGTCC 114 0 

TCAACTCCAG CCACCTGCCC CAGCTCGACG GCACTGGGCC AGTTCCCCCT CTGCTCTGCA 1200 
GCTCGGTTTC CTTTTCTAGA ATGGAAATAG TGAGGGCCAA TGC 

55 Seq ID NO: 344 Protein sequence 



MNWSIFEGLL SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA E 
SNVCFDEFFP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG E 
GKKRGGLWWT YVCSLVFKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCFISKP 1B0 



I I I I I I 

CGGGCGAAGC AGCGCGGGCA GCGAGATGCA GCACCGAGGC TTCCTCCTCC TCACCCTCCT 
CGCCCTGCTG GCGCTCACCT CCGCGGTCGC CAAAAAGAAA GATAAGGTGA AGAAGGGCGG 
CCCGGGGAGC GAGTGCGCTG AGTGGGCCTG GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 
CGGCGTGGGT TTCCGCGAGG GCACCTGCGG GGCCCAGACC CAGCGCATCC GGTGCAGGGT 
GCCCTGCAAC TGGAAGAAGG AGTTTGGAGC CGACTGCAAG TACAAGTTTG AGAACTGGGG 
TGCGTGTGAT GGGGGCACAG GCACCAAAGT CCGCCAAGGC ACCCTGAAGA AGGCC-CGCTA 
CAATGCTCAG TGCCAGGAGA CCATCCGCGT CACCAAGCCC TGCACCCCCA AGACCAAAGC 
AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GGACTAGACG CCAAGCCTGG ATGCCAAGGA 
GCCCCTGGTG TCACATGGGG CCTGGCCACG CCCTCCCTCT CCCAGGCCCG AGATC-TGACC 
CACCAGTGCC TTCTGTCTGC TCGTTAGCTT TAATCAATCA TGCCCTGCCT TGTCCCTCTC 
ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG ACAAGGGATT CTGGGAAGCT 
TGAGCCTCCC CCAAAGCAAT GTGAGTCCCA GAGCCCGCTT TTGTTCTTCC CCACAATTCC 
ATTACTAAGA AACACATCAA ATAAACTGAC TTTTTCCCCC CAATAAAAGC TCTTCTTTTT 



1085 
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5 

10 



45 



Seq ID NO: 347 DHA sequence 

Nucleic Acid Accession tf: NM_006783.1 

Coding sequence: 1..78S 

i i 1 r i 1 i 1 r 

ATGGATTGGG GGACGCTGCA CACTTTCATC GGGGGTGTCA ACAAACACTC CACCAGCATC 
GGGAAGGTGT GGAT CACAGT CATCTTTATT TTCCGAGTCA TGATCCTAGT GGTGGCTGCC 
CAGGAAGTGT GGGGTGACGA GCAAGAGGAC TTCGTCTGCA ACACACTGCA ACCGGGATGC 
AAAAATGTGT GCTATGACCA CTTTTTCCCG GTGTCCCACA TCCGGCTGTG G3CCCTCCAG 
CTGATCTTCG TCTCCACCCC AGCGCTGCTG GTGGCCATGC ATGTGGCCTA CTACAGGCAC 
GAAACCACTC GCAAGTTCAG GCGAGGAGAG AAGAGGAATG ATTTCAAAGA CATAGAGGAC 
ATTAAAAAGC ACAAGGTTCG GATAGAGGGG TCGCTGTGGT GGACGTACAC CAGCAGCATC 
TTTTTCCGAA TCATCTTTGA AGCAGCCTTT ATGTATGTGT TTTACTTCCT TTACAATGGG 
TACCACCTGC CCTGGGTGTT GAAATGTGGG ATTGACCCCT GCCCCAACCT TGTTGACTGC 
TTTATTTCTA GGCCAACAGA GAAGACCGTG TTTACCATTT TTATGATTTC TGCGTCTGTG 
ATTTGCATGC TGCTTAACGT GG CAGAGTTG TGCTACCTGC TGCTGAAAGT GTGTTTTAGG 
AGATCAAAGA GAGCACAGAC GCAAAAAAAT CACCCCAATC ATGCCCTAAA GGAGAGTAAG 
CAGAATGAAA TGAATGAGCT GATTTCAGAT AGTGGTCAAA ATGCAATCAC AGGTTTCCCA 



I I I I I I 

MDWGTLHTFI GGVNKHSTSI GKVWITVIFI FRVMILWAA QEVWGDEQED FVCNTLQPGC 

KNVCYDHFFP VSIIIRLWALQ LIFVSTPALL VAMHVAYYRH ETTRXFRRGE KRNDFKDIED 

40 IKKHKVRIEG SLWWTYTSSI FFRIIFEAAF MYVFYFLYNG YHLPWVLXCG IDPCPNLVDC 

FISRPTEKTV FTIFMISASV ICMLLNVAEL CYLLLKVCFR RSKRAQTQKN HPNHALKESK 
QNEMHELISD SGQNAITGFP S 



AGGACCTGGA GCTCCCAAAG TTGGCAGGGA CCTGGCACTC CATGGCCATG GCGACCAACA 
ACATCTCCCT CATGGCGACA CTGAAGGCCC CTCTGAGGGT CCACATCACC TCACTGTTGC 
CCACCCCCGA GGACAACCTG GAGATCGTTC TGCACAGATG GGAGAACAAC AGCTGTGTTG 
AGAAGAAGGT CCTTGGAGAG AAGACTGGGA ATCCAAAGAA GTTCAAGATC AACTATACGG 
TGGCGAACGA GGCCACGCTG CTCGATACTG ACTACGACAA TTTCCTGTTT CTCTGCCTAC 
AGGACACCAC CACCCCCATC CAGAGCATGA TGTGCCAGTA CCTGGCCAGA GTCCTGGTGG 
AGGACGATGA GATCATGCAG GGATTCATCA GGGCTTTCAG GCCCCTGCCC AGGCACCTAT 
GGTACTTGCT GGACTTGAAA CAGATGGAAG AGCCGTGCCG TTTCTAGCTC ACCTCCGCCT 
CCAGGAAGAC CAGACTCCCA CCCTTCCACA CCTCCAGAGC AGTGGGACTT CCTCCTGCCC 
TTTCAAAGAA TAACCACAGC TCAGAAGACG ATGACGTGGT CATCTGTGTC GCCATCCCCT 
TCCTGCTGCA CACCTGCACC ATTGCCATGG GGAGGCTGCT CCCTGGGGGC AGAGTCTCTG 
GCAGAGGTTA TTAATAAACC CTTGGAGCAT G 



70 MDIPQTKQDL ELPKLAGTWH SMAMATNNIS 

WENNSCVEKK VLGEKTGNPK KFKINYTVAN 
D EIMQGFIRAF RPLPRHLWYL 



75 




CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC G 
AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 
TCATCTTCCG TGTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGIAC GAGCAGCGGC 
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TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 360 
GCATCTTCTT GTGCCAGGGC AAGCGCCCTC GGTCCCAGGA GTACCGCATC CAGCTCCGCG 420 
TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 480 
GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 540 
TCATCTGGTA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA GAACCGGC-TC CACATTCAGT SOO 
CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCACAGC 660 
TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG 72 0 
GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 
TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG GGACCGCGTG GAAATCAGGT 840 
GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAGCAGAAC CCCAGCACCA 900 
GGGAGGCAGA GGAAGAGACA ACCAACGACA ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA S60 
AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 1020 
TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 10 80 
CCCCTGAGAG ACAGG AAGG C AGCAGCCTCA CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 
ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 12 00 
TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 
CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT C-GCCATTTTT GGCCCCCCTT 1320 
GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTC- AATCTGTCTT 13 80 
GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACC-GCAAGTG 1440 
AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 
TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 
TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC 1620 
TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCICCACA GAGAGAAAGC 16 80 
TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740 
TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 
GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 
TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 
GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 
CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 
CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 
GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 
GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 222 0 
CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 
AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 
CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 240 0 
GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 
AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 252 0 
ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2580 
GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 264 0 
TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCIGTCT CTACTAAAAA 2700 
TACARAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 
CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2 82 0 
CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 
ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2 940 
TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 
GGGGAGCAGA CAAACATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3 0 60 
TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 
AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCC-GGTGT GTGTGTCTGT 31B0 
CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 
TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 
AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTITTTA TTCTACATGG GTACCACAGG 3360 
AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 
AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCICTACCA GAGCAGACAG 3480 
CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GCACGAAGGG CCTGGCAGGC 3540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTI 



Seq ID NO: 352 Protein 



GLPRLVCAPL LAACCCCPRV AGVPGEAEQP APELVEVEVG STALLKCGLS QSQGNLSHVD 
WFSVHKEKRT LIFRVRQGOG QSEPGEYEQR LSLQDRGATL ALTQVTPQDE RIFLCQGKRP 
RSQEYRIQLR VYKAPEEPNI QVIIPLGIPVN SKEPEEVATC VGRNGYPIPQ VIWYKNGRPL 
KEEKNRVHIQ SSOTVESSGL YTLQSILKAQ LVKEDKDAQF YCELNYRLPS GNHMKESREV 
TVPVFYPTEK VWLEVEPVGM LKEGDRVEIR CLADGNPPPH FSISKQNPST REAEEETXND 
NGVLVLEPAR KEHSGRYECQ AWNLDTMISL LSEPQELLVN YVSDVRVSPA APERQEC-SSL 
TLTCEAESSQ DLEFQWLREE TDQVLERGPV LQLHDLKREA GGGYRCVASV PSIPGLNRTQ 
LVKLAIFGPP WMAFKERKVW VKENMVLNLS CEASGHPRPT ISWNVNGTAS SQDQDPQRVL 
STLNVLVTPE LLETGVECTA SNDLGKNTS I LFLELVNLTT LTPDSNTTTG LSTSTASPHT 
RANSTSTERK LPEPESRGW IVAVIVCILV LAVLGAVLYF LYKKGKLPCR RSGKQEITLP 
PSRKTELWE VKSDKLPEEM GLLQGSSGDK RAPGDQGEKY IDLRH 

Seq ID NO: 353 DNA sequence 



G CGGTAGAATC TTCCCAGTAG G 

GCTAGGCCGG GCGGATCCCG TCCTCCCCCG ATGTGAGCAG TITTCCC-AAA CCCCGTCAGG 
CGAAGGCTGC CCAGAGAGGT GGAGTCGGTA GCGGGGCCGG GAACATC-AGG CAGTCTCTCC 
TATTCCTGAC CAGCGTGGTT CCTTTCGTGC TGGCGCCGCG ACCTCCGGAT GACCCGGGCT 
TCGGCCCCCA CCAGAG ACT C GAGAAGCTTG ATTCTTTGCT CICAGACTAC G*""" "-' 
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CTTTATCTAA TATCCAGCAG CATTCGGTAA GAAAAAGAGA TCTACAGACT TCAACACATG 360 
TAGAAACACT ACTAACTTTT TCAGCTTTGA AAAGGCATTT TAAATTATAC CTGACATCAA 420 
GTACTGAACG TTTTTCACAA AATTTCAAGG TCGTGGTGGT GGATGGTAAA AACGAAAGCG 480 
AGTACACTGC AAAATGGCAG GACTTCTTCA CTGGACACGT GGTTGGTGAG CCTGACTCTA 540 
GGGTTCTAGC CCACATAAGA GATGATGATG TTATAATCAG AATCAACACA GATGGGGCCG 60 0 
AATATAACAT AGAGCCACTT TGGAGATTTG TTAATGATAC CAAAGACAAA AGAATGTTAG 660 
TTTATAAATC TGAAGATATC AAGAATGTTT CACGTTTGCA GTCTCCAAAA GTGTGTGGTT 72 0 
ATTTAAAAGT GGATAATGAA GAGTTGCTCC CAAAAGGGTT AGTAGACAGA GAACCACCTG 780 
AAGAGCTTGT TCATCGAGTG AAAAGAAGAG CTGACCCAGA TCCCATGAAG AACACGTGTA 840 
AATTATTGGT GGTAGCAGAT CATCGCTTCT ACAGATACAT GGGCAGAGGG GAAGAGAGTA 900 
CAACTACAAA TTACTTAATA GAGCTAATTG ACAGAGT1GA TGACATCTAT CGGAACACTT 960 
CATGGGATAA TGCAGGTTTT AAAGGCTATG GAATACAGAT AC-AGCAGATT CGCATTCTCA 1020 
AGTCTCCACA AGAGGTAAAA CCTGGTGAAA AGCACTACAA CATGGCAAAA AGTTACCCAA 1080 
ATGAAGAAAA GGATGCTTGG GATGTGAAGA TGTTGCTAGA GCAATTTAGC TTTGATATAG 1140 
CTGAGGAAGC ATCTAAAGTT TGCTTGGCAC ACCTTTTCAC ATACCAAGAT TTTGATATGG 1200 
GAACTCTTGG ATTAGCTTAT GTTGGCTCTC CCAGAGCAAA CAGCCATGGA GGTGTTTGTC 1260 
CAAAGGCTTA TTATAGCCCA GTTGGGAAGA AAAATATCTA TTTGAATAGT GGTTTGACGA 1320 
GCACAAAGAA TTATGGTAAA ACCATCCTTA CAAAGGAAGC TGACCTGGTT A CAACTCATG 1380 
AATTGGGACA TAATTTTGGA GCAGAACATG ATCCGGATGG TCTAGCAGAA TGTGCCCCGA 1440 
ATGAGGACCA GGGAGGGAAA TATGTCATGT ATCCCATAGC TGTGAGTGGC GATCACGAGA 1500 
ACAATAAGAT GTTTTCAAAC TGCAGTAAAC AATCAATCTA TAAGACCATT GAAAGTAAGG 1560 
CCCAGGAGTG TTTTCAAGAA CGCAGCAATA AAGTTTGTGG GAACTCGAGG GTGGATGAAG 1620 
GAGAAGAGTG TGATCCTGGC ATCATGTATC TGAACAACGA CACCTGCTGC AACAGCGACT 1680 
GCACGTTGAA GGAAGGTGTC CAGTGCAGTG ACAGGAACAG TCCTTGCTGT AAAAACTGTC 1740 
AGTTTGAGAC TGCCCAGAAG AAGTGCCAGG AGGCGATTAA TGCTACTTGC AAAGGCGTGT 1800 
CCTACTGCAC AGGTAATAGC AGTGAGTGCC CGCCTCCAGG AAATGCTGAA AATGACACTG 1860 
TTTGCTTGGA TCTTGGCAAG TGTAAGGATG GGAAATGCAT CCCTTTCTGC GAGAGGGAAC 192 0 
AGCAGCTGGA GTCCTGTGCA TGTAATGAAA CTGACAACTC CTGCAAGGTG TGCTGCAGGG 1980 
ACCTTTCTGG CCGCTGTGTG CCCTATGTCG ATGCTGAACA AAAGAACTTA TTTTTGAGGA 2040 
AAGGAAAGCC CTGTACAGTA GGATTTTGTG ACATGAATGG CAAATGTGAG AAACGAGTAC 2100 
AGGATGTAAT TGAACGATTT TGGGATTTCA TTGACCAGCT GAGCATCAAT ACTTTTGGAA 2160 
AGTTTTTAGC AGACAACATC GTTGGGTCTG TCCTGGTTTT CTCCTTGATA TTTTGGATTC 2220 
CTTTCAGCAT TCTTGTCCAT TGTGTGGATA AGAAATTGGA TAAACAGTAT GAATCTCTGT 2280 
CTCTGTTTCA CCCCAGTAAC GTCGAAATGC TGAGCAGCAT GGATTCTGCA TCGGTTCGCA 2340 
TTATCAAACC CTTTCCTGCG CCCCAGACTC CAGGCCGCCT GCAGCCTGCC CCTGTGATCC 2400 
CTTCGGCGCC AGCAGCTCCA AAACTGGACC ACCAGAGAAT GGACACCATC CAGGAAGACC 2460 
CCAGCACAGA CTCCCATATG GACGAGGATG GGTTTGAGAA GGACCCCTTC CCAAATAGCA 2520 
GCACAGCTGC CAAGTCATTT GAGGATCTCA CGGACCATCC GGTCGCCAGA AGTGAAAAGG 2580 
CTGCCTCCTT TAAACTGCAG CGTCAGAATC GTGTTAACAG CAAAGAAACA GAGTGCTAAT 2 6 40 
TTAGTTCTCA GCTCTTCTGA CTTAAGTGTG CAAAATATTT TTATAGATTT GACCTACAAA 2 7 00 
TCAATCACAG CTTCTATTTT GTGAAGACTG GGAAGTGACT TAGCAGATGC TGGTCATGTG 2760 
TTTGAACTTC CTGCAGGTAA ACAGTTCTTG TGTGGTTTGG CCCTTCTCCT TTTGAAAAGG 2820 
T AAGGTG AAA GTGAATCTAC TTATTTTGAG GCTTTCAGGT TTTAGTTTTT AAAAT AT CTT 2880 
TTGACCTGTG GTGCAAAAGC AGAAAATACA GCTGGATTGG GTTATGAATA TTTACGTTTT 2 940 
TGTAAATTAA TCTTTTATAT TGATAACAGC ACTGACTAGG GAAATGATCA GTTTTTTTTT 3000 
ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA GAAAAGTGGA 3060 
ATAGTTTTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA AAACAAAGGA GATAAATTTA 3120 
GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT TTTTATGTAG 3180 
CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC TTCATAATTC 3240 
TAAGACTAAT GCTCAGAAC C TAACCACTAC CTTACAGTGA GGGCTATACA TGGTAGCCAG 3 300 
TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG TTTTTCTGTA 3360 
TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT AGAAAATTCA 3420 
CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG GCTGAGGTTG 3480 
CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC CTC 



Seq ID NO: 3S4 P 



I I I I I I 

MRQSLLFLTS WPFVIAPRP PDDPGFGPHQ RLEKLDSLLS DYDILSLSNI QQHSVRKRDL 
QTSTHVETLL TFSALKRHFK LYLTSSTERF SQNFKWWD GKNESEYTAK WQDFFTGHW 
GEPDSRVIiAH IRDDDVIIRI NTDGAEYNIE PLWRFVNDTK DKRMLVYKSE DIKNVSRLQS 
V DREPPEELVH RVKRRADPDP MKNTCKLLW ADHRFYRYMG 
A GFKGYGIQIE QIRILKSPQE VKPGEKHYNM 



AKSYPNEEKD AWDVKMLLEQ FSFDIAEEAS KVCLAHLrJ I FJKGTU3L A 
HGGVCPKAYY SPVGKKHIYL 1 """" " 

AECAPNEDQG GKYVMYPIAV SGDHENNKMF SNCSKQSIYK TIESKAQSCF QERSNKVCGN 
SRVDEGEECD PGIMYLNNDT CCNSDCTLKE GVQCSDRNSP CCKNCQFSTA QKKCQEAINA 
TCKGVSYCTG NSSECPPPGN AENDTVCLDL GKCKDGKCIP FCSREQQLES CACNETDMSC 
KVCCRDLSGR CVPYVDAEQK NLFLRKGKPC TVG FCDMNGK CEKRVQDVIE RFWDFIDQLS 
INTFGKFLAD NIVGSVLVFS LIFWIPFSIL VHCVDKKLDK O.YESLSLFHP SNVEMLSSMD 
SASVRIIKPF PAPQTPGRLQ PAPVIPSAPA APKLDHQRMD TIQEDPSIDS HMDEDGFEKD 
PFPNSSTAAK SFEDLTDHPV ARSEKAASFK LQRQNRVNSK ETEC 

Seg ID NO: 355 DMA sequence 
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ATTCCTGACC AGCGTGGTTC CTTTCGTGCT GGCGCCGCGA CCTCCGGATG ACCCGGGCTT 
CGGCCCCCAC CAGAGACTCG AGAAGCTTGA TTCTTTGCTC TCAGACIACG ATATTCTCTC 
TTTATCTAAT ATCCAGCAGC ATTCGGTAAG AAAAAGAGAT CTACAGACTT CAACACATGT 
AGAAACACTA CTAACTTTTT CAGCTTTGAA AAGGCATTTT AAATTATACC TGACATCAAG 
TACTGAACGT TTTTCACAAA ATTTCAAGGT CGTGGTGGTG GATGGTAAAA * 
\ AAATGGCAGG ACTTCTTCAC TGGACACGTG GTTGGTGAGC C 



ATATAACATA GAGCCACTTT GGAGATTTGT TAATGATACC AAAGACAAAA GAATGTTAGT 660 

TTATAAATCT GAAGATATCA AGAATGTTTC ACGTTTGCAG TCTCCAAAAG TGTG7GGTTA 720 

TTTAAAAGTG GATAATGAAG AGTTGCTCCC AAAAGGGTTA GTAGACAGAG AACCACCTGA 780 

AGAGCTTGTT CATCGAGTGA AAAGAAGAGC TGACCCAGAT CCCATGAAGA ACACGTGT AA 840 

ATTATTGGTG GTAGCAGATC ATCGCTTCTA CAGATACATG GGCAGAGGGC- AAC-AGAGTAC 900 

AACTACAAAT TACTTAATAG AGCTAATTGA CAGAGTTGAT GACATCTATC GGAACACTTC 960 

ATGGGATAAT GCAGGTTTTA AAGGCTATGG AAT ACAG AT A GAGCAGATTC GCATTCTCAA 1020 

GTCTCCACAA GAGGTAAAAC CTGGTGAAAA GCACTACAAC ATGGCAAAAA GTTACCCAAA 1080 

TGAAGAAAAG GATGCTTGGG ATGTGAAGAT GTTGCTAGAG CAATTTAGCT TTGATATAGC 1140 

TGAGGAAGCA TCTAAAGTTT GCTTGGCACA CCTTTTCACA TACCAAGATT TTGATATGGG 1200 

AACTCTTGGA TTAGCTTATG TTGGCTCTCC CAGAGCAAAC AGCCATGGAG GTGTTTGTCC 1260 

AAAGGCTTAT TATAGCCCAG TTGGGAAGAA AAATATCTAT TTGAATAGTG GTTTGACGAG 1320 

CACAAAGAAT TATGGTAAAA CCATCCTTAC AAAGGAAGCT GACCTGGTTA CAACTCATGA 1380 

ATTGGGACAT AATTTTGGAG CAGAACATGA TCCGGATGGT CTAGCAGAAT GTGCCCCGAA 1440 

TGAGGACCAG GGAGGGAAAT ATGTCATGTA TCCCATAGCT GTGAGTGC-CG AT CACGAGAA 1500 

CAATAAGATG TTTTCAAACT GCAGTAAACA ATCAATCTAT AAGACCATTG AAAGTAAGGC 1560 

CCAGGAGTGT TTTCAAGAAC GCAGCAATAA AGTTTGTGGG AACTCGAGGG TGGATGAAGG 1620 

AGAAGAGTGT GATCCTGGCA TCATGTATCT GAACAACGAC ACCTGCTC-CA ACAGCGACTG 1680 

CACGTTGAAG GAAGGTGTCC AGTGCAGTGA CAGGAACAGT CCTTGCTGTA AAAACTGTCA 1740 

GTTTGAGACT GCCCAGAAGA AGTGCCAGGA GGCGATTAAT GCTACTTGCA AAGGCGTGTC 1800 

CTACTGCACA GGTAATAGCA GTGAGTGCCC GCCTCCAGGA AATGCTGAAG ATGACACTGT 1860 

TTGCTTGGAT CTTGGCAAGT GTAAGGATGG GAAATGCATC CCTTTCTGCG AGAGGGAACA 1920 

GCAGCTGGAG TCCTGTGCAT GTAATGAAAC TGACAACTCC TGCAAGGTGT GCTGCAGGGA 1980 

CCTTTCCGGC CGCTGTGTGC CCTATGTCGA TGCTGAACAA AAGAACTTAT TTTTGAGGAA 2040 

AGGAAAGCCC TGTACAGTAG GATTTTGTGA CATGAATGGC AAATGTGAGA AACGAGTACA 2100 

GGATGTAATT GAACGATTTT GGGATTTCAT TGACCAGCTG AGCATCAATA CTTTTGGAAA 2160 

GTTTTTAGCA GACAACATCG TTGGGTCTGT CCTGGTTTTC TCCTTGATAT TTTGGATTCC 2220 

TTTCAGCATT CTTGTCCATT GTGTGTAACG TCGAAATGCT GAGCAGCATG GATTCTGCAT 2280 

CGGTTCGCAT TATCAAACCC TTTCCTGCGC CCCAGACTCC AGGCCGCCTG CAGCCTGCCC 2340 

CTGTGATCCC TTCGGCGCCA GCAGCTCCAA AACTGGACCA CCAGAGAATG GACACCATCC 2400 

3 ACGAGGATGG GTTTGAGAAG GACCCCTTCC 2460 

3 CACAGCTGCC AAGTCATTTG AGGATCTCAC GGACCATCCG GTCACCAGAA 2520 

C TGCCTCCTTT AAACTGCAGC GTCAGAATCG TGTTGACAGC AAAGAAACAG 2580 

:AG CTCTTCTGAC TTAAGTGTGC AAAATATTTT TATAGATTTG 2640 

ACCTACAATC AATCACAGCT TATATTTTGT GAAGACTGGG AAGTGACTTA GCAGATGCTG 2700 

GTCATGTGTT TGAACTTCCT GCAGGTAAAC AGTTCTTGTG TGGTTTGGCC CTTCTCCTTT 2760 

TGAAAAGGTA AGGTGAAGGT GAATCTAGCT TATTTTGAGG CTTTCAGGTT TTAGTTTTTA 2 82 0 

AAATATCTTT TGACCTGTGG TGCAAAAGCA GAAAATACAG CTGGATTGGG TTATGAGTAT 2 880 

TTACGTTTTT GTAAATTAAT CTTTTATATT GATAACAGGC ACTGACTAGG GAAATGATCA 2940 

\ ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA "™ 



GAAAAGTGGA ATAGTTTTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA AAACAAAGGA 3060 

GATAAATTTA GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT 312 0 

TTTTATGTAG CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC 3180 

TTCATAATTC TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GGGCTATACA 3240 

TGGTAGCCAG TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG 33 00 

TTTTTCTGTA TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT 3360 

AGAAAATTCA CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG 3420 
3421 GCTGAGGTTG CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC CTC 



1 11 21 31 41 51 
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MRQSLLFLTS WPFVLAPRP PDDPGFGPHQ RLEKLDSLLS DYDILSLSNI QQHSVRKKDL 
QTSTHVETLL TFSALKRHFK LYLTSSTERF SQNFKWWD GKNESEYTVK WQDFFTGHW 
GEPD SRVIAH IRDDDVIIRI NTDGAEYNIE PLWRFVNDTK DKRMLVYKSE DIKKVSRLQS 
PKVCGYLKVD NEELLPKGLV DREPPEELVH RVKRRADPDP MKKTCKLLW ADHRFYRYMG 
RGEESTTTNY LIELIDRVDD IYRNTSWDNA GFKGYGIQIE QIRILKSPQE VKPGEKHYHM 
AKSYPNEEKD AWDVKMLLEQ FSFDIAEEAS KVCLAHLFTY QDFDKGTLGL AYVGSPRANS 
HGGVCPKAYY SPVGKKMIYL NSGLTSTKNY GKTILTKEAD LVTTHELGHN FGAEHDPDGL 
AECAPNEDQG GKYVMYPIAV SGDHENNKMF SNCSKQSIYK TIESKAQECF QERSNKVCGN 
SRVDEGEECD PGIMYLNNDT CCNSDCTLKE GVQCSDRNSP CCKNCQFSTA QKKCQEAINA 
TCKGVSYCTG NSSECPPPGN AEDDTVCLDL GKCKDGKCIP FCEREQQLES CACNETDNSC 
KVCCRDLSGR CVPYVDAEQK NLFLRKGKPC TVGFCDMNGK CEKRVQDVIE RFWDFIDQLS 
INTFGKFLAD NIVGSVLVFS LIFWIPFSIL VHCV 



T GCCCTCACCA TGAGCCTCTG GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 
GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 
CCTGAGAACC AATCTCACCG ACAGGCAGCT GGCAGAGGAA TACCTGTACC GCTATGGTTA 
CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG GGGCCTGCGC TGCTGCTTCT 
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\ CTGTCCCTGC CCGAGACCGG TGAGCTGGAT AGCGCCACGC TGAAC-GCCAT 300 

GCGAACCCCA CGGTGCGGGG TCCCAGACCT GGGCAGATTC CAAACCTTTG AGGGCGACCT 360 

CAAGTGGCAC CACCACAACA TCACCTATTG GATCCAAAAC TACTCGGAAG ACTTGCCGCG 420 

GGCGGTGATT GACGACGCCT TTGCCCGCGC CTTCGCACTG TGGAGCGCGG TGACGCCGCT 480 

CACCTTCACT CGCGTGTACA GCCGGGACGC AGACATCGTC ATCCAGTTTG GTGTCGCGGA 540 

GCACGGAGAC GGGTATCCCT TCGACGGGAA GGACGGGCTC CTGGCACACG CCTTTCCTCC 600 

TGGCCCCGGC ATTCAGGGAG ACGCCCATTT CGACGATGAC GAGTTC-TGGT CCCTGGGCAA 660 

GGGCGTCGTG GTTCCAACTC GGTTTGGAAA CGCAGATGGC GCGGCCTGCC ACTTCCCCTT 720 

CATCTTCGAG GGCCGCTCCT ACTCTGCCTG CACCACCGAC GGTCGCTCCG ACGGCTTGCC 780 

CTGGTGCAGT ACCACGGCCA ACTACGACAC CGACGACCGG TTTGGCTTCT GCCCCAGCGA 840 
GAGACTCTAC ACCCGGGACG GCAATGCTGA TGGGAAACCC TGCCAGTTTC CATTCATCTT 900 
CCAAGGCCAA TCCTACTCCG CCTGCACCAC GGACGGTCGC TCCGACGGCT ACCGCTGGTG 960 

CGCCACCACC GCCAACTACG ACCGGGACAA GCTCTTCGGC TTCTC-CCCGA CCCGAGCTGA 1020 

CTCGACGGTG ATGGGGGGCA ACTCGGCGGG GGAGCTGTGC GTCTTCCCCI TCACTTTCCT 1080 

GGGTAAGGAG TACTCGACCT GTACCAGCGA GGGCCGCGGA GATGGGCGCC TCTGGTGCGC 1140 

TACCACCTCG AACTTTGACA GCGACAAGAA GTGGG3CTTC TGCCCGGACC AAGGATACAG 1200 

TTTGTTCCTC GTGGCGGCGC ATGAGTTCGG CCACGCGCTG GGCTTAGATC ATTCCTCAGT 1260 

GCCGGAGGCG CTCATGTACC CTATGTACCG CTTCACTGAG GGGCCCCCCT TGCATAAGGA 1320 

CGACGTGAAT GGCATCCGGC ACCTCTATGG TCCTCGCCCT GAACCTGAGC CACGGCCTCC 1380 

AACCACCACC ACACCGCAGC CCACGGCTCC CCCGACGGTC TGCCCCACCG GACCCCCCAC 1440 

TGTCCACCCC TCAGAGCGCC CCACAGCTGG CCCCACAGGT CCCCCCTCAG CTGGCCCCAC 1500 

AGGTCCCCCC ACTGCTGGCC CTTCTACGGC CACTACTGTG CCTTTGAGTC CGGTGGACGA 1560 

TGCCTGCAAC GTGAACATCT TCGACGCCAT CGCGGAGATT GGGAACCAGC TGTATTTGTT 1620 

CAAGGATGGG AAGTACTGGC GATTCTCTGA GGGCAGGGGG AGCCGGCCGC AGGGCCCCTT 1680 

CCTTATCGCC GACAAGTGGC CCGCGCTGCC CCGCAAGCTG GACTCGGTCT TTGAGGAGCC 1740 

GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG TGGGTGTACA CAGGCGCGTC 1800 

GGTGCTGGGC CCGAGGCGTC TGGACAAGCT GGGCCTGGGA GCCGACC-TGG CCCAGGTGAC 1860 

CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT GCTGCTGTTC AGCGGGCGGC GCCTCTGGAG 1920 

GTTCGACGTG AAGGCGCAGA TGGTGGATCC CCGGAGCGCC AGCGAGGTGG ACCGGATGTT 1980 

CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC CGAGAGAAAG CCTATTTCTG 2040 

CCAGGACCGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG TTGAACCAGG TGGACCAAGT 2100 

GGGCTACGTG ACCTATGACA TCCTGCAGTG CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 2160 

GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA AGGAGCCAGT TTGCCGGATA 2220 

3AGGAAAGGG AGGAGTGGAG GTGGGCTGGG CCCTCTCTTC 2280 
AGTGTTTCTA ATAAACTTGG ATTCTCTAAC CTTT 



I I I 

MSLWQPLVLV LLVLGCCFAA PRQRQSTLVL FPGDLRTNLT DRQLAEEYLY R 
RGESKSLGPA LLLLQKQLSL PETGELDSAT LKAMRTPRCG VPDLGRFQTF E 
ITYWIQNYSE DLPRAVIDDA FARAFAIiWSA VTPLTFTRVY SRDADIVIQF GVAEHGDGYP 
FDGKDGLLAH AFPPGPGIQG DAHFDDDELW SLGKGVWPI RFGNADGAAC HFPFIFEGRS 
YSACTTDGRS DGLPWCSTTA NYDTDDRFGF CPSERLYTRD GNADCKPCQF PFIFQG<JSYS 
ACTTDGRSDG YRWCATTANY DRDKLFGFCP TRADSTVMGG NSAGELCVFP TFL I 1 
CTSEGRGDGR LWCATTSNFD SDKKWGFCPD QGYSLFLVAA HEFGHAICLD HS SVPEALMY 
PMYRFTEGPP LHKDDVNGIR HLYGPRPEPE PRPPTTTTPQ PTAPPTVCPT GPPTVHPSER 
PTAGPTGPPS AGPTGPPTAG PSTATTVPLS PVDDAOIVNI FDAIAEIGNQ LYLFKDGKYW 
RFSEGRGSRP QGPFLIADKW PALPRKLDSV FEEPLSKKLF FFSGRQVKVY TGASVLGPRR 
LDKLGLGADV AQVTGALRSG RGKMLLFSGR RLWRFDVKAQ MVDPRSASEV DRMFPGVPLD 
THDVFQYREK AYFCQDRFYW RVSSRSEIHQ VDQVGYVTYD ILQCPED 

Seg ID NO: 359 D 



60 



51 



CGCCCGCGCG CTGCAGCCCC ATCTCCTAGC GGCAGCCCAG GCGCGGAGGG AGCGAGTCCG 60 

CCCCGAGGTA GGTCCAGGAC GGGCGCACAG CAGCAGCCGA GGCTGGCCGG GAGAGGGAGG 120 

AAGAGGATGG CAGGGCCACG CCCCAGCCCA TGGGCCAGGC TGCTCCTGGC AGCCTTGATC 180 

AGCGTCAGCC TCTCTGGGAC CTTGGCAAAC CGCTGCAAGA AGGCCCCAGT GAAGAGCTGC 240 

65 ACGGAGTGTG TCCGTGTGGA TAAGGACTGC GCCTACTGCA CAGACGAGAT GTTCAGGGAC 300 

CGGCGCTGCA ACACCCAGGC GGAGCTGCTG GCCGCGGGCT GCCAGCGGGA GAGCATCGTG 3 60 

\ GCAGCTTCCA AATCACAGAG GAGACCCAGA TTGACACCAC CCTGCGGCGC 420 

T CCCCCCAAGG CCTGCGGGTC CGTCTGCGGC CCGGTGAGGA GCGGCATTTT 480 

GAGCTGGAGG TGTTTGAGCC ACTGGAGAG C CCCGTGGACC TGTACATCCT CATGGACTT C 540 

70 TCCAACTCCA TGTCCGATGA TCTGGACAAC CTCAAGAAGA TGGGGCAGAA CCTGGCTCGG 600 

GTCCTGAGCC AGCTCACCAG CGACTACACT ATTGGATTTG GCAAGTTIGT GGACAAAGTC 660 

AGCGTCCCGC AGACGGACAT GAGGCCIGAG AAGCTGAAGG AGCCCTGGCC CAACAGTGAC 720 

CCCCCCTTCT CCTTCAAGAA CGTCATCAGC CTGACAGAAG ATGTGGATGA GTTCCGGAAT 780 

AAACTG CAGG GAGAGCGGAT CTCAGGCAAC CTGGATGCTC CTGAGGGCGG CTTCGATGCC 840 

75 ATCCTGCAGA CAGCTGTGTG CACGAGGGAC ATTGGCTGC-C GCCCGGACAG CACCCACCTG 900 

CTGGTCTTCT CCACCGAGTC AGCCTTCCAC TATGAGGCTG ATGGCGCCAA CGTGCTGGCT 960 

GGCATCATGA GCCGCAACGA TGAACGGTGC CACCTGGACA CCACGGGCAC CTACACCCAG 1020 

TACAGGACAC AGGACTACCC GTCGGTGCCC ACCCTGGTGC GCCTGCTCGC CAAGCACAAC 1080 

ATCATCCCCA TCTTTGCTGT CACCAACTAC TCCTATAGCT ACTACGAGAA GCTTCACACC 1140 

80 TATTTCCCTG TCTCCTCACT GGGGGTGCTG CAGGAGGACT CGTCCAACAT CGTGGAGCTG 1200 

CTGGAGGAGG CCTTCAATCG GATCCGCTCC AACCTGGACA TCCGGGCCCT AGACAGCCCC 1260 

CGAGGCCTTC GGACAGAGGT CACCTCCAAG ATGTTCCAGA AC-ACGAGGAC TGG3TCCTTT 1320 

CACATCCGGC GGGGGGAAGT GGGTATATAC CAGGTGCAGC TGCGGGCCCT TGAGCACGTG ™" 
GATGGGACGC ACGTGTGCCA GCTGCCGGAG GACCAGAAGG GCAACATCCA T 
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TCCTTCTCCG ACGGCCTCAA GATGGACGCG GGCATCATCT GTGATGTGTG C 

C AGCTCGCTGC AGCTTCAACG GAGACTTCGT GTGCGGACAG 1SS0 



TGTGTGTGCA GCGAGGGCTG GAGTGGCCAG ACCTGCAACT GCTCCACCG3 C 

GACATT CAGC CCTGCCTGCG GGAGGG CGAG GACAAGCCGT GCTCCGGCCG TGGGGAGTGC 1680 

CAGTGCGGGC ACTGTGTGTG CTACGGCGAA GGCCGCTACG AGGGTCAGTT CTGCGAGTAT 1740 

GACAACTTCC AGTGTCCCCG CACTTCCGGG TTCCTCTGCA ATGACCGAGG ACGCTGCTCC 1800 

ATGGGCCAGT GTGTGTGTGA GCCTGGTTGG ACAGGCCCAA GCTGTGACTG TCCCCTCAGC I860 

AATGCCACCT GCATCGACAG CAATGGGGGC ATCTGTAATG GACGTGGCCA CTGTGAGTGT 1920 

GGCCGCTGCC ACTGCCACCA GCAGTCGCTC TACACGGACA CCATCTGCGA GATCAACTAC 1980 

TCGGCGATCC ACCCGGGCCT CTGCGAGGAC CTACGCTCCT GCGTGCAGIG CCAGGCGTGG 2040 

GGCACCGGCG AGAAGAAGGG GCGCACGTGT GAGGAATGCA ACTTCAAGGT CAAGATGGTG 2100 

GACGAGCTTA AGAGAGCCGA GGAGGTGGTG GTGCGCTGCT CCTTCCGGC-A CGAGGATGAC 2160 

GACTGCACCT ACAGCTACAC CATGGAAGGT GACGGGGCCC CTGGGCCCAA CAGCACTGTC 2220 

CTGGTGCACA AGAAGAAGGA CTGCCCTCCG GGCTCCTTCT GGTGGCTCAT CCCCCTGCTC 2280 

CTCCTCCTCC TGCCGCTCCT GGCCCTGCTA CTGCTGCTAT GCTC-GAAC-TA CTGTGCCTGC 2340 

TGCAAGGCCT GCCTGGCACT TCTCCCGTGC TGCAACCGAG GTCACATGGT GGGCTTTAAG 2400 

GAAGACCACT ACATGCTGCG GGAGAACCTG ATGGCCTCTG ACCACTTGGA CACGCCCATG 2460 

CTGCGCAGCG GGAACCTCAA GGGCCGTGAC GTGGTCCGCT GGAAGGTCAC CAACAACATG 2520 

CAGCGGCCTG GCTTTGCCAC TCATGCCGCC AGCATCAACC CCACAGAGCT GGTGCCCTAC 2580 

GGGCTGTCCT TGCGCCTGGC CCGCCTTTGC ACCGAGAACC TGCTGAAGCC TGACACTCGG 2640 

GAGTGCGCCC AGCTGCGCCA GGAGGTGGAG GAGAACCTGA ACGAGGTCTA CAGGCAGATC 2700 

TCCGGTGTAC ACAAGCTCCA GCAGACCAAG TTCCGGCAGC AGCCCAATGC CGGGAAAAAG 2760 

CAAGACCACA CCATTGTGGA CACAGTGCTG ATGGCGCCCC GCTCGGCCAA GCCGGCCCTG 2820 

CTGAAGCTTA CAGAGAAGCA GGTGGAACAG AGGGCCTTCC ACGACCTCAA GGTGGCCCCC 2880 

GGCTACTACA CCCTCACTGC AGACCAGGAC GCCCGGGGCA TGGTGGAGTT CCAGGAGGGC 2 940 

GTGGAGCTGG TGGACGTACG GGTGCCCCTC TTTATCCGGC CTGAGGATGA CGACGAGAAG 3000 
CAGCTGCTGG TGGAGGCCAT CGACGTGCCC GCAGGCACTG CCACCCTCGG 

TTCTCGGTCA GCCGCGGGGA CCAGGTGGCC CGCATCCCTG TCATCCGGCG TGTCCTGGAC 3180 

GGCGGGAAGT CCCAGGTCTC CTACCGCACA CAGGATGGCA CCGCGCAGGG CAACCGGGAC 3240 

TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCAG 3 300 

GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCCGCCA GGTCCGCCGT 3360 

TTCCACGTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC TGGGCCAGCC CCACTCCACC 3420 

ACCATCATCA TCAGGGACCC AGATGAACTG GACCGGAGCT TCACGAGTCA GATGTTGTCA 3480 

TCA J.GCCAC CCCCTCACGG CGACCTGGGC GCCCCGCAGA ACCCCAATGC TAAGGCCGCT 3540 

GTAAAGTACT GGATTCAGGG TGACTCCGAA TCCGAAGCCC ACCTG CTCG A CAGCAAGGTG 3 660 

CCCTCAGTGG AGCTCACCAA CCTGTACCCG TATTGCGACT ATGAGATGAA GGTGTGCGCC 3720 

TACGGGGCTC AGGGCGAGGG ACCCTACAGC TCCCTGGTGT CCTGCCGCAC CCACCAGGAA 3 780 

GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATGTCGTCT CCTCCACGGT GACCCAGCTG 3 840 

AGCTGGGCTG AGCCGGCTGA GACCAACGGT GAGATCACAG CCTACGAGGT CTGCTATGGC 3 900 

CTGGTCAACG ATGACAACCG ACCTATTGGG CCCATGAAGA AAGTGCTGGT TGACAACCCT 3 960 

AAGAACCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC AGCCCTACCG CTACACGGTG 4020 

AAGGCGCGCA ACGGGGCCGG CTGGGGGCCT GAGCGGGAGG CCATCATCAA CCTGGCCACC 4 080 

CAGCCCAAGA GGCCCATGTC CATCCCCATC ATCCCTGACA TCCCTATCGT GGACGCCCAG 4140 

AGCGGGGAGG ACTACGACAG CTTCCTTATG TACAGCGATG ACGTTCTACG CTCTCCATCG 4200 

GGCAGCCAGA GG CC CAGCGT CTCCGATGAC ACTGAGCACC TGGTGAATGG CCGGATGGAC 4260 

TTTGCCTTCC CGGGCAGCAC CAACTCCCTG CACAGGATGA CCACGACCAG TGCTGCTGCC 4320 

TATGGCACCC ACCTGAGCCC ACACGTGCCC CACCGCGTGC TAAGCACATC CTCCACCCTC 4380 

ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTCGACCAC ACTGCCGAGG 4440 

GACT ACT CCA CCCTCACCTC CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC 4500 

GACACGCCCA CCCGCCTGGT GTTCTCTGCC CTGGGGCCCA CATCTCTCAG AGTGAGCTGG 4560 

CAGGAGCCGC GGTGCGAGCG GCCGCTGCAG GGCTACAGTG TGGAGTACCA GCTGCTGAAC 4620 

GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AGACCTCGGT GGTGGTGGAA 4680 

GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGG 4740 

GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG TGCACCCGCA GAGCCCACTG 4800 

TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA GTGCCCCAGG CCCGCTGGTG 4860 

C TGAGCCCAGA CTCGCTGCAG CTGAGCTGGG AGCGGCCACG GAGGCCCAAT 4920 

3 TCGGCTACCT GGTGACCTGT GAGATGGCCC AAGGAGGAGG GCCAGCCACC 4980 

3 TGGATGGAGA CAGCCCCGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 5040 

T ACAAGTTCAA GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCGC 5100 

A TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCGT 5160 

GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA GCATCACCAC CACCCACACC 5220 

AGCGCCACCG AGCCCTTCCT AGTGGATGGG CCGACCCTGG GGGCCCAGCA CCTGGAGGCA 5280 

GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCCGGACACT GACCACCAGC 5340 

GGAACCCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCTGCCCCA 5400 

CCCCCGCCAT GTCCCACTAG GCGTCCTCCC GACTCCTCTC CCGGAGCCTC CTCAGCTACT 5460 

CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 5520 

TCCTGGGAGG CATGAAGGGG GCAAGGTCCG TCCTCTGTGG GCCCAAACCT ATTTGTAACC 5580 

AAAGAGCTGG GAGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 5640 



Seq I 



I I I I I I 

MAGPRPSPWA RLLLAALI SV SLSGTIANHC KKAPVKSCTE CVRVDKDCAY CTLm-tx nun* 
CNTQAELIiAA GCQRESIWM ESSFQITEET QIDTTLRRSQ MSPQGLRVRL RPGEERHFEL 
EVPEPLESPV DLYILMDFSN SMSDDLDNLK KMGQNLARVL. SQLTSDYTIG FGKFVDKVSV 
PQTDMRPEKL KEPWPNSDPP FSFKNVISLT EDVDEFRNKL QGERISGNLD APEGGFDAIL 240 
QTAVCTRDIG WRPDSTHLLV FSTESAFHYE ADGAHVLAGI MSRNDERCHL DTTGTYTQYR 300 
TQDYPSVPTL VRLLAKHNII PIFAVTNYSY SYYEKLKTYF PVSSLGVLQE DSSNIVELLE 360 
EAFNRIRSNL DIRALDSPRG LRTEVTSKMF QKTRTGSFHI RRGEVGIYQV QLRALEHVDG 420 
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THVCQLPEDQ KGNIHLKPSF SDGLKMDAGI ICDVCTCELQ K 
CSEGWSGQTC NCSTGSLSDI QPCLREGEDK PCSGRGECQC GHCVCYGEGR YEGQFCEYDN 
FQCPRTSGFL CNDRGRCSMG QCVCEPGWTG PSCDCPLSNA TCIDSNGGIC NGRGIICECGR 
CHCHQQSLYT DTICEINYSA IHPGLCEDLR SCVQCOA-JGT GEKKGRT 



LKRAEEVWR CSFRDEDDDC TYSYTMEGDG APGPNSTVLV HKKKDCPPGS F 

LLPLLALLLL LCWKYCACCK ACLALLPCCN RGHMVGFKED HYMLRENLMA SDHLDTPMLR 780 

SGNLKGRDW RWKVTNHMOR PGFATHAASI NPTELVPYGL SLRLARLCTE NLLKPDTREC 840 

AQLRQEVEEN LMEVYRQISG VHKLQQTKFR QQPNAGKKQD HTIVDTVLMA PRSAKPALLK 900 

LTEKQVEQRA FHDLKVAPGY YTLTADQDAR GMVEFQEGVE LVOTRVPLFI RPEDDDEKQL 960 

LVEAIDVPAG TATLGRRLVN ITIIKEQARD WSFEQPEFS VSRGDQVARI PVIRRVLDGG 1020 

KSQVSYRTQD GTAQGHRDYI PVEGELLFQP GEAWKELQVK LLELQEVDSL LRGRQVRRFH 10 80 

VQLSNPKFGA HLGQPHSTTI IIRDPDELDR SFTSQMLSSQ PPPHGDLGAP QNPNAKAAGS 1140 

RKIHFNWLPP SGKPHGYRVK YWIQGDSESE AHLLDSKVPS VELTNLYPYC DYEMKVCAYG 1200 

AQGEGPYSSL VSCRTHQEVP SEPGRLAFNV VSSTVTQLSW AEPAETNGEI TAYEVCYGLV 12 60 

NDDNRPIGPM KKVLVDNPKN RMLLIENLRE SQPYRYTVKA RNGAGWGPER EAIIHLATQP 1320 

KRPMSIPIIP DIPIVDAQSG EDYDSFLMYS DDVLRSPSGS QRPSVSDDTE HLVNGRMDFA 1380 

FPGSTNSLHR MTTTSAAAYG THLSPHVPHR VLSTSSTLTR DYNSLTRSEH SHSTTLPRDY 1440 

STLTSVSSHD SRLTAGVPDT PTRLVFSALG PTSLRVSWQE PRCERPLQGY SVEYQLLNGG 1500 

ELHRLNI PNP AQTSVWEDL LPMHSYVFRV RAQSQEGHGR EREGVITIES QVEPQSPLCP 1560 

LPGSAFTLST PSAPGPLVFT ALSPDSLQLS WERPRRPNGD IVGYLVTCEM AQGGGPATAF 1620 

RVDGDSPESR LTVPGLSENV PYKFKVOART TEGFGPEREG IITIESQDGG PFPQLGSRAG 1680 

LFQHPLQSEY SSITTTHTSA TEPFLVDGPT LGAQHLEAGG SLTRHVTQEF VSRTLTTSGT 1740 
LSTHMDQQFF QT 

Seq ID NO: 361 DNA sequence 



1 11 21 31 41 51 

III 
GCACGAGGGC GCTTTTGTCT CCGGTGAGTT TTGTGGCGGG AAGCTTCTGC GCTGGTGCTT 60 

AGTAACCGAC TTTCCTCCGG ACTCCTGCAC GACCTGCTCC TACAGCCGGC GATCCACTCC 120 

CGGCTGTTCC CCCGGAGGGT CCAGAGGCCT TTCAGAAGGA GAAGGCAGCT CTGTTTCTCT 180 

GCAGAGGAGT AGGGTCCTTT CAGCCATGAA GCATGTGTTG AACCTCTACC TGTTAGGTGT 240 

GGTACTGACC CTACTCTCCA TCTTCGTTAG AGTGATGGAG TCCCTAGAAG GCTTACTAGA 3 00 

GAGCCCATCG CCTGGGACCT CCTGGACCAC CAGAAGCCAA CTAGCCAACA CAGAGCCCAC 360 

CAAGGGCCTT CCAGACCATC CATCCAGAAG CATGTGATAA GACCTCCTTC CATACTGGCC 420 

ATATTTTGGA ACACTGACCT AGACATGTCC AGATGGGAGT CCCATTCCTA GCAGACAAGC 480 

TGAGCACCGT TGTAACCAGA GAACTATTAC TAGGCCTTGA AGAACCTGTC TAACTGGATG 540 

CTCAT 'I -\AGGCC TGTTTAGGCC GGTTGCGGTG GCTCATGCCT GTAATCCTAG 600 

CACTTTGGGA GGCTGAGGTG GGTGG AT CAC CTGAGGT CAG GAGTTCGAGA CCAGCCTCGC 660 

CAACATGGCG AAACCCCATC TCTACTAAAA ATACAAAAGT TAGCTGGGTG TGGTGGCAGA 720 

T CCCAGTTCCT T GGGAGGCTG AGGCGGGAGA ATTGCTTGAA CCCGGGGACG 7 80 

G TGAACCGAGA TCGCACTGCT GTACCCAGCC TGGGCCACAG TGCAAGACTC 840 
A AAAAAAAGAA AAGAAAAAGC CTGTTTAATG CACAGGTGTG AGTGGATTGC 900 
TTATGGCTAT GAGATAGGTT GATCTCGCCC TTACCCCGGG GTCTGGTGTA TGCTGTGCTT 960 

TCCTCAGCAG TATGGCTCTG ACATCTCTTA GATGTCCCAA CTTCAGCTGT TGGGAGATGG 1020 

TGATATTTTC AACCCTACTT CCTAAACATC TGTCTGGGGT TCCTTTAGTC TTGAATGTCT 1080 

TATGCTCAAT TATTTGGTGT TGAGCCTCTC TTCCACAAGA GCTCCTCCAT GTTTGGATAG 1140 

CAGTTGAAGA GGTTGTGTGG GTGGGCTGTT GGGAGTGAGG ATGGAGTGTT CAGTGCCCAT 1200 

TTCTCATTTT ACAT TTT AAA GTCGTTCCTC CAACATAGTG TGTATTGGTC TGAAGGGGGT 1260 

GGTGGGATGC CAAAGCCTGC TCAAGTTATG GACATTGTGG CCACCATGTG GCTTAAATGA 1320 
TTTTTTCTAA CTAATAAAGT GGAATATATA TTTCAAAAAA AAAAAAAAAA AA 



MKHVLNLYLL GWLTLLSIF VRVMESLEGL LESPSPGTSW TTRSQLANTE PTKGLPDHPS 



1 11 21 31 41 51 

I I I I I I 

GGCACGAGGG TTTCGTTTTC ATGCTTTACC AGAAAATCCA CTTCCCTGCC GACCTTAGTT 

70 TCAAAGCTTA TTCTTAATTA GAGACAAGAA ACCTGTTTCA ACTTGAAGAC ACCGTATGAG 
GTGAATGGAC AGCCAGCCAC CACAATGAAA GAAATCAAAC CAGGAATAAC CTATGCTGAA 
CCCACGCCTC AATCGTCCCC AAGTGTTTCC TGACACGCAT CTTTGCTTAC AGTGCATCAC 
AACTGAAGAA TGGGGTTCAA CTTGACGCTT GCAAAATTAC CAAATAAOGA GCTGCACGGC 
CAAGAGAGTC ACAATTCAGG CAACAGGAGC GACGGGCCAG GAAAGAACAC C^CCCTTCAC 

75 AATGAATTTG ACACAATTGT CTTGCCGGTG CTTTATCTCA TTATATTTGT GGCAAGCATC 
TTGCTGAATG GTTTAGCAGT GTGGATCTTC TTCCACATTA GGAATAAAAC CAGCTTCATA 
TTCTATCTCA AAAACATAGT GGTTGCAGAC CTCATAATGA CGCTGACATT TCCATTTCGA 
ATAGTCCATG ATGCAGGATT TGGACCTTGG TACTTCAAGT TTATTCTCTG CAGATACACT 
- TCAGTTTTGT TTTATGCAAA CATGTATACT TCCATCGTGT TCCTTGGGCT GATAAGCATT 

80 GATCGCTATC TGAAGGTGGT CAAGCCATTT GGGGACTCTC GGATGTACAG CATAACCTTC 
ACGAAGGTTT TATCTGTTTG TGTTTGGGTG ATCATGGCTG TTTTGTCTTT GCCAAACATC 
ATCCTGACAA ATGGTCAGCC AACAGAGGAC AATATCCATG ACTGCTCAAA ACTTAAAAGT 
CCTTTGGGGG TCAAATGGCA TACGGCAGTC ACCTATGTGA ACAGCTGCTT GTTTGTGGCC 
GTGCTGGTGA TTCTGATCGG ATGTTACATA GCCATATCCA GGTACATCCA CAAATCCAGC 
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AGGCAATTCA T AAGT CAGTC AAGCCGAAAG CGAAAACATA ACCAGAGCAT 
GTGGCTGTGT TTTTTACCTG CTTTCTACCA TATCACTTGT GCAGAATTCC 

3CTTTT AGATGAATCT GCACAAAAAA TCCTATATTA CTGCAAAGAA 

T TCTTGTCTGC GTGTAATGTT TGCCTGGATC CAATAATTTA CTTTTTCATG 

r TTTCAAGAAG GCTGTTCAAA AAATCAAATA TCAGAACCAG GAGTGAAAGC 

ATCAGATCAC TGCAAAGTGT GAGAAGATCG GAAGTTCGCA TATATTATGA TTACACTGAT 

T GTTGGAATCG ATATGTACAA AGTGTAAATA AATGTTTCTT 



Protein Accession ft: NP_076404 

i i 1 r i 1 i 1 r 

MGFHLTLAKL PHNELHGQES HNSGNRSDGP GKNTTLHNEF DTIVLPVLYL IIFVASILLN 
GLATOIFFHI RNKTSFIFYL KNIWADLIM TLTFPFRIVH DAGFGPWYFK FILCRYTSVL 
FYANMYTSIV FLGLISIDRY LKWKPFGDS RMYSITFTKV LSVCVWVIMA VLSLPNIILT 
NGQPTEDNIH DCSKLKSPLG VKWHTAVTYV NSCLFVAVLV ILIGCYIAIS RYIHKSSRQF 
ISQSSRKRKH NQSIRVWAV FFTCFLPYHL CRIPFTFSHL DRLLDESAQK ILYYCKEITL 
FLSACNVCLD PIIYFFMCRS FSRRLFKKSN IRTRSESIRS LQSVRRSEVR IYYDYTDV 



1 11 21 31 41 51 

I I I I I I 

ATGTCTCTCG AGCAGAGGAG TCCGCACTGC AAGCCTGATG AAGACCTTGA AGCCCAAGGA 
GAGGACTTGG GCCTGATGGG TGCACAGGAA CCCACAGGCG AGGAGGAGGA GACTACCTCC 
TCCTCTGACA GCAAGGAGGA GGAGGTGTCT GCTGCTGGGT CATCAAGTCC TCCCCAGAGT 
CCTCAGGGAG GCGCTTCCTC CTCCATTTCC GTCTACTACA CTTTATGGAG CCAATTCGAT 
GAGGGCTCCA GCAGTCAAGA AGAGGAAGAG CCAAGCTCCT CGGTCGACCC AGCTCAGCTG 
GAGTTCATGT TCCAAGAAGC ACTGAAATTG AAGGTGGCTG AGTTGGTTCA TTTCCTGCTC 
CACAAAT AT C GAGTCAAGGA GCCGGTCACA AAGGCAGAAA TGCTGGAGAG CGTCATCAAA 
AATTACAAGC GCTACTTTCC TGTGATCTTC GGCAAAGCCT CCGAGTTCAT GCAGGTGATC 
TTTGGCACTG ATGTGAAGGA GGTGGACCCC GCCGGCCACT CCTACATCCT TGTCACTGCT 
CTTGGCCTCT CGTGCGATAG CATGCTGGGT G ATGGT CAT A GCATGCCCAA GGCCGCCCTC 
CTGATCATTG TCCTGGGTGT GATCCTAACC AAAGACAACT GCGCCCCTGA AGAGGTTATC 
TGGGAAGCGT TGAGTGTGAT GGGGGTGTAT GTTGGGAAGG AGCACATGTT CTACGGGGAG 
CCCAGGAAGC TGCTCACCCA AGATTGGGTG CAGGAAAACT ACCTGGAGTA CCGGCAGGTG 
CCCGGCAGTG ATCCTGCGCA CTACGAGTTC CTGTGGGGTT CCAAGGCCCA CGCTGAAACC 
AGCTATGAGA AGGT CAT AAA TTATTTGGTC ATGCTCAATG CAAGAGAGCC CATCTGCTAC 
CCATCCCTTT ATGAAGAGGT TTTGGGAGAG GAGCAAGAGG GAGTCTGA 



I I I I I I 

MSLEQRSPHC KPDEDLEAQG EDLGLMGAQE PTGEEEETTS SSDSKEEEVS AAGSSSPPQS 
EQGGASSSIS VYYTLWSQFD EGSSSQEEEE PSSSVDPAQL EFMFQEALKL KVAELVHFLL 
HKYRVKEPVT KAEMLESVIK NYKRYFPVIF GKASEFMQVI FGTDVKEVDP AGHSYILVTA 
LGLSCDSMLG DGHSMPKAAL LIIVLGVILT KDNCAPEEVI WEAIiSVMGVY VGKEHMFYGE 
PRKLLTQDWV QENYLEYRQV PGSDPAHYEF LWGSKAHAET SYEKVINYLV M 
PSLYEEVLGE EQEGV 

Seq ID NO: 367 DNA sequence 

Coding sequence: 86.. 1126 



GGTTACTCAT CCTGGGCTCA GGTAAGAGGG CCCGAGCTCG GAGGCGGCAC ACCCAGGGGG 60 

GACGCCAAGG GAGCAGGACG GAGCCATGGA CCCCGCCAGG AAAGCAGGTG CCCAGGCCAT 120 

GATCTGGACT GCAGGCTGGC TGCTGCTGCT GCTGCTTCGC GGAGGAGCGC AGGCCCTGGA 180 

GTGCTACAGC TGCGTGCAGA AAGCAGATGA CGGATGCTCC CCGAACAAGA TGAAGACAGT 240 

GAAGTGCGCG CCGGGCGTGG ACGTCTGCAC CGAGGCCGTG GGGGCGGTGG AGACCATCCA 300 

CGGACAATTC TCGCTGGCAG TGCSGGGTTG CGGTTCGGGA CTCCCCGGCA AGAATGACCG 360 

CGGCCTGGAT CTTCACGGGC TTCTGGCGTT CATCCAGCTG CAGCAATGCG CTCAGGATCG 420 

CTGCAACGCC AAGCTCAACC TCACCTCGCG GGCGCTCGAC CCGGCAGGTA ATGAGAGTGC 480 

ATACCCGCCC AACGGCGTGG AGTGCTACAG CTGTGTGGGC CTGAGCCGGG AGGCGTGCCA 540 

GGGTACATCG CCGCCGGTCG TGAGCTGCTA CAACGCCAGC GATCATGTCT ACAAGGGCTG 600 

CTTCGACGGC AACGTCACCT TGACGGCAGC TAATGTGACT GTGTCCTTGC CTGTCCGGGG 660 

CTGTGTCCAG GATGAATTCT GCACTCGGGA TGGAGTAACA GGCCCAGGGT TCACGCTCAG 720 

TGGCTCCTGT TGCCAGGGGT CCCGCTGTAA CTCTGACCTC CGCAACAAGA CCTACTTCTC 780 

CCCTCGAATC CCACCCCTTG TCCGGCTGCC CCCTCCAGAG CCCACGACTG TGGCCTCAAC 840 

CACATCTGTC ACCACTTCTA CCTCGGCCCC AGTGAGACCC ACATCCACCA CCAAACCCAT 900 

GCCAGCGCCA ACCAGTCAGA CTCCGAGACA GGGAGTAGAA CACGAGGCCT CCCGC-GATGA 960 

GGAGCCCAGG TTGACTGGAG GCGCCGCTGG CCACCAGGAC CGCAGCAATT CAGGGCAGTA 1020 

TCCTGCAAAA GGGGGGCCCC AGCAGCCCCA TAATAAAGGC TGTGTGGCTC CCACAGCTGG 1080 

ATTGGCAGCC CTTCTGTTGG CCGTGGCTGC TGGTGTCCTA CTGTGAGCTT CTCCACCTGG 1140 

AAATTTCCCT CTCACCTACT TCTCTGGCCC TGGGTACCCC TCTTCTCATC ACTTCCTGTT 1200 

CCCACCACTG GACTGGGCTG GCCCAGCCCC TGTTTTTCCA ACATTCCCCA GTATCCCCAG 1260 

CTTCTGCTGC GCTGGTTTGC GGCTTTGGGA AATAAAATAC CGTTGTATAT ATTCTGGCAG 1320 
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GGGTGTTCTA GCTTTTTGAG GACAGCTCCT GTATCCTTCT CATCCTTGTC TCTCCGCTTC 
TCCTCTTGTG ATGTTAGGAC AGAGTGAGAG AAGTCAGCTG TCACGGGGAA GGTGAGAGAG 
AGGATGCTAA GCTTCCTACT CACTTTCTCC TAGCCAGCCT GGACTTTGGA GCGTGGGGTG 
GGTGGGACAA TGGCTCCCCA CTCTAAGCAC TGCCTCCCCT ACTCCCCGCA TCTTTGGGGA 
ATCGGTTCCC CATATGTCTT CCTTACTAGA CTGTGAGCTC CTCGAGGGCA GGGACCGTGC 
CTTATGTCTG TGTGTGATCA GTTTCTGGCA CATAAATGCC TCAATAAAGA TTTAATTACT 
TTGTATAGTG AAAAAAAA 



1 11 21 31 41 51 
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MDPARKAGAQ AMIWTAGWLL LLLLRGGAQA LECYSCVQKA DDGCSPNKMK TVKCAPGVDV 
CTEAVGAVET IHGQFSLAVX GCGSGLPGKN DRGLDLHGLL AFIQLCQCAQ DRCNAKLNLT 
SRALDPAGNE SAYPPNGVEC YSCVGLSREA CQGTSPPWS CYMASDHVYK GCFDGNVTLT 
AANVTVSLPV RGCVQDEFCT RDGVTGPGFT LSGSCCQGSR CNSDLRNKTY FSPRIPPLVR 
LPPPEPTTVA STTSVTTSTS APVRPTSTTK PMPAPTSQTP RQGVSHEASR DSEPRLTGGA 
AGHQDRSNSG QYPAKGGPQQ PHNKGCVAPT AGLAALLLAV AAGVLL 



I I I 

- AGCCCTGCGT GTGGTGGGCA CCAGCCTGTT TGCCCTGGCA 60 

_G GCATCCTGGC AGCCTATGTG ACGGGCTACC AGTTCATCCA CACGGAAAAG 120 

CACTACCTGT CCTTCGGCCT GTACGGCGCC ATCCTGGGCC TGCACCTGCT CATTCAGAGC 180 

30 CTTTTTGCCT TCCTGGAGCA CCGGCGCATC CGACGTGCCG GCCAGGCCCT GAAGCTGCCC 240 

TCCCCGCGGC GGGGCTCGGT GGCACTGTGC ATTGCCGCAT ACCAGGAGGA CCCTGACTAC 300 

TTGCGCAAGT GCCTGCGCTC GGCCCAGCGC ATCTCCTTCC CTGACCTCAA GGTGGTCATG 360 

GTGGTGGATG GCAACCGCCA GGAGGACGCC TACATGCTGG ACATCTTCCA CGAGGTGCTG 420. 

GGCGGCACCG AGCAGGCCGG CTTCTTTGTG TGGCGCAGCA ACTTCCATGA GGCAGGCGAG 480 

35 GGTGAGACGG AGGCCAGCCT GCAGGAGGGC ATGGACCGTG TGCGGGATGT GGTGCGGGCC 540 

AGCACCTTCT CGTGCATCAT GCAGAAGTGG GGAGGCAAGC GCGAGGTCAT GTACACGGCC 600 

TTCAAGGCCC TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCGACTCTGA CACTGTGCTG 660 

CATCCAGCCT GCACCATCGA GATGCTTCGA GTCCTGGAGG AGGATCCCCA AGTAGGGGGA 720 

GTCGGGGGAG ATGTCCAGAT CCTCAACAAG TACGACTCAT GGATTTCCTT CCTGAGCAGC 780 

40 GTGCGGTACT GGATGGCCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT TGGCTCTGTG 840 

CAGTGTATTA GTGGGCCCTT GGGCATGTAC CGCAACAGCC TCCTCCAGCA GTTCCTGGAG 900 

GACTGGTACC ATCAGAAGTT CCTAGGCAGC AAGTGCAGCT TCGGGGATGA CCGGCACCTC 96 0 

ACCAACCGAG TCCTGAGCCT TGGCTACCGA ACTAAGTATA CCCCGCGCTC CAAGTGCCTC 1020 

ACAGAGACCC CCACTAAGTA CCTCCGGTGG CTCAACCAGC AAACCCGCTG GAGCAAGTCT 1080 

45 TACTTCCGGG AGTGGCTCTA CAACTCTCTG TGGTTCCATA AGCACCACCT CTGGATGACC 1140 

TACGAGTCAG TGGTCACGGG TTTCTTCCCC TTCTTCCTCA TTGCCACGGT TATACAGCTT 1200 

TTCTACCGGG GCCGCATCTG GAACATTCTC CTCTTCCTGC TGACGGTGCA GCTGGTGGGC 1260 

ATTATCAAGG CCACCTACGC CTGCTTCCTT CGGGGCAATG CAGAGATGAT CTTCATGTCC 1320 

CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTGCTACC 13 80 

50 ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAAAAACCA TTGTGGTGAA CTTCATTGGC 1440 

CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGAGG GGCTGGCCTA CACAGCTTAT 1500 

TGCCAGGACC TGTTCAGTGA GACAGAGCTA GCCTTCCTTG TCTCTGGGGC TATACTGTAT 1560 

GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATCGCCCG GCGATGTGGG 1620 
^ AAGAAGCCGG AGCAGTACAG CTTGGCTTTT GCTGAGGTGT GA i 

Seq ID NO: 370 Protein sequence 
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MPVQLTTALR WGTSLFALA VLGGILAAYV TGYQFIHTEK HYLSFGLYGA ILGLHLLIQS 60 

LFAFLEHRRM RRAGQALKLP SPRRGSVALC IAAYQEDPDY LRKCLRSAQR ISFPDLKWM 120 

WDGNRQEDA YMLDIFHEVL GGTEQAGFFV WRSNFHEAGE GETEASLQEG MDRVRDWRA 180 

STFSCIMQKW GGKREVMYTA FKALGDSVDY IQVCDSDTVL DPACTIEMLR VLEEDPQVGG 240 

65 VGGDVQILNK YDSNISFLSS VRYWMAFNVE RACQSYFGCV QCISGPLGMY RNSLLQQFLE 300 

DWYHQKFLGS KCSFGDDRHL TNRVLSLGYR TKYTARSKCL TETPTKYLRW LNQQTRWSKS 360 

YFREWLYNSL WFHKHHLWMT YESWTGFFP FFLIATVIQL FYRGRIHKIL LFLLTVQLVG 420 

IIKATYACFL RGNAEMIFMS LYSLLYHSSL LPAKIFAIAT INKSGV,'GTSG RKTIWNFIG 480 

LIPVSIWVAV LLEGLAYTAY CQDLFSETEL AFLVSGAILY GCYWVALLML YLAI I ARRCG 540 
70 KKPEQYSLAF AEV 

Seq ID NO: 371 DNA sequence 
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CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCAOTC TGAGAAGCAG AGGAGCCGCA 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATI 
CAGCICCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC A 
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AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 
GTCAGCGGAG GAGTTT CAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 
GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 
5 GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 
ACAGTT AG CA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 
10 TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 
TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 
15 GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 
TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 
AAT CCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 
GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 
20 ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 
25 AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 204 0 
G AAGATT CAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
30 AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACT CGCAG AC AACCCAACCG 2400 
GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 
35 ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 
TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 
TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 
TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 
GATAAGGTGC CCTTGCATGC TTCTCTGCCS ( T T TTTGTT ATTAGAGCCC 2760 

40 AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2 8 80 
AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2 940 
GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3 0 00 
GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3 0 60 
45 CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 
GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 
TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 
ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 
50 CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 342 0 
ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 
GATCATGAGA TTAGTCAAGT T CCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 
TCTCAAGCAT CTGGTGACAC T TCGCTT AAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 
TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 
55 ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 
GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 
CCCAAAGTTG ATAAAATTAG TT CTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 
AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 
ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 
60 GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 
TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 40 80 
TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 
CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 
ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 
65 GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAG AG AT G GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 
TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 
AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 
CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA T CAG AAT AAT 4560 
70 CCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 
T CAGACAGT C AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAAT GGGCTA 4680 
TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 
CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGGATCA 4800 
GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 
75 GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 
T CAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 
GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 
80 TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 
ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 
CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 5400 
CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 
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CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC SS20 

AACAGAC CAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 5700 

5 TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5820 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 

10 CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

15 AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

20 ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

25 TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 7080 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

30 GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 

TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

35 AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTCCAGTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7680 

GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7 80 0 

40 GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 

AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA AAAA 

Seq ID NO: 372 Protein sequence: 
45 Protein Accession #: built from XP_031379 



MRILKRFIAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNKG KKYPTCNSPK 60 

50 QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 1B0 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

55 HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDKPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

60 TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLLLDNQI 780 

UITTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSS3 LFRHLHTVSQ 840 

ILPQVTSATE SDKVPLHASL PVAGGDLLLE PSLAQYSDVL STTHAASETL EFGSESGVLY 900 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF TVSYSSAIPV HDSVGVTYQG 960 

65 SLFSGPSHIP IPKSSLITPT ASLLQPTHAL SGDGEWSGAS SDSEFLLPDT DGLTALNISS 1020 

PVSVAEFTYT TSVFGDDNKA LSKSEIIYGN ETELQIPSFN EMVYPSESTV MPNMYDNVNK 1080 

LNASLQETSV SISSTKGMFP GSLAHTTTKV FDHEISQVPE NKFSVQPTHT VSQASGDTSL 1140 

KPVLSAHSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

70 TISYASEKYE PVLLKSESSH QWPSLYSND ELFQTANLEI NQAHPPKGRK VFATPVLSID 1320 

EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGH\'AITAVS 1380 

PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD IDDDGDDDDD DRGSDGLSIH 1440 

KCMSCSSYRE SQEKVMNDSD THENSLMDQN NPISYSLSEN SEEDNRVTSV SSDSQTGMDR 1500 

SPGKSPSANG LSQKHNDGKE ENDIQTGSAL LPLSPESKAW AVLTSDEESG SGCGTSDSLN 1560 

75 ENETSTDFSF ADTNEKDADG ILAAGDSEIT PGFPQSPTSS VTSENSEVFH VSEAEASNSS 1620 

HESRIGLAEG LESEKKAVIP LVIVSALTFI CLWLVGILI YURKCFQTAH FYLEDSTSPR 1680 

VISTPPTPIF PISDDVGAIP IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI 1740 

TADSSNHPDN KHKNRYINIV AYDHSRVKLA QLAEKDGKLT DYINAWYVDG YKRPKAYIAA 1800 

QGPLKSTAED FWRMIWEHNV EVIVMITNLV EKGRRKCDQY HI DG EE'S \"LVTQKSVQ 1860 

80 VLAYYTVRNF TLRNTKIKKG SQKGRPSGRV VTQYHYTQKP DMGVPEYSLP VLTFVRKAAY 1920 

AKRHAVGPW VHCSAGVGRT GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QR1IYLVQTEE 1980 

QYVFIHDTLV EAILSKETEV LDSHIHAYVN ALLIPGPAGK TKLEKQFQLL SQSNIQQSDY 204 0 

SAALKQCNRE KNRTSSIIPV ERSRVGISSL SGEGTDYINA SYIMGYYQSN EFIITQHPLL 2100 

HTIKDFWRMI WDHNAQLWM IPDGQNMAED EFVYWPNKDE PINCESFKVT LMAEEHKCLS 2160 



1096 



WO 03/042661 



PCT/US02/36810 



•CLIIQDF I LEATQDDYV LEVRHFQCPK WPNPDSPISK TFELISVIKE EAANRDGPMI 2220 
A GTFCALTTLM HQLEKENSVD VYQVAKMINL MRPGVFADIE QYQFLYKVIL 22B0 
SLVSTRQEEN PSTSLDSNGA ALPDGNIAES LESLV 

Seq ID NO: 3 73 DNA sequence 

Nucleic Acid Accession #: built from NM_002851 
Coding sequence: 148-4518 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 3 60 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

A TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

G GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATGCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

G ATTT CAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

:ACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840' 

T TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 10 80 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT AT ACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 12 00 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1660 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 174 0 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 180 0 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG I86 0 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 19 8 0 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTOCTAG AAAIGCTTCC 2040 

GAACATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 22 80 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AGCTGAGGGG 2460 

TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 

TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 

CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 

TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 

GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2820 

AAGCACAAGA AT CG AT ACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2 880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2 940 

TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 

TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3060 

GAGAAAGGAA GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 

AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 

ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGAGGTGTG 3240 

GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 

GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 3360 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 

CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA 3480 

CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 

GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 

GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 

AAGAAT CGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 37 80 

AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3840 
GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTT ~ 
TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT G 
GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT T 

CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 40 80 

ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 

TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 

GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 4260 

GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 43 20 

GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGC CAG GAGICTTTGC TGACATTGAG 4380 
CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 
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G TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 4500 

. ... r TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4 680 

TGCCTTTTTG CAAGACTTGT AATTTAC1TA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4980 

AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG GACCAAATTT 5040 

ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA GTTCTGTGTA 5100 

ATTGTTTAGT TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT TTCTGACATT 5160 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA 5220 

ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT 5280 

TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA TTAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAAAAAA 



Seq ID NOs 374 Protein sequence: 

Protein Accession #-. built from XP_031379 

1 11 21 31 41 51 

I I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQOR KLVEEIGWSY TGALNQKNWG KXYPTOJSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAI IDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LONNFREQQY KFSROVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEI IKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARKA 3EDSTSSGSE ESLKDPSMEG NWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESSKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

P SSRQQDLVST VNWYSQTTQ PVYNAEASNS SHESRIGLAE GLESEKKAVI 780 

? ICLWLVGIL IYWRKCFQTA HFYLEDSTSP RVISTPPTPI FPISDDVGAI 840 

E EFETLKEFYQ EVOSCTVDLG ITADSSNHPD NKHKNRYINI 900 

VAYDHSRVKL AQLAEKDGKL TDYINANYVD GYNRPKAYIA AQGPLKSTAK DFWRMIWEHK 96 0 

VEVIVMITNL VEKGRRKCDQ YWPADGSEE Z GNFL 2 S _\ L'.Y YTVRN FTLRNTKIKK 102 0 

GSQKGRPSGR WTQYHYTQK PDMGVPEYSL PVLTFVRXAA YAKRHAVGPV WIICSAGVGR 10 8 0 

TGTYIVLDSM LQQIQHEGTV NIFGFLKHIR SQRNYLVQTE 3QYVFIHDTL VEAILSKETE 1140 

VIiDSHIHAYV NALLIPGPAG KTKLEKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVGISS LSGEGTDYIN ASYIMGYYQS NEFIITQIIPL LliTIKDFWRH IWDHNAQLW 1260 

MIPDGQMHAE DEFVYWPNKD EPINCESFKV TLMAEEHKCL SKEEKLIIQD FILEATQDDY 1320 

S KTFBLISVIK EEAANRDGPH IVHDEHGGVT AGTFCALTTL 1380 



Seq ID NO: 375 DNA sequence 
Nucleic Acid Accession #: EOS sequence 
: 148-4494 



CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GT CCT ATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGI 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCAGAC CGATTTTCAA GTTTTGAGGA AGCAGT CAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTT CAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GITTTGGGAA GCAGGCTGCT 780 

TTAGAT CCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT 1TCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAG CAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

^TCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

..A AAGACATTGA AGAAGGCGCT ATTGTGAATC CTG3TAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAG3AAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 16B0 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
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GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTCGG GGACIGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG I860 
AGTT TATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAAIGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 
GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 24S0 
GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 
TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 
ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAEGCATGT TGCAGATTTA 2700 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 
GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT 2820 
ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC TTGCTGAAAA GGATGGCAAA 2880 
CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA ACAGACCAAA AGCTTATATT 2940 
GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 3000 
AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATGTGAT 3060 
CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT 3120 
GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC TAAGAAACAC AAAAATAAAA 3180 
AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA CACAGTATCA CTACACGCAG 3240 
TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3300 
GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC ACTGCAGTGC TGGAGTTGGA 3360 
AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC AGATTCAACA CGAAGGAACT 3420 
GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA GAAATTATTT GGTACAAACT 3480 
GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG CCATACTTAG TAAAGAAACT 3540 
GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 3600 
GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT 3660 
GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA ATGGAACTTC TTCTATCATC 3720 
CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG GAGAAGGCAC AGACTACATC 3780 
AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT TCATCATTAC CCAGCACCCT 3840 
CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG ACCATAATGC CCAACTGGTG 3900 
GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA 3960 
GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA TGGCTGAAGA ACACAAATGT 4020 
CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT TAGAAGCTAC ACAGGATGAT 40 80 
TATCTACTTC AACTGAGGCA CTTTCAGTGT CCTAAATGGC CAAATCCAGA TAGCCCCATT 4140 
AGTAAAACTT TTGAACTTAT AAGTGTTATA AAACAAGAAG CTGCCAATAG GGATGGGCCT 420 0 
\ TGGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 42S0 
iAATTCC GTGGATGTTT ACCAGGTAGC CAAGATGATC 43 20 

T CTTTCCTCAC ATTGAGCAGT ATCAGTTTCT CTACAAAGTG 4380 

ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT CCACCTCTCT GGACAGTAAT 4440 
GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG AGTCTTTAGT TTAACACAGA 4500 
AAGGGGTGGG GGGACTCACA TCTCAGCATT GTTTTCCTCT TCCTAAAATT AGGCAGGAAA 4560 
ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG ACAGTAACTT TCATGACATA 4620 
GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC TTTTTGCAAG ACTTGTAATT 4680 
TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT 4740 
GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT TTATAGAGGT TAGGAATTCC 4800 
AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG CTGTATTTGT AGCAATTATC 4860 
AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA AATAAAACAC TCTTCCATAT 4920 
GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA GAAATAATCT GTTACTTATT 4980 
GTAAAT ACT G CCCTAGTGTC TCCATGGACC AAATTTATAT TTATAATTGT AGATTTTTAT 5040 
ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG TTTAGTTTAA TGACGTAGTT 5100 
CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT TGTGTTACCT AAGTCATTAA 5160 
CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG AAATACCTTC ATTTTGAAAG 5220 
AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA ATGGTTTTTA TCCAAGGAAT 5280 
TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 534 0 



Protein Accession ft: BOS sequence 

65 

1 11 21 31 41 51 

I I I I I I 

MRILKRFIiAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 
QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 
70 FKASKITFHW GKCHMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 
ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 
TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 
TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 
HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 
75 LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 
RSPTRGSEFS GKGDVPNTSL HSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 
jITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 



ENISQGYIFS SENPETITYD V 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESP.IGLAEG LESEKKAVIP 780 

LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHASSGFTEE FEEVQSCTVD LGITADSSNH PDKKHKNRYI KIVAYDHSRV 900 

KLAQLAEKDG KLTDYIHANY VDGYNRPKAY IAAQGPLK61 \ u IWI HNVEVIVMIT 960 

NLVEKGRRKC DQYWPADGSE EYGHFLVTQK SVQVLAYYTV RNFTLRNTKI KKGSQKGRPS 1020 
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GRVVTQYHYT QWPDMGVPEY SLPVLTFVRK A 

SMLQQIQHEG TVNI FGFLKH IESQRNYLVQ TEEQYVFIHD TLVEAILSKE TEVLDSHIHA 1140 

YVNALLIPGP AGKTKLEKQP QLLSQSNIQQ SDYSAALKQC NREKNRTSSI IPVERSRVGI 1200 

SSLSGEGTDY INASYIMGYY QSNEFIITQH PLLHTIKDPW RHIWDHNAQL WMIPDGQNM 1260 

AEDEFVYWPN KDEPINCESF KVTLMAEEHK CLSNEEKLII QDFILEATQD DYVLEVRHFQ 1320 

CPKWPNPDSP ISKTFELISV IKEEAANRDG PMIVHDEHGG VTAGTFCALT TLMHQLEKEN 1380 

SVDVYQVAKM INLMRPGVFA DIEQYQFLYK VILSLVSTRQ EENPSTSLDS KGAALPDGNI 1440 



1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAT TGGGGAAAGA 
AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA GATCTTACAC 
AAGTAAATGT GAATCTTAAG AAACTTAAAT TTCAGGGTTG GGATAAAACA T CATTGG AAA 
ACACATTCAT T CATAACACT GGGAAAACAG TGGAAATTAA TCTCACTAAT GACTACCGTG 
TCAGCGGAGG AGTTTCAGAA ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 
AAT GCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGGACAAAAA TTTCCACTTG 
AGATGCAAAT CTACTGCTTT GATGCGGACC GATTTTCAAG TTTTGAGGAA GCAGTCAAAG 
^ GTTAAGAGCT TTATCCATTT TGTTTGAGGT TGGGACAGAA GAAAATTTGG 

2 GATTATTGAT GGAGTCGAAA GTGTTAGTCG TTTTGGGAAG CAGGCTGCTT 
I CATACTGTTG AACCTTCTGC CAAACTCAAC TGACAAGTAT TACATTTACA 
I GACATCTCCT CCCTGCACAG ACACAGTTGA CTGGATTGTT TTTAAAGATA 

. T CTCTGAAAGC CAGTTGGCTG TTTTTTGTGA AGTTCTTACA ATGCAACAAT 

CTGGTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TCGAGAGCAA CAGTACAAGT 
TCTCTAGACA GGTGTTTTCC TCATACACTG GAAAGGAAGA GATTCATGAA GCAGTTTGTA 
GTTCAGAACC AGAAAATGTT CAGGCTGACC CAGAGAATTA TACCAGCCTT CTTGTTACAT 
GGGAAAGACC TCGAGTCGTT TATGATACCA TGATTGAGAA GTTTGCAGTT TTGTACCAGC 

3 AGAGGACCAA ACCAAGCATG AATTTTTGAC AGATGGCTAT CAAGACTTGG 
T CAATAATTTG CTACCCAATA TGAGTTATGT 1 



3ATCTTTTC CCTGAATTAA TTGGAACTGA AGAAATAATC AAGGAGGAGG 1440 
AAGAGGGAAA AG AC ATT G A A GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGTGCTACAA 150 0 
ACCAAATCAC GAAAAAGGAA CCCCAGATTT CTACCACAAC ACACTACAAT CGCATAGGGA 15 60 
CGAAATACAA TGAAGCCAAG ACTAACCGAT CCCCAACAAG AGGAAGTGAA TTCTCTGGAA 162 0 
AGGGTGATGT TCCCAATACA TCTTTAAATT CCACTTCCCA ACCAGTCACT AAATTAGCCA 16 80 
CAGAAAAAGA TATTTCCTTG ACTTCTCAGA CTGTGACTGA ACTGCCACCT CACACTGTGG 1740 
AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 1800 
ACTTGTCGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT GAGGAGGAGA 1860 
GTTTATTGAC CAGTTT CAAG CTTGATACTG GAGCTGAAGA TTCTTCAGGC TCCAGTCCCG 1920 
CAACTTCTGC TATCCCATTC ATCTCTGAGA ACATATCCCA AGGGTATATA TTTTCCTCCG 1980 
AAAACCCAGA GACAATAACA TATGATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCCG 2040 
AAGATTCAAC TTCATCAGGT TCAGAAGAAT CACTAAAGGA TCCTTCTATG GAGGGAAATG 2100 
TGTGGTTTCC TAGCTCTACA GACAT AACAG CACAGCCCGA TGTTGGATCA GGCAGAGAGA 216 0 
GCTTTCTCCA GACTAATTAC ACT GAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 2220 
CCTTTTCTGC AGGCCCAGTG ATGTCACAGG GTCCCTCAGT TACAGATCTG GAAATGCCAC 2280 
ATTATTCTAC CTTTGCCTAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 
CCAGACAACA GGATTTGGTC TCCACGGTCA ACGTGGTATA CTCGCAGACA ACCCAACCGG 240 0 
TATACAATGA GGCCAGTAAT AGTAGCCATG AGTCTCGTAT TGGTCTAGCT GAGGGGTTGG 2460 
AATCCGAGAA GAAGGCAGTT ATACCCCTTG TGATCGTGTC AGCCCTGACT TTTATCTGTC 2520 
TAGTGGTTCT TGTGGGTATT CT CAT CTACT GGAGGAAATG CTTCCAGACT GCACACTTTT 2580 
ACTTAGAGGA CAGTACAT CC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 2640 
TTTCAGATGA TGTCGGAGCA ATTCCAATAA AGCACTTTCC AAAGCATGTT GCAGATTTAC 2700 
ATGCAAGTAG TGGGTTTACT GAAGAATTTG AGACACTGAA AGAGTTTTAC CAGGAAGTGC 2760 
AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 2820 
ACAAGAATCG ATACATAAAT ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 2880 
TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 2940 
ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 
GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA 3060 
AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT 3120 
TTCTGGTCAC T CAGAAG AGT GTG CAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC 3180 
TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 3240 
CACAGTATCA CTACACGCAG TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 
TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 3360 
ACTGCAGTGC TGGAGTTGGA AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 3420 
AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA 3480 
GAAATTATTT GGTACAAACT GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 
CCATACTTAG TAAAGAAACT GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC 3 600 
TCCTCATTCC TGGACCAGCA GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 3660 
AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 3720 
ATCGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG 3780 
GAGAAGGCAC AGACTACATC AATGCCTCCT AT ATCAT GGG CTATTACCAG AGCAATGAAT 3840 
TCATCATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 
ACCATAATGC CCAACTGGTG GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT 3960 
TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA 4020 
TGGCTGAAGA ACACAAATGT CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT 4080 
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TAGAAGCTAC ACAGGATGAT TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC 4140 

CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 

CTGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA 42S0 

CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT 4320 

ACCAGGTAGC CAAGATGAT C AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT 4 3 BO 

ATCAGTTTCT CTACAAAGTG ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATC CAT 4440 

CCACCTCTCT GGACAGTAAT GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG 4500 

AGTCTTTAGT TTAACACAGA AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT 4560 

TCCTAAAATT AGGCAGGAAA ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG 4S20 

ACAGTAACTT TCATGACATA GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC 4S80 

TTTTTGCAAG ACTTGTAATT TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT 4740 

ATTTCTAAGA ATGGAATTGT GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT 4B00 

TTATAGAGGT TAGGAATTCC AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG 4860 

CTGTATTTGT AGCAATTATC AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA 4920 

AATAAAACAC TCTTCCATAT GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA 4980 

GAAATAATCT GTTACTT AT T GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT 504 0 

TTATAATTGT AGATTTTTAT ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG 5100 

TTTAGTTTAA TGACGTAGTT CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT 5160 

TGTGTTACCT AAGTCATTAA CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG 5220 

AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 5280 

ATGGTTTTTA TCCAAGGAAT TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAA 

Protein Accession #-. EOS sequence 

1 11 21 31 41 51 

ill 111 

MVPKASKITP HWGKCNMSSD GSEHSLEGQK FPLEMQIYCF DADRFSSFEE AVKGKGKLRA 50 
LSILFEVGTE ENLDFKAIID GVESVSRFGK QAALDPFILL KLLPNSTDKY YIYNGSLTSP 120 
PCTDTVDWIV FKDTVSISES QLAVFCEVLT MQQ3GYVMLM DYLQNNFREQ QYKFSRQVFS 
SYTGKEEIHE AVCSSEPENV QADPENYTSL LVTWERPRW YDTM I EKFAV LYQQLDGEDQ 
TKHEFLTDGY QDLGAILNNL LPKMSYVLQI VAICTNGLYG KYSDQLIVDM PTDNPELDLF 
PELIGTEEII KEEEEGKDIE EGAIVNPGRD SATNQIRKKE PQISTTTHYN »™™'™'' 
TNRSPTRGSE FSGKGDVPNT SLNSTSQPVT KLATEKDISL TSQTVTELPP 
NDCSKTVLRS PHMNLSGTAE SLNTVSITEY EEESLLTSFK LDTGAEDSSG S 
ISENISQGYI FSSENPETIT YDVLIPESAR KASEDSTSSG SEESLKDPSM Fj 
DITAQPDVGS GRESFLQTNY TEIRVDESEK TTKSFSAGPV MSQGPSVTDL EMPRYSTFAY 600 
FPTEVTPHAF TPSSRQQDLV STVNWYSQT TQPVYNEASN SSHESRIGLA EGLESEKKAV 660 
IPLVIVSALT FICLWLVGI LIYWRKCFQT AHFYLEDSTS PRVISTPPTP IFPISDDVGA 720 
IPIKHFPKKV ADLHASSGFT EEFETLKEFY QEVQSCTVDL GITADSSNHP DNKHKNRYIN 780 
IVAYDHSRVK I.AQIAEKDGK LTDYINANYV DGYNRPKAYI AAQGPLKSTA EDFWRMIWEH 840 
NVEVIVMITN LVEKGRRKCD QYWPADGSEE YGNFLVTQKS VQVLAYYTVR NFTLRNTKIK 900 
KGSQKGRPSG RWTQYHYTQ WPDMGVPEYS LPVLTFVRKA AYAKRHAVGP VWHCSAGVG 960 
RTGTYIVLDS MLQQIQHEGT VNIFGFLKHI RSQRNYLVQT EEQYVFIHDT LVEAILSKET 1020 
EVLDSHIHAY VHAM.IPGPA GKTKLEKQFQ LLSQSKIQQS DYSAALKQCN REKNRTSSII 1080 
PVERSRVGIS SLSGEGTDYI NASYIMGYIQ SNEFIITQHP LLHTIKDFWR MIWDHNAQLV 1140 
VMIPDGQNMA EDEFVYWPNK DEPINCESFK VTLMAEEHKC LSNEEKLIIQ DFILEATQDD 1200 
YVLEVRHFQC PKWPNPDSPI SKTFELISVI KEEAANRDGP MIVHDEHGGV TAGTFCALTT 1260 
LMHQLEKENS VDVYQVAKMI NLMRPGVFAD IEQYQFLYKV ILSLVSTRQE ENPSTSLDSN 1320 
GAALPDGNIA ESLESLV 



Seq ID NO: 379 DNA s 
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I I I I I I 



CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA A 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 
AAATGCAATA TGTCATCTGA TGGAT CAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 
GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 
AATGGCT CAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 
TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 
TTCTCTAGAC AGGTGTTTTC CT CAT ACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 
AGTTCAGAAC CAGAAAAT GT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA T""""'-™ 



AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 
GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 
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ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1S80 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC 1CACACTGTG 1740 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 180 0 
5 AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG I860 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
10 GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 234 0 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 
15 GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 
GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 
TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 
ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 
20 CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 2760 
CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 2820 
CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 2880 
CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 2940 
AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 3 000 
25 TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 3060 
AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAC-GA GTACGGGAAC 3120 
TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 3180 
CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 3240 
ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 
30 CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 3360 
CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 3420 
CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 3480 
AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 3540 
GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGT CAT A TTCATGCCTA TGTTAATGCA 3600 
35 CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3660 
CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 3720 
GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 
TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 3840 
CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGCA 3 900 
40 GAAGGCACAC ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 3960 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GAT AT GGG AC 4 020 
CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 40 80 
GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 4140 
GCTGAAGAAC ACAAATGTCT ATCTAATCAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 42 0 0 
45 GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 4260 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 4320 
GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 4380 
TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 
CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 4500 
50 CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 4560 
ACCTCTCTGG ACAGT AATGG TGCAGCATTG CCTGATGGAA ATATAGCIGA GAGCTTAGAG 4 620 
TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 4680 
CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 4740 
AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 480 0 
55 TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 4860 
TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 4920 
ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 4980 
GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 
TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 5100 
60 AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 5160 
ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 5220 
TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 5280 
TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 5340 
ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 5400 
65 GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 5460 
AAAAAAAAAA AAAAAAAAAA A 




MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKtWG KKYPTCNSPK 
QSPINIDEDL TQVNVNLKKL KFQGNDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 

75 FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEHQIYCFDA DRFSSFEEAV KGKGKLRALS 
ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPKSTDKYYI YNGSLTSPPC 
TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 
TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 
HEFLTDGYQD LGAILNNLLP NMSYVLQ IVA ICTNGLYGKY SDQLIVDHPT DNPSLDLFPE 

80 LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 
RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 
GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 
ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NWFPSSTDI 
TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 
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TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS KESRIGLAEG LESEKXAVIP 780 

LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV 900 

AYDHSRVKLA QIAEKDGKLT DYINANYVDG YNRPKAYIAA QGPLKSTAED FKRMIWEH1JV 960 

EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ VLAYYTVRNF TLRNTKIKKG 1020 

SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY AKREAVGPW VECSAGVGRT 1080 

GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE QYVFIHDTLV EAILSKETEV 1140 

LDSHIHAYVN ALLIPGPAGK TKLEKQFQGL TLSPRLECRG TISAHCNLPL PGLTDPPTSA 1200 

SRVAGTILLS QSNIQQSDYS AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS 1260 

YIMGYYQSNE FIITQHPLLH TIKDFWRMIW DHNAQLWMI PDGQNMAEDE FVYWPN'KDEP 1320 

INCESFKVTL MAEEHKCLSN EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PMPDSPISKT 1380 

FELISVIKEE AANRDGPMIV HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMIHLM 1440 
RPGVFADIEQ YQFLYKVILS LVGTRQEENP STSLDSKGAA LPDGMAESL ESLV 
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CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA SO 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 3S0 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 
GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA S60 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 
ACAGTTAGCA TCTCTCAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 
TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 
TTCTCTAGAC AGGTGTTTTC CT CAT ACACT GGAAAGGAAG AGATTCATGA AGCAGTTTCT 10 80 
ACTTCAGAAC CACAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATCATTGAGA AGTTTGCAGT TTTGTACCAG 12 0 0 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 
GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 
TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 13 80 
AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 
CAAGAGGGAA AAGACATT G A AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 15S0 
ACGAAATACA AIGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 16 B0 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CICCAGTCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG . AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
T CCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 
GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 
ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 
TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 
TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 
TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 
GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 
AGCCITGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 
TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 
AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 
GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 
GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 30S0 
CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 
GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 
GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 
TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 
ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 
CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 
ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 
GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 
T CT CAAG CAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 
TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 
ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 
GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 
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CCCAAAGTTG AT AAAAT TAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 
AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 
ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 
GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 
TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 
TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 
CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 
ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 
GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 
TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 
TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGACAGA 4440 
GATAGTGATG GCTTATCCAT TCATAAGTGT ATGTCATGCT CATCCTATAG AGAATCACAG 4500 
GAAAAGGTAA TGAATGATTC AGACACCCAC GAAAACAGTC TTATGGATCA GAATAATCCA 4560 
ATCTCATACT CACTAT CTGA GAATTCTGAA GAAGATAATA GAGTCACAAG TGTATCCTCA 4620 
GACAGTCAAA CTGGTATGGA CAGAAGTCCT GGTAAATCAC CATCAGCAAA TGGGCTATCC 4680 
CAAAAGCACA ATGATGGAAA AGAGGAAAAT G ACATT CAG A CTGGTAGTGC TCTGCTTCCT 4740 
CTCAGCCCTG AATCTAAAGC ATGGGCAGTT CTGACAAGTG ATGAAGAAAG TGGATCAGGG 4800 
CAAGGTACCT CAGATAGCCT TAATGAGAAT GAGACTTCCA CAGATTTCAG TTTTGCAGAC 4860 
ACTAATGAAA AAGATGCTGA TGGGATCCTG GCAGCAGGTG ACTCAGAAAT AACTCCTGGA 4920 
TTCCCACAGT CCCCAACATC ATCTGTTACT AGCGAGAACT CAGAAGTGTT CCACGTTTCA 4980 
G CCAGTAATAG TAGCCATGAG TCTCGTATTG GTCTAGCTGA GGGGTTGGAA 5040 
^AGTTAT ACCCCTTGTG ATCGTGTCAG CCCTGACTTT TATCTGTCTA 5100 
-w-GTATTCT CATCTACTGG AGGAAATGCT TCCAGACTGC ACACTTTTAC 5160 
TTAGAGGACA GTACATCCCC TAGAGTTATA TCCACACCTC CAACACCTAT CTTTCCAATT 5220 
TCAGATGATG TCGGAGCAAT TCCAATAAAG CACTTTCCAA AGCATGTTGC AGATTTACAT 5280 
GCAAGTAGTG GGTTTACTGA AGAATTTGAG ACACTGAAAG AGTTTTACCA GGAAGTGCAG 5340 
AGCTGTACTG TTGACTTAGG TATTACAGCA GACAGCTCCA ACCACCCAGA CAACAAGCAC 5400 
AAGAATCGAT ACATAAATAT CGTTGCCTAT GATCATAGCA GGGTTAAGCT AGCACAGCTT 5460 
GCTGAAAAGG ATGGCAAACT GACTGATTAT ATCAATGCCA ATTATGTTGA TGGCTACAAC 5520 
AG AC CAAAAG CTTATATTGC TGCCCAAGGC CCACTGAAAT CCACAGCTGA AGATTT CTGG 5580 
AGAATGATAT GGGAACATAA TGTGGAAGTT ATTGTCATGA TAACAAACCT CGTGGAGAAA 5640 
GGAAGGAGAA AATGTGATCA GTACTGGCCT GCCGATGGGA GTGAGGAGTA CGGGAACTTT 5700 
CTGGTCACTC AGAAGAGTGT GCAAGTGCTT GCCTATTATA CTGTGAGGAA TTTTACTCTA 5760 
AGAAACACAA AAATAAAAAA GGGCTCCCAG AAAGGAAGAC CCAGTGGACG TGTGGTCACA 5820 
CAGTATCACT ACACGCAGTG GCCTGACATG GGAGTACCAG AGTACTCCCT GCCAGTGCTG 5880 
ACCTTTGTGA GAAAGGCAGC CTATGCCAAG CGCCATGCAG TGGGGCCTGT TGTCGTCCAC 5940 
TCCAGTGCTG GAGTTGGAAG AACAGGCACA TATATTGTGC TAGACAGTAT GTTGCAGCAG 6000 
ATTCAACACG AAGGAACTGT CAACATATTT GGCTTCTTAA AACACATCCG TTCACAAAGA 6060 
AATTATTTGG TACAAACTGA GGAGCAATAT GTCTTCATTC ATGATACACT GGTTGAGGCC 6120 
ATACTTAGTA AAGAAACTGA GGTGCTGGAC AGTCATATTC ATGCCTATGT TAATGCACTC 6180 
CTCATTCCTC CACCAGCAGG CAAAACAAAG CTAGAGAAAC AATTCCAGCT CCTGAGCCAG 6240 
TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 6300 
CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 6360 
GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 6420 
ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 64B0 
CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 6540 
GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 
GCTGAAGAAC ACAAATGTCT AT CT AATG AG GAAAAACTTA TAATTCAGGA c ~™~-' " — 
GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC T 
AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA A 
GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC G 
TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT G 
CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 6960 
CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGAGCACAA GGCAGGAAGA GAATCCATCC 7020 
ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 7080 
TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 7140 
CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 7200 
AGTAACTTTC AT GACAT AGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 7260 
TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 7320 
TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 7380 
ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 7440 
GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 7500 
TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 7560 
AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 7620 
ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 7680 
TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 7740 
TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 7800 
ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 7860 
GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA A 
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MRILKHPLAC IQLLCVCRLD WANGYYRQQR KLVEEIGKSY TGALNQKNWG KKYPTCNSPK 
QSPINIDEDL TQVNWLKKL KPQGWDKTSL ENTFIHNTGK TVEINLTUDY RVSGGVSEMV 
FKASKITFHW GKCNNSSDGS EHSLEGOKFP LEMQIYCFDA DP.FSSFEEAV KGKGKLRALS 
ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLKL LPNSTDKYYI YNGSLTSPPC 
TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 
TGKEEIHEAV CSSEPENVQA DPENYTSLLV THERPRWYD TMIEKFAVLY QQLDGSDQTK 
HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 
LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 
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RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTSLPPHT VEGTSASLKD 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLI PESARNA SEDSTSSGSE ESLKDPSMSG NVWPPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLSM PHYSTFAYFP 720 

5 TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLLLBN'QI 780 

LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ 840 

ILPQVTSATE SDKVPLHASL PVAGGDLLLE PSLAQYSDVL STTHAASETL EFGSESGVLY 900 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF T\'SYSSAIPV HDSVGVTYQG 960 

SLFSGPSHIP IPKSSLITPT ASLLQPTHAL SGDGEWSGAS SDSEFLLPDT DGLTALNISS 1020 

10 PVSVAEFTYT TSVFGDDNKA LSKSEIIYGN ETELQIPSFN EMVYPSESTV MPNMYDNVNK 1080 

LNASLQETSV SISSTKGMFP GSLAHTTTKV FDHEISQVPS NNFSVQPTHT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVFDVSPTS HMHSASLOGL 1260 

TISYASEKYE PVLLKSESSH QWPSLYSND ELFQTANLEI NQAHPPKGRH VFATPVLSID 1320 

15 EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD RESDGLSIHK 1440 

CMSCSSYRES QEKVMNDSDT HENSLMDQNN PISYSLSENS EEDNRVTSVS SDSQTGMDRS 1500 

PGKSPSANGL SQKHNDGKEE NDIQTGSALL PLSPESKAWA VLTSDEESGS GQGTSDSLHE 1560 

NETSTDFSFA DTNEKDADGI LAAGDSEITP GFPQSPTSSV TSENSEVFHV SEAEASNSSH 1620 

20 ESRIGLAEGL ESEKKAVIPL VIVSALTFIC LWLVGILIY WRKCFQTAHF YLEDSTSPRV 1680 

ISTPPTPIFP ISDDVGAIPI KHFPKHVADL HASSGFTEEF ETLKEFYQEV QSCTVDLGIT 1740 

ADSSNHPDNK HKNRYINIVA YDHSRVKLAQ LAEKDGKLTD YINANYVDGY NRPKAYIAAQ 1800 

GPLKSTAEDF WRMIWEHNVE VIVMITNLVE KGRRKCDQYW PADGSSEYGN FLVTQKSVQV 1860 

LAYYTVRNFT LRNTKIKKGS QKGRPSGRW TQYHYTQWPD MGVPEYSLPV LTFVRKAAYA 1920 

25 KRHAVGPVW HCSAGVGRTG TYIVLDSMLQ QIQHEGTVNI FGFLKHIRSQ RNYLVQTEEQ 1980 

YVFIHDTLVE AILSKETEVL DSHIHAYVNA LLIPGPAGKT KLEKQFQLLS QSNIQQSDYS 2040 

AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS YIMGYYQSNE FIITQHPLLH 2100 

TIKDFWRMIW DHN AQLWM I PDGONMAEDE FVYWPNKDEP INCESFKVTL MAEEHKCLSM 2160 

EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT FELISVIKEE AANRDGPMIV 2220 

30 HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM RPGVFADIEQ YQFLYKVILS 2280 
LVSTRQEENP STSLDSHGAA LPDGNIAESL ESLV 

Seq ID NOi 383 DHA sequence 
Nucleic Acid Accession #: NM_00S688.1 
35 Coding sequence: 126.. 4439 



CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 60 

40 AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 120 

AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 1B0 

GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCTCTCTTGA TGCCTCCATG CATTCTCACC TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 

45 GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 

CCCGTGTGGC CCACAAGAAC GGGGAGCTCT CAATGGAAGA CGTGTGCTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACI GTGGCAAGAA GAGCTGAATG 600 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

50 TCATCCTGIC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA ACACCICTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTAC CG A ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

55 TTTGCTCCAA CG ATGGG CAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

60 AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTICACATGA 1560 

65 TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 

70 ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

75 GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATC T CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

80 CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG AAGT CACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGT CAGT A TATGGTGTCT ACATCCAGGC TGCTGC-GGGC CCC1TGGCAT 2700 

TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 
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GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA 2820 

CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 

CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

5 CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 ■ 

GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

10 AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 

TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 

15 CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 

AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC AC GAT CAAAC 3780 

CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 

20 GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3 960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 

25 CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 

TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

30 CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 45S0 

GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 

ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

35 TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 

TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 498 0 
ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 504 0 
GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 510 0 
40 CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 516 0 
TTTCACTCCC TCCATCAAGA ATGGGG AT CA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 522 0 
TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 
TCCCACTCCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 
GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 54 0 0 
45 ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 5460 
CTCACCGCAC TCGTCCCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 
CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 55 80 
ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 
GGGGCTGGTA GCT CAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 
50 ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 
CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5820 
AAAAAAAAAA AAAAAAAA 

Seq ID NO: 384 Protein sequence 



1 11 21 31 41 51 

| | I I I I 

MKDIDIGKEY IIPSPGYRSV EERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 60 

60 LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDKAGLFSCM TFSWLSSLAR 120 

VAHKKGELSM EDWSLSKHE SSDVHCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

LSIVCLMITQ IAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLIiAGG 300 

PWAILGMIY KVIILGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 

65 VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLME3VHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQKSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VLAEQKGHLL LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

70 CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMK2GC ITERGTHEEL KNLMC-DYATI 7 80 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWKL SYKIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

75 KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLKRYQEL 1080 

LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIV1M HGQIPFAYAG LAISYAVQLT 1140 

GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQS GEVTFENAEM 12 00 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGKAL FRLVELSGGC IKIEGVRISD 12 60 

80 IGLADLRSKL SIIPQEPVLF SGTVRSNIiDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDEATAAM DTETDLLIQE TIREAFADCT 1380 
MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAKFAAAE NKVAVKG 
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AGCAGGGGGC GCTGTGTGTA CCGAGAATAC GAGAATACCT CGTGGGCCCT GACCTTCTCT 
CTGAGAGCCG GGCAGAGGCT CCGGAGCCAT GCAGGCCGAA GGCCGGGGCA CAGGGGGTTC 
GACGGGCGAT GCTGATGGCC CAGGAGGCCC TGGCATTCCT GATGGCCCAG a""""' ' 



AAGGGCCTCG GGGCCGGGAG GAGGCGCCCC GCGGGGTCCG CATGGCGGCG CGGCTTCAGG 
GCTGAATGGA TGCTGCAGAT GCGGGGCCAG GGGGCCGGAG AGCCGCCTGC TTGAGTTCTA 
CCTCGCCATG CCTTTCGCGA CACCCATGGA AGCAGAGCTG GCCCGCAGGA GCCTGGCCCA 
GGATGCCCCA CCGCTTCCCG TGCCAGGGGT GCTTCTGAAG GAGTTCACTG TGTCCGGCAA 
CATACTGACT ATCCGACTGA CTGCTGCAGA CCACCGCCAA CTGCAGCTCT CCATCAC-CTC 
CTGTCTCCAG CAGCTTTCCC TGTTGATGTG GATCACGCAG TGCTTTCTGC CCGTGTTTTT 
GGCTCAGCCT CCCTCAGGGC AGAGGCGCTA AGCCCAGCCT GGCGCCCCTT CCTAGGTCAT 
GCCTCCTCCC CTAGGGAATG GTCCCAGCAC GAGTGGCCAG TTCATTGTGG GGGCCTGATT 
GTTTGTCGCT GGAGGAGGAC GGCTTACATG TTTGTTTCTG TAGAAAATAA AACTGAGCTA 

Seq ID NOs 386 Protein sequence 



MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 
PRGPHGGAAS GLNGCCRCGA RGPESRLLEF YLAMPFATPM EAELARRSLA QDAPPLPVPG 
VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCFLPVF LAQPPSGQRR 



I I I I I 

CCTCGTGGGC CCTGACCTTC TCTCTGAGAG CCGGGCAGAG GCTCCGGAGC C 
GAAGGCCAGG GCACAGGGGG TTCGACGGGC GATGCTGATG GCCCAGGAGG CCCTGGCATT 
CCTGATGGCC CAGGGGGCAA TGCTGGCGGC CCAGGAGAGG CGGG1GCCAC GGGCGGCAGA 
GGTCCCCGGG GCGCAGGGGC AGCAAGGGCC TCGGGGCCGA GAGGAGGCGC CCCGCGGGGT 
TTC TGCGCAGGAT GGAAGGTGCC CCTGCGGGGC CAGGAGGCCG 
GACAGCCGCC TGCTTCAGTT CCGACTGACT GCTGCAGACC ACCGCCAACT GCAGCTCTCC 
ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG TTGATGTGGA TCACGCAGTG CTTTCTGCCC 
GTGTTTTTGG CTCAGGCTCC CTCAGGGCAG AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 
TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAGCACGA GTGGCCAGTT CATTGTGGGG 
GCCTGATTGT TTGTCGCTGG AGGAGGACGG CTTACATGTT TGTTTCTGTA GAAAATAAAG 



MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 
PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 
FLPVFLAQAP SGQRR 




I 1 

C GCAGAGTGAG AACCACCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGCGG 

AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 
GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 
ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 

GCTTTTACCG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 
CTCTTAGTGC TCGATGTGAC AACT CTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 
CCAGATGCGA CCGATGT CTG CCAGGCTTCC ACATGCTCAC GGATGCGC-GG TGCACCCAAG 
ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCC-CA GGGCCCTGTC- 
ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTC-AT AGGTGTCGAT 
T GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 



A GCGCCAT CAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 

TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 
TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 
T ACGGAT CACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 
C TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 



TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 12 SO 

AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 1320 
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GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 13 80 

GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTC-CCCAATT GGTTTCTACA 1440 

ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 

CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

5 CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 

TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 

GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 

ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT 1860 

10 GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 

CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 

AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 

15 ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 2220 

AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 

CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 

CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 

20 CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 2520 

CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 

CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 

TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 

CAAAGAGGAT CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 

25 AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2820 

AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 2880 

AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940 

TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 

AAGCCATGAA GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 

30 AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 3120 

CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 

AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC T CTCTGAAGA 3240 

GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 

TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 3360 

35 CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 

AGCCT CTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 3480 

AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 

AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3 6 00 

^ AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CIACAATACC CAGGCTCTTG 3 6 60 

GGCT CGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 37 80 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3 840 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3 900 

ATGATCAAGG AT CTGGACC C CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 39S0 

45 ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 4080 

ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 

ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 4260 

50 AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 

GTT CTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 

GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 

ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT GATGCGCATC AATGTATTTT 4560 

55 ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 

CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 

TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 

GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA 4800 

AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 

60 GCAATAACCG CTTGGTTTGC AACCTCTTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4920 

CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 

CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA 5040 

TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 5100 

TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 

65 CAATTGTTAG ATGCC 

Protein Accession #: NP_005553.1 



MPALWLGCCL CPSLLLPAAR ATSRREVCDC NGKSRQCIFD EELHRQTGNG FRCLNCNDNT 60 

DGIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPGVTG ARCDRCLPGF 120 

HMLTDAGCTQ DQRLIiDSKCD CDPAGIAGPC DAGRCVCKPA VTGERCDRCR SGYYNLDGGN 180 

75 PEGCTQCPCY GHSASCRSSA EYSVHKITST FHQDVDGWKA VQSKGSPAKL QKSQRHQDVF 240 

SSAQRLDPVY FVAPAKFLGN QQVSYGQSLS FDYRVDRGGR HPSAHBVILE GAGLRITAPL 300 

MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYFEYRRLL RNLTALRIRA TYGEYSTGYI 360 

DNVTLISARP VSGAPAPWVE QCICPVGYKG QFCQDCASGY KRDSARU3PF GTCIPCNCQG 420 

_ . GGACDPDTGD CYSGDENPDI ECADCPIGFY NDPHDPRSCK PCPCHKGFSC SVMPETEEW 480 

80 CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQOJ HNVDPSASGK CDRLTGRCLK 540 

CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS EPVGCRSDGT CVCKFGFGGP 600 

NCEHGAFSCP ACYKQVKIQM DQFMQQLQRM EALISKAQGG DGV\'PDTELE GP.MQQAEQAL 660 

QDILRDAQIS EGASRSLGLQ LAKVRSQEHS YQSRLDDLKM IVERVRALGS QYQMRVRDTH 720 

RLITQMQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE ATRLAESKVE SASNMEQLTR 780 
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ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQL TRSATQAEIE 840 

ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS LSTLVTRHMO EFKRTQKNLG 900 

NWKBEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM GNATFYEVES ILKNLREFDL 960 

QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE 1020 

IEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL ERKELEFDTN MDAVQHVITE 1080 

AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGL VLLiiQKLS^A KTQINSQLRP 1140 
MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL PPGCYNTQAL EQQ 

Seq ID NO: 



Nucleic Acid Accession #: 



1 11 21 31 41 51 

I I I I I I 

GAGCAACCTC AGCTTCTAGT ATCCAGACTC CAGCGCCGCC CCGGGCGCGG ACCCCAACCC 
CGACCCAGAG CTTCTCCAGC GGCGGCGCAG CGAGCAGGGC TCCCCGCCTT AACTTCCTCC 
GCGGGGCCCA GCCACCTTCG GGAGTCCGGG TTGCCCACCT GCAAACTCTC CGCCTTCTGC 
ACCTGCCACC CCTGAGCCAG CGCGGGCGCC CGAGCGAGTC ATGGCCAACG CGGGGCTGCA 
GCTGTTGGGC TTCATTCTCG CCTTCCTGGG ATGGATCGGC GCCATCGTCA GCACTGCCCT 
GCCCCAGTGG AGGATTTACT CCTATGCCGG CGACAACATC GTGACCGCCC AGGCCATGTA 
CGAGGGGCTG TGGATGTCCT GCGTGTCGCA GAGCACCGGG CAGATCCAGT GCAAAGTCTT 
TGACTCCTTG CTGAATCTGA GCAGCACATT GCAAGCAACC CGTGCCTTGA TGGTGGTTGG 
CATCCTCCTG GGAGTGATAG CAATCTTTGT GGCCACCGTT GGCATGAAGT GTATGAAGTG 
CTTGGAAGAC GATGAGGTGC AGAAGATGAG GATGGCTGTC ATTGGGGGTG CGATATTTCT 
T CTGGCTATTT TAGTTGCCAC AGCATGGTAT GGCAATAGAA TCGTTCAAGA 



TGGCTGGGCT GCTGCTTCTC TCTGCCTTCT GGGAGGTGCC CTACTTTGCT G 

CCGAAAAACA ACCTCTTACC CAACACCAAG GCCCTATCCA AAACCTGCAC CTTCCAGCGG 840 

GAAAGACTAC GTGTGACACA GAGGCAAAAG GAGAAAATCA TGTTGAAACA AACCGAAAAT 900 

GGACATTGAG ATACTATCAT TAACATTAGG ACCTTAGAAT TTTGGGTATT GTAATCTGAA 960 

GTATGGTATT ACAAAACAAA CAAACAAACA AAAAACCCAT GTGTTAAAAT ACTCAGTGCT 1020 

AAACATGGCT TAATCTTATT TTATCTTCTT TCCTCAATAT AGGAGGGAAG ATTTTACCAT 10 80 

TTGTATTACT GCTTCCCATT GAGTAATCAT ACTCAAATGG GGGAAGGGGT GCTCCTTAAA 1140 

TATATATAGA TATGTATATA TACATGTTTT TCTATTAAAA ATAGACAGTA AAATACTATT 1200 

CTCATTATGT TGATACTAGC ATACTTAAAA TATCTCTAAA ATAGGTAAAT GTATTTAATT 1260 

CCATATTGAT GAAGATGTTT ATTGGTATAT TTTCTTTTTC GTCCTTATAT ACATATGTAA 1320 

CAGTCAAATA TCATTTACTC TTCTTCATTA GCTTTGGGTG CCTTTGCCAC AAGACCTAGC 13 80 

CTAATTTACC AAGGATGAAT TCTTTCAATT CTTCATGCCT GCCCTTTTCA TATACTTATT 1440 

TTArTTTTTA CCATAATCTT ATAGCACTTG CATCGTTATT AAGCCCTTAT TTGTTTTGTG 1500 

TTTCATTGGT CTCTATCTCC TGAATCTAAC ACATTTCATA GCCTACATTT TAGTTTCTAA 1560 

AGCCAAGAAG AATTTATTAC AAATCAGAAC TTTGGAGGCA AATCTTTCTG CATGACCAAA 1620 

GTGATAAATT CCTGTTGACC TTCCCACACA ATCCCTGTAC TCTGACCCAT AGCACTCTTG 16 80 

TTTGCTTTGA AAATATTTGT CCAATTGAGT AGCTGCATGC TGTTCCCCCA GGTGTTGTAA 1740 

CACAACTTTA TTGATTGAAT TTTTAAGCTA CTTATTCATA GTTTTATATC CCCCTAAACT 1800 

ACCTTTTTGT TCCCCATTCC TTAATTGTAT TGTTTTCCCA AGTGTAATTA TCATGCGTTT 1860 

TATATCTTCC TAATAAGGTG TGGTCTGTTT GTCTGAACAA AGTGCTAGAC TTTCTGGAGT 1920 

GATAATCTGG TCACAAATAT TCTCTCTGTA GCTGTAAGCA AGTCACTTAA TCTTTCTACC 1980 

TCTTTTTTCT ATCTGCCAAA TTGAGATAAT GATACTTAAC CAGTTAGAAG AGGTAGTGTG 2040 

AATATTAATT AGTTTATATT ACTCTCATTC TTTGAACATG AACTATCCCT ATGTAGTGTC 2100 

TTTATTTGCT CAGCTGGCTG AGACACTGAA GAAGTCACTG AACAAAACCT ACACACGTAC 2160 

CTTCATGTGA TTCACTGCCT TCCTCTCTCT ACCAGTCTAT TTCCACTGAA CAAAACCTAC 2220 

ACACATACCT TCATGTGGTT CAGTGCCTTC CTCTCTCTAC CAGTCTATTT CCACTGAACA 2280 

AAACCTACGC ACATACCTTC ATGTGGCTCA GTGCCTTCCT CTCTCTACCA GTCTATTTCC 2340 

ATTCTTTCAG CTGTGTCTGA CATGTTTGTG CTCTGTTCCA TTTTAACAAC TGCTCTTACT 2400 

TTTCCAGTCT GTACAGAATG CTATTTCACT TGAGCAAGAT GATGTATGGA AAGGGTGTTG 2460 

GCACTGGTGT CTGGAGACCT GGATTTGAGT CTTGGTGCTA TCAATCACCG TCTGTGTTTG 2520 

AGCAAGGCAT TTGGCTGCTG TAAGCTTATT GCTTCATCTG TAAGCGGTGG TTTGTAATTC 2580 

CTGATCTTCC CACCTCACAG TGATGTTGTG GGGATCCAGT GAGATAGAAT ACATGTAAGT 2640 

GTGGTTTTGT AATTTGAAAA GTGCTATACT AAGGGAAAGA ATTGAGGAAT TAACTGCATA 2700 

CGTTTTGGTG TTGCTTTTCA AATGTTTGAA AATAAAAAAA TGTTAAGAAA TGGGTTTCTT 2760 

GCCTTAACCA GTCTCTCAAG TGATGAGACA GTGAAGTAAA ATTGAGTGCA CTAAACGAAT 2820 

AAGATTCTGA GGAAGTCTTA TCTTCTGCAG TGAGTATGGC CCAATGCTTT CTGTGGCTAA 2880 

ACAGATGTAA TGGGAAGAAA TAAAAGCCTA CGT3TTGGTA AATCCAACAG CAAGGGAGAT 2940 

TTTTGAATCA TAATAACTCA TAAGGTGCTA TCTGTTCAGT GATGCCCTCA GAGCTCTTGC 3000 

TGTTAGCTGG CAGCTGACGC TGCTAGGATA GTTAGTTTGG AAATGGTACT TCATAATAAA 3060 

CTACACAAGG AAAGTCAGCC ACCGTGTCTT ATGAGGAATT GGACCTAATA AATTTTAGTG 3120 

TGCCTTCCAA ACCTGAGAAT ATATGCTTTT GGAAGTTAAA ATTTAAATGG CTTTTGCCAC 3180 



.,„>„„ r TTTATGGCCC AAAATGACCA ACGAAATTGT TACAATAGAA TTTATCCAAT 3300 
70 TTTGATCTTT TTATATTCTT CTACCACACC TGGAAACAGA CCAATAGACA TTTTGGGGTT 3360 
TTATAATGGG AATTTGTATA AAGCATTACT CTTTTTCAAT AAATTGTTTT T 
AAAAGGAAAA AAAAAAAAAA AAA 

_ Seq ID NO; 392 Protein sequence 
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Coding sequences 352.. 2820 

1 11 21 31 41 51 

I I I I I I 

CCCCCATTCG CATCTAACAA GGAATCTGCG CCCCAGAGAG TCCCGGACGC CGCCGGTCGG 60 
TGCCCGGCGC GCCGGGCCAT GCAGCGACGG CCGCCGCGGA GCTCCGAGCA GCGGTAGCGC 120 
CCCCCTGTAA AGCGGTTCGC TATGCCGGGA CCACTGTGAA CCCTGCCGCC TGCCGGAACA 180 
CTCTTCGCTC CGGACCAGCT CAGCCTCTGA TAAGCTGGAC TCGGCACGCC CGCAACAAGC 240 
ACCGAGGAGT TAAGAGAGCC GCAAGCGCAG GGAAGGCCTC CCCGCACGGG TGGGGGAAAG 300 
CGGCCGGTGC AGCGCGGGGA CAGGCACTCG GGCTGGCACT GGCTGCTAGG GATGTCGTCC 360 
TGGATAAGGT GGCATGGACC CGCCATGGCG CGGCTCTGGG GCTTCTGCTG GCTGGTTGTG 420 
GGCTTCTGGA GGGCCGCTTT CGCCTGTCCC ACGTCCTGCA AATGCAGTGC CTCTCGGATC 480 
TGGTGCAGCG ACCCTTCTCC TGGCATCGTG GCATTTCCGA GATTGGAGCC TAACAGTGTA 540 
GATCCTGAGA ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGTTAGA AATCATCAAC 600 
GAAGATGATG TTGAAGCTTA TGTGGGACTG AGAAATCTGA CAATTGTGGA TTCTGGATTA 660 
AAATTTGTGG CT CATAAAGC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 720 
CGAAACAAAC TGACGAGTTT GTCTAGGAAA CATTTCCGTC ACCTTGACTT GTCTGAACTG 780 
ATCCTGGTGG GCAATCCATT TACATGCTCC TGTGACATTA TGTGGATCAA GACTCTCCAA 840 
GAGGCTAAAT CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 900 
ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA TCTGGCCGCA 960 

CCTAACCTCA CTGTGGAGGA AGGAAAGTCT ATCACATTAT CCTGTAGTGT GGCAGGTGAT 1020 

CCGGTTCCTA ATATGTATTG GGATGTTGGT AACCTGGTTT CCAAACATAT GAATGAAACA 1080 

ATCTCTTGTG TGGCGGAAAA TCTTGTAGGA GAAGATCAAG ATTCTGTCAA CCTCACTGTG 1200 

CATTTTGCAC CAACTATCAC ATTTCTCGAA TCTCCAACCT CAGACCACCA CTGGTGCATT 1260 

CCATTCACTG TGAAAGGCAA CCCCAAACCA GCGCTTCAGT GGTTCTATAA CGGGGCAATA 1320 

TTGAATGAGT CCAAATACAT CTGTACTAAA ATACATGTTA CCAATCACAC GGAGTACCAC 1380 

GGCTGCCTCC AGCTGGATAA TCCCACTCAC ATGAACAATG GGGACTACAC TCTAATAGCC 1440 

AAGAATGAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC ACTTCATGGG CTGGCCTGGA 1500 

ATTGACGATG GTGCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA TGGAACTGCA 1560 

GCGAATGACA TCGGGGACAC CACGAACAGA AGTAATGAAA TCCCTTCCAC AGACGTCACT 1620 

GATAAAACCG GTCGGGAACA TCTCTCGGTC TATGCTGTGG TGGTGATTGC GTCTGTGGTG 1680 

GGATTTTGCC TTTTGGTAAT GCTGTTTCTG CTTAAGTTGG CAAGACACTC CAAGTTTGGC 1740 

ATGAAAGGCC CAGCCTCCGT TATCAGCAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 1800 

ATCTCCAATG GGAGTAACAC TCCATCTTCT TCGGAAGGTG GCCCAGATGC TGTCATTATT 1860 

GGAATGACCA AGATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCATCAC CAACAGTCAG 1920 

CTCAAGCCAG ACACATTTGT TCAGCACATC AAGCGACATA ACATTGTTCT GAAAAGGGAG 19 80 

CTAGGCGAAG GAGCCTTTGG AAAAGTGTTC CTAGCTGAAT GCTATAACCT CTGTCCTGAG 2040 

CAGGACAAGA TCTTGGTGGC AGTGAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG 2100 

GACTTCCACC GTGAGGCCGA GCTCCTGACC AACCTCCAGC ATGAGCACAT CGTCAAGTTC 21 GO 

TATGGCGTCT GCGTGGAGGG CGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATGGG 2220 

GACCTCAACA AGTTCCTCAG GGCACACGGC CCTGATGCCG TGCTGATGGC TGAGGGCAAC 2280 

CCGCCCACGG AACTGACGCA GTCGCAGATG CTGCATATAG CCCAGCAGAT CGCCGCGGCC 2340 

ATGGTCTACC TGGCCTCCCA GCACTTCGTG CACCGCGATT TGGCCACCAG GAACTGCCTG 2400 

GTCGGGGAGA ACTTGCTGGT GAAAATCGGG GACTTTGGGA TGTCCCGGGA CGTGTACAGC 2460 

ACTGACTACT ACAGGGTCGG TGGCCACACA ATGCTGCCCA TTCGCTGGAT GCCTCCAGAG 2520 

AGCATCATGT ACAGGAAATT CACGACGGAA AGCGACGTCT GGAGCCTGGG GGTCGTGTTG 2580 

TGGGAGATTT TCACCTATGG CAAACAGCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2640 

GAGTGTATCA CTCAGGGCCG AGTCCTGCAG CGACCCCGCA CGTGCCCCCA GGAGGTGTAT 2700 

GAGCTGATGC TGGGGTGCTG GCAGCGAGAG CCCCACATGA GGAAGAACAT CAAGGGCATC 2760 

CATACCCTCC TTCAGAACTT GGCCAAGGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2820 

GGCCCTTTTC CCCAGACCGA TCCTTCCCAA CGTACTCCTC AGACGGGCTG AGAGGATGAA 2880 

CATCTTTTAA CTGCCGCTGG AGGCCACCAA GCTGCTCTCC TTCACTCTGA CAGTATTAAC 2940 

ATCAAAGACT CCGAGAAGCT CTCGAGGGAA GCAGTGTGTA CTTCTTCATC CATAGACACA 3000 

GTATTGACTT CTTTTTGGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTG GTTGTTCCTT 3060 

TTTCTTTTTT TAAATTTTCT TTTTCTTCTT TTTTTTCGTC TTCCCTGCTT CACGATTCTT 3120 

ACCCTTTCTT TTGAATCAAT CTGGCTTCTG CATTACTATT AACTCTGCAT AGACAAAGGC 3180 

CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACICCAGT TTGCCCACCA CAACTAACAA 3240 

TGCCTTGTTG TATTCCTGCC TTTGATGTGG ATGAAAAAAA GGGAAAACAA ATATTTCACT 3300 

TAAACTTTGT CACTTCTGCT GTACAGATAT CGAGAGTTTC TATGGATTCA CTTCTATTTA 3360 

TTTATTATTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAGAA 3420 

AACTTGTGTT CAATCTGTGA AGCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 

ATTTATTATG AACCGCAATA TGGGAGGAAC AAAGACAACC ACTGGGATCA GCTGGTGTCA 3540 

GTCCCTACTT AGGAAATACT CAGCAACTGT TAGCTGGGAA GAATGTATTC GGCACCTTCC 3600 

CCTGAGGACC TTTCTGAGGA GTAAAAAGAC TACTGGCCTC TGTGCCATGG ATGATTCTTT 3660 
TCCCATCACC AGAAATGATA GCGTGCAGTA GAGAGCAAAG ATGGCTT 



I I I I I I 

MSSWIRWHGP AMARLWGFCW LWGPWEAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 
NSVDPENITE IPIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKKSNLQHI 
NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 
SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPHMYW D 
NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVK LTVHFAPTIT F 
WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN P 
LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNESNEIPST ' 420 
DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGPASV ISKDDDSASP 480 
LHHISNGSNT PSSSEGGPDA VIIGMTKIPV IENPQYFGIT NSQLKPDTFV QHIKRHNIVL 540 
KRELGEGAFG KVFIAECYNL CPEQDKI LVA VKTLKDASDN ARKDFHREAE LLTNLQHEHI 600 
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VI KHGDLNKFLR AHGPDAVLMA EGNPPTELTQ SQMLHIAQQI 
2 HPVHRDLATR NCLVGEWLLV KIGDFGMSED VYSTDYYRVG GHTMLPIRWM 
PPESIMYRKF TTESDVWSLG WLWEIFTYG KQPWYQLSNN EVIECITQGR VLQRPRTCPQ 
W QREPHMRKNI KGIHTLLQNL AKASP VYLD I LG 



G CTGCTTCGCT CGCGCTCTAC GCGCTCAGTC CCCGGCGGTA 60 

GCAGGAGCCT GGACCCAGGC GCCGGCGGCG GGCGTGAGGC GCCGGAGCCC GGCCTCGAG3 120 

TGCATACCGG ACCCCCATTC GCAT CTAACA AGGAATCTGC GCCCCAGAGA GTCCCGGACG 180 

CCGCCGGTCG GTGCCCGGCG CGCCGGGCCA TGCAGCGACG GCCGCCGCGG AGCTCCGAGC 240 

AGCGGTAGCG CCCCCCTGTA AAGCGGTTCG CTATGCCGGG ACCACTGTGA ACCCTGCCGC 300 

CTGCCGGAAC ACTCTTCGCT CCGGACCAGC TCAGCCTCTG ATAAGCTGGA CTCGGCACGC 360 

CCGCAACAAG CACCGAGGAG TTAAGAGAGC CGCAAGCGCA GGGAAGGCCT CCCCGCACGG 420 

GTGGGGGAAA GCGGCCGGTG CAGCGCGGGG ACAGGCACTC GGGCTGGCAC TGGCTGCTAG 480 

GGATGTCGTC CTGGATAAGG TGGCATGGAC CCGCCATGGC GCGGCTCTGG GGCTTCTGCT 540 

GGCTGGTTGT GGGCTTCTGG AGGGCCGCTT TCGCCTGTCC CACGTCCTGC AAATGCAGTG 600 

CCTCTCGGAT CTGGTGCAGC GACCCTTCTC CTGGCATCGT GGCATTTCCG AGATTGGAGC 660 

CTAACAGTGT AGATCCTGAG AACATCACCG AAATTTTCAT CGCAAACCAG AAAAGGTTAG 720 

AAATCATCAA CGAAGATGAT GTTGAAGCTT ATGTGGGACT GAGAAATCTG ACAATTGTGG 780 

ATTCTGGATT AAAATTTGTG GCTCATAAAG CATTTCTGAA AAACAGCAAC CTGCAGCACA 840 

TCAATTTTAC CCGAAACAAA CTGACGAGTT TGTCTAGGAA ACATTTCCGT CACCTTGACT 900 

TGTCTGAACT GATCCTGGTG GGCAATCCAT TTACATGCTC CTGTGACATT ATGTGGATCA 960 

AGACTCTCCA AGAGGCTAAA TCCAGTCCAG ACACTCAGGA TTTGTACTGC CTGAATGAAA 1020 

GCAGCAAGAA TATTCCCCTG GCAAACCTGC AGATACCCAA TTGTGGTTTG CCATCTGCAA 1080 

ATCTGGCCGC ACCTAACCTC ACTGTGGAGG AAGGAAAGTC TATCACATTA TCCTGTAGTG 1140 

TGGCAGGTGA TCCGGTTCCT AATATGTATT GGGATGTTGG TAACCTGGTT T CCAAACATA 1200 

TGAATGAAAC AAGCCACACA CAGGGCTCCT TAAGGATAAC TAACATTTCA TCCGATGACA 1260 

GTGGGAAGCA GATCTCTTGT GTGGCGGAAA AT CTTGT AGG AGAAGATCAA GATTCTGTCA 1320 

ACCTCACTGT GCATTTTGCA CCAACTATCA CATTTCTCGA ATCTCCAACC TCAGACCACC 1380 

T TCCATTCACT GTGAAAGGCA ACCCCAARCC AGCGCTTCAG TGGTTCTATA 1440 

ITACTAA AATACATGTT ACCAATCACA 1500 

CGGAGTACCA CGGCTGCCTC CAGCTGGATA ATCCCACTCA CATGAACAAT GGGGACTACA 1560 

CTCTAATAGC CAAGAATGAG TATGGGAAGG ATGAGAAACA GATTTCTGCT CACTTCATGG 1620 

GCTGGCCTGG AATTGACGAT GGTGCAAACC CAAATTATCC TGATGTAATT TATGAAGATT 1680 

ATGGAACTGC AGCGAATGAC ATCGGGGACA CCACGAACAG AAGTAATGAA ATCCCTTCCA 1740 

CAGACGTCAC TGATAAAACC GGTCGGGAAC ATCTCTCGGT CTATGCTGTG GTGGTGATTG 1800 

CGTCTGTGGT GGGATTTTGC CTTTTGGTAA TGCTGTTTCT GCTTAAGTTG GCAAGACACT 1860 

CCAAGTTTGG CATGAAAGAT TTCTCATGGT TTGGATTTGG GAAAGTAAAA TCAAGACAAG 1920 

GTGTTGGCCC AGCCTCCGTT ATCAGCAATG ATGATGACTC TGCCAGCCCA CTCCATCACA 19 80 

TCTCCAATGG GAGTAACACT CCATCTTCTT CGGAAG3TGG CCCAGATGCT GTCATTATTG 2040 

GAATGACCAA GATCCCTGTC ATTGAAAATC CCCAGTACTT TGGCATCACC AACAGTCAGC 2100 

TCAAGCCAGA CACATTTGTT CAGCACATCA AGCGACATAA CATTGTTCTG AAAAGGGAGC 2160 

TAGGCGAAGG AGCCTTTGGA AAAGTGTTCC TAGCTGAATG CTATAACCTC TGTCCTGAGC 2220 

IGGATGC CAGTGACAAT GCACGCAAGG 2 2 BO 

3 TGAGGCCGAG CTCCTGACCA ACCTCCAGCA TGAGCACATC GTCAAGTTCT 2340 

ATGGCGTCTG CGTGGAGGGC GACCCCCTCA TCATGGTCTT TGAGTACATG AAGCATGGGG 2400 

C CTGATGCCGT GCTGATGGCT GAGGGCAACC 2460 



GCATCATGTA CAGGAAATTC ACGACGGAAA GCGACGTCTG GAGCCTGGGG GTCGTGTTGT 2760 

GGGAGATTTT CACCTATGGC AAACAGCCCT GGTACCAGCT GTCAAACAAT GAGGTGATAG 2B20 

AGTGTATCAC TCAGGGCCGA GTCCTGCAGC GACCCCGCAC GTGCCCCCAG GAGGTGTATG 2B80 

AGCTGATGCT GGGGTGCTGG CAGCGAGAGC CCCACATGAG GAAGAACATC AAGGGCATCC 2940 

ATACCCTCCT TCAGAACTTG GCCAAGGCAT CTCCGGTCTA CCTGGACATT CTAGGCTAGG 3000 

GCCCTTTTCC CCAGACCGAT CCTTCCCAAC GTACTCCTCA GACGGGCTGA GAGGATGAAC 3060 

ATCTTTTAAC TGCCGCTGGA GGCCACCAAG CTGCTCTCCI TCACTCTGAC AGTATTAACA 3120 

TCAAAGACTC CGAGAAGCTC TCGAGGGAAG CAGTGTGTAC TTCTTCATCC ATAGACACAG 3180 

TATTGACTTC TTTTTGGCAT TATCTCTTTC TCTCITTCCA TCTCCCTTGG TTGTTCCTTT 3240 

TTCTTTTTTT AAATTTTCTT TTTCTTCTTT TTTTTCGTCT TCCCTGCTTC ACGATTCTTA 3300 

CCCTTTCTTT TGAATCAATC TGGCTTCTGC ATTACTATTA ACTCTGCATA GACAAAGGCC 3360 

TTAACAAACG TAATTTGTTA TATCAGCAGA CACTCCAGTT TGCCCACCAC AACTAACAAT 3420 

GCCTTGTTGT ATTCCTGCCT TTGATGTGGA TGAAAAAAAG GGAAAACAAA TATTTCACTT 3480 

AAACTTTGTC ACTTCTGCTG TACAGATATC GAGAGTTTCT ATGGATTCAC TTCTATTTAT 3540 

TTATTATTAT TACTGTTCTT ATTGTTTTTG GATGGCTTAA GCCTGTGTAT AAAAAAGAAA 3600 



CTGAGGACCT TTCTGAGGAG TAAAAAGACT ACTGGCCTCT GTGCCATGGA TGATTCTTTT 3840 

CCCATCACCA GAAATGATAG CGTGCAGTAG AGAGCAAAGA TGGCTTCCGT GAGACACAAG 3900 

ATGGCGCATA GTGTGCTCGG ACACAGTTTT GTCTTCGTAG GTTGTGATGA TAGCACTGGT 3960 

TTGTTTCTCA AGCGCTATCC ACAGAACCTT TGTCAACTTC AGTTGAAAAG AGGTGGATTC 4020 



a #: AAL67965.1 



1111 



WO 03/042661 PCT/US02/36810 



NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TODLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNKYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALOWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KPGMKDFSWF GFGKVKSRQG 480 

VGPASVISND DDSASPLHHI SNGSNTPSSS EGGPDAVIIG MTKIPVIENP QYFGITNSQL 540 
KPDTFVQHIK RHNIVLKREL GEGAFGKVFL AECYNLCPEQ DKILVAVKTL KDASDNARKD 
FHREAELLTN LQHEHIVKFY GVCVEGDPLI M 

PTELTQSQML HIAQQIAAGM VYLASQHFVH RDLATRNCLV GENLIA 



I 

T CTTCCAGGCT CTCCTTCCAT CAAGTCTCTC CTCCCTAGCG SO 

C CTTCTGTGCC TCCCGCTTCT 120 

T CCCGGGCTGG GCGAGCCGAC CCTCACTCTC TTTGCTATGA 180 

CATCACCGTC ATCCCTAAGT TCAGACCTGG ACCACGGTGG TGTGCGGTTC AAGGCCAGGT 240 

GGATGAAAAG ACTTTTCTTC ACTATGACTG TGGCAACAAG ACAGTCACAC CTGTCAGTCC 300 

CCTGGGGAAG AAACTAAATG TCACAACGGC CTGGAAAGCA CAGAACCCAG TACTGAGAGA 3 SO 

GGTGGTGGAC ATACTTACAG AGCAACTGCG TGACATTCAG CTGGAGAATT ACACACCCAA 420 

GGAACCCCTC ACCCTGCAGG CCAGGATGTC TTGTGAGCAG AAAGCTGAAG GACACAGCAG 480 

TGGATCTTGG CAGTTCAGTT TCGATGGGCA GATCTTCCTC CTCTTTGACT CAGAGAAGAG 540 

AATGTGGACA ACGGTTCATC CTGGAGCCAG AAAGATGAAA GAAAAGTGGG AGAATGACAA S00 

GGTTGTGGCC ATGTCCTTCC ATTACTTCTC AATGGGAGAC TGTATAGGAT GGCTTGAGGA SSO 

CTTCTTGATG GGCATGGACA GCACCCTGGA GCCAAGTGCA GGAGCACCAC TCGCCATGTC 720 

CTCAGGCACA ACCCAACTCA GGGCCACAGC CACCACCCTC ATCCTTTGCT GCCTCCTCAT 780 

CATCCTCCCC TGCTTCATCC TCCCTGGCAT CTGAGGAGAG TCC1TTAGAG TGACAGGTTA 840 

AAGCTGATAC CAAAAGGCTC CTGTGAGCAC GGTCTTGATC AAACTCGCCC TTCTGTCTGG 900 

CCAGCTGCCC ACGACCTACG GTGTATGTCC AGTGGCCTCC AGCAGATCAT GATGACATCA 960 

TGGACCCAAT AGCTCATTCA CTGCCTTGAT TCCTTTTGCC AACAATTTTA CCAGCAGTTA 1020 

TACCTAACAT ATTATGCAAT TTTCTCTTGG TGCTACCTGA TGGAATTCCT GCAC1TAAAG 1080 

TTCTGGCTGA CTAAACAAGA TATATCATTT TCTTTCTTCT CTTTTTGTTT GGAAAATCAA 1140 

GTACTTCTTT GAATGATGAT CTCTTTCTTG CAAATGATAT TGTCAGTAAA ATAATCACGT 1200 

TAGACTTCAG ACCTCTGGGG ATTCTTTCCG TGTCCTGAAA GAGAATTTTT AAATTATTTA 12G0 



Protein Accession # 



MAAAAATKIL LCLPLLLLLS GWSRAGRADP HSLCYDITVI PKFRPGPRWC AVQGQVDEKT 
FLHYDCGNKT VTPVSPLGKK LNVTTAWKAQ NPVLREWDI LTEQLRDIQL ENYTPKEPLT 
LQARMSCEQK AEGHSSGSWQ FSFDGQIFLL FDSEKRMWTT VHPGARKMKE KWENDKWAM 
SFHYFSMGDC IGWLEDFLMG MDSTLEPSAG APLAMSSGTT QLRATATTLI LCCLLIILPC 



Seq ID NO : 399 DNA sequence 

Nucleic Acid Accession # ; NM_001898.1 

Coding sequence: 57.. 482 

1 11 21 31 41 51 

I I I I I I 

GGCTCTCACC CTCCTCTCCT GCAGCTCCAG CTTTGTGCTC TGCCTCTGAG GAGACCATGG 
CCCAGTATCT GAGTACCCTG CTGCTCCTGC TGGCCACCCT AGCTGTGGCC CTGGCCTGGA 
GCCCCAAGGA GGAGGATAGG ATAATCCCGG GIGGCATCTA TAACGCAGAC CTCAATGATG 
AGTGGGTACA GCGTGCCCTT CACTTCGCCA TCAGCGAGTA TAACAAGGCC ACCAAAGATG 
ACTACTACAG ACGTCCGCTG CGGGTACTAA GAGCCAGGCA ACAGACCGTT GGGGGGGTGA 
ATTACTTCTT CGACGTAGAG GTGGGCCGCA CCATATGTAC CAAGTCCCSG CCCAACTTGG 
ACACCTGTGC CTTCCATGAA CAGCCAGAAC TGCAGAAGAA ACAGTTGTGC TCTTTCGAGA 
T CTACG AAGT TCCCTGGGAG AACAGAAGGT CCCTGGTGAA ATCCAGGTGT CAAGAATCCT 
AGGGATCTGT GCCAGGCCAT TCGCACCAGC CACCACCCAC TCCCACCCCC TGTAGTGCTC 
CCACCCCTGG ACTGGTGGCC CCCACCCTGC GGGAGGCCTC CCCATGTC-CC TC-CGCCAAGA 
GACAGACAGA GAAGGCTGCA GGAGTCCTTT GTTGCTCAGC AGGC-CGCTCT GCCCTCCCTC 
CTTCCTTCTT GCTTCTAATA GCCCTGGTAC ATGGTACAC3V CCCCCCCACC TCCTGCAATT 
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EIYEVPWENR RSLVKSRCQE S 

Seq ID NO: 401 DNA sequence 
Nucleic Acid Accession #: NM_003976.2 
5 Coding sequence: 299.961 

1 11 21 31 41 51 

I I I I I I 

CTCTGAGCTT CTCTGAGCCT TGTTTGCTCA TCTGGAAAAA GGGGATTAAA CCATTTACCT 60 

10 CATGGAGTTG TGAAAGAATA GCTGCAAAGC ACCTAACACA TAGTAAGGTT CCCAGTGCAG 120 

CTACTTCTGC TGGGTTGAGT CTAGCTGTGT AGGCCCCTTG TTCCTCACCT GGAGAAACTG 180 

GGGTGGCAGG CCGGTCCCCC ACAAAAGATA ACTCATCTCT TAATTTGCAA GCTGCCTCAA 240 

CAGGAGGGTG GGGGAACAGC TCAACAATGG CTGATGGGCG CTCCTGGTGT TGATAGAGAT 300 

GGAACTTGGA CTTGGAGGCC TCTCCACGCT GTCCCACTGC CCCTGGCCTA GGCGGCAGCC 3 SO 

15 TGCCCTGTGG CCCACCCTGG CCGCTCTGGC TCTGCTGAGC AGCGTCGCAG AGGCCTCCCT 420 

GGGCTCCGCG CCCCGCAGCC CTGCCCCCCG CGAAGGCCCC CCGCCTGTCC TGGCGTCCCC 480 

CGCCGGCCAC CTGCCGGGGG GACGCACGGC CCGCTGGTGC AGTGGAAGAG CCCGGCGGCC 540 

CGGGGGCCGC GCGGCGCGGG CTGGGGGCCC GGGCAGCCGC GCTCGGGCAG CGGGGGCGCG 660 

20 GGGCTGCCGC CTGCGCTCGC AGCTGGTGCC GGTGCGCGCG CTCGGCCTGG GCCACCGCTC 720 

CGACGAGCTG GTGCGTTTCC GCTTCTGCAG CGGCTCCTGC CGCCGCGCGC GCTCTCCACA 780 

CGACCTCAGC CTGGCCAGCC TACTGGGCGC CGGGGCCCTG CGACCGCCCC CGGGCTCCCG 840 

GCCCGTCAGC CAGCCCTGCT GCCGACCCAC GCGCTACGAA GCGGTCTCCT TCATGGACGT 900 

CAACAGCACC TGGAGAACCG TGGACCGCCT CTCCGCCACC GCCTGCGGCT GCCTGGGCTG 960 

25 AGGGCTCGCT CCAGGGCTTT GCAGACTGGA CCCTTACCGG TGGCTCTTCC TGCCTGGGAC 1020 

CCTCCCGCAG AGTCCCACTA GCCAGCGGCC TCAGCCAGGG ACGAAGGCCT CAAAGCTGAG 1080 

AGGCCCCTAC CGGTGGGTGA TGGAT AT CAT CCCCGAACAG GTGAAGGGAC AACTGACTAG 1140 

CAGCCCCAGA GCCCTCACCC TGCGGATCCC AGCCTAAAAG ACACCAGAGA CCTCAGCTAT 1200 

GGAGCCCTTC GGACCCACTT CTCACAGACT CTGGCACTGG CCAGGCCTCG AACCTGGGAC 1260 

30 CCCTCCTCTG ATGAACACTA CAGTGGCTGA GGCATCAGCC CCCGCCCAGG CCCTGTAGGG 1320 

ACAGCATTTG AAGGACACAT ATTGCAGTTG CTTGGTTGAA AGTGCCTGTG CTGGAACTGG 1380 
CCTGTACTCA CTCATGGGAG CTGGCCCC 



Seq I 



MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP R„_.„.„ 
40 PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 

RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 



I I I I I 

TCGGGTGGAG CAGAGAGCAG CTGCTGCAGG GCAGACAGCC 



GGACCCCCAA ATCTGCACGT ACCAGCAGTC AGCCGCCCCA CGCAGGGACC GGCTTACCCC 
TCGCTCCCCG CCCTCACTCA CTTTCTCCCG CCCTCGGCCC GGCCTCCCAG CTCTCTACTT 
CGCGTGTCTA CAAACTCAAC TCCCGGTTTC CGTGCCTCTC CACCGCTCGA GTTCTCTACT 
CTCCATATCC GAGGGGCCCC TCCCAGCATC TACCCCCCTC CCAACCTCGG GGGACCTAGC 
55 CAAGCTAGGG GGGACTGGAT CCGACGGGTG GAGCAGCCAG GTGAGCCCOG AAAGGTGGGG 



CAGACAAGGC CCGGGGGCTC CGCCAGCAGC AGGTCCCTCG GGCCCCAGCC CTCGCTGCCA 
CCCGGGCCTG GAGCCCCACA CCCGAGGGTG CAGACTGGCT GCCAAGGCCA CACTTTTGGC 

60 TAAAAGAGGC ACTGCCAGGT GTACAGTCCT GGGCATGCGC TGTTTGAGCT TCGGGGGAGA 
GCCCAGCACT GGTCCCCGGA AAGGTGCCTA GAAGAACAAG GTGCAGGACC CCGTGCTGCC 
TCAACAGGAG GGTGGGGGAA CAGCTCAACA ATGGCTGATG GGCGCTCCTG GTGTTGATAG 
AGATGGAACT TGGACTTGGA GGCCTCTCCA CGCTGTCCCA CTGCCCCTGG CCTAGGCGGC 
AGCCTGCCCT GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT 

65 CCCTGGGCTC CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT 
CCCCCGCCGG CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC 
GGCCGCCGCC GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC 
CCCGCGGGGG CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG 
CGCGGGGCTG CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC 

70 GCTCCGACGA GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTC-CCGCCGC GCGCGCTCTC 
CACACGACCT CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT 
CCCGGCCCGT CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG 
A CGT CAACAG CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG 
GCTGAGGGCT CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC T 

75 GGACCCTCCC GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG G 

TGAGAGGCCC CTACCGGTGG GTGATGGATA TCATCCCCC-A ACAGGTGAAG GGACAACTGA 1620 
CTAGCAGCCC CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG 1680 
CTATGGAGCC CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG 1740 
GGACCCCTCC TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGC-CCCTGT 1800 

80 AGGGACAGCA TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA 1860 
CTGGCCTGTA CTCACTCATG GGAGCTGGCC CC 

Seq ID NO: 404 Protein sequence 



1113 



WO 03/042661 



PCT/US02/36810 



MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS SO 

5 PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAAKAGG PGSRARAAGA 120 

RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 180 
RPVSQPCCRP TRYEAVSPMD VNSTWRTVDR LSATACGCLG 



NM_057160.1 



15 | | | | I I 

ATGCCCGGCC TGATCTCAGC CCGAGGACAG CCCCTCCTTG AGGTCCTTCC TCCCCAAGCC SO 

CACCTGGGTG CCCTCTTTCT CCCTGAGGCT CCACTTGGTC TCTCCGCGGA GCCTGCCCTG 120 

TGGCCCACCC TGGCCGCTCT GGCTCTGCTG AGCAGCGTCG CAGAGGCCTC CCTGGGCTCC 160 

GCGCCCGGCA GCCCTGCCCC CCGCGAAGGC CCCCCGCCTG TCCTGGCGTC CCCCGCCGGC 240 

20 CACCTGCCGG GGGGACGCAC GGCCCGCTGG TGCAGTGGAA GAGCCCGGCG GCCGCCGCCG 300 

CAGCCTTCTC GGCCCGCGCC CCCGCCGCCT GCACCCCCAT CTGCTCTTCC CCGCGSC-GGC 360 

CGCCTGCGCT CGCAGCTGGT GCCGGTGCGC GCGCTCGGCC TGGGCCACCG CTCCGACGAG 480 

CTGGTGCGTT TCCGCTTCTG CAGCGGCTCC TGCCGCCGCG CGCGCTCTCC ACACGACCTC 540 

25 AGCCTGGCCA GCCTACTGGG CGCCGGGGCC CTGCGACCGC CCCCGGGCTC CCGGCCCGTC S00 

AGCCAGCCCT GCTGCCGACC CACGCGCTAC GAAGCGGTCT CCTTCATGGA CGTCAACAGC S60 

ACCTGGAGAA CCGTGGACCG CCTCTCCGCC ACCGCCTGCG GCTGCCTGGG CTGAGGGCTC 720 

GCTCCAGGGC TTTGCAGACT GGACCCTTAC CGGTGGCTCT TCCTGCCTGG GACCCTCCCG 780 

CAGAGTCCCA CTAGCCAGCG GCCTCAGCCA GGGACGAAGG CCTCAAAGCT GAGAGGCCCC 840 

30 TACCGGTGGG TGATGGATAT CATCCCCGAA CAGGTGAAGG GACAACTGAC TAGCAGCCCC 90 0 

AGAGCCCTCA CCCTGCGGAT CCCAGCCTAA AAGACACCAG AGACCTCAGC TATGGAGCCC 960 

TTCGGACCCA CTTCTCACAG ACTCTGGCAC TGGCCAGGCC TCGAACCTGG GACCCCTCCT 102 0 

CTGATGAACA CTACAGTGGC TGAGGCATCA GCCCCCGCCC AGGCCCTGTA GGGACAGCAT 1080 

TTGAAGGACA CATATTGCAG TTGCTTGGTT GAAAGTGCCT GTGCTGGAAC TGGCCTGTAC 1140 

35 TCACTCATGG GAGCTGGCCC C 



Seq ID NO: 406 Protein sequence 



MPGLISARGQ PLLEVLPPQA HLGALFLPKA PLGLSAQPAL WPTLAALALL S 
APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPEALPRGG 
S RARAAGARGC RLRSQLVPVR ALGLGHRSDE LVRFRFCSGS CRRARSPHDL 
V SQPCCRPTRY F.AVSFMDVNS TWRTVDRLSA TACGCLG 



I i 1 r i 1 i 1 r 

CTGATGGGCG CTCCTGGTGT TGATAGAGAT GGAACTTGGA CTTGGAGGCC TCTCCACGCT 60 

GTCCCACTGC CCCTGGCCTA GGCGGCAGGC TCCACTTGGT CTCTCCGCGC AGCCTGCCCT 120 

55 GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT CCCTGGGCTC 1B0 

CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT CCCCCGCCGG 240 

CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC GGCCGCCGCC 300 

GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCIGCTCTIC CCCGCGGGGG 360 

CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG CGCGGGGCTG 420 

60 CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC GCTCCGACGA 480 

GCTGGTGCGT TTCCGCTTCT GCAGCGGCT C CTGCCGCCGC GCGCGCTCTC CACACGACCT 540 

CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGC-CT CCCGGCCCGT 600 

CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG ACGTCAACAG 660 

CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG GCTGAGGGCT 720 

65 CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TICCTGCCTG GGACCCTCCC 780 

GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC TGAGAGGCCC 840 

CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA CTAGCAGCCC 900 

CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG CTATGGAGCC 960 

CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC C1CGAACCTG GGACCCCTCC 1020 

70 TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT AGGGACAGCA 1080 

TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGC1GGAA CTGGCCTGTA 1140 
CTCACTCATG GGAGCTGGCC CC 

3 ID NO: 408 E 



Protein Accession # 



MELGLGGLST LSHCPWPRRQ APLGLSAQPA LWPTLAALAL LSSVAEASLG S 
GPPPVLASPA GHLPGGRTAR WCSGRARRPP PQPSRPAPPP PAPPSALPRG GRAARAGGPG 
SRARAAGARG CRLRSQLVPV RALGLGHRSD ELVRFRFCSG SCRRARSPHD LSLASLLGAG 
ALRPPPGSRP VSQPCCRPTR Y 
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1 11 21 31 41 51 

I I I I I I 

ATGCCACTGA AGCATTATCT CCTTTTGCTG GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG SO 
GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGG CCTCCCAGGT GGAGTGCACC 12 0 
GGGGCACGCA TTGTGGCGGT GCCCACCCCT CTGCCCTGGA ACGCCATGAG CCTGCAGATC 180 
CTCAACACGC ACATCACTGA ACTCAATGAG TCCCCGTTCC TCAATATCTC AGCCCTCATC 240 
GCCCTGAGGA TTGAGAAGAA TGAGCTGTCG CGCATCACGC CTGGGGCCTT CCGAAACCTG 300 
GGCTCGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AGGTTCTGCC CATCGGCCTC 360 
TTCCAGGGCC TGGACAGCCT TGAGTCTCTC CTTCTGTCCA GTAACCAGCT GTTGCAGATC 420 
CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 460 
CTGGAATACA TCCCTGACGG AGCCTTCGAC CACCTGGTAG GACTCACGAA GCTCAATCTG 540 
GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 
GTCCTCCGGC TGTATGAGAA CAGGCTCACG GATATCCCCA TGGGCACTTT TGATGGGCTT 660 
GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 
TTCCACAACA ACCACAACCT CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 
CCACCCAGCA TCTTCATGCA GCTGCCCCAG CTCAACCGTC TTACTCTCTT TGGGAATTCC 840 
CTGAAGGAGC TCTCTCTGGG GATCTTCGGG CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 
TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA GCAACCTCCG CCAGTTGCAG 960 
GTCCTGATTC TTAGCCGCAA TCAGATCAGC TTCATCTCCC CGGGTGCCTT CAACGGGCTA 1020 
ACGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 
TTCCGCATGT TGGCCAACCT GCAGAACATC TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 
CCAGGGAATA TCTTCGCCAA CGTCAATGGC CTCATGGCCA TCCAGCTGCA GAACAACCAG 1200 
CTGGAGAACT TGCCCCTCGG CATCTTCGAT CACCTGGGGA AACTGTGTGA GCTGCGGCTG 12 60 
TATGACAATC CCTGGAGGTG TGACT CAGAC ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 1320 
AACCAGCCTA GGTTAGGGAC GGACACTGTA CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 13 80 
GGCCAGTCCC TCATTATCAT CAATGTCAAC GTTGCTGTTC CAAGOGTCCA TGTCCCTGAG 1440 
GTGCCTAGTT ACCCAGAAAC ACCATGGTAC CCAGACACAC CCAGTTACCC TGACACCACA 1500 
TCCGTCTCTT CTACCACTGA GCTAACCAGC CCTGTGGAAG ACTACACTGA TCTGACTACC 1560 
ATTCAGGTCA CTGATGACCG CAGCGTTTGG GGCATGACCC AGGCCCAGAG CGGGCTGGCC 1620 
ATTGCCGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 1680 
TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 
TGTTAAAGAG GCAGGCTGGA GCAGGGCTGG GGAATGATGG GACTGGAGGA CCTGGGAATT 1800 
TCATCTTTCT GCCTCCACCC CTGGGTCCAT GGAGCTTTCC CGTGATTGCT CTTTCTGGCC 1860 
CTAGATAAAG GTGTGCCTAC CTCTTCCTGA CTTGCCTGAT TCTCCCGTAG AG AAG CAGGT 1920 
CGTGCCGGAC CTTCCTACAA TCAGGAAGAT AGATCCAACT GGCCATGGCA AAAGCCCTGG 1980 
GGATTTCCGA TTCATACCCC TGGGCTTCCT TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 2040 
ACCTGTCCTC CAAGAACAGC CTTCCCTGCG CCCAGGCCCC CTCCGGGCCT CTGTAGACTC 2100 
AGTTAGTCCA CAGCCTGCTC ACTTCGTGGG AATAGTTCTC CGCTGAGATA GCCCCTCTCG 2160 
CCTAAGTATT ATGTAAGTTG ATTTCCCTTC TTTTGTTTCT CTTGTTTGTG CTATGGCTTG 2220 
ACCCAGCATG TCCCCTCAAA TGAAAGTTCT CCCCTTGATT TTCTGCTCCT GAAGGCAGGG 22 8 0 
TGAGTTCTCT CCT CAAAGAA GACTTCAAAC CATTTAACTG GTTTCTTAAG AGCCGTCAAT 2 340 
CAGCCTGGTT TTGGGGATGC TATGAAAGAG AGAAGGAAAA TCATGCCGCT CAGTTCCTGG 2400 
AGACAGAAGA GCCGTCATCA GTGT CTCACT TGTGATTTTT ATCTGGAAAA GGAAGAAACA 2460 
CCCCAGCACA GCAAGCTCAG CCTTTTAGAG AAGGATATTT CCAAACTGCA AACTTTGCTT 2520 
TGAAAAGTTT AGCCCTTTAA GGAATGAAAT CATGTAGAAT TTTGGACTTC TAAAAACATT 2580 
AAAATCAGCT TATTAATACG GGATAGAGAA AGAAATCTGG TGCCTGGGGG TCCCTGTGTT 2640 
CACCCCTAGA GTTTGTTTTA AAATTTTTAA TTGAAGCATG TGAAGTGTAC STGCAGAAAA 2700 
GTGGGAACAT GATAGTGTAT GGCTTGGTGG ATTTTCACAA ACTGAACATA CCTGTGTAAT 2760 
CAGCAT CT AG ACCCAGACCC AGAGCATCAC AAATATCCCC CATCCTGGGC TTTTCCCAGA 2820 
GGAGATGGGG GCTTCTGAAG ATGGACTTAC CTGGGACCTG CCCCCCATGA GCCAGGACGG 2880 
TCCCCCCACA GTCAGCCTGT GCAAAGGCCC CGTGGCCAGG GGTGGAGGAG AATATGTGGG 2940 
TGTGGACAGG ATGGGAGACT GTGGCCTGAA CAGGAGATTT TATTATATCT GGAGACCCTG 3000 
AGAGACCCTG AGACCTGGGG CACCATGGCT GGCCAGGTCA GAAGCATCCT GACTGCAGAG 3060 
GTCCGTGCAG CCACACCCTC TTCCCTGCCA GCAAGTTGTC TGCGGCTCAT CGGAGGCCCC 3120 
TCCGCCTGGA GCCTTCTATG GACGTGATAT GCCTGTATCT GTTTTTAATT TTCATTCTTC 3180 
ACTTAGGGGA AGTGAAATCG CT CAGAGATG AGATCCTTTA ATTGAAAACG AAGTGTAACG 3240 
GAATCTAGTG TCTTTCTAAT GTGGTAAAAT TCTCCATCAA CAT CACAGTC AGCTGGCAGC 33 00 
TGAACTTCAG AATCTCACTT ACAGCAGGCG ACACGGGGGT ACACCGATGG GTCACACTGG 3360 
GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3420 
TCCAGGGTTA TTCTCCTCCT CGAGTCACAG TCACACGAAT ACCTGCCTTC TCTGGCTTTC 3480 
CTGCTATACA CATATTCACA TGGCGCTCAA GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 3540 
CTCTGGACAA CTGGCCCAGT TTACAGTGAA ATGGAGAATT TCAGGTCTCC ACGTCTGCCC 3600 
AGGAAAGAAC TTCAGCTGAC TCCACGGGGA TCTGGAAATC CACGACCAAT CCCGATCGGC 3660 
TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGCTTTGG AAATCCACCA CCAATCCCGA 3 720 
TCGGCTCTTA TTAGCTCCCC GCTCCACAAG ACACCTGTGA TCTGGAAATC TACCACCAAT 3780 
CCCGATCGGC TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGACATCC TCCAGGGCCA 3840 
CAGGAGCACG TGCTGACCAG TTTTCCCTTC CAGTTCCTGC ACAAAAAGTG TCCAGAGGGC 3900 
TGTTTGCAAA CACTAGTGCA CTTTGTAGCT TTTCACCCTC TGTCCCAGGG AATCTAGGAG 3960 
AGATGAGGCC CGTCAGAGTC AAGAGATGTC ATCCCCCCAG GGTCTCCAAG GCATTTCCAC 4020 
ACT ATT GGTG GCACCTGGAG GACATGCACC AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 4080 
CCAGAGCATG GCACATGAGC ATCACCCGCT GATGGTGGCC TGCTGTGCCT GGTGCCAACA 4140 
GGGGCATCCC GGCCCGTACC CCTCCAGACA GGAAGCATGG GTTTGCCCAC AGACCTGTCG 4200 
GGTGCTCCTG TGAGTGGCCT CCAGATGTCT TTGTGCATAG GCACAAGTGG GCCAGGGCTG 4260 
GAGGGAGGTG GGAAACCTCA TCATCCGGTG GGCCCTGCCA ATCTTAACCC AGAACCCTTA 4320 
GGTATTCCTG G CAGT AGCCA TGACATTGGA GCACCTTCCT CTCCAGCCAG AGGCTGACCT 4380 
GAGGGCCACT GTCCTCAGAT GACACCACCC AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 4440 
CCTTATGTGA ACCTCTTGCC TCTTCCTTTC TCCCATCAGA GTGGTTGGAT GGAC-CCATTG 4500 
GCCTCCTTTT CTTCAGCGGG CCCTTCAACC TCTCTGCACC ATGTTGTCTG GCTGAGGAGC 4560 
TACTAGAAAA GCTGAGTGGA GTCTCCTTTC CAACAGGATG ATGCATTTGC TCAATTCTCA 4620 
GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 4680 

1115 



WO 03/042661 



PCT/US02/36810 



CCTCTCTGTT TACAGCTCCT TGACAGTCCC ACGCCCATCT GGAGTGGGAG CTGGGAGTTA 4740 

GTGTTGGAGA AGAAACAACA AAAGCCAATT AGAACCACTA TTTTTAAAAA GTGCTTACTG 4800 

TGCACAGATA CTCTTCAAGC ACTGGACGTG GATTCTCTCT CTAGCCCTCA GCACCCCTGC 4860 

GGTAGGAGTG CCGCCTCTAC CCACTTGTGA TGGGGTACAG AGGCACTTGC TCTTCTGCAT 492 0 

GGTGTTCAAT AGGCTGGGAG TTTTATTTAT CTCTTCAAAC TTTGTACAAG AGCTCATGGC 4980 

TTGTCTTGGG CTTTCGTCAT TAAACCAAAG GAAATGGAAG CCATTCCCCT GTTGCTCTCC 5040 

TTAGTCTTGG TCATCAGAAC CTCACTTGGT ACCATATAGA TCAAAAGCTT TGTAACCACA 5100 

GGAAAAAATA AACTCTTCCA TCCCTTAAAG AATAGAATAG TTTGTCCCTC T CATGGGAAT 5160 

TGGGCTGTAT GTATATTGTT CTTCCTCCTT AGAATTTAGA GATACAAGAG TTCTACTTAG 5220 

AACTTTTCAT GGACACAATT TCCACAACCT TTCAGATGCT GATGTAGAGC TATTGGGAAA 52 80 

GAACTTCCAA ACTCAGGAAG TTTGCAGAGA GCAGACAGCT AGAGATAACT CGGGACCCAG 5340 

AGTTGGTCGA CAGATGTTAG ATGTATCCTA GCTTTTAGCC ATAAACCACT CAAAGATTCA 5400 

GCCCCCAGAT CCCACAGTCA GAACTGAATC TGCGTTGTTG GGAAGCCAGC AGTGGCCTTG 54 60 

GGAAGGAAGC CATGGCTGTG GTTCAGAGAG GGTGGGCTGG CAAGCCACTT CCGGGGAAAA 5520 

CTCCTTCCGC CCCAGGTTTC TTCTTCTCTT AAGGAGAGAT TGTTCTCACC AACCCGCTGC 5580 

CTTCATGCTG CCTTCAAAGC TAGATCATGT TTGCCTTGCT TAGAGAATTA CTGCAAATCA 5640 

GCCCCAGTGC TTGGCGATGC ATTTACAGAT TTCTAGGCCC TCAGGGTTTT GTAGAGTGTG 5700 

AGCCCTGGTG GGCAGGGTTG GGGGGTCTGT CTTCTGCTGG ATGCTGCTTG TAATCCATTT 5760 
ATGTAT 



sion «: BAB84S87.1 

21 31 41 51 

I I I I 

h VGCQAWGAGL AYHGCPSECT CSRASQVECT GARIVAVPTP LPWNAMSLQI 

E SPFLNISALI ALRIEKNELS RITPGAPENL GSLRYLSLAS NKLQVLPIGL 

FQGLDSLESL LLSSNQLLQI QPAHFSQCSN LKEU3LHGNH LEYIPDGAFD HLVGLTKLtJL 
GKNSLTHISP RVPQHLGNLQ VLRLYENRLT DIPMGTFDGL VNLQELALQQ NQIGLLSPGL 
FHNNHNLQRL YLSNNHISQL PPSIFMQLPQ LNRLTLFGNS LKELSLGIFG PMPNLRELWL 
YDNHISSLPD NVFSNLRQLQ VblLSRNQIS FISPGAFNGL TELRELSLHT NALQDLDGNV 
FRMLANLQNI SLQNNRLRQL PGNIFANVNG LMAIQLQNNQ LENLPLGIFD HLGKLCELRL 
YDNPWRCDSD ILPLRNWLLL NQPRLGTDTV PVCFSPAKVR GQSLIIINW VAVPSVHVPE 
VPSYPETPWY PDTPSYPDTT SVSSTTELTS PVEDYTDLTT IQVTDDRSW GMTQAQSGLA 
IAAIVIGIVA LACSLAACVG CCCCKKRSQA VLMQMKAPNE C 

Seq ID NO: 411 DNA sequence 
Nucleic Acid Accession XM_098151 



1 11 21 31 41 51 

I I I I I I 

ATGATGCATT TGCTCAATTC TCAGGGCTGG AATGAGCCGG CTGGTCCCCC AGAAAGCTGG 
AGTGGGGTAC AGAGTTCAGT TTTCCTCTCT GTTTACAGCT CCTTGACAGT CCCACGCCCA 

45 TCTGGAGTGG GAGCTGGGAG TCAGTGTTGG AGAAGAAACA ACAAAAGCCA ATTAGAACCA 

CTATTTTTAA AAAGTGCTTA CTGTGCACAG ATACTCTTCA AGCACTGGAC GTGGATTCTC 
I t P A LL TCAGCACCCC TGCGGTAGGA GTGCCGCCTC TACCCACTTG TGATGGGGTA 
CAGAGGCACT TGCTCTTCTG CAT GGTGTT C AATAGGCTGG GAGTTTTATT TATCTCTTCA 
AACTTTGTAC AAGAGCTCAT GGCTTGTCTT GGGCTTTCGT CATTAAACCA AAGGAAATGG 

50 AAGCCATTCC CCTGTTGCTC TCCTTAG 



MMHLLNSQGW NEPAGPPESW SGVQSSVFLS VYSSLTVPRP SGVGAGSQCW RRNNKSQLEP 
LFLKSAYCAQ ILFKHWTWIL SLALSTPAVG VPPLPTCDGV QRHLLFCMVF NRLGVLFISS 
L GLSSLNQRKW KPFPCCSP 



I I I I I 

„ G CGCCGTCGCG CCCTCCTGCC GCAGGCCACC GAGGCCGCCG CCGTCTAGCG 60 

CCCCGACCTC GCCACCATGA GAGCCCTGCT GGCGCGCCTG CTTCTCTGCG TCCTGGTCGT 120 

GAGCGACTCC AAAGGCAGCA ATGAACTTCA TCAAGTTCCA TCGAA.CTGTG ACTGTCTAAA 180 

70 TGGAGGAACA TGTGTGTCCA ACAAGTACTT CTCCAACATT CACTGGTGCA ACTGCCCAAA 240 

GAAATTCGGA GGGCAGCACT GTGAAATAGA TAAGTCAAAA ACCTGCTATG AGGGGAATGG 300 

TCACTTTTAC CGAGGAAAGG CCAGCACTGA CACCATGGGC CGGCCCTGCC TGCCCTGGAA 360 

CTCTGCCACT GTCCTTCAGC AAACGTACCA TGCCCACAGA TCTGATGCTC TTCAGCTGGG 420 

CCTGGGGAAA CATAATTACT GCAGGAACCC AGACAACCGG AGGCGACCCT GGTGCTATGT 480 

75 GCAGGTGGGC CTAAAGCCGC TTGTCCAAGA GTGCATGGTG CATGACTC-CG CAGATGGAAA 540 

AAAGCCCTCC TCTCCTCCAG AAGAATTAAA ATTTCAGTGT GGCCAAAAGA CTCTGAGGCC 600 

CCGCTTTAAG ATTATTGGGG GAGAATTCAC CACCATCGAG AACCAGCCCT GGTTTGCGGC 660 

CAT CTACAGG AGGCACCGGG GGGGCTCTGT CACCTACGTG TGTGGAGGCA GCCTCATCAG 720 

CCCTTGCTGG GTGATCAGCG CCACACACTG CTTCATTGAT TACCCAAAGA AGGAGGACTA 780 

80 CATCGTCTAC CTGGGTCGCT CAAGGCTTAA CTCCAACACG CAAGGGGAGA TGAAGTTTGA 840 

GGTGGAAAAC CTCATCCTAC ACAAGGACTA CAGCGCTGAC ACGCTTGCTC ACCACAACGA 900 

CATTGCCTTG CTGAAG AT CC GTTCCAAGGA GGGCAGGTGT GCGCAGCCAT CCCGGACTAT 960 

ACAGACCATC TGCCTGCCCT CGATGTATAA CGATCCCCAG TTTGGCACAA GCTGTGAGAT 1020 

CACTGGCTTT GGAAAAGAGA ATTCTACCGA CTATCTCTAT CCGGAGCAGC TGAAAATGAC 1080 
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TGTTGTGAAG CTGATTTCCC ACCGGGAGTG TCAGCAGCCC CACTACTACG G 

CACCACCAAA ATGCTATGTG CTGCTGACCC CCAATGGAAA ACAGATTCCT GCCAGGGAGA 1200 

CTCAGGGGGA CCCCTCGTCT GTTCCCTCCA AGGCCGCATG ACTTTGACTG GAATTGTGAG 1260 

CTGGGGCCGT GGATGTGCCC TGAAGGACAA GCCAGGCGTC TACACGAGAG TCTCACACTT 1320 

CTTACCCTGG ATCCGCAGTC ACACCAAGGA AGAGAATGGC CTGGCCCTCT GAGGGTCCCC 1380 

AGGGAGGAAA CGGGCACCAC CCGCTTTCTT GCTGGTTGTC ATTTTTGCAG TAGAGTCATC 1440 

TCCATCAGCT GTAAGAAGAG ACTGGGAAGA TAGGCTCTGC ACAGATGGAT TTGCCTGTGG 1S00 

CACCACCAGG GTGAACGACA ATAGCTTTAC CCTCACGGAT AGGCCTGGGT GCTGGCTGCC 1560 

CAGACCCTCT GGCCAGGATG G AGG GGTGGT CCTGACTCAA CATGTTACTG ACCAGCAACT 162 0 

TGTCTTTTTC TGGACTGAAG CCTGCAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 1680 

GGCTCGAAGG GAGAGCCAGC TCCCCCGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA 174 0 

AATGAATAAT TTCCCAATTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAGG GAGCTTAGCC 1800 

AATG1GGGAG CAGCGGTTTG GGGAGCAGAG ACACTAACGA CTTCAGGGCA GGGCTCTGAT 1860 

ATTCCATGAA TGTATCAGGA AATATATATG TGTGTGTATG TTTGCACACT TGTTGTGTGG 1920 

GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG TGTCTGATTG TTAAGTCTAA ATATTTCCTT 1980 

AAACTGTGTG GACTGTGATG CCACACAGAG TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 2040 

CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT CTGCAGCATG 2100 

ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA TGTCCCTTTC TTGGCCAGTT 2160 

ATCCCTTCCT TTTAGCCTAG TTCATCCAAT CCTCACTGGG TGGGGTGAGG ACCACTCCTT 222 0 

ACACTGAAT A TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG 2280 
ATCAATAAAA TGTGATTTTT CTGA 

Seq ID NO: 414 Protein sequence 
Protein Accession NP_002649.1 

i i 1 r i 1 i 1 i 1 

MRALliARLLL CVLWSDSKG SNELHQVPSN CDCLNGGTCV SNKYFSNIHW CNCPKKFGGQ 60 

HCEIDKSKTC YEGNGHPYRG KASTDTMGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 120 

YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKPSSP PEELKFQCGQ KTLRPRPKII 180 

GGEFTTIEHQ PWFAAIYRRH RGGSVTYVCG GSLISPCWVI SATHCFIDYP KKEDYIVYLG 240 

RSRLNSNTQG EMKFEVENLI LHKDYSADTL AHHNDIALLK IRSKEGRCAQ PSRTIQTICL 300 

PSMYNDPQFG TSCEITGFGK ENSTDYLYPE QLKMTWKLI SHRECQQPHY YGSEVTTKML 360 

CAADPOWKTD SCQGDSGGPL VCSLQGRMTL TGIVSWGRGC ALKDKPGVYT RVSHFLPWIR 420 



Nucleic Acid Accession #= NM_024422.1 
Coding sequence-. 202.. 2907 

40 

1 11 21 31 41 51 

I I I I I I 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 120 

45 GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 180 

GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 3 00 

AATGTGACAT TACATGTTCC CTCCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC 3 60 

CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 420 

50 TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAAGAGAAGT 4 80 

TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 540 

GAGCATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGCGCGCC 600 

AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 720 

55 AGAGGTCCTG GAGTTGACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA 780 

AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GAT AATTG CC 840 

TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCAAAATA 900 

GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 960 

GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CT ACT GACAA AGATGAGCCT 1020 

60 GACACGATGC ACACACGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 1080 

CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA CATCATCTCA GCTAGACAGA 1140 

GAGTT AATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 1200 

GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 1260 

ACATTT ACT C GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 1320 

65 TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 1380 

ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 1440 

GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 1620 

70 CCAATACAGA CTGTTCGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 1680 

AAAGCATATG ACCCAGAAAC AAGAAGTAGC AGTGGCATAA GGTATAAGAA ATTAACTGAT 1740 

CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 1800 

GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGACGT GAATGATAAC 1920 

75 AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 1980 

ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 2 040 

AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 2100 

CGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 2160 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTC-TGTGA CTGCATTACC 2220 

80 GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT ACAACTTGGA 2280 

AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT CCTGTTTACG 2340 

CTGGTCTGTG GGGCTTCTGG GACGTCTAAA CAACCAAAAG TAATTCCTGA TGATTTAGCC 2400 

CAGCAGAACC TAATTGTATC AAACACAGAA GCTCCTGGAG ATGACAAAC-T GTATTCTGCG 2460 

AATGGCTTCA CAACCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2520 
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TCAGGAATCA AAAACGGAGG TCAGGAGACC ATCGAAATGG TGAAAGGAGG ACACCAGACC 2S80 

TCGGAATCCT GCCGGGGGGC TGGCCACCAT CACACCCTGG ACTCCTGCAG GGGAGGACAC 2640 

ACGGAGGTGG ACAACTGCAG ATACACTTAC TCGGAGTGGC ACAGTTTTAC TCAGCCCCGT 2700 

CTTGGTGAAA AAGTGTATCT GTGTAATCAA GATGAAAATC ACAAGCATGC CCAAGACTAT 2760 

GTCCTGACAT ATAACTATGA AGGAAGAGGA TCGGTGGCTG GGTCTGTAGG TTGTTGCAGT 2820 

GAACGACAAG AAGAAGATGG GCTTGAATTT TTGGATAATT TGGAGCCCAA ATTTAGGACA 2880 

CTAGCAGAAG CATGCATGAA GAGATGAGTG TGTTCTAATA AGTCTCTGAA AGCCAGTGGC 2940 

TTTATGACTT TTAAAAAAAA TTACAAACCA AGAATTTTTT AAAGCAGAAG ATGCTATTTG 3000 

TGGGGGTTTT TCTCTCATTA TTTGGATGGA ATCTCTTTGG TCAAATGCAC ATTTACAGAG 3060 

AGACACTATA AACAAGTACA CAAATTTTTC AATTTTTACA TATTTTTAAA TTACTTATCT 312 0 

TCTATCCAAG GAGGTCTACA GAGAAATTAA AGTCTGCCTT ATTTGTTACA TTTGGGTATA 3180 

ATGACAACAG CCAATTTATA GTGCAATAAA ATGTAATTAA TTCAAGTCCT TATTATAGAC 3240 

TATTTGAAGC ACAACCTAAT GGAAAATTGT AGAGACCTTG CTTTAACATT ATCTCCAGTT 3300 

AATTAAGTGT TCATGTGGTG CTTGGAAACT GTTGTTTTCC TGAACATCTA AAGTGTGTAG 3360 

ACTGCATTCT TGCTATTATT TTATTCTTGT AATGTGACCT TTTCACTGTG CAAAGGGAGA 3420 
TTTCTAGCCA GGCATTGACT ATTACAATTT CATT 



I I I I I 

N NGALCELLLL TLAILIPASD ACKNVTLHVP SKLDAEKLVG EVNLKECFTA 

D FQILEDGSVY TTNTILLSSE KRSFTILLSN TENQEKKKIF VFLEHQTKVL 

KKRHTKEKVL RRAKRRWAPI PCSMLENSLG PFPLFLQQVQ SDTAQNYTIY YSIRGPGVDQ 

EPRNLFYVER DTGNLYCTRP VDREOYESFE IIAFATTPDG YTPELPLPLI IKIEDENDNY 

PIFTEETYTF TIFENCRVGT TVGQVCATDK DEPDTMHTRL KYSIIGQVPP SPTLFSMHPT 

TGVITTTSSQ LDRELIDKYQ LKIKVQDMDG QYFGLQTTST CIINIDDVND HLPTFTRTSY 

KPLNYEEKQQ MILQIGWNE APFSREASPR SAMSTATVTV NVEDQDEGPE CNPPIQTVRM 
S NGYKAYDPET RSSSGIRYKK LTDPTGWVTI DENTGSIKVF RSLDREAETI 
L ASDQGGRTCT GTLGIILQDV NDNSPFIPKK TVIICKPTMS SAEIVAVDPD 
? SLESSTSEVQ RMWRLKAIND TAARLSYQND PPFGSYWPI TVRDRLGMSS 
VTSLDVTLCD CITENDCTHR VDPRIGGGGV QLGKWAILAI LLGIALLFCI LFTLVCGASG 
TSKQPKVIPD DLAQONLIVS NTEAPGDDKV YSANGFTTQT VGASAQGVCG TVGSGIKNGG 
OETIEMVKGG HQTSESCRGA GinlllTLDSCR GGHTEVDNCR YTYSEWHSFT QPRLGEKVYL 
(I i i iHiCHA QDYVLTYNYE GRGSVAGSVG CCSERQEEDG LEFLDNLEPK FRTLAEACMK 



1 11 21 31 41 51 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 120 

GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 180 

GACCTGCCCC GAGCCCTCTC CAT GGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 300 

AATGTGACAT TACATGTTCC CICCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC 360 

CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 42 0 

TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAAGAGAAGT 4 80 

TT TACCAT AT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 540 

GAGCATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGCGCGCC 600 

AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 720 

AGAGGTCCTG GAGTTGACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA 7 80 

AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC 840 

TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCAAAATA 900 

GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 960 

GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 1020 

Z ACACACGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 1080 

4. TGCATCCAAC TACAGGCGTG AT CACCACAA CATCATCTCA GCTAGACAGA 1140 

3 ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 1200 

GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 1260 

ACATTTACTC GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 1320 

TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 1380 

ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 1440 

GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 162 0 

I GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 1680 



CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAG1 

GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGACGT GAATGATAAC 1920 

AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 1980 

ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 2040 

AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 2100 

CGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 2160 

GATAGACTTG G CATGT CTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC 2220 

GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT ACAACTTGGA 2280 

AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT CCTGTTTACG 2340 
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AATGGCTTCA CAACCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2520 

TCAGGAATCA AAAACGGAGG TCAGGAGACC ATCGAAATGG TGAAAGGAGG ACACCAGACC 2580 

TCGGAATCCT GCCGGGGGGC TGGCCACCAT CACACCCTGG ACTCCTGCAG GGGAGGACAC 2640 

ACGGAGGTGG ACAACTGCAG ATACACTTAC TCGGAGTGGC ACAGTTTTAC TCAGCCCCGT 2700 

CTTGGTGAAG AATCCATTAG AGGACACACT CTGATTAAAA ATTAAACAAT GAAAGAAAGT 2760 

GTATCTGTGT AATCAAGATG AAAATCACAA GCATGCCCAA GACTATGTCC TGACATATAA 2820 

CTATGAAGGA AGAGGATCGG TGGCTGGGTC TGTAGGTTGT TGCAGTGAAC GACAAGAAGA 2880 

AGATGGGCTT GAATTTTTGG ATAATTTGGA GCCCAAATTT AGGACACTAG CAGAAGCATG 2940 

CATGAAGAGA TGAGTGTGTT CTAATAAGTC TCTGAAAGCC AGTC-GCTTTA TGACTTTTAA 3000 

AAAAAATTAC AAACCAAGAA TTTTTTAAAG CAGAAGATGC TATTTGTGGG GGTTTTTCTC 3060 

TCATTATTTG GATGGAATCT CTTTGGTCAA ATGCACATTT ACAGAGAGAC ACTATAAACA 3120 

AGTACACAAA TTTTTCAATT TTTACATATT TTTAAATTAC TTATCTTCTA TCCAAGGAGG 3180 

TCTACAGAGA AATTAAAGTC TGCCTTATTT GTTACATTTG GGTATAATGA CAACAGCCAA 3240 

TTTATAGTGC AATAAAATGT AATTAATTCA AGTCCTTATT ATAGACTATT TGAAGCACAA 3300 

CCTAATGGAA AATTGTAGAG ACCTTGCTTT AACATTATCT CCAGTTAATT AAGTGTTCAT 3360 

GTGGTGCTTG GAAACTGTTG TTTTCCTGAA CATCTAAAGT GTGTAGACTG CATTCTTGCT 3420 

ATTATTTTAT TCTTGTAATG TGACCTTTTC ACTGTGCAAA GGGAGATTTC TAGCCAGGCA 3480 
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I I I I I I 

MEAARPSGSW NGALCRLLLL TLAILIFASD ACKNVTLHVP SKLDAEKLVG RVNLKECFTA 
ANLIHSSDPD FQILEDGSVY TTKTILLSSE KRSFTILLSN TENQEKKKIF VFLEHQTKVL 
KKRHTKEKVL RRAKRRWAPI PCSMLENSLG PFPLFLQQVQ SDTAQNYTIY YSIRGPGVDQ 
EPRNLFYVER DTGNLYCTRP VDREQYESFE IIAFATTPDG YTPELPLPLI IKIEDENDNY 
PIFTEETYTF TIFENCRVGT TVGQVCATDK DEPDTMHTRL KYSIIGQVPP SPTLFSMHPT 
TGVITTTSSQ LDRELIDKYQ LKIKVQDMDG QYFGLQTTST CIINIDDVND HLPTFTRTSY 
VTSVEENTVD VEILRVTVED KDLVNTANWR ANYTILKGNE NGNFKIVTDA KTNEGVLCW 
KPLNYEEKQQ MILQIGWNE APFSREASPR SAMSTATVTV NVEDQDEGPE CNPPIQTVRM 
KENAEVGTTS NGYKAYDPET RSSSGIRYKK LTDPTGWVTI DENTGSIKVF RSLDREAETI 
KNGIYNITVL ASDQGGRTCT GTLGIILQDV NDNSPFIPKK TVIICKPTMS SAEIVAVDPD 
EPIHGPPFDF SLESSTSEVQ RMWRLKAIND TAARLSYQMD PPFGSYVVPI TVRDRLGMSS 
VTSLDVTLCD CITENDCTHR VDPR IGGGGV QLGKWAILAI LLGIALLFCI LFTLVCGASG 
TSKQPKVIPD DLAQQNLIVS NTEAPGDDKV YSANGFTTQT VGASAQGVCG TVGSGIKNGG 
QETIEMVKGG HQTSESCRGA GHHHTLDSCR GGHTEVDNCR YTYSEHHSFT QPRLGEESIR 
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I I I I I I 

ACTCTGGACT CCGGATGGCT GCCGCACGCC TCTGCCTCTC CCTGCTGCTC CTGTCCACCT 60 

GCGTGGCTCT GTTACTACAG CCACTGCTGG GTGCCCAGGG AGCCCCACTG GAGCCAGTGT 120 

ACCCAGGGGA CAATGCCACA CCAGAGCAGA TGGCCCAGTA TGCAGCTGAT CTCCGTAGAT 180 

ACATCAACAT GCTGACCAGG CCTAGGTATG GGAAAAGACA CAAAGAGGAC ACGCTGGCCT 240 

TCTCGGAGTG GGGGTCCCCG CATGCIGCTG TCCCCAGGGA GCTCAGCCCG CTGGACTIAT 300 

AATGCCACCT TCTGTCTCCT ACGACICCAT GAGCAGCGCC AGCCCAGCTC TCCCCTCTGC 3 60 

ACCCTTGGCT CTGGCCAAAG CTTGCTCCCI GCTCCCACAC AGGCTCAATA AAGCAAGTCA 420 



TRPRYGKRHK 

Seq ID NO: c 
Nucleic Acid 



TCAATGCACT AAGAGAAGGA GACTCTCAAA CCAAAAATGA 



70 



u»j™»i i=ftis««™_A±A ACGGCGGTAG AGAGGAAGTC ACCAAGGTTG CCACTCAGAA 
75 GCACCGACAG TCACCGCTCA ACTGGACCTC CAGTCATTTC GGAGAGGTGA CTGGGAGCGC 
CGAGGGCTGG GGGCCGGAGG AGCCGCTCCC CTACTCCCGG GCTTTCGGAG AGGGTGCGTC 
CGCGCGGCCG CGCTGCTGCA GGAACGGCGG TACCTGCGTG CTGGGCAGCT TCTGCGTGTG 
CCCGGCCCAC TTCACCGGCC GCTACTGCGA GCATGACCAG AGGCGCAGTG AATGCGGCGC 
CCTGGAGCAC GGAGCCTGGA CCCTCCGCGC CTGCCACCTC TGCAGGTGCA TCTTCGGGGC 
CCTGCACTGC CTCCCCCTCC AGACGCCTGA CCGCIGTGAC CCGAAAGACT TCCTGGCCTC 
CCACGCTCAC GGGCCGAGCG CCGGGGGCGC GCCCAGCCTG CTACTCTTGC TGCCCTGCGC 
ACTCCTGCAC GGCCTCCTGC GCCCGGATGC GCCCGCGCAC CCTCGC-TCCC TGGTCCCTTC 
CGTCCTCCAG CGGGAGCGGC GCCCCTGCGG AAGGCCGGGA CTTGGGCATC GCCTTTAATT 
TTCTATGTTG TAAATAATAG ATGTGTTTAG TTTACCGTAA GCTGAAGCAC TGGGTGAATA 



1119 
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it FTVSLALQII NLGNSYQREK HNGGREEVTK VATQKHRQSP LNWTSSHFGE 
_P EEPLPYSRAF GEGASARPRC CRNGGTCVLG SFCVCPAHFT GRYCEKDQRR 
SECGALEHGA WTLRACHLCR CIFGALHCLP LQTPDRCDPK DFLASHAHGP SAGGAPSLLL 
LLPCALLHRL LRPDAPAHPR SLVPSVLQRE RRPCGRPGLG HRL 



20 AGGGAGAGAG GGAGGGGAGG AAATTGGAGA CCCCAGCACC CCCTTGCTCA CTC 

CACAGTCCAC GATGGCCCGG TCCCTGGTGT GCCTTGGTGT CATCATCTTG CTGTCTGCCT 
TCTCCGGACC TGGTGTCAGG GGTGGTCCTA TGCCCAAGCT GGCTGACCGG AAGCTGTGTG 
CGGACCAGGA GTGCAGCCAC CCTATCTCCA TGGCTGTGGC CCTTCAGGAC TACATGGCCC 
CCGACTGCCG ATTCCTGACC ATTCACCGGG GCCAAGTGGT GTATGTCTTC TCCAAGCTGA 

25 AGGGCCGTGG GCGGCTCTTC TGGGGAGGCA GCGTTCAGGG AGATTACTAT GGAG AT CTGG 
CIGCTCGCCT GGGCTATTTC CCCAGTAGCA TTGTCCGAGA GGACCAGACC CTGAAACCTG 
GCAAAGTCGA TGTGAAGACA GACAAATGGG ATTTCTACTG CCAGTGAGCT CAGCCTACCG 
CTGGCCCTGC CGTTTCCCCT CCTTGGGTTT ATGCAAATAC AATCAGCCCA GTGCAAAC 

30 Seq ID NO: 424 Proteii 



Protein Accession # 



MARSLVCLGV IILLSAFSGP GVRGGPMPKL ADRKLCADQE CSHPISMAVA LQDYMAPDCR 
FLTIHRGQW YVFSKLKGRG RI.FWGGSVQG DYYGDLAARL GYFPSSIVRE DQTLKPGKVD 
VKTDKWDFYC Q 
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AGATGACACA AGTCACAGAA AAGTCCACAG AACACCCAGA AAAGACCACG TCAACCACAG 60 

AGAAAACCAC AAGAACCCCA GAAAAGCCTA CGCTATACTC AGAGAAGACC ATATGCACCA 120 

AAGGGAAAAA CACACCAGTC CCAGAAAAGC CTACAGAAAA CCTGGGGAAC ACCACACTGA 180 

CCACTGAGAC CATAAAAGCC CCAGTAAAGT CCACAGAAAA CCCAGAAAAA ACAGCAGCAG 240 

TCACAAAGAC TATAAAACCT TCAGTCAAGG TCACAGGAGA CAAATCTCTC ACTACTACCT 300 

CTTCTCATCT AAATAAAACT GAAGTTACTC ATCAGGTGCC CACTGGTTCT TTCACCCTCA 360 

TTACATCTAG AACGAAGCTG AGTTCTATCA CAT CAG AAGC CACAGGAAAC GAGAGCCATC 420 

CATACCTCAA T AAAGATGG C TCACAGAAAG GTATCCACGC TGGACAGATG GGAGAGAATG 480 

ATTCATTCCC TGCATGGGCC ATAGTTATTG TGGTCCTGGT GGCTGTGATT CTCCTCCTGG 540 

TGTTCCTTGG CCTGATCTTC TTGGTCTCCT ATATGATGCG GACACGCCGC ACACTAACCC 600 

AGAACACCCA GTACAATGAT GCAGAGGATG AGGGTGGCCC CAATTCCTAC CCGGTCTACC 660 

TGATGGAGCA GCAGAATCTT GGCATGGGCC AGATCCCTTC CCCACGGTGA TCTTGGAGTA 720 

GGCGCCCAGC CCTGGCTCTT CCATGCTCTG CCCCTTTCCT GGATGAGGAA CCGGACTCAC 780 

AATTTCTATT TCCGGGACTA CAGGAAGGGC AGAGAATACT GACGGTTACC AGTATTAACC 840 

CTTCATCTGT TCTTGAAACT GGTTGGGGAA TGAGGTGATA AGCAAGGAGG GTGTAAGTTT 900 

AGGGGACAAA GAAGAAAGAA TGAATAATAC GAGCAGACAT TCTCTGTAGA AGGTAATGGT 960 

CTGAGAATGA AAAGGTGTTT GATGGACATG TTGTGGGGGC ACCAATGCAG AACACTGCAC 1020 

TGAGTCCTAA AGGAAGGACA GGAGCCTTAT AGGCAATGCC CCAGACTGAC TTGTGAGTGG 1080 

GGTTTATGGG GAAAGGGAGG GACTGAGGGC AGAGTCTCTG GGTTTCAGGA CAGCATTATG 1140 

TTATTTCCAT TCACTATTAC TTAAGAGTTT GTGTGTAAAC AGGCTCATCT CTGAGTTCTC 1200 

AGGACCCTTG CCCCCACCCC CATTTTTTTA ATGAAAAAAA AAAACAAAAA AAACGGATCC 1260 

AAGAAGAAAA GAGAATTTAT TTCCTTCTCC ACTCTCTCCA TGCCCTGGAG AAAAAAAAGT 1320 

CCAGAAGAAA TCATAAATAT CTCTCATCTA CATGGTTGCT TCCTCTTCCT CCCAAATCCC 1380 

TTAGTTTTCC TAAATGTCTA CAGTGGACGC CCTGTTGGTT TGGCTTGCTG GGTTGTGGGT 1440 

GGACACGCAA GGAGGGGATT TTTATTTGGC CAGCAGTCTC ACCCACTGAT CTCCACCCCA 1500 

GACCTTCCCT GATTGGTGTC TCAGCATTTA TTTTCCTGTC TCTTCCACCA AAAGCCAGCT 1560 

GTAGCTTTAT CTCGTAAAAG TTACCCATCT TCTCTACTGT CCCCATTCTC TCTCCTCCCA 1620 

CCTTCACCCC AGATTCAAGT TTTCCTCCTT GTAGGCATTT CATCTGTGTG TGTTTTCTGG 1680 

ATTTTCTCTC TCTCTTCTTA TGGCCATTTC ACCTTATTAC TGATTGGGTA GAGGGGGAAA 1740 

AGGAGAATGA TGATGATAGT TTCCTTCTGT CTATTGACCT TTTTTATAAT AAAGTATAAC 1800 

Seq ID NO: 426 Protein sequence 



MTQVTEKSTE HPEKTTSTTE KTTRTPEKPT LYSEKTICTK GKNTPVPEKP TENLGNTTLT 
TETIKAPVKS TENPEKTAAV TKTIKPSVKV TGDKSLTTTS SHLNKT3VTH QVPTGSFTLI 
TSRTKLSSIT SEATGNESHP YLNKDGSQKG IHAGOMGEND SFPAWAIVIV W"" ' 
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FLGLIFLVSY MMRTRRTLTQ NTQYNDAEDE GGPNSYPVYL MEQQNLGMGQ I PS PR 



,„,.-„™.3 TGCTGGTGCT GCTCCTGGGC CTGCAGGCCT 

CCCCAGAAGG ACTCTGTCTC AGACTGGGCC ATTGTGTTGA TCACTCTCAC TTTGGTGGCA 
GCAA1TGTCA GCCTAATGTA CGGTATCAAG AAGGCCTGCC AGTTCCGGAG GGAGATGAGT 
CTGGGGTGTG GCTGTGGCTC TGTGACCCCT TACAGCAGCC ACCATGAGGG GGAGGCTGCC 240 
AGCCAGCGCT ACTCTTGTCA AATGAAAGCT TCTTGGGGGG CAGGTGCTAC TACATTCCAA 300 
GAATATCAGA AAACTGGGGA ACTCTCAACA TCCGATCACA TATTTCCCCT CACTCCAGGC 360 
CTTGTTTATA GTATCCCTTT TGATCACATT GTTCTGCATT CAGGACAAAG ACCTCCAGAG 420 
CTCCCTAAAT CTACAGAAAT CCATGAGCAA AAACGCCACT GCAACACCAC ACGCCATTCT 480 
AAGCCAACTG ACAAGCCTAC AGGCAACTCC AAAACTATAG ACCACAAAAG CTCTACAGAT 540 
AATCATGAGG CTCCTCCCAC TTCTGAAGAA AACTCCAGCA ACCAAGGGAA AGACCCAATG 600 
AT CCGGAACC AGCGCTCTGT TGATCCTGCT GACTCCACTA CCACACATAA AGAATCCGCT 660 
GGAAAAAAAC ATATAACGCC AGCACCCAAG AGCAAAATAA ACIGTCGTAA GTCCACAACA 720 
GGCAAATCAA CGGTAACAAG AAAATCAGAT AAAACTGGAA GACCTTTGGA AAAGTCCATG 780 
AGTACTTTGG ATAAGACAAG TACCAGCTCA CATAAGACTA CAACTTCCTT CCACAACTCA 840 
GGCAATTCAC AGACCAAGCA AAAAAGCACA TCTTTTCCAG AAAAAATCAC AGCAGCCTCA 900 
AAAACAACAT ACAAGACCAC AGGAACCCCA GAAGAGTCAG AAAAAACTGA AGATTCCAGA 960 
ACAACAGTTG CCTCAGACAA GCTCCTGACA AAAACTACAA AAAACATACA AGAGACCATA 1020 
TCAGCCAATG AGCTCACACA ATCTCTAGCA GAGCCTACAG AACATGGAGG AAGGACAGCC 1080 
AATGAGAACA ACACACCATC CCCAGCAGAG CCTACAGAAA ATAGAGAAAG GACAGCCAAT 1140 
GAGAACACCA CACTATCCCC AGCAGAGCCT ACAGAAAATA GAGAAAGGAC AGCCAATGAG 1200 
AACACCGCAC CATTCCCAGC AGGGCCTACA GAAAATAGAG AAATGACAGC CAATGAGAAT 1260 
ACCACACTAT TCCCAGCAGA GCCTACAGAA CATGGAGAAA GGACAGCCAA TGAGAACACC 1320 
ACACCATCCC CAGCAGAGCC TACAGAACAT GGAGAAAGGA CAGCCAATGA GAACACTACA 1380 
CCATCCCCAG CAGAGCCTAC AGAACATGGA GAAAGGACCC CATTTGCCAA TGACAAAACC 1440 
ACATCATCCT CAGCAGAGTC TACAGAACAT GGAGAAAGGA CCCCACTGGC CAACGAGAAC 1500 
ACCACACCAT CCCCAGCAGA GCCTACAGAA AATAGAGAAA GGACAGCCAA TGAGAACACC 1560 
ACACCATCCC CAGCAGGGCC TACAGAAAAC AGAGAAACGA CAGCCAACGA GAAGACCACA 1620 
CTATCCCCAG TAGAGCCTAC AGAAAATAGA GAAACARCAG CCAATGAGAA GACCACACCA 1680 
TCCCCAGCAG AGCCTACAGA AAATGGACAA AGGACCCCAT TTGCCAATGA GAAAACCACA 1740 
TCATCCTCAG CAGAGCCTAC AGAACACGGA GAAAGGACCC CACTGGCCAA TGAGAACACC 1800 
ACACCATCCC CAGCAGAGCC TACAGAAAAT AGAGAAAGGA CAGCCAATGA GAAGACCACA 1860 
CCATCCCCAG CAGAGCCTAC AGAAAATGGA GACAGGACTC CTTTGGCCAA TGAGAAGACC 192 0 
ACGCCATCTC TAGCAGAGCC TACAGAAAAT GGACAAAGGA CCCCATTTGC CAATGAGAAG 1980 
ACCACATCAT CCTCAGCAGA GCCTACAGAA CACGAAGAAA GGACTCCACT GGCCAATGAG 2040 
AACACCACAC CATCCCCGGC AGAGC CTACA GAAAATAGAG AAAGGACAGC CAATGAGAAC 2100 
ACCACACCAT CCCCAGCAGG GCCTACAGAA AATAGAGAAA TGACAGCCAA CGAGAAGACC 2160 
ACACTATTCC CAGCAGAGCC TACAGAAAAT AGAGAAAGGA CAGCCAATGA GAAGACCACA 2220 
TCATCCCCAG CAGAGCCTAC AGAAAATGGA CAAAGGACCC CATTTGCCAA TGAGAAAACC 2280 
ACATCATCCC CAGCAGAGCC TACAGAACAC GGAGAAAGGA CCCCACTGGC CAATGAGAAC 2340 
ACCACACTAT CCCCAGCAGA GCCTACAGAA AATAGAGAAA GGACAGCCAA TGAGAAGACC 2400 
ACACCATTCC CAGCAGAGCC TACAGAAAAT AGAGAAAGGA CAGCCAATGA GAACACCACA 2460 
CCATCCCCAG CACAGCCTAC AGAAAATGGA GACAGGACTC CATTGGCCAA TGAGAAGACC 2520 
ACACCATCTC TAGCAGAGCC TACAGAAAAT GGAAAAAGGA CCCCATTTGC CAATGAGAAG 2580 
ACCACATCAT CCTCAGCAGA GCCTACAGAA CACGCAGAAA GGACTCCACT GGCCAATGAG 2640 
AACACCACAT CATCCCCAGC AGAGCCTACA GAAAATAGAG AAAGGACAGC CAATGAGAAG 2700 
ACCACACAAT TCCCAGCAGA GCCTACAGAA AATAGAGAAA GCACAGCCAA TGAGAAGACC 2760 
ACACCATTCC CAGCAGAGCC TACAGAAAAT AGAGAATGGA CAGCCAATGA GAACACCACA 2320 
CTATCCCCAG CAGAGCCTAC AGAACATGAA GAAATGACCC CATTGGCCAA TGAGAAGACC 2880 
CAGCAGAGCC TACAGAAAAT GGAGAAAGGA CCCCATTTAC CAATGAGAAG 2940 
CCTCAGCAGA GCCTACAGAA CATGGAGAAA GGACCCCACT GGCCAATGAG 3000 
CATCCCGAGC AGAGCCTACA GAACATGGAG AAAGGATAGC CAATGAGAAG 3060 
GCCACACCAT CCCCAGCAAA GCCTACAGAA CATGGAGAAA CGACAGTCAA TGAGGACACC 3120 
ACACCATCCT CAGCAGAGCC TACAGAAAAT GGAGAAAGGA CCCCACTGGC CAATGAGAAC 3180 
ACCACAACAT CCCCAACAGA GTCTACAGAA CATGGAGAAA GGACAGCCAA TGAGAAGACC 3240 
ACACCATCCC CAGCAGAGCC TACAGAACAT GGAGAAAGGA CACCATCAGC CAATGAGAAG 3300 
ACCATACCAT CTCCAGCAAA GCCTACAGAA CACGAAGAAA TGACCCCATC GGCCAATGAG 3360 
AACACCACAC CATCCCCAGT AAAGCCTACA GAACATGGAG AAAAGACTAC ATTGGCCAAT 3420 
GAGAAGATCA CACTATCCCC AGAAGGGCCT ACAGAACATG GAGCAAAAAC TACGTCGGCC 3480 
AATGAGAAGA TCACACCATC CCTAGCAAAG CCTACAGAAC AT GGAG AAAG GACCACATCA 3540 
CC CAATGACA AGATCACCTC ATCTGCAGCA GAGTCTACAG AACATAGAGA TAGGGCTACA 3600 
TCAGCCAATG TGATCACACC AGCCCCAGCA GAGCCTATAA AACATGCAAA AAGGACCACA 3660 
TTGGCCCATG AGAAGATGAC ACAAGTCACA GAAAAGTCCA CAGAACACCC AGAAAAG AC C 3720 
ACGTCAACCA CAGAGAAAAC CACAAGAACC CCAGAAAAGC CTACGCTATA CTCAGAGAAG 3780 
ACCATATGCA CCAAAGGGAA AAACACACCA GTCCCAGAAA AGCCTACAGA AAACCTGGGG 3840 
AACACCACAC TGACCACTGA GACCATAAAA GCCCCAGTAA AGTCCACAGA AAACCCAGAA 3900 
AAAACAGCAG CAGTCACAAA GACTATAAAA CCTTCAGTCA AGGTCACAGG AGACAAATCT 3 960 
CT CACTACT A CCTCTTCTCA TCTAAATAAA ACTGAAGTTA CTCATCAGGT GCCCACTGGT 4020 
TCTTTCACCC TCATTACATC TAGAACGAAG CTGAGTTCTA TCACATCAC-A .AGCCACAGGA 4 080 
AACGAGAGCC ATCCATACCT CAATAAAGAT GGCTCACAGA AAGGTATCCA CGCTGGACAG 4140 
ATGGGAGAGA ATGATTCATT CCCTGCATGG GCCATAGTTA TTGTGGTCCT GGTGGCTGTG 4200 
ATTCTCCTCC TGGTGTTCCT TGGCCTGATC TTCTTGGTCT CCTATATGAT GCGGACACGC 4260 
CGCACACTAA CCCAGAACAC CCAGTACAAT GATGCAGAGG ATGAGGGTGG CCCCAATTCC 4320 
TACCCGGTCT ACCTGATGGA GCAGCAGAAT CTTGGCATGG GCCAGATCCC TTCCCCACGG 4380 
TGA 
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11 21 31 41 51 

I I I I I 

„^„S LQALAGPSPK PQKDSVSDWA IVLITLTLVA AIVSLMYGIK KACQFRREMS 60 

LGCGCGSVTP YSSHHEGEAA SQRYSCQMKA SWGAGATTFQ EYQKTGELST SDHIPPLTPG 120 

LVYSIPFDHI VLHSGQRPPE LPKSTEIHEQ KRHCNTTRHS KPTDKPTGNS KTIDHKSSTD 180 

NHEAPPTSEE NSSNQGKDPM IRNQRSVDPA DSTTTHKESA GKKHITPAPK SKINCRKSTT 240 

GKSTVTRKSD KTGRPLEKSM STLDKTSTSS HKTTTSFENS GKSQTKQKST SFPEKITAAS 300 

KTTYKTTGTP EESEKTEDSR TTVASDKLLT KTTKNIQETI SANELTQSLA EPTEHGGRTA 360 

NENNTPSPAE PTENRERTAN ENTTLSPAEP TENRERTANE NTAPFPAGPT ENREMTANEN 420 

TTLFPAEPTE HGERTANENT TPSPAEPTEH GERTANENTT PSPAEPTEHG ERTPFANDKT 480 

TSSSAESTEH GERTPLANEN TTPSPAEPTE NRERTANENT TPSPAGPTEN RETTANEKTT 540 

LSPVEPTBNR ETTANEKTTP SPAEPTENGQ RTPFANEKTT SSSAEPTEHG ERTPLANENT 600 

TPSPAEPTEN RERTANEKTT PSPAEPTEHG DRTPLANEKT TPSLAEPTEN GQRTPFANEK 660 

TTSSSAEPTE HEERTPLANE NTTPSPAEPT ENRERTANEN TTPSPAGPTE NREMTANEKT 720 

TLFPAEPTEN RERTANEKTT SSPAEPTENG QRTPFANEKT TSSPAEPTEH GERTPLANEN 780 

TTLSPAEPTE NRERTANEKT TPFPAEPTEN RERTANENTT PSPAQPTENG DRTPLANEKT 840 

TPSLAEPTEN GKUTPFANEK TTSSSAEPTE HAERTPLANE NTTSSPAEPT EKRERTANEK 900 

TTQFPAEPTE NRESTANEKT TPFPAEPTEN REWTANENTT LSPAEPTEHE ENTPLANEKT 960 

TLSPAEPTEN GERTPFTNEK TTPSSAEPTE HGERTPLANE ITTPSRAEPT EHGERIANEK 1020 

ATPSPAKPTE HGETTVNEDT TPSSAEPTEN GERTPLANEN TTTSPTESTE HGERTANEKT 1080 

TPSPAEPTEH GERTPSANEK TIPSPAKPTE HEEMTPSANE NTTPSPVKPT EHGEKTTLAN 1140 

EKITLSPEGP TEHGAKTTSA NEKITPSLAK PTEHGERTTS PNDKITSSAA ESTEHRDRAT 1200 

SANVITPAPA EPIKHAKRTT LAHEKMTQVT EKSTEHPEKT TSTTEKTTRT PEKPTLYSEK 1260 

TICTKGKNTP VPEKPTENLG NTTLTTETIK APVKSTENPE KTAAVTKTIK PSVKVTGDKS 1320 

LTTTSSHLNK TEVTKQVPTG SFTLITSRTK LSSITSEATG NESHPYLNKD GSQKGIHAGQ 1380 

MGENDSFPAW AIVIWLVAV ILLLVFLGLI FLVSYMMRTR RTLTQNTQYN DAEDEGGPNS 1440 
YPVYLMEQQN LGMGQIPSPR 



ATGTGGCCTC GCCTGGCCTT TTGTTGCTGG GGTCTGGCGC TCGTTTCGGG CTGGGCGACC 60 

TTTCAGCAGA TGTCCCCGTC GCGCAATTTC AGCTTCCGCC TCTTCCCCGA GACCGCGCCC 120 

GGGGCCCCCG GGAGTATCCC CGCGCCGCCC GCTCCTGGCG ACGAAGCGGC GGGGAGCAGA 180 

GTGGAGCGGC TGGGCCAGGC GTTCCGGCGA GGCGTGCGGC TGCTGCGGGA GCTCAGCGAG 240 

rG AG TTGTCTTCCT GGTGGATGAT TCGTCCAGCG TGGGCGAAGT CAACTTCCGC 300 

AGCGAGCTCA TGTTCGTCCG CAAGCTGCTG TCCGACTTCC CCGTGGTGCC CACGGCCACG 360 

CGCGTGGCCA TCGTGACCTT CTCGTCCAAG AACTACGTGG TGCCGCGCGT CGATTACATC 42 0 

TCCACCCGCC GCGCGCGCCA GCACAAGTGC GCGCTGCTCC TCCAAGAGAT CCCTGCCATC 480 

TCCTACCGAG GTGGCGGCAC CTACACCAAG GGCGCCTTCC AGCAAGCCGC GCAAATTCTT 540 

CTTCATGCTA GAGAAAACTC AACAAAAGTT GTATTTCTCA TCACTGATGG ATATTCCAAT 600 

GGGGGAGACC CTAGACCAAT TGCAGCGTCA CTGCGAGATT CAGGAGTGGA GATCTTCACT 660 

TTTGGCATAT GGCAAGGGAA CATTCGAGAG CTGAATGACA TGGCTTCCAC CCCAAAGGAG 720 

GAGCACTGTT ACCTGCTACA CAGTTTTGAA GAATTTGAGG CTTTAGCTCG CCGGGCATTG 780 
CATGAAGATC TACCTTCTGG GAGTTTTATT CAAGATGATA TGGTCCACTG CTCATATCTT 840 
TGTGATGAAG G CAAGGACTG CTGTGACCGA ATGGGAAGCT GCAAATGTGG GACACACACA 900 
GGCCATTTTG AGTGCATCTG TGAAAAGGGG TATTACGGGA AAGGTCTGCA GTATGAATGC 960 

ACAGCTTGCC CATCGGGGAC ATACAAACCT GAAGGCTCAC CAGGAGGAAT CAGCAGTTGC 1020 

ATTCCATGTC CTGATGAAAA TCACACCTCT CCACCTGGAA GCACATCCCC TGAAGACTGT 1080 

GTCTGCAGAG AGGGATACAG GGCATCTGGC CAGACCTGTG AACTTGTCCA CTGCCCTGCC 1140 

CTGAAGCCTC CCGAAAATGG TTACTTTATC CAAAACACTT GCAACAACCA CTTCAATGCA 1200 

GCCTGTGGGG TCCG ATGT CA CCCTGGATTT GATCTTGTGG GAAGCAGCAT CATCTTATGT 1260 

CTACCCAATG GTTTGTGGTC CGGTTCAGAG AGCTACTGCA GAGTAAGAAC ATGTCCTCAT 1320 

CTCCGCCAGC CGAAACATGG CCACATCAGC TGTTCTACAA GGGAAATGTT ATATAAGACA 1380 

ACATGTTTGG TTGCCTGTGA TGAAGGGTAC AGACTAGAAG GCAGTGATAA GCTTACTTGT 1440 

CAAGGAAACA GCCAGTGGGA TGGGCCAGAA CCCCGGTGTG TGGAGCGCCA CTGTTCCACC 1500 

TTTCAGATGC CCAAAGATGT CATCATAT CC CCCCACAACT GTGGCAAGCA GCCAGCCAAA 1560 

TTTGGGACGA TCTGCTATGT AAGTTGCCGC CAAGGGTTCA TTTTATCTGG AGTCAAAGAA 1620 

ATGCTGAGAT GTACCACTTC TGGAAAATGG AATGTCGGAG TTCAGGCAGC TGTGTGTAAA 1680 

GACGTGGAGG CTCCTCAAAT CAACTGTCCT AAGGACATAG AGGCTAAGAC TCTGGAACAG 1740 

CAAGATTCTG CCAATGTTAC CTGGCAGATT CCAACAGCTA AAGACAACTC TGGTGAAAAG 1800 

GTGTCAGTCC ACGTTCATCC AGCTTTCACC CCACCTTACC TTTTCCCAAT TGGAGATGTT 1860 

GCTATCGTAT ACACGGCAAC TGACCTATCC GGCAACCAGG CCAGCTGCAT TTTCCATATC 1920 

AAGGTTATTG ATGCAGAACC ACCTGT CATA GACTGGTGCA GATCTCCACC TCCCGTCCAG 1980 

GTCTCGGAGA AGGTACATGC CGCAAGCTGG GATGAGCCTC AGTTCTCAGA CAACTCAGGG 2040 

GCTGAATTGG TCATTACCAG AAGTCATACA CAAGGAGACC TTTTCCCTCA AGGGGAGACT 2100 

ATAGTACAGT ATACAGCCAC TGACCCCTCA GGCAATAACA GGACATGTGA TATCCATATT 2160 

GTCATAAAAG GTTCTCCCTG TGAAATTCCA TTCACACCTG TAAATGGGGA TTTTATATGC 2220 

ACTCCAGATA ATACTGGAGT CAACTGTACA TTAACTTGCT TGGAGGGCTA TGATTTCACA 2280 

GAAGGGTCTA CTGACAAGTA TTATTGTGCT TATGAAGATG GCGTCTGGAA ACCAACATAT 2340 
ACCACTGAAT GGCCAGACTG TGCCAAAAAA CGTTTTGCAA ACCACGGGTT CAAGTCCTTT 2400 
GAGATGTTCT ACAAAGCAGC TCGTTGTGAT GACACAGATC TGATGAAGAA GTTTTCTGAA 2460 
GCATTTGAGA CGACCCTGGG AAAAATGGTC CCATCATTTT GTAGTGATC-C AGAGGACATT 2520 
GACTGCAGAC TGGAGGAGAA CCTGACCAAA AAATATTGCC TAGAATATAA TTATGACTAT 2580 
GAAAATGGCT TTGCAATTGG ACCAGGTGGC TGGGGTGCAG CTAATAGGCT GGATTACTCT 2640 
TACGATGACT TCCTGGACAC TGTGCAAGAA ACAGCCACAA GCATCGGCAA TGCCAAGTCC 2700 
TCACGGATTA AAAGAAGTGC CCCATTATCT GACTATAAAA TTAAGTTAAT TTTTAACATC 2760 
ACAGCTAGTG TGCCATTACC CGATGAAAGA AATGATACCC TTGAATGGGA AAATCAGCAA 2820 
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CGACTCCTTC AGACATTGGA AACTATCACA AATAAACTGA AAAGGACTCT CAACAAAGAC 2880 
CCCATGTATT CCTTTCAGCT TGCATCAGAA ATACTTATAG CCGACAGCAA TTCATTAGAA 2940 
ACAAAAAAGG CTTCCCCCTT CTGCAGACCA GGCTCAGTGC TGAGAGGGCG TATGTGTGTC 3000 
AATTGCCCTT TGGGAACCTA TTATAATCTG GAACATTTCA CCTGTGAAAC- CTGCCGGATC 3060 
5 GGATCCTATC AAGATGAAGA AGGGCAACTT GAGTGCAAGC TTTGCCCCTC TGGGATGTAC 3120 
ACGGAATATA TCCATTCAAG AAACATCTCT GATTGTAAAG CTCAGTGTAA ACAAGGCACC 3180 
TACTCATACA GTGGACTTGA GACTTGTGAA TCGTGTCCAC TGGGCACTTA TCAGCCAAAA 3240 
TTTGGTTCCC GGAGCTGCCT CTCGTGTCCA GAAAACACCT CAACTGTGAA AAGAGGAGCC 3300 
GTGAACATTT CTGCATGTGG AGTTCCTTGT CCAGAAGGAA AATTCTCGCG TTCTGGGTTA 33 60 
10 ATGCCCTGTC ACCCATGTCC TCGTGACTAT TACCAACCTA ATGCAGGGAA GGGCTTCTGC 3420 
CTGGCCTGTC CCTTTTATGG AACTACCCCA TTCGCTGGTT CCAGATCCAT CACAGAATGT 3480 
TCAACTTCAG TTCTGAATAT TACTATTTTC GGTGGATTTG GGCATCTGGA GTTGTTAAAT 3540 
TGTCCTTCTG AGGTTTTCCA TGAATGCTTC TTTAACCCTT GCCACAATAG TGGAACCTGC 3600 
CAGCAACTTG GGCGTGGTTA TGTTTGTCTC TGTCCACTTG GATATACAGG CTTAAAGTGT 3660 
15 GAAACAGACA TCGATGAGTG CAGCCCACTG CCTTGCCTCA ACAATGGAGT TTGTAAAGAC 3 720 
CTAGTTGGGG AATTCATTTG TGAGTGCCCA TCAGGTTACA CAGGTCAGCG GTGTGAAGAA 3780 
AATATAAATG AGTGTAGCTC CAGTCCTTGT TTAAATAAAG GAATCTGTGT TGATGGTGTG 3840 
GCTGGCTATC GTTGCACATG TGTGAAAGGA TTTGTAGGCC TGCATTGTGA AACAGAAGTC 3900 
AATGAATGCC AGTCAAACCC ATGCTTAAAT AATGCAGTCT GTGAAGACCA GGTTGGGGGA 3960 
20 TTCTTGTGCA AATGCCCACC TGGATTTTTG GGTACCCGAT GTGGAAAGAA CGTCGATGAG 4020 
TGTCTCAGTC AGCCATGCAA AAATGGAGCT ACCTGTAAAG ACGGTGCCAA TAGCTTCAGA 4080 
TGCCTGTGTG CAGCTGGCTT CACAGGATCA CACTGTGAAT TGAACATCAA TGAATGTCAG 4140 
TCTAATCCAT GTAGAAATCA GGCCACCTGT GTGGATGAAT TAAATTCATA CAGTTGTAAA 4200 
TGTCAGCCAG GATTTTCAGG CAAAAGGTGT GAAACAGAAC AGTCTACAGG CTTTAACCTG 4260 
25 GATTTTGAAG TTTCTGGCAT CTATGGATAT GTCATGCTAG ATGGCATGCT CCCATCTCTC 4320 
CATGCTCTAA CCTGTACCTT CTGGATGAAA TCCTCTGACG ACATGAACTA TGGAACACCA 4380 
ATCTCCTATG CAGTTGATAA CGGCAGCGAC AATACCTTGC TCCTGACTGA TTATAACGGC 4440 
TGGGTTCTTT ATGTGAATGG CAGGGAAAAG ATAACAAACT GTCCCTCGGT GAATGATGGC 4 500 
AGATGGCATC ATATTGCAAT CACTTGGACA AGTGCCAATG GCATCTGGAA AGTCTATATC 4560 
30 GATGGGAAAT TATCTGACGG TGGTGCTGGC CTCTCTGTTG GTTTGCCCAT ACCTGGTGGT 4620 
GGTGCGTTAG TTCTGGGGCA AGAGCAAGAC AAAAAAGGAG AGGGATTCAG CCCAGCTGAG 4680 
TCTTTTGTGG GCTCCATAAG CCAGCTCAAC CTCTGGGACT ATGTCCTGTC TCCACAGCAG 4740 
GTGAAGTCAC TGGCTACCTC CTGCCCAGAG GAACTCAGTA AAGGAAACGT GTTAGCATGG 4 800 
CCTGATTTCT TGTCAGGAAT TGTGGGGAAA GTGAAGATCG ATTCTAAGAG CATATTTTGT 4860 
35 TCTGATTGCC CACGCTTAGG AGGGTCAGTG CCTCATCTGA GAACTGCATC TGAAGATTTA 4920 
AAGCCAGGTT CCAAAGTCAA TCTGTTCTGT GATCCAGGCT TCCAGCTGGT CGGGAACCCT 4980 
GTGCAGTACT GTCTGAATCA AGGACAGTGG ACACAACCAC TTCCTCACTG TGAACGCATT 5040 
AGCTGTGGGG TGCCACCTCC TTTGGAGAAT GGCTTCCATT CAGCCGATGA CTTCTATGCT 5100 
GGCAGCACAG TAACCTACCA GTGCAACAAT GGCTACTATC TATTGGGTGA CTCAAGGATG 516 0 
40 TTCTGTACAG ATAATGGGAG CTGGAACGGC GTTTCACCAT CCTGCCTTGA TGTCGATGAG 5220 

TGTGCAGTTG GATCAGATTG TAGTGAGCAT GCTTCTTGCC TGAACGTAGA TGGATCCTAC 5280 
ATATGTTCAT GH3TCCCACC GTACACAGGA GATGGGAAAA ACTGTGCAGA ACCTATAAAA 5340 
TGTAAGGCTC CAGGAAATCC GGAAAATGGC CACTCCTCAG GTGAGATTTA TACAGTAGGT 5400 
GCCGGAGTCA CATTTTCGTG TCAGGAAGGA TACCAGTTGA TGGGAGTAAC CAAAATCACA 546 0 
45 TGTTTGGAGT CTGGAGAATG GAATCATCTA ATACCATATT GTAAAGCTGT TTCATGTGGT 5520 

AAACCGGCTA TTCCAGAAAA TGGTTGCATT GAGGAGTTAG CATTTACTTT TGGCAGCAAA 5580 
GTGACATATA GGTGTAATAA AGGATATACT CTGGCCGGTG ATAAAGAATC ATCCTGTCTT 5640 
GCTAACAGTT CTTGGAGTCA TTCCCCTCCT GTGTGTGAAC CAGTGAAGTG TTCTAGTCCG 5700 
GAAAATATAA ATAATGGAAA ATATATTTTG AGTGGGCTTA CCTACCTTTC TACTGCATCA 5760 
50 TATTCATGCG ATACAGGATA CAGCTTACAG GGCCCTTCCA TTATTGAATG CACGGCTTCT 5820 
GGCATCTGGG ACAGAGCGCC ACCTGCCTGT CACCTCGTCT TCTGTGGAGA ACCACCTGCC 5880 
ATCAAAGATG CTGTCATTAC GGGGAATAAC TTCACTTTCA GGAACACCGT CACTTACACT 5940 
TGCAAAGAAG GCTATACTCT TGCTGGTCTT GACACCATTG AATGCCTGGC CGACGGCAAG 6000 
TGGAGTAGAA GTGACCAGCA GTGCCTGGCT GTCTCCTGTG ATGAGCCACC CATTGTGGAC 6060 
55 CACGCCTCTC CAGAGACTGC CCATCGGCTC TTTGGAGACA TTGCATTCTA CTACTGCTCT 6120 
GATGGTTACA GCCTAGCAGA CAATTCCCAG CTTCTCTGCA ATGCCCAGGG CAAGTGGGTA 6180 
CCCCCAGAAG GTCAAGACAT GCCCCGTTGT ATAGCTCATT TCTGTGAAAA ACCTCCATCG 6240 
GTTTCCTATA GCATCTTGGA ATCTGTGAGC AAAGCAAAAT TTGCAGCTGG CTCAGTTGTG 6300 
AGCTTTAAAT GCATGGAAGG CTTTGTACTG AACACCTCAG CAAAGATTGA ATGTATGAGA 6360 
60 GGTGGGCAGT GGAACCCTTC CCCCATGTCC ATCCAGTGCA TCCCTGTGCG GTGTGGAGAG 6420 
CCACCAAGCA TCATGAATGG CTATGCAAGT GGATCAAACT ACAGTTTTGG AGCCATGGTG 6480 
GCTTACAGCT GCAACAAGGG GTTCTACATC AAAGGGGAAA AGAAGAGCAC CTGCGAAGCC 6540 
ACAGGGCAGT GGAGTAGTCC TATACCGACG TGCCACCCGG TATCTTGTGG TGAACCACCT 6500 
AAGGTTGAGA ATGGCTTTCT GGAGCATACA ACTGGCAGGA TCTTTGAGAG TGAAGTGAGG 6660 
65 TATCAGTGTA ACCCGGGCTA TAAGTCAGTC GGAAGTCCTG TATTTGTCTG CCAAGCCAAT 6720 
CGCCACTGGC ACAGTGAATC CCCTCTGATG TGTGTTCCTC TCGACTGTGG AAAACCTCCC 6780 
CCGATCCAGA ATGGCTTCAT GAAAGGAGAA AACTTTGAAG TAGGGTCCAA GGTTCAGTTT 5840 
TTCTGTAATG AGGGTTATGA GCTTGTTGGT GACAGTTCTT GGACATGTCA GAAATCTGGC 6900 
AAATGGAATA AGAAGTCAAA TCCAAAGTGC ATGCCTGCCA AGTGCCCAGA GCCGCCCCTC 6960 
70 TTGGAAAACC AGCTAGTATT AAAGGAGTTG ACCACCGAGG TAGGAGTTGT GACATTTTCC 7020 
TGTAAAGAAG GGCATGTCCT GCAAGGCCCC TCTGTCCTGA AATGCTTGCC ATCCCAGCAA 7080 
TGGAATGACT CTTTCCCTGT TTGTAAGATT GTTCTTTGIA CCCCACCTCC CCTAATTTCC 7140 
TTTGGTGTCC CCATTCCTTC TTCTGCTCTT CATTTTGGAA GTACTGTCAA GTATTCTTGT 7200 
GTAGGTGGGT TTTTCCTAAG AGGAAATTCT ACCACCCTCT GCCAACCTGA TGGCACCTGG 7260 
75 AGCTCTCCAC TGCCAGAATG TGTTCCAGTA GAATGTCCCC AACCTGAGGA AATCCCCAAT 7320 
GGAATCATTG ATGTG CAAGG CCTTGCCTAT CTCAGCACAG CTCTCTATAC CTGCAAGCCA 7380 
GGCTTTGAAT TGGTGGGAAA TACTACCACC CTTTGTGGAG AAAATGGTCA CTGGCTTGGA 7440 
GGAAAACCAA CATGTAAAGC CATTGAGTGC CTGAAACCCA AGGAGATTTT GAATGGCAAA 7500 
TTCTCTTACA CGGACCTACA CTATGGACAG ACCGTTACCT ACTCTTGCAA CCGAGGCTTT ,7560 
80 CGGCTCGAAG GTCCCAGTGC CTTGACCTGT TTAGAGACAG C-TGATTGGGA TGTAGATGCC 7620 
CCATCTTGCA ATGCCATCCA CTGTGATTCC CCACAACCCA TTGAAAATGG TTTTGTAGAA 7680 
GGTGCAGATT ACAGCTATGG TGCCATAATC ATCTACAGTT GCTTCCCTGG GTTTCAGGTG 7740 
GCTGGTCATG CCATGCAGAC CTGTGAAGAG TCAGGATGGT CAAGTTCCAT CCCAACATGT 7800 
ATGCCAATAG ACTGTGGCCT CCCTCCTCAT ATAGATTTTG GAGACTGTAC TAAACTCAAA 7860 
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GATGACCAGG GATATTTTGA GCAAGAAGAC GACATGATGG AAGTTCCATA TGTC-ACTCCT 7920 
CACCCTCCTT ATCATTTGGG AGCAGTGGCT AAAACCTGGG AAAATACAAA GGAGTCTCCT 7980 
GCTACACATT CATCAAACTT TCTGTATGGT ACCATGGTTT CATACACCTG TAATCCAGGA 8040 
TATGAACTTC TGGGGAACCC TGTGCTGATC TGCCAGGAAG ATGGAACTTG GAATGGCAGT 8100 
5 GCACCATCCT GCATTTCAAT TGAATGTGAC TTGCCTACTG CTCCTGAAAA TGGCTTTTTG 8160 
CGTTTTACAG AGACTAGCAT GGGAAGTGCT GTGCAGTATA GCTGTAAACC TGGACACATT 8220 
CTAGCAGGCT CTGACTTAAG GCTTTGTCTA GAGAATAGAA AGTGGAGTGG TGCCTCCCCA 8230 
CGCTGTGAAG CCATTTCATG CAAAAAGCCA AATCCAGTCA TGAATGGATC CATCAAAGGA 8340 
AGCAACTACA CATACCTGAG CACGTTGTAC TATGAGTGTG ACCCCGGATA TGTGCTGAAT 8400 
10 GGCACTGAGA GGAGAACATG CCAGGATGAC AAAAACTGGG ATGA3GATGA GCCCATTTGC 8460 
ATTCCTGTGG ACTGCAGTTC ACCCCCAGTC TCAGCCAATG GCCAGGTGAG AGGAGACGAG 8520 
TACACATTCC AAAAAGAGAT TGAATACACT TGCAATGAAG GGTTCTTGCT TGAGGGAGCC 8580 
AGGAG1CGGG TTTGTCTTGC CAATGGAAGT TGGAGTGGAG CCACTCCCGA CTGTGTGCCT 8640 
GTCAGATGTG CCACCCCGCC ACAACTGGCC AATGGGGTGA CGGAAGGCCT GGACTATGGC 8700 
15 TTCATGAAGG AAGT AACAT T CCACTGTCAC GAGGGCTACA TCTTGCACGG TGCTCCAAAA 8760 
CTCACCTGTC AGTCAGATGG CAACTGGGAT GCAGAGATTC CTCTCTGTAA ACCAGTCAAC 8820 
TGTGGACCTC CTGAAGATCT TGCCCATGGT TTCCCTAATG GTTTTTCCTT TATTCATGGG 8880 
GGCCATATAC AGTATCAGTG CTTTCCTGGT TATAAGCTCC ATGGAAATTC ATCAAGAAGG 8940 
TGCCTCTCCA ATGGCTCCTG GAGTGGCAGC TCACCTTCCT GCCTGCCTTG CAGATGTTCC 9000 
20 ACACCAGTAA TTGAATATGG AACTGTCAAT GGGACAGATT TTGACTGTGG AAAGGCAGCC 9060 
CGGATTCAGT GCTTCAAAGG CTTCAAGCTC CTAGGACTTT CTGAAATCAC CTGTGAAGCC 9120 
GATGGCCAGT GGAGCTCTGG GTTCCCCCAC TGTGAACACA CTTCTTGTGG TTCTCTTCCA 9180 
ATGATACCAA ATGCGTTCAT CAGTGAGACC AGCTCTTGGA AGGAAAATGT GATAACTTAC 9240 
AGCTGCAGGT CTGGATATGT CATACAAGGC AGTTCAGATC TGATTTGTAC AGAGAAAGGG 9300 
25 GTATGGAGCC AGCCTTATCC AGTCTGTGAG CCCTTGTCCT GTGGGTCCCC ACCGTCTGTC 9360 
GCCAATGCAG TGGCAACTGG AGAGGCACAC ACCTATGAAA GTGAAGTGAA ACTCAGATGT 9420 
CTGGAAGGTT ATACGATGGA TACAGATACA GATACATTCA CCTGTCAGAA AGATGGTCGC 9480 
TGGTTCCCTG AGAGAATCTC CTGCAGTCCT AAAAAATGTC CTCTCCCGGA AAACATAACA 9540 
CATATACTTG TACATGGGGA CGATTTCAGT GTGAATAGGC AAGTTTCTGT GTCATGTGCA 9600 
30 GAAGGGTATA CCTTTGAGGG AGTTAACATA TCAGTATGTC AGCTTGATGG AACCTGGGAG 9660 
CCACCATTCT CCGATGAATC TTGCAGTCCA GTTTCTTGTG GGAAACCTGA AAGTCCAGAA 9720 
CATGGATTTG TGGTTGGCAG TAAATACACC TTTGAAAGCA CAATTATTTA TCAGTGTGAG 9780 
CCTGGCTATG AACTAGAGGG GAACAGGGAA CGTGTCTGCC AGGAGAACAG ACAGTGGAGT 9840 
GGAGGGGTGG CAATATGCAA AGAGACCAGG TGTGAAACTC CACTTGAATT TCTCAATGGG 9900 
35 AAAGCTGACA TTGAAAACAG GACGACTGGA CCCAACGTGG TATATTCCTG CAACAGAGGC 9960 
TACAGTCTTG AAGGGCCATC TGAGGCACAC TGCACAGAAA ATGGAACCTG GAGCCACCCA 10020 
GTCCCTCTCT GCAAACCAAA TCCATGCCCT GTTCCTTTTG TGATTCCCGA GAATGCTCTG 10080 
CTGTCTGAAA AGGAGTTTTA TGTTGATCAG AATGTGTCCA TCAAATGTAG GGAAGGTTTT 10140 
CTGCTGCAGG GCCACGGCAT CATTACCTGC AACCCCGACG AGACGTGGAC ACAGACAAGC 102 00 
40 GCCAAATGTG AAAAAATCTC ATGTGGTCCA CCAGCTCACG TAGAAAATGC AATTGCTCGA 10260 

GGCGTACATT ATCAATATGG AGACATGATC ACCTACTCAT GTTACAGTGG ATACATGTTG 10320 
GAGGGTTTCC TGAGGAGTGT TTGTTTAGAA AATGGAACAT GGACATCACC TCCTATTTGC 103 80 
AGAGCIGTCT GTCGATTTCC ATGTCAGAAT GGGGGCATCT GCCAACGCCC AAATGCTTGT 10440 
TCCTGTCCAG AGGGCTGGAT GGGGCGCCTC TGTGAAGAAC CAATCTGCAT TCTTCCCTGT 10500 
45 CTGAACGGAG GTCGCTGTGT GGCCCCTTAC CAGTGTGACT GCCCGCCTGG CTGGACGGGG 10560 

TCTCGCTGTC ATACAGCTGT TTGCCAGTCT CCCTGCTTAA ATGCTGGAAA ATGTGTAAGA 10620 
CCAAACCGAT GTCACTGTCT TTCTTCTTGG ACGGGACATA ACTGTTCCAG GTAA 

Seq ID NO: 430 Protein sequence 
50 Protein Accession #-. FGENESH predicted 

1 11 21 31 41 51 

I I I I I 

MWPRLAPCCW GLALVSGWAT FQQMSPSRNF SFRLFPETAP GAPGSIPAPP APGDEAAGSR 60 

55 VERLGQAFRR RVRLLRELSE RLELVFLVDD SSSVGEVNFR SELMFVRKLL SDFPWPTAT 120 

RVAIVTFSSK NYWPRVDY I STRRARQHKC ALLLQEIPAI SYRGGGTYTK GAFQQAAQIL 180 

LHAREHSIKV VFLITDGYSN GGDPRPIAAS LRDSGVEIFT FGIWQGNIRE LNDMASTPKE 240 

EHCYLLHSFE EFEALARRAL HEDLPSGSFI QDDMVHCSYL CDEGKDCCDR MGSCKCGTHT 300 

GHFECICEKG YYGKGLQYEC TACPSGTYKP EGSPGGISSC IPCPDENHTS PPGSTSPEDC 360 

60 VCREGYRASG QTCELVHCPA LKPPENGYFI QNTOTOHFNA ACGVRCKPGF DLVGSSIILC 420 

LPNGLWSGSE SYCRVRTCPH LRQPKHGHIS CSTREHLYKT TCLVACDEGY RLEGSDKLTC 480 

QGNSQWDGPE PRCVERHCST FQMPKDVIIS PHNCGKQPAK FGTICYVSCR QGFILSGVKE 540 

MLRCTTSGKW NVGVQAAVCK DVEAPQINCP KDIEAKTLEQ QDSANVTWQI PTAKDNSGEK 600 

VSVHVHPAFT PPYLFPIGDV AIVYTATDLS GNQASCIFHI KVIDAEPPVI DWCRSPPPVQ 660 

65 VSEKVHAASW DEPQFSDNSG AELVITRSHT QGDLFPQGET IVQYTATDPS GNNRTCDIHI 720 

VIKGSPCEIP FTPWGDFIC TPDNTGVNCT LTCLEGYDFT EGSTOKYYCA YEDGVWKPTY 7B0 

TTEWPDCAKK RFANHGFKSF EMFYKAARCD DTDLMKKFSE AFETTLGKMV PSFCSDAEDI 840 

DCRLEENLTK KYCLEYNYDY ENGFAIGPGG WGAANRLDYS YDDFLDTVQE TATSIGNAKS 900 

SRIKRSAPLS DYKIKLIFNI TASVPLPDER NDTLEWENQQ RLLQTLETIT NKLKRTLNKD 960 

70 PMYSFQLASE ILIADSNSLE TKKASPFCRP GSVLRGRMCV NCPLGTYYNL EHFTCESCRI 1020 

GSYQDEEGQL ECKLCPSGMY TEYIHSRNIS DCKAQCKQGT YSYSGLETCE SCPLGTYQPK 1080 

FGSRSCLSCP ENTSTVKRGA VMSACGVPC PEGKFSRSGL MPCHPCPRDY YQPNAGKAFC 1140 

LACPFYGTTP FAGSRSITEC STSVLNITIF GGFGHLELLN CPSEVFHECF FKPCHNSGTC 1200 

QQLGRGYVCL CPLGYTGLKC ETDIDECSPL PCIjNNGVCKD LVGEFICECP SGYTGQRCEE 1260 

75 NINECSSSPC 1MKGICVDGV AGYRCTCVKG FVGLHCETEV NEi 2SN IK MAVCEDQVGG 1320 

FLCKCPPGFL GTRCGKNVDE CLSQPCKNGA TCKDGANSFR CLCAAGFTGS HCELNINECQ 1380 

SNPCRNQATC VDELMSYSCK CQPGFSGKRC ETEQSTGFNL DFEVSGIYGY VMLDGMLPSL 1440 

HALTCTFWMK SSDDMNYGTP ISYAVDNGSD KTLL TDYNG WVI IGREK ITNCPSVWDG 1500 

RWHHIAITWT SANGIWKVYI DGKLSDGGAG LSVGLPIPGG GALVLGQEQD KKGSGFSPAE 1560 

80 SFVGSISQLM LWDYVLSPQQ VKSLATSCPE ELSKGNVLAK PDFLSGIVGK VKIDSKSIFC 1620 

SDCPRLGGSV PHLRTASEDL KPGSKVNLFC DPGFQLVGNP VQYCLHQGQN TQPLPHCERI 1680 

SCGVPPPLEN GFHSADDFYA GSTVTYQCNN GYYLLGDSRM FCTDNGSl\NG VSPSCLDVDE 174 0 

CAVGSDCSEH ASCLNVDGSY ICSCVPPYTG DGKNCAEPIK CKAPGNPENG HSSGEIYTVG 1800 

AGVTFSCQEG YQLMGVTKIT CLESGEWNHL IPYCKAVSCG KPAIPSNGCI EELAFTFGSK 1860 
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VTYRCNKGYT LAGDKESSCL ANSSWSHSPP VCEPVKCSSP ENIKNGKYIL SGLTYLSTAS 192 0 

YSCDTGYSLQ GPSIIECTAS GIWDRAEPAC HLVFCGEPPA IKDAVITGNN FTFRNTVTYT 1980 

CKEGYTLAGL DTIECLADGK WSRSDQQCLA VSCDEPPIVD HASPETAHRL FGDIAFYYCS 2040 

DGYSLADNSQ LLCNAQGKWV PPEGQDMPRC IAHFCEKPPS VSYSILESVS KAKFAAGSW 2100 

SFKCMEGFVL NTSAKIECMR GGQWNPSPMS IQCIPVRCGE PPSIMNGYAS GSNYSFGAMV 2160 

AYSCNKGFYI KGEKKSTCEA TGQWSSPIPT CHPVSCGEPP KVENGFLEHT TGRIFESEVR 2220 

YQCNPGYKSV GSPVFVCQAN RHWHSESPLM CVPLDCGKPP PIQNGFMKGS NFEVGSKVQF 22 80 

FCNEGYELVG DSSWTCQKSG KKNKKENPKC MPAKCPEPPL LENQLVLKEL TTEVGWTFS 2340 

CKEGHVLQGP SVLKCLPSQQ WNDSFPVCKI VLCTPPPLIS FGVPIPSSAL HFGSTVKYSC 2400 

VGGFFLRGNS TTLCQPDGTW SSPLPECVPV ECPQPEEIPN GIIDVQGLAY LSTALYTCKP 2460 

GFELVGNTTT LCGENGHWLG GKPTCKAIEC LKPKEILNGK FSYTDLHYGQ TVTYSCNRGF 2520 

RLEGPSALTC LETGDWDVDA PSCNAIHCDS PQPIENGFVE GADYSYGAII IYSCFPGFQV 2580 

AGHAMQTCEE SGWSSSIPTC MPIDCGLPPH IDFGDCTKLK DDQGYFEQSD DMMEVPYVTP 2640 

HP P YHLGAVA KTWENTKESP ATHSSNFLYG TMVSYTCNPG YELLGNPVLI CQEDGTTINGS 2700 

APSCISIECD LPTAPENGFL RFTETSMGSA VQYSCKPGHI LAGSDLRLCL ENRKWSGASP 2760 

RCEAISCKKP NPVNNGSIKG SNYTYLSTLY YECDPGYVLN GTERRTCQDD KNWDEDEPIC 2820 

IPVDCSSPPV SANGQVRGDE YTFQKEIEYT CNEGFLLEGA RSRVCLANGS NSGATPDCVP 2880 

VRCATPPQLA NGVTEGLDYG FMKEVT FHCH EGYILHGAPK LTCQSDGNWD AEIPLCKPVN 2 940 

CGPPEDLAHG FPNGFSFIHG GHIQYQCFPG YKLHGNSSRR CLSNGSWSGS SPSCLPCRCS 3000 

TPVIEYGTVN GTDFDCGKAA RIQCFKGFKL LGLSEITCEA DGQWSSGFPH CEHTSCGSLP 3060 

MIPNAFISET SSWKENVITY SCRSGYVIQG SSDLICTEKG VWSQPYPVCE PLSCGSPPSV 3120 

ANAVATGEAH TYESEVKLRC LEGYTMDTDT DTFTCQKDGR WFPERISCSP KKCPLPENIT 3180 

HILVHGDDFS VNRQVSVSCA EGYTFEGVNI SVCQLDGTWE PPFSDESCSP VSCGKPESPE 3240 

HGFWGSKYT FESTIIYQCE PGYELEGKRE RVCQENRQWS GGVAI CKETR CETPLEFMG 3300 

KADIENRTTG PNWYSCHRG YSLEGPSEAH CTENGTWSHP VPLCKPNPCP VPFVIPENAL 3360 

LSEKEFYVDQ NVSIKCREGF LLQGHGI ITC NPDETWTQTS AKCEKISCGP PAHVENAIAR 3420 

GVHYQYGDMI TYSCYSGYML EGFLRSVCLE NGTWTSPPIC RAVCRFPCQN GGICQRPNAC 3480 

• — b CEEPICILPC LNGGRCVAPY QCDCPEGWTG SRCHTAVCQS t"""™' "=•'" 
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ATGAGGTTCA GTGTCTCAGG CATGAGGACC GACTACCCCA GGAGTGTGCT GGCTCCTGCT 

TATGTGTCAG TCTGTCTCCT CCTCTTGTGT CCAAGGGAAG TCATCGCTCC CGCTGGCTCA 

GAACCATGGC TGTGCCAGCC GGCACCCAGG TGTGGAGACA AGATCTACAA CCCCTTGGAG 

CAGTGCTGTI ACAATGACGC CATCGTGTCC CTGAGCGAGA CCCGCCAATG TGGTCCCCCC 

TGCACCTTCT GGCCCTGCTT TGAGCTCTGC TGTCTTGATT CCTTTGGCCT CACAAACGAT 

TTTGTTGTGA AGCTGAAGGT TCAGGGTGTG AATTCCCAGT GCCACTCATC TCCCATCTCC 
AGTAAATGTG AAAGAGGCCG GATATGTTAG 



Seq i: _ 

" : FGENESH predicted 



50 MRFSVSSMRT DYPRSVIAPA YVSVCLbbbC PREVIAPAGS EPWLCQPAPR CGDKIYNPLE 60 

QCCYNDAIVS LSETRQCGPP CTFWPCFELC CLDSFGLTND FWKLKVQGV NSQCHSSPIS 120 
SKCERGRIC 

Seq ID NO: 433 DNA sequence 
55 Nucleic Acid Accession #: NM_007231.1 
Coding sequences 89.. 2017 

1 11 21 31 41 51 

60 TAGGAACAGG GGAGAGTGCA CCTGCTACCA GTCAAGCTCA GCCAGACTGC AAGAGGAGGC 50 

GAGGCGGAGC CAGCCGAGGG AGTGAACCAT GGACAAGTTG AAATGCCCGA GT1TCTTCAA 120 

GTGCAGGGAG AAGGAGAAAG TGTCGGCTTC ATCAGAGAAT 1TCCATGTTG GTGAAAATGA 180 

TGAGAATCAG GACCGTGGTA ACTGGTCCAA AAAATCGGAT TATCTTCTAT CTATGATTGG 240 

ATACGCAGTG GGATTAGGAA ATGTGTGGAG ATTTCCATAT CTGACCTACA GCAATGGTGG 300 

65 AGGCGCCTTC TTGATACCTT ATGCAATTAT GTTAGCATTG GCTGGTTTAC CTTTGTTCTT 360 

TCTGGAGTGT TCACT GGGAC AATTTGCTAG CTTAGGTCCA GTTTCAGTTT GGAGGATTCT 420 

TCCATTGTTT CAAGGTGTGG GAATTACAAT GGTCCTGATC TCCATTTTTG TGACAATCTA 480 

TTACAATGTC ATAATTGCCT ATAGTCTTTA CTACATGTTT GCT1CTT1TC AAAGTGAACT 540 

ACCATGGAAA AATTGTTCTT CGTGGTCAGA TAAAAACTGT AGCAGATCAC CAATAGTAAC 600 

70 T CACTGT AAT GTGAGTACAG TGAATAAAGG AATACAAGAG ATCATCCAAA TGAATAAAAG 660 

CTGGGTAGAC ATCAACAATT TTACCTGCAT CAACGGCAGT GAAATTTATC AGCCAGGGCA 720 

GCTTCCCAGT GAACAATATT GGAATAAAGT GGCGCTCCAA CGGTCAAGTG GAATGAATGA 780 

GACTGGAGTA ATTGTTTGGT ATTTAGCACT TTGTCTTCTT CTGGCTTGGC TCATAGTTGG 840 

AGCAGCACTA TTTAAAGGAA TCAAATCGTC TGGCAAGGTG GTATATTTTA CAGCTCTTTT 900 

75 CCCCTATGTG GTCCTACTCA TCCTGTTAGT ACGAGGTGCA ACTCTGGAGG GTGCTTCAAA 960 

AGGCATTTCA TACTATATTG GAGCCCAGTC AAATTTTACA AAACTTAAGG AAGCTGAGGT 1020 

ATGGAAAGAT GCTGCCACTC AGATATTTTA CTCCCTTTCA GTGGCTTGGG GTGGCTTAGT 1080 

TGCTCTATCA TCTTACAATA AGTTCAAAAA CAACTGCTTC TCTGATGCCA TTGTGGTTTG 1140 

TTTGACAAAC TGTCTCACTA GCGTGTTTGC TGGATTTGCT ATTTTTTCTA TATTGGGACA 1200 

80 CATGGCCCAT ATATCTGGAA AGGAAGTTTC TCAAGTTGTA AAATCAGGTT TTGATTTGGC 1260 

ATTCATTGCC TATCCAGAGG CTCTAGCCCA ACTCCCAGGT GGTCCATTTT GGTCCATATT 1320 

ATTTTTTTTC ATGCTTTTAA CTTTGGGTCT CGATTCTCAG TT'TGCTTCGA TTGAAACGAT 1380 

CACAACAACA ATTCAAGATT TATTTCCCAA AGTGATGAAG AAAATGAGGG TTCCCATAAC 1440 

TTTGGGCTGC TGCTTGGTTT TGTTTCTCCT TGGTCTCGTC TGTGTGACTC AGGCTGGAAT 1500 
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TTACTGGGTT CATCTGATTG ACCACTTCTG TGCTGGATGG GGCATTTTAA TTGCAGCTAT 
ACTGGAGCTA GTTGGAATCA TCTGGATTTA TGGAGGGAAC AGATTCATTG AGGATACAGA 
AATGATGATT GGAGCAAAGA GGTGGATATT CTGGCTATGG TGGAGAGCTT GCTGGTTTGT 
AATTACGCCT ATCCTTTTGA TTGCAATATT TATCTGGTCA TTGGTGCAAT TTCATAGACC 
TAATTATGGC GCAATTCCAT ACCCTGACTG GGGAGTTGCT TTAGGCTGGT GTATGATTGT 
TTTCTGCATT ATTTGGATAC CAATTATGGC TATCATAAAA ATAATTGAGG CTAAAGGAAA 
CATCTTTCAA CGCCTTATAA GTTGCTGCAG ACCAGCTTCT AACTGGGGTC CATACCTGGA 
ACAACATCGT GGGGAAAGAT ATAAAGACAT GGTAGATCCT AAAAAAGAGG CTGACCATGA 
AATACCTACT GTTAGTGGCA GCAGAAAACC GGAATGAGAT CTCATTGAAA AAAATATATG 2040 
ATTGTATAAT GTGATTTTTT TTAGAATAGG GGGAACCTTA TTTATTTGTG TGTTAACTGA 2100 
ATAGGAAAAT GTACATACTA TGTTCATGAT AGTGTGATTT TTTTCACATT TAAGCAGGAA 2160 
TGCAATATAA AAATGTGAAT CTCTTAATTC TCAGCCATGT GCTTATTATA TTTCTTTTTA 2220 
GATTGTCTAT CTGTATAACA CACACACACA CACCTAAGAG TCTCTATTTC ACAATTATAT 2280 
TTTTGTAAAT AGTATATGCA TTTTTAATAC ATTGGAGGCT TTATTTTGAA CTAATTTCTT 2340 
AGAGAATAGT TATATTTTCT ATTACACAAG TTTAAAAATA TTATTAACTT GTATTTTCTT 2400 
AATATACAAT CTATCTTTTC CACAAATATG AGTGGGAAAT AAATCAGCAC ATTTGAAAGA 24S0 
AAGTGTTAAA ACTGAAGGCC TCACTTAATT AGAAACGTGA TAAATATATG GACAAATGGA 2520 
CTATACATAC TATAAGAGGA CTGTAGTTTA ATACTTTTTA CCCAAATATG TTTAAAAACA 2 5B0 
TCGTGCATTT GTTACAGCTC ATGTTTTCTA TATGAACTTA GTCATTAATG TTCTTTATAA 2S40 
AAAGTGAAAT AAGATGGAAA AATTAGGATC CTACAGCCAG TACGTGATAA ATCTAGAAAA 2700 
TTGAGTTTTG AGTACCTCTT TTCCCATATA CAATCTTCCT TCCTTAGGTA ATTTGGAAGA 2760 
AAACTATGAC CCATTTAATT TCTATTGTGT TTCACCAAAT TCAGTGTTGT TCATTATACC 2B20 
TCTCTGAAAT ATAGGTTTAA TTTCAAATAG AATATGGACT TAAATGTTAA TGAGAAACTG 2880 
GCTTTAATCA ATTCTAGCAT TTTATTACTG TAATACAGGG CTGATAGAGT GATTTTGTCT 294 0 
TATATGAGTC AGTTACTACT TACAGGTGAT AACTTGCATA CTATTGGAAG ATAAAGTTGT 3000 
CAAACTTGTC AAGAATGAGA AAAGCCAAAT TAGAAAATCC TATGTCCTAG TTTCCTTACC 3060 
AAGGATAATT AAATATATCA CTAAGAGCTT TATATATTGA TTATATATTG TTGACAACTG 3120 
GTTTAAGCAT CATAGCCTAT GATGATAAAC ACTGCCTATA TATGTAAATA GCTTTTCATC 3180 
AATTCTTAAA TTTCTTAACC TAGGCTTCAG GGAGCATATG AAACCAAAAT TATATGGAAC 3240 
ATTTTCTGTG TGTACATGTA CATGCATTTT TCTAGGGAGA GAGTCCGTAG GTTTATCAGA 3300 
ATATCAAGGA AAACTGTGAC CCAAAGAAGT TTAAGAATCA CAT ACAGTG C TGCTGGCTTT 33 60 
TTGTGCTTGG CAAATGAGTG ACAATAGAAG AAATAATTTT TCTTACACAT TTTAAAACGT 3420 
TTTCTCTTCC TTGTGATTGA AGATGAAAGG AGTAAGAAAT TAAGGCATTT GTTTAATTTA 3480 
TACTGGTAAC TTATTTAGGG GGGAGGGGAC ATGAAGGTAG GTAAATAGGT A 1 
TTGAACCACC TCTCTAAGTT ATGTACGTAT ATATAAGCTG AAATTGTGTT T 
AGGGTTTTCT TTTTCTTTTT CCTTTTTTTT TTTTTTTGGT GC-GGGGCTGG GGGTCAGAGT 3660 
CTTGTTCTGT TGCCTGGGCT GGAGTGCAGT GGCATGATCT CAGCTCACTG CAACCTCTGC 3720 
CTTCTGGATT CAAGTGATTC TCCTGCCTCA GCCTCTTGAG TAGCTGGGAC TACAGGTGCC 3 780 
CGCCACCACA CCAGCTAATT TTTGTATTTT TAGTAGAGGC GAAGTTTCCC CATGTTGGCC 3 840 
AGGCTGGTCT TGAACTCCCG ACCTCAAGTG ATCTGTCTAC CTCGGCCTCC TAAAGTGCTG 3 900 
AGATTACAGG TGTGAGCCAC CGTGCCCGGC CCATTCTAAG GGTTTTCTTT GAAGACAGGT 3 360 
CAAATGCTGT TAGTAAGTTT CAGGAGATTG TTAATTCCTC AGT TATACCA GATTTTATAA 4020 
AATATTTGAG AATAGATGGC TAACAAGAGG TTAGAAATAC TTTTCCTTAA TTTTAATCCA 4 080 
CAGTATGTTA CATGCATTCT ACCACTACAT TTTGGTGCTA TTTAAGGTGT GCAATTTTCT 414 0 
ATAGGTGACT TTTGCAATTC AGGGAAGATT TGGGCATATT AAATGAAAGA ATATCTAATT 4200 
GGGGGAGGTG TGAAGGGAAA GAAATTCTTT TCAAAAGCTG ACCACAAAGA GTAGTTAAAA 4260 
GTTTTTGTCA CTATCTTCAC AAGTGTGTAA AGCACAGATT TCAACAGAGT GCTTGGCATA 4320 
TTGTAGGGTG CTCAATGGTG GTTTTTATTA TTATTACTCA GATTCCACAG TGGCAAGAAA 4380 
CATCATTCTA CATAATGGAA AACATTTACA TCAAATCCCA CTTACTTTAA TGCGAACTTG 4440 
GAGATAATTT ATGGTATTGT ATTGTAAACC ATTAATGAAA ACTTTTTCAC AGTT GAGTGA 4500 
AATTAAAATC ACTATATCTC 

Seg ID NO: 434 Protein sequence 
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MDKLKCPSFF KCREKEKVSA SSENFHVGEN DENQDRGNWS KKSDYLLSMI GYAVGLGNVW 
RFPYIiTYSNG GGAFLIPYAI MLALAGLPLF FLECSLGCFA SLGPVSVWRI LPLFQGVGIT 
MVLISIFVTI YYNVIIAYSL YYMFASFQSE LPWKNCSSWS DKNCSRSPIV THCNVSTVNK 
GIQEIIQMHK SWVDINNFTC INGSEIYQPG QLPSEQYWNK VALQRSSGMN ETGVIVWYLA 
LCLLLAWLIV GAALFKGIKS SGKWYFTAL FPYWLLILL VRGAILEGAS KGISYYIGAQ 
SNFTKLKEAE VWKDAATQIF YSLSVAWGGL VALSSYNKFK NUCFSDAIW CLTNCLTSVF 
AGFAIFSILG HMAHISGKEV SQWKSGFDL AFIAYPEALA QLPGGPFWSI LFFFMLLTLG 
LDSQFASIET ITTTIQDLFP KVMKKHRVPI TLGCCLVLFL LGLVCVTQAG IYWVHLIDHF 
CAGWGILIAA ILELVGIIWI YGGNRFIEDT EMMIGAKRWI FWLWKRACWF VITPILLIAI 
FIWSLVQFHR PNYGAIPYPD WGVALGWCMI VFCIIWIPIM AIIKIIQAKG NIFQRLISCC 
EQHRGERYKD ' 



3ion tt: M18728.1 
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GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC A 
CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GC-TCCTGCTC ACAGCCTCAC 
TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 
ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 
GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 
TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 
ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACC CTACAAG 
TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCf rGIA TA < « 
TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 
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CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 

GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 

ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 9S0 

CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 1260 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 1380 

AACAAGACTC CTCAT CATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

A GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 1620 

3 TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 

A CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

.. T GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 192 0 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACAT CAT AA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGT CACAC 2160 

TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2450 

CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 



1 11 21 

I I I 

MGPPSAPPCR LHVPWKEVLL T AS LLT FWNP PTTAKLTIES TPFNVAEGKE V 

NRIGYSWYKC ERVDGNSLIV GYVIGTQQAT PGPAYSGRET IYPNASLLIQ NVTQMDTGFY 

TLQVIKSDLV NEEATGQFHV YPELPKPSIS SNNSNPVEDX DAVAFTCEPE VQNTTYLWVJV 

N( 2 LPVSPR LQLSNGNMTL TLLSVKRNDA GSYECEIQNP ASANRSDPVT LNVLYGPDVP 

TISPSKANYR PGEHLNLSCH AASNPPAQYS WFINGTFQQS TQELFIPNIT VNNSGSYMCQ 

AHNSATGLNR TTVTMITVSG SAPVLSAVAT VGITIGVLAR VALI 

Seq ID NO: 437 DNA sequence 



GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 120 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AAT CGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 3 SO 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 420 

TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 4 BO 

TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 

GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCAnTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 

ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 1260 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 1380 

AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 1620 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 

TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1850 



1127 



WO 03/042661 



PCT/US02/36810 



25 
30 
35 
40 
45 
50 



CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC T AGTAGT CAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCAT CAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 21S0 

TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 24S0 

2520 



MDSFSQDVKT 



1 11 21 31 41 51 

I I I I I I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 
CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 
TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 
ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 
GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 
TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 
ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 
TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 
TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 
CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 
GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 
GCGTC'AAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 
ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 
CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 
ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 
TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 
CAGCCACTGG CCTCAATAGG ACCACAGTCA CG ATG AT C A C AGTCTCTGGA AGTGCTCCTG 
TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 
TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 
GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 
CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGG ACAACT C AATGAAAATT TAAAGGGAAA 
ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGT CAAACT 
GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 
AACAAG ACT C CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 
TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG T 
GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC 
AGATCCTTTA GTGCACCCAG T 

CAGAGTTGGA CTT CTAGACT C 

CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 
TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 
CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT : 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 

TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 234 0 

TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2460 

CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 

Seq I 
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AAATCCGAGC CTCGCGTGGG CTCCTGGCCC CCGACGGACA CCACCAGGCC C 
ACCATGCCGC GCCCGGCCCC CGCGCGCCGC CTCCCGGGAC TCCTCCTGCT GCTCTGGCCG 
CTGCTGCTGC TGCCCTCCGC CGCCCCCGAC CCCGTGGCCC GCCCGGGCTT CCGGAGGCTG 
GAGACCCGAG GTCCCGSGGG CAGCCCTGGA CGCGGCCCCT CTCCTGCGGC TCCCGACGGC 
GCGCCCGCTT CCGGGACCAG CGAGCCTGGC CGCGCCCGCG GTGCAGGTGT TTGCAAGAGC 
AGACCCTTGG ACCTGGTGTT TATCATTGAT AGTTCTCGTA GCGTACGGCC CCTGGAATTC 

ACGCGGGTGG CAGTGGTGAA CTATGCTAGC ACTGTGAAGA TCGAGTTCCA ACTCCAGGCC 
TACACAGATA AGCAGTCCCT GAAGCAGGCT GTGGGTCGAA TCACACCCTT GTCAACAGGC 



70 
75 



GCTCGAGAGC CCTCTTCTAA CATCCCTAAG GTGGCCATCA TTGTTACAGA TGGGAGGCCC 660 

CAGGACCAGG TGAATGAAGT GGCGGCTCGG GCCCAAGCAT CTGGTATTGA GCTCTATGCT 720 

GTGGGCGTGG ACCGGGCAGA CATGGGGTCC CTCAAGATGA TGGCCAGTGA GCCCCTAGAG 780 

GAGCATGTTT TCTACGTGGA GACCTATGGG GTCATTGAGA AACTTTCCTC TAGATTCCAG 840 

GAAACCTTCT GTGCGCTGGA CCCCTGTGTG CTTGGAACAC ACCAGTGCCA GCACGTCTGC 900 

ATCAGTGATG GGGAAGGCAA GCACCACTGT GAGTGTAGCC AAGGATACAC CTTGAATGCC 960 

GACAAGAAAA CGTGTTCAGC TCTTGATAGG TGTGCTCTTA ACACCCACGG ATGTGAGCAC 1020 

ATCTGTGTGA ATGACAGAAG TGGCTCTTAT CATTGTGAGT GCTATGAAGG TTATACCTTG 1080 

AATGAAGACA GGAAAACTTG TTCAGCTCAA GATAAATGTG CTTTC-GGTAC CCATGGGTGT 1140 

CAGCACATTT GTGTGAATGA CAGAACAGGG TCCCATCATT GTGAATGCTA TGAGGGCTAC 1200 

ACTCTGAATG CAGATAAAAA AACATGTTCA GTCCGTGACA AGTGTGCCCT AGGCTCTCAT 1260 

GGTTGCCAGC ACATTTGTGT GAGTGATGGG GCCGCATCCT ACCACTGTGA TTGCTATCCT 1320 

GGCTACACCT TAAATGAGGA CAAGAAAACA TGTTCAGCCA CTGAGGAAGC ACGAAGACTT 1380 

GTTTCCACTG AAGATGCTTG TGGATGTGAA GCTACACTGG CATTCCAGGA CAAGGTCAGC 1440 

TCGTATCTTC AAAGACTGAA CACTAAACTT GATGACATTT TGGAGAAGTT GAAAATAAAT 1500 

GAATATGGAC AAATACATCG TTAAATTGCT CCAATTTCTC ACCTGAAAAT GTGGACAGCT 1560 

TGGTGTACTT AATACTCATG CATTCTTTTG CACACCTGTT ATTGCCAATG TTCCTGCTAA 1S20 

TAATTTGCCA TTATCTGTAT TAATGCTTGA ATATTACTGG ATAAATTGTA TGAAGATCTT 1680 

CTGCAGAATC AGCATGATTT TTCCAAGGAA ATACATATGC AGATACTTAT TAAGAGCAAA 1740 

CTTTAGTGTC TCTAAGTTAT GACTGTGAAA TGATTGGTAG GAAATAGAAT GAAAAGTTTA 1800 

GTGTTTCTTT ATCTACTAAT TGAGCCATTT AATTTTTAAA TGTTTATATT AGATAACCAT 1860 

ATTCACAATG GAAACTTTAG GTCTAGTTTC TTTTGATAGT ATTTATAATA TAAATCAATC 1920 

TTATTACTGA GAGTGCAAAT TGTACAAGGT ATTTACACAT ACAACTTCAT ATAACTGAGA 1980 
TGAATGTAAT TTTGAACTGT TTAACACTTT TTGTTTTTTG CTTATTTTGT T 
TTGAAGATGT GAT CAATAG A TTGTAATACA C 
TGAACATTAC ATTGCCATTT TTAATTCATT C 
AGCACTAGTT GTGAATTTAG GGTGTTJ 

CTTTATTATT TTGCTTCAGG ATCCAAGTGA CAAAGTTATA 1 
ATCGACAAAA TCTAATGTTG TCTTTTTAAT GTTAGTGATC C 

AGTGCTGGGA TTACAGGCTT GAAAGTCTAA CTTTTTTTTA CTTATATATT TGATACATAT 2400 

AATTCTTTTG GCTTTGAAAC TTGCAACTTT GAGAACAAAA CAGTCCTTTA AATTTTGCAC 2460 

TGCTCAATTC TGTTTTTCGT TTGCATTGTC TTTAATATAA TAAAAGTTAT TACCTTTACA 2520 

TATTATCATG TCTATTTTTG ATGACTCATC AATTTTGTCT ATTAAAGATA TTTCTTTAAA 2580 
TTAAAAAAAA AAAAAAAAA 



Protein Accession #: NP_002372.1 

i I 1 i 1 i 1 r r 

MPRPAPARRL PGLLLLLWPL LLLPSAAPDP VARPGERRLE TRGPGGSPGR RPE 
PASGTSEPGR ARGAGVCKSR PLDLVFIIDS SRSVRPLEFT KVKTFVSRII DTLDIGPADT 
RVAWNYAST VKIEFQLQAY TDKQSLKQAV GRITPLSTGT MSGLAIQTAM DEAFTVEAGA 
REPSSNIPKV AIIVTDGRPQ DQVNEVAARA QASGIELYAV GVDRADMASL KMMASEPLEE 
HVFYVETYGV IEKLSSRFQE TFCALDPCVL GTHQCQHVCI SDGEGKHHCE CSQGYTLNAD 
KKTCSALDRC ALNTHGCEHI CVNDRSGSYH CECYEGYTLN EDRKTCSAQD KCALGTHGCQ 
HICVHDRTGS HHCECYEGYT LNADKKTCSV RDKCALGSHG CQHICVSDGA ASYHCDCYPG 
YTLNEDKKTC SATEEARRLV STEDACGCEA TLAFQDKVSS YLQRLNTKLD DILEKLKINE 
YGQIHR 



I I I I I I 

GCGGCGGGCG CAGACAGCGG CGGGCGCAGG ACGTGCACTA TGGCTCGGGG CTCGCTGCGC 
CGGTTGCTGC GGCTCCTCGT GCTGGGGCTC TGGCTGGCGT TGCTGCGCTC CGTGGCCGGG 
GAGCAAGCGC CAGGCACCGC CCCCTGCTCC CGCGGCAGCT CCTGGAGCGC GGACCTGGAC 
. AAGTGCATGG ACTGCGCGTC TTGCAGGGCG CGACCGCACA GCGACTTCTG CCTGGGCTGC 
GCTGCAGCAC CTCCTGCCCC CTTCCGGCTG CTTTGGCCCA TCCTTGGGGG CGCTCTGAGC 
CTGACCTTCG TGCTGGGGCT GCTTTCTGGC TTTTTGGTCT GC-AGACGATG CCGCAGGAGA 
GAGAAGTTCA CCACCCCCAT AGAGGAGACC GGCGGAGAGG GCTGCCCAGC TGTGGCGCTG 
ATCCAGTGAC AATGTGCCCC CTGCCAGCCG GGGCTCGCCC ACTCATCATT CATTCATCCA 
TTCTAGAGCC AGTCTCTGCC TCCCAGACGC GGCGGGAGCC AAGCTCCTCC AACCACAAGG 
GGGGTGGGGG GCGGTGAATC ACCTCTGAGG CCTGGGCCCA GGGTTCAGGG GAACCTTCCA 
AGGTGTCTGG TTGCCCTGCC TCTGGCTCCA GAACAGAAAG GGAGCCTCAC GC1GGCTCAC 
ACAAAACAGC TGACACTGAC TAAGGAACTG CAGCATTTGC ACAGGGGAGG GGGGTGCCCT 
CCTTCCTTAG GACCTGGGGG CCAGGCTGAC TTGGGGGC-CA GACTTGACAC TAGGCCCCAC 
TCACTCAGAT GTCCTGAAAT TCCACCACGG GGGTCACCCT GGGG3GTTAG GGACCTATTT 
TTAACACTAG GGGCTGGCCC ACTAGGAGGG CTGGCCCTAA GATACAGACC CCCCCAACTC 
CCCAAAGCGG GGAGGAGATA TTTATTTTGG GGAGAGTTTG G 
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AATAAAAGAA TCTTTAACTT TAAAAAAAAA AAAAAAAA 
Seq ID NO: 444 Protein sequence 



I I I I I I 

GCACTCCGCA GCCTTTAAGG TTCGCGCGGG GGCCAGGCAA GAGTTAGCCA TGAAGAGCCT 60 

CAAGTCCCGC CTGAGGAGGC AGGACGTGCC GGGCCCCGCG TCGTCTGGCG CCGCCGCCGC 120 

20 CAGCGCGCAT GCAGCAGATT GGAATAAATA TGATGACCGA TTGATGAAAG CAGCAGAAAG 180 

GGGGGATGTA GAAAAAGTGA CCTCAATCCT TGCTAAAAAG GGGGTCAATC CAGGCAAACT 240 

AGATGTGGAA GGCAGATCTG TCTTCCATGT TGTGACCTCA AAGGGGAATC TTGAGTGTTT 300 

GAATGCCATC CTTATACATG GAGTTGATAT TACAACCAGT GACACTGCAG GGAGAAATGC 360 

TCTTCACCTG GCTGCTAAGT ATGGACATGC ATTGTGCCTA CAAAAACTTC TACAGTACAA 420 

25 TTGTCCCACT GAGCATGCAG ACCTGCAGGG AAGAACTGCA CTTCAAAAAA AAGCAATGGC 480 

AGATTGTCCT TCTAGCATAC AGCTGCTTTG TGACCATGGG GCCTCTGTGA ATGCCAAAGA 540 

TGTAAAAGGG CGGACACCAC TTGTTCTGGC TACTCAGATG AGTAGGCCAA CAATATGTCA 600 

CATGCTAGGT TGCGAATATG GTTGCAGAGA TGCAGTAGAA GTCTTAATTA AAAATGGTGC 72 0 

30 TGATATAAGC TTGCTGGATG OGCTTGGCCA TGATAGTTCT TACTATGCAA GAATTGGTGA 780 

CAATCTGGAC ATTCTAACCT TGTTGAAGAC TGCATCGGAA AATACCAACA AAGGGAGAGA 840 
ACTTTGGAAG AAAGGACCAT CTTTGCAACA GCGAAATTTG ACACACJ 
AAATGTGAAG TCACATCAGA GGGAGCATCA AAATATTCAG GATTTGGAGA T 



3 J CAATGGTTTA CAGTTACAGC TGAATGAGGA AGTTATGGTT G 

GAGAGAAAAG CTGAAGTCCC TTTTGGCAGC TAAAGAAAAG CAACATGAAG A 
GACTATTGAG GCTCTGAAAA ATAGATTTAA ATATTT1 

AAGTCATTTC AGTAACCGAA AAGAAGATAT GCTTCTTAAA CAAGGTCAGA T 

AGACTCACAG TGTACTTCCC CAGGTATACC AGCCCATATG CAAAGCAGAT CTATGTTAAG 132 0 

40 ACCTCTGGAA CTATCTTTAC CCAGTCAAAC GTCATACTCT GAAAATGAAA TTTTAAAGAA 13 80 

AGAGTTAGAA GCAATGCGAA CTTTCTGTGA GTCAGCAAAA CAAGACCGAC TGAAGCTCCA 1440 

AAATGAACTG CCACACAAAG TGGCAGAATG CAAAGCTTTA GCATTAGAAT GTGAAAGGGT 1500 

CAAGGAGGAT TCAGATGAAC AGATAAAGCA ATTAGAAGAT GCATTAAAAG ATGTGCAGAA 15S0 

GAGGATGTAT GAGTCAGAAG GTAAAGTTAA ACAAATGCAG ACCCATTTTC TTGCCCTTAA 1620 

45 ACAACACTTA ACAAGTGAAG CAGCCTCAGG GAATCACAGA CTAACCGAGG AACTGAAGGA 1680 

TCAGTTGAAA GACTTGAAAG TAAAATATGA AGGTGCTTCA GCAGAAGTGG GGAAATTAAG 1740 

AAACCAAATC AAACAAAATG AGATGATAGT AGAAGAGTTT AAGAGGGATG AAGGCAAGCT 1800 

GATAGAAGAA AAT AAGCGAT TACAGAAGGA ACTTAGTATG TGTGAAATGG AGCGAGAGAA 1860 

GAAAGGAAGA AAGGT CACAG AGATGGAAGG CCAGGCAAAA GAATTGTCAG CGAAGTTGGC 1920 

50 CCTTTCCATT CCAGCTGAAA AATTIGAAAA CATGAAGAGC TCATTATCAA ATGAAGTGAA 1980 

TGAGAAAGCA AAAAAATTAG TAGAAATGGA AAGAGAACAT GAAAAATCAC TTAGTGAAAT 2D4 0 

TAGACAGTTA AAGAGAGAAC TTGAGAATGT TAAGGCCAAG CTTGCTCAGC ACGTCAAACC 2100 

AGAGGAACAT GAACAGGTTA AGAGCAGATT AGAACAGAAA TCAGGAGAAC TTGGGAAGAA 2160 

GAT CACTG AG TTAACATTGA AAAATCAGAC ACTACAAAAG GAAATTGAAA AAGTTTATTT 2220 

55 GGATAATAAG CTCCTCAAGG AGCAAGCACA TAACTTAACA ATTGAAATGA AAAATCATTA 2280 

TGTTCCTTTA AAAGTAAGTG AAGACATGAA AAAGTCACAT GATGCAATTA TTGATGATCT 2340 

TAATAGAAAG CTTTTAGATG TAACACAAAA ATATACAGAA AAGAAGTTGG AAATGGAGAA 2400 

ATTGCTACTG QAAAATGACA GCTTAAGTAA GGATGTAAGC CGCCTAGAAA CTGTGTTTGT 2460 

ACCTCCTGAG AAACATGAAA AAGAGATAAT AGCTCTGAAA TCCAATATTG TTGAACTTAA 2520 

60 GAAACAGCTG TCTGAACTTA AGAAAAAATG TGGTGAAGAC CAGGAGAAAA TACACGCTCT 2580 

CACATCTGAA AACACTAACT TGAAGAAGAT GATGAGTAAT CAGTATGTGC CAGTTAAAAC 264 0 

CCATGAAGAG GTTAAAATGA CACTGAATGA CACGTTAGCC AAAACTAACA GAGAATTATT 2700 

AGATGTGAAG AAAAAATTTG AAGATATAAA TCAGGAATTT GTAAAAATAA AAGATAAGAA 2760 

TGAAATATTA AAAAGAAACC TGGAAAACAC TCAGAACCAA ATAAAAGCTG AGTACATCAG 2820 

65 CCTGGCAGAG CACGAGGCAA AGATGAGCTC GCTAAGTCAG AGCATGAGAA AGGTGCAGGA 2880 

TAGTAATGCT GAAATCTTGG CCAACTACAG AAAAGGCCAA GAAGAGATTG TGACACTGCA 2940 

A AGAAGGAGCT CGACACAATA CAAGAATGCA TT AAGGT AAA 3000 

T TTGAGGAGTG CGAGAGAAAA TTTAAAGCAA CAGAGAAAGA 3 060 

iGACACA AAAGTATAGT GTCAGTGAAG AAGAAGTCAA 3120 

3 CAAGAGAATG ACAAGTTAAA GAAGGAGATT TTTACCCTTC AGAAAGATTT 3180 

GAGAGATAAG ACAGTTCTCA TTGAGAAGTC TCATGAAATG GAAAGAGCAT TAAGCAGAAA 3240 

PAAAAGA CTTGTCACAG AAATACACGG AAGTAAAGAA 3300 

G AAGCTAGTAG AAGAAAATGC CAAACAGACT TCTGAGATAC TTGCAGTGCA 3360 

AAATCTTTTG CAAAAACAAC ATGTTCCATT GGAACAGGTT GAGGCTCTGA AAAAATCTCT 3420 

TAATGGCACA ATTGAAAATC TAAAGGAAGA ACTGAAGAGT ATGCAAAGGT GTTACGAGAA 3480 

AG AG CAGCAG ACAGTGACCA AACTGCATCA ATTGTTGGAG AATCAAAAGA ACTCTTCTGT 3540 

ACCCCTGGCA GAGCATTTGC AGATTAAAGA AGCATTTGAG AAAGAAGTTG GAATCATAAA 3600 

AGCCAGCTTG AGAGAAAAGG AAGAAGAAAG CCAAAACAAA ATGGAAGAAG TCTCCAAACT 3660 

TCAGTCGGAG GTTCAGAATA CTAAACAAGC ATTAAAAAAA TTAGAGACTA GAGAGGTAGT 3720 

TGACTTGTCT AAATATAAAG CAACAAAAAG TGATTTGGAG ACACAGATTT CTAGCTTAAA 3780 

TGAAAAATTG GCCAATCTGA ATAGAAAGTA TGAGGAAGTA TGTGAGGAAG TTTTGCATGC 3 840 

CAAAAAGAAG GAAATATCTG CAAAAGATGA GAAGGAATTA CTGCATTTCA GCATTGAGCA 3900 

AGAAATTAAG GATCAGAAGG AACGATGTGA TAAGTCCTTA ACAACAATCA CAGAGTTACA 3960 

AAGAAGAATA CAAGAATCTG CTAAACAAAT AGAAGCAAAA GATAATAAGA TAACTGAACT 4020 
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GCTTAATGAT GTGGAAAGAT TAAAACAGGC ACTCAATGGC CTTTCCCAAC TCACCTACAC 4080 

AAGTGGGAAC CCCACCAAGA GGCAGAGCCA GCTGATTGAC ACTCTGCAGC ACCAAGTGAA 4140 

ATCTCTGGAG CAACAGCTGG CCGATGCTGA CAGACAGCAC CAAGAAGTAA TTGCAATTTA 4200 

T CGGACACAC CTTCTTAGTG CTGCACAGGG TCACATGGAT GAAGATGTTC AGGAGGCTCT 4260 

GCTCCAGATC ATACAAATGC GGCAGGGGCT TGTGTGCTAG CCGTTAGCAC TGACTGCCAG 4320 

TATCTGTTTT ATCTTGCTGG TGCTGAACAT TCTTTGTGCA ACTCCATGGT CTTTCTGGGC 4380 
CTTACTGTGC TGGTATAATT AAAATAAAAT ATATTTTGTT CTGGGTGT 



I I I I I I 



40 
45 



60 
65 



PGKLDVEGRS VFHWTSKGN LECLNAILIH GVDITTSDTA GRNALHLAAK YGHALCLQKL 
LQYNCPTEHA DLQGRTALQK KAMADCPSSI QLLCDHGASV NAKDVKGRTP LVLATQMSRP 
TICQLLIDRG ADVNSRDKQN RTALMLGCEY GCRDAVEVLI KNGADISLLD ALGHDSSYYA 
RIGDNLDILT LLKTASENTN KGRELWKKGP SLQQRNLTHM QDEVNVKSHQ REHQNIQDLE 
IENEDLKERL RKIQQEQRIL LDKVNGLQLQ LNEEVMVADD LESEREKLKS LLAAKEKQHE 
ESLRTIEALK NRFKYFESDH LGSGSHFSNR KEDMLLKQGQ MYMADSQCTS PGIPAHMQSR 
SMLRPLELSL PSQTSYSENE ILKKELEAMR TFCESAKQDR LKLQNEIiAHK VAECKALALE 
CERVKEDSDE QIKQLEDALK DVQKRMYESE GKVKQMQTHF LALKSHLTSE AASGKHRLTE 
ELKDQLKDLK VKYEGASAEV GKLRNQIKQN EMIVEEFKRD EGKLIEEHKR LQKELSMCEM 
EREKKGRKVT EMEGQAKELS AKLALSIPAE KFENMKSSLS NEVNEKAKKL VEMEREHEKS 
LSEIRQLKRE LENVKAKLAQ HVKPEEHEQV KSRLEQKSGE LGKKITELTL KNQTLQKEIE 



EMEKLLLEND SLSKDVSRLE TVFVPPEKHE KEIIALKSNI VELKKQLSSL KKKCGEDQEK 
IHALTSENTN LKKMMSNQOT PVKTHEEVKM TLNDTIAKTN RELLDViCKKP EDINQEFVKI 
KDKNEILKRN LENTQNQ I KA EYISLAEHEA KMSSLSQSMR KVQDSNAEIL ANYRKGQEEI 
VTLHAE I KAQ KKELDTIQEC IKVKYAPIVS FEECERKFKA TEKELKDQLS EQTQKYSVSE 
EEVKKNKQEN DKLKKEIFTL QKDLRDKTVL IEKSHEMERA LSRKTDELNK QLKDLSQKYT 
EVKNVKEKLV EENAKQTSEI LAVQNLLQKQ HVPLEQVEAL KKSLNGTIEN L 
CYEKEQQTVT KLHQLLENQK NSSVPLAEHL QIKEAFEKEV GIIKASLREK E 
VSKLQSEVQN TKQALKKLET REWDLSKYK ATKSDLETQI SSLNEKLAML NRKYEEVCEE 12S0 
VLHAKKKEIS AKDEKELLHF SIEQEIKDQK ERCDKSLTTI TELQRRIQES AKQIEAKDNK 1320 
R LKOALNGLSQ LTYT5GNPTK RQSQLIDTLQ HQVKSLEQQL ADADRQHQEV 1380 
S AAQGHMDEDV QEALLQIIQM RQGLVC 



GGGCATTGCC AGGCCCCGAG TGGAATATCC AGCTCACCAG GCCATGAATC TTGTGGGCCC 
CCAGAGCATT GAAGGTGGAG CTCATGAAGG ACTTCAGCAT TTGGGTCCTT TTGGCAACAT 
CCCCAACATC GTGGCAGAGT TGACTGGAGA CAACATTCCT AAGGACTTTA GTGAGGATCA 
GGGGTACCCA GACCCTCCAA ATCCCTGTCC TGTTGGAAAA ACAGATGATG GATGT CTAGA 
AAACACCCCT GACACTGCAG AGTTCAGTCG AGAGTTCCAG TTGCACCAGC ATCTCTTTGA 
TCCGGAACAT GACTATCCAG GCTTGGGCAA GTGGAACAAG AAACTCCTTT ACGAGAAGAT 
GAAGGGAGGA GAGAGACGAA AGCGGAGGAG TGTCAATCCA TATCTACAAG GACAGAGACT 
GGATAATGTT GTTGCAAAGA AGTCTGTCCC CCATTTTTCA GATGAGGATA AGGATCCAGA 
GTAAAGAGAA GATGCTAGAC GAAAACCCAC ATTACCTGTT AGGCCTCAGC ATGG CTTATG 



TGCAGTTTAA AATGGTGTCT GAGGTTGTAC TATTTTGGCC AAGTCTGTAG AAAGCTGTCA 1020 
TTTGATTTTG ATTATGTAGT TCATCCAGCC CTTGGGCATT GTTATACACC AGTAAAGAAG 1080 
GCTGTACTCA AGAGGAGGAG CTGACACATT TCACTTGGCI GCGTCTTAAT AAACATGAAT 1140 



MVSRMVSIML SGLLFWLASG WTPAFAYSPR TPDRVSEADI QRLLHGVMEQ LGIARPRVEY 

PAHQAMNLVG PQSIEGGAHE GLQHLGPFGN IPNIVAELTG DNIPKDFSED QGYPDPPNPC 

PVGKTDDGCL ENTPDTAEFS REFQLHQHLF DPEHDYPGLG K 
SVNPYLQGOR L 



CGGCAGGGTT GGAAAATGAT GGAAGAGGCG GAGGTGGAGG CGACCGAGTG CTGAGAGGAA 
CCTGCGGAAT CGGCCGAGAT GGGGTCTGGC GCGCGCTTTC CCTCGC-GGAC CCTTCGTGTC 
CGGTGGTTGC TGTTGCTTGG CCTGGTCGGC CCAGTCCTCG GTGCGGCGCG GCCAGGCTTT 
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CAACAGACCT CACATCTTTC TTCTTATGAA ATTATAACTC CTTGGAGATT AACTAGAGAA 240 

AGAAGAGAAG CCCCTAGGCC CTATTCAAAA CAAGTATCTT ATGTTATTCA GGCTGAAGGA 300 

AAAGAGCATA TTATTCACTT GGAAAGGAAC AAAGACCTTT TGCCTGAAGA TTTTGTGGTT 360 

TATACTTACA ACAAGGAAGG GACTTTAATC ACTGACCATC CCAATATACA GAATCATTGT 42 0 

5 CATTATCGGG GCTATGTGGA GGGAGTTCAT AATTCATCCA TTGCTCTTAG CGACTGTTTT 480 

GGACTCAGAG GATTGCTGCA TTTAGAGAAT GCGAGTTATG GGATTGAACC CCTGCAGAAC 540 

AGCTCTCATT TTGAGCACAT CATTTATCGA ATGGATGATG TCTACAAAGA GCCTCTGAAA 6 00 

TG1GGAGTTT CCAACAAGGA TATAGAGAAA GAAACTGCAA AGGATGAAGA GGAAGAGCCT 660 

10 GAGCTGTTCA TTGTCGTAGA CAAGGAAAGG TATGACATGA TGGGAAGAAA TCAGACTGCT 780 

GTGAGAGAAG AGATGATTCT CCTGGCAAAC TACTTGGATA GTATGTATAT TATGTTAAAT 840 

ATTCGAATTG TGCTAGTTGG ACTGGAGATT TGGACCAATG GAAACCTGAT CAACATAGTT 900 

GGGGGTGCTG GTGATGTGCT GGGGAACTTC GTGCAGTGGC GGGAAAAGTT TCTTATCACA 960 

CGTCGGAGAC ATGACAGTGC ACAGCTAGTT CTAAAGAAAG GTTTTGGTGG AACTGCAGGA 1020 

15 ATGGCATTTG TGGGAACAGT GTGTTCAAGG AGCCACGCAG GCGGGATTAA TGTGTTTGGA 1080 

CAAATCACTG TGGAGACATT TGCTTCCATT GTTGCTCATG AATTGGGTCA TAATCTTGGA 1140 

ATGAATCACG ATGATGGGAG AGATTGTTCC TGTGGAGCAA AGAGCTGCAT CATGAATTCA 12 00 

GGAGCATCGG GTTCCAGAAA CTTTAGCAGT TGCAGTGCAG AC-GACTTTGA GAAGTTAACT 1260 

TTAAATAAAG GAGGAAACTG CCTTCTTAAT ATTC CAAAG C CTGATGAAGC CTATAGTGCT 1320 

20 CCCTCCTGTG GTAATAAGTT GGTGGACGCT GGGGAAGAGT GTGACTGTGG TACTCCAAAG 13 80 

GAATGTGAAT TGGACCCTTG CTGCGAAGGA AGTACCTGTA AGCTTAAATC ATTTGCTGAG 1440 

GGAAAAACCA GTGAGTGTGA TGTTCCAGAG TACTGCAATG GTTCTTCTCA GTTCTGTCAG 1560 

CCAGATGTTT TT ATT CAG AA TGGATATCCT TGCCAGAATA ACAAAGCCTA TTGCTACAAC 1620 

25 GGCATGTGCC AGTATTATGA TGCTCAATGT CAAGTCATCT TTGGCTCAAA AGCCAAGGCT 1680 

GCCCCCAAAG ATTGTTTCAT TGAAGTGAAT TCTAAAGGTG ACAGATTTGG CAATTGTGGT 1740 

TTCTCTGGCA ATGAATACAA GAAGTGTGCC ACTGGGAATG CTTTGTGTGG AAAGCTTCAG 1800 

TGTGAGAATG TACAAGAGAT ACCTGTATTT GGAATTGTGC CTGCTATTAT TCAAACGCCT 1860 

AGTCGAGGCA CCAAATGTTG GGGTGTGGAT TTCCAGCTAG GATCAGATGT TCCAGATCCT 1920 

30 GGGATGGTTA ACGAAGGCAC AAAATGTGGT GCTGGAAAGA TCTGTAGAAA CTTCCAGTGT 1980 

GTAGATGCTT CTGTTCTGAA TTATGACTGT GATGTTCAGA AAAAGTGTCA TGGACATGGG 2040 

GTATGTAATA GCAATAAGAA TTGTCACTGT GAAAATGGCT GGGCTCCCCC AAATTGTGAG 2100 

ACTAAAGGAT ACGGAGGAAG TGTGGACAGT GGACCTACAT ACAATGAAAT GAATACTGCA 2160 

TTGAGGGACG GACTTCTGGT CTTCTTCTTC CTAATTGTTC CCCTTATTGT CTGTGCTATT 2220 

35 TTTATCTTCA T CAAG AGGGA TCAACTGTGG AGAAGCTACT TCAGAAAGAA GAGATCACAA 2280 

ACATATGAGT CAGATGGCAA AAATCAAGCA AACCCTTCTA GACAGCCGGG GAGTGTTCCT 2340 

CGACATGTTT CTCCAGTGAC ACCTCCCAGA GAAGTTCCTA TATATGCAAA CAGATTTGCA 2 400 

GTACCAACCT ATGCAGCCAA GCAACCTCAG CAGTTCCCAT CAAGGCCACC TCCACCACAA 2460 

CCGAAAGTAT CATCTCAGGG AAACTTAATT CCTGCCCGTC CTGCTCCTGC ACCTCCTTTA 2 52 0 

40 TATAGTTCCC TCACTTGATT TTTTTAACCT TCTTTTTGCA AATGTCTTCA GGGAACTGAG 2580 

CTAATACTTT TTTTTTTTCT TGATGTTTTC TTGAAAAGCC TTTCTGTTGC AACTATGAAT 2 640 

GAAAACAAAA CACCACAAAA CAGACTTCAC TAACACAGAA AAACAGAAAC TGAGTGTGAG 2 70 0 

AGTTGTGAAA TACAAGGAAA TGCAGTAAAG CCAGGGAATT TACAATAACA TTTCCGTTTC 2 7 60 

CATCATTGAA TAAGTCTTAT TCAGTCATCG GTGAGGTTAA TGCACTAATC ATGGATTTTT 2 82 0 

45 TGAACATGTT ATTGCAGTGA TTCTCAAATT AACTGTATTG GTGTAAGATT TTTGTCATTA 2880 

AGTGTTTAAG TGTTATTCTG AATTTTCTAC CTTAGTTATC ATTAATGTAG TTCCTCATTG 2940 

AACATGTGAT AATCTAATAC CTGTGAAAAC TGACTAATCA GCTGCCAATA ATATCTAATA 3000 

TTTTTCATCA TGCACGAATT AATAATCATC ATACTCTAGA ATCTTGTCl'G TCACTCACTA 3060 

CATGAATAAG CAAATATTGT CTTCAAAAGA ATGCACAAGA ACCACAATTA AGATGTCATA 3120 

50 TTATTTTGAA AGTACAAAAT ATACTAAAAG AGTGTGTGTG TATTCACGCA GTTACTCGCT 3180 

TCCATTTTTA TGACCTTTCA ACTATAGGTA ATAACTCTTA GAGAAATTAA TTTAATATTA 3240 

GAATTTCTAT T ATG AAT CAT GTGAAAGCAT GACATTCGTT CACAATAGCA CTATTTTAAA 33 00 

TAAATTATAA GCTTTAAGGT ACGAAGTATT TAATAGATCT AAT CAAAT AT GTTGATTCAT 3360 

GGCTATAATA AAGCAGGAGC AATTATAAAA TCTTCAATCA ATTGAACTTT TACAAAACCA 3420 

55 CTTGAGAATT TCATGAGCAC TTTAAAATCT GAACTTT CAA AGCTTGCTAT TAAATCATTT 3480 

AGAATGTTTA CATTTACTAA GGTGTGCTGG GT CATGT AAA ATATTAGACA CTAATATTTT 3540 

CATAGAAATT AGGCTGGAGA AAGAAGGAAG AAATGGTTTT CTTAAATACC TACAAAAAAG 3600 

TTACTGTGGT ATCTATGAGT TATCATCTTA GCTGTGTTAA AAATGAATTT TTACTATGGC 3660 

AGATATGGTA TGG AT CGTAA AATTTTAAGC ACTAAAAATT ITTTCATAAC CTTTCATAAT 3720 

60 AAAGTTTAAT AATAGGTTTA TTAACTGAAT TTCATTAGTT TTTTAAAAGT GTTTTTGGTT 3780 

TGTGTATATA TACATATACA AATACAACAT TTACAATAAA TAAAATACTT GAAATTCTCA 3840 
AAAAAAAAAA AAAAAAAAAA AAAAA 

Seq ID NO: 450 Protein sequence 
65 Protein Accession #; NP_003807.1 



MGSGARPPSG TLRVRWLLLL GLVGPVLGAA RPGFQQTSHL SSYEIITPWR LTRERHEAPR 60 

70 PYSKQVSYVI QAEGKBHIIH LERNKDLLPE DFWYTYNKE GTLITDHPNI QNKCHYRGYV 120 

DIEKETAKDE EEEPPSMTQL LRRRRAVLPQ TRYVELF I W DKERYDMMGR NQTAVREEMI 240 

LLANYLD SMY IMLNIRIVLV GLEIWTNGNL INIVGGAGDV LGNFVQWREK FLITRRRHDS 300 

AQLVLKKGFG GTAGMAFVGT VCSRSHAGGI NVFGQITVST FASIVAHELG HNLGMNHDDG 360 

75 RDCSCGAKSC IMNSGASGSR NFSSCSAEDF EKLTLNKGGN CLLNIPKPDE AYSAPECGNK 420 

LVDAGEECDC GTPKECELDP CCEGSTCKLK SFAECAYGDC CKDCRFLPGG TLCRGKTEEC 480 

DVPEYCKGSS QFCQPDVFIQ NGYPCQNNKA YCYNGMCQYY DAQCQVIFGS KAKAAPKDCF 540 

IEVNSKGDRF GNCGFSGNEY KKCATGNALC GKLQCENVQE IPVFGIVPAI IQTPSRGTKC 600 

WGVDFQLGSD VPDPGMVNEG TKCGAGKICR NFQCVDASVL NYDCDVQKKC HGHGVCNSNK 660 

80 NCHCENGWAP PNCETKGYGG SVDSGPTYHE MNTALRDGLL VFFFLIVPLI VCAIFIFIKR 720 

DQLWRSYFRK KRSQTYESDG KNQANPSRQP GSVPRHVSPV TPPREVPIYA NRFAVPTYAA 780 
KQPQQFPSRP PPPQPKVSSQ GNLIPARPAP APPLYSSLT 
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i i 1 r r I 1 r 

GGTTTCAATA TATGCAGATG TCTCGATATA GGAATGAAAT TACGTCTTTG GAACAACTTA 
AATAAGTCAA ATATACTTGG AGCTTTAAAA ATTARAAGGA GAGAGATTCG AGCACCTTTT 
CTGCTGCCAT GACAACCATG CAAGGAATGG AACAGGCCAT GCCAGGGTTG GCCTGGTGTG 
CCCCAGCTGG GAAACATGGC TGTCATACAT TCACATCTGT GGAAAGGATT GCAAGAGAAG 
TTCTTGAAGG GAGAACCCAA AGTCCTTGGG GTTGTGCAGA TTCTGACTGC CCXGATGAGC 
CTTAGCATGG GAATAACAAT GATGTGTATG GCATCTAATA CTTATGGAAG TAACCCTATT 
TCCGTGCATA TCGGGTACAC AATTTGGGGG TCAGTAATGT TTATTATTTC AGGATCCTTG 
TCAATTGCAG CAGGAAT TAG AACTACAAAA GGCCTGGTCC GAGGTAGTCT AGGAATGAAT 
AT CACCAGCT CTGTACTGGC TGCATCAGGG ATCTTAATCA ACACATTTAG CTTGGCGTTT 
TATTCATTCC ATCACCCTTA CTGTAACTAC TATGGCAACT CAAATAATTG TCATGGGACT 
ATGTCCATCT TAATGGGTCT GGATGGCATG GTGCTCCTCT TAAGTGTGCT GGAATTCTGC 
ATTGCTGTGT CCCTCTCTGC CTTTGGATGT AAAGTGCTCT GTTGTACCCC TGGTGGGGTT 
GTGTTAATTC TGCCATCACA TTCTCACATG GCAGAAACAG CATCTCCCAC ACCACTTAAT 
GAGGTTTGAG GCCAACAAAA GATCAACAGA CAAATGCTCC AGAAATCTAT GCTGACTGTG 
ACACAAGAGC CTCACATGAG AAATTACCAG TATCCAACTT CGATACTGAT AGACGTGTTG 
ATATTATTAT TATATGTAAT CCAATTATGA ACTGTGTGTG TATAGAGAGA TAATAAATTC 
AAAATTATGT TCTCATTTTT TTCCCTGGAA CTCAATAACT CACTTCACTG GCTCTTTATC 
GAGAGTACTA GGAGTTAAAT TAATAAATAA TGCATTTAAT GAGGCCACAG GAAAAA 

Seq ID NO; 452 Protein sequence 



GLQEKPLKGE PKVLGWQIL TALMSLSMGI TMMCMASNTY GSNPISVHIG 
IRTTKGLVRG SLGMNITSSV LAASGILINT FSLAPYSPHH 
VLEPCIAVSL SAFGCKVLCC TPGGWLILP 



I I I I I I 

AGTCTCTGCT CTTCCCAGCC TCTCCGGCGC GCTCCAAGGG CTTCCCGTCG GGACCATGCG 
CGGCAGTGAG CTCCCGCTGG TCCTGCTGGC GCTGGTCCTC TGCCTAGCGC CCCGGGGGCG 
AGCGGTCCCG CTGCCTGCGG GCGGAGGGAC CGTGCTGACC AAGATGTACC CGCGCGGCAA 
CCACTGGGCG GTGGGGCACT TAATGGGGAA AAAGAGCACA GGGGAGTCTT CTTCTGTTTC 
TGAGAGAGGG AGCCTGAAGC AGCAGCTGAG AGAGTACATC ACGTGGGAAG AAGCTGCAAG 
GAATTTGCTG GGTCTCATAG AAGCAAAGGA GAACAGAAAC CACCAGCCAC CTCAACCCAA 
GGCCTTGGGC AATCAGCAGC CTTCGTGGGA TTCAGAGGAT AGCAGCAACT TCAAAGATGT 
AGGTTCAAAA GGCAAAGTTG GTAGACTCTC TGCTCCAGGT TCTCAACGTG AAGGAAGGAA 
CCCCCAGCTG AACCAGCAAT GATAATGATG GCCTCTCTCA AAAGAGARAA ACAAAACCCC 
TAAGAGACTG AGTTCTGCAA GCATCAGTTC TACGGATCAT CAACAAGATT TCCTTGTGCA 
AAATATTTGA CTATTCTGTA TCTTTCATCC TTGACTAAAT TCGTGATTTT CAAGCAGCAT 
CTTCTGGTTT AAACTTGTTT GCTGTGAACA ATTGTCGAAA AGAGTCTTC C AATTAATGCT 
TTTTTATATC TAGGCTACCT GTTGGTTAGA TTCAAGGCCC CGAGCTGTTA CCATTCACAA 
TAAAAGCTTA AACACAT 



60 
65 



MRGSELPLVL LALVLCLAPR GRAVPLPAGG GTVLTKMYPR GNHWAVGHLM GKKSTGEESS 
VSERGSLKQQ LREYIRWEEA ARNLLGLIEA KENRNHQPPQ PKALGNQQPE WDSEDSSNFK 
DVGSKGKVGR LSAPGSQREG RNPQLNQQ 



G GCCGTCCTCC GTGGAACCGG TTTTCCGAGG 
CTGGCAAAAG CCGAGGCTGG ATTTGGGGGA GGAATATTAG ACTGGGAGGA GTCTGCGCGC 
TTTTCTCCTC CCCGCGCCTC CCGGTCGCCG CGGGTTCACC GCTCAGTCCC CGCGCTCGCT 
CCGCACCCCA CCCACTTCCT GTGCTCGCCC GGGGGGCGTG TGCCGTGCGG CTGCCGGAGT 
TCGGGGAAGT TGTGGCTGTC GAGAATGGGG GTCTGTGGGT ACCTGTTCCT GCCCTGGAAG 
TGCCTCGTGG TCGTGTCTCT CAGGCTGCTG TTCCTTGTAC CCACAGGAGT GCCCGTGCGC 
G CCACCTTCCC CAAAGCTATG GACAACGTGA CGGTCCG 

T TGACAACCGG GTCACCCGGG TGGCCTGGCT A 
ACCATCCTCT ATGCTGGGAA TGACAAGTGG TGCCTGGATC CTCGCGTGGT CCTTCTGAGC 
AACACCCAAA CGCAGTACAG CAT CGAGAT C CAGAACGTGG ATGTGTATGA CGAGGGCCCT 
TACACCTGCT CGGTGCAGAC AGACAACCAC CCAAAGACCT CTAGGGTCCA CCTCATTGTG 
CAAGTATCTC CCAAAATTGT AGAGATTTCT TCAGATATCT CCATTAATGA AGGGAACAAT 
ATTAGCCTCA CCTGCATAGC AACTGGTAGA CCAGAGCCTA CGGTTACTTG GAGACACATC 
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TCTCCCAAAG CGGTTGGCTT TGTGAGTGAA GACGAATACT TGGAAATTCA GGGCATCACC 
CGGGAACAGT CAGGGGACTA CGAGTGCAGT GCCTCCAATG ACGTGGCCGC GCCCGTGGTA 
CGGAGAGTAA AGGT CACCGT GAACTATCCA CCATACATTT CAGAAGCCAA GGGTACAGGT 
GTCCCCGTGG GACAAAAGGG GACACTGCAG TGTGAAGCCT CAGCAGTCCC CTCAGCAGAA 
TTCCAGTGGT ACAAGGATGA CAAAAGACTG ATTGAAGGAA AGAAAGGGGT GAAAGTGGAA 
AACAGACCTT TCCTCTCAAA ACTCATCTTC TTCAATGTCT CTGAACATGA CTATGGGAAC 
TACACTTGCG TGGCCTCCAA CAAGCTGGGC CACACCAATG CCAGCATCAT GCTATTTGGT 
CCAGGCGCCG TCAGCGAGGT GAGCAACGGC ACGTCGAGGA GGGCAGGCTG CGTCTGGCTG 
CTGCCTCTTC TGGTCTTGCA CCTGCTTCTC AAATTTTGAT GTGAGTGCCA CTTCCCCACC 
CGGGAAAGGC TGCCGCCACC ACCACCACCA ACACAACAGC AATGGCAACA CCGACAGCAA 
CCAATCAGAT ATATACAAAT GAAATTAGAA GAAACACAGC CTCATGGGAC AGAAATTTGA 
GGGAGGGGAA CAAAGAATAC TTTGGGGGGA AAAGAGTTTT AAAA^AGAAA TTGAAAATTG 
CCTTGCAGAT ATTTAGGTAC AATGGAGTTT TCTTTTCCCA AACGGGAAGA ACACAGCACA 
CCCGGCTTGG ACCCACTGCA AGCTGCATCG TGCAACCTCT TTGGTGCCAG TGTGGGCAAG 
GGCTCAGCCT CTCTGCCCAC AGACTGCCCC CAGGTGGAAC ATTCTGGAGC TGGCCATCCC 
AAATTCAATC AGTCCATAGA GACGAACAGA ATGAGACCTT CCGGCCCAAG CGTGGCGCTT 
CCGGCCCAAG CGTGGCGCTG CGGGCACTTT GGTAGACTGT GCCACCACGG CGTGTGTTGT 
GAAACGTGAA ATAAAAAGAG CAAAAAAAAA AAAAAAAAA 

Protein Accession Its NP_0S7606.1 

I I' I" I' I' I 1 

MGVCGYLPLP WKCLVWSLR LLFLVPTGVP VRSGDATFPK AMDNVTVROG ESATLRCTID 
[LYAGND KWCLDPRWL LSNTQTQYSI EIQNVDVYDE GPYTCSVQTD 
j IVQVSPKIVE ISSDISINEG NNISLTCIAT GRPEPTVTWR HISPKAVGFV 
SEDEYLEIQG ITREQSGDYE CSASNDVAAP WRRVKVTVN YPPYISEAKG TGVPVGQKGT 
LQCEASAVPS AEFQWYKDDK RLIEGKKGVK VENRPFLSKL IFFNVSEHDY GNYTCVASNK 
LGHTNAS IML FGPGAVSEVS NGTSRRAGCV WLLPLLVLHL LLKF 

Seq ID NO: 457 DNA sequence 

Nucleic Acid Accession #! NM_012261.1 



I I I I I I 

GATTTGCTCT GCCAGCAGCT GTCGGTGCCG CGCTCGACAC CGAGTCCTAG CTAGGCGCTC 60 

ACAGAATACG CGCTCCCTCC CTCCCCCTTC TCTGTCCCCC GCCTCTCGCT CACCCCGGCC 120 

CACTCCAGCG GCGACTTTGA GGGATTCCCT CTCTGGCGGC CTCTGCAGCA GCACAGCCGG 180 

CCTCATTCGG GGCACTGCGA GTATGGATCT CCAAGGAAGA GGGGTCCCCA GCATCGACAG 24 0 

ACTTCGAGTT CTCCTGATGT TGTTCCATAC AATGGCTCAA ATCATGGCAG AACAAGAAGT 300 

GGAAAATCTC TCAGGCCTTT CCACTAACCC TGAAAAAGAT ATATTTGTGG TGCGGGAAAA 360 

TGGGACGACG TGTCTCATGG CAGAGTTTGC AGCCAAATTT ATTGTACCTT ATGATGTGTG 420 

GGCCAGCAAC TACGTAGATC TGATCACAGA ACAGGCCGAT ATCGCATTGA CCCGGGGAGC 48 0 

TGAGGTGAAG GGCCGCTGTG GCCACAGCCA GTCGGAGCTG CAAGTGTTCT GGG1GGA1CG 540 

CGCATATGCA CTCAAAATGC TCTTTGTAAA GGAAAGCCAC AACATGTCCA AGGGACCTGA 600 

GGCGACTTGG AGGCTGAGCA AAGTGCAGTT TGTCTACGAC TCCTCGGAGA AAACCCACTT 660 

CAAAGACGCA GTCAGTGCTG GGAAGCACAC AGCCAACTCG CACCACCTCT CTGCCTTGGT 720 

CACCCCCGCT GGGAAGTCCT ATGAGTGTCA AGGT CAACAA ACCA11TCAC TGGCCTCTAG 780 

TGATCCGCAG AAGACGGTCA CCATGATCCT GTCTGCGGTC CACATCCAAC CTTTTGACAT 840 

TATCTCAGAT TTTGTCTTCA GTGAAGAGCA TAAATGCCCA GTGGATGAGC GGGAGCAACT 900 

GGAAGAAACC TTGCCCCTGA TTTTGGGGCT CATCTTGGGC CTCGTCATCA TGGTAACACT 960 

CGCGATTTAC CACGTCCACC ACAAAATGAC TGCCAACCAG GTGCAGATCC CTCGGGACAG 1020 

ATCCCAGTAT AAGCACATGG GCTAGAGGCC GTTAGGCAGG CACCCCCTAT TCCTGCTCCC 10B0 

CCAACTGGAT CAGGTAGAAC AACAAAAGCA CTTTTCCATC TTGTACACGA GATACACCAA 1140 

CATAGCTACA ATCAAACAGG CCTGGGTATC TGAGGCTTGC TTGGCTTGTG TCCATGCTTA 1200 

AACCCACGGA AGGGGGAGAC TCTTTCGGAT TTGTAGGGTG AAATGGCAAT TATTCTCTCC 1260 

ATGCT GGGGA GGAGGGGAGG AGGGTCTCAG ACAGCTTTCG TGCTCATGGT GGCTTGGCTT 1320 

TGACTCTCCA AAGAGCAATA AATGCCACTT GGAGCTGTAT CTGGCCCCAA AGTTTAGGGA 1380 

TTGAAAACAT GCTTCTTTGA GGAGGAAACC CCTTTAGGTT CAGAAGAATA TGGGGTGCTT 1440 

TGCTCCCTTG GACACAGCTG GCTTATCCTA TACAGTTGTC AATGCACACA GAATACAACC 1500 



T GCTACCCGCA TCCAGCGACT G 
AAAACGACTA ATGTAACTAT GCAGAGTTGT TTGGACTTCT TCCTGTGCCA G 
GGGGGACCTG AAGAATCAAT CTGTGTGAGT CTGTTTTTCA AAATGAAATA AAACACACTA 
TTCTCTGGC 



Protein Accession # 



HDLQGRGVPS IDRLRVLLML FHTMAQIMAE QEVENLSGLS TNPEKDIFW RE1JGTTCLMA 
EFAAKFIVPY DVWASNYVDL ITEQADIALT RGAEVKGRCG HSQSEUJVFW VDRAYALKML 
FVKESHNMSK GPEATWRLSK VQFVYDSSEK THFKDAVSAG KHTANSHHLS ALVTPAGKSY 
ECQAQQTISL ASSDPQKTVT MILSAVHIQP FDIISDFVFS SEKKCPVDER EQLEETLPLI 
LGLILGLVIM VTLAIYHVHH KMTANQVQIP RDRSQYKHMG 



Coding sequenc 
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I I I I I I 

TAGGAGATAA GAGTATCTTG CACAGCAGGT GCAGGTTTCC CAGCAGCTCA GGCAAGAGTC 
CGATGTTTGT GCCATCTGAT CCTGATGTCT GGAGAGATAG CCATGTG1GA GCCTGAATTT 
GGCAATGACA AGGCCAGGGA GCCGAGCGTG GGTGGCAGGT GGCGAGTGTC CTGGTACGAA 
CGGTTTGTGC AGCCATGTCT GGTCGAACTG CTGGGCTCTG CTCTCTTCAT CTTCATCGGG 
TGCCTGTCGG TCATTGAGAA TGGGACGGAC ACTGGGCTGC TGCAGCCGGC CCTGGCCCAC 
T TGGGGCTCGT GATTGCCACG CTGGGGAATA TCAGTGGTGG ACACTTCAAC 
T CCCTGGCAGC CATGCTGAT C GGAGGCCTCA ACCTGGTGAT GCTCCTCCCG 
r CACAGCTGCT CGGGGGGATG CTCGGGGCTG CCTTGGCCAA G 
FC GGTTCTGGAA TGCATCTGGG GCGGCCTTTG TGACAGTCCA G 
CAGGTGGCAG GGGCGTTGGT GGCAGAGATC ATCCTGACGA CGCTGCTGGC C 

GCCGTCACCG TGGATATCCT GGCTGGGGGC CCTGTGTCTG GAGGCTGCAT GAATCCCGCC 
CGTGCTTTTG GACCTGCGGT GGTGGCCAAC CACTGGAACT TCCACTGGAT CTACTGGCTG 
GGCCCACTCC TGGCTGGCCT GCTTGTTGGA CTGCTCATTA GGTGCTTCAT TGGAGATGGG 
AAGACCCGCC TCATCCTGAA GGCTCGGTGA GCAGAGCTCG TGGGATTCCT GCTGCTCCAG 
GTGTCCTCAG CTCACCTGTC CCAGACTGAG GACAGGGGAG TTCCTGCATT TCCTGCCAGG 
GCAGAGGCCC AGAGGAGCGA CCCCCTGCTT CCACTGCTTG GGCCTGCTTT CTCAGATAGA 

CTGGGGAACA CGCTGCCCGC ACTGCCCAGA GAGCAGTGCA AACACCACAA CACGAGCGTG 
TTTCTTGAGA GGAATGTCCC CGAGTTGGAC AAGGAGGCTG TTTCTGCACA TCAGCTCATT 
TCCCGCACCC CATTTCTTGC TTGATTGCTT TGTTGGGGGC CTGGCCACTT CCTTGCTTCT 
CAAGCTGACA ATTCTCACTT TGCAATAAAT AGTCCAGTGT TTCCTTCAT 



MSGEIAMCEP EFGNDKAREP SVGGRWRVSW YERPVQPCLV ELLGSALPIP IGCLSVIENG 
TDTGLLQPAL AHGLALGLVI ATLGNISGGH FNPAVSLAAM LIGGLMLVML LPYWVSQLLG 
GMLGAALAKV VSPEERPWNA SGAAFVTVQE QGQVAGALVA EIILTTLLAL AVCMGAINEK 
TKGPLAPPSI GFAVTVDILA GGPVSGGCMN PARAPGPAW ANHWNFHWIY WLGPLLAGLL 
VGLLIRCFIG DGKTRLILKA R 



D NO: 461 DNA s 



1 11 21 31 41 51 

II 

GATGCTGGGG CTGGTCCTGG CCTTGCTGTC CTCCAGCTCT GCTGAGGAGT ACGTGGGCCT 
Gl rGC, AC -c li;ic;C^G T^CCC4C4CJCAA GGACAGGGTG GACTGCGGCT ACCCCCATGT 
CACCCCCAAG GAGTGCAACA ACCGGGGCTG CTGCTTTGAC TCCAGGATCC CTGGAGTGCC 
TTGGTGTTTC AAGCCCCTGA CTAGGAAGAC AGAATGCACC TTCTGAGGCA CCTCCAGCTG 
CCCCTGGGAT GCAGGCTGAG CACCCTTGCC CGGCTGTGAT TGCTGCCAGG CAC1GTTCAT 
CTCAGTTTTT CTGTCCCTTT GCTCCCGGCA AGCTTTCTGC TGAAAGTTCA TATCTGGAGC 
CTGATGTCTT AACGAATAAA GGTCCCATGC TCCACCCG 



462 Protein sequence 



MLGLVLALLS SSSAEEYVGL SANQCAVPAK DRVDCGYPHV TPKECNNRGC CFDSRIPGVP 



I I I I I I 

GG CACGAGCC AGTCTCCGCG CCTCCACCCA GCTCAGGAAC CCGCGAACCC TCTCTTGACC 
ACTATGAGCC TCCCGTCCAG CCGCGCGGCC CGTGTCCCGG GTCCTTC 

GCGCTGCTCG CGCTGCTGCT CCTGCTGACG CCGCCGGGGC CCC1CGCCAG C 
GTCTCTGCTG TGCTGACAGA GCTGCGTTGC ACTTGTTTAC GCGTTAC 

CCCAAAACGA TTGGTAAACT GCAGGTGTTC CCCGCAGGCC CGCAGTGCTC C 

T CCCTGAAGAA CGGGAAGCAA GTTTGTCTGG ACCCGGAAGC CCCTTTTCTA 



ACCATGCATC ATAAARTTGC CCAGTCTTCA GCGGAGCAGT TTTCTGGAGA TCCCTGGACC 
CAGTAAGAAT AAGAAGGAAG GGTTGGTTTT TTTCCATTTT CTACATGGAT TCCCTACTTT 
GAAGAGTGTG GGGGAAAGCC TACGCTTCTC CCTGAAGTTT ACAGCTCAGC TAATGAAGTA 
CTAATATAGT ATTTCCACTA TTTACTGTTA TTTTACCTGA TAAGTTATTG A 
CAATTGACCA TATTGTGAGC AAAGAATCAC TGGTTATTAG TCTTTCAATG " 
TGAAGATAAC TATTGTATTT CTATCATACA TTCCTTAAftG TCTTACCGAA AAGGCTGTGG 780 
ATTTCGTATG GAAATAATGT TTTATTAGTG TGCTGTTGAG GGAGGTATCC TGTTGTTCTT S40 
ACTCACTCTT CT CAT AAAAT AGGAAATATT TTAGTTCTGT TTTCTTGGGG AATATGTTAC 900 
TCTTTACCCT AGGATGCTAT TTAAGTTGTA CTGTATTAGA ACACTGGGTG TGTCATACCG 960 
TTATCTGTGC AGAATATATT- TCCTTATTCA GAATTTCTAA AAATTTAAGT TCTGTAAGGG 102 0 
CTAATATATT CTCTTCCTAT GGTTTTAGAT GTTTGATGTC TTCTTAGTAT GGCATAATGT 1080 
CATGATTTAC T CATT AAACT TTGATTTTGT ATGCTATTTT TTCACTATAG GATGACTATA 1140 



WO 03/042661 



ATTCTGGTCA CTAAATATAC ACTTTAGATA GATGAAGAAG CCCAAAAACA GATAAATTCC 1200 

TGATTGCTAA TTTACATAGA AATGTATTCT CTTGGTTTTT TAAATAAAAG CAAAATTAAC 1260 

AATGATCTGT GCTCTGCAAA GTTTTGAAAA TATATTTGAA CAATTTGAAT ATAAATTCAT 1320 

CATTTAGTCC T CAAAATAT A TACAGCATTG CTAAGATTTT CAGATATCTA TTGTGGATCT 1380 

TTTAAAGGTT TTGACCATTT TGTTATGAGG AATTATACAT GTATCACATT CACTATATTA 1440 

AAATTGCACT TTTATTTTTT CCTGTGTGTC ATGTTGGTTT TTGGTACTTG TATTGTCATT 1500 
TGGAGAAACA ATAAAAGATT TCTAAACCAA AAAAAAAAAA AAAAAAA 



I I I I I I 

GAACCGTTTA CTCGCTGCTG TGCCCATCTA T CAGCAGGCT CCGGGCTGAA GAT 
CTTCTCTCCT CCAAGGTCTA GTGACGGAGC CCGCGCGCGG CGCCACCATG CGGCAGAAGG 
CGGTATCGCT TTTCTTGTGC TACCTGCTGC TCTTCACTTG CAGTGGGGTG GAGGCAGGTA 
AGAAAAAGTG CTCGGAGAGC TCGGACAGCG GCTCCGGGTT CTGGAAGGCC CTGACCTTCA 
TGGCCGTCGG AGGAGGACTC GCAGTCGCCG GGCTGCCCGC GCTGGGCTTC ACCGGCGCCG 
GCATCGCGGC CAACTCGGTG GCTGCCTCGC TGATGAGCTG GTCTGCGATC CTGAATGGGG 
GCGGCGTGCC CGCCGGGGGG CTAGTGGCCA CGCTGCAGAG CCTCGGGGCT GGTGG CAGCA 
GCGTCGTCAT AGGTAATATT GGTGCCCTGA TGGGCTACGC CACCCACAAG TATCTCGATA 
GTGAGGAGGA TGAGGAGTAG CCAGCAGCTC CCAGAACCTC TTCTTCCTTC TTGGCCTAAC 
TCTTCCAGTT AGGATCTAGA ACTTTGCCTT TTTTTTTTTT TTTTTTTTTT TTTGAGATGG 
GTTCTCACTA TATTGTCCAG GCTAGAGTGC AGTGGCTATT CACAGATGCG AACATAGTAC 
ACTGCAGCCT CCAACTCCTA GCCTCAAGTG ATCCTCCTGT CTCAACCTCC CAAGTAGGAT 
TACAAGCATG CGCCGACGAT GCCCAGAATC CAGAACTTTG TCTATCACTC TCCCCAACAA 
CCTAGATGTG AAAACAGAAT AAACTTCACC CAGAAAA 



Seq ID NO: 467 DNA_ 



1 11 21 31 41 51 

I I I I I I 

GAAACGGCCC GAGAAGCT CG CCCGGAGAAC GGGGAGGAAT ATGCTGTGGA GCTCCTCTGC 60 

CATATAAACA AAAAGAGGAA ATCTTTCAAA CATGGCTGAA GCAAAGACCC ACTGGCTTGG 120 

AGCAGCCCTG TCTCTTATCC CTTTAATTTT CCTCATCTCT GGGGCTGAAG CAGCTTCATT 180 

TCAGAGAAAC CAGCTGCTTC AGAAAGAACC AGACCTCAGG TTGGAAAATG TCCAAAAGTT 240 

TCCCAGTCCT GAAATGATCA GGGCTTTGGA GTACATAGAA AACCTCCGAC AACAAGCTCA 300 

TAAGGAAGAA AGCAGCCCAG ATTATAATCC CTACCAAGGT GTCTCTGTCC CCC1TCAGCA 360 

ATGAAA GCCACTTGCC CGAGAGGGAT TCACTGAGTG AAGAAGACTG 420 

A ATACTCGAAG CTTTGAGACA GGCTGAAAAT GAGCCTCAGT CTGCACCAAA 48 

3 CCCTATGCCT TGAATTCAGA AAAGAACTTT CCAATGGACA TGAGTGATGA 54 

AGTGGC CAGAAAGAAA GCTTAAGCAC ATGCAATTCC CTCCTATGTA 60 

TGAAGAGAAT TCCAGGGATA ACCCCTTTAA ACGCACAAAT GAAATAGTGG AGGAACAATA 66 

TACTCCTCAA AGCCTTGCTA CATTGGAATC TGTCTTCCAA GAGCTGGGGA AACTGACAGG 72 

ACCAAACAAC CAGAAftCGTG AGAGGATGGA TGAGGAGCAA AAACTTTATA CGGATGATGA 78 

AGATGATATC TACAAGGCTA ATAACATTGC CTATGAAGAT GTGGT CGGGG GAGAAGACTG 84 

GAACCCAGTA GAGGAGAAAA TAGAGAGTCA AACCCAGGAA GAGGTGAGAG ACAG CAAAGA 90 

GAATATAGGA AAAAATGAAC AAATCAACGA TGAGATGAAA CGCTCAGGGC AGCT TGGCAT 96 

CCAGGAAGAA GATCTTCGGA AAGAGAGTAA AGACCAACTC TCAGATGATG TCTCCAAAGT 102 

T TTGAAAAGGT TAGTAAATGC TGCAGGAAGT GGGAGGTTAC AGAATGGGCA 108 

SCCACCA GGCTTTTTGA GAAACCTCTT GATTCTCAGT CTATTTATCA 114 

GCTGATTGAA ATCT CAAGGA ATTTACAGAT ACCCCCAGAA GACTTAATTG AGATGCTCAA 120 

AACTGGGGAG AAGCCGAATG GATCAGTGGA ACCGGAGCGG GAGCTTGACC TTCCTGTTGA 126 

CCTAGATGAC ATCTCAGAGG CTGACTTAGA CCATCCAGAC CTGTTCCAAA ATAGGATGCT 132 

CTCCAAGAGT GGCTACCCTA AAACACCTGG TCGTGCTGGG ACTGAGGCCC TACCAGACGG 13E 

GCTCAGTGTT GAGGATATTT TAAATCTTTT AGGGATGGAG AGTGCAGCAA AT CAG AAAAC 144 

GTCGTATTTT CCCAATCCAT ATAACCAGGA GAAAGTTCTG CCAAGGCTCC CTTATGGTGC 15C 

TGGAAGATCT AGAT CGAACC AGCTTCCCAA AGCTGCCTGG ATTCCACATG TTGAAAACAG 15e 

ACAGATGGCA TATGAAAACC TGAACGACAA GGATCAAGAA TTAGGTGAGT ACTTGGCCAG 16; 

GATGCTAGTT AAATACCCTG AGAT CATTAA TTCAAACCAA GTGAAGCGAG TTCCTGGTCA 16! 

AGGCTCATCT GAAGATGACC TGCAGGAAGA GGAACAAAT T GAGCAGGCCA TCAAAGAGCA 174 

TTTGAATCAA GGCAGCTCTC AGGAGACTGA CAAGCTGGCC CCGGTGAGCA AAAGGTTCCC 18 ( 

TGTGGGGCCC CCGAAGAATG ATGATACCCC AAATAGGCAG TACTGGGATG AAGATCTGTT 18! 

AATGAAAGTG C TGGAATACC TCAATCAAGA AAAGGCAGAA AAGGGAAGGG AGCATATTGC 19: 
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10 



TAAGAGAGCA ATGGAAAATA TGTAAGCTGC TTTCATTAAT TACCCTACTT TCATTCCTCC 1980 

CACCCCAAGC AAATCCCAAC ATTTCTCTTC AGTGTGTTGA CTTCTATCCT GTTAACACTG 2040 

TAATATCTTT AAATGATGTA CAGGCAGATG AAACCAGGTC ACTGGGGAGT CTGCTTCATT 2100 

TCCTCTGAGC TGTTATCTTG TGTATGGATA TGTGTAAATG TTATGACTCC TTGATAAAAA 2160 

ATTTATTATG TCCATTATTC AAGAAAGATA TCTATGACTG TGTTTAATAG TATATCTAAT 2220 

GGCTGTGGCA TTGTTGATGC TCACATATGA TAAAAAAGTG TCCTATAATT CTATTGAAAG 2280 

TTTTTAATAT TTATTGAATT ATTTTGTTAC TGTCTGTAGC GT1TTG1GGA GTACTGGACC 2340 



1 11 21 31 41 51 

I I I I I I 



15 MAEAKTHWLG AALSLIPLIP LISGAEAASP QHNQLLQKEP DLRLENVQKF PSPEMIRALE 
YIENLRQQAH KEESSPDYNP YQGVSVPLQQ KENGDESHLP ERDSLSEEDW MRIILEALRQ 
AENEPQSAPK ENKPYALNSE KNFPMDMSDD YETQQWPERK LKHMQFPPMY EENSRDNPFK 
RTNEIVEEQY TPQSLATLES VFQELGKLTG PNNQKRERMD EEQKLYTDDE DDIYKANNIA 
YEDWGGEDW NPVEEKIESQ TQEEVRDSKE NIGKNEQIND 3MKRSGQLGI QEEDLRKESK 
20 DQLSDDVSKV IAYLKRLVNA AGSGRLQNGQ NGERATRLFE KPLDSQSIYQ LIEISRNLQI 

K TGEKPNGSVE PERELDLPVD LDDISEADLD HPDLFQNRML SKSGYPKTPG 
3DILNLL GMESAANQKT SYFPNPYNQE KVLPRLPYGA GRSRSNQLPK 
R QMAYENLNDK DQELGEYLAR MLVKYPEIIN SNQVKRVPGQ GSSEDDLQEE 
EQIEQAIKEH LHQGSSQETD KLAPVSKRFP VGPPKNDDTP NRQYWDEDLL MKVLEYLNQE 



GGCCCCTTGT CTGCAGAGAT GGCTCCCAAT GCTTCCTGCC TCTGTGTGCA TGTCCGTTCC 
GAGGAATGGG ATTTAATGAC CTTTGATGCC AACCCATATG ACAGCGTGAA AAAAATCAAA 
GAACATGTCC GGTCTAAGAC CAAGGTTCCT GTGCAGGACC AGGTTCTTTT GCTGGGCTCC 
AAGATCTTAA AGCCACGGAG AAGCCTCTCA TCTTATGGCA TTGACAAAGA GAAGACCATC 
CACCTTACCC TGAAAGTGGT GAAGCCCAGT GATGAGGAGC TGCCCTTGTT TCTTGTGGAG 
TCAGGTGATG AGG CAAAGAG GCACCTCCTC CAGGTGCGA^ GGTCCAGCTC AGTGGCACAA 
GTGAAAGCAA TGAT CG AG AC TAAGACGGGT ATAATCCCTG AGACCCAGAT TGTGACTTGC 
AATGGAAAGA GACTGGAAGA TGGGAAGATG ATGGC AGAT T ACGC-CATCAG AAAGGGCAAC 
TTACTCTTCC TGGCATCTTA TTGTATTGGA GGGTGACCAC CCTGGGGATG GGGTGTTGGC 
AGGGGTCAAA AAGCTTATTT CTTTTAATCT CTTACTCAAC GA^CACATCT TCTGATGATT 
TCCCAAAATT AATGAGAATG AGAT GAGT AG AGTA^GATTT GGGTGGGATG GGTAGGATGA 
AGTATATTGC CCAACTCTAT GTTTCTTTGA TTCTAACACA ATTAATTAAG TGACATGATT 
TTTACTAATG TATTACTGAG ACTAGTAAAT AAATTTTTAA GGCAAAATAG AGCATTC 



Seq ID NO: 470 Protein sequence 




AACTCGGCGC TGGTGCTGCT GGCGGTGCGC G 
TTCCTCTGCC ACCTGGCCTT GGTAGACGCG G 
C AGCGCTCTGG C 
G TTCGGCCGAA T 
CGCGCGGCCG CAGTGTGCCG CCCGCTGCGC TATGCGGGGC TCGTCTCCCC G 
CGCACGCTGG CCAGCGCCTC CTGGCTAAGC GGCCTCACCA ACTCGGTTGC GCAAACCGCG 

CCGGCGTTGC TCAAGCTGGC CTGCGGAGGC GACGGAGACA CTACCGAGAA CCAGATGTTC 
T GCTGCTGCCG TTTGCCGTCA TCCTGGCCTC CTACGGTGCC 
: AGCGGAGGCC GGAGGAGGGC GGTGGGCACG 
TGTGGGTCCC ACCTGACAGC CGTCTGCCTG TTCTACGGCT CGGCCATCTA CACCTACCTG 
CAGCCCGCGC AGCGCTACAA CCAGGCACGG GGCAAGTTCG TATCGCTCTT CTACACCGTG 
GTCACACCTG CTCTCAACCC GCTCATCTAC ACCCTCAGGA ATAAGAAAGT GAAGGGGGCA 
GCGAGGAGGC TGCTGCGGAG TCTGGGGAGA GGCCAGGCTG GGCAGTGA 

Seq I 



MKANYSAEER FLLLGFSDWP SLQPVLFALV LLCYLLTLTG NSALVLLAVR 

1137 
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A GFTTSWPPL LANLRGPALW LPRSHCTAQL CASLHiGSAE CVIAAVMALD 
jVSPRLC RTLASASWLS GLTNSVAQTA LLAERPLCAP RLLDHFICEL 
PALLKLACGG DGDTTENQMF AARWILLLP FAVILASYGA VARAVCCMRF SGGRRRAVGT 
CGSHLTAVCL FYGSAIYTYL QPAQRYNQAR GKFVSLFYTV VTPALNPLIY TLRNKKVKGA 
ARRLLRSLGR GQAGQ 

Seq ID NO: 473 DNA 



1 11 21 31 41 51 

I I I I I I 

GCTCTCATTA CCTTCTGCCC AT CACTT AAT AAATAGCCAG CCAATTCATC AACATTCTGG 
TACACTGTTG GAGAGATGAG ACAGTCACAC CAGCTGCCCC TAGTGGGGCT CTTACTGTTT 
TCTTTTATTC CAAGCCAACT ATGCGAGATT TGTGAGGTAA GTGAAGAAAA CTACATCCGC 
CTAAAACCTC TGTTGAATAC AATGATCCAG TCAAACTATA ACAGGGGAAC CAGCGCTGTC 
AATGTTGTGT TGTCCCTCAA ACTTGTTGGA ATCCAGATCC AAACCCTGAT GCAAAAGATG 
ATCCAACAAA TCAAATACAA TGTGAAAAGC AGATTGTCAG ATGTAAGCTC GGGAGAGCTT 
GCCTTGATTA TACTGGCTTT GGGAGTATGT CGTAACGCTG AGGAAAACTT AATATATGAT 
TACCACCTGA CTGACAAGCT AGAAAATAAA TTCCAAGCAG AAATTGAAAA TATGGAAGCA 
A CTCCCCTGAC TAACTACTAC CAGCTCAGCC TGGACGTTTT GGCCTTGTGT 
ICTACTC AACCGCCGAA GTTGTCAACC ACTTCACTCC TGAAAATAAA 
5TAGCCA GTTCTCAGTA GATACTGGTG CAATGGCTGT CCTGGCTCTG 
\ AGAAGAGTCT AATAAATGGG CAGATCA 
WACAAA GTCACTGGTA GAAAAGf 
GGTCTCATTG GAAACACATT TAGCACAGGA GAAGCCATGC A 
GACTATTATA ATGAAAATGA CTGGAATTGC CAACAAACTC TGAATACAGT GCTCACGGAA 
3 GAGCATTCAG TAATCCAAAC GCTGCAGCCC A 
T TCTTGGATAT TAACAAAGAC TCTTCTTGCG T 
3 CTGATGAGCC TATAACTGTG ACACCTCCTG ACTCACAATC A 
GTCAATTACT CTGTGAGAAT CAATGAAACA TATTTCACCA ATGTCACTGT GCTAAATGGT 



ACAATGGAGG AGCGCTCATG GGGGCCCTAT ATCACCTGTA TTCAGGGCCT ATGTGCCAAC 
G GGAACTTCTG A 



GCCCAAACTT TCCTCAGCTG CATAAAATCC ATTTGCAGTG GAGTTCCATG TTTATTGTCC 1440 
TTATGCCTTC TTCTTCATTT ATCCCAGTAC GAGCAGGAGA GTTAATAACC TCCCCTTCTC 1500 
TCTCTACATG TTCAATAAAA GTTGTTGAAA GATTAAC 

Seq ID NO: 474 Protein sequence 



1 11 21 31 41 

I I I I I 

MRQSHQLPLV GLLLFSFIPS QLCEICEVSE ENYIRLKPLL NTMIQSNYNR G 
LKLVGIQIQT LMQKMIQQIK YNVKSRLSDV SSGELALIIL ALGVCRNASE NLIYDYHLTD 
KLENKFQAEI ENMEAHNGTP LTNYYQLSLD VIALCLFNGN YSTAEWNHF TPENKNYYFG 
SQFSVOTGAM AVLALTCVKK SLINGQIKAD EGSLKNISIY TKSLVEKILS EKKENGL I GN 
TFSTGEAMQA LFVSSDYYNE NDWNCQQTLN TVLTEISQGA F SNPNAAAQV LPALMGKTFL 
DINKDSSCVS ASGNFNISAD EPITVTPPDS QSYISVNYSV RINETYFTNV TVLNGSVFLS 
R SWGPYITCIQ GLCANNNDRT YWELLSGGEP LSQGAGSYW 




GCCCCCGCCC GCCCCGGGCC CTGATGGACT GAATGAAGGC TGCCTACACC GCCTATCGAT GO 

GCCTCACCAA AGACCTAGAA CGTGCGCCAT GAACCCGGAG CTGACAATGG AAAGTCTGGG 12 0 

CACTTTGCAC GGCGCGCGCG GCGGCGGCAG TGGCGGGGGC GGCGGCGGGG GCGGCGGGGG 180 

CGGCGGCGGG GGCCCGGGCC ATGAGCAGGA GCTGCTGGCC AGCCCCAGCC CCCACCACGC 240 

GCGCCGCGGC CCGCGTGGCT CGCTGCGGGG CCCTCCGCCG CCTCCAACCG CGCACCAGGA 300 

65 GCTGGGCACG GCGGCAGCGG CGGCAGCGGC GGCGTCGCGC TCGGCCATGG TCACCAGCAT 3 60 

GGCCTCGATC CTGGACGGCG GCGACTACCG GCCCGAGCTC TCCATCCCGC TGCACCACGC 42 0 

CATGAGCATG TCCTGCGACT CGTCTCCGCC TGGCATGGGC ATGAG CAACA CCTACACCAC 480 

GCTGACACCG CTCCAGCCGC TGCCACCCAT CTCCACCGTG TCK3ACAAGT TCCACCACCC 540 

TCACCCGCAC CACCATCCGC ACCACCACCA CCACCACCAC CACCAGCGCC TGTCCGGCAA 600 

70 CGTCAGCGGC AGCTTCACCC TCATGCGCGA CGAGCGCGGG CTCCCGGCCA TGAACAACCT 660 

CTACAGTCCC TACAAGGAGA TGCCCGGCAT GAGCCAGAGC CTGTCCCCGC TGGCCGCCAC 720 

GCCGCTGGGC AACGGGCTAG GCGGCCTCCA CAACGCGCAG CAGAGTCTGC CCAACTACGG 780 

TCCGCCGGGC CACGACAAAA TGCTCAGCCC CAACTTCGAC GCGCACCACA CTGCCATGCT 840 

GACCCGCGGT GAGCAACACC TGTCCCGCGG CCTGGGCACC CCACCTGCGG CCATGATGTC 900 

75 GCACCTGAAC GGCCTGCACC ACCCGGGCCA CACTCAGTCT CACGGGCCGG TGCIGGCACC 960 

CAGTCGCGAG CGGCCACCCT CGTCCTCATC GGGCTCGCAG GTGGCCACGT CGGGCCAGCT 1020 

GGAAGAAATC AACACCAAAG AGGTGGCCCA GCGCATCACA GCGGAGCTGA AGCGCTACAG 1080 

TATCCCCCAG GCGATCTTTG CGCAGAGGGT GCTGTGCCGG TCTCAGGGGA CTCTCTCCGA 1140 

CCTGCTCCGG AATCCAAAAC CGTGGAGTAA ACTCAAATCT GGCAGGGAGA CCTTCCGCAG 1200 

OU GATGTGGAAG TGGCTTCAGG AGCCCGAGTT CCAGCGCATG TCCGCCTTAC GCCTGGCAGC 1260 

GTGCAAACGC AAAGAGCAAG AACCAAACAA AGACAGGAAC AATTCCCAGA AGAAGTCCCG 1320 

CCTGGTGTTC ACTGACCTCC AACGCCGAAC ACTCTTCGCC ATCTTCAAGG AGAACAAACG 1380 

CCCGTCAAAG GAGATGCAGA TCACCATTTC CCAGCAGCTG GGCCTGGAGC TCACAACCGT 1440 

CAGCAACTTC TTCATGAACG CCCGGCGCCG CAGC CTGGAG AAGTGGCAAG ACGATCTGAG 1500 



1138 
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CACAGGGGGC TCCTCGTCCA CCTCCAGCAC GTGT AC CAAA GCATGATGGA AGGACTCTCA 1560 
CTTGGGCACA AGTCACCTCC AAATGAGGAC AACAGATACC AAAAGAAAAC AAAGGAAAAA 1620 
GACAC CGGAT TCCTAGCTGG GGCCCTTCAC TGGTG 



MNPELTMESL GTLHGARGGG SGGGGGGGGG GGGGGPGHEQ SLLASPSPHH ARRGPRGSLR 

GPPPPPTAHQ ELGTAAAAAA AASRSAMVTS MASILDGGDY RPELSIPLHH AMSKSCDSSP 

PGMGMSNTYT TLTPLQPLPP ISTVSDKFHH PHPHHHPHHH HHHHHQRLSG NVSGSFTLMR 

DERGLPAMNN LYSPYKEMPG MSQSLSPLAA TPLGNGLGGL HNAQQSLPNY GPPGHDKMLB 

PNFDAHHTAM LTRGEQHLSR GLGTPPAAMM SHLNGLHHPG HTQSHGPVLA PSRERPPSSS 

SGSQVATSGQ LEEINTKEVA QRITAELKRY SIPQAIFAQR VLCRSQGTLS DLLRNPKPWS 

KLKSGRETFR RMWKWLQEPE FQRMSALRLA ACKRKEQEPN KDRNNSQKKS RLVFTDLQRR 
TLFAIFKENK RPSKEMQITI SQQLGLELTT VSNFFMNARR RSLEKWQD 



TCTGCGCGCG GCCGGTAAAG GAACCCCGCG G 
AGACTGGCGC TCCTCGCCGC TTCCGGCGGT C 
TGCAGGAGCT GGCGCGGGCG CTGGCGCATC T 
GGGCCGAGGC GCAGGAGGCT GAGGATCAGC AGGCGCGCGT CCTGGCGCAG CTGCTGCGCG 
TCTGGGGCGC CCCCCGCAAC TCTGATCCGG C 
CTGCAGCGCA GCTCGCTCGC GCTCTGCTCC G 
CCCAGCTTGT CCCCGCGCCC GTCCCCGCCG CGGCGCTCCG ACCCCGGCCC CCGGTCTACG 



CCGAGCTGTT GAGGTACTTG CTGGGACGGA TTCTTGCGGG AAGCGCGGAC TCCGAGGGGG 
TGGCAGCCCC GCGCCGCCTC CGCCGTGCCG CCGACCACGA TGTGGGCTCT GAGCTGCCCC 
CTGAGGGCGT GCTGGGGGCG CTGCTGCGTG TGAAACGCCT AGAGACCCCG GCGCCCCAGG 
TGCCTGCACG CCGCCTCTTG CCACCCTGAG CACTGCCCGG ATCCCGTGCA CCCTGGGACC 
CAGAAGTGCC CCCGCCATCC CGCCACCAGG ACTTCTCCCC GCCAGCACGT CCAGAGCAAC 
TTACCCCGGC CAGCCAGCCC TCTCACCCGA GGATCCCTAC CCCCTGGCCC ACAATAACAT 
GATCTGAGC 

Seq ID NO: 478 Protein sequence 



55 
60 



B RAGGVGLLVL LLLGLFRPPP ALCARPVKEP RGLSAASPPL AETGAPRRFR 
3 AVQELARALA HLLEAERQER ARAEAQEAED QQARVLAQLL RVWGAPRNSD 
PALGLDDDPD APAAQLARAL LRARLDPAAL AAQLVPAPVP AAALRPRPPV YDDGPAGPDA 
EEAGDETPDV DPELLRYLLG RILAGSADSE G 
RVKRLETPAP QVPARRLLPP 



Nucleic Acid Accession tt 
Coding sequence: 



GC AGACGGGGCT GCAAAGCTGC AACTAATGGT 
GTTGGCCTCC CTGCCCACCT GTGGAAGCAA CTGCGCTGAT TGATGCGCCA CAGACTTTTT 
TCCCCTCGAC CTCGCCGGCG TACCCTCCCA CAGATCCAGC ATCACCCAGT GAATGTACAT 
TAGGGTGGTT TCCCCCCCAG CTTCGGGCTT TGTTTGGGTT TGJ 



TGTCCCGGAG CAGGCTGCGG AGCCCTTGCA GAGCCCTCTC TCCAGTCGCC GCCGGGCCCT 42 0 

TGGCCGTCGA AGGAGGTGCT TCTCGCGGAG ACCGCGGGAC CCGCCGTGCC GAGCCGGGAG 480 

GGCCGTAGGG GCCCTGAGAT GCCGAGCGGT GCCCGGGCCC GCTTACCTGC ACCGCTTGCT 540 

CCGAGCCGCG GGGTCCGCCT GCTAGGCCTG CGGAAAACGT CCTAGCGACA CTCGCCCGCG 600 

GGCCCCGAGG TCGCCCGGGA GGCCGAGCCC GCGTCCGGAA GGCAGCCAGG CGGCGGGCGC 660 

TCTGCCTGCA AAACGACCGG CGAGGTCCCG CCTCGTTCCT CTGGGCAGCC TGGGTGTTTT 780 
CACTTGTTCT TGGACTGGGC CAAGGTGAAG ACAATAGATG TGCATCTTCA AATGCAGCAT 840 
CCTGTGCCAG GTGCCTTGCG C 

TTTCAGGTGG ATCAAGAAGT GAACGTTGTG ATATTGTTTC C 

GCTCAGTTGA TTCAATAGAA TACCCATCTG TGCATGTTAT AATACCCACT GAAAATGAAA 
TTAATACCCA GGTGACACCA GGAGAAGTGT CTATCCflGCT GCGTCCAGGA GCCGAAGCTA 
ATITTATGCT GAAAGTTCAT CCTCTGAAGA AATATCCTGT GGATCTTTAT T 
ATGTCTCAGC ATCAATGCAC A 
CTAGAAAAAT GGCATTTTTC TCCCGTGACT TTCGTCTTGG ATTTGGCTCA TACGTTGATA 1260 
AAACAGTTTC ACCATACATT AGCATCCACC CCGAAAGGAT TCATAATCAA TGCAGTGACT 1320 
ACAATTTAGA CTGCATGCCT CCCCATGGAT ACATCCATGT GCTGTCTTTG ACAGAGAACA 1380 
TCACTGAGTT TGAGAAAGCA GTTCATAGAC AGAAGATCTC TGGAAACATA GAIACACCAG 1440 
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GCAAATTGGC AGGCATAGTG GTGCCCAATG ACGGAAACTG TCATCTGAAA AACAACGTCT 1620 

ACGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 1580 

ACAACAACAT TAATGTCATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 1740 

TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA 1800 

ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 1860 

ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 1920 

CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 

TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 2040 

GTTTTAATGA AACCGCTAAA ATTCATATAC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 2100 

ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 2160 

GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 2220 

ACAAGGATCA GCCTGTTTGC AGTGGTCGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 2280 

h GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTTGTC 2340 

3 TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC AGATGCCAAT 2400 

G CTGGGAAGGT GATCGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 
A GGGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTG1 

T GTGAACACTG CCCCACCTGT T 



A TGGAACAACA 

CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CAT CATTTTC 
h GTCCTGATCA 



45 
50 
55 



TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA G 

G CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 3000 

AAGTCACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT T 
GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT T 
AAAATGTGTC TTACTACTGT TTGAGACTAG T 
ACTTATTTAG ATCAGCATAG AATGTAGATC C 

TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT A 
CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG G 
TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA G 
ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3S40 
AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 
TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3 660 
TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 3 720 
TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 3 7 80 
GAATGTTAA 

Seq ID NO: 480 Prot 



JLVLGLG Q( 

LGPBCGWCVQ EDFISGGSRS ERCDIVSNLI SKGCSVDSIE YPSVHVIIPT ENEIMTQVTP 
GEVSIQLRPG AEANFMLKVH PLKKYPVDLY YLVDVSASMH NNIEKLNSVG NDLSRKMAFF 
SRDFRLGFGS YVDKTVSPYI SIHPERIHNQ CSDYNLDCMP PHGYIHVLSL TENITEFEKA 
VHRQKISGNI DTPEGGFDAM LQAAVCESHI GWRKEAKRLL LVMIDQTSHL ALDSKLAGIV 
VPNDGNCHLK NNVYVKSTTM EHPSLGQLSE KLIDNNIHVI FAVQGKQFHW YKDLLPLLPG 
TIAGEIESKA ANLNNLWEA YQKLISEVKV OVECTOVQGIY FNITAICPDG SRKPGMEGCR 
HVTSNDEVLF NVTVTMKKCD VTGGKKYAII KPIGFNETAK IHIHRHCSCQ CEDNRGPKGK 
CVDETFLDSK CFQCDENKCH FDEDQFSSES CKSHKDQPVC SGRGVCVCGK CSCHKIKLGK 
VYGKYCEKDD FSCPYHHGNL CAGHGECEAG RCQCFSGWEG DRCQCPSAAA QHCVNSKGQV 
CSGRGTCVCG RCECTDPRSI GRFCEHCPTC YTACKENWNC MQCLHPHNLS QAILDQCKTS 
CALMEQQHYV DQTSECFSSP SYLRIFFIIF IVTFLIGLLK VLIIRQVILQ WNSNKIKSES 
DYRVSASKKD KLILQSVCTR AVTYRREKPE EIKMDISKLK A 



AGAGACATTA AAAATAAGTT TAAAAATGAA GACCTTACTG ATGAACTAAG CTTGAATAAA 
A TAACTCGGGA ACTGTTAACC AAATTATGAT GATGGCAAAC 



GATGCTCTTT 



AAATTTGCTT TTGTTCATAT ATCTTTTGCA CAATTTGAAC TGTCACAAGG TAATGTCAAA 480 
\ AAAAGCTGTA GAACGTGGAG CAGTACCACT AGAAATGCTG 540 
:CTCCAA AAAAAGCAGC TGCTTTCAGA GGAGGAAAAG 600 
I CAGCATCTAC GGTATTAACT GCCCAAGAAT CATTTTC 
CATTTACAGA ATAGGAACAA CAGTTGTGAT TCCAGAGGAC AGACTACTAA A 
TTATATGGAG AGAACATGCC ACCACAAGAT GCAGAAATAG GTTACCGGAA TTCATTGAGA 
CAAACTAACA AAACTAAACA GTCATGCCCA TTTGGAAGAG TCCCAGTTAA C 
AGCCCAGATT GTGATGTGAA GACAGATGAT TCAGTTGTAC CTTGTTTTAT G 
ACCTCTAGAT CAGAATGCCG AGATTTGGTT GTGCCTGGAT CTAAACCAAG TGGAAATGAT 960 
TCCTGTGAAT TAAGAAATTT AAAGTCTGTT CAAAATAGTC ATTTCAAGGA ACCTCTGGTG 1020 
T CAGATGAAA AGAGTTCTGA ACTTATTATT ACTGATTCAA TAACCCTGAA GAATAAAACG 1080 
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GAATCAAGTC TTCTAGCTAA ATTAGAAGAA ACTAAAGAGT ATCAAGAACC AGAGGTTCCA 1140 

GAGAGTAACC AGAAACAGTG GCAATCTAAG AGAAAGTCAG AGTGTATTAA CCAGAATCCT 1200 

GCTGCATCTT CAAATCACTG GCAGATTCCG GAGTTAGCCC GAAAAGTTAA TACAGAGCAG 1260 

AAACATACCA CTTTTGAGCA ACCTGTCTTT TCAGTTTCAA AACAGTCACC ACCAATATCA 132 0 

5 ACATCTAAAT GGTTTGACCC AAAATCTATT TGTAAGACAC CAAGCAGCAA TACCTTGGAT 1380 

GATTACATGA GCTGTTTTAG AACTCCAGTT GTAAAGAATG ACTTTCCACC TGCTTGTCAG 1440 

TTGTCAACAC CTTATGGCCA ACCTGCCTGT TTCCAGCAGC AACAGCATCA AATACTTGCC 1500 

ACTCCACTTC AAAATTTACA GGTTTTAGCA TCTTCTTCAG CAAATGAATG CATTTCGGTT 1560 

AAAGGAAGAA TTTATTCCAT TTTAAAGCAG ATAGGAAGTG GAGGTTCAAG CAAGGTATTT 1620 

10 CAGGTGTTAA ATGAAAAGAA ACAGATATAT GCTATAAAAT ATGTGAACTT AGAAGAAGCA 1680 

GATAACCAAA CTCTTGATAG TTACCGGAAC GAAATAGCTT ATTTGAATAA ACTACAACAA 1740 

CACAGTGATA AGATCATCCG ACTTTATGAT TATGAAATCA CGGACCAGTA CATCTACATG 1800 

GTAATGGAGT GTGGAAATAT TGATCTTAAT AGTTGGCTTA AAAAGAAAAA ATCCATTGAT 1860 

CCATGGGAAC GCAAGAGTTA CTGGAAAAAT ATGTTAGAGG CAGTTCACAC AATCCATCAA 1920 

15 CATGGCATTG TTCACAGTGA TCTTAAACCA GCTAACTTTC TGATAGTTGA TGGAATGCTA 1980 

AAGCTAATTG ATTTTGGGAT TGCAAACCAA ATGCAACCAG ATACAACAAG TGTTGTTAAA 2040 

GATTCTCAGG TTGGCACAGT TAATTATATG CCACCAGAAG CAATCAAAGA TATGTCTTCC 2100 

TCCAGAGAGA ATGGGAAAT C TAAGTCAAAG ATAAGCCCCA AAAGTGATGT TTGGTCCTTA 2160 

GGATGTATTT TGTACTATAT GACTTACGGG AAAACACCAT TTCAGCAGAT AATTAATCAG 2220 

20 ATTT CTAAAT TACATGCCAT AATTGATCCT AATCATGAAA TTGAATTTCC CGATATTCCA 2280 

GAGAAAGATC TTCAAGATGT GTTAAAGTGT TGTTTAAAAA GGGACCCAAA ACAGAGGATA 2340 

TCCATTCCTG AGCTCCTGGC 1CATCCCTAT GTTCAAATTC AAACTCATCC AGTTAACCAA 2400 

ATGGCCAAGG GAACCACTGA AGAAATGAAA TATGTTCTGG GCCAACTTGT TGGTCTGAAT 2460 

TCTCCTAACT CCATTTTGAA AGCTGCTAAA ACTTTATATG AACACTATAG TGGTGGTGAA 2520 

25 AGTCATAATT CTTCATCCTC CAAGACTTTT GAAAAAAAAA GGGGAAAAAA ATGA 

30 1 11 21 31 

I I I I I I 

MESEDLSGRE LTIDSIMNKV RDIKNKFKNE DLTDELSLNK ISADTTDNSG TVNQIMMMAN SO 

NPEDWLSLLL KLEKNSVPLS DALLNKLIGR YSQAIEALPP DKYSQNESFA RIQVRFAELK 120 

AIQEPDDARD YFQMARANCK KFAFVHISFA QFELSQGNVK KSKQLLQKAV SRGAVPLEML 180 

35 EIALRNLNLQ KKQLLSEEEK KNLSASTVLT AOESFSGSLG HLQNRNNSCD SRGQTTKARF 240 

LYGENMPPQD AEIGYRNSLR QTNKTKQSCP FGRVPVNLLN SPDCDVKTDD SWPCFMKRQ 300 

TSRSECRDLV VPGSKPSGND SCELRNLKSV QNSHFKEPLV SDEKSSELII TDSITLKNKT 360 

ESSLLAKLEE TKEYQEPEVP ESNQKOWOSK RKSECINQNP AASSNHWQIP EIARKVHTEQ 42 0 

KHTTFEOPVF SVSKQSPPIS TSKWFDPKSI CKTPSSNTLD DYMSCFRTPV VKNDFPPACQ 4E0 

40 LSTPYGQPAC FQQQQHQILA TPLQNLQVLA SSSANECISV KGRIYSILKO IGSGGSSKVF 540 

OVLNEKKQIY AIKYVMLEEA DNQTLDSYRN EIAYLNKLQQ HSDKIIRLYD YEITDQYIYM 600 

VMECGNIDLN SWLKKKKSID PWERKSYWKN MLEAVHTIHQ HGIVHSDLKP ANFLIVDGML 660 

KLIDFGIANQ MQPDTTSWK DSQVGTVNYM PPEAIKDMSS SRENGKSKSK ISPXSDVNSL 720 

GCILYYMTYG KIPFQQIINQ ISKLHAIIDP NHEIEFPDIP EKDLQDVLKC CLKRDPKQRI 780 

45 SIPELIAHPY VQ IQTHPVNQ MAKGTTEEMK YVLGQLVGLN SPNSILKAAK TLYEHYSGGE 840 
SHNSSSSKTF EKKRGKK 

Seq ID NO: 483 DNA sequence 
Nucleic Acid Accession ft: NM_003667.1 
50 Coding sequence: 1..2651 

1 11 21 31 41 51 

I I I I I I 

ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG TGCTGCTGCA GCTGGCGACC 60 

55 GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT GCCCCACACA CTGTCATTGC 12 0 

GAGCCCGACG GCAGGATGTT GCTCAGGGTG GACTGCTCCG ACCTGGGGCT CTCGGAGCTG 180 

CCITCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA GTATGAACAA CAT CAGTCAG 240 

CTGCTCCCGA ATCCCCTGCC CAGTCTCCGC TTCCTGGAGG AGTTACGTCT TGCGGGAAAC 3 00 

GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT ACAGTCTTAA AGTTCTTATG 3 60 

60 CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCT C IGCAGAATTT GCGAAGCCTT 420 

CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC CCCCAAGCTG TTTCAGTGGC 480 

CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT TAACAGAAAT CCCCGTCCAG 540 

GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG CCCTGAACAA AATACACCAC 600 

ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG TTCTACATCT CCATAACAAT 6 60 

65 AGAAT CCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC ACAGCCTAGA GACTTTAGAT 720 

TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA GGACACTCTC CAACCTTAAA 780 

GAACTACATT TCTATGACAA TCCCATCCAA TTTGTTGGGA GATCTGCTTT TCAACATTTA 840 

CCTGAACTAA GAACACTGAC TCTGAATGGT GCCTCACAAA TAACTGAATT TCCTGATTTA 900 

ACTGGAACTG CAAACCTGGA GAGT CTG ACT TTAACTGGAG CACAGATCTC ATCTCTTCCT 960 

70 CAAACCGTCT GCAATCAGTT ACCTAATCTC CAAGTGCTAG ATCTGTCTTA CAACCTATTA 1020 

GAAGATTTAC CCAGTTTTTC AGTCTGCCAA AAGCTTCAGA AAATTGACCT AAGACATAAT 1080 

GAAATCTACG AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGCCTCCG ATCGCTGAAT 1140 

TTGGCTTGGA ACAAAATTGC TATTATTCAC CCCAATGCAT TTTCCACTTT GCCATCCCTA 1200 

ATAAAGCTGG ACCTATCGTC CAACCTCCTG TCGTCTTTTC CTATAACTGG GTTACATGGT 1260 

75 TTAACTCACT TAAAATTAAC AGGAAATCAT GCCTTACAGA GCTTGATATC ATCTGAAAAC 132 0 

TTTCCAGAAC TCAAGGTTAT AGAAATGCCT TATGCTTACC AGTGCTGTGC ATTTGGAGTG 1380 

TGTGAGAATG CCTATAAGAT TTCTAATCAA TGGAATAAAG GTGACAACAG CAGTATGGAC 144 0 

GACCTTCATA AGAAAGATGC TGGAATGTTT CAGGCTCAAG ATGAACGTGA CCTTGAAGAT 1500 

TTCCTGCTTG ACTTTGAGGA AGACCTGAAA GCCCTTCATT C=iGTGC=iGTG TTCACCTTCC 1560 

80 CCAGGCCCCT TCAAACCCTG TGAACACCTG CTTGATGGCT GGCTGATCAG AATTGGAGTG 162 0 

TGGACCATAG CAGTTCTGGC ACTTACTTGT AATGCTTTGG TGACTTCAAC AGTTTTCAGA 1680 

TCCCCTCTGT ACATTTCCCC CATTAAACTG TTAATTGGGG TCATCGCAGC AGTGAACATG 1740 

CTCACGGGAG TCTCCAGTGC CGTGCTGGCT GGTGTGGATG CGTTCACTTT TGGCAGCTTT 1800 

GCACGACATG GTGCCTGGTG GGAGAATGGG GTTGGTTGCC ATGTCATTGG TTTTTTGTCC 1860 
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ATTTTTGCTT CAGAATCATC TGTTTTCCTG CTTACTCTGG CAGCCCTGGA GCGTGGGTTC 1920 

TCTGTGAAAT ATTCTGCAAA ATTTGAAACG AAAGCTCCAT TTTCTAGCCT GAAAGTAATC 1980 

ATTTTGCTCT GTGCCCTGCT GGCCTTGACC ATGGCCGCAG TTCCCCTGCT GGGTGGCAGC 2040 

AAGTATGGCG CCTCCCCTCT CTGCCTGCCT TTGCCTTTTG GGGAGCCC^G CACCATGGGC 2100 

TACATGGTCG CTCTCATCTT GCTCAATTCC CTTTGCTTCC TCATGATGAC CATTGCCTAC 2160 

ACCAAGCTCT ACTGCAATTT GGACAAGGGA GACCTGGAGA ATATTTGGGA CTGCTCTATG 2220 

GTAAAACACA TTGCCCTGTT GCTCTTCACC AACTGCATCC TAAACTGCCC TGTGGCTTTC 22S0 

TTGTCCTTCT CCTCTTTAAT AAACCTTACA TTTATCAGTC CTGAAGTAAT TAAGTTTATC 2 340 

CTTCTGGTGG TAGTCCCACT TCCTGCATGT CTCAATCCCC TTCTCTACAT CTTGTTCAAT 2400 

CCTCACTTTA AGGAGGATCT GGTGAGCCTG AGAAAGCAAA CCTACGTCTG GACAAGATCA 2460 

AAACACCCAA GCTTGATGTC AATTAACTCT GATGATGTCG AAAAACAGTC CTGTGACTCA 2520 

ACTCAAGCCT TGGTAACCTT TACCAGCTCC AGCATCACTT ATGACCTGCC TCCCAGTTCC 2E80 

GTGCCATCAC CAGCTTATCC AGTGACTGAG AGCTGCCATC TTTCCTCTGT GGCATTTGTC 2640 
CCATGTCTTA A 



I I I I I I 

MDTSRLGVLL SLPVLLQLAT GGSSPRSGVL LRGCPTHCHC EPDGRMI 
PSNLSVFTSY LDLSMNNISQ LLPNPLPSLR FLEELRLAGN ALTYIP1 
LQNNQLRHVP TEALQNLRSL QSLRLDANHI SYVPPSCFSG LHSLRHLKLD DNALTEIPVQ 
AFRSLSALQA MTLALNKIHH IPDYAFGNLS SLWLHLHNN RIHSLGKKCF DGLHSLETLD 
ITLSNLK ELHFYDNPIQ FVGRSAFQHL PELRTLTLNG ASQITEFPDL 
T LTGAQISSLP QTVCNQLPNL QVLDLSYNLL EDLPSFSVCQ KLQKIDLRHN 
F QQLLSLRSLN LAWNKIAIIH PNAFSTLPSL IKLDLSSNLL SSFPITGLHG 
H ALQSLISSEM FPELKVIEMP YAYQCCAFGV CENAYKISNQ WNKGDNSSMD 
DLHKKDAGMF QAQDERDLED FLLDFEEDLK ALHSVQCSPS PGPFKPCEHL LDGWLIRIGV 
7TSTVFR SPLYISPIKL LIGVIAAVNM LTGVSSAVLA GVDAFTFGSF 
3 VGCHVIGFLS IFASESSVFL LTLAALERGF SVKYSAKFET KAPFSSLKVI 
T MAAVPLLGGS KYGASPLCLP LPFGEPSTMG YMVALILLNS LCFLMMTIAY 
dlWDCSM VKHIALLLFT NCILNCPVAF LSFSSLINLT FISPEVIKFI 
L.LYILFN PHFKEDLVSL RKQTYVWTRS KHPSLMSINS DDVEKQSCDS 
TQALVTFTSS SITYDLPPSS VPSPAYPVTE SCHLSSVAPV PCL 

Seq ID NO: 48S DHA sequence 



AGCCAGCCCG AGGACGCGAG CGGCAGGTGT GCACAGAGGT TCTCCACTTT GTTTTCTGAA 60 
CTCGCGGTCA GGATGGTTTT CTCTGTCAGG CAGTGTGGCC ATGTTGGCAG AACTGAAGAA 120 
GTTTTACTGA CGTTCAAGAT ATTCCTTGTC ATCATTTGTC TTCATGTCGT TCTGGTAACA 18 0 



AGCTTACTCC CTTCAAACGA AACAGAAAAA ACTAAAATCA CTATAGTAAA AACCTTCAAT 360 

GCTTCAGGCG TCAAACCCCA G AG AAAT AT C TGCAATTTGT CATCTATTTG CAATGACTCA 420 

50 GCATTTTTTA GAGGTGAGAT CATGTTTCAA TATGATAAAG AAAGCACTGT TCCCCAGAAT 480 

CAACATATAA CGAATGGCAC CTTAACTGGA GTCCTGTCTC TAAGTGAATT AAAACGCTCA 540 

GAGCTCAACA AAACCCTGCA AACCCTAAGT GAGACTTACT TTATAATGTG TGCTACAGCA 600 

GAGGCCCAAA GCACATTAAA TTGTACATTC ACAATAAAAC TGAATAATAC AATGAATGCA 660 

TGTGCTGCAA TAGCCGCTTT GGAAAGAGTA AAGATT CGAC CAATGGAACA CTGCTGCTGT 720 

55 TCTGTCAGGA TACCCTGCCC TTCCTCCCCA GAAGAGTTGG GAAAGCTTCA GTGTGACCTG 780 

CAGGATCCCA TTGTCTGTCT TGCTGACCAT CCACGIGGCC CACCATTTTC TTCCAGCCAA 840 

TCCATCCCAG TGGTGCCTCG GGCCACTGTG CTTTCCCAGG TCCCCAAAGC TACCTCTTTT 900 

GCTGAGCCTC CAG ATT AT T C ACCTGTGACC CACAATGTTC CCTCTCCAAT AGGGGAGATT 960 

CAACCCCTTT CACCCCAGCC TTCAGCTCCC ATAGCTTCCA GCCCTGCCAT TGACATGCCC 1020 

60 CCACAGTCTG AAACGATCTC TTCCCCTATG CCCCAAACCC ATGTCTCCGG CACCCCACCT 1080 

CCTGTGAAAG CCTCATTTTC CTCTCCCACC GTGTCTGCCC CTGCGAATGT CAACACTACC 1140 

AGCGCACCTC CTGTCCAGAC AG ACATCGT C AACACCAGCA GTATTTCTGA TCTTGAGAAC 1200 

1260 



GAAATGATCA ACCAAGTCAG CAGACTCCTT CATTCCCCGC CTGACATGCT 
65 GCTCAAAGAT TGCTGAAAGT AGTGGATGAC ATTGGC CTAC AGCTGAACTT TTCAAACACG 

\ TGCCAGTAGT 



GTTGCAAACC TGACCGTCAG GAACTTGACA AGAAACGTGA CAGTCACATT AAAGCACATC 1740 

AACCCGAGCC AGGATGAGTT AACAGTGAGA TGTGTATTTT GGGACTTGGG CAGAAATGGT 1800 

ATCTGTACCT GTAGCCATCT AACAAGCTTC GGCGTTCTGC TGGACCTATC TAGGACATCT 1920 

GTGCTGCCTG CTCAAATGAT GGCTCTGACG TTCATTACAT ATATTGGTTG TGGGCTTTCA 1980 

TCAATTTTTC TGTCAGTGAC TCTTGTAACC TACATAGCTT TTGAAAAGAT CCGGAGGGAT 2040 

TACCCTTCCA AAATCCTCAT CCAGCTGTGT GCTGCTCTGC TTCTGCTGAA CCTGGTCTTC 2100 

CTCCTGGACT CGTGGATTGC TCTGTATAAG ATGCAAGGCC TCTGCATCTC AGTGGCTGTA 2160 

TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG GCCTAGAAGC ATTCCATATG 2220 

TACCTGGCCC TTGTCAAAGT ATTTAATACT TACAT OCGAA AATACATCCT TAAATTCTGC 2280 

ATTGTCGGTT GGGGGGTACC AGCTGTGGTT GTGACCATCA TCCTGACTAT ATCCCCAGAT 2340 

AACTATGGGC TTGGATCCTA TGGGAAATTC CCCAATGGTT CACCGGATGA CTTCTGCTGG 2400 

ATCAACAACA ATGCAGTATT CTACATTACG GTGGTGGGAT ATTTCTGTGT GATATTTTTG 2460 

CTGAACGTCA GCATGTTCAT TGTGGTCCTG GTTCAGCTCT GTCGAATTAA AAAGAAGAAG 2520 
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CAACTGGGAG CCCAGCGAAA AACCAGTATT CAAGACCTCA GSAGTATCGC TGGCCTTACA 2580 

TTTTTACTGG GAATAACTTG GGGCTTTGCC TTCTTTGCCT GGGGACCAGT TAACGTGACC 2640 

TTCATGTATC TGTTTGCCAT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 2700 

TGTCTGGCCA AAGAAAATGT CAGGAAGCAA TGGAGGCGGT ATCTTTGTTG TGGAAAGTTA 2 760 

CGGCTGGCTG AAAATTCTGA CTGGAGTAAA ACTGCTACTA ATGGTTTAAA GAAGCAGACT 2820 

GTAAACCAAG GAGTGTCCAG CTCTTCAAAT TCCTTACAGT CAAGCAGTAA CTCCACTAAC 2880 

TCCACCACAC TGCTAGTGAA TAATGATTGC TCAGTACACG CAAGCGGGAA TGGAAATGCT 2 940 

T CTACAGAGA GGAATGGGGT CTCTTTTAGT GTTCAGAATG GAGATGTGTG CCTTCACGAT 3 000 

TTCACTGGAA AACAGCACAT GTTTAACGAG AAGGAAGATT CCTGCAATGG GAAAGGCCGT 3060 

ATGGCTCTCA GAAGGACTTC AAAGCGGGGA AGCTTACACT TTATTGAGCA AATGTGATTC 3120 

CTTTCTTCTA AAATCAAAGC ATGATGCTTG ACAGTGTGAA ATGTCCAATT TTACCTTTTA 3180 

CACAATGTGA GATGTATGAA AATCAACTCA TTTTATTCTC GGCAACATCT GGAGAAGCAT 3240 

AAGCTAATTA AGGGCGATGA TTATTATTAC AAGAAGAAAC CAAGACATTA CACCATGGTT 3300 

TTTAGACATT TCTGATTTGG TTTCTTATCT TTCATTTTAT AAGAAGGTTG GTTTTAAACA 3360 

ATACACTAAG AATGACTCCT ATAAAGAAAA CAAAAAAAGG TAGTGAACTT TCAGCTACCT 3420 

TTTAAAGAGG CTAAGTTATC TTTGATAACA TCATATAAAG CAACTGTTGA CTTCAGCCTG 3480 

TTGGTGAGTT TAGTTGTGCA TGCCTTTGTT GTATATAAGC TAAATTCTAG TGACCCATGT 3540 

GTCAAAAATC TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTCCT 3 600 

5TTGTTT TGTCTCACGT GATAATTCAG AAAATCCTTG CTCGTTCCGC 3660 

-CTTTTG GAGATGATAT AGGATGTGAA ATACAGAAAC CTCAGTGAAA 3720 

A ATGATCCCAG CCAGACTGAG AAAATGTAAG CAGACAGTGC CACAGTTAGC 3 780 

TTGAGCA AGTTAGGAAA AGATGCCCCC ACTGGGCAGA CACAGCCCTA 3840 
G TTTGACAAAC AGAGTGAGAG ACCATATTTT AGCCCCACTC A 



2 AAGATGTTTA GCTTTATACC TTGGCCACAG 
3GAAAAT TGTGAACCTA GATGAGGTAC 
A TCCCCAGAGC CTTTAGGAAC AGGAGAGTAG ACTAGGATTC 
A AAAGGTACAT ATATATGGAA AAAAATCATA TTGCCG1 
ACTGCATGGT ACATTGTTGA TTGTTATGAC TGGTACACTC TGGCCCAGCC A 
TTGTTTTTTA AATGTGTCTT GAAGAATGCA CAGTGACAAG GGGAGTAGCT A 
GGGAACTGTC CTACACTGCT ATTGTTGCTA CATGTATCGA GCCTTGATTG CTCCTAGTTA 4380 
TATACAGGGT CTATCTTGCT TCCTACCTAC ATCTGCTTGA GCAGTGCCTC AAGTACATCC 4440 
TTATTAGGAA CATTTCAAAC CCCTTTTAGT TAAGTCTTTC ACTAAGGTTC TCTTGCATAT 4500 
ATTTCAAGTG AATGTTGGAT CTCAGACTAA CCATAGTAAT AATACACATT TCTGTGAGTG 4560 
CTGACTTGTC TTTGCAATAT TTCTTTTCTG ATTTATTTAA TTTTCTTGTA TTTATATGTT 4620 
AAAATCAAAA ATGTTAAAAT CAATGAAATA AATTTGCAGT TAAGA 



1 11 21 31 41 51 

I I I I I I 

MVFSVRQCGH VGRTEEVLLT FKIFLVIICL HWLVTSLEE DTDNSSLSPP PAKLSWSFA 
PSSNEVETTS LNDVTLSLLP SNETGVKPQR NICNLSSICH DSAFFRGEIM FQYDKESTVP 
QNQHITNGTL TGVLSLSELK RSELNKTLQT LSETYFIMCA TAEAQSTLNC TFTIKLNNTM 
NACAVIAALE RVKIRPMEHC CCSVRIPCPS SPEELEKLQC DLQDPIVCLA DHPRGPPFSS 
SQSIPWPRA TVLSQVPKAT SFAEPPDYSP VTHNVPSPIG EIQPLSPQPS APIASSPAID 
MPPQSETISS PMPQTHVSGT PPPVKASFSS PTVSAPANVN TTSAPPVQTD ITOTSSISDL 
ENQVLQMEKA LSLGSLEPNL AGEMINQVSR LLHSPPDMLA PLAQRLLiWV DDIGLQLNFS 
NTTISLTSPS LALAVIRVNA SSFNTTTFVA QDPANLQVSL ETQAPENSIG TITLPSSLMN 
NLPAHDMELA SRVQFNFFET PALFQDPSLE NLSLISYVIS SSVANLTVRN LTRNVTVTLK 
HINPSQDELT VRCVFWDLGR NGGRGGWSDN GCSVKDRRLN ETICTCSHLT SFGVLLDLSR 
TSVLPAQMMA LTFITYIGCG LSSIFLSVTL VTYIAFEKIR RDYPSKILIQ LCAALLLLNL 
VFLLDSWIAL YKMQGLCISV AVFLHYFLLV SFTWMGLEAF HMYLALVKVF KTYIRKYILK 
FCIVGWGVPA VWTIILTIS PDNYGLGSYG KFPNGSPDDF CWINNNAVFY ITWGYFCVI 
FLLNVSMFIV VLVQLCRIKK KKQLGAQRKT SIQDLRSIAG LTFLLGITViG FAFFAWGPVN 
VTFMYLFAIF NTLQGFFIFI FYCVAKENVE KQWRRYLCCG KLRLAENSDW SKTATNGLKK 
QTVNQGVSSS SNSLQSSSNS TNSTTLLVNN DCSVHASGNG NASTERNGVS FSVQNGDVCL 
HDFTGKQHMF NEKEDSCNGK GRMALRRTSK RGSLHFIEQM 



A GTGTGGCCAT G 

TTCAAGATAT TCCTTGTCAT CATTTGTCTT CATGTCGTTC TGGTAACATC CCTGGAAGAA 
TCCAGTTT GTCACCACCA CCTGATGTTA CTTTAA 



A AGCTTCAGTG TGACCTGCAG GATCCCATTG TCTGTCTTGC TGACCATCCA 
CGTGGCCCAC CATTTTCTTC CAGCCAATCC ATCCCAGTGG TGCCTCGGGC CACTGTGCTT 
TCCCAGGTCC CCAAAGCTAC CTCTTTTGCT GAGCCTCCAG ATTATTCACC TGTGACCCAC 
G GGAGATTCAA CCCCTTTCAC CCCAGCCTTC AGCTCCCATA 



CAAACCCATG TCTCCGGCAC CCCACCTCCT GTGAAAGCCT CATTTTCCTC TCCCACCGTG 

TCTGCCCCTG CGAATGTCAA CACTACCAGC GCACCTCCTG TCCAGACAGA CATCGTCAAC 

ACCAGCAGTA TTTCTGATCT TGAGAACCAA GTGTTGCAGA TGGAGAAGGC TCTGTCCTTG 

GGCAGCCTGG AGCCTAACCT CGCAGGAGAA ATGATCAACC AAGTCAGCAG ACTCCTTCAT 
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TCCCCGCCTG ACATGCTGGC CCCTCTGGCT CAAAGATTGC TGAAAGTAGT GGATGACATT 1140 

GGCCTACAGC TGAACTTTTC AAACACGACT ATAAGTCTAA CCTCCCCTTC TTTGGCTCTG 1200 

GCTGTGATCA GAGTGAATGC CAGTAGTTT C AACACAACTA CCTTTGTGGC CCAAGACCCT 1260 

GCAAATCTTC AGGTTTCTCT GGAAACCCAA GCTCCTGAGA ACAGTATTGG CACAATTACT 1320 

CTTCCTTCAT CGCTGATGAA TAATTTACCA GCTCATGACA TGGAGCTAGC TTCCAGGGTT 1380 

CAGTTCAATT TTTTTGAAAC ACCTGCTTTG TTTOAGGATC CTTCCCTGGA GAACCTCTCT 1440 

CTGATCAGCT ACGTCATATC ATCGAGTGTT GCAAACCTGA CCGTCAGGAA CTTGACAAGA 1500 

AACGTGACAG TCACATTAAA GCACATCAAC CCGAGCCAGG ATGAGTTAAC AGTGAGATGT 1560 

GTATTTTGGG ACTTGGGCAG AAATGGTGGC AGAGGAGGCT GGTCAGACAA TGGCTGCTCT 1620 

GTCAAAGACA GGAGATTGAA TGAAACCATC TGTACCTGTA GCCATCTAAC AAGCTTCGGC 1680 

GTTCTGCTGG ACCTATCTAG GACATCTGTG CTGCCTGCTC AAATGATGGC TCTGACGTTC 1740 

ATTACATATA TTGGTTGTGG GCTTTCATCA ATTTTTCTGT CAGTGACTCT TGTAACCTAC 1800 

G GAGGGATTAC CCTTCCAAAA TCCTCATCCA GCTGTGTGCT 1860 

T GTATAAGATG 1920 

CAAGGCCTCT GCATCTCAGT GGCTGTATTT CTTCATTATT TTCTCTTGGT CTCATTCACA 1980 

TGGATGGGCC TAGAAGCATT CCATATGTAC CTGGCCCTTG TCAAAGTATT TAATACTTAC 2040 

ATCCGAAAAT ACATCCTTAA ATTCTGCATT GTCGGTTGGG GGGTACCAGC TGTGGTTGTG 2100 
ACCATCATCC TGACTATATC CCCAGATAAC TATGGGCTTG GATCCTATGG G 
3 CAGTATTCTA C 

CAGCTCTGTC GAATTAAAAA GAAGAAGCAA CTGGGAGCCC AGCGAAAAAC CAGTATTCAA 2 340 

TTTGCCTGGG GACCAGTTAA CGTGACCTTC ATGTATCTGT TTGCCATCTT TAATACCTTA 2460 
A ATTCTGACTG G 

GCTACTAATG GTTTAAAGAA GCAGACTGTA AACCAAGGAG TGTCCAGCTC TTCAAATTCC 

TTACAGT CAA GCAGTAACTC CACTAACTCC ACCACACTGC TAGTGAATAA TGATTGCTCA 2700 

GTACACGCAA GCGGGAATGG AAATGCTTCT ACAGAGAGGA ATGGGGTCTC TTTTAGTGTT 2760 

CAGAATGGAG ATGTGTGCCT TCACGATTTC ACTGGAAAAC AGCACATGTT TAACGAGAAG 2820 

GAAGATTCCT GCAATGGGAA AGGCCGTATG GCTCTCAGAA GGACTTCAAA GCGGGGAAGC 2 880 
TTACACTTTA TTGAGCAAAT GTGA 



MVFSVEQCGH VGETEEVLLT FKIFLVIICL HWLVTSLEE DTDNSSLSPP PDVTLSLLPS 
IQIDSAFFRG EIMFQYDKES TVPQNQIIITN 
'K IRPKEHCCCS VRIPCPSSPE 
IPWPRATVL SQVPKATSFA EPPDYSPVTH 



NETEKTKITI VKTFNASGVK PQRNICNLSS 
GTLTGVLSLS ELNTLNCTFT IKLNNTMNAC 
ELEKLQCDLQ DPIVCLADHP RGPPFSSSQS 



SPPDMLAPLA QELLKWDDI GLQLNFSKTT 



ANLQVSLETQ APENSIGTIT LPSSLMNNLP AIIDMELASRV QFNFFSTPAL FQDPSLENLS 



LISYVISSSV ANLTVRNLTR NVTVTLKHIN 



'. SLTSPSLAL AVIRVNASSF NTTTFVAQDP 



PSQDELTVRC VFWDLGRNGG RGGWSDNGCS 
I CTCSHLTSFG VLLDLSRTSV LPAQMMALTF ITYIGCGLSS IFLSVTLVTY 
IAFEKIRRDY PSKILIQLCA ALLLLNLVFL LDSWIALYKM QGLCISVAVF LHYFLLVSFT 
HMGLEAFHMY LALVKVFNTY IRKYILKFCI VGWGVPAVW TIILTISPDN YGLGSYGKFP 
NGSPDDFCWI NNNAVFYITV VGYFCVIFLL, NVSMFIWLV QLCRIKKKKQ LGAQRKTSIQ 
DLRSIAGLTF LLGITWGFAF FAWGPVNVTF MYLFAIFNTL QGFFIFIFYC VAKENVRKQW 
RRYLCCGKLR LAENSDWSKT AINGLKKQTV NQGVSSSSNS LQSSSNSTNS TTLLVHNDCS 
VHASGNGNAS TERNGVSFSV QNGDVCLHDF TGKQHMFNEK EDSCNGKGRM ALRRTSKRGS 
LHFIEQM 



Coding sequence 



CTGTCAGGCA GTGTGGCCAT GTTGGCAGAA CTGAAGAAGT TTTACTGACG 



ATTCGACCAA TGGAACACTG CTGCTGTTCT GTCAGGATAC O 

GAGTTGGAAA AGCTTCAGTG TGACCTGCAG GATCCCATTG TCTGTCTTGC TGACCATCCA 
CGTGGCCCAC CATTTTCTTC CAGCCAATCC A' 
TCCCAGGTCC CCAAAGCTAC CTCTTTTGCT G 

AATGTTCCCT CTCCAATAGG GGAGATTCAA CCCCTTTCAC CCCAGCCTTC AGCTCCCATA 
GCTTCCAGCC CTGCCATTGA CATGCCCCCA CAGTCTGAAA CGATCTCTTC C 
CAAACCCATG TCTCCGGCAC CCCACCTCCT G 
TCTGCCCCTG CGAATGTCAA CACTACCAGC GCACCTCCTG T C CAG ACAG A CATCGTCAAC 1020 
ACCAGCAGTA TTTCTGATCT TGAGAACCAA GTGTTGCAGA TGGAGAAGGC TCTGTCCTTG 1080 
GGCAGCCTGG AGCCTAACCT CGCAGGAGAA ATGATCAACC AAGTCAGCAG ACTCCTTCAT 1140 
TCCCCGCCTG ACATGCTGGC CCCTCTGGCT CAAAGATTGC TGAAAGTAGT GGATGACATT 1200 
GGCCTACAGC TGAACTTTTC AAACACGACT ATAAGTCTAA CCTCCCCTTC TTTGGCTCTG 1260 
GCTGTGATCA GAGTGAATGC CAGTAGTTTC AACACAACTA CCTTTGTGGC CCAAGACCCT 1320 
GCAAATCTTC AGGTTTCTCT GGAAACCCAA GCTCCTGAGA ACAGTATTGG CACAATTACT 1380 
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CTTCCTTCAT CGCTGATGAA T AATT TACCA GCTCATGACA TGGAGCTAGC TTCCAGGGTT 1440 

CTGATCAGCT ACGTCATATC ATCGAGTGTT GCAAACCTGA CCGTCAGGAA CTTGACAAGA 1560 

AACGTGACAG TCACATTAAA GCACATCAAC CCGAGCCAGG ATGAGTTAAC AGTGAGATGT 1620 

5 GTATTTTGGG ACTTGGG CAG AAATGGTGGC AGAGGAGGCT GGTCAGACAA TGGCTGCTCT 1680 

GTCAAAGACA GGAGATTGAA TGAAACCATC TGTACCTGTA GCCATCTAAC AAGCTTCGGC 1740 

GTTCTGCTGG ACCTATCTAG GACATCTGTG CTGCCTGCTC AAATGATGGC TCTGACGTTC 1800 

ATTACATATA TIGGTTGTGG GCTTTCATCA ATTTTTCTGT CAGTGACTCT TGTAACCTAC 1860 

ATAGCTTTTG AAAAGATCCG GAGGGATTAC CCTTCCAAAA TCCTCATCCA GCTGTGTGCT 1920 

10 GCTCTGCTTC TGCTGAACCT GGTCTTCCTC CTGGACTCGT GGATTGCTCT GTATAAGATG 1980 

CAAGGCCTCT GCATCTCAGT GGCTGTATTT CTTCATTATT TTCTCTTGGT CTCATTCACA 2040 

TGGATGGGCC TAGAAGCATT CCATATGTAC CTGGCCCTTG TCAAAGTATT TAATACTTAC 2100 

AT CCGAAAAT ACATCCTTAA ATTCTGCATT GTCGGTTGGG GGGTACCAGC TGTGGTTGTG 2160 

ACCATCATCC TGACTATATC CCCAGATAAC TATGGGCTTG GATCCTATGG GAAATTCCCC 2220 

15 AATGGTTCAC CGGATGACTT CTGCTGGATC AACAACAATG CAGTATTCTA CATTACGGTG 2280 

GTGGGATATT TCTGTGTGAT ATTTTTGCTG AACGTCAGCA TGTTCATTGT GGTCCTGGTT 2340 

CAGCTCTGTC GAATTAAAAA GAAGAAGCAA CTGGGAGCCC AGCGAAAAAC CAGTATTCAA 2400 

GACCTCAGGA GTATCGCTGG CCTTACATTT TTACTGGGAA TAACTTGGGG CTTIGCCTTC 2460 

TTTGCCTGGG GACCAGTTAA CGTGACCTTC ATGTATCTGT TTGCCATCTT TAATACCTTA 2520 

20 CAAGGATTTT TCATATTCAT CTTTTACTGT GTGGCCAAAG AAAATGTCAG GAAGCAATGG 2B80 

AGGCGGTATC TTTGTTGTGG AAAGTTACGG CTGGCTGAAA ATTCTGGAAA TGCTTCTACA 2640 

GAGAGGAATG GGGTCTCTTT TAGTGTTCAG AATGG AG AT G TGTGCCTTCA CGATTTCACT 2 700 

GGAAAACAGC ACATGTTTAA CGAGAAGGAA GATTCCTGCA ATGGGAAAGG CCGTATGGCT 2760 
CTCAGAAGGA CTTCAAAGCG GGGAAGCTTA CACTTTATTG AGCAAATGTG A 



I I I 

MVFSVRQCGH VGRTEEVLLT FKIFLVIICL H 

VTLSLLPSNE TGVKPQRNIC NLSSICNDSA FFRGEIMFQY DKESTVPQNQ H 
LSLSELKRSE LNKTLQTLSE TYF IMCATAE AQSTLNCTFT I 
IRPMEHCCCS VRIPCPSSPE ELEKLQCDLQ DPIVCLADHP R 
SOVPKATSFA EPPDYSPVTH NVPSPIGEIQ PLSPQPSAPI ASSPAIDMPP QSETISSPMP 
QTIIVSGTPPP VKASFSSPTV SAPANWTTS APPVQTDIVN TSSISDLENQ VLQMEKALSL 
GSLEPNLAGE MINQVSRLLH SPPDMLAPLA QRLLKWDDI GLOLNFSNTT ISLTSPSLAL 
AVIRVNASSF NTTTFVAQDP ANLQVSLETQ APENSIGTIT LPSSLMNNLP AHDMEIiASRV 
QFNFFETPAL FQDPSLENLS LISYVISSSV ANLTVRNLTR NVTVTLKHIN PSQDELTVRC 
VFWDLGRNGG RGGWSDNGCS VKDRRLNETI CTCSHLTSFG VLLDLSRTSV LPAQMMALTF 
ITYIGCGLSS IFLSVTLVTY IAFEKIRRDY PSKILIQLCA ALLLLNLVFL LDSWIALYKM 
QCLCISVAVF LHYFLLVSFT WMGLEAFHMY LALVKVFNTY IRKYILKFCI VGWGVPAVVV 
TIILTISPDN YGLGSYGKFP NGSPDDFCW I NNNAVFYITV VGYFCVIFLL KVSMFIWLV 



GKQHMFNEKE DSCNGKGRMA LRRTSKRGSL KFIEQM 



1 11 21 31 41 51 

I I I I I I 

ATGGTTTTCT CTGTCAGGCA GTGTGGCCAT GTTGGCAGAA CTGAAGAAGT TTTACTGACG SO 

TTCAAGATAT TCCTTGTCAT CATTTGTCTT CATGTCGTTC TGGTAACATC CCTGGAAGAA 120 

GATACTGATA ATTCCAGTTT GTCACCACCA CCTGCTAAAT TATCTGTTGT CAGTTTTGCC 180 

CCCTCCTCCA ATGAGGTIGA AACAACAAGC CTCAATGATG TTACTTTAAG CTTACTCCCT 240 

TCAAACGAAA CAGAAAAAAC TAAAATCACT ATAGTAAAAA CCTTCAATGC TTCAGGCGTC 300 

AAACCCCAGA GAAATATCTG CAATTTGTCA TCTATTTGCA ATGACTCAGC ATTTTTTAGA 360 

GGTGAGATCA TGTTT CAAT A IGATAAAGAA AGCACTGTTC CCCAGAATCA ACATATAACG 420 

AATGGCACCT TAACTGGAGT CCTGTCTCTA AGTGAATTAA AACGCTCAGA GCTCAACAAA 480 

ACCCTGCAAA CCCTAAGTGA GACTTACTTT ATAATGTGTG CTACAGCAGA GGCCCAAAGC 540 

ACATTAAATT GTACATTCAC AATAAAACTG AATAATACAA TGAATGCATG TGCTGCAATA 600 

GCCGCTTTGG AAAGAGTAAA GATTCGACCA ATGGAACACT GCTGCTGTTC TGTCAGGATA S60 

CCCTGCCCTT CCICCCCAGA AGAGTTGGGA AAGCTTCAGT GTGACCTGCA GGATCCCATT 720 

GTCTGTCTTG CTGACCATCC ACGTGGCCCA CCATTTTCTT CCAGCCAATC CATCCCAGTG 780 

GTGCCTCGGG CCACTGTGCT TTCCCAGGTC CCCAAAGCTA CCTCTTTTGC TGAGCCTCCA 840 

GATTATTCAC CTGTGACCCA CAATGTTCCC TCTCCAATAG GGGAGATTCA ACCCCTTTCA 900 

CCCCAGCCTT CAGCTCCCAT AGCTTCCAGC CCTGCCATTG ACATGCCCCC ACAGTCTGAA 960 

ACGATCTCTT CCCCTATGCC CCAAACCCAT GTCTCCGGCA CCCCACCTCC TGTGAAAGCC 1020 

TCATTTTCCT CTCCCACCGT GTCTGCCCCT GCGAATGTCA ACACTACCAG CGCACCTCCT 1080 

GTCCAGACAG ACATCGTCAA CACCAGCAGT ATTTCTGATC TTGAGAACCA AGTGTTGCAG 1140 

G CTCTGTCCTT GGGCAGCCTG GAGCCTAACC TCGCAGGAGA AATGATCAAC 1200 

ft TTCCCCGCCT GACATGCTGG CCCCTCTGGC TCAAAGATTG 1260 

I TGGCCTACAG CTGAACTTTT CAAACACGAC TATAAGTCTA 1320 

ACCTCCCCTT CTTTGGCTCT GGCTGTGATC AGAGTGAATG CCAGTAGTTT CAACACAACT 1380 

ACCTTTGTGG CCCAAGACCC TGCAAATCTT CAGGTTTCTC TGGAAACCCA AGCTCCTGAG 1440 

AACAGTATTG GCACAATTAC TCTTCCTTCA TCGCTGATGA AIAATITACC AGCTCATGAC 1500 

ATGGAGCTAG CTTCCAGGGT TCAGTTCAAT TTTTTTGAAA CACCTGCTTT GTTTCAGGAT 1560 

CCTTCCCTGG AGAACCTCTC TCTGATCAGC TACGTCATAT CATCGAGTGT TGCAAACCTG 1620 

ACCGTCAGGA ACTTGACAAG AAACGTGACA GTCACATTAA AGCACATCAA CCCGAGCCAG 1680 

GATGAGTTAA CAGTGAGATG TGTATTTTGG GACTTGGGCA GAAATGGTGG CAGAGGAGGC 1740 

TGGTCAGACA ATGGCTGCTC TGTCAAAGAC AGGAGATTGA ATGAAACCAT CTGTACCTGT 1800 

AGCCATCTAA CAAGCTTCGG CGTTCTGCTG GACCTATCTA GGACATCTGT GCTGCCTGCT I860 
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45 



CAAATGATGG CTCTGACGTT CATTACATAT ATTGGTTGTG GGCTTTCATC AATTTTTCTG 
TCAGTGACTC TTGTAACCTA CATAGCTTTT GAAAAGATCC GGAGGGATTA CCCITCCAAA 
C TGCTCTGCTT CTGCTGAACC TGGTCTTCCT CCTGGACTCG 
T GCAAGGCCTC TGCATCTCAG TGGCTGTATT T 
TTTCTCTTGG TCTCATTCAC ATGGATGGGC CTAGAAGCAT TCCATATGTA 0 

GTCAAAGTAT TTAATACTTA CATCCGAAAA TACATCCTTA AATTCTGCAT TGTCGGTTGG 2220 

GGGGTACCAG CTGTGGTTGT GACCATCATC CTGACTATAT CCCCAGATAA CTATGGGCTT 2280 

GGATCCTATG GGAAATTCCC CAATGGTTCA CCGGATGACT TCTGCTGGAT CAACAACAAT 2340 

GCAGTATTCT ACATTACGGT GGTGGGATAT TTCTGTGTGA TATTTTTGCT GAACGTCAGC 2400 

ATGTTCATTG TGGTCCTGGT TCAGCTCTGT CGAATTAAAA AGAAGAAGCA ACTGGGAGCC 2460 

CAGCGAAAAA CCAGTATTCA AGACCTCAGG AGTATCGCTG GCCTTACATT TTTACTGGGA 2520 

ATAACTTGGG GCTTTGCCTT CTTTGCCTGG GGACCAGTTA ACGTGACCTT CATGTATCTG 2580 

TTTGCCATCT TTAATACCTT ACAAGGATTT TTCATATCCA TCTTITACTG TGTGGCCAAA 2540 

GAAAATGTCA GGAAGCAATG GAGGCGGTAT CTTTGTTGTG GAAAGTTACG GCTGGCTGAA 2700 

AATTCTGACT GGAGTAAAAC TGCTACTAAT GGTTTAAAGA AGCAGACTGT AAACCAAGGA 2760 

GTGTCCAGCT CTTCAAATTC CTTACAGTCA AGCAGTAACT CCACTAACTC CACCACACTG 2820 

CTAGTGAATA ATGATTGCTC AGTACACGCA AGCGGGAATG GAAATGCTTC T ACAG AGAGG 2880 

AATGGGGTCT CTTTTAGTGT TCAGAATGGA GATGTGTGCC TTCACGATTT CACTGGAAAA 2940 

CAGCACATGT TTAACGAGAA GGAAGATTCC TGCAATGGGA AAGGCCGTAT GGCTCTCAGA 3000 
AGGACTTCAA A 



n Accession # : Eos sequence 
1 11 21 31 41 51 

I I I I I I 

MVPSVEQCGH VGRTEEVLLT FKIPLVIICL HWLVTSLEE DTDNSSLSPP PAKIjSWSFA 
PSSNEVETTS LNDVTLSLLP SNETEKTKIT IVKTFNASGV KPQRNICNLS SICNDSAFFR 
GEIMFQYDKE STVPQNQHIT NGTLTGVLSL SELKRSELNK TLQTLSETYF IMCATAEAQS 
TLNCTFTIKL NNTMNACAAI AALERVKIRP MEHCCCSVRI PCPSSPEELG KLQCDLQDPI 
VCLADHPRGP PFSSSQSIPV VPRATVLSQV PKATSPAEPP DYSPVTHNVP SPIGEIQPLS 
PQPSAPIASS PAIDMPPQSE TISSPMPQTH VSGTPPPVKA SFSSPTVSAP AHVNTTSAPP 
VQTDIVNTSS ISDLENQVLQ MEKALSLGSL EPNLAGEMIN QVSRLLHSPP DMLAPLAQRL 
LKWDDIGLQ LNPSNTTISL TSPSLALAVI RVNASSFNTT TFVAQDPANL QVSLETQAPE 
NSIGTITLPS SLMNNLPAHD MELASRVQFN FFETPALFQD PSLENLSLIS YVISSSVAHL 
TVRNLTRNVT VTLKHINPSQ DELTVRCVFW DLGRNGGRGG WEDNGCEVKD RRLNETICTC 
SHLTSFGVLL DLSRTSVLPA QMMALTFITY IGCGLSSIFL SVTLVTYIAF EKIRRDYPSK 
S WIALYKMQGL CISVAVFLHY FLLVSFTWMG LEAFHMYLAL 
,1 GVPAWVTII LTISPDNYGL GEYGKFPNGS PDDFCNINNN 
S MFIWLVQLC RIKKKKQLGA QRKTSIQDLR SIAGLTFLLG 
W GPVNVTFMYL FAIFNTLQGF FIFIFYCVAK ENVRKOWRRY LCCGKLRLAE 
N GLKKQTVNQG VSSSSNSLQS SSNSTNSTTL LVNNDCSVHA SGNGN ASTER 
NGVSFSVQNG DVCLHDFTGK QHMFNEKEDS CNGKGRMALR RTSKRGSLHF ISQM 

Seq ID NO: 493 DNA sequence 



so i r i i i i 



60 



C CCAGACTGCA G 

CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG G 
GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC O 

55 ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGT GGTTTC 300 

GGGAACGCGG CCAGTGCAAG GCATCAOGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 

TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 

TGTGAAGCTA CATGCGAACC TGGATG1AAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 

AGATGCTTTC CAGGATACAC CGGGAAAACC TG C AGTCAAG ATGTGAATGA GTGTGGAATG 540 

60 AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 

CTCAGTGGCC. ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 

ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 

TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 

GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 

65 AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 

AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 

GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 

AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1140 

70 CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 12 00 

A GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 

3CCCTGA AGAATGACAT AGAGGAGCGA 1320 

AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 

A GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 

G GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 

TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 

GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 
AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA A 1 



TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 
GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 
TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGACT GAATGTTACT ATCTTTATAT 
TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CICTGGCAT1 
TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 
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TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 

TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 

CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT ITCTAGAAAA 2220 

TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 

5 TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTICA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 



Protein Accessi 



MPLPWSLALP LLLSWVAGGF GNAASAHHHG LLASAROPGV CHYGTKLACC YGWRRNSKGV 
CEATCEPGCK FGECTGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQKRCV NTHGSYKCFC 
LSGKMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDSCAS 
GKVICPYNRR CVNTFGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 
GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKLIiAHKNSM KKKAKIKNVT 
PEPTRTPTPK VNLQPFNYEE IVSRGGNSHG GKKGNEEKMK EGLEDEKREE KALKNDIEER 
SLRGDVFFPK VNEAGEFGLI LVORKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 
WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 
FVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 
SGLCPDSLLS VBD 



Seq ID NO : 49B DNA sequen 



1 11 21 31 41 51 

I I I I I I 

GCAGCTCCAG TCCCGGACGC AACCCCGGAG CCGTCTCAGG TCCCTGGGGG GAACGGTGGG SO 

TTAGACGGGG ACGGGAAGGG ACAGCGGCCT TCGACCGCCC CCCGAGTAAT TGACCCAGGA 120 

CTCATTTTCA GGAAAGCCTG AAAATGAGTA AAATAGTGAA ATGAGGAATT TGAACATTTT 180 

ATCTTTGGAT GGGGATCTTC TGAGGATGCA AAGAGTGATT CATCCAAGCC ATGTGGTAAA 240 

ATCAGGAATT TGAAGAAAAT GGAGATGTTT ACATTTTTGT TGACGTGTAT TTTTCTACCC 300 

CTCCTAAGAG GGCACAGTCT CTTCACCTGT GAACCAATTA CTGTTCCCAG ATGTATGAAA 360 

ATGGCCTACA ACATGACGTT TTTCCCTAAT CTGATGGGTC ATTATGACCA GAGTATTGCC 420 

GCGGTGGAAA TGGAGCATTT TCTTCCTCTC GCAAATCTGG AATGTTCACC AAACATTGAA 48 0 

ACTTTCCTCT GCAAAGCATT TGTACCAACC TGCATAGAAC AAATTCATGT GGTTCCACCT 54 0 

TGTCGTAAAC TTTGTGAGAA AGTATATTCT GATTGCAAAA AATTAATTGA CACTTTTGGG 600 

ATCCGATGGC CTGAGGAGCT TGAATGTGAC AGATTACAAT ACTGTGATGA GACTGTTCCT 660 

GTAACTTTTG ATCCACACAC AGAATTTCTT GGTCCTCAGA AGAAAACAGA ACAAGTCCAA 720 

AGAGACATTG GATTTTGGTG TCCAAGGCAT CTTAAGACTT CTGGGGGACA AGGATATAAG 78 0 

TTTCTGGGAA TTGACCAGTG TGCGCCTCCA TGCCCCAACA TGTATTTTAA AAGTGATGAG 84 0 

CTAGAGTTTG CAAAAAGTTT TATTGGAACA GTTTCAATAT TTTGTCTTTG TGCAACTC1G 90 0 

TTCACATTCC TTACTTTTTT AATTGATGTT AGAAGATTCA GATACCCAGA GAGAC CAATT 960 

ATATATTACT CTGTCTGTTA CAGCATTGTA TCTCTTATGT ACTTCATTGG ATTTTTGCTG 102 0 

GGCGATAGCA CAGCCTGCAA TAAGGCAGAT GAGAAGCIAG AACTTGGTGA CACTGTTGTC 1080 

CTAGGCTCTC AAAATAAGGC TTGCACCGTT TTGTTCATGC TTTTGTATTT TTTCACAATG 1140 

GCTGGCACTG TGTGGTGGGT GATTCTTACC ATTACTTGGT TCTTAGCTGC AGGAAGAAAA 1200 

TGGAGTTGTG AAGCCATCGA GCAAAAAGCA GTGTGGTITC ATGCTGTTGC ATGGGGAACA 12 60 

CCAGGTTTCC TGACTGTTAT GCTTCTTGCT CTGAACAAAG TTGAAGGAGA CAACATTAGT 1320 

GGAGTTTGCT TTGTTGGCCT TTATGACCTG GATGCTTCTC GCTACTTTGT ACTCTTGCCA 1380 

CTGTGCCTTT GTGTGTTTGT TGGGCICTCT CTTCTTTTAG CTGGCATTAT TTCCTTAAAT 1440 

CATGTTCGAC AAGTCATACA ACATGATGGC CGGAACCAAG AAAAACTAAA GAAATTTATG 1500 

ATTCGAATTG GAGTCTTCAG CGGCTTGTAT CTTGTGCCAT TAGTGACACT TCTCGGATGT 1560 

TACGTCTATG AGCAAGT3AA CAGGATTACC TGGGAGATAA CTTGGGTCTC TGATCATTGT 1620 

CGTCAGTACC ATATCCCATG TCCTTATCAG GCAAAAGCAA AAGCTCGACC AGAATTGGCT 1680 

TTATTTATGA TAAAATACCT GATGACATTA ATTGTTGGCA TCTCTGCTGT CTTCTGGGTT 1740 

GGAAGCAAAA AGACATGCAC AGAATGGGCT GGGTTTTTTA AACGAAATCG CAAGAGAGAT 1800 

CCAATCAGTG AAAGTCGAAG AGTACTACAG GAATCATGTG AGTTTTTCTT AAAGCACAAT I860 

TCTAAAGTTA AACACAAAAA GAAGCACTAT AAACCAAGTT CACACAAGCT GAAGGTCATT 1920 

TCCAAATCCA TGGGAACCAG CACAGGAGCT ACAGCAAATC ATGGCACTTC TGCAGTAGCA 19B0 

ATTACTAGCC ATGAT T AC CT AGGACAAGAA ACTTTGACAG AAATCCAAAC CTCACCAGAA 2 040 

ACATCAATGA GAGAGGTGAA AGCGGACGGA GCTAGCACCC CCAGGTTAAG AGAACAGGAC 2100 

TGTGGTGAAC CTGCCTCGCC AGCAGCATCC ATCTCCAGAC TCTCTGGGGA ACAGGTCGAC 2160 

GGGAAGGGCC AGGCAGGCAG TGTATCTGAA AGTGCGCGGA GTGAAGGAAG GATTAGTCCA 2220 

AAGAGTGATA TTACTGACAC TGGCCTGGCA CAGAGCAACA ATTTGCAGGT CCCCAGTTCT 2280 

TCAGAACCAA GCAGCCTCAA AGGTTCCACA TCTCTGCTTG TTCACCCAGT TTCAGGAGTG 2340 

AGAAAAGAGC AGGGAGGTGG TTGTCATTCA GATACTTGAA GAACATTTTC TCTCGTTACT 2400 

A TTTGTGTTAC ACTGGAAGTG ACCTATGCAC TGTTTTGTAA GAATCACTGT 2460 

. ___.'GCACTT AAAGTTGCAT TGCCTACTGT TATACTGGAA AAAATAGAGT 252 0 

A TATGACTCAT TTCACACAAA GGTTAATGAC AACAATATAC CTGAAAACAG 2580 

IATAATA TTTTTTIAAT AGTGTGGGAG GACAGAGTTA GAGGAATCTT 2640 

CCTTTTCTAT TTATGAAGAT TCTACTCTTG GTAAGAGTAT TTTAAGATGT ACTATGCTAT 2700 

TTTACCTTTT TGATATAAAA TCAAGATATT TCTTTGCTGA AGTATTTAAA TCTTATCCTT 2760 

GTATCTTTTT ATACATATTT GAAAATAAGC TTATATGTAT TTGAACTTTT TTGAAATCCT 2820 

ATTCAAGTAT TTTTATCATG CTATTGTGAT ATTTTAGCAC TTTGGTAGCT TTTACACTGA 2880 

ATTTCTAAGA AAATTGTAAA ATAGTCTTCT TTTATACTGT AAAAAAAGAT ATACCAAAAA 2940 

GTCTTATAAT AGGAATTTAA CTTTAAAAAC CCACTTATTG ATACCTTACC ATCTAAAATG 3000 

TGTGATTTTT ATAGTCTCGT TTTAGGAATT TCACAGATCT AAATTATGTA ACTGAAATAA 3060 

GGTGCTTACT CAAAGAGTGT CCACTATTGA TTGTATTATG CTGC1CACTG ATCCTTCTGC 3120 
ATATTTAAAA TAAAATGTCC TAAAGGGTTA GTAGACAAAA T " ~ 



GGCCAAGTGC AATTGACTTC CCTTTTTTAA TGTTTCATGA CCACCCATTG ATTGTATTAT 3240 
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I I I I I I 

MEMFTFLLTC IFLPLLRGHS LFTCEPITVP RCMKMAYNMT FFPNLMGHYD QSIAAVEMEH 
FLPLANLECS PNIETFLCKA FVPTCIEQIH WPPCRKLCE KVYSDCKKLI DTFGIRWPEE 
LECDRLQYCD ETVPVTFDPH TEFLGPQKKT EQVQRDIGFW CPRHLKTSGG QGYKFLGIDQ 
CAPPCPNMYF KSDELEFAKS FIGTVSIFCL CATLFTFLTF LIDVRRFRYP ERPIIYYSVC 
YSIVSLMYFI GFLLGDSTAC NKADEKLELG DTWLGSQNK ACTVLFMLLY FFTMAGTVWW 
VILTITWFLA AGRKWSCEAI EQKAVWFHAV AWGTPGFLTV MLLALNKVEG DNISGVCFVG 
LYDLDASRYF VLLPLCLCVF VGLSLLLAGI ISLNHVRQVI O.HDGRNQEKL KKFMIRIGVF 
SGLYLVPLVT LLGCYVYEQV NRITWEITHV SDHCRQYHIP CPYQAKAKAR PELALFMIKY 
LMTLIVGISA VFWVGSKKTC TEWAGFFKRN RKRDPISESR RVLQESCEFF LKHNSKVKHK 
KKHYKPSSHK LKVISKSMGT STGATANHGT SAVAITSHDY LGQETLTEIQ TSPETSKREV 
KADGASTPRL REQDCGEPAS PAASISRLSG EQVDGKGQAG SVSESARSEG RISPKSDITD 



I I I I I I 

GGATTTCCGG GCTCCATGGC AAGATCCCTT CTCCTGCCCC TGCAGATCCT ACTGOTATCC 
TGCAGG AGAAGAAGCC CAGGGTGACA AGATTATTGA TGGCGCCCCA 
CCACCC ATGGCAGGTG GCCCTGCTCA GTGGCAATCA GCTCCACTGC 
C TGGTCAATGA GCGCTGGGTG CTCACTGCCG CCCACTGCAA GATGAATGAG 
TACACCGTGC ACCTGGGCAG TGATACGCTG GGCGACAGGA GAGCTCAGAG GATCAAGGCC 
TCGAAGTCAT TCCGCCACCC CGGCTACTCC ACACAGACCC ATGTTAATGA CCTCATGCTC 
GTGAAGCTCA ATAGCCAGGC CAGGCTGTCA TCCATGGTGA AGAAAGTCAG GCTGCCCTCC 
CGCTGCGAAC CCCCTGGAAC CACCTGTACT GTCTCCGGCT GGGGCACTAC CACGAGCCCA 
GATGTGACCT TTCCCTCTGA CCTCATGTGC GTGGATGTCA AGCTCATCTC CCCCCAGGAC 
TGCACGAAGG TTTACAAGGA CTTACTGGAA AATTCCATGC TGTGCGC 
TCCAAGAAAA ACGCCTGCAA TGGTGACTCA GGGGGACCGT TGGTGTGCAG A 
CAAGGTCTGG TGTCCTGGGG AACTTTCCCT TGCGGCCAAC CCAATG/ 
ACTCAAGTGT GCAAGTT CAC CAAGTGGATA AATGACACCA TGAAAAf 
CACACTGAGT TAATTAACTG TGTGCTTCCA ACAGAAAATG CACAGGAGTG AGGACGCCGA 
A AGTCAAATTT GACTTTACCT TTCCTCAAAG ATATATTTAA ACCTCATGCC 
h ACCAATCAAA TTGGTAAAGA CCTAAAACCA AAACAAATAR AGAAACACAA 



MARSLLLPLQ ILLLSLALET AGEEAQGDKI IDGAPCARGS HPWQVALLSG NQLHCGGVLV 
NERWVLTAAH CKMNEYTVHL GSDTLGDRRA QRIKASKSFR HPGYSTQTEV NDLMLVKLNS 
QARLSSMVKK VRLPSRCEPP GTTCTVSGWG TTTSPDVTFP SDLMCTOVKL ISPQDCTKVY 
KDLLENSMLC AGIPDSKKNA CNGDSGGPLV CRGTLQGLVS WGTFPCGQPN DPGVYTQVCK 



Seq ID NO: 49 
Nucleic Acid A 
Coding sequenc 



GTTCCCAGAA GCTCCCCAGG CTCTAGTGCA GGAGGAGAAG GAGGAGGAGC AGGAGGTGGA 
GATTCCCAGT TAAAAGGCTC CAGAATCGTG TACCAGGCAG AGAACTGAAG TACTGGGGCC 
TCCTCCACTG GGTCCGAATC AGTAGGTGAC CCCGCCCCTG GATTCTGGAA GACCTCACCA 



CCTGGGCAGG ACACTCCAGG GCACAGGAGG ACAAGGTGCT GGGGGGTCAT GAGTGCCAAC 
CCCATTCGCA GCCTTGGCAG GCGGCCTTGT TCCAGGGCCA GCAACTACTC TGTGGCGGTG 
TCCTTGTAGG TGGCAACTGG GTCCTTACAG CTGCCCACTG TAAAAAACCG AAATACACAG 
TACGCCTGGG AGACCACAGC CTACAGAATA AAGATGGCCC AGAGCAAGAA ATACCTGTGG 
TTCAGTCCAT CCCACACCCC TGCTACAACA GCAGCGATGT GGAGGACCAC AACCATGATC 
TGATGCTTCT TCAACTGCGT GACCAGGCAT CCCTGGGGTC CAAAGTGAAG CCCATCAGCC 
TGGCAGATCA TTGCACCCAG CCTGGCCAGA AGTGCACCGT CTCAGGCTGG GGCACTGICA 
CCAGTCCCCG AGAGAATTTT CCTGACACTC TCAACTGTGC AGAAGTAAAA ATCTTTCCCC 
AGAAGAAGT G TGAGGATGCT TACCCGGGGC AGATCACAGA TGGCATGGTC TGTGCAGGCA 
GCAGCAAAGG GGCTGACACG TGCCAGGGCG ATTCTGGAGG CCCCCTGGTG TGIGATGGTG 
CACTCCAGGG CATCACATCC TGGGGCTCAG ACCCCTGTGG GAGGTCCGAC AAACCTGGCG 
TCTATACCAA CATCTGCCGC TACCTGGACT GGATCAAGAA GATCATAGGC AGCAAGGGCT 
GATTCTAGGA TAAGCACTAG ATCTCCCTTA ATAAACTCAC AACTCTC 
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K TWMFLLLLGG AWAGHSRAQE DKVLGGKECQ PHSQPWQAAL FQGQQLLCGG 
VLVGGNWVLT AAHCKKPKYT VRLGDHSLQN KDGPBQBIPV VQSIPHPCY1I SSDVEDHNHD 
LMLLQLRDQA SLGSKVKPIS LADHCTQPGQ KCTVSGWGTV TSPRENFPDT LNCAEVKIFP 
QKKCEDAYPG QITDGMVCAG SSKGADTCQG DSGGPLVCDG ALQGITSWGS DPCGRSDKPG 



CACCTGCACC CCGCCCGGGC ATAGCACCAT GCCTGCTTGT CGCCTAGGCC C 
CGCCCTCCTC CTCAGCCTGC TGCTGTTCGG CTTCACCCTA GTCTCAGGCA CAGGAGCAGA 
GAAGACTGGC GTGTGCCCCG AGCTCCAGGC TGACCAGAAC TGCACGCAAG AGTGCGTCTC 
GGACAGCGAA TGCGCCGACA ACCTCAAGTG CTGCAGCGCG GGCTGTGCCA CCTTCTGCCT 
TCTCTGCCCA AATGATAAGG AGGGTTCCTG CCCCCAGGTG AACATTAACT TTCCCCAGCT 
CGGCCTCTGT CGGGACCAGT GCCAGGTGGA CAGCCAGTGT CCTGGCCAGA TGAAATGCTG 
CCGCAATGGC IGTGGGAAGG TGTCCTGTGT CACTCCCAAT TTCTGAGGTC CAGCCACCAC 
CAGGCTGAGC AGTGAGGAGA GAAAGTTTCT GCCTGGCCCT GCATCTGGT1 CCAGCCCACC 
TGCCCTCCCC TTTTTCGGGA CTCTGTATTC CCTCTTGGGC TGACCACAGC TTCTCCCTTT 
CCCAACCAAT AAAGTAACCA CTTTCAGCAA AAAAAAAAAA AAAA 



Seq ID NO-. 

Coding sequence: 65.. 352 

1 11 21 31 41 51 

CCTCCACAGC AACTTCCTTG ATCCCTUt CA CGCA -' TG AACACAGACA GCAGCCGCCT 
CGCCATGAAG CTGCTGATGG TCCTCATCCT CGCGGCCCTC C 
TTCTGGCTGC AAACTCCTGG AGGACATGGT TGAAAAGACC A 

\GAGTT CATAGACAGT GATGCCGCTG CAGAGGCTAT 
C AAGCAGTGTT TCCTCAACCA GTCACATAGA A 
GATGATGCAT ACAGTGTACG ACAGCATTTG GTGTAATATG A 
AAGGCGTTTG GCTCAGAGGG CTACAGACTA TGGCCAGAAC TCATCTGTTG A 
ACCACTTTTC TTTCTTGTGT TGTCTTTTTA TGTGGAAACT GCTAGACAAC T 
TCAAATTCAT TTCCATTTCA ATAACTAACT GCAAATC 

Seq ID NO: 504 Protein sequence 
Protein Accession ft: NP_002398 

1 11 21 31 41 51 

I I I I I I 

MKLLMVLMLA ALL LHCYAD S GCKLLEDMVE KTINSDISIP EYKELLQEFI DSDAAAEAMG 
KFKQCFLNQS HRTLKNFGLM MHTVYDSIWC NMKSN 

Seq ID NO: 505 DNA sequence 
Nucleic Acid Accession ft: NM_014791.1 
Coding sequence: 171..212S 

1 11 21 31 41 51 

I I I I I I 

TTGGCGGGCG GAAGCGGCCA CAACCCGGCG AT CGAAAAG A TTCTTAGGAA CGCCGTACCA 
GCCGCGTCTC TCAGGACAGC AGGCCCCTGT CCTTCTGTCG GGCGCCGCTC AGCCGTGCCC 
TCCGCCCCTC AGGTTCTTTT TCTAATTCCA AATAAACTTG CAAGAGGACT ATGAAAGATT 
ATGATGAACT TCTCAAATAT TATGAATTAC ATGAAACTAT TGGGACAGGT GGCTTTGCAA 
AGGTCAAACT TGCCTGCCAT ATCCTTACTG GAGAGATGGT AGCTATAAAA AT CATGGATA 
AAAACACACT AGGGAGTGAT TTGCCCCGGA TCAAAACGGA GATTGAGGCC TTGAAGAACC 
TGAGACATCA GCATATATGT CAACTCTACC ATGTGCTAGA GACAGCCAAC AAAATATTCA 
TGGTTCTTGA GTACTGCCCT GGAGGAGAGC TGTTTGACTA TATAATTTCC CAGGATCGCC 
TGTCAGAAGA GGAGACCCGG GTTGTCTTCC GTCAGATAGT ATCTGCTGTT GCTTATGTGC 
ACAGCCAGGG CTATGCTCAC AGGGACCTCA AGCCAGAAAA TTTGCTGTTT GATGAATATC 
ATAAATTAAA GCTGATTGAC TTTGGTCTCT GTGCAAAACC CAAGGGTAAC AAGGATTACC 
ATCTACAGAC ATGCTGTGGG AGTCTGGCTT ATGCAGCACC TC-AGTTAATA CAAGGCAAAT 
3TTTGGA GCATGGGCAT ACTGTTATAT GTTCTTATGT 
T GATGATAATG TAATGGCTTT ATACAAGAAG A 
G TGGCTCTCTC CCAGTAGCAT TCTGCTTCTT C 
A CGGATTTCTA TGAAAAATCT ATTGAACCAT C 
TGCAAGATTA CAACTATCCT GTTGAGTGGC AAAGCAAGAA TCCTTTTATT C 
ATGATTGCGT AACAGAACTT TCTGTACATC ACAGAAACAA CAGGCAAACA ATGGAGGATT 1080 
TAATTTCACT GTGGCAGTAT GATCACCTCA CGGCTACCTA TCTTCTGCTT C 
AGGCTCGGGG AAAACCAGTT CGTTTAAGGC TTTCTTCTTT C 
CTACCCCATT CACAGACATC AAGTCAAATA ATTGGAGTCT GGAAGATGTG ACCGCAAGTG 12 SO 
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ATAAAAATTA TGTGGCGGGA TTAATAGACT ATGATTGGTG T 

GTGCTGCTAC TCCCCGAACA TCACAGTTTA CCAAGTACTG GACAGAATCA AATC-GGGTGG 1380 

AATCTAAATC ATTAACTCCA GCCTTATGCA GAACACCTGC AAATAAATTA AAGAACAAAG 1440 

AAAATGTATA TACTCCTAAG TCTGCTGTAA AGAATGJ 

CAAAGACTCC AGTTAATAAG AACCAGCATA AGAGAGAAAT ACTCACTACC- C 

C CTCAAAAGCT AGAAACCAGT GCCTGAAAGA A 



GCTCAGTGGA ATTGGATCTC AACCAAGCAC ATATGGAGGA S 
CCAAAGTGTT TGGGAGCCTT GAAAGGGGGT TGGATAAGGT TATCACTGTG C 
GCAAAAGGAA GGGTTCTGCC AGAGACGGGC CCAGAAGACT AAAGCTTCAC T 

CTACAACTAG ATTAGTGAAT CCAGATCAAC TGTTGAATGA AATAATGTCT ATTCTTCCAA 1920 

AGAAGCATGT TGACTTTGTA CAAAAGGGTT ATACACTGAA GTGTCAAACA CAGTCAGATT 1980 

TTGGGAAAGT GACAATGCAA TTTGAATTAG AAGTGTGCCA C-CTTCAAAAA CCCGATGTGG 2 040 

TGGGTATCAG GAGGCAGCGG CTTAAGGGCG ATGCCTGGGT TTACAAAAGA TTAGTGGAAG 2100 

ACATCCTATC TAGCTGCAAG GTATAATTGA TGGATTCTTC CATCCTGCCG GATGAGTGTG 2160 

GGTGTGATAC AGCCTACATA AAGACTGTTA TGATCGCTTT GATTTTAAAG TTCATTGGAA 2220 

CTACCAACTT GTTTCTAAAG AGCTATCTTA AGACCAATAT CTCTTTGTTT TTAAACAAAA 2280 

GATATTATTT TGTGTATGAA TCTAAATCAA GCCCATCTGT CATTATGTTA CTGTCTTTTT 2340 

TAATCATGTG GTTTTGTATA TTAATAATTG TTGACTTTCT TAGATTCACT TCCATATGTG 2400 

AATGTAAGCT CTTAACTATG TCTCTTTGTA ATGTGTAATT TCTTTCTGAA ATAAAACCAT 2460 



I I I I 

MKDYDELLKY YELHETIGTG GFAKVKIiACH ILTGEMVAIK I 
LKNLRHQHIC QLYHVLETAN KIFMVLEYCP GGELFDYIIS QDRLSEEETR WFRQIVSAV 
AYVHSQGYAH RDLKPEMLLF DEYHKLKLID FGLCAKPKGN KDYHLQTCCG SLAYAAPELI 
QGKSYLGSEA DVWSMGILLY VLMCGFLPFD DDNVMALYKK IMRGKYDVPK WLSPSSILLL 
QQMLQVDPKK R1SMKNLLNH PWIMQDYNYP VEWQSKNPFI HLDDDCVTEL SVHHRNNRQT 
MEDLISLWOY DHLTATYLLL LAKKARGKPV RLRLSSFSCG QASATPFTDI KSNNWSLEDV 
TASDKNYVAG LIDYDWCEDD LSTGAATPRT SQFTKYWTES NGVESKSLTP ALCRTPANKL 
KMKENVYTPK SAVKNEEYFM FPEPKTPVNK NQHKREILTT PNRYTTPSKA RNQCLKETPI 
KIPVNSTGTD KLMTGVISPE RRCRSVELDL NQAHMEETPK RKGAKVFGSL ERGLDKVITV 
LTRSKRKCSA RDGPRRLKLH YNVTTTRLVN PDQLLNSIMS ILPKXHVDFV Q 
QSDFGKVTMQ FELEVCOLOK PDWGIRRQR LKGDAWVYKR LVEDILSSCK V 



GCAGAGCACA GCATCGTCGG GACCAGACTC GTCTCAGGCC AGTTGCAGCC TTCTCAGCCA 60 

AACGCCGACC AAGGAAAACT CACTACCATG AGAATTGCAG TGATTTGCTT TTGCCTCCTA 120 

GGCATCACCT GTGCCATACC AGTTAAACAG GCTGATTCTG GAAGTTCTGA GGAAAAGCAG 180 

CTTTACAACA AATACCCAGA TGCTGTGGCC ACATGGCTAA ACCCTGACCC ATCTCAGAAG 240 

CAGAATCTCC TAGCCCCACA GACCCTTCCA AGTAAGTCCA ACGAAAGCCA TGACCACATG 300 

GATGATATGG ATGATGAAGA TGATGATGAC CATGTGGACA GCCAGGACTC CATTGACTCG 360 

AACGACTCTG ATGATGTAGA TGACACTGAT GATTCTCACC AGTCTGATGA GTCTCACCAT 420 

TCTGATGAAT CTGATGAACT GGTCACTGAT TTTCCCACGG ACCTGCCAGC AACCGAAGTT 480 

TTCACTCCAG TTGTCCCCAC AGTAGACACA TATGATGGCC GAGGTGATAG TGTGGTTTAT 540 

GGACTGAGGT CAAAATCTAA GAAGTTTCGC AGACCTGACA TCCAGTACCC TGATGCTACA 600 

GACGAGGACA TCACCTCACA CATGGAAAGC GAGGAGTTGA ATGGTGCATA CAAGGCCATC 660 

CCCGTTGCCC AGGACCTGAA CGCGCCTTCT GATTGGGACA GCCGTGGGAA GGACAGTTAT 72 0 

GAAACGAGTC AGCTGGATGA CCAGAGTGCT GAAACCCACA GCCACAAGCA GTCCAGATTA 780 

TATAAGCGGA AAGCCAATGA TGAGAGCAAT GAGCATTCCG ATGTGATTGA TAGTCAGGAA 840 

CTTTCCAAAG TCAGCCGTGA ATTCCACAGC CATGAATTTC ACAGCCATGA AGATATGCTG 900 

GTTGTAGACC CCAAAAGTAA GGAAGAAGAT AAACACCTGA AATTTCGTAT TTCTCATGAA 960 

TTAGATAGTG CATCTTCTGA GGTCAATTAA AAGGAGAAftA AATACAATTT CTCACTTTGC 102 0 

ATTTAGTCAA AAGAAAAAAT GCTTTATAGC AAAATGAAAG AGAACATGAA ATGCTTCTTT 1080 

CTCAGTTTAT TGGTTGAATG TGTATCTATT TGAGT CTGGA AATAACTAAT GTGTTTGATA 1140 

ATTAGTTTAG TTTGTGGCTT CATGGAAACT CCCTGTAAAC TAAAAGCTTC AGGGTTATGT 12 00 

CTATGTTCAT TCTATAGAAG AAATGCAAAC TATCACTGTA TTTTAATATT TGTTATTCTC 12 60 

TCATGAATAG AAATTTATGT AGAAGCAAAC AAAATACTTT TACCCACTTA AAAAGAGAAT 1320 

ATAACATTTT ATGTCACTAT AATCTTTTGT TTTTTAAGTT AGTGTATATT TTGTTGTGAT 1380 

TATCTTTTTG TGGTGTGAAT AAATCTTTTA TCTTGAATGT AATAAGAATT TGGTGGTGTC 1440 

AATTGCTTAT TTGTTTTCCC ACGGTTGTCC AGCAATTAAT AAAACATAAC CTTTTTTACT 1500 
GCCTAAAAAA AAAAAAAAAA AAAA 



MRIAVICFCL LGITCAIPVK QADSGSSEEK QLYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 60 

PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDSSH HSDESDELVT 120 

80 DFPTDLPATE VFTPWPTVD TYDGRGDSW YGLRSKSKKF RRPDIQYPDA TDEDITSHME 180 

SEELNGAYKA I PVAQDLNAP SDWDSRGKDS YETSQLDDQS AET i IKQ LYSi r "~" 240 
NEHSDVIDSQ ELSKVSREFH SHEFHSHEDM LWDPKSKEE DKHLKFRISH ELDSASSEVN 
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Seq ID NO: 509 DNA sequence 



I I I I I I 

AGCGGCCGCG GC ACAAAGT T GGGGGCCGCG AAGATGAGGC TGTCCCCGGC GCCCCTGAAG 
CTGAGCCGGA CTCCGGCACT GCTGGCCCTG GCGCTGCCCC TGGCCGCGGC GCTGGCCTTC 
TCCGACGAGA CCCTGGACAA AGTGCCCAAG TCAGAGGGCT ACTC-CAGCCG TATCCTGCGC 
GCCCAGGGCA CGCGGCGCGA GGGCTACACC GAGTTCAGCC TCCC-CGTGGA GC-GCGACCCC 
GACTTCTACA AGCCGGGAAC CAGCTACCGC GTAACACTTT CAGCTGCTCC TCCCTCCTAC 
TTCAGAGGAT TCACATTAAT TGCCCTCAGA GAGAACAGAG AGGGTGATAA GGAAGAAGAC 
CATGCTGGGA CCTTCCAGAT CATAGACGAA GAAGAAACTC AGTTTATGAG CAATTGCCCT 

CCACCAGCGG GAACAGGCTG CGTGATTCTG AAGGCCAGCA TCGTACAAAA ACGCATTATT 

GGGGTGACTG ACAAACCCAT CTTAGACTGC TGTGCCTGCG GAACTGCCAA GTACAGACTC 
ACATTTTATG GGAATTGGTC CGAGAAGACA CACCCAAAGG ATTACCCTCG TCGGGCCAAC 
CACTGGTCTG CGATCATCGG AGGATCCCAC TCCAAGAATT ATGTACTGTG G 



C CTCACCGTCA T 

CCAGCCTGGC AGCCTCTCAA CGTGAGAGCA GCACCTTCAG CTGAATTTTC CGTGGACAGA 
ACGCGCCATT TAATGTCCTT CCTGACCATG ATGGGCCCTA G 
TTATCTGCAG AAGATCTGTG CACCAAGGAA TGTGGCTGGG T 
CTGATTCCCT GGGACGCTGG CACCGACAGC GGGGTG/ 
ACCATTCCCC AGGAGAAAAT CCGGCCCCTG ACCAGCCTGG ACCATCCTCA GAGTCCTTTC 



CTGAAAGCAC AGCTGGACCT CAGCGTCCCC TGCCCTGACA CCCAGGACTT C 
ATGGGCCCTG GCTGCAGTGA CGAAGACGGC TCCACCTGCA CCATGTCCGA G 

CAGTTCCCGG AGGACGGCTC CGTGTGCACG CTGCCCACTG AGGAAACGGA GAAGTGCACG 16BO 

GTCAACGAGG AGTGCTCTCC CAGCAGCTGC CTGATGACCG AGTGGGGCGA GTGGGACGAG 1740 

TGCAGCGCCA CCTGCGGCAT GGGCATGAAG AAGCGGCACC GCATGATCAA GATGAACCCC 1800 

GCAGATGGCT CCATGTGCAA AGCCGAGACA TCACAGGCAG AGAAGTGCAT GATGCCAGAG 1860 

TGCCACACCA TCCCATGCTT GCTGTCCCCA TGGTCCGAGT GGAGTGACTG CACCGTGACC 1920 

TGCGGGAAGG GCATGCGAAC CCGACAGCGG ATGCTCAAGT CTCTGGCAGA ACTTGGAGAC 1980 

TGCAATGAGG ATCTGGAGCA GGTGGAGAAG TGCATGCTCC CTGAATGCCC CATTGACTGT 2040 

GAGCTCACCG ACTCGTCCCA GTGGTCGGAA TGTAACAAGT CATGTGGGAA AGGCCACGTG 2100 

ATTCGAACCC GGATGATCCA AATGGAGCCT CAGTTTGGAG GTGCACCCTG CCCAGAGACT 2160 

GTGCAGCGAA AAAAGTGCCG CATCCGAAAA TGCCTTCGAA ATCCATCCAT CCAAAAGCTA 2220 

CGCTGCAGGC AGGCCCGAGA GACCCGCCGG AGTGAGCAGC TGAAGGAAGA GTCTGAAGGG 2280 

GAGCAGTTCC CAGGTTGTAG GATGCGCCCA TGGACGGCCT GGTCAGAATG CACCAAACTG 2340 

TGCGGAGGTG GAATTCAGGA ACGTTACATG ACTGTAAAGA AGAGATTCAA AAGCTCCCAG 2400 

TTTACCAGCT GCAAAGACAA GAAGGAG AT C AGAGCATGCA ATGTTCATCC TTGTTAGCAA 2460 

GGGTACGAGT TCCCCAGGGC TGCACTCTAG ATTCCAGAGT CACCAATGGC TGGATT ATT T 252 0 

GCTTGTTTAA GACAATTTAA ATTGTGTACG CTAGTTTTCA TTTTTGCAGT GTGGTTCGCC 2580 

CAGTAGTCTT GTGGATGC CA GAGACATCCT TTCTGAATAC TTCTTGATGG GTACAGGCTG 2640 

AGTGGGGCGC CCTCACCTCC AGCCAGCCTC TTCCTGCAGA GGAGTAGTGT CAGCCACCTT 2700 

GTACTAAGCT GAAACATGT C CCTCTGGAGC TTCCACCTGG CCAGGGAGGA CGGAGACTTT 2760 

GACCTACTCC ACATGGAGAG GCAACCATGT CTGGAAGTGA CTATGCCTGA GTCCCAGGGT 282 0 

GCGGCAGGTA GGAAACATTC ACAGATGAAG ACAGCAGATT CCCCACATTC TCATCTTTGG 2880 

CCTGTTCAAT GAAACCATTG TTTGCCCATC TCTTCTTAGT GGAACTTTAG GTCTCTTTTC 2940 

AAGTCTCCTC AGTCATCAAT AGTTCCTGGG GAAAAACAGA GCTGGTAGAC TTGAAGAGGA 300 0 

GCATTGATGT TGGGTGGCTT TTGTTCTTTC ACTGAGAAAT TCGGAATACA TTTGTCTCAC 3060 

CCCTGATATT GGTTCCTGAT GCCCCCCCAA CAAAAATAAA TAAATAAATT ATGGCTGCTT 312 0 

A TAAGGTAGCT AGTTTTTACA CCTGAGATAA ATAATAAGCT TAGAGTGTAT 3180 

3 CTTTTGGGGG TTCAGAGGAG TATGTACAAT TCTTCTGGGA AGCCAGCCTT 3240 

CTGAACTTTT TGGTACTAAA TCCTTATTGG AACCAAGACA AAGGAAGCAA AATTGGTCTC 330 0 

TTTAGAGACC AATTTGCCTA AATTTTAAAA TCTTCCTACA CACATCTAGA CGTTCAAGTT 3360 

TGCAAATCAG TTTTTAGCAA GAAAACATTT TTGCTATACA AACATTTTGC TAAGTCTGCC 342 0 

CAAAGCCCCC CCAATGCATT CCTTCAACAA AATACAATCT CTGTACTTTA AAGTTATTTT 3480 
AGTCATGAAA TTTTATATGC AGAGAGAAAA AGTTACCGAG ACAGAAAACA A 




A GTAAGTGTTC AGAAGGTTCT 
TTTTTATATT GTCCTCCACC TCCATCATTT TCAATAAAAG ATAGGGCTTT TGCTCCCTTG 
G TATCCAACAT GTTTTAAATC 

T TGGGGGGAAA AAAGTCCTCA TGTAGAAGCA CCCACTTTTG CAATGTTGTT 42 60 

CTAAGCTATC TATCTAACTC TCAGCCCATG ATAAAGTTCC TTAAGCTGGT GATTCCTAAT 4320 

T GTCTCATGTT TGTATTTGGT CCCAGTTGGG TACATTTTAA 4380 

I TTGGAGACTT AAAACCAGGT TAATGGCTAA GAATGGGTAA CATGACTCTT 4440 

T TATTTTTTGT TTGCAATGGG GAATTTATAA GAAC-CATCAA GTCTCTTTCT 4500 

TACCAAAGTC TTGTTAGGTG GTTTATAGTT CTTTTGGCTA ACAAATCATT TTGGAAATAA 4560 
AGATTTTTTA CTACAAAAAT G 
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25 
30 



I I I I I 

MRLSPAPLKL SRTPALLALA LPLAAALAFS DETLDKVPKS E~ 
PSLRVEGDPD PYKPGTSYRV TLSAAPPSYP RGFTLIALRE NREGDKEEDH AGTFQIIDEE 
ETQFMSNCPV AVTESTPRRR TRIQVFWIAP PAGTGCVILK ASIVQKRIIY FQDEGSLTKK 
LCEQDSTFDG VTDKPILDCC ACGTAKYRLT FYGNWSEKTH PKDYPRRANH WSAIIGGSHS 
KNYVLKEYGG YASEGVKQVA ELGSPVKMEE EIRQQSDEVL TVIKAKAQWP AWQPLNVRAA 
PSAEFSVDRT RHLMSFLTMM GPSPDWNVGL SAEDLCTKEC GNVQKWQDL IPWDAGTDSG 
VTYESPNKPT IPQEKIRPLT SLDHPQSPFY DPEGGSITQV ARVVIERIAR KGEQCNIVPD 
NVDDIVADLA PEE KD ED DTP ETCIYSNWSP WSACSSSTCD KGKRMRQRNL KAQLDLSVPC 
PDTQDFQPCM GPGCSDEDGS TCTMSEWITK SPCSISCGMG MRSRERYVKQ FPEDGSVCTL 
A DGSMCKAETS 



Nucleic Acid Acc 
Coding sequence: 



ACCTCCGCAC GAGACCCAGC GGCCCGGGTT GGAGCGTCCA 
CGGGAGGCGC TGGACACGGA G 



TCGGGCCACA TCAAGCGGCC GATGAACGCG 



A TCTCCAAGAG GCTGGGCAAG CGCTGGAAAA TGCTGAAGGA C 

ATCCCGTTCA TCCGGGAGGC GGAGCGGCTG CGGCTCAAGC ACATGGCCGA CTACCCCGAC 420 

TACAAGTACC GGCCCCGGAA AAAGCCCAAA ATGGACCCCT CGGCCAAGCC CAGCGCCAGC 430 

CAGAGCCCAG AGAAGAGCGC GGCCGGCGGC GGCGGCGGGA GCGCGGGCGG AGGCGCGGGC 540 

GGTGCCAAGA CCTCCAAGGG CTCCAGCAAG AAATGCGCCA AGCTCAAGGC CCCCGCGGCC 600 

GCGGGCGCCA AGGCGGGCGC GGGCAAGGCG GCCCAGTCCG GGGACTACGG GGGCGCGGGC 650 

GACGACTACG TGCTGGGCAG CCTGCGCG1 720 

ACGGTCAAGT GCGTGTTTCT GGATGAGGAC GACGACGACG ACGACGACGA CGACGAGCTG 780 

CAGCTGCAGA TCAAACAGGA GCCGGACGAG GAGGACGAGG AACCACCGCA CCAGCAGCTC 840 

CTGCAGCCGC CGGGGCAGCA GCCGTCGCAG CTGCTGAGAC GCTACAACGT CGCCAAAGTG 900 

CCCGCCAGCC CTACGCTCAG CAGCTCGGCG GACTCCCCCG AGGGAGCGAG CCTCTACGAC 960 

GAGGTGCGGG CCGGCGCGAC CTCGGGCGCC GGGGGCGGCA GCCGCCTCTA CTACAGCTTC 1020 

AAGAACAT CA CCAAGCAGCA CCCGL'L'GCCG CTCGCGCAGC CCGCGCTGTC GCCCGCGTCC 1080 

TCGCCCTCGG TCTCCACCTC CTCCTCCACC ACCAGCGGCA GCAGCAGCGG CAGCAGCGGC 1140 

GAGGACGCCG ACGACCTGAT GTTCGACCTG AGCTTGAATT TCTCTCAAAG CGCGCACAGC 1200 

GCCAGCGAGC AGCAGCTGGG GGGCGGCGCG GCGGCCGGGA ACCTGTCCCT GTCGCTGGTG 1260 

GATAAGGATT TGGATTCGTT CAGCGAGGGC AGCCTGGGCT CCCACTTCGA GTTCCCCGAC 132 0 

TACTGCACGC CGGAGCTGAG CGAGATGATC GCGGGGGACT GGCTGGAGGC GAACTTCTCC 1380 



AGCTGGGTTC CTTGGGAGGA AGTTGTAGTG GTGATGATGA TGATGATGAT AATGATGATG 1500 

ATGATGGTGG TGTTGATGGT GGCGGTGGTA GGGTGGAGGG GAGAGAAGAA GATGCTGATG 1560 

ATATTGATAA GATGTCGTGA CGCAAAGAAA TTGGAAAACA TGATGAAAAT TTTGGTGGAG 162 0 

TTAAAGTGAA ATGAGTAGTT TTTAAACATT TTTCCTGTCC TTTTTTTGTC CCCCCTCCCT 1680 

TCCTTTATCG TGTCTCAAGG TAGTTGCATA CCTAGTCTGG AGTTGTGATT ATTTTCCCAA 1740 

AAAATGTGTT TTTGTAATTA CTATTTCTTT TTCCTGAAAT TCGTGATTGC AACAAAGGCA 1800 

GAGGGGGCGG CGCGGCGGAG GGGAGGTAGG ACCCGCTCCG GAAGGCGCTG TTTGAAGCTT 1860 

GTCGGTCTTT GAAGTCTGGA AGACGTCTGC AGAGGACCCT TTTGGCAGCA CAACTGTTAC 1920 

TCTAGGGAGT TGGTGGAGAT ATTTTTTTTT CTTAAGAGAA CTTAAAGAAC TGGTGATTTT 1980 



MVQQAESLEA ESNLPREALD TEEGEFMACS PVALDESDPD WCKTASGHIK RPMNAFMVWS 50 

KIERRKIMEQ SPDMHNAEIS KRLGKRWKML KDSEKIPFIR EAERLRLKHM ADYPDYKYRP 120 

RKKPKMDPSA KPSASQSPEK SAAGGGGGSA GGGAGGAKT S KGSSKKCGKL KAPAAAGAKA 180 

GAGKAAQSGD YGGAGDDYVL GSLRVSGSGG GGAGKTV KCV FLDEDDDDDD DDDELQLQIK 240 

QEPDEEDEEP PHQQLLQPPG QQPSQLLRRY NVAKVPASPT LSSSAESPEG ASLYDEVRAG 300 

ATSGAGGGSR LYYSFKNITK QHPPPIiAQPA LSPASSRSVS TSSSSSSGSS SGSSGEDADD 350 

LMFDLSLNFS QSAHSASEQQ LGGGAAAGNL SLSLVDKDLD SFSEGSLGSH FEFPDYCTPE 420 
LSEMIAGDWL EANFSDLVFT Y 



GGTCGACCTA AATCTGATAA CTGGCTTATT ATGTAATTTA TTGGTGTTAT T 
ATTGGTAATC TACAGTAAGA TTTTCAGTTA GGATTTGAGA TTATGA1 
ATATTTCTAA ATTGGAATTA GAAGATTGTT GTATGACAGA GAGTCAGGAC TTGCCATTTG 
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A AAGTCATTGT TTGGTGTGTA ATAGTACAAA ATCATCTTGC T 
AAGGATATCT GTTGCTCCCG AATGAAACAA TTTTTCTGAA ATAGAGGGCC CAGAATTGGT 
CTCTGACAAT TAATAAAGAC ATCAAAGATA GCAAAATGAT TTTTATATCT TAGGGCCAAT 
ACTACCAATT TAATAATTAA AACAAGTTCT GGTGAGCTCT GAACTTGGCA GAATTGGTGG 
5 CAACATAGAC TTTGGATTTT CCAAATTCCC CACATAAAAC AAAGGGGATC AACTAGATAG 
AAAAACCAGA AACCTTTGGA AATATCTGTT TAAAAAAAAA AAAAAGTCGA CC-CGGGCC 



Nucleic Acid A 



I I I I I I 

GGAGCCACAG TGAAAGTCAA GAATGTCAGT GATTCCACAT TTAATATCTA CATTTTTGCA 
GGGCAGTTAC TCTTTTGTAG TATAACATTG AGCTGATAGC ACATAGTC-TA GACAAGTGAA 
TACAGGATTC TCTGGGTTGT ATTCCCAGAA GT CTGGAGGT CATTTGGATA TTTGTGGGCC 
CTTGGCTTCA CTCTGACTTG TGTGACACAT AAAAATTGTG ATGAAATGTC CTATAGATGT 
CCTGCAGGTC TTAAAAGAAC CTTTCCAAAC TATGAAACAG CCCAGCAGCA CTGAGTTAGA 
GGTAAATTCT GAACCCTTGA ACACTAAAAC TATTCTAACT GCACATAGAA TTGGCAAGTA 
GCATTCTATG TCTATGAACA GTATGTCTTT TCTATATAAC AGAGAAAATC TTTTTAAGCA 
AACTACT CAG TTTAAAACCT AATTCTTCTC ATAATCTCAG TACTTTTGAA T GAAGA CAT A 
TCAATGCAAC AGTACACTCT TATTCAGGCA TTTGAAAGAA AGAATTCGAG ATCTAGTTTG 
TAT CAGAT AT TATAAATTAG TATGGTTTAG TCTTTGTCAT GAAATTCTAC TTAATTTTTG 
GACTATAGGT TTAAGAATGT AAGCAGAAGT TCTGCACCAA TCAGAATAAG CTACATTATG 
CTTGAGTGAC AACTACTGTA ATGACAAAAT ATCAGTGGCT T 
CATACTTGTT CATAAAGAGT CAGCAAGGAC CCTGCTCATT A 



T GGACACCGGC CACTCCCACT CACATCCCTT 
GGGCCAGAAC TAGGTCACTG GGCCCGGACC TAACTTCGGA G 
CTCCATGTAC CCAAAGTGGA GAGAAGCCAG ATACTGAGAA ACATCAATAA T 
AAATCCATTC TACCATTCCC TTTGCCTAAA GTGAAAAGAT GAGTACTTTC ATCAATTTGT 
AAACTGTACT TTTGAAGTAA ATCCTGGTAG CTTGCATGGG GGCTGGATTT CCAGAAAGCC 
ATATGTAATT TGGGAATGAC ATTCACTTAA GCTCATAGAA TAT CATT ATT TGATGTAAAA 
TGCCCTCATT TGCAATACAG GACCAAAATG CACTAACCAC AAAACCCCCC T 
GCCCGGGGTC CCTATTCCCC TCCATCCCTT TAAATGAGGC ATTCTATGAT T 
AGCCCAGTTG TAGTCGTAAG AATTTTACTT AATTCAAGAA TTATTCTCAC T 
CCAGTTCTGA AAGGAATGCA AAGTCAAATT TTGCATCTTC TTTGCTCAAG GGCCTTTAGA 
TGTAACAACA CAGACATGAT ACAAGGCTGA CAATGAC 

A ATTGTGAATC AACAAAAAAT TATGTTCTTT A 
C TGCCTACATA CCCCTCTTGT TCCTCAGTTC TTATCCCTGA T 
P. GACAGCTGTA GATGTTTTTA TTTAGCAAAA AAAAAAAAAA A 



Coding sequence : 43 . . 924 

50 | " i 1 i 1 i 1 I 1 

CTTGTGGTTC CTCTCTACTT GGGGAAATCA GGTGCAGCGG CCATGGCTAC AGCAAGACCC 60 

CCCTGGATGT GGGTGCTCTG TGCTCTGATC ACAGCCTTGC TTCTGGGGGT CACAGAGCAT 12 0 

GTTCTCGCCA ACAATGATGT TTCCTGTGAC CACCCCTCTA ACACCGTGCC CTCTGGGAGC 180 

AACCAGGACC TGGGAGCTGG GGCCGGGGAA GACGCCCGGT CGGATGACAG CAGCAGCCGC 240 

55 ATCATCAATG GATCCGACTG CGATATGCAC ACCCAGCCGT GGCAGGCCGC GCTGTTGCTA 300 

AGGCCCAACC AGCTCTACTG CGGGGCGGTG TTGGTGCATC CACAGTGGCT GCTCACGGCC 3 60 

GCCCACTGCA GGAAGAAAGT TTTCAGAGTC CGTCTCGGCC ACTACTCCCT GTCACCAGTT 420 

TATGAATCTG GGCAGCAGAT GTTCCAGGGG GTCAAATCCA TCCCCCACCC TGGCTACTCC 480 

CACCCTGGCC ACTCTAACGA CCTCATGCTC ATCAAACTGA ACAGAAGAAT TCGTCCCACT 540 

60 AAAGATGTCA GACCCATCAA CGTCTCCTCT CATTGTCCCT CTGCTGGGAC AAAGTGCTTG 600 

GTGTCTGGCT GGGGGACAAC CAAGAGCCCC CAAGTGCACT TCCCTAAGGT CCTCCAGTGC 660 

TTGAATATCA GCGTGCTAAG TCAGAAAAGG TGCGAGGATG CTTACCCGAG ACAGATAGAT 720 

GACACCATGT TCTGCGCCGG TGACAAAGCA GGTAGAGACT CCTGCCAGGG TGATTCTGGG 780 

GGGCCTGTGG TCTGCAATGG CTCCCTGCAG GGACTCGTGT CCTGGGGAGA TTACCCTTGT 840 

65 GCCCGGCCCA ACAGACCGGG TGTCTACACG AACCTCTGCA AGTTCACCAA GTGGATCCAG 900 

GAAACCATCC AGGCCAACTC CTGAGTCATC CCAGGACTCA GCACACCGGC ATCCCCACCT 960 

GCTGCAGGGA CAGCCCTGAC ACTCCTTTCA GACCCTCATT CCTTCCCAGA GATGTTGAGA 1020 

ATGTTCATCT CTCCAGCCCC TGACCCCATG TCTCCTGGAC TCAGGGTCTG CTTCCCCCAC 1080 

ATTGGGCTGA CCGTGTCTCT CTAGTTGAAC CCTGGGAACA ATTTCCAAAA CTGTCCAGGG 1140 

70 CGGGGGTTGC GTCTCAATCT CCCTGGGGCA CTTTCATCCT CAAGCTCAGG GCCCATCCCT 1200 
C TGAGAAGTGG AAAAAAAAAA 



75 




ITPSGSNQDL GAGAGEDARS 
DDSSSRIING SDCDMHTQPW QAALLLRPNQ LYCGAVLVHP QNLLTAAHCR KKVFRVRLGH 
YSLSPVYESG QQMFQGVKSI PHPGYSHPGH SNDLMLIKLN RRIRPTKDVR PINVSSHCPS 
AGTKCLVSGW GTTKSPQVHF PKVLQCIJJIS VLSQKRCEDA YPRQIDDTMF CAGDKAGRDS 
CQGDSGGPW CNGSLQGLVS WGDYPCARPN RPGVYTNLCK FTKWIQETIQ AHS 
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GGGCGCAGCG GGGCCCGTCT GCAGCAAGTG ACCGACGGCC GGGACGGCCG CCTGCCCCCT 60 

CTGCCACCTG GGGCGGTGCG GGCCCGGAGC CCGGAGCCCC- GGTAGCC-CGT AGAGCCGGCG 120 

CGATGCACGT GCGCTCACTG CGAGCTGCGG CGCCGCACAG CTTCGTGGCG CTCTGGGCAC 180 

10 CCCTGTTCCT GCTGCGCTCC GCCCTGGCCG ACTTCAGCCT GGACAACGAG GTGCACTCGA 240 

GCTTCATCCA CCGGCGCCTC CGCAGCCAGG AGCGGCGGGA GATGCAGCGC GAGATCCTCT 300 

CCATTTTGGG CTTGCCCCAC CGCCCGCGCC CGCACCTCCA GGGCAAGCAC AACTCGGCAC 360 

CCATGTTCAT GCTGGACCTG TACAACGCCA TGGCGGTGGA GGAGGGCGGC GGGCCCGGCG 42 0 

GCCAGGGCTT CTCCTACCCC TACAAGGCCG TCTTCAGTAC CCAGGGCCCC CCTCTGGCCA 480 

15 GCCTGCAAGA TAGCCATTTC CTCACCGACG CCGACATGGT CATGAC-CTTC GTCAACCTCG 540 

TGGAACATGA CAAGGAATTC TTCCACCCAC GCTACCACCA TCGAGAGTTC CGGTTTGATC 600 

TTTCCAAGAT CCCAGAAGGG GAAGCTGTCA CGGCAGCCGA ATTCCGGATC TACAAGGACT 660 

ACATCCGGGA ACGCTTCGAC AATGAGACGT TCCGGATCAG CC-TTTATCAG GTGCTCCAGG 720 

AGCACTTGGG CAGGGAATCG GATCTCTTCC TGCTCGACAG CCGTACCCTC TGGGCCTCGG 780 

20 AGGAGGGCTG GCTGGTGTTT GACATCACAG CCACCAGCAA CCACTGGGTG GTCAATCCGC 840 

GGCACAACCT GGGCCTGCAG CTCTCGGTGG AGACGCTGGA TGGGCAGAGC ATCAACCCCA 900 

AGTTGGCGGG CCTGATTGGG CGGCACGGGC CCCAGAACAA GCAGCCCTTC ATGGTGGCTT 960 

TCTTCAAGGC CACGGAGGTC CACTTCCGCA GCATCCGGTC CACGGGGAGC AAACAGCGCA 1020 

GCCAGAACCG CTCCAAGACG CCCAAGAACC AGGAAGCCCT C-CGGATGGCC AACGTGGCAG 1080 

25 AGAACAGCAG CAGCGACCAG AGGCAGGCCT GTAAGAAGCA CGAGCTGTAT GTCAGCTTCC 1140 
GAGACCTGGG CTGGCAGGAC TGGATCATCG CGCCTGAAGG C 

C CACCAACCAC G 

T GCGCCCACGC 

AGCTCAATGC CATCTCCGTC CTCTACTTCG A 
30 ACAGAAACAT GGTGGTCCGG GCCTGTGGCT GCCACTAGCT CCTCCGAGAA T 

TTGGGGCCAA GTTTTTCTGG ATCCTCCATT GCTCGCCTTG GCCAGGAACC AGCAGACCAA 




Seq ID NO: S18 Protein 



21 31 41 51 

I I I I 

A PHSFVALWAP LFLLRSALAD FSLDNEVHSS FIHRRLRSQE RREMQREILS 60 

n AVEEGGGPCG QGFSYPYKAV FSTQGPPLAS 120 

LQDSHFLTDA DMVMSFVNLV EHDKBFFHPR YHHREFRFDL SKIPEGEAVT AAEFRI YKDY 180 

IRERFDNETF RISVYQVLQE HLGRESDLFL LDSRTLWASE EGKLVFDITA TSNHWWNPR 240 

HNLGLQLSVE TLDGQSINPK LAGLIGRHGP QNKQPFMVAF FKATEVHFRS IRSTGSKQRS 300 

QNRSKTPKNQ EALRMANVAE NSSSDQRQAC KKHELYVSFR DLGWQDWIIA PEGYAAYYCE 360 

FD DSSNVILKKY 420 

Seq ID NO: 519 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 264.. 782 

1 11 21 31 41 51 

CCCTGCTCCA GTCACACCCG GAAGCTGACT GGTCCACGCA CAGCTGAAGC ATGAGGAAAC 50 

TCATCGCGGG ACTAATTTTC CTTAAAATTT AGACTTGCAC AGTAAGGACT TCAACTGACC 120 

TTCCTCAGAC TGAGAACTGT TT CCAGT AT A TACATCAAGT CACTGAGATC TCCAGCACCC 180 

TGCCGGTGGC ACTACTGAGA GACGAGGTGC CAGGGTGGTT CCTGAAAGTG CCTGAGCCCC 240 

AACTTATCAG CAAGGAGCTC ATCATGCTGA CAGAAGTCAT GGAGGTCTGG CATGGCTTAG 300 

TGATCGCGGT GGTGTCCCTC TTCCTGCAGG CCTGCTTCCT CACCGCCATC AACTACCTGC 360 

TCAGCAGGCA CATGGCCCAC AAGAGTGAAC AGATACTGAA AG CGG CCAGT CTCCAGGTTC 420 

CCAGGCCCAG CCCTGGCCAC CATCATCCAC CTGCTGTCAA AGAGATGAAG GAGACTCAGA 480 



A GTGGCCATGT 
A TTCTTACATT TAATTTGTAG 
T GGAGACAGGC 

T GTGGAGAAGA AAACTGATAA A' 
CCTGGTCTGT ACCCAAAAAA GCTGTTCGTT CCTCAAAAAC AAAAACAAGG C 
AAAACAGGCC AATGCCCCGG CAAGAAAGGT TGAGATCAGA TGTTAGGAAG AACTTTCAGG 

A GTCCATCAGC AGAGATAGTA GTGAAGTCTC TCCCCAGGGA 1140 
A AAGGTTGAAT CAGCTGTTGT AGAGTTCTAT TTGGCAATCT CATGGTTAAA 1200 
G CAATAAACAA CTTCTTTAAA AGTTTTAAAT 1260 



D NO: 520 Protein 
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H GLVIAWSLF LQACPLTAIN YLLSRHMAHK SEQILKAASL QVPRPSPGHH 
HPPAVKEMKE TQTERDIPMS DSLYRHDSDT PSDSLDSSCS SPPACQATED VDYTQWPSD 
D YVNVNPERHK PSFWYFVNPA LSEPAEYDQV AM 



Protein Accession #: Eos sequence 



Seq ID NO.- 523 D 
Nucleic Acid Acce 
Coding sequence: 



r L i I _AGT CACTTCCTCC A 

GGGGCTTGGG AGGCAGCCTG CTCTCCAGTC CCTATCCACC CACAGGTTTT TTGGGTCGGA 
GAGGAATTAT CTGATAAAAT TCCTGGGTTA AT ATTTT T AA AAACGGAGAG TTTTTAAAAA 
;CTTTTT ATGCTTCGAA i CA ITTGTC AACCAGCATA 
C TTTTTCTACG ATAAATGAAA GCATTTCTTC AAGAAAAAGG 
CACAGGTTCC TTGAACAGCT GGATTCTGAT GGCACCATTA CTATAGAGGA GCAGATTGTC 



GTTTCCCTGA ATGGGATGGA CTCATTTGTT GGCCCAGAGG 
CTGTTCCATG CCCICCTTAT ATTTATGACT TCAACCATAA 
GTAACCCCAA TGGAACATGG GATTTTATGC ACAGCTTAAA 
iACTGCCT TCGCTTTCT 



TTCTTTGAAC GCCTCTATGT AATGTATACC GTTGGCTACT CCATCTCTTT T 
A TTCTCATCAT TGGTTACTTC AGACGATTGC A 

T TATTTGTGTC TTTCATGCTG AGAGCTACAA GCATCTTTGT CAAAGACAGA 

GTAGTCCATG CTCACATAGG AGTAAAGGAG CTGGAGTCCC TAATAATGCA GGATGACCCA 

CAAAATTCCA TTGAGGCAAC TTCTGTGGAC AAATCACAAT ATATCGGGTG CAAGATTGCT 



GT TTCTGAATAC GGTTAGAGTT 1260 
CTAGCTACCA AAATCTGGGA GACCAATGCA GTTGGGCATG ACACAAGGAA GCAATACAGG 1320 
AAACTGGCCA AATCGACACT GGTCCTGGTC CTAGTCTTTG GAGTGCATTA CATCGTGTTC 1380 



AGCAGCCAGT CACAGGTGGC GGCCAGCACA CGCATGGTGC TTATCTCTGG CAAAGCTGCC 1680 

AAGATCGCCA GCAGACAGCC TGACAGCCAC ATCACTTTAC CTGGCTATGT CTGGAGTAAC 1740 

TCAGAGCAGG ACTGCCTGCC ACACTCTTTC CACGAGGAGA CCAAGGAAGA TAGTGGGAGG 1800 

CAGGGAGATG ATATTCTAAT GGAGAAGCCT TCCAGGCCTA TGGAATCTAA CCCAGACACT 1860 
GAAGGATGCC AAGGAGAAAC TGAGGATGTT CTCTGA 



I I I I I I 

MLRSSLSTSI VLFLFSSFST INESISSRKE HRFLEQLDSD GTITIEEQIV LVLKAKVQCE 
LNITAQLQEG EGNCFPEWDG LICWPRGTVG KISAVPCPPY IYDFNHKGVA FRHCNPNGTW 
DFMHSLNKTW ANYSDCLRFL QPDISIGKQE FFERLYVMYT VGYSISFGSL AVAILIIGYF 
RRLHCTRNYI HMHLFVSFML RATSIFVKDR WHAHIGVKE LESLIMQDDP QNSIEATSVD 
KSQYIGCKIA WMFIYFIiAT NYYWILVEGL YLHNLIFVAF FSDTKYLKGF IL1GKGFPAA 
FVAAWAVARA TLADARCWEL SAGDIKWIYQ APILAAIGLN FILFLNTVRV LATKIWETNA 
VGHDTRKQYR KLAKSTLVLV LVFGVHYIVF VCLPHSFTGL GWEIRMHCEL FFNSFQGFFV 
SIIYCYCNGE VQAEVKKMWS RHNLSVDHKR TPPCGSRRCG SVLTTVTHST SSQSQVAAST 
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Nucleic Acid Acce 
Coding sequence: 



GGCCGGTGGC CCGGGCCCGA CCACCCCAGC TGCGCGTCGT TACTGGCCAC AAGTTTGCTC 60 

TGGGCCAGCC AAGTTGGCAA CTTGGAAGCT TCTCCCGGGC TCTGGAGGAG GGTCCCTGCT 12 0 

TCTTCCTACA GCCGTTCCGG GCATGGCCGG GCTGGGGGCG TCC-CTCCACG TCTGGGGTTG 180 

GCTAATGCTC GGCAGCTGCC TCCTGGCCAG AGCCCAGCTG GATTCTGATG GCACCATTAC 240 

TATAGAGGAG CAGATTGTCC TTGTGCTGAA AGCGAAAGTA CAATGTGAAC TCAACATCAC 300 

AGCTCAACTC CAGGAGGGAG AAGGTAATTG TTTCCCTGAA TGGGATGGAC TCATTTGTTG 360 

GCCCAGAGGA ACAGTGGGGA AAATATCGGC TGTTCCATGC CCTCCTTATA TTTATGACTT 420 

CAACCATAAA GGAGTTGCTT TCCGACACTG TAACCCCAAT GGAACATGGG ATTTTATGCA 480 

CAGCTTAAAT AAAACATGGG CCAATTATTC AGACTGCCTT CGCTTTCTGC AG C CAGATAT 540 

CAGCATAGGA AAGCAAGAAT TCTTTGAACG CCTCTATGTA ATGTATACCG TTGGCTACTC 600 

CATCTCTTTT GGTTCCTTGG CTGTGGCTAT TCTCATCATT GGTTACTTCA GACGATTGCA 660 

TTGCACTAGG AACTATATCC ACATGCACTT ATTTGTGTCT TTCATGCTGA GAGCTACAAG 72 0 

CATCTTTGTC AAAGACAGAG TAGTCCATGC TCACATAGGA GTAAAGGAGC TGGAGTCCCT 780 

AATAATGCAG GATGACCCAC AAAATTCCAT TGAGGCAACT TCTGTGGACA AATCACAATA 84 0 

TATCGGGTGC AAGATTGCTG TTGTGATGTT TATTTACTTC CTGGCTACAA ATTATTATTG 900 

GATCCTGGTG GAAGGTCTCT ACCTGCATAA TCTCATCITT GTGGCTTTCT TTTCGGACAC 960 

G TGGGGCTTCA TCTTGATAGG CTGGGGGTTT CCAGCAGCAT TTGTTGCAGC 102 0 

G GCACGAGCAA CTCTGGCTGA TGCGAGGTGC TGGGAACTTA GTGCTGGAGA 1080 

CATCAAGTGG ATTTATCAAG CACCGATCTT AGCAGCTATT GGGCTGAATT TTATTCTGTT 1140 

TCTGAATACG GTTAGAGTTC TAGCTACCAA AATCTGGGAG ACCAATGCAG TTGGGCATGA 12 00 

CACAAGGAAG CAATACAGGA AACTGGCCAA ATCGACACTG GTCCTGGTCC TAGTCTTTGG 12 60 

AGTGCATTAC ATCGTGTTCG TATGCCTGCC TCACTCCTTC ACTGGGCTCG GGTGGGAGAT 1320 

CCGCATGCAC TGTGAGCTCT TCTTCAACTC CTTTCAGGGT TTCTTTGTGT CTATCATCTA 1380 

CTGCTACTGC AATGGAGAGG TTCAGGCAGA GGTGAAGAAG ATGTGGAGTC GGTGGAATCT 1440 

CTCCGTGGAC TGGAAAAGGA CACCGCCATG TGGCAGCCGC AGATGCGGCT CAGTGCTCAC IS 00 

S CACAGCACCA GCAGCCAGTC ACAGGTGGCG GCCAGCACAC GCATGGTGCT 1560 

3CTGCCA AGATCGCCAG CAGACAGCCT GACAGCCACA TCACTTTACC 1620 

TGGCTATGTC TGGAGTAACT CAGAGCAGGA CTGCCTGCCA CACTCTTTCC ACGAGGAGAC 1680 

r AGTGGGAGGC AGGGAGATGA TATTCTAATG GAGAAGCCTT CCAGGCCTAT 1740 



GGAGTAGTTT ATTACCTTCT ATTGGCATCA AGTTTTCCTC TAAATTAATG T 
GCTCTGTGAT TGTTCATTTT TTTCTGCTAC TTTTGGGTAG AAAAAAC 
GCTGT AG CTT TCTCTCATAT ATATCACCCT AAATATAATG AAGATCTTTT AGTGTGTATC 
ATTTTCCTTT TAGAAACTAG TATTCTCTTA TTTCTTACTT TAATGTJ 
ATTTATTTTG CCTGTGCATA GGAGCAATTA GGATCTAAAA AAATATATGG G 
GAT CTAAGAA CAAGTACTTG CTGGAAAATT AGTTGGCTGG ACATTGATAA AATAATGCAT : 
TTATAACAAT TACATGTGTT TTTGGGAACA AGGAAAATTT CTCAAAAAAG AATATTTCAC : 
ACATCCCTTC TTTTGAATGG CCTCTTTGTG ACCAGCCAGA CCTCAGGTCT TCACTCTTTC : 
TTCTTTGTAA ACCATGTCAT GTGGAAAGAT TTCCTCAGTT AGTGAGCTTG TGTCTGCAAA 



MAGLGASLHV WGWLMLGSCL LARAQLDSDG TITIEEQIVL VLKAKVQCEL NITAQLQEGE 
GNCFPEWDGL ICWPRGTVGK ISAVPCPPYI YDFNHKGVAF RHCNPNGTWD FMHSLNKTWA 
NYSDCLRFLQ PDISIGKQEF FERLYVWYTV GYSISFGSLA VAILIIGYFR RLHCTRNYIH 



LADARCWELS AGDIKWIYQA PILAAIGLNF ILFUITVRVL ATKIWETNAV GHDTRKQYRK 
LAKSTLVLVL VFGVHYIVFV CLPHSFTGLG WEIRMHCELF FNSFQGFFVS IIYCYCNGEV 
QAEVKKMWSR WNLSVDWKRT PPCGSRRCGS VLTTVTHSTS SQSQVAASTR MVLISGKAAK 
IASRQPDSHI TLPGYVWSNS EQDCLPHSFH EETKEDSGRQ GDDILHEKPS 
GCQGETEDVL 



I I 



3 AGGTTAGGAC TTCAGCTGAT GGACAAGCTG GTAATGAAGA 
AATGGTGCAA ATAGATTTAC CAATAAAGAG ATATAGAGAG TATGAGCTGG TGACTCCAGT 
CAGCACAAAT CTAGAAGGAC GCTATCTCTC CCATACTCTT TCTGCGAGTC ACAAAAAGAG 
GTCAGCGAGG GACGTGTCTT CCAACCCTGA GCAGTTGTTC TTTAACATCA CGG( I J 
AAAAGATTTT CATCTGCGAC TAAAGCCCAA CACTCAACTA GTAGCTCCTG GGGCTGTTGT 
GGAGTGGCAT GAGACATCTC TGGTGCCTGG GAATATAACC GATCCCATTA ACAACCATCA 
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ACCAGGAAGT GCTACGTATA GAATCCGGAA AACAGAGCCT TTGCAGACTA ACTGTGCTTA 4 80 

TGTTGGTGAC ATCGTGGACA TTCCAGGAAC CTCTGTTGCC ATCAGCAACT GTGATGGTCT 540 

GGCTGGAATG ATAAAAAGTG ATAATGAAGA GTATTTCATT GAACCCTTGG AAAGAGGTAA 600 

ACAGATGGAG GAAGAAAAAG GAAGGATTCA TGTTGTCTAC AAGAGATCAG CTGTAGAACA 660 

5 GGCTCCCATA GACATGTCCA AAGACTTCCA CTACAGAGAG TCGGACCTGG AAGGCCTTGA 72 0 

CAGACACGCG GGAGAAAACG ATTACAATAT CGAGGTACTG CTGGGAGTGG ATGACTCTGT 840 
GGTCCGTTTC CATGGCAAAG AGCACGTCCA AAACTACCTC CTGACCCTAA TGAACATTGT 900 
GAATGAAATT TACCATGATG AGTCCCTCGG AGTGCATATA AATGTGGTCC TGGTGCGCAT 960 

10 GATAATGCTG GGATATGCAA AGTCCATCAG CCTCATAGAA AGGGGAAACC CATCCAGAAG 1020 

CTTGGAGAAT GTGTGTCGCT GGGCGTCCCA ACAGCAAAC-A TCTGATCTCA ACCACTCTGA 10 80 

ACACCATGAC CATGCAATTT TTTTAACCAG GCAAGACTTT GGACCTGCTG GAATGCAAGG 1140 

ATATGCTCCA GTCACCGGCA TGTGTCATCC AGTGAGAAGT TGTACCCTGA ATCATGAGGA 12 00 

TGGTTTTTCA TCTGCTTTTG TAGTAGCCCA TGAAACGGGC CATGTGTTGG GAATGGAGCA 12 60 

15 TGATGGACAA GGCAACAGGT GTGGTGATGA GACTGCTATG GGAAGTGTCA TGGCTCCCTT 1320 

GGTACAAGCA GCATTCCATC GTTAGCACTG GTCCCGATGC AGTGGTCAAG AACTGAAAAG 1380 

ATATATCCAT TCCTATGACT GTCTCCTTGA TGACCCTTTT GATCATGATT GGCCTAAACT 1440 

CCCAGAACTT CCTGGAATCA ATTATTCTAT GGATGAGCAA TGTCGTTTTG ATTTTGGTGT 1500 

TGGCTATAAA ATGTGCACCG CGTTCCGAAC CTTTGACCCA TGTAAACAGC TGTGGTGTAG 1560 

20 CCATCCTGAT AATCCCTACT TTTGTAAGAC TAAAAAGGGA CCTCCACTTG ATGGGACTGA 1620 

ATGTGCTGCT GGAAAATGGT GCTATAAGGG TCATTGCATG TGGAAGAATG CTAATCAGCA 1680 

AAAACAAGAT GGCAATTGGG GGTCATGGAC TAAATTTGGC TCCTC-TTCTC GGACATGTGG 1740 

AACTGGTGTT CGTTTCAGAA CACGCCAGTG CAATAATCCC ATGCCCATCA ATGGTGGTCA 1800 

GGATTGTCCT GGTGTTAAT T TTGAGTACCA GCTTTGTAAC ACAGAAGAAT GCCAAAAACA 1860 

25 CTTTGAGGAC TTCAGAGCAC AGCAGTGTCA GCAGCGAAAC TCCCACTTTG AATACCAGAA 1920 

TACCAAACAC CACTGGTTGC CATATGAACA TCCTGACCCC AAGAAAAGAT GCCACCTTTA 1980 

CTGTCAGTCC AAGGAGACTG GAGATGTTGC TTACATGAAA CAACTGGTGC ATGATGGAAC 2 040 

GCACTGTTCT TACAAAG AT C CATATAGCAT ATGTGTGCGA GGAGAGTGTG TGAAAGTGGG 2100 

30 TAATTCCCAC TGCCGAACCG TGAAGGGGAC ATTTACCAGA ACT CCCAGG A AGCTTGGGTA 2220 
CCTTAAGATG TTTGATATAC CCCCTGGGGC TAGACATGTG TTAATCCAAG AAGACGAGGC 2280 

AGGGGAGGAA GCCAAGTCGC GGACCTTCAT AGATCTTGGT GTGGAGTGGG ATTATAACAT 2400 

TGAAGATGAC ATTGAAAGTC TTCACACCGA TGGACCTTTA CATGATCCTG TTATTGTTTT 2460 

35 GATTATACCT CAAGAAAATG ATACCCGCTC TAGGCTGACA TATAAGTACA TCATCCATGA 2520 

AGACTCTGTA CCTACAATCA ACAGCAACAA TGTCATCCAG GAAGAATTAG ATACTTTTGA 2580 

GTGGGCTTTG AAGAGCTGGT CTCAGTGTTC CAAACCCTGT GGTGGAGGTT TCCACTACAC 2640 

TAAATATGGA TGCCGTAGGA AAAGTGATAA TAAAATGGTC CATCGCAGCT TCTGTGAGGC 2700 

CAACAAAAAG CCGAAACCTA TTAGACGAAT GTGCAATATT CAAGAGTGTA CACATCCACT 2760 

40 CTGGGTAGCA GAAGAATGGG AACACTGCAC CAAAACCTGT GGAAGTTCTG GCTATCAGCT 2S20 

TCGCACTGTA CGCTGCCTTC AGCCACTCCT TGATGGCACC AACCGCTCTG TGCACAGCAA 2880 

ATACTGCATG GGTGACCGTC CCGAGAGCCG CCGGCCCTGT AACAGAGTGC CCTGCCCTGC 2940 

ACAGTGGAAA ACAGGACCCT GGAGTGAGTG TTCAGTGACC TGCGGTGAAG GAACGGAGGT 30 00 

GAGGCAGGTC CTCTGCAGGG CTGGGGACCA CTGTGATGGT GAAAAGCCTG AGTCGGTCAG 3060 

45 AGCCTGTCAA CTGCCTCCTT GTAATGATGA ACCATGTTTG GGAGACAAGT CCATATTCTG 312 0 

TCAAATGGAA GTGTTGGCAC GATACTGCTC CATACCAGGT TATAACAAGT TATGTTGTGA 3180 

GTCCTGCAGC AAGCGCAGTA GCACCCTGCC ACCACCATAC CTTCTAGAAG CTGCTGAAAC 3240 

TCATGATGAT GTCATCTCTA ACCCTAGTGA CCTCCCTAGA TCTCTAGTGA TGCCTACATC 3300 

TTTGGTTCCT TATCATTCAG AGACCCCTGC AAAGAAGATG TCTTTGAGTA GCATCTCTTC 3360 

50 AGTGGGAGGT CCAAATGCAT ATGCTGCTTT CAGGCCAAAC AGTAAACCTG ATGGTGCTAA 3420 

TTTACGCCAG AGGAGTGCTC AGCAAGCAGG AAGTAAGACT GTGAGACTGG TCACCGTACC 3480 

ATCCTCCCCA CCCACCAAGA GGGTCCACCT CAGTTCAGCT TCACAAATGG CTGCTGCTTC 3540 

CTTCTTTGCA GCCAGTGATT CAATAGGTGC TTCTTCTCAG GCAAGAACCT CAAAGAAAGA 3600 

TGGAAAGATC ATTGACAACA GACGTCCGAC AAGATCATCC ACCTTAGAAA GATGAGAAAG 3660 

55 TGAACCAAAA AGGCTAGAAA CCAGAGGAAA ACCTGGACAA CCTCTCTCTT CCCATGGTGC 3720 

ATATGCTTGT TTAAAGTGGA AATCTCTATA GATCGTCAGC TCATTTTATC TGTAATTGGA 3780 

AGAACAGAAA GTGCTGGCTC ACTTTCTAGT TGCTTTCATC CTCCTTTTGT TCTGCATTGA 3840 

CTCATTTACC AGAATTCATT GGAAGAAATC ACCAAAGATT ATTACAAAAG AAAAATATGT 3900 

TGCTAAGATT GTGTTGGTCG CTCTCTGAAG CAGAAAAGGG ACTGGAACCA ATTGTGCATA 3 960 

60 TCAGCTGACT TTTTGTTTGT TTTAGAAAAG TTACAGTAAA AATTAAAAAG AGATACCAAT 4020 

GGTTTACACT TTAACAAGAA ATTTTGGATA TGGAACAAAG AATTCTTAGA CTTGTATTCC 4080 

TATTTATCTA TATTAGAAAT ATTGTATGAG CAAATTTGCA GCTGTTGTGT AAATACTGTA 4140 

TATTGCAAAA ATCAGTATTA TTTTAAGAGA TGTGTTCTCA AATGATTGTT TACTATATTA 4200 

CATTTCTGGA TGTTCTAGGT GCCTGTCGTT GAGTATTGCC TTGTTTGACA TTCTATAGGT 4260 

65 TAATTTTCAA AGCAGAGTAT TACAAAAGAG AAGTTAGAAT TACAGCTACT GACAATATAA 4320 

AGGGTTTTGT TGAAT CAACA ATGTGATACG TAAATTATAG AAAAAGAAAA GAAACACAAA 4380 

AGCTATAGAT ATACAGATAT CAGCTTACCT ATTGCCTTCT ATACTTATAA TTTAAAGGAT 4440 

TGGTGTCTTA GTACACTTGT GGTCACAGGG ATCAACGAAT AGTAAATAAT GAACTCGTGC 4500 

AAGACAAAAC TGAAACCCTC TTTCCAGGAC CTCAGTAGGC ACCGTTGAGG TGTCCTTTGT 4560 

70 TTTTGTGTGT GTGTGTTCTT TTTTAATTTT CGCATTGTTG ACAGATACAA ACAGTTATAC 4620 

TCAATGTACT GTAATAATCG CAAAGGAAAA AGTTTTGGGA TAACTTATTT GTATC-TTGGT 4680 

AGCTGAGAAA AATATCATCA GTCTAGAATT GATATTTGAG TATAGTAGAG CTTTGGGGCT 4740 

TTGAAGGCAG GTTCAAGAAA G CAT ATGT CG ATGGTTGAGA TATTTATTTT CCATATGGTT 4800 
CATGTTCAAA TGTTCACAAC CACAATGCAT CTGACTGCAA TAATGTGCTA ATAATTTATG 4860 
/5 TCAGTAGTCA CCTTGCTCAC AGCAAAGCCA GAAATGCTCT CTCCAGGGAG TAGATGTAAA 4920 
GTACTTGTAC ATAGAATTCA GAACTGAAGA TATTTATTAA AAC-TTGATTT TTTTTTCTTG 4980 
ATAGTATTTT TATGTACTAA ATATTTACAC TAATATCAAT TACATATTTT GGTAAACTAG 5040 
AGAGACATAA TTAGAGATGC ATGCTTTGTT CTGTGCATAG AGACCTTTAA GCAAACTACT 5100 
OA ACAGCCAACT CAAAAGCTAA AACTGAACAA ATTTGATGTT ATGCAAACAT CTTGCATTTT 5160 
oO TAGTAGTTGA TATTAAGTTG ATGACTTGTT TCCCTTCAAG GAAACATTAA ATTGTATGGA 5220 
CTCAGCTAGC TGTTCAATGA AATTGTGAAT TAGAAACATT TTTAAAAGTT TTTGAAAGAG 5280 
ATAAGTGCAT CATGAATTAC ATGTACATGA GAGGAGATAG TGATATCAGC ATAATGATTT 5340 
TGAGGTCAGT ACCTGAGCTG TCTAAAAATA TATTATACAA ACTAAAATGT AGATGAATTA 5400 
ACCTCTCAAA GCACAGAATG TGCAAGAACT TTTGCATTTT AATCGTTGTA AACTAACAGC 5460 
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T GACTCTATAC CTCTAAAGAA TTGCTGCTAC TTTGTGCAAG AACTTTGAAG 5520 

GTCAAATTAG GCAAATTCCA GATAGTAAAA CAATCCCTAA GCCTTAAGTC TTTTTTTTTT 55 80 

CCTAAAAATT CCCATAGAAT AAAATTCTCT CTAGTTTACT TGTGTGTGCA TACATCTCAT 5640 

CCACAGGGGA AGATAAAGAT GGTCACACAA ACAGTTTCCA TAAAGATGTA CATATTCATT 5700 

ATACTTCTGA CCTTTGGGCT TTCTTTTCTA CTAAGCTAAA AATTCCTTTT TATCAAAGTG 5760 

TACACTACTG ATGCTGTTTG TTGTACTGAG AGCACGTACC AATAAAAATG TTAACAAAAT 582 0 
AT 



40 
45 



I I I I I I 

MVLLSLWLIA AALVEVRTSA DGQAGNEEMV QIDLPIKRYR EYELVTPVST NLEGRYLSHT SO 

15 LSASHKKRSA RDVSSNPEQL FFNITAPGKD PHLRLKPNTQ LVAPGAWEK H3TSLVPGNI 120 

TDPINNHQPG SATYRIRKTE PLQTNCAYVG DIVDIPGTSV AISNCDGLAG MIKSDNEEYF 180 

IEPLERGKQM EEEKGRIHW YKRSAVEQAP IDMSKDFHYR ESDLEGLDDL GTVYGNIHQQ 240 

LNETMRRRRH AGENDYNIEV LLGVDDSWR PHGKEHVQNY LLTLMNIVNE IYHDESLGVH 300 

INWLVRMIM LGYAKSISLI ERGNPSRSLE NVCRWASQQQ RSDLNHSEHH DHAIFLTRQD 360 

20 FGPAGMQGYA PVTGMCHPVR SCTLNHEDGF SSAFWAHET GHVLGMEHDG QGNRCGDETA 420 

MGSVMAPLVQ AAFHRYHWSR CSGQELKRYI HSYDCLLDD? FDHDWPKLPE LPGINYSMDE 480 

QCRFDFGVGY KMCTAFRTFD PCKQLWCSHP DNPYFCKTKK GPPLDGTECA AGKWCYKGHC 540 

MWKNANQQKQ DGNWGSWTKF GSCSRTCGTG VRFRTRQCNN PMPINGGQDC PGVNFEYQLC 600 

NTEECQKHFE DFRAQQCQQR NSHFEYQNTK HHWLPYEHPD PKKRCHLYCQ SKETGDVAYM 660 

25 KQLVHDGTHC SYKDPYSICV RGECVKVGCD KEIGSNKVED KCGVCGGDNS HCRTVKGTFT 720 

RTPRKLGYLK HFDIPPGARH VLIQEDEASP HILAIKNOAT GHYILNGKGE EAKSRTFIDL 780 

GVEHDYHIED DIESLHTDGP LHDPVIVLII PQENDTRSSL TYKYIIHEDS VPTINSNHVI 840 

OEELDTFEWA LKSWSQCSKP CGGGFQYTKY GCRRKSDNKM VHRSFCEANK KPKPIRRMCN 900 

IQECTHPLWV AEEWEHCTKT CGSSGYQLRT VRCLQPLLDG TNRSVHSKYC MGDRPESRRP 960 

30 CNRVPCPAQW KTGPWSECSV TCGEGTEVRQ VLCRAGDHCD GEKPESVRAC QLPPCNDEPC 102 0 

LGDKSIFCQM EVLARYCSIP GYNKLCCESC SKRSSTLPPP YLLEAAETHD DVISNPSDLP 1080 

RSLVMPTSLV PYHSETPAKK MSLSSISSVG GPNAYAAFRP NSKPDGANLR QRSAQQAGSK 1140 

TVRLVTVPSS PPTKRVHLSS ASQMAAASFF AASDSIGASS QARTSKKDGK IIDNRRPTRS 1200 



Seg ID NO: 529 DNA s 
Coding sequence: 246. 



AGGCGGACAA AGCCCGATTG TTCCTGGGCC CTTTCCCCAT CGCGCCTGGG CCTGCTCCCC 
AGCCCGGGGC AGGGGCGGGG GCCAGTGTGG TGACACACGC TGTAGCIGTC TCCCCGGCTG 
GCTGGCTCGC TCTCTCCTGG GGACACAGAG GTCGGCAGGC AGCACACAGA GGGACCTACG 
GGCAGCTGTT CCTTCCCCCG ACT CAAGAAT CCCCGGAGGC CCGGAGGCCT GCAG CAGGAG 
CGGCCATGAA GAAGCTGATG GTGGTGCTGA GTCTGATTGC T 



TCIACACCTC GGGCCACTTG CTCTGTGGTG GGGTCCTTAT CCATCCACTG TGGGTCCTCA 
CAGCTGCCCA CTGCAAAAAA CCGAATCTTC AGGTCTTCCT GGGGAAGCAT AACCTTCGGC 
AAAGGGAGAG TTCCCAGGAG CAGAGTTCTG TTGTCCGGGC TGTGATCCAC CCTGACTATG 
ATGCCGCCAG CCATGACCAG GACATCATGC TGTTGCGCCT GGCACGCCCA GCCAAACTCT 
CTGAACTCAT CCAGCCCCTT CCCCTGGAGA GGGACTGCTC AGCCAACACC ACCAGCTGCC 
ACAT CCTGGG CTGGGGCAAG ACAGCAGATG GTGATTTCCC TGACACCATC C 



AAACCATTCA GGCCAAGTGA CCCTGACATG TGACATCTAC CTCCCGACCT ACCACCCCAC 
TGGCTGGTTC CAGAACGTCT CTCACCTAGA CCTTGCCTCC CCTCCTCTCC TGCCCAGCTC 
T GATTCTCCCT GGTTTTACCC 

TACCCCCACC ACTAAGAGAA TACAGGAAAA TCCCTTCTAG GCATCTCCTC TCCCCAACCC 1260 

TTCCACACGT TTGATTTCTT CCTGCAGAGG CCCAGCCACG TGTCTGGAAT CCCAGCTCCG 1320 

CTGCTTACTG TCGGTGTCCC CTTGGGATGT ACCTTTCTTC ACTGCAGATT TCTCACCTGT 1380 

AAGATGAAGA TAAGGATGAT ACAGTCTCCA TCAGGCAGTG GCTGTTGGAA AGATTTAAGA 1440 

TTTCACACCT ATGACATACA TGGGATAGCA CCTGGGCCGC CATGCACTCA ATAAAGAATG 1500 



LIQPLPLERD CSANTTSCHI LGWGKTADGD FPDTIQCAYI HLVSREECEH AYPGQITQNM 
LCAGDEKYGK DSCQGDSGGP LVCGDHLRGL VSWGNIPCGS KEKPGVYTNV CRYTNWIQKT 
IQAK 
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CTTCTTTAAA TTTCTTTCTA GGATGTTCAC TTCTTCTCCA CAATGAATQA GTGTCACTAT 60 

GACAAGCACA TGGACTTTTT TTATAATAGG AGCAACACTG ATACTGTCGA TGACTGGACA 12 0 

GGAACAAAGC TTGTGATTGT TTTGTGTGTT GGGACGTTTT TCTGCCTGTT TATTTTTTTT 180 

TCTAATTCTC TGGTCATCGC GGCAGTGATC AAAAACAGAA AATTTCATTT CCCCTTCTAC 240 

TACCTGTTGG CTAATTTAGC TGCTGCCGAT TTCTTCGCTG GAATTGCCTA TGTATTCCTG 30 0 

ATGTTTAACA CAGGCCCAGT TTCAAAAACT TTGACTGTCA ACCGCTGGTT TCTCCGTCAG 360 

GGGCTTCTGG ACAGTAGCTT GACTGCTTCC CTCACCAACT TGCTGGTTAT CGCCGTGGAG 42 0 

AGGCACATGT CAATCATGAG GATGCGGGTC CATAGCAACC TGACCAAAAA GAGGGTGACA 480 

CTGCTCATTT TGCTTGTCTG GGCCATCGCC ATTTTTATGG GGGCGGTCCC CACACTGGGC 54 0 

TGGAATTGCC TCTGCAACAT CTCTGCCTGC TCTTCCCTGG CCCCCATTTA CAGCAGGAGT 60 0 

TACCTTGTTT TCTGGACAGT GTCCAACCTC ATGGCCTTCC TCATCATGGT TGTGGTGTAC 660 

CTGCGGATCT ACGTGTACGT CAAGAGGAAA ACCAACGTCT TGTCTCCGCA TACAAGTGGG 72 0 

TCCATCAGCC GCCGGAGGAC ACCCATGAAG CTAATGAAGA CGGTGATGAC TGTCTTAGGG 780 

GCGTTTGTGG TATGCTGGAC CCCGGGCCTG GTGGTTCTGC TCCTCGACGG CCTGAACTGC 840 

AGGCAGTGTG GCGTGCAGCA TGTGAAAAGG TGGTTCCTGC TGCTGGCGCT GCTCAACTCC 90 0 

GTCGTGAACC CCATCATCTA CTCCTACAAG GACGAGGACA TGTATGGCAC CATGAAGAAG 96 0 

ATGATCTGCT GCTTCTCTCA GGAGAACCCA GAGAGGCGTC CCTCTCGCAT CCCCTCCACA 1020 

GTCCTCAGCA GGAGTGACAC AGGCAGCCAG TACATAGAGG ATAGTATTAG CCAAGGTGCA 1080 

GTCTGCAATA AAAGCACTTC CTAAACTCTG GATGCCTCTC GGCCCACCCA GGTGATGACT 1140 
GTCTTAGG 

Seq ID NO: 532 Protein sequence 



I I I I I I 

MNECHYDKHM DFFYNRSNTD TVDDWTGTKL VIVLCVGTFF CLFIFFSNSL VIAAVIKNRK 60 

FHFPFYYLLA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN RWFLRQGLLD SSLTASLTNb 120 

LVIAVERHMS IMRMRVHSNL TKKRVTLLIL LVWAIAIFMG AVPTLGWNCL CNISACSSLA 180 

PIYSRSYLVF WTVSNLMAFL IMWVYLRIY VYVKRKTNVL 3PHTSGSISR RRTPMKLMKT 240 

VMTVLGAFW CWTPGLWLL LDGLNCRQCG VQHVKRWFLL LALLNSWNP IIYSYKDEDM 300 
YGTHKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNK STS 



TCGGGACGCC TCGGGGTCGG GCTCCGGCTG CGGCTGCTGC TGCGGCGCCC 



C GCGGGGATCC CCGGCCAGAC 180 

r GCTGGGCGGT ACCCAGACAG 240 

r GCAGGGGCGC CGGGCGCTGC 300 

TTCGCTGTGA GGTTGAGGCT CCGGGCCCGG TACATGTGTA CTGGCTGCTC GATGGGGCCC 360 

CTGTCCAGGA CACGGAGCGG CGTTTCGCCC AGGGCAG CAG CCTGAGCTTT GCAGCTGTGG 12 0 

ACCGGCTGCA GGACTCTGGC ACCTTCCAGT GTGTGGCTCG GGATGATGTC ACTGGAGAAG 480 

r TGAGGCAGGT CCTGTGGTCC 54 0 

A GCTGAGATCC AGCCACAGAC CCAGGTCACA CTTCGTTGCC 600 

ACATTGATGG GCACCCTCGG CCCACCTACC AATGGTTCCG AGATGGGACC CCCCTTTCTG 660 

ATGGTCAGAG CAACCACACA GTCAGCAGCA AGGAGGGGAA CCTGAGGCTC CGGCCAGCTG 72 0 

GTCCTGAGCA TAGTGGGCTG TATTCCTGCT GCGCCCACAG TGCTTTTGGC CAGGCTTGCA 780 

GCAGCCAGAA CTTCACCTTG AGCATTGCTG ATGAAAGCTT TGCCAGGGTG GTGCTGGCAC 840 

CCCAGGACGT GGTAGTAGCG AGGTATGAGG AGGCCATGTT CCATTGCCAG TTCTCAGCCC 900 

AGCCACCCCC GAGCCTGCAG TGGCTCTTTG AGGATGAGAC TCCCATCACT AACCGCAGTC 960 

GCCCCCCACA CCTCCGCAGA GCCACAGTGT TTGCCAACGG GTCTCTGCTG CTGACCCAGG 1020 

TCCGGCCACG CAATGCAGGG ATCTACCGCT GCATTGGCCA GGGGCAGAGG GGCCCACCCA 10 80 

TCATCCTGGA AGCCACACTT CACCTAGCAG AGATTGAAGA CATGCCC-CTA TTTGAGCCAC 114 0 

GGGTGTTTAC AGCTGGCAGC GAGGAGCGTG TGACCTGCCT TCCCCCCAAG GGTCTGCCAG 12 00 

AGCCCAGCGT GTGGTGGGAG CACGCGGGAG TCCGGCTGCC CACCCATGGC AGGGT CTACC 1260 

AGAAGGGCCA CGAGCTGGTG TTGGCCAATA TTGCTGAAAG TGATGCTGGT GTCTACACCT 132 0 

GCCACGCGGC CAACCTGGCT GGTCAGCGGA GACAGGATGT CAACATCACT GTGGCCACTG 13 80 

TGCCCTCCTG GCTGAAGAAG CCCCAAGACA GCCAGCTGGA GGAGGGCAAA CCCGGCTACT 1440 

TGGATTGCCT GACCCAGGCC ACACCAAAAC CTACAGTTGT CTGGTACAGA AACCAGATGC 1500 

TCATCTCAGA GGACTCACGG TTCGAGGTCT TCAAGAATGG GACCTIGCGC AT CAACAGCG 1560 

TGGAGGTGTA TGATGGGACA TGGTACCGTT GTATGAGCAG CACCCCAGCC GG CAG CATCG 1620 

AGGCGCAAGC CCGTGTCCAA GTGCTGGAAA AGCTCAAGTT CACACCACCA CCCCAGCCAC 1680 

AGCAGTGCAT GGAGTTTGAC AAGGAGGCCA CGGTGCCCTG TTCAGCCACA GGCCGAGAGA 1740 

AGCCCACTAT TAAGTGGGAA CGGGCAGATG GGAGCAGCCT CCCAGAGTGG GTC-ACAC-ACA 1800 

ACGCTGGGAC CCTGCATTTT GCCCGGGTGA CTCGAGATGA CGCTGGCAAC TACACTTGCA 1860 

TTGCCTCCAA CGGGCCGCAG GGCCAGATTC GTGCCCATGT CCAGCTCACT GTGGCAGTTT 1920 

TTATCACCTT CAAAGTGGAA CCAGAGCGTA CGACTGTGTA CCAGGGCCAC ACAGCCCTAC 1980 

TGCAGTGCGA GGCCCAGGGG GACCCCAAGC CGCTGATTCA GTGGAAAGGC AAGGACCGCA 2 040 

2100 



CCGCTGTGGC CTACATCATT G 
AAGCCAAGCG GCTG CAG AAG CAGCCCGAGG G 
GAGGGCCTTT GCAGAACGGG CAGCCCTCAG CAGAGATCCA A 
GCTTGGGCTC CGGCCCCGCG GCCACCAACA AACGCCACAG CACAAGTGAT AAGATGCACT 
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GCCTGCAGAC GAAGGATGAG CAGCAGCAGC TGGACTTCCG GAGGGAGTTG GAGATGTTTG 2700 

GGAAGCTGAA CCACGCCAAC GTGGTGCGGC TCCTGGGGCT GTGCCGGGAG GCTGAGCCCC 2760 

ACTACATGGT GCTGGAATAT GTGGATCTGG GAGACCTCAA GCAGTTCCTG AGGATTTCCA 2 820 

AGAGCAAGGA TGAAAAATTG AAGTCACAGC CCCTCAGCAC CAAGCAGAAG GTGC-CCCTAT 2 880 

GCACCCAGGT AGCCCTGGGC ATGGAGCACC TGTCCAACAA CCGCTTTGTG CATAAGGACT 2 940 

TGGCTGCGCG TAACTGCCTG GTCAGTGCCC AGAGACAAGT GAAGGTGTCT GCCCTGGGCC 3 0 00 

TCAGCAAGGA TGTGTACAAC AGTGAGTACT ACCACTTCCG CCAGGCCTGG GTGCCGCTGC 30 SO 

GCTGGATGTC CCCCGAGGCC ATCCTGGAGG GTGACTTCTC TACCAAGTCT GATGTCTGGG 3120 

CCTTCGGTGT GCTGATGTGG GAAGTGTTTA CACATGGAGA GATGCCCCAT GGTGGGCAGG 3180 

CAGATGATGA AGTACTGGCA GATTTGCAGG CTGGGAAGGC TAGACTTCCT CAGCCCGAGG 3240 

GCTGCCCTTC CAAACTCTAT CGGCTGATGC AGCGCTGCTG GGCCCTCAGC CCCAAGGACC 3300 

GGCCCTCCTT CAGTGAGATT GCCAGCGCCC TGGGAGACAG CACCGTGGAC AGCAAGCCGT 33S0 

GAGGAGGGAG CCCGCTCAGG ATGGCCTGGG CAGGGGAGGA CATCTCTAGA GGGAAGCTCA 3420 

CAGCATGATG GGCAAGATCC CTGTCCTCCT GGGCCCTGAG GTGCCCTAGT GCAACAGGCA 3480 

TTGCTGAGGT CTGAGCAGGG CCTGGCCTTT CCTCCTCTTC CTCACCCTCA TCCTTTGGGA 3540 

GGCTGACTTG GACCCAAACT GGGCGACTAG GGCTTTGAGC TGGGCAGTTT CCCCTGCCAC 36D0 

CTCTTCCTCT ATCAGGGACA GTGTGGGTGC CACAGGTAAC CCCAATTTCT GGCCTTCAAC 3 6 SO 

TTCTCCCCTT GACCGGGTCC AACTCTGCCA CTCATCTGCC AACTTTGCCT GGGGAGGGCT 3720 

AGGCTTGGGA TGAGCTGGGT TTGTGGGGAG TTCCTTAATA TTCTCAAGTT CTGGGCACAC 37B0 

AGGGTTAATG AGTCTCTTGC CCACTGGTCC ACTTGGGGGT CTAGACCAGG ATTATAGAGG 3840 

ACACAGCAAG TGAGTCCTCC CCACTCTGGG CTTGTGCACA CTGACCCAGA CCCACGTCTT 3900 

CCCCACCCTT CTCTCCTTTC CTCATCCTAA GTGCCTGGCA GATGAAGGAG TTTTCAGGAG 3 960 

CTTTTGACAC T ATATAAAC C GCCCTTTTTG TATGCACCAC GGGCGGCTTT TATATGTAAT 4020 



Seq ID NO: 534 I 



1 11 21 31 41 51 

I I I I I I 

MGAARGSPAR PHRLPLLSVL LLPLLGGTQT AIVPIKQPSS QDALQGRRAL LRCEVEAPGP SO 

VHVYWLLDGA PVQDTERRFA QGSSLSPAAV DRLQDSGTFQ CVARDDVTGE EARSANASFN 12 0 

IKWIEAGPW LKHPASEAEI QPQTQVTLRC HIDGHPRPTY QWFRDGTPLS DGQSNHTVSS 180 

KERNLTLRPA GPEHSGLYSC CAHSAFGQAC SSQNFTLSIA DESFARWLA PQDVWARYE 240 

EAMFHCQFSA QPPPSLQWLF EDETPITNRS RPPHLRRATV FANGSLLLTQ VRPRNAGIYR 300 

CIGQGQRGPP I ILEATLHLA EIEDMPLFEP RVFTAGSEER VTCLPPKGLP EPSVWKEHAG 360 

VRLP1HGRVY QKGHELVLAN IAESDAGVYT CHAANLAGQR RQDVNITVAT VPSNLKKPQD 420 

SQLEEGKPGY LDCLTQATPK PTWWYRNQM LISEDSRFEV FKNGTLRINS VEVYDGTHYR 480 
CMSSTPAGSI EAQARVQVLE KLKFTPPPQP QQCMEFDKSA 1 
GSSLPEWVTD NAGTI.HFARV TRDDAGNYTC IASNGPQGQI £ 

TTVYQGHTAL LOCEAQGDPK PLIQWKGKDR ILDPTKLGPR MHIFQNGSLV IHDVAPEDSG 6 60 

RYTCIAGNSC NIKHTEAPLY WDKPVPEES EGPGSPPPYK MIQTIGLSVG AAVAYIIAVL 720 

GLMFYCKKRC KAKRLQKQPE GEEPEMECLN GGPLQNGQPS AEIQEEVALT SLGSGPAATN 780 

KRHSTSDKMH FPRSSLQPIT TLGKSEFGEV FLAKAQGLEE GVAETLVLVK SLQTKDEQQQ 840 

LDFRRELEMF GKLNHANWR LLGLCREAEP HYMVLEYVDL GDLKQFLRIS KSKDEKLKSQ 900 

PLSTKQKVAL CTQVALGMEH LSNNRFVHKD LAARNCLVSA QRQVKVSALG LSKDVYNSEY 960 

YHFRQAWVPL RWMSPEAILE GDFSTKSDW AFGVLMWEVF THGEHPHGGQ ADDEVLADLQ 1020 
AGKARLPQPE GCPSKLYRLM QRCWALSPKD RPSFSEIASA LGDSTVDSK? 

Seq ID NO: 535 DNA sequence 



I I I I I I 

TTCAGAAGGA GGAGAGACAC CGGGCCCAGG GCACCCTCGC GGGCGGGCGG ACCCAAGCAG 
TGAGGGCCTG CAGCCGGCCG GCCAGGGCAG CGGCAGGCGC GGCCCGGACC TACGGGAGGA 
AGCCCCGAGC CCTCGGCGGG CTGCGAGCGA CTCCCCGGCG ATGCCTCACA ACTCCATCAG 
ATCTGGCCAT GGAGGGCTGA ACCAGCTGGG AGGGGCCTTT GTGAATGGCA GACCTCTGCC 
GGAAGTGGT C CGCCAGCGCA TCGTAGACCT GGCCCACCAG GGTGTAAGGC CCTGCGACAT 
CTCTCGCCAG CTCCGCGTCA GCCATGGCTG CGTCAGCAAG ATCCTIGGCA GGTACTACGA 
GACTGGCAGC ATCCGGCCTG GAGTGATAGG GGGCTCCAAG CCCAAGGTGG CCACCCCCAA 
GGTGGTGGAG AAGATTGGGG ACTACAAACG CCAGAACCCT ACCATGTTTG CCTGGGAGAT 
CCGAGACCGG CTCCTGGCTG AGGGCGTCTG TGACAATGAC ACTGTGCCCA GTGTCAGCTC 
CATTAATAGA ATCATCCGGA CCAAAGTGCA GCAACCATTC AACCTCCCTA TGGACAGCTG 
CGTGGCCACC AAGTCCCTGA GTCCCGGACA CACGCTGATC CCCAGCTCAG CTGTAACTCC 
CCCGGAGTCA CCCCAGTCGG ATTCCCTGGG CTCCACCTAC TCCATCAATG GGCTCCTGGG 
CATCGCTCAG CCTGGCAGCG ACAAGAGGAA AATGGATGAC AGTGATCAGG ATAGCTGCCG 



G CACCACCTCG AGCCGCTCGA GTGCCCATTT GAGCGGCAGC 
GGCCTATGCC TCCCCCAGCC ACACCAAAGG CGAGCAGGGC 
G GGAAGGCCAC CCTGACCCCT 



CAAGTCGGCT CCGGGGTCCC GCCCTTCART G 

TACCCACCCC ACATCCCCAC CAGCGGACAG GGCAGCTATG CCTCCTCTGC CATCGCAGGC 1260 
ATGGTGGCAG GAAGTGAATA CTCTGGCAAT GCCTATGGCC ACACCCCCTA CTCCTCCTAC 1320 
AGCGAGGCCT GGGGCTTCCC C 



TGGGGACAGT G 
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MPHNSIRSGH GGLNQLGGAF VNGRPLPEW RQRIVDLAHQ GVRPCDISRQ LRVSHGCVSK 
ILGRYYETGS IRPGVIGGSK PKVATPKWE KIGDYKRQNP TMFANEIRDR LLAEGVCDND 
TVPSVSSINR IIRTKVQQPF NLPMDSCVAT KSLSPGHTLI PSSAVTPPES PQSDSLGSTY 
SINGLLGIAQ PGSDKRKMDD SDQDSCRLSI DSQSSSSGPR KHLRTDAFSQ HHLEPLECPF 
ERQHYPEAYA SPSHTKGEQG LYPLPLLNST LDDGKATLTP SNTPLC-RNLS THQTYPWAA 
PPFWICSKSA PGSRPSHPFP HLPPCTGSSR ARPSSQGERW WGPRCPDTHP TSPPADRAAM 
PPLPSQAWWQ EVNTLAMPMA TPPTPPTARP GASPTPAC 



GAATTCGGCG ATGCCTCACA ACTCCATCAG ATCTGGCCAT GGAGGGCTGA ACCAGCTGGG 
AGGGGCCTTT GTGAATGGCA GACCTCTGCC GGAAGTGGTC CGCCAGCGCA TCGTAGACCT 
3TAAGGC CCTGCGACAT CTCTCGCCAG CTCCGCGTCA GCCATGGTTG 



45 
50 



GGGCTCCAAG CCCAAGGTGG CCACCCCCAA GGTGGTGGAG AAGATTGGGC- 



GCAACCATTC AACCTCCCTA TGGACAGCTG CGTGGCCACC AAGTCCCTGA GTCCCGGACA 
C CCCAGCTCAG CTGTAACTOC CCCGGAGTCA CCCCAGTOGG ATTCCCTGGG 
ITCAATG GGCTCCTGGG CATCGCTCAG CCTGGCAGCG ACAAGAGGAA 

CGGACCCCGA AAGCACCTTC GCACGGATGC CTTCAGCCAG CACCACCTCG AGCCGCTCGA 



CCTACCCCGT 900 

AAGCAGGAA ACCCCCGAGG TGTCCAGTTC 960 

C CCTTCCTCTT TATCTAGCTC CGCCTTTTTG GATCTGCAGC AAGTCGGCTC 1020 

CGGGGTCCCG CCCTTCAATG CCTTTCCCCA TGCTGCCTCC GTGTACGGGC AGTTCACGGG 1080 

CCAGGCCCTC CTCTCAGGGC GAGAGATGGT GGGGCCCACG CTGCCCGGAT ACCCACCCCA 1140 

CATCCCCACC AGCGGACAGG GCAGCTATGC CTCCTCTGCC ATCGCAGGCA TGGTGGCAGG 12 0 0 

AAGTGAATAC TCTGGCAATG CCTATGGCCA CACCCCCTAC TCCTCCTACA GCGAGGCCTG 1260 

GCGCTTCCCC AACTCCAGCT TGCTGAGTTC CCCATATTAT TACAGTTCCA CATCAAGGCC 132 0 
GAGTGCACCG CCCACCACTG CCACGGCCTT TGACCATCTG T 

Seq ID NO: 538 Protein sequenefe 



MPHNSIRSGH GGLNQLGGAF VNGRPLPEW RQRIVDLAHQ GVRPCDISRQ LRVSHGCVSK 
ILGRYYETGS IRPGVIGGSK PKVATPKWE KIGDYKRQNP TMFAWEIRDR LLAEGVCDND 
TVPSVSSINR IIRTKVQQPF NLPMDSCVAT KSLSPGHTLI PSSAVTPPES PQSDSLGSTY 
SINGLLGIAQ PGSDKRKMDD SDQDSCRLSI DSQSSSSGPR KHLRTDAFSQ HHLEPLECPF 
ERQHYPEAYA SPSHTKGEQG LYPLPLLNST LDDGKATLTP SNTPLGRNLS THQTYPWAD 
PHSPFAIKQE TPEVSSSSST PSSLSSSAFL DLQQVGSGVP PFNAFPHAAS VYGQFTGQAL 
LSGREMVGPT LPGYPPHIPT SGQGSYASSA IAGMVAGSEY SGNAYGHTPY SSYSEAWRFP 
NSSLLSSPYY YSSTSRPSAP PTTATAFDHL 



I I I I I I 

GCCGCGGGAG AGGAGGCCAT GGGCGCGCGC GGGGCGCTGC TGCTGGCGCT GCTGCTGGCT 
CGGGCTGGAC TCAGGAAGCC GGAGTCGCAG GAGGCGGCGC CGTTATCAGG ACCATGCGGC 
CGACGGGTCA TCACGTCGCG CATCGTGGGT GGAGAGGACG CCGAACTCGG GCGTTGGCCG 
TGGCAGGGGA GCCTGCGCCT GTGGGATTCC CACGTATGCG GAGTGAGCCT GCTCAGCCAC 
CGCTGGGCAC TCACGGCGGC GCACTGCTTT GAAACCTATA GTGACCTTAG TGATCCCTCC 
GGGTGGATGG TCCAGTTTGG CCAGCTGACT TCCATGCCAT CCTTCTGGAG CCTGCAGGCC 
TACTACACCC GTTACTTCGT ATCGAATATC TATCTGAGCC CTCGCTACCT G 



CCCATCTGTC TCCAGGCCTC CACATTTGAG TTTGAGAACC GGACAGACTG CTGGGTGACT 
CAGGTCGCCA TCATAAACAA CTCTATGTGC A 
C CTTGGCCTGT S 



CCCAATCGGC CCGGTGTCTA C 

CCAGCCAGAC 900 
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LWDSHVCGVS L 
VSNIYLSPRY L 

EDEALPSPHT LQEVQVAI IN NSMCNHLFLK YSFRKDIFGD MVCAGNAQGG KDACFGDSGG 
PLACNKNGLW YQIGWSWGV GCGRPNRPGV YTNISHHPEW IQKLMAQSGM SQPDPSWPLL 
h LGPV 




TCGCCGCCGG GACCCGGGTG CCTGGGCTCG GCTTGAAGCG GCGGCGGCGC ACCGGCACAG 120 

CCGCGGGAGC ATGGGCAGGA GGATGCGGGG CGCCGCCGCC ACCGCGGGGC TCTGGCTGCT 180 

GGCGCTGGGC TCGCTGCTGG CGCTGTGGGG AGGGCTCCTG CCGCCGCGGA CCGAGCTGCC 240 

CGCCTCCCGG CCGCCCGAAG ACCGACTCCC ACGGCGCCCG GCCCGGAGCG GCGGCCCCGC 300 

GCCCGCGCCT CGCTTCCCTC TGCCCCCGCC CCTGGCGTGG GACGCCCGCG GCGGCTCCCT 360 

GAAAACTTTC CGGGCGCTGC TCACCCTGGC GGCCGGCGCG GACGGCCCGC CCCGGCAGTC 420 

CCGGAGCGAG CCCAGGTGGC ACGTGTCAGC CAGGCAGCCC CGGCCGGAGG AGAGCGCCGC 480 

GGTGCACGGG GGGGTCTTCT GGAGCCGCGG CCTGGAGGAG CAGGTGCCCC CGGGCTTTTC 540 

GGAGGCCCAG GCGGCGGCGT GGCTGGAGGC GGCTCGCGGC GCCCGGATGG TGGCCCTGGA 500 

GCGCGGGGGT TGCGGGCGCA GCTCCAACCG ACTGGCCCGT TTTGCCGACG GCACCCGCGC S60 

CTGCGTGCGC TACGGCATCA ACCCGGAGCA GATTCAGGGC GAGGCCCTGT CTTACTATCT 720 

GGCGCGCCTG CTGGGCCTCC AGCGCCACGT GCCGCCGCTG GCACTGGCTC GGGTGGAGGC 780 



CCTCAGCCAG GCGGAGCTGG TGGACCTAGT ACAATGGACC Gj 

CCTGACGGCC AACTTCGACC GGCTCGTAAG CAACCTCTTC AGCCTGCAGT GGGACCCGCG 1080 

3 CGTGCCACCA GCAACCTGCA CCGCGGTCCG GGCGGGGCGC TGGTCTTTCT 1140 

3 GCGGGCTTGG TGCACGGCTA CCGGGTAGCA GGCATGTGGG ACAAGTATAA 1200 

GG AGCCGCTG TTGCAGTCAG TGTGCGTGTT CCGCGAGCGG ACCGCGCGGC GCGTCCTGGA 1260 

TCGCTTCCCC GAGCTGGCCG CCCTTGCAGA CCCCCACGCT CAGCTGCTAC Ai 



ACTTGCGTTT GAGCCGTTGA GCTAATTCTG C 

CCAAACAGAG CGCTGGTGGC CCCGGAGCAG GGCTGTGACA TTGGCTGGTG GAGCCCCTTC 1920 

CTGTGTTCTC CCTTTGTTCC AGCGCCGCGA TGGTGAGATC ACTGTTCCAA GCAGGGGGAC 1980 

GGCTCGCGAT AGGACAAAGA GAGCAGGACC TCCAGACTCT GGGGAGCCCT GCAGACCTTG 2 040 

ACAATTTGCC TGACTCATTC CTGACCTCTT GTCATTTTGG CCTGAAGGCT ACAAATTCAG 2100 

GGTCAGCTGT ATGCACTAAG TCAAATAATG AATTTCTTCC TCCCTCTCGC AACCGACCAA 2160 

AATTTTGACA ACGATGATGT TCACCAGAAG GAAAAAAAAA TCAGTTTTAT GCACTTTATT 2220 

TTGTTTTGAT TTTCATTTTT TATTAAGAAA AAATTTTATT TTACAGAATT TACCTTCTCT 2280 

GTATATATGT GCATAAAGTG TGGTGTAAAT ATACTAAACA AACTTATATT TCAATAAAAG 2340 



1 11 21 31 41 51 

I I I I I I 

MGRRMRGAAA TAGLWLLALG SLLALWGGLL PPRTELPASR PPEDRLPRRP ARSGGPAPAP 60 

RFPLPPPLAW DARGGSLKTF RALLTLAAGA DGPPRQSRSE PRWHVSARQP RPEESAAVHG 120 

GVFWSRGLEE QVPPGFSEAQ AAAWLEAARG ARMVALERGG CGRSSNRLAR FADGTRACVR 180 

YGINPEQIQG EALSYYLARL LGLQRHVPPL ALARVEARGA QWAQVQEELR AAHWTEGSW 240 

SLTRWLPNLT DVWPAPWRS EDGRLRPLRD AGGELANLSQ AELVDLVQWT DLILFDYLTA 300 

NFDRLVSNLF SLQWDPRVMQ RATSNLHRGP GGALVFLDNE AGLVHGYRVA GMWDKYNEPL 360 

LQSVCVFRER TARRVLELHR GQDAAARLLR LYRRHEPRFP ELAALADPHA QLLQRRLDFL 420 



ATGGCCGGCT CTGGCGCGTG GAAGCGCCT C AAATCTATGC 



\ ACAAACT TCG AGTTGTGGTG GCCGATGACG GTTCCGAAGC CCCGGAAAGG 
CCTGTTAACG GGGCGCACCC GACCCTCCAG GCCGACGATG ATTCCTTACT GGACCAAGAC 
TTACCTTTGA CCAACAGTCA GCTGAGTTTG AAGGTGGACT CCTGTGACAA CTGCAGCAAA 
CAGAGAGAGA TACTGAAGCA GAGAAAGGTG AAAGCCAGGT TGACCATTGC TGCCGTTCTG 
TACTTGCTTT TCATGATTGG AGAACTTGTA GGTGGATACA TTGCAAATAG CCTAGCAATC 
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ATGACAGATG CACTTCATAT GTTAACTGAC CTAAGCGCCA TCATACTCAC CCTGCTTGCT 480 

TTGTGGCTAT CATCAAAATC ACCAACCAAA AGATTCACCT TTGGATTTCA TCGCTTAGAG S40 

GTTTTGTCAG CTATGATTAG TGTGCTGTTG GTGTATATAC TTATGGGATT CCTCTTATAT SOO 

GAAGCTGTGC AAAGAACTAT CCATATGAAC TATGAAATAA ATGGAGATAT AATGCTCATC 560 

ACCGCAGCTG TTGGAGTTGC AGTTAATGTA ATAATGGGGT TTCTGTTGAA CCAGTCTGGT 720 

CACCGTCACT CCCATTCCCA CTCCCTGCCT TCAAATTCCC CTACCAGAGG TTCTGGGTGT 780 

GAACGTAACC ATGGGCAGGA TAGCCTGGCA GTGAGAGCTG CATTTGTACA TC-CTTTGGGA 840 

GATTTGGTAC AGAGTGTTGG TGTGCTAATA GCTGCATACA TCATACGATT CAAGCCAGAA 90 0 

TACAAGATTG CTGATCCCAT CTGTACATAC GTATTTTCAT TACTTGTGGC TTTTACAACA 960 

TTTCGAATCA TATGGGATAC AGTAGTTATA ATACTAGAAG GTGTGCCAAG CCATTTGAAT 1020 

GTAGACTATA TCAAAGAAGC CTTGATGAAA ATAGAAGATG TATATTCAGT CGAAGATTTA 1080 

AATATCTGGT CTCTCACTTC AGGAAAATCT ACTGCCATAG TTCACATACA GCTAATTCCT 1140 

GGAAGTTCAT CTAAATGGGA GGAAGTACAG TCCAAAGCAA ACCATTTATT ATTGAACACA 12 00 

TTTGGCATGT ATAGATGTAC TATTCAGCTT CAGAGTTACA GGCAAGAAGT GGACAGAACT 1260 
TGTGCAAATT GTCAGAGTTC TAGTCCCTGA 

Seq ID NO: S44 Protein sequence 
Protein Accession #= XP_007652.1 



QREILKQRKV KARLTIAAVL 

ulinioiL, uuiUiHOimi niunumMjiu uofti..L±j i uun LWLSSKSPTK RFTFGFHRLE 
25 VLSAMISVLL VYILMGPLLY EAVQRTIHMN YEINGDIMLI TAAVGVAVNV IMGFLLNQSG 

HRHSHSHSLP SNSPTRGSGC ERNHGQDSLA VRAAFVHALG DLVQSVGVLI AAYIIRFKPE 
JIIWDTWI ILEGVPSHLN VD Y I KEALMK IEDVYSVEDL 
S TAIVHIQLIP GSSSKWEEVQ SKANHLLLNT FGMYRCTIQL QSYRQEVDRT 

30 



I I I I I I 

ATGTTTTCCG GCTTCAATGT CTTTAGAGTT GGGATCTCTT TTGTCATAAT GTC 
TACATGCCAA CAGTAAACTC TTTACCAGAA CTGAGTCCTC AGAAAT2 
CAACCAGGAA AAGCCTCTTT AGCTTATTTT TGTCAAGCTG ATTCCCC 
TTTCTTGAAG AACTGAATGA GGCTGTTAGA CCTCTGCAGG ACTATGGAAT T 
AAGGTTAATT GTGTCAAAGA AGAAATATCA AGATACTGTG GAAAAGJ 

AAAGCATATT TATTCAAGGG CAACATATTG CTCAGAGAAT TCCCTACTGA CACCTTGTTT 3 SO 

GATGTGAATG CCATTGTCGC CCATGTTCTC TTTGCTCTTC TTTTTAGTGA AGTGAAATAT 42 0 

ATTACCAACC TGGAAGACCT TCAGAACATA GAAAATGCTC TGAAAGGAAA AGCAAATATT 480 

ATATTCTCAT ATGTAAGAGC CATTGGAATA CCAGAGCACA GAGCAGTCAT GGAAGCCGCT 540 

TTTGTGTATG GGACTACATA CCAATTTGTC TTAACCACAG AAATTGCCCT TTTGGAAAGT 600 

ATTGGCTCTG AGGATGTGGA ATATGCACAT CTCTACTTTT TTCATTGTAA ACTAGTCTTG 560 

GACTTGACCC AGCAATGTAG AAGAACACTA ATGGAACAGC CATTGACTAC ACTGAACATT 720 

CACCTGTTTA TTAAGACAAT GAAAGCACCT CTGTTGACTG AAGTTGCTGA AGATCCTCAA 780 

CAAGTTTCAA CTGTCCATCT CCAACTGGGC TTACCACTGG TTTTTATTGT TAGCCAACAG 840 

GCTACTTATG AAGCTGATAG AAGAACTGCA GAATGGGTT G CTTGGCGTCT TCTGGGAAAA 90 0 

GCAGGAGTTC TACTCTTGTT AAGGGACTCT TTGGAAGTGA ACATTCCTCA AGATGCTAAT 960 

GTGGT CTTCA AAAGAGCAGA AGAGGGAGTT CCAGTGGAAT TTTTGGTATT ACATGATGTT 1020 

GATTTAATAA TATCTCATGT GGAAAATAAT ATGCACATTG AGGAAATACA AGAAGATGAA 1080 

GACAATGACA TGGAAGGTCC AGATATAGAT GTTCAGGATG ATGAAGTGGC AGAAACTGTT 1140 

TTCAGAGATA GGAAGAGAAA ATTACCTTTG GAACTTACAG TGGAACTAAC AGAAGAAACA 120 0 

TTTAATGCAA CAGTGATGGC TTCTGACAGC ATAGTACTCT TCTATGCTGG TTGGCAAGCA 12 60 

GTATCCATGG CATTTTTGCA ATCCTATATT GATGTGGCAG TTAAACTGAA AGGCACATCT 132 0 

ACTATGCTTC TTACTAGAAT AAACTGTGCA GATTGGTCTG ATGTATGTAC TAAGCAAAAT 1380 

GTTACTGAAT TTCCTATCAT AAAGATGTAC AAGAAAGGCG AGAACCCAGT ATCTTATGCT 1440 

GGAATGTTAG GAACCGAAGA TCTCCTAAAA TTTATCCAGC TCAACAGGAT TTCATATCCA 150 0 

GTGAATATAA CATCGATCCA AGAAGCAGAA GAATATTTAA GTGGGGAATT ATATAAAGAC 1560 

CTCATCTTGT ATTCTAGTGT GTCAGTATTG GGACTATTTA GTCCAACCAT GAAAACAGCA 162 0 

AAAGAAGATT TTAGTGAAGC AGGAAACTAC CTAAAAGGAT ATGTTATCAC TGGAATTTAT 1680 

TCTGAAGAAG ATGTTTTGCT ACTGTCAACC AAATATGCTG CAAGTCTTCC AGCCCTGCTG 174 0 

CTTGCCAGAC ACACAGAAGG CAAAATAGAG AGCATCCCAC TAGCTAGCAC ACATGCACAA 180 0 

GACATAGTTC AAATAATAAC AGATGCACTA CTGGAAATGT TTCCGGAAAT CACTGTGGAA 1860 

AATCTTCCCA GTTATTTCAG ACTTCAGAAA CCATTATTGA TTTTGTTCAG TGATGGCACT 1920 

GTAAATCCTC AGTATAAAAA AGCAATATTG ACACTGGTAA AGCAGAAATA CTTGGATTCA 1980 

TTTACTCCAT GCTGGTTAAA TCTAAAGAAT ACTCCAGTGG GGAGAGGAAT CTTGAGGGCA 2 040 

TATTTTGATC CTCTGCCTCC CCTTCCTCTT CTTGTTTTGG TGAATCTGCA TTCAGGTGGC 2100 

CAAGTATTTG CATTTCCTTC AGACCAGGCT ATAATTGAAG AAAACCTTGT ATTGTG3CTG 2160 

AAG AAAT TAG AAGCAGGACT AGAAAATCAT ATCACAATTT TACC1GCTCA AGAATGGAAA 2220 
G CCGCAACAIC T 



CATGAAGATA AATCGGCAGT 
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35 
40 
45 
50 
55 
60 



MFSGFNVFRV GISFVIMCIF YMPTVNSLPE LSPQKYFSTL QPGXASLAYF CQADSPRTSV 
FLEELNEAVR PLQDYGISVA KVNCVKEEIS RYCGKEKDLM KAYLFKGNIL LREFPTDTLF 
DVNAIVAHVL FALLFSEVKY ITNLEDLQMI ENALKGKANI IFSYVRAIGI PEHRAVMEAA 
FVYGTTYQFV LTTEIALLES IGSEDVEYAH LYFFHCKLVL DLTQQCRRTL MEQPLTTLNI 
HLFIKTMKAP LLTEVAEDPQ QVSTVHLQIjG LPLVFIVSQQ ATYEADRRTA EWVAWRLLGK 
AGVLLIjLRDS LEVNIPQDAN WFKRAEEGV PVEFLVLHDV dliishvenk MHIEEIQEDE 
DNDMEGPDID VQDDEVAETV FRDRKRKLPL ELTVELTEET FNATVMASDS IVLPYAGWQA 
VSMAFLQSYI DVAVKLKGTS TMLLTRINCA DWSDVCTKQN VTEFP 1 1 KM Y KKGENPVSYA 
GMLGTEDLLK FIQLNRI SYP VNITSIQEAE EYLSGELYKD LILYSSVSVL GLFSPTMKTA 
KEDFSEAGNY LKGYVITGIY SEEDVLLLST KYAASLPALL LARHTEGXIE SIPLASTHAQ 
DIVQIITDAL LEMFPEITVE NLPSYFRLQK PLLILFSDGT VNPQYKJCAIL TLVKQKYLDS 
FTPCWLNLKN TPVGRGILRA YFDPLPPLPL LVLVWLHSGG QVFAFPSDQA IIEEMLVLWL 
KKLEAGLENH ITILPAQEWK PPLPAYDFLS MIDAATSQRG TRKVPKCMKE TDVQEHDKEQ 
HEDKSAVRKE PIETLRIKHW NRSNWFKEAE KSFRRDKELG CSKVN 

Seq ID NO: 547 DHA sequence 
Nucleic Acid Accession #: NM_033102 .1 
1..1S62 



ATGGTCCAGA G 
GTCAACCTGC T 

C TGGAAGTGGG GGTAGAGGAG AAGTTCATGA CCATGGTGCT G 
G GCCTGGTCTG TGTCCCGCTC CTAGGCTCAG CCAGTGACCA CTGGCGTGGA 



GAGCTGGCAC TGCTCATCCT GGGCGTGGGG CTGCTGGACT TCTCTGGCCA GGTGTGCTTC 



TACTCTGTCT A 

ATTGACTGGG ACACCAGTGC CCTGGCCCCC TACCTGGGCA CCCAGGAGGA GTGCCTCTTT 50 0 
GGCCTGCTCA C 
GCAGCGCTGG G 

TGCTGTCCAT GCCGGGCCCG CTTGGCTTTC CGGAACCTGG GCGCCCTGCT TCCCCGGCTG 
AGCTGGATGG CACTCATGAC C 

TACCAGGGCG TGCCCAGAGC TGAGCCGGGC ACCGAGGCCC GGAGACACTA TGATGAAGGC 
i GCAGCCTGGG GCTGTTCCTG CAGTGCGCCA TCTCCCIGGT C 
fGGTGCA GCGATTCGGC A 
GCTTTCCCTG TGGCTGCCGG TGCCACATGC C 
TCAGCCGCCC TCACCGGGTT CACCTTCTCA GCCCTGCAGA T 
TCCCTCTACC ACCGGGAGAA G' 

GCTAGCAGTG AGGACAGCCT GATGACCAGC TTCCTGCCAG GCCCTAAGCC T 
TTCCCTAATG GACACGTGGG TGCTGGAGGC AGTGGCCTGC TCCCACCTCC ACCCGCGCTC 13 8 0 
TGCGGGGCCT CTGCCTGTGA TGTCTCCGTA CGTGTGGTGG TGGGTGAGCC CACCGAGGCC 1440 
AGGGTGGTTC CGGGCCGGGG CATCTGCCTG GACCTCGCCA TCCTGGATAG TGCCTTCCTG 15 0 0 
CTGTCCCAGG TGGCCCCATC CCTGTTTATG GGCTCCATTG TCCAGCTCAG CCAGTCTGTC 1560 
ACTGCCTATA TGGTGTCTGC CGCAGGCCTG GGTCTGGTCG CCATTTACTT TGCTACACAG 162 0 
GTAGTATTTG ACAAGAGCGA CTTGGCCAAA TACTCAGCGT GA 



Seq ID NO: 548 Protein 
Protein Accession # : NP_1 



MVQRLWVSRL LRHRKAQLLL VNLLTFGLEV CLAAGITYVP PLLLEVGVEE KFMTMVLGIG 

PVLGLVCVPL LGSASDHWRG RYGRRRPFIW ALSLGILLSL FLIPRAGWLA GLLCPDPRPL 

ELALLILGVG LLDFCGQVCF TPLEALLSDL FRDPDHCRQA YSVYAFMISL GGCLGYLLPA 

IDWDTSALAP YLGTQEECLF GLLTLIFLTC VAATLLVAEE AALGPTEPAE GLSAPSLSPH 

CCPCRARLAF RNLGALLPRL HQLCCRMPRT LRRLFVAELC SWMALHTFTL FYIDFVGEGL 

YQGVPRAEPG TEARRHYDEG VRMGSLGLFL QCAISLVFSL VMDRLVQRFG TRAVYLASVA 

AFPVAAGATC LSHSVAWTA SAALTGFTFS ALQILPYTLA SLYHREKQVF LPKYRGDTGG 

ASSEDSLMTS FLPGPKPGAP FPNGHVGAGG SGLLPPPPAL CGASACDVSV RVWGEPTEA 
RWPGRGICL DLAILDSAFL LSQVAFSLFM GSIVQLSQSV TAYMVSAAGL GLVAIYFATQ 
WFDKSDLAK YSA 

Seq ID NO: 549 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1389 




ATTCAAGCGG TCGGGGTTAT GTCTTTTGCA TTTATTTGCC ACCATAACTC CTTCTTAGTT 
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GTGATTTCTG TATTTATCTG TATATTCTTT GCTACATGTG GATACTTGAC A 
TTCACCCAAG GGGACTTATT TGAAAATTAC T 
AGATTTTGTT ATGGTGTCAC TGTCATTTTG A 

GAGGTAATTG CCAATGTGTT TTTTGGTGGG AATCTTTCAT CGGTTTTCC 

ACAGTGATGG TCATCACTGT AGCCACGCTT GTGTCATTGC TGATTGATTG CCTCGGGATA 10 80 

GTTCTAGAAC TCAATGGTGT GCTCTGTGCA ACTCCCCTCA TTTTTATCAT TCCATCAGCC 1140 

TGTTATCTGA AACTGTCTGA AGAACCAAGG ACACACTCCG ATAAGATTAT GTCTTGTGTC 12 00 

ATGCTTCCCA TTGGTGCTGT GGTGATGGTT TTTGGATTCG TCATGGCTAT TACAAATACT 12 SO 

CAAGACTGCA CCCATGGGCA GGAAATGTTC TACTGCTTTC CTGACAATTT CTCTCTCACA 1320 

AATACCTCAG AGTCTCATGT TCAGCAGACA ACACAACTTT CTACTTTAAA TATTAGTATC 13 80 

50 Protein sequence 



I I I I I I 

MGYQRQEPVI PPQEDLDDRE TLVSEHEYKE KTCQSAALFN WNSIIGSGI IGLPYSMKQA 
GFPLGILLLF WVSYVTDFSL VLLIKGGALS GTDTYQSLVN KTFGFPGYLL LSVLQFLYPF 
IAMISYHIIA GDTLSKVFQR IPGVDPENVF IGRHFIIGLS TVTFTLPLSL YRHIAKLGKV 
SLISTGLTTL ILGIVMARAI SLGPHIPKTE DAWVFAKPNA IQAVGVMSFA FICHHNSFLV 
YSSLEEPTVA KWSRLIHMSI VISVFICIFF ATCGYLTFTG FTQGDLFENY CRNDDLVTFG 
RFCYGVTVIL TYPMECFVTR EVIANVFFGG NLSSVFHIW TVKVITVATL VSLLIDCLGI 




Eos sequence 



I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC C 
AAGCAAGCTG GGTTTCCTTT GGGAAT2 
TTTTCCCTTG TTTTATTGAT AAAAGGAGGG G 
TTGGTCAATA AAACTTTCGG CTTTCCJ 

TATCCTTTTA TAGCAATGAT AAGTTACAAT ATAATAGCTG GAGATACT 
TTTCAAAGAA TCCCAGGAGT TGATCCTGAA AAGGTGTTTA TTGGTCGCCA CTTCATTATT 
GGACTTTCCA CAGTTACCTT TACT CTGCCT TTATCCTTGT ACCGAAATAT AGCAAAGCTT 
GGAAAGGTCT CCCTCATCTC TACAGGTTTA ACAACTCTGA TTCTTGGAAT TGTAATGGCA 
AGGGCAATTT CACTGGGTCC ACACATACCA AAAACAGAAG ACGCTTGGGT ATTTGCAAAG 
CCCAATGCCA TTCAAGCGGT CGGGGTTATG TCTTTTGCAT 

TTCTTAGTTT ACAGTTCTCT AGAAGAACCC ACAGTAGCTA AGTGGTCCCG CCTTAT 
ATGTCCATCG TGATTTCTGT ATTTATCTGT A 
TTTACTGGCT TCACCCAAGG GGACTTATTT GAAAATTACT GCAGAAATGA TGACCTGGTA 780 
ACATTTGGAA GATTTTGTTA TGGTGTCACT GTCATTTTGA CATACCCTAT GGAATGCTTT 840 
GTGACAAGAG AGGTAATTGC CAATGTGTTT TTTGGTGGGA ATCTTTCATC GGTTTTCCAC 900 
ATTGTTGTAA CAGTGATGGT CAT CACTGTA GCCACGCTTG TGTCATTGCT GATTGATTGC 9S0 
CTCGGGATAG TTCTAGAACT CAATGGTGTG CTCTGTGCAA CTCCCCTCAT TTTTATCATT 1020 
CCATCAGCCT GTTATCTGAA ACTGTCTGAA GAACCAAGGA CACACTCCGA TAAGATTATG 1080 
A TGCTTCCCAT TGGTGCTGTG GTGATGGTTT TTGGATTCGT CATGGCTATT 1140 
C CCATGGGCAG GAAATGTTCT ACTGCTTTCC TGACAATTTC 12 00 
T CAGCAGACAA CACAACTTTC TACTTTAAAT 12S0 
ATTAGTATCT TTCAACTCGA GTAA 

Seq ID NO: 552 Protein sequence 
Protein Accession # ; Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MGYQRQEPVI PPQRGLPYSM KQAGFPLGIL LLFWVSYVTD FSLVLLIKGG ALSGTDTYQS 60 
LVNKTFGFPG YLLLSVLQFL YPFIAHISYN 1 1 AGDTLSKV FQRIPGVDPE NVFIGRHFII 120 
GLSTVTFTLP LSLYRNIAKL GKVSLISTGL TTLILGIVMA RAISLGPHIP KTEDAWVFAK 180 
PNAIQAVGVM SFAFICHHNS FLVYSSLEEP TVAKWSRLIH MSIVISVFIC IFFATCGYLT 240 
FTGFTQGDLF ENYCRNDDLV TFGRFCYGVT VILTYPMECF VTREVIANVF FGGNLSSVFH 300 
IWTVMVITV ATLVSLLIDC LGIVLELNGV LCATPLIFII PSACYLKLSE EPRTHSDKIM 360 
V QQTTQLSTLN 420 




T ATCTGCTCCT CTCTGTTCTT C 
AGTTACAATA TAATAGCTGG AGATACTTTG AGCAAAGTTT TTCAAAGAAT CCCAGGAGTT 
GATCCTGAAA ACGTGTTTAT TGGTCGCCAC TTCATTATTG GACTTTCCAC AGTTACCTTT 
ACTCTGCCTT TATCCTTGTA CCGAAATATA GCAAAGCTTG GAAAGGTCTC CCTCATCTCT 
ACAGGTTTAA CAACTCTGAT TCTTGGAATT GTAATGGCAA GGGCAATTTC ACTGGGTCCA 
CACATACCAA AAACAGAAGA CGCTTGGGTA TTTGCAAAGC CCAATGCCAT TCAAGCGGTC 
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GGGGTTATGT CTTTTGCATT TATTTGCCAC CATAACTCCT TCTTAGTTTA CAGTTCTCTA 540 
GAAGAACCCA CAGTAGCTAA GTGGTCCCGC CTTATCCATA TGTCCATCGT GATTTCTGTA 600 
A TATTCTTTGC TACATGTGGA TACTTGACAT TTACTGGCTT CACCCAAGGG 660 



35 
40 



GGTGTCACTG TCATTTTGAC ATACCCTATG GAATGCTTTG TGACAAGAGA G 

AATGTGTTTT TTGGTGGGAA TCTTTCATCG GTTTTCCACA TTGTTGTAAC AGTGATGGTC 840 

ATCACTGTAG CCACGCTTGT GTCATTGCTG ATTGATTGCC TCGGGATAGT TCTAGAACTC 90 0 

AATGGTGTGC TCTGTGCAAC TCCCCTCATT TTTATCATTC CATCAGCCTG TTATCTGAAA 960 

CTGTCTGAAG AACCAAGGAC ACACTCCGAT AAGATTATGT CTTGTGTCAT GCTTCCCATT 1020 

3 TGATGGTTTT TGGATTCGTC ATGGCTATTA CAAATACTCA AGACTGCACC 1080 

3 AAATGTTCTA CTGCTTTCCT GACAATTTCT CTCTCACAAA TACGTCAGAG 1140 

TCTCATGTTC AGCAGACAAC ACAACTTTCT ACTTTAAATA TTAGTATCTT TCAACTCGAG 12 00 
TAA 



Protein Accession #: Eos sequence 

1 11 21 31 41 SI 

I I I I I I 

MGYQRQEPVI PPQFSLVLLI KGGALSGTDT YQSLVNKTFG FPGYLLLSVL QFLYPFIAMI 
SYNIIAGDTL SKVFQRIPGV DPENVFIGRH FIIGLSTVTF TLPLSLYRNI AKLGKVSLIS 
TGLTTLILGI VMARAISLGP HIPKTEDAWV FAKPNAIQAV GVMSFAFICH H1ISFLVYSSL 
EEPTVAKWSR LIHMSIVISV FICIFFATCG YLTFTGFTQG DLFENYCRND DLVTFGRFCY 
GVTVILTYPM ECFVTREVIA NVFFGGNLSS VFHIWTVMV ITVATLVSLL IDCLGIVLEL 
NGVLCATPLI FIIPSACYLK LSEEPRTHSD KIMSCVMLPI GAWMVFGFV MAITNTQDCT 
HGQEMFYCFP DNFSLTNTSE SHVQQTTQLS TLNISIFQLE 

Seq ID NO: 



Eos sequence 




GTTATGTCTT TTGCATTTAT TTGCCACCAT AACTCCTTCT TAGTTTAC 

T TTGTTATGGT 660 

GTCACTGTCA TTTTGACATA CCCTATGGAA TGCTTTGTGA CAAGAGAGGT AATTGCCAAT 720 

GTGTTTTTTG GTGGGAATCT TTCATCGGTT TTCCACATTG TTGTAACAGT GATGGTCATC 780 

ACTGTAGCCA CGCTTGTGTC ATTGCTGATT GATTGCCTCG GGATAGTTCT AGAACTCAAT 840 

T GTGCAACTCC CCTCATTTTT ATCATTCCAT CAGCCTGTTA TCTGAAACTG 900 

3 ATTATGTCTT GTGTCATGCT TCCCATTGGT 960 

3 GCTATTACAA ATACTCAAGA CTGCACCCAT 1D2 0 

3 CTTTCCTGAC AATTTCTCTC TCACAAATAC CTCAGAGTCT 1080 
CATGTTCAGC AGACAACACA ACTTTCTACT TTAAATATTA GTATCTTTCA A 



1 11 21 31 41 51 

I I I I I I 

MGYQRQEPVI PPQVNKTFGF PGYLLLSVLQ FLYPFIAMIS YNIIAGDTLS KVFQRIPGVD 
PENVFIGRHF IIGLSTVTFT LPLSLYRNIA KLGKVSLIST GLTTLILGIV MARAISLGPH 
IPKTEDAWVF AKPNAIQAVG VMSFAFICHH NSFLVYSSLE EPTVAKKSRL IHMSIVISVF 
ICIFFATCGY LTFTGFTQGD LFENYCRNDD LVTFGRFCYG VTVILTYPME CFVTREVIAN 
VFFGGNLSSV FHIWTVMVI TVATLVSLLI DCLGIVLELN GVLCATPLIF IIPSACYLKL 
SEEPRTHSDK IMSCVMLPIG AWMVFGFVM AITNTQDCTH GQEMFYCFPD N 
HVQQTTQLST LNISIFQLE 



70 
75 



d Accession ff : XM_057188.1 



1 11 21 31 41 51 

I I I I I I 

ATGGGATTGC CTCTCCCTCT CACTCTGGGC TTCTGTCCCA CTCTIATCTI AGTGTCAGTC 

CTCCCCCAAG TCTGTGTCCC TCTCTCTCCC CTAAATCTCT GGCCCCTCCT TTCTGAGTTC 

CTGCCCTTCC CCCAATTCTT TGGTTTTTGC ATCCCCCTCT GCCCCTTGCC TCAGTCAAGT 

CTCCCCTGTG TGTCTCTCTC CCCCCGGCCC GGACCTCTGC ACCCCCCAGC- TCGCTGTCCC 

GCCCCCCAGA CCTCGCCCCG CCCCAGGCTA GGCTGGAAAG TGGAGGATCC 
CCGGCGGAGG GAGCGCCGGG GCCCTGGC-CT 
;tgccgg AGAAGGAGCA GAGCTGGATC 
CCCAAGATCT TCAAGAAGAA GACCTGCACG ACGTTCATAG TTGACTCCAC AGATCCGGGG 
GATGGGGTCT CGCTCTATTG CCCAGGCTGT TCTCAAACTC CTGGGCTCAA GCAGTCCTTC 
TGCCTTGACC TCCCAAAGTG CTTGGATTGT GCCCAGCCGT CCTTGAAGTT TTGCTCAGAA 
GAGCAAACTT TCTGGGAAGT AGCTGCAGGT GTTGGAAGTA GCTGCAGGGG AACTAGGGGA 
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TTCAAGGATG GAGCTGAAAT GGGTGAACGG ACAAAGTCGG TAAACTGAAT GGAGGATGCC 
TTCGGGGCAG CCGTGGTGAC CGTGTGGGAC AGCGATGCAC ACACCACGGA GAAGCCCACC 
3 GAGAGCTGGA CTTCACGGGG GCCGGCCGCA AGCACAGCAA TTTCCTCCGG 



G CCAGCACTGG GGGCACCAAG GTGGTGGCCA TGGGTGTGGC CCCCTGGGGT 1200 

GTGGTCCGGA ATAGAGACAC CCTCATCAAC CCCAAGGGCT CGTTCCCTGC GAGGTACCGG 12S0 

10 TGGCGCGGTG AC CCGGAGGA CGGGGTCCAG TTTCCCCTGG ACTACAACTA CTCGGCCTTC 1320 

TTCCTGGTGG ACGACGGCAC ACACGGCTGC CTGGGGGGCG AGAACCGCTT CCGCTTGCGC 13 80 

CTGGAGTCCT ACATCTCACA GCAGAAGACG GGCGTGGGAG GGACTGGAAT TGACATCCCT 1440 

GTCCTGCTCC TCCTGATTGA TGGTGATGAG AAGATGTTGA CGCGAATAGA GAACGCCACC 1500 

CAGGCTCAGC TCCCATGTCT CCTCGTGGCT GGCTCAGGGG GAGCTGCGGA CTGCCTGGCG 1560 

15 GAGACCCTGG AAGACACTCT GGCCCCAGGG AGTGGGGGAG CCAGGCAAGG CGAAGCCCGA 1S20 

GATCGAATCA GGCGTTTCTT TCCCAAAGGG GACCTTGAGG TCCTGCAGGC CCAGGTGGAG 1580 

AGGATTATGA CCCGGAAGGA GCTCCTGACA GTCTATTCTT CTGAGGATGG GTCTGAGGAA 1740 

TT CGAGACCA TAGTTTTGAA GGCCCTTGTG AAGGCCTGTG GGAGCTCGGA GGCCTCAGCC 1800 

TACCTGGATG AGCTGCGTTT GGCTGTGGCT TGGAACCGCG TGGACATTGC CCAGAGTGAA 1860 

Z\) CTCTTTCGGG GGGACATCCA ATGGCGGTCC TTCCATCTCG AAGCTTCCCT CATGGACGCC 1920 

CTGCTGAATG ACCGGCCTGA GTTCGTGCGC TTGCTCATTT CCCACGGCCT CAGCCTGGGC 1980 

CACTTCCTGA CCCCGATGCG CCTGGCCCAA CTCTACAGCG CGGCGCCCTC CAACTCGCTC 2 040 

ATCCGCAACC TTTTGGACCA GGCGTCCCAC AGCGCAGGCA CCAAAGCCCC AGCCCTAAAA 210 0 

GGGGGAGCTG CGGAGCTCCG GCCCCCTGAC GTGGGGCATG TGCTGAGGAT GCTGCTGGGG 2160 

25 AAGATGTGCG CGCCGAGGTA CCCCTCCGGG GGCGCCTGGG ACCCTCACCC AGGCCAGGGC 2220 

TTCGGGGAGA GCATGTATCT GCTCTCGGAC AAGGCCACCT CGCCGCTCTC GCTGGATGCT 2280 

GGCCTCGGGC AGGCCCCCTG GAGCGACCTG CTTCTTTGGG CACTGTTGCT GAACAGGGCA 234 0 

CAGATGGCCA TGTACTTCTG GGAGATGGGT TCCAATGCAG TTTCCTCAGC TCTTGGGGCC 240 0 

TGTTTGCTGC TCCGGGTGAT GGCACGCCTG GAGCCTGACG CTGAGGAGGC AGCACGGAGG 2460 

3U AAAGACCTGG CGTTCAAGTT TGAGGGGATG GGCGTTGACC TCTTTGGCGA GTGCTATCGC 2520 

AGCAGTGAGG TGAGGGCTGC CCGCCTCCTC CTCCGTCGCT GCCCGCTCTG GGGGGATGCC 2580 

ACTTGCCTCC AGCTGGCCAT GCAAGCTGAC GCCCGTGCCT TCTTTGCCCA GGATGGGGTA 2640 

CAGTCTCTGC TGACACAGAA GTGGTGGGGA GATATGGCCA GCACTACACC CATCTGGGCC 2700 

CTGGTTCTCG CCTTCTTTTG CCCTCCACTC ATCTACACCC GCCTCATCAC CTTCAGGAAA 2760 

35 TCAGAAGAGG AGCCCACACG GGAGGAGCTA GAGTTTGACA TGGATAGTGT CATTAATGGG 2820 

GAAGGGCCTG TCGGGACGGC GGACCCAGCC GAGAAGACGC CGCTGGGGGT CCCGCGCCAG 2880 




ACTGTCCTCT GCATCGACTT CATGGTTTTC ACGGTGCGGC TGCTTCACAT CTTCACGGTC 3360 

AACAAACAGC TGGGGCCCAA GATCGTCATC GTGAG CAAGA TGATGAAGGA CGTGTTCTTC 342 0 

TTCCTCTTCT TCCTCGGCGT GTGGCTGGTA GCCTATGGCG TGGCCACGGA GGGGCTCCTG 34 80 

AGGCCACGGG ACAGTGACTT CCCAAGTATC CTGCGCCGCG TCTTCTACCG TCCCTACCTG 3540 

CAGATCTTCG GGCAGATTCC CCAGGAGGAC ATGGACGTGG CCCTCATGGA GCACAGCAAC 3600 

TGCTCGTCGG AGCCCGGCTT CTGGGCACAC CCTCCTGGGG CCCAGGCGGG CACCTGCGTC 3660 
TCCCAGTATG CCAACTGGCT G 
ATCCTGCTGG TCAACTTGCT CATTGCCATG T 

AACAGCGATC TCTACTGGAA GGCGCAGCGT TACCGCCTCA TCCGG3AATT CCACTCTCGG 3840 



CTTTCTAAGG AAGCCGAGCG GAAGCTGCTA ACGTGGGAAT C 
CTGCTGGCAC GCGCTAGGGA CAAGCGGGAG AGCGACTCCG A 

CAGAAGGTGG ACTTGGCACT GAAACAGCTG GGACACATCC GCGAGTACGA ACAGCGCCTG 4140 

AAAGTGCTGG AGCGGGAGGT CCAGCAGTGT AGCCGCGTCC TGGGGTGGGT GGCCGAGGCC 42 00 

CTGAGCCGCT CTGCCTTGCT GCCCCCAGGT GGGCCGCCAC CCCCTGACCT GCCTGGGTCC 42S0 

AAAGACTGAG CCCTGCTGGC GGACTTCAAG GAGAAGCCCC CACAGGGGAT TTTGCTCCTA 4320 

GAGTAAGGCT CATCTGGGCC TCGGCCCCCG CACCTGGTGG CCTTGTCCTT GAGGTGAGCC 43B0 

CCATGTCCAT CTGGGCCACT GTCAGGACCA CCTTTGGGAG TGTCATCCTT ACAAACCACA 4440 

GCATGCCCGG CTCCTCCCAG AACCAGTCCC AGCCTGGGAG GATCAAGGCC TGGATCCCGG 4 500 

GCCGTTATCC AT CTGGAGGC TGCAGGGTCC TTGGGGTAAC AGGGACCACA GACCCCTCAC 4560 
CACTCACAGA TTCCTCACAC TGGGGAAATA AAGCCATTTC AGAGGAAAAA A 



Protein Accession ft: 



I I I I 



TWGFRAPNLV VSVLGGSGGP VLQTWLQDLL RRGLVRAAQS TGAWIVTGGL HTGIGRHVGV 
AVRDHQMAST GGTKWAMGV APWGWRNRD TLINPKGSFP ARYRWRGDPE DGVQFPLDYN 
YSAFFLVDDG THGCLGGENR FRLRLESYIS QQKTGVGGTG IDIPVLLLLI DGDEKMLTRI 
ENATQAQLPC LLVAGSGGAA DCLAETLEDT LAPGSGGARQ GEARDRIRRF FPKGDLEVLQ 
AQVERIMTRK ELLTVYSSED GSEEFETIVL KALVKACGSS EASAYLDELR LAVAWNRVDI 
AQSELFRGDI QWRSFHLEAS LMDALMDRP EFVRLLISHG LSLGHFLTPM RLAQLYSAAP 
SNSLIRNLLD QASHSAGTKA PALKGGAAEL RPPDVGHVLR MLLGKHCAPR YPSGGAWDPH 
PGQGFGESMY LLSDKATSPL SLDAGLGQAP WSDLLLWALL LNRAQMAMYF WEMGSNAVSS 
ALGACLLLRV MARLEPDAEE AARRKDLAFK FEGMGVDLFG ECYRSSEVRA A 
WGDATCLQLA MQADARAFFA QDGVQSLLTQ KWWGDMASTT PIWALVLAFF C 
TFRKSEEEPT REELEFDMDS VINGEGPVGT K 
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LERWFHFWGA PVTIFMGMW SYLLFLLLFS RVLLVDFQFA PPGSLELLLY FWAFTLLCEE 780 

LRQGLSGGGG SLASGGPGPG HASLSQRLRL YLADSWNQCD LVALTCFLLG VGCRLTPGLY 840 

HLGRTVLCID FMVFTVRLLH IFTVNKQLGP KIVIVSKMMK DVFFFLFFLG WLVAYGVAT 900 

EGLLRPRDSD FPSILRRVFY RPYLQIFGQI PQEDMDVALM EHSNCSSEPG FWAHPPGAQA 960 

GTCVSQYANW LWLLLVIFL LVANILLWL LIAMFSYTFG KVQGNSDLYW KAQRYRLIRE 1020 

[SHLRLL LRQLCRRPRS PQPSSPALEH FRVYLSKEAE RKLLTW3SVH 1080 

J DKRESDSERL KRTSQKVDLA LKQLGHIREY EQRLKVLERE VQQCSRVLGW 1140 



Seq ID NO: 559 D 



I I I I I 

AGGAATCTGC GCTCGGGTTC CGCAGATGCA GAGGTTC 

ATCGGGCAGA GGTCTCACAG CAGCCAAGGA ACCTGGGGCC CGCTCCTCCC C 

CATGAGGATT CTGCAGTTAA TCCTGCTTGC TCTGGCAACA GGGCTTGTAG GGGGAGAGAC 180 

CAGGATCATC AAGGGGTTCG AGTGCAAGCC TCACTCCCAG CCCTGGCAGG CAGCCCTGTT 240 

CGAGAAGACG CGGCTACTCT GTGGGGCGAC GCTCATCGCC CCCAGATGGC TCCTGACAGC 300 

AGCCCACTGC CTCAAGCCCC GCTACATAGT TCACCTGGGG CAGCACAACC TCCAGAAGGA 360 

GGAGGGCTGT GAGCAGACCC GGACAGCCAC TGAGTCCTTC CCCCACCCCG GCTTCAACAA 420 

CAGCCTCCCC AACAAAGACC ACCGCAATGA CATCATGCTG GTGAAGATGG CATCGCCAGT 480 

CTCCATCACC TGGGCTGTGC GACCCCTCAC CCTCTCCTCA CGCTGTGTCA CTGCTGGCAC 540 

CAGCTGCCTC ATTTCCGGCT GGGGCAGCAC GTCCAGCCCC CAGTTACGCC TGCCTCACAC 600 

CTTGCGATGC GCCAACATCA CCATCATTGA GCACCAGAAG TGTGAGAACG CCTACCCCGG 650 

CAACATCACA GACACCATGG TGTGTGCCAG CGTGCAGGAA GGGGGCAAGG ACTCCTGCCA 720 

GGGTGACTCC GGGGGCCCTC TGGTCTGTAA CCAGTCTCTT CAAGGCATTA TCTCCTGGGG 780 

CCAGGATCCG TGTGCGATCA CCCGAAAGCC TGGTGTCTAC ACGAAAGTCT GCAAATATGT 840 

GGACTGGATC CAGGAGACGA TGAAGAACAA TTAGACTGGA CCCACCCACC ACAGCCCATC 900 

ACCCTCCATT TCCACTTGGT GTTTGGTTCC TGTTCACTCT GTTAATAAGA AACCCTAAGC 960 

CAAGACCCTC TACGAACATT CTTTGGGCCT CCTGGACTAC AGGAGATGCT GTCACTTAAT 1020 

AATCAACCTG GGGTTCGAAA TCAGTGAGAC CTGGATTCAA ATTCTGCCTT GAAATATTGT 1080 

GACTCTGGGA ATGACAACAC CTGGTTTGTT CTCTGTTGTA TCCCCAGCCC CAAAGACAGC 1140 
TCCTGGCCAT ATATCAAGGT TTCAATAAAT ATTTGCTAAA TGAGTG 



MRILQLILLA LATCLVCCET RIIKGFECKP IISQPWQAALF EKTRLLCGAT LIAPRWLLTA 
V HLGQHNLGKE EGCEQTRTAT ESFPHPGFNN SLPNKDHRND IMLVKMASPV 
r LSSRCVTAGT SCLISGWGST SSPQLRLPHT LRCANITIIE HQKCENAYPG 
NITDTMVCAS VQEGGKDSCQ CDSGGPLVCN QSLQCIISWG QDPCAITRKP GVYTKVCKYV 
DWIQETMKNN 

Seq ID NO: 561 DNA sequence 
Nucleic Acid Accession #: AY046419.1 
Coding sequence: 1..1743 

I I' -I' I' I' I' 

ATGTTTACCT TCCTGTCATC TGTCACTGCT G' " 

CTTGGGATCA TCTCTGGGGC TCTTCTTCAG A' 
GAGCAGGAAA TGGTTGTGAG CTCCCTCGTC ATTGGAGCCC T 
GGGGTCCTGA TAGACAGATA T 
GGACTCGGAA GCTTAGTCTT G 

rTTCCAT CTCCCTCTCT TCCATTGCCA CTTGTGTTTA CATCGCAGAG 



GGCATTCTTT CTGCCTATAT TTCAAATTAC GCATTTGCCA ATGTTTTCCA TGGCTGGAAG 480 

TACATGTTTG GTCTTGTGAT TCCCTTGGGA GTTTTGCAAG CAATTGCAAT GTATTTTCTT 540 

CCTCCAAGCC CTCGGTTTCT GGTGATGAAA GGACAAGAGG GAGCTGCTAG CAAGGTTCTT 600 

GGAAGGTTAA GAGCACTCTC AGATACAACT GAGGAACTCA CTGTGATCAA ATCCTCCCTG 660 

AAAGATGAAT ATCAGTACAG TTTTTGGGAT CTGTTTCGTT CAAAAGACAA CATGCGGACC 720 

CGAATAATGA TAGGACTAAC ACTAGTATTT TTTGTACAAA TCACTGGCCA ACCAAACATA 780 

TTGTTCTATG CAT CAACTGT TTTGAAGTCA GTTGGATTTC AAAGCAATGA GGCAGCTAGC 840 

CTCGCCTCCA CTGGGGTTGG AGTCGTCAAG GTCATTAGCA CCATCCCTGC CACTCTTCTT 900 

GTAGACCATG TCGGCAGCAA AACATICCTC TGCATTGGCT CCTCTGTGAT GGCAGCTTCG 960 

TTGGTGACCA TGGGCATCGT AAATCTCAAC ATCCACATGA ACTTCACCCA TATCTGCAGA 102 0 

AGCCACAATT CTATCAACCA GTCCTTGGAT GAGTCT GTGA TTTATGGACC AGGAAACCTG 1080 

TCAACCAACA ACAATACTCT CAGAGACCAC TTCAAAGGGA TTTCTTCCCA TAGCAGAAGC 1140 

TCACTCATGC CCCTGAGAAA TGATGTGGAT AAGAGAGGGG AGACGACCTC AGCATCCTTG 120 0 

CTAAATGCTG GATTAAGCCA CACTGAATAC CAGATAGTCA CAGACCCTGG GGACGTCCCA 1260 

GCTTTTTTGA AATGGCTGTC CTTAGCCAGC TTGCTTGTTT ATGTTGCTGC TTTTTCAATT 1320 

GGTCTAGGAC CAATGCCCTG GCTGGTGCTC AGCGAGATCT TTCCTGGTGG GATCAGAGGA 1380 

CGAGCCATGG CTTTAACTTC TAGCATGAAC TGGGGCATCA ATCTCCTCAT CTCGCTGACA 1440 

TTTTTGACTG TAACTGATCT TATTGGCCTG CCATGGGTGT GCTTTATATA TACAATCATG 1500 

AGTCTAGCAT CCCTGCTTTT TGTTGTTATG TTTATACCTG AGACAAAGGG ATGCTCTTTG 1560 

GAACAAATAT CAATGGAGCT AGCAAAAGTG AACTATGTGA AAAACAACAT TTGTTTTATG 1620 

AGTCATCACC AAGAAGAATT AGTGCCAAAA CAGCCTCAAA AAAGAAAACC CCAGGAGCAG 1680 

CTCTTGGAGT GTAACAAGCT. GTGTGGTAGG GGCCAATCCA GGCAGCTTTC TCCAGAGACC 1740 
TAA 
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Protein Accession #: AAL02327.1 

1 11 21 31 41 51 

I I I I I I 

MFTFLSSVTA AVSGLLVGYE LGIISGALLQ IKTLLALSCH EQEMWSSLV IGALLASLTG 
GVLIDRYGRR TAI ILSSCLL GLGSLVLILS LSYTVLIVGR IAIGVSISLS SIATCVYIAE 
IAPQHRRGLL VSLNELMIVI GILSAYISNY AFANVFHGWK YMFGLVIPLG VLQAIAMYFL 
PPSPRFLVMK GQEGAASKVL GRLRALSDTT EELTVIKSSL KDEYQYSFWD LFRSKDNMRT 
RIMIGLTLVF FVQITGQPNI LFYASTVLKS VGFQSNEAAS LASTGVGWK VISTIPATLL 
VDHVGSKTFL CIGSSVMAAS LVTHGIVNLN IHMNFTHICR SHKSINQSLD ESVIYGPGNL 
STNNNTLRDH FKGISSHSRS SLMPLRNDVD KRGETTSASL LNAGLSHTEY QIVTDPGDVP 
AFLKWLSLAS LLVYVAAFSI GLGPMPWLVL SEIFPGGIRG RAMALTSSMN WGINLLISLT 
FLTVTDLIGL PWVCFIYTIM SLASLLFWM FIPETKGCSL EQISMELAKV N 
SHHQEELVPK QPQKRKPQEQ LLECNKLCGR GQSRQLSPET 

Seq ID NO: 563 DNA sequence 



r CACGGCGCTC AGCCTCGGCC TCAGCCTGTG CTCCCTGGGG 
CTGCTCGTCA CGGCCATCTT CACCGACCAC TGGTACGAGA CCGACCCCCG GCGCCACAAG 
GAGAGCTGCG AGCGCAGCCG CGCGGGCGCC GACCCCCCGG ACCAGAAGAA CCGCCTGATG 
CCGCTGTCGC ACCTGCCGCT GCGGGACTCG CCCCCGCTGG GGCGCCGGCT GCTCCCGGGC 
GGCCCGGGGC GCGCCGACCC CGAGTCCTGG CGCTCGCTCC TGGGC-CTCGG CGGGCTGGAC 
GCCGAGTGCG GCCGGCCCCT CTTCGCCACC TACTCGGGCC TCTGGAGGAA GTGCTACTTC 
CTGGGCATCG ACCGGGACAT CGACACCCTC ATCCTGAAAG GTATTGCGCA GCGATGCACG 
GCCATCAAGT ACCACTTTTC TCAGCCCATC CGCTTGCGAA ACATTCCTTT TAATTTAACC 
C AGCAAGATGA GTGGCACCTG CTTCATTTAA GAAGAATCAC TGCTGGCTTC 
T CCTTCTCTGC GGCTGCATTG TGGCCACAGT CAGTTTCTTC 
TGGGAGGAGA GCTTGACCCA GCACGTGGCT GGACTCCTGT TCCTCATGAC AGGGATATTT 
TGCACCATTT CCCTCTGTAC TTATGCCGCC AGTATCTCGT ATGATTTGAA CCGGCTCCCA 
T ATAGCCTGCC TGCTGATGTG GAACATGGTT ACAGCTGGTC CATCTTTTGC 
TGTGGCA GCTGGAGGTC TCTGCATCGC TTATCCGTTT 
ATTAGCCGGA CCAAGATTGC ACAGCTAAAG TCTGGCAGAG ACTCCACGGT ATGA 

Seq ID NO: 564 Protein sequence 



MEPRALVTAL SLCLSLCSLG LLVTAIFTDH WYETDPRRHK ESCERSRAGA DPPDQKNRLM 
PLSHLPLRDS PPLGRRLLPG GPGRADPESW RSLLGLGGLD A3CGRPLFAT YSGLWRKCYF 
LGIDRDIDTli ILKGI AQRCT AIKYHFSQPI RLRNIPFNLT KTIQQDEWHL Ii 
LGMAVAVLLC GCIVATVSFF WEESLTQHVA GLLFLMTGIF CTISLCTYAA S 
KLIYSLPADV EHGYSWSIFC AWCSLGFIVA AGGLCIAYPF ISRTKIAQLK SGRDSTV 



I I 



ACCCGGACCC TGTACTCCAG CGCGTCTCGG AGCACAGACT TGTCTTACAG TGAAAGCGAC 
TTGGTGAATT TTATTCAAGC AAATTTTAAG AAACGAGAAT GTGTCTTCTT TACCAAAGAT 
TCCAAGGCCA CGGAGAATGT GTGCAAGTGT GGCTATGCCC AGAGCCAGCA CATGGAAGGC 
ACCCAGATCA ACCAAAGTGA GAAATGGAAC TACAAGAAAC ACACCAAGGA ATTCCCTACC 
GACGCCTTTG GGGATATTCA GTTTGAGACA CTGGGGAAGA AAGGGAAGTA TATACGTCTG 
TCCTGCGACA CGGACGCGGA AATCCTTTAC GAGCTGCTGA CCCAGCACTG GCACCTGAAA 
ACACCCAACC TGGTCATTTC TGTGACCGGG GGCGCCAAGA ACTTCGCCCT GAAGCCGCGC 
ATGCGCAAGA TCTTCAGCCG GCTCATCTAC ATCGCGCAGT CCAAAGGTGC TTGGATTCTC 
ACGGGAGGCA CCCATTATGG CCTGATGAAG TACATCGGGG AGGTGGTGAG AGATAACACC 
ATCAGCAGGA GTTCAGAGGA GAATATTGTG GCCATTGGCA TAGCAGCTTG GGGCATGGTC 
TCCAACCGGG ACACCCTCAT CAGGAATTGC GATGCTGAGG GCTATTTTTT AGCCCAGTAC 
CTTATGGATG ACTTCACAAG AGATCCACTG TATATCCTGG ACAACAACCA CACACATTTG 
CTGCTCGTGG ACAATGGCTG T CATGG ACAT CCCACTGTCG AAGCAAAGCT CCGGAATCAG 
CTAGAGAAGT ATATCTCTGA GCGCACTATT CAAGATTCCA ACTATGGTGG CAAGATCCCC 



AGGTGGAGGA TGCCCTGACA TCTTCTGCCG TCAAGGAGAA G 



TTTTTACCCC GCACGGTGTC CCGGCTGCCT GAGGAGGAGA CTGAGAGTTG GATCAAATGG 
CTCAAAGAAA TTCTCGAATG TTCTCACCTA TTAACAGTTA TTAAAATGGA AGAAGCTGGG 
GATGAAATTG TGAGCAATGC CATCTCCTAC GCTCTATACA AAGCCTTCAG CACCAGTGAG 
CAAGACAAGG ATAACTGGAA TGGGCAGCTG AAGCTTCTGC TGGAGTGGAA CCAGCTGGAC 
iGATTTT CACCAATGAC CGCCGATGGG AGTCTGCTGA CCTTCAAGAA 
A CGGCTCTCAT AAAGGACAGA CCCAAGTTTG TCCGCCTCTT TCTGGAGAAT 
C TACGGAAGTT TCTCACCCAT GATGTCCTCA CTGAACTCTT CTCCAACCAC 
TTCAGCACGC TTGTGTACCG GAATCTGCAG ATCGCCAAGA ATTCCTATAA TGATGCCCTC 
CTCACGTTTG TCTGGAAACT GGTTGCGAAC TTCCGAAGAG GCTTCCGGAA GGAAGACAGA 
AATGGCCGGG ACGAGATGGA CATAGAACTC CACGACGTGT CTCCTATTAC TCGGCACCCC 
CTGCAAGCTC TCTTCATCTG GGCCATTCTT CAGAATAAGA AGGAACTCTC CAAAGTCATT 
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TGGGAGCAGA CCAGGGGCTG CACTCTGGCA GCCCTGGGAG C 
CTGGCCAAAG TGAAGAACGA CATCAATGCT GCTGGGGAGT CCGAGGAGCT G 

TACGAGACCC GGGCTGTTGA GCTGTTCACT GAGTGTTACA GCAGCGATGA AGACTTGGCA 1920 

GAACAGCTGC TGGTCTATTC CTGTGAAGCT TGGGGTGGAA GCAACTGTCT GGAGCTGGCG 1980 

GTGGAGGCCA CAGACCAGCA TTTCATCGCC CAGCCTGGGG TCCAGAATTT TCTTTCTAAG 2040 

G GAGAGATTTC CCGAGACACC AAGAACTGGA AGATTATCCT GTGTCTGTTT 2100 

T TGGTGGGCTG TGGCTTTGTA TCATTTAGGA AGAAACCTGT CGACAAGCAC 2160 

C TTTGGTACTA TGTGGCGTTC TTCACCTCCC CCTTCGTGGT CTTCTCCTGG 2 220 

AATGTGGTCT TCTACATCGC CTTCCTCCTG CTGTTTGCCT ACGTGCTGCT CATGGATTTC 2280 

CATTCGGTGC CACACCCCCC CGAGCTGGTC CTGTACTCGC TGGTCTTTGT CCTCTTCTGT 2340 

GATGAAGTGA GACAGTGGTA CGTAAATGGG GTGAATTATT TTACTGACCT GTGGAATGTG 2400 

ATGGACACGC TGGGGCTTTT TTACTTCATA GCAGGAATTG TATTTCGGCT CCACTCTTCT 2460 

AATAAAAGCT CTTTGTATTC TGGACGAGTC ATTTTCTGTC TGGACTACAT TATTTTCACT 2520 

CTAAGATTGA TCCACATTTT TACTGTAAGC AGAAACTTAG GACCCAAGAT TATAATGCTG 2 580 

CAGAGGATGC TGATCGATGT GTTCTTCTTC CTGTTCCTCT TTGCGGTGTG GATGGTGGCC 2 640 

TTTGGCGTGG CCAGG CAAGG GATCCTTAGG CAGAATGAGC AGCGCTGGAG GTGGATATTC 2700 

CGTTCGGTCA TCTACGAGCC CTACCTGGCC ATGTTCGGCC AGGTGCCCAG TGACGTGGAT 2760 

GGTACCACGT ATGACTTTGC CCACTGCACC TTCACTGGGA ATGAGTCCAA GCCACTGTGT 2820 

GTGGAGCTGG ATGAGCACAA CCTGCCCCGG TTCCCCGAGT GGATCACCAT CCCCCTGGTG 2880 

TGCATCTACA TGTTATCCAC CAACATCCTG CTGGTCAACC TGCTGGTCGC CATGTTTGGC 2 940 

TACACGGTGG GCACCGTCCA GGAGAACAAT GACCAGGTCT GGAAGTTCCA GAGGTACTTC 30 00 

CTGGTGCAGG AGTACTGCAG CCGCCTCAAT ATCCCCTTCC CCTTCATCGT CTTCGCTTAC 3060 

TTCTACATGG TGGTGAAGAA GTGCTTCAAG TGTTGCTGCA AGGAGAAAAA CATGGAGTCT 3120 

TCTGTCTGCT GTTTCAAAAA TGAAGACAAT GAGACTCTGG CATGGGAGGG TGTCATGAAG 3180 

GAAAACTACC TTGT CAAG AT CAACACAAAA GCCAACGACA CCTCAGAGGA AATGAGGCAT 3240 

CGATTTAGAC AACTGGATAC AAAGCTTAAT GATCTCAAGG GTCTTCTGAA AGAGATTGCT 3300 
AATAAAATCA AATGA 



I I I I I I 

MSFRAARLSM RNRRNDTLDS TRTLYSSASR STDLSYSESD LVNFIQAKFK KRI 
SKATENVCKC GYAQSQHMEG TQINQSEKWN YKKHTKEFPT DAFGDIQFET LGKKGKYIRL 
SCDTDAKILY ELLTQHWHLK TPNLVISVTG GAKNFALKPR MRKIFSRLIY IAQSKGAWIL 
TGCTHYCLMK YICEWRDNT ISRSSEENIV AIGIAAWGMV SMRDTLIRNC DAEGYFLAQY 
LMDDFTRDPL YILDNNHTHL LLVDNGCHGH PTVEAKLRNQ LEKYISERTI QDSNYGGKIP 
EVCFAQGGGK ETLKMNTSI KNKIPCVWE GSGQIADVIA SLVEVSDALT SSAVKEKLVR 
FLPRTVSRLP EEETESWIKW LKEILECSHL LTVIKMEEAG DSIVSNAISY ALYKAFSTSE 
QDKDNWNGQL KLLLEWNQLD LANDEIFTND RRWESADLQE VMFTALIKDR PKFVRLFLEN 
GLNLRKFLTH DVLTELFSNH FSTLVYRNLQ IAKNSYNDAL LTFVWKLVAN FRRGFRKEDR 

NGRDEMCIIEL HDVSPITRHP LQALFIWAIL QNKKELSKVI WEQTRGCTLA A 

LAKVKNDINA AGESEELANE YETRAVELFT ECYSEDEDLA EQLLVYECEA » 
VEATDQHF1A QPGVQNFLSK QWYGEISRDT KNWKIILCLF IIPLVGC 
KKLLWYYVAF FTSPFWFSW NWFYIAFLL LFAYVLLMDF HSVPHP] 
DEVRQWYVNG VNYFTDLWNV MDTLGLFYFI AGIVFRLHSS NKSSLYSGRV IFCLDYIIFT 840 
LRLIHIFTVS RNLGPKIIML QRMLIDVFFF LFLFAVWMVA FGVARQGILR QNEQRWRWIF 900 
RSVIYEPYLA MFGQVPSDVD GTTYDFAHCT FTGNESKPLC VELDBHNLPR FPEW1TIPLV 960 
CIYMLSTNIL LVNLLVAMFG YTVGTVQENN DQVWKFQRYF LVQEYCSRLN IPFPFIVFAY 1020 
FYMWKKCFK CCCKEKNMES SVCCFKNEDN ETLAWEGVMK ENYLVKINTK ANDTSEEMRH 1080 
RFRQLDTKLN DLKGLLKEIA NKIK 

Seq ID NO: 567 DNA sequence 

Nucleic Acid Accession #: NM_006911.1 

Coding sequence: 1..558 



ATGCCTCGCC TGTTCTTGTT CCACCTGCTA GAATTCTGTT TACTACTGAA CCAATTTTCC 60 

AGAGCAGTCG CGGCCAAATG GAAGGACGAT GTTATTAAAT TATGCGGCCG CGAATTAGTT 120 

CGCGCGCAGA TTGCCATTTG CGGCATGAGC ACCTGGAGCA AAAGGTCTCT GAGCCAGGAA 180 

GATGCTCCTC AGACACCTAG ACCAGTGGCA GAAATTGTAC CATCCTTCAT CAACAAAGAT 240 

65 ACAGAAACTA TAATTATCAT GTTGGAATTC ATTGCTAATT TGCCACCGGA GCTGAAGGCA 300 

GCCCTATCTG AGAGGCAACC ATCATTACCA GAGCTACAGC AGTATGTACC TGCATTAAAG 360 

GATTCCAATC TTAGCTTTGA AGAATTTAAG AAACTTATTC GCAATAGGCA AAGTGAAGCC 420 

GCAGACAGCA ATCCTTCAGA ATTAAAATAC TTAGGCTTGG ATACTCATTC TCAAAAAAAG 480 

AGACGACCCT ACGTGGCACT GTTTGAGAAA TGTTGCCTAA TTGGTTGTAC CAAAAGGTCT 540 

70 CTTGCTAAAT ATTGCTGA 



MPRLFLFHLL EFCLLLNQFS RAVAAKWKDD VIKLCGRELV RAQIAICGMS TWSKRSLSQE 
DAPQTPRPVA EIVPSFINKD TETIIIMLEF IANLPPELKA ALSERQPSL? ELQQYVPALK 
DSNLSFEEFK KLIRNRQSEA ADSNPSELKY LGLDTHSQKK RRPYVALFEK CCLIGCTKRS 
LAKYC 
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I I I I I I 

ATGCTGCCCG TGTACCAGGA GGTGAAGCCC AACCCGCTGC AGGACGCGAA CCTCTGCTCA SO 

CGCGTGTTCT TCTGGTGGCT CAATCCCTTG TTTAAAATTG GCCATAAACG GAGATTAGAG 12 0 

GAAGATGATA TGTATTCAGT GCTGCCAGAA GACCGCTCAC AGCACCTTGG AGAGGAGTTG 180 

CAAGGGTTCT GGGATAAAGA AGTTTTAAGA GCTGAGAATG ACGCACAGAA GCCTTCTTTA 240 

ACAAGAGCAA TCATAAAGTG TTACTGGAAA TCTTATTTAG TTTTGGGAAT TTTTACGTTA 300 

ATTGAGGAAA GTGCCAAAGT AATCCAGCCC ATATTTTTGG GAAAAATTAT TAATTATTTT 36 0 

GAAAATTATG ATCCCATGGA TTCTGTGGCT TTGAACACAG CGTACGCCTA TGCCACGGTG 42 0 

CTGACTTTTT GCACGCTCAT TTTGGCTATA CTGCATCACT TATATTTTTA TCACGTTCAG 480 

TGTGCTGGGA TGAGGTTACG AGTAGCCATG TGCCATATGA TTTATCGGAA GGCACTTCGT 540 

CTTAGTAACA TGGCCATGGG GAAGACAACC ACAGGCCAGA TAGTCAATCT GCTGTCCAAT 60 0 

GATGTGAACA AGTTTGATCA GGTGACAGTG TTCTTACACT TCCTGTGGGC AGGACCACTG 660 

CAGGCGATCG CAGTGACTGC CCTACTCTGG ATGGAGATAG GAATATCGTG CCTTGCTGGG 720 

ATGGCAGTTC TAATCATTCT CCTGCCCTTG CAAAGCTGTT TTGGGAAGTT GTTCTCATCA 780 

CTGAGGAGTA AAACTGCAAC TTTCACGGAT GCCAGGATCA GGACCATGAA TGAAGTTATA 840 

ACTGGTATAA GGATAATAAA AATGTACGCC TGGGAAAAGT CATTTTCAAA TCTTATTACC 90 0 

AATTTGAGAA AGAAGGAGAT TTCCAAGATT CTGAGAAGTT CCTGCCTCAG GGGGATGAAT 96 D 

TTGGCTTCGT TTTTCAGTGC AAGCAAAATC ATCGTGTTTG TGACCTTCAC CACCTACGTG 102 0 

CTCCTCGGCA GTGTGATCAC AGCCAGCCGC GTGTTCGTGG CAGTGACGCT GTATGGGGCT 10 80 

GTGCGGCTGA CGGTTACCCT CTTCTTCCCC TCAGCCATTG AGAGGGTGTC AGAGGCAATC 114D 

GTCAGCATCC GAAGAATCCA GACCTTTTTG CTACTTGATG AGATATCACA GCGCAACCGT 12 0 0 
CAGCTGCCGT CAGATGGTAA AA 



TTAGCTGTGG TCGGCCCCGT GGGAGCAGGG AAGTCATCAC TGTTAAGTGC CGTGCTCGGG 1380 

GAATTGGCCC CAAGTCACGG GCTGGTCAGC GTGCATGGAA GAATTGCCTA TGTGTCTCAG 1440 

CAGCCCTGGG TGTTCTCGGG AACTCTGAGG AGTAATATTT TATTTGGGAA GAAATACGAA 1500 

AAGGAACGAT ATGAAAAAGT CATAAAGGCT TGTGCTCTGA AAAAGGATTT ACAGCTGTTG 1560 

GAGGATGGTG ATCTGACTGT GATAGGAGAT CGGGGAACCA CGCTGAGTGG AGGGCAGAAA 1620 

GCACGGGTAA ACCTTGCAAG AGCAGTGTAT CAAGATGCTG ACATCTATCT CCTGGACGAT 1680 

CCTCTCAGTG CAGTAGATGC GGAAGTTAGC AGACACTTGT TCGAACTGTG TATTTGTCAA 1740 

ATTTTGCATG AGAAGATCAC AATTTTAGTG ACTCATCAGT TGCAGTACCT CAAAGCTGCA 1800 

AGTCAGATTC TGATATTGAA AGATGGTAAA ATGGTGCAGA AGGGGACTTA CACTGAGTTC I860 

CTAAAATCTG GTATAGATTT TGGCTCCCTT TTAAAGAAGG ATAATGAGGA AAGTGAACAA 1920 

CCTCCAGTTC CAGGAACTCC CACACTAAGG AATCGTACCT TCTCAGAGTC TTCGGTTTGG 1980 

T CTTCTACACC CTCCTTCAAA GATGGTGCTC TGGAGAGCCA AGATACAGAG 2 040 

G TTACACTATC AGAGGAGAAC CGTTCTGAAG GAAAAGTTGG TTTTCAGGCC 2100 

T ACTTCAGAGC TGGTGCTCAC TGGATTGTCT TCATTTTCCT TATTCTCCTA 21S0 

AACACTGCAG CTCAGGTTGC CTATCTGCTT CAAGATTGGT GGCTTTCATA CTGGGCAAAC 22 2 0 

AAACAAAGTA TGCTAAATGT CACTGTAAAT GGAGGAGGAA ATGTAACCGA GAAGCTAGAT 2 2 80 
CTTAACTGGT ACTTAGGAAT TTATTCAGGT TTAACTGTAG C 
GCAAGATCTC TATTGGTATT CTACGTCCTT GTTAACTCTT C 
ATGTTTGAGT CAATTCTGAA AGCTCCGGTA TTATTCTTTG ATAGAAATCC A 
ATTTTAAATC GTTTCTCCAA AGACAT1 
TTAGATTTCA TCCAGACATT GCTACAAGTG <3 
ATTCCTTGGA TCGCAATACC CTTGGTTCCC CTTGGAATCA T 
TATTTTTTGG AAACGTCAAG AGATGTGAAG CGCCTGGAAT C 

TTTTCCCACT TGTCATCTTC TCTCCAGGGG CTCTGGACCA TCCGGGCATA CAAAGCAGAA 2760 

GAGAGGTGTC AGGAACTGTT T G ATGCACAC CAGGATTTAC ATTCAGAGGC TTGGTTCTTG 2820 

TTTTTGACAA CGTCCCGCTG GTTCGCCGTC CGTCTGGATG CCATCTGTGC CATGTTTGTC 2880 

ATCATCGTTG CCTTTGGGTC CCTGATTCTG GCAAAAACTC TGGATGCCGG GCAGGTTGGT 2 940 

TTGGCACTGT CCTATGCCCT CACGCTCATG GGGATGTTTC AGTGGTGTGT TCGACAAAGT 3000 

GCTGAAGTTG AGAATATGAT GATCTCAGTA GAAAGGGTCA TTGAATACAC AGACCTTGAA 3060 

AAAGAAGCAC CTTGGGAATA TCAGAAACGC CCACCACCAG CCTGGCCCCA TGAAGGAGTG 3120 

ATAATCTTTG ACAATGTGAA CTTCATGTAC AGTCCAGGTG GGCCTCTGGT ACTGAAGCAT 3180 

CTGACAGCAC TCATTAAATC ACAAGAAAAG GTTGGCATTG TGGGAAGAAC CGGAGCTGGA 324 0 

AAAAGTTCCC TCATCTCAGC CCTTTTTAGA TTGTCAGAAC CCGAAGGTAA AATTTGGATT 3 30 0 

GATAAGATCT TGACAACTGA AATTGGACTT CACGATTTAA GGAAGAAAAT GTCAATCATA 3360 

CCTCAGGAAC CTGTTTTGTT CACTGGAACA ATGAGGAAAA ACCTGGATCC CTTTAATGAG 342 0 

CACACGGATG AGGAACTGTG GAATGCCTTA CAAGAGGTAC AACTTAAAGA AACCATTGAA 3480 

GATCTTCCTG GTAAAATGGA TACTGAATTA GCAGAATCAG GATCCAATTT TAGTGTTGGA 3 540 

CAAAGACAAC TGGTGTGCCT TGCCAGGGCA ATTCTCAGGA AAAATCAGAT ATTGATTATT 360 0 

GATGAAGCGA CGGCAAATGT GGATCCAAGA ACTGATGAGT TAATACAAAA AAAAATCCGG 3660 

GAGAAATTTG CCCACTGCAC CGTGCTAACC ATTGCACACA GATTGAACAC CATTATTGAC 372 0 
AGCGACAAGA TAATGGTTTT AGATTC 



GCCGCTGCCC TCACTGAAAC AGCAAAACAG GTATACTTCA AAAGAAATTA TCCACATATT 390 0 
GGTCACACTG ACCA CATGGT TACAAACACT TCCAATGGAC AGCCCTCGAC CTTAACTATT 3960 
TTCGAGACAG CACTGTGA 



1 11 21 31 41 51 

I I I I I I 

MLPVYQEVKP NPLQDANLCS RVFFWWLNPL FKIGHKRRLE EDDMYSVLP3 DRSQHLGEEL 
QGFWDKEVLR AENDAQKPSL TRAIIKCYWK SYLVLGIFTL IEESAKVIQP IFLGKIINYF 
ENYDPMDSVA LNTAYAYATV LTFCTLILAI LHHLYFYHVQ CAGMRLRVAM CHMIYRKALR 
LSNMAMGKTT TGQIVNLLSN DVNKFDQVTV FLHFLWAGPL Q 
MAVLIILLPL QSCFGKLFSS LRSKTATFTD ARIRTMNEVI T 
NLRKKEISKI LRSSCLRGMN LASFFSASKI IVFVTFTTYV L 
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VRLTVTLFFP SAIERVSEAI VSIRRIQTFL LLDEISQRNR QLPSDGKKMV HVQDFTAFWD 420 

KASETPTLQG LSFTVP.PGEL LAWGFVGAG KSSLLSAVLG ELAPSKGLVS VHGRIAYVSQ 480 

QPWVFSGTLR SNILFGKKYE KERYEKVIKA CALKKDLQLL EDGDLTVIGD RGTTLSGGQK 540 

ARVNLARAVY QDADIYLLDD PLSAVDAEVS RHLFELCICQ ILHEKITILV THQLQYLKAA 600 

SQILILKDGK HVQKGTYTEF LKSGIDFGSL LKKDNEESEQ PPVPGTPTLR NRTFSESSVW 660 

SQQSSRPSLK DGALESQDTE NVPVTLSEEN R3EGKVGFQA YKNYFRAGAK WIVFIFLILL 720 

NTAAQVAYVL QDWWLSYWAN KQSMLNVTVN GGGNVTEKLD LNWYLGIYSG LTVATVLFGI 780 

ARSLLVFYVL VNSSQTLHNK MFESILKAPV LFFDRHPIGR ILNRFSKDIG HLDDLLPLTF 840 

LDFIQTLLQV VGWSVAVAV IPWIAIPLVP LGIIFIFLRR YFLETSRDVK RLESTTRSPV 900 

FSHLSSSLQG LWTIRAYKAE ERCQELFDAH QDLHSEAWFL FLTTSRWFAV RLDAICAMFV 960 

IIVAFGSLIL AKTLDAGQVG LALSYALTLM GMFQWCT/FQ 1 - f M T ER\ IEYTDLE 1020 

KEAPWEYQKR PPPAWPHEGV IIFDNVNFMY SPGGPLVLKH LTALIKSQEK VGIVGRTGAG 1080 

KSSLISALFR LSEPEGKIWI DKILTTEIGL HDLRKKMSII PQ EPVLFTGT MRKNLDPFNE 1140 

HTDEELWNAL QEVQLKETIE DLPGKMDTEL AESGSMFSVG QRQLVCLARA ILRKMQILII 1200 

DEATANVDPR TDELIQKKIR EKFAHCTVLT IAHRLNTIID SDKIMVLDSG RLKEYDEPYV 1260 

LLQNKESLFY KMVQQLGKAE AAALTETAKQ VYFKRNYPHI GHTDHMVTNT SNGQPSTLTI 1320 

Seq ID NO; 571 DMA 



Coding sequences 116.. 4093 

oc i t 1 r r t 1 r 

2. J GGACAGGCGT GGCGGCCGGA GCCCCAGCAT CCCTGCTTGA GGTCCAGGAG CGGAGCCCGC 60 

GGCCACCGCC GCCTGATCAG CGCGACCCCG GCCCGCGCCC GCCCCGCCCG GCAAGATGCT 120 

GCCCGTGTAC CAGGAGGTGA AGCCCAACCC GCTGCAGGAC GCGAACATCT GCTCACGCGT 180 

GTTCTTCTGG TGGCTCAATC CCTTGTTTAA AATTGGCCAT AAACGGAGAT TAGAGGAAGA 24 0 

TGATATGTAT TCAGTGCTGC CAGAAGACCG CTCACAGCAC CTTGGAGAGG AGTTGCAAGG 300 

30 GTTCTGGGAT AAAGAAGTTT TAAGAGCTGA GAATGACGCA CAGAAGCCTT CTTTAACAAG 360 

AGCAATCATA AAGTGTTACT GGAAATCTTA TTTAGTTTTG GGAATTTTTA CGTTAATTGA 420 

GGAAAGTGCC AAAGTAATCC AGCCCATATT TTTGGGAAAA ATTATTAATT ATTTTGAAAA 480 

TTATGATCCC ATGGATTCTG TGGCTTTGAA CACAGCGTAC GCCTATGCCA CGGTGCTGAC 540 

TTTTTGCACG CTCATTTTGG CTATACTGCA TCACTTATAT TTTT ATCACG TTCAGTGTGC 600 

35 TGGGATGAGG TTACGAGTAG CCATGTGCCA TATGATTTAT CGGAAGGCAC TTCGTCTTAG 660 

TAACATGGCC ATGGGGAAGA CAACCACAGG CCAGATAGTC AATCTGCTGT CCAATGATGT 720 

GAACAAGTT T GATCAGGTGA CAGTGTTCTT ACACTTCCTG TGGGCAGGAC CACTGCAGGC 780 

CATCCCACTC ACTGCCCTAC TCTGGATGGA GATAGGAATA TCGTGCCTTG CTGGGATGGC 84 0 

AGTTCTAATC ATTCTCCTGC CCTTGCAAAG CTGTTTTGGG AAGTTGTTCT CATCACTGAG 90 0 

40 GAGTAAAACT GCAACTTTCA CGGATGCCAG GATCAGGACC ATGAATGAAG TTATAACTGG 960 

TATAAGGATA ATAAAAATGT ACGCCTGGGA AAAGTCATTT TCAAATCTTA TTACCAATTT 1020 

GAGAAAGAAG GAGATTTCCA AG ATT CTG AG AAGTTCCTGC CTCAGGGGGA TGAATTTCGC 10 80 

TTCGTTTTTC AGTGCAAGCA AAATCATCGT GTTTGTGACC TTCACCACCT ACGTGCTCCT 1140 

CGGCAGTGTG ATCACAGCCA GCCGCGTGTT CGTGGCAGTG ACGCTGTATG GGGCTGTGCG 12 00 

45 GCTGACGGTT ACCCTCTTCT TCCCCTCAGC CATTCAGAGG GTGTCAGAGG CAATCGTCAG 1260 

CATCCGAAGA ATCCAGACCT TTTTGCTACT TGATGAGATA TCACAGCGCA ACCGTCAGCT 13 20 

GCCGTCAGAT GGTAAAAAGA TGGTGCATGT GCAGGATTTT ACTGCTTTTT GGGATAAGGC 13 BO 

ATCAGAGACC CCAACTCTAC AAGGCCTTTC CTTTACTGTC AGACCTGGCG AATTGTTAGC 1440 

TGTGGTCGGC CCCGTGGGAG CAGGGAAGTC ATCACTGTTA AGTGCCGTGC TCGGGGAATT 1500 

50 GGCCCCAAGT CACGGGCTGG TCAGCGTGCA TGGAAGAATT GCCTATGTGT CTCAGCAGCC 1560 

CTGGGTGTTC TCGGGAACTC TGAGGAGTAA TATTTTATTT GGGAAGAAAT ATGAAAAGGA 1620 

ACGATATGAA AAAGTCATAA AGGCTTGTGC TCTGAAAAAG GATTTACAGC TGTTGGAGGA 1680 

TGGTGATCTG ACTGTGATAG GAGATCGGGG AACCACGCTG AGTGGAGGGC AGAAAGCACG 1740 

GGTAAACCTT GCAAGAGCAG TGTATCAAGA TGCTGACATC TATCTCCTGG ACGATCCTCT 1B00 

55 CAGTGCAGTA GATGCGGAAG TTAGCAGACA CTTGTTCGAA CTGTGTATTT GTCAAATTTT 1860 

GCATGAGAAG ATCACAATTT TAGTGACTCA TCAGTTGCAG TACCTCAAAG CTGCAAGTCA 1920 

GATTCTGATA TTGAAAGATG GTAAAATGGT GCAGAAGGGG ACTTACACTG AGTTCCTAAA 1980 

AT CTGGTATA GATTTTGGCT CCCTTTTAAA GAAGGATAAT GAGGAAAGTG AACAACCTCC 2040 

AGTT CCAGGA ACTCCCACAC TAAGGAATCG TACCTTCTCA GAGTCTTCGG TTTGGTCTCA 2100 

60 ACAATCTTCT AGACCCTCCT TGAAAGATGG TGCTCTGGAG AGCCAAGATA CAGAGAATGT 2160 

CCCAGTTACA CTAT CAGAGG AGAACCGTTC TGAAGGAAAA GTTGGTTTTC AGGCCTATAA 2220 

GAATTACTTC AGAGCTGGTG CTCACTGGAT TGTCTTCATT TTCCTTATTC TCCTAAACAC 2280 

TGCAGCTCAG GTTGCCTATG TGCTTCAAGA TTGGTGGCTT TCATACTGGG CAAACAAACA 2340 

AAGTATGCTA AATGTCACTG TAAATGGAGG AGGAAATGTA ACCGAGAAGC TAGATCTTAA 2400 
65 CTGGTACTTA GGAATTTATT CAGGTTTAAC TGTAGCTACC G 
ATCTCTATTG GTATTCTACG TCCTTGTTAA C 



\ TTGGACACTT GGATGATTTG CTGCCGCTGA CGTTTTTAGA 2640 

TTTCATCCAG ACATTGCTAC AAGTGGTTGG TGTGGTCTCT GTGGCTGTGG CCGTGATTCC 2700 

70 TTGGATOGCA ATACCCTTGG TTCCCCTTGG AATCATTTTC ATTTTTCTTC GGCGATATTT 2760 

TTTGGAAACG TCAAGAGATG TGAAGCGCCT GGAATCTACA ACTCGGAGTC CAGTGTTTTC 2820 

CCACTTGTCA TCTTCTCTCC AGGGGCT CTG GACCATCCGG GCATACAAAG CAGAAGAGAG 2880 

GTGTCAGGAA CTGTTTGATG CACACCAGGA TTTACATTCA GAGGCTTGGT TCTTGTTTTT 2 940 
_ GACAACGTCC CGCTGGTTCG CCGTCCGTCT GGATGCCATC 

75 CGTTGCCTTT GGGTCCCTGA TTCTGGCAAA AACTCTGGAT GCCGGGCAGG TTGGTTTGGC 3060 

ACTGTCCTAT GCCCTCACGC T CATGGGG AT GTTTCAGTGG TGTGTTCGAC AAAGTGCTGA 3120 
AGTTGAGAAT ATGATGATCT CAGTAGAAAG GGTCATTGAA T. 
AGCACCTTGG GAATATCAGA AACGCCCACC ACCAGCCTGG C 

CTTTGACAAT GTGAACTTCA TGTACAGTCC AGGTGGGCCT CTGGTACTGA AGCATCTGAC : 

TTCCCTCATC TCAGCCCTTT TTAGATTGTC AGAACCCGAA G 
GATCTTGACA ACTGAAAT T G GACTTCACGA TTTAAGGAAG AAAATGTCAA T 

GGAACCTGTT TTGTTCACTG GAACAATGAG GAAAAACCTG GATCCCTTTA AGC-AGCACAC 3540 

GGATGAGGAA CTGTGGAATG CCTTACAAGA GGTACAACTT AAAGAAACCA TTGAAGATCT 3600 
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ATTTGCCCAC TGCACCGTGC TAACCATTGC ACACAGATTG AACACCATTA TTGACAGCGA 3840 

CAAGATAATG GTTTTAGATT CAGGAAGACT GAAAGAATAT GATGAGCCGT ATGTTTTGCT 3900 

GCAAAATAAA GAGAGCCTAT TTTACAAGAT GGTGCAACAA CTGGGCAAGG CAGAAC-CCGC 3960 

TGCCCTCACT GAAACAGCAA AACAGGTATA CTTCAAAAGA AATTATCCAC ATATTGGTCA 402 0 

CACTGACCAC ATGGTTACAA ACACTTCCAA TGGACAGCCC TCGACCTTAA CTATTTTCGA 4080 

GACAGCACTG TGAATCCAAC CAAAATGTCA AGTCCGTTCC GAAGGCATTT TCCACTAGTT 4140 

TT TTTTACTTTG GCAACAAATA TTTATACATA 4200 



Protein Accession if : AAC27076.1 

i i 1 T f T i 1 

MLPVYQEVKP NPLQDANICS RVFFWWLNPL FKIGHKRRLE BDDMYSVLPE DRSQHLGEEL 
QGFWDKEVLR AENDAQKPSL TRAI IKCYWK SYLVLGIFTL IEESAKVIQ? IFLGKIINYF 
ENYDPMDSVA LHTAYAYATV LTFCTLILAI LHHLYFYHVQ CAGMRLRVAM CHMIYRKALR 
LSNMAMGKTT TGQIVNLLSN DVNKFDQVTV FLHFLWAGPL QAIAVTALLW MEIGISCLAG 
MAVLIILLPL QSCFGKLFSS LRSKTATFTD ARIRTMNEVI TGIRIIKMYA WEKSFSNLIT 
NLRKKEISKI LRSSCLRGMM LASFFSASKI IVFVTFTTYV LLGSVITASR VFVAVTLYGA 
VRLTVTLFFP SAIERVSEAI VSIRRIQTFL LLDEISQRNR QLPSDGKKMV HVQDFTAFWD 
KASETPTLQG LSFTVRPGEL LAWGPVGAG KSSLLSAVLG ELAPSHGLVS VHGRIAYVSQ 
QPWVFSGTLR SNILFGKKYE KERYEKVIKA CALKKDLQLL EDGDLTVIGD RGTTLSGGQK 
ARVNLARAVY QDADIYLLDD PLSAVDAEVS RHLFELCICQ ILHEKITILV THQLQYLKAA 
SQILILKDGK MVQKGTYTEF LKSGIDFGSL LKKDNEESEQ PPVPGTPTLR HRTFSESSVW 
SQQSSRPSLK DGALESQDTE NVPVTLSEEN RSEGKVGFQA YKNYFRAGAH WIVFIFLILL 
NTAAQVAYVL QDWWLSYWAN KQSMLNVTVN GGGNVTEKLD LNWYLGIYSG L 
ARSLLVFYVL VNSSQTLHNK MFESILKAPV LFFDRNPIGR ILNRFSKDIG H 
LDFIQTLLQV VGWSVAVAV IPWIAIPLVP LGIIFIFLRR YFLETSRDVK R 
FSHLSSSLQG LWTIRAYKAE ERCQELFDAH QDLHSEAWFL FLTTSRWFAV R 
IIVAFGSLIL AKTLDAGQVG LALSYALTLM GMFQWCVRQS AEVENMMISV E 
KEAPWEYQKR PPPAWPHEGV IIFDMVNFMY SPGGPLVLKH LTALIKSQEK VGIVGRTGAG 1080 
KSSLISALFR LSEPEGKIWI DKILTTEIGL HDLRKKMSII PQEPVLFTGT MRKNLDPFKE 1140 
HTDEELWNAL QEVQLKETIE DLPGKMDTEL AESGSNFSVG QRQLVCLARA ILRKHQILII 1200 
DEATANVDPR TDELIQKKIR EKFAHCTVLT IAHRLNTIID SDKIMVLDEG RLKEYDEPYV 1260 
LLQNKESLFY KMVQQLGKAE AAALTETAKQ VYFKRKYPHI GHTDHMVTNT SNGQPSTLTI 1320 



I I I I I 

A TCTCTATGAT GGGAAGCCCT AAGAGCCTTA GTGAAACTTG TTTACCTAAT 60 

GGCATAAATG GTATCAAAGA TGCAAGGAAG GTCACTGTAG GTGTGATTGG AAGTGGAGAT 12 0 

TTTGCCAAAT CCTTGACCAT TCGACTTATT AGATGCGGCT ATCATGTGGT CATAGGAAGT 180 

AGAAATCCTA AGTTTGCTTC TGAATTTTTT CCTCATGTGG TAGATGTCAC TCATCATGAA 240 

GATGCTCTCA CAAAAACAAA TATAATATTT GTTGCTATAC ACAGAGAACA TTATACCTCC 300 

CTGTGGGACC TGAGACATCT GCTTGTGGGT AAAATCCTGA TTGATGTGAG CAATAACATG 360 

AGGATAAACC AGTACCCAGA ATCCAATGCT GAATATTTGG CTTCATTATT CCCAGATTCT 420 

TTGATTGTCA AAGGATTTAA TGTTGTCTCA GCTTGGGCAC TTCAGTTAGG ACCTAAGGAT 480 

GCCAGCCGGC AGGTTTATAT ATGCAGCAAC AATATTCAAG CGCGACAACA GGTTATTGAA 540 

CTTGCCCGCC AGTTGAATTT CATTCCCATT GACTTGGGAT CCTTATCATC AGCCAGAGAG 600 

ATTGAAAATT TACCCCTACG ACTCTTTACT CTCTGGAGAG GGCCAGTGGT GGTAGCTATA 660 

AGCTTGGCCA CATTTTTTTT CCTTTATTCC TTTGTCAGAG ATGTGATTCA TCCATATGCT 720 

AGAAACCAAC AGAGTG ACT T TTACAAAATT CCTATAGAGA TTGTGAATAA AACCTTACCT 7 80 

ATAGTTGCCA TTACTTTGCT CTCCCTAGTA TACCTCGCAG GTCTTCTGGC AGCTGCTTAT 840 

CAACTTTATT ACGGCACCAA GTATAGGAGA TTTCCACCTT GGTTGGAAAC CTGGTTACAG 900 

TGTAGAAAAC AGCTTGGATT ACTAAGTTTT TTCTTCGCTA TGGTCCATGT TGCCTACAGC 960 

CTCTGCTTAC CGATGAGAAG GTCAGAGAGA TATTTGTTTC TCAACATGGC TTATCAGCAG 1020 

GTTCATGCAA ATATTGAAAA CTCTTGGAAT GAGGAAGAAG TTTGGAGAAT TGAAATGTAT 1080 

ATCTCCTTTG GCATAATGAG CCTTGGCTTA CTTTCCCTCC TGGCAGTCAC TTCTATCCCT 1140 

TCAGTGAGCA ATGCTTTAAA CTGGAGAGAA TTCAGTTTTA TTCAGTCTAC ACTTGGATAT 1200 

GTCGCTCTGC TCATAAGTAC TTTCCATGTT TTAATTTATG GATGGAAACG AGCTTTTGAG 1260 

GAAGAGTACT ACAGATTTTA TACACCACCA AACTTTGTTC TTGCTCTTGT TTTGCCCTCA 1320 
ATTGTAATTC TGGATCTTTT GCAGCTTTGC AGATACCCAG ACTGA 



MESISMMGSP KSLSETCLPN GINGIKDARK VTVGVIGSGD FAKSLTIRLI RCGYHWIGS 



RINQYPESNA EYLASLFPDS LIVKGENWS AWALQLGPKD ASRQVYICSN NIQARQQVIE 
LARQLNFIPI DLGSLSSARE IENLPLRLFT LWRGPVvfVAI SLATFFFLYS FVRDVIHPYA 
RNQQSDFYKI PIEIVHKTLP IVAITLLSLV YLAGLIAAAY QL^iYGT ' r FP? uL^ IL > 
CRKQLGLLSF FFAMVHVAYS LCLPMRRSER YLFLNMAYQQ VHANIENSWN EEEVKRIEMY 
ISFGIMSLGL LSLLAVTSIP SVSNALNWRE FSFIQSTLGY VALLIST 
EEYYRFYTPP NPVLALVLPS IVILDLLQLC RYPD 
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Seq ID NO: 575 DNA sequence 

Nucleic Acid Accession # : NM_001873.1 

Coding sequence: 3.. 1721 



G CCCGTCTCTC CGCCGGCCCC CTGCCTCGCA GTGGTTTCTC CTGCAGCICC 
CCTGGGCTCC GCGGCCAGTA GTGCAGCCCG TGGAGCCGCG GCTTTGCCCG TCTCCTCTGG 
GTGGCCCCAG TGCGCGGGCT GACACTCATT CAGCCGGGGA AGGTGAGGCG AGTAGAGC-CT 
GGTGCGGAAC TTGCCGCCCC CAGCAGCGCC GGCGGGCTAA GCCCAGGGCC GGGCAGACAA 
AAGAGGCCGC CCGCGTAGGA AGGCACGGCC GGCGGCGC-CG GAGCGCAGCG ATGGCCGGGC 
GAGGGGGCAG CGCGCTGCTG GCTCTGTGCG GGGCACTGGC TGCCTGCGGG TGGCTCCTGG 
GCGCCGAAGC CCAGGAGCCC GGGGCGCCCG CGGCGGGCAT GAGGCGGCGC CGGCGGCTGC 
AGCAAGAGGA CGGCATCTCC TTCGAGTACC ACCGCTACCC CGAGCTGCGC GAGGCGCTCG 
TGTCCGTGTG GCTGCAGTGC ACCGCCATCA GCAGGATTTA CACGGTGGGG CGCAGCTTCG 
AGGGCCGGGA GCTCCTGGTC ATCGAGCTGT CCGACAACCC TGGCGTCCAT GAGCCTGGTG 
AGCCTGAATT TAAATACATT GGGAATATGC ATGGGAATGA GGCTGTTGGA CGAGAACTGC 
TCATTTTCTT GGCCCAGTAC CTATGCAACG AATACCAGAA GGGGAACGAG ACAATTGTCA 
ACCTGATCCA CAGTACCCGC ATT CACATCA TGCCTTCCCT GAACCCAGAT GGCTTTGAGA 
AGGCAGCGTC TCAGCCTGGT GAACTCAAGG ACTGGTTTGT GGGTCGAAGC AATGCCCAGG 
T GAACCGGAAC TTTCCAGACC TGGATAGGAT AGTGTACGTG AATGAGAAAG 
A ATATGAAGAA A 



CTGCCAATCT CCATGGAGGA GACCTTGTGG CCAATTATCC ATATGATGAG ACGCGGAGTG 
GTAGTGCTCA CGAATACAGC TCCTCCCCAG ATGACGCCAT TTTCCAAAGC TTGGCCCGGG 
CATACTCTTC TTTCAACCCG GCCATGTCTG ACCCCAATCG GCCACCATGT C 



^ACAAAAACT 1380 



AAGGTAACCC AATTGCGAAT G 

CCGCAAAGGA TGGTGATTAC TGGAGATTGC TTATACCTGG AAACTATAAA CTTACAGCCT 
CAGCTCCAGG CTATCTGGCA A 
GGGTTGATTT TGAACTGGAG T 

TGGAATGGTG GAAAATGATG TCAGAAACTT TAAATTTTTA AAAAGGCTTC TAGTTAGCTG 
CTTTAAATCT ATCTATATAA T 
CAGTTAATAC TTAACATTGA TTTATTTTTT A 

AAATAAATAG CCTCTTAGGT AAAAATATAA GAACTTGATA TATTTCATTC T 

TATTCATTTT CCTACCTATA TTACACAAAA AAGTATAGAA AAGATTTAAG TAATTTTGCC 19 8 0 

ATCCTAGGCT TAAATGCAAT ATTCCTGGTA TTATTTACAA TGCAGAATTT TTTGAGTAAT 2040 

TCTAGCTTTC AAAAATTAGT GAAGTTCTTT TACTGTAATT GGTGACAATG TCACATAATG 210 0 

AATGCTATTG AAAAGGTTAA CAGATACAGC TCGGAGTTGT GAGCACTCTA CTGCAAGACT 216 0 

TAAATAGTTC AGTATAAATT GTCGTTTTTT TCTTGTGCTG ACTAACTATA AGCATGATCT 22 2 0 

TCTTAATGCA TTTTTGATGG CAAGAAAAGG TACATGTTTA CAAAGAGGTT T T AT GAAAAG 22 8 0 

AATAAAAATT GACTTCTTGC TTGTACATAT AGGAGCAATA CTATTATATT ATGTAGTCCG 2340 

TTAACACTAC TTAAAAGTTT AGGGTTTTCT CTTGGTTGTA GAGTGGCCCA GAATTGCATT 2400 
CTGAATGAAT AAAGGTTAAA AAAAAATCCC CAGTGAAAAA AAA 



MAGRGGSALL ALCGAIiAACG WLLGAEAQEP GAPAAGMRRR RRLQQEDGIS FEYHRYPELR 
EALVSVWLQC TAISRIYTVG RSFEGRELLV IELSDNPGVH EPGEPEFKYI GNMHGNEAVG 
RELLIFLAQY LCNEYQKGNE TIVNLIHSTR IHIMPSLNPD GFEKAASQPG ELKDWFVGRS 
NAQGIDLNRN FPDLDRIVYV NEKEGGPNNH LLKNMKKIVD QKTKLAPETK AVIHWIMDIP 
FVLSANLHGG DLVANYPYDE TRSGSAHEYS SSPDDAIFQS LARAYSSFNP AMSDPNRPPC 
RKNDDDSSFV DGTTNGGAWY SVPGGMQDFN YLSSNCFEIT VELSCEKFPP EETLKTYWED 
NKNSLISYLE QIHRGVKGFV RDLQGNPIAN ATISVEGIDH DVTSAKDGDY WRLLIPGNYK 
LTASAPGYLA ITKKVAVPYS PAAGVDFELE S 



I I I I I I 



TTCGACAAGA GTGATGAGAA GGAGTGCCCC AAGGCTAAGT CGAAATGTGG CCCGACCTTC 
TTCCCCTGTG CCAGCGGCAT CCATTGCATC ATTGGTCGCT TCCGGTGCAA TGGGTTTGAG 
GACTGTCCCG ATGGCAGCGA TGAAGAGAAC TGCACAGCAA ACCCTCTGCT TTGCTCCACC 
GCCCGCTACC ACTGCAAGAA CGGCCTCTGT ATTGACAAGA GCTTCATCTG CGATGGACAG 
AATAACTGT C AAGACAACAG TGATGAGGAA AGCTGTGAAA GTTCTCAAGA ACCCGGCAGT 
C AGAGAACCAA CTTGTGTATT ACCCCAGCAT CACCTATGCC 
G GTGC-TGGCCC TGCTGGCACT GGTCTTGCAC 
3 CTGCCCGTGC ACCGGCTGCA GCACCCTGTG 
CTGCTGTCCC GCCTGGTGGT CCTGGACCAC CCCCACCACT GCAACGTCAC C 
AATAATGGCA TCCAGTATGT GGCCAGCCAG GCGGAGCAGA ATGCGTCGGA A 
CCACCCTCCT ACTCCGAGGC CTTGCTGGAC CAGAGGCCTG CGTGGTATGA CCTTCCTCCA 
CCGCCCTACT CTTCTGACAC GGAATCTCTG AACCAAGGCG ACCTGCCCCC CTACCGCTCC 
CGGTCCGGGA GTGCCAACAG TGCCAGCTCC CAGGCAGCCA GCAGCCTCCT G 
GACACCAGCC ACAGCCCGGG GCAGCCTGGC CCCCAGGAGG GCACTGCTGA G 
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H CTANPLLCST ARYHCKNGLC IDKSFICDGQ NNCQDNSDE2 SCESSQEPGS 120 

GQVFVTSENQ LVYYPSITYA IIGSSVIFVL WALLAliVLH HQRKRNNLMT LPVHRLQHPV 180 

LLSRLWLDH PHHCNVTYNV NNGIQYVASQ AEQNASEVGS PPSYSEALLD QRPAWYDLPP 240 

PPYSSDTESL NQADLPPYRS RSGSANSASS QAASSLLSVE DTSH3PGQPG PQEGTAEPRD 300 



40 



ft GCTCGCTGCT 

25 TTAAGAATTG GAGACACTGT GACTTGCGTC TGTCAGTTCA AGTGCAACAA T 



30 



GTCATGTCTT TGGGTCGATG 



A TGCAGGAGCC ATCTTGCAGG TGTGATGCTG GTTATACTGG ACAACACTGT 90 0 

T TCTATACGTT GTTCCCGGTC CTGTACGATT TCAGTATGTC 960 

^ATTCAG ATTGCTGTCA TCTGTGTGGT GGTCCTCTGC 1020 

ATCACAAGGA AATGCCCCAG AAGCAACAGA ATTCACAGAC AGAAGCAAAA TACAGGGCAC 1080 
TACAGTTCAG ACAATACAAC AAGAGCGTCC ACGAGGTTAA TCTGA 



45 | i i i i i 

MVLWESPRQC SSWTLCEGFC WLLLLPVMLL IVARPVKLAA FPTSL3DCQT PTGWNCSGYD SO 

DRENDLFLCD TNTCKFDGEC LRIGDTVTCV CQFKCNNDYV PVCGSNGESY QNECYLRQAA 120 

CKQQSEILW SEGSCATDAG SGSGDGVHEG SGETSQKETS TCDICQFGAE CDEDAEDVWC 180 

- VCNIDCSQTN FNPLCASDGK SYDNACQIKE ASCQKQEKIE VMSLGRCQDN TTTTTKSEDG 240 

5U HYARTDYAEN ANKLEESARE HHIPCPEHYN GFCMHGKCEH SINMQEPSCR CDAGYTGQHC 300 

EKKDYSVLYV VPGPVRFQYV LIAAVIGTIQ IAVICVWLC ITRKCPRSNR IHRQKQNTGH 3 60 



60 
65 



Seq ID NO: 581 DNA sequence 
Nucleic Acid Accession ft: S782I 
Coding sequence-. 1..2190 



ATGAATCCTT TCCAGAAAAA TGAGTCCAAG GAAACTCTTT ITTCACCTGI CTCCATTGAA 60 

GAGGTACCAC CTCGACCACC TAGCCCTCCA AAGAAGCCAT CTCCGACAAT CTGTGGCTCC 120 

AACTATCCAC TGAGCATTGC CTTCATTGTG GTGAATGAAT TCTGCGAGCG CTTTTCCTAT 1B0 

TATGGAATGA AAGCTGTGCT GATCCTGTAT TTCCTGTATT TCCTGCACTG GAATGAAGAT 240 



3TTTGAAG AAAAACATGC AGAGGAACGG 
ACTAGATACT TCTCAGTCTT CTACCTGTCC AT CAATGCAG GGAGCTTGAT TTCTACATTT 
h TGTTTTGGAG AAGACTGCTA TGCATTGGCT 
C CAGGACTGCT CATGGTAATT GCACTTGTTG TGTTTGCAAT G 
ATATACAATA AACCACCCCC TGAAGGAAAC ATAGTGGCTC A 
TTTGCTATTT CCAATCGTTT CAAGAACCGT TCTGGAGACA TTCCAAAGCG ACAGCACTGG 
;tcatta TGGATGTAAA GGCACTGACC 
T TCCTTTATAT CCCATTGCCC ATGTTCTGGG CTCTTTTGGA TCAGCAGGGT 
TCACGATGGA CTTTGCAAGC CATCAGGATG AATAGGAATT TGGGGTTTTT TGTGCTTCAG 
CCGGACCAGA TGCAGGTTCT A 
TTTGTCATTT ATCGTCTGGT C 

GCTGTTGGTA TGATCCTAGC GTGCCTGGCA TTTGCAGTTG CGGCAGCIGT 
ATAAATGAAA TGGCCCCAGC CCAGTCAGGT CCCCAGGAGG TTTTCCTACA AGTCTTGAAT 
CTGGCAGATG ATGAGG 1 ] 

GAGTCCATCA AATCCTTTCA GAAAACACCA CACTATTCCA AACTGCACCT G. 
AGCCAGGATT TTCACTTCCA CCTGAAATAT CACAATTTGT CTCTCTACAC TGAGCATTCT 
GTGCAGGAGA AGAACTGGTA CAGTCTTGTC ATTCGTGAAG ATC-GGAACAG TATCTCCAGC 
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ATGATGGTAA AGGATACAGA AAGCAAAACA ACCAATGGGA TGACAACCGT GAGGTTTGTT 1560 

AACACTTTGC ATAAAGATGT CAACATCTCC CTGAGTACAG ATACCTCTCT CAATGTTGGT 1620 

GAAGACTATG GTGTGTCTGC TTATAGAACT GTGCAAAGAG GAGAATACCC TGCAGTGCAC 1680 

TGTAGAACAG AAGATAAGAA CTTTTCTCTG AATTTGGC-TC TTCTAGACIT TGGTGCAGCA 1740 

TATCTGTTTG TTATTACTAA TAACACCAAT CAGGGTCTTC AGGCCTGGAA GATTGAAGAC 1800 

ATTCCAGCCA ACAAAATGT C CATTGCGTGG CAGCTACCAC AATATGCCCT GGTTACAGCT 1860 

GGGGAGGTCA TGTTCTCTGT CACAGGTCTT GAGTTTTCTT ATTCTCAGGC TCCCTCTAGC 1920 

ATGAAATCTG TGCTCCAGGC AGCTTGGCTA TTGACAATTG CAGTTGGGAA TATCATCGTG 1980 

CTTGTTGTGG CACAGTTCAG TGGCCTGGTA CAGTGGGCCG AATTCATTTT GTTTTCCTGC 2040 

CTCCTGCTGG TGATCTGCCT GATCTTCTCC ATCATG3GCT ACTACTATGT TCCTGTAAAG 2100 

ACAGAGGATA TGCGGGGTCC AGCAGATAAG CACATTCCTC ACATCCAGGG GAACATGATC 2160 
AAACTAGAGA CCAAGAAGAC AAAACTCTGA 

Seq ID NO: 582 Protein sequence 



MNPFQKNESK ETLFSPVSIE EVPPRPPSPP KKPSPTICGS NYPLEIAFIV VNEFCERFSY 
YGMKAVLILY FLYFLHHNED TSTSIYHAFS SLCYFTPILG AAIADSWLGK FKTIIYLSLV 
YVLGHVIKSL GALPILGGQV VHTVLSLIGL SLIALGTGGI KPCVAAFGGD QFEEKHAEER 
TRYFSVFYLS INAGSLISTF ITPMLRGDVQ CFGEDCYALA FGVPGLLMVI ALWFAMGSK 
IYNKPPPEGN IVAQVFKCIW FAISNRFKNR SGDIPKRQHW LDWAAEKYPK QLIMDVKALT 
RVLFLYIPLP MFWALLDQQG SRWTLQAIRM NRNLGFFVLQ PDQMQVLNPF LVLIFIPLFD 
FVIYRLVSKC GINFSSLRKM AVGMILACLA FAVAAAVEIK INEMAPAQSG PQEVFLQVLN 
LADDEVKVTV VGNENNSLLI ESIKSFQKTP HYSKLHLKTK SQDFHFHLKY HNLSLYTEHS 
VQEKNWYSLV IREDGNSISS MMVKDTESKT TNGMTTVRFV NTLHKDVNIS LSTDTSLHVG 
EDYGVSAYRT VQRGEYPAVH CRTEDKNFSL NLGLLDFGAA YLFVITNNTH QGLQAWKIED 
IPANKMSIAW QLPQYALVTA GEVMFSVTGL EFSYSQAPSS MKSVLQAAWL LTIAVGNIIV 
LWAQFSGLV QWAEFILFSC LLLVICLIFS IMGYYYVPVK TEDMRGPADK HIPHIQGNMI 
KLETKKTKL 

Seq ID NO: 583 DNA sequence 

Nucleic Acid Accession #: NM_032642.1 

Coding sequence: 184.. 1263 

1 11 21 31 41 51 

I I I I I I 

GACCATTAGC AGGCACCCAG GCCTGTCTTT GGCTCGGAAA CGGTGGCCCC CAATGTAGCC 
TAGTTTGAAC CTAGGAACTG CAGGACCAGA GAGATTCCAC TGGAGCCTGA TGGACGGGTG 
ACAGAGGGAA CCCTACTCTG GAAACTGTCA GTCCCAGGGC ACTGGGGAGG GCTGAGGCCG 
ACCATGCCCA GCCTGCTGCT GCTGTTCACG GCTGCTCTGC TGTCCAGCTG GGCTCAGCTT 
CTGACAGACG CCAACTCCTG GTGGTCATTA GCTTTGAACC CGGTGCAGAG ACCCGAGATG 
TTTATCATCG GTGCCCAGCC CGTGTGCAGT CAGCTTCCCG GGCTCTCCCC TGGCCAGAGG 
AAGCTGTGCC AATTGTACCA GGAGCACATG GCCTACATAG GGGAGGGAGC CAAGACTGGC 
ATCAAGGAAT GCCAGCACCA GTTCCGGCAG CGGCGGTGGA ATTGCAGCAC AGCGGACAAC 
GCATCTGTCT TTGGGAGAGT CATG CAGAT A GGCAGCCGAG AGACCGCCTT CACCCACGCG 
GTGAGCG CG CGGG CGTGGT CAACGCCATC AGCCGGGCCT GCCGCGAGGG CGAGCTCTCC 
ACCTGCGGCT GCAGCCGGAC GGCGCGGCCC AAGGACCTGC CCCGGGACTG GCTGTGGGGC 
GGCTGTGGGG ACAACGTGGA GTACGGCTAC CGCTTCGCCA AGGAGTTTGT GGATGCCCGG 
GAGCGAGAGA AGAACTTTGC CAAAGGATCA GAGGAGCAGG GCCGGGTGCT CATGAACCTG 
CAAAACAACG AGGCCGGTCG CAGGGCIGTG TATAAGATGG CAGACGTAGC CTGCAAATGC 
CACGGCGTCT CGGGGTCCTG CAGCCTCAAG ACCTGCTGGC TGCAGCTGGC CGAGTTCCGC 
AAGGTCGGGG ACCGGCTGAA GGAGAAGTAC GACAGCGCGG CCGCCATGCG CGTCACCCGC 
AAGGGCCGGC TGGAGCTGGT CAACAGCCGC TTCACCCAGC CCACCCCGGA GGACCTGGTC 
TATGTGGACC CCAGCCCCGA CTACTGCCTG CGCAACGAGA GCACGGGCTC CCTGGGCACG 



GGGCGTGGCT ACAACCAGTT CAAGAGCGTG CAGGTGGAGC GCTGCCACTG C 

TGGTGCTGCT TCGTCAGGTG TAAGAAGTGC ACGGAGATCG TGG AC CAGTA CATCTGTAAA 1260 

TAGCCCGGAG GGCCTGCTCC CGGCCCCCCC TGCACTCTGC CTCACAAAGG TCTATATTAT 1320 

ATAAATCTAT ATAAATCTAT TTTATATTTG TATAAGTAAA TGGGTGGGTG CTATACAATG 1380 

GAAAGATGAA AATGGAAAGG AAGAGCTTAT TTAAGAGACG CTGGAGATCT CTGAGGAGTG 1440 

GACTTTGCTG GTTCTCTCCT CITGGTGGGT GGGAGACAGG GCTTTTTCTC TCCCTCTGGC 1500 

GAGGACTCTC AGGATGTAGG GACTTGGAAA TATTTACTGT CTGTCCACCA CGGCCTGGAG 1560 

GAGGGAGGTT GTGGTTGGAT GGAGGAGATG ATCTIGTCTG GAAGTCTAGA GTCTTTGTTG 1620 

GTTAGAGGAC TGCCTGTGAT CCTGGCCACT AGGCCAAGAG GCCCTATGAA GGTGGCGGGA 1680 

ACICAGCTTC AACCTCGATG TCTTCAGGGT CTTGTCCAGA ATGTAGATGG GTTCCGTAAG 1740 

AGGCCTGGTG CTCTCTTACT CTTTCATCCA CGTGCACTTG TGCGGCATCT GCAGTT7ACA 1800 

GGAACGGCTC CTTCCCTAAA ATGAGAAGTC CAAGGTCATC TCTGGCCCAG TGACCACAGA 1860 

GAGATCTGCA CCTCCCGGAC TTCAGGCCTG CCTTTCCAGC GAGAATTCTT CATCCTCCAC 1920 

GGTTCACTAG CTCCTACCTG AAGAGGAAAG GGGGCCATTT GACCTGACAT GTCAGGAAAG 1980 

CCCTAAACTG AATGTTTGCG CCTGGGCTGC AGAAGCCAGG GTGCATGACC AGGCTGCGTG 2040 

GACGTTATAC TGTCTTCCCC CACCCCCGGG GAGGGGAAGC TTGAGCTGCT GCTC-TCACTC 2100 

CTCCACCGAG GGAGGCCTCA CAAACCACAG GACGCTGCAA CGGGTCAGGC TGGCGGGCCC 2160 

GGCGTGCTCA TCATCTCTGC CCCAGGTGTA CGGTITCTCT CTGACATTAA ATGCCCTTCA 2220 
TGGAAAAAAA AAAAAGAAAA AAAAAAAAAA AA 
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SAAGWNAIS RACREGELST CGCSRTARPK D 
REKNFAKGSE EQGRVLMNLQ MNEAGRRAVY KMADVACKCH GVSGSCSLKT CNLQLAEFRK 
VGDRLKEKYD SAAAMRVTRK GRLELVNSRF TQPTPEDLVY VDPSPDYCLR NESTGSLGTQ 
GRLCNKTSEG MDGCELMCCG RGYNQFKSVQ VERCHCKFHK CCFVRCKKCT EIVDQYICK 



ATGGCTTTGA ACTCAGGGTC ACCACCAGCT ATTGGACCTT ACTATGAAAA CCATGGATAC 60 
CAACCGGAAA ACCCCTATCC CGCACAGCCC ACTGTGGTCC CCACTGTCTA CGAGGTGCAT 120 



TCCAACCCCG TCGTCTGCAC GCAGCCCAAA TCCCCATCCG GGACAGTGTG CACCTCAAAG 240 

ACTAAGAAAG CACTGTGCAT CACCTTGACC CTGGGGACCT TCCTCGTGGG AGCTGCGCTG 30 0 

GCCGCTGGCC TACTCTGGAA GTTCATGGGC AGCAAGTGCT CCAACTCTGG GATAGAGTGC 360 

GACTCCTCAG GTACCTGCAT CAACCCCTCT AACTGGTGTG ATGGCGTGTC ACACTGCCCC 420 

GGCGGGGAGG ACGAGAATCG GTGTGTTCGC CTCTACGGAC CAAACTTCAT CCTTCAGGTG 480 

TACTCATCTC AGAGGAAGTC CTGGCACCCT GTGTGCCAAG ACGACTGGAA CGAGAACTAC 540 

GGGCGGGCGG CCTGCAGGGA CATGGGCTAT AAGAATAATT TTTACTCTAG CCAAGGAATA 60 0 

GTGGATGACA GCGGATCCAC CAGCTTTATG AAACTGAACA CAAGTGCCGG CAATGTCGAT 660 

ATCTATAAAA AACTGTACCA CAGTGATGCC TGTTCTTCAA AAGCAGTGGT TTCTTTACGC 720 

TGTATAGCCT GCGGGGTCAA CTTGAACTCA AGCCGCCAGA GCAGGATCGT GGGCGGCGAG 780 

AGCGCGCTCC CGGGGGCCTG GCCCTGGCAG GTCAGCCTGC ACGTCCAGAA CGTCCACGTG 840 

TGCGGAGGCT CCATCATCAC CCCCGAGTGG ATCGTGACAG CCGCCCACTG CGTGGAAAAA 900 

CCTCTTAACA ATCCATGGCA TTGGACGGCA TTTGCGGGGA TTTTGAGACA ATCTTTCATG 960 

TTCTATGGAG CCGGATACCA AGTAGAAAAA GTGATTTCTC ATCCAAATTA TGACTCCAAG 1020 

ACCAAGAACA ATGACATTGC GCTGATGAAG CTGCAGAAGC CTCTGACTTT CAACGACCTA 1080 

GTGAAACCAG TGTGTCTGCC CAACCCAGGC ATGATGCTGC AGCCAGAACA GCTCTGCTGG 114 0 

ATTTCCGGGT GGGGGGCCAC CGAGGAGAAA GGGAAGACCT CAG AAGTG CT GAACGCTGCC 1200 

AAGGTGCTTC TCATTGAGAC ACAGAGATGC AACAGCAGAT ATGTCTATGA CAACCTGATC 1260 

ACACCAGCCA TGATCTGTGC CGGCTTCCTG CAGGGGAACG TCGATTCTTG CCAGGGTGAC 132 0 

AGTGGAGGGC CTCTGGTCAC TTCGAAGAAC AATATCTGGT GGCTGATAGG GGATACAAGC 1380 

TGGGGTTCTG GCTGTGCCAA AGCTTACAGA CCAGGAGTGT ACGGGAATGT GATGGTATTC 1440 
ACGGACTGGA TTTATCGACA AATGAGGGCA GACGGCTAA 



MALNSGSPPA IGPYYEHIIGY QPENPYPAQP TWPTVYEVH PAQYYPEPVP QYAPRVLTQA 
SNPWCTQPK SPSGTVCTSK TKKALCITLT LGTFLVGAAL AAGLLWKFMG SKC3NSGIEC 
DSSGTCINPS NWCDGVSHCP GGEDENRCVR LYGPNFILQV YSEQRKEWIIP VCQDDNNENY 
GRAACRDMGY KNNFYSSQGI VDDSGSTSFM KLNTSAGNVD IYKKLYKSDA CSSKAWSLR 
CIACGVNLNS SRQSRIVGGE SALPGAKPWQ VSLHVQNVHV CGGSIITPEW IVTAAHCVEK 
PLNMPWHMTA FAGILRQSFM FYGAGYQVEK VISHPNYDSK TKNNDIALMK LQKPLTFNDL 
VKPVCLPNPG MMLQPEQLCW ISGWGATEEK GKTSEVLNAA KVLLIETQRC NSRYVYDNLI 
TPAMICAGFL QGNVDSCQGD SGGPLVTSKN NIWWLIGDTS WGSGCAKAYR PGVYGNVMVF 
TDWIYRQMRA DG 

Seq ID NO: 587 DNA sequence 



I I I I I I 

GTCATATTGA ACATT CCAGA TACCTATCAT TACTCGATGC TGTTGATAAC AGCAAGATGG 
CTTTGAACTC AGGGT CACCA CCAGCTATTG GACCTTACTA TGAAAACCAT GGATACCAAC 
CGGAAAACCC CTATCCCGCA CAGCCCACTG TGGTCCCCAC TGTCTACGAG GTGCATCCGG 
CTCAGTACTA CCCGTCCCCC GTGCCCCAGT ACGCCCCGAG GGTCCTGACG CAGGCTTCCA 
ACCCCGTCGT CTGCACGCAG CCCAAATCCC CATCCGGGAC AGTGTGCACC TCAAAGACTA 



CTGGCCTACT CTGGAAGTTC ATGGGCAGCA AGTGCTCCAA CTCTGGGATA GAGTGCGACT 420 

CCTCAGGTAC CTGCATCAAC CCCTCTAACT GGTGTGATGG CGTGTCACAC TGCCCCGGCG 480 

GGGAGGACGA GAATCGGTGT GTTCGCCTCT ACGGACCAAA CTTCATCCTT CAGATGTACT 540 

CATCTCAGAG GAAGTCCTGG CACCCTGTGT GCCAAGACGA CTGGAACGAG AACTACGGGC 60 0 

GGGCGGCCTG CAGGGACATG GGCTATAAGA ATAATTTTTA CTCTAGCCAA GGAATAGTGG 660 

ATGACAGCGG ATCCACCAGC TTTATGAAAC TGAACACAAG TGCCGGCAAT GTCGATATCT 72 0 

ATAAAAAACT GT AC CACAGT GATGCCTGTT CTTCAAAAGC AGTGGTTTCT TTACGCTGTT 780 

TAGCCTGCGG GGTCAACTTG AACTCAAGCC GCCAGAGCAG GATCGTGGGC GGTGAGAGCG 840 

CGCTCCCGGG GGCCTGGCCC TGGCAGGTCA GCCTGCACGT CCAGAACGTC CACGTGTGCG 900 

GAGGCTCCAT CATCACCCCC GAGTGGATCG TGACAGCCGC CCACTGCGTG GAAAAACCTC 960 

TTAACAATCC ATGGCATTGG ACGGCATTTG CGGGGATTTT GAGACAATCT TTCATGTTCT 1020 

ATGGAGCCGG AT AC CAAGT A CAAAAAGTGA TTTCTCATCC AAATTATGAC TCCAAGACCA 1080 

AGAACAATGA CATTGCGCTG ATGAAGCTGC AGAAGCCTCT GACTTTCAAC GACCTAGTGA 1140 

AACCAGTGTG TCTGCCCAAC CCAGGCATGA TGCTGCAGCC AGAACAGCTC TGCTGGATTT 1200 

CCGGGTGGGG GGCCACCGAG GAGAAAGGGA AGACCTCAGA AGTGCTGAAC GCTC-CCAAGG 1260 

TGCTTCTCAT TGAGACACAG AGATGCAACA GCAGATATGT CTATGACAAC CTGATCACAC 1320 

CAGCCATGAT CTGTGCCGGC TTCCTGCAGG GGAACGTCGA TTCTTGCCAG GGTGACAGTG 1380 

GAGGGCCTCT GGTCACTTCG AACAACAATA TCTGGTGGCT GATAGGGGAT ACAAGCTGGG 1440 

GTTCTGGCTG TGCCAAAGCT TACAGACCAG GAGTGTACGG GAATGTGATG GTATTCACGG 1500 

ACTGGATTTA T CGACAAATG AAGGCAAACG GCTAATCCAC ATGGTCTTCG TCCTTGACGT 1560 
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15 
20 



CGTTTTACAA GAAAACAATG GGGCTGGTTT TGCTTCCCCG TGCATGATTT ACTCTTAGAG 
ATGATTCAGA GGTCACTTCA TTTTTATTAA ACAGTGAACT TGTCTGGCTT TGGCACTCTC 
TGCCATACTG TGCAGGCTGC AGTGGCTCCC CTGCCCAGCC TGCTCTCCCT AACCCCTTGT 
CCGCAAGGGG TGA1GGCCGG CTGGTTGTGG GCACTGGCGG TCAATTGTGG AAGGAAGAGG 
GTTGGAGGCT GCCCCCATTG AGATCTTCCT GCTGAGTCCT TTCCAGGGGC CAATTTTGGA 
TGAGCATGGA GCTGTCACTT CTCAGCTGCT GGATGACTTG AGATGAAAAA GGAGAGACAT 
GGAAAGGGAG ACAGCCAGGT GGCACCTGCA GCGGCTGCCC TCTGGGGCCA CTTGGTAGTG 
TCCCCAGCCT ACTTCACAAG GGGATTTTGC TGATGGGTTC TTAGAGCCTT AGCAGCCCTG 2040 
GATGGTGGCC AGAAATAAAG GGACCAGCCC TTCATGGGTG GTGACGTG3T AGTCACTTGT 2100 
AAGGGGAACA GAAACATTTT TGTTCTTATG GGGTGAGAAT ATAGACAGTG CCCTTGGTGC 2 ISO 
GAGGGAAGCA ATIGAAAAGG AACTTGCCCT GAGCACTCCT GGTGCAGGTC TCCACCTGCA 2220 
CATTGGGTGG GGCTCCTGGG AGGGAGACTC AGCCTTCCTC CTCATCCTCC CTGACCCTGC 2280 
TCCTAGCACC CTGGAGAGTG AATGCCCCTT GGTCCCTGGC AGGGCGCCAA GTTTGGCACC 2340 
O GAAATCAAGG 2400 



CTGAGTTCAA AGCCATCTT 



I I I I I I 

MALNSGSPPA IGPYYENHGY QPENPYPAQP TWPTVYEVH PAQYYPSPVP QYAPRVLTQA 
SNPWCTQPK SPSGTVCTSK TKKALCITLT LGTFLVGAAL AAGLLNKFMG SKCSNSGIEC 
DSSGTCINPS NWCDGVSHCP GGEDENRCVR LYGPNFILQM YSSQRKSKHP VCQDDWNENY 
GRAACRDMGY KNNPYSSQGI VDDSGSTSPM KLNTSAGNVD IYKKLYHSDA CSSKAWSLR 
CLACGVNLNS SRQSRIVGGE SALPGAWPWQ VSLHVONVHV CGGSIITPEW IVTAAHCVEK 
PLNNPWHWTA FAGILRQSFM FYGAGYQVQK VISHPNYDSK TKNNDIALMK LQKPLTFNDL 
VKPVCLPNPG MMLQPEQLCW ISGWGATEEK GKTSEVLNAA KVLLIETQRC NSRYVYDNljI 
TPAMICAGFL QGNVDSCQGD SGGPLVTSNN NIWWLIGDTS WGSGCAKAYR PGVYGNVMVF 
TDWIYRQMKA NG 



Coding sequence: 1..2301 

1 11 21 31 41 51 

I I I ■ I I I 

ATGAAGACAC CGTGGAAGAT TCTTCTGGGA CTGCTGGGTG CTGCTGCGCT TGTCACCATC 
ATCACCGTGC CCGTGGTTCT GCTGAACAAA GGCACAGATG ATGCTACAGC TGACAGTCGC 
AAAACTTACA CTCTAACTGA TTACTTAAAA AATACTTATA GACTGAAGTT ATACTCCTTA 
AGATGGATTT CAGATCATGA ATATCTCTAC AAACAAGAAA ATAATATCTT GGTATTCAAT 
GCTGAATATG GAAACAGCTC AGTTTTCTTG GAGAACAGTA CATTTGATGA GTTTGGACAT 
G ATTATTCAAT ATCTCCTGAT GGGCAGTTTA TTCTCTTAGA ATACAACTAC 
T GGAGGCATTC CTACACAGCT TCATATGACA TTTATGATTT A 
CAGCTGATTA CAGAAGAGAG GATTCCAAAC AACACACAGT GGGTCACATG G' 

GGTCATAAAT TGGCATATGT TTGGAACAAT GACATTTATG TTAAAATTGA ACCAAATTTA 540 

CCAAGTTACA GAATCACATG GACGGGGAAA GAAGATATAA TATATAATGG AATAACTGAC 600 

TGGGTTTATG AAGAGGAAGT CTTCAGTGCC TACTCTGCTC TGTGGTGGTC TCCAAACGGC SSO 

ACTTTTTTAG CATATGCCCA ATTTAACGAC ACAGAAGTCC CACTTATTGA ATACTCCTTC 720 

TACTCTGATG AGTCACTCCA GTACCCAAAG ACTGTACGGG TTCCATATCC AAAGGCAGGA 780 

GCTGTGAATC CAACTGTAAA GTTCTTTGTT GTAAATACAG ACTCTCTCAG CTCAGTCACC B40 

AATGCAACTT CCATACAAAT CACTGCTCCT GCTTCTATGT TGATAGGGGA TCACTACTTG 900 

TGTGATGTGA CATGGGCAAC ACABGAAAGA ATTTCTTTGC AGTGGCTCAG GAGGATTCAG 960 

AACTATTCGG TCATGGATAT TTGTGACTAT GATGAATCCA GTGGAAGATG GAACTGCTTA 1020 

GTGGCACGGC AACACATTGA AATGAGTACT ACTGGCTGGG TTGGAAGATT TAGGCCTTCA 1080 

GAACCTCATT TTACCCTTGA TGGTAATAGC TT CTACAAG A TCATCAGCAA TGAAGAAGGT 1140 

TACAGACACA TTTGCTATTT CCAAATAGAT AAAAAAGACT GCACATTTAT TACAAAAGGC 1200 

ACCTGGGAAG TCATCGGGAT AGAAGCTCTA ACCAGTGATT ATCTATACTA CATTAGTAAT 1260 

GAATATAAAG GAATGCCAGG AGGAAGGAAT CTTTATAAAA TCCAACTTAG TGACTATACA 1320 

AAAGTGACAT GCCTCAGTTG TGAGCTGAAT CCGGAAAGGT GTCAGTACTA TTCTGTGTCA 1380 

TTCAGTAAAG AGGCGAAGTA TTATCAGCTG AGATGTTCCG GTCCTGGTCT GCCCCTCTAT 1440 

ACTCTACACA GCAGCGTGAA TGATAAAGGG CTG AGAGT CC TGGAAGACAA TTCAGCTTTG 1500 

GATAAAATGC TGCAGAATGT GCAGATGCCC TCCAAAAAAC TGGACTTCAT TATTTTGAAT 1560 

GAAACAAAAT TTTGGTATCA GATGATCTTG CCTCCTCATT TTGATAAATC CAAGAAATAT 1620 

CCTCTACTAT TAGATGTGTA TGCAGGCCCA TGTAGTCAAA AAGCAGACAC TGTCTTCAGA 1680 

CTGAACTGGG CCACTTACCT TGCAAGCACA GAAAACATTA TAGTAGCTAG CTTTGATGGC 1740 

AGAGGAAGTG GTTACCAAGG AGATAAGATC ATGCATGCAA TCAACAGAAG ACTGGGAACA 1800 

TTTGAAGTTG AAGATCAAAT TGAAGCAGCC AG ACAATTT T CAAAAATGGG ATTTGTGGAC 1860 

3 GGGCTGGTCA TATGGAGGGT ACGTAACCTC AATGGTCCTG 1920 

A ATAGCCGTGG CGCCTGTATC CCGGTGGGAG 1980 

TACTATGACT CAGTGTACAC AGAACGTTAC ATGGGTCTCC CAACTCCAGA AGACAACCTT 2040 

GACCATTACA GAAATTCAAC AGTCATGAGC AGAGCTGAAA ATTTTAAACA AGTTGAGTAC 2100 

CTCCTTATTC ATGGAACAGC AGATGATAAC GTTCACTTTC AGCAGICAGC TCAGATCTCC 2160 

AAAGCCCTGG TCGATGTTGG AGTGGATTTC CAGGCAATGT GGTATACTGA TGAAGACCAT 2220 

C ACACCAACAT ATATATACCC ACATGAGCCA CTTCATAAAA 2280 



MKTPWKILLG LLGAAALVTI ITVPWLLNK GTDDATADSR 
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35 



RWISDHEYLY KQENNILVFN AEYGNSSVFL ENSTFDEFGH SINBYSISPD GQFILLEYNY 
VKQWRHSYTA SYDIYDLNKR QLITEERIPN NTQHVTWSPV GHKLAYVWNN DIYVKIEPNL 
PSYRITWTGK EDIIYNGITD WVYEEEVFSA YSALKWSPNG TFLAYAQFND TEVPLIEYSF 
YSDESLQYPK TVRVPYPKAG AVNPTVKFFV VNTDSLSSVT NATSIQITAP ASMLIGDHYL 
CDVTWATQER ISLQWLRRIQ NYSVMDICDY DESSGPiriCL /AEJhl" r P 
EPHFTLDGNS FYKI I SNEEG YRHICYFQID KKDCTFITKG TKEVIGIEAL TSDYLYYISN 
ST LYKIQLSDYT KVTCLSCELN PERCQYYSVS FSKEAKYYQL RCSGPGLPLY 
jEDNSAL DKMLQNVQMP SKKLDFIILN ETKFWYQMIL PPHFDK5KKY 
P CSQKADTVFR LNWATYLAST ENI IVASFDG RGSGYQGDKI MHAINRRLGT 
FEVEDQIEAA RQFSKMGFVD NKRIAIWGWS YGGYVTSMVL GSGSGVFKCG IAVAPVSRWE 
YYDSVYTERY MGLPTPEDNL DHYRNSTVMS RAENFKQVEY LLIHGTADDN VHFQQSAQIS 
KALVDVGVDF QAMWYTDEDH GIASSTAHQH IYTHMSHFIK QCFSLP 

Seq ID NO: 591 DHA sequence 

Nucleic Acid Accession J) : NM_01SQ77.1 

Coding sequence: 128.. 667 



3 TCACCCAGGA ACCCCGGAAG A 
CGCGATAGAA ACGTGTTCGC TTGCCCAGAA GAAGGGAAGG CGCGAGTGAG GAAAGGAGGT 
T CCTTGGTTAT GGAATATTTG GCTCATCCCA GTACACTCGG 

G AGCCTTCGAG T 

\ CGAGCAAGAC ACACACAGAT A 




CAAAGCTCCT GATGAAGAAA CCCTGATTGC ATTATTGGCC CATGCAAAAA T 
GACTGTAAGT TTAATTCAAG ATGCTGGACG TACTCAGATT GCACCAGGCT CTCAAACTGT 

TTACTAGGTG GACTTTGATA TGACAACAAC CCCTCCATCA CAAGTGTTTG AAGCCTGTCA 
GATTCTAACA ACAAAAGCTG AATTTCTTCA CCCAACTTAA ATGTTCTTGA GATGAAAATA 
AAACCTATTC CCATGTTCTA AAAAAA 



n Accession #: ( 



40 MPSKSLVMEY LAHPSTLGLA VGVACGMCLG WSLRVCFGML PKSKTSKTHT DTSSEASILG 
DSGEYKMILV VENDLKMGKG KVAAQCSHAA VSAYKQIQRR NPEMLKQWEY CGQPKVWKA 
PDEETLIALL AHAKMLGLTV SLIQDAGRTQ IAPGSQTVLG IGPGPADLID KVTGIILKLY 



"FGENE3K predicted 



dU atgcgcgccg tgccgctgcc cgccccgctc ctgccgctgc tgctgctcgc g 

GCTCCCGCCG CCCGCGCCAG CAGAGCCGAG TCCGTCTCCG OGCCGTGGCC CGAACCCGAG 120 

CGCGAGTCGC GGCCACCGCC CGGCCCGGGG CCCGGGAACA CCACCCGGTT TGGGTCTGGG 180 

GCGGCGGGCG GCAGCGGCAG CTCCAGCTCC AACAGCAGTG GCGACGCCTT GGTGACCCGC 240 

ATTTCCATCC TCCTCCGCGA CCTACCCACC CTCAAGGCAG CCGTGATCGT GGCGTTCGCC 300 

55 TTTACCACCC TCCTCATCGC CTGCCTGCTG CTGCGCGTCT TCAGGTCGGG AAAGAGGTTA 360 

AAGAAGACAC GCAAGTATGA TATCATCACC ACTCCAGCAG AGCGAGTGGA AATGGCGCCA 420 

CTAAATGAAG AGGATGATGA AGATGAGGAC TCCACAGTAT TCGACATCAA ATACAGAGTG 480 

TCCTTGCCGG CTGCACTGAG ACGTCAGCTG CCAGGGTGCC AGACGCTACT GACAGTTCCT 540 

GTGCCCCCAC CCTTCATCCT CGACATTGAC CTTCCAGCAA GATGCAGTGG AAGGCCTGAT 600 

60 GGTGGAATCA GACCTGGTAA AACCTGTTTC CCAGCCTGGT GGCATCCTGT GGAAAGTTGG 660 

TCAGCTGCAA CCTGGGGTGT GAAGGACTGG ACCTGGAAGC CCTCTTGCGT CGGAGGTGTT 720 

GAAACCAAAA CGAACGTTAT GTATAAAACC CCAGCTCCAT CGTGCGTGTC AGGCATCTGC 780 

TCAGACTGTC ACT GGCAAGC TCGTTTCCAC GTCACCACAA TGGAGTTGCT TCTGCCACCC B40 

TTTGGGCATC CCTTTAAAGT GCCCCCTACT TCTACTCCCC ATGGTTTTCG ACAACTGCAG 900 

65 CTGAATCTCA TGGAAAAGCT GGATTCCTCT GCCTTACGCA GAAACACCCG GGCTCCATCT 960 

GCCAGGTGCT TGCCACTGGT CCTGGCAGAA ATGGCGGCTG CTGAAAGTGA CCTTCCAAAT 1020 
CCTTGGTGGC ACTTCAGCGC CACAGGCTCT CCAATAAAAA CCCTTTACAC A 
AGTACCTTGG GCTTGGATGT TTTCTGTGGT GCCGGCCAGC GGGGCACCTT T 

AGAGCAGTGA CTAAGGTTCT CCAGGGTAGC TCTTTCTCCA AACAGCTGCG CTGGAAGCCA 1200 

70 GCCCTAGAGA GTGGGTTTCC CCATCATCTC AGGCTTCTCA GAGAGTGTCC TCCGCTGAGC 1260 

ACCCATCCTG TCAGGTTGGC TCGTTCAGAT GCCCGGGGAC AAGCCAGCCT GACGGGGAGG 1320 

AGGGTGTTTC GGCGTCCGCG GCAGTCTCTG CATGGCGGAG GC-TCAGCGGG TACCGCAACT 1380 

TGCCTTTTGG TTTTGAAGAT TCTGTTGAGG CGCCATCCTC ACCTTGACCT CTTCTACAAA 1440 

ATCTGTCTCC CCTGCTGTGC CGTGGAACAC CTACGGGAAG CCAAGAGAAG CTCAGTGACT 150 0 

75 GTCCTTGCGT CATTTGAGCA GAGCCCACAA AAGGCAGCTG CTGCCCACGG GGAGCCTGTC 1560 

AAACGAGGGC CCAGTGGGCA ATTGACCAGA CACACATGCC CTGGCTGGGC- GATCACACAT 1620 

GCGAACCTGC AGACAATTCC AGATACCCAA GGCCAG3AAG GCCCACGTGA GGATGTCACT 1680 

G GAG ACT TGGA TGGG3TGGCA AATTTCTATT TGGAGGAAGA GGGTTTCCAG 1740 

\GAAGAT GGTCCTGATG TCTGAGGAAG GGCCACCTAG TTTGACAGGA 1800 

C TCACAGGTTC CCATCACTTC TCCAGCCATT CCAAGTCTTG GTCCTTCCTT 1860 
TCCCCCCGAC AGCCCCTGTT TCTGTCCAGG CCCTGA 



n sequence 
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S NSSGDALVTR 

KKTRKYDi: 



C SDCHWQARFH V 

LNLMEKLDSS ALRRNTRAPS ARCLPLVLAE MAAAESDLPN PKWHFSATGS P I KTLYTQTM 
STLGLDVFCG AGQRGTFCED RAVTKVLQGS SFSKQLRWKP ALESGFPHHL RLLRECPPLS 
D ARGQASLTGR RVFRRPRQSL HGGGSAGTAT CLLVLKILLR RHPHLDLFYK 
I VLASFEQSPQ KAAAAHGEPV KRGPSGQLTR HTCPGWGITH 
ANLQTIPDTQ GQEGPREDVT HPGGDLDGVA NFYLEEEGFQ DGRCQKMVLK SSEGPPSLTG 
CERLTGSHHF SSHSKSWSFL SPRQPLFLSR P 



Coding sequence: 1..1740 

i I 1 r i 1 i 1 i 1 

ATGAGCAGCT GCAGGTACAA CGGGGGCGTC ATGCGGCCGC TCAGCAACTT GAGCGCGTCC 
CGCCGGAACC TGCACGAGAT GGACTCAGAG GCGCAGCCCC TGCAGCCCCC CGCGTCTGTC 
GGAGGAGGTG GCGGCGCGTC CTCCCCGTCT GCAGCCGCTG CCGCCGCCGC CGCTGTTTCG 
TCCTCAGCCC CCGAGATCGT GGTGTCTAAG CCCGAGCACA ACAACTCCAA CAACCTGGCG 
CTCTATGGAA CCGGCGGCGG AGGCAGCACT GGAGGAGGCG GCGGCGGTGG CGGGAGCGGG 
CACGGCAGCA GCAGTGGCAC CAAGTCCAGC AAAAAGAAAA ACCAGAACAT CGGCTACAAG 
CTGGGCCACC GGCGCGCCCT GTTCGAAAAG CGCAAGCGGC T CAGCG ACTA CGCGCTCATC 
TTCGGCATGT TCGGCATCGT GGTCATGGTC ATCGAGACCG AGCTGTCGTG GGGCGCCTAC 
GACAAGGCGT CGCTGTATTC CTTAGCTCTG AAATGCCTTA TCAGTCTCTC CACGATCATC 
CTGCTCGGTC TGATCATCGT GTACCACGCC AGGGAAATAC AGTTGTTCAT GGTGGACAAT 
GGAGCAGATG ACTGGAGAAT AGCCATGACT TATGAGCGTA TTTTCTTCAT CTGCTTGGAA 
ATACTGGTGT GTGCTATTCA TCCCATACCT GGGAATTATA CATTCACATG GACGGCCCGG 
CTTGCCTTCT CCTATGCCCC ATCCACAACC ACCGCTGATG TGGATATTAT TTTATCTATA 
CCAATGTTCT TAAGACTCTA TCTGATTGCC AGAGTCATGC TTTTACATAG CAAACTTTTC 
ACTGATGCCT CCTCTAGAAG CATTGGAGCA CTTAATAAGA TAAACTTCAA TACACGTTTT 



TTATGGATAA TTGCCGCATG GACTGTCCGA GCTTGTGAAA GGTACCATGA TCAACAGGAT 
ACTTCCTTGG AGCGATGTGG TTGATATCAA TAACTTTTCT CTCCATTGGT 



ATGGGTGCTG GTTGCACAGC CCTGGTGGTA GCTGTAGTGG CAAGGAAGCT AGAACTTACC 120 0 
AAAGCAGAAA AACACGTGCA CAATTTCATG ATGGATACTC AGCTGACTAA A 
AATGCAGCTG C 

AAAAAGATAG ATCATGCAAA AGTAAGAAAA CATCAACGAA AATTCCTGCA A' 
CAATTAAGAA GTGTAAAAAT G 

GACTTGGCAA AGACCCAGAA CATCATGTAT GATATGATTT CTGACTTAAA C 

3 AGAAGAGGAT TGTTACCCTG GAAACAAAAC TAGAGACTTT GATTGGTAGC 1560 
;TGGGCT CATAAGCCAG ACCATCAGGC AGCAGCAGAG AGATTTCATT 1620 
GAGGCT CAG A TGGAGAGCTA CGACAAGCAC GTCACTTACA ATGCTGAGCG GTCCCGGTCC 1680 
TCGTCCAGGA GGCGGCGGTC CTCTTCCACA GCACCACCAA CTTCATCAGA GAGTAGCTAG 

Seq ID NO: 596 Protein sequence 
Protein Accession #: NP_067627.1 

1 11 21 31 41 51 

MSSCRYNGGV MRPLSNLSAS RRNLHEMDSE AQPLQ.PPASV GGGGGASSPS AAAAAAAAVS 60 

SSAPEIWSK PEHNNSNNLA LYGTGGGGST GGGGGGGGSG HGSSSGTKSS KKKNQNIGYK 120 

LGHRRALFEK RKRLSDYALI FGMFGIWMV IETELSWGAY DKASLYSLAL KCLISLSTII 180 

LLGLIIVYHA REIQLFMVDN GADDWRIAMT YERIFFICLE ILVCAIHPIP GNYTFTWTAR 240 

LAFSYAPSTT TADVDIILSI PMFLRLYLIA RVMLLHSKLF TDASSRSIGA LNKINFNTRF 300 

VMKTLMTICP GTVLLVFSIS LWIIAAWTVR ACERYHDQQD VTSNFLGAMW LI3ITFLSIG 360 

YGDMVPNTYC GKGVCLLTGI MGAGCTALW AWARKLELT KAEKHVHNFM MDTQLTKRVK 420 

NAAANVLRET WLIYKHTKLV KKIDHAKVRK HQRKFLQAIH QLRSVKMEQR KLNDQANTLV 480 

DLAKTQNIMY DMISDLNERS EDFEKRIVTL ETKLETLIGS IHALPGLISQ TIRQQQRDFI 540 
EAQMESYDKH VTYNAERSRS SSRRRRSSST APPTSSESS 

Seq ID NO: 597 DNA sequence 

Nucleic Acid Accession it: HM_016029.1 

Coding sequence: 228.. 1097 

1 11 21 31 41 51 

I I I I I I 

CTGCGATCCC GCAGGGCAGC GACGCGACTC TGGTGCGGGC CGTCTTCTTC CCCCCGAGCT 60 

GGGCGTGCGC GGCCGCAATG AACTGGGAGC TGCTGCTGTG GCTGCTGGTG CTC-TGCGCGC 120 

TGCTCCTGCT CTTGGTGCAG CTGCTGCGCT TCCTGAGGGC TGACGGCGAC CTGACGCTAC 180 

TATGGGCCGA GTGGCAGGGA CGACGCCCAG AATGGGAGCT C-ACTGATATG GTGGTGTGGG 240 

TGACTGGAGC CT CGAGTGGA ATTGGTGAGG AGCTGGCTTA CCAGTTGTCT A 
TTTCTCTTGT G 



CTGGTTCCCA TGAAGCGGCT ACCAAAGCTG TTCTCCAGGA GTTTGGTAGA ATCGACATTC 
TGGTCAACAA TGGTGGAATG TCCCAGCGTT CTCTGTGCAT GGATACCAGC TTGGATGTCT 
ACAGAAAGCT AATAGAGCTT AACTACTTAG GGACGGTGTC CTTGACAAAA TGTGTTCTGC 
CTCACATGAT CGAGAGGAAG CAAGGAAAGA TTGTTACTGT GAATAGCATC CTGGGTATCA 
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TATCTGTACC TCTTTCCATT GGATACTGTG CTAGCAAGCA TGCTCTCCGG GGTTTTTTTA 
ATGGCCTTCG AACAGAACTT GCCACATACC CAGGTATAAT AGTTTCTAAC A' 
GACCTGTGCA ATCAAATAfT GTGGAGAATT CCCTAGC 
GCAATAATGG AGACCAGTCC CACAAGATGA CAACCAGTCG T 
TCAGCATGGC CAATGATTTG AAAGAAGTTT GGATCTCAGA ACAACCTTTC TTGTTAGTAA 960 
CATATTTGTG GCAATACATG CCAACCTGGG CCTGGTGGAT AACCAACAAG ATGGGGAAGA 102 0 
AAAGGATTGA GAACTTTAAG AGTGGTGTGG ATGCAGACTC TTCTTATTTT AAAATCTTTA 10 80 
AGACAAAACA TGACTGAAAA GAGCACCTGT ACTTTTCAAG CCACTSGAGG GAGAAATGGA 1140 
AAACATGAAA ACAGCAATCT TCTTATGCTT CTGAATAATC AAAGACTAAT TTGTGATTTT 1200 
ACTTTTTAAT AGATATGACT TTGCTTCCAA CATGGAATGA AATAAAAAAT AAATAATAAA 1260 
AGATTGCCAT GAATCTTGCA AA 



Seq ID NO: 598 Protein sequence 



I I I I I I 

MNWELLLWLL VLCALLLLLV QLLRFLRADG DLTLLWAEWQ GRRPEWELTD MWWVTGASS 

GIGEELAYQL SKLGVSLVLS ARRVHELERV KRRCLENGNL KEKDILVLPL DLTDTGSHEA 

ZO ATKAVLQEFG RIDILVNNGG MSQRSLCMDT SLDVYRKLIE LNYLGTVSLT KCVLPHMIER 

KQGKIVTVNS ILGIISVPLS IGYCASKHAL RGPFNGLRTE LATYPGIIV3 NICPGPVQSN 

IVENSLAGEV TKTIGNNGDQ SHKMTTSRCV RLMLISMAND LKEVWISEQP FLLVTYLWQY 

MPTWAWWITH KMGKKRIENF KSGVDADSSY FKIFKTKHD 

25 Seq ID NO: 599 DNA sequence 



G ACAGAT AG CA AGAAGGGAAA GACAGAGAGT 60 

GAGAAAAAAG AGGAGTCAGT CGCTCCTGGG GAAGGGAGAG AGTGAGACTG GGAGAAAGAG 12 D 

AAGCACAGAA AGTGTGTGTA AAACGGAGTA AAGAAAGAAA AAAAAAAAAC TACCCTTAAA 180 

GCACATTTAA AAAAAAAAAA CTCTGGCAAT TCAAGAAAGA AACAGGCTAC GTTTAAAGAG 240 

CATAGAGACA ATGAAAGGCT AAAGAAAATT TTAAAATCTC TGCCACAGTC TCATAGGTGC 3 00 

TTGGAAATGA AAGTAGAACT GCCTGTCTTT AACGGACTCT GACAGAGAGG GTGAAGGGGA 360 

ACCAGAGCGC ACAAGGGAAC TGACTCAGGA GGCAGAGAAG ATGGGCATCC TCAGCGTAGA 420 

CTTGCTGATC ACACTGCAAA TTCTGCCAGT TTTTTTCTCC AACTGCCTCT TCCTGGCTCT 480 

CTATGACTCG GTCATTCTGC TCAAGCACGT GGTGCTGCTG TTGAGCCGCT CCAAGTOCAC 540 

TCGCGGAGAG TGGCGGCGCA TGCTGACCTC AGAGGGACTG CGCTGCGTCT GGAAGAGCTT 6 00 

CCTCCTCGAT GCCTACAAAC AGGTGAAATT GGGTGAGGAT GCCCCCAATT CCAGTGTGGT 6 50 

GCATGTCTCC AGTACAGAAG GAGGTGACAA CAGTGGCAAT GGTACCCAGG AGAAGATAGC 720 

TGAGGGAGCC ACATGCCACC TTCTTGACTT TGCCAGCCCT GAGCGCCCAC TAGTGGTCAA 780 

CTTTGGCTCA GCCACTTGAC CTCCTTTCAC GAGCCAGCTG CCAGCCTTCC GCAAACTGGT 840 
GGAAGAGTTC TCCTCAGTGG CTGACTTCCT G 
AGATGGCTGG GCGATACCGG GGGACTC 

CCAGGAAGAT CGATGTGCAG CAGCCCAGCA GCTTCTGGAG CGTTTCTCCT TGCCGCCCCA 1020 

GTGCCGAGTT GTGGCTGACC GCATGGACAA TAACGCCAAC ATAGCTTACG GGGTAGCCTT 1080 

TGAACGTGTG TGCATTGTGC AGAGACAGAA AAITGCTTAT CTGGGAGGAA AGGGCCCCTT 1140 

CTCCTACAAC CTTCAAGAAG TCCGGCATTG GCTGGAGAAG AATTTCAGCA AGAGATGAAA 1200 

GAAAACTAGA TTAGCTGGTT AAAGGTATGA TTATAAGAGA GCTTATTGTT TTAAAAAGTT 1260 

ATATAAAGGC AAGGAAATTA AGAACTGAAT CCATATTTCA ACAGAGCCCT ATTGGCTTAC 1320 

TGAAAGACAG GAGTTTATCT ATCGGAAGAA CATGAAICTC TAACAGCTCC ATACTTCTTT 1380 

CACTACTCAA ATGGCATTGG GCTGAGTAAG TAACCATATC ACCTCTCTTC TTAGTAAAAA 1440 

GCCCTATGTG AAAAGATCCC AAGATGGAGA GGAAGAAACG CTAATTCAGC ATGTGTTCAT 1500 

TCTGCATTGA GAAGGAACTG ATACATCTGA TGCATGCTTT GAGACCAGAA GAAAAGACTT 1560 

ACCTGAATAA TTACTACATT AGGGAAGCTA CTGTCTACGT TAAGATAAAG GGTATTGCCT 1620 

TGGCTCTATT TGGCATGGAT GGAGCCCAGT TGGAAAATTC CCAAATATTA CAACAAGTCC 16B0 

TTGAACCCAG GCCATGTGGT TAGACGTTGG TGTTAAGGTT AGACCTTATG TTAGAGTCAT 1740 

TTCTGATGTT CCAGCTTCTA GCCATGTAGT GCICTCAGTC TTCATACCCC AGAAATTATT 1800 

GGTATATTTG TAGATACCGA GAATGATCCC TCAGTCTGAG AGGITAGAAT GATCATCTGT 1860 

AATCTGAGGG TTAATTTCTA GGCAGGTGGA GAGAGTGGTA AAAAAGAAAT GAAATTGACA 1920 

AGCTAGGAAA GAGGAGGCAG AAAGATTTGG AAAATTCACA GAGTTTCACC CTTAAGCTGT 1980 

AGAGAGTGGG TCACATTTGT TAGCCACGGA AACATAGAAA CATACACAAG GCCAGAAAAA 2040 

GAAGAAGGAG CTCAACTAAA AGTGGCATAG AGAATACACA TATAAAAACA ATATATTTGT 2100 

CATATGCTCC TAGAGAGGAG AAAGGGGTGA TIGAAAGAAA AAAAAATACT TAAATATTTG 2160 

TAATTGTGAG GGGTTTCTTT TGGAAATAAT TACTTTTGAA CCATGTATGT GGTATGTATA 2220 

TTTTCAGTGG GTTAATTATA CCCCATGATA CCTATTAAAG GAAAACCBGT GGGTCTGGTG 2280 

GTGCTGGTCT TTTCCTCCCC ATTCCTACAA TTTCTATGTG GCCCAAGTCA TTCCTAATCT 2340 
TGGTCTCTAT AGCAGTGTT C TCTCTGAATG CTGAGCTGAA GAAATTATAC G' 
ACATACATAC ATACATACAA ATATATGTAT ATATATTCTC AGCTGCTGCG G 
ACCATGGCCA TTCAGCACAG CCTTGATTTC CTCCCAAAGT A 
TAGGTGCAAA CAAACAAGCT TACTTCCATT G 

ATTCTGATCA ATCATTTTGG AGGCTTTGTT ATAAGG CAAC CCCCGGTATA TCATGGAATT 2640 

TCCATTGACA TTTGAATTTG GACTTGGATC TTCCCTTGGT CCCATTAGCT GAGGTTTAGT 2700 

AATCTAAAGT CCCTATAGTA TATGATTATA ATGCTATTTT AAAAAATATA TATATAAAAT 2760 

ATTTTTTTCT TTTTAAAATA GACACTATAG TITTACCCAT AAGTAATATT TAAAGATTAT 2B20 

AGCTCCCAAA AGAATGGACC AACCACTTTC GTATCATAAT TTCTTTTTGG TAAATATGAG 2B80 

ACTATTATGA AAT CATAGT A TATGATTGTA TTTAAAGGTA CAATCAAAGG ATCTTTTGTC 2940 

CATTCCATTA ATAACTGAAT AAAAAATAAA TAAAATGGAT AGAAAAAAAC TAAAGTTGAA 3000 

AATACATTCT TAAACTAGTT GTCTGAAATG AGAAAAGAGT GAGAACTAGG TGTGCAAGAA 3060 

CCAAACGTAT TTTATTTTAT TTTTTAAATG GGAGCAACAT ATCAGTCGTG TCACCAGCTG 3120 

GTATATTGTG TAAATATTAA AGCTCCATTG GGACTGATTT TTCATGGCAA CATCAGCTTT 3180 

CTAATGTTCT AAATTCTATA AAAACCACCC ACAAAGAAAC AAAGCAAATT TCATTATCTA 3240 
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ATGAGTTGCT GGAAAATCAT ATTGAGAATA ATTATTTCAG ATTCCTCAGT TGTTAACTTC 



ACTGTGGAAC CCTAAAGCAG TAAAATAAAA AACCTGGTTG CAGCACATTC ACACTGTTGT 3420 

CCTTAAAATT CCCCTTTTTT CTCTATGTAC GATAAAGTAA CAGTATGTCA GATAAGCCGG 3480 

TGGGGGGATG AGATTAGGCT GAGGCAGTGC TAGTCAACTG G3GGAAAAGG ATGATGGAAA 3540 

AATCACCCAG TTGTGCTATA TTTTTAAAGA AGGAGGTCGT TTATGTGTGC AGACAATTCT 3 600 

CCCTGAGGTT AGCCCAATGG AGAAATGAAG CAGAGGAAGG AAACATAGAA AGACATGGGC 3660 

TATCAGGGAG GAAGATGTTC AATAGAACAT GCAAGAATTT CTGGAAGAAA GGCTGTGGAA 372 0 

GGGCCAATGG AGAAAATGAA TGGACAAAGC TCAGGAATCC CTACGCTATG TAGAATGTTC 3780 

TTGGTGTTAT CAGGGTTAAG CCCTGTAATT ATGTAACCTA TTTATCGCAA CATGAATTTT 3840 

TATGATTTCT TGTGATGTAT TCTTTTATGA AATTAACAAG AACTCATTAT TTTGAGGTAG 3 900 

AGGAAAATCA ATGCTTTATC TGATATGCTG AGAAATTATT AGATTGCCAA TACTCATGTG 3 960 

CGTTTCATGT GTTTTATAAG GTTTGTTCCT TTGAAGAATT GTAGTTCTTA GTCCCACAGG 4020 

GAAATGTGTA TCTATTTATA TAT CATAGT A TAAATCTATG ATATATTTAT ATCATATATA 4080 

AAAGTCTGAG TTCTCTTTCT TAGTCCCTAA TCATGTTTCT CCCATAGGCT GTGTTTACAT 4140 

GGAGCTATCG GTTTAGCCTT TTAAGCTTCA TTAGCTTGTC TATTATTGAA ATAGTTTCCA 4200 

AGAAATTTTA GATATTATCA TAACATCTGG GTCTACTCAA ACACTTATTG TTTGAAAGAC 4260 

r TATTTATTGC TTAGTGAAAA TACTAGTGGG 4320 

A ATGGAGATGC CCGCACAGTA ATGTAGAAAT 43 80 

A TGTAACTGAC CTCCTTAGAG GCAGATTAGT AACTGTTCCT 4440 

VCATGAC TTTCTTTTTT CACATTGGGT 4500 
CTCTGGTCCT GTGTCTTCAC CTCATTTATA GCACGTCTCC T 
CTTCCCAGTG ATCTGTTCAG TTAAGTTCTT CTCCCGTTAA O 

T TTCATTTTGC CTGAGTGTAT T 

3GTCACA TGTTGGTGCT GCCTTGCTGC 
ATAAAATTCT AGGAGGGCAG GTTAGGAGAC AGTTATGTAT G 
2 ATGCCCCTAT G 



\ AATAACTTTA 
A AAATGTAGAC TCTGACTTCC GTCCCACTGA 
A GAACATCCCC ACTGTCATGT 
TGGAGACTGC AAGTAAGGCT TTTAATTTTA GGAGGTTTTT 



TCACTCTAAA AATAGAATCC AAACCCACTC TTCATATATG CTTCCAGAAT G 
TACCAATCTC TGCTTTGCAA TGGGCAC 
AGAGAGGATC TAGGATGGGA GAGCTAGAAA G 

GGGTTGGTCT ACCAATCTGG GAAGATTTGA AAACAAACTT CTCGCAACTG AAGGAAGGCT 5460 

GAAGGCTGCT GCAAGTCATT GAGTGACTTT AGGATGAGCA AAACATTGGG CCACTTCCTA 5520 

ATGCCCTATG TGTATAGTAC CAGAAGCAAG GTCTCAGACT TAACAGACCC AGCTCTGTTC 5580 

CAAGGTGAGT CTGAACCAAT AGAAAGCAAA CATGTGCAGA TATCCAAACA AGACTGCTCA 5640 

TGCAAGTCGG GGCTGGCTAC CCGTCTTAGG CAGCAACAGC AGAGCTCCAG GGAGCTTATT 5700 

CAATATTTAC TGAGACTTCG AAGACCCAGC AGATGTTTAA TGAAGTCACT ATTTTGGCTC 5760 

AAACCCTCCA CTTCTCCCCC TCCCCTCAAA AAGCCAACAG GTAAACACAT AAATGAAAGA 5820 

AACCCACAGA AGGGGATGGG AAATAAAGAA AATTCTCTCA AGACTTCTCC AGGCCCATGT 58 80 

CACTGGT CAG CGTGGTTTTT ATGTGTATTA GGATTGGGGG ATGTGAAGAA ATAAGTATCC 5940 

AGT ACTT TAT AACCAAAGCA ATTAAATGAT ATTGGGGTAG GGAATGTTGG CCAGTTTTGT 6000 

TTAGTTTTGC CATCACATTG TCACCCAGAC CTCACCTAGC CCCAAGTAAT CGGGCGCCCC 6060 

GAAGAGGGAG ACAGAGATGT GCCAGAGTTG ACCCAGTGTG CGGATGATAA CTACTGACGA 6120 

AAGAGTCATC GACCTCAGTT AGTGGTTGGA TGTAGTCACA TTAGTTTGCC TCTCCCCATC 6180 

TTTGTCTCCC TGGCAAGGAG AATATGCGGG ACATGATGCT AAGAGCCCTG GGTAAATGTG 6240 

GTGAGAATGC ACGCGTGCAT ATGCTACACA TATGTGCTTC TCAGTTGCAG AAAATGAACT 6300 

GCTTTGGGAG ATTATCAGTA GAAAGAGTGT TATCATATTG GTGCTGAGTG CTATGTGTGC 6360 

TTATACAATT TGTTCTTGTA TTTTAATAAA CTTTGAATAA AAGAATAAAA AAAAAAAAAA 6420 



Seq ID NO: 600 Protein sequenc 



MGILSVDLLI TLQILPVFFS NCLFLALYDS V 
RCVWKSFLLD AYKQVKLGED APNSSWHVS STEGGDNSGN GTQEKIAEGA TCHLLDFASP 
ERPLWNFGS ATUPPFTSQL PAFRKLVEEF SSVADFLLVY IDEAHPSDGW AIPGDSSLSF 
EVKKHQNQED RCAAAQQLLE RFSLPPQCRV VADRMDNNAN IAYGVAFERV CIVORQKIAY 




AGCTGATGAA AGTTTCACTC AAATGGATCT TGGGGACCGT ATTCTGAAGC TCAACACTGA 600 
GATTAGAGAA GTAGGTCCTG TCAACAAGAA GGGATTTTAT TTGGCATTTC AAGATGTTGG 660 
TGCTTGTGTT GCCTTGGTGT CTGTGAGAGT ATACTTCAAA AAGTGCCCAT TTACAGTGAA 720 
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I I I 

3 ELIPQPSNEV NLLDSKTIQG ELGWISYPSH G 
HYTPIRTYQV CNVMDHSQNN WLRTNWVPRN SAQKIYVELK FTLRDCHSIP L 
NDYYMESDDD HGVKPREHQF TKIDTIAADE SFTQMDLGDR ILKLNTEIRE VGFVNKKGFY 
LAFQDVGACV ALVSVRVYFK KCPFTVKNLA MFPDTVPMDS QSLVEVRGSC VNNSKEEDPP 
RMYCSTEGEW LVPIGKCSCN AGYEERGFMC QACRPGFYKA LDGNMKCAKC PPHSSTQEDG 
SMNCRCENNY FRADKDPPSM ACTRPPSSPR NVISNINETS VILDKSWPLD TGGRKDVTFN 
IICKKCGWNI KQCEPCSPNV RFLPRQFGLT NTTVTVTDLL AHTNYTFEID AVNGVSELSS 
PPRQFAAVSI TTNQAAPSPV LTIKKDRTSR NSISLSWQEP EHPNGIILDY EVKYYEKQEQ 
ETSYTILRAR GTNVTISSLK PDTIYVFQIR ARTAAGYGTN SRKFEFETSP DSFSISGESS 
QWMIAISAA VAIILLTWI YVLIGRFCGY KSKHGADEKR LHFGNGHLKL PGLRTYVDPH 
TYEDPTQAVH EFAKEU3ATN ISIDKWGAG EFGEVCSGRL KLPSKKEISV AIKTLKVGYT 
EKQRRDFLGE ASIMGQFDHP NIIRLEGWT KSKPVMIVTE YMENGSLDEF LRKHDAQFTV 
IQLVGMLRGI ASGMKYLSDM GYVHRDLAAR NILINSNLVC KVSDFGLSRV LEDDPEAAYT 
TRGGKIPIRW TSPEAIAYRK FTSASDWSY GIVLWEVMSY GERFYWEMSN QDVIKAVDEG 
YRLPPPMDCP AALYQLMLDC KQKDRNNRPK FEQIVSILDK LIRNPGSLKI ITSAAARPSN 
LLLDQSNVDI STFRTTGDWL NGVRTAHCKE IFTGVEYSSC DTIAKISTDD MKKVGVTWG 
PQKKIISSIK ALETQSKNGP VPV 



GTGTGGTGCA GCCCTGTTGG CAGTGGGCAT CTGGGTGTCA ATCGATGGGG CATCCTTTCT 
CATCGCAGCC GGCGTTGTGG TCTTTGCTCT TGGTTTCCTG GGCTGCTATG GTGCTAAGAC 



3 GCTGAGCACT TCCTGACGTT 
GCTGGTAGTG CCTGCCATCA AGAAAGATTA TGGTTCCCAG GAAGACTTCA CTCAAGTGTG 
GAACACCACC ATGAAAGGGC TCAAGTGCTG TGGCTTCACC AACTATACGG ATTTTGAGGA 
CTCACCCTAC TTCAAAGAGA ACAGTGCCTT TCCCCCATTC TGTTGCAATG ACAACGTCAC 
CAACACAGCC AATGAAACCT GCACCAAGCA AAAGGCT CAC GACCAAAAAG TAGAGGGTTG 
CTTCAATCAG CTTTTGTATG ACAT CCGAAC TAATGCAGTC ACCGTGGGTG GTGTGGCAGC 
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T GATTGTGTCC ATGTATCTGT ACTGCAATCT 
2 CACTTCTGCC TCTGCCACTA CTGCTGCCAC ATGGGAACTG TGAAGAGGCA 
CCCTGGCAAG CAGCAGTGAT TGGGGGAGGG GACAGGATCT AACAATGTCA CTTGGGCCAG 
AATGGACCTG CCCTTTCTGC TCCAGACTTG GGGCTAGATA GGGACCACTC CTTTTAGGCG 
ATGCCTGACT TTCCTTCCAT TGGTGGGTGG ATGGGTGGGG GGCATTCCAG AGCCTCTAAG 
GTAGCCAGTT CTGTTGCCCA TTCCCCCAGT CTATTAAACC CTTGATATGC CCCCTAGGCC 



CTGGGTGGAT GTGTAGAAGG CACTTCAAAA TGCATAAACC 



Protein Accession # 



MQCFSFIKTM MILFNLLIFL CGAALLAVGI WVSIDGASFL KIFGPLSSSA MQFVNVGYFL 

h GFLGCYGAKT ESKCALVTFF FILLLIFIAE VAAAWALVY TTMAEHFLTL 

Y GSQEDFTQVW NTTMKGLKCC GFTNYTDFED SPYFKENSAF PPFCCNDNVT 

NTANETCTEQ KAHDQKVEGC FNQLLYDIRT NAVTVGGVAA GIGGLELAAM IVSMYLYCNL 




CTGGATGGAT TTTGGCCGTC GCAGTGCCGA GGAGTATGAG TACCCCTCCT A 
GCCGCCATCA GCCCAACGGA AGCAACCTCC CAACCCAGAG GAGGCAGAAT AAGACAACAA 
rCATTGT CTGTGGAGTT TGACATTGAA TGTATCTATT TATTAAGTTC 
\ AATTGTGTCT GTAAGATTGT CCAGTGCAAC CACACACGCT CACCAGAAGT 
TGTGCAAACT GAAGACAAAA CTGTTTTCTT CATCTGTGAC TCCTGTTCTG AAAATGTTGT 



Seq ID NO: 507 DNA sequence 

Nucleic Acid Accession #: NM_001423.1 

Coding sequence: 219.. 692 



AGCACTCTCC AGCCTCTCAC CGCAAAATTA CACACCCCAG TACACCAGCA GAGGAAACTT 
ATAACCTCGG GAGGCGGGTC CTTCCCCTCA GTGCGGTCAC ATACTTCCAG AAGAGCGGAC 
CAGGGCTGCT GCCAGCACCT GCCACTCAGA GCGCCTCTGT CGCTGGGACC CTTCAGAACT 
CTCTTTGCTC ACAAGTTACC AAAAAAAAAA GAGCCAACAT GTTGGTATTG CTGGCTGGTA 
GT TAGCACCATT GCCAATGTCT 
3 TAGGTCTTTG GAAAAACTGT ACCAACATTA 
GCTGCAGTGA CAGCCTGTCA TATGCCAGTG AAGATGCCCT CAAGACAGTG CAGGCCTTCA 
TGATTCTCTC TATCATCTTC TGTGTCATTG CCCTCCTGGT CTTCGTGTTC CAGCTCTTCA 
CCATGGAGAA GGGAAACCGG TTCTTCCTCT CAGGGGCCAC CACACTGGTG TGCTGGCTGT 
GCATTCTTGT GGGGGTGTCC AT CT ACACT A GTCATTATGC GAATCGTGAT GGAACGCAGT 
AT CACCACGG CTATTCCTAC ATCCTGGGCT GGATCTGCTT CTGCTTCAGC TTCATCATCG 
GCGTTCTCTA TCTGGTCCTG AGAAAGAAAT AAGGCCGGAC GAGTTCATGG GGATCTGGGG 
GGTGGGGAGG AGGAAGCOGT TGAAT CTGGG AGGGAAGTGG AGGTTGCTGT ACAGGAAAAA 
CCGAGATAGG GGAGGGGGGA GGGGGAAGCA AAGGGGGGAG GTCAAATCCC AAACCATTAC 
TGAGGGGATT CTCTACTGCC AAGCCCCTGC CCTGGGGAGA AAGTAGTTGG CTAGTACTTT 
GATGCTCCCT TGATGGGGTC CAGAGAGCCT CCCTGCAGCC ACCAGACTTG GCCTCCAGCT 
GTTCTTAGTG ACACACACTG TCTGGGGCCC CATCAGCTGC CACAACACCA GCCCCACTTC 
TGGGTCATGC ACTGAGGTCC ACAGACCTAC TGCACTGAGT TAAAATAGCG GTACAAGTTC 
TGGCAAGAGC AGATACTGTC TTTGTGCTGA ATACGCTAAG CCTGGAAGCC A' 
TCTGACCCAA AGCAAAACAT CACATTCCAG TCTGAAGTGC CTACTGGGGG G 1 
GTGAGCCATT GTCCCTCTTT G 
GGAGGAGGAA AGAGAGTTTG T. 
CTTTTCACAA TGGAAAACCT GGGGGATGGT CAGAGCCCAG TCGAGACCTC ACACACGGCT 
GTCCCTCATG GAGACCT CAT GCCATGGTCT TTGCTAGGCC TCTTGCTGAA AGCCAAGGCA 



AACTATGGGA TCCAAGGGGC AGTCTTGCAA CAGTGCCATG TTAGGGTTAT GTTTTTAGGA 
TTCCCCTCAA TGCAGTCAGT GTTTCTTTTA AGTATACAAC AGGAGAGAGA TGGACATGGC 
TCATTGTAGC ACAATCCTAT TACTCTTCCT CTAACATTTT TGAGGAAGTT TTGTCTAATT 
ATCAATATTG AGGATCAGGG CTCCTAGGCT CAGTGGTAGC TCTGGCTTAG ACACCACCTG 
GAGTGATCAC CTCTTGGGGA CCCTGCCTAT CCCACTTCAC A 
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GGAAGCTGAT TAAAACACAC ATAAACCAAA ACCAAACAAC AGGCCCTTGG GTGAAAGGTG 1860 

T AAGCCTACCG TATTTCAGCC ATGATAAGAA CAGAGTGCCT 1920 

A AAATCCCATG AGATAAATAA AAATATAGGT GATGGGCAGA 1980 

TCTTTTCTTT AAAATAAAAA AGCAAAAACT CTTGTGGTAC CTAGTCAGAT GGTAGACGAG 2040 

CTGTCTGCTG CCGCAGGAGC ACCTCTATAC AGGACTTAGA AGTAGTATGT TATTCCTGGT 2100 

TAAGCAGGCA TTGCTTTGCC CTGGAGCAGC TATTTTAAGC CATCTCAGAT TCTGTCTAAA 2160 

GGGGTTTTTT GGGAAGACGT TTTCTTTATC GCCCTGAGAA GATCTACCCC AGGGAGAATC 2220 

TGAGACATCT TGCCTACTTT TCTTTATTAG CTTTCTCCTC ATCCATTICT TTTATACCTT 2280 

TCCTTTTTGG GGAGTTGTTA TGCCATGATT TTTGGTATTT ATGTAAAAGG ATTATTACTA 2340 

ATTCTATTTC TCTATGTTTA TTCTAGTTAA GGAAATGTTG AGGGCAAGCC ACCAAATTAC 2400 

CTAGGCTGAG GTTAGAGAGA TTGGCCAGCA AAAACTGTGG GAAGATGAAC TTTGTCATTA 2460 

TGATTTCATT ATCACATGAT TATAGAAGGC TGTCTTAGTG CAAAAAACAT ACTTACATTT 252 0 

CAGACATATC CAAAGGGAAT ACTCACATTT TGTTAAGAAG TTGAACTATG ACTGGAGTAA 25 80 

ACCATGTAT T CCCTTATCTT TTACTTTTTT TCTGTGACAT TTATGTCTCA TGTAATTTGC 2640 

ATTACTCTGG TGGATTGTTC TAGTACTGTA TTGGGCTTCT TCGTTAATAG AT TAT TT CAT 270 0 

ATACTATAAT TGTAAATATT TTGATACAAA TGTTTATAAC TCTAGGGATA TAAAAACAGA 2760 
TTCTGATTCC CTTCAAAAAA AAAAAA 

Seq ID NO : 608 Protein sequence 
Protein Accession ii: NP_001414.1 



MLVLLAGIFV VHIATVIMLF VSTIANVWLV SNTVDASVGL WKNCTNISCS DSLSYASEDA 
LKTVQAFMIL SIIFCVIALL VFVFQLFTME KGNHFFLSGA TTLVCWLCIL VGVSIYTSHY 
ANEDGTQYHH GYSYILGWIC FCFSFIIGVL YLVLHKK 




TCCCAGACCT GGTACGACGA ACGCCTCTGT TACAACGACA CCTTTGAGTC TCTTGTTCTG 
AATGCCAATG TGGTGAGCCA GCTATGGATC CCGGACACCT T 

ACCCACGAGC ATGAG AT CAC CATGCCCAAC CAGATGGTCC GCATCTACAA GGATGGCAAG 600 

GTGTTGTACA CAATTAGGAT GACCATTGAT GCCGGATGCT CACTCCACAT GCTCAGATTT 66 0 

CCAATGGATT CTCACTCTTG CCCTCTATCT TTCTCTAGCT TTTCCTATCC T GAG AAT GAG 720 

ATGATCTACA AGTGGGAAAA TTTCAAGCTT GAAATCAATG AGAAGA^CTC CTGGAAGCTC 780 

TTCCAGTTTG ATTTTACAGG AGTGAGCAAC AAAACTGAAA TAATCACAAC CCCAGTTGGT 840 

GACTTCATGG TCATGACGAT TTTCTTCAAT GTGAGCAGGC GGTTTGGCTA TGTTGCCTTT 900 

CAAAACTATG TCCCTTCTTC CGTGACCACG ATGCTCTCCT GGGTTTCCTT TTGGATCAAG 960 

ACAGAGTCTG CTCCAGCCCG GACCTCTCTA GGGATCACCT CTGTTCTGAC CATGACCACG 10 20 

TTGGGCACCT TTTCTCGTAA GAATTTCCCG CGTGTCTCCT ATATCACAGC CTTGGATTTC 1080 

TATATCGCCA TCTGCTTCGT CTTCTGCTTC TGCGCTCTGT TGGAGTTTGC TGTGCTCAAC 1140 

TTCCTGATCT ACAACCAGAC AAAAGCCCAT GCTTCTCCTA AACTCCGCCA TCCTCGTATC 1200 

AATAGCCGTG CCCATGCCCG TACCCGTGCA CGTTCCCGAG CCTGTGCCCG CCAACATCAG 1260 

GAAGCTTTTG TGTGCCAGAT TGTCACCACT GAGGGAAGTG ATGGAGAGGA GCGCCCGTCT 13 2 0 

TGCTCAGCCC AGCAGCCCCC TAGCCCAGGT AGCCCTGAGG GTCCCCGCAG CCTCTGCTCC 1380 

AAGCTGGCCT GCTGTGAGTG GTGCAAGCGT TTTAAGAAGT ACTTCTGCAT GGTCCCCGAT. 1440 

TGTGAGGGCA GTACCTGGCA GCAGGGCCGC CTCTGCATCC ATGTCTACCG CCTGGATAAC 1500 

TACTCGAGAG TTGTTTTCCC AGTGACTTTC TTCTTCTTCA ATGTGCTCTA CTGGCTTGTT 15S0 

TGCCTTAACT TGTAGGTACC AGCTGGTACC CTGTGGGGCA ACCTCTCCAG TTCCCCAGGA 1620 

GGTCCAAGCC CCTTGCCAAG GGAGTTGGGG GAAAGCAGCA GCAGCAGCAG GAGCGACTAG 1680 

AGTTTTTCCT GCCCCATTCC CCAAACAGAA GCTTGCAGAG GGTTTGTCTT TGCTGCCCCT 1740 

CTCCCCTACC TGGCCCATTC ACTGAGTCTT CTCAGCAGAC CATTTCAAAT TATTAATAAA 1800 

T CCCTCTTCTT CAAGGAGCAT CCGTGATGCT CAGTGTTCAA AACCACAGCC 1860 

T CAGCTCCCTA AAACCATGCC TAAGTACAGG CGGATTAGCT ATCTTCCAAC 1920 

T TACTGCATTT TTCCAGAAGC CCACTATTGC CTTTGTAGTG 1980 

CTTTCGGCCC AGTTCTGGCC TCAGCCTCAA AGTGCACCGA CTAGTTGCTT GCCTATACCT 2 040 

GGCACCTCAT TAAGATGCTG GGCAGCAGTA TAACAGGAGG AAGAGATCCC TCTCCTTTGG 2100 

TCAGATTATT ATGTTCTCAG TTCTCTCTCC CTGCTACCCC TTTCTCTGCA GATAGATAGA 2160 

CACTGGCATT ATCCCTTTAG GAAGAGGGGG GGGCAGCAAG AGAGCCTATT TGGGACAGCA 2220 

XKTTGC TGGCTCCATC TTTCGTCTGC 2280 

3 GTATCTATTT TTGTGTGTGA TTATAGTAAC 2340 

TACTCCCTGC TTTATATGCC ACCCTCTTCC TTCTCTTTGA CCCCTGTGAC TCTTTCTGTA 2400 

3GCACTA GGCCTTGGTG ACTTCCTGGG 2460 

TGCA ACAGGCATTA CTCGCCATTG ATTGGTGCCC 2520 

ACCCAGGGCA CACTGTCGGA GTTCTATCAC TTGCTTGACC CCK3GACCCA TAAACCAGTC 2580 

A CCCGGGGCAC TCTAACCATC ACAATCAATC AATCAAATTC CCTTAAATTT 2640 

T GGAACTTTGG CAAAGCACTT TTGACAAGTT GTGTCTGATT GGAGCTTCAT 2 700 

GATAGCCTTG TGACAT CTTT AGGGCAGGAT TCTTATCCCC ATTTTGCAGA TGAAAACCCT 27 60 

GAGTCACAGA TTTCTGTGGG ACTGTGGATC TCACTGGAAG CTATCCAAGA GCCCACTGTC 2820 

ACCTTCTAGA CCACATGATA GGGCTAGACA GCTCAGTTCA CCATGATTCT CTTCTGT CAC 2880 

CTCTGCTGGC ACACCAGTGG CAAGGCCCAG AATGGCGACC TCTCTTTAGC TCAATTTCTG 2940 

GGCCTGAGGT GCTCAGACTG CCCCCAAGAT CAAATCTCTC CTGGCTGTAG TAACCCAGTG 3000 

GAATGAATTT GGACATGCCC CAATGCTTCT ATATGCTAAG TGAAATCTGT GTCTGTAATT 3060 
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I 



I 



I 



MLSKVLPVLL GILLILQSRV EGPQTESKNE 
TGSRVGKLPE ASRILNTILS NYDHKLRPGI 
IFSQTWYDER LCYNDTFESL VLNGNWSQL 
GKVLYTIRMT IDAGCSLHML RFPMDSHSCP 
KLFQFDFTGV SNKTEIITTP VGDFMVMTIF 
IKTESAPART SLGITSVLTM TTLGTFSRKN 
IOTLIYNQTK AHASPKLRHP RINSRAHART 



DNYSRWFPV TFFFFNVLYW LVCLNL 



20 Nucleic Acid 

Coding sequence; 



ASSRDWYG? QPQFLENQLL SEETKSTETE 

GEKPTWTVE IAVNSLGPLS ILDMEYTIDI 

?RHS KRTHEHEITM PNCMVRIYKD 

LSFSSFSYPE NEMIYKWENF KLEINEKMSW 

FNVSRRFGYV AFQKYVPSSV TTMLSKVSFW 

FPRVSYITAL DFYIAICFVF CFCALLEFAV 

iRSRACARQ HQEAFVCQIV TTEGSDGESR 



GTGTAAAGAA AGCCAAATCA AGGACCCGAA T 
GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAGACTG AGACTGGGAG 



CAACAGCCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 
CCAGACCTGG TACGACGAAC GCCTCTGTTA CAACGACACC TTTGAG1 
TGGCAATGTG GTGAGCCAGC TATGGATCCC GGACACCTTT TTTAGGAATT C 

CCACGAGCAT GAGATCACCA TGCCCAACCA GATGGTCCGC ATCTACAAGG ATGGCAAGGT 780 

GTTGTACACA ATTAGGATGA CCATTGATGC CGGATGCTCA CTCCACATGC TCAGATTTCC 840 

AATGGATTCT CACTCTTGCC CTCTATCTTT CTCTAGCTTT TCCTATCCTG AGAATGAGAT 900 

GATCTACAAG TGGGAAAATT TCAAGCTTGA AATCAATGAG AAGAACTCCT GGAAGCTCTT 960 

CCAGTTGGAT TTTACAGGAG TGAGCAACAA AACTGAAATA ATCACAACCC CAGTTGGTGA 1020 

CTTCATGGTC ATGACGATTT TCTTCAATGT GAGCAGGCGG TTTGGCTATG TTGCCTTTCA 10 80 

AAACTATGTC CCTTCTTCCG TGACCACGAT GCTCTCCTGG GTTTCCTTTT GGATCAAGAC 1140 

AGAGTCTGCT CCAGCCCGGA CCTCTCTAGG GATCACCTCT GTTCTGACCA TGACCACGTT 1200 

GGGCACCTTT TCTCGTAAGA ATTTCCCGCG TGTCTCCTAT ATCACAGCCT TGGATTTCTA 12 60 

TATCGCCATC TGCTTCGTCT TCTGCTTCTG CGCTCTGTTG GAGTTTGCTG TGCTCAACTT 13 20 

CCTGATCTAC AACCAGACAA AAGCCCATGC TTCTCCTAAA CTCCGCCATC CTCGTATCAA 1380 

TAGCCGTGCC CATGCCCGTA CCCGTGCACG TTCCCGAGCC TGTGCCCGCC AACATCAGGA 1440 

AGCTTTTGTG TGCCAGATTG TCACCACTGA GGGAAGTGAT GGAGAGGAGC GCCCGTCTTG 1500 

CTCAGCCCAG CAGCCCCCTA GCCCAGGTAG CCCTGAGGGT CCCCGCAGCC TCTGCTCCAA 1550 

GCTGGCCTGC TGTGAGTGGT GCAAGCGTTT TAAGAAGTAC TTCTGCATGG TCCCCGATTG 1620 



CTCGAGAGTT GTTTTCCCAG TGACTTTCTT CTTCTTCAAT GTGCTCTACT GGCTTGTTTG 1740 

CCTTAACTTG TAGGTACCAG CTGGTACCCT GTGGGGCAAC CTCTCCAGTT CCCCAGGAGG 1800 

TCCAAGCCCC TTGCCAAGGG AGTTGGGGGA AAGCAGCAGC AGCAGCAGGA GCGACTAGAG 1860 

TTTTTCCTGC CCCATTCCCC AAACAGAAGC TTGCAGAGGG TTTGTCTTTG CTGCCCCTCT 1920 

CCCCTACCTG GCCCATTCAC TGAGTTTTCT CAGCAGACCA TTTCAAATTA TTAATAAATG 1980 

GGCCACCTCC CTCTTCTTCA AGGAGCATCC GTGATGCTCA GTGTTCAAAA CCACAGCCAC 2040 

TTAGTGATCA GCTCCCTAAA ACCATGCCTA AGTACAGGCG GATTAGCTAT CTTCCAACAA 2100 

TGCTGACCAC CAGACAATTA CTGCATTTTT CCAGAAGCCC ACTATTGCCT TTGCAGTGCT 2160 

TTCGGCCCAG TTCTGGCCTC AGCCTCAAAG TGCACCGACT AGTTGCTTGC CTATACCTGG 2220 

CACCTCATTA AGATGCTGGG CAGCAGTATA ACAGGAGGAA GAGATCCCTC TCCTTTGGTC 2280 

AGATTATTAT GTTCTCAGTT CTCTCTCCCT GCTACCCCTT TCTCTGCAGA TAGATAGACA 2340 

CTGGCATTAT CCCTTTAGGA AGAGGGGGGG GCAGCAAGAG AGCCTATTIG GGACAGCATT 2400 

CCTCTCTCTC TGCTGCTGTG ACATCTCCCT CTCCTTCCTG GCTCCATCTT TCGTCTGCAC 2460 

TACCAATTCA ATGCCCTTCA TCCAATGGGT ATCTATTTTT GTGTGTGATT ATAGTAACTA 2520 

CTCCCTGCTT TATATGCCAC CCTCTTCCTT CTCTTTGACC CCTGTGACTC TTTCTGTAAC 2580 

TTTCCCAGTG ACTTCCCCTA GCCCTGACCC AGGCACTAGG CCTTGGTGAC TTCCTGGGGC 264 0 

CAAGAAACTA AGGAAACTCG GCTTTGCAAC AGGCATTACT CGCCATTGAT TGGTGCCCAC 2700 
CCAGGGCACA CTGTCGGAGT TCTATCACTT GCTTGACCCC T 



GTCACAGATT 



ACATCTTTAG GGCAGGATTC 



T GTCTGATTGG 



ACTGGAAGCT ATCCAAGAGC CCACTGTCAC 



C ACCAGTGGCA AGGCCCAGAA 
CCTGAGGTGC TCAGACTGCC CCCAAGATCA AATCTCTCCT GGCTGTAGTA ACCCAGTGGA 
ATGAATTTGG ACATGCCCCA ATGCTTCTAT ATGCTAAGTG AAATCTGTGT CTGTAATTTG 
TTGGGGGGTG GATAGGGTGG GGTCTCCATC TACTTTTTGT CACCATCATC TGAAATGGGG 
OU AAATATGTAA ATAAATATAT CAGCAAAGC 



Protein Accessi 



l #: NP_068819.1 
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MEYTIDIIPS QTWYDERLCY NDTFESLVLN GNWSQLWIP DTFFRMSKRT HEHEITMPNQ 
MVRIYKDGKV LYTIRMTIDA GCSLHMLRFP MDSHSCPLSF SSFSYPENEM IYKWENFKLE 
INEKNSWKLF QLDFTGVSNK TEIITTPVGD FMVMTIFFNV SRHFGYVAFQ NYVPSSVTTM 
LSWVSFWIKT ESAPARTSLG ITSVLTMTTL GTFSRKNPPR VSYITALDFY IAICFVFCFC 
ALLE FAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVCQIVTTE 
GSDGEERPSC SAQQPPSPGS PEGPRSLCSK LACCEWCKSF KKYFCMVPDC EGSTWQQARL 
Y SRWFPVTFF FFNVLYWLVC LNL 



I I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG G 



TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

G1GTAAAGAA AGCCAAATCA AGGACCCGAA TGTG AG CAGG ACCTCAGAAG CCCCCTTTGT 240 

CACTSCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGG TCGAGGGACC 300 

TCAGACTGAA TCAAAGAATG AAGCCTCTTC CCGTGATGTT GTCTATGGCC CCCAGCCCCA 360 

GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAGACTG AGACTGGGAG 420 

CAGAGTTGGC AAACTGCCAG AAGCCTCTCG CATCCTGAAC ACTATCCTGA GTAATTATGA 480 

CCACAAACTG CGCCCTGGCA TTGGAGAGAA GCCCACTGTG GTCACTGTTG AGATCTCCGT 540 

CAACAGCCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 600 

CCAGACCTGG AATT CTAAG A GGACCCACGA GCATGAGATC ACCATGCCCA ACCAGATGGT 660 

CCGCATCTAC AAGGATGGCA AGGTGTTGTA CACAATTAGG ATGACCATTG ATGCCGGATG 720 

CTCACTCCAC ATGCTCAGAT TTCCAATGGA TTCTCACTCT TGCCCTCTAT CTTTCTCTAG 780 

CTTTTCCTAT CCTGAGAATG AGATGATCTA CAAGTGGGAA AATTTCAAGC TTGAAATCAA 840 

TGAGAAGAAC TCCTGGAAGC TCTTCCAGTT TGATTTTACA GGAGTGAGCA ACAAAACTGA 900 

AATAATCACA ACCCCAGTTG GTGACTTCAT GGTCATGACG ATTTTCTTCA ATGTGAGCAG 960 

GCGGTTTGGC TATGTTGCCT TTCAAAACTA TCTCCCTTCT TCCGTGACCA CGATGCTCTC 1020 

CTGGGTTTCC TTTTGGATCA AGACAGAGTC TGCTCCAGCC CGGACCTCTC TAGGGATCAC 1080 

CTCTGTTCTG ACCATGACCA CGTTGGGCAC CTTTTCTCGT AAGAATTTCC CGCGTGTCTC 1140 

CTATATCACA GCCTTGGATT TCTATATCGC CATCTGCTTC GTCTTCTGCT TCTGCGCTCT 1200 

GTTGGAGTTT GCTGTGCTCA ACTTCCTGAT CTACAACCAG ACAAAAGCCC ATGCTTCTCC 1260 

TAAACTCCGC CATCCTCGTA TCAATAGCCG TGCCCATGCC CGTACCCGTG CACGTTCCCG 1320 

AGCCTGTGCC CGCCAACATC AGGAAGCTTT TGTGTGCCAG ATTGTCACCA CTGAGGGAAG 13 80 

TGATGGAGAG GAGCGCCCGT CTTGCTCAGC CCAGCAGCCC CCTAGCCCAG GTAGCCCTGA 1440 

GGGTCCCCGC AGCCTCTGCT CCAAGCTGGC CTGCTGTGAG TGGTGCAAGC GTTTTAAGAA 1500 

GTACTTCTGC ATGGTCCCCG ATTGTGAGGG CAGTACCTGG CAGCAGGGCC GCCTCTGCAT 156 0 

CCATGTCTAC CGCCTGGATA ACTACTCGAG AGTTGTTTTC CCAGTGACTT TCTTCTTCTT 1620 

CAATGTGCTC TACTGGCTTG TTTGCCTTAA CTTGTAGGTA CCAGCTGGTA CCCTGTGGGG 1680 

CAACCTCTCC AGTTCCCCAG GAGGTCCAAG CCCCTTGCCA AGGGAGTTGG GGGAAAGCAG 1740 

CAGCAGCAGC AGGAGCGACT AGAGTTTTTC CTGCCCCATT CCCCAAACAG AAGCTTGCAG 1800 

AGGGTTTGTC TTTGCTGCCC CTCTCCCCTA CCTGGCCCAT TCACTGAGTT TTCTCAGCAG 1860 

ACCATTTCAA ATTATTAATA AATGGGCCAC CTCCCTCTTC TTCAAGGAGC ATCCGTGATG 1920 

CTCAGTGTTC AAAACCACAG CCACTTAGTG ATCAGCTCCC TAAAACCATG CCTAAGTACA 1980 

GGCGGATTAG CTATCTTCCA ACAATGCTGA CCACCAGACA ATTACTGCAT TTTTCCAGAA 2040 

GCCCACTATT GCCTTTGCAG TGCTTTCGGC CCAGTTCTGG CCTCAGCCTC AAAGTGCACC 2100 

GACTAGTTGC TTGCCTATAC CTGGCACCTC ATTAAGATGC TGGGCAGCAG TATAACAGGA 2160 

GGAAGAGATC CCTCTCCTTT GGTCAGATTA TTATGTTCTC AGTTCTCTCT CCCTGCTACC 2220 

CCTTTCTCTG CAGATAGATA GACACTGGCA TTATCCCTTT AGGAAGAGGG GGGGGCAGCA 2280 

AGAGAGCCTA TTT GGGACAG CATTCCTCTC TCTCTGCTGC TGTGACATCT CCCTCTCCTT 2340 

GCTGGCTCCA TCTTTCGTCT GCACTACCAA TTCAATGCCC TTCATCCAAT GGG TATCTAT 2400 

TTTTGTGTGT GATTATAGTA ACTACTCCCT GCTTTATATG GCACCCTCTT CCTTCTCTTT 2460 

GACCCCTGTG ACTCTTTCTG TAACTTTCCC AGTGACTTCC CCTAGCCCTG ACCAGGCACT 2520 

AGGCCTTGGT GACTTCCTGG GGCCAAGAAA CTAAGGAAAC TCGGCTTTGC AACAGGCATT 2580 

ACTCGCCATT GATTGGTGCC CACCCAGGGC ACACTGTCGG AGTTCTATCA CTTGCTTGAC 2640 

CCCTGGACCC ATAAACCAGT CCACTGTTAT ACCCGGGGCA CTCTAACCAT CACAATCAAT 2700 

CAATCAAATT CCCTTAAATT TGTATGGCAC TGGAACTTTG GCAAAGCACT TTTGACAAGT 2760 

TGTGTCTGAT TGGAGCTTCA TGATAGCCTT GTGACATCTT TAGGGCAGGA TTC1TATCCC 2820 

CATTTTGCAG ATGAAAACCC TGAGTCACAG ATTTCTGTGG GACTGTGGAT CTCACTGGAA 2880 

GCTATCCAAG AGCCCACTGT CACCTTCTAG ACCACATGAT AGGGCTAGAC AGCTCAGITC 2940 

ACCATGATTC TCTTCTGTCA CCTCTGCTGG CACACCAGTG GCAAGGCCCA GAATGGCGAC 3000 

CTCTCTTTAG CTCAATTTCT GGGCCTGAGG TGCTCAGACT GCCCCCAAGA TCAAATCTCT 3060 

CCTGGCTGTA GTAACCCAGT GGAATGAATT TGGACATGCC CCAATGCTTC TATATGCTAA 3120 

GTGAAATCTG TGTCTGTAAT TTGTTGGGGG GTGGATAGGG TGGGGTCTCC ATCTACTTTT 3180 
TGTCACCATC ATCTGAAATG GGGAAATATG TAAATAAATA TATCAGCAAA GC 



MEYTIDIIFS QTWNSKRTHE HEITMPNQMV RIYKDGKVLY TIRMTIDAGC SLHMLRFPMD 

SHSCPLSFSS FSYPENEMIY KWENFKLEIN EKNSWKLFQF DFTGVSNKTE IITTPVGDFM 

VMTIFFNVSR RFGYVAFQNY VPSSVTTMLS WVSFWIKTES APARTSLGIT SVLTMTTLGT 

FSRKNFPRVS YITALDFYIA ICFVFCFCAL LEFAVLNFLI YNQTKAHASP KLRKPRINSR 

AHARTRARSR ACARQHQEAF VCQIVTTEGS DGEERPSCSA QQPPSPGSPE GPRSLCSKLA 
CCEWCKRFKK YFCMVPDCEG STWQQGRLCI HVYRLDNYSR WFPVTI 
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Coding sequence: 1309.. 2490 

1 11 21 31 41 51 

I I I I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 
TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 
CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 
GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 
CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGT GAGTTTCATA 
CCTTGGCAGA TGGCCTTTAA CATTTTTGTT TAATTCAATT ATTCTTACTA ATCTTCTTCT 
TTTTCTTGGC TGTGGTGCAT GGCTGTGGAG CTCAGGGTGG ACTCCTGTTG GGCAGCCAGT 
TCCTGGATGG CTGTCTGTGG GTGGAGGACT CCTGCCTTTC CTGTTTAGAC ACCCACAAAG 
GCTGCTCTTT AGCCTCCTTC CCTTCATCCC CTTCCCCTGC CCCCAGTGCA ACGAGTATTA 
CACAACCAAC AAAACCGCAA AATATTCCCA CAATTTTCTG GTCCTCTCTG GGAGAGGCCG 
CTCTGGCTTT TCCTCTCAGC CCTGGCCCTC TGCCTGCTCC TCACTCCTGG TTGGTGCTGG 
TCAGGCTGAC TAGAGGCCAA GGCGACCAAC ACTAGGCAAA CGCGGCCAGC GCTCAGACAT 
AAATGCCCTC TTCATTTCAC GTGTAACATT CTTTTAAAAT CTAGGTCTTG GTTTTGTTGA 
TT.TTTTCTTA AATAAAAGAG TGATCATAAA AGAGGGACAG CATAGAAAGT CCCCAAAGAG 
CAGCAAGGTT TTAAAGAAAT TCACAAGCCT AATCTGTCAC TGTCTTATAA T 
CCAGTCACAA TTTAACTAGG TTTTGTGTTG AAAACTTGTT T 
GAGGCACTAG CTGGGGCCCC TACAGAGTGC AGGGCAO 

TCTAGGGT CG AGGGACCT CA GACTGAATCA AAGAATGAAG CCTCTTCCCG T 



G AGTTGGCAAA CTGCCAGAAG C 
h ATTATGACCA CAAACTGCGC CCTGGCATTG GAGAGAAGCC CACTGTGGTC 
ACTGTTGAGA TCTCCGTCAA CAGCCTTGGT CCTCTCTCTA T CCTAGACAT GGAATACACC 
C GACGAACGCC TCTGTTACAA CGACACCTTT 
3 AGCCAGCTAT GGATCCCGGA CACCTTTTTT 
AGGAATTCTA AGAGGACCCA CGAGCATGAG ATCACCATGC CCAACCAGAT GGTCCGCATC 



G TTGGTGACTT CATGGTCATG ACGATTTTCT TCAATGTGAG CAGGCGGTTT 
GGCTATGTTG CCTTTCAAAA CTATGTCCCT TCTTCCGTGA CCACGATGCT C 
TCCTTTTGGA TCAAGACAGA GTCTGCTCCA GCCCGGACCT CTCTAGC 

CTGACCATGA CCACGTTGGG CACCTTTTCT CGTAAGAATT TCCCGCGTGT CTCCTATATC 
ACAGCCTTGG ATTTCTATAT CGCCATCTGC TTCGTCTTCT GCTTCTGCGC T 
TTTGCTGTGC TCAACTTCCT GATCTACAAC CAGACAAAAG CCCATGCTTC T 

CGCCATCCTC GT AT CAATAG CCGTGCCCAT GCCCGTACCC GTGCACGTTC CCGAGCCTGT 2 ISO 

GCCCGCCAAC ATCAGGAAGC TTTTGTGTGC CAGATTGTCA CCACTGAGGG AAGTGATGGA 2 220 

GAGGAGCGCC CGTCTTGCTC AGCCCAGCAG CCCCCTAGCC CAGGTAGCCC TGAGGGTCCC 2 280 

CGCAGCCTCT GCTCCAAGCT GGCCTGCTGT GAGTGGTGCA AGCGTTTTAA GAAGTACTTC 2340 

TGCATGGTOC CCGATTGTGA GGGCAGTACC TGGCAGCAGG GCCGCCTCTG CATCCATGTC 2400 

TACCGCCTGG ATAACTACTC GAGAGTTGTT TTCCCAGTGA CTTTCTTCTT CTTCAATGTG 2 460 

CTCTACTGGC TTGTTTGCCT TAACTTGTAG GTACCAGCTG GTACCCTGTG GGGCAACCTC 2 520 

TCCAGTTCCC CAGGAGGTCC AAGCCCCTTG CCAAGGGAGT TGGGGGAAAG CAGCAGCAGC 2580 

AGCAGGAGCG ACTAGAGTTT TTCCTGCCCC ATTCCCCAAA CAGAAGCTTG CAGAGGGTTT 2S40 

GTCTTTGCTG CCCCTCTCCC CTACCTGGCC CATTCACTGA GTTTTCTCAG CAGACCATTT 2700 

CAAATTATTA ATAAATGGGC CACCTCCCTC TTCTTCAAGG AGCATCCGTG ATGCTCAGTG 2760 

TTCAAAACCA CAGCCACTTA GTGATCAGCT CCCTAAAACC ATGCCTAAGT ACAGGCGGAT 2820 

TAGCTATCTT CCAACAATGC TGACCACCAG ACAATTACTG CATTTTTCCA GAAGCCCACT 2B80 

ATTGCCTTTG CAGTGCTTTC GGCCCAGTTC TGGCCTCAGC CTCAAAGTGC ACCGACTAGT 2940 

TGCTTGCCTA TACCTGGCAC CT CATTAAG A TGCTGGGCAG CAGTATAACA GGAGGAAGAG 3000 

ATCCCTCTCC TTTGGTCAGA TTATTATGTT CTCAGTTCTC TCTCCCTGCT ACCCCTTTCT 3060 

CTGCAGATAG ATAGACACTG GCATTATCCC TTTAGGAAGA GGGGGGGGCA GCAAGAGAGC 3120 

CTATTTGGGA CAGCATT CCT CTCTCTCTGC TGCTGTGACA TCICCCTCTC CTTGCTGGCT 3180 

CCATCTTTCG TCTGCACTAC CAATTCAATG CCCTTCATCC AATGGGTATC TATTTTTGTG 3240 

TGTGATTATA GTAACTACTC CCTGCTTTAT ATGCCACCCT CTTCCTTCTC TTTGACCCCT 3300 

GTGACTCTTT CTGTAACTTT CCCAGTGACT TCCCCTAGCC CTGACCAGGC ACTAGGCCTT 3360 

GGTGACTTCC TGGGGCCAAG AAACTAAGGA AACTCGGCTT TGCAACAGGC ATTACTCGCC 3420 

ATTGATTGGT GCCCACCCAG GGCACACTGT CGGAGTTCTA TCACTTGCTT GACCCCTGGA 3480 

CCCATAAACC AGTCCACTGT TATACCCGGG GCACT CTAAC CATCACAATC AATCAATCAA 3540 

ATTCCCTTAA ATTTGTATGG CACTGGAACT TTGGCAAAGC ACTTTTGACA AGTTGTGTCT 3600 

GATTGGAGCT TCATGATAGC CTT GTGACAT CTTTAGGGCA GGATTCTTAT CCCCATTTIG 3660 

CAGATGAAAA CCCTGAGTCA CAGATTTCTG TGGGACTGTG GATCTCACTG GAAGCTATCC 3720 

AAGAGCCCAC TGTCACCTTC TAGACCACAT GATAGGGCTA GACAGCTCAG TTCACCATGA 3780 

TTCTCTTCTG TCACCTCTGC TGGCACACCA GTGGCAAGGC CCAGAATGGC GACCTCTCTT 3840 

TAGCTCAATT TCTGGGCCTG AGGTGCTCAG ACTGCCCCCA AGATCAAATC TCTCCTGGCT 3900 

GTAGTAACCC AGTGGAATGA ATTTGGACAT GCCCCAATGC TTCTATATGC TAAGTGAAAT 3960 



S QTWYDERLCY NDTFESLVLW GHWSQLWIP DTFFRNSKRT EEHEITMPKQ 
MVRIYKDGKV LYTIRMTIDA GCSLHMLRFP MDSHSCPLSF SSFSYPENEM I 
INEKNSWKLF QFDFTGVSNK TEIITTPVGD FMVMTIFFKV SRRFGYVAFQ N 
LSWVSFWIKT ESAPARTSLG ITSVLTMTTL GTFSRKNFPR V 
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ALLEFAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVCQIVTTE 
GSDGEERPSC SAQQPPSPGS PEGPRSLCSK LACCEWCKP.F KKYFCMVPDC EGSTWQQGRL 
CIHVYRLDNY SRWFPVTPF FFNVLYWLVC LNL 



35 
40 



1 11 21 31 41 51 

I I I I I | 

CGGAACGAGG GCAACCTGCA CAGCCATGCC CGGGCAAGAA CTCAGGACGG TGAATGGCTC 60 

TCAGATGCTC CTGGTGTTGC TGGTGCTCTC GTGGCTGCCG CATG3GGGCG CCCTGTCTCT 120 

GGCCGAGGCG AGCCGCGCAA GTTTCCCGGG ACCCTCAGAG TTGCACTCCG AAGACTCCAG 180 

ATTCCGAGAG TTGCGGAAAC GCTACGAGGA CCTGCTAACC AGGCTGCGGC- CCAACCAGAG 240 

CTGGGAAGAT TCGAACACCG ACCTCGTCCC GGCCCCTGCA GTCCGGATAC TCACGCCAGA 300 

AGTGCGGCTG GGATCCGGCG GCCACCTGCA CCTGCGTATC TCTCGGGCCG CCCTTCCCGA 360 

GGGGCTCCCC GAGGCCTCCC GCCTTCACCG GGCTCTGTTC CGGCTGTCCC CGACGGCGTC 420 

AAGGTCGTGG GACGTGACAC GACCGCTGCG GCGTCAGCTC AGCCTTGCAA GACCCCAAGC 480 

GCCCGCGCTG CACCTGCGAC TGTCGCCGCC GCCGTCGCAG TCGGACCAAC TGCTGGCAGA 540 

ATCTTCGTCC GCACGGCCCC AGCTGGAGTT GCACTTGCGG CCGCAAGCCG CCAGGGGGCG 600 

CCGCAGAGCG CGTGCGCGCA ACGGGGACGA CTGTCCGCTC GGGCCCGGGC GTTGCTGCCG 660 

TCTGCACACG GTCCGCGCGT CGCTGGAAGA CCTGGGCTGG GCCGATTGGG TGCTGTCGCC 720 

ACGGGAGGTG CAAGTGACCA TGTGCATCGG CGCGTGCCCG AGCCAGTTCC GGGCGGCAAA 780 

CATGCACGCG CAGATCAAGA CGAGCCTGCA CCGCCTGAAG CCCGACACGG AGCCAGCGCC 840 

CTGCTGCGTG CCCGCCAGCT ACAATCCCAT GGTGCTCATT CAAAAGACCG ACACCGGGGT 900 

GTCGCTCCAG ACCTATGATG ACTTGTTAGC CAAAGACTGC CACTGCATAT GAGCAGTCCT 960 

GGTCCTTCCA CTGTGCACCT GCGCGGGGGA GGCGACCTCA GTTGTCCTGC CCTGTGGAAT 1020 

GGGCTCAAGG TTCCTGAGAC ACCCGATTCC TGCCCAAACA GCTGTATTTA TATAAGTCTG 1080 

TTATTTATTA TTAATTTATT GGGGTGACCT TCTTGGGGAC TCGGGGGCTG GTCTGATGGA 1140 

ACTGTGTATT TATTTAAAAC TCTGGTGATA AAAATAAAGC TGTCTGAACT GTTAAAAAAA 1200 



I I I I I 

MPGQELRTVN GSQMLLVLLV LSWLPHGGAL SLAEASRASF FGPSET.H 

EDLLTRLRAN OSWEDSNTDL VPAPAVRILT PEVRLGSGGH LHLRISRAAL PEGLPEAERL 

1' AIi I LSPT ASRSWDVTRP LRRQLSLARP QAPALHLRLS FPPEQEDQLL AESESARPOL 

ELHLRPQAAR GRRRARARNG DDCPLGPGRC CRLHTVRASL EDLGWADWVL SPREVQVTMC 

IGACPSQFRA ANMHAOIKTS LHRLKPDTEP APCCVPASYN PMVLIQKTDT GVSLQTYEDL 



I I I I | | 

ATAACAGCAT GAAGTGCCGT GGAACTGGAA TAGGCGTGTC CTCTCCCTCX3 ACCCTCCCCC 60 

TCCTTGTCCC TCTGCTCACC CCTCGCTCGT TCCCTCCCTC CGGCGAGGGC CGCCTTTATA 120 

ACAACTGCTC AGAGTGCGAG GGCGGGATAG CTGTCCAAGG TCTCCCCCAG CACTGAGGAG 180 

CTCGCCTGCT GCCCTCTTGC GCGCGGGAAG CAGCACCAAG TTCACGGCCA ACGCCTTGGC 240 

ACTA3GGTCC AGAATGGCTA CAACAGTCCC TGATGGTTGC CGCAATGGCC TGAAATCCAA 300 

GTACTACAGA CTTTGTGATA AGGCTGAAGC TTGGGGCATC GTCCTAGAAA CGGTGGCCAC 3 60 

AGCCGGGGTT GTGACCTCGG TGGCCTTCAT GCTCACTCTC CCGATCCTCG TCTGCAAGGT 42 0 

GCAGGACTCC AACAGGCGAA AAATGCTGCC TACTCAGTTT CTCTTCCTCC TGGGTGTGTT 480 

GGGCATCTTT GGCCTCACCT TCGCCTTCAT CATCGGACTG GACGGGAGCA CAGGGCCCAC 540 

ACGCTTCTTC CTCTTTGGGA TCCTCTTTTC CATCTGCITC TCCTGCCTGC TGGCTCATGC 600 

TGTCAGTCTG ACCAAGCTCG TCCGGGGGAG GAAGCCCCTT TCCCTGTTGG TGATTCTGGG 660 

TCTGGCCGTG GGCTTCAGCC TAGTCCAGGA TGTTATCGCT ATTGAATATA TTGTCCTGAC 720 

CATGAATAGG ACCAACGTCA ATGTCTTTTC TGAGCTTTCC GCTCCTCGTC GCAATGAAGA 780 

CTTTGTCCTC CTGCTCACCT ACGTCCTCTT CTTGATGGCG CTGACCTTCC TCATGTCCTC 840 

CTTCACCTTC TGTGGTTCCT TCACGGGCTG GAAGAGACAT GGGGCCCACA TCTACCTCAC 900 

GATGCTCCTC TCCATTGCCA TCTGGGTGGC CTGGATCACC CTGCTCATGC TTCCTGACTT 960 

TGACCGCAGG TGGGATGACA CCATCCTCAG CTCCGCCITG GCTGCCAATG GCTGGGTGTT 1020 

CCTGTTGGCT TATGTTAGTC CCGAGTTTTG GCTGCTCACA AAGCAACGAA ACCCCATGGA 1080 

TTATCCTGTT GAGGATGCTT TCTGTAAACC TCAACTCGTG AAGAAGAGCT ATGGTGTGGA 1140 

GAACAGAGCC TACTCTCAAG AGGAAATCAC TCAAGGTTTT GAAGAGACAG GGGACACGCT 1200 

CTATGCCCCC TATTCCACAC ATTTTCAGCT GCAGAACCAG CCTCCCCAAA AGGAATTCTC 1260 

CATCCCACGG GCCCACGCTT GGCCGAGCCC TTACAAAGAC TATGAAGTAA AGAAAGAGGG 1320 

CAGCTAACTC TGTCCTGAAG AGTGGGACAA ATGCAGCCGG GCGGCAGATC TAGCGGGAGC 1380 

TCAAAGGGAT GTGGGCGAAA TCTTGAGTCT TCTGAGAAAA CTGTACAAGA CACTACGGGA 1440 

ACAGTTTGCC TCCCTCCCAG CCTCAACCAC AATTCTTCCA 1GCTGGGGCT GATGTGGGCT 1500 

AGTAAGACTC CAGTTC1TAG AGGCGCTGTA GTATTTTTTT TTTTTTGTCT CATCCTTTGG 1560 

ATACTTCTTT TAAGTGGGAG TCTCAGGCAA CTCAAGTTTA GACCCTTACT CTTTTTGTTT 1620 

GTTTTTTGAA ACAGGATCTT GCTCTGTCAC CCAGGCTTGA GTGCAGTGG1 GCGATCACAG 1680 

CCCAGTGCAG CCTCGACCAC CTGTGCTCAA GCAATCCTCC CATCTCCATC TCCCAAAGTG 1740 

CTGGGATGAC AGGCGTGAGC CACAGCTCCC AGCCTAGGCC CTTAATCTTG CTGTTATTTT 1800 

CCATGGACTA AAGGTCTGGT CATCTGAGCT CACGCTGGCT CACACAGCTC TAGGGGCCTG 1860 

CTCCTCTAAC TCACAGTGGG TTTTGTGAGG CTCTGTGGCC CAGAGCAGAC CTGCATATCT 1920 

GAGCAAAAAT AGCAAAAGCC TCTCTCAGCC CACTGGCCTG AATCTACACT GG AAG CCAAC 1980 
TTGCTGGCAC CCCCGCTCCC CAACCCTTCT TGCCTGGGTA GGAGAGGCTA A 
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25 
30 



TAAATTTACT CATCTCTCTA GTGCTGCCTC ACATTGGGCC TCAGOAGCTC CCCAGCACCA 2100 

ATTCACAGGT CACCCCTCTC TTCTTGCACT GTCCCCAAAC TTGCTGTCAA TTCCGAGATC 21SO 

TAATCTCCCC CTACGCTCTG CCAGGAATTC TTTCAGACCT CACTAGCACA AGCCCGGTTG 2220 

CTCCTTGTCA GGAGAATTTG TAGATCATTC TCACTTCAAA TTCCTGGGGC TGATACTTCT 2280 

CTCATCTTGC ACCCCAACCT CTGTAAATAG ATTTACCGCA TTTACGGCTG CATTCTGTAA 2340 

GTGGGCATGG TCTCCTAATG GAGGAGTGTT CATTGTATAA TAAGTTATTC ACCTGAGTAT 2400 
GCAATAAAGA TGTGGTGGCC ACTCTTTCAT GGTGGTGGCA GCAAAAAAAA AAAAAA 

Seq ID NO: 520 Protein sequence 



I I I I I I 

MATTVPDGCR NGLKSKYYRL CDKAEAWGIV LETVATAGW TSVAFMLTLP ILVCKVQDSN 
REKMLPTQFL FLLGVLGIFG LTFAFIIGLD GSTGPTRFFL FGILFSICFS CLLAHAVSLT 
KLVRGRKPLS LLVILGLAVG FSLVQDVIAI EYIVLTMNRT NVNVFSELSA PRRNEDFVLL 
LTYVLFLMAL TFLMSSFTFC GSFTGWKRHG AHIYLTMLLS IAIWVAHITL LMLPDFDRRW 
DDTILSSALA ANGWVFLLAY VSPEFWLLTK QRNPMDYPVE DAFCKPQLVK KSYGVENRAY 
SQEEITQGFE ETGDTLYAPY STHFQLQNQP PQKEFSIPRA H 

Seq ID NO: 621 DNA sequence 
Nucleic Acid Accession #: NM_002423.2 
Coding sequence; 4 8.. 851 



ACCAAATCAA CCATAGGTCC A 
TGCTGTGTGC TGTGTGCCTG CTGCCTGGCA GCCTGGCCCT GCCGCTGCCT CAGGAGGCGG 
GAGGCATGAG TGAGCTACAG TGGGAACAGG CTCAGGACTA TCTCAAGAGA T 



TCTTTGGCCT ACCTATAACT GGAATGTTAA ACTCCCGCGT CATAGAAATA ATGCAGAAGC 



C AGGATCGTAT C 

TGGATCGATT AGTGTCAAAG GCTTTAAACA TGTGGGGCAA AGAGATCCCC CTGCATTTCA 
GGAAAGTTGT ATGGGGAACT GCTGACATCA TGATTGGCTT TGCGCGAGGA GCTCATGGGG 
ACTCCTACCC ATTTGATGGG CCAGGAAACA CGCTGGCTCA TGCCTTTGCG CCTGGGACAG 
GTCTCGGAGG AGATGCTCAC TTCGATGAGG ATGAACGCTG GACGGATGGT AG CAGT CTAG 
GGATTAACTT CCTGTATGCT GCAACTCATG AACTTGGCCA TTCTTTGGGT ATGGGACATT 
CCTCTGATCC TAATGCAGTG ATGTATCCAA CCTATGGAAA TGGAGATCCC CAAAATTTTA 
AACTTTCCCA GGATGATATT AAAGGCATTC AGAAACTATA TGGAAAGAGA A 
GAAAGAAATA GAAACTT CAG GCAGAACATC CATTCATTCA T 
TGTTGCACAA TCAGAATTGA TAAG CACTGT TCCTCCACTC CATTTAGCAA TTATGTCACC 
CTTTTTTATT GCAGTTGGTT TTTGAATGTC TTTCACTCCT TTTATTGGTT AAACTCCTTT 
ATGGTGTGAC TGTGTCTTAT TCCATCTATG AGCTTTGTCA GTGCGCGTAG ATGTCAATAA 
ATGTTACATA CACAAATAAA TAAAATGTTT ATTCCATGGT AAATTTA 

Seq i: 



n Accession # i NP_002414.1 



MRLTVLCAVC LLPGSLALPL PQEAGGMSEL QWEQAQDYLK RFYLYDSETK NANSLEAKLK 
EMQKFFGLPI TGMLNSRVIE IMQKPRCGVP DVAEYSLFPN SPKWTSKWT YRIVSYTRDL 
PHITVDRLVS KALNMWGKEI PLKFRKWWG TADIMIGFAR GAHGDSYPFD GPGNTLAHAF 
APGTGLGGDA KFDEDERWTD GSSLGINFLY AATHELGKSL GMGHSSDPNA VMYPTYGNGD 
PQNFKLSQDD IKGIQKLYGK RSNSRKK 



Coding sequence: 204.. 9SS 



AAACAGGAAA TAAATACGAA TGAAACTGAG CTCTAAGCAG CATGTAACCT GGCCTGCATC SO 

CAGGAAATAG AGGACTTCGG ATCCTTCTAA CCCTACCACC CAACTGGCCC CAGTACATTC 120 

ATTCTCTCAG GAAAAAAAAC AAGGTCCCCA CAGCAAAGAA AAGGAATAGG ATCAAGAGAT 180 

ACGTGGCTGC TGGCAGAGCA AGCATGAATT CGATGACTTC AGCAGTTCCG GTGGCCAATT 240 

CTGTGTTGGT GGTGGCACCC CACAATGGTT ATCCTGTGAC CCCAGGAATT ATGTCTCACG 300 

TGCCCCTGTA TCCAAACAGC CAGCCGCAAG TCCACCTAGT TCCTGGGAAC CCACCTAGTT 360 



I GGCCTGGCTC ACATCGGCCT CGGCTCCATC ATGGCGACGG 480 

TTCTCGTAGG GGAAT AC CTG T CT ATTTCAT TCTACGGAGG CTTTCCCTTC TGGGGAGGCT 540 

3 GGCTTGAACA TCGTCAGTGC AATCTGCTCT GCAGTTGGAG 

TCATACTCTT CATCACAGAT CTAAGTATTC CCCACCCATA TGCCTACCCC G 
CTTACGCCTG GGGTGTGAAC CCTGGAATGG CGATTTCTGG C 

TCCTGGAGTT TGGCATCGCA TGCGCATCTT CCCACTTTGG CTGCCAGTTG GTCTGCTGTC 
AATCAAGCAA TGTGAGTGTC ATCTATCCAA ACATCTATGC AGCAAACCCA GTGATCACCC 



TCTCACCTTC ATTCTTCAAT TCAGTCTAGG AAACCATGCT GTTTCTCTAT CAAGAAGAAG 
ACAGAGATTT TAAACAGATG TTAACCAAGA GGGACTCCCT AGGGCACATG CATCAGCACA 
TATCTGGGCA TCCAGCCTCT GGGGCCTTGG CACACACACA TTCGTGTGCT C 
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J ANSVLWAPH NGYPVTPGIM SHVPLYPNSQ PQVHLVPGNP PSLVSMVNGQ 

PVQKALKEGK TLGAIQIIIG LAHIGLGSIM ATVLVGEYLS ISFYGGFPFK GGLWFIISGS 

LSVAAENQPY SYCLLSGSLG LNIVSAICSA VGVILFITDL SIPHPYAYPD YYPYAWGVNP 

GMAISGVLLV FCLLEFGIAC ASSHFGCQLV CCQSSNVSVI YPNIYAANPV ITPEPVTSPP 
SYSSEIQANK 



ATGACAGGAG TGTTTGACAG AAGGGTCCCC AGCATCCGAT CCGGCGACTT CCAAGCTCCG 
TTCCAGACGT CCGCAGCTAT GCACCATCCG TCTCAGGAAT CGCCAACTTT GCCCGAGTCT 
TCAGCTACCG ATTCTGACTA CTACAGCCCT ACGGGGGGAG CCCCGCACC-G CTACTGCTCT 
CCTACCTCGG CTTCCTATGG CAAAGCTCTC AACCCCTACC AGTATCAGTA TCACGGCGTG 
AACGGCTCCG CCGGGAGCTA CCCAGCCAAA GCTTATGCCG A 
TACCACCAGT ACGGCGGCGC CTACAACCGC GTCCCAAGCG C 
GAAGTGACCG AGCCCGAGGT GAGAATGGTG AATGGCAAAC C 
AGGACTATTT ATTCCAGCTT TCAGCTGGCC GCATTACAGA GAAGGTTTC 
TACCTCGCCT TGCCGGAACG CGCCGAGCTG GCCGCCTCGC TGGGATTGA 
GTGAAAATCT GGTTT CAG AA CAAAAGATCC AAGATCAAGA AGATCATGAA AAACGGGGAG 
ATGCCCCCGG AGCACAGTC C CAGCTCCAGC GACC GAATGG CGTGTAACTC GCCGCAGTCT 
CCAGCGGTGT GGGAGCCCCA GGGCTCGTCC CGCTCGCTCA GCCACCACCC TCATGCCCAC 
CCTCCGACCT CCAACCAGTC CCCAGCGTCC AGCTACCTGG AGAACTCTGC ATCCTGGTAC 
ACAAGTGCAG CCAGCTCAAT CAATTCCCAC CTGCCGCCGC CGGC-CTCCTT ACAGCACCCG 
CTGGCGCTGG CCTCCGGGAC ACTCTATTAG 



Protein Accession #: NP_005212.1 

1 11 21 31 41 51 

MTGVFDRRVP SIRSGDFQAP FQTSAAMHHP SQESPTLPSS SATDSDYYSP TGGAPEGYCS 

PTSASYGKAL NPYQYQYHGV NGSAGSYPAK AYADYSYASS YHQYGGAYNR VPSATNQPEK 

EVTEPEVRMV NGKPKKVRKP RTIYSSFQLA ALQRRFQKTQ YLALPERAEL AASLGLTQTC 

VKIWFQNKRS KIKKIMKNGE MPPKHSPSSS DPMACNSPQS PAVWKPQGSS RSLSHHPHAH 
PPTSNQSPAS SYLEMSASWY TSAASSINSK LPPPGSLQKE LALASGTLY 



627 D 



GCACGAGAGA CGACGTGCTG AGCTGCCAGC TTAGTGGAAG CTCTGCTCTG GGTGGAGAGC 
AGCCTCGCTT TGGTGACGCA CAGTGCTGGG ACCCTCCAGG AGCCCCGGGA TTGAAGGATG 
GTGGCGGCCG TCCTGCTGGG GCTGAGCTGG CTCTGCTCIC CCCTGGGAGC TCTGGTCCTG 
GACTTCAACA ACATCAGGAG CTCTGCTGAC CTGCATGGGG CCCGGAAGGG CTCACAGTGC 
CTGTCTGACA CGGACTG CAA TACCAGAAAG TTCTGCCTCC AGCCCCGCGA TGAGAAGCCG 
TT CTGTGCTA CATGTCGTGG GTTGCGGAGG AGGTGCCAGC GAGATGCCAT GTGCTGCCCT 
GGGACACTCT GTGTGAACGA TGTTTGTACT ACGATGGAAG ATGCAACCCC AATATTAGAA 
AGGCAGCTTG ATGAGCAAGA TGGCACACAT GCAGAAGGAA CAACTGGGCA CCCAGTCCAG 
GAAAACCAAC CCAAAAGGAA GCCAAGTATT AAGAAATCAC AAGGCAGGAA GGGACAAGAG 
GGAGAAAGTT GTCTGAGAAC TTTTGACTGT GGCCCTGGAC TTTGCTGTGC TCGTCATTTT 
TGGACGAAAA TTTGTAAGCC AGTCCTTTTG GAGGGACAGG TCTGCTCCAG AAGAGGG CAT 
AAAGACACTG CTCAAGCTCC AGAAATCTTC CAGCGTTGCG ACTGTGGCCC TGGACTACTG 
TGTCGAAGCC AATTGACCAG CAATCGGCAG CATGCTCGAT TAAGAGTATG C 
GAAAAGCTAT AAATATTTCA AAATAAAGAA GAATCCACAT TGCAAAAAAA A 



628 Protein sequence 



3 WLCSPLGALV LDFNNIRSSA DLHGARKGSQ CLSDTDCNTR KFCLQPRDEK 

PFCATCRGLR RRCQRDAMCC PGTLCVHDVC TTMEDATPIL ERQLDEQDGT HAEGTTGHPV 

QENQPKRKPS IKKSQGRKGQ EGESCLRTFD CGPGLCCARH FKTKICKPVL LEGQVCSRRG 
HKDTAQAPEI FQRODCGPGL LCRSQLTSNR QHARLRVCQK IEKL 
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GCGCGAGTGC TCCCGGGAAC TCTGCCTGCG CGGCGGCAGC GACCGGAGGC CAGGCCCAGC 

G GGAGGCGCGC GCGGGAGGGT CCGCCCGGCC 

C GCTCCCAGCC CGCCCGGAGC CCATGCCCGG 



GAGGAGGGGG CCAAGCCCAA AGTGTCCCCT TCGCTCCTGC CCTTCAGCGT GGAGGCGCTC 420 

ATGGCCGACC ACAGGAAGCC GGGGGCCAAG GAGAGCGCCC TGGCGCCCTC CGAGGGCGTG 480 

CAGGCGGCGG GTGGCTCGGC GCAGCCACTG GGCGTCC03C CGGGGICGCT GGGAGCCCCG 540 

GACGCGCCCT CTTCGCCGCG GCCGCTCGGC CATTTCTCGG TGGGGGGACT CCTCAAGCTG GOO 

G CGCTCGTCAA AGCCGAGAGC CCCGAGAAGC CCGAGAGGAC CCCGTGGATG SSO 



CGCAAACACA AGACGAACCG TAAGCCGCGG ACGCCCTTCA CCACCGCGCA GCTGCTGGCG 
CTGGAGCGCA AGTTCCGCCA GAAGCAGTAC CTGTCCATCG CCGAGCGCGC GGAGTTCTCC 
AGCTCGCTCA GCCTCACTGA GACGCAGGTG AAGATATGGT TCCAGAACCG CCGCGCCAAG 
GCAAAGAGAC TACAAGAGGC AGAGCTGGAG AAGCTGAAGA TGGCCGCCAA GCCCATGCTG 
CCACCGGCTG CCTTCGGCCT CTCCTTCCCT CTCGGCGGCC CCGCAGCTGT ■ AGCGGCCGCG 
GCGGGTGCCT CGCTCTACGG TGCCTCTGGC CCCTTCCAGC GCGCCGCGCT G 
CCCGTGGGAC TCTACACGGC CCATGTGGGC TACAGCATGT A 
CCAGGTCCCC ACCTGTGGGC CAGCCGATTC CTCCAGCCCT G 
CTCCCCTGCT CGGCACCGCC AGCCGCCTTC CCTTTAACCC TCACACTGCT CCAGTTTCAC 12S0 
CTCTTTGCTC CCTGAGTTCA CTCTCCGAAG TCTGATCCCT GCCAAAAAGT GGCTGGAAGA 1320 
GTCCCTTAGT ACTCTTCTAG CATTTAGATC TACACTCTGG A 
AGGGCAGAGA GGTTAACAGA TTTATCTAGG GTCCCCAGCA GAATTGACAG T 
CTAGAGGCCA TGTCTCCTGC ATAGCTTTTC CCTGTCCTGA CACCAGGCAA GAAAAGCGCA 1500 
GAGAAATCGG TGT CTGACGA TTTTGGAAAT GAGAACAATC TCAAAAAAAA AAAAAAAAAA 15S0 
AAAAAAAAAA GAAAAGAGAA AAAAAAGACT AGCCAGCCAG GAAGATGAAT CCTAGCTTCT 1S20 
TCCATTGGAA AATTTAAGAC AAGTTCAACA ACAAAACATT TGCTCTGGGG GGCAGGGAAA 1S80 
ACACAGATGT GTTGCAAAGG TAG 

Seq ID NO: 630 Prote 



n Accession #: 
35 1 11 21 31 

I I I I I I 

MTSLPLGVKV EDSAFGKPAG GGAGOAPSAA AATAAAMGAD ESGAKPKVSP SLLPFSVEAL 

MADHRKPGAK ESALAPSEGV QAAGGSAQPL GVPPGSLGAP DAPSSPRPLG KFSVGGLLKL 

PEDAIiVKAES PEKPERTPWM QSPRFSPPPA RRLSPPACTL RKHKTNKKPR TPFTTAQLLA 

40 LERKFRQKQY LSIAERAEFS SSLSLTETQV KIWFQNRHAK AKRLQEAELS KLKMAAKPKL 

PPAAFGLSFP LGG PAAVAAA AGASLYGASG PFQRAALPVA PVGLYTAHVG YSMYKLT 

Seq ID NO: 631 DNA sequence 
Nucleic Acid Accession ft: NM_002557.1 
45 Coding sequence: 13.. 2049 

1 11 21 31 41 51 

I I I I I I 

CAGACCATTG AGATGTGGAA GCTGTTGCTG TGGGTTGGGC TGGTTCTTGT GCTGAAACAC 
50 CACGATGGTG CTGCCCATAA ACTCGTGTGT TATTTCACCA ACTGGGCACA CAGTCGGCCA 
GGCCCTGCCT CGATCTTGCC CCATGACCTG GACCCCTTTC TCTGCACCCA CCTGATATTT 
GCCTTTGCCT CAATGAACAA CAATCAGATT GTTGCTAAGG ATCTCCAGGA TGAGAAAATT 
CTCTACCCAG AGTTCAACAA ACTAAAGGAG AGGAACAGAG AGCTGAAAAC ACTACTGTCC 
ATCGGCGGGT GGAACTTTGG CACCTCAAGA TTCACCACTA TGTTGTCCAC ATTTGCCAAC 
55 CGTGAAAAGT TTATTGCTTC AGTTATATCC CTTCTGAGGA CACATGACTT TGATGGTCTT 
GACCTTTTCT TCTTATATCC TGGACTAAGA GGCAGCCCCA TGCATGACCG GTGGACTTTT 
CTCTTCTTAA TTGAAGAGCT CCTGTTTGCC TTCCGGAAGG AGGCACTGCT CACCATGCGC 
CCGAGGCTGC TGCTGTCTGC TGCTGTTTCT GGGGTCCCAC ACATCGTCCA AACATCCTAT 
GATGTGCGCT TTCTAGGAAG ACTCCTGGAT TTCATCAATG TCTTGTCTTA TGACTIACAT 
60 GGAAGTTGGG AAAGGTTCAC AGGACATAAT AGCCCCCTCT TCTCTCTGCC TGAAGACCCC 
AAATCTTCGG CATATGCTAT GAATTATTGG AGAAAGCTTG GGGCACCCTC AGAGAAGCTC 
ATCATGGGGA TCCCCACCTA TGGACGTACC TTTCGCCTCC TCAAAGCCTC TAAGAATGGG 
TTGCAGGCCA GAGCGATCGG ACCAGCATCT CCAGGGAAGT ACACCAAGCA AGAAGGCTTC 
TTGGCTTATT TTGAGATTTG TTCCTTTGTC TGGGGAGCGA AGAAGCACTG GATTGATTAC 
65 CAGTATGTCC CGTATGCCAA CAAGGGGAAA GAGTGGGTTG GCTATGACAA TGCCATCAGC 
TTCAGTTACA A 1 

G ATGACGTCAG GGGCACGTTC TGTGGCACTG G 

T GCGGGCTGAG TTCAGTTCAA CTTCTTTACC A 
CTGTCATCTG CTGTGAATTC TTCAAGCACT GACCCTGAAA GGCTGGCTGT G 

G ATAGTAAGAT TTTGCCCCCA GGAGGAGAGG CTGGGGTCAC TGAGATCCAC 1320 
Z TATAACCCCT AGAGGTACAA CTGTGACCCC TACAAAGGAA 13 80 
ACTGTATCCC TTGGAAAGCA CACTGTAGCT CTAGGAGAGA AGACTGAGAT CACTGGGGCA 1440 
ATGACCATGA CTTCTGTGGG TCATCAGTCC ATGACCCCTG GAGAGAAGGC CCTGACCCCT 1500 
GTGGGTCATC AATCTGTGAC CACTGGACAG AAGACCCTGA CCTCTGTGGG TTATCAGTCT 1560 
GTGACCCCTG GGGAAAAGAC CCTGACCCCT GTGGGTCATC AGTCTGTGAC CCCTGTGAGT 1620 
CATCAGTCTG TGAGCCCTGG AGGAACGACT ATGACCCCTG TCCATTTTCA GACTGAGACC 1680 
CTTAGACAGA ATACAGTGGC CCCTAGAAGG AAGGCTGTGG CCCGTGAAAA GGTGACTGTC 1740 
CCCTCCAGAA ACATATCAGT CACCCCTGAA GGGCAGACTA TGCCTTTAAG AGGGGAGAAT 1800 
TTGACTTCTG AGGTGGGCAC TCACCCCAGG ATGGGTAACT TGGGTCTTCA GATGGAAGCT 1860 
GAAAACAGGA TGATGCTGTC CTCCAGCCCC GTCATCCAGC TCCCGGAACA AACTCCTCTA 1920 
GCTTTTGACA ACCGCTTTGT TCCCATCTAT GGAAACCATT CCTCTGTCAA CTCAGTAACC 1980 
CCTCAAACAA GTCCTCTTTC TCTAAAAAAA GAAATCCCAG AAAACTCTGC TGTGGATGAA 2040 
GAAGCCTAAG CCCCTCTGGT GTCAGAAACC AGGGAAAACC CTTGTCTTTT CTTCTAAGTG 2100 
ACATGTTGGA AGCCTTCTCA TCCCGGGGCA AAGCAGGCAT CAAAACCAGA ATAGGCCAAT 2160 
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A TTAAATAAAC 



1 11 21 31 41 51 

I I I I I I 

MWKLLLWVGL VLVLKHHDGA AHKLVCYFTN WAHSRPGPAS ILPHDLDPFL CTHLIFAFAS 
MNNNQIVAKD LQDEKILYPE FNKLKERNRE LKTLLSIGGW NFGTSRFTTM LSTFANREKF 
IASVISLLRT HDFDGLDLFF LYPGLRGSPM HDRWTFLFLI EELLFAFRKE ALLTMRPRLL 
LSAAVSGVPH IVQTSYDVRF LGRLLDFINV LSYDLHGSWE RFTGHNSPLF SLPEDPKSSA 
YAMNYWRKLG APSEKLIMGI PTYGRTFRLL KASKNGLQAR AIGPASPGKY TKQEGFLAYF 
EICSFVWGAK KHWIDYQYVP YANKGKEWVG YDNAISFSYK AWFIRREHFG GAMVTOLDMD 
DVRGTFCGTG PFPLVYVLND ILVRAEFSST SLPQFWLSSA VNSSSTDPER LAVTTAWTTD 
SKILPPGGEA GVTEIHGKCE NMTITPRGTT VTPTKETVSL GKHTVALGEK TEITGAMTMT 
SVGHQSMTPG EKALTPVGHQ SVTTGQKTLT SVGYQSVTPG EKTLTPVGHQ SVTPVSHQSV 
SPGGTTMTPV HFQTETLRQN TVAPRRKAVA REKVTVPSRN ISVTPEGQTM PLRGEtlLTSE 
VGTHPRMGNL GLQMEAENRM MLSSSPVIQL PEQTPLAFDN RPVPI YGNHS SVNSVTPQTS 
PLSLKKEIPE NSAVDEEA 

Seq ID NO: 633 DMA sequence 



TTTTCGCGGG CTCGGTGAGC GGTTTTATCC 



TCCCAGCTAC CGGAAGGCCA CGCTGTTTGA GGATGGCGCG G 
GGCCGTACAG AACAGCAAGA ACGCCAAGGA CAAGAACCTG AAGCGCCACT CCATCATCTC 



GCAGCCTAAC AGCAGCTACC AGAACAACAT CACGCACCTC AACAATGAGA A 

GTCGCTGTCG TGCGCCAACC TGTCCACATT CGCCCAGCCC CCACCGGCCC AGCCGCCTGC 420 

ACCCCCGGCC AGCCAGCTCT CGGGTTCCCA GACCGGGGGC TCCTCCTCAG TCAAGAAAGC 480 

CCCTCACCCT GCCGTCACCT CCGCAGGGAC GCCCAAACGG GTCATCGTCC AGGCGTCCAC 540 

CAGTGAGCTG CTTCGCTGCC TGGGTGAGTT TCTCTGCCGC CGGTGCTACC GCCTGAAGCA 600 

CCTCTCCCCC ACGGACCCCG TGCTCTGGCT GCGCAGCGTG GACCGCTCGC TGCTTCTGCA 6S0 

GGGCTGGCAG GACCAGGGCT TCATCACGCC GGCCAACGTG GTCTTCCTCT ACATGCTCTG 720 

CAGGGATGTT ATCTCCTCCG AGGTGGGCTC GGATCACGAG CTCCAGGCCG TCCTGCTGAC 780 

ATCCCTCTAC CTCTCCTACT CCTACATGGG CAACGAGATC TCCTACCCGC TCAAGCCCTT 840 

CCTGGTGGAG AGCTGCAAGG AGGCCTTTTG GGACCGTTGC CTCTCTGTCA TCAACCTCAT 900 

GAGCTCAAAG ATGCTGCAGA TAAATGCCGA CCCACACTAC TTCACACAGG TCTTCTCCGA 9S0 

CCTGAAGAAC GAGAGCGGCC AGGAGGACAA GAAGCGGCTC CTCCTAGGCC TGGATCGGTG 1020 

G CCTGCGTCAT GGCTCAAGGA TTCAATGCAT TTTTAAGAAT TTATTATTAA 1080 



1 11 21 31 41 51 

I I I I I I 

IG i. = LSPS YRKATLFEDG AATVGHYTAV QNSKNAKDKN LKRHSIISVL PWKRIVAVSA 
KKKNSKKVQP NSSYQNNITH LNNENLKKSL SCANLSTFAQ PPPAQPPAPP ASOLSGSQTG 
GSSSVKKAPH PAVTSAGTPK RVIVQASTSE LLRCLGEFLC RRCYRLKHLS PTDPVLWLRS 
VDRSLLLQGW QDQGFITPAN WFLYMLCRD VISSEVGSDH ELQAVLLTCL YLSYSYMGNE 
ISYPLKPFLV ESCKEAFWDR CLSVINLMSS KMLQINADPH YFTQVFSDLK NESGQEDKKR 
LLLGLDR 
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Unique Eos probeset identifier number 



UnigenelD: Unigene number 
Unigene Title: Unigene gei "" 



447377 
446921 
418888 
432179 



408243 
426088 
439738 
419741 



70 
75 



on in Table 79A to sequences in Table 80 

SeqlDNo. 



Pkey ExAccn UnigenelD Unigene Title 

424212 NM_005814 Hs.143131 glycoprotein A33 (tn 

424503 NMJ02205 Hs.149609 integrin, alpha 5 (fibronectin rec 

418007 M13509 Hs.83169 matrix metalloproteinase 1 (interstitial 

418007 M13509 Hs.83169 matrix metalloproteinase 1 (intorsUtia' 

418738 AW388633 Hs.6682 solute earner family 7, (cationic amino 

443646 AI085198 Hs.1 64226 Thrombospondin 1 

409956 AW103364 H" 727 nhibii bet xtivin A, activin AB a 

422867 132137 Hs.1584 cartilage oligomeric matrix protein (pse 

444381 BE387335 Hs.283713 hypothetical protein BC014245 

421582 AI910275 Hs.350470 trefoil factor 1 (breast cancer, estroge 

411789 AF245505 Hs.72157 Adlican 

452281 T93500 Hs.28792 Homo sapiens CDNAFLJ11041 lis, cone PL 

428698 AA852773 Hs.334838 KIAA1866 protein 

421552 AF026692 Hs.105700 secreted frizzled-relaled protein 4 



447033 

411975 
434206 
423936 
447400 
449032 
415214 



410418 
446342 
411274 
104978 
422260 
409041 
420344 



X77343 

AB012113 

AU076801 

X75208 

AF239666 

AW630041 

AI357412 

AI357412 

AI916058 

AW136973 



AB033025 
BE463721 
AF027208 



Hs.298241 
Hs.334334 
Hs.16530 
Hs.89436 
Hs.2913 
Hs.1545 
Hs.56937 
Hs.157601 
Hs.1 57601 
Hs.144583 
Hs.362915 
Hs.135639 
Hs.18457 
Hs.22900 
Hs.1 25124 
Hs.333893 
Hs.288126 
Hs.63325 
Hs.1 4846 
Hs.69423 
Hs.19322 
Hs.105484 
ls.50081 
ls.97101 



ne3 

transcription factor AP-2 alpha 
small inducible cytokine subfamily A (Cy 
cadherin 17, LI cadherin (I'vor-intostin 
EphB3 

caudal type homeo box transcription fact 
suppression of tumorigenicity 14 (colon 
Predicted gene: Eos cloned; secreted wW 
Predicted gene: Eos cloned; secreted w/V 
3'UTR of: dead ringer (Drosophila]-like 
ESTs, Weakly similar to S59890 mitogen i 
achaete-scute complex (Drosoptiila) homol 
hypothetical protein FU20315 
nuclear factor (erythroid-derived 2)-lik 



solute carrier family 7 (cationic amino 
kallikrein 10 

Homo sapiens, Similar to R1KEM cDNA 2010 
regenerating gene type IV 
Hypothetical protein, XP.051860 (KIAA119 
putative G protein-coupled receptor 
promin n (mouse)-like 1 



AW190902 

BE001836 

Y00787 

AF038007 

BE246502 

NMJXJ7019 

AA305384 

AA804698 



AI636208 
NMJB5408 
AI005163 
AW204600 



ls.256897 putative GPCR 
ls.624 interleukin 8 
ls.166196 ATPase, Class I, type 8B, member 1 



442275 
431745 
431745 
453142 
421659 



NMJX36433 

AW842182 

C17898 

AA251131 

AU077260 



Hs.93002 
Hs.25740 
Hs.82547 
Hs.103982 
Hs.372210 
Hs.75182 
Hs.1 64225 
Hs.96901 
Hs.96901 
Hs.11383 
Hs.201378 
Hs.355452 
Hs.160435 
Hs.54795 
Hs.163425 



Hs.106511 
Hs.10306 
Hs.105806 
Hs.241392 



ubiquitin earner protein E2-C 
ER01 (S. cerevisiae)4ike 
retinoic acid receptor responder (tazaro 
small inducible cytokine subfamily B (Cy 



hypothetical protein FU23049 

hypothetical protein FU23049 

small inducible cytokine subfamily A (Cy 

Homo sapiens cDNA FU 40427 fls 

HUMPSPBA Human pulmonary surfxtant-as 

MDAC1 

Homo sapiens secretoglobin, fanify 3A, m 



Novel FGENESH predicted cadherin repeat 
Homo sapiens gap junction protein, alpha 
protocadherin 17 

natural killer cell group 7 sequence 
granulysin 

small inducible cytokine A5 (RANTES) 
Homo sapiens up-regulated by BCG-CWS (LO 
Homo sapiens Iryptophanyl-tRNA synthetas 
death-associated protein kinase 1 



SeqlDNo.CUC217 
SeqlDNaC28tC218 
SeqlDNo.C3&C219 
SeqlDNo.C48.C220 
Seq IDNo.C58.C221 
Seq IDNo.C68.C222 
Seq IDNo.C7S.C223 
Seq IDNo.C88.C224 
Seq IDNo.C98.C225 
Seq IDNO.C10&C226 
Seq IDNO.C11&C227 
Seq ID No. C12 
Seq IDNo.C138.C22B 
Seq IDNo.C148.C229 
Seq [DNo.C158.C230 
Seq IDNo.C168.C231 
Seq IDNo.C17 8.C232 
SeqlDNo.C18 8.C233 
Seq IDNo.C198.C234 
SeqlDNo.C208.C235 
Seq IDNo.C218.C236 
Seq IDNo.C228.C237 
Seq IDNo.C238.C238 
Seq IDNO.C248.C239 
Seq IDNo.C258.C240 
SeqlDNo.C268.C241 
Seq IDNo.C27 8.C242 
Seq IDNo.C288.C243 
Seq IDNo.C298.C244 
Seq IDNo.C30 8.C245 
Seq ID No. C31&C246 
Seq IDNo.C32 8.C247 
SeqlDNo.C33 8.C24S 
Seq IDNo.C348.C249 
Seq IDNo.C358.C250 
Seq IDNo.C368.C251 
Seq IDNo.C37 8.C252 
Seq IDNo.C38 8.C253 
Seq IDNo.C39 8.C254 
SeqlDNo.C408.C255 
SeqlDNo.C418.C256 
SeqlDNo.C42 8.C257 
SeqlDNo.C43 8.C258 
SeqlDNo.C448.C259 
SeqlDNo.C45 8.C260 
SeqlDNo.C468.C261 
SeqlDNo.C47 8.C262 
SeqlDNo.C48 8.C263 
Seq ID No. C49 8. C264 
SeqlDNo.C508.C265 
Seq ID No. C51 & C266 
SeqlDNo.C528iC267 



Seq ID No. C54 & C269 
Seq ID NO.C55&C270 
SeqlDNo.C568 C271 
SeqlDNo.C578.C272 
Seq IDNO.C58&C273 
Seq IDNO.C598C274 
SeqlDNo.C60&C275 
Seq ID No. C51&C276 
Seq ID No.C628iC277 
Seq IDNO.CS3&C278 
SeqlDNo.C64&C279 
SeqlDNo. C65SC280 
Seq ID No.C558.C281 
Seq ID NO.C67&C282 
Seq ID NO.C68&C283 
Seq ID NO.C59&C284 
Seq IDNO.C70&C285 
Seq ID No. C71 & C286 
Seq ID NO.C72&C287 
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20 
25 



430890 


X54232 


Hs.2699 


glypican 1 


Seq ID No. C73 & C288 


452401 


NMJM7115 


Hs.29352 


tumor necrosis factor, alpha-induced pro 


Seq ID NO.C74&C289 






Hs.199067 


v-erb-b2 avian erythroblastic leukemia v 


Seq IDNO.C75&C290 




X66839 


Hs.63287 


carbonic anhydrase IX 


Seq ID No. C76 & C291 


418526 


BE019020 


Hs.85838 


solute earner family 16 (monocarhoxylic 


SeqlDNo.C77&C292 


422627 


BE336857 


Hs.1 18787 


transforming growth factor, beta-induced 


SeqlDNo.C78&C293 


430486 


BE062109 


Hs.241551 


chloride channel, calcium activated, fam 


SeqlDNo.C79&C294 


423673 


BE003054 




matrix metalloproteinase 12 (macrophage 


Seq IDNo.C80 8.C295 


423673 


BE003054 


HsM695 


matrix metalloproteinase 12 (macrophage 


SeqlDNo.C8UC296 


431846 


BE019924 


Hs.271580 


uroplakin 1B 


Seq IDNo.C82S.C297 


431958 


X63629 




cadherin 3, type 1, P-cadherin (placenta 


SeqlDNo.C83&C298 


448733 


NMJM5629 


Hs.187958 


solute carrier family 6 (neurotransmitte 


Seq ID No. C84&C299 


426440 


BE382756 


Hs.169902 


solute earner family 2 (facilitated glu 


Seq ID No. C85 8.C300 


428434 


AF104032 


Hs.184601 


solute earner family 7 (cationic amino 


SeqlDNo.C86&C301 


429211 


AF052693 


Hs.198249 


gap junction protein, beta 5 (comexin 3 


Seq ID No. C87 & C302 


423634 


AW959908 




heparin-binding growth factor binding or 


Seq ID No. C88&C303 


457819 


M057484 


Hs.35406 


FU20522 Hypothetical protein FLJ20522 


SeqlDNo.C89&C304 


424687 


J05070 


Hs.151738 


matrix metalloproteinase 9 (gelatinase B 


SeqlDNo.C90&C305 


418462 


BE001596 


Hs.85266 


integrin, beta 4 


SeqlDNo.C91&C306 


439606 


W79123 


Hs.58561 


G protein-coupled receptor 87 


SeqlDNo.C92&C307 


407720 


AB037776 


Hs.38002 


immunoglobulin superfamily, member 9 


SeqlDNo.C93&C308 


418543 


NMJW5329 


Hs.85962 


hyaluronan synthase 3 


Seq IDNo.C94 8.C309 


417512 


X76534 


Hs.82226 


glycoprotein (transmembrane) nmb 


SeqlDNo.C95&C310 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


Seq ID NO.C96&C311 


415817 


U88967 


Hs.78857 


protein tyrosine phosphatase, receptor-t 


Seq ID NO.C97&C312 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


SeqlDNo.C98&C313 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


Seq ID No. C99 & C314 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


Seq ID NO.C100&C315 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


Seq IDNO.C10UC316 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


Seq ID No. C102&C317 


415817 


U88967 


Hs.78867 


Seq ID No. C103&C318 


421817 




Hs.108660 


ATP-binding cassette, sub-family C(CFTR 


Seq IDNO.C104&C319 




AF146074 


Hs.108660 


ATP-binding cassette, sub-family C(CFTR 


Seq ID No. C105&C320 


409420 


Z15008 


Hs.54451 


laminin, gamma 2 (nicein (100kD), kaliri 


Seq ID No. C106&C321 




AF134160 


Hs.7327 


claudin 1 


SeqfDNo.C1078.C322 


408790 


AW580227 


Hs.47860 


neurotrophic tyrosine kinase, receptor, 


Seq IDNo.C1088.C323 


408790 




Hs.47860 


neurotrophic tyrosine kinase, receptor, 


Seq ID No. C1098.C324 


408790 


AW580227 


lls.47860 


neurotrophic tyrosine kinase, receptor, 


Seq IDNo. C110&C325 


450701 


H39960 




hypothetical protein XP_098151 (leucine- 


Seq IDNO.C11UC326 


414774 


XC2419 


Ho.77274 


plasminogen activator, urokinase 


Seq ID No. C112S.C327 


413691 


AB023173 




ATPase, Class VI, type 11B 


SeqlDNo.C1138.C328 




AL080235 


Hs!35861 


Ras-induced senescence 1 (RIS1) 


SeqlDNo.C114&C329 


449101 


AA205847 


Hs.23016 


G protein-coupled receptor 


Seq ID No. C115&C330 


429263 




Hs.198396 


ATP-binding cassette, sub-family A (ABC1 


SeqlDNo.C116&C331 


421474 


U76362 


Hs.104637 


solute earner family 1 (glutamate trans 


Seq ID No. C117&C332 


421753 


BE314828 




ATP-binding cassette, sub-family B (MDR/ 


SeqlDNo.C118 8.C333 


408482 


NMJM0676 


Hs.45743 


adenosine A2b receptor 


Seq ID No. C1198.C334 








PORIMIN Pro-oncosis receptor inducing me 


Seq ID No. C120 & C335 


429736 


AF125304 


Hs.212680 


tumor necrosis factor receptor superfami 


Seq ID No. C121&C336 


430985 


AA490232 


Hs.27323 


ESTs, Weakly similar to 178885 serine/th 


Seq ID No. C122 & C337 


431890 


X17033 


Hs.271986 


integrin, alpha 2 (CD49B, alpha 2 subunl 


Seq ID NO.C123&C338 


432583 


AW023624 


Hs.162282 


potassium channel TASK-4; potassium chan 


Seq ID No. C124&C339 


446872 


X97058 


Hs.16362 


pyrimidinergic receptor P2Y, G-protein c 


SeqlDNo.C125SC340 


453102 


NM.007197 


Hs.31664 


frizzled (Drosophilajhomotog 10 


SeqlDNo.C126SC341 


428513 


BE220806 


Hs.184697 


plexinCI 

interleukin 7 receptor 


Seq ID No. C127&C342 


430280 


AA361258 


Hs.237868 


SeqlDNo.C128&C343 


428486 


AW583497 


Hs.184604 


pancreatic polypeptide 


SeqlDNo.C129&C344 


457489 


AI693815 


Hc.127179 




SeqlDNo.C130&C345 


432874 


W94322 


Hs.279651 


melanoma inhibitory activity 


SeqlDNo.C131&C345 



429547 
404287 
404287 
404287 
418318 
444754 
432596 
444006 
428505 
448844 
448844 
428392 
448030 
422109 



AW009166 Hs.99376 



U47732 
T83911 
AJ224741 



FGENESH predicted novel secreted protein 
FGENESH predicted novel CUB-domain conta 
FGENESH predicted novel CUB-domain conta 
FGENESH predicted novel CUB-domain conta 



transmembrane 4 superfamily member 4 



Hs.334762 type I transmembrane protein Fn14 
Hs.2281 chromogranin B (secretogranin 1) 
Hs.177164 FGENESH predicted novel cell sufee pr 
' :oll surface pr 



N30714 Hs.325960 membrane-spanning 4-domains, subfamly/ 
S73265 Hs.1473 gastrin-releasing peptide 
245051 Hs.22920 similarto S68401 (cattle) giucose indue 
W95642 Hs.82961 trefoil factor 3 (intestinal) 

I cibie cytokin ofami'y B 



Seq ID No. C132&C347 
SeqlDNo.C133&C348 
Seq ID NO.C134&C349 
Seq ID No. C135&C350 
Seq ID No. C136&C351 
Seq ID No. C137&C352 
Seq ID No. C138&C353 
Seq ID No. C139&C354 
SeqlDNo.C140&C355 
SeqlDNo.C14UC356 
Seq ID No. C142&C357 
Seq IDNO.C143&C358 
SeqlDNo.C144&C359 
Seq IDNo. C145&C360 



C146&C361 
C147&C362 
C148&C363 
C149&C364 
C150&C365 
C151&C366 
C152&C367 
Seq ID No. C153&C368 
SeqlDNo.C154&C369 
Seq ID No. C155&C370 



Seq ID 

Seq ID No. 
Seq ID No. 
Seq ID No. 
Seq ID No. 
Seq ID No. 



WO 03/042661 



PCT/US02/36810 



10 
15 

20 



413719 
431462 
415498 



421340 
428187 
428187 
422278 



BE439580 Hs.75498 small inducible cytokine subfamily A (Cy 
AW583672 Hs.256311 granin-like neuroendocine peptide precu 
U33632 Hs.79351 potassium channel, subfamily K, member 1 



40 

45 
50 
55 
60 
65 




421481 
444151 
426174 
410037 
425071 
421829 
418676 
419693 
419693 



448988 
448988 
430144 



AI091277 

AF017986 

AA284679 

AA316181 

N90470 

AA861697 

AW391972 

AW972917 

AA547959 

AB020725 

NM 013989 



Y09763 
AI732722 
AW612232 
AF109302 



Hs.44197 
Hs.1369 
Hs.285529 
Hs.285529 
Hs.1 14218 



Hs.72290 

Hs'.l 08219 
Hs.302634 
Hs.31386 
Hs.25540 
Hs.61635 
Hs.203697 
Hs.1 20591 
Hs.104696 
Hs.1 28749 
Hs.1 15838 
Hs.58009 
Hs.1 54424 



hypothetical protein DKFZp564D0462 
decay accelerating factor for comi ' 
G protein-coupled receptor 49 



PTK7 protein tyrosine kinase 7 
leukemia inhibitory factor (cholinergic 
leucine-n'ch repeat-containing G protein 
solute earner family 39 (zinc transport 
G protein-coupled receptor 27 (GPR27) (S 
smoothened (Drosophila) homolog 
transmembrane receptor Unc5H2 mRNA 
wingless-type MMTV integration site fami 
ATP-binding cassette, sub-family C (CFTR 
wingless-type MMTV integration site fami 
frizzled (Drosophila) homolog 8 

2(str 



d:n(LOC1 



Hs.301350 
Hs.22785 
Hs.22785 
Hs.22785 
Hs.22785 
Hs.98927 
Hs.254835 
Hs.27495 
Hs.145807 
Hs.1 45807 
Hs.10082 
Hs.334514 



KIAA1324 protein 
alpha-methylacyl-Co/ 
Homo sapiens simiiai 
KIAA0918 protein 
deiodinase, iodothyronlne, type II 
calcium/calmodulin-dependent protein kin 
Epithelial calcium channel 2, CaT-likeA 
FXYD domain-containing ion transport reg 
FXYD domain-containing ion transport reg 
gamma-aminobutyric acid (GABA) A recepto 
gamma-aminobutyric acid (GABA) A recepto 
gamma aminobutyric acid (GABA) A rcceplo 
gamma-aminobutyric acid (GABA) A recepto 
ERGL protein; ERGiC-53-llke protein 
ESTs 

prostate cancer associated protein 7 
hypothetical protein FU13593 
hypothetical protein FU13593 
potassium intermediate/small conductance 




409041 AE 
TABLE 79B 



Pkey CAT Number Accession 

414991 1785136J D78831 C17898 D78863 

TABLE 79C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this coll 
sequence of human chromosome 22." Dunham I. 

Strand: Indicates DNA strand from which exons were pred 

NLposition: Indicates nucleotide positions of predicted exons. 



Seq IDNo.C156 8.C371 
SeqlDNo.C157 8.C372 
SeqlDNo.C158 8.C373 
SeqlDNo.C1598.C374 
SeqlDNo.C1608.C375 
SeqlDNo.C1618.C376 
SeqlDNo.C162&C377 
3.C163&C378 



Seq ID I 
Seq ID t 
Seq ID I 



.C164 8.I 



IO.C165&C380 
SeqlDNo.C166&C381 
Seq ID No. C167&C382 
Seq ID No.C168S.C383 
SeqlDNo.C1698.C384 
SeqlDNo.C1708.C385 
SeqlDNo.C1718.C386 
SeqlDNo.C172 8iC387 
SeqlDNo.C173S.C388 
SeqlDNo.C1748.C389 
Seq ID No. C175&C390 
Seq IDNo.C176S.C391 
Seq ID No. C177 & C392 
Seq IDNo.C178S.C393 
Seq ID No. C179 & C394 
SeqlDNo.C1808.C395 
Seq ID No. C181 81 C396 
SeqlDNo.C182S.C397 
Seq ID No. C183 & C398 
SeqlDNo.C1848iC399 
Seq ID No. C185S.C400 
Seq ID No.C1868iC401 
Seq ID No. C187 81 C402 
Seq ID No. C188& C403 
Seq ID No. C189&C404 
SeqlDNo.C190&C4D5 
Seq ID No. C191 & C406 
Seq ID No. C192&C407 
Seq ID No. C193&C408 
Seq ID No. C194&C409 
Seq ID No. C195& C410 
Seq ID No. C196&C411 
Seq ID No. C197&C412 
Seq lDNo. C198&C413 
Seq ID No. C199&C4U 
Seq ID No.C2008.C415 
Seq ID No. C201 & C416 
SeqlDNo.C202&C417 
SeqlDNo.C2038.C418 
Seq ID NO.C204&C419 
Seq ID No.C2058.C420 
Seq ID No.C2068.C421 
Seq ID NO.C207&C422 
Seq ID NO.C208&C423 
Seq ID NO.C209&C424 
Seq ID NO.C210&C425 
Seq ID NO.C211&C425 
Seq ID NO.C212&C427 
Seq ID NO.C213&C428 
Seq ID NO.C214&C429 
Seq ID No. C215&C430 
Seq ID No. C216&C431 
Seq ID N0.C432&C433 
Seq ID NO.C434&C435 



in are Genbank Ident'fler (Gl) numbers. 'Dunham I. et al." refers to the publication entitled The DNA 
al„ Nature (1999) 402489495. 



Strand 



NLpositon 
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404682 9797231 

404287 2326514 

404287 2326514 

404287 2326514 



Minus 40977-41150 

Plus 53134-53281 

Plus 53134-53281 

Plus 53134-53281 
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Coding sequence: 3 



AGTAGGTGAC ACATGAGCCC AGCCCCAGCT CACCTGCCAA TCCAGCTGAG GAGCTCACCT 
GCCAATCCAG CTGAGGCTGG GCAGAGGTGG GTGAGAAGAG GGAAAATTGC AGGGACCTCC 
AGTTGGGCCA GGCCAGAAGC TGCTGTAGCT TTAACCAGAC AGCTCAGACC TGTCTGGAGG 
CTGCCAGTGA CAGGTTAGGT TTAGGGCAGA GAAGAAGCAA GACCATGGTG GGGAAGATGT 
GGCCTGTGTT GTGGACACTC TGTGCAGTCA GGGTGACCGT CGATGCCATC TCTGTGGAAA 
CTCCGCAGGA CGTTCTTCGG GCTTCGCAGG GAAAGAGTGT CACCCTGCCC TGCACCTACC 
ACACTTCCAC CTCCAGTCGA GAGGGACTTA TTCAATGGGA IAAGCTCCTC CTCACTCATA 
CGGAAAGGGT GGTCATCTGG CCGTTTTCAA ACAAAAACTA CATCCATGGT GAGCTTTATA 
AGAATCGCGT CAGCATATCC AACAATGCTG AGCAGTCCGA TGCCTCCATC ACCATTGATC 
AGCTGACCAT GGCTGACAAC GGCACCTACG AGTGTTCTGT C 
AGGGCAACAC CAAGTCACGT GTCCGCCTGT T 

GCGGCATCGA GGGAGAGACC ATAATTGGGA ACAACATCCA GCTGACCTGC CAATCAAAGG 
AGGGCTCACC AACCCCTCAG TACAGCTGGA AGAGGTACAA CATGCTGAAT CAGGAGCAGC 
CCCTGGCCCA GCCAGCCTCA GGTCAGCCTG TCTCCCTGAA GAATATCTCC ACAGACACAT 
CGGGTTACTA CATCTGTACC TCCAGCAATG AGGAGGGGAC GCAGTTCTGC AACATCACGG 
TGGCCGTCAG ATCTCCCTCC ATGAACGTGG CCCTGTATGT GGGCATCGCG G 
TTGCAGCCCT CATTATCATT GGCATCATCA TCTACTGCTG C 

ACAAGGAG GATGCAAGGC CGAACCGGGA AGCCTATGAG GAGCCACCAG 
&ACTTTCC AGAGAGAGGG AGGAGGAGGA TGACTACAGG CAAGAAGAGC 
AGAGGAGCAC TGGGCGTGAA TCCCCGGACC ACCTCGACCA G 
GGCGGCGGAG GAAGGGTTAG GGGTTCATTC T 
AAGCCCTGTT CTCCTGTCCC TCCATCCCAG ACATTGATGG GGACATTTCT TCCCCAGTGT 1440 
CAGCTGTGGG GAACATGGCT GGCCTGGTAA GGGGGTCCCT GTGCTGATCC TGCTGACCTC 1500 
ACTGTCCTGT GAAGTAACCC CTCCTGGCTG TGACACCTGG TGCGGGCCTG GCCCTCACTC 1560 
AAGACCAGGC TGCAGCCTCC ACTTCCCTCG TAGTTGGCAG GAGCTCCTGG AAGCACAGCG 162 0 
CTGAGCATGG GGCGCTCCCA CTCAGAACTC TCCAGGGAGG CGATGCCAGC CTTGGGGGGT 1680 
GGGGGCTGTC CTGCTCACCT GTGTGCCCAG CACCTGGAG- / ~l - A n L74 0 

CA< I KACAC ATCTTTCTTG AATGAATGAA AGAATAAGTG AGTATGCTTG GGCCCTGCAT 1800 
TGGCCTGGCC TCCACCTCCC ACTCCCTTTC CAACCTCACT TCCCGTAGCT GCCAGTATGT 1860 
TCCAAACCCT CCTGGGAAGG CCACCTCCCA CTCCTGCTGC ACAGGCCCTG GGGAGCTTTT 192 0 
GCCCACACAC TTTCCATCTC TGCCTGTCAA TATCGTACCT GTCCCTCCAG GCCCATCTCA 198 0 
AATCACAAGG ATTTCTCTAA CCCTATCCTA ATTGTCCACA TACGTGGAAA CAATCCTGTT 2040 
ACTCTGTCCC ACGTCCAATC ATGGGCCACA AGGCACAGTC TTCTGAGCGA GTGCTCTCAC 2100 
TGTATTAGAG CGCCAGCTCC TTGGGGCAGG GCCTGGGCCT CATGGCTTTT GCTTTCCCTG 2160 
AAGCCCTAGT AGCTGGCGCC CATCCTAGTG GGCACTTAAG CTTAATTGGG GAAACTGCTT 2220 
TGATTGGTTG TGCCTTCCCT TCTCTGGTCT CCTTGAGATG ATCGTAGACA CAGGGATGAT 22 80 
TCCCACCCAA ACCCACGTAT TCATTCAGTG AGTTAAACAC GAATTGATTT AAAGTGAACA 2340 
CACACAAGGG AGCTTGCTTG CAGATGGTCT GAGTTCTTGT GTCCTGGTAA TTCCTCTCCA 2400 
GG C CAGAATA ATTGGCATGT CTCCTCAACC CACATGGGGT TCCTGGTTGT TCCTGCATCC 2460 
CGATACCTCA GCCCTGGCCC TGCCCAGCCC ATTTGGGCTC TGGTTTTCTG GTGGGGCTGT 252 0 
CCTGCTGCCC TCCCACAGCC TCCTTCTGTT TGTCGAGCAT TTCTTCTACT CTTGAGAGCT 2580 
CAGGCAGCGT TAGGGCTGCT TAGGT CTCAT GGACCAGTGG CTGGTCTCAC CCAACTGCAG 2640 
TTTACTATTG CTATCTTTTC TGGATGATCA GAAAAATAAT TCCATAAATC TATTGTCTAC 2700 
TTGCGATTTT TTAAAAAATG TATATTTTTA TATATATTGT TAAATCCTTT GCTTCATTCC 2760 
AAATGCTTTC AGTAATAATA AAATTGTGGG TGG 2793 

Seq ID NO: C2 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..3150 

1 11 21 31 

I I I I 

ATGGGGAGCC GGACGCCAGA GTCCCCTCTC CACGCCGTGC A 
CGCCGACCCC CGCTSSTGCC GCTGCTGTTG C 

GGCTTCAACT TAGACGCGGA GGCCCCAGCA GTACTCTCGG GGCCCCCGGG C 
GGATTCTCAG TGGAGTTTTA CCGGCCGGGA ACAGACGGGG T 

CCCAAGGCTA ATACCAGCCA GCCAGGAGTG CTGCAGGGTG GTGCTGTCTA CCTCTGTCCT 
"l GCCCCACACA GTGCACCCCC ATTGAATTTG ACAGCAAAGG CTCTCGGCTC 
: CACTGTCCAG CTCAGAGGGA GAGGAGCCTG TGGAGTACAA GTCCTTGCAG 
TGGTTCGGGG CAACAGT T CG AGCCCATGGC TCCTCCATCT TGGCATGCGC T 
AGCTGGCGCA CAGAGAAGGA GCCACTGAGC GACCCCGTGG GCACCTGCTA C 
GATAACTTCA CCCGAATTCT GGAGTATGCA CCCTGCCGCT CAGATTICAG CTGGGCAGCA 
GGACAGGGTT ACTGCCAAGG AGGCTTCAGT GCCGAGTTCA CCAAGACTGG CCGTGTGGTT 
" C CAGGAAGCTA TTTCTGGCAA GGCCAGATCC TGTCTGCCAC TCAGGAGCAG 
WTACCC CGAGTACCTG AT CAACCTGG TTCAGGGGCA GCTGCAGACT 
CGCCAGGCCA GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCTGTGGC T 
TTCAGTGGTG A 

TCCTTAA TGGCTCAGAC ATTCGATCCC T 
T CCTACTTTGG CTATGCAGTG GCCGCCACAG ACGTCAATGG G 
GATGACTTGC TGGTGGGGGC ACCCCTGCTC ATGGATCGGA CCCCTGACGG G 

GAGGTGGGCA GGGTCTACGT CTACCTGCAG CACCCAGCCG GCATAGAGCC C 

CTTACCCTCA CTGGCCATGA TGAGTTTGGC CGATTTGGCA GCTCCTTGAC CCCCCTGGGG 
GACCTGGACC AGGATGGCTA CAATGATGTG GCCATCGGGG CTCCCTTTGG TGGGGAGACC 
CAG CAGGGAG TAGTGTTTGT ATTTCCTGGG GGCCCAGGAG GGCTGGGCTC TAAGCCTTCC 
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CAGGTTCTGC AGCCCCTGTG GGCAGCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 13 80 

CGAGGAGGCC GAGACCTGGA TGGCAATGGA TATCCTGATC TGATTGTGGG GTCCTTTGGT 1440 

GTGGACAAGG CTGTGGTATA CAGGGGCCGC CCCATCGTGT CCGCTAGTGC CTCCCTCACC 1500 

ATCTTCCCCG CCATGTTCAA CCCAGAGGAG CGGAGCTGCA GCTTAGAGGG GAACCCTGTG 1560 

GCCTGCATCA ACCTTAGCTT CTGCCTCAAT GCTTCTGGAA AACACGTTGC TGACTCCATT 1S20 

GGTTTCACAG TGGAACTT CA GCTGGACTGG CAGAAGCAGA AGGGAGGGGT ACGGCGGGCA 1630 

CTGTTCCTGG CCTCCAGGCA GGCAACCCTG ACCCAGACCC TGCTCATCCA GAATGGGGCT 1740 

CGAGAGGATT GCAGAGAGAT GAAGATCTAC CTCAGGAACG AGT CAGAATT TCGAGACAAA 1800 

CTCTCGCCGA TTCACATCGC TCTCAACTTC TCCTTGGACC CCCAAGCCCC AGTGGACAGC 1860 
CACGGCCTCA GGCCAGCCCT ACATTATCAG AGCAAGAGCC S 

3 ACTGTGGAGA AGACAACATC TGTGTGCCTG ACCTGCAGCT G 



GGGGAGCAGA ACCATGTGTA CCTGGGTGAC AAGAATGCCC TGAACCTCAC TTTCCATGCC 
CAGAATGTGG GTGAGGGTGG CGCCTATGAG G 

GCTGAGTACT CAGGACTCGT CAGACACCCA GGGAACTTCT CCAGCCTGAG C 
TTTGCCGTGA ACCAGAGCCG CCTGCTGGTG TGTGACCTGG GCAACCCCAT GAAGGCAGGA 
GCCAGTCTGT GGGGTGGCCT TCGGTTTACA GTCCCTCATC TCCGGGACAC T 



TCCTTTCGGC 

GAGGCAGTGC TATTCCCAGT 
GACCTGGGAC CTGCTGTCCA 




GTGACCAGAG TTACGGGACT C 
GAGTTGGATC CCGAGGGTTC O 
TCTGCTTCCT CGGGACCTCA G 

TGTGAGCTCG GGCCCCTGCA CCAACAAGAG AGCCAAAGTC TGCAGTTGCA TTTCCGAGTC 

TGGGCCAAGA CTTTCTTGCA GCGGGAGCAC CAGCCATTTA GCCTGCAGTG TGAGGCTGTG 2880 

TACAAAGCCC TGAAGATGCC CTACCGAATC CTGCCTCGGC AGCTGCCCCA AAAAGAGCGT 2 940 

CAGGTGGCCA CAGCTGTGCA ATGGACCAAG GCAGAAGGCA GCTATGGCGT CCCACTGTGG 3000 

ATCATCATCC TAGCCATCCT GTTTGGCCTC CTGCTCCTAG GTCTACTCAT CTACATCCTC 3060 

TACAAGCTTG GATTCTTCAA ACGCTCCCTC CCATATGGCA CCGCCATGGA AAAAGCTCAG 3120 

CTCAAGCCTC CAGCCACCTC TGATGCCTGA 3150 

Seq ID NOs C3 DNA Sequence 



I I I I I I 

IACAGCT TTCCTCCACT GCTGCTGCTG CTGTTCTGGG GTGTGGTGTC ACACAGCTTC 
CCACCCACTC TACAAACACA AGAGCAAGAT GTGGACTTAG TCCAGAAATA CCTGGAAAAA 
TACTACAACC TGAAGAATGA TGGGAGGCAA GTTGAAAAGC GGAGAAATAG TGGCCCAGTG 
GTTGAAAAAT TGAAGCAAAT GCAGGAATTC TTTGGGCTGA AAGTGACTGG GAAACCACAT 
GCTGAAACCC TGAAGGTGAT CAACCAGCCC AGATGTGGAG TGCCTGATGT GGCTCAGTTT 
GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACACATC TGACCTACAG GATTGAAAAT 
TACACGCCAG ATTTGCCAAG AGCAGATGTG GACCATGCCA TTGAGAAAGC CTTCCAACTC 
TGGAGTAATG TCACACCTCT GACATTCACC AAGGTCTCTG AGGGTCAAGC AGACATCATG 
ATATCTTTTG TCAGGGGAGA T CAT CGGGAC AACTCTCCTT TTGATGGACC TGGAGGAAAT 
CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG ATGCTCATTT TGATGAAGAT 
GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC GTGTTGCGGC TCATGAACTC 
GGCCATTCTC TTGGACTCTC CCATTCTACT GATATCGGGG CTTTGATGIA CCCTAGCTAC 
ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG ATGGCATCCA AGCCATATAT 
GGACGTTCCC AAAATCCTGT CCAGCCCATC GGCCCACAAA CCCCAAAAGC A 
T AACTACGATT CGGGGAGAAG T 



TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC TCAATTTCAT TTCTGTTTTC 
TGGCCACAAC TGCCAAATGG GCTTGAAGCT GCTTACGAAT TTGCCGACAG AGATGAAGTC 
CGGTTTTTCA AAGGGAATAA GTACTGGGCT GTTCAGGGAC AGAATGTGCT ACACGGATAC 
CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG TGAAGCATAT CGATGCTGCT 
CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTG CTAACAAATA CTGGAGGTAT 
GATGAATATA AACGATCTAT GGATCCAGGT TATCCCAAAA TGATAGCACA TGACTTTCCT 
GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG GATTTTTCIA TTTCTTTCAT 
Z AATACAAATT TGATCCTAAA ACGAAGAGAA TTTTGACTCT CCAGAAAGCT 
G GAAAAATTAG 



I I I I I I 

ATGCACAGCT TTCCTCCACT GCTGCTGCTG CTGTTCTGGG GTGTGGTGTC ACACAGCTTC 
CCAGCGACTC TAGAAACACA AGAGCAAGAT GTGGACTTAG TCCAGAAATA CCTGGAAAAA 
TACTACAACC TGAAGAATGA TGGGAGGCAA GTTGAAAAGC GGAGAAATAG TGGCCCAGTG 
GTTGAAAAAT TGAAGCAAAT GCAGGAATTC TTTGGGCTGA AAGTGACTGG GAAACCAGAT 
GCTGAAACCC TGAAGGTGAT GAAGCAGCCC AGATGTGGAG TGCCTGATGT GGCTCAGTTT 
GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACACATC TGACCTACAG GATTGAAAAT 
TACACGCCAG ATTTGCCAAG AGCAGATGTG GACCATGCCA TTGAGAAAGC CTTCCAACTC 
TGGAGTAATG TCACACCTCT GACATTCACC AAGGTCTCTG AGGGTCAAGC AGACATCATG 
ATATCTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT TTGATGGACC TGGAGGAAAT 
CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG ATGCTCATTT TGATGAAGAT 
GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC GTGTTGCGGC TCATGCCCTC 
GGCCATTCTC TTGGACTCTC CCATTCTACT GATATCGGGG CTTTGATGTA CCCTAGCTAC 
ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG ATGGCATCCA AGCCATATAT 
GGACGTTCCC AAAATCCTGT CCAGCCCATC GGCCCACAAA CCCCAAAAGC ATGTGACAGT 
T TTGATGCTAT AACTACGATT CGGGGAGAAG TGATGTTCTT TAAAGACAGA 
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TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC TOVATTTCAT TTCTGTTTTC 960 

TGGCCACAAC TGCCAAATGG GCTTGAAGCT GCTTACGAAT TTGCCGACAG AGATGAAGTC 1020 

CGGTTTTTCA AAGGGAATAA GTACTGGGCT GTTCAGGGAC AGAATGTGCT ACACGGATAC 1080 

CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG TGAAGCATAT CGATGCTGCT 1140 

CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTG CTAACAAATA CTGGAGGTAT 1200 

GATGAATATA AACGATCTAT GGATCCAGGT TATCCCAAAA TGATAGCACA TGACTTTCCT 1260 

GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG GATTTTTCTA TTTCTTTCAT 1320 

GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA TTTTGACTCT CCAGAAAGCT 1380 



I I 

ATGGTCAGAA AGCCTGTTGT G 

AACGGGAGGC TGCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 12 0 

AAGAGGAAAG TCACTTTACT GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA 180 

3 CATGTCTCTG 240 

A CGGTGTGTGG GGTCCTGTCA CTATTTGGAG CTTTGTCTTA TGCTGAATTG 300 

GGAACAACTA T AAAGAAAT C TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 360 

T CTGGGTGGAA CTCCTCATAA TACGCCCTGC AGCTACTGCT 420 

T CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 

CCTGAACTTG CGATCAAGCT CATTACAGCT GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 540 

C CAGATTTTCT TAACCTTTTG CAAGCTCACA 600 

A AACGCAGAAC 660 

TTTAAAGACG CGTTTTCAGG AAGAGATTCA AGTATTACGC GGTTGCCACT GGCTTTTTAT 72 0 

TATGGAATGT ATGCATATGC TGGCTGGTTT TACCT CAACT TTGTTACTGA AGAAGTAGAA 780 

AACCCTGAAA AAACCATTCC CCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGGC 840 

TATGTGCTGA CAAATGTGGC CTACTTTACG ACCATTAATG CTGAGGAGCT GCTGCTTTCA 900 

AATGCAGTGG CAGTGACCTT TTCTGAGCGG CTACTGGGAA ATTTCTCATT AGCAGTTCCG 960 

ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATGAACGGTG GTGTGTTTGC TGTCTCCAGG 1020 

TTATTCTATG TTGCGTCTCG AGAGGGTCAC CTTCCAGAAA TCCTCTCCAT GATTCATGTC 1080 

CGCAAGCACA CTCCTCTACC AGCTGTTATT GTTTTGCACC CTTTGACAAT GATAATGCTC 1140 

TTCTCTGGAG ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 1200 

GGGCTGGCAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT 12 60 

1320 
1380 



A TAACCAGAAC ATTACAAATA ATACTGGAAG TTGTACCAGA AG AAG AT AAG 1500 

TTATGAACTA ATGGACTTGA GATCTTGGCA ATCTGCCCAA GGGGAGACAC AAAATAGGGA 1560 

TTTTTACTTC ATTTTCTGAA AGTCTAGAGA ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

CAGTTATTTT T ATT CAT AT A TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

AACTCTATCT AGTTATACAA AGTGAATATC CAGTTATTCT ATGAGTCGCA CAATTCTTGA 1740 

GTCTCTGATA CCTACCTATT GGGGTTAGGA GAAAAGACTA GACAATTACT ATGTGGTCAT 1800 

TCTCTACAAC ATATGTTAGC ACGGCAAAGA ACCTTCAAAT TGAAGACTGA GATTTTTCTG 1860 

TATATATGGG TTTTGTAAAG ATGGTTTTAC ACACTACAGA TGTCTATACT GTGAAAAGTG 1920 

TTTTCAATTC TGAAAAAAAG CATACATCAT GATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 
ATTTTACATT GACATTGCAT TGCTTCCCCT TAGATACCAA T 
GCTTTAATGG ATTATACCCA GAGCACTTTG AACAAAGGTC A< 
TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAGACACA TCCAGGAGTT A 
AAAAATCCTT GAGAATTTAT TATGTCAGAT G 



AAGAAATGTC GCTGTAAATA AGATTTACAA CTGATGTTTC T 
ATCTAGGCTT TGTCAGTAAT TTCCACACCT TAATTATCAT TCAACTTGCA AAAGAGACAA 
CTGATAAGAA GAAAATTGAA ATGAGAATCT GTGGATAAGT GTTTGTGTTC A 



GCACTTTGGG AGGCTGAGGG GGTGGATCAC CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 2940 

CAACATGGAG AAACCCCATC TCTACTAAAA ATACAAAATT AGCTGGGCAT GGTGGCACAT 3000 

GCTGGTAATC TCAGCTATTG AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCG 3060 

GAGGTTGCAG TGAGCCAAGA TTGCACCACT GTACTCCAGC CTGGGTGACA AAGTCAGACT 3120 

CCATCTCCAA AAAAAAAAAA AAAA 3144 



TCTGGGCGCC GACTGGTGAA GGGCCCCGAC CCTTCCAGCC CAGCTTTCCG CATCGAGGAT 
GCCAACCTGA TCCCCCCTGT GCCTGATGAC AAGTTCCAAG ACCTGGTGGA TGCTGTGCGG 
GCAGAAAAGG GTTTCCTCCT TCTGGCATCC CTGAGGCAGA TGAAGAAGAC CCGG3GCACG 
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CTGCTGGCCC TGGAGCGGAA AGACCACTCT GGCCAGGTCT TCAGCGTGGT GTCCAATGGC 480 

AAGGCGGG CA CCCTGGACCT CAGCCTGACC GTCCAAGGAA AGCAGCACGT GGTGTCTGTG 540 

GAAGAAGCTC TCCTGGCAAC CGGCCAGTGG AAGAGCATCA CCCTGTTTGT G CAG GAAGAC SOO 

AGGGCCCAGC TGTACATCGA CTGTGAAAAG ATGGAGAATG CTGAGTTGGA CGTCCCCATC S60 

CAAAGCGTCT TCACCAGAGA CCTGGCCAGC ATCGCCAGAC TCCGCATCGC AAAGGGGGGC 720 

GTCAATGACA ATTTCCAGGG GGTGCTGCAG AATGTGAGGT TTGTCTTTGG AACCACACCA 780 

GAAGACATCC TCAGGAACAA AGGCTGCTCC AGCTCTACCA GTGTCCTCCT CACCCTTGAC 840 

AACAACGTGG TGAATGGTTC CAGCCCTGCC ATCCGCACTA ACTACATTGG CCACAAGACA 900 

AAGGACTTGC AAGCCATCTG CGGCATCTCC TGTGATGAGC TGTCCAGCAT GGTCCTGGAA 960 

CTCAGGGGCC TGCGCACCAT TGTGACCACG CTGCAGGACA GCATCCGCAA AGTGACTGAA 102 0 

GAGAACAAAG AGTTGGCCAA TGAGCTGAGG CGGCCTCCCC TATGCTATCA CAACGGAGTT 1080 

CAGTACAGAA ATAACGAGGA ATGGACTGTT GATAGCTGCA CTGAGTGTCA CTGTCAGAAC 1140 

T CAGTT ACCA TCTGCAAAAA GGTGTCCTGC CCCATCATGC CCTGCTCCAA TGCCACAGTT 1200 

CCTGATGGAG AATGCTGTCC TCGCTGTTGG CCCAGCGACT CTGCGGACGA TGGCTGGTCT 12 60 

CCATGGTCCG AGTGGACCTC CTGTTCTACG AGCTGTGGCA ATGGAATTCA GCAGCGCGGC 1320 

CGCTCCTGCG ATAGCCTCAA CAACCGATGT GAGGGCTCCT CGGTCCAGAC ACGGACCTGC 13 80 

CACATT CAGG AGTGTGACAA AAGATTTAAA CAGGATGGTG GCTGGAGCCA CTGGTCCCCG 1440 

TGGTCATCTT GTTCTGTGAC ATGTGGTGAT GGTGTGATCA CAAGGATCCG GCTCTGCAAC 1500 

TCTCCCAGCC CCCAGATGAA TGGGAAACCC TGTGAAGGCG AAGCGCGGGA GACCAAAGCC 1560 

TGCAAGAAAG ACGCCTGCCC CATCAATGGA GGCTGGGGTC CTTGGTCACC ATGGGACATC 1620 

TGTTCTGTCA CCTGTGGAGG AGGGGTACAG AAACGTAGTC GTCTCTGCAA CAACCCCGCA 1680 

CCCCAGTTTG GAGGCAAGGA CTGCGTTGGT GATGTAACAG AAAACCAGAT CTGCAACAAG 1740 

CAGGACTGTC CAATTGATGG ATGCCTGTCC AATCCCTGCT TTGCCGGCGT GAAGTGTACT 1800 

G ATGGCAGCTG GAAATGTGGT GCTTGTCCCC CTGGTTACAG TGGAAATGGC 1860 

■ — -VTGTTGA TGAGTGCAAA GAAGTGCCTG ATGCCTGCTT CAACCACAAT 1920 

3TGAGAA CACGGACCCC GGCTACAACT GCCTGCCCTG CCCCCCACGC 1980 

T CACAGCCCTT CGGCCAGGGT GTCGAACATG CCACGGCCAA CAAACAGGTG 2040 

C GTAACCCCTG CACGGATGGG ACCCACGACT GCAACAAGAA CGCCAAGTGC 2100 

AACTACCTGG GCCACTATAG CGACCCCATG TACCGCTGCG AGTGCAAGCC TGGCTACGCT 2160 

GGCAATGGCA TCATCTGCGG GGAGGACACA GACCTGGATG GCTGGCCCAA TGAGAACCTG 2220 

GTGTGCGTGG CCAATGCGAC TTACCACTGC AAAAAGGATA ATTGCCCCAA CCTTCCCAAC 2280 

TCAGGGCAGG AAGACTATGA CAAGGATGGA ATTGGTGATG CCTGTGATGA TGACGATGAC 2340 

AATGATAAAA TTCCAGATGA CAGGGACAAC TGTCCATTCC ATTACAACCC AGCTCAGTAT 2400 

GACTATGACA GAGATGATGT GGGAGACCGC TGTGACAACT GTCCCTACAA CCACAACCCA 2460 

GATCAGGCAG ACACAGACAA CAATGGGGAA GGAGACGCCT GTGCTGCAGA CATTGATGGA 2520 

GACGGTATCC TCAATGAACG GGACAACTGC CAGTACGTCT ACAATGTGGA CCAGAGAGAC 2580 

ACTGATATGG ATGGGGTTGG AGATCAGTGT GACAATTGCC CCTTGGAACA CAATCCGGAT 264 0 

CAGCTGGACT CTGACTCAGA CCGCATTGGA GATACCTGTG ACAACAATCA GGATATTGAT 2700 

GAAGATGGCC ACCAGAACAA TCTGGACAAC TGTCCCTATG TGCCCAATGC CAACCAGGCT 2760 

GACCATGACA AACATGGCAA GGGAGATGCC TGTGACCACG ATGATGACAA CGATGGCATT 2820 

CCTGATGACA AGGACAACTG CAGACTCGTG CCCAATCCCG ACCAGAAGGA CTCTGACGGC 2880 

GATGGTCGAG GTGATGCCTG CAAAGATGAT TTTGACCATG ACAGTGTGCC AGACATCGAT 2940 

GACATCTGTC CTGAGAATGT TGACATCAGT GAGACCGATT TCCGCCCATT CCAGATGATT 3000 

CCTCTCGACC CCAAAGGGAC ATCCCAAAAT GACCCTAACT GGGTTGTACG CCATCAGGGT 3060 

AAAGAACTCG TCCAGACTGT CAACTGTGAT CCTGGACTCG CTGTAGGTTA TGATGAGTTT 3120 

AATGCTGTGG ACTTCAGTGG CACCTTCTTC ATCAACACCG AAAGGGACGA TGACTATGCT 3180 

GGATTTGTCT TTGGCTACCA GTCCAGCAGC CGCTTTTATG TTGTGATGTG GAAGCAAGTC 3240 

ACCCAGTCCT ACTGGGACAC CAACCCCACG AGGGCTCAGG GATACTCGGG CCTTTCTGTG 3300 

AAAGTTGTAA ACTCCACCAC AGGGCCTGGC GAGCACCTGC GGAACGCCCT GTGGCACACA 33 60 

GGAAACACCC CTGGCCAGGT GCGCACCCTG TGGCATGACC CTCGTCACAT AGGCTGGAAA 3420 

GATTTCACCG CCTACAGATG GCGTCTCAGC CACAGGCCAA AGACGGGTTT CATTAGAGTG 3480 

GTGATGTATG AAGGGAAGAA AATCATGGCT GACTCAGGAC CCATCTATGA TAAAACCTAT 3540 

GCTGGTGGTA GACTAGGGTT GTTTGTCTTC TCTCAAGAAA TGGTGTTCTT CTCTGACCTG 3600 

AAATACGAAT GTAGAGATCC CTAATCATCA AATTGTTGAT TGAAAGACTG ATCATAAACC 3 660 

AATGCTGGTA TTGCACCTTC TGGAACTATG GGCTTGAGAA AACCCCCAGG ATCACTTCTC 3 720 

CTTGGCTTCC TTCTTTTCTG TGCTTGCATC AGTGTGGACT CCTAGAACGT GCGACCTGCC 3780 

TCAAGAAAAT GCAGTTTTCA AAAACAGACT CATCAGCATT CAGCCTCCAA TGAATAAGAC 3 840 

ATCTTCCAAG CATATAAACA ATTGCTTTGG TTTCCTTTTG AAAAAGCATC TACTTGCTTC 3900 

AGTTGGGAAG GTGCCCATTC CACTCTGCCT TTGTCACAGA GCAGGGTGCT ATTGTGAGGC 3 960 

CATCTCT 3 967 

Seq ID NO: C7 DNA Sequence 



TCCACACACA CAAAAAACCT GCGCGTGAGG G 

AATCACAACA ACTTTTGCTG CCAGGATGCC CTTGCTTTGG CTGAGAGGAT T 
AAGTTGCTGG ATTATAGTGA GGAGTTCCCC CACCCCAGGA TCCGAGGGGC ACAGCGCGGC 
CCCCGACTGT CCGTCCTGTG CGCTGGCCGC CCTCCCAAAG GATGTACCCA ACTCTCAGCC 
AGAGATGGTG GAGGCCGTCA AGAAGCACAT TTTAAACATG CTGCACTTGA AGAAGAGACC 
CGATGTCACC CAGCCGGTAC CCAAGGCGGC GCTTCTGAAC GCGATCAGAA AGCTTCATGT 
GGGCAAAGTC GGGGAGAACG GGTATGTGGA GATAGAGGAT GACATTGGAA GGAGGGCAGA 
AATGAATGAA CTTATGGAGC AGACCTCGGA GATCATCACG TTTGCCGAGT CAGGAACAGC 
CAGGAAGACG CTGCACTTCG AGATTTCCAA GGAAGGCAGT GACCTGTCAG TGGTGGAGCG 
TGCAGAAGTC TGGCTCTTCC TAAAAGTCCC CAAGGCCAAC AGGACCAGGA CCAAAGTCAC 
CATCCGCCTC TTCCAGCAGC AGAAGCACCC GCAGGGCAGC TTGGACACAG GGGAAGAGGC 
CGAGGAAGTG GGCTTAAAGG GGGAGAGGAG TGAACTGTTG CTCTCTGAAA AAGTAGTAGA 
CGCTCGGAAG AGCACCTGGC ATGTCTTCCC TGTCTCCAGC AGCATCCAGC GGTTGCTGGA 
CCAGGGCAAG AGCTCCCTGG ACGTTCGGAT TGCCTGTGAG CAGTGCCAGG AGAGTGGCGC 
CAGCTTGGTT CTCCTGGGCA AGAAGAAGAA GAAAGAAGAG GAGGGGGAAG GGAAAAAGAA 
GGGCGGAGGT GAAGGTGGGG CAGGAGCAGA TGAGGAAAAG GAGCAGTCGC ACAGACCTTT 
CCTCATGCTG CAGGCCCGGC AGTCTGAAGA C 
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GGAGTGTGAT GGCAAGGTCA ACATCTGCTG TAAGAAACAG T 

CATCGGCTGG AATGACTGGA TCATTGCTCC CTCTGGCTAT CATGCCAACT ACTGCGAGGG 1140 

TGAGTGCCCG AGCCATATAG CAGGCACGTC CGGGTCCTCA CTGTCCTTCC ACTCAACAGT 1200 

CATCAACCAC TACCGCATGC GGGGCCATAG CCCCTTTGCC AACCTCAAAT CGTGCTGTGT 12S0 

GCCCACCAAG CTGAGACCCA TGTCCATGTT GTACTATGAT GATGGTCAAA ACATCATCAA 1320 

AAAGGACATT CAGAACATGA TCGTGGAGGA GTGTGGGTGC TCATAGAGTT GCCCAGCCCA 1380 

GGGGGAAAGG GAG CAAGAGT TGTCCAGAGA AGACAGTGGC AAAAT G AAG A AATTTTTAAG 1440 

GTTTCTGAGT TAACCAGAAA AATAGAAATT AAAAACAAAA CAAAACAAAA AAAAAAACAA 1500 

AAAAAAACAA AAGTAAATTA AAAACAAACC TGATGAAACA GATGAAACAG ATGAAGGAAG 1550 

ATGTGGAAAT CTTAGCCTGC CTTAGCCAGG GCTCAGAGAT GAAGCAGTGA AGAGACAGAT 1620 

TGGGAGGGAA AGGGAGAATG GTGTACCCTT TATTTCTTCT GAAATCACAC TGATGACATC 1680 

AGTTGTTTAA ACGGGGTATT GTCCTTTCCC CCCTTGAGGT TCCCTTGTGA GCTTGAATCA 1740 

ACCAATCTGA TCTGCAGTAG TGTGGACTAG AACAACCCAA ATAGCATCTA GAAAGCCATG 1800 

AGTTTGAAAG GGCCCATCAC AGGCACTTTC CTAGCCTAAT i 840 



I I I I I 

G CTCCCCGCCA CCGCCATGGT CCCCGACACC GCCTGCGTTC 1TCTGCTCAC 60 

CCTGGCTGCC CICGGCGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 12 0 

GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGC1G CAGGACGTGC GGGACTGGCT 180 

GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 

CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 

GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGOGGCG GCCGCTGCGG 360 

CGCCCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 

GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 

GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 

CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 

CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 

CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 

GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 

AGACGGCTTC CCGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 

CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 

CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 102 0 

GAACCCACAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 

GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 

CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 12 00 

CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 1260 

GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 

CG AT C AAG AC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 13 80 

TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCTGCG ACGACGACGA 1440 

CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 

GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 

GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 

GGCCTT CCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 

GGTGCTCAAC CAGGGAAGGG AGAT CGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 

GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 

GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1860 

CATGTGGAAG CAGATGGAGC AAACGT ATT G GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 

GCCTGGCATC CAACT CAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 

CGCTCTGTGG CATACAGGAG ACACAG AGT C CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 

AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 

GGGCTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 

CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 

CATCTGGGCC AACCTGCGTT ACCGCTGCAA TG ACACCAT C CCAGAGGACT ATGAGACCCA 2280 

TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GGATGACAGC CACCCTCACC 2340 

GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 

AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 243 9 



Seq ID NO: C9 DNA Sequence 

Nucleic Acid Accession #: XM_057014 

Coding sequence: 143.. 874 

i i 1 T r t 1 T 

GGGAGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 
CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 
CGCTGCCCGG CAGCCGGGAG CCATGCGACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT 
CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 
CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGGAG GTGGTGGACC TGTATAATGG 
AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 
CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 
T CTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 
ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 
AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAAIGCATG 
CTGTCAGCGT TGGTATTTCA CATT CAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 
AGCTATAATT TATTTGGACC AAGGAAGCCC TGAAATGAAT TCAACAATTA ATATTCATCG 
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CACTTCTTCT GTGGAAGGAC TTTGTGAAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 
CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGGAGATGCT TCTACTGGAT GGAATTCAGT 
TTCTCGCATC ATTATTGAAG AACTACCAAA ATAAATGCTT TAATTTTCAT TTGCTACCTC 
TTTTTTTATT ATGCCTTGGA ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 
CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACTGTT 
TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTT1TAGT 
CTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 
TTGT TTTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCAATCTT 
I GTAAATGTTA AGAATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 



I I I I I I 

ATCCCTGACT CGGGGTCGCC TTTGGAGCAG AGAGGAGGCA ATGGCCACCA TGGAGAACAA SO 

GACAGAGACG TGTACAGTGG CCCCCCGTGA AAGACAGAAT TGTGGTTTTC CTGGTGTCAC 180 

GCCCTCCCAG TGTGCAAATA AGGGCTGCTG TTTCGACGAC ACCGTTCGTG GGGTCCCCTG 240 

GTGCTTCTAT CCTAATACCA TCGACGTCCC TCCAGAAGAG GAGTGTGAAT TTTAGACACT 300 

TCTGCAGGGA TCTGCCTGCA TCCTGACGGG GTGCCGTCCC CAGCACGGTG ATTAGTCCCA 360 

GAGCTCGGCT GCCACCTCCA CCGGACACCT CAGACACGCT TCTGCAGCTG TGCCTCGGCT 420 

CACAACACAG ATTGACTGCT CTGACTTTGA CTACTCAAAA TTGGCCTAAA AATTAAAAGA 480 

GATCGATATT AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 540 

Seq ID NO; Cll DNA Sequence 

Coding sequence: 1..8487 



ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCCGTGGTGC TGATCCTGCT TTGGGGCCAT 
CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT ACGTCCCCAG CGAGGTCCAC 
TGCACGTTCC GATCCCTGGC TTCCGTGCCC GCTGGCATTG CTAGACACGT GGAAAGAATC 
AATTTGGGGT TTAATAGCAT ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG 
TTGGAGCTAC TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 
GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT GATCACAGGA 
CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCIGCACA TTGACCACAA CAAGATCGAG 
TTTATCCACC CTCAAGCTTT CAACGGCTTA ACGTCTCTGA GGCTACTCCA TTTGGAAGGA 
AATCTCCTCC ACCAGCTGCA CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTATTTC 
AGACTCTCCA CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 
AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA TCCGTGGACC 
TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA AATCCAGAGG AATTCTGAAG 
TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT CAGTTGTGTG CAATGTGCTT CAGTCCAAAG 
AACTTCTACA AACATGAGAT ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA 
GAGTCCCCTC TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 
GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG CATCTCTTTG 
AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT GTGACATCAA GAAACCAATG 
GATGTGTACA AGATTCACTT GAACCAAACG GATCCTCCAG ATATTGACAT AAATGCAACA 
GTTGCCTTGG ACTTTGAGTG TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG 
AIAGCATACT ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 
CCCAGAGT CA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA 
GTGAGAGCCC AGATICTIGC AGAACCAGAA TGGGTCATGC Al 
CTGAACCGAC GTCAGAGIAC GGCCAAGAAG GTGCTACTTT CCTACTACAC CCAGTATTCT 



CTGTGCAAAG AGATCAGACT GTCCTGGAAG G 



CTGAAAGCGC CCATGGATGA CCCAGACAGC 

AGGATCAAGT CCATGGAGCC ATCTGACTCA GGCTTGTACC AGTGCATTGC 



A CAGTGACAAT TGGCAAGAAC CCAGGGGAGT 
GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG 
GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG 




C TCAAAACAAA GGATGATGCC ATCAATGGAG ACAAGAAAGC C 

AGAAGAAAGC TGAAACTCTG G AAGCATT CG GAAAAAGAAC CAGAGACCAA TGITGCAGAA 2280 

GGTCGCAGAG TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 2340 

GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA GGGCACAGAA 2400 

GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA GCCTAGAAGT CACACCACCT 2460 

TTTCCTGCTG TTTCTCCCCC CTCAGCATCT CCTGTGCAGA CAGTAACCAG TGCTGAAGAA 2520 

CACCTCT ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA 2580 

3 GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 2 640 

AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA GATAACTTCC 2700 

ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA TATCTGAGCC TTATGAACCA 2760 

TCTCCTACTC TGCACACATT AGACACAGT C TATGAAAAGC CCACCCATGA AGAGACGGCA 2820 

ACAGAGGGTT GGTCTGCAGC AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT 2880 

GAGCCTCCAT TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 2940 

GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC CTITGCACAC 3000 

CTTACT CCAA CCCCCACCAT CTGGGTTAAT GACTCCAGTA CATCACAGTT ATTTGAGGAT 3060 
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TCTACTATAG GGGAACCAGG TGTCCCAGGC CAATCACATC TACAAGGACT GACAGACAAC 312 0 

ATCCACCTTG TGAAAAGTAG TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG 3180 

AAAGAGATGT CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 3240 

AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA CTCCACACTG 3300 

5 GGTATAATGA GCAGT ATGT C TCCAGTTAAG AAGCCTGCGG AAACCACAGT TGGTACCCTC 3360 

CTAGACAAAG ACACCACAAC AGTAACAACA ACACCAAGGC AAAAAGTTGC TCCGTCATCC 3420 

ACGATGAGCA CTCACCCTTC T CGAAGGAG A CCCAACGGGA GAAGGAGATT ACGCCCCAAC 3480 

AAATTCCGCC ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 3540 

TCTACTCAAC CAACICAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA GAGTTCTCTG 3600 

10 GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA AACAGTTGGA AATGGAGAAG 3 660 

AATGCAGAAC CCACATCCAA GGGAACACCA CGGAGAAAAC ACGGGAAGAG GCCAAACAAA 3720 

CATCGATATA CCCCTTCTAC AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT 3780 

CCAGAAAATA AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 3 840 

ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC AACCACCAGA 3900 

15 AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC CAGTCACATA TAAACCCACA 3960 

TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG TTGACAAACA TAAAAGTGAC 4020 

ATTTTAGTCA CTGGTGAATC AATTACTAAT GCCATACCAA CTTCTCGCTC CTTGGTCTCC 4080 

ACTATGGGAG AATTTAAGGA AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG 4140 

AATCCCTCAA GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 4200 

20 GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA TTTCACTTCC 4260 

GAGTTTTTGT CCTCTTTGAC AGTCTCCAGA CCATTTCACC AGGAAGAAGC TGGTTCTTCC 4320 

ACAACTCTCT CAAGCATAAA AGTGGAGGTG GCTTCAAGTC AGGCAGAAAC CACCACCCTT 4380 

GATCAAGATC ATCITGAAAC CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT 4440 

CACACCCCTA CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 4500 

25 ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG AATATCTCAA 4560 

GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG TGGGGAATCC AGAAACAGAA 4620 

GCAACCCCAG TCAACAATGA AGGAACACAG CATATGTCAG GGCCAAATGA ATTATCAACA 4680 

CCCTCTTCCG ACCGGGATGC ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA 4740 

TTTGGTAGTA GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 4800 

30 GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC AACAGTGAGG 4860 

CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG TAACTTCCCA GTCACCTCGT 492 0 

CACTGGACCA ACAAACCGGA AATAACTACA TATCCTTCTG GGGCTTTGCC AGAGAACAAA 4980 

CAGTTTACAA CTCCAAGATT ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA 504 0 

CCCAGCATTC CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 5100 

35 GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG TTGGAAAGCC TCCCAGTCCA 5160 

AGAATTCCTC ATTATTCCAA TGGAAG ACT C CCTTTCTTTA CCAACAAGAC TCTTTCTTTT 522 0 

CCACAGTTGG GAGTCACCCG GAGACCCCAG ATACCCACTT CTCCTGCCCC AGTAATGAGA 5280 

GAGAGAAAAG TTATTCCAGG TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTC 5 3 40 

GACTTTGGCC CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCACGGG ATCACCCTCA 54 0 0 

40 ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC CTTTATAACA 5460 

TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT CAAAGTTCTT TGCAGGAGGA 552 0 

CCTCCTGCAT CCAAATTCTG GTCTCTTGGG GAAAAGCCCC AAATCCTCAC CAAGTCCCCA 55 8 0 

CAGACTGTGT CCGTCACCGC TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA 5 640 

CCAAAGCCTT TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 57 0 0 

45 AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA GGTTCAAGTA 5 76 0 

CAAGATCGAG GCCAGTATAT GTGCACCGCC AGCAACCTGC ACGGCCTGGA CAGGATGGTG 5820 

GTCTTGCTTT CGGTCACCGT GCAGCAACCT CAAATCCTAG CCTCCCACTA CCAGGACGTC 5880 

ACTGTCTACC IGGGAGACAC CATTGCAATG GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC 5 940 

CAAATTTCCT GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC CGTGGAGAGC 6000 

50 CGCATCACCC TGCACGAAAA CCGGACCCTT T CCATCAAGG AGGCGTCCTT CTCAGACAGA 6060 

GGCGTCTATA AGTGCGTGGC CAGCAATGCA GCCGGGGCGG ACAGCCTGGC CATCCGCCTG 6120 

CACGTGGCGG CACTGCCCCC CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC 6180 

CCGGGGCTCA GCATTCACAT TCACTGCACT GCCAAGGCTG CGCCCCTGCC CAGCGTGCGC 6240 

TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG GAACTTGTTT 6300 

55 GTTTTCCCCA ACGGGACGCT CTACAICCGC AACCTCGCGC CCAAGGACAG CGGGCGCTAT 6360 

GAGTGCGTGG CCGCCAACCT GGTAGGCTCC GCGCGCAGGA CGGTGCAGCT GAACGTGCAG 6420 

CGTGCAGCAG CCAACGCGCG CATCACGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC 6480 

GGAGGAACCC TCAAGCTGGA CTGCAGCGGC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 6540 

AGGCTGCCGT CCAAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG AATCAAGGTG 6600 

60 TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGACGGACA AAGATGCCGG AGATTACCTG 6660 

TGCGTAGCTC GAAATAAGGT TGGTGATGAC TACGTGGTGC TCAAAGTGGA TGTGGTGATG 6720 

AAACCGGCCA AGATTGAACA CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC 6780 

CTGAAAGTGG ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 6840 

GACGGGAGTC TGGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGACG CACCAAGCGC 6900 

65 TATGTCGTCT TCAACAAIGG GACACTCTAC T TTAACGAAG TGGGGATGAG GGAGGAAGGA 6960 

GTGGTGACAG CGCCCGCCAC CAT CCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT 7080 

GGAGACGTGG TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 7140 

TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT ATACCAAGAT 7200 

70 GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG GCAACTACAC CTGCCTGGTC 7260 

AGGAACAGCG CGGGAGAGGA TAGGAAGACG GTGTGGATTC ACGTCAACGT CCAGCCACCC 7320 

AAGATCAACG GTAACCCCAA CCCCATCACC ACCGTGCGGG AGATAGCAGC CGGGGGCAGT 7380 

CGGAAACTGA TTGACTGCAA AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 7440 

CCCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT CCATGGCAAC 7500 

75 GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG TCCAGCTGGT ATGCATGGCA 7560 

CGCAACGAGG GAGGGGAGGC GAGGTTGATC GTGCAGCTCA CTGTCCTGGA GCCCATGGAG 7620 

AAACCCATCT TCCACGACCC GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC 7680 

AGCCTCAACT GCTCTGCCGC GGGGACCCCG ACACCCAGCC TGGTGTGC-GT CCTTCCCAAT 7740 

GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC TGACGGCATG 7800 

80 CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT ACCGCTGCGT GGCCCGCAAT 7860 

GCCGCTGGCC ACACGGAGAG GCTGGTCTCC CTGAAGGTGG GACTGAAGCC AGAAGCAAAC 7920 

AAGCAGTATC ATAACCTGGT CAGCATCATG AATGGTGAGA CCCTGAAGCT CCCCTGCACC 7980 

CCTCCCGGGG CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 8040 

GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC GGTTCGTGAG 8100 
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GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG AGACGGAGTA CGGCCCTTCG 

GTCACCAGCA TCCCCGTGAT TGTGATCGCC TATCCTCCCC GGATCACCAG CGAGCCCACC 

CCGGTCATCT ACACCCGGCC CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT 

CCCAAAGCTG ACATCACGTG GGAGTTACCG GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 

GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT CCAGCATGCC 

ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAAAA ACATTCTCGG CAGTGACTCC 
AAAACAACTT ACATCCACGT CTTCTGAAAT GTGGATTCCA G 
ACAACAAAGC GGGGTTTGTA AGGGAAGCCA GGTTGGGGAA T 
GTCACAGTGC ATGGTGGCCT CTGGTGGGTT TCAAGT1 

GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT GAGACACTTT 

CTTTTGTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG TCTGTGCTCT GACTGCAATT 

TTTCTTCTTT TGCAAATGCC ACTCGACTGC CTTCATAAGC GTCCATAGGA TATCTGAGGA 

ACATTCATCA AAAATAAGCC ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA 
TCACCTAGTT AACCTGCTGC AGTTTTTACA TGATAG2 
TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAG 
CAGAGTGACT GATATATATA TATATATTTT AATTCAGAGT T 
TTTATATGAA AAAAGAAAAA CATTTCTTCC TGGAAC1 

T TTCCTTTCAA ATCAGACGAT GAGACTAGAA GGAGAAATAC T 

A ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 9240 

ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC TGTTATATTA 9300 

CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT AGATTTCCTT GTATGCAAAG 9360 

TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT GAGAGGAGAG GAAGGAGAAA ACTGCATCAT 9420 
AACTTTACAG AATTGAATCT AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT 9480 

CTGGCTTGTC CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 

CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 
ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 9645 



Coding sequence: none 

1 11 21 31 41 51 

I I I I I I 

TATCATC r GTG I -AAGGT GGGTGTGGTG AGAAAAGTTT TAAGGCAAGA GTAGATGGCC 
ATGTTCAACT TTACAAAATT TCTTGGAAAA CTGGCAGTAT TTTGAACTGC ATCTTCTTTG 
GTACCGGAAC CTGCAGAAAC AGTGTGAGAA ATTAAGTCCT GGTTCACTGC GCAGTAGCAA 
AGATGGTCAA GGCCATGGAA AAAGCAGAAA TTTACCAAGA AAGCTGATAC CCATGTATAG 
TTCCCACTCA TCTCAAATAC ATCTGCTATC TTTTTAAGCT AAGTCCTAGA CATATCGGGG 
ATAACATGGG GGTTGATTAG TGACCACAGT TATCAGAAGC AGAGAAATGT AATTCCATAT 
TTTATTTGAA ACTTATTCCA TATTTTAATT GGATATTGAG TGATTGGGTT ATCAAACACC 
CACAAACTTT AATTTTGTTA AATTTATATG GCTTTGAAAT AGAAGTATAA GTTGCTACCA 
TTTTTTGATA ACATTGAAAG ATAGTATTTT ACCATCTTTA ATCATCTTGG AAAATACAAG 
TCCTGTGAAC AACCACTCTT TCACCTAGCA GCATGAGGCC AAAAGTAAAG GCTTTAAATT 
ATAACATATG GGATTCTTAG TAGTATGTTT TTTTCTTGAA ACTCAGTGGC TCTATCTAAC 
CTTACTATCT CCTCACTCTT TCTCTAAGAC TAAACTCTAG GCTCTTAAAA ATCTGCCCAC 

TTTTATGGAC CAAATTGACA TTTTCGACTA TTTTTTCCAA AAAAGTCAGG TGAATTTCAG 
CACACTGAGT TGGGAATTTC TTATCCCAGA AGACCAACCA ATTTCATATT TATTTAAGAT 
TGATTCCATA CTCCGTTTTC AAGGAGAATC CCTGCAGTCT CCTTAAAGGT AGAACAAATA 
CTTTCTATTT TTTTTTTCAC CATTGTGGGA TTGGACTTTA AGAGGTGACT CTAAAAAAAC 
AGAGAACAAA TATGTCTCAG TTGTATTAAG CACGGACCCA TATTATCATA TTCACTTAAA 
AAAATGATTT CCTGTGCACC TTTTGGCAAC TICTCITTTC AATGTAGGGA AAAACTTAGT 
CACCCTGAAA ACCCACAAAA TAAATAAAAC TTGTAGATGT GGGCAGAAGG TTTGGGGGTG 
GACATTGTAT GTGTTTAAAT TAAACCCTGT ATCACTGAGA AGCIGTTGTA TGGGTCAGAG 
AAAATGAATG CTTAGAAGCT GTTCACATCT TCAAGAGCAG AAGCAAACCA CATGTCTCAG 
CTATATTATT ATTTATTTTT TATGCATAAA GTGAATCATT TGTTCTGTAT TAATTT CCAA 
AGGGTTTTAC CCTCTATTTA AATGCTTTGA AAAACAGTGC ATTGACAATG GGTTGATATT 
TTTCTTTAAA AG AAAAAT AT AATTATGAAA GCCAAGATAA TCTGAAGCCT GTTTTATTTT 
AAAACTTTTT ATGTTCTGTG GTTGATGTTG TTTGTTTGTT TGTTTCTATT TTGTTGGTTT 
TTTACTTTGT TTTTTGTTTT GTTTTGTTTT GTTTTGCATA CTACATGCAG TTCTTTAACC 
AATGTCTGTT TGGCTAATGT AATTAAAGTT GTTAATTTAT ATGAGTGCAT TTCAACTATG 
TCAATGGTTT CTTAATATTT ATTGTGTAGA AGTACTGGTA ATTTTTT 
TTTAAAGAGA TAACAGTTTG ATATGTTTTC ATGTGTTTAT AGCAGAB 
ATGGCATTCC AGCGGATATT TTGGTGTTTG CGAGGCATGC AGTCAATATT TTGTACAGTT 
AGTGGACAGT ATTCAGCAAC GCCTGATAGC TTCTTTGGCC TTATGTTAAA TAAAAAGACC 
TGTTTGGGAT GT 



Coding sequence. 1 . .5001 

1 11 21 31 41 51 

I I I I I I 

ATGCCAGGCA CAAAACTAAC CCGAACAGGC GCCCCAGCAG ACTACAGAGT GATAITGAAG 
ACCTCTCAAG AGGACGAATT GGATGTACCT GACGACATCA GCGTCCGGGT TATGTCATCT 
CAGTCTGTGC TTGTGTCCTG GGTGGATCCT GTTCTGGAAA AACAGAAGAA AGTTGTTGCA 
T CAAGACAGT ACACCGTGCG CTAT CG AGAG AAGGGGGAAT TGGCCAGGTG GGATTATAAG 
CAGATCGCTA ACAGGCGTGT GCTGATTGAG AACCTGATTC CAGACACTGT GTATGAATTT 
GCAGTCCGTA TTTCACAGGG TGAAAGAGAT GGCAAATGGA GTACGTCAGT CTTCCAAAGA 
ACACCAGAAT CTGCCCCTAC CACAGCTCCT GAAAACTTGA ACGTCTGGCC AGTCAATGGC 
AAACCTACAG TTGTCGCTGC ATCTTGGGAT GCGC1ACCAG AGACTGAGGG GAAAGTGAAA 
" C TGCCAAATCA 
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tttcagaata cattctttca tacgccccgg ctctcaaacc atttggagca aagtccctca 
cctatcctgg agacactact tctgccctgg tggatggtct gcagcctggg gaacgctatc 
a aacaggagag gcctgggacc tcactccaaa gccttcattg 
:tgtacc cagaaggatt tcagttgtct agcttacctg 
atcgatatcc aaaccaaaca agttaataaa gatccacaac tggaagggag tgtttttgga 
CCATGTTTTC ttttctactt CCTCACATTT ATGCTGGATA ttggcggctt ttccttcatt 
ATGTGCTATG AAGACCCANN TGTTTCTTCT TTGACAGGCA ATTCTTTAAA ATCTGTTGCA 
GCCAGTAAGG CGGATGTTCA GCAGAACACG GAGGACAATG GGAAACCCGA AAAACCTGAG 

CCCCAAGGGA GAAATGCCAA GGACCTTCTT CTTGACTTGA AGAACAAAAT ATTGGCTAAT 

2 CCCGAAAACC CCAGCTTCGC GCCAAGAAGG CAGAGGAGCT GGATCTTCAG 
5GAGCTG GGTTCCCGGG AGGACTCGCC CATGTCACCC 

TCAGACACCC AAGACCAGAA ACGGACCCTG AGGCCGCCAA GTAGACACGG CCACTCGGTG 

3 GCAGGACTGC AGTGAGGGCC CGGATGCCAG CGCTGCCCCG AAGGGAAGGC 



TCGGCCTCTC CTGCCCACCA CGCGTCCACC CAGGGCACCT CTCATCGTCC TTCCCTGCCT 1500 

GCCAGCTTGA ATGACAACGA CTTGGTGGAC TCAGACGAAG ATGAGCGCGC TGTGGGCTCC 1560 

CTCCACCCCA AGGGCGCCTT CGCCCAGCCC CGGCCAGCCC TGTCCCCCAG CCGCCAGTCC 1620 

CCGTCCAGCG TTCTCCGCGA CAGAAGCTCT GTGGACCCCG GCGCAAAGCC AGCCTCGGCG 1680 

GCGCGGAGGA CCCCCCATTC AGGGGCCGCA GAGGAAGATT CCAGTGCCTC AGCCCCACCC 1740 

TCAAGACTTT CTCCACCCCA TGGGGGATCA TCTCGGCTGC TGCCCACCCA GCCACACCTG 1800 

AGCTCTCCAC TTTCCAAGGG CGGGAAGGAT GGTGAGGACG CCCCAGCCAC CAACTCCAAT 1860 

GCGCCATCAC GGTCCACCAT GTCCTCCTCC GTCTCTTCTC ATCTCTCGTC CAGGACGCAG 1920 

GTCTCTGAGG GAGCGGAGGC TTCTGATGGT GAAAGCCACG GTGACGGCGA TAGGGAAGAC 1980 

GGCGGAAGGC AGGCGGAGGC CACGGCCCAG ACGCTGCGGG CCCGGCCTGC CTCTGGACAC 2040 

TTCCATTTGC TCAGACACAA ACCCTTTGCT GCCAACGGGA GGTCTCCAAG CAGGTTCAGC 2100 

ATTGGGCGGG GACCTCGGCT GCAGCCCTCC AGCTCCCCAC AGTCGACTGT GCCCTCCCGA 2160 

GCCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACCCTA AGCTTAGCTC AGGTATCCAT 2220 

GGAGACGAGG AGGATGAGAA GCCGCTTCCT GCCACCGTTG TCAATGACCA CGTGCCTTCC 2280 

TCCTCCAGGC AGCCCATCTC CCGGGGCTGG GAGGACTTAA GGAGAAGCCC GCAGAGAGGG 2340 

GCCAGCCTGC ATCGGAAGGA ACCCATCCCA GAGAACCCCA AATCCACAGG GGCAGATACA 2400 

CATCCTCAGG GCAAGTACTC CTCCCTGGCC TCCAAGGCTC AGGATGTTCA ACAGAGCACA 2460 

GACGCGGACA CGGAGGGTCA TTCTCCCAAA GCACAGCCAG GGTCCACAGA CCGCCACGCG 2520 

TCCCCTGCTC GTCCTCCCGC AGCACGGTCA CAGCAGCATC CCAGTGTTCC CAGAAGGATG 25B0 

ACACCCGGCC GGGCCCCAGA ACAGCAGCCC CCTCCTCCCG TCGCCACGTC CCAGCACCAC 2640 

CCGGGACCCC AGAGCAGAGA CGCGGGTCGG TCACCTTCCC AGCCCAGGCT CTCACTGACC 2700 

CAGGCCGGGC GGCCCCGCCC CACGTCGCAG GGCCGCTCCC ACTCCTCCTC GGACCCTTAC 2760 

ACGGCGAGCT CCAGAGGGAT GCTCCCCACG GCCCTCCAGA ACCAGGACGA GGATGCCCAG 2820 

GGCAGCTACG ACGACGACAG CACAGAAGTC GAGGCCCAGG ATGTGCGGGC CCCCGCGCAC 2 8B0 

GCCGCGCGCG CCAAGGAGGC AGCTGCGTCC CTTCCCAAGC ACCAGCAGGT GGAGTCTCCC 2940 

ACAGGCGCAG GGGCAGGTGG CGACCACAGG TCCCAGCGCG GACATGCGGC CTCCCCCGCC 3000 

AGGCCCAGCC GACCCGGCGG CCGCCAGTCC CGCGCCCGGG TCCCCAGCAG GGCAGCGCCG 3 060 

GGGAAGTCGG AGCCTCCTTC CAAGCGGCCC CTGTCCTCCA AGTCCCAGCA GTCGGTCTCA 3120 

GCCGAGGACG AGGAGGAGGA GGACGCGGGG TTTTTTAAAG GCGGGAAAGA AGACCTTCTG 3180 

TCTTCCTCTG TGCCAAAGTG GCCCTCTTCC TCCACTCCCA GGGGCGGCAA AGACGCCGAT 3240 

GCGAGCCTCG CCAAGGAAGA GAGGGAGCCT GCCATCGCGC TTGCCCCTCG CGGAGGGAGC 3300 
CTGGCTCCTG TGAAGCGACC TCTCCCCCCA CCTCCAGGCA GCTCCCCCAG G 
GTCCCTTCCC GACCGCCGCC TCGCAGCGCT GCCACCGTGA G 

CCCTGGCCGC GGTACACCAC GCGCGCCCCV CCTGGCCACT TCTCCACCAC CCCGATGCTG 
TCCTTGCGCC AGAGGATGAT GCATGCCAGA TTCCGTAACC C 
AGACCCTCTT ACAGACAAGG TTATAATGGC AGACCAAATG T. 

1ACCGAA TGGACAGAGA ATTATCAATG GCCCTCAAGG AACAAAGTGG 
C TTGATCGTGG GTTAGTATTG AATGCAGAAG GAAGCTACCT C 



GGGACCCCCG TGGTGAGTCC TGACGGCCTC CCACTCTTTG G 

CCTCTGGCCA ATGCCCAAGA TAAGCCAATT TTGAGTCTTG GAGGAAAGCC GCTGGTGGGC 
TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATGCAGCC CACCACTACT 

ACCACTACTA CGACGCCCCT GCCTACCACT ACACCGAGGC CCACCACTGC CACCACCCGC 
CGCACGACCA CCAGGCGTCC AACAACCACA GTCCGAACCA CTACGCGGAC A 
ACCACCCCCA AACCCACCAC TCCCATCCCC ACCTGTC 

GACGATGATG GCAACCTGAT AATGAGCTCC AATGGGATCC CAGAGTGCTA CGCTGAAGAA 
GATGAGTTCT CAGGCTTGGA GACTGACACT GCAGTACCTA CGGAAGAGGC CTACGTTATA 
TATGATGAAG ATTATGAATT TGAGACGTCA AGGCCACCAA CCACCACTGA G 
:gagggt GATCCCAGAG GAAGGCGCCA Ti 

C TGGCTGGAAG GAAACGATTT GTTGCTCCTT ACGTGACGTA CCTAAATAAA 4500 

GACCCATCAG CCCCGTGCTC TCTGACTGAT GCACTGGATC ACTTCCAAGT GGACAGCCTG 4560 

A TCCCCAATGA CCTGAAGAAG AGTGATCTGC CTCCCCAGCA TGCTCCCCGC 4620 

G TGGTGGCCGT GGAAGGTTGC CACTCATTTG TCATTGTGGA TTGGGACAAA 4680 

GCCACCCCAG GAGATTTGGT CACAGGTTAT TTGGTTTACA GTGCATCCTA TGAAGATTTC 4740 

ATCAGGAACA AGTTTTCCAC TCAAGCTTCA TCAGTAACTC ACTTGCCCAT TC-AGAACCTA 4800 

AAGCCCAACA CGAGGTATTA TTTTAAAGTG CAAGCACAAA ATCCTCATGG CTACGGACCT 4860 

ATCAGCCCTT CGGTCTCATT TGTCACCGAA TCAG ATAAT C CTCTGCTTGT TGTGAGGCCC 4920 

CTATCTGG ATCCCATTCG CTTTCAAACA TGATCCCAGC TACACGGACT 4S80 

G GCAATATGTG AAGCGCACGT GGTATCGAAA GTTCGTGGGA GTTGTTCTTT 5040 

GTAATT CACT GAGGTATAAA ATCTACCTCA GTGACAACCT GAAAGATACA TTCTACAGCA 5100 

TTGGAGACAG CTGGGGAAGA GGTGAAGACC ATTGCCAATT TGTGGATTCA CACCTTGATG 5160 

GAAGAACAGG GCCTCAGTCC TATGTAGAAG CCCTCCCTAC TATTCAAGGC TACTATCGCC 5220 

AGTATCGTCA GGAGCCTGTC AGGTTTGGGA ACATCGGCTT CGGAACCCCC TACTACTATG 5280 

TGGGCTGGTA CGAGTGTGGG GTCTCCATCC CTGGAAAGTG GTAATCACAG GACCGTCATG 5340 

CTGCAAGCTT GCCCTGCCCA GCCCCACCAA CTAAGTCGCA CTAGGGGCTG TGAGCAAAGA 5400 

CAGCCAGCAT GCTCAGCCCC GCTGCCCTAG GTGCCAGGAA GGTCACAGAT GGACACTGGC 5460 

CATTCTGGTC ATCTCAGTCT GGAACTCAGT CCCACTTCTT GGCCTGGACA ATGAACAGGA 5520 
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CCAGAGACAT CAGAAACCAG CAACTGATTC AGTGTGATTT CCCAGACTTT TTAGGCATGA 
AATTCGGACA CTTCAGTATT TCCAGGAATA GCATATGCAC GCTGTTCTTG CTT CATGGAA 
TGCTACATGC TTTCTGTTTT TCTCATTTTG GATTTCTCCA AAACTAACTG AATTTAAGCT 
TCAGGTCCCT TTGTATGCAG TAGAAAGGAA TTATTAAAAA CACCACCAAA GAAAATAAAT 
ATATCCTACT TGAAATTTAC TCTATGGACT TACCCACTGC TAGAATAAAT GTATCAAATC 
TTATTTGTAA ATTCTCAATT TTGATATATA TATGTATATA TGCATATACA TATCCACACT 
TGTCTGCAAG AATATTGATT AAAATTGCTA AATTTGTACT TGTTCACCAA AAAAAAAAAA 
AAAAAAA 

Seq ID NO: C14 DNA Sequence 
Nucleic Acid Accession # : NM_003014 
Coding sequence: 238.. 1278 

i i 1 r r r i 1 

GGCGGGTTCG CGCCCCGAAG GCTGAGAGCT GGCGCTGCTC GTGCCCTGTG TGCCAGACGG 
CGGAGCTCCG CGGCCGGACC CCGCGGCCCC GCTTTGCTGC CGACTGGAGT TTGGGGGAAG 
AAACTCTCCT GCGCCCCAGA AGATTTCTTC CTCGGCGAAG GGACAGCGAA AGATGAGGGT 
GGCAGGAAGA GAAGGCGCTT TCTGTCTGCC GGGGTCGCAG CGCGAGAGGG CAGTGCCATG 
TTCCTCTCCA TCCTAGTGGC GCTGTGCCTG TGGCTGCACC TGGCGCTGGG CGTGCGCGGC 
GCGCCCTGCG AGGCGGTGCG CATCCCTATG TGCCGGCACA TGCCCTGGAA CAT CACGCGG 
ATGCCCAACC ACCTGCACCA CAGCACGCAG GAGAACGCCA TCCTGGCCAT CGAGCAGTAC 
GAGGAGCTGG TGGACGTGAA CTGCAGCGCC GTGCTGCGCT TCTTCTTCTG TGCCATGTAC 
GCGCCCATTT GCACC CTGGA GTTCCTGCAC GACCCTATCA AGCCGTGCAA GTCGGTGTGC 
CAACGCGCGC GCGACGACTG CGAGCCCCTC ATGAAGATGT ACAACCACAG CTGGCCCGAA 
AGCCTGGCCT G 



ATCGTCACGG ACCTCCCGGA GGATGTTAAG TGGATAGACA TCACACCAGA CATGATGGTA 720 

CAGGAAAGGC CTCTTGATGT TGACTGTAAA CGCCTAAGCC CCGATCGGTG CAAGTGTAAA 780 

AAGGTGAAGC CAACTTTGGC AACGTATCTC AGCAAAAACT ACAGCTATGT TATTCATGCC 840 

AAAATAAAAG CTGTGCAGAG GAGTGGCTGC AATGAGGTCA CAACGGTGGT GGATGTAAAA 900 

GAGATCTTCA AGTCCTCATC ACCCATCCCT CGAACTCAAG TCCCGCTCAT TACAAATTCT 960 

TCTTGCCAGT GTCCACACAT CCTGCCCCAT CAAGATGTTC TCATCATGTG TTACGAGTGG 1020 

CGTTCAAGGA TGATGCTTCT TGAAAATTGC TTAGTTGAAA AATGGAGAGA TCAGCTTAGT 1080 

AAAAGAT CCA TACAGTGGGA AGAGAGGCTG CAGGAACAGC GGAGAACAGT TCAGGACAAG 1140 

AAGAAAACAG CCGGGCGCAC CAGTCGTAGT AATCCCCCCA AACCAAAGGG AAAGCCTCCT 1200 

GCTCCCAAAC CAGCCAGTCC CAAGAAGAAC ATTAAAACTA GGAGTGCCCA GAAGAGAACA 1260 

AACCCGAAAA GAGTGTGAGC TAACTAGTTT CCAAAGCGGA GACTTCCGAC TTCCTTACAG 1320 

GATGAGGCTG GGCATTGCCT GGGACAGCCT ATCTAAGGCC ATGTGCCCCT TGCCCTAACA 1380 

ACTCACTGCA GTGCTCTTCA TAGACACATC TTGCAGCATT TTTCTTAAGG CTATGCTTCA 1440 

GTTTTTCTTT GTAAGCCATC ACAAGCCATA GTGGTAGGTT TGCCCTTTGG TACAGAAGGT 1500 

GAGTTAAAGC TGGTGGAAAA GGCTTATTGC ATTGCATTCA GAGTAACCTG TGTGCATACT 1560 

CTAGAAGAGT AGGGAAAATA ATGCTTGTTA CAATTCGACC TAATATGTGC ATTGTAAAAT 1620 

AAATGCCATA TTTCAAACAA AACACGTAAT TTTTTTACAG TATGTTTTAT TACCTTTTGA 1680 

TATCTGTTGT TGCAATGTTA GTGATGTTTT AAAATGTGAT GAAAATATAA TGTTTTTAAG 1740 



TGTGTTTTTT TACCAAIGAC TTCAGTTTCT GTTTTTAGCT AGAAACTTAA A 
AATAATAAAG AAAAATAAAT AAAAAGGAGA GGCAGACAAT GTCTGGATTC CTGTTTTTTG 
GTTACCTGAT TTCCATGATC ATGATGCTTC TTGTCAACAC CCTCTTAAGC AGCACCAGAA 
ACAGTGAGTT TGTCTGTACC ATTAGGAGTT AGGTACTAAT TAGTTGGCTA ATGCTCAAGT 
ATTTTATACC CACAAGAGAG GTATGTCACT CATCTTACTT CCCAGGACAT CCACCCTGAG 
AATAATTTGA CAAGCTTAAA AATGGCCTIC ATGTGAGTGC CAAATTTTC 
TTAAATATTT TCTTTGCCTA AATACATGTG AGAGGAGTTA AATATAAAO 
AAAGTTGAGT TCCACCTCTG AAATGAGAAT TACTTGACAG TTGGGATAC- - 
AAAAAGAACT TATTTGCAGC ATTTTATCAA CAAATTTCAT AATTGTGGAC A 
ATTTATTTTA AAAAACAATT TTATTGGCCT TTTGCTAACA CAGTAAC 
AGGCATTCAA T AAAT GCACA ACGCCCAAAG GAAATAAAAT CCTATCTAAT CCTACTCTCC 
ACTACACAGA GGTAATCACT ATTAGTATTT TGGCATATTA TTCTCCAGGT GTTTGCTTAT 
GCACTTATAA AATGATTTGA ACAAATAAAA CTAGGAACCT GTATACATGT GTTTCATAAC 
CTGCCTCCTT TGCTTGGCCC TTTATTGAGA TAAGTTTTCC TGTCAAGAAA GCAGAAACCA 
TCTCATTTCT AACAGCTGTG TTATATTCCA TAGTATGCAT TACTCAACAA ACTGTTGTGC 
TATTGGATAC TTAGGTGGTT TCTTCACTGA CAATACTGAA TAAACATCTC ACCGGAATTC 

seq ID NO: C15 DNA Sequence 



I I I 



CGCCCTCCTG CCCCCGATGC T 

TCTGCCGCCG GACGTCCACC ACCTCCATGC CGAGAGGAGG GGGCCACAGC C 
AGCCCTGCCC AGTAGCCCGG CACCTGCCCC TGCCACGCAG GAAGCCCCCC GGCCTGCCAG 
CAGCCTCAGG CCTCCCCGCT GTGGCGTGCC CGACCCATCT GATGGGCTGA GTGCCCGCAA 
CCGACAGAAG AGGTTCGTGC TTTCTGGCGG GCGCTGGGAG AAGACGGACC TCACCTACAG 
GATCCTTCGG TTCCCATGGC AGTTGGTGCA GGAGCAGGTG CGGCAGACGA TGGCAC-AGGC 
CCTAAAGGIA TGGAGCGATG TGACGCCACT CACCTTTACT GAGGTGCACG AGGGCCGTGC 
TGACATCATG ATCGACTTCG CCAGGTACTG GCATGGGGAC GACCTGCCGT TTGATGGGCC 
TGGGGGCATC CTGGCCCATG CCTTCTTCCC CAAGACTCAC CGAGAAGGGG ATGTCCACTT 
CGACTATGAT GAGACCTGGA CTATCGGGGA TGACCAGGGC ACAGACCTGC TGCAGGTGGC 
AGCCCATGAA TTTGGCCACG TGCTGGGGCT GCAGCACACA ACAGCAGCCA AGGCCCTGAT 
GTCCGCCTTC TACACCTTTC GCTACCCACT GAGTCTCAGC CCAGATGACT GCAGGGGCGT 
TCAACACCTA TATGGCCAGC CCTGGCCCAC TGTCACCTCC AGGACCCCAG CCCTCCGCCC 
CCAGGCTGGG ATAGACACCA ATGAGATTGC ACCGCTGGAG CCAGACGCCC CGCCAGATGC 
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3 CGGTCTCCAC CATCCGAGGC GAGCTCTTTT TCTTCAARGC 9 SO 

GGGCTTTGTG TGGCGCCTCC GTGGGGGCCA GCTGCAGCCC GGCTACCCAG CATTGGCCTC 1020 

TCGCCACTGG CAGGGACTGC CCAGCCCTGT GGACGCTGCC TTCGAGGATG CCCAGGGCCA 1080 

3 CTCAGTACTG GGTGTACGAC GGTG.AAAAGC CAGTCCTGGG 1140 

C TGGGCCTGGT GAGGTTCCCG GTCCATGCTG CCTTGGTCTG 1200 

GGGTCCCGAG AAGAACAAGA TCTACTTCTT CCGAGGCAGG GACTACTGGC GTTTCCACCC 1260 

CAGCACCCGG CGTGTAGACA GTCCCGTGCC CCGCAGGGCC ACTGACTGGA GAGGGGTGCC 1320 

CTCTGAGATC GACGCTGCCT TCCAGGATGC TGATGGCTAT GCCTACTTCC TGCGCGGCCG 1380 

CCTCTACTGG AAGTTTGACC CTGTGAAGGT GAAGGCTCTG GAAGGCTTCC CCCGTCTCGT 1440 

GGGTCCTGAC TTCTTTGGCT GTGCCGAGCC TGCCAACACT TTCCTCTGAC CATGGCTTGG 1500 

C CCCTGCCAGG CCACGAATAT CAGGCTAGAG ACCCATGGCC 1560 

3 CTGTGGGCAC CAGGCATGGG ACTGAGCCCA TGTCTCCTGC AGGGGGATGG 1620 

GGTGGGGTAC AACCACCATG ACAACTGCCG GGAGGGCCAC GCAGGTCGTG GTCACCTGCC 1680 

AGCGACTGTC TCAGACTGGG CAGGGAGGCT TTGGCATGAC TTAAGAGGAA GGGCAGTCTT 1740 

GGGACCCGCT ATGCAGGTCC TGGCAAACCT GGCTGCCCTG TCTCATCCCT GTCCCTCAGG 1B00 

~~ G GGGGAACTGG AGTGTCCTTG CTGTATCCCT GTTGTGAGGT 1B60 

Z CCATGGCCTT CAGCCCTGGC 1920 

C AGGTCTTGGT AGGTGCCTGC 1980 

ATCTGTCTGC CTTCTGGCTG ACAATCCTGG AAATCTGTTC TCCAGAATCC AGGCCAAAAA 2040 

GTTCACAGTC AAATGGGGAG GGGTATTCTT CATGCAGGAG ACCCCAGGCC CTGGAGGCTG 2100 

CAACATACCT CAATCCTGTC CCAGGCCGGA TCCTCCTGAA GCCCTTTTCG CAGCACTGCT 2160 

ATCCTCCAAA GCCATTGTAA ATGTGTGTAC AGTGTGTATA AACCTTCTTC TTCTTTTTTT 2220 

TTTTTAAACT GAGGATTGTC ATTAAACACA GTTGTTTTCT 2260 

seq ID NO: C16 DNA Sequence 
Nucleic Acid Accession ft : NM_024022 
Coding sequence: 202.. 1563 

1 11 21 31 41 51 

I I I I I I 

ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 
GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 
CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 180 
AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 
TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 
GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 
ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 
TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 4B0 
GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT G 
AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA C 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 
GATAACCTCA GAGTGAGCTC GCTGGAGGGG CACTTCCGGG AGGAGTTTGT GTCCATCGAT 
CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 
TGTGCCTCTG GGCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 
AGCTCACGCA TCGTGGGTGG AAACATGTCC TTCCTCTCCC AGTGGCCCTG GCAGGCCAGC 
CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 
ACTGCIGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 
CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG T GGAGAAGAT TGTCTACCAC 
AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 
CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTICCCC 
GATGGAAAAG TGTGCTGGAC GTGAGGATGG GGGGCCACAG AGGATGGAGG TGACGCCTCC 
CCTGTCCTGA ACCACGCGGC CGTCCCTTTG ATTTCCAACA AGATCTGCAA CCACAGGGAC 
GTGTACGGTG GCATCAT CT C CCCCICCAIG CTCTGCGCGG GCTACCTGAC GGGTGGCGTG 
GACAGCTGCC AGGGGGACAG CGGGGGGCCC CTGGTGTGTC AAGAGAGGAG GCTGTGGAAG 
TTAGTGGGAG CGACCAGCTT TGGCATCGGC TGCGCAGAGG TGAACAAGCC TGGGGTGTAC 
A CCTCCTTCCT GGACTGGAIC CACGAGCAGA TGGAGAGAGA CCTAAAAACC 
A GTTCCTGAGG TGATGAAGAC AGCCCGATCC 
C TCCCGTGTAG GAACCTGCAC ACGAGCAGAC ACCCTTGGAG CTCTGAGTTC 
G AAAGAGGCAC CCTTCCATCT GATTCCAGCA C 

GCAGTGGCGA A 
CCTCAGCTTC CCCAGTAGCT G 
TATTTTTAGT AGAGACAGGG TTTCACCATG T 

CAAATGATGT GCCTGCTTCA GCCTCCCACA GTGCTGGGAT TACAGGCATG GGCCACCACG 
CCTAGCCTCA CGCTCCTTTC TGATCTTCAC TAAGAACAAA AGAAGCAGCA ACTTGCAAGG 
GCGGCCTTTC CCACTGGTCC ATCTGGTTTT CICTCCAGGG GTCTTGCAAA ATTCCTGACG 
AGATAAGCAG TTATGTGACC TCACGTGCAA AGCCACCAAC AGCCACTCAG AAAAGACGC& 
CCAGCCCAGA AGTGCAGAAC TGCAGTCACT GCACGTTTTC ATCTCTAGGG ACCAGAACC!A 
AACCCACCCT TTCTACTTCC AAGACTTATT TTCACATGTG GGGAGGTTAA TCTAGGAATG 
ACTCGTTTAA GGCCTATTTT CATGATTTCT TTGTAGCATT TGGTGCTTGA CGTATTATTG 
TCCTTTGATT CCAAATAATA TGTTTCCTTC CCTCAAAAAA AAAAAAAAAA AAAAAAAAAA 



I I I I I I 

GAATT CCGGC TCTCTGGGTG AGAGACCGAG AGGGGCATAT CCGTTCACGC CGATCCATGA 
AAATGCTTTG GAAATTGACG GATAATATCA AGTACGAGGA CTGCGAGGAC CGTCACGACG 
GCACCAGCAA CGGGACGGCA CGGTTGCCCC AGCTGGGCAC TGTAGGTCAA TCTCCCTACA 
CGAGCGCCCC GCCGCTGTCC CACACCCCCA ATGCCGACTT COAGCCCCCA TACTTCCCCC 
C CCCCAGTCGC AAGATCCTTA CTCCCACGTC AACGACCCCT 
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A CCCCCTGCAC GCCCAGCCGC AGCCGCAGCA CCCAGGCTGG CCCGGCCAGA 

GGCAGAGCCA GGAGTCTGGG CTCCTGCACA CGCACCGGGG GCTGCCTCAC CAC-CTGTCGG 

GCCTGGATCC TCGCAGGGAC TACAGGCGGC ACGAGGACCT CCTGCACGGC CCACACGCGC 

G ACTCGGAGAC CTCTCGATCC ACTCCTTACC TCACGCCATC GAGGAGGTCC 




TAGCCAGGGA CTTTGGGTAC GTGTGCGAAA CCGAATTTCC T 

3 ACAACATTCC GATCCCAATG AGCAAGTGAC AAGAAAAAAC ATGCTCCTGG 
A GATATGCAAA GAGTTCACCG ACCTGCTGGC TCAGGACCGA TCTCCCCTGG 
3 GCCCAACCCC ATCCTGGAGC CCGGCATCCA GAGCTGCTTG ACCCACTTCA 
ACCTCATCTC CCACGGCTTC GGCAGCCCCG CGGTGTGTGC CGCGGTCACG GCCCTGCAGA 



GCCACACGGA CAACAACGCC A 

CTCCTCCCGC CCCGCCCCTC CCACGCCTCA CCAGCCCCCC GCGCGCCCAC C 
GTGACAGCTC CGGGATCAGC AACCCTTCCT GCTGCTGCTA C 
GCCGCCGCCG CCGCTGCCCT TGGGTCCCCC CGAGTCTCCG GGACTGCCCT CICGACTGTC 
AGTGGGGCAG CCTCTCCGAC TCTGCACCCG CCTCGACCTC CCCACCCGCT CCCACACCCC 
TGTGCCCCCG GAATTC 

Seq ID NO: C18 DNA Sequence 
Nucleic Acid Accession #: NM_002988 
Coding sequence: 71.. 340 

1 11 21 31 41 51 

I I I I I I 

CCGGCACGAG AGGAGTTGTG AGTTTCCAAG CCCCAGCTCA CTCTGACCAC TTCTCTGCCT 
GCCCAGCATC ATGAAGGGCC TTGCAGCTGC CCTCCTTGTC CTCGTCTGCA CCATGGCCCT 
CTGCTCCTGT GCACAAGTTG GTACCAACAA AGAGCTCTGC TGOCTCG1CT ATACCTCCTG 
GCAGATTCCA CAAAAGTTCA TAGTTGACTA TTCTGAAACC AGCCCCCAGT GCCCCAAGCC 
AGGTGTCATC CTCCTAACCA AGAGAGGCCG GCAGATCTGT GCTGACCCCA ATAAGAAGTG 
GGT CCAG AAA TACATCAGCG ACCTGAAGCT GAATGCCTGA GGGGCCTGGA AGCTGCGAGG 
GCCCAGTGAA CTTGGTGGGC CCAGGAGGGA ACAGGAGCCT GAGCCAGGGC AATGGCCCTG 
CCACCCTGGA GGCCACCTCT TCTAAGAGTC CCATCTGCTA TGCCCAGCCA CATTAACTAA 
CTTTAATCTT AGTTTATGCA TCATATTTCA TTTTGAAATT GATTTCTATT GTTGAGCTGC 
ATTATGAAAT TAGTATTTTC TCTGACATCT CATGACATTG TCTTTATCAT CCTTTCCCCT 
TTCCCTTCAA CICTTCGTAC ATTCAATGCA TGGATCAATC AGTGTGATTA GCTTTCTCAG 
CAGACATTCT GCCATATGTA TCAAATCACA AATCTTTATT GAATGGTTTT GCTCAGCACC 
ACCTTTTAAT ATATTGGCAG TACTTATTAT ATAAAAGGTA AACCAGCATT CTCACTGTGA 



Seq ID NO: C19 DNA Sequence 
Coding sequence: 121.. 2619 ~~ 



TATGGCCAAG AGGGGAAGTT TAGTGGACCC CTGAAACCCA TGACATTTTC TATTTATGAA 
GGCCAAGAAC CGAGTCAAAT TAT ATT CCAG TTTAAGGCCA ATCCTCCTGC TGTGACTTTT 
GAACTAACTG GGGAGACAGA CAACATATTT GTGATAGAAC GGGAGGGACT TCTGTATTAC 
AACAGAGCCT TGGACAGGGA AACAAGATCT ACTCACAATC TCCAGGTTGC AGCCCTGGAC 
GCTAATGGAA TTATAGTGGA GGGTCCAGTC CCTATCACCA TAGAAGTGAA GGACATCAAC 
GACAATCGAC CCACGTTTCT CCAGTCAAAG TACGAAGGCT CAGTAAGGCA GAACTCTCGC 
CCAGGAAAGC CCTTCTTGTA TGT CAATGCC ACAGACCTGG ATGATCCGGC C 
GGCCAGCTTT ATTACCAGAT TGTCATCCAG CTTCCCATGA TCAACAATGT CATGT 
\ ACAAAACGGG AGCCATCTCT CTTACCCGAG A 



GAGAATTCCT TCAGTGATAC CACATCTGTG GATATCATAG TGACAGAGAA TATTTGGAAA 
GCACCAAAAC CTGTGGAGAT GGTGGAAAAC TCAACTGATC CTCACCCCAT CAAAATCACT 
CAGGTGCGGT GGAATGATCC CGGTGCACAA TATTCCTTAG TTGACAAAGA GAAGCTGCCA 
AGATTCCCAT TTTCAATTGA CCAGGAAGGA GATATTTACG TGACTCAGCC CTTGGACCGA 
GAAGAAAAGG ATGCATATGT TTTTTATGCA GTTGCAAAGG ATGAGTACGG AAAACCACTT 
TCATATCCGC TGGAAATTCA TGTAAAAGTT AAAGATATTA ATGATAATCC ACCTACATGT 
CCGTCACCAG TAACCGTATT TGAGGTCCAG GAGAATGAAC GACTGGGTAA CAGTATCGGG 
ACCCTTACTG CACATGACAG GGATGAAGAA AATACTGCCA ACAGTTTTCT AAA CTACAGG 
ATTGTGGAGC AAACTCCCAA ACTTCCCATG GATGGACTCT TCCTAATCCA AACCTATGCT 
GGAATGTTAC AGTTAGCTAA ACAGTCCTTG AAGAAGCAAG ATACTCCTCA GTACAACTTA 
ACGATAGAGG TGTCTGACAA AGATTTCAAG ACCCTTTGTT TTGTGCAAAT CAACGTTATT 
GATATCAATG ATCAGATCCC CATCTTTGAA AAAT CAGATT ATGGAAACCT GACTCTTGCT 
GAAGACACAA ACATTGGGTC CACCATCTTA ACCATCCAGG CCACTGATGC TGATGAGCCA 
TTTACTGGGA GTTCTAAAAT TCTGTATCAT ATCATAAAGG GAGACAGTGA GGGACGCCTG 
GGGGTTGACA CAGATCCCCA TACCAACACC GGATATGTCA TAATTAAAAA GCCTCTTGAT 
TTTGAAACAG CAGCTGTTTC CAACATTGTG TTCAAAGCAG AAAAT CCTGA GCCTCTAGTG 
TTTGGTGTGA AGTACAATGC AAGTTCTTTT GCCAAGTTCA CGCTTATTGT GACAGATGTG 
AATGAAGCAC CTCAATTTTC CCAACACGTA TTCCAAGCGA AAGTCAGTGA GGATGTAGCT 
ATAGGCACTA AAGTGGGCAA TGTGACTGCC AAGGATCCAG AAGGTCTGGA CATAAGCTAT 
TCACTGAGGG GAGACACAAG AGGTTGGCTT AAAATTGACC ACGTGACTGG TGAGATCTTT 
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AGTGTGGCTC CATTGGACAG AGAAGCCGGA AGTCCATATC GGGTACAAGT GGTGGCCACA 
GAAGTAGGGG GGTCTTCCTT GAGCTCTGTG TCAGAGTTCC ACCTGATCCT TATGGATGTG 
AATGACAACC CTCCCAGGCT AGCCAAGGAC TACACGGGCT TGTTCTTCTG CCATCCCCTC 
AGTGCACCTG GAAGT CTCAT TTTCGAGGCT ACTGATGATG ATCAGCACTT ATTTCGGGGT 
CCCCATTTTA CATTTTCCCT CGGCAGTGGA AGCTTACAAA ACGACTGGGA AGTTTCCAAA 
AT CAATGGT A CTCATGCCCG ACTGTCTACC AGGCACACAG AGTTTGAGGA GAGGGAGTAT 
GTCGTCTTGA TCCGCATCAA TGATGGGGGT CGGCCACCCT TGGAAGGCAT TGTTTCTTTA 
G TGTGGAAGGA AGTTGTTTCC GGCCAGCAGG TCACCAGACT 



3 CAGTTGTGTT TATCCGCATA AAGAAGGATA AAGGCAAAGA TAATGTTGAA 
AGTGCTCAAG CATCTGAAGT CAAACCT CTG AGAAGCTGAA TTTGAAAAGG AATGTTTGAA 
TTTATATAGC AAGTGCTATT TCAGCAACAA CCATCTCATC CTATTACTTT TCATCTAACG 
TGCATTATAA TTTTTTAAAC AGATATTCCC TCTTGTCCTT TAATATTTGC TAAATATTTC 
TTTTTTGAGG TGGAGTCTTG CTCTGTCGCC CAGGCTGGAG TACAGTGGTG T 



TGGGTTTACA GGCACCCACC ACCATGCCCA GCTAATTTTT GTATTTTTAA TAGAGACGGG 2940 

GTTTCGCCAT TTGGCCAGGC TGGTCTTGAA CTCCTGACGT CAAGTGATCT GCCTGCCTTG 3000 

ACATTAGAGA GATTTTTCAT TTTTCCATGA GATTTTTCCT CTCTGCAAAT GGCTTAGCTA 3120 

CTTGTGTTTT TCCCTTTTGG GGCAAGACAG ACTCATTAAA TATTCTGTAC ATTTTTTCTT 3180 

TATCAAGGAG ATATATCAGT GTTGTCTCAT AGAACTGCCT GGATTCCATT TATGTTTTTT 3240 

CTGATTCCAT CCTGTGTCCC CTTCATCCTT GACTCCTTTG GTATTTCACT GAATTTCAAA 3300 

CATTTGTCAG AGAAGAAAAA CGTGAGGACT GAGGAAAAAT AAATAAATAA AAGAACAGCC 3360 

TTTTCCCTTA GTATTAACAG AAATGTTTCT GTGTCATTAA CCATCTTTAA TCAATGTGAC 3420 

ATGTTGCTCT TTGGCTGAAA TTCTTCAACT TGGAAATGAC ACAGACCCAC AGAAGGTGTT 3480 

CAAACACAAC CTACTCTGCA AACCTTGGTA AAGGAACCAG TCAGCTGGCC AGATTTCCTC 3540 

ACTACCTGCC ATGCATACAT GCTGCGGATG TTTTCTTCAT TCGIATGTTA GTAAAGTTTT 3600 

GGTTATTATA TATTTAACAT GTGGAAGAAA ACAAGACATG AAAAGAGTGG TGACAAATCA 3660 

AGAATAAACA CTGGTTGTAG TCAGTTTTGT TTGTTAA 3697 



GCCGG GGCTTCTGCC GCTGCTCCCT CCGCTGCTGC TGCTGCCGCT CCTGCTGCTC 
CCCGCCGCCT GCCGGGCGCT GGAAGAGACC CT CATGGACA CAAAATGGGT AACATCTGAG 
TTGGCGTGGA CATCTCATCC AGAAAGTGGG TGGGAAGAGG TGAGTGGCTA CGATGAGGCC 
ATGAATCCCA TCCGCACATA CCAGGTGTGT AATGTGCGCG AGTCAAGCCA GAACAACTGG 
CTTCGCACGG GGTTCATCTG GCGGCGGGAT GTGCAGCGGG TCTACGTGGA GCTCAAGTTC 
ACTGTGCGTG ACTGCAACAG CATCCCCAAC ATCCCCGGCT CCTGCAAGGA GACCTTCAAC 
CTCTTCTACT ACGAGGCTGA CAGCGATGTG GCCTCAGCCT CCTCCCCCTT CTGGATGGAG 
AACCCCTACG TGAAAGTGGA CACCATTGCA CCCGATGAGA GCTTCTCGCG GCTGGATGCC 
GGCCGTGTCA ACACCAAGGI GCGCAGCTTT GGGCCACTTT CCAAGGCTGG CTTCTACCTG 
GCCTTCCAGG ACCAGGGCGC CTGCATGTCG CTCATCTCCG TGCGCGCCTT CTACAAGAAG 
TGTGCATCCA CCACCGCAGG CTTCGCACTC TTCCCCGAGA CCCTCACTGG GGCGGAGCCC 
ACCTCGCTGG TCATTGCTCC TGGCACCTGC ATCCCTAACG C 
CTCAAGCICT ACTGCAACGG CGATGGGGAG T 
GCCACCGGCC ATGAGCCAGC TGCCAAGGAG T 
TACAAGGCGA AGCAGGGAGA GGGGCCCTGC CTCCCATGTC CCCCCAACAG CCGTACCACC 
TCCCCAGCCG CCAGCATCTG CACCTGCCAC AATAACTTCT ACCGTGCAGA CTCGGACTCT 
GCGGACAGTG CCTGTACCAC CGTGCCATCT CCACCCCGAG GTGTGATCTC CAATGTGAAT 
GAAACCTCAC TGATCCTCGA GTGGAGTGAG CCCCGGGACC TGGGTGGCCG GGATGACCTC 
CTGTACAATG TCATCTGCAA GAAGTGCCAT GGGGCTGGAG GGGCCTCAGC CTGCTCACGC 
TGTGATGACA ACGTGGAGTT TGTGCCTCGG CAGCTGGGCC TGACGGAGCG CCGGGTCCAC 
ATCAGCCATC TGCTGGCCCA CACGCGCTAC ACCTTTGAGG TGCAGGCGGT CAACGGTGTC 
TCGGGCAAGA GCCCTCTGCC GCCTCGTTAT GCGGCCGTGA ATATCACCAC AAACCAGGCT 
GCCCCGTCTG AAGTGCCCAC ACTACGCCTG CACAGCAGCT CAGGCAGCAG CCTCACCCTA 
TCCTGGGCAC CCCCAGAGCG GCCCAACGGA GTCATCCTGG ACTACGAGAT GAAGTACTTT 
GAGAAGAGCG AGGGCAT CGC CTCCACAGTG ACCAGCCAGA TGAACTCCGT GCAGCTGGAC 
GGGCTTCGGC CTGACGCCCG CTATGTGGTC CAGGTCCGTG CCCGCACAGT AGCTGGCTAT 
GGGCAGTACA GCCGCCCTGC CGAGTTTGAG ACCACAAGTG AGAGAGGCTC TGGGGCCCAG 
CAGCTCCAGG AGCAGCTTCC CCTCATCGTG GGCTCCGCTA CAGCTGGGCT TGTCTTCGTG 
GTGGCTGTCG TGGTCATCGC TATCGTCTGC CTCAGGAAGC AGCGACACGG CTCTGATTCG 
GAGTACACGG AGAAGCTGCA GCAGTACATT GCTCCTGGAA TGAAGGTTTA TATTGACCCT 
TTTACCTACG AGGACCCTAA TGAGGCTGTT CGGGAGTTTG CCAAGGAGAT CGACGTGTCC 
TGCGTCAAGA TCGAGGAGGT GATCGGAGCT GGGGAATTTG GGGAAGTGTG CCGTGGTCGA 
CTGAAACAGC CTGGCCGCCG AGAGGTGTTT GTGGCCATCA AGACGCTGAA GGTGGGCTAC 
ACCGAGAGGC AGCGGCGGGA CTTCCTAAGC GAGGCCTCCA TCATGGGTCA GTTTGATCAC 
CCCAATATAA TCCGGCTCGA GGGCGTGGTC ACCAAAAG1C GGCCAGTTAT GATCCTCACT 
GAGTTCATGG AAAACTGCGC CCTGGACTCC TTCCTCOSGC TCAACGATGG GCAGTTCACG 
GTCATCCAGC TGGTGGGCAT GTTGCGGGGC ATTGCTGCCG GCAT GAAGT A CCTGTCCGAG 
ATGAACTATG TGCACCGCGA CCTGGCTGCT CGCAACATCC TTGTCAACAG CAACCTGGTC 
TGCAAAGTCT CAGACTTTGG CCTCTCCCGC TTCCTGGAGG ATGACCCCTC CGATCCTACC 
TACACCAGTT CCCTGGGCGG GAAGATCCCC ATCCGCTSGA CTGCCCCAGA GGCCATAGCC 
TATCGGAAGT TCACTTCTGC TAGTGATGTC TGGAGCTACG GAATTGTCAT GTGGGAGGTC 
ATGAGCTATG GAGAGCGACC CTACTGGGAC ATGAGCAACC AGGATGTCAT CAATGCCGTG 
GAGCAGGATT ACCGGCTGCC ACCACCCATG GACTGTCCCA CAGCACTGCA CCAGCTCATG 
CTGGACTGCT GGGTGCGGGA CCGGAACCTC AGGCCCAAAT TCTCCCAGAT TGTCAATACC 
CTGGACAAGC TCATCCGCAA TGCTGCCAGC CTCAAGGTCA TTGCCAGCGC TCAGTCTGGC 
ATGTCACAGC CCCTCCTGGA CCGCACGGTC CCAGATTACA CAACCTTCAC GACAGTTGGT 
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GATTGGCTGG ATGCCATCAA GATGGGGCGG TACAAGGAGA GCTTCGTCAG TGCGGGGTTT 28B0 

GCATCTTTTG ACCTGGTGGC CCAGATGACG GCAGAAGACC TGCTCCGTAT TGGGGTCACC 2940 

CTGGCCGGCC ACCAGAAGAA GATCCTGAGC AGTATCCAGG ACATGCGGCT GCAGATGAAC 3000 

CAGACGCTGC CTGTGCAGGT CTGACACCGG CTCCCACGGG GACCCTGAGG ACCGTGCAGG 30 SO 

GATGCCAAGC AGCCGGCTGG ACTTTCGGAC TCTTGGACTT TTGGATGCCT GGCCTTAGGC 3120 

TGTGGCCCAG AAGCTGGAAG TTTGGGAAAG GCCCAAGCTG GC-ACTTCTCC AGGCCTGTGT 3180 

TCCCTCCCCA GGAAGTGCGC CCCAAACCTC TTCATATTGA AGATGGATTA GGAGAGGGGG 3240 

TGATGACCCC TCCCCAAGCC CCTCAGGGCC CAGACCTTCC TGCTCTCCAG CAGGGGATCC 3300 

CCACAACCTC ACACTTGTCT GTTCTTCAGT GCTGGAGGTC CTGGCAGGGT CAGGCTGGGG 3360 

TAAGCCGGGG TTCCACAGGG CCCAGCCCTG GCAGGGGTCT GGCCCCCCAG GTAGGCGGAG 3420 

AGCAGTCCCT CCCTCAGGAA CTGGAGGAGG GGACTCCAGG AATGGGGAAA TGTGACACCA 3480 

CCATCCTGAA GCCAGCTTGC ACCTCCAGTT TGCACAGGGA TTTGTCCTGG GGGCTGAGGG 3540 

CCCTGTCCCC ACCCCCGCCC TTGGTGCTGT CATAAAAGGG CAGGCAGGGG CAGGCTGAGG 3S00 

T TGCCCCCCAG AGACTGACTC TCAGAGCCAG AGATGGGATG TGTGAGTGTG 3660 

G TGTGCGCGCG CGCGCGCGTG TGTGTGTGCA CGCACTGGCC TGCACAGAGA 3720 

GCATGGGTGA GCGTGTAAAA GCTTGGCCCT GTGCCCTACA ATGGGGCCAG CTGGGCCGAC 3780 

AGCAGAATAA AGGCAATAAG ATGAA 3805 



1 11 21 31 41 51 

I I I I I I 

AGGTGAGCGG TTGCTCGTCG TCGGGGCGGC CGGCAGCGGC GGCTCCAGGG CCCAGCATGC 
GCGGGGGACC CCGCGGCCAC CATGTATGTG GGCTATGTGC TGGACAAGGA TTCGCCCGTG 
TACCCCGGCC CAGCCAGGCC AGCCAGCCTC GGCCTGGGCC CGGCAAACTA CGGCCCCCCG 
GCCCCGCCCC CGGCGCCCCC GCAGTACCCC GACTTCTCCA GCTACTCTCA CGTGGAGCCG 
GCCCCCGCGC CCCCGACGGC CTGGGGGGCG CCCTTCCCTG CGCCCAAGGA CGACTGGGCC 
GCCGCCTACG GCCCGGGCCC CGCGGCCCCT GCCGCCAGCC CAGCTTCGCT GGCATTCGGG 
CCCCCTCCAG ACTTTAGCCC GGTGCCGGCG CCCCCTGGGC CCGGCCCGGG CCTCCTGGCG 
CAGCCCCTCG GGGGCCCGGG CACACCGTCC TCGCCCGGAG CGCAGAGGCC GACGCCCTAC 
GAGTGGATGC GGCGCAGCGT GGCGGCCGGA GGCGGCGGTG GCAGCGGTAA GACTCGGACC 
AAGGAOAAGT ACCGCGTGGT CTACACCGAC CACCAACGCC TGGAGCTGGA GAAGGAGTTT 
CATTACAGCC GTTACATCAC AATCCGGCGG AAATCAGAGC TGGCTGCCAA TCTGGGGCTC 
ACTGAACGGC AGGIGAAGAT CTGGTTCCAA AACCGGCGGG CAAAGGAGCC CAAAGTGAAC 
AAGAACAAAC AGCAGCAGCA ACAGCCCCCA CAGCCGCCGA TGGCCCACGA CATCACGGCC 
ACCCCAGCCG GGCCATCCCT GGGGGGCCTG TGTCCCAGCA ACACCAGCCT CCTGGCCACC 
1 < i 1 ' f.TJIA AGAGGAGTTT CTGCCATAGC CCCATGCCCA GCCTGTGCGC 

CGGGGGACCT GGGGACTCGG GTGCTGGGAG TGTGGCTCCT GTGGGCCCAG GAGGTCTGGT 
CCGAGTCTCA GCCCTGACCT TCTGGGACAT GGTGGACAGT CACCTATCCA CCCTCTGCAT 
XCCTTt i ITTGTGTGC AGTAAGCCTG TTGGATAAAG ACCTTCCAGC TCCTGTGTTC 

TAGACCTCTG GGGGATAAGG GAGTCCAGGG TGGATGATCT CAATCTCCCG TGGGCATCTC 
AAGCCCCAAA TGGTTGGGGG AGGGGCCTAG ACAAGGCTCC AGGCCCCACC TCCTCCTCCA 
TACGTTCAGA GGTGCAGCTG GAGGCCTGTG TGGGGACCAC ACTGATCCTG GAGAAAAGGG 
ATGGAGCTGA AAAAGATGGA ATGC1TGCAC AGCATGACCT GAGGAGGGAG GAACGTGGTC 
AACTCACACC TGCCTCTTCT GCAGCCTCAC CTCTACCTGC CCCCATCATA AGGGCACTGA 
GCCCTTCCCA GGCTGGATAC TAAGCACAAA GCCCATAGCA CTGGGCTCTG ATGGCTGCTC 
CACTGGGTTA CAGAATCACA GCCCTCATGA TCATTCTCAG TGAGGGCTCT GGATTGAGAG 
GGAGGCCCTG GGAGGAGAGA AGGGGGCAGA GTCTTCCCTA CCAGGTTTCT ACACCCCCGC 
CAGGCTGCCC ATCAGGGCCC AGGGAGCCCC CAGAGGACTT TATTCGGACC AAGCAGAGCT 
CACAGCTGGA CAGGTGTIGT ATATAGAGTG GAATCTCTTG GATGCAGCTT CAAGAATAAA 
TTTTTCTTCT CTTTTCAAA 



2 DNA Sequence 

NM_021978 

. .2603 



GACGCCTGTG AGACCCGCGA GCGGCCTCGG GGACCATGGG GAGCGATCGG GCCCGCAAGG 
GCGGAGGGGG CCCGAAGGAC TTCGGCGCGG GACTCAAGTA CAACTCCCGG CACGAGAAAG 
TGAATGGCTT GGAGGAAGGC GTGGAGTTCC TGCCAGT CAA CAACGTCAAG AAGGTGGAAA 
AGCATGGCCC GGGGCGCTGG GTGGTGCTGG CAGCCGTGCT GATCGGCCTC CTCTTGGTCT 
TGCTGGGGAT CGGCTTCCTG GTGTGGCATT TGCAGTACCG GGACGTGCGT GTCCAGAAGG 
— 3 CTACATGAGG A ' 



ACTCCACTGA GTTTGTAAGC CTGGCCAGCA AGGTGAAGGA CGCGCTGAAG C 

GCGGAGTCCC ATTCCTGGGC CCCTACCACA AGGAGTCGGC TGTGACGGCC TTCAGCGAGG 
GCAGCGTCAT CGCCTACTAC TGGTCTGAGT TCAGCATCCC GCAGCACCTG GTGGAGGAGG 
CCGAGCGCGT CATGGCCGAG GAGCGCGTAG TCATGCTGCC CCCGCGGGCG CGCTCCCTGA 

CCCAGGACAA CAGCTGCAGC TTTGGCCTGC ACGCCCGCGG TGTGGAGCTG ATGCGCTTCA 
CCACGCCCGG CTTCCCTGAC AGCCCCTACC CCGCTCATGC CCGCTGCCAG TGGGCCCTGC 
GGGGGGACGC CGACTCAGTG CTGAGCCTCA CCTTCCGCAG CTTTGACCTT GCGTCCTGCG 
ACGAGCGCGG CAGCGACCTG GTGACGGTGT ACAACACCCT GAGCCCCATG GAGCCCCACG 
CCCTGGTGCA GTTGTGTGGC ACCTACCCTC CCTCCTACAA CCTGACCTTC CACTCCTCCC 
AGAACGTCCT GCTCATCACA CTGATAACCA ACACTGAGCG GCGGCATCCC GGCTTTGAGG 
CCACCTTCTT CCAGCTGCCT AGGATGAGCA GCTGTGGAGG CCGCTTACGT AAAGCCCAGG 
GGACATTCAA CAGCCCCTAC TACCCAGGCC ACTACCCACC CAACATTGAC TC-CACATGGA 
ACATTGAGGT GCCCAACAAC CAGCATGTGA AGGTGCGCTT CAAATTCTTC TACCTGCTGG 
AGCCCGGCGT GCCTGCGGGC ACCTGCCCCA AGGACTACGT GGAGATCAAT GGGGAGAAAT 
ACTGCGGAGA -GAGGTCCCAG TTCGTCGTCA CCAGCAACAG CAACAAG AT C ACAGTTCGCT 
TCCACTCAGA TCAGTCCTAC ACCGACACCG GCTTCTTAGC TGAATACCTC TCCTACGACT 
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CCAGTGACCC ATGCCCGGGG CAGTTCACGT G 
TGCGCTGTGA TGGCTGGGCC GACTGCACCG A 
ACGCCGGCCA CCAGTTCACG TGCAAGAACA AGTTCTGCAA G 
ACAGTGTGAA CGACTGCGGA GACAACAGCG ACGAGCAGGG GTGCAGTTGT CCGGCCCAGA 



A CGGGTCCGAC GAGGCCTCCT G 

AACACACCTA CCGCTGCCTC AATGGGCTCT GCTTGAGCAA GGGCAACCCT GAGTGTGACG 1BOO 

GGAAGGAGGA CTGTAGCGAC GGCTCAGATG AGAAGGACTG CGACTGTGGG CTGCGGTCAT 1B60 

TCACGAGACA GGCTCGTGTT GTTGGGGGCA CGGATGCGGA TGAGC-GCGAG TGGCCCTGGC 1920 

AGGTAAGCCT GCATGCTCTG GGCCAGGGCC ACATCTGCGG TGCTTCCCTC ATCTCTCCCA 1980 

ACTGGCK3GT CTCTGCCGCA CACTGCTACA TCGATGACAG AGGATTCAGG TACTCAGACC 2040 

CCACGCAGTG GACGGCCTTC CTGGGCTTGC ACGACCAGAG CCAGCGCAGC GCCCCTGGGG 2100 

TGCAGGAGCG CAGGCT CAAG CGCATCATCT CCCACCCCTT CTTCAATGAC TTCACCTTCG 2160 

CCATCTGCCT GCCGGACGCC ICCCATGTCT TCCCTGCCGG CAAGGCCATC TGGGTCACGG 2280 

GCTGGGGACA CACCCAGTAT GGAGGCACTG GCGCGCTGAT CCTGCAAAAG GGTGAGATCC 2340 

GCGTCATCAA CCAGACCACC TGCGAGAACC TCCTGCCGCA GCAGATCACG CCGCGCATGA 2400 

TGTGCGTGGG CTTCCTCAGC GGCGGCGTGG ACTCCTGCCA GGGTGATTCC GGGGGACCCC 2460 

TGTCCAGCGT GGAGGCGGAT GGGCGGATCT TCCAGGCCGG TGTGGTGAGC TGGGGAGACG 2520 

GCTGCGCTCA GAGGAACAAG CCAGGCGTGT ACACAAGGCT CCCTCTGTTT CGGGACTGGA 2580 

TCAAAGAGAA CACTGGGGTA TAGGGGCCGG GGCCACCCAA ATGTGTACAC CTGCGGGGCC 2640 

ACCCATCGTC CACCCCAGTG TGCACGCCTG CAGGCTGGAG ACTGGACCGC TGACTGCACC 2700 

AGCGCCCCCA GAACATACAC TGTGAACTCA ATCTCCAGGG CTCCAAATCT GCCTAGAAAA 2760 

CCTCTCGCTT CCTCAGCCTC CAAAGTGGAG CTGGGAGGTA GAAGGGGAGG ACACTGGTGG 2820 

TTCTACTGAC CCAACTGGGG GCAAAGGTTT GAAGACACAG CCTCCCCCGC CAGCCCCAAG 2880 

CTGGGCCGAG GCGCGTTTGT • GTATATCTGC CTCCCCTGTC TGTAAGGAGC AGCGGGAACG 2940 

GAGCTTCGGA GCCTCCTCAG TGAAGGTGGT GGGGCTGCCG GATCTGGGCT GTGGGGCCCT 3000 

TGGGCCACGC TCTTGAGGAA GCCCAGGCTC GGAGGACCCT GGAAAACAGA CGGGTCTGAG 3060 

ACTGAAAATG GTTTACCAGC TCCCAGGTGA CTTCAGTGTG TGTATTGTGT AAATGAGTAA 3120 

AACATTTTAT TTCTTTTTAA AAAAAAAAA 314 9 

Seq ID NO: C23 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..2268 

1 11 21 31 41 51 

I I I I I I 

ATGCCCCCTT TCCTGTTGCT GGAAGCCGTC TGTGTTTTCC TGTTTTCCAG AGTGCCCCCA 60 

TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 12 0 

AG CAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 180 

AGCGTCGGGA AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCAT CACAGT CTGTGACGGT 240 

CTGGACATCA GCCCCGAGAC GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTCAT 300 

CTGGAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 360 

ATGGTTTTCA AAGGAGGGCG CACGGAGACG GAACTTGCTC TGAAATACCT TCTGCACAGA 420 

GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 480 
AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG T 
TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC C 
AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG GTGGAGGATG CCACCAACGG CCTCTTCAGC 
ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAG ACTGCAGGGT CGAGGCTCAC 
CCCTGTGAGC ACAGGACG CT GGAG ATGGT C CGGGAGTTCG C 
C GGCGGACCCT TGCGGTGCTG GCTGCACACT G 

C TAACCCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 
TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG G 



CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCCCACC 
CTGACGGGCA GTGCCTTGCG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC C 
GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACf 

G TGTAGGCAGT 



TCAGTAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTCCAG 16B0 

TTTGAGGTGA ACCCTGACGT G ACACAGGT C GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 1740 

ACTGCCTTCG GGCTGGACAC CAAACCCACC CGGGCTGCGA TGCTGCGGGC CATTAGCCAG 1800 

GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACCGCCC TGCTGCACAT CTATGACAAA 1860 

GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTG CTCACA 1920 

" " "G GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 1980 

G TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 2040 

CCCCGGGATT CCCTGATCCA CGTCGCAGCT TACGCCGACC TGCGGTACCA CCAGGACGTG 2100 

CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CGGCCAGTCA ACCTCTGCAA ACCCAGCCCG 2160 

TGCATGAATG AGGGCAGCTG CGTCCTGCAG AATGGGAGCT ACCGCTGCAA GTGTCGGGAT 2220 

GGCTGGGAGG GCCCCCACTG CGAGAACCGA TTCTTGAGAC GCCCCTGA 2268 

Seq ID NO: C24 DNA Sequence 

Nucleic Acid Accession Eos sequence 

Coding sequence: 1..2424 
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AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 
AGCGT CGGG A AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCATCACAGT CTGTGACGGT 
CTGGACATCA GCCCCGAGAG GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTCAT 
CTGGAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 
ATGGTTTTCA AAGGAGGGCG CACGGAGACG GAACTTGCTC TGAAATACCT TCTGCACAGA 
GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 
AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG 
TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC C 
AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG G 
ACCCTCAGCA GCTCGGCCAT C 
CCCTGTGAGC ACAGGACGCT GGAGATGGT C C 
AGAGGATCGC GGCGGACCCT T 



45 
50 



70 
75 



CGGGCCAAAG TCTTCGTGAA GCGGTTTGTG C 
CGAGTGGGTG TGGCCACATA CAGCAGGGAG C 
CAGGATGTGC CTGACCTGGT CTGGAGCCTC G 

CTGACGGGCA GTGCCTTGCG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC CACCAGGACA 
GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACACTC CGAGGATGAG 
GTTGCGGGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 
GAGGCCGTGC GGGCAGAGCT GGAGGAGATC ACAGGGAGCC CAAAGCATGT GATGGTCTAC 
TCTGTT CAACCAAATC CCTGAGCTGC AGGGGAAGCT GTGCAGCOGG 
G GGTGCCGGAC ACAAGCCCTG GACCTCGTCT TCATGTTGGA CACCTCTGCC 
TCAGTAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTCCAG 
TGACGT GACACAGGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 
G GGCTGGACAC CAAACCCACC CGGGCTGCGA TGCTGCGGGC CATTAGCCAG 
GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACCGCCC TGCTGCACAT CTATGACAAA 
GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTGCTCACA 
GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 
TCTGTCTTGG TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 
CCCCGGGATT CCCTGATCCA CGTGGCAGCT TACGCCGACC TGCGGTACCA CCAGGACGTG 
CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CAGCCAGTCA ACCTCTGCAA ACCCAGCCCG 
TGCATGAATG AGGGCAGCTG CGTCCTGCAG AATGGGAGCT ACCGCTGCAA GTGTGGGGAT 
GGCT ii ( A TG CGAGAACCGT GAGTGGAGCT CTTGCTCTGT ATGTGTGAGC 

CAGGGATGGA TTCTTGAGAC GCCCCTGAGG CACATGGCTC CCGTGCAGGA GGGCAGCAGC 
CGTACCCCTC CCAGCAACTA CAGAGAAGGC CTGGGCACTG AAATGGTGCC TACCTTCTGG 
AATGT CTGTG CCCCAGGTCC TTAG 

Seq ID NO: C25 DNA Sequence 
Nucleic Acid Accession #: XM_097386.3 
Coding sequence: 142.. 795 

1 11 21 31 41 51 

CTCGCAGAAC CACCTGGACT CTGTCCGTGT CTCTCCCCCG GCCTCCAGGG CTCCTCTCCC 
GGGACCCCCG TCCCACGCCT GGGCCCCGCG CCGGGGGAAG C 
CTACCTCAGG TCTGACTTTT GATGCCAAAA TCIGAGCCCC T( 

TCCCGTGCAC CAGGGTCTGC AGCAGCCACT GGGGCCTGGC TGCCTGCTGC ATCTGGCGGC 
CCTGGACCCC TGGGGCCCCC GTGCACCTGC CCACCTCGGA GCCTGGGGAG GGGCCGTGCA 
GGGTCGAGGG CTGGGTCGTC TCCCTCGGGC TGCGTGTGTG TGTCGGGAAT CCTGCGTGTC 
GTGTCTGTGG GCGATCCGGC CTCCCGGCGG TGGGTGGACC TGGATTCTAA CTCAGAGGAC 
TTGAGCCTGC TGTTAACTCC GATGATTGTA GGGACAGGCG GGGTGGGTGG GGGGTGGGCG 
CGAGGCTGGG TCCCGGCCCA GGAGAAGGAA GTCGCTGAAG GCAGTGGCCA TGCTGGCCGT 
GGAAATGGGA GGCGGTTGCA GAGGGTCTAT GGGGCCCGGT CCTGGATACT CGGCAGGAAG 
CCGTGTCTGC AGAGGCTCCT CCCTGCCTCA GGTGGCCCCG TTCAACCCCA GCCGTGCCCA 
TCTCCTGCCA CCGCCTGTCG GTGGGGGTTT AAATT CGGTG TGGCTTTCTG GGGTGCAGCT 
CAGCACCCCC CCTTATGCAG ACTGGGAGGG GGTCGGGCAG TCCCCTCAGC C 
CTGGATGGGT TCTAGTTCAC TTGGGACCGT GGGGCCTGGC TGCGTACTGA G 
CACAGTCAAG GCCAACGGGG GCTCCCCCTG CTCTGAGATG TTGGGAGAAA GGCGGCTTCT 900 
GGAACCTTCC GTGGGACCCG TAAGTGGCTG TCCAGAAAGG CGGGAGGGTG GGCACGGGGC 960 
ACGGGGGGCA GCTGGGGTCG TTGTTAAGGG TCACGCATCT GTACAGTTGA ATTTCCTTTC 1020 
TCTTATCATG TTTTACCCAC CTTGTCCCTT TTTTCCCCAA TTGTGCTTTT GCATTTTTTT 1080 
CCTTGGCAAA TGTAAACTCA GCCTTTCATT CATGACGTGT GAAATTTCAG TTTCTCTGGA 1140 
GTTTGTCAGA CGGCGTGGGA ACCACGCCTG AAACTCAGGT AATAGGAGGA AAAAAAAAAA 1200 
AACTTAAAAA AATTTTTAAA AAACATAAAA CTACTCTCTA CCTCTGGCTG GGCCCAGCCT 1260 
T GGCCGCGGCA GGGTGGCCIG TAACAATTTC AGTTTTCGCA GAACATTCAG 1320 
IAAA 1337 



Coding sequence: 95.. 2128 

i i 1 i 1 r i 1 r 

GGGGTAGTTT GTAGGGACGC AGCTCTCCAC GTGCGCGACT GCGAGGCTGG ACGCTACGGG 
CTCCTGGAAA GGAGAGACAC CAGCATTTGC CACAATGCTG TCATCCACTG ACTTTACATT 
TGCTTCCTGG GAGCTTGTGG TCCGCGTTGA CCATCCCAAT GAAGAGCAGC AGAAAG ACG T 
CACACTGAGA GTATCTGGAG ACCTTCATGT TGGAGGAGTG ATGCTCAAGT TAGTAGAACA 
GAT CAATATA TCCCAAGACT GGTCAGACTT TGCTCTTTGG TGGGAACAGA AGCATTGCTG 
GCTTCTGAAA ACCCACTGGA CCCTGGACAA ATATGGGGTC CAGGCAGATG CAAAGCTTCT 
CTTCACCCCT CAGCATAAAA TGCTGCGCCT TCGTCTGCCG AATTTGAAGA TGGTGAGGTT 
GCGAGTCAGC TTCTCAGCTG TGGTTTTTAA AGCTGTCAGT G 



1213 



WO 03/042661 PCT/US02/36810 



TATTAGAAGA TCAGAAGAGC TTTCCTTGTT AAAGCCGTCT GGTGACTATT TTAAGAAGAA 
GAAGAAAAAA GACAAAAATA ATAAGGAACC CATAATTGAA GATVTTCTAA Ac ( A 
TTCTCCAACA GCTTCAGGTT CATCAGTAAG TCCTGGTTTA TACAGTAAAA CCATGACCCC 
TATATATGAC CCCATCAATG GAACACCAGC ATCATCCACC ATGACTTGGT TCAGTGACAG 
CCCTTTGACG GAACAAAACT GCAGCATCCT CGCATTCAGC CAACCCCCCC AGTCCCCAGA 
AGCACTTGCG GATATGTACC AGCCTCGGTC TCTGGTTGAT AAAGCCAAGC TCAATGCAGG 
TTGGCTAGAC TCCTCACGCT CCCTTATGGA ACAAGGCATC CAAGAGC 
CTTACGATTT AAATATTATT CTTTCTTCGA CTTGAATCCT AAATATGATG C 
AAACCAACTC TATGAG CAAG CCAGGTGGGC CATTCTCTTA GAAGAAf 

GGAAGAAATG TTGATCTTTG CAGCTCTACA GTACCACATT AGCAAACTGT C 

TGAAACACAG GATTTTGCAG GCGAGTCCGA GGTTGATGAA ATAGAAGCGG CGCTTTCTAA 
TTTGGAAGTA ACCCTAGAAG GTGGAAAAGC GGACAGCCTT TTGGAGGACA TTACTGATAT 
CCCTAAACTT GCAGATAATC TCAAATTATT TAGGCCCAAG AAGTTACTAC CAAAAGCTTT 
CAAACAATAT TGGTTTATCT TTA^AGACAC ATCCATAGCA TACTTTAAAA ATAAGGAACT 
TGAACAAGGA GAACCACTAG AAAAACTAAA TCTTAGAGGC TGCGAAGTTG TGCCCGATGT 
AAATGTAG CA GGAAGAAAAT TTGGAATCAA GTT ACT AAT C CCTGTTGCCG ATGGTATGAA 
TGAAATGTAT TTGAGATGTG ACCATGAGAA TCAATACGCC CAATGGATGG CTGCCTGCAT 
" \ CCATGGCAGA CAGCTCCTAC CAGCCAGAGG T 



r ATGAACCCAG AATGTTTTGT GTCACCACGG TGTGCAAAAA G 

G GCCGCCCGGA TCCTGGAGGC GCACCAGAAC GTGGCCCAC-A TGCCCCTGGT 
CGAAGCCAAG CTGCGGTTCA TCCAGGCGTG GCAGTCACTG CCTGAGTTTG GCCTCACCTA 
CTACCTTGTC AGATTTAAAG GAAGCAAAAA AGATGACATT CTGGGAGTTT CATATAACAG 
GTTGATTAAA ATTGATGCAG CCACCGGGAT TCCAGTGACA ACATGGAGAT TCACAAATAT 

CAAACAGTGG AATGTAAACT GGGAAACCCG GCAGGTGGTC ATCGAGTTTG ACCAAAACGT 

CTTTACTGCT TTCACCTGCC TGAGTGCAGA TTG CAAG ATT GTGCACGAGT ACATTGGCGG 2040 

CTACATTTTC TTGTCCACCC GCTCCAAGGA CCAGAATGAA ACACTCGATG AGGACTTGTT 2100 

CCACAAATTG ACCGGCGGTC AGGATTGAAA CAAGCACGCG TGCTCGGCTC ACACCAACAA 2160 

GGCAAGCCAA AGGCGCCCCT CCCCAGAGGG ATCCCTAACG TGCCCAGCAT GTAGATTCTG 2220 

GACTAACAGA CAACATACAT TCACCGCTGG TCACCCAGAT CCTCATTCAA ACCCACTGCT 2280 

GGCACATCCC TTTCCTTACT TTGCCCTGTG CTACCAGCCA CGGAAGGAGC CTCTCTTGTT 2340 

TTTTCTATAA AATGGGTAGG CAGGAGAAAA GCAGGTGCCC TAAGATTGCT CTAAGGCCCA 2400 

GCATGTGGTT ACAGTTCTCT GACTTGCAGA ACCTGCCAGG TGTATGGCTA CAAGTTATCC 2460 

TCGTGCTGAT CTGTCTCATT ACTAAGTCAA TGGAGAAGAC AGAAAGGTAA AAATCACGTG 2520 

TAGCAAGAAC AACTCTTATT TCACAAACTC AGGTATGAAA CGAAACGCCT GTCCTTCATG 2580 

GAACTGCTTT TAGCTCCTGT CTTTTCAAAA TGGCAGAGGG AGTTCCTACA CACACTTTTT 2640 

CCCTGGAGGC CAAGGTCTAG GGGTAGAAAG GGGAGGGGTG GGGCTACCAG GTAGCAGTTG 2700 

ACAACCCAAG GTCAGAGGAG TGGCCCTCAG TGTCATCTGT CCACAGTGAT ACCTGCCAAG 27 60 

ATGACCACTG ACCCACATCT GGTCTTAGTC ATTGGTCTCC TCAGATTTCT GGGGCCACCT 2B20 

GCAAGCCCCA TTCCATTCCT ACAGATCTCT CAGCCACCTG TAAGTCCTTT GTGAAGATGT 2880 

GGGTGACACA GGGGGACAGG AAAACCCATT TCTCAACCCA GATCCATGTC TCCACTGCTT 2940 

CTACTCTGGG TTGGGATTCA GGAAGACAGG CACAGTCCTC TCTGTTCATA GAAACACCTG 3000 

CCAGTGTCAA GGATT CCAGT CAGGTGTCTA TCCCAACTGG TCAGGGAGAG AAGGGCAGAC 3 06 0 

CCATTCTCAA AGACCACCAT GTCCAAGGTC TGACAGCTCC CCACTGGCTG CCCCCACAGG 3120 

GGCTTTAGGC TGGTCTGGGT CATGGGGAAG CGTCCCTCTT ATCGCTGGTC TGTGTTCTCC 3180 

TGGATTTGGT ATCTATGTTG GTACGACTCC TGGCCTTTTA TCTAAAGGAC TTTGGCTTTT 3240 

GTAAAT CACA AGCCAATAAT AGACTTTTTT CTCCCCCTCT GTTTTTTGCT GTGTCATCTC 3300 

TGCCTTGAGA CTGCCTTGAG ACAGTGCTTG CCTTGAGAGA GTGAGCCAAT TAACAGCTGC 33 60 

CTGAATTGTC ATTTTCCATT TTGGTTTGTT AGAGGTGGGA GGGGTGGGTT TTGAGAAGGT 3420 

CAAAAGCAAT ACCAGAAGTA AAGGGAAATA TCAGACAATA TTTTATTATT TTTT CATAGA 3480 

TGTTCTGCCA CACAAAGAAC TTGGGGTGTA AGGATAAGGC AAAAGCTCCA ATCCCATTTT 3540 

TCAGTTCTCC TAGGATGCAC CCCTCAGGGA GCCTGGCCAG AGTTCCGAGG CCCGTGAGCG 3600 

TCAGCTGTTG CTTTATTTTC CATCAAAGCC CTCTGAGAAG TGAGACCTCA GCAATTCCGG 3660 

GAGCCACATA GAGACAGACT TGGCAAGGGA CCCCCTGGTT CTGAGCCAGT AGCTGCCATC 3720 

TGGAAATTCC TCTTTTAGCC TCTCCTTAGA GGTGAATGTG AATGAAGCCT CCCAGGCACC 3780 

CGCTGAATTT CTGAGGCCTT GCTTAAAGCT CAGAAGTGGT TTAGGCATTT GGAAAATCTG 3840 

GTTCACATCA TAAAGAACTT GATTTGAAAT GTTTTCTATA GAAACAAGTG CTAAGTGTAC 3900 

CGTATTATAC TTGATGTTGG TCATTTCTCA GTCCTATTTC TCAGTTCTAT TATTTTAGAA 3960 

CCTAGTCAGT TCTTTAAGAT TATAACTGGT CCTACATTAA AATAATGCTT CTCGATGTCA 4020 

GATTTTACCT GTTTGCTGCT GAGAACATCT CTGCCTAATT TACCAAAGCC AGACCTTCAG 4080 

TTCAACATGC TTCCTTAGCT TTTCATAGTT GTCTGACATT TCCATGAAAA CAAAGGAACC 4140 

AACTTTGTTT TAACCAAACT TTGTTTGGTT ACAGTTTT CA GGGGAGCGTT TCTTCCATGA 4200 

CACACAGCAA CATCCCAAAG AAATAAACAA GTGTGACAAA AAABAAAAAA AACAAACCTA 42 60 

AATGCTACTG TTCCAAAGAG CAACTTGATG GTTTTTTTTA ATACTGAGTG CAAAAGGTCA 4320 

CCCAAATTCC TATGATGAAA TTTTAAATTA ATGGGCACCT TTCAACATCA TTTGCTTCCT 4380 
TATCTACAGT TGATTCAGAA ATCTGCATTT TTTATTCTTT T 
GATTTATATG GATTTGAAAA AAAAAAAAAA A 



MDGGTLPRSA PPAPPVPVGC AARRRPASPE LLRCSRRRRP ATAETGGGAA A 
NRVKLVNLGF QALRQHVPHG GASKKLSKVE TLRSAVEYIR ALQRLLAEHD AVRNALAGGL 
RPQAVRPSAP RGPPGTTPVA ASPSRASSSP GRGGSSEPGS PRSAYSSDDS GCEGALSPAE 
RELLDFSSWL GGY 
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3 CCAAAACTAC GACTTGGCTT TCTGAAACGG 
AAGCATAAAT GTTCTTTTCC TCCATTTGTC TGGATCTGAG AACCTGCATT T GG T ATT AG C 
TAGTGGAAGC AGTATGTATG GTTGAAGTGC ATTGCTGCAG CTGGTAGCAT GAGTGGTGGC 
CACCAGCTGC AGCTGGCTGC CCTCTGGCCC TGGCTGCTGA TGGCTACCCT G 
ft CAGGACTGGT ACTGGCAGCA GCGGTGGAGT CTGAAAC 

T CCCCTTGAAA ATGGACCCCA CAGGAAAACT GAATCTCACT 
TTGGAAGGTG TGTTTGCTGG TGTTGCTGAA ATAACTCCAG CAGAAGGAAA A 



40 



TGGGGTAATG ACGCTGAGAA GCTGATGGAG TTTGTGTACA AGAACCAAAA GGCCCATGTG 
AGGATTGAGC TGAAGGAGCC CCCGGCCTGG CCAGATTATG ATGTGTGGAT CCTAATGACA 
GTGGTGGGCA CCATCTTTGT GATCATCCTG GCTTCGGTGC TGCGCATCCG GTGCCGCCCC 
15 CGCCACAGCA GGCCGGATCC GCTTCAGCAG AGAACAGCCT GGGCCATCAG CCAGCTGGCC 
ACCAGGAGGT ACCAGGCCAG CTGCAGGCAG GCCCGGGGTG AGTGGCCAGA CTCAGGGAGC 
AGCTGCAGCT CAGCCCCTGT GTGTGCCATC TGTCTGGAGG AGTTCTCTGA GGGGCAGGAG 
CTACGGGTCA TTTCCTGCCT CCATGAGTTC CATCGTAACT GTGTGGACCC CTGGTTACAT 
CAGCATCGGA CTTGCCCCCT CTGCGTGTTC AACATCACAG AGGGAGATTC ATTTTCCCAG 
20 TCCCTGGGAC CCTCTCGATC TT AC CAAGAA CCAGGTCGAA GACTCCACCT CATTCGCCAG 
CATCCCGGCC ATGCCCACTA CCACCTCCCT GCTGCCTACC TGTTGGGCCC TTCCCGGAGT 
GCAGTGGCTC GGCCCCCACG ACCTGGTCCC TTCCTGCCAT CCCAGGAGCC AGGCATGGGC 
CCTCGGCATC ACCGCTTCCC CAGAGCTGCA CATCCCCGGG CTCCAGGAGA GCAGCAGCGC 
CTGGCAGGAG CCCAGCACCC CTATGCACAA GGCTGGGGAA TGAGCCACCT CCAATCCACC 
25 TCACAGCACC CTGCTGCTTG CCCAGTGCCC CTACGCCGGG CCAGGCCCCC TGACAGCAGT 
GGATCTGGAG AAAGCTATTG CACAGAACGC AGTGGGTACC TGGCAGATGG GCCAGCCAGT 
GACTCCAGCT CAGGGCCCTG TCATGGCTCT TCCAGTGACT CTGTGGTCAA CTGCACGGAC 
ATCAGCCTAC AGGGGGTCCA TGGCAGCAGT TCTACTTTCT GCAGCTCCCT AAGCAGTGAC 
TTTGACCCCC TAGTGTACTG CAGCCCTAAA GGGGATCCCC AGCGAGTGGA CATGCAGCCT 
30 AGTGTGACCT CTCGGCCTCG TTCCTTGGAC TCGGTGGTGC CCACAGGGGA AACCCAGGTT 
TCCAGCCATG TCCACTACCA CCGCCACCGG CACCACCACT ACAAAAAGCG GTTCCAGTGG 
CATGGCAGGA AGCCTGGCCC AGAAACCGGA GTCCCCCAGT CCAGGCCTCC TATTCCTCGG 
ACACAGCCCC AGCCAGAGCC ACCTTCTCCT GATCAGCAAG TCACCGGATC CAACTCAGCA 
GCCCCTTCGG GGCGGCTCTC TAACCCACAG TGCCCCAGGG CCCTCCCTGA GCCAGCCCCT 
35 GGCCCAGTTG ACGCCTCCAG CATCTGCCCC AGTACCAGCA GTCTGTTCAA CTTGCAAAAA 
TCCAGCCTCT CTGCCCGACA CCCACAGAGG AAAAGGCGGG GGGGTCCCTC CGAGCCCACC 
CCTGGCTCTC GGCCCCAGGA TGCAACTGTG CACCCAGCTT GCCAGATTTT TCCCCATTAC 
ACCCCCAGTG TGGCATATCC TTGGTCCCCA GAGGCACACC CCTTGATCTG TGGACCTCCA 
GGCCTGGACA AGAGGCTGCT ACCAGAAACC CCAGGCCCCT GTTACTCAAA TTCACAGCCA 



ACAGAGTCCT GCTCCTGGGA AAGGAAAGGA CCACAGCAAA CACCATTCTT TTTGCCGTAC 2640 

TTCCTAGAAG CACTGGAAGA GGACTGGTGA TGGTGGAGGG TGAGAGGGTG CCGTTTCCTG 2700 

CTCCAGCTCC AGACCTTGTC TGCAGAAAAC ATCTGCAGTG CAGCAAATCC ATGTCCAGCC 2760 

AGGCAACCAG CTGCTGCCTG TGGCGTGTGT GGGCTGGATC CCTTGAAGGC TGAGTTTTTG 2B20 

AGGGCAGAAA GCTAGCTATG GGTAGCCAGG TGTTACAAAG GTGCTGCTCC TTCTCCAACC 2880 

CCTACTTGGT TTCCCTCACC CCAAGCCTCA TGTTCATACC AGCCAGTGGG TT CAGCAGAA 2940 

CGCATGACAC CTTATCACCT CCCTCCTTGG GTGAGCTCTG AACACCAGCT TTGGCCCCTC 3000 

CACAGTAAGG CTGCTACATC AGGGGCAACC CTGGCTCTAT CATTTTCCTT TTTTGCCAAA 3060 

AGGACCAGTA GCATAGGTGA GCCCTGAGCA CTAAAAGGAG GGGTCCCTGA AGCTTTCCCA 3120 

CTATAGTGTG GAGTTCTGTC CCTGAGGTGG GTACAGCAGC CTTGGTTCCT CTGGGGGTTG 3180 

AGAATAAGAA TAGTGGGGAG GGAAAAACTC CTCCTTGAAG ATTTCCTGTC TCAGAGTCCC 3240 

AGAGAGGTAG AAAGGAGGAA TTTCTGCTGG ACTTTATCTG GGCAGAGGAA GGATGGAATG 3300 

AAGGTAGAAA AGGCAGAATT ACAGCTGAGC GGGGACAACA AAGAGTTCTT CTCTGGGAAA 3360 

AGTTTTGTCT TAGAGCAAGG ATGGAAAATG GGGACAACAA AGGAAAAGCA AAGTGTGACC 3420 

CTTGGGTTTG GACAGCCCAG AGGCCCAGCT CCCCAGTATA AGCCATACAG GCCAGGGACC 3480 

CACAGGAGAG TGGATTAGAG CACAAGTCTG GCCTCACTGA GTGGACAAGA GCTGATGGGC 3540 

3660 



CCACAAAAAA AAAAAAAAAA AA 



MNSSAHYHVN FSQAISQDVN LHEAILLCPN NTFRRDPTAR TSQSQEPFL2 LNSHTTNPEQ 
TLPGTNLTGF LSPVDNHMRN LTSQDLLYDL DINIFDEINL MSLATEDNFD PIDVSQLFDE 
PDSDSGLSLD SSHNNTSVIK SNSSHSVCDE GAIGYCTDHE SSSHHDLEGA VGGYYPEPSK 
LCHLDQSDSD FHGDLTFQHV FHNHTYHLQP TAPESTSEPF PWPGKSQKIR SRYLEDTDRN 
LSRDEQRAKA LHIPFSVDEI VGMPVDSFNS MLSRYYLTDL QVSblRDIRR RGKNKVAAQN 
CRKRKLDIIL NLEDDVCNLQ AKKETLKREQ AQCNKAINIM KQKLHDLYHD IFSRLRDDQG 
RPVNPNHYAL QCTHDGSILI VPKELVASGH KKETQKGKRK 



Seq ID NO: C30 DNA Sequence 
Nucleic Acid Accession #: NM_004442 
Coding sequence: 19.. 2982 



1 11 21 31 41 51 
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I I I I I I 

GCCCCGGGAA GCGCAGCCAT GGCTCTGCGG AGGCTGGGGG CCGCGCTGCT GCTGCTGCCG 
CTGCTCGCCG CCGTGGAAGA AACGCTAATG GACTCCACTA CAGCGACTGC TGAGCTGGGC 
TGGATGGTGC ATCCTCCATC AGGGTGGGAA GAGGTGAGTG GCTACGATGA GAACATGAAC 
ACGATCCGCA CGTACCAGGT GTGCAACGTG TTTGAGTCAA GCCAGAACAA CTGGCTACGG 
ACCAAGTTTA TCCGGCGCCG TGGCGCCCAC CGCATCCACG TGGAGATGAA GTTTTCGGTG 
CGTGACTGCA GCAGCATCCC CAGCGTGCCT GGCTCCTGCA AGGAGACCTT CAACCTCTAT 
TACTATGAGG CTGACTTTGA CTCGGCCACC AAGACCTTCC CCAACTGGAT GGAGAATCCA 
T GGGTGAAGG TGGATACCAT TGCAGCCGAC GAGAGCTTCT CCCAGGTGGA CCTGGGTGC-C 
CGCGTCATGA AAATCAACAC CGAGGTGCGG AGCTTCGGAC CTGTGTCCCG CAGCGGCTTC 
TACCTGGCCT TCCAGGACTA TGGCGGCTGC ATGTCCCTCA TCGCCGTGCG TGTCTTCTAC 
CGCAAGTGCC CCCGCATCAT CCAGAATGGC GCCATCTTCC AGGAAACCCT GTCGGGGGCT 
GAGAGCACAT CGCTGGTGGC TGCCCGGGGC AGCTGCATCG CCAATGCGGA AGAGGTGGAT 
GTACCCATCA AGCTCTACTG TAACGGGGAC GGCGAGTGGC TGGTGCCCAT CGGGCGCTGC 
ATGTGCAAAG CAGGCTTCGA GGCCGTTGAG AATGGCACCG TCTGCCGAGG TTGTCCATCT 
GGGACTTTCA AGGCCAACCA AGGGGATGAG GCCTGTACCC ACTGTCCCAT CAACAGCCGG 
3 AAGGGGCCAC CAACTGTGTC TGCCGCAATG GCTACTACAG AGCAGACCTG 



GT CAATGAG A C 

T ACAACATCAT C 

C ACGCATTTAC 

ATCAGTGACC TGCTGGCCCA CACCCAGTAC ACCTTCGAGA TCCAGGCTGT GAACGGCGTT 

ACTGACCAGA GCCCCTTCTC GCCTCAGTTC GCCTCTGTGA ACATCACCAC CAACCAGGCA 1320 

GCTCCATCGG CAGTGTCCAT CATGCATCAG GTGAGCCGCA CCGTGGACAG CATTACCCTG 13B0 

TCGTGGTCCC AGCCAGACCA GCCCAATGGC GTGATCCTGG ACTATGAGCT GCAGTACTAT 1440 

GAGAAGGAGC TCAGTGAGTA CAACGCCACA GCCATAAAAA GCCCCACCAA CACGGTCACC 1500 

GTGCAGGGCC TCAAAGCCGG CGCCATCTAT GTCTTCCAGG TGCGGGCACG CACCGTGGCA 15S0 

GGCTACGGGC GCTACAGCGG CAAGATGTAC TTCCAGACCA TGACAGAAGC CGAGTACCAG 1620 

ACAAGCATCC AGGAGAAGTT GCCACTCATC ATCGGCTCCT CGGCCGCTGG CCTGGTCTTC 16B0 

CTCATTGCTG TGGTTGTCAT CGCCATCGTG TGTAACAGAA GACGGGGGTT TGAGCGTGCT 1740 

GACTCGGAGT ACACGGACAA GCTGCAACAC TACACCAGTG GCCACATGAC CCCAGGCATG 1800 

AAGATCTACA TCGATCCTTT CACCTACGAG GACCCCAACG AGGCAGTGCG GGAGTTTGCC 18S0 

AAGGAAATTG ACATCTCCTG TGTCAAAATT GAGCAGGTGA TCGGAGCAGG GGAGTTTGGC 1920 

GAGGTCTGCA GTGGCCACCT GAAGCTGCCA GGCAAGAGAG AGAT CTTTGT GGCCATCAAG 1980 

ACGCTCAAGT CGGGCTACAC GGAGAAGCAG CGCCGGGACT TCCTGAGCGA AGCCTCCATC 2040 

ATGGGCCACT TCGACCATCC CAACGTCATC CACCTGGAGG GTGTCGTGAC CAAGAGCACA 2100 

CCTGTGATGA TCATCACCGA GTTCATGGAG AATGGCTCCC TGGACTCCTT TCTCCGGCAA 2160 

AACGATGGGC AGTTCACAGT CATCCAGCTG GTGGGCATGC TTCGGGGCAT CGCAGCTGGC 2220 

ATGAAGTACC TGGCAGACAT GAACTATGTT CACCGTGACC TGGCTGCCCG CAACATCCTC 2280 

GTCAACAGCA ACCTGGTCTG CAAGGTGTCG GACTTTGGGC TCTCACGCTT TCTAGAGGAC 2340 

GATACCTCAG ACCCCACCTA CACCAGTGCC CTGGGCGGAA AGAT CCCCAT CCGCTGGACA 2400 

GCCCCGGAAG CCATCCAGTA CCGGAAGTTC ACCTCGGCCA GTGATGTGTG GAGCTACGGC 24G0 

ATTGTCATGT GGGAGGT GAT GTCCTATGGG GAGCGGCCCT ACTGGGACAT GACCAACCAG 2520 

GATGTAATCA ATGCCATTGA GCAGGACTAT CGGCTGCCAC CGCCCATGGA CTGCCCGAGC 2580 

GCCCTGCACC AACTCATGCT GGACTGTTGG CAGAAGGACC GCAACCACCG GCCCAAGTTC 2G40 

GGCCAAATTG TCAACACGCT AGACAAGATG ATCCGCAATC CCAACAGCCT CAAAGCCATG 2700 

GCGCCCCTCT CCTCTGGCAT CAACCTGCCG CTGCTGGACC GCACGATCCC CGACTACACC 2760 

AGCTTTAACA CGGTGGACGA GTGGCTGGAG GCCATCAAGA TGGGGCAGTA CAAGGAGAGC 2820 

TTCGCCAATG CCGGCTTCAC CTCCTTTGAC GTCGTGTCTC AGATGATGAT GGAGGACATT 2880 

CTCCGGGTTG GGCTCACTTT GGCTGGCCAC CAGAAAAAAA TCCTGAACAG TATCCAGGTG 2 940 

ATGCGGGCGC AGATGAACCA GATTCAGTCT GTGGAGGTTT GACATTCACC TGCCTCGGCT 3000 

CACCTCTTCC TCCAAGCCCC GCCCCCTCTG CCCCACGTGC CGGCCCTCCT GGTGCTCTAT 3060 
3 GCCAGCCACT CGCCAGGAGG CCACGGGCCA CGGGAAGAAC C 



GGAAAAAAGA AAACAGATCC TGGGAGGGGG CGGGAAATAC AAGGAATATT TTTTAAAGAG 
GATTCTCATA AGGAAAGCAA TGACTGTTCT TGCGGGGGAT AAAAAAGGGC TTGGGAGATT 
CATGCGATGT GTCCAATCGG AGACAAAAGC AGTTTCTCTC CAACTCCCTC TGGGAAGGTG 
ACCTGGCCAG AGCCAAGAAA CACTTTCAGA AAAACAAATG TGAAGGGGAG AGACAGGGGC 
C TCAACAACCA A 



CCGCCAGCCC CTGCCTCGAG GACTGATACT GCAGTGACTG CCGTCAGCTC CGACTGCCGC 
TGAGAAGGGT TGATCCTGCA TCTGGGTTTG TTTACAGCAA TTCCTGGACT CGGGGGTATT 
TTGGTCACAG GGTGGTTTTG GTT T AGGGGG TTTGTTTGTT GGGTTGTTTT TTGTTTTTTG 
GTTTTTTTTA ATGACAATGA AGTGACACTT TGACATTTCC TACCTTTTGA GGACTTGATC 
CTTCTCCAGG AAGAAGGTGC TTTCTGCTTA CTGACTTAGG CAATACACCA AGGGCGANAT 
A CATTTCTGGA TTTTTTTATA CGGTTTTCAT TGACACTCTT CCCTCCTCCC 
2 AGGCCTCACC AAAGCCCACT GCCATGGGGC CATCTGGGCC AT T CAGAGAC 
TGGAGTGAGA TTTGGGTGTG GAGGGGGAGG CGCCAAGGTG GAGGAGCTTC CCACTCCAGG 
ACTGTTGATG AAAGGGACAG ATTGAGGAGG AAGTGGGCTC TGAGGCTGCA GGGCTGGAAG 
TCCTTGCCCA CTTCCCACTC TCCTGCCCCA ATCTATCTAG TACTTCCCAG GCAAATAGGC 
CCCTTTGAGG CTCCTGAGTG CCCTCAGATG GTCAAAACCC AGTTTTCCCT CTGGGAGCCT 
AAACCAGGCT GCATCGGAGG CCAGGACCCG GATCATTCAC TGTGATACCC TGCCCTCCAG 
AGGGTGCGCT CAGAGACACG GGCAAGCATG CCTCTTCCCT TCCCTGGAGA GAAAGTGTGT 
GATTTCTCTC CCACCTCCTT CCCCCCACCA GACCTTTGCT GGGCCTAAAG GTCTTGGCCA 
TGGGGACGCC CTCAGTCTAG GGATCTGGCC ACAGACTCCC TCCTGTGAAC CAACACAGAC 
ACCCAAGCAG AGCAAT CAGT TAGTGAATTG GAATTCCCCA AGTCTTTGCT ATTGTGAATA 
GTGCTGCAAT AAACATACGT GTGCATGTGT CTTTATAGTA GAATGATCTA TAATCCTCTG 
GGTATGTACC CAGTAATGGG ATTGCTGGGT CAAATGGTTT TTCTGGTTCT AGATCCTTGA 
GGAATTGCCA CACTGTCTTT CACAATGGTT GAACTAATTT ACACTCCTAC CAACAGTGTA 
AAAGTCTTCC TGTTTCTCCA CATCCTCTCC AGCATCTGTT G 
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TTGCCATTCT AACTGGTGTG AGATGGTATC TCATTGTGGG T 

TGACCAGTGA AGATGAGCTT TTTTTCATAT GTTTGTTGGC CACATGTTTG TTGTTTCTTT 

A GGTTGTTIGT T 
TAAACTTAAT GAAATAAAGC ATGAAGACAA GATTAGAAGA A 
ACAAAGCGTC CAAGAAATAT GGGACTATGT G. 

& CTCTTCAGGA T 

GAGAACTTCT CCAACCTAGC AAGACAGACC A 




CCAAACTAAG CTTCATAAGC AAAGGAGAAA TAAAATCCTT TACAGACAAG CAAATCCTGG 

T CACCACCAGG CCTGCCTTAC AAGACATCCT GAAGGAAGCA CTAAATATGG 5760 
A AAATACCAAA TTGTAGAGAC CATTGACACT 
A ATAACCAGCT AGTATCATAA TGACAGGATC 
C ATAACAATAT TAACCTTATA TCTAAATGGG CTAAATCCCC CAATTAAAAG 5940 
ACGCAGACTG GCAAATTGGT TAAAGAGTCA AGACTCATTG GTGTGCOGTA TTCAGGAGAC 6000 
CCATCTCACG TGCAAAGACA CACATAGGCT CAGAGTAAAA GGGATACAGG GGAATTC 6057 

Seq ID NO: C31 DNA Sequence 

Nucleic Acid Accession #: NM_031942.1 

Coding sequence: 145.. 1260 



CCCGAGCCCC GCCCCTCCGG GCCCGGGTCG GCGCGCCCAG CCTGCCAGCC GCGCTGCTGC 
TGCTCCTCCT GCTGTGGGAC CGCTGACCGC GCGGCTGCTC CGCTCTCCCC GCTCCAAGCG 
CCGATCTGGG CACCCGCCAC CAGCATGGAC GCTCGCCGCG TGCCGCAGAA AGATCTCAGA 
GTAAAGAAGA ACTTAAAGAA ATTCAGATAT GTGAAGTTGA TTTCCATGGA AACCTCGTCA 
TCCTCTGATG ACAGTTGTGA CAGCTTTGCT TCTGATAATT TTGCAAACAC GAGGCTGCAG 
TCAGTTCGGG AAGGCTGTAG GACCCGCAGC CAGTG CAGGC ACTCTGGACC TCTCAGGGTG 
GCGATGAAGT TTCCAGCGCG GAGTACCAGG GGAGCAACCA ACAAAAAAGC AGAGTCCCGC 
CAGCCCTCAG AGAATTCTGT GACTGATTCC AACTCCGATT CAGAAGATGA AAGTGGAATG 
AATTTTTTGG AGAAAAGGGC TTTAAATATA AAGCAAAACA AAGCAATGCT TGCAAAACTC 
ATGTCTGAAT TAGAAAGCTT CCCTGGCTCG TTCCGTGGAA GACATCCCCT CCCAGGCTCC 
CACTCACAAT CAAGGAGACC GCGAAGGCGT ACATTCCCGG GTGTTGCTTC CAGGAGAAAC 
CCTGAACGGA GAGCTCGTCC TCTTACCAGG TCAAGGTCCC GGATCCTCGG GTCCCTTGAC 
GCTCTACCCA TGGAGGAGGA GGAGGAAGAG GATAAGTACA TGTTGGTGAG A 
ACCGTGGATG GCTACATGAA TGAAGATGAC CTGCCCAGAA GCCGTCGCTC C 
GTGACCCTTC CGCATATAAT TCGCC CAGTG GAAGAAATTA C 
GTCTGCAGCA ATTCTCGAGA GAAGATATAT AACCGTTCAC T 
TGCCGTCAGA AGACTATTGA TACCAAAACA A 
CGAGGCCAGT TCTGTGGCCC CTGCCTTCGA AACCGTTATG G 
CTGCTGGATC CGAACTGGCA TTG CCGCCT TGTCGAGGAi r 
CGGCAGCGAG ATGGACCGTG T 

TTTGGGAATG TGCATGCCTA CTTGAAAAGC CTGAAACAGG A 
TATCTGGAAA ATTTGCTGCC T 

TTTTTTCACT GAAACCTGAG TTAAAAATCT TGATGATCAG C 
AATCAAGTTA ATCTTAGCAG ACATGTGTTT C 
TACACTTTGC CCTCCTGCAG TTTCTTCTCT GCTCCCAACC O 




I GCAGTTTCTT GTAAATTTAT A 

I TTTTGGAGAT 

GTAACTTTTA GCAGTTTGTT AACCTGACAT CTCTGCCAGT CTAGTTTCTG GGCAGGTTTC 

T ATTCCCCCTC CTCTTTGCAT TAATCAAGGT ATTTGGTAGA GGTGGAATCT 2340 
T ATGTCCAATT TACTTGCATA TGTAAACCAT TGCTGTGCCA TTCAATGTTT 2400 
GATGCATAAT TGGACCTTGA ATCGATAAGT GTAAATACAG CTTTTGATCT GTAATGCTTT 2460 
TATACAAAAG TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA 2510 

Seq ID NO: C32 DNA Sequence 



TCTCGCTGGA qqccAGGCCG TGCAGCATCG AAGACAGGAG GAACTGGAGC CICATTGGCC 
GGCCCGGGGC GCCGGCCTCG GGCTTAAATA GGAGCTCCGG GCTCTGGCTG GGACCCGACC 
GCTGCCGGCC GCGCTCCCGC TGCTCCTGCC GGGTGATGGA AAACCCCAGC CCGGCCGCCG 
CCCTGGGCAA GGCCCTCTGC GCTCTCCTCC TGGCCACTCT CGGCGCCGCC GGCCAGCCTC 
TTGGGGGAGA GTCCATCTGT TCCGCCAGAG CCCCGGCCAA ATACAGCATC ACCTTCACGG 
GCAAGTGGAG CCAGACGGCC TTCCCCAAGC AGTACCCCCT GTTCCGCCCC CCTGCGCAGT 
GGTCTTCGCT GCTGGGGGCC GCGCATAGCT CCGACTACAG CATGTGGAGG AAGAACCAGT 
ACGTCAGTAA CGGGCTGCGC GACTTTGCGG AGCGCGGCGA GGCCTGGGCG CTGATGAAGG 
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25 
30 



AGAT CGAGGC GGCGGGGGAG GCGCTGCAGA GCGTGCACGC GGTGTTTTCG GCGCCCGCCG 660 

TCCCCAGCGG CACCGGGCAG ACGTCGGCGG AGCTGGAGGT GCAGCGCAGG CACTCGCTGG 720 

TCTCGTTTGT GGTGCGCATC GTGCCCAGCC CCGACTGGTT CGTGGGCGTG GACAGCCTGG 780 

ACCTGTGCGA CGGGGACCGT TGGCGGGAAC AGGCGGCGCT GGACCTGTAC CCCTACGACG 840 

CCGGGACGGA CAGCGGCTTC ACCTTCTCCT CCCCCAACTT CGCCACCATC CCGCAGGACA 900 

CGGTGACCGA GATAACGTCC TCCTCTCCCA GCCACCCGGC CAACTCCTTC TACTACCCGC 960 

GGCTGAAGGC CCTGCCTCCC ATCGCCAGGG TGACACTGGT GCGGCTGCGA CAGAGCCCCA 1020 

GGGCCTTCAT CCCTCCCGCC CCAGTCCTGC CCAGCAGGGA CAATGAGATT GTAGACAGCG 1080 

CCTCAGTTCC AGAAACGCCG CTGGACTGCG AGGTCTCCCT GTGGTCGTCC TGGGGACTGT 1140 

GCGGAGGCCA CTGTGGGAGG CTCGGGACCA AGAGCAGGAC TOGCTAOGTC CGGGTCCAGC 1200 

CCGCCAACAA CGGGAGCCCC TGCCCCGAGC TCGAAGAAGA GGCTGAGTGC GTCCCTGATA 1260 

ACTGCGTCTA AGACCAGAGC CCCGCAGCCC CTGGGGCCCC CGGAGCCATG GGGTGTCGGG 1320 

GGCTCCTGTG CAGGCTCATG CTGCAGGCGG CCGAGGCACA GGGGGTTTCG CGCTGCTCCT 1380 

GACCGCGGTG AGGCCGCGCC GACCATCTCT GCACTGAAGG GCCCTCTGGT GGCCGGCACG 1440 

GGCATTGGGA AACAGCCTCC TCCTTTCCCA ACCTTGCTTC TTAGGGGCCC CCGTGTCCCG 1500 

TCTGCTCTCA GCCTCCTCCT CCTGCAGGAT AAAGTCATCC CCAAGGCTCC AGCTACTCTA 1560 

AATTATGGTC TCCTTATAAG TTATTGCTGC TCCAGGAGAT TGTCCTTCAT CGTCCAGGGG 1620 

CCTGGCTCCC ACGTGGTTGC AGATACCTCA GACCTGGTGC TCTAGGCTGT GCTGAGCCCA 1680 

CTCTCCCGAG GGCGCATCCA AGCGGGGGCC ACTTGAGAAG TGAATAAATG GGGCGGTTTC 1740 



:CATCAT CATAGCACTA 
CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 
TACTTCCTCT GCGGGCAGCC TCTCCACTTC AT CCCG AGG A AGCAGCTGTG TGACGGAGAG 
CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 
GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 
GGGAACTGGT TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 
.1(4 AGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 
GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 
GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 
AAGACCCCCC GTCTGGT0K3G TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 
AGCATCCAGT ACGACAAACA GCACGTCTGT CGAGGGAGCA TCCTGGACCC CCACIGGGTC 
CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT T CAACTGGAA GGTGCGGGCA 
GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 
TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 
ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 
GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 
GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 
GCGTACCACG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 
GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 
GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG A 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 




CGTCATATTC CAGCTCTGAA CAGCAACATG GGGTGCAAAG 
CAGATGCTGC GGCGGAAGGT GGTGGACTGT AGCCGGGAGG AGACGCGGCT 
r GGCCCTCGGG GTGGGCAGCA CACTGGGTGC 
2 CCGTGAGAAT GCAGGCCCTG CCATTGTCAT CTCCTTCCTG 
_ GC ctqjqct^q GCGAGTTTGG TGCTCGGGTC 

A CCTCTACAGC TATGTCACCG TTGGAGAGCT CTGGGCCTTC 
ATCACCGGCT GGAACTTAAT CCTCTCCTAC ATCATCGGTA CTTCAAGCGT AGCGAGGGCC 
TGGAGCGCCA CCTTCGACGA GCTGATAGGC AGACCCATCG GGGAGTTCTC ACGGACACAC 
ATGACTCTGA ACGCCCCCGG CGTGCTGGCT GAAAACCCCG ACATATTCGC AGTGATCATA 
ATTCTCATCT TGACAGGACT TTTAACTCTT GGTGTGAAAG AGTCGGCCAT GGTCAACAAA 
ATATTCACTT GTATTAACGT CCTGGTCCTG GGCTTCATAA TGGTGTCAGG ATTTGTGAAA 
GGATCGGTTA AAAACTGGCA GCTCACGGAG GAGGATTTTG GGAACACATC AGGCCGTCTC 
TGTTTGAACA ATGACACAAA AGAAGGGAAG CCCGGTGTTG GTGGATTCAT GCCCTTCGGG 
TTCTCTGGTG TCCTGTCGGG GGCAGCGACT TGCTTCTATG CCTTCGTGG3 CTTTGACTGC 
ATCGCCACCA CAGGTGAAGA GGTGAAGAAC CCACAGAAGG CCATCCCCGT GGGGATCGTG 
GCGTCCCTCT TGATCTGCTT CATCGCCTAC TTTGGGGTGT CGGCTGCCCT CACGCTCATG 
ATGCCCTACT TCTGCCTGGA CAATAACAGC CCCCTGCCCG ACGCCTTTA^ GCACGTGGGC 
TGGGAAGGTG CCAAGTACGC AGTGGCCGTG GGCTCCCTCT GCGCTCTTTC CGCCAGTCTT 
CTAGGTTCCA TGTTTCCCAT GCCTCGGGTT ATCTATGCCA TGGCTGAGGA TGGACTGCTA 
TTTAAATTCT TAGCCAACGT CAATGATAGG ACCAAAACAC CAATAATCGC CACATTAGCC 
TCGGGTGCCG TTGCTGCTGT GATGGCCTTC CTCTTTGACC TGAAGC-ACTI GGTGGACCTC 
ATGTCCATTG GCACTCTCCT GGCTTACTCG TTGGTGGCTG CCTGTGTGTT GGTCTTACGG 
TACCAGCCAG AGCAGCCTAA CCTGGTATAC CAGATGGCCA GTACTTCCGA CGAGTTAGAT 
CCAGCAGACC AAAATGAATT GGCAAGCACC AATGATTCCC AGCTGGGGTT TTTACCAGAG 
GCAGAGATGT TCTCTTTGAA AACCATACTC TCACCCAAAA ACATGGAGCC TTCCAAAATC 
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TCTGGGCTAA TTGTGAACAT TTCAACCAGC CTTATAGCTG TTCTCATCAT CACCTTCTGC 
3 TGCTTGGAAG GGAGGCTCTC ACCAAAGGGG CGCTGTGGGC AGTCTTTCTG 
I CTGCCCTCCT CTGTGCCGTG GTCACGGGCG TCATCTGGAG GCAGCCCGAG 
AGCAAGACCA AGCTCTCATT TAAGGTTCCC TTCCTGCCAG TGCTCCCCAT C 
G TCTATCTCAT GATGCAGCTG GACCAGGGCA CCTGGG1 

A TAGGCTTCAT CATCTACTTT GGCTATGGCC TGTGGCACAG CGAGGAGGCG 1980 
TCCCTGGATG CCGACCAAGC AAGGACTCCT GACGGCAACT TGGACCAGTG CAAGTGACGC 2040 



CACCCCACCC TCCCCACCAG TGCAACAGAA ACCACCTGCG TCCACACCCT CACTGCA 2157 
Seq ID NO: C35 DNA Sequence 



AGGGCAGAGG CACGTCTGGG TCCCCTCCCT CCTTCCTATC 



GTCCTGGTGG ACCAGAGTTG GGTGCTGACG GCCGCGCACT GCGGAAACAA G 

GCTCGAGTAG GGGATGATCA CCTGCTGCTT CTTCAGGGCG AGCAGCICCG CCGGACGACT 420 

CGCTCTGTTG TCCATCCCAA GTACCACCAG GGCTCAGGCC CCATCCTGCC AAGGCGAACG 480 

GATGAGCACG ATCTCATGTT GCTAAAGCTG GCCAGGCCCG TAGTGCCGGG GCCCCGCGTC S40 

CGGGCCCTGC AGCTTCCCTA CCGCTGTGCT CAGCCCGGAG ACCAGTGCCA GGTTGCTGGC 600 

TGGGGCACCA CGGCCGCCCG GAGAGTGAAG TACAACAAGG GCCTGACCTG CTCCAGCATC 660 

ACTATCCTGA GCCCTAAAGA GTGTGAGGTC TTCTACCCTG GCGTGGTCAC CAACAACATG 720 

ATATGTGCTG GACTGGACCG GGGCCAGGAC CCTTGCCAGA GTGACTCTGG AGGCCCCCTG 780 

GTCTGTGACG AGACCCTCCA AGGCATCCTC TCGTGGGGTG TTTACCCCTG TGGCTCTGCC 840 

CAGCATCCAG CTGTCTACAC CCAGATCTGC AAATACATGT CCTGGATCAA T AAAGT CAT A 900 

CGCTCCAACT GATCCAGATG CTACGCTCCA GCTGATCCAG ATGTTATGCT CCTGCTGATC 960 

CAGATGCCCA GAGGCTCCAT CGTCCATCCT CTTCCTCCCC AGT CGGCTGA ACTCTCCCCT 1020 

TGTCTGCACT GTTCAAACCT CTGCCGCCCT CCACACCTCT AAACATCTCC CCTCTCACCT 1030 

CATTCCCCCA CCTATCCCCA TTCTCTGCCT GTACTGAAGC TGAAATGCAG GAAGTGGTGG 1140 

CAAAGGTTTA TTCCAGAGAA GCCAGGAAGC CGGTCATCAC CCAGCCTCTG AGAGCAGTTA 1200 

CTGGGGTCAC CCAACCTGAC TTCCTCTGCC ACTCCCCGCT GTGTGACTTT GGGCAAGCCA 1260 

AGTGCCCTCT CTGAACCTCA GTTTCCTCAT CTGCAAAATG GGAACAATGA CGTGCCTACC 1320 

TCTTAGACAT GTTGTGAGGA GACTATGATA TAACATGTGT ATGTAAATCT TCATGTGATT 13B0 

GTCATGTAAG GCTTAACACA GTGGGTGGT G AGTTCTGACT AAAGGTTACC TGTTGTCGTG 1440 



Seq ID NOi C36 DN. 
Nucleic Acid Acce 
Coding sequence.- 

i i 1 

ATGACCCGGG CCGCGACAGC CGAGCCGGGC CGCGTGTCTC C 
ACGGCGGGCC TGCCGCGCGC CTTCCTGCAG A 
.GACTGGCAGC GCGC 

GAGCTGCCAA GCGGGGTGCT GGAGGGCCTG AGCCAGCGGC GCGGGCCGCA GCCGGGAGCA 
GCTGTACGCT CCCGCCGAGG CGGCGCAGTG CCCCGCGGGG CCCGAGCGGT TCCAGAGCGC 
TGCGCTGGAA CGGAGACCCG GCGCGGACGC AGGTGCAGCG GCCTGCAGCG CCTTGGAGGC 
GGATTCAGGG GATGCCCGGC GGACCCCTGT GCCCGAGGGG AACACCGGAG GCACACCATC 
ACCAGCGGCG TGGACTGCGG CCTGCTGAAG CAGATGAAGG AGCTGGAGCA GGAGAAGGAG 
GTGCTGCTAC AGGGTTTGGA GATGATGGCG CAGGGCCGCG ATTGGTACCA GCAGCAGCTG 
CAACAAGTGC AGGAGCGCCA GTGCCGCCTG GGCCAGAGCA GAGCCAGCGC CGACTTTGGG 
GCCGTGGGGA GCCCTCGCCC ACTGGGACGG CTACTGCCCA AGGTACAGGA GGTGGCCCGG 
AGCTGCTGGC TGAGGCCTGT GCCGGTCGGG CCCIGCCCAC 



CGGCTTCCGG TGCTGACATG G 

CTGCCGCTGT CAGCGGACAG TGGGGGACCC CTCCACGAGT TACCAGATAC CTGGTTTCCA 

GCGGTGCTGC TTTGGGTCCC ATCTCCAGGG AAAAGAACTG CTCACGCCAG GCTGCACTTC 1140 

C TGCGCCCGAG 1200 

St CTTTGAGTCC CACAAAACAA CCTGTGAGCC TGACTCCCTC 1260 

GGAGGGCCCT GTCCCCAGGA GGGGGATCGC AGCTGGAGCC ACCTGGGCGC AGCGTTTGAT 1320 

GTGGCACCTG CAGTGGCCAA AGTGACACCC AACCGT GAGG ACGCTGCAGG GAGTCGGCAC 1380 

GGAGACATCT GTCCCCTCTG TCCCAAGGGA CTGTTGACAT TCAGAGACAT AGCTATAGAA 1440 

TTCTCTCTGG CGGAGTGGCA ATGCCTGGAT CATGCTCAGC AGAATTTATA TAGAGATGTG 1500 

ATGTTAGAGA ACTACAGAAA CCTGTTCTCC CTGGGTATGA CTGTCTCTAA GCCAGACTTG 1560 

ATCGCCTGTC TGGAGCAAAA TAAAGAGCCC CAGAATATAA AGAGAAATGA GATGGCAGCC 1620 



TTAAAAAAAT GTTGTAAAAG AGTAGATGAG TGTGAGGTGC ACAAAGGAGG TTATAATGAC 1800 

CTTAACCAAT GTTTGTCAAA TACCCAAAAC AAAATATTTC AGACTCATAA ATGTGTCAAA 1860 

GTCTTCAGTA AATTTTCAAA TTCCAATAGA CACAATGCAA GATATACTGG AAAGAAACAT 1920 

TTGAAATGTA AAAAATATGG CAAATCATTT TGCATGTTTT CACACCIAAA TCAACATCAG 1980 

ATAATTCATA CTAAGGAGAA GTCCTACAAA TGTGAAGAAT GTGGCAAATC CTTTAACCAC 2040 

TCCTCAAGCG GTACTACACA TAAAAGAATT CTTACTGGAG AGAAACCCTA CAGATGTGAG 2100 

GAATGTGGCA AAGCCTTTAG GTGGCCCTCA AACCTTACTA GACATAP 



1219 



WO 03/042661 



PCT/US02/36810 



I I I | | | 

GCTCACCCAG GAAAAATATG CAATCGTCCC ATTGATATAC AGGCCACTAC AATGGATGGA 

GTTAACCTCA GCACCGAGGT TGTCTACAAA AAAGGCCAGG ATTATAGGTT TGCITGCTAC 

GACCGGGGCA GAGCCTGCCG GAGCTACCGT GTACGGTTCC TCTGTGGGAA GCCTGTG^GG 

CCCAAACTCA CAGTCACCAT TGACACCAAT GTGAACAGCA CCATTCIGAA CTTGGAGGAT 

AATGTACAGT CATGGAAACC TGGAGATACC CTGGTCATTG CCAGTACTGA TTACTCCATG 

TACCAGGCAG AAGAGTTCCA GGTGCTTCCC TGCAGATCCT GCGCCCCCAA CCAGGTCAAA 

GTGGCAGGGA AAC CAATGTA CCTGCACATC GGGGAGGAGA TAGACGGCGT GGACATGCGG 



70 



aaccaatgta CCTGCACATC GGGGAGGAGA TAGACGGCGT GGACATGCGG 
GCGGAGGTTG GGCTTCTGAG CCGGAACATC ATAGTGATGG GGGAGATGGA GGACAAATGC 
/ J TACCCCTACA GAAACCACAT CTGCAATTTC TTTGACTTCG ATACCTTTGG GGGCCACATC 
AAGTTTGCTC TGGGATTTAA GGCAGCACAC TTGGAGGGCA CGGAGCTGAA GCATATGGGA 
CAGCAGCTGG TGGGTCAGTA CCCGATTCAC TTCCACCTGG CCGGTGATGT AGACGAAAGG 
GGAGGTTATG ACCCACCCAC AT ACAT CAGG GACCTCTCCA TCCATCATAC ATTCTCTCGC 
TGCGTCACAG TCCATGGCTC CAATGGCTTG TTG AT CAAGG ACGTTGTGGG CTATAACTCT 



„„ T >-™~vj«v.hvj nvwiluuiVjU AUiTTGJGGG CTATAACTCT 

TTGGGCCACT GCTTCTTCAC - GGAAGATGGG CCGGAGGAAC GCAACACTTT TGACCACTGT 
CTTGGCCTCC TTGT CAAGTC TGGAACCCTC CTCCCCTCGG ACCGTGACAG CAAGATGTGC 
AAGATGATCA CAGGAGACTC CTACCCAGGG TACATCCCCA AGCCCAGGCA AGACTGCAAT 
GCTGTGTCCA CCTTCTGGAT GGCCAATCCC AACAACAACC TC?TC ^ TGTG 3CTGQ 
GGATCTGAGG AAACTGGATT TTGGTTTATT TTTCACCACG TACCAACGGG CCCCTCCGTG 

1220 



2220 
2280 
2340 
2400 



" I ~™" C CCTACGCATG TGAAGAATGT GGCCAAGCCT TTAGGCGCTC CTCAACACTT 
ACTAACCACA AGAGAATTCA TACTGGAGAG AGACCCTACA AATGTGAAGA ATGTGGCAAA 
GCCTTTAGCG TATCCTCAGC CCTCATTTAC CACAAGAGAA TTCATACTGG AGAGAAACCC 
TACACATGTG AAGAATGTGG CAAAGCCTTT AACTGCTCCT CGACTCTTAA GACACATAAG 2400 
ATAATTCATA CTGGAGAGAA ACCCTACACA TGTGAAGAAT GTGGCAGAAC CTTTAACTGC 2450 
TCCTCAACTG TAAAGGCACA TAAGAGAATT CATACTGGAG AGAAACCATA CAAATGTGAA 2520 
GAATGTGACA AAGCTTTTAA GTGGCATTCA AGTCTTGCTA AACATAAC-AT AATTCACACT 
GGAGAGAAAC CCTACAAATG CAGTGACAGC AAAGCCTTAG CCAAATCATC AGAAGTGCAA 
AAGGT CTACT CTGGAGATGG GGAAAATGGA ATCCGTGTAC ATAAGAAAAA GGAGACACAG 
1U GGCTGGCTTG TGAGAAACAA GAACGAAAAT AGAACAGGGC TGTTCCAGAT CCGGGCTGCC 
GTGAGACCTA ACAGGGACCC TTCATGGGGA CAGCAAGAAG GTTCACTGAC TGACCCAATT 
CAGAGGAAGG AGGAACCTGA CCTTCAAAAT CACTATGACC ATCAGAATGC CTTAGAAGAT 
CAAAGAAATA CTGGAGTGGG TGGACTGTTG ACATTCAGAG ATGTAGTCAT AGAATTCTCT 
CTGGAGGAGT GGCAATGCCT GGATCACGCT CAGCAGAATT TATATAGAGA TGTGATGTTA 
U GAGAACTACA GAAACCTGGT CTCCCTGGGT ATTGCTGTCT CTAAGCCAGA CTTGATCACC 

TGTCTGGAGC AAAATAAAGA GCCTTGGAAT ATAAAGAGAA ATGAGATGGT AACCAAACAC 
CCAGACCTTC CGCCAGAGCT AGGCATAAAA GATTCACTCC AAAAAGTAAT ACCAAGAAGA 
TATGGAAAAA GTGGACATGA CAATTTACAA GTAAAAACAT GTAAAAGCAT GGGTGAGTGT 

- - GAGGTGCAAA AAGGAGGTTG TAATGAAGTT AACCAATGTT TGTCAACTAC CCAAAACAAA 
ZU ATATTTCAGA CTCATAAATG TGTCAAAGTC TTCGGCAAAT TTTCAAATTC CAATAGACAT 

AAGACAAGAC ATACTGGAAA GAAACATTTC AAATGTAAAA AATATGGCAA ATCATTTTGC 
ATGGTTTCAC AACTACATCA ACATCAGATA ATTCATACTA GGGAGAATTC CTACCAATGT 
GAAGAATGCG GCAAACCCTT CAACTGCTCT TCAACCCTTT CTAAACATAA AAGAATTCAT 

- ACTGGAGAGA AACCCTACAG ATGTGAGGAA TGTGGCAAAG CTTTTACCTG GTCCTCAACC 
ZJ CTTACTAAAC ATAGGAGAAT TCATACTGGA GAAAAACCCT ACACATGTGA AGAATGTGGC 

CAAGCCTTTA GCCGCTCCTC AACACTTGCT AACCACAAGA GAATTCATAC TGGAGAGAAA 
CCATACACAT GTGAAGAATG TGGCAAAGCC TTTAGCTTAT CCTCATCCCT CACTTACCAC 
AAGAGAATTC ATACTGGAGA GAAACCCTAC ACATGTGAAG AATGTGGCAA AGCCTTTAAC 
_ _ TGCTCCTCAA CCCTTAAGAA ACATAAGATA ATT CAT ACTG GAGAGAAACC CTACAAATGT 

3V AAAGAATGTG GGAAAGCCTT TGCCTTCTCC TCAACTCTTA ATACTCATAA GAGGATTCAT 
ACTGGAGAGG AACCCTACAA ATGTGAAGAA TGTGACAAAG CTTTTAAGTG GTCCTCAAGT 
CTTGCTAATC ATAAGAGTAT GCATACTGGA GAGAAACCCT ACAAATGTGA ATAA 



j- Y 21 31 41 51 

40 AAGATATAAA AGCTCCAGAA ACGTTGACTG GGACCACTGG AGACACTGAA GAAGGCAGGG 
CCCCTTAGAG TCTTGGTTGC CAAACAGATT TGCAGATCAA GGAGAACCCA GGAGTTTCAA 
AGAAG CGCT A GTAAGGTCTC TGAGATCCTT GCACTAGCTA CATCCTCAGG GTAGGAGGAA 
GATGGCTTCC AGAAGCATGC CGCTGCTCCT ATTGCTGAGC TGCCTGGCCA AAACAGGAGT 
CCTGGGTGAT ATCATCATGA GACCCAGCTG TGCTCCTGGA TGGTITTACC ACAAGTCCAA 

<+J TTGCTATGGT TACTTCAGGA AGCTGAGGAA CTGGTCTGAT GCCGAGCTCG AGTGTCAGTC 

TTACGGAAAC GGAGCCCACC TGGCATCTAT CCTGAGTTTA AAGGAAGCCA GCACCATAGC 
AGAGTACATA AGTGGCTATC AGAGAAGCCA GCCGATATGG ATTGGCCTGC ACGACCCACA 
GAAGAGGCAG CAGTGGCAGT GGATTGATGG GGCCATGTAT CTGTACAGAT CCTGGTCTGG 
CAAGTCCATG GGTGGGAACA AGCACTGTGC TGAGATGAGC TCCAATAACA ACTTTTTAAC 

DV TTGGAGCAGC AACGAATGCA ACAAGCGCCA ACACTTCCTG TGCAAGTACC GACCATAGAG 
CAAGAATCAA GATTCTGCTA ACTCCTGCAC AGCCCCGTCC TCTTCCTTTC TGCTAGCCTG 
GCTAAATCTG CTCATTATTT CAGAGGGGAA ACCTAGCAAA CTAAGAGTGA TAAGGGCCCT 
ACTACACTGG CTTTTTTAGG CTTAGAGACA GAAACTTTAG CATTGGCCCA GTAGTGGCTT 
CTAGCTCTAA ATGTTTGCCC CGCCATCCCT TTCCACAGTA TCCTTCTTCC CTCCTCCCCT 
GTCTCTGGCT GTCTCGAGCA GTCTAGAAGA GTGCATCTCC AGCCTATGAA ACAGCTGGGT 
CTTTGGCCAT AAGAAGTAAA GATTTGAAGA CAGAAGGAAG AAACTCAGGA GTAAGCTTCT 
AGACCCCTTC AGCTTCTACA CCCTTCTGCC CTCTCTCCAT TGCCTGCACC CCACCCCAGC 
CACTCAACTC CTGCTTGTTT TTCCTTTGGC CATAGGAAGG TTTACCAGTA GAATCCTTGC 
TAGGTTGATG TGGGCCATAC ATTCCTTTAA TAAACCATTG TGTACATAAG AAAAAAAAAA 
Seq ID NO-. C38 DNA Sequence 
Nucleic Acid Accession # : Eos seouence 
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GGAATGTACT CCCCAGGTTA TTCAGAGCAC ATTCCACTGG GAAAATTCTA TAACAACCGA 
GCACATTCCA ACTACCGGGC TGGCATGATC ATAGACAACG GAGTCAAAAC CACCGAGGCC 
TCTGCCAAGG ACAAGCGGCC GTTCCTCTCA ATCATCTCTG CCAGATACAG CCCTCACCAG 
GACGCCGACC CGCTGAAGCC CCGGGAGCCG GCCATCATCA GACACTTCAT TGCCTACAAG 
AACCAGGACC ACGGGGCCTG GCTGCGCGGC GGGGATGTGT GGCTGGACAG CTGCCGGTTT 
GCTGACAATG GCATTGGCCT GACCCTGGCC AGTGGTGGAA CCTTCCCGTA TGACGACGGC 
:aaagaa cagcttgttt gttggcgaga GTGGCAACGT GGGGACGGAA 
\ ATAGGATCTG GGGCCCTGGC GGCTTGGACC ATAGCGGAAG GACCCTCCCT 



20 
25 




ACATTCTCCA T 
GTCTTCGTGA GGACCTTGCA G 
TACTACTGGG A 

GAGAAGTTTG CTTTCTGCTC CATGAAAGGC TGTGAGAGGA TAAAGATTAA Ai 

CAGGCGTCAG TGACTGCACA GCCACAGCTT ACCCCAAGTT CACCGAGAGG 

ACGTGCCGAT GCCCAAGAAG CTCTTTGGTT CTCAGCTGAA AACAAAGGAC 2580 

CATTTCTTGG AGGTGAAGAT GGAGAGTTCC AAGCAGCACT TCTTCCACCT CTGGAACGAC 2640 

TTCGCTTACA TTGAAGTGGA TGGGAAGAAG TACCCCAGTT OGGAGGATGG CATCCAGGTG 2700 

GTGGTGATTG ACGGGAACCA AGGGCGCGTG GTGAGCCACA CGAGCTTCAG GAACTCCATT 2760 

CTGCAAGGCA TACCATGGCA GCTTTTCAAC TATGTGGCGA CCATCCCTGA CAATTCCATA 2 820 

GTGCTTATGG CATCAAAGGG AAGATACGTC TCCAGAGGCC CATGGACCAG AGTGCTGGAA 2 880 

AAGCTTGGGG CAGACAGGGG TCTCAAGTTG AAAGAGCAAA TGGCATTCGT TGGCTTCAAA 2940 

GGCAGCTTCC GGCCCATCTG GGTGACACTG GACACTGAGG ATCACAAAGC CAAAATCTTC 3 000 

CAAGTTGTGC CCATCCCTGT GGTGAAGAAG AAGAAGTTGT GAGGACAGCT GCCGCCCGGT 3060 

GCCACCTCGT GGTAGACTAT GACGGTGACT CTTGGCAGCA GACCAGTGGG GGATGGCTGG 3120 

GTCCCCCAGC CCCTGCCAGC AGCTGCCTGG GAAGGCCGTG TTTCAGCCCT GATGGGCCAA 3180 

GGGAAGGCTA TCAGAGACCC TGGTGCTGCC ACCTGCCCCT ACTCAAGTGT CTACCTGGAG 3240 

CCCCTGGGCC CGTGCTGGCC AATGCTCGAA ACATTCACTT TCCTGCAGCC TCTTGGGTGC 3 300 

TTCTCTCCTA TCTGTGCCTC TTCAGTGGGG GTTTGGGGAC CATATCAGGA GACCTGGGTT 3360 

GTGCTGACAG CAAAGATCCA CTTTGGCAGG AGCCCTGACC CAGCTAGGAG GTAGTCTGGA 3420 

GGGCTGGTCA TTCACAGATC CCCATCGTCT TCAGCAGACA AGTGAGGGTG GTAAATGTAG 3480 

GAGAAAGAGC CTTGGCCTTA AGGAAATCTT TACTCCTGTA AGCAAGAGCC AACCTCACAG 3 54 0 

GATTAGGAGC TGGGGTAGAA CTGGCTATCC TTGGGGAAGA GGCAAGCCCT GCCTCTGGCC 3S00 

GTGTCCACCT TTCAGGAGAC TTTGAGTGGC AGGTTTGGAC TTGGACTAGA TGACTCTCAA 3 660 

AGGCCCTTTT AGTT CT GAGA TTCCAGAAAT CTGCTGCATT TCACATGGTA CCTGGAACCC 3 720 

AACAGTTCAT GGATATCCAC TGATATCCAT GATGCTGGGT GCCCCAGCGC ACACGGGATG 3780 

GAGAGGTGAG AACTAATGCC TAGCTTGAGG GGTCTGCAGT CCAGTAGGGC AGGCAGTCAG 3B40 

GTCCATGTGC ACTGCAATGC CAGGTGGAGA AATCACAGAG AGGTAAAATG GAGGCCAGTG 3 900 

CCATTTCAGA GGGGAGGCTC AGGAAGGCTT CTTGCTTACA GGAATGAAGG CTGGGGGCAT 3960 

TTTGCTGGGG GGAGATGAGG CAGCCTCTGG AATGGCTCAG GGATTCAGCC CTCCCTGCCG 4020 

CTGCCTGCTG AAGCTGGTGA CTACGGGGTC GCCCTTTGCT CACGTCTCTC TGGCCCACTC 4080 

ATGATGGAGA AGTGTGGTCA GAGGGGAGCA ATGGGCTTTG CTGCTTATGA GCACAGAGGA 4140 

ATTCAGTCCC CAGGCAGCCC TGCCTCTGAC TCCAAGAGGG TGAAGTCCAC AGAAGTGAGC 4200 

TCCTGCCTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT GAGCACAGGG GGCCTCCAGG 4260 

AGACCCTAGA TGTGCTCGTA CTCCCTCGGC CTGGGATTTC AGAGCTGGAA ATATAGAAAA 4320 
TATCTAGCCC AAAGCCTTCA TTTTAACAGA TGGGGAAAGT G 
AACCACACAG CTAAGGGAGG GCCTGGGGAG CCCCACCCTA GCCCTTGCTG C 
TTGCCTCAAC AACCGGCCCC AGAGTGCCCA GGCACTCCTG AGGTAGCTTC T 
: CTCGAAGGAA AGGAAATGAC TAGAGTAGAA T 
: CCCAGCGCAC ACAAACCCGC CCTCCCCTTG GTGTTGGCGG T 
CTTCACTTTG TTCACTACCT GTCAGCCCAG CCTGGGTGCA CAGTAGCTGC A 
TGGTGCTACC TGGCTCTCCT GTCTCTGCAG CTCTACAGGT GAGGCCCAGC A 



CAACCACAAA CTCTTTCCTT CAAAGAGGGC CTGCCTGGCT CCCTCCACCC AACTGCACCC 5040 

ATGAGACTCG GTCCAAGAGT CCATTCCCCA GGTGGGAGCC AACTGTCAGG GAGGTCTTTC 5100 

CCACCAAACA TCTTTCAGCT GCTGGGAGGT GACCATAGGG CTCTGCTTTT AAAGATATGG 5160 

CTGCTTCAAA GGCCAGAGTC ACAGGAAGGA CTTCTTCCAG GGAGATTAGT GGTGATGGAG 5220 

AGGAGAGTTA AAATGACCTC ATGTCCTTCT TGT CCACGGT TTTGTTGAGT TTTCACTCTT 5280 
CTAATGCAAG GGTCTCACAC TGTGAACCAC TTAGGATGTG ATCACTTTCA G 



ft. TGTTTCACAG TACAGGATCT G 
ACCAAGAGCC AATAT CT AGG CATTTTCTTG GTAGCACAAA TTTTCTTATT O 
TTGTCCTCCT TGTTATTTCT GTTTGTAAGA CTTAAGTGAG TTAGGTCTTT AAGGAAAGCA 
ACGCTCCTCT GAAATGCTTG TCTTTTTTCT GTTGCCGAAA TAGCTGGTCC TTTTTCGGGA 
GTTAGATGTA TAGAGTGTTC GTAT GTAAAC ATTTCTTGTA G3CATCACCA TGAACAAAGA 
TATATTTTCT ATTTATTTAT TATATGTGCA CTTCAAGAAG TCACTGTCAG AGAAATAAAG 
A AATGTCAAAA AAAAAAAAAA AAAAAAAAAA ARAAAAAA 

C39 DNA Sequence 
d Accession (ft NM_014373 
ence: 322.1338 
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GTGGCCTCGA GGTGGTGGCA 



ATGTCTCCGA GCTTACTCAC ATAGCATATT 
AAATAACATA ATTGAAGGCA GTAAAAGTGA 
GACCCACTGG AGAGGACAGA AAATGAAGCA 



\ GCCCCTAGAC GTTAACTATC TGCTATTCTT GATCATACTT 420 

GGGAAAATAT TATTAAATAT CCTTACACTA GGAATGAGAA GAAAAAACAC CTGTCAAAAT 480 

TTTATGGAAT ATTTTTGCAT TTCACTAGCA TTCGTTGATC TTITACTTTT GGTAAACATT 540 

TCCATTATAT TGTATTTCAG GGATTTTGTA CTTTTAAGCA TTAGGTTCAC TAAATACCAC 600 

T TTACTCAAAT TATTTCCTTT ACTTATGGCT TTTTGCATTA TCCAGTTTTC 660 

T GTATAGATTA TTGCCTGAAT TTCTCTAAAA CAACCAAGCT TTCATTTAAG 720 

TGTCAAAAAT TATTTTATTT CTTTACAGTA ATTTTAATTT GGATTTCAGT CCTTGCTTAT 780 

r CTACCAAAGC CTGAAGGCAC AGAATGCTTA TTCTCGTCAC 840 

C TGGCTGTCAT TTTTCATGGT GATGATTTTA 900 

2 TTGGGAAGAA GTTACTACTT TGGTACAGGC TATCAGGATA 960 

ACTTCCTATA TGAATGAAAC TATCTTATAT TTTCCTTTTT CATCCCACTC CAGTTATACT 1020 

GTGAG AT CIA AAAAAATATT CTTATCCAAG CTCATTGTCT GTTTTCTCAG TACCTGGTTA 1080 

: TACTTCAGGT AATCATTGTT TTACTTAAAG TTCAGATTCC AGCATATATT 1140 

A TTCCCTGGTT ATACTTTGTC AATAGTTTTC TCATTGCTAC AGTGTATTGG 1200 

TTTAATTGTC ACAAGCTTAA TTTAAAAGAC ATTGGATTAC CTTTGGATCC ATTTGTCAAC 1260 

TGGAAGTGCT GCTTCATTCC ACTTACAATT CCTAATCTTG AGCAAATTGA AAAGCCTATA 1320 

TCAATAATGA TTTGTTAATA TTATTAATTA AAAGTTACAG CTGTCATAAG ATCATAATTT 1380 

TATGAACAGA AAGAACT CAG GACATATTAA AAAATAAACT GAACTAAAAC AACTTTTGCC 1440 

CCCTGACTGA TAGCATTTCA GAATGTGTCT TTTGAAGGGC TATACCAGTT ATTAAATAGT 1500 

GTTTTATTTT AAAAACAAAA TAATTCCAAG AAGTTTTTAT AGTTATTCAG GGACACTATA 1560 

TTACAAATAT TACTTTGTTA TTAACACAAA AAGTGATAAG AGTTAACATT TGG CTAT ACT 1620 

GATGTTTGTG TTACTCAAAA AAACTACTGG ATGCAAACTG TTATGTAAAT CTGAGATTTC 1680 

ACTGACAACT TTAAGATATC AACCTAAACA TTTTTATTAA ATGTTCAAAT GTAAGCAAGA 1740 



Seq ID NO: C40 DNA Sequence 
Nucleic Acid Accession #: BC012089 
Coding sequence: 1..2S71 



ATGGCCCTCG TACTCGGCTC CCTGTTGC 

GGGCAGCCTT CATCCACAGA TGCTCCTAAG GCTTGGAATT ATGAATTGCC T 
TATGAGACCC AAGACTCCCA TAAAGCTGGA CCCATTGGCA TTCTCTTTGA ACTAGTGCAT 




^ AATTGATTAT GACAAGATTG T 
r GTGTCCTGGG GCTGCTGTTT ATTATTCTGA T 
TTTTGTATGT GTCGTTGCTG TAACAAATGT G 

AATGGGI CT TCCI AGGAA ATGCTTTGCA ATCTCCCTGT TGGTGATTTG TATAATAATA 
AGCATTGGCA TCTTCTATGG TTTTGTGGCA AATCACCAGG TAAGAACCCG G 
AGTCGGAAAC TGGCAGATAG CAATTT CAAG GACTTGCGAA C 
GAGCAAATCA AATATATATT GGCCCAGTAC A 

CTGAACAGTA TCAATTCAGT GCTAGGAGGC GGAATTCTTG ACCGACTGAG ACCCAACATC 
ATCCCTGTTC TTGATGAGAT TAAGTCCATG GCAACAGCGA TCAAGGAGAC CAAAGAGGCG 
TTGGAGAACA TGAACAGCAC CTTGAAGAGC TTGCACCAAC AAAGTACACA GCTTAGCAGC 
AGTCTGACCA GCGTGAAAAC TAGCCTGCGG T 
CATCCATCAA GTGAAACCT G CAACAGCATC AGATTGTCTC T. 
CCTGAACTGA GGCAGCTTCC A 

T TGGATGGCCT GGTCCAACAG GGCTATCAAT CCCTTAATGA TATACCTGAC 
C GCCAAACCAC GACTGTCGTA GCAGGTATCA AAAGGGTCTT GAATTCCATT 1140 
GGTTCAGATA TCGACAATGT AACTCAGCGT CTTCCTATTC AGGATATACT CTCAGCATTC 1200 
: TGAAAGTTAC ATCCACAGAA ATTTACCTAC ATTGGAAGAG 1260 
T ACTGGTGGCT GGGTGGCCTG GTCATCTGCT CTCTGCTGAC CCTCATCGTG 1320 
ATTTTTTACT ACCTGGGCTT ACTGTGTGGC GTGTGCGGCT ATGACAGGCA TGCCACCCCG 1380 
ACCACCCGAG GCTGTGTCTC CAACACCGGA GGCGTCTTCC TCATGGTTGG AGTTGGATTA 1440 
T TTTGCTGGAT ATTGATGATC ATTGTGGTTC TTACCTTTGT CTTTGGTGCA 1500 
A AACTGATCTG TGAACCTTAC ACGAGCAAGG AATTATTCCG GGTTTTGGAT 1560 
ACACCCTACT TACTAAATGA AGACTGGGAA TACTATCTCT CTGGGAAGCT ATTTAATAAA 162 0 
TCAAAAATGA AGCTCACTTT TGAACAAGTT TACAGTGACT GCAAAAAAAA TAGAGGCACT 1680 
TACGGCACTC TTCACCTGCA GAACAGCTTC AATATCAGTG AACATCTCAA CATTAATGAG 1740 
CATACTGGAA GCATAAGCAG TGAATTGGAA AGTCTGAAGG TAAATCTTRA TATCTTTCTG 1800 
TTGGGTGCAG CAGGAAGAAA AAACCTTCAG GATTTTGCTG CTTGTGGAAT AGACAGAATG 1860 
AATTATGACA GCTACTTGGC TCAGACTGGT AAATCCCCCG CAGGAGTGAA TCTTTTATCA 1920 
TTTGCATATG ATCTAGAAGC AAAAGCAAAC AGTTTGCCCC CAGGAAATTT GAGGAACTCC 1980 
CTGAAAAGAG ATGCACAAAC TATTAAAACA ATTCACCAGC AACGAGTCCT TCCTATAGAA 2040 
CAATCACTGA GCACTCTATA CCAAAGCGTC AAGATACTTC AACGCACAGG GAATGGATTG 2100 
TTGGAGAGAG TAACTAGGAT TCTAGCTTCT CTGGATTTTG CTCAGAACTT CATCACAAAC 2160 
AATACTTCCT CTGTTATTAT TGAGGAAACT AAGAAGTATG GGAGAACAAT AATAGGATAT 2220 
TTTGAACATT ATCTGCAGTG GATCGAGTTC TCTATCAGTG AGAAAGTGGC ATCGTGCAAA 2280 
CCTGTGGCCA CCGCTCTAGA TACTGCTGTT GATGTCTTTC TGTGTAGCTA CATTATCGAC 2340 
CCCTTGAATT TGTTTTGGTT TGGCATAGGA AAAGCTACTG TATTTTTACT TCCGGCTCTA 2400 
ATTTTTGCGG TAAAACTGGC TAAGTACTAT CGTCGAATGG ATTCGGAGGA CGTGTACGAT 2460 
GATGTTGAAA CTATACCCAT GAAAAATATG GAAAATGGTA ATAATGGTTA TCATAAAGAT 2520 
CATGTATATG GTATTCACAA TCCTGTTATG ACAAGCCCAT CACAACATTG A 2571 

Seq ID NO : C41 DNA Sequence 
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CCACGCGTCC GAGCAAGAAC AGCTAAAATG A 

CTCCTTTCTG TAAACACAG C CACCAACCAA GGCAACTCAG CTGATGCTG1 A 
GAAACTGCGA CTAGTGGTCC TACAGTAGCT G 

A GCACCACAGC AAATACACCT TCTTTCCCAA CAGCTACTTC ACCTGCTCCC 
3 TTCCTCCACA ATTCCTACAC C 



T TAGCTACCTC TGACATAATC ACCGCTTCAT CTCCAAATGA TGGATTAATC 
ACAATGGTTC CTTCTGAAAC ACAAAGTAAC AATGAAATGT CCCCCACCAC AGAAGACAAT 
CAATCATCAG GGCCTCCCAC TGGCACCGCT TTATTGGAGA CCAGCACCCT AAACAGCACA 

ID GGTCCCAGCA atccttgcca agatgatccc tgtgcagata attcgttatg tgttaagctg 

CATAATACAA GTTTTTGCCT GTGTTTAGAA GGGTATTACT ACAACTCTTC TACATGTAAG 
AAAGGAAAGG TATTCCCTGG GAAGATTTCA GTGACAGTAT CAGAAACATT TGACCCAGAA 
GAGAAACATT CCATGGCCTA TCAAGACTTG CATAGTGAAA TTACTAGCTT GTTTAAAGAT 
GTATTTGGCA CATCTGTTTA TGGACAGACT GTAATTCTTA CTGTAAGCAC ATCTCTGTCA 

ZU CCAAGATCTG AAATGCGTGC TGATGACAAG TTTGTTAATG TAACAATAGT AACAATTTTG 
GCAGAAACCA CAAGTGACAA TGAGAAGACT GTGACTGAGA AAATTAATAA AGCAATTAGA 
AGTAGCTCAA GCAACTTTCT AAACTATGAT TTGACCCTTC GGTGTGATTA TTATGGCTGT 
AACCAGACTG CGGATGACTG CCTCAATGGT TTAGCATGCG ATTGCAAATC TGACCTGCAA 
AGGCCTAACC CACAGAGCCC TTTCTGCGTT GCTTCCAGTC TCAAGTGTCC TGATGCCTGC 

ZD AACGCACAGC ACAAGCAATG CTTAATAAAG AAGAGTGGTG GGGCCCCTGA GTGTGCGTGC 
GTGCCCGGCT ACCAGGAAGA TGCTAATGGG AACTGCCAAA AGTGTGCATT TGGCTACAGT 
GGACTCGACT GTAAGGACAA ATTTCAGCTG ATCCTCACTA TTGTGGGCAC CATCGCTGGC 
ATTGTCATTC TCAGCATGAT AATTGCATTG ATTGTCACAG CAAGATCAAA TAACAAAACG 
AAGCATATTG AAGAAGAGAA CTTGATTGAC GAAGACTTTC AAAATCTAAA ACTGCGGTCG 

3\) ACAGGCTT CA CCAATCTTGG AGCAGAAGGG AGCGTCTTTC CTAAGGTCAG GATAACGGCC 
TCCAGAGACA GCCAGATGCA AAATCCCTAT TCAAGACACA GCAGCATGCC CCGCCCTGAC 
TATTAGAATC ATAAGAATGT GGAACCCGCC ATGGCCCCCA ACCAATGTAC AAGCTATTAT 
TTAGAGTGTT TAGAAAGACT GATGGAGAAG TGAGCACCAG TAAAGATCTG GCCTCCGGGG 
TTTTTCTTCC ATCTGACATC TGCCAGCCTC TCTGAATGGA AGTTGTGAAT GTTTGCAACG 

jj AATCCAGCTC ACTTGCTAAA TAAGAATCTA TGACATTAAA TGTAGTAGAT GCTATTAGCG 
CTTGTCAGAG AGGTGGTTTT CTTCAATCAG TACAAAGTAC TGAGACAATG GTTAGGGTTG 
TTTTCTTAAT TCTTTTCCTG GTAGGGCAAC AAGAACCATT TCCAATCTAG AGGAAAGCTC 
CCCAGCATTG CTTGCTCCTG GGCAAACATT GCTCTTGAGT TAAGTGACCT AATTCCCCTG 
GGAGACATAC GCATCAACTG TGGAGGTCCG AGGGG AT GAG AAGGGATACC CACCACCTTT 

4U CAAGGGTCAC AAGCTCACTC TCTGACAAGT CAGAATAGGG ACACTGCTTC TATCCCTCCA 
ATGGAGAGAT TCTGGCAACC TTTGAACAGC CCAGAGCTTG CAACCTAGCC TCACCCAAGA 
AG ACTGG AAA GAGACATATC TCTCAGCTTT TTCAGGAGGC GTGCCTGGGA ATCCAGGAAC 
TTTTTGATGC TAATTAGAAG GCCTGGACTA AAAATGTCCA CTATGGGGTG CACTCTACAG 
TTTTTGAAAT GCTAGGAGGC AGAAGGGGCA GAGAGTAAAA AACATGACCT GGTAGAAGGA 

4j AGAGAGGCAA AGGAAACTGG GTGGGGAGGA TCAATTAGAG aggaggcacc tgggatccac 



r CCAGTACAAG CTTTTACAAA TGTTATTAGT GTCCTTTTTT 2760 
G CCTTGTCCTC TTAAAAGTTA TTTTATTTGT TATTATTATT TGTTCTTGAC 2820 
TGTTAATTGT GAATGGTAAT GCAATAAAGT GCCTTTGTTA GATGGTGAAA A 



I I I I I I 

TCACTTGCCT GATATTTCCA GTGT CAGAGG GACACAGCCA ACGTGGGGTC CCTTCTAGGC 
TGACAGCCGC TCTCCAGCCA CTGCCGCGAG CCCGTCTGCT CCCGCCCTGC CCGTGCACTC 
TCCGCAGCCG CCCTCCGCCA AGCCCCAGCG CCCGCTCCCA TCGCCGATGA CCGCGGGGAG 
GAGGATGGAG ATGCTCTGTG CCGGCAGGGT CCCTGCGCTG CTGCTCTGCC TGGGTTTCCA 
TCTTCTACAG GCAGTCCTCA GTACAACTGT GATTCCATCA TGTATCCCAG GAGAGTCCAG 
TGATAACTGC ACAGCTTTAG TTCAGACAGA AGACAATCCA CGTGTGGCTC AAGTGTCAAT 
AACAAAGTGT AGCTCTGACA TGAATGGCTA TTGTTTGCAT GGACAGTGCA TCTATCTGGT 
GGACATGAGT CAAAACT ACT GCAGGTGTGA AGTGGGTTAT ACTGGTGTCC GATGTGAACA 
CTTCTTTTTA ACCGTCCACC AACCTTTAAG CAAAGAGTAT GTGGCTTTGA CCGTGATTCT 
TATTATTTTG TTTCTTATCA CAGTCGTCGG TTCCACATAT TATTTCTGCA GATGGTACAG 
AAATCGAAAA AGTAAAGAAC CAAAGAAGGA ATATGAGAGA GTTACCTCAG GGGAT CCAGA 
GTTGCCGCAA GTCTGAATAT GAGAGAGTTA CCTCAGGGGA TCCAGAGTTG CCGCAAGTCT 
GAATGGCGCC ATCAAACTTA TGGGCAGGGA TAACAGTGTG CCTGGTTAAT ATTAATATTC 
CATTTTATTA ATAATATTTA TGTTGGGTCA AGTGTTAGGT CAATAACACT GTATTTTAAT 
GTACTTGAAA AATGTTTTTA TTTTTGTTTT ATTTTTGACA GACTATTTGC TAATGTATAA 
TGTGCAGAAA ATATTTAATA TCAAAAGAAA ATTGATATTT TTATACAAGT AATTTCCTGA 
GCTAAATGCT T CATTGAAAG CTTCAAAGTT TATATGCCTG GTGCACAGTG CTTAGAAGTA 
AGCAATTCCC AGGTCATAGC TCAAGAATTG TTAGCAAATG ACAGATTTCT GTAAGCCTAT 
ATATATAGTC AAATCGATTT AGTAAGTATG TTTTTTATGT TCCTCAAATC AGTGATAATT 
GGTTTGACTG TACCATGGTT TGATATGTAG TTGGCACCAT GGTATCATAT ATTAAAACAA 
TAATGCAATT AGAATTTGGG AGAAGCAAAT ATAGGTCCTG TGTTAAACAC TACACATTTG 
AAACAAGCTA ACCCTGGGGA GTCTATGGTC TCTTCACTCA GGTCTCAGCT ATAATTCTGT 
TATATGAGGG GCAGTGGACA GTTCCCTATG CCAACTCACG ACTCCTACAG GTACTAGTCA 
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CTCATCTACC AGATTCTGCC TATGTAAAAT GAATTGAAAA ACAATTTTCT GTAATCTTTT 
ATTTAAGTAG TGGGCATTTC ATAGCTTCAC AATGTTCCTT TTTTGTATAT TACAACATTT 
ATGTGAGGTA ATTATTGCTC AACAGACAAT TAGAAAAAAG TCCACACTTG AAGCCTAAAT 
TTGTGCTTTT TAAGAATATT TTTAGACTAT TTCTTTTTAT AGGGGCTTTG CTGAATTCTA 
ACATTAAATC ACAGCCCAAA ATTTGATGGA CTAATTATTA TTTTAAAATA TATGAAGACA 
ATAATTCTAC ATGTTGTCTT AAGATGGAAA TACAGTTATT TCATCTTTTA TTCAAGGAAG 
TTTTAACTTT AATACAGCTC AGTAAATGGC TTCTTCTAGA A 
AGTTGT AT CT TGACACAGGA AATGGGAAAA A 
AAATGAAAAA TCTCAATTGA AAGCTTTTAA AATGTAGAAA C 
GAGGCTGAGA TGAAAACTAG GGCTCATTTT CCTGACATTT GTTTATTTTT TGGAAGAGAC 
AAAGATTTCT TCTGCACTCT GAGCCCATAG GTCTCAGAGA GTTAATAGGA GTATTTTTGG 
Z TGCTGCCACC ACTTTTGGAT TTTATGGGAG GCTCCTTCAT 
A ACCTTTGAGT AGAGTCTCCC TGGATCACAT ACCAGGTCAG GGAGGATCTG 
T ACGTTTATCC TGGCATGTGC TAGGGTAAAC GAAGGCATAA TAAGCCATGG 
CTGACCTCTG GAGCACCAGG TGCCAGGACT T 

Z TGGCCCTTAC TATTAGGAAA 




TACAAACATG T. 
TGCTAACCAT T 

AGTTATTTCT GTTTCCTATT TAATATAGCT GAAGTCAAAA TATGTAAGAA CACATTTTAA 

ATACTCTACT TACAGTTGGC CCTCTGTGGT TAGTTCCACA TCTGTGGATT CAACCAACCA 2540 

A ATGCTTAAAA AATAATACAA CAACAACAAA AAATACATTA TAACAACTAT 2700 

T TTTTTTCTTT TTGAGATGGA GTCTCGCTCT GTTGCCCAGG TTGGAGTGCA 2760 

• " 1G,CTCAC TGCAACCTCA CCTCCCGGGT TCAAGAGATC CTCCTGCCTC 2B20 

AGCCTCCTGA GCAGCTGGGA CTACAGGCGC ATGCCACCAT GCCCAGCTAA TTTTTGTATT 28B0 

TTTAGTAGAG GCGGGGTTTC ACCATGTTGG CCAGGATGGT CTCAATCTCC TAACCTTGAG 2 940 

ATCCACCCTC CACAGCCTCC CAAACTGCTG GGATTACAGG CGTGAGCCAC CGCACGTAGC 3000 

ATTTACATTA GGTATTACAA GTAATGTAAA GATGATTTAA GTATACAGGA GGATGTGAAT 3060 

AGGTTATATG CAAGCACTAT GCCCTTTTAT ATAAGTGACT TGAACATCTG TGCCCGATTT 3120 

TAGTATGTGC AGGGGGGCGA TCTGGGAATC AGTCCCCTGT GGATACCAAG GTACAACTGT 3180 

ATTTATTAAC GCTTACTAGA TGTGAGGAGA GTCTGAATAT TTTCAGTGAT CTTGGCTGTT 3240 

TCAAAAAAAT CTATTGACTT TTCAATAAAT CAGCTGCAAT CCATTTATTT CATTTACAAA 3 300 

AGATTTATTG TAAGCCTCTC AATCTTGGTT TTTCAGTTGA TCTTAAGCAT GTCAATTCAT 3360 

AAAAACAAGT CATTTTTGTA TTTTTCATCT TTAAGAATGC TTAAAAAAGC TAATCCCTAA 3420 

AATAGTTAGA TCTTTGTAAA TGCATATTAA ATAATAAAGT ATGACCCACA TTACTTTTTA 3480 

TGGGTGAAAA TAAGACAAAA ATAATAGTTT TAGTGAGGAT GGTGCTGAGT AAA CATAAAA 3540 

ACTGATTTGC TCTCAGCTGA TGTGTCCTGT ACACAGTGGG AAGATTTTAG TTCACACTTA 3600 

GTCTAACTCC CCCATTTTAC AGATTTCTCA CTATATATAT TTCTAGAAGG GGCTATGCAT 3660 

ATTCAATGTA TTGAGAACCA AAGCAACCAC AAATGCATAA ATGCATAATT TATGGTCTTC 3720 

AACCAAGGCC ACATAATAAC CCAGTTAACT TACTCTTTAA CCAGGAATAT TAAGTTCTAT 3780 

AACTAGTACT CAAGGTTTAA CCTTAAAATT AAGATTTCCT TAACCTTAAC CTTAAAATTG 3 840 

ATATTATATT AAACATACAT AATACAATGT AACTCCACTG TTCTCCTGAA TATTTTTTGC 3900 

TCTAATCTCT CTGCCGAAAG TCAAAGTGAT GGGAGAATTG GTATACTGGT ATGACTACGT 3960 

CTTAAGTCAG ATTTTTATTT ATGAGTCTTT GAGACTAAAT TCAATCACCA CCAGGTATCA 4020 

AATCAACTTT TATGCAGCAA ATATATGATT CTAGTGTCTG ACTTTTGTTA AATTCAGTAA 4080 

TGCAGTTTTT AAAAACCTGT ATCTGACCCA CTTTGTAATT TTTGCTCCAA TATCCATTCT 4140 

GTAGACTTTT GAAAAAAAAG TTTTTAATTT GATGCCCAAT ATATTCTGAC CGTTAAAAAA 4200 

TTCTTGTTCA TATGGGAGAA GGGGGAGTAA TGACTTGTAC AAACAGTATT T CTGGTGTAT 4260 

ATTTTAATGT TTTTAAAAAG AGTAATTTCA TTTAAATATC TGTTATTCAA ATTTGATGAT 4320 

GTTAAATGTA ATATAATGTA TTTTCTTTTT ATTTTGCACT CTGTAATTGC ACTTTTTAAG 4380 

TTTGAAGAGC CATTTTGGTA AACGGTTTTT ATTAAAGATG CTATGGAACA TAAAGTTGTA 4440 

TTGCATGCAA TTTAAAGTAA CTTATTTGAC TATGAATATT ATCGGATTAC TGAATTGTAT 4500 

CAATTTGTTT GTGTTCAATA TCAGCTTTGA TAATTGTGTA CCTTAAGATA TTGAAGGAGA 4560 
AAATAGATAA TTTACAAGAT ATTATTAATT TTTATTTATT TTTCTTGGGA A 
ATTGAAATAA ATAAAAATGC ATTGAACATC TTGCATTCAA AATCTTCACT GAC 

Seq ID NO: C43 DNA Sequence 
Nucleic Acid Accession #: AF011468.1 
Coding sequences 257.. 1468 

i r r i 1 i 1 r 

GGAAGACTTG GGTCCTTGGG TCGCAGGTGG GAGCCGACGG GTGGGTAGAC 0~ 
ATCTCAGTGG CGGACGAGGA CGGCGGGGAC AAGGGGCGGC TGGTCGGAGT GGCGGAGCGT 
CAAGTCCCCT GTCGGTTCCT CCGTCCCTGA GTGTCCTTGG CGC1GCCTTG TGCCCGCCCA 
GCGCCTTTGC ATCCGCTCCT GGGCACCGAG GCGCCCTGTA GGATACTGCT TGTTACTTAT 
TACAGCTAGA GGCAT CATGG ACCGATCTAA AGAAAACTGC ATTTCAGGAC CTGTTAAGGC 
TACAGCTCCA GTTGGAGGTC CAAAACGTGT TCTCGTGACT CAGCAAATTC CTTGTCAGAA 
TCCATTACCT GTAAATAGTG GCCAGGCTCA GCGGGTCTTG TGTCCTTCAA ATTCTTCCCA 
GCGCGTTCCT TTGCAAGCAC AAAAGCTTGT CTCCAGTCAC AAGCCGGTTC AGAATCAGAA 
GCAGAAGCAA TTGCAGGCAA CCAGTGTACC TCATCCTGTC TCCAGGCCAC TGAATAACAC 
CCAAAAGAGC AAGCAGCCCC TGCCATCGGC ACCTGAAAAT AATCCTGAGG AGGAACTGGC 
ATCAAAACAG AAAAATGAAG AATCAAAAAA GAGGCAGTGG GCTTTGGAAG ACTTTGAAAT 
TGGTCGCCCT CTGGGTAAAG GAAAGTTTGG TAATGTTTAT TTGGCAAGAG AAAAGCAAAG 
CAAGTTTATT CTGGCTCTTA AAGTGTTATT TAAAGCTCAG CTGGAGAAAG CCGGAGTGC-A 
GCATCAGCTC AGAAGAGAAG TAGAAATACA GTCCCACCTT CGGCATCCTA ATATTCTTAG 
ACTGTATGGT TATTTCCATG ATGCTACCAG AGTCTACCTA ATTCTGGAAT ATGCACCACT 
TGGAACAGTT TATAGAGAAC TTCAGAAACT TTCAAAGTTT GATGAGCAGA GAACTGCTAC 
TTATATAACA GAATTGGCAA ATGCCCTGTC TTACTGTCAT TCGAAGAGAG TTATTCATAG 
AGACATTAAG CCAGAGAACT TACTTCTTGG ATCAGCTGGA GAGCTTAAAA TTGCAGATTT 
TGGGTGGTCA GTACATGCTC CATCTTCCAG GAGGACCACT CTCTGTGGCA CCCTCGACTA 
CCTGCCCCCT G 
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TAACAGAGGG 
GCATAATCCC AGCCAGAGGC CAATGCTCAG 
ATTGCCAAAA 



25 
30 



40 
45 



GGCTGCCATA TAACCTGACA GGAACATGCT ACTGAAGTTT A 
CCTCAATCTA GAACGCTACA CAAGAAATAT TTGTTT1 

C ATCAATAAAC ATGACACTCT G 
A CTTATACTGG TTCATAATCT GGAGGCAAGG TTCGACTGCA GCCGCCCCGT 

T GTCTTCACAG GAGGCAf 

AGTGACCACT CTGCCCTGAC CCCGATCAGT TAAGGAGCTG TGCAATAACC T 

G GAATGAGTAT 

A AAATAAAGAT ATATGTACAG ACTTGTATTT TTTCTCTGGT 
G ATTGGGTTTC 
A AAAATAGGAA CACGTGCTCT 
T AGGGATTTGC TTGGGATACA GAAGAGGCCA TGTGTCTCAG AGCTGTTAAG 
GGCTTATTTT TITAAAACAT TGGAGTCATA GCATGTGTGT AAACTTTAAA TATGCAAATA 
AATAAGTATC TATGT CT AAA AAAAAAAAAA AAA 

Seq ID NO: C44 DNA Sequence 
Nucleic Acid Accession #: NM_013372 
Coding sequence: 63.. 617 



GCGGCCGCAC TCAGCGCCAC GCGTCGAAAG CGCAGGCCCC GAGGACCCGC CGCACTGACA 
GTATGAGCCG CACAGCCTAC ACGGTGGGAG CCCTGCTTCT CCTCTTGGGG ACCCTGCTGC 
CGGCTGCTGA AGGGAAAAAG AAAGGGTCCC AAGGTGCCAT CCCCCCGCCA GACAAGGCCC 
AGCACAATGA CTCAGAGCAG ACTCAGTCGC CCCAGCAGCC TGGCTCCAGG AACCGGGGGC 
GGGG CCAAGG GCGGGGCACT GCCATGCCCG GGGAGGAGGT GCTGGAGTCC AGCCAAGAGG 
CCCTGCATGT GACGGAGCGC AAATACCTGA AGCGAGACTG GTGCAAAACC CAGCCGCTTA 
AGCAGACCAT CCACGAGGAA GGCTGCAACA GTCGCACCAT CATCAACCGC TTCTGTTACG 
GCCAGTGCAA CTCTTTCTAC ATCCCCAGGC ACATCCGGAA GGAGGAAGGT TCCTTTCAGT 
CCTGCTCCTT CTGCAAGCCC AAGAAATTCA CTACCATGAT GGTCACACTC AACTGCCCTG 
AACTACAGCC ACCTACCAAG AAGAAGAGAG TCACACGTGT GAAGCAGTGT CGTTGCATAT 
CCATCGATTT GGATTAAGCC AAATCCAGGT GCACCCAGCA TGTCCTAGGA ATGCAGCCCC 
AGGAAGTCCC AGACCTAAAA CAACCAGATT CTTACTTGGC TTAAACCTAG AGGCCAGAAG 
AACCCCCAGC TGCCTCCTGG CAGGAGCCTG CTTGTGCGTA GTTCGTGTGC ATGAGTGTGG 
ATGGGTGCCT GTGGGTGTTT TTAGACACCA GAGAAAACAC AGTCTCTGCT AGAGAGCACT 
CCCTATTTTG TAAACAT AT C TGCTTTAATG GGGATGTACC AGAAACCCAC CTCACCCCGG 



•JiGA ATCTCCTTTC GGAATGAATG TTCATGGAAG AGGCTCCTCT GAGGGCAAGA 
GACCTGTTTT AGTGCTGCAT TCGACATGGA AAAGTCCTTT TAACCTGTGC TTGCATCCTC 
CTTTCCTCCT CCTCCTCACA ATCCATCTCT TCTTAAGTTG ATAGTGACTA TGTCAGTCTA 
ATCTCTTGTT TGCCAAGGTT CCTAAATTAA TTCACTTAAC CATGATGCAA ATGTTTTTCA 
TTTTGTGAAG ACCCTCCAGA CTCTGGGAGA GGCTGGTGTG GGCAAGGACA AGCAGGATAG 
TGGAGTGAGA AAGGGAGGGT GGAGGGTGAG GCCAAATCAG GTCCAGCAAA AGTCAGTAGG 
GACATTGCAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGCTTC TGAGAAAGTC 
TTTTCCTAGT ATTTAACAGA ACCCAAGTGA ACAGAGGAGA AATGAGATTG CCAGAAAGTG 
ATTAACTTTG GCCGTTGCAA TCTGCTCAAA CCTAACACCA AACTGAAAAC ATAAATACTG 
ACCACTCCTA TGTTCGGACC CAAGCAAGTT AGCTAAACCA AACCAACTCC TCTGCTTTGT 
, Qjy^Q^^ GTAGTTTAGA ACTCTCTGCA TAGGGGTGGG AATTAATCAA 
. GGCTGAAATT CCTAATACCT TTCCTTTATC GTGGTTATAG TCAGCTCATT 
: TATTTCCCAT AATGCTTCTG AGAGCCACTA ACTTGATTGA TAAAGAT CCT 
I AGTGTACCTG ACAGTAAGT C TAAAGATGAR AGAGTTTAGG GACTACTCTG 
TTTTAGCAAG ARATATTKTG GGGGTCTTTT TGTTTTAACT ATTGTCAGGA GATTGGGCTA 
RAGAGAAGAC GACGAGAGTA AGGAAATAAA GGGRATTGCC TCTGGCTAGA GAGTAAGTTA 
GGTGTTAATA CCTGGTAGAA ATGTAAGGGA TATGACCTCC CTTTCTTTAT GTGCTCACTG 
AGGATCTGAG GGGACCCTGT TAGGAGAGCA TAGCATCATG ATGTATTAGC TGTTCATCTG 
CTACTGGTTG GATGGACATA ACTATTGTAA CTATTCAGTA TTTACTGGTA GGCACTGTCC 2100 
TCTGATTAAA CTTGGCCTAC TGGCAATGGC TAC1TAGGAT TGATCTAAGG GCCAAAGTGC 2 ISO 
AGGGTGGGTG AACTTTATTG TACTTTGGAT TTGGTTAACC TGTTTTCTTC AAGCCTGAGG 2220 
TTTTATATAC AAACTCCCTG AATACTCTTT TTGCCTTGTA TCTTCTCAGC CTCCTAGCCA 2280 
AGTCCTATGT AATATGGAAA ACAAACACTG CAGACTTGAG ATTCAGTTGC CGATCAAGGC 2340 
TCTGGCATTC AGAGAACCCT TGCAACTCGA GAAGCTGTTT TIATTTCGTT TTTGTTTTGA 2400 
TCCAGTGCTC TCCCATCTAA CAACTAAACA GGAGCCATTT CAAGGCGGGA GATATTTTAA 24S0 
ACACCCAAAA TGTTGGGTCT GATTTTCAAA CTTTTAAACT CACTACTGAT GATTCTCACG 2520 
CTAGGCGAAT TTGTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCCCACC 25B0 
CCAAATCTTT GTATTGTCCA CATTCTCCAA CAATAAAGCA CAGAGTGGAT TTAATTAAGC 2640 
ACACAAATGC TAAGGCAGAA TITTGAGGGT GGGAGAGAAG AAAAGGGAAA GAAGCTGAAA 2700 
ATGTAAAACC ACACCAGGGA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 27S0 
TTGTTGTTTT AACTCTGCCA CAAGAATGCA ATTTCGTTAA TGGAGATGAC TTAAGTTGGC 2 820 
AGCAGTAATC TTCTTTTAGG AGCTTGTACC ACAGTCTTGC ACATAAGTGC AGATTTGGCT 28B0 
CAAGTAAAGA GAATTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2 940 
TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTGTG CCTATATTAA 3000 
GACTAGTACA AATGTGGTGT GTCTTCCAAC TTTCATTGAA AATGCCATAT CTATACCATA 3060 
TTCTATTCGA GTCACTGATG ATGTAATGAT ATATTTTTTC ATTATTATAG TAGAATATTT 3120 
TTATGGCAAG ATATTTGTGG TCTTGATCAT ACCTATTAAA ATAATGCCAA ACACCAAATA 3180 
TGAATTTTAT GATGTACACT TTGTGCTTGG CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 
GTCTGTAAGT TGTTTTTTGT TACTGTAGGT CTTCAAAGTT AAGAGTGTAA GTGAAAAATC 3300 
TGGAGGAGAG GATAATTTCC ACTGTGTGGA ATGTGAATAG TTAAATGAAA AGTTATGGTT 3360 
ATTTAATGTA ATTATTACTT CAAATCCTTT GGTCACTGTG ATTTCAAGCA TGTTTTCTTT 3420 
TTCTCCTTTA TATGACTTTC TCTGAGTTGG GCAAAGAAGA AGCTGACACA CCGTATGTTG 3480 
TTAGAGTCTT TTATCTGGTC AGGGGAAACA AAATCTTGAC CCAGCTGAAC ATGTCTTCCT 3540 
GAGTCAGTGC CTGAATCTTT ATTTTTTAAA TTGAATGTTC CTTAAAGGTI AACATTTCTA 3600 
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I TTAAATGTTA TTTTGGAAGA CTTACGATGC ATGTATACAA 3660 

A GATAATGATG ACTAGTTCAC ACATAAAGTC CTTTTAAGGA GAAAATCTAA 3 720 

AATGAAAAGT GGATAAACAG AACATTTATA AGTGATCAGT TAATGCCTAA GAGTGAAAGT 37B0 

AGTTCTATTG ACATTCCTCA AGATATTTAA TATCAACTGC ATTATGTATT ATGTCTGCTT 3840 

ATGAAAGGGG AGTTGATAGT CTCATAAAAC TAATTTGGCT 

TAAATTAAAC 



ATTGCTGATG GATCAGTGAG CCTGTGTTCA TGCCAGTGAG C 
ATACTTTCTT TCCAAACAGC ATAAGAAGTG AT TGAGCCAC A 
TCCCTCGAGT TCTGGTGTGA AGAGATAAAT CACCAGTCAC AGACTATGCA CCCGACTGCT 
GCTGTTCAGT CCAGGGAAAA TGAAAGTTGG AGTGCTGTGG CTCATTTCTT TCTTCACCTT 
CACTGACGGC CACGGTGGCT TCCTGGGGAA AAATGATGGC ATCAAAACAA AAAAAGAACT 
CATTGTGAAT AAGAAAAAAC ATCTAGGCCC AGTCGAAGAA TATCAGCTGC TGCTTCAGGT 
GACCTATAGA GATTCCAAGG AGAAAAGAGA TTTGAGAAAT TTTCTGAAGC TCTTGAAGCC 
TCCATTATTA TGGTCACATG GGCTAATTAG AATT AT CAG A GCAAAGGCTA CCACAGACTG 
CAACAGCCTG AATGGAGTCC TGCAGTGTAC CTGTGAAGAC AGCTACAOCT GGTTTCCTCC 
CTCATGCCTT GATCCCCAGA ACTGCTACCT TCACACGGCT GGAGCACTCC CAAG CTGTGA 
ATGTCATCTC AACAACCTCA GCCAGAGTGT CAATTTCTGT GAGAGAACAA AGATTTGGGG 
CACTTTCAAA ATTAATGAAA GGTTTACAAA TGACCTTTTG AATTCATCTT CTGCTATATA 
CTCCAAATAT GCAAATGGAA TTGAAATTCA ACTTAAAAAA GCATATGAAA GAATTCAAGG 
TTTTGAGTCG GTTCAGGTCA CCCAATTTCG AAATGGAAGC ATCGTTGCIG GGTATGAAGT 
TGTTGGCTCC AGCAGTGCAT CTGAACTGCT GTCAGCCATT GAACATGTTG CCGAGAAGGC 
TAAGACAGCC CTTCACAAGC TGTTTCCATT AGAAGACGGC TCTTTCAGAG TGTTCGGAAA 
AGCCCAGTGT AATGACATTG TCTTTGGATT TGGGTCCAAG GATGATGAAT ATACCCTGCC 
CTGCAGCAGT GGCTACAGGG GAAACATCAC AGCCAAGTGT G AGT CCTCTG GGTGGCAGGT 
CATCAGGGAG ACTTGTGTGC TCTCTCTGCT TGAAGAACTG AACAAGAATT TCAGTATGAT 
TGTAGGCAAT GCCACTGAGG CAGCTGTGTC ATCCTTCGTG CAAAATCTTT CTGTCATCAT 
TCGGCAAAAC CCATCAACCA CAGTGGGGAA TCTGGCTTCG GTGGTGT03A TTCTGAGCAA 
TATTTCATCT CTGTCACTGG CCAGCCATTT CAGGGTGTCC AATTCAACAA TGGAGGATGT 
CATCACTATA GCTGACAATA TCCTTAATTC AGCCTCAGTA ACCAACTGGA CAGTCTTACT 
GCGGGAAGAA AAGTATGCCA GCTCACGGTT ACTAGAGACA T 

C TCAAAAGGGG TTACAGCTAT CAGATTAAAA TGTGTCCCCA 
AAATACATCT ATTCCCATCA GAGGCCGTGT GTTAATTGGG TCAGACCAAT TCCAGAGATC 
CCTTCCAGAA ACTATTATCA GCATGGCCTC GTTGACTCTG GGGAACATTC TACCCGTTTC 
CAAAAATGGA AATGCTCAGG TCAATGGACC TGTGATATCC ACGGTTATTC AAAACTATTC 
TTTCCTAT TTTTTTCCAA GATAGAGTCA AACCTGAGCC AGCCTCATTG 
TTTTGG GATTTCAGTC A 




CCAGAATCTG GCTTGGCATG TTATTTTTGC TTTACTCAAT GCATTCCAGG GATTTTTTAT 2700 

CTTATGCTTT GGAATACTCT TGGACAGTAA GCTGCGACAA CTTCTGTTCA ACAAGTTGTC 2 760 

TGCCTTAAGT TCTTGGAAGC AAACAGAAAA GCAAAACTCA TCAGATTTAT CTGCCAAACC 2820 

CAAATTCTCA AAGCCTTTCA ACCCACTGCA AAACAAAGGC CAT TAT G CAT TTTCTCATAC 2880 

TGGAGATTCC TCCGACAACA TCATGCTAAC TCAGTTTGTC TCAAATGAAT AAGGCAAGGA 2 940 

ATCATAAAAT CAAGAAAAAA TTTCCAGAAC AACTTGACAT TTAGAGACAA ATGTCAATGA 3 000 

AGAAATTATG CTCAGTATTC GATCGGGTTT TCTGATTTAG GGGTCTGGGA ATAAAACAAG 3060 

AATGTCTCAG TGGCTTCA 3 078 

Seq ID NO: C46 DNA Sequence 

Nucleic Acid Accession 0: NM_000S84.1 

Coding sequence: 75.. 374 

1 11 21 31 41 51 

I I I I I I 

AGCAGAGCAC ACAAGCTTCT AGGACAAGAG CCAGGAAGAA ACCACCGGAA GGAACCATCT 60 

CACTGTGTGT AAACATGACT TCCAAGCTGG CCGTGGCTCT CTTGGCAGCC TTCCTGATTT 120 

CTGCAGCTCT GTGTGAAGGT GCAGTTTTGC CAAGGAGTGC TAAAGAACTT AGATGTCAGT 1B0 

GCATAAAGAC ATACTCCAAA CCTTTCCACC CCAAATTTAT CAAAGAACTG AGAGTGATTG 240 

AGAGTGGACC ACACTGCGCC AACACAGAAA TTATTGTAAA GCTTTCTGAT GGAAGAGAGC 300 

TCTGTCTGGA CCCCAAGGAA AACTGGGTGC AGAGGGTTGT GGAGAAGTTT TTGAAGAGGG 360 

CTGAGAATT C ATAAAAAAAT TCATTCTCTG TGGTATCCAA GAATCAGTGA AGATGCCAGT 420 

GAAACTTCAA GCAAATCTAC TTCAACACTT CATGTATTGT GTGGGTCTGT TGTAGGGTTG 480 

CCAGATGCAA TACAAGATTC CTGGTTAAAT TTGAATTTCA GTAAACAATG AATAGTTTTT 540 

CATTGTACCA TGAAATATCC AGAACATACT TATATGTAAA GTATTATTTA TTTGAATCTA 600 
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CAAAAAACAA CAAATAATTT TTAAATATAA GGATTTTCCT AGATATTGCA CGGGAGAATA 
TACAAATAGC AAAATTGAGC CAAGGGCCAA GAGAATATCC GAACTTTAAT TTCAGGAATT 
GAATGGGTTT GCTAGAATGT GATATTTGAA GCATCACATA AAAATGATGG GACAATAAAT 
TTTGCCATAA AGTCAAATTT AGCTGGAAAT CCTGGATTTT TTTCTGTTAA ATCTGGCAAC 
CCTAGTCTGC TAGCCAGGAT CCACAAGTCC TTGTTCCACT GTGCCTTGGT TTC1CCTTTA 
TTTCTAAGTG GAAAAAGTAT TAGCCACCAT CTTACCTCAC AGTGATGTTG TGAGGACATG 
TGGAAGCACT TTAAGTTTTT TCATCATAAC ATAAATTATT TTCAAGTGTA ACTTATTAAC 
CTATTTATTA TTTATGTATT TATTTAAGCA TCAAATATTT GTGCAAGAAT TTGGAAAAAT 
AGAAGATGAA TCATTGATTG AATAGTTATA AAG AT GTT AT AGTAAATTTA TTTTATTTTA 
GATATTAAAT GATGTTTTAT TAGATAAATT TCAATCAGGG TTTTTAGATT AAACAAAGAA 
ACAATTGGGT ACCCAGTTAA ATTTTCATTT CAGATAAACA ACAAATAATT TTTTAGTATA 
AGTACATTAT TGTTTATCTG AAAGTTTTAA TTGAACTAAC AATCCTAGTT TGATACTCCC 
AGTCTTGTCA TTGCCAGCTG TGTTGGTAGT GCTGTGTTGA ATTACGGAAT AATGAGTTAG 
AACTATTAAA ACAGCCAAAA CTCCACAGTC AATATTA 

A TATTATTTAA A 



ATGAAACATT 



11 21 31 41 SI 

I I I I I 

3 AAAGAGACT C AGAAACGACA TTTGACGAGG ATTCTCAGCC TAATGACGAA 
GTGGTTCCCT ACAGTGATGA TGAAACAGAA GATGAACTTG ATGACCAGGG GTCTGCTGTT 
GAACCAGAAC AAAACCGAGT CAACAGGGAA GCAGAGGAGA ACCGGGAGCC ATTCAGAAAA 
GAATGTACAT GGCAAGTCAA AGCAAACGAT CGCAAGTACC ACGAACAACC TCACTTTATG 
AACACAAAAT TCTTGTGTAT TAAGGAGAGT AAATATGCGA ATAATGCAAT TAAAACATAC 
AAGTACAACG CATTTACCTT TATACCAATG AATCTGTTTG AGCAGTTTAA GAGAGCAGCC 
AATTTATATT TCCTGGCTCT TCTTATCTTA CAGGCAGTTC CTCAAATCTC TACCCTGGCT 
TGGTACACCA CACTAGTGCC CCTGCTTGTG GTGCTGGGCG TCACTGCAAT CAAAGACCTG 
GTGGACGATG TGGCTCGCCA TAAAATGGAT AAGGAAATCA ACAATAGGAC GTGTGAAGTC 
ATTAAGGATC GCAGGTTCAA AGTTGCTAAG TGGAAAGAAA TTCAAGTTGG AGACGTCATT 
CGTCTGAAAA AAAATGAT TT TGTTCCAGCT GACATTCTCC TGCTGTCTAG CTCTGAGCCT 
AGCCTI i \1 TUjfl AACAGCAGAA CTGGACGGAG AAACCAATTT AAAATTTAAG 

ATGTCACTTG AAATCACAGA CCAGTACCTC CAAAGAGAAG ATACATTGGC TACATTTGAT 
GGTTTTATTG AATGTGAAGA ACCCAATAAC CGACTAGATA AGTTTACAGG AACACTATTT 
TGGAGAAACA CAAGTTTTCC TTTGGATGCT GATAAAATTT TGTTACGTGG CTGTGTAATT 
AGGAACACCG ATTTCTGCCA CCGCTTACTC ATTTTTGCAG GTGCTGACAC T 
AAGAATAGTG G 

GTTTACACGA TCTTTGTTGT TCTTATTCTG CTTTCTGCTG G 
TATTGGGAAG CACAGGTGGG CAATTCCTCT TGGTACCTCT A 
CCCTCCTACC GTGGATTCCT CATTTTCTGG G 

CCCATCTCTC TCTATGTCAG CGTGGAAGTG ATTCGTCTTG GACAGAGTCA CTTCATCAAC 
TGGGACCTGC AAATGTACTA TGCTGAGAAG GACACACCCG CAAAAGCTAG AACCACCACA 
CTCAATGAAC AGCTCGGGCA GATCCATTAT ATCTTCTCTG ATAAGACGGG GACACTCACA 
CAAAATATCA TGACCTTTAA AAAGTGCTGT ATCAACGGGC AGATATATGG GGACCATCGG 
GATGCCTCTC AACACAACCA CAACAAAATA GAGCAAGTTG ATTTTAG 
GCTGATGGGA AGCTTGCATT TTATGACCAC T 
GAGCCAGAAG T ACG ACAGT T CTTCTTCTTG C 
AGGACTGATG GTCAGCTCAA CTACCAGGCA G 

A ACTTTGGCTT 
GAACTGGGCA C 

AAGCGAATGT CTATCATTGT AAGAACCCCA GAAGGCAATA TCAAGCTTTA C 

GCTGACACTG TTATTTATGA ACGGTTACAT CGAATGAATC CTACTAAGCA AGAAACACAG 
GATGCCCTGG ATATCTTTGC AAATGAAACT CTTAGAACCC TATGCCTTTG CTACAAGGAA 1980 
ATTGAAGAAA AAGAATTTAC AGAATGGAAT AAAAAGTTTA TGGCTGCCAG TGTGGCCTCC 2040 
ACCAACCGGG ACGAAGCTCT GGATAAAGTA TATGAGGAGA TTGAAAAAGA CTTAATTCTC 2100 
CTGGGAGCTA CAGCTATTGA AGACAAGCTA CAGGATGGAG TTCCAGAAAC CATTTCAAAA 2160 
CTTGCAAAAG CTGACATTAA GATCTGGGTG CTTACTGGAG ACAAAAAGGA AACTGCTGAA 2220 
AATATAGGAT TTGCTTGTGA ACTTCTGACT GAAGACACCA CCATCTGCTA TGGGGAGGAT 2280 
ATTAATTCTC TTCTTCATGC AAGGATGGAA AACCAGAGGA ATAGAGGTGG CGTCTACGCA 234 0 
AAGTTTGCAC CTCCTGTGCA GGAATCTTTT TTTCCACCCG GTGGAAACCG TGCCTTAATC 2400 
ATCACTGGTT CTTGGTTGAA TGAAATTCTT CTCGAGAAAA AGACCAAGAG AAATAAGATT 2460 
CTGAAGCTGA AGTTCCCAAG AACAGAAGAA GAAAGACGGA TGCGGACCCA AAGTAAAAGG 2520 
AGGCTAGAAG CTAAGAAAGA GCAGCGGCAG AAAAACTTTG TGGACCTGGC CTGCGAGTGC 2580 
AGCGCAGTCA TCTGCTGCCG CGTCACCCCC AAGCAGAAGG CCATGGTGGT GGACCTGGTG 2640 
AAGAGGTACA AGAAAGCCAT CACGCTGGCC ATCGGAGATG GGGCCAATGA CGTGAACATG 2700 
ATCAAAACTG CCCACATTGG CGTTGGAATA AGTGGACAAG AAGGAATl C ( rCATG 2760 
TCGAGTGACT ATTCCTTTGC TCAGTTCCGA TATCTGCAGA GGCTAC7GCT GGTGCATGGC 282 0 
CGATGGTCTT ACATAAGGAT GTGCAAGTTC CTACGATACT TCTTTTACAA AAACTTTGCC 2880 
TTTACTTTGG TTCATTTCTG GTACTCCTTC TTCAATGGCT ACTCTGCGCA GACTGCATAC 2940 
GAGGATTGGT TCATCACCCT CTACAACGTG CTGTACACCA GCCTGCCCGT GCTCCTCATG 3000 
GGGCTGCTCG ACCAGGATGT GAGTGACAAA CTGAGCCTCC GATTCCCTGG GTTATACATA 3060 
GTGGGACAAA GAGACTTACT ATTCAACTAT AAGAGATTCT TTGTAAGCTT GTTGCATGGG 3120 
GTCCTAACAT CGATGATCCT CTTCTTCATA CCTCTTGGAG CTT AT CTGCA AACCGTAGGG 3180 
CAGGATGGAG AGGCACCTT C CGACTACCAG TCTTTTGCCG TCACCATTGC CTCTGCTCTT 3240 
GTAATAACAG TCAATTTCCA GATTGGCTTG GATACTTCTT ATTGGACTTT TGTGAATGCT 3300 
TTTTCAATTT TTGGAAGCAT TGCACTTTAT TTTGGCATCA TGTTTGACTT TCATAGTGCT 3360 
GGAATACATG TTCTCTTTCC ATCTGCATTT CAATTTACAG GCACAGCTTC AAACGCTCTG 3420 
AGACAGCCAT ACATTTGGTT AACTATCATC CTGACTGTTG CTGTGTGCTT ACTACCCGTC 3480 
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GTTGCCATTC GATTCCTGTC AATGACCATC TGGCCATCAG AAAGTGATAA GATCCAGAAG 

CAT CGCAAGC GGTTGAAGGC GGAGGAGCAG TGGCAGCGAC GGCAGCAGGT GTTCCGCCGG 

GGCGTGTCAA CGCGGCGCTC GGCCTACGCC TTCTCGCACC AGCGGG3CTA CGCGGACCIC 

ATCTCCTCCG GGCGCAG CAT CCGCAAGAAG CGCTCGCCGC TTGATGCCAT CGTGGCGGAT 

3 AGTACAGGCG CACCGGGGAC AGCTGA 

: C48 DNA Sequence 



I I I I I I 

GCTCTGCCCA AGCCGAGGCT GCGGGGCCGG CGCCGGCGGG AGGACTGCGG TGCCCCGCGG 
AGGGGCTGAG TTTGCCAGGG CCCACTTGAC CCTGTTTCCC ACCTCCCGCC CCCCAGGTCC 
GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GCGGACCGCG GGGCGGAGCT GCCGCCCGTG 
AGTCCGGCCG AGCCACCTGA GCCCGAGCCG CGGGACACCG TCGCTCCTGC TCTCCGAATG 
CTGCGCACCG CGATGGGCCT GAGGAGCTGG CTCGCCGCCC CATGGGGCGC GCTGCCGCCT 
CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 
TGGGCGCTCA GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGCGGCC ATTCCTCAGA 
TTCGAAGCTG AACACATCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA TGGCAGGACC 
CTGTACGTGG GTGCTCGAGA GGCCCTCTTT GCACTCAGTA GCAACCTCAG CTTCCTGCCA 
GGCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAGC 
TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CCTGCCGCTC 
AGCGGCAGTC ACCTGTTCAC CTGTGG CACA GCAGCCTTCA GCCCCATGTG TACCTACATC 
AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGAAGGGGA ATGT CCTCCT GGAAGATGGC 
AAGGGCCGTT GTCCCTTCGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGCGAG 
CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC CGGCCATCTC GCGGAGCCAA 
AGCCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TGCAAGACCC AGCTTTTGTG 
GCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG GCGATGATGA CAAGATCTAC 
TTTTTCTTCA GCGAGACTGG CCAGGAATTT GAGTTCTTTG AGAACACCAT TGTGTCCCGC 
ATTGCCCGCA TCTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 
TCCTTCCTCA AGGCCCAGCT GCTGTGCTCA CGGCCCGACG ATGGCTTCCC CTTCAACGTG 
CTGCAGGATG TCTTCACGCT GAGCCCCAGC CCCCAGGACT GGCGTGACAC CCTTTTCTAT 
GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 
ACAATGAAGG ATGTGCAGAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CCGTGAGACA 
CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCATCACC 
AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCAGACCG CGTGCTGAAC 
TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTGCTGCAG 
CCCCAGGCTC GCTACCAGCG CGTGGCTGTA CACCGCGTCC CTGGCCTGCA CCACACCTAC 
GATGTCCTCT TCCTGGGCAC TGGTGACGGC CGGCTCCACA AGGCAGTGAG CGTGGGCCCC 
CGG ■ ACA TCATTGAGGA GCTGCAGATC TTCTCATCGG GACAGCCCGT GCAGAATCTG 
CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCGGCCTCAC ACTCGGGCGT AGTCCAGGTG 
CCCATGGCCA ACTGCAGCCT GTACAGGAGC T GTGGGG ACT GCCTCCTCGC CCGGGACCCC 
L i AGCGGCTC CAGCTGCAAG CACGTCAGCC TCTACCAGCC TCAGCTGGCC 

ACCAGGCCGT GGAT CCAGG A CATCGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 
TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 
CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCAACCTGGC GACCCGACTC 
TGGCTACGCA ACGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 
GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 
GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 22 80 
CAAACAGATG AGGGTGGCAG TGTACCCGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 2340 
GCTGGTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG 24 00 
TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 24S0 
AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTGC 2520 
CCTGTGGTGC TGCCCCCTGA GACCCGCCCA CTCAACGGCC TAGGGCCCCC TAGCACCCCG 25B0 
CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCCCC CGGGGTCCCG AGTCTTCACT 2640 
GAGTCAGAGA AGAGGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC CCCAGTGTGC 2700 
CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG ATCCGTGACT CTGTGGTGTG AGAGCTGACT 27 60 
TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 2 820 
TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 2 880 
GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG TAGCGAAGCT CCTACCACCC AGACACCCAA 2 940 
ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT GGTGGAACAG 3 000 
TGCTCCTTAT GT AAACT GAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 3060 
GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 3120 
G GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 3180 



CTGCCGTCGT CCCACCACCT CAGGGACCAG AGGGCTAGGT TGGCACTGCG GCCCTCACCA 3360 

GGTCCTGGGC TCGGACCCAA CTCCTGGACC TTTCCAGCCT GTATCAGGCT GTGGCCACAC 3420 

GAGAGGACAG CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT 3480 

CAGGGAAGAG ACTGTCGCCT GCCTTCCTCC GTTGTTGCGT GAGAACCCGT GTGCCCCTTC 3540 

CCACCATATC CACCCTCGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTG 3 600 

GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 3660 

TCAACACTGC CCAGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 3720 

ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 3766 



GGCACGAGCG AGTTCCTGTC TCTCTGCCAA CGCCGCCCGG ATGGCTTCCC AAAACCGCGA 
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CCCAGCCGCC ACTAGCGTCG CCGCCGCCCG TAAAGGAGCT GAGCCGAGCG GGGGCGCCGC 
CCGGGGTCCG GTGGGCAAAA GGCTACAGCA GGAGCTGATG ACCCTCATGA TGTCTGGCGA 
TAAAGGGATT TCTGCCTTCC CTGAATCAGA CAACCTTTTC AAATGGGTAG GGACCATCCA 
TGGAGCAGCT GGAACAGTAT ATGAAGACCT GAGGTATAAG CTCTCGCTAG AGTTCCCCAG 
TGGCTACCCT TACAATGCGC CCACAGTGAA GTTCCTCACG CCCTGCTATC ACCCCAACGT 
GGACACCCAG GGTAACATAT GCCTGGACAT CCTGAAGGAA AAGTGGTCTG CCCTGTATGA 
TGTCAGGACC ATTCTGCTCT CCATCCAGAG CCTTCTAGGA GAACCCAACA TTGATAGTCC 
CTTGAACACA CATGCTGCCG AGCTCTGGAA AAACCCCACA GCTTTTAAGA AGTACCTGCA 
AGAAACCTAC TCAAAGCAGG TCACCAGCCA GGAGCCCTGA CCCAGGCTGC CCAGCCTGTC 
CTTGTGTCGT CTTTTTAATT TTTCCTTAGA TGGTCTGTCC TTTTTGTGAT TTCTGTATAG 
GACTCTTTAT CTTGAGCTGT GGTATTTTTG ITTTGTTTTT GTCTTTTAAA TTAAGCCTCG 
GTTGAGCCCT TGTATATTAA ATAAATGCAT TTTTGTCCTT TTTTAAAAAA AAAAAAAAAA 

Seq ID NO: C50 DNA Sequence 

Nucleic Acid Accession #: NM_0145a4.1 

Coding sequence: 227.. 1633 

i i 1 r f i 1 i 1 

GCACGAGCCC CGGGCTGCCG GCGCGGGCGC CGCGGCACGT CCACAGGCTG GGTCGCGAGG 
TGGCGATCGC TGAGAGGCAG GAGGGCCGAG GCGGGCCTGG GAGGCGGCCC GGAGGTGGGG 
CGCCGCTGGG GCCGGCCCGC ACGGGCTTCA TCTGAGGGCG CACGGCCCGC GACCGAGCGT 
GCGGACTGGC CTCCCAAGCG TGGGGCGACA AGCTGCCGGA GCTGCAATGG GCCGCGGCTG 
GGGATTCTTG TTTGGCCTCC TGGGCGCCGT GTGGCTGCTC AGCTCGGGCC ACGGAGAGGA 
GCAGCCCCCG GAGACAGCGG CACAGAGGTG CTTCTGCCAG GTTAGTGGTT ACTTGGATGA 
TTGTACCTGT GATGTTGAAA CCATTGATAG ATTTAATAAC TACAGGCTTT T 
ACAAAAACTT CTTGAAAGTG ACTACTTTAG GTATTACAAG GTAAACC 
TCCTTTCTGG AATGACATCA GCCAGTGTGG AAGAAGGGAC TGTGCTGTCA AACCATGTCA 
ATCTGATGAA GTTCCTGATG GAATTAAATC TGCGAGCTAC AAGTATTCTG AAGAAGCCAA 
TAATCTCATT GAAGAATGTG AACAAGCTGA ACGACTTGGA GCAGTGGATG AATCTCTGAG 
TGAGGAAACA CAGAAGGCTG TTCTTCAGTG GACCAAGCAT GATGATTCTT CAGATAACTT 
CTGTGAAGCT GATGACATTC AGTCCCCTGA AGCTGAATAT GTAGATTTGC T 



T TTTAAGCCAC AGACAATTAA AAGACCTTTA AATCCTTTGG CTTCTGGTCA 
AGGGACAAGT GAAGAGAACA CTTTTTACAG TTGGCTAGAA G 
AGCATTCTAC AGACTTATAT CTGGCCTACA T 

ATATCTTTTA CAAGAGACCT GGTTAGAAAA GAAATGGGGA CACAACAT1A 
ACAGCGATTT GATGGAATTT TGACTGAAGG AGAAGGTCCA AGAAGGCTTf 
TTTTCTCTAC TTAATAGAAC TAAGGGCTTT ATCCAAAGTG TTACCAT7CT 
AGATTTTCAA CTCTTTACTG GAAATAAAAT TCAGGATGAG GAAAACAAAA T 
GGAAATACTT CATGAAATCA AGTCATTTCC TTTGCATTTT GATGAGAATT CATTTTTTGC 
TGGGGATAAA AAAGAAGCAC ACAAACTAAA GGAGGACTTT CGACTGCATT TTAGAAATAT 1380 
TTCAAGAATT ATGGATTGTG TTGGTTGTTT TAAATGTCGT CTGTGGGGAA AGCTTCAGAC 1440 
TCAGGGTTTG GGCACTGCTC TGAAGATCTT ATTTTCTGAG AAATTGATAG CAAATATGCC 1500 
AGAAAGTGGA CCTAGTTATG AATTCCATCI AACCAGACAA GAAATAGTAT CATTATTCAA 1560 
CGCATTTGGA AGAATTTCTA CAAGTGTGAA AGAATTAGAA AACTTCAGGA ACTTGTTACA 162 0 
GAATATTCAT TAAAGAAAAC AAGCTGATAT GTGCCTGTTT CTGGACAATG GAGGCGAAAG 1680 
AGTGGAATTT CATTCAAAGG CATAATAGCA ATGACAGTCT TAAGCCAAAC ATTTTATATA 1740 
AAGTTGCTTT TGTAAAGGAG AATTATATTG TTTTAAGTAA ACACATTTTT AAAAATTGTG 1800 
TTAAGTCTAT GTATAATACT ACTGTGAGTA AAAGTAATAC TTTAATAATG TGGTACAAAT 1860 
TTTAAAGTTT AATATTGAAT AAAAGGAGGA TTATCAAATT CATATATGAT AAAAGTGAAT 192 0 
GTTCTAAGTC TCTCAAACTA GCGTTTTA1G TAATAATATG TAATATAAAT AAAACTATGG 1980 
TAAATGTGAC AAGCATTTAA TAGGAAAATG CTAAGGAGGC CTCATAAATG ACCCATAATT 2040 
ACCAACGTAG AATTTTTCAG TACATTTAGG GTTGCTGGAT TTAGCAAATA AAAATAAAGA 2100 
TTGCCCAGTT AGATTTGAAT TTCAGATAAA CAATTAGTTT TTTAATATTT TACATGGAAT 2160 
AITTGGAAAA TACTTATACT AAAAAATTAT TTGTTTGAAA TTCACATTTA ACTGGGAGTC 2220 
TTGTATTTTA TCTGGCAATC CTAAAATACA TTGGTATGAA ACAAATCACT TTTAGAAGTA 2280 
TATTGCTATT TTGATTGGGT TGTTTTTGTG TGTAGAAACG TACAATAACA ACTCAAAGGC 2340 
ACAGGAGATT TCTAAACATT GTGAAAAGTT GAATAGATTA TATATTTATT CTCATAATAC 2400 
TTTCACTAAT ACTAAATAAA ATTTGGGGAA CACTTTTTAT TTTTATATAA TTTCCAATTT 2460 
ACAGAAAAGT TTCAAAAATA GTACAAAGAG CTCTCTTACC CAGATTCACT AATTGTTCAT 2520 
ACGTGCTTTA TCTTTCA 

ATCATTTGAA AGTCAGTTAT AGGCATCATG C 

ATACTGAATA ATTACTAAAA ATGATTTTCT CAGAAAAAAA AACTCCCACA A 
TATAATACTG TAAGCCTTAG AATAAATAAT ACTTTCAAGT T 
GAGTTTTGTT G 

TTAAAATTTT TTTTAGAGAC AAGGTTTTGC TGTGTTGCCC AAGCTGGAAC TTGAACGACT 
GGGCTGAAGT GATCTTCCCA CCTCAGCCIC CCAAGTAGCT G 
CCATACCCAG TTTCATTTTT GTTTTTTATA CCCGAAGTTC A' 

ACTGAACTGT AATTTTGGGA GGTTTTCATT AGTGGAAGCT CTTCATTTAT AAAGCTATTT 
GAAGGGGTTT AGGAATTTAT ATCACATGGT AATTGTAGAG AAAAAGAAGC TATATACCTC 
AAAATCGTGC CCTCTTTACA TATGTCTTAT CAGGTATAAC ATGTTGAAAT GTCACATTAG 
TAGTAAAGTG GGGTTTATTT ATATAGTGGT TAAGAAATGT CAGTTTACAC TGCTGTATAC 
TTCTTCTTCT GTGTCCCTAA GGCCTGGTAC AGTGCCAAGC ACAIACTIGG TATCCAATAA 
ATATTTGTTG GATGAAAAAA AAAAAAAAAA AAAA 

Seq ID NO: C51 DNA Sequence 
Coding sequence: 37.. 723 ~~ 
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CCACGTCCGG GGTGCCGAGC CAACTTTCCT GCGTCCATGC AGCCCCGCCG GCAACGGCTG 60 



CTGTTGCTCG CCCCGGTGGC GGCGCCCGCG GGGTCCGGGG GCCCCGACGA CCCTGGGCAG 
CCTCAGGATG CTGGGGTCCC GCGCAGGCTC CTGCAGCAGA AGGCGCGCGC GGCGCTTCAC 
TTCTTCAACT TCCGGTCCGG CTCGCCCAGC GCGCTGCGAG TGCTGGCCGA GGTGCAGGAG 
GGCCGCGCGT GGATTAATCC AAAAGAGGGA TGTAAAGTTC ACGTGGTCTT CAGCACAGAG 
CGCTACAACC CAGAGTCTTT ACTTCAGGAA GGTGAGGGAC GTTTGGGGAA ATGTTCTGCT 
CGAGTGTTTT TCAAGAATCA GAAACCCAGA CCAACCATCA ATGTAACTTG TACACGGCTC 
AT CGAG AAAA AGAAAAGACA ACAAGAGGAT TACCTGCTTT ACAAGCAAAT GAAGCAACTG 
AAAAACCCCT TGGAAATAGT CAGCATACCT GATAATCATG GACATATTGA TCCCTCTCTG 
AGACTCATCT GGGATTTGGC TTTCCTTGGA AGCTCTTACG TGATGTGGGA AATGACAACA 
CAGGTGTCAC ACTACTACTT GGCACAGCTC ACTAGTGTGA GGCAGTGGGT AAGAAAAACC 
TGAAAATTAA CTTGTGCCAC AAGAGTTACA ATCAAAGTGG TCTCCTTAGA CTGAATTCAT 
GTGAACTTCT AATTTCATAT CAAGAGTTGT AATCACATTT ATTTCAATAA ATATGTGAGT 
TCCTGC 

Seq ID NO: C52 DNA Sequence 

Nucleic Acid Accession ft: NM_005409.3 

Coding sequence: 94.. 378 



TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 
CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 
TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 



CTTAAAA ATATCAAAAC A 
AACCTAGAAC AAGTTTAACT GTGACIf 

AGACTTTTCT ATGGTTTTGT GACTTTCAAC TTTTGTACAG T 
GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT G 

CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 
GGTTACCATC GGAGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT A 
GCATTTCTAG GCTAGAGAAC CTTCTAGATT TGATGCTTAC A 
GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT A 
C1GTGGTTAC AGTGGAGACA TTGACA1 
CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 
CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1020 
TITATAACCA ATTCAT TAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCTA 1080 
TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 114 0 
GATGTTTTTC AACTTTTATT CATTGAGATG TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 12 0 0 
TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 
TACAAAATGT TTTTGT CTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 13 2 0 
AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 



Seq ID NO: C53 DNA Sequence 

Nucleic Acid Accession #: FGENESH predicted 
Coding sequence: 1..609 

1 11 21 31 41 51 

I I I I I I 

ATGCTGCGGC AGGTGCTTCG CAGAGGGCTC CAGTCGTTCT GCCACAGGCT GGGTTTGTGC 
GTGAGCCGGC ACCCGGTCTT TTTCCTCACC GTGCCCGCAG TCCTGACAAT CACCTTCGGC 
CTCAGCGCGC TCAACCGCTI CCAGCCCGAG GGCGACCTGG AGCGCCTGGT CGCTCCCAGC 
CACAGCCTGG CCAAGAT CG A GCGCAGCCTG GCCAGCAGCC TTTTCCCCCT GGACCAGTCC 
AAAAGCCAGC TCTATTCGGA CTTACACACC CCTGGGAGGT ATGGCAGGGT GATCCTCCIC 
TCCCCAACCG GGGACAATAT TTTGCTCCAG GCTGAGGGGA TCCTGCAGAC CCACCGAGCC 
GTGCTGGAAA TGAAGGTGAA CCACAAGGGC TATAATTATA CTTTTTCCCA TCTGTGTGTG 
TTGAGAAATC AGGATAAGAA ATGCGTGCTG GATGATATTA TTTCAGTGCT AGAGGATCTC 
AGGCAGGCTG CCGTCTCCAA TAAGACAACA GCCAGGGTGC AAGTGAGGTA TCCCAACACT 
AAATTAAAGG TATGCTCCTT CTGCATGCTT CTGCCAATTA AAGAGGCAGC ACTTCATTTC 
TTGCCCTAA 




TATTATGCT C TACAAGGGAT CGGGTTTATG GAGCAGGTGG AAGATCTATG GAACCACAGA 
CAATCTGTGC TCCAGAGGTT ATGAAGCCAT GTAIACGCIA CIAGGCAATG CCAATGGAGC 
AACCTGTGCA TTCCCGTTCA AGTTTGAAAA CAAGTGGTAC GCAGATTGCA CGAGTGCTGG 
GCGGTCGGAT GGATGGCTCT GGTGCGGAAC CACTACTGAC TATGACACAG ACAAGCTATT 
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TGGATATTGT CCATTGAAAT TTGAGGGCAG TGAAAGCTTA TGGAATAAAG ACCCGCTGAC 
CAGCGTTTCC TACCAGATAA ACTCCAAATC CGCTTTAACG TGGCACCAAG CGAGGAAAAG 
CTGCCAACAA CAGAACGCTG AGCTCCTGAG CATCACAGAG ATACATGAGC AAACATACCT 
GACAGG AT TA ACCAGTTCCT TGACCTCAGG ACTCTGGATT GGACTTAACA GTCTGAGCTT 
CAACAGCGGT TGGCAGTGGA GTGACCGCAG TCCTTTCCGA TATTTGAACT GGTTACCAGG 
AAGTCCATCA GCTGAACCTG GAAAAAGCTG TGTGTCACTA AATCCTGGAA AAAATGCTAA 
ATGGGAAAAT CTGGAATGTG TTCAGAAACT GGGCTATATT TGCAAAAAGG GCAACACCAC 
TTTAAATTCT TTTGTTATTC CCTCAGAAAG TGATGTGCCT ACTCACTGTC CTAGTCAGTG 
GTGGCCGTAT GCCGGTCACT GTTACAAGAT TCACAGAGAT GAGAAAAAAA TCCAGAGGGA 
TGCTCTGACC ACCTGCAGGA AGGAAGGCGG TGACCTCACA AGTATCCACA CCATCGAGGA 
ATTGGACTTT ATTATCTCCC AGCTAGGATA TGAGCCAAAT GACGAATTGT GGATCGGCTT 
AAATGACATT AAGATTCAAA TGTACTTTGA GTGGAGTGAT GGGACCCCTG TAACGTTTAC 
CAAATGGCTT CGTGGAGAAC CAAGCCATGA AAACAACAGA CAGGAGGATT GTG1 3 3 
GAAAGGCAAG GATGGGTACT GGGCAGATCG GGGCTGTGAG TGGCCTCTTG GCTACATCTG 
CAAGATGAAA TCACGAAGCC AAGGTCCAGA AATAGTGGAA GTCGAAAAAG G 
AGGCTGGAAA AAACATCACT TTTACTGCTA TATGAT1 
AGAAGCAAAC CAAACCTGTA ATAATGAGAA TGCTTAT 
TGAACAAGCC TTCCTGACTA GTTTCGTTGG CTTAAGGCCT G 
ACTTTCAGAT ATACAAACCA AAGGGACTTT TCAGTGGACC ATCGAGGAAG AGGTTCGGTT 



K ATGTGATGAA A 

3 ACGACTCCCG AACCCAAATG 
T TTCAAGCTGT A 
: TCGAGATTTT T 

CTTAGCTAGC ATCAATAACA AAGAGGAACA GCAAACAATA TGGCGA1TAA TAACAGCTAG 
TGGAAGCTAC CACAAACTGT TTTGGTTGGG ATTGACATAT GGAAGCCCTT CAGAAGGTTT 
TACTTGGAGT GATGGTTCTC CTGTTTCATA TGAAAACTGG GCTTATGGAG AACCTAATAA 
TTATCAAAAT GTTGAATACT GTGGTGAGCT GAAAGGTGAC CCTACTATGT CTTGGAATGA 
TATTAATTGT GAACACCTTA ACAACTGGAT TTGCCAGATA CAAAAAGGAC AAACACCAAA 
ACCTGAGCCA ACACCAGCTC CTCAAGACAA TCCACCAGTT A 
TTACAAAGAC TACCAGTATT ATTTCAGCAA AGAGAAGGAA A 

GTTTTGCAAG AGGAATTTTG GTGATCTTGT TTCTATTCAA AGTGAAAGTG AAAAGAAGTT 
TCTATGGAAA TATGTAAACA GAAATGATGC ACAGTCTGCA TATTTTATTG GTTTATTGAT 
CAGCTTGGAT AAAAAGTTTG CTTGGATGGA TGGAAGCAAA GTGGATTACG TGTCTTGGGC 
CACAGGTCAA CCCAATTTTG CAAATGAAGA TGAAAACTGT GTGACCATGT ATTCAAATTC 
AGGGTTTTGG AATGACATTA ACTGTGGCTA TCCAAACGCC TTCATTTGCC AGCGACATAA 

CAGTAGTATC AATGCTACCA CAGTTATGCC TACCATGCCC TCGGTCCCAT CAGGGTGCAA 2 910 

GGAAGGTTGG AATTTCTACA GCAACAAGTG TTTCAAAATC TTTGGAT7TA TGGAAGAAGA 3 0 00 

AAGAAAAAAT TGGCAAGAGG CACGAAAAGC TTGTATAGGC TTTGGAGGGA ATCTGGTCTC 3060 

CATACAAAAT GAAAAAGAGC AAGCATTTCT TACCI'ATCAC ATGAAGGACT CCACTTTCAG 3120 

TGCCTGGACT GGGCTGAATG ATGTCAATTC AGAACACACG TTCCTTTGGA CGGATGGACG 31Q0 

AGGAGTCCAT TACACAAACT GGGGGAAAGG TTACCCTGGT GGAAGAAGAA GCAGTCTTTC 3240 

TTATGAAGAT GCTGACTGTG TTGTTATTAT TGGAGGTGCA TCAAATGAAG CAGGAAAATG 33 00 

GATGGATGAT ACCTGCGACA GTAAACGAGG CTACATATGC CAGACACGAT CCGACCCTTC 33 SO 

CTTGACTAAT CCTCCAGCAA CGATTCAAAC AGATGGCTTT GTTAAATATG GCAAAAGCAG 3420 

CTATTCACTC ATGAGACAAA AATTTCAATG GCATGAAGCG GAGA CA TACT GCAAGCTTCA 3480 

CAATTCCCTT ATAGCCAGCA TTCTGGATCC CTACAGTAAT GCATTTGCGT GGCTGCAGAT 3540 

GGAAACATCT AATGAACGTG TGTGGATCGC CCTGAACAGT AACTTGACTG ATAATCAATA 3600 

CACTTGGACT GATAAGTGGA GGGTGAGGTA CACTAACTGG GCTGCTGATG AGCCCAAATT 3S60 

GAAATCAGCA TGTGTTTATC TGGATCTTGA TGGCTACTGG AAGACAGCAC ATTGCAATGA 3720 

AAGTTTTTAC TTTCTCTGTA AAAGATCAGA TGAAATCCCT GCTACTGAAC CCCCACAACT 3780 

GCCTGGCAGA TGCCCGGAGT CAGAT CACAC AGCATGGATT CCTTTCCATG GTCACTGTTA 3840 

CTATATTGAG TCCTCATATA CAAGAAACTG GGGCCAAGCT TCTCTGGAAT GTCTTCGAAT 3900 

GGGTTCCTCT CTGGTTTCCA TTGAAAGTGC TGCAGAATCC AGTTTTCTGT CATATCGGGT 3960 

TGAGCCACTT AAAAGTAAAA CCAATTTTTG GATAGGATTG TTCAGAAATG "TGAAGGGAC 4020 

GTGGCTGTGG ATAAATAACA GTCCGGTCTC CTTTGTCAAC TGGAACACAG GAGATCCCTC 4080 

G AATGATTGTG TAGCTTTACA TGCGTCTTCT GGGTTTTGGA GTAATATTCA 4140 

C TACAAAGGAT ATATTTGTAA AAGACCAAAA ATTATTGATG CTAAACCTAC 4200 

TCATGAATTA CTTACAACAA AAGCTGACAC AAGGAAGATG GACCCTTCTA AACCGTCTTC 4260 

CAACGTGGCC GGAGTAGTCA TCATTGTGAT CCTCCTGATT TTAACGGGTG CTGGCCTTGC 4320 

CGCCTATTTC TTTTATAAGA AAAGACGTGT GCACCTACCT CAAGAGGGCG CCTTTGAAAA 4380 

CACTCTGTAT TTTAACAGTC AGTCAAGCCC AGGAACTAGT GATATGAAAG ATCTCGTGGG 4440 

A CAGAATGAAC ACTCGGTCAT CTAGTACCTC AATGCGATTC TGAGATATTT 4500 

RATTTTTAG" TCAATGTGAT TGTTTTCTTT 4560 

CTTTTTTCCT TTGCCTAATT GAAGAAATAA 4620 

T TCTAGCCTGG CAAGATATTT T CAT AAAAG A GGGATAACAA TGCTGATTAC 4680 

A AATATTTTAG ATAAATGCAC AGCACCACAG CACCACATCT AAG CATTAGT 4740 

GATGGGTAGC TGATGTCAGC TTCATGTGGA TTTTAAGCAC TCTAGAAACA ATGAAGCTTC 4800 

TTGGCATATT TTAAGGAGCT CCCAAAATGT GTTACCTATT AAATTGTAAC TCAGCAAGTA 4860 

GAAGACCATT TGAAAAGTCA GGTACAAATT TCCTCAAGTG GCATAAAAAT GTAGTCAGTT 4920 

A CCAGTTTTTA TTTCCACTCC AATTATTTAG AACTTTATTT GTACATGTGC 4980 

h GGCAGCTGAG AATCTTGTTT CCCCCAAGAG AGTTTTACAG GCTGAGTGTT 5040 

T TCTTTGTCCT GTTATATGTA TATCAGGAAT ACAAGGATGT GAAATAAAAC 5100 

5160 
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CGCTGCCCCG GGATTCCCCA GCCCCGCGCG GCTCCCTACT CCACTTCGCA GCAACTTCGG 
CGACCGCGCG CCGCCCGCCC TCGCCCGCCT TTGAAGTTTG CTGTGCCGAC CGCAAAGTTG 
GGACACTTCA GCGGATTGAA TTTTTCTCTT TTATCTGCCT CCGTCCCCGC CCTCCAGGCT 
TCTCGTTCCT GGATATTGGT GCTTAGCATC TTGGCAGGGT CCGGGGAOGT GGACTATTTC 
GCACACCACA CCACGGGGAG GGATTTTTTT CTATTTTCCC TACGAAAAAC AGATCTTTTT 
AAGGATGGTG CTGCTCCACT GGTGCCTGCT GTGGC7CCTG TTTCCACTCA GCTCAAGGAC 
CCAGAAGTTA CCCACCCGGG ATGAGGAACT TTTTCAGATG CAGATCCGGG ACAAGGCATT 
TTTTCATGAT TCGTCAGTAA TTCCAGATGG AGCTGAAATT AGCAGTTATC TCTTTAGAGA 
TACACCTAAA AGGTATTTCT TTGTGGTTGA AGAAGACAAT ACTCCATTAT CAGTCACAGT 
GACGCCCTGT GATGCGCCTT TGGAGTGGAA GCTGAGCCTC CAGGAGCTGC CAGAGGACAG 
GAGCGGGGAA GGCTCAGGTG ATCTGGAACC TCTTGAGCAG CAGAAGCAGC AGATCATTAA 
TGAGGAAGGC ACTGAGTTAT TCTCCTACAA AGGCAATGAT GTTGAGTATT TTATATCGTC 
TAGTTCCCCA TCCGGTTTAT ATCAGTTGGA TCTTCTTTCA ACAGAGAAAG ACACACATTT 
CAAAGTATAT GCCACCACAA CTCCAGAATC TGATCAGCCA TACCCTGAGT TACCCTATGA 
CCCAAGAGTA GATGTGACCT CACTGGGGCG CACCACGGTC ACTTTGGCCT GGAAACCAAG 
CCCCACTGCC TCTTTGCTGA AACAACCCAT TCAGTACTGT GTGGTCATCA ACAAAGAGCA 
CAATTTCAAA AGTCTCTGTG CAGTGGAAGC AAAACTGAGT GCAGATGATG CTTTTATGAT 
GGCACCGAAA CCTGGTCTGG ACTTCAGCCC CTTTGACTTT GCCCACTTTG GATTTCCTTC 
TGATAATTCA GGTAAAGAAC GCAGTTTCCA GGCAAAGCCT TCTCCAAAAC TGGGGCGTCA 
TGTCTACTCC AGGCCCAAGG TTGATATTCA GAAAATCTGC ATAGGAAACA AGAACATCTT 
CACCGTCTCT GATCTGAAAC CCGACACGCA GTACTACTTT GACGTATTTG TGGTCAACAT 
CAACAGCAAC ATGAGCACCG CTTATGTAGG TACCTTTGCC AGGACCAAGG A 
h GTCGAGCTCA AAGATGGGAA GA1AACAGAT G 



CTCTTGTCTG GATGCTGTCC A 
GAATGTGGAA GGCATTCAGC A 
TCGACIGAAA GGAAACAAGA AAGGAGCATC T 

TACTAAGCAG TCATTTCCCT CTCTTCCTGA AGACACAAGA ATCAAAGCCT TTGACAAGCT 
CCGTACCTGT TCCTCGGCCA CCGTGGCTTG GCTAGGCACT CAGGAAAGGA A 
CATCTACAAA AAAGAAG 

CCAATGTCTA GGACCAGATA TAAGGAAGAA GTCAGAAAAG GTCCTCTGTA AATATTTCCA 1980 

CAGTCAAAAC CTGCAGAAAG CAGTGACCAC AGAAACAATT AAAGGTCTTC AGCCTGGCAA 2040 

ATCTTACCTG CTGGATGTTT ATGTCATAGG ACATGGGGGG CACTCTGTAA AGTATCAGAG 2100 

- - 1CTAGAA AGTTCTGTTA GTTACCTTCT TATAGAGATA TATTATGTAG 2160 

A GGGACATTAA ATCACTTTAA GTATAAACTG ACTACTCCCA CAGTTGAGAG 2220 

AAGTTGTGAC CTGTACTTGT ACTATGGAAG GAAGGATATC AACGTGTGTA TATTGATGTT 2280 

TATATAAGTA ACTCTTGAAG GAGACTTGTT CTAGCGTGCC CCATGGTACC TAGTGTGTGT 2340 

CTGATGCCGG TTGGTGTCAA AGATAGAGGG CTTCTTGAAG GAACTTGCCA TTCCTTGCTT 2400 

TGACCACTGC ATGAACTGCT TCTAAATTAT TTTATTACCT AAAAATTTAA AATATGCCAT 2460 

TCATTGCACA CACCCACAAA TGCAAATCAT TCCTCTCTAT AGATGCTAGG ATATAIATAA 2520 

ATTATTTTAT AAATTCTTGT TTTAAATGTC AGTGTTTCTA TGATTGTAAA CTATTAAATT 2580 

CTTTTCCTAT TAAAGTACAG ATCTAATCTA AGTATTATTA AGTTGATAGC CCTCTAGTCA 2640 

GTTATATTGC TATTGTAAAT TCTTGTTTGT TGAGTAAAAT GTTTAAATAC TATATGTATC 2700 

TCATGTACAA AGTTGACATA CATTATATTC ATGTACATAA AATTAAAGAG ATTAGATTAT 27 60 

ATACTGTTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 2808 



1 11 21 31 41 51 

I I I I I I 

ATCCGGCGGT GGTGACGCGA GAGGCTGGGG TCTCCAGGAC CAAOTCCTCT TCS 
CTTCCTCAGC CTGCTCAATG TGAAGCCCCT GATCATGATT CACTTCCACT TAATAAATAA 
AGTGTTTACA AAT CAGAAT A ACTTTTAGAC AATATTAAGG TGGTAATCAT GAACAGAAAA 
GATTTTGTAG TTCTTCCATG GGGAAAACCT GGAAATTCTG TAAAGCTAAA ATATAGCAAT 
GTAATAATTA AAACAAAAGT CTAAGATTTG AAGAGATAAT TTGCTTCAGG A 
AAGGCAAATG CTAACTTTAA AAACCAGATT TCGGAGAAGT ACAAAAGAAA T 

CAAGAACTGC GAATGGAGAA AGTACAGITA GAGTTTGAGA ACCAAGAGAT G 

CTGCAAGAAT TCCGATCCAC AAGAAACAAA GAAAAGGAAG ATAGAGAGTC AAGCGAGTAT 
TACTGGAAAT CTGGAAAAGT GGGCAAAITG G 



ATTCAAGAGC CAGTGAAACC AACAGTTAAT TATAAAATGG CAAATTCTTC AGAATGTGAA 
AAACCCAAGA TAAATGGGAA AGTTTGTGGA CAGTGTGAGA ACAAAGCTGC TCTACTGGIA 
TGCCTTGAAT GTGGAGAAGA TTATTGTTCA GGATGCTTTG CTAATGTTCA CCAGAAAGGG 
GCACTAAAGC TCCACAGAAC AACTCTTTTG CAGGCAAGAT CTCAAATATT ATTCAATGTA 
TTGGATGTTG CCCATCAGTT TATAAAGGAT GTTAATCCAG ATGAACCCAA AGAGGAGAAT 



AGCAGCTCTG AGGTAGAAAT T ACAACGATG AAAAGAGCAC AACGTACAAA ACCAAGAAAG 
AGTCTGTTGT GTG AAGGGT C ATTCGATGAA GAAGCTTCTG CACAGTCCTT TCAGGAAGTG 
TTAAGTCAAT GGAGAACCGG AAATCATGAT GACAACAAGA AACAGAATTT ACATGCAGCA 
GTAAAAGACT CATTGGAAGA ATGCGAAGTA CAGACTAATC TGAAAATTTG GAGAGAACCA 
CTTAATATTG AACTTAAAGA AGACATTCTA TCCTATATGG AAAAATTATG GCTTAAAAAA 
CACAGGAGAA CTCCACAAGA GCAACTTTTT AAATGCTACC AGATACGTTC CCACATCCAC 
ATGAAACCAC TGGTGATGCA CAGTGTTCTC AAAATGAAAA CGATGAAGAT AGTGATGGTG 
AGGAGACCAA AGTACAACAC ACAGCTCTTT TATTGCCAGT AGAAACATTA AACATAGAGA 
GACCTGAACC ATCTCTGAAG ATAGTCGAAC TGGATGATAC TIATGAAGAG GAATTTGAAG 
AAGCAGAAAA TATTGTGCCT TACAAAGTTA AATTAGCTGA TGCAGACAGT CAACGAAGTT 
GTGCTTTTCA TGATTGTCAG AAGAATAGCT TTCCATATGA AAA7GGCATC CATCAACATC 
ATGTTTTCGA TAAGGGAAAG AGAGACTTCT TAAATCTTTG TCTGAGAAAC AGCTCTACTT 
ATTATAAAGA TAATTCAAAA GGAGAAACTT CAAACACAGA TTTTGAC 
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TTGAAAGCAA GGACACTTTG CTAGGTAGAG ATTTAGAAAA AGCTCCCATT GAGGAGAAAT 1920 

TATCTCAAGA CATCAAAGAA TCCTTGGAAT TGAGCAATCT GTATAAGAGG CCAAGCTTTG 1980 

AAGAATCAAA AACTACAAAG TCATCACTGT TGTTACAAGA AATAGCCTGC AGAAGTAAGC 2040 

CTATAACAAA ACAATATCAA GGACTTGAGA GATTCTTTAT TTTTGATACA AATGAAAGAC 2100 

TCAACTTACT TCCTTCTCAT CGTTTAGAAT GCAACAATT C CAGIACTAGG ATTACACTTG 2160 

CAGGTCAGAA ATCACAGAGA CCTTCAACAG CAAATTTTCC ACTTTCCAAC TCTGTTAAAG 2220 

AAAGCTCCAG TTGCCTTTCA TCCTCTCAIC CTCGATCAAG AAGTGCAGCT GCTCAATCAT 22 80 

CATCTAGAGC TGCTTCTGAA ATTTCAGAAA TTGAATATAT TGATATTACT GACCAGAATG 2340 
AGCTTTCCTT AGATGACACT ACTGATCAAC ATACTTTAGA CAATTTGGA/ , lT TA io 

1U AAGTGCTGAG ATCTCTTGCA GATACTTCAG AAAAGCTTTA CAGCTTAACC TCAGAAGAGT 2460 

TCCCAGATTT CAGCAGCCAA T CACTG AAT A TAAGTCAGAT TTCCACAGAT TTCCTTAAGA 2520 

CCTCACATGT GAGGGGTCCC TGTGGAGTTG AGGAATTGAG CTGTTCTGGA AGAGATACCA 2580 

AAATTCAGTC TTTGCTGTCA CTTTCTGAGA GCAGTACAGA TGAGGAGGAG GAAGATTTTC 2640 

TCAACAAGCA ACATGTCATC ACACTACCGT GGTCAAAGAG TACTTAAAGA TTATTTGTTC 2700 

1J ATTACTGTTT CCATTTTGTA CCCAGAGTAA AGCAAACAAC TGAGAAAAGT AACCAAGTGA 2760 

TTACCTATCC AAGTGCTGGA GATTTTGATT ACTAATGTCT ITGATGTTTC AAGGCTACAA 2820 

ACTAATAAAA GTAAAATTAT AAGTTCAAAA AAATTTTTAA AAAAAAAAAT AAAAAA 2B76 

20 Seq ID NO: C57 DNA Sequence 

Nucleic Acid Accession it: NM_024687.1 
Coding sequence: 138.. 1706 

Or 1 11 21 31 41 51 

25 | | | | . | | 

AAAAACATGA TGACAACAAG AAACAGAATT TACATGCAGC AGTAAAAGAC TCATTGGAAG 60 
AATGCGAAGT ACAGACTAAT CTGAAAATTT GGAGAGAACC ACTTAATATT GAACTTAAAG 
AAGACATTCT ATCCTATATG GAAAAATTAT GGCTTAAAAA ACACAGGAGA ACTCCACAAG 
AGCAACTTTT TAAAATGCTA TCAGATACGT TCCCACATCC ACATGAAAOC P 



^<_*a._ittt TAAAATGCTA TCAGATACGT TCCCACATCC 

CACAGTGTTC TCAAAATGAA AACGATGAAG ATAGTGATGG TGAGGAGACC AAAGTACAAC 
ACACAGCTCT TTTATTGCCA GTAGAAACAT TAAACATAGA GAGACCTGAA CCATCTCTGA 
AGATAGTCGA ACTGGATGAT ACTTATGAAG AGGAATTTGA AGAAGCAGAA AATATTGTGC 
CTTACAAAGT TAAATTAGCT GATGCAGACA GTCAACGAAG TTGTGCTTTT CATGATTGTC 
_ _ AGAAGAATAG CTTTCCATAT GAAAATGGCA TCCATCAACA TCATGTTTTC GATAAGGGAA 
J J AGAGAGACTT CTTAAATCTT TGTCTGAGAA ACAGCTCTAC TTATTATAAA GATAATTCAA 
AAGGAGAAAC TTCAAACACA GATTTTGACA ACATCGTGGA TCCTGATGTG "ATTCTTCTG 
ACATTGAAAA AATTGAGGAA AGCACCTCCT TTGAAAGAAA TTTAAAGGAG AAAAATATAG 
GTTTAGAAAG T AAT CAAAAG TCTGATGATT CCTGTGTATC ACT"GAAAGC AAGGACACTT 
TGCTAGGTAG AGATTTAGAA AAAGCTCCCA TTGAGGAGAA ATTATCTCAA GACATCAAAG 
4 " AATCCTTGGA ATTGAGCAAT CTGTATAAGA GGCCAAGCTT TGAAGAATCA AAAAC'ACAA 

AGTCATCACT GTTGTTACAA GAAATAGCCT GCAGAAGTAA GCCTATAACA AAACAATATC 
AAGGACTTGA GAGATTCTTT ATTTTTGATA CAAATGAAAG ACTCAACTTA CTTCCT' CTC 
ATCGTTTAGA ATGCAACAAT TCCAGTACTA GGATTACACT TGCAGAAGAC AGAGAATGGA 
TTCCAGACCA TAGCTTAAGT GAATATGCTG ATAATGCAAT TGTCTTGGGT GTTCTGCAGG 
GTGCTCAGAG TCCATCATCA AGTAGAAAAC AGCAAAAGAT GGGTCAGAAA T CACAGAGAC 
'?Z? AACAGC fiAATTTTCCA CTTTCCAACT CTGTTAAAGA AAGCTCCAGT TGCCTTTCAT 12S0 
CCTCTCATCC TCGATCAAGA AGTGCAGCTG CTCAATCATC ATCTAGAGCT GCTTCTGAAA 1320 
TTTCAGAAAT TGAATATATT GATATTACTG ACCAGAATGA GCTTTCCTTA GATGACACTA 
CTGATCAACA TACTTTAGAC AATTTGGAAA AAGAATTACA AGTGCTGAGA TCTCTTGCAG 



1200 



rtrtAiivjvjrtftrt rtftViHHl tHLA AUTUCTGAGA TCTCTTGCAG 

ATACTTCAGA AAAGCTTTAC AGCTTAACCT CAGAAGAGTT CCCAGATTTC AGCAGCCAAT 
CACTG AATAT AAGTCAGATT TCCACAGATT TCCTTAAGAC CTCACATGTG AGGGGTCCCT 



^^v^wiw^ii A^AArtrtVjm. ^IWiUiUjlb AWj'J:10^Li 

v.w^WA GGAATTGAGC TGTTCTGGAA GAGATACCAA AATTCAGTCT TTGCTGTCAC 
TTTCTGAGAG CAGTACAGAT GAGGAGGAGG AAGATTTTCT CAACAAGCAA CATGTCATCA 
CACTACCGTG GTCAAAGAGT ACTTAAAGAT TATTTGTTCA TTACTGTTTC CATTTTGTAC 
CCAGAGTAAA GCAAACAACT GAGAAAAGTA ACCAAGTGAT TACCTATCCA AGTGCTGGAG 
ATTTTGATTA CTAATGTCTT TGATGTTTCA AGGCTACAAA CTAATAAAAG TAAAATTATA 
AGTTCAAAAA AAAAAAAAAA AAAA 

Seq ID NO: C58 DNA Sequence 

Nucleic Acid Accession #: NM_005408.1 

Coding sequence: 76.. 372 

1 11 21 31 41 51 

I I I I I | 

AAAAGGCCGG CGGAACAGCC AGAGGAGCAG AGAGGCAAAG AAACATTGTG AAATCTCCAA 
. CTCTTAACCT TCAACATGAA AGTCTCTGCA GTGCTTCTGT GCCTGCTGCT CA"GACAGCA 
GCTTTCAACC CCCAGGGACT TGCTCAGCCA GATGCACTCA ACGTCCCATC TACTTGCTGC 
TTCACATTTA GCAGTAAGAA GATCTCCTTG CAGAGGCTGA AGAGCTATGT GATCACCACC 
AGCAGGTGTC CCCAGAAGGC TGTCATCTTC AGAACCAAAC TGGGCAAGGA GATCTGTGCT 
GACCCAAAGG AGAAGTGGGT CCAGAATTAT ATGAAACACC "GGGCCGGAA AGCTCACACC 
CTGAAGACTT GAACTCTGCT ACCCCTACTG AAATCAAGCT GGAGTACGTG AAATGACTTT 
TCCATTCTCC TCTGGCCTCC TCTTCTATGC TTTGGAATAC TTCTACCATA ATTTTCAAAT 
AGGATGCATT CGGTTTTGTG ATTCAAAATG TACTATGTGT TAAGTAATAT TGGCTATTAT 
TTGACTTGTT GCTGGTTTGG AGTTTATTTG AGTATTGCTG ATCTTTTCTA AAGCAAGGCC 
TTGAGCAAGT AGGTTGCTGT CTCTAAGCCC CCTTCCCTTC CACTATGAGC TGCTGGCAGT 
GGGTTTGTAT TCGGTTCCCA GGGGTTGAGA GCATGCCTGI GGGAGTCATG GACATGAAGG 
GATGCTGCAA TGTAGGAAGG AGAGCTCTTT GTGAATGTGA GGTGTTGCTA AAT£ 
TGTGGAAAGA TGAATGCAAT AGTAGGACTG CTGACATTTT GCAGAAAPTA CATTTTATT 
AAAATCTCCA AAAAAAAAAA 

Seq ID NO: C59 DNA Sequence 

Nucleic Acid Accession #: AK097746.1 

Coding sequence: 185.. 2224 
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I I I I I I 

CTTTCATGAC AGTAACAAAT CCAAGATTTT GGAAAAGCGC CTfCC T( A ^_t\ 

CTTCACATAC AACTTATATT GTAATATATG CCGATCACTA TTTGAGAAGG ACAAGCTGTT 
ATTTTCCTTT TTATTATGTG CCAATCTTCT ICTGGCAAGG AAAGAGATTG AATAC CAGGA 

AACTTGGCTA CAGGACAAAA GCTGGGAGGA AATCTG7CGG GCAAGIGAAT TTCCTGCCTT 
CAGAGGACTC AGGCAACATT TTTGTGAACA TATATATGAA TGGOGAGAAA TCTATGACAG 
TAAAGAGCCA CATAATGCTA AATTTCCAGC ACCAATGGAT AAGAACCTAA ATGAACTACA 
GAAAATAATA ATTCTTCGGT GTTTAAGACC TGATAAGATA ACCCCAGCTA TAACAAACTA 
TGTAACTGAC AAACTAGGGA AAAAGTTTGT AGAGCCTCCA CCATTTGATT TGACAAAGAG 
TTACTTGGAT TCAAATTGCA CCATTCCCTT AATTTTTGTT CTATCTCCAG GAGCAGATCC 
TATGGCCAGC CTGCTGAAAT TTGCAAATGA TAAATCTATG TCTGGAJ 

i^AGGGAC AAGGACCGAT TGCAGCAAAA ATGATTAAAG C 
T TGGGTGTGCC TACAGAATTG CCAICTTGCA GTGTCCTGGA TGCCCATGTT 
GGAAAAAATA TGTGAAGATT TTACCTCTGA AACCTGTAAC TCATCCTTTA G 
GACAAGCTAT CCATCTTCAA AATTCCCAGT AACAATTCTA CAGAATGGAG T 
TAATGAACCT CCCACGGGIC TTCGGCTGAA TCTCCTTCAA TCATATCTCA CTGATCCAGT 



A CTCAACACCC 



TGAATACTGT 



CAGTAGCTTA CTTGTTGGAA AGGTTCCAGA AATATGGGCC AAACGTTCAT ACCCAAGCCT 
TAAGCCCCTG GGAAGTTACA TCACAGATTT CCTAGCCCGG T 
GTATAATTCA GGAAAACCTT GTGTGTTTTG GCTGTCA 

TTTAACTGGA GCTATGCAGA ATTATGCCAG AAAATATACC ACCCCTATTG ATTTGCTAGG 
35 ATATGAATTT GAGGTTATCC CATCTGATAC ATCTGACACA TCACCAGAAG ATGGTGTTTA 
TATCCACGGA CTGTATCTCG ATGGCGCACG CTGGGACCGA G 
ACAATATCCC AAACTTCTGT TTGACCTGAT GCCCATCATA TGGATAAAAC C 
ATCTCGGATT ATAAAGTCGG ATGCCTATGT CTGTCCCCTC TACAAGACAA GTGAACGTAA 2100 
AGGAACTCTT TCCACTACGG GACATTCTAC TAACTTTGTC ATTGCAATGT TGTTAAAAAC 2160 
4U AGACCAACCT ACTCGGCACT GGATCAAGCG CGGGGTTGCT TTGCTTTGTC AGTTGGATGA 2220 
CTAAATTGGA CAAATTTATA AAACATCCAA AAGTTT 22 5S 

Seq ID NO: C60 DNA Sequence 
Nucleic Acid Accession #: J02761.1 
45 Coding sequence: 14.. 1159 

1 11 21 31 41 51 

I I I I I I 

GAATTCCGGT GCCATGGCTG AGTCACACCT GCTGCAGTGG CTGCTGCTGC TGCTGCCCAC 60 

50 GCTCTGTGGC CCAGGCACTG CTGCCTGGAC CACCICATCC TTGGCCTGTG CCCAGGGCCC 12 0 

TGAGTTCTGG TGCCAAAGCC TGGAGCAAGC ATTGCAGTGC AGAGCCCTAG GGCATTGCCT 180 

ACAGGAAGTC TGGGGACATG TGGGAGCCGA TGACCTATGC CAAGAGTGTG AGGACATCGT 240 

CCACATCCTT AACAAGATGG CCAAGGAGGC CATTTTCCAG GACACGATGA GGAAGTTCCT 3 00 

GGAGCAGGAG TGCAACGTCC TCCCCTTGAA GCTGCTCATG CCCCAGTGCA ACCAAGTGCT 360 

TGACGACIAC TTCCCCCTGG TCATCGACTA CTTCCAGAAC CAGACTGACT CAAACGGCAT 420 

CTGTATGCAC CTGGGCCTGT GCAAATCCCG GCAGCCAGAG CCAGAGCAGG AGCCAGGGAT 480 

GTCAGACCCC CTGCCCAAAC CTCTGCGGGA CCCTCTGCCA GACCCTCTGC TGGACAAGCT 540 

CGTCCTCCCT GTGCTGCCCG GGGCCCTCCA GGCGAGGCCT GGGCOTCACA CACAGGATCT 600 



G GATGACAGCG CTGGCCCAAG GTCGCCGACA GGAGAATGGC "GCCGCGAGA 
:tctgca TGTCCGTGAC CACCCAGGCC GGGAACAGCA GCGAGCAGGC 
CATACCACAG GCAATGCTCC AGGCCTGTGT TGGCTCCTGG C 



: TCCAGTGTAT 

CCACAGCCCC GACCTTTGAT GAGAACTCAG CTGTCCAGCT G 



AAAATGGGAA CATACAGAAC TCTAAAAGAT AGACATCAGA AATIGTTAAG TTAAGCTTTT 
TCAAAAAATC AGCAATTCCC CAGCGTAGTC AAGGGTGGAC ACTGCACGCT CTGGCATGAT 
GGGATGGCGA CCGGGCAAGC TTTCTTCCTC GAGATGCTCT GCTC-C7TGAG AGC7AITGCT 



GATTGAAAGT CCTAGGGTGA TTCTATTTCT GCTGTGATTT ATCTGCTGAA A 
GGGTTGTGCA AGCTAGGGAC CCATTCCTGT GIAATACAAT GTCTGCACCA ATGCTAATAA 

AGTCCTATTC TCTTTTAA 

Seq ID NO: C61 DNA Sequence 
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5 

10 



GGGGTCTGGG GAGGTGACAT GTTGGGCTGT GGGATCCCAG CGCTGGGCCT GCTCCTGCTG 
CGGCAGACGG AAATGGAATC CAGGGATTCT TCTACCCATG GAGCTGTGAG 
GGGACCGGGA GAGCTGTGGG GGCCAGGCGG CCATCGATAG CCCCAACCTC 
TGCCTGCGTC TCCGGTGCTG CTACCGCAAT GGGGTCTGCT ACCACCAGCG TCCAGACGAA 
AACGTGCGGA GGAAGCACAT GTGGGCGCTG GTCTGGACGT GCAGCGGCCT CCTCCICCTG 
AGCTGCAGCA TCTGCTTGTT CTGGTGGGCC AAGCGCCGGG ACGTGCTGCA TATGCCCGGT 
TTCCTGGCGG GTCCGTGTGA CATGTCCAAG TCCGTCTCGC TGCTCTCCAA GCACCGAGGG 
ACCAAGAAGA CGCCGTCCAC GGGCAGCGTG CCAGTCGCCC 7GTCCAAAGA GTCCAGGGAT 
GTGGAGGGAG GCACCGAGGG GGAAGGGACG GAGGAGGGTG AGGAGACAGA GGGCGAGGAA 
I AGGGGAGTCC CCGGGGGACT GCTCAATACA GATACGGTGG ACG 



I I I I I I 

GGGGACACTT TGTATGGCAA GTGGAACCAC TGGCTTGGTG GATTTIC-CTA G" 
ATTTTTAAAC TCCTGAAAAA TATCCCAGAT AACTGTCATG AAGCTGGTAA C™^^ 

25 GCTGGTGACC ATCAGCCTTT GTAGTTACTC TGCTACTGCC TTCCTCATCA ACAAAGTGCC 
CCTTCCTGTT GACAAGTTGG CACCT1TACC TCTGGACAAC ATTCTTCCCT TTATGGATCC 
ATTAAAGCTT CTTCTGAAAA CTCTGGGCAT TTCTGTTGAG CACCTTGTGG AGGGGCTAAG 
GAAGTGTGTA AATGAGCTGG GACCAGAGGC TTCTGAAGCT GTGAAGAAAC TGCTGGAGGC 
GCTATCACAC TTGG1GTGAC ATCAAGATAA AGAGCGGAGG TGGATGGGGA TGGAAGATGA 

J (J TGCTCCTATC CTCCCTGCCT GAAACCTGTT CTACCAATTA TAGATCAAAT GCCCTAAAAT 

GTAGTGACCC GTGAAAAGGA CAAATAAAGC AATGAATACT AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA 

Seq ID NO: C63 DNA Sequence 
35 Nucleic Acid Accession U: FGENESH predicted 
Coding sequence : 1 . . 2 374 



ATGCCCCTGT CCTATGCCTA TAAAAAOGCT GAGACCCTAG CAGGCAGACA CACAAGCAGC 
TCGATGTCGA GAGGAGCATA T CAGCGGAGG AACACACGGG CAGCTGGACG TCCAGAGGAA 
TGCACTGACA GAAACTGGCA TGCTGGCAGA ACACGTGGAA TTTGGCTGGG GCAGTTGGAG 
(,11 CAGATGTGTT CGGAGTTTCT TTCTTCTGGT GGGTTCGTGG TCTCGCTGGC 
TCAGGAGCGA AGCTGCAGAC CTTCACGCCA GCCCAGGAAG GGGCTCCCAC AGTGCAGCGG 
CAGGCTGAAG CGCTCCTCAA GTGCCGCCAG AGTGGGCGTC CAGGCAGAGG AGGCGCCGAG 



TACTCTC CAACCGAAAT TGCTCACAAG 480 

AGTTACTCCT GCAGCCTTCC AGACATGAAA ATCTCCATGG CAGAATCTGG CCCCTCCTTG 540 

GATAGCCTTG ACATTCTGGA GGATGGCGAG TCTGGGTCAC CATTTCTTGT GACTCATTTG 600 

TACTTTCTGG GGGTTGTCAC CACTGGGATG GAACAACTAG ATTTTGAAAC AGGACCAAAC 660 

ATATTTGATT TGCAGATTTA TGTGAAGGAT GAGGTTGGTG TCACAGACC7 GCAAGTCCTG 720 

ACTGTCCAGG TAACAGATGT GAACGAGCCA CCTCAGTTTC AAGGCAACTT GGCAGAAGAT 780 

CATCTCCGTG CAGACCAGCC ACATTTCAAT GCTCATAGTC ACACGTACGT GAGGGTAGTG 840 

GCTACTGCAT TGGCCAGGCA CAGGCTTAGA TCTAGCATTG GTTCCCCCTT CCTGGGCACC 900 

TTCIGTGTTG TGGTGGGCAT GCAGTATTTC CTGATTTCTC CCCCAAAGAG CTTCAGAATG 960 

TCTGCTAATG GCACCCTCTT CTCCACAACA GAATTGGACT TTGAAGCAGG ACACAGAAGT 1020 

TTCCATCTCA TCGTGGAGGT GAGGGACAGT GGAGGCCTCA AAGCCTCCAC AGAGCTCCAG 1080 

GTGAACATCG TGAACCTCAA CGACGAAGTC CCTCGCTTTA CCAGCCCGAC ACGAGTGTAC 1140 

ACAGTCCTGG AGGAACTGAG TCCAGGAACC ATCGTGGCCA AIATCACAGC GGAGGAT CCT 12 00 

3 CCACCTCCTC TACAGCATTA CCACTGTTAG CAAATATTTC 1260 

C AGTTGACTGG TACAATCCAA GTGGCCCAAA GGATAGACCG AGATGCAGGT 132 0 

GAATTGAGAC AAAAICCCAC CATTTCCCTG GAAGTTCTAG TGAAGGACAG ACCATATGGG 1380 



CTCACAATCA TGGCAGAAGG CAAGGAGGAG CAAGTCACAT CTTACATGGA TGGCAG CAGG 
:ttgtgt AGGGAAACTC CTCCTTATAA AGCCATCAGA T CTCATG AG A 
r ATCACGAGAA CAACTCAGGA AAGACTTGCC CCCAIGATTC CATTTCCTCC 
:CACAAC ATGTAGGAAT TCAAGAATCC AGGCCACCAA CAACGAAGAC 
ACAAGCTCTG TCACTGTTAC TGTGAACATC CTTGAAGAAA ATGAIGAAAA GCCAATTTGT 
T CTTATTTCCT GGCCCTCCCA GTGGATCTGA AAGTTGGCAC AAATATTCAG 
^ATGTAC CGACCTTGAT TCCAGCCCCA GATCTTTCCG TTATTCCATT 
GGCCCAGGTA ACGT CAACAA TCATTTCACC TTCTCTCCCA ATGCTGGTTC CAATGTCACA 
CGCCTGCTGC TTACATCTCG CTTTGACTAT GCTGGTGGGT TTGATAAGAT CTGGGACTAC 
iCGTAAC TGATGACAAC TTGATGTCTG ACAGGAAGAA AGCGGAGGCT 
h CAGGAACAGT GACACTGAGT ATTAAAGTCA TTCCCCACCC AACCACTATC 
ATCACCACGA CCCCCAGGCC CAGGGTCACC TATCAGGTCC TGAGGAAAAA CGTTTACTCT 
CCATCTGCAT GGTACGTGCC GTTTGTCATC ACTTTGGGCT CCAIATTGCT TCTGGGTCTC 



GGGAAGAACA AGGAACCTCT GACAAAGAAA GGAGAAACGA AGACTGCAGA GAGAGACGTC 

GTGGTGGAAA CTATCCAGAT GAACACTATC TTTGATGGAG AAGCCATAGA TCCAGAGCCT 

GAGCAAGCTT CACTCGAGCT CTATGCCCTG CTGCCCAGCT GCTC-CGACCC TAG7CCAGTA 

ACCCTAAGAA AGGTCCAGGT GTGTGGGGAG AGTGAAGAGA CCGGTCAGTG TTCCGGCCAC 
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ATCACACTTC CCGGCAAGAT TCCAGTCGAT GACCCAAGGA A 
GGTGATTTCG AGGTCTGGAC TCTATGCCCC GCTGTGAAGG TGGTTGTAGG CAGCCCTCAA 
GCTGAACGGT GCATTCGATT GGCTCTCAGT CTGAAAAAGT ACAGTTCTGA TTAA 

Seq ID NO: CS4 DNA Sequence 

Nucleic Acid Accession #• XM_168571.1 

Coding sequence: 155.. 988 < 

1 11 21 31 41 51 

I I I I I I 

TACACAGTCC TGGAGGAACT GAGTCCAGGA ACCATCGTGG CCAATATCAC AGCGGAGGAT 

CCTGATGATG AAGGTTTTCC CAGCCACCTC CTCTACAGCA TTACCACTGT TAGCAAATAT 

TTCATGATAA ATCAGTTGAC TGGTACAATC CAAGTGGCCC AAAGGATAGA CCGAGATGCA 

GGTGAATTGA GACAAAATCC CACCATTTCC CTGGAAGTTC "AGTGAAGGA CAGACCATAT 

GGGGGTCAGG AGAATCGCAT CCAGATAACC TTCATTGTGG AAGACGTCAA CGACAATCCT 

GCCACATGCC AAAAGTTCAC CTTCAGCATT ATGGTGCCGG AAAGAACAGC CAAGGGGACG 

TTGCTTCTTG ACCTAAACAA GTTCTGCTTT GATGATGACA GTGAGGCACC AAACAACAGA 

TTCAACTTCA CCATGCCATC TGGAGTGGGG AGCGGCAGCA GATTTTTACA GGATCCAGCT 

GGCTCTGGGA AGATTGTGCT GATTGGTGAT CTAGACTACG AAAAT CCAAG TAACCTAGCA 

GCCGGCAATA AATATACGGT G 



Pl TGTGTCAGAA AGAAGGCCCG C 

k AACGTTTACT CTCCATCTGC ATGGTACGTG CCGTTTGTCA TCACTTTGGG 

CTCCATATTG CTTCTGGGTC TCCTCGTGTA CCTGGTCGTC CTAT7GGCCA AAGCCATCCA 

CAGACACTGC CCCTGCAAGA CTGGGAAGAA CAAGGAACCT CTGACAAAGA AAGGAGAAAC 




CTGCCCTCGT TTTAACATCA CCCAGATTTC TTCAGTTATA AATATGCCAT ACACCTTTGT 
AAGTCACCTC AAATCTTCTT CAAAAGAAGC AGAACAGTGA AAAAAACAGA T 
AAGAGTTGGT CATCTGGAAA GAAGAAAACT C 
GGTGTCCGGA TCAGCATCCT GCATGTGAGA TTCATCCACG TTGTCCTGTC TAGCAGTAGT 
TCAGTTCTCT TCATGGTTAT GTCTGGTTTC ATTCTATGAT TATATCACAA TTTATCTATT 



ACTACAGGCA GGGCATGATG GCTCATGCCT GTAATGCCAG CACTTTGCCA AGGTGGGCAG 2100 

ATCACCTAAG GTCAGGAGTT CAAGATCAGC CTGGCCTAGA TGGCAAAACC CTGTCTCTAC 2160 

TAAAAAATAC AAAAATTAGC TTGGTGTGGT GGTGGGCACA TGTAATCCCA GCTACTTGGG 2220 

AGGCTGAGGT AGGGAGAACT GCTTAAACCT GAGAGGTGGA GGTTACAGTG AGTTGAGATT 22 80 

GTGCCACTGC ACTCTAGCCT GGGTGACAAA GCAAGACTCC ATCTCAGAAA AAAAAAATAA 2340 

AAGTGAATTA CAACACT 2357 

Seq ID NO: C65 DNA Sequence 

Nucleic Acid Accession It: NM_00S266.3 

Coding sequence : 122 . . 1198 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGGC CATTTTCAAA CAGTCCCTCC TGGGAGAACA CAGACAGGCA GAGGATTACA 60 
ACACAAGGCA GCAAGCACTG GGAGACGAAA GTTTTGGCAT CTGTTCCCTG GCTGTGCCAA 120 
GATGGGCGAT TGGAGCTTCC TGGGAAATTT CCTGGAGGAA GTACACAAGC ACTCGACCGT 180 
GGTAGGCAAG GTCTGGCTCA CTGTCCTCTT CATATTCCGT ATGCTCGTGC TGGGCACAGC 240 
TGCTGAGTCT TCCTGGGGGG ATGAGCAGGC TGATTTCCGG TGTGATACGA TTCAGCCTGG 300 
CTGCCAGAAT GTCTGCTACG ACCAGGCTTT CCCCATCTCC CACATTCGCI A 
GCAGATCATC TTCGTCTCCA CGCCCTCTCT GGTGTACATG GGCCACGCCA T 
GCGCATGCAG GAGAAGCGCA AGCTACGGGA GGCCGAGAGG GCCAAAGAGG T 
TGGCTCTTAC GAGTACCCGG TGGCAGAGAA GGCAGAACTG T 
TGGAAGGATT GCCCTCCAGG GCACTCTGCT C 

3TGGGCT TCATTGTGGG CCAGTACTTC ATCTACGGAA TCTTCCTGAC 
I GTCTGCCGCA GGAGTCCCTG TCCCCACCCG GTCAACTGIT ACGTATCCCG 
GCCCACAGAG AAGAATGTCT TCATTGTCTT TATGCTGGCT GTGC-CTGCAC TGTCCCTCCT 

ACCGCGGCAG CACATGGCTA AGTGCCAGCT TTCTGGCCCC TCTGTGGGCA TAGTCCAGAG 
CTGCACACCA CCCCCCGACT TTAATCAGTG CCTGGAGAAT GGCCCTGGGG GAAAAITCTT 
CAATCCCTTC AGCAATAATA TGGCCTC 

AGTACGAGGT CAGGAGCAGA CTCCTGGGGA AGGTTTCATC CAGGTTCGTT A 
GCCTGAGGTG CCCAATGGAG TCTCACCAGG TCACCGCCTT CCCCA7G3CT AT CATAGTGA 



G GGAGGAT CAG G 

ICCCTTCTG AACTGATGCT TTCTCACTGT CATCACTGCT TGGCTCCITT GAGCCCCGGG 
C GTTGCTCATT AATTCTAGAA ACTATAACCA GGGCTCTGGG A 



TTCAGATTAC TCATGAAACA GGGTAGAGGG AAAGAAGGGA AGCATGGCAA AAGCTGGCCT 
GGAAGGGATA GCCAGAGGGA TAGAATGACT CTCTCICTAC ATACCAGCAG C 
GCGTTCTCTA AGTTCCTACC TCCTTGACCT GATCACCCTC CCTCCTC 
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TTCCGATCfiG ATGG CAAGGG TCTTTGCAGA TGTGCATTCC AGAGCCAGCC GGGATTCCAG 
TGAGATGGGT GCTGTTCTTG AGCAGCTTGA CCACCCCAAC AGGGATCTGG GCAGAGAGTC 
TGTGGATGCA GAGGAAGTTG TGAGAGAAAT TGATAAGCTT TTGCAAGACT GCCGGGGAAA 
CGACCCTGTG GCTGTGAGAA AGTGAAAAAA GAAAAAAAAA AAGGCATTGG CATTTTCTTG 
TCTCTTCTGT TGATTTAAAA ATGATCCCTC CTGGTGATAA CCCATTTTAC AGGGATGAAG 
AAAGACCAAT GCTGCTTTAA GGCTTTTAGT GAACATCTGA AGTGCCCACA AGTATGTTCT 
TTCCACTGCT GATTTCTTTT TCAGAGATAA CAATGGTTTC GTTTTGACCA AACTTGTATT 
AGGACAGAAT TAATGATGCT TAAAGAGAAA AGAAAAAAAG AGAGAAGAAA AAGGAGAGAT 
GAAAAAGGAG GATGAGGAGA AGAATTACCT TTTGACAATC TGTTAG3AAG GTATGCAGTG 
TGAGAACTGA AGTATTTCTG ATCACTCTCA GACTGTCCTC CGTGATTTAT GCTGACTTAA 
CTGTTTACCT ATAAACCCCA TACAAAGCAG GGTCATAATT TGTGATCTGT GGTGGATTTC 
TAGCAGTCAT CACAGGCTTC TACTGAAAGT CCTGAAAAGA CCTTGCAGTA GTCCAAGCTA 
CACCAAACAT TAACACATAT TTGTGGTAAA CATTTCTGTA TAAAGTTACC TGACACACAT 
ATAAACACAA GGAACATTCC ATATCATTAG TCGAAAACAA AAACAAAAAA AAAACCTTTG 
GTCATTTGTA AGACATCT CA TGTCATATAA AAGTTAAATG TAAAAAGATA CAGTCCATTT 
TGTCCTGCAC ACACGTAGAC TAATTCACGT CAAAAAAAAA AAAAAA 



G GCTGTGGGTC C 
TGGGGACATC ATATCAGGCT A 
GTGGGCCCTG GTGTCCGTGA GCTTCCTGGT CCTGTCCTGC TTCCCCTCAC TGTTCCCCCC 
C CCGCTTGTCT CAACCACCGC AGCCTTTGCT GCAGCCATCT CCATGGTGGT 
G GTGTACACCA GCGAGCGGTG GGACCAGCCT CCACACCCCC AGATCCAGAC 
CTTCTTCTCC TGGTCCTTCT ACCTGGGCTG GGTCTCAGCT ATCCTCTTGC TCTGTACAGG 
TGCCCTGAGC CTGGGTGCTC ACTGTGGCGG TCCCCGTCCT GGCTATGAAA CCTTGTGAGC 
AGAAGGCAAG AGCGGCAAGA TGAGTTTTGA GCGTTGTATT CCAAAGGCCT CATCTGGAGC 
CTCGGGAAAG TCTGGTCCTA CATTTGCCCG CCCTTCCAGC CCTTCCCCAG CCCCTCCTCT 
TGTTTCTTCA 1TCATTCAAC AAAATTTGGC TGGAAAAAAA AAAAAAAAAA AAAAAAAAAA 



GTATCTGTGG TAAACCCAGT GACACGGGGG AGATGACATA CAAAAAGGGC AGGACCTGAG 
AAAGATTAAG CTGCAGGCTC CCTGCCCATA AAACAGGGTG TGAAAGGCAT CTCAGCGGCT 
GCCCCACCAT GGCTACCTGG GCCCTCCTGC TCCTTGCAGC CATGCICCTG GGCAACCCAG 
GTCTGGTCTT CTCTCGTCTG AGCCCTGAGT ACTACGACCT GGCAAGAGCC CACCTGCGTG 
ATGAGGAGAA ATCCTGCCCG TGCCTGGCCC AGGAGGGCCC CCAGGGTGAC CTGTTGACCA 
AAACACAGGA GCTGGGCCGT GACTACAGGA CCTGTCTGAC GATAGTCCAA AAACTGAAGA 
AGATGGTGGA TAAGCCCACC CAGAGAAGTG TTTCCAATGC TGCGACCCGG GTGTGTAGGA 
CGGGGAGGTC ACGATGGCGC GACGTCTGCA GAAATTTCAT GAGGAGGTAT CAGTCTAGAG 
TTACCCAGGG CCTCGTGGCC GGAGAAACTG CCCAGCAGAT CTGTGAGGAC CTCAGGTTGT 
GTATACCTTC TACAGGTCCC CTCTGAGCCC TCTCACCTTG TCCTGTGGAA GAAGCACAGG 
CTCCTGTCCT CAGATCCCGG GAACCTCAGC AACCTCTGCC GGCTCCTCGC TTCCTCGATC 
CAGAATCCAC TCTCCAGTCT CCCTCCCCTG ACTCCCTCTG CrGTCCTCCC CTCTCACGAG 
AATAAAGTGT C 



I I I I I I 

GCTGCAGAGG ATTCCTGCAG AGGATCAAGA CAGCACGTGG ACCTCGCACA GCCTCTCCCA 
CAGGTACCAT GAAGGTCTCC GCGGCAGCCC TCGCTGTCAT CCTCATTGCT ACTGCCCTCT 
GCGCTCCTGC ATCTGCCTCC CCATATTCCT CGGACACCAC ACCCTGCTGC TTTGCCTACA 

CCAACCCAGC AGTCGTCTTT GTCACCCGAA AGAACCGCCA A 
AGAAATGGGT TCGGGAGTAC ATCAACTCTT TGGAGATGAG C 
ACCTGAACTT ACACAAATTT GCCTGTTTCT GCTTGCTCTT G 
CCCTCACTAT CCTACCCCAC CCGCTCCTTG AAGGGCCCAG ATTCTACCAC ACAGCAGCAG 
TTACAAAAAC CTTCCCCAGG CTGGACGTGG TGGCTCACGC CTGTAATCCC AGCACITTGG 
GAGGCCAAGG TGGGTGGATC ACTTGAGGTC AGGAGTTCGA GACCAGCCTG GCCAACATGA 
TGAAACCCCA TCTCTACTAA AAATACAAAA AATTAGCCGG GCGTGGTAGC GGGCGCCTGT 
AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG AGAATGGCGT GAACCCGGGA GGCGGAGCTT 
GCAGTGAGCC GAGATCGCGC CACTGCACTC C 



AATCCCAGCT ACTCGGGAGG CTAAGGCAGG AAAATTGTTT GAACCCAGGA G 

GCAGTGAGCT GAGATTGTGC CACITCACTC CAGCCTGGGT GACAAAGIGA GACTCCGTCA 

CAACAACAAC AACAAAAAGC TTCCCCAACT AAAGCCTAGA AGAGCTTCTG AGGCGCTGCT 

TTGTCAAAAG GAAGTCTCTA GGTTCTGAGC TCTGGCTTTG CCTTGGCTTT GCCAGGGCTC 
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TGTGACCAGG AAGGAAGTCA GCATGCCTCT AGAGGCAAGG AGGC-GAGGAA CACTGCACTC 
TTAAGCTTCC GCCGTCTCAA CCCCTCACAG GAGCTTACTG GCAAACATGA AAAATCGGCT 
TACCATTAAA GTTCTCAATG CAACCATAAA AAAAAAA 



: C70 DNA Sequence 




T GGCTATTGGG ACTCTTTTTT CAAATGCAAT T 
3 CATTTGGATT TGATCCCAAA GTOGACAGTT ATGTTGAGAA G 
GTGTTTGGTG GATTTTACCT ACTTTTCTTI TTTGAAAGAA TGCTAAAGAT G 
ACATATGGTC AGAATGGTCA TACCCACTTT GGAAATGATA ACTTTGGTC 
ACTCATCAAC CTAAAGCATT ACCTGCCATC AATGGTGTGA C 
GTCACAGAAG CTAATGGACA TATCCATTTT GATAATGTCA G 

GGAAAAAAAG AGCCAAGTTC ATGTACCTGT TTGAAGGGGC CCAAACTGTC AGAAATAGGG 1380 

ACGATTGCCT GGATGATAAC GCTCTGCGAT GCCCTCCACA ATTTCATCGA TGGCCTGGCG 1440 

ATTGGGGCTT CCTGCACCTT GTCTCTCCTT CAGGGACTCA GTACTTCCAT AGCAATCCTA 1500 

TGTGAGGAGT TTCCCCACGA GTTAGGAGAC TTTGTGATCC TACTCAATGC AGGGATGAGC 1560 

ACTCGACAAC CCTTGCTATT CAACTTCCTT TCTGCATGTT CCTGCTATGT TGGGCTAGCT 1520 

TTTGGCATTT TGGTGGGCAA CAATTTCGCT CCAAATATTA TATTTGCACT TGCTGGAGGC 168 0 

ATGTTCCTCT ATATTTCTCT GGCAGATATG TTTCCAGAGA TGAATGATAT GCTGAGAGAA 1740 

AAGGTAACTG GAAGAAAAAC CGATTTCACC TTCTTCATGA TTCAGAATGC TGGAATGTTA 1800 

ACTGGATTCA CAGCCATT CT ACTCATTACC TTGTATGCAG GAGAAATCGA ATTGGAGTAA I860 

TAGAAAATGG AAGATGGTGT TGTTAATAAA GGCATTTAAT AGATAAAAAC ATCTCCAAAA 1920 

AGGATTTTGA AGCTGATCCT ATTTAGTTAA AAAGATAATT TTGCTTTCAA CTGTAGGTCC 1980 

AGAAAACTAA TTATTGGCAT CAGTCTGTGA AATAGTCCAT TATTTGTTGT TAAAAATGCT 2 040 

TCAAAAGGTT TTCAGTGTCA GTCTGAGATG CCTGGTATAT AGGAGCCTTT GGGAAATACT 2100 

TATTTTTCAG TATTCCATGC ATATTAGATA TCACCATGAA GCAAGAGACA TGCATTCTAT 2160 

AATCATGTAG ACACT CAG AC TCAGGGGAAA ATACAAGTTA TATCCTGAAA GCCTTTAAAA 2220 

CTCTATGGTA GGAT CAAAG A TTCAAATGGT TTCAGAGAGG TTTTATTTCA ATTAATTTGT 2280 

TCTAGTGCTT TCAAGAGCAA GTACATCAAA ATGTAGAAGG TAAAATGTAT GCAACACTAA 2340 

TATAAATTAT TCCAAGTCTT TAAGGAGCCA AAGAAAAAAA AGATTTCTCA CAGCTTTTTG 2400 

TTCTGTTTTG TATTTCAATT AGGAACTTGC AGTATTATTT TGAAAACCAT TCTAAAATAA 2460 

TAGGAGTTAG GAAATAAATA AAGTTTTGCT AGCCC1GCTA AGTTCAGGCT TAGAGGCTTA 2520 

TCGCTAAGTN TAAACTTCAC CAGATTCCAC GAAAAGCTGG ATAGCTTTTT TTCTGACTTA 2580 

TGTTGTGGTT GCACCCCTCA CAAATGGCAG AACAGTATGT AAAGCTGGTA ACACCICGGT 2640 

TTCAGTGCAC CATGTGTTTG CTTTGTGAAG GTGAAGAATA TGTTGGTTTA GAGAAAGAAA 2700 

TTGGATGTAA T1TTATGCAA TTTACTT1TA AAGACAAACA TAACTATTTA GCAGAGAATA 2760 

TTTTAATAAA TGCAAAACAA CAGCTGGACT GCTGTACATC AAGGACAGAT TAACTGGAAA 2820 



A AAATTAGCAC ACCATGGITA T' 

TTACTCAT TTAGAAGATA GAGAAAATTG GTCTAAAATT 3000 

I CCCAAGTCAC TTAAGGTGAT TTGATGGTGA GGAAAATGAT 3060 

TGACAAAGCC CAACAATGAT CTCAGGAATT ACATTTTCCA ACAGACCAAA AAATGTTTTC 3120 

ATGTAGCAGC AATGCAGATT TGGTGAATAT TTAATATATA TTTTAGTATG TATTTCACTT 3180 

Z AATTAAAAAA TATTGTTTGG CCAAATAGTA AACACCCITT TGAAACCATG 32 40 

3246 




T GAACAAATCC TCTGACCTCA GGCCGGCTGT GAACGTAGTT CCTGAGAGAT 
AGCAAACATG CCCAACAGTG AGCCCGCATC TCTGCTGGAG CIGTTCAACA GCATCGCCAC 
ACAAGGGGAG CTCGTAAGGT CCCTCAAAGC GGGAAATGCG TCAAAGGATG AAATTGATTC 
TGCAGTAAAG ATGTTGGTGT CATTAAAAAT GAGCTACAAA GCTGCCGCGG GGGAGGATTA 
CAAGGCTGAC TGTCCTCCAG GGAACCCAGC ACCTACCAGT AATCATGGCC CAGATGCCAC 
AGAAGCTGAA GAGGATTTTG TGGACCCATG GACAGTACAG ACAAGCAGTG CAAAAGGCAT 
AGACTACGAT AAGCTCATTG TTCGGTTTGG AAGTAGTAAA ATTGACAAAG AGCTAATAAA 
CCGAATAGAG AGAGCCACCG GCCAAAGACC ACACCACTTC CTGCGCAGAG GCATCTTCTT 



1239 



WO 03/042661 PCT/US02/36810 



CTCACACAGA G AT ATGAAT C AGGTTCTTGA TGCCTATGAA AATAAGAAGC CATTTTATCT 
GTACACGGGC CGGGGCCCCT CTTCTGAAGC AATGCATGTA GGTCACCTCA TTCCATTTAT 
TTTCACAAAG TGGCTCCAGG ATGTATTTAA CGTGCCCTTG GTCATCCAGA TGACGGATGA 
CGAGAAGTAT CTGTGGAAGG ACCTGACCCT GGACCAGGCC IATGGCGATG CTGTTGAGAA 
TGCCAAGGAC ATCATCGCCT GTGGCTTTGA CATCAACAAG ACTTTCATAT TCTCTGACCT 
GGACTACATG GGGATGAGCT CAGGTTTCTA CAAAAATGTG GTGAAGATTC AAAAGCATGT 
TACCTTCAAC CAAGTGAAAG GCATTTTCGG CTICACTGAC AGCGACTGCA TTGGGAAGAT 
CAGTTTTCCT GCCATCCAGG CTGCTCCCTC CTTCAGCAAC TCATTCCCAC AGATCTTCCG 
AGACAGGACG GATATCCAGT GCCTTATCCC ATGTGCCATT GACCAGGATC CTTACTTTAG 
AATGACAAGG GACGTCGCCC CCAGGATCGG CTATCCTAAA CCAGCCCTGT TGCACTCCAC 
CTTCTTCCCA GCCCTGCAGG GCGCCCAGAC CAAAATGAGT GCCAGCGACC CAAACTCCTC 
CATCTTCCTC ACCGACACGG CCAAGCAGAT CAAAACCAAG GTCAATAAGC ATGCGTTTTC 
TGGAGGGAGA GACACCATCG AGGAGCACAG GCAGTTTGGG GGCAACTGTG ATGTGGACGT 
GTCTTTCATG TACCTGACCT TCTTCCTCGA GGACGACGAC AAGCTCGAGC AGATCAGGAA 
GGATTACACC AGCGGAGCCA TGCTCACCGG TGAGCTCAAG AAGGCACTCA TAGAGGTTCT 
GCAGCCCTTG ATCGCAGAGC ACCAGGCCCG GCGCAAGGAG GTCACGGATG AGATAGTGAA 
AGAGTTCATG ACTCCCCGGA AGCTGTCC1T CGACTTTCAG TAGCACTCGT TTTACATATG 
CTTATAAAAG AAGTGATGTA TCAGTAATGT ATCAATAATC CCAGCCCAGT CAAAGOACCG 
CCACCTGTAG GCTTCTGTCT CATGGTAATT ACTGGGCCTG GCCTCTGTAA GCCTGTGTAT 
GTTATCAATA CTGTTTCTTC CTGTGAGT1C CATTATTTCT ATCTCTTATG GGCAAAGCAT 
TGTGGGTAAT TGGTGCTGGC TAACATTGCA TGGTCGGATA GAGAAGTCCA GCTGTGAGTC 
TCTCCCCAAA GCAGCCCCAC AGTGGAGCCT TCGGCTGGAA GTCCATGGGC CACCCTGTTC 
TTGTCCATGG AGGACTTCCG AGGGTTCCAA GTATAC1CTT AAGACCC 
ATATATATTC TATGTATGCG TATATGGAAT TGAAATGTCA TTATTGT 
CTTTGAAATA TTGATGTGGG GAGGTTTATT GAGCACAAGA TGTATTTCAG C 
CTCCCAAAAA GAAATTGATA AGTAAAAGCT TCGTTATACA TTTGACTAAG AAATCACCCA 
GCTTTAAAGC TGCTTTTAAC AATGAAGATT GAACAGAGTT CAGCAATTTT GATTAAATTA 
AGACTTGGGG GTGAAACTTT CCAGTTTACT GAACTCCAGA CCATGCATGT AGTCCACTCC 
AGAAATCATG CTCGCTTCCC TTGGCACACC AGTGTTCTCC TGCCAAATGA CCCTAGACCC 
TCTGTCCTGC AGAGTCAGGG TGGCTTTTCC CCTGACTGTG TCCGATGCCA AGGAGTCCTG 
GCCTCCGCAG ATGCTTCATT TTGACCCTTG GCTGCAGTGG AAGTCAGCAC AGAGCAGTGC 
CCTGGCTGTG TCCTGGACGG GTGGACTTAG CTAGGGAGAA AGTCGAGGCA GCAGCCCTCG 
AGGCCCTCAC AGATGTCTAG GCAGGCCTCA TTTCATCACG CAGCATGTGC AGGCCTGGAA 



Coding sequence: 337.. 4632 

1 11 21 31 41 51 

I I I I I I 

CGGAGGACAG CCGGACCGAG CCAACGCCGG GGACTTTGTT CCCTCCACGG AGGGGACTCG 
GCAACTCGCA GCGGCAGGGT CTGGGGCCGG CGCCI'GGGAG GGATCTGCGC CCCCCACTCA 
CTCCCTAGCT GTGTTCCCGC CGCCGCCCCG GCTAGTCTCC GGCGCTGGCG CCTATGGTCG 
GCCTCCGACA GCGCTCCGGA GGGACCGGGG GAGCTCCCAG GCGCCCGGGA CTGGAGACTG 
ATGCATGAGG GGCCTACGGA GGCGCAGGAG CGGTGGTGAT GGTCTGGGAA GCGGAGCTGA 
AGTCCCCTGG GCTTTGGTGA GGCGTGACAG TTTATCATGA CCGTGTTCAG GCAGGAAAAC 
GTGGATGATT ACTACGACAC CGGCGAGGAA CTTGGCAGTG GACAGTTTGC GGTTGTGAAG 
AAATGCCGTG AGAAAAGTAC CGGCCTCCAG TATGCCGCCA AATTCATCAA GAAAAGGAGG 
ACTAAGTCCA GCCGGCGGGG TGTGAGCCGC GAGGACATCG AGCGGGAGG7 CAGCATCCTG 
AAGGAGATCC AGCACCCCAA TGTCATCACC CTGCACGAGG TCTATGAGAA CAAGACGGAC 
GTCATCCTGA TCTTGGAACT CGTTGCAGGT GGCGAGCTGT TTGACTTCTT AGCTGAAAAG 
GAATCTTTAA CTGAAGAGGA AGCAACTGAA TTTCTCAAAC AAATTCTTAA TGGTGTTTAC 
TACCTGCACT CCCTTCAAAT CGCCCACTTT GATCTTAAGC CTGAGAACAT AATGCTTTTG 
GATAGAAATG TCCCCAAACC TCGGATCAAG ATCATTGACT TTGGGTTGGC CCATAAAATT 
GACTTTGGAA ATGAATTTAA AAACATATTT GGGACTCCAG AGTTTGTCGC TCCTGAGATA 
GTCAACTATG AACCTCTTGG TCTTGAGGCA GATATGTGGA GTATCGGGGT AATAACCTAT 
ATCCTCCTAA GTGGGGCCTC CCCATTTCTT GGAGACACTA AGCAAGAAAC GTTAGCAAAT 
GTATCCGCTG TCAACIACGA ATTTGAGGAT GAATACTTCA GTAATACCAG TGCCCTAGCC 
AAAGATTTCA TAAGAAGACT TCTGGTCAAG GAT CCAAAGA AGAGAATGAC AATTCAAGAT 
AGTTTGCAGC ATCCCTGGAT CAAGCCTAAA GATACACAAC AGGCACTTAG TAGAAAAGCA 
TCAGCAGTAA ACATGGAGAA ATTCAAGAAG TTTGCAGCCC GGAAAAAATG GAAACAATCC 
GTTCGCTTGA TATCACTGTG CCAAAGATTA TCCAGGTCAT TCCTGTCCAG AAGTAACATG 
AGTGTTGCCA GAAGCGATGA TACTCTGGAT GAGGAAGACT CCTTTGTGAT GAAAGCCATC 
ATCCATGCCA TCAACGATGA CAATGTCCCA GGCCTGCAGC ACCTTCTGGG CTCATTATCC 
AACTATGATG TTAACCAACC CAACAAGCAC GGGACACCTC CATTACTCAT TGCTGCTGGC 
TGTGGGAATA TTCAAATACT ACAGTTGCTC ATTAAAAGAG GCTCGAGAAT CGATGTCCAG 
GATAAGGGCG GGTCCAATGC CGTCTACTGG GCTGCTCGGC ATGGCCACGT CGATACCTTG 
AAATTTCTCA GTGAGAACAA ATGCCCTTTG GATGTGAAAG ACAAGTCTGG AGAGATGGCC 
CTCCACGTGG CAGCTCGCTA TGGCCATGCT GACGTGGCTC AAGTTACTTG TGCAGCTTCG 
GCTCAAATCC CAAT AT CCAG GACAAAGGAA GAAGAAACCC CCCTGCACTG TGCTGCTTGG 
CACGGCTATT ACTCTGTGGC CAAAGCCCTT TGTGAAGCCG GCTGTAACGT GAACATCAAG 
AACCGAGAAG GAGAGACGCC CCTCCTGACA GCCTCTGCCA GGGGCTACCA CGACATCGTG 
GAGTGTCTGG CCGAACATGG AGCCGACCTT AATGCTTGCG ACAAGGACGG ACACATTGCC 
CTTCATCTGG CTGTAAGACG GTGT CAGATG GAGGTAATCA AGACTCTCCT C 



GGCAACATGC CTATCGTGGT GGCCCTCTGT GAAGCAAACT GCAATTTGGA CATCTCCAAC 
AAGTATGGGC GAACGCCTCT GCACCTTGCG GCCAACAACG GAATCCTAGA CGTGGTCCGG 
TATCTCTGTC TGATGGGAGC CAGCGTTGAG GCGCTGACCA CGGACGGAAA GACGGCAGAA 
GATCTTGCTA GATCGGAACA GCACGAGCAC GTAGCAGGTC TCCTTGCAAG ACTTCGAAAG 
GATACGCACC GAGGACTCTT CATCCAGCAG CTCCGACCCA CACAGAACCT GCAGCCAAGA 
ATTAAGCTCA AGCTGTTTGG CCACTCGGGA TCCGGGAAAA CCACCCTTGT AGAATCTCTC 
C TGCTGAGGAG CTTTTTCAGA AGGCGTCGGC CCAGACTGTC TTCCACCAAC 
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TCCAGCAGGT TCCCACCTTC ACCCCTGGCT TCTAAGCCCA CAGTCTCAGT GAGCATCAAC 
AACCTGTACC CAGGCTGCGA GAACGTGAGT GTGAGGAGCC GCAGCATGAT GTTCC-AGCCG 
GGTCTTACCA AAGGGATGCT GGAGGTGTTT GTGGCCCCGA CCCACCACCC GCACTGCTCG 
GCCGATGACC AGTCCACCAA GGCCATCGAC AT CCAGAACG CTTATTTGAA TGGAGTTGGC 
GATTTCAGCG TGTGGGAGTT CTCTGGAAAT CCTGTGTATT TCTGCTGTTA TGACTATTTT 
GCTGCAAATG ATCCCACGTC AATCCATGTT GTTGTCTTTA GTCTAGAAGA GCCCTATGAG 
ATCCAGCTGA ACCCAGTGAT TTTCTGGCTC AGTTTCCTGA AGTCCCTTGT CCCAGTTGAA 
GAACCCATAG CCTTCGGTGG CAAGCTGAAG AACCCACTCC AAGTTGTCCT GGTGGCCACC 
CACGCTGACA TCATGAATGT TCCTCGACCG GCTGGAGGCG AGTTTGGATA TGACAAAGAC 
ACATCGTTGC TGAAAGAGAT TAGGAACAGG TTTGGAAATG ATCTTCACAT TTCAAATAAG 
CTGTTTGTTC TGGATGCTGG GGCTTCTGGG TCAAAGGACA TGAAGGTACT TCGAAATCAT 
CTGCAAGAAA TACGAAGCCA GATTGTTTCG GTCTGTCCTC CCATGACTCA CCTGTGTGAG 
AAAATCATCT CCACGCTGCC TTCCTGGAGG AAGCTCAATG GACCCAACCA GCTGATGTCG 
CTGCAGCAGT TTGTGTACGA CGTGCAGGAC CAGCTGAACC CCCTGGCCAG CGAGGAGGAC 
CTCAGGCGCA TTGCTCAGCA GCTCCACAGC ACAGGCGAGA TCAACATCAT G 
ACAGTTCAGG ACGTGCTGCT CCTGGACCCC C 
TTGCTGTCCG TGGAGACCCC ACGGGCGCTG C 

GACATCCAGC GCCTGGTGCC CGACAGCGAC GTGGAGGAGC TGCTGCAGAT C 
ATGGACATCT GCGCCCGGGA CCTGAGCAGC GGGACCATGG TGGACGTCCC A 
AAGACAGACA ACCTGCACCG CTCCTGGGCT GATGAGGAGG ACGAGGTGAT GGTGTATGGT 
GGCGTGCGCA TCGTGCCCGT GGAACACCTC ACCCCCTTCC CATGTGGCAT CTTTCACAAG 
GTCCAGGTGA ACCTGTGCCG GTGGATCCAC C 



GTCAACCACG GCCAGGGCAT TGAGGTCCAG GTCCGTGGCC TGGAGACGGA GAAGATCAAG 
TGCTGCCTGC TGCTGGACTC G 
CCAGGGCTCC TGACCGTGAA GCATTACCTG A 
CCCGTCATGA TCTACCAGCC ACGGGACTTC T 
CTGACCAACA CCATGGGGGG G 

CACGACGTCT ACTCACAGGC CAGCCTCGGC ATGGACATCC A 
CTCACTCGGA GGAAACTGAG TCGCCTGCTG GACCCGCCCG A 
TGJCCTTCTCG CCATGAACTT AGGCCTCCCT GACCTCGTGG CAAAGTACAA CACCAATAAC 
A AGGATTTCCT CCCCAGCCCC CTCCACGCCC TGCTGCGGGA ATGGACCACC 
AGTGGG CACCCTCATG TCCAAACTGA GGGAGCTGGG TCGCCGGGAT 
C TTTTGCTGAA GGCATCCTCT GTGTTCAAAA TCAACCTGGA TGGCAATGGC 
CAGGAGGCCT ATGCCTCGAG CTGCAACAGC GGCACCTCTT ACAATTCCAT TAGCTCTGTT 
GTATCCCGGT GAGGGCAGCC TCTGGCTTGG ACAGGGTCTG TTTGGACTGC AGAACCAAGG 
A GCCCATCCTT CCCTTTGGAG ATGCTGAGGG TGTTTCTTCC TGCACCCACA 
A TGCCACTCCT CCCTCCGGCT TGACCTGTTT CTCTGCCGCT ACCTCCCTCC 
CCGTCTCATT CCGTTGTCTG TGGATGGTCA TTGCAGTTTA AGAGCAGAAC AGATCTTTTA 
C TTGAAAAGCT AGTGTACCTC CTCTCAGTGT TTTGGACTCC ATCTCTCATC 
2 TTGCTTCTTA CTGATAATTT TGCTGGAATT CCTAACTTTT CAATGACATT 
TTTTTTAACT ATCATATTGA TTGTCCTTTA AAAAAGAAAA GTGCATATTT ATCCAAAATG 
TGTATTTCTT ATACGCTTTT CTGTGTTATA CCATTTCCTC AGCTTATCTC TTTTATATTT 
GTAGGAGAAA CTCCCATGTA TGGAATCCCA CTGTATGATT TATAAACAGA CAATATGTGA 
GTGCCTTTTG CAGAAGAGGG TGTGTTTGAA ATCATCGGAG TCAGCCAGGA GCTGTCACCA 
AGGAAACGCT ACCTCTCTGT CCCTTGCTGT ATGCTGATCA TCGCCAGAGG TGCTTCACCC 
TGAGTTTTGT TTTGTATTGT TTTCTGACAG TTTTTCTGTT TTGTTTGGCA AGGAAAGGGG 
AGAAGGGAAT CCTCCTCCAG GGTGATTTTA TGATCAGTGT TGTTGCTCTA GGAAGACATT 
TTTCCGTTTG CTTTTGTTCC AATGTCAATG TGAACGTCCA CATGAAACCT ACACACTGTC 
ATGCTTCATC ATTCCCTCTC ATCTCAGGTA GAAGGTTGAC ACAGTTGTAG GGTTACAGAG 
ACCTATGTAA GAATTCAGAA GACCCCTGAC TCATCATTTG TGG CAGTCCC TTATAATTGG 
TGCATAGCAG ATGGTTTCCA CATTTAGATC CTGGTTTCAT AACTTCCTGT ACTTGAAGTC 
TAAAAGCAGA AAATAAAGGA AGCAAGTTTT CTTCCATGAT TTTAAATTGT GATCGAGTTT 
TAAATTGATA GGAGGGAACA TGTCCTAATT CTTCTGTCCT GAGAAG CATG TAATGTTAAT 
GTTATATCAT ATGTATATAT ATATATGCAC TATGTATATA CATATATATT AATACTGGTA 
TTTTTACTTA ATCTATAAAA TGTCGTTAAA AAGTTGTTTG TTTTTTTCTT TTTTTATAAA 
TAAACTGTTG CTCGTTAAAA AAAAAAAAAA 

Nucleic Acid Accession")}: NM_002081.1 
Coding sequence: 222.. 1898 

1 11 21 31 41 51 

I I I I I I 

GGCTGCCCGA GCGAGCGTTC GGACCTCGCA CCCCGCGCGC CCCGCGCCGC C 
GGCTTTTGTT GTCTCCGCCT CCTCGGCCGC CGCCGCCTCT G 
CGGGACCTTG GCTCTGCCCT TCGCGGGCGG GAACTGCGCA GGACCCGGCC AGGATCCGAG 
AGAGGCGCGG GCGGGTGGCC GGGGGCGCCG CCGGCCCCGC CATGGAGCTC CGGGCCCGAG 
GCTGGTGGCT GCTATGTGCG G 
GCAAGAGCCG GAGCTGCGGC G 

GCGACGTGCC CCAGGCGGAG ATCTCGGGTG AGCACCTGCG G 
CCTGCTGCAC CAGCGAGATG GAGGAGAACC TGGCCAACCG CAGCCATGCC GAGCTGGAGA 
CCGCGCTCCG GGACAGCAGC CGCGTCCTGC AGGCCATGCT TGCCACCCAG CTGCGCAGCT 
TCGATGACCA CTTCCAGCAC CTGCTGAACG ACTCGGAGCG GACGCTGCAG GCCACCTTCC 
CCGGCGCCTT CGGAGAGCTG TACACGCAGA ACGCGAGGGC CTTCCGGGAC CTGTACTCAG 
AGCTGCGCCT GTACTACCGC GGTGCCAACC TGCACCTGGA GGAGACGCTG GCCGAGTTCT 
GGGCCCGCCT GCTCGAGCGC CTCTTCAAGC AGCTGCACCC CCAGCTGCTG C 
ACTACCTGGA CTGCCTGGGC AAGCAGGCCG AGGCGCTGCG G 
GAGAGCTGCG CCTGCGGGCC ACCCGTGGCT TCGTGGCTGC TCGCTCCTTT G 
TGGGCGTGGC CAGCGACGTG GTCCGGAAAG TGGCTCAGGT CCCCCTGGGC CCGGAGTGCT 
CGAGAGCTGT CATGAAG C TG GTCTACTGTG CTCACTGCCT GGGAGTCCCC G 
CCTGCCCTGA CTATTGCCGA A 
G GAGGAACCTC C 
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CCCTCCAGGA CAACAGGGAC ACGCTCACGG CCAAGGTCAT CCAGGGCTGC GGGAACCCCA 1260 

AGGTCAACCC CCAGGGCCCT GGGCCTGAGG AGAAGCGGCG CCGGGGCAAG CTGGCCCCGC 1320 

GGGAGAGGCC ACCTTCAGGC ACGCTGGAGA AGCTGGTCTC TGAAGCCAAG GCCCAGCTCC 1380 

GCGACGTCCA GGACTTCTGG ATCAGCCTCC CAGGGACACT GTGCAGTGAG AAGATGGCCC 1440 

TGAGCACTGC CAGTGATGAC CGCTGCTGGA ACGGGATGGC CAGAGGCCGG TACCTCCCCG 1500 

AGGTCATGGG TGACGGCCTG GCCAACCAGA TCAACAACCC CGAGGTGGAG GTGGACATCA 1560 

CCAAGCCGGA CATGACCATC CGGCAGCAGA TCATGCAGCT GAAGATCATG ACCAACCGGC 162 0 

TGCGCAGCGC CTACAACGGC AACGACGTGG ACTTCCAGGA CGCCAGTGAC GACGGCAGCG 1680 

GCTCGGGCAG CGGTGATGGC TGTCTGGATG ACCTCTGCGG CCGGAAGGTC AG CAGG AAG A 1740 

GCTCCAGCTC CCGGACGCCC TTGACCCATG CCCTCCCAGG CCTGTCAGAG CAGGAAGGAC 1800 

AGAAGACCTC GGCTGCCAGC TGCCCCCAGC CCCCGACCTT CCTCCTGCCC CTCCTCCTCT I860 

TCCTGGCCCT TACAGTAGCC AGGCCCCGGT GGCGGTAACT GCCCCAAGGC CCCAGGGACA 192 0 



GTCCCAGCCC CAGGCCTGGC CTCGCCTGCC TTTCTGCCTT T 

\ GGCCT CAAAG CAACCCGCTG GAGCCCACAG CGAGCOTGTG CCTTCCTCCC 

\ AGCCCCGCAC GGGCTGTCTG GGTGTCCGCC ATCCAG3GTC TGGCAGAGCC 
A TGCATGATGC CCTCCCCTCA GCGCAGGCTG CAGAGCCCGS CCCCACCTCC 



GGAGGCAGCG TGGGCTCTGC CAATGTGGGC TGCCCCTCGC ACACAGGGCT CACAGGGCAG 2700 



CAGGGCTCAG AGTGACCCTC GGCTGTCACC TGCTCACAGG GATGCTGGTG GCTGGTGAGA 2880 

CCCCGCACTG CACACGGGAA TGCCTAGGTC CCTTCCCGAC CCAGCCAGCT GCACTGCAGG 2940 

GCACGGGGAC CTGGATAGTT AAGGGCTTTT CCAAACATGC ATCCATTTAC TGACACTTCC 3000 

TGTCCTTGTT CATGGAGAGC TGTTCGCTCC TCCCAGATGG CTTCGGAGGC CCGCAGGGCC 3060 

CACCTTGGAC CCTGGTGACC TCCTGTCACT CACTGAGGCC ATCAGGGCCC TGCCCCAGGC 312 0 

CTGGACGGGC CCTCCTTCCC TCCTGTGCCC CAGCTGCCAG GTGGCCCTGG GGAGGGGTGG 3180 

TGTGGTGCTG GGAAGGGGTC CTGCAGGGGG AGGAGGACTT GGAGGGTCTG GGGGCAGCTG 3240 

TCCTGAACCG ACTGACCCTG AGGAGGCCGC TTAGTGCTGC TTTGCTTTTC ATCACCGTCC 



TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC TGGTCAGGGC AGTGGCCAAG C 



CCTTCCTCCA CAAGGTCCCC CCACCGCTCA GTGTCAGCGG GTGACGTGTG TTCTTTTGAG 



Seq ID NO: C74 DNA Sequence 

Nucleic Acid Accession it: BC030205-1 

Coding sequence: 45.. 878 



GTGAGCAGCC CCTAACAGGC TGTTACTTCA CTACAACTGA CGATATGATC ATCTTAATTT 60 
ATAACTCCAT ATGGCTTGAA CGAGCAGCCG GTGTGTACCA CAGAGAAGCA CGGTCTGGCA 180 
T TCATGTCTGT GCTGCTGGAT 300 



GAAAAACTGG CATTATTGAT TATGGAATCC GTCTCAATAG G 
ATTGCTACAA CCCACACGCA AAGGAGTGTG GTGGCGTCTT TACAGATCCA AAGCAAATTT 480 

GACTCAAGTA TGGTCAGCGT ATTCACCTGA GTTTTTTAGA TTTTGACCTT GAAGATGACC 600 
CAGGTTGCTT GGCTGATTAT GTTGAAATAT ATGACAGTTA C 



CCTTGAAGTT 
ATAAAAACTT TTTAGCTGGA AGATTTAGCC 



ATGTACCTAT ATGTATTTGC ATTTGAAATT TTGGAATCCT GCTCTATGTA CAGTTTTGTA 
TTATACTTTT TAAATCTTGA ACTTTATAAA CATTTTC 
AAACATGATT TTAAACAGCT GTAAAATATT CTATGATATG A 
AGCCTGTCTC TATTGTTGGA ATTTCAGGTC ATTTTCATAA ATATTGTTGC AATAAATATC 
A AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



Coding sequence : 
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GTGGCTCTTG CCT03ATGTC CTAGCCTAGG GGCCCCCGGG CCGGACTTGG CTGGGCTCCC 180 

TTCACCCTCT GCGGAGTCAT GAGGGCGAAC GACGCTCTGC AGGTGCTGGG CTTGCTTTTC 240 

AGCCTGGCCC GGGGCTCCGA GGTGGGCAAC TCTCAGGCAG TGTGTCCTGG GACICTGAAT 300 

GGCCTGAGTG TGACCGGCGA TGCTGAGAAC CAATACCAGA CACTGTACAA GCTCTACGAG 360 

AGGTGTGAGG TGGTGATGGG GAACCTTGAG ATTGTGCTCA CGGGACACAA TGCCGACCTC 420 

TCCTTCCTGC AGTGGATTCG AGAAGTGACA GGCTATGTCC TCGTGGCCAT GAATGAATTC 480 

TCTACTCTAC CATTGCCCAA CCTCCGCGTG GTGCGAGGGA CCCAGGTCTA CGATGGGAAG 540 

TTTGCCATCT TCGTCATGTT GAACTATAAC ACCAACTCCA GCCACGCTCT GCGCCAGCTC 600 

CGCTTGACTC AGCTCACCGA GATTCTGTCA GGGGGTGTTT ATATTGAGAA GAACGATAAG 660 



TGGGGTCCTG G 
TGCTCAGGCC C 

TGTGTACCTC GCTGTCCACA GCCTCTTGTC TACAACAAGC TAACTTTCCA GCTGGAACCC 

AATGGGCTCA AGATGTGTGA GCCTTGTGGG GGACTATGTC 
TCGAGCAACA 



AAGATCCCTG CCCTGGACCC AGAGAAGCTC AATGTCTTCC GGACAGTACG GGAGATCACA 1380 
Z CACATGCACA A 



AACTTGAATG TCACATCTCT GGGCTTCCGA TCCCTGAAGG AAATTAGTGC Ti 

1620 



GCAGAGGGCA AAGTGTGTGA CCCACTGTGC TCCTCTGGGG G 

GGTCAGTGCT TGTCCTGTCG AAATTATAGC CGAGGAGGTG TCTGTGTGAC OCACTGCAAC 
G GGGAGCCTCG AGAATTTGCC CATGAGGCCG AATGCTTCTC C 
ATGGCT CGGGCTCTGA T 
CAATGTGCCC ATTTTCGAGA TGGGCCCCAC TGTGTGAGCA GCTGCCCCCA T 
GGTGCCAAGG GCCCAATCTA CAAGTACCCA GATGTTC 

GAGAACTGCA CCCAGGGGTG TAAAGGACCA GAGCTTCAAG ACTGTTTAGG ACAAACACTG 2100 

GTGCTGATCG GCAAAACCCA T CTGACAATG GCTTTGACAG TGATAGCAGG ATTGOTAGTG 2160 

ATTTTCATGA TGCTGGGCGG CACTTTTCTC TACTGGCGTG GGCGCCGGAT TCAGAATAAA 2220 

AGGGCTATGA GGCGATACTT GGAACGGGGT GAGAGCATAG AGCCTCTGGA CCCCAGTGAG 2280 

AAGGCTAACA AAGTCITGGC CAGAATCTTC AAAGAGACAG AGCTAAGGAA GCTTAAAGTG 2340 

CTTGGCTCGG GTGTCTTTGG AACTGTG CAC AAAGGAGTGT GGATCCCTGA GGGTGAATCA 2400 

ATCAAGATTC CAGTCTGCAT TAAAGTCATT GAGGACAAGA GTGGACGGCA GAGTTTTCAA 2460 

GCTGTGACAG ATCATATGCT GGCCATTGGC AGCCTGGACC ATGCCCACAT TGTAAGGCTG 252 0 

CTGGGACTAT GCCCAGGGTC ATCTCTGCAG CTTGTCACTC AATATTTGCC TCTGGGTTCT 2580 

CTGCTGGATC ATGTGAGACA ACACCGGGGG GCACTGGGGC CACAGCTGCT GCTCAACTGG 2640 

GGAGTACAAA TTGCCAAGGG AATGTACTAC CTTGAGGAAC ATGGTATGGT GCATAGAAAC 2700 

CTGGCTGCCC GAAACGTGCT ACTCAAGTCA CCCAGTCAGG TTCAGGTGGC AGATTTTGGT 2760 

GTGGCTGACC TGCTGCCTCC TGATGATAAG CAGCTGCTAT ACAGTGAGGC CAAGACTCCA 2820 

ATTAAGTGGA TGGCCCTTGA GAGTATCCAC TTTGGGAAAT ACACACACCA GAGTGATGTC 2880 

TGGAGCTATG GTGTGACAGT TTGGGAGTTG ATGACCTTCG GGGCAGAGCC CTATGCAGGG 2940 

CTACGATTGG CTGAAGTACC AGACCTGCTA GAGAAGGGGG AGCGGTTGGC ACAGCCCCAG 3000 

ATCTGCACAA TTGATGTCTA CATGGTGATG GTCAAGTGTT GGATGATTGA TGAGAACATT 30S0 

CGCCCAACCT TTAAAGAACT AGCCAATGAG TTCACCAGGA TGGCCCGAGA CCCACCACGG 3120 

TATCTGGTCA TAAAGAGAGA GAGTGGGCCT GGAATAGCCC CTGGGCCAGA GCCCCATGGT 3180 

CTGACAAACA AGAAGCTAGA GGAAGTAGAG CTGGAGCCAG AACTAGACCT AGACCTAGAC 3240 

TTGGAAGCAG AGGAGGACAA CCTGGCAACC ACCACACTGG GCTCCGCCCT CAGCCTACCA 3 300 

GTTGGAACAC TTAATCGGCC ACGTGGGAGC CAGAGCCTTT TAAGTCCATC ATCTGGATAC 3360 

ATGCCCATGA ACCAGGGTAA TCTTGGGGGG TCTTGCCAGG AGTCTGCAGT TTCTGGGAGC 3420 
C AGTCTCTCTA CACCCAATGC CACGGGGATG CCTGGCATCA 
T GAGGCTGAGC TCCAGGAGAA AGTGTCAATG 

3 CCGGAGCCCA CGGCCACGCG GAGATAGCGC CTACCATTCC 3600 




Seq ID NO: C76 DNA Sequence 
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Coding sequence! 



GCCCGTACAC ACCGTGTGCT GGGACACCCC ACAGTCAGCC GCATGGCTCC CCTGTGCCCC 60 

AGCCCCTGGC TCCCTCTGTT GATCCCGGCC CCTGCTCCAG GCCTCACTGT GCAACTGCTG 12 0 

CTGTCACTGC TGCTTCTGAT GCCTGTCCAT CCCCAGAGGT TGCCCCGGAT GCAGGAGGAT 180 

TCCCCCTTGG GAGGAGGCTC TTCTGGGGAA GATGACCCAC TGGGCGAGGA GGATCTGCCC 240 

AGTGAAGAGG ATTCACCCAG AGAGGAGGAT CCACCCGGAG AGGAGGATCT ACCTGGAGAG 300 

GAGGATCTAC CTGGAGAGGA GGATCTACCT GAAGTTAAGC CTAAATCAGA AGAAGAGGGC 360 

TCCCTGAAGT TAGAGGATCT ACCTACTGTT GAGGCTCCTG GAGATCCTCA AGAACCCCAG 42 0 

AATAATGCCC "ACAGGGACAA AGAAGGGGAT GACCAGAGTC ATTGGCGCTA TGGAGGCGAC 480 

CCGCCCTGGC CCCGGGTGTC CCCAGCCTGC GCGGGCCGCT TCCAGTCCCC GGTGGATATC 54 0 

CGCCCCCAGC TCGCCGCCTT CTGCCCGGCC CTGCGCCCCC TGGAACTCCT GGGCTTCCAG 600 

CTCCCGCCGC TCCCAGAACT GCGCCTGCGC AACAATGGCC ACAGTGTGCA ACTGACCCTG 660 
CCTCCTGGGC TAGAGATGGC T 




C CAAAGGGGGT 13 80 
GTGAGCTACC GCCCAGCAGA GGTAGCCGAG ACTGGAGCCT AGAGGCTGGA TCTTGGAGAA 1440 
TGTGAGAAGC C 



45 
50 



GGCGAGAGGC GGGCTGAGGC O 

CCATGGGAGG GGCCGTGGTG GACGAGGGCC CCACAGGCGT CAAGGCCCCT GACGGCGGCT 12 0 

GGGGCTGGGC CGTGCTCTTC GGCTGTTTCG TCATCACTGG CTTCTCCTAC GCCTTCCCCA 180 

AGGCCGTCAG TGTCTTCTTC AAGGAGCTCA TACAGGAGTT TGGGATCGGC TACAGCGACA 2 40 

CAGCCTGGAT CTCCTCCATC CTGCTGGCCA TGCTCTACGG GACAGGTCCG CTCTGCAGTG 300 

TGTGCGTGAA CCGCTTTGGC TGCCGGCCCG TCATGCTTGT GGGGGGTCTC TTTGCGTCGC 360 

TGGGCATGGT GGCTGCGTCC TTTTGCCGGA GCATCATCCA GGTCIACCTC ACCACTGGGG 420 

TCATCACGGG GTTGGGTTTG GCACTCAACT TCCAGCCCTC GCTCATCATG CTGAACCGCT 4B0 

ACTTCAGCAA GCGGCGCCCC ATGGCCAACG GGCTGGCGGC AGCAGGTAGC CCTGTCTTCC 540 

TGTGTGCCCT GAGCCCGCTG GGGCAGCTGC TGCAGGACCG CTACGGCTGG CGGGGCGGCT 600 

TCCTCATCCT GGGCGGCCTG CTGCTCAACT GCTGCGTGTG TGCCGCACTC ATGAGGCCCC 660 

TGGTGGTCAC GGCCCAGCCG GGCTCGGGGC CGCCGCGACC CTCCCGGCGC CTGCTAGACC 720 

TGAGCGTCTT CCGGGACCGC GGCTTTGTGC TTTACGCCGT GGCCGCCTCG GTCATGGTGC 780 

TGGGGCTCTT CGTCCCGCCC GTGTTCGTGG TGAGCTACGC CAAGGACCTG GGCGTGCCCG 840 
ACACCAAGGC CGCCTTCCTG CTCACCATCC TGGGCTTCAT TGACATCTTC G 



TCTOCATGTT CTTCAACGGC CTCGCGGACC TGGCGGGCTC TACGGCGGGC GACTACGGCG 102 0 

GCCTCGTGGT CTTCTGCATC TTCTTTGGCA TCTCCTACGG CATGGTGGGG GCCCTGCAGT 1080 

TCGAGGTGCT CATGGCCATC GTGGGCACCC ACAAGTTCTC CAGTGCCATT GGCCTGGTGC 1140 

TGCTGATGGA GGCGGTGGCC GTGCTCGTCG GGCCCCCTTC GGGAGGCAAA CTCCTGGATG 1200 

CGACCCACGT CTACATGTAC GTGTTCATCC TGGCGGGGGC CGAGGTGCTC ACCTCCTCCC 1260 

TGATTTTGCT GCTGGGCAAC TTCTTCTGCA TTAGGAAGAA GCCCAAAGAG CCACAGCCTG 132 0 

AGGTGGCGGC CGCGGAGGAG G AGAAGCT CC ACAAGCCTCC TGCAGACTCG GGGGTGGACT 1380 

1AGGCTG AGCCTGAGAA AAACGGGGAG GTGGTTCACA 1440 
A GTGGCTGGGC Gl 



CTCAAGGACC TGGAAACCCA 
CCGCAGACAG GCTGGCAGGG CAGGTGCTGC 
GGCTCACATG 



Coding sequence 



TTGGCACTAA TAGGAAGTAC 7TCACCAACT 
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3 GT AC CAAAGG A 

A TGAAAAGGTC CCTGGGGAGA AGGGCTGTCC AGCAGCCCTA C ._„. 

ACCTTTACGA GACCCTGGGA GTCGTTGGAT CCACCACCAC TCAGCTGTAC ACGGACCGCA 

CGGAGAAGCT GAGGCCTGAG ATGGAGGGGC CCGGCAGCTT CACCATCTTC GCCCCTAGCA 480 

TTGAGCTGCT CAATGCCCTC CGCTACCATA TGGTGGG CAG GCGAGTCCTG ACTGATGAGC 600 

TGAAACACGG CATGACCCTC ACCTCTATGT ACCAGAATTC CAACATCCAG ATCCACCACT 660 

ATCCTAATGG GATTGTAACT GTGAACTGTG CCCGGCTCCT GAAAGCCGAC CACCATGCAA 720 

CCAACGGGGT GGTGCACCTC ATCGATAAGG TCATCTCCAC CATCACCAAC AACATCCAGC 780 

AGATCATTGA GAT CGAGGAC ACCTTTGAGA CCCTTCGGGC TGCTGTGGCT GCATCAGGGC 840 

TCAACACGAT GCTTGAAGGT AACGGCCAGT ACACGCTTTT GGCCCCGACC AATGAGGCCT 900 

TCGAGAAGAT CCCTAGTGAG ACTTTGAACC GTATCCTGGG CGACCCAGAA GCCCTGAGAG 960 

ACCTGCTGAA CAACCACATC TTGAAGTCAG CTATGTGTGC TGAAGCCATC GTTGCGGGGC 1020 

TGTCTGTAGA GACCCTGGAG GGCACGACAC TGGAGGTGGG CTGCAGCGGG GACATGCTCA 1080 

CTATCAACGG GAAGGCGATC ATCTCCAATA AAGACATCCT A3CCACCAAC GGGGTGATCC 1140 

ACTACATTGA TGAGCTACTC ATCCCAGACT CAGCCAAGAC ACTATTTGAA TTGGCTGCAG 1200 

AGTCTGATGT GTCCACAGCC ATTGACCTTT TCAGACAAGC CGGCCTCGGC AATCATCTCT 12S0 

CTGGAAGTGA GCGGTTGACC CTCCTGGCTC CCCTOAATTC TGTATTCAAA GATGGAACCC 1320 

CTCCAATTGA TGCCCATACA AGGAATTTGC TTCGGAACCA CATAATTAAA GACCAGCTGG 1380 

CCTCTAAGTA TCTGTACCAT GGACAGACCC TGGAAACTCT GGGCGGCAAA AAACTGAGAG 1440 

TTTTTGTTTA TCGTAATAGC CTCTGCATTG AGAACAGCTG CATCGCGQCC CACGACAAGA 1500 

GGGGGAGGTA CGGGACCCTG TTCACGATGG ACCGGGTGCT GACCCCCCCA ATGGGGACTG 1560 

T CATGG ATGT CCTGAAGGGA GACAATCGCT TTAGCATGCT GGTAGCTGCC ATCCAGTCTG 1620 

CAGGACTGAC GGAGACCCTC AACCGGGAAG GAGTCTACAC AGTCTTTGCT CCCACAAATG 1680 

AAGCCTTCCG AGCCCTGCCA CCAAGAGAAC GGAGCAGACT CTTGGGAGAT GCCAAGGAAC 1740 

TTGCCAACAT CCTGAAATAC CACATTGGTG ATGAAATCCT GGTTAGCGGA GGCATCGGGG 1800 

CCCTGGTGCG GCTAAAGTCT CTCCAAGGTG ACAAGCTGGA AGTCAGCTTG AAAAACAATG 1860 

TGGTGAGTGT CAACAAGGAG CCTGTTGCCG AGCCTGACAT CATGGCCACA AATGGCGTGG 1920 

TCCATGTCAT CACCAATGTT CTGCAGCCTC CAGCCAACAG ACCTCAGGAA AGAGGGGATG 1980 

AACTTGCAGA CTCTGCGCTT GAGATCTTCA AACAAGCATC AGCGTTTTCC AGGGCTTCCC 2040 

AGAGGTCTGT GCGACTAGCC CCTGTCTATC AAAAGTTATT AGAGAGGATG AAGCATTAGC 2100 

TTGAAGCACT ACAGGAGGAA TGCACCACGG CAGCTCTCCG CCAATTTCTC TCAGATTTCC 2160 

ACAGAGACTG TTTGAATGTT TTCAAAACCA AGTATCACAC TTTAATGTAC ATGGGCCGCA 2220 

CCATAATGAG ATGTGAGCCT TGTGCATGTG GGGGAGGAGG GAGAGAGATG TACTTTTTAA 2280 

ATCATGTTCC CCCTAAACAT GGCTGTTAAC CCACTGCATG CAGAAACTTG GATGTCACTG 2340 

CCTGACATTC ACTTCCAGAG AGGACCTATC CCAAATGTGG AATTGACTGC CTATGCCAAG 24 00 

TCCCTGSAAA AGGAGCTTCA GTATTGTGGG GCTCATAAAA CATGAATCAA GCAATCCAGC 2460 

CTCATGGGAA GTCCTGGCAC AGTTTTTGTA AAGCCCTTGC ACAGCTGGAG AAATGGCATC 2520 

ATTATAAGCT ATGAGTTGAA ATGTTCTGTC AAATGTGTCT CACATCTACA CGTGGCTTGG 2580 

AGGCTTTTAT GGGGCCCTGT CCAGGTAGAA AAGAAATGGT ATGTAGAGCT TAGATTTCCC 2640 

TATTGTGACA GAGCCATGGT GTGTTTGTAA TAATAAAACC AAAGAAACAT A 2691 

Seq ID NO: C79 DNA Sequence 

Nucleic Acid Accession #s NM_006536.2 

Coding sequence; 109.. 2940 

1 11 21 31 41 51 

I I I I . I I 

ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCATC CATATTGAAA ACCTGACACA 60 

ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC TCTACAACAT GACCCAAAGG 12 0 

AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 180 

GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGGGTATAA TGGATTGCTC 240 

ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA GAAGAGTATT TTTCAGAAAT 360 

ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG GGGCA CATGG AGATGATCCA 480 

TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACACCTAAT 540 

TTCCTACTGA ATGATAACTT AACAGCTGGC TACGGATCAC GAGGCCGAGT GTTTGTCCAT 600 

GAATGGGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC 660 

ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT 720 

GTGTGTGAAA AAGGTCCTTG CCCCCAAGAA AACTGTA7TA TTRCTASSC? TT7TAAAGAA 780 

GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CATCAATAAT GTTCATGCAA 840 

AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 

CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG TAATCACAGA CTCTGCTGAC 960 

TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 102 0 

GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 1080 

GCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATT3TTGAA 1140 

ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCAGCTA 1200 

CACCAAATTA ACAGCAATGA TG AT CGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 1260 

T CAGCTAAAA CAGACAT CAG CATTTGTTCA GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 132 0 

AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TGACCAGCGG AGATGATAAG 1380 

CTTCTTGGCA ATTGCTTACC CACTGTGCTC AGCAGTGGTT CAACAATTCA CTCCATTGCC 1440 

CTGGGTTCAT CTGCAGCCCC AAAT CTGGAG GAATTATCAC GTCTTACAGG AGGTTTAAAG 1500 

TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 1620 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 1680 

ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA 7TATATTAT7 TGATCCTGAT 1740 

GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATC7AA CTTTTCGGAC AGCTAGTCTT 1800 

TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGGACTTACA CCCTGAACAA TACCCATCAT 1860 

TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 1920 

GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 1980 

™™™ TG T^AACA^S ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

. __ ., TGTTACGCTG rgACTCCTTG ATGATGGAGC AGGTGCTGAT 2100 
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GTTATAAAAA ATGATGGAAT TTACTCGAGG TATTTTTTCT CCTTTGOTGC AAATGGTAGA 2160 

TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA GCACCCCAGC CCACTCTATT 2220 

CCAGGGAGTC ATGCTATGTA TGTACCAGGT T ACACAG CAA ACG3TAATAT TCAGATGAAT 2280 

GCTCCAAGGA AATCAGTAGG CAGAAATGAG GAGGA3CGAA AGTGGGGCTT TAGCCGAGTC 2340 

AGCTCAGGAG GCTCCTTTTC AGTGCTGGGA GTTCCAGCTG GCCCCCACCC TGATGTGTTT 2400 

CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG AAGAGGAATT GACCCIATCT 2460 

TGGACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA CAAGCTATGA AATAAGAATG 2520 

AGTAAAAGTC TACAGAATAT CCAAGATGAC TTTAACAATG CTATTTTAGT AAATACATCA 2580 

AAGCGAAATC CTCAGCAAGC TGGCATCAGG GAGATATTTA CGTTCTCACC CCAGATTTCC 2640 

ACGAATGGAC CTGAACATCA GCCAAATGGA GAAACACATG AAAGCCACAG AATTTATGTT 2700 

GCAATACGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG TATCTAACAT TGCCCAGGCG 2760 

CCTCTGTTTA TTCCCCCCAA TTCTGATCCT GTACCTGCCA GAGATTATCT TATATTGAAA 2820 

GGAGTTTTAA CAGCAATGGG TTTGATAGGA ATCATTTGCC TTATTATAGT TGTGACACAT 2880 

CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA ATGGAACAAA ATTATTATAA 2940 

ATAAATATCC AAAGTGTCTT CCTTCTTAGA TATAAGACCC ATGGCCTTCG ACTACAAAAA 3000 

CAT ACT AA CA AAGTCAAATT AACATCAAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 3060 

ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT TTGGGGGTAG ATTAGAAAAC 3120 

CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT TTAAAGTAAT GTCTTTAAAG 3180 

GCAAAGGGAA GGGTAAAGTC GGACCAGTGT CAAGGAAAGT TTGTTTTATT GAGGTGGAAA 3240 

AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA TAACTGTCTG TGTGAAGCAA 3300 

TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 3360 

TTTACATGAA GAT CATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG CTCTTTACCT 3420 
CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 
TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC CTTTGTCTCT TCATACCGGT 
TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 
TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCCN TATAATGCCT 
TACCTAGGAA A 



1 11 21 31 41 51 

I I I I | | 

ATGAAGTTTC TTCTAATACT GCTCCTGCAG GCCACTGCTT CTGGAGCTCT TCCCCTGAAC 
AGCTCTACAA GCCTCGAAAA AAATAATGTG CTATTTGGTG AAAGATACTT AGAAAAATTT 
TATGGCCTTG AGATAAACAA ACTTCCAGTG ACAAAAATGA AATATAGTGG AAACT^AATG 
AAGGAAAAAA TCCAAGAAAT GCAGCACTTC TTGGGTCTGA AAGTGACCGG GCAACGGAC 
ACATCTACCC TGGAGATGAT GCACGCACCT CGATGTGGAG TCCCCGATGT CCATCATTTC 
AGGGAAATGC CAGGGGGGCC CGTATGGAGG AAACATTATA TCACCTACAG AATCAATAAT 
TACACACCTG ACATGAACCG TGAGGATGTT GACTACGCAA TCCGGAAAGC TTTCCAAGTA 
TGGAGTAATG TTACCCCCTT GAAATTCAGC AAGATTAACA CAGGCATGGC TGACATTTTG 
GTGGTTTTTG CCCGTGGAGC TCATGGAGAC TTCCATGCTT TTGATGGCAA AGGTGGAATC 
CTAGCCCATG CTTTTGGACC TGGATCTGGC ATTGGAGGGG ATGCACATTT CGATGAGGAC 
GAATTCTGGA CTACACATTC AGGAGGCACA AACTTGTTCC TCACTGCTGT TCACGAGATT 



w^^.w^^ j-:^^r^^ n ^ ± wiLUb:iU. TCfiCTGCTGT TC'AC'GAGATT 

GGCCATTCCT TAGGTCTTGG CCATTCTAGT GATCCAAAGG CCGTAATGTT CCCCACCTAC 

AAATATGTTG ACATCAACAC ATTTGGCCTC TCTGCTGATG ACATACGTGG CATTCAGTCC 

CTGTATGGAG ACCCAAAAGA GAACCAACGC TTGCCAAATC CTGACAATTC AGAACCAGCT 

CTCTGTGACC CCAATTTGAG TTTTGATGCT GTCACTACCG TGGGAAATAA GATCTTTTTC 



^n^iu^i uiulLISLUj llMJUMIAA GATCTTTTTC 

TTCAAAGACA GGTTCTTCTG GCTGAAGGTT TCTGAGAGAC CAAAGACCAG TGTTAATTTA 
ATTTCTTCCT TATGGCCAAC CTTGCCATCT GGCATTGAAG CTGCTTATGA AATTGAAGCC 
AGAAATCAAG TTTTTCTTTT TAAAGATGAC AAATACTGGT TAATTAGCAA TTTAAGACCA 
GAGCCAAATT ATCCCAAGAG CATACATTCT TTTGGTTTTC CTAACTTTGT GAAAAAAATT 
GATGCAGCTG TTTTTAACCC ACGTTTTTAT AGGACCTACT TCTTTGTAGA TAACCAGTAT 
TGGAGGTATG ATGAAAGGAG ACAGATGATG GACCCTGGTT ATCCCAAACT GATTACCAAG 
AACTTCCAAG GAATCGGGCC TAAAATTGAT GCAGTCTTCT ACTCTAAAAA CAAATACTAC 
TATTTCTTCC AAGGATCTAA CCAATTTGAA TATGACTTCC TACTCCAACG TATCACCAAA 
ACACTGAAAA GCAATAGCTG GTTTGGTTGT TGA 



1 11 21 31 41 51 

ATGAAGTTTC TTCTAATACT GCTCCTGCAG GCCACTGCTT CTGGAGCTCT TCCCCTGAAC 
AGCTCTACAA GCCTGGAAAA AAATAATGTG CTATTTGGTG AAAGATACTT AGAAAAATTT 
TATGGCCTTG AGATAAACAA ACTTCCAGTG ACAAAAATGA AATATAGTGG AAACTTAATG 
AAGGAAAAAA TCCAAGAAAT GCAGCACTTC TTGGGTCTGA AAGTGACCGG GCAACTGGAC 
ACATCTACCC TGGAGATGAT GCACGCACCT CGATGTGGAG TCCCCGATGT CCA^CAT^TC 
AGGGAAATGC CAGGGGGGCC CGTATGGAGG AAACATTATA TCACCTACAG AATCAATAAT 
TACACACCTG ACATGAACCG TGAGGATGTT GACTACGCAA TCCGGAAAGC TTTCCAAGTA 
TGGAGTAATG TTACCCCCTT GAAATTCAGC AAGATTAACA CAGGCATGGC TGACATTTTG 
GTGGTTTTTG CCCGTGGAGC TCATGGAGAC TTCCATGCTT TTGATGGCAA AGGTGGAATC 
CTAGCCCATG CTTTTGGACC TGGATCTGGC ATTGGAGGGG ATGCACATTT CGATGAGGAC 
GAATTCTGGA CTACACATTC AGGAGGCACA AACTTGTTCC TCACTGCTGT TCACGCCATT 
GGCCATTCCT TAGGTCTTGG CCATTCTAGT GATCCAAAGG CCGTAATGTT CCCCACCTAC 
AAATATGTTG ACATCAACAC ATTTGGCCTC TCTGCTGATG ACATACGTGG CATTCAGTCC 
CTGTATGGAG ACCCAAAAGA GAACCAACGC TTGCCAAATC CTGACAATTC AGAACCA ;CT 
CTCTGTGACC CCAATTTGAG TTTTGATGCT GTCACTACCG TGGGAAATAA GATCTTTTTC 
TTCAAAGACA GGTTCTTCTG GCTGAAGGTT TCTGAGAGAC CAAAGACCAG TGTT. ATTt? 
ATTTCTTCCT TATGGCCAAC CTTGCCATCT GGCATTGAAG CTGCTTATGA AATTGAAGCC 
AGAAATCAAG TTTTTCTTTT TAAAGATGAC AAATACTGGT TAATTAGCAA TTTAAGACCA 
GAGCCAAATT ATCCCAAGAG CATACATTCT TTTGGTTTTC CTAACTTTGT GAAAAAAATT 
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3TTCGTTGC TTCCAGGGCC TGCTGATTTT 
TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 
ATCTGACCAA CACAGCCTCT ACCCACTGCT T 
GGCTGCCTGG ATCGGCATAT TTGTGGGCAT C 
TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 
AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 



TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 
CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 
TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 
AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GGTTTTTATC ACAATCAGGG 
CTGCTATGAA CTGATCTCTG G 
T CTCTGCTGGA C 




TGGCGAGACA GTCCAGGAAA G 
ATCCAAACGT ATCTTACGAA G 
TGAAAATGGC AAGGGTCCCT T 
AGACACCAAG ATTTTCTACA G 
CTTCGCTGTA GAGAAGGAGA C 
GATTGCCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG A 
CCCCATGAAC A 



CAGTAGTGGA 1020 



CAACTCACCA GCGTGGCGTG CCACCTACCT TATCATGGGC GGTGACGACG GGGACCATTT 
TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGA 
TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT 
GCTGAAGCTC CCAACCICCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TGAATGAGGC 
ACCTGTGTTT GTCCCACCCT CCAAAGTCGT TGAGGTCCAG GAGGGCAICC CCACTGGGGA 
GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG 
CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC A 
TGTGGGCACC CTCGACCGTG AGGATGAGCA GTTTGTGAGG AACAACATCT A 
GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 
ACTGATTGAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC CGTCAGATCA CCATCTGCAA 
CCAAAGCCCT GTGCGCCAGG TGCTGAACAT CACGGACAAG GACCTGTCTC CCCACACCTC 
CCCTTTCCAG GCCCAGCTCA CAGATGACTC AGACATCTAC TGGACGGCAG AGGTCAACGA 
GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA GAAGTTCCTG AAGCAGGATA CATATGACGT 
GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTGATCA G 
GTGCGACTGC CATGGCCATG TCGAAACCTG CCCTGGACCC TGGAAGGGAG G 
CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCTGC ITTTGTTGGT 
GAGAAAGAAG CGGAAGAT CA AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 
CGTCTTCTAC TATGGCGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 
GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG T 



TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCCCCGCCCT ACGACACCCT 
CTTGGTGTTC GACTATGAGG GCAGCGGCTC CGACGCCGCG TCCCTGAGCT CCCTCACCTC 
CTCCGCCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 
GAAGCTGGCA GACATGTACG GTGGCGGGGA GGACGACTAG GCGGCCTGCC TGCAGGGCTG 
3CATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 
A AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGT CTG 
; TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 
AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 
TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 
TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 
TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 
GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 



1247 



WO 03/042661 PCT/US02/36810 



TAAAGAAACT 

Seg ID NO: C84 DNA Sequence 
Nucleic acid Accession #: NM_ 
Coding sequence: 6 3 9.. 2546 



CCGCCGCCAC CACCGCCACC GGAGTCGCGG GCCAGCCGGG CAGCCTCCGC a 
CGGGGCGGGG GGCGCGGGCC ACAGGCCCCT GCTCCGGCCG TCGTTTGCAG ACCGCGGGCG 
CCGATGTCGC CCGCGCCCCG TTAGGATGAG TCTCGGGTCG GGCGAGGAGC CGCCGCAGCC 
GCCGCCGCCC GAGCCGCGGG CAGGAGCCTC GGGAGCCGCC GCCGCCGCCG CCGCCGCCCG 
GCCGGGCCCC GACGCCGCCC GCGCGCCCCC GGGCCCCCGA CACACATGAG ATTCTTCAGG 
CTCACTTTCA AGTGCTTCGT GGACTGCTTC TGACTGCGCC GCCCGCGCCC CGCACCCCGC 
CGTCCGCCCG CCGCCCCGTC CCCCGGCCCG GCCGCCCCCC GGCCCCCGGC CGGCCCGCGC 
CCTCGGGGCC CTCCCCGGTG CCGCCGGTGC CCCCCGCCTG ACCGCCGCCC CCCGTGAGGC 
GCCGCGACCC CGGCCCGGCC GTGCGGCCCG CCGGGGCCAT GGCGAAGAAG AGCGCCGAGA 
ACGGCATCTA TAGCGTGTCC GGCGACGAGA AGAAGGGCCC CCTCATCGCG CCCGGGCCCG 
ACGGGGCCCC GGCCAAGGGC GACGGCCCCG TGGGCCTGGG GACACCCGGC GGCCGCCTGG 
CCGTGCCGCC GCGCGAGACC TGGACGCGCC AGATGGACTT CATCATGTCG TGCGTGGGCT 
TCGCCGTGGG CTTGGGCAAC GTGTGGCGCT TCCCCTACCT GTGCTACAAG AACGGCGGAG 
GTGTGTTCCT TATTCCCTAC GTCCTGATCG CCCTGGTTGG AGGAATCCCC ATTTTCTTCT 
TAGAGATCTC GCTGGGCCAG TTCATGAAGG CCGGCAGCAT CAATGTCTGG AACATCTGTC 
CCCTGTTCAA AGGCCTGGGC TACGCCTCCA TGGTGATCGT CTTCTACTGC AACACCTACT 
ACATCATGGT GCTGGCCTGG GGCTTCTATT ACCTGGTCAA GTCCTTTACC ACCACGCTGC 
CCTGGGCCAC ATGTGGCCAC ACCTGGAACA CTCCCGACTG CGTGGAGATC TTCCGCCATG 
AAGACTGTGC CAATGCCAGC CTGGCCAACC TCACCTGTGA CCAGCTTGCT GACCGCCGGT 
CCCCTGTCAT CGAGTTCTGG GAGAACAAAG TCTTGAGGCT GTCTGGGGGA CTGGAGGTGC 
CAGGGGCCCT CAACTGGGAG GTGACCCTTT GTCTGCTGGC CTGCTGGGTG CTGGTCTACT 
TCTGTGTCTG GAAGGGGGTC AAATCCACGG GAAAGATCGT GTACTTCACT GCTACATTCC 
CCTACGTGGT CCTGGTCGTG CTGCTGGTGC GTGGAGTGCT GCTGCCTGGC GCCCTGGATG 
GCATCATTTA CTATCTCAAG CCTGACTGGT CAAAGCTGGG GTCCCCTCAG GTGTGGATAG 
ATGCGGGGAC CCAGATTTTC TTTTCTTACG CCATTGGCCT GGGGGCCCTC ACAGCCCTGG 
GCAGCTACAA CCGCTTCAAC AACAACTGCT ACAAGGACGC CATCATCCTG GCTCTCATCA 

iCAG-i ;gac cagcttcttt gctggcttcg tggtcttctc catcctgggc ttcatggctg 

CAGAGCAGGG CGTGCACATC TCCAAGGTGG CAGAGTCAGG GCCGGGCCTG GCCTTCATCG 

CCTACCCGCG GGCTGTCACG ctgatgccag tggccccact CTGGGCTGCC ctgttcttct 

rcATGCTGTT GCTGCTTGGT CTCGACAGCC AGTTTGTAGG TGTGGAGGGC TTCATCACCG 
GCCTCCTCGA CCTCCTCCCG GCCTCCTACT ACTTCCGTTT CCAAAGGGAG ATCTCTGTGG 
CCCTCTGTTG TGCCCTCTGC TTTGTCATCG ATCTCTCCAT GGTGACTGAT GGCGGGATGT 

ACGTCTTCCA GCTGTTTGAC TACTACTCGG CCAGCGGCAC CACCCTGCTC TGGCAGGCCT 2100 

TTTGGGAGTG CGTGGTGGTG GCCTGGGTGT ACGGAGCTGA CCGCTTCATG GACGACATTG 2160 

CCTGTATGAT CGGGTACCGA CCITGCCCCT GGATGAAATG GTGCTGGTCC TTCTTCACCC 222 0 

CGCTGGTCTG CATGGGCATC TTCATCTTCA ACGTTGTGTA CTACGAGCCG CTGGTCTACA 22 80 

ACAACACCTA CGTGTACCCG TGGTGGGGTG AGGCCATGGG CTGGGCCTTC GCCCTGTCCT 2340 

CCATGCTGTG CGTGCCGCTG CACCTCCTGG GCTGCCTCCT CAGGGCCAAG GGCACCATGG 2400 

CTGAGCGCTG GCAGCACCTG ACCCAGCCCA TCTGGGGCCT CCACCACTTG GAGTACCGAG 2460 

CTCAGGACGC AGATGTCAGG GGCCTGACCA CCCTGACCCC AGTGTCCGAG AGCAGCAAGG 2520 

TCGTCGTGGT GGAGAGTGTC ATGTGACAAC TCAGCTCACA TCACCAGCTC ACCTCTGGTA 2580 

GCCATAGCAG CCCCTGCTTC AGCCCCACCG CACCCCTCCA GGGGGCCTGC CTTTCCCTGA 2640 

CACTTTTGGG GTCTGCCTGG GGGAGGAGGG GAGAAAGCAC CATGAGTGCT CACTAAAACA 2700 

ACTTTTTCCA TTTTTAATAA AACGCCAAAA ATATCACAAC CCACCAAAAA TAGATGCCTC 2760 

TCCCCCTCCA GCCCTAGCCG AGCTGGTCCT AGGCCCCGCC TAGTGCCCCA CCCCCACCCA 282 0 

CAGTGCTGCA CTCCTCCTGC CCCTGCCACG CCCACCCCCT GCCCACCTCT CCAGGCTCTG 2880 

CTCTGCAGCA CACCCGTGGG TGACCCCTCA CCCCAGAAGC AGCAGTGGCA GCTTGGGAAA 2 940 

TGTGAGGAAG GGAAGGAGGG AGAGACGGGA GGGAGGAGAG AGAGGAGAAG GGAGGCAGGG 3000 

GAGGGGCAGC AGAACCAAGG CAAATATTTC AGCTGGGCTA TACCCCTCTC CCCATCCCTG 3060 

TTATAGAAGC TTAGAGAGCC AGCCAGCAAT GGAACCTTCT GGTTCCTGCG CCAAICGCCA 3120 

CCAGTATCAA TTGTGTGAGC TTGGGTGCGA GTGCACGCGT GCGTGAGTAC GGAGAGTATA 3180 

TATAGATCTC TATCTCTTAG CAAAGGTGAA TGCCAGATGT AAATGGCGCC TCTGGGCAAA 3240 

GGAGGCTTGT ATTTTGCACA TITTATAAAA ACTTGAGAGA ATGAGATTTC TGCTTGTATA 3300 

TTTCTAAAAA GAGGAAGGAG CCCAAACCAT CCTCTCCTTA CCACTCCCAT CCCTGTGAGC 3360 

CCTACCTTAC CCCTCTGCCC CTAGCCAAGG AGTGTGAATT TATAGATCTA ACTTTCATAG 342 0 

GCAAAACAAA AGCTTCGAGC TGTTGCGTGT GTGAGTCTGT TGTGTGC-ATG TGCGTGTGTG 3480 

GTCCCCAGCC CCAGACTGGA TTGGAAAAGT GCATGGTGGG GGCCTCGGGG CTGTCCCCAC 3540 

GCTGTCCCTT TGCCACAAGT CIGTGGGGCA AGAGGCTGCA ATATTCCGTC CTGGGTGTCT 3600 

GGGCTGCTAA CCTGGCCTGC TCAGGCTTCC CACCCTGTGC GGGGCACACC CCCAGGAAGG 3660 

GACCCTGGAC ACGGCTCCCA CGTCCAGGCT TAAGGTGGAT GCACTTCCCG CACCTCCAGT 3720 

CTTCTGTGTA GCAGCTTTAA CCCACGTTTG TCTGTCACGT CCAGTCCCGA GACGGCTGAG 3780 

TGACCCCAAG AAAGGCTTCC CCGACACCCA GACAGAGGCT GCAGGGCTGG GGCTGGGTGA 3840 

A CATTCTACTG TGCTAAAAAG CCACTGCAGA CATAGCAATA 3900 



TO: C85 DNA Sequence 
Acid Accession NM 
sequence: 180.. 1658 



TAGTCGCGGG TCCCCGAGTG AGCACGCCAG GG AG CAGGAG ACCAAACGAC GGGGGTCGGA 
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GTCAGAGTCG CAGTGGGAGT CCCCGGACCG GAGCACGAGC CTGAGCGGGA GAGCGCCGCT 
CGCACGCCCG TCGCCACCCG CGTACCCGGC GCAGCCAGAG CCACCAGCGC AGCGCTGCCA 
TGGAGCCCAG CAGCAAGAAG CTGACGGGTC GCCTCATGCT GGCTGTGGGA GGAGCAGTGC 
TTGGCTCCCT GCAGTTTGGC TACAACACTG GAGTCATCAA TGCCCCCCAG AAGGTGATCG 
AGGAGTTCTA CAACCAGACA TGGGTCCACC GCTATGGGGA GAGCATCCTG CCCACCACGC 
TCACCACGCT CTGGTCCCTC TCAGTGGCCA TCTTTTCTGT TGGGGC-CATG ATTGGCTCCT 
TCTCTGTGGG CCTTTTCGTT AACCGCTTTG GCCGGCGGAA TTCAATGCTG AT3ATGAACC 
TGCTGGCCTT CGTGTCCGCC GTGCTCATGG GCTTCTCGAA ACTGGGCAAG TCCTTTGAGA 
TGCTGATCCT GGGCCGCTTC ATCATCGGTG TGTACTGCGG CCTGACCACA GGCTTCGTGC 
CCATGTATGT GGGTGAAGTG TCACCCACAG CCTTTCGTGG GGCCCTGGGC ACCCTGCACC 
AGCTGGGCAT CGTCGTCGGC ATCCTCATCG CCCAGGTGTT CGGCCTGGAC TCCATCATGG 
GCAACAAGGA CCTGTGGCCC CTGCTGCTGA GCATCATCTT CATCCCGGCC CTGCTGCAGT 
GCATCGTGCT GCCCTTCTGC CCCGAGAGTC CCCGCTTCCT GCTCATCAAC CGCAACGAGG 
AGAACCGGGC CAAGAGTGTG CTAAAGAAGC TGCGCGGGAC AGCTGACGTG ACCCATGACC 
TGCAGGAGAT GAAGGAAGAG AGTCGGCAGA TGATGCGGGA GAAGAAGGTC ACCA7CCTGG 
AGCTGTTCCG CTCCCCCGCC TACCGCCAGC CCATCCTCAT CGCTGTGGTG CTGCAGCTGT 
CCCAGCAGCT GTCTGGCATC AACGCTGTCT TCTATTACTC CACGAGCATC TTCGAGAAGG 



C GCTAGCACTG 

TGGGTCCTGG CCCCATCCCA TGGTTCATCG TGGCTGAACT CTTCAGCCAG 
CAGCTGCCAT TGCCGTTGCA GGCTTCTCCA ACTGGACCTC AAATTTCATT GTGGGCATGT 
GCTTCCAGTA TGTGGAGCAA CTGTGTGGTC CCTACGTCTT CATCATCTTC A 
TGGTTCTGTT CTTCATCTTC ACCTACTTCA AAGTTCC 
ATGAGATCGC TTCCGGCTTC CGGCAGGGGG GAGCCAG 

A TCCCCTGGGG GCTGATTCCC AAGTGTGAGT C 



ATGTCAGCCG A 

CAAAAGCAAG ACTGTTGCTC 



I GAGTCTCCTG TGCCCACATC C 

IAGCCCT GTCGAGACAC TTGCCTTCTT CACCCAGCTA ATCTGTAGGG 
CTGGACCTAT GTCCTAAGGA C 
GAGGTGGCTA TGGCCACCCG T 
CATTAGGATT TGCCCCTTCC CATCTCTTCC T 
CCTGAGACCA GTTGGGAGCA C 
GCCGGGTTCT AGTCTCCTTT GCACTGAGGG CCACACTATT ACCATGAGAA GAGGGCCTGT 2340 
GGGAGCCTGC AAACTCACTG CTCAAGAAGA CATGGAGACT CCTGCCCTGT TGTGTATAGA 2400 
TGCAAGATAT TTATATATAT TTTTGGTTGT CAATATTAAA TACAGACACT AAGTTATAGT 24 6 0 

TATAAATGGC TGGTTTTTAG AAACATGGTT TTGAAATGCT TGTGGATTGA GGGTAGGAGG 25 GO 

TTTGGATGGG AGTGAGACAG AAGTAAGTGG GC-TTGCAACC ACTGCAACGG CTTAGACTTC 2640 

GACTCAGGAT CCAGTCCCTT ACAQGTACCT CTCATCAGTG TCCTCTTGCT CAAAAATCTG 2700 

TTTGATCCCT GTTACCCAGA GAATATATAC ATTCTTTATC TTGACATTCA AGGCATTTCT 2760 

ATCACATATT TGATAGTTGG TGTTCAAAAA AACACTAGTT TTGTGCCAGC CGTGATGCTC 2820 

AGGCTTGAAA TCGCATTATT TTGAATGTGA AGGGAA 285 6 

Seg ID NO; C86 DNA Sequence 

Nucleic Acid Accession #: XM_035292.2 

Coding sequence: 53.. 1576 



60 GCTGGCGCTG GTGGTGTGGG CCGCGTGCGG CGTCTTCTCC ATCGTGGGCG CGCTCTGCTA 
CGCGGAGCTC GGCACCACCA TCTCCAAATC GGGCGGCGAC TACGCCTACA TGCTGGAGGT 
CTACGGCTCG CTGCCCGCCT TCCTCAAGCT CTGGATCGAG CTGCTCATCA TCCGGCCTTC 
ATCGCAGTAC ATCGTGGCCC TGGTCTTCGC CACCTACCTG CICAAGCCGC TCTTCCCCAC 

65 GG CCGTGAAC TGCTACAGCG TGAAGGCCGC CACCCGGGTC CAGGATGCCT TTGCCGCCGC 

CAAGCTCCTG GCCCTGGCCC TGATCATCCT G 

TGTGTCCAAT CTAGATCCCA ACTTCTCATT T 

TGTGCTGGCA TTATACAGCG GCCTCTTTGC CTATGGAGGA TGGAATTACT TGAATTTCGT 

CACAGAGGAA ATGATCAACC C 
70 CATCGTGACG CTGGTGTACG TGCTGACCAA C 



GTCCTGGATC ATCCCCGTCT T 
GTTCACATCC TCCAGGCTCT T 
Z CACCCACAGC T 

GACGCTGCTC TACGCCTTCT CCAAGGACAT CTTCTCCGTC ATCAACTTCT TCAGCTTCTT 
C TGCGTGGCCC TGGCCATCAT CGGCATGATC TGGCTGCGCC ACAGAAAGCC 
T GGCCCTGCCT G 

3 ATCGCCGTCT C 

CATCATCCTC AGCGGGCTGC CCGTCTACTT CTTCGGGGTC TGGTGGAAAA ACAAGCCCAA 
C CAGGGCATCT TCTCCACGAC CGTCCTGTGT CAGAAGCTCA TGCAGGTGGT 
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Coding sequence: 



TAAAAAGCAA AAGAATTCGC GGCCGCGTCG ACACGGQCTT CCCCGAAAAC CTTCCCCGCT 
TCTGGATATG AAATTCAAGC TGCTTGCTGA GTCCTATTGC CGGCTGCTGG GAGCCAGGAG 
AGCCCTGASG AGTAGTCACT CAGTAGCAGC TGACGCGTGG GTCCACCATG AACTGGAGTA 
TCTTTGAGGG ACTCCTGAGT GG GGTCAACA AGTACTCCAC AGCCTTTGGG CGCATCTGGC 
TGTCTCTGGT CTTCATCTTC CGCGTGCTGG TGTACCTGGT GACGGCCGAG CGTGTGTGGA 
GTGATGACCA CAAGGACTTC GACTGCAATA CTCGCCAGCC CGGCTGCTCC AACGTCTGCT 
TTGATGAGTT CTTCCCTGTG TCCCATGTGC GCCTCTGGGC CCTGCAGCTT ATCCTGGTGA 
CATGCCCCTC ACTGCTCGTG GTCATGCACG TGGCCTACCG GGAGGTTCAG GAGAAGAGGC 
ACCGAGAAGC CCATGGGGAG AACAGTGGGC GCCTCTACCT GAACCCCGGC AAGAAGCGGG 
GTGGGCTCTG GTGGACATAT GTCTGCAGCC TAGTGTTCAA GGCGAGCGTG GACATCGCCT 
TTCTCTATGT GTTCCACTCA TTCTACCCCA AATATATCCT CCCTCCTGTG GTCAAGTGCC 
ACGCAGATCC ATGTCCCAAT ATAGTGGACT GCTTCATCTC CAAGCCCTCA GAGAAGAACA 
TTTTCACCCT CTTCATGGTG GCCACAGCTG CCATCTGCAT CCTC-CTCAAC CTCGTGGAGC 
T GGTGAGCAAG AGATGCCACG AGTGCCTGGC AGCAAGGAAA GCTCAAGCCA 
3 TCATCACCCC CACGGTACCA CCTCTTCCTG CAAACAAGAC GACCTCCTTT 
CGGGTGACCT CATCTTTCTG GGCTCAGACA GTCATCCTCC TCTCTTACCA GACCGCCCCC 
T GAAGAAAACC ATCTTGTGAG GGGCTGCCTG GACTGGTCTG GCAGGTTGGG 
G GAGGCTCTAG CATCTCTCAT AGGTGCAACC TGAGAGTGGG GGAGCTAAGC 
CATGAGGTAG GGGCAGGCAA GAGAGAGGAT TCAGACGCTC TGGGAGCCAG TTCCTAGTCC 
C CAGCTCGACG GCACTGGGCC AGTTCCCCCT CTGCTCTGCA 
GAAATAG TGAGGGCCAA TGC 

Seg ID NO: C88 DNA Sequence 



CTCTACCTGA CACAGCTGCA GCCTGCAATT CACTCCCACT G 
CGTGTGCTCA GAACAAGGTG AACGCCCAGC TGCAGCCATG AAGATCTGTA GCCTCACCCT 
GCTCTCCTTC CTCCTACTGG CTGCTCAGGT GCTCCTGGTG GAGGGGAAAA A 
GAATGGACTT CACAGCAAAG TGGTCTCAGA ACAAAAGGAC A 
TAAGCAGAAA AGCAGGCCCG GGAACAAAGG CAAGTTTGTC ACCAAAGACC AAGCCAACTG 



GGACCATGAA TTTTCCTGTG TCTTTGCTGG CAATCCAACC TCATGCCTAA A 
TGAGAGAGTC TATTGGAAAC AAGTTGCCCG GAATCTGCGC TCACAGAAAG ACATCTGTAG 
ATATTCCAAG ACAGCTGTGA AAACCAGAGT GTGCAGAAAG GATTTTCCAG AATCCAGTCT 
TAAGCTAGTC AGCTCCACTC TATTTGGGAA CACAAAGCCC AGGAAGGAGA AAACAGAGAT 
GTCCCCCAGG GAGCACATCA AGGGCAAAGA GACCACCCCC TCTAGCCTAG CAGTGACCCA 
GACCATGGCC ACCAAAGCTC CCGAGTGTGT GGAGGACCCA GATATGGCAA ACCAGAGGAA 
GACTGCCCTG GAGTTCTGTG G AG AGACT TG GAGCTCTCTC TGCACATTCT TCCTCAGCAT 
AGTGCAGGAC ACGTCATGCT AATGAGGTCA AAAGAGAACG GGTTCCTTTA AGAGATGTCA 
TGTCGTAAGT CCCTCTGTAT ACTTTAAAGC TCTCTACAGT CCCCCCAAAA TATGAACTTT 
TGTGCTTAGT GAGTGCAACG AAATATTTAA ACAAGTTTTG TATTTTTTGC TTTTGTGTTT 
TGGAATTTGC CTTATTTTTC TTGGATGCGA TGTTCAGAGG CTGTTTCCTG CAGCATGTAT 
TTCCATGGCC CACACAG CTA TGTGTTTGAG CAGCGAAGAG TCTTTGAGCT GAATGAGCCA 
GAGTGATAAT TTCAGTGCAA CGAACTTTCT GCTGAATTAA TGGTAATAAA ACTCTGGGTG 
TTTTTCAAAA AAAAAAAAAA AAA 

Seq ID NO: C89 DNA Sequence 
Nucleic Acid Accession tt : BC022542 
Coding sequence: 2 74.. 92 7 

1 11 21 31 41 51 

I I I I I I 

ACTTGGTCCC AGCCGATAAA TCTGGGGCAG CGCGCGGTAG GAGCTGCGGG CGGCCAGGCC 
CCTTCCTGCG TCCGCACCTG GCCCCGCGCG CCCCTCTCGG GCGTCCGGCT TCCGGCGTCC 
TGGCGGCTCG GGTGGCGGCG GTTCGGGCGG CCGCCTGGCT GCTCCTCGGG GCGGCGACGG 
GGCTCACGCG CGGGCCCGCC ACGGCCTTCA CCGCCGCGCG CTCTGACGCC GGCATAAGGG 
CCATGTGTTC TGAAATTATT TTGAGGCAAG AAGTTTTGAA AGATGGTTTC CACAGAGACC 
TTTTAATCAA AGTGAAGTTT GGGGAAAGCA TTGAGGACTT GCACACGTGC CGTCTCTTAA 
TTAAACAGGA CATTCCTGCA GGACTTTATG TGGATCCGTA TGAGTTGGCT TCATTACGAG 
AGAGAAACAT AACAGAGGCA GTGATGGTTT CAGAAAATTT TGATATAGAG GCCCCTAACT 
ATTTGTCCAA GGAGTCTGAA GTTCTCATTT ATGCCAGACG AGATTCACAG TGCATTGACT 
GTTTTCAAGC CTTTTTGCCT GTGCACTGCC GCTATCATCG GCCGCACAGT GAAGATGGAG 
AAGCCTCGAT TGTGGTCAAT AACCCAGATT TGTTGATGTT TTGTGACCAA GAGTTCCCGA 
TTTTGAAATG CTGGGCTCAC TCAGAAGTGG CAGCCCCTTG TGCTTTGGAT AATGAGGATA 
TATGCCAATG GAACAAGATG AAGTATAAAT CAGTATATAA GAATGTGATT CTACAAGTTC 
CAGTGGGACT GACTGTACAT ACCTCTCTAG TATGTTCTGT GACTCTGCTC ATTACAATCC 
TGTGCTCTAC ATTGATCCTT GTAGCAGTTT TCAAATATGG CCATTTTTCC CTATAAGTTT 
TATGTAGTTA AATGCTTCCT AGAAACCTAA ATAAGATCTA TTAAITTCTG ACGAGAGGTG 
TTCTTCTAGA ATTAATTACT TTTATCTTTT GTCTTCATTT GTGGCCAAAA TTATGTTTAC 
TAGAGGAAAT TTGGGATCAT TCTCAGCTAA T 
CCTTGGTAAT CCTCAAGCAT CAGATGCCAT A 
TTTATTTTGT GAGAAGTGAC TTTATCTTCA TTTGGGGTAG AAAAATTATT TCTTTATGTA 



A ATAGGCCAGG CACAGTGGCT CACACCTGTA ATCCCAGCAC 1320 

C CGAGGTGGGC GGATCACCAG AGGTCAAGAG TTTGAGACCA GCTTGGTGAA 1380 

C TACTAAAAAT ACAAAAGTTA GCTGGGGCTG GTGGTGGGCA TCTGTAGTCC 1440 

CAGCTAATTG GAAGGGTGAG GCGGGAGGAT CGCTTGAACC TGGGAGGCG3 AGGTTCCAGA 1500 
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\T TTAAATAGTA 

ATGTCATGAG ACTATTAAAG ATGTGCCAGA GTTTCAATGA AAATCATTAA AGTAGGACAG 
CTAAGAAATT AATATTAATA TAAAAATTAT TGATAATCTT AAATTATTGA TTATTCCTTA 
ACGCACTCCA TTCTCCTTTT ACATTTTATC ATGTTTCTTT TGAATATATG AATTGGCAAA 
GGACTTGATG AAACTGAGTA CTAAGATTTG GTACAGAGTA TGTCAGGAAG ACAACTCAGA 
TTGCCATTTT AAATAAAGTT GTACATGAAC AAAAAAAAAA AAAAAA 




65 



CTCGACGGTG ATGGGGGGCA ACTCGGCGGG GGAGCTGTGC GTCTTCCCCT TCACTTTCCT 1080 

TACCACCTCG AACTTTGACA GCGACAAGAA GTGGGGCTTC TGCCCGGACC AAGGATACAG 12 00 

TTTGTTCCTC GTGGCGGCGC ATGAGTTCGG CCACGCGCTG GGCTTAGATC ATTCCTCAGT 12 60 

GCCGGAGGCG CTCATGTACC CTATGTACCG CTTCACTGAG GGGCCCCCCT TGCATAAGGA 1320 

CGACGTGAAT GGCATCCGGC ACCTCTATGG TCCTCGCCCT GAACCTGAGC CACGGCCTCC 1380 

AACCACCACC ACACCGCAGC CCACGGCTCC CCCGACGGTC TCCCCCACCG GACCCCCCAC 1440 

TGTCCACCCC TCAGAGCGCC CCACAGCTGG CCCCACAGGT CCCCCCTCAG CTGGCCCCAC 1500 

AGGTCCCCCC ACTGCTGGCC CTTCTACGGC CACTACTGTG OCTTTGAGTC CGGTGGACGA 1560 

TGCCTGCAAC GTGAACATCT TCGACGCCAT CGCGGAGATT GGGAACCAGC TGTATTTGTT 1620 

CAAGGATGGG AAGTACTGGC GATTCTCTGA GGGCAGGGGG AGCCGGCCGC AGGGCCCCTT 1680 

CCTTATCGCC GACAAGTGGC OCGCGCTGCC CCGCAAGCTG GACTCGGTCT TTGAGGAGCC 1740 

GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG TGGGTGTACA CAGGCGCGTC 1800 

GGTGCTGGGC CCGAGGCGTC TGGACAAGCT GGGCCTGGGA GCCGACGTGG CCCAGGTGAC 1860 
CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT GCTGCTGTTC A( 



CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC CGAGAGAAAG CCTATTTCTG 2040 

- n CCAGGACCGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG TTGAACCAGG TGGACCAAGT 2100 

Jlj GGGCTACGTG ACCTATGACA TCCTGCAGTG CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 2160 

GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA AGGAGCCAGT TTGCCGGATA 2220 

CAAACTGGTA TTCTGTTCTG GAGGAAAGGG AGGAGTGGAG GTGGGCTGGG CCCTCTCTTC 22 80 

TCACCTTTGT TTTTTGTTGG AGTGTTTCTA ATAAACTTGG ATTCTCTAAC CTTT 2 334 

55 Seq ID NO: C91 Sequence 

Nucleic Acid Accession tt : NM_000213 

60 | i 1 i 1 i 1 t 1 i 1 

GCGCTGCCCG CCTCGTCCCC ACCCCCCAAC CCCCCGCGCC CGCCCTCGC-A CAGTCCCTGC 60 
TCGCCCGCGC GCTGCAGCCC CATCTCCTAG CGGCAGCCCA GGCGCGGAGG GAGCGAGTCC 120 
GCCCCGAGGT AGGTCCAGGA CGGGCGCACA GCAGCAGCCG AGGCTGGCCG G 
C GCCCCAGCCC ATGGGCCAGG CTGCTCCTGG C 

CGGCAAA CCGCTGCAAG AAGGCCCCAG TGAAGAGCTG 
CACGGAGTGT GTCCGTGTGG ATAAGGACTG CGCCTACTGC ACAGACGAGA V 



CAGCCAGATG TCCCCCCAAG GCCTGCGGGT CCGTCTGCGG CCCGGTGAGG AGCGGCATTT 540 

CTCCAACTCC ATGTCCGATG ATCTGGACAA C 
GGTCCTGAGC CAGCTCACCA GCGACTACAC I 



3 CCTGACAGAA GATGTGGATG A 

TAAACTGCAG GGAGAGCGGA TCTCAGGCAA CCTGGATGCT CCTGAGGGCG GCTTCGATGC 900 

rGGCTC-G CGCCCGGACA GCACCCACCT 960 

A CTATGAGGCT GATGGCGCCA ACGTGCTGGC 1020 

G ATGAACGGTG CCACCTGGAC ACCACGGGCA CCTACACCCA 1080 

GTACAGGACA CAGGACTACC CGTCGGTGCC CACCCTGGTG CGCCTGCTCG CCAAGCACAA 1140 

CATCATCCCC ATCTTTGCTG TCACCAACTA CTCCTATAC-C TACTACGAGA AGCTTCACAC 12 00 

CTATTTCCCT GTCTCCTCAC TGGGGGTGCT GCAGGAGGAC TCC-TCCAACA TCGTGGAGCT 1260 

GCTGGAGGAG GCCTTCAATC GGATCCGCTC CAACCTGGAC ATCCGGGCCC TAGACAGCCC 1320 

CCGAGGCCTT CGGACAGAGG TCACCTCCAA GATGTTCCAG AAGACGAGGA CTGGGTCCTT 1380 

TCACATCCGG CGGGGGGAAG TGGGTATATA CCAGGTGCAG CTGCGGGCCC TTGAGCACGT 1440 
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GCTGCAAAAA GAGGTGCGGT CAGCTCGCTG CAGCTTCAAC GGAGACTTCG TGTGCGGACA 
GTGTGTGTGC AGCGAGGGCT GGAGTGGCCA GACCTGCAAC TGCTCCACCG GCTCTCTGAG 
TG ACATT CAG CCCTGCCTGC GGGAGGGCGA GGACAAGCCG TGCTCCGGCC GTGGGGAGTG 
CCAGTGCGGG CACTGTGTGT GCTACGGCGA AGGCCGCTAC GAGGGTCAGT TCTGCGAG7A 
TGACAACTTC CAGTGTCCCC GCACTTCCGG GTTCCTGTGC AATGACCGAG GACGCTGCTC 
CATGGGCCAG TGTGTGTGTG AGCCTGGTTG GACAGGCCCA AGCTGTGACT G 
CAATGCCACC TGCATCGACA GCAATGGGGG CATCTGTAAT GGACGTGGCC A 
TGGCCGCTGC CACTGCCACC AGCAGTCGCT CTACACGGAC ACCATCTGCG AGATCAACTA 2040 
CTCGGCGATC CACCCGGGCC TCTGCGAGGA CCTACGCTCC TGCC-TGCAGT GCCAGGCGTG 2100 
GGGCACCGGC GAGAAGAAGG GGCGCACGTG TGAGGAATGC AACTTCAAGG TCAAGATGGT 2160 
GGACGAGCTT AAGAGAGCCG AGGAGGTGGT GGTGCGCTGC TCCTTCCGGG ACGAGGATGA 222 0 
CGACTGCACC TACAGCTACA CCATGGAAGG TGACGGCGCC CCTGGGCCCA ACAGCACTGT 2280 
CCTGGTGCAC AAGAAGAAGG ACTGCCCTCC GGGCTCCTTC TGGTGGCTCA TCCCCCTGCT 2340 
CCTCCTCCTC CTGCCGCTCC TGGCCCTGCT ACTGCTGCTA TGCTGGAAGT ACTGTGCCTG 2400 
CTGCAAGGCC TGCCTGGCAC TTCTCCCGTG CTGCAACCGA GGTCACATGG TGGGCTTTAA 2460 



GGGAACCTCA AGGGCCGTGA CGTGGTCCGC TGGAAGGTCA CCAACAACAT 2580 



CGGCTACTAC ACCCTCACTG CAGACCAGGA CGCCCGGGGC ATGGTGGAG? 



A AGGAGCAAGC CAGAGACGTG 
GTTCTCGGTC AGCCGCGGGG ACCAGGTGGC CCGCATCCCT 



TCAACTGGCT GCCCCCTTCT GGCAAGCCAA TGGGGTACAG 3660 



GAGCTGGGCT GAGCCGGCTG AGACCAACGG TGAGATCACA G 

CCTGGTCAAC GATGACAACC GACCTATTGG GCCCATGAAG AAAGTGCTGG TTGACAACCC 4 02 0 

TAAGAACCGG ATGCTGCTTA TTGAGAACCT TCGGGAGTCC CAGCCCTACC GCTACACGGT 4 0 80 

GAAGGCGCGC AACGGGGCCG GCTGGGGGCC TGAGCGGGAG GCCATCATCA ACCTGGCCAC 4140 

G AGGCCCATGT CCATCCCCAT CATCCCTGAC ATCCCTATCG TGGACGCCCA 42 00 



CGGAGAG CAC CTGGTGAATG GCCGGATGGA CTTTGCCTTC CCGGGCAGCA CCAACTCCCT 
GCACAGGATG ACCACGACCA GTGCTGCTGC CTATGGCACC CACCTGAGCC CACACGTGCC 
CCACCGCGTG CTAAGCACAT CCTCCACCCT CACACGGGAC TACAACTCAC TGACCCGCTC 
AGAACACTCA CACTCGACCA CACTGCCCAG GGACTACTCC ACCCICACCT CCGTCTCCTC 
CCACGACTCT CGCCTGACTG CTGGTGTGCC CGACACGCCC ACCCGCCTGG TGTTCTCTGC 
CCTGGGGCCC ACATCTCTCA GAGTGAGCTG GCAGGAGCCG CGGTGCGAGC GGCCGCTGCA 
GGGCTACAGT GTGGAGTACC AGCTGCTGAA CGGCGGTGAG CTGCATCGGC TCAACATCCC 
CAACCCTGCC CAGACCTCGG TGGTGGTGGA AGACCTCCTG CCCAACCACI CCTACGTGTT 
CCGCGTGCGG GCCCAGAGCC AGGAAGGCTG GGG CCGAG AG CGTGAGGGTG TCATCACCAT 
TGAATCCCAG GTGCACCCGC AGAGCCCACT GTGTCCCCTG CCAGGCTCCG CCTTCACTTT 
GAGCACTCCC AGTGCCCCAG GCCCGCTGGT GTTCACTGCC CTGAC-CCCAG ACTCGCTGCA 
GCTGAGCTGG GAGCGGCCAC GGAGG CCCAA TGGGGATATC GTCGGCTACC TGGTGACCTG 
TGAGATGGCC CAAGGAGGAG GGCCAGCCAC CGCATTCCGG GTGGATGGAG ACAGCCCCGA 
GAGCCGGCTG ACCGTGCCGG GCCTCAGCGA GAACGTGCCC TACAAGTTCA AG3TGCAGGC 
CAGGACCACT GAGGGCTTCG GGCCAGAGCG CGAGGGCATC ATCACCATAG AGTCCCAGGA 
TGGAGGACCC TTCCCGCAGC TGGGCAGCCG TGCCGGGCTC TTCCAGCACC CGCTGCAAAG 



GCACT TAAT A AATGGTTTTG CTACTGCTAA AAAAAAAAAA 



cid Accession";): NM_023915 
quence: 250.. 1326 



GGCACGAGGG TTTCGTTTTC ATGCTTTACC AGAAAATCCA CTTCCCTGCC G 
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TCAAAGCTTA TTCTTAATTA GAGACAAGAA ACCTGTTTCA ACTTGAAGAC ACGGTATGAG 12 0 

= AGCCAGCCAC CACAATGAAA GAAATCAAAC CAGGAATAAC CTATGCTGAA 180 

Z AATCGTCCCC AAGTGTTTCC TGACACGCAT CTT7GCTTAC AGTGCATCAC 240 

AACTGAAGAA TGGGGTTCAA CTTGACGCTT GCAAAATTAC CAAATAACGA GCTGCACGGC 300 

CAAGAGAGTC ACAATTCAGG CAACAGGAGC GACGGGCCAG GAAAGAACAC CACCCTTCAC 360 

AATGAATTTG ACACAATTGT CTTGCCGGTG CTTTATCTCA TTA7ATTTGT GGCAAGCATC 42 0 

TTGCTGAATG GTTTAGCAGT GTGGATCTTC TTCCACATTA GGAATAAAAC CAGCTTCATA 480 

TTCTATCTCA AAAACATAGT GGTTGCAGAC CTCATAATGA CGC7GACATT TCCATTTCGA 540 

ATAGTCCATG ATGCAGGATT TGGACCTTGG TACTTCAAGT TTA7TCTCTG CAGATACACT 600 

TCAGTTTTGT TTTATGCAAA CATGTATACT TCCATCGTGT TCCTTGGGCT GATAAGCATT 660 

GATCGCTATC TGAAGGTGGT CAAGCCATTT GGGGACTCTC GGATGTACAG CATAACCTTC 720 

ACGAAGGTT T TATCTGTTTG TGTTTGGGTG ATCATGGCTG TTTTGTCTTT GCCAAACATC 780 

ATCCTGACAA ATGGTCAGCC AACAGAGGAC AAT AT CCATG ACTGCTCAAA ACTTAAAAGT 840 

CCTTTGGGGG TCAAATGGCA TACGGCAGTC ACCTATGTGA ACAGCTGCTT GTTTGTGGCC 900 

GTGCTGGTGA TTCTGATCGG ATGTTACATA GCCATATCCA GGTACATCCA CAAATCCAGC 960 

AGGCAATTCA TAAGTCAGTC AAGCCGAAAG CGAAAACATA ACCAGAGCAT CAGGGTTGTT 1020 

GTGGCTGTGT TTTTTACCTG CTTTCTACCA TATCACTTGT GCAGAATTCC TTTTACTTTT 1080 

ATTACACTTT TCTTGTCTGC GTGTAATGTT TGCCTGGATC C 
TGTAGGTCAT T 

& TTACACTGAT 

T GTTGGAATCG ATATGTACAA AGTGTAAATA AATGTTTCTT 
C TTAAAAAAAA AA 



T GGTTCTCCGG AGAGCAGCTT 
CCTTGGGTGT T ACATG AG C C AAGCCCTCAC TGTACAGAAG AGTGAGAGCT GAAACCTGTT 
CCCTGAGCTG ATCAGAAGGA CATCCCTTGG CCCCTCCATC TGGGCTCCTG TGGATAGGAG 



CTGGTCATCA GCCAGGGGGC TGACGGTCGA GGGAAGCCTG AGGTGGTATC G 

CGGGCTGAGG AGAGTGTGGT GCTGGGCTGT GACCTGCTGC CCCCGGCCGG CCGGCCCCCC 3 60 

CTGCATGTCA TCGAGTGGCT GCGCTTTGGA TTCCTGCTTC CCATCTTCAT CCAGTTCGGC 420 

CTCTACTCTC CCCGAATTGA CCCTGATTAC GTGGGACGAG TCCGGCTGCA GAAGGGGGCC 480 

TCTCTCCAGA TTGAGGGTCT CCGGGTGGAA GACCAGGGCT GGTACGAGTG CCGCGTGTTC 540 

TTCCTGGACC AGCACATCCC TGAAGACGAT TTTGCTAACG GCTCCTGGGT GCATCTGACA 600 

C CCCCTCAATT CCAGGAGACA CCTCCTGCTG TGTTGGAAGT GCAGGAACTG 660 

A CCCTGCGTTG TGTGGCCCGT GGCAGCCCCC TGCCTCATGT GACGTGGAAG 72 0 

CTCCGAGGAA AGGACCTTGG CCAGGGCCAG GGCCAGGTGC AAGTGCAGAA CGGGACGCTG 78 0 

CGGATCCGCC GGGTAGAGCG AGGCAGCTCT GGGGTCTACA CCTGCCAAGC CTCCAGCACT 840 

GAGGGCAGCG CCACCCACGC CACCCAGCTG CTAGTGCTAG GACCCCCAGT CATCGTGGTG 900 

CCCCCCAAGA ACAGCACAGT CAATGCCTCC CAGGATGTTT CATTGGCCTG CCATGCTGAG 960 

GCATACCCTG CTAACCTCAC CTACAGCTGG TTCCAGGACA ACATCAATGT CTTCCACATT 102 0 

AGCCGCCTGC AGCCC CGGGT GCAGATCCTG GTGGACGGGA GCCTGCGGCT GCTGGCCACC 1080 

CAGCCTGATG ATGCCGGCTG CTACACCTGT GTGCCCAGCA ATGGCCTCCT GCATCCACCC 1140 

TCAGCCTCTG CCTACCTCAC TGTGCTCTGC ATGCCGGGGG TGATCCGCTG CCCGGTTCGT 12 00 

GCCAACCCCC CACTGCTCTT TGTCAGCTGG ACCAAGGATG GAAAGGCCCT GCAGCTGGAC 12 60 

AAGTTCCCTG GCTGGTCCCA GGGCACAGAA GGCTCACTGA TCATCGCCCT GGGGAACGAG 1320 

GATGCCCTGG GAGAATACTC CTGCACCCCC TACAACAGT C TTGGTACCGC CGGGCCCTCT 1380 

CCTGTGACCC GCGTGCTGCT CAAGGCTCCC CCAGCTTTTA TAGAGCGGCC CAAGGAAGAA 1440 

TATTTCCAAG AAGTAGGGCG GGAGCTGCTC ATCCCCTGCT CCGCCCAAGG GGACCCTCCT 15 00 

CCTGTTGTCT CTTGGACCAA GGTGGGCCGG GGGCTGCAAG GCCAGGCCCA GGTGGACAGC 15 60 

AACAGCAGCC TCATCCTGCG ACCATTGACC AAGGAGGCCC ACGGGCACTG GGAATGCAGT 162 0 

GCCAGCAATG CTGTGGCCCG AGTGGCCACC TCCACGAACG TCTACGTGCT GGGCACTAGC 1680 

CCTCATGTTG TCACCAATGT GTCCGTGGTG GCTTTGCCCA AGGGTGCCAA TGTCTCCTGG 1740 

GAGCCTGGCT TTGATGGTGG TTATCTGCAG AGATTCAGTG TCTGGTACAC CCCACTGGCC 18 00 

G ACCGAATGCA CCATGACTGG GTGTCCTTGG CAGTGCCTGT GGGGGCTGCT 1860 

G TGCCAGGGCT GCAGCCCCAC ACCCAGTACC AGTTCAGCGT GC7AGCTCAG 192 0 

AACAAGCTGG GGAGTGGTCC CTTCAGCGAA ATCGTCTTGT CTGCTCCGGA AGGGCTTCCT 1980 



G TGGCAGTGAG GACACCCCGG GGGGTACTCC TGCATTGGGA 1CCCCCAGAG 
CTGGTCCCTA AGAGACTGGA TGGCTACGTC TTGGAAGGCC GGCAAGGCTC CCAGGGCTGG 
GAGGTGCTGG ACCCGGCTGT GGCAGGCACA GAAACAGAGC TGCTGGTGCC AGGCCTCATC 
AAGGATGTTC TCTACGAGTT CCGCCTCGTG GCCTTCGCGG GCAGCTTCGT CAGCGACCCC 
AGCAACACGG CCAACGTCTC CACTTCCGGT CTGGAGGTCT ACCCTTCGCG CACGCAGCTG 



CGCCGCCGCA AGCGCCTCCG CCAAGATCCA CCTCTTATCT TCTCTCCGAC CGGGAAGTCA 2520 

GCTGCACCCT CTGCTCTGGG CTCAGGCAGT CCTGACAGCG TGGCGAAGCT GAAGCTCCAG 2580 

GGATCCCCAG TCCCCAGCCT GCGCCAGAGT CTGCTCTGGG GGGATCCTGC CGGAACTCCC 2 640 
AGCCCCCACC CGGATCCTCC AT 

AGCAGCAGCC CCAGTGGGGC ACCCCAGCCC CTCTGCATTG AAGACATCAG CCCTGTGGCA 2 880 

CCCCCTCCAG CAGCCCCACC CAGTCCCTTG CCAGGTCCTG GACCCCTGCT CCAGTACCTG 2 940 

AGCCTGCCCT TCTTCCGAGA GATGAATGTG GATGGGGACT GGCCCCCGCT TGAGGAGCCC 3000 

AGCCCTGCTG CACCCCCAGA T T ACATGGAT ACCCGGCGCT GTCCCACCTC ATCTTTCCTT 3060 

CGTTCTCCAG AAACCCCTCC TGTATCCCCC AGGGAATCAC TTCCTGGGGC TGTGGTAGGG 3120 

GCTGGGGCCA CTGCAGAGCC CCCTTACACA GCCCTGGCTG ACTGGACACT GAGGGAGCGG 3180 

CTGCTGCCAG GCCTTCTCCC TGCTGCCCCT CGAGGCAGCC TCACCAGCCA GAGCAGCGGG 3 240 
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T GCGGCCCCCC TCCACAGCCC CCTCTGCAGG AGGCAGCTAC 
G CTCCAGGAGA CACCAGCAGC TGGGCCAGTG GCCCTGAGAG ATGGCCCCGA 
AGGGAGCATG TGGTGACAGT CAGCAAGAGG AGGAACACA7 CTGTGGACGA GAACTATGAG 
TGGGACTCAG AATTCCCTGG GGACATGGAA TTGCTGGAGA CTTTGCACCT GGGCTTGGCC 
AGCTCCCGGC TCAGACCTGA AGCTGAGACA GAGCTAGGTG TGAAGACTCC AGAGGAGGGC 
TGCCTCCTGA ACACTGCCCA TGTTACTGGC CCTGAGGCCC GCTGTGCTGC CCTTCGGGAG 
GAATTCCTGG CCTTCCGCCG CCGCCGAGAT GCTACTAGGG CTCGGCTACC AGCCTATCGA 
CAGCCAGTCC CCCACCCCGA ACAGGCCACT CTGCTGTGAA CATCCCTAAT GTGAGGCTGT 
GAAAAGGCAT ATGGACCTGC AAAGGAGGCC CCCAACCAGA CAGACCTAGT TTCAAACGAG 
GGCACTGCCC CTGCCTGCCC CTTTGGTGCC CAGGCACAGA CCCTGATAGT GGGTTTGGGT 
CACCTTGGTA TGGAATGTAT GTGCTGACCC CCTAGGTGAG TCTGGGGATT GGAACAGGGA 
TCTTAGGTCT GCCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTCTCTGT GTGTGTGTGT 
GTGTGTGAAG TTTTTTACAG GTGAATAAAC AAAGTTTGAA AGAAAAAAAA AAAAAAAAAA 



Seq ID NO: C94 DNA Sequence 
Nucleic Acid Accession » : NM_006875 
Coding sequence: 186.. 1190 




CATGCCCACT CGAAGTCGCA C 

TGATCCGCCT GCTTGACTGG TTTGAGACAC AGGAAGGCTT C 
CTTTGCCCGC CCAGGATCTC T 

TTGTCCATCG TGACAT CAAG GATGAGAACA TCCTGATAGA CCTACGCCGT GGCTGTGCCA 72 0 

GGACAAGGGT GTACAGCCCC CCAGAGTGGA TCTCTCGACA CCAGTACCAT GCACTCCCGG 840 

CCACTGTCTG GTCACTGGGC ATCCTCCTCT ATGACATGGT GTGTGGGGAC ATTCCCTTTG 900 

AGAGGGACCA GGAGATTCTG GAAGCTGAGC TCCACTTCCC AGCCCATGTC TCCCCAGACT 960 

GCTGTGCCCT AATCCGCCGG TGCCTGGCCC CCAAACCTTC TTCCOGACCC TCACTGGAAG 102 0 

AGATCCTGCT GGACCCCTGG ATGCAAACAC CAGCCGAGGA TGTTACCCCT CAACCCCTCC 10G0 

AAAGGAGGCC CTGCCCCTTT GGCCTGGTCC TTGCTACCCT AAGCCTGGCC TGGCCTGGCC 1140 

TGGCCCCCAA TGGTCAGAAG AGCCATCCCA TGGCCATGTC ACAGGGATAG ATGGACATTT 1200 

GTTGACTTGG TTTTACAGGT CATTACCAGT CATTAAAGTC CAGTATTACT AAGGTAAGGG 12 60 

ATTGAGGATC AGGGGTTAGA AGACATAAAC CAAGTTTGCC CAGTTCCCTT CCCAATCCTA 132 0 

CAAAGGAGCC TTCCTCCCAG AACCTGTGGT CCCTGATTTT GGAGGGGGAA CTTCTTGCTT 1380 

CTCATTTTGC TAAGGAAGTT TATTTTGGTG AAGTTGTTCC CATTTTGAGC CCCGGGACTC 1440 

TTATTTTGAT GATGTGTCAC CCCACATTGG CACCTCCTAC TACCACCACA CAAACTTAGT 1500 

TCATATGCTT TTACTTGGGC AAGGGTGCTT TCCTTCCAAT ACCCCAGTAG CTTTTATTTT 1560 

AGTAAAGGGA CCCTTTCCCC TAGCCTAGGG TCCCATATTG GGTCAAGCTG CTTACCTGCC 162 0 

TCAGCCCAGG ATTTTTTATT TTGGGGGAGG TAATGCCCTG TTGITACCCC AAGGCTTCTT 1680 

TTTTTTTTTT TTTTTTTTTG GGTGAGGGGA CCCTACTTTG TTATCCCAAG TGCTCTTATT 1740 

CTGGTGAGAA GAACCTTAAT TCCATAATTT GGGAAGGAAT GGAAGATGGA CACCACCGGA 180 0 

CACCACCAGA CAATAGGATG GGATGGATGG TTTTTTGGGG GATGGGCTAG GGGAAATAAG 1860 

GCTTGCTGTT TGTTTTCCTG GGGCGCTCCC TCCAATTTTG CAGATTTTTG CAACCTCCTC 192 0 

CTGAGCCGGG ATTGTCCAAT TACTAAAATG TAAATAATCA CGTATTGTGG GGAGGGGAGT 1980 

TCCAAGTGTG CCCTCCTTTT TTTTCCTGCC TGGATTATTT AAAAAGCCAT GTGTGGAAAC 2 040 

CCACTATTTA ATAAAAGTAA T AGAAT CAG A AAAAAAAAAA AAAAAAAA 20 88 



1 11 21 31 41 51 

I I I I I I 

CAGATGCCAG AAGAACACTG TTGCTCTTGG TGGACGGGCC CAGAGGAATT CAGAGTTAAA 
CCTTGAGTGC CTGCGTCCGT GAGAATTCAG CATGGAATGT CTCTACTAT7 TCCTGGGATT 
TCTGCTCCTG GCTGCAAGAT TGCCACTTGA TGCCGCCAAA CGATTTCATG ATGTGCTGGG 
CAATGAAAGA CCTTCTGCTT ACATGAGGGA GCACAATCAA TTAAATGGCT GGTCTTCTGA 
TGAAAATGAC TGGAATGAAA AACTCTACCC AGTGTGGAAG CGGGGAGACA 1GAGGTGGAA 
AAACTCCTGG AAGGGAGGCC GTGTGCAGGC GGTCCTGACC AGTGACTCAC CAGCCCTCGT 
GGGCTCAAAT ATAACATTTG CGGTGAACCT GATATTCCCT AGATGCCAAA AGGAAGATGC 
CAATGGCAAC ATAGTCTATG AGAAGAACTG CAGAAATGAG GCTGGTTTAT CTGCTGATCC 
ATATGTTTAC AACTGGACAG CATGGTCAGA GGACAGTGAC GGGGAAAATG GCACCGGCCA 

ATGGAATTTC ATCTACGTCT TCCACACACT TGGTCAGTAT TTCCAGAAAT TGGGACGATG 
TTCAGTGAGA GTTTCTGTGA ACACAGCCAA TGTGACACTT GGGCCTCAAC T 
A TGTTCCCATC G 



r GATGTCCTGA 



CCTTAACCTC ACTGTGAAAG CTGCAGCACC AGGACCTTC-T CCGCCACCGC CACCACCACC 
CAGACCTTCA AAACCCACCC CTTCTTTAGG ACCTGCTGGT GACAACCCCC TGGAGCTGAG 
TAGGATTCCT GATGAAAACT GCCAGATTAA CAGATATGGC CACTTTCAAG CCACCATCAC 
G GGAATCTTAG AGGTTAACAT CATCCAGATG ACAGACGTCC TGATGCCGGT 
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GCCATGGCCT GAAAGCTCCC TAATAGACTT TGTCGTGACC TGCCAAGGGA GCATTCCCAC 
GGAGGTCTGT ACCATCATTT CTGACCCCAC CTGCGAGATC ACCCAGAACA CAGTCTGCAG 
CCCTGTGGAT GTGGATGAGA TGTGTCTGCT GACTGTGAGA CGAACCTTCA ATGGGTCTGG 
GACGTACTGT GTGAACCTCA CCCTGGGGGA TGACACAAGC CTGGCTCTCA CGAGCACCCT 
GATTTCTGTT CCTGACAGAG ACCCAGCCTC GCCTTTAAGG ATGGCAAACA GTGCCCTGAT 
CTCCGTTGGC TGCTTGGCCA TATTTGTCAC TGTGATCTCC CTCTTGGTGT ACAAAAAACA 
CAAGGAATAC AACCCAATAG AAAATAGTCC TGGGAATGTG GTCAGAAGCA AAGGCCTGAG 
TGTCTTTCTC AACCGTGCAA AAGCCGTGTT CTTCCCGGGA AACCAGGAAA AGGATCCGCT 
ACTCAAAAAC CAAGAATTTA AAGGAGTTTC TTAAATTTCG ACCTTGTTTC TGAAGCTCAC 
TTTTCAGTGC CATTGATGTG AGATGTGCTG GAGTGGCTAT TAACCTTTTT TTCCTAAAGA 
TTATTGTTAA ATAGATATTG TGGTTTGGGG AAGTTGAATT TTTTATAGGT TAAATGTCAT 
TTTAGAGATG GGGAGAGGGA TTATACTGCA GGCAGC7TCA GCCATGTTGT GAAACTGATA 
T AGCAAGGCTT CTTTTCATTA TTTTTTATGT TTCACTTATA AAGTCTTAGG 



AGCCTAACCC 



TGACAACCTA 
GGACATTTAG 
ATTTCCAAAT 

h TCTACTATAT 



ATGCGAATCC T 

TGGGCTAATG GATACTACAG ACAACAGAGA AAACTTGTTG AAGAGATTGG C 
ACAGGAGCAC T 

CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT TAAGAAACTT 
AAATTTCAGG GTTGGGATAA AACATCATTG GAAAACACAT TCATTCATAA CACTGGGAAA 
ACAGTGGAAA TTAATCTCAC TAATGACTAC CGTGTCAGCG GAGGAGTTTC AGAAATGGTG 
TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAATGCA ATATGTCATC TGATGGATCA 
GAGCATAGTT TAGAAGGACA AAAATTTCCA CTTGAGATGC AAATCTACTG CTTTGATGCA 
GACCGATTTT CAAGTTTTGA GGAAGCAGTC AAAGGAAAAG GGAAGTTAAG AGCTTTATCC 
ATTTTGTTTG AGGTTGGGAC AGAAGAAAAT TTGGATTTCA AAGCGATTAT TGATGGAGTC 
GAAAGTGTTA GTCGTTTTGG GAAGCAGGCT GCTTTAGATC CATTCATACT GTIGAACCTT 
CTGCCAAACT CAACTGACAA GTATTACATT TACAATGGCT CATTGACATC TCCTCCCTGC 
ACAGACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTCTGA AAGCCAGTTG 
GCTGTTTTTT GTGAAGTTCT TACAATGCAA CAATCTGGTT ATGTCATGCT GATGGACTAC 



GACCCAGAGA ATTATACCAG CCTTCTTGTT ACATGGGAAA GACCTCGAGT CGTTTATGAT 1020 

ACCATGATTG AGAAGTTTGC AGTTTTGTAC CAGCAGTTGG ATGGAGAGGA CCAAACCAAG 1080 

CATGAATTTT TGACAGATGG CTATCAAGAC TTGGGTGCTA TTCTCAATAA TTTGCTACCC 1140 

AATATGAGTT ATGTTCTTCA GATAGTAGCC ATATGCACTA ATG3CTTATA TGGAAAATAC 1200 

AGCGACCAAC TGATTGTCGA CATGCCTACT GATAATCCTG AACTTGATCT TTTCCCTGAA 1260 

TTAATTGGAA CTGAAGAAAT AATCAAGGAG GAGGAAGAGG GAAAAGACAT TGAAGAAGGC 132 0 

GCTATTGTGA ATCCTGGTAG AGACAGTGCT ACAAACCAAA TCAGGAAAAA GGAACCC CAG 1380 

ATTTCTACCA CAACACACTA CAATCGCATA GGGACGAAAT ACAATGAAGC CAAGACTAAC 1440 

CGATCCCCAA CAAGAGGAAG TGAATTCTCT GGAAAGGGTG ATGTTCCCAA TACATCTTTA 150 0 

AATTCCACTT CCCAACCAGT CACTAAATTA GCCACAGAAA AAGATATTTC CTTGACTTCT 1560 

CAGACTGTGA CTGAACTGCC ACCTCACACT GTGGAAGGTA CTTCAGCCTC TTTAAATGAT 162 0 

GGCTCTAAAA CTGTTCTTAG ATCTCCACAT ATGAACTTGT CGGGGACTGC AGAATCCTTA 1680 

AATACAGTTT CTATAACAGA ATATGAGGAG GAGAGTTTAT TGACCAGTTT CAAGCTTGAT 174 0 

ACTGGAGCTG AAGATTCTTC AGGCTCCAGT CCCGCAACTT CTGCTATCCC ATTCATCTCT 1800 

GAGAACATAT CCCAAGGGTA TATATTTTCC TCCGAAAACC CAGAGACAAT AACATATGAT I860 

GTCCTTATAC CAGAATCTGC TAGAAATGCT TCCGAAGATT CAACTTCATC AGGTTCAGAA 1920 

A AGGATCCTTC TATGGAGGGA AATGTGTGGT TTCCTAGCTC TACAGACATA 1980 

C CCGATGTTGG ATCAGGCAGA GAGAGCTTTC TCCAGACTAA TTACACTGAG 2040 

ATACGTGTTG ATGAATCTGA GAAGACAACC AAGTCCTTTT CTGCAGGCCC AGTGATGTCA 2100 

JTACAGA TCTGGAAATG CCACATTATT CTACCTTTGC CTACTTCCCA 2160 

A CACCTCATGC TTTTACCCCA TCCTCCAGAC AACAGGATTT GGT CTCCACG 2220 

GTCAACGTGG TATACTCGCA GACAACCCAA CCGGTATACA ATGAGGCCAG TAATAGTAGC 2280 



TGTCAGCCCT GACTTT1 
TACTGGAGGA AATGCTTCCA GACTGCACAC TTTTACTTAG AGGACAGTAC ATCCCCTAGA 



T TTCCAAAGCA TGTTGCAGAT T 
TTTGAGGAAG TGCAGAGCTG TACTGTTGAC TTAGGTATTA CAGCAGACAG CTCCAACCAC 
CCAGACAACA AGCACAAGAA TCGATACATA AATATCGTTG C 
AAGCTAGCAC AGCTTGCTGA AAAGGATGGC A 
GTTGATGGCT ACAACAGACC AAAAGCTTAT ATTGCTGCCC AAGGCCCACT GAAATCCACA 
GCTGAAGATT TCTGGAGAAT GATATGGGAA CATAATGTGG AAGTTATTGT CATGATAACA 



GAGTACGGGA ACTTTCTGGT CACTCAGAAG AGTGTGCAAG TGCTTGCCTA 
AGGAATTTTA CTCTAAGAAA CACAAAAATA AAAAAGGGCT CCCAGAAAGG AAGACCCAGT 
GGACGTGTGG TCACACAGTA TCACTACACG CAGTGGCCTG ACATGGGAGT ACCAGAGTAC 
TCCCTGCCAG TGCTGACCTT TGTGAGAAAG GCAGCCTATG CCAAGCGCCA TGCAGTGGGG 



1255 



WO 03/042661 PCT/US02/36810 



ATCCGTTCAC AAAGAAATTA TTTGGTACAA ACTGAGGAGC AATATGTCTT CATTCATGA7 
ACACTGGTTG AGGCCATACT TAGTAAAGAA ACTGAGGTGC TGGACAGTCA TATTCATGCC 
TATGTTAATG CACTCCTCAT TCCTGGACCA GCAGGCAAAA CAAAGCTAGA GAAACAATTC 

CAGCTCCTGA GCCAGTCAAA TATACAGCAG AGTGACTAT7 CTGCAGCCCT AAAGCAATGC 3S40 

AACAGGGAAA AGAATCGAAC TTCTTCTATC ATCCCTGTGG AAAGATCAAG GGTTGGCATT 3600 

TCATCCCTGA GTGGAGAAGG CACAGACTAC ATCAATGCCT CCTATATCAT GGGCTATTAC 3660 

CAGAGCAATG AATTCATCAT TACCCAGCAC CCTCTCCTTC ATACCATCAA GGATTTCTGG 3 720 

AGGATGATAT GGGACCATAA TGCCCAACTG GTGGTTATGA TTCCTGA7GG CCAAAACATG 3780 

GCAGAAGATG AATTTGTTTA CTGGCCAAAT AAAGATGAGC CTATAAATTG TGAGAGCTTT 3840 

AAGGTCACTC TTATGGCTGA AGAACACAAA TGTCTATCTA ATGAGGAAAA ACTTATAATT 3 900 

CAGGACTTTA TCTTAGAAGC TACACAGGAT GATTATGTAC TTGAAGTGAG GCACTTTCAG 3 960 

TGTCCTAAAT GGCCAAATCC AGATAGCCCC ATTAGTAAAA CT7TTGAACT TATAAGTGTT 4020 

ATAAAAGAAG AAGCTGCCAA TAGGGATGGG CCTATGATTG TTCATGATGA GCATGGAGGA 4080 

GTGACGGCAG GAACTTTCTG TGCTCTGACA ACCCTTATGC ACCAACTAGA AAAAGAAAAT 4140 

TCCGTGGATG TTTACCAGGT AGCCAAGATG ATCAATCTGA TGAGGCCAGG AGTCTTTGCT 420 0 

GACATTGAGC AGTAT CAGTT TCTCTACAAA GTGATCCTCA GCCTTGTGAG CACAAGGCAG 4260 

GAAGAGAATC CATCCACCTC TCTGGACAGT AATGGTGCAG CATTGCCTGA TGGAAATATA 4320 

GCTGAGAGCT TAGAGTCTTT AGTTTAA 4347 



25 
30 



TGAGAAGCAG AGGAGCCGCA 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT A 



CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 



GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTT7GGGAA GCAGGCTGCT 



TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTCAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 12 0 0 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 132 0 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 138 0 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 150 0 

AACCAAATCA GGAAAAAG3A ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCIT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA T GAGG AGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 



GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 




ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC C 
TCGGCCTTGC ATGCTACGCC TGTATTTCCC A 
TCTTCCTATG ATGGTGCACC TTTGCTTCCA T 
TTTCGCCATC T 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG G 
AGCCTTGCTC AGTATTCTGA TGTGCTGTCC A 
TTTGGTAGTG A 
AGCAGTGATG CCATGATGCA T 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 
GATTCTGTGG GTGTAACTTA TCAGGGTTCC T 
CCTAAGTCTT C 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 



TCTGTGTTTG GTGATGATAA TAAGGCGCTT T 
ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC C 
CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 
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60 



A CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

A TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3 500 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3 560 

ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3 840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3 900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3 960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAA7AAA TAAGCTTA7A 4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGC-T ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGA7G GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCA7A GTGCCAAATC TGATGCCGGT 4380 

G GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 



TCCCAAAAGC A 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA G 
GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAG7TTTGCA 
GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 
GGATTCCCAC AGTCCCCAAC ATCATCTGTT, ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 
A TAGTAGCCAT GAGTCTCGTA T 



CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 

GTGGGTTTAC T 
CAGAGCTGTA CTGTTGJ 
CACAAGAATC GATACATAAA T 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 5520 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 564 0 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 5700 

TTTCTGGTCA CTCACAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5E2 0 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGCCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 60 0 0 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG S120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 624 0 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 63 60 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTT7CAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 684 0 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 7080 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 
TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT C 
GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT A 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTT7ACAG 7320 

G AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 

ITGTTTG TTTTTAGTGT CAAATTTTTA 744 D 

T CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

A TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7550 

^TAGTGT CTCCATGGAC CAAATTTATA 7620 

A CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7680 

GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGC-AAAATA 7800 

T CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 

I AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 



Seq ID NO: C98 DNA Sequence 
Nucleic Acid Accession fl : NM_002851 
Coding sequence: 77.. 4518 



CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA 
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CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGC- GCTAATG3AT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

5 AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 3 60 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGT7 GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA S40 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

10 GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 650 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTCT TG ( IK ACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 7B0 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

15 ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 102 0 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTT7GT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 120 0 

20 CAGTTGGATG GAGAGG AC CA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 132 0 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

25 AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA G ACT AAC CG A TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 162 0 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

30 AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2 040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

35 GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTCCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

40 GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AGCTGAGGGG 2460 

TTGGAATCCC AGAAGAACGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 252 0 

TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATCCTTCCA GAC7GCACAC 25 6 0 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 

CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 

45 TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 

GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2G2 0 

AAGCACAAGA ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2 94 0 

TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 30 0 0 

50 TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3060 

GAGAAAGGAA GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 

AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 

ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGACGTGTG 3240 

GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 

55 GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 3360 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 

CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA 3480 

CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 

GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 

60 GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 

AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 

AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3 840 

GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3 900 

65 TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGA7G GCCAAAACAT GGCAGAAGAT 3 960 

GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGTCACT 4020 

CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 

ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 414 0 

TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC T7ATAAGTGT TATAAAAGAA 420 0 

70 GAAGCTGCCA ATAGGGAT GG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 4260 

GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 432 0 

GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 43 80 

CAGTAT CAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 

CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 450 0 

75 TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TAICTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 
80 CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 
TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 
TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4 980 
AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TC-TCTCCATG GACCAAATTT 5040 
ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA GTTCTGTGTA 5100 
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GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA '3 60 0 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3 66 0 

CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 372 0 

GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCT.AAAGC AATGCAACA.G GGAAAAGAAT 3840 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATT7CATC CCTGAGTGGA 3900 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGC-GCT ATTACCAGAG CAATGAATTC 3360 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 4020 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 4080 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAC-GT CACTCTTATG 4140 

GCTGAAGAAC ACAAATGTCT AT CT AATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 42 0 0 

GAAGCTACAC AGGATGAT T A TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 4260 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 4320 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 4380 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 

CAGGTAGCCA AGATG AT CAA TCTGATGAGG CCAGGAG7CT TTGCTGACA? TGAGCAGTAT 450 0 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 4560 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 462 0 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 4680 
CTAAAATTAG GCAGGAAAAT CAGTCTA 
AGTAACTTTC ATGACATAGG ATTCTGCCGC C 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT T 



GTATTTGTAG CAATTATCAG GTTTGCTAGA A 
TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 



\ TTAGCTGGTC 

GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA 



I I I I I 

C TAAAGCGTTT CCTCGCTTGC ATTCAGCTCC TCTGTGTTTG CCGCCTGGAT 6 0 

G GATACTACAG ACAACAGAGA AAACTTGTTG AAGAGATTGG CTGGTCCTAT 120 

ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAGAAATATC CAACATGTAA TAGCCCAAAA 180 

CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT TAAGAAACTT 240 

AAATTTCAGG GTTGGGATAA AACATCATTC CAAAACACAT TCATTCATAA CACTGGGAAA 3 00 

ACAGTGGAAA TTAATCTCAC TAATGACTAC CGTGTCAGCG GAGGAGTTTC AGAAATGGTG 3 50 

TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAATGCA ATATGTCATC TGATGGATCA 420 

GAGCATAGTT TAGAAGGACA AAAATTTCCA CTTGAGATGC AAATCTACTG CTTTGATGCG 480 

GACCGATTTT CAAGTTTTGA GGAAGCAGTC AAAGGAAAAG GGAAGTTAAG AGCTTTATCC 540 

ATTTTGTTTG AGGTTGGGAC AGAAGAAAAT TTGGATTTCA AAGCGATTAT TGATGGAGTC 6D0 

GAAAGTGTTA GTCGTTTTGG GAAGCAGGCT GCTTTAGATC CATTCATACT GTTGAACCTT 660 

CTGCCAAACT CAACTGACAA GTATTACATT TACAATGGCT CATTGACATC TCCTCCCTGC 720 

ACAGACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTCTGA AAGCCAGTTG 780 

GCTGTTTTTT GTGAAGTTCT TACAATGCAA CAATCTGGTT ATGTCATGCT GATGGACTAC 840 

TTACAAAACA ATTTTCGAGA GCAACAGTAC AAGTTCTCTA GACAGGTGTT TTCCTCATAC 900 

ACTGGAAAGG AAGAGATTCA TGAAGCAGTT TGTAGTTCAG AACCAGAAAA TGTTCAGGCT 960 

GACCCAGAGA ATTATACCAG CCTTCTTGTT ACATGGGAAA GACCTCGAGT CGTTTATGAT 102 0 

ACCATGATTG AGAAGTTTGC AGTTTTGTAC CAGCAGTTGG ATGGAGAGGA CCAAACCAAG 1080 

CATGAATTTT TGACAGATGG CTATCAAGAC TTGGGTGCTA TTCTCAATAA TTTGCTACCC 1140 

AATATGAGTT ATGTTCTTCA GATAGTAGCC ATATGCACTA ATGGCTTATA TGGAAAATAC 12 0 0 

AGCGACCAAC TGATTGTCGA CATGCCTACT GATAATCCTG AGGCCAGTAA TAGTAGCCAT 12 60 

GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG GAATCCGAGA AGAAGGCAGT TA7ACCCCTT 1320 

GTGATCGTGT CAGCCCTGAC TTTTATCTGT CTAGTGGTTC TTGTGGGTAT TCTCATCTAC 1380 

r GCTTCCAGAC TGCACACTTT TACTTAGAGG ACAGTACATC CCCTAGAGTT 1440 

C CTCCAACACC TATCTTTCCA ATTTCAGATG ATGTCGGAGC AATTCCAATA 15 00 

AAGCACTTTC CAAAG CAT GT TGCAGATTTA CATGCAAGTA GTGGGTTTAC TGAAGAATTT 1560 

A CTGTTGACTT AGGTATTACA 1620 

TATGATCATA GCAGGGTTAA GCTAGCACAG CTTGCTGAAA AGGATGGCAA ACTGACTGAT 1740 
TATATCAATG CCAATTATGT TGATGGCTAC AACAGACCAA AAGCTT? 
GGCCCACTGA AATCCACAGC TGAAGATTTC TGGAGAATGA TATGGGAACA T 
GTTATTGTCA TGATAACAAA CCTCGTGGAG AAAGGAAGGA GAAAATGTGA TCAGTACTGG 
A GTACGGGAAC TTTCTGGTCA CTCAGAAGAG T 



CAGAAAGGAA GACCCAGTGG ACGTGTGGTC ACACAGTATC ACTACACGCA GTGGCCTGAC 
: CCTGCCAGTG CTGACCTTTG TGAGAAAGGC A 



3 TATGTTGCAG CAGATTCAAC ACGAAGGAAC TGTCAACATA 
TTTGGCTTCT TAAAACACAT CCGTTCACAA AGAAATTATT TGGTACAAAC TGAGGAGCAA 



7AATGCA CTCCTCATTC CTGGACCAGC A 1 
AAGCTAGAGA AACAATTCCA GCTCCTGAGC CAGTCAAATA TACAGCAGAG TGACTATTCT 
GCAGCCCTAA AGCAATGCAA CAGGGAAAAG AATCGAACTT CTTCTATCAT C 
AG AT CAAGGG TTGGCATTTC ATCCCTGAGT GGAGAAG 
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TATATCATGG GCTATTACCA GAGCAATGAA TTCATCATTA CTCAGCACGC TCTCCTTCAT 



C AAAACATGGC AGAAGATGAA TTTGTTTACT GGCCAAATAA AGATGAGCCT 2820 
3 AGAGCTTTAA GGTCACTCTT ATGGCTGAAG AACACAAATG TCTATCTAAT 2B80 
GAGGAAAAAC TTATAATTCA GGACTTTATC TTAGAAGCTA CACAGGATGA TTATGTACTT 2 940 



TTTGAACTTA TAAGTGTTAT AAAAGAAGAA GCTGCCAATA GGGATGGGCC TATGATTGTT 3060 

CATGATGAGC ATGGAGGAGT GACGGCAGGA ACTTTCTGTG C7CTGACAAC CCTTATGCAC 3120 

CAACTAGAAA AAGAAAATTC CGTGGATGTT TACCAGGTAG CCAAGATGAT CAATCTGATG 3180 

AGGCCAGGAG TCTTTGCTGA CATTGAGCAG TATCAGTTTC TCTACAAAGT GATCCTCAGC 324 0 

CTTGTGAGCA CAAGGCAGGA AGAGAATCCA TCTACCTCTC TGGACAGTAA TGGTGCAGCA 3300 

TTGCCTGATG GAAATATAGC TGAGAGCTTA GAGTCTTTAG 3340 



I I I I I I 

ATGCGAATCC TAAAGCGTTT CCTCGCTTGC ATTCAGCTCC TCTGTGTTTG CCC 
TGGGCTAATG GATACTACAG ACAACAGAGA AAACTTGTTG AAGAGATTGG CTGGTCCTAT 
ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAGAAATATC CAACATGTAA TAGCCCAAAA 
CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT TAAGAAACTT 
AAATTTCAGG GTTGGGATAA AACATCATTG GAAAACACAT TCATTCATAA CACTGGGAAA 
ACAGTGGAAA TTAATCTCAC TAATGACTAC CGTGTCAGCG GAGGAGTTTC AGAAATGGTG 



T TAGAAGGACA AAAATTTCCA CTTGAGATGC AAATCTACTG CTTTGATGCA 
3TTTTGA GGAAGCAGTC AAAGGAAAAG GGAAGTTAAG AGCTTTATCC 
ATTTTGTTTG AGGTTGGGAC AGAAGAAAAT TTGGATTTCA AAGCGAT 
GAAAGTGTTA GTCGTTTTGG GAAGCAGGCT GCTTTAGATC CATTCA1 
CTGCCAAACT CAACTGACAA GTATTACATT TACAATGGTT CATTGACATC T 
ACAGACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTC 
GCTGTTTTTT GTGAAGTTCT TACAATGCAA CAATCTGGTT ATGTCATGCT GATGGACTAC 
TTACAAAACA ATTTTCGAGA GCAACAGTAC AAGTTCTCTA GACAGGTGTT T ~ 



GACCCAGAGA ATTATACCAG CCTTCTTGTT ACATGGGAAA GACCTOSAGT C 

ACCATGATTG AGAAGTTTGC AGTTTTGTAC CAGCAGTTGG ATGGAGAGGA CCAAACCAAG 1080 

CATGAATTTT TGACAGATGG CTATCAAGAC TTGGGTGCTA TTCTCAATAA TTTGCTACCC 1140 

AATATGAGTT ATGTTCTTCA GATAG"A ,( ( ATATG f T TTAT£ TGGAAAATAC 12 0 0 

AGCGACCAAC TGATTGTCGA CATCCCTACT GATAATCCTG AACTTGATCT TTTCCCTGAA 1260 

TTAATTGGAA CTGAAGAAAT AATCAAGGAG GAGGAAGAGG GAAAAGACAT TGAAGAAGGC 132 0 

GCTATTGTGA ATCCTGGTAG AGACAGTGCT ACAAACCAAA T CAGGAAAAA GGAACCCCAG 1380 



; ACCTCACACT GTGGAAGGTA ( TTt ( 
GGCTCTAAAA CTGTTCTTAG ATCTCCACAT ATGAACTTGT CGGGGACTGC A 
AATACAGTTT CTATAACAGA ATATGAGGAG GAGAGTTTAT TGACCAGTT7 CAAGCTTGAT 
ACTGGAGCTG AAGATTCTTC AGGCTCCAGT CCCGCAACTT CTGCTATCCC ATTCATCTCT 
50 GAGAACATAT CCCAAGGGTA TATATTTTCC TCCGAAAACC CAGAGACAAT AACATATGAT 
GTCCTTATAC CAGAATCTGC TAGAAATGCT TCCGAAGATT CAACTTCATC AGGTTCAGAA 
GAATCACTAA AGGATCCTTC TATGGAGGGA AATGTGTGGT TTCCTAGCTC T 
ACAGCACAGC CCGATGTTGG ATCAGGCAGA GAGAGCTTTC TCCAGACTAA T 
ATACGTGTTG ATGAATCTGA GAAGACAACC AAGTCCTTTT CTGCAGGCCC A 
55 CAGGGTCCCT CAGTTACAGA TCTGGAAATG CCACATTATT CIACCTTTGC CTACTTCCCA 2160 
ACTGAGGTAA CACCTCATGC TTTTATCCCA TCCTCCAGAC AACAGGATTT GGTCTCCACG 222D 
GTCAACGTGG TATACTCGCA GACAACCCAA CCGGTATACA ATGAGGCCAG TAATAGTAGC 2280 
CATGAGTCTC GTATTGGTCT AGCTGAGGGG TTGGAATCCG AGAAGAAGGC AGTTATACCC 2340 
CTTGTGATCG TGTCAGCCCT GACTTTTATC TGTCTAGTGG TTCTTGTGGG TATTCTCATC 2400 
60 TACTGGAGGA AATGCTTCCA GACTGCACAC TTTTACTTAG AGGACAGTAC ATCCCCTAGA 2450 
GTTATATCCA CACCTCCAAC ACCTATCTTT CCAATTTCAG ATGATGTCG3 AGCAATTCCA 2520 
ATAAAGCACT TTCCAAAGCA TGTTGCAGAT TTACATGCAA GTAGTGGGTT TACTGAAGAA 2580 
TTTGAGACAC TGAAAGAGTT TTACCAGGAA GTGCAGAGCT GTACTGTTGA CTTAGGTATT 2 640 
ACAGCAGACA GCTCCAACCA CCCAGACAAC AAGCACAAGA ATCGATACAT AAATATCGTT 2700 
65 GCCCATGATC ATAGCAGGGT TAAGCTAGCA CAGCTTGCTG AAAAGGATGG CAAACTGACT 2760 
GATTATATCA ATGCCAATTA TGTTGATGGC TACAACAGAC CAAAAGCTTA TATTGCTGCC 2820 
CAAGGCCCAC TGAAATCCAC AGCTGAAGAT TTCTGGAGAA TGATATGGGA ACATAATGTG 2880 
GAAGTTATTG TCATGATAAC AAACCTCGTG GAGAAAGGAA GGAGAAAATG TGATCAGTAC 2940 
TGGCCTGCCG ATGGGAGTGA GGAGTACGGG AACTTTCTGG TCACTCAGAA GAGTGTGCAA 300 0 
70 GTGCTTGCCT ATTATACTGT GAGGAATTTT ACTCTAAGAA ACACAAAAAT AAAAAAGGGC 3060 
TCCCAGAAAG GAAGACCCAG TGGACGTGTG GTCACACAGT ATCACTACAC GCAGTGGCCT 3120 
GACATGGGAG TACCAGAGTA CTCCCTGCCA GTGCTGACCT TTGTGAGAAA GGCAGCCTAT 3180 
GCCAAGCGCC ATG CAGTGGG GCCTGTTGTC GTCCACTGCA GTGCTGGAGT TGGAAGAACA 324 0 
GGCACATATA TTGTGCTAGA CAGTATGTTG CAGCAGATTC AACACGAAGG AACTGTCAAC 3300 
75 ATATTTGGCT TCTTAAAACA CATCCGTTCA CAAAGAAATT ATTTGGTACA AACTGAGGAG 3360 
CAATATGTCT TCATTCATGA TACACTGGTT GAGGCCATAC TTAGTAAAGA AACTGAGGTG 342 0 
CTGGACAGTC ATATTCATGC CTATGTTAAT GCACTCCTCA TTCCTGGACC AGCAGGCAAA 3480 
ACAAAGCTAG AGAAACAATT CCAGGGTCTC ACTCTG7CAC CCAGGCTGGA C-TGCAGAGGC 3540 
ACAATCTCGG CT CACTGCAA CCTTCCTCTC CCTGGCTTAA CTGATCCTCC TACCTCAGCC 3600 
80 TCCCGAGTGG CTAGGACTAT ACTCCTGAGC CAGTCAAATA TACAGCAGAG TGACTATTCT 3660 
GCAGCCCTAA AGCAATGCAA CAGGGAAAAG AATCGAACTT CTTCTATCAT CCCTC-TGGAA 372 0 
AGATCAAGGG TTGGCATTTC ATCCCTGAGT GGAGAAGGCA CAGACTACAT CAATGCCTCC 378 0 
TATATCATGG GCTATTACCA GAGCAATGAA TTCATCATTA CCCAGCACCC TCTCCTTCAT 3840 
■ ■ 3 GACCATAATG CCCAACTGGT GGTTATGATT 3 900 
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CCTGATGGCC AAAACATGGC AGAAGATGAA TTTGTTTACT GGCCAAATAA A 
ATAAATTGTG AGAGCTTTAA GGTCACTCTT ATGGCTGAAG AACACAAATG TCTATCTAAT 
GAGGAAAAAC TTATAATTCA GGACTTTATC TTAGAAGCTA CACAGGATGA TTATGTACTT 
GAAGTGAGGC ACTTTCAGTG TCCTAAATGG CCAAATCCAG ATAGCCCCAT TAGTAAAACT 
TTTGAACTTA TAAGTGTTAT AAAAGAAGAA GCTGCCAATA GGGATGGGCC 7ATGATTGTT 
CATGATGAGC ATGGAGGAGT GACGGCAGGA ACTTTCTGTG CTCTGACAAC CCITATC-CAC 
CAACTAGAAA AAGAAAATTC CGTGGATGTT TACCAGGTAG CCAAC-A7GAT CAATCTGA7G 
AGGCCAGGAG TCTTTGCTGA CATTGAGCAG TATCAGTTTC TCTACAAAGT GATCCTCAGC 
A CAAGGCAGGA AGAGAATCCA TCCACCTCTC TGGACAGTAA TGG7GCAGCA 
C TGAGAGCTTA GAGTCTTTAG 

A Sequence 



I I I I I I 

ATGCGAATCC TAAAGCGTTT CCTCGCTTGC ATTCAGCTCC TCTGTGTTTG CCGCCTGGAT 
TGGGCTAATG GATACTACAG ACAACAGAGA AAACTTGTTG AAGAGATTGG CTGGTCCTAT 
20 ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAGAAATATC CAACATGTAA TAGCCCAAAA 
CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT TAAGAAACTT 
AAATTTCAGG GTTGGGATAA AACATCATTG GAAAACACAT TCATTCATAA CACTGGGAAA 
ACAGTGGAAA TTAATCTCAC TAATGACTAC CGTGTCAGCG GAGGAGTTTC AGAAATGGTG 
TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAATGCA ATATGTCATC TGATGGATCA 
25 GAGCATAGTT TAGAAGGACA AAAATTTCCA CTTGAGATGC AAATCTACTG CTTTGATGCG 
GACCGATTTT CAAGTTTTGA GGAAGCAGTC AAAGGAAAAG GGAAGTTAAG AGCTTTATCC 
ATTTTGTTTG AGGTTGGGAC AGAAGAAAAT TTGGATTTCA AAGCGATTAT TGATGGAGTC 
GAAAGTGTTA GTCGTTTTGG GAAGCAGGCT GCTTTAGATC CATT CAT ACT G 
CTGCCAAACT CAACTGACAA GTATTACATT TACAATGGCT C 
30 ACAGACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA G 

GCTGTTTTTT GTGAAGTTCT TACAATGCAA CAATCTGGTT ATGTCATGCT GATGGACTAC 
A ATTTTCGAGA GCAACAGTAC AAGTTCTCTA GACAGGTGTT TTCCTCATAC 
3 AAGAGATTCA TGAAGCAGTT TGTAGTTCAG AACCAGAAAA TGTTCAGGCT 
ft ATTATACCAG CCTTCTTGTT ACATGGGAAA GACCTCGAGT CGTTTATGAT 



35 ACCATGATTG AGAAGTTTGC AGTTTTGTAC CAGCAGTTGG ATGGAGAGGA CCAAACCAAG 

CATGAATTTT TGACAGATGG CTATCAAGAC TTGGGTGCTA TTCTCAATAA TTTGCTACCC 
AATATGAGTT ATGTTCTTCA GATAGTAGCC ATATGCACTA ATGGCTTATA TGGAAAATAC 
AGCGACCAAC TGATTGTCGA CATGCCTACT GATAATCCTG AACTTGATCT TTTCCCTGAA 
TTAATTGGAA CTCAAGAAAT AATCAAGGAG GAGGAAGAGG GAAAAGACAT TGAAGAAGGC 

40 GCTATTGTGA ATCCTGGTAG AGACAGTGCT ACAAACCAAA TCAGGAAAAA GGAACCCCAG 

ATTTCTACCA CAACACACTA CAATCGCATA GGGACGAAAT ACAATGAAGC CAAGACTAAC 
CGATCCCCAA CAAGACGAAG TGAATTCTCT GCAAA^r^ I A TACATCTTTA 

AATTCCACTT CCCAACCAGT CACTAAATTA GCCACAGAAA AAGATATTTC CTTGACTTCT 
CAGACTGTGA CTGAACTGCC ACCTCACACT GTGGAAGGTA CTTCAGCCTC TTTAAATGAT 

45 GGCTCTAAAA CTCTTCTTAG ATCTCCACAT ATGAACTTGT CGGGGACTGC A 

AATACAGTTT CTATAACAGA ATATGAGGAG CAGAGTTTAT TGACCAGTTT ' 



C CAGAATCTGC TAGAAATGCT TCCGAAGATT CAACTTCATC AGGTTCAGAA 192 0 

GAATCACTAA AGGATCCTTC TATGGAGGGA AATGTGTGGT TTCCTAGCTC TACAGACATA 1980 

ACAGCACAGC CCGATGTTGG ATCAGGCAGA GAGAGCTTTC TCCAGACTAA TTACACTGAG 2 040 

ATACGTGTTG ATGAATCTGA GAAGACAACC AAGTCCTTTT CTGCAGGCCC AGTGATGTCA 2100 

CAGGGTCCCT CAGTTACAGA TCTGGAAATG CCACATTATT CTACCTTTGC CTACTTCCCA 2160 

ACTGAGGTAA CACCTCATGC TTTTACCCCA TCCTCCAGAC AACAGGATTT GGTCTCCACG 2220 

55 GTCAACGTGG TATACTCGCA GACAACCCAA CCGGTATACA ATGAGGCCAG TAATAGTAGC 2280 

CATGAGTCTC GTATTGGTCT AGCTGAGGGG TTGGAATCCG AGAAGAAGGC AGTTATACCC 2340 

CTTGTGATCG TGTCAGCCCT GACTTTTATC TGTCTAGTGG TTCTTGTGGG TATTCTCATC 2400 

TACTGGAGGA AATGCTTCCA GACTGCACAC TTTTACTTAG AGGACAGTAC ATCCCCTAGA 2460 

GTTATATCCA CACCTCCAAC ACCTATCTTT CCAATTTCAG ATGATGTCGG AGCAATTCCA 2520 

60 ATAAAGCACT TTCCAAAGCA TGTTGCAGAT TTACATGCAA GTAGTGGGTT TACTGAAGAA 2580 

TTTGAGACAC TGAAAGAGTT TTACCAGGAA GTGCAGAGCT GTACTGTTGA CTTAGGTATT 2 540 

ACAGCAGACA GCTCCAACCA CCCAGACAAC AAGCACAAGA ATCGATACAT AAATATCGTT 2700 

GCCTATGATC ATAGCAGGGT TAAGCTAGCA CAGCTTGCTG AAAAGGATGG CAAACTGACT 276 0 

GATTATATCA ATGCCAATTA TGTTGATGGC TACAACAGAC CAAAAGCTTA TATTGCTGCC 282 0 

65 CAAGGCCCAC TGAAATCCAC AGCTGAAGAT TTCTGGAGAA TGATATGGGA ACATAATGTG 2880 

GAAGTTATTG TCATGATAAC AAACCTCGTG GAGAAAGGAA GGAGAAAATG TGATCAGTAC 2940 

TGGCCTGCCG ATGGGAGTGA GGAGTACGGG AACTTTCTGG TCACTCAGAA GAGTGTGCAA 3000 

GTGCTTGCCT ATTATACTGT GAGGAATTTT ACTCTAAGAA ACACAAAAAT AAAAAAGGGC 3060 

TCCCAGAAAG GAAGACCCAG TGGACGTGTG GTCACACAGT ATCACTACAC GCAGTGGCCT 312 0 

70 GACATGGGAG TACCAGAGTA CTCCCTGCCA GTGCTGACCT TTGTGAGAAA GGCAGCCTAT 3180 

GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC GTCCACTGCA GTGCTGGAGT TGGAAGAACA 3240 

GGCACATATA TTGTGCTAGA CAGTATGTTG CAGCAGATTC AACACGAAGG AACTGTCAAC 33 0 0 

ATATTTGGCT TCTTAAAACA CATCCGTTCA CAAAGAAATT ATTTGGTACA AACTGAGGAG 3360 

CAATATGTCT TCATTCATGA TACACTGGTT GAGGCCATAC TTAGTAAAGA AACTC-AGGTG 3420 

75 CTGGACAGTC ATATTCATGC CTATGTTAAT GCACTCCTCA TTCCTGGACC AGCAGGCAAA 3480 

ACAAAGCTAG AGAAACAATT CCAGCTCCTG AGCCAGTCAA ATATACAGCA GAGTGACTAT 3540 

TCTGCAGCCC TAAAGCAATG CAACAGGGAA AAGAATCGAA CTTCTTCTAT C " 
G AAAGAT CAA GGGTTGGCAT TTCATCCCTG AGTGGAGAAG G ~ 
TCCTATATCA TGGGCTATTA CCAGAGCAAT GAATT CATCA T 
80 CATACCATCA AGGATTTCTG GAGGATGATA TGGGACCATA ATGCCCAACT GGTGGTTATG 
ATTCCTGATG GCCAAAACAT GGCAGAAGAT GAATTTGTTT ACTGGCCAAA TAAAGATGAG 
CCTATAAATT GTGAGAGCTT TAAGGTCACT CTTATGGCTG AAGAACACAA 
AATGAGGAAA AACTTATAAT TCAGGACTTT ATCTTAGAAG CTACACAGGC ATGGAGGAGT 
GACGGCAGGA ACTTTCTGTG CTCTGACAAC CCTTATGCAC C 
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a Accession #: XM_002914.6 



1 11 21 31 41 51 

I I I I I I 

ATGAAGGATA TCGACATAGG AAAAGAGTAT ATCATCCCCA GTCCTGGGTA TAGAAGTGTG 

AGGGAGAGAA CCAGCACTTC TGGGACGCAC AGAGACCGTG AAGATTCCAA GTTCAGGAGA 

ACTCGACCGT TGGAATGCCA AGATGCCTTG GAAACAGCAG CCCGAGCCGA GGGCCTCTCT 

CTTGATGCCT CCATGCATTC TCAGCTCAGA ATCCTGGATG AGGAGCATCC CAAGGGAAAG 

TACCATCATG GCTTGAGTGC TCTGAAGCCC ATCCGGACTA CTTCCAAACA CCAGCACCCA 

GTGGACAATG CTGGGCTTTT TTCCTGTATG ACTTTTTCGT GGCTTTCTTC TCTGGCCCGT 

GTGGCCCACA AGAAGGGGGA GCTCTCAATG GAAGACGTGT GGTCTCTGTC CAAGCACGAG 

TCTTCTGACG TGAACTGCAG AAGACTAGAG AGACTGTGGC AAGAAGAGCT GAATGAAGTT 

GGGCCAGACG CTGCTTCCCT GCGAAGGGTT GTGTGGATCT 7CTGCCGCAC CAGGCTCATC 

CTGTCCATCG TGTGCCTGAT GATCACGCAG CTGGCTGGCT TCAGTGGACC AGCCTTCATG 

TGGAGTA TACCCAGGCA ACAGAGTCTA ACCTGCAGTA CAGCTTGTTG 

G GCCTCCTCCT GACGGAAATC GTGCGGTCTT GGTCGCTTGC A 




ATCACTGTGG G 
CATATGACCC TGGGCTTCGA T 
AATTCCATGA C 

TCAGTGGCTG TTGACAGATT TAAGAGTTTG TTTCTAATGG AAGAGGTTCA CATGATAAAG 
AACAAACCAG CCAGTCCTCA CATCAAGATA GAGATGAAAA ATGCCACCTT G 
TCCTCCCACT CCAGTATCCA GAACTCGCCC AAGCTGACCC C 

AGGGCTTCCA Gtui RAGAi G* < ( r TX AGCGCACTGA GCATCAGGCG 1620 

GTGCTGGCAG AGCAGAAAGG CCACCTCCTC CTGGACAGTG ACGAGCGGCC CAGTCCCGAA 1680 

GAGGAAGAAG GCAAGCACAT CCACCTGGGC CACCTGCGCT TACAGAGGAC ACTGCACAGC 1740 

ATCGATCTGG AGATCCAAGA GGGTAAACTG GT7GGAATCT GCGGCAGTGT GGGAAGTGGA 18 0 0 

AAAACCTCTC TCATTTCAGC CATTTTAGGC CAGATGACGC TTCTAGAGGG CAGCATTGCA I86 0 

ATCAGTGGAA CCTTCGCTTA TGTGGCCCAG CAGGCCTGGA TCCTCAATGC TACTCTGAGA 1920 

GACAACATCC TGTTTGGGAA GGAATATGAT GAAGAAAGAT ACAACTCTGT GCTGAACAGC 1980 

TGCTGCCTGA GGCCTGACCT GGCCATTCTT CCCAGCAGCG ACCTGACGGA GATTGGAGAG 2 040 

CGAGGACCCA ACCTGAGCGG TGGGCAGCGC CAGAGGATCA GCCTTGCCCG GGCCTTGTAT 210 0 

AGTGACAGGA GCATCTACAT CCTGGACGAC CCCCTCAGTG CCTTAGATGC CCATGTGGGC 2160 

AACCACATCT TCAATAGTGC TATCCGGAAA CATCTCAAGT CCAAGACAGT TCTGTTTGTT 222 0 

ACCCACCAGT TACAGTACCT GGTTGACTGT GATGAAGTGA TCTTCATGAA AGAGGGCTGT 2280 

ATTACGGAAA GAGGCACCCA TGAGGAACTG ATGAATTTAA ATGGTGACTA TGCTACCATT 2340 

TTTAATAACC TGTTGCTGGG AGAGACACCG CCAGTTGAGA TCAATTCAAA AAAGGAAACC 2400 

AGTGGTTCAC AGAAGAAGTC ACAAGACAAG GGT CCTAAAA CAGGATCAGT AAAGAAGGAA 2460 

AAAGCAGTAA AGCCAGAGGA AGGGCAGCTT GTGCAGCTGG AAGAAAAAGG GCAGGGTTCA 2520 

GTGCCCTGGT CAGTATATGG TGTCTACATC CAGGCTGCTG GGGGCCCCTT GGCATTCCTG 2580 

GTTATTATGG CCCTTTTCAT GCTGAATGTA GGCAGCACCG CCT7CAGCAC CTGGTGGTTG 2 640 

AGTTACTGGA TCAAG CAAGG AAGCGGGAAC ACCACTGTGA CTCGAGGGAA CGAGACCTCG 2700 

GTGAGTGACA GCATGAAGGA CAATCCTCAT ATGCAGTACT ATGCCAGCAT CTACGCCCTC 2760 

TCCATGGCAG TCATGCTGAT CCTGAAAGCC ATTCGAGGAG TTGTCTTTGT CAAGGGCACG 2820 

CTGCGAGCTT CCTCCCGGCT GCATGACGAG CTTTTCCGAA GGATCCTTCG AAGCCCTATG 2880 

AAGTTTTTTG ACACGACCCC CACAGGGAGG ATTCTCAACA GGTTTTCCAA AGACATGGAT 2 94 0 

GAAGTTGACG TGCGGCTGCC GTTCCAGGCC GAGATGTTCA TCCAGAACGT TATCCTGGTG 3000 

TTCTTCTGTG TGGGAATGAT CGCAGGAGTC TTCCCGTGGT TCCTTGTGGC AGTGGGGCCC 3060 

CTTGTCATCC TCTTTTCAGT CCTGCACATT GTCTCCAGGG TCCTGATTCG GGAGCTGAAG 3120 

CGTCTGGACA ATATCACGCA GTCACCTTTC CTCTCCCACA TCACGTCCAG CATACAGGGC 3180 

CTTGCCACCA TCCACGCCTA CAATAAAGGG CAGGAGTTTC TGCACAGATA CCAGGAGCTG 3240 

CTGGATGACA ACCAAGCTCC TTTTTTTTTG TTTACGTGTG CGATGCGGTG GCTGGCTGTG 3300 
CGGCTGGACC TCATCAGCAT CGCCCTCATC A 



A TCTGAGACAG AAGCTCGATT CACCTCGGTG 
A AT CACTACAT TAAGACTCTG TCCTTGGAAG CACCTGCCAG AATTAAGAAC 
AAGGCTCCCT CCCCTGACTG GCCCCAGGAG GGAGAGGTGA CCTTTGAGAA CGCAGAGATG 
T CCTTCACGAT C 

3 GCATTGTGGG GCGGACAGGA TCAGGGAAGT CCTCGCTGGG G 

3 AATCAGTGAT 
C CTCAAGAGGC GGTGCTGTTC 
AGTGGCACTG TCAGATCAAA TTTGGACCCC TTCAACCAGT ACACTGAAGA CCAGATTTGG 



C TCAGTGGGGG 
& CTGTAAGATT CTGATTTTAG 
GACACAGAGA CAGACTTATT GATTCAAGAG ACCATCCGAG AAGCAT' 
ATGCTGACC7A TTGCCCATCG CCTGCACACG GTTCTAGGCT 
GCCCAGGGAC AGGTGGTGGA GTTTGACACC CCATCGGTCC TTCTGTCCAA 
CGATTCTATG CCATGTTTGC TGCTGCAGAG AACAAGGTCG CTGTCAAGGG 
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Seq ID NO: C105 DNA Sequence 

Nucleic Acid Accession # -. NM_005688.1 

Coding sequence: 126.. 443 9 

1 11 21 31 41 51 

1 I I I I I 

I GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCC-CGGTT GTCCTGGAGC 
T GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 
AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 
A CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 
A TGCCAAGATG CCTTGGAAAC AGCAGCCOGA GCCGAGGGCC 
TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 
^ AGCCCATCCG GACTACTTCC AAACACCAGC 



C CTGATGATCA 
A ACACCTCTTG GAGTATACCC AGGCAACAGA 
TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCE 
CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA 
TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 

C TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 
TTTACCCAGC AATGATGTTT GCATCACGGC 
CCGCCACGGA TGAACGTGTC CAGAAGATGA 



GAGGAGGAGC GTCGGATATT GGAAAAAGCC 
TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 



GGGACTCCTC CCACTCCAGT ATCCAGAACT C .. 

35 ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC A 
AGGCGGTGCT GGCAGAGCAG AAAGGCCACC T 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 

ACAGCATCGA TCTCGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 192 0 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 19E0 

40 TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGACACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 210 0 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

GAGG A J G CACAG GATCAGCCTT GCCCGGGCCT 2220 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

45 TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 234 0 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 24 6 0 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 252 0 

AAACCAGTGG TTCACAGAAG AAGT CACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

50 AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGC3GGC CCCTTGGCAT 270 0 

TCCIGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 

GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA 282 0 

CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 288 0 

55 CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2 940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3 0 00 

CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3 0 60 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC G1GGTTCCTT GTGGCAGTGG 3180 

60 GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 33 00 

AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 33 SO 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGIGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 

65 TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA G ACAG AAG CT CGATTCACCT 3600 
CGGTGGAGAG GATCAATCAC T. 



3 ACACACATGA AAGAATGTAT TGCTCAGCTA C 
TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 
GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 
CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAC-CA TTTGCAGACT 
GTACCATGCT GACCATTGCC CATCGCCTGC ACACGCT"" :T t ^ 7 C - TT : 
TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 
GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 
TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 
CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 
GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT T. 
ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT A 
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TC7TAC7GGA. GCAGAAGCTT 7CCCGA.GCCA. 



AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA 7GGGATTC7G GCTGATGTGA 3600 

AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3660 

AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAGATCTCAG 3720 

GGCTCGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3 900 

ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3 960 

ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAG7G C7GTTGGGAT 4020 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AAC7TTGACT TTGCCCAGGC 4 080 

ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 42 00 

ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAG7GAGCAG TGTTGGAGTG 42 60 

AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 

GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 

GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 
ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT G 
ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT A 

CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 
CGAGTAC CTACTGTGTG CCAGGGGCTG 
T CATAGAGTTG ATTGTCTAGT G 
ATTTTTA AAAAATAAAT TTAAACTTAC A 



CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 
CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA 
TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 
TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 



Coding sequence: 221.. 856 

i r r r r 

GAATTCGGCA CGAGGCCTCG TGCCGGGGAG CAACCTCAGC TTC 
CGCCGCCCCG GGCGCGGACC CCAACCCCGA CCCAGAGCTT CTCCAGCGGC GGCGCAGCGA 
GCAGGGCTCC CCGCCTTAAC TTCCTCCGCG GGGCCCAGCC ACCTTCGGGA GTCCGGGTTG 
CCCACCTGCA AACTCTCCGC CTTCTGCACC TGCCACCCCT GAGCCAGCGC G 
GCGAGTCATG GCCAACGCGG GGCTGCAGCT GTTGGGCTTC ATTCTCGCCT T 

jTCACCA CTCCCCTCCC CCAGTGGAGG ATTTACTCCT ATGCCGGCGA 
G ACCGCCCAGG CCATGTACGA GGGGCTGTGG ATGTCCTGCG TGTCGCAGAG 
yiGTGCA AAGTCTTTGA CTCCTTGCTG AATCTGAGCA GCACATTGCA 
AGCAACCCGT CCCTTCATCC TGGTTGCCAT CCTCCTGGGA GTGATAGCAA TCTTTGTGGC 
CACCGTTGGC ATGAAGTGTA TGAAGTGCTT GGAAGACGAT GAGGTGCAGA AGATGAGGAT 
GGCTGTCATT GGGGGCGCGA TATTTCTTCT TGCAGGTCTG GCTATTTTAG TTGCCACAGC 
ATGGTATGGC AATAGAATCG TTCAAGAATT CTATGACCCT A7GACCCCAG TCAATGCCAG 
GTACGAATTT GGTCAGGCTC TCTTCACTGG CTGGGCTGCT GCTTC7CTCT GCCTTCTGGG 
AGGTGCCCTA CTTTGCTGTT CCTGTCCCCG AAAAACAACC TCTTACCCAA CACCAAGGCC 
CTATCCAAAA CCIGCACCTT CCAGCGGGAA AGACTACGTG TGACACAGAG GCAAAAGGAG 
AAAATCATGT TGAAACAAAC CGAAAATGGA CATTGAGATA CTATCATTAA CATTAGGACC 
TTAGAATTTT GGGTATTGTA ATCTAAAGIA TGTTATTACA AAACAAACAA ACAAACAAAA 
AACCCATGTG TTAAAATACT CAGTGCTAAA CATGGCTTAA TCTTATTTTA TCTTCTTTCC 
TCAATATAGG AGGGAAGATT TTTCCATTTG TATTAGTGCT TCCCATTGAG TAATCATACT 
CAAATGGGGG AAGGGGIGCT CCTTAAATAT ATATAGATAT GTATATAIAC ATGTTTTTCT 
ATTAAAAATA GCCAGTAAAA AAAAAAAAAA AAAAAAA 

Seq ID NO: C108 DNA Sequence 
Nucleic Acid Accession #: AF508964.1 
Coding sequence: S8..1531 

1 11 21 31 41 51 

I I I I I I 

CAGAGCCGCA AGCGCAGGGA AGGCCTCCCC GCACGGTGGG GGAAAGCGGC CGGTGCAGCG 
CGGGGACAGG CACTCGGGCT GGCACTGGCT GCTAGC-GATG TCGTCCTGGA IAAGGTGGCA 
TGGACCCGCC ATGGCGCGGC TCTGGGGCTT CTGCTGGCTG G 
CGCTTTCGCC TGTCCCACGT CCTGCAAATG CAGTGCCTCT CGGATCTGGT G 
TTCTCCTGGC ATCGTGGCAT TTCCGAGATT GGAGCCTAAC AGTGTAGATC CTGAGAACAT 
CACCGAAATT TTCATCGCAA ACCAGAAAAG G 
AGCTTATGTG GGACTGAGAA ATCTGACAAT TGTGGATTCT G 

GAGTTTGTCT AGGAAACATT TCCGTCACCT T 



TCCAGACACT CAGGATTTGT ACTGCCTGAA TGAAAGCAGC A 
CCTGCAGATA CCCAATTGTG GTTTGCCATC TGCAAA7CTG GCCGCACC7A ACCTCACTGT 
GGAGGAAGGA AAGTCTATCA CATTATCCTG TAGTGTGGCA G 
GTATTGGGAT GTTGGTAACC TGGTTTCCAA ACATATGAAT GAAACAAGCC A 
CTCCTTAAGG ATAACTAACA TTTCATCCGA TGACAGTGGG AAGCAGATCT CTTGTGTGGC 
GGAAAATCTT GTAGGAGAAG ATCAAGATTC TGTCAACCTC ACTGTGCATT TTGCACCAAC 
TATCACATTT CTCGAATCTC CAACCTCAGA CCACCACTGG TGCATTCCAT TCACTGTGAA 
AGGCAACCCC AAACCAGCGC TTCAGTGGTT CTATAACGGG GCAATATTGA ATGAGTCCAA 
ATACATCTGT ACTAAAATAC ATGTTACCAA TCACACGGAG TACCACGGCT GCCTCCAGCT 
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GGATAATCCC ACTCACATGA ACAATGGGGA CTACACTCTA ATAGCCAAGA A 

GAAGGATGAG AAACAGATTT CTGCTCACTT CATGGGCTGG CCTGGAATTG ACGATGGTGC 1260 

AAACCCAAAT TATCCTGATG TAATTTATGA AGATTATGGA ACTGCAGCGA ATGACATCGG 1320 

GGACACCACG AACAGAAGTA ATGAAATCCC TTCCACAGAC GTCACTGATA AAACCGGTCG 1380 

GGAACATCTC TCGGTCTATG CTGTGGTGGT GATTGCGT21 3GT( rTTGCCTTTT 1440 

GGTAATGCTG TTTCTGCTTA AGTTGGCAAG ACACTCCAAG TTTGGCATGA AAGGTTTTGT 1E00 

TTTGTTTCAT AAGATCCCAC TGGATGGGTA GCTGAAATAA AGGAAAAGAC AGAGAAAGGG 1560 

GCTGTGGTGC TTGTTGGTTG ATGCTGCCAT GTAAGCTGGA CTCCTGGGAC TGCTGTTGGC 1620 

TTATCCCGGG AAGTGCTGCT TATCTGGGGT TTTCIGGTAG ATGTGG3CGG TG7TTGGAGG 1680 

CTGTACTATA TGAAGCCTGC ATATACTGTG AGCTGTGATT GGGGAACACC AATGCAGAGG 1740 

TAACTCTCAG GCAGCTAAGC AGCACCTCAA GAAAACATGT TAAATTAATG CTTCTCTTCT 1800 

TACAGTAGTT CAAATACAAA ACTGAAATGA AATCCCATTG GATTGTACTT CTCTTCTGAA 1860 

AAGTGTGCTT TTTGACCCTA CTGGACATTT ATTGACTTAA TTGCTTCTGT TTATTAAAAT 1920 

TGACCTGCAA AGTTAAAAAA AAATTAAAGT TGAGAACAGG TATAAGTGCA CACTGAATAG 1980 

TCTAATCTAC ATGTAACACA TATTTTAGTA TGATTTTCTA TACTCTAATC AGCACTGAAT 2040 

TCAGAGGGTT TGACTTTTTC ATCTATAACA CAGTGACTAA AAGAGTTAAG GGTATATATA 2100 

CCATCACTTT GGGACTTGGT AGTATTATTA AAAGGTTATT TCCTTCACTG TCAATAAAAG 2160 

2280 

A CAAGAACAAG CAGCAACAGC TGTTTTG1TG GGGCTATAGA TTTAAGTTAG 2340 



AGCTGTGGAT TTCTGCATCC CCCCTGAGTC TGACCCATGG 



3 AAGAGAGGTT TGGCTATCCC CCACCCCACC CACCCACCCT 
GTTCCTTTTT TATCAGGAGG ACTTCAGAGC CAGGCCTGCA GCATTTTGTT TGAAAACACA 
ATCAGCTCTG ACAGTTAGAC ATGCACACAG ACGCCATAGC TGGATTGGAA ACATTGATGT 
TTTAAAAATT TATTTTTTTT G 
CTATAGATTT TTAACTAGTC C 
AAGGCATTAA TCTTAATAAA CCAGGATCCA T 
CCATAATGAA TATTTTATAC TGCATCCTTT A 

TGAAGACCTT TCACAGAATC CTATGGATTG CAGCATTTCA CTTGGCTACT TCATACCCAT 
r CTCAAAAGCA GAAACATGCC GCCAGTTCTC AAGTTTTCCT 
^ AGGGCAGCTG GCCCCCAATG TGGGGAGGTC CGAACATTTT 
C ATTTTCTTGT TCCCGGCTAA ATGACAGTTT CTGTCATTAC TTAGATTCCG 
k TTTACAAAGA GGCCAGCTAA TAGCAGAAAT C 
AAAGAGAGAT GAAATTCAAG CTGTGAGCCA GGCAGGAGCI CAGTATGGCA A 

ACAATCACCC ATTTCGTACA AAAAACATTT TTAAAGCTTT TATGTTATAC CATGGAGCCA 3 660 

TAGAAAGGCT ATGGATTGTT TAAGAACTAT TTTAAAGTGT TCCAGACCCA AAAAGGAAAA 3720 

ATAAAAAAAA AGGAATATTT GTACCCAACA GCTAGAAGGA TTGCAAGGTA GATTTTTGTT 3780 

TTAAAATCGA CAGAAGTCCA CAGATAAGGC CATTTAATAT ATCAAAGATC AGTTGACATC 3840 

TCTAGGGAAT GATGAAACAG CAGGCTATTA GAAAATTATT 7CATATAGTT CTCGTGTTCT 3900 

TTTCTTTTTT TTAATCCCTG AAGGGAGATC AGTAACATAG CTTCTCTTTT CTGTACTCTA 3960 

GACCACCCCT TTTCATCATT TTGCTTTTTA TGTCTCCCAT AAGAAA.TGTG CTTTTTAGAG 4020 

CTTCCTAAIG CATGTGTTGC ATTATTGCAG CATTAGAAAA GGAGAGGTAG CATTTTTGCT 4080 

GAAATCGGGC CIGTCACTCT CCAATAAAGG ITCTGGCACT TCAATGCCAG GCAGGTCTCC 4140 

TAAATGAACA GAATGATCTG TGTGAGCCGA TGCCTGCCCT TCCAGAGGGG CCACTGTCCC 4200 

CAGCCGCAGC CAACTGTGTC CCACAGGAAT GGGAGCCTAG GTTTCCAAAT CTTGTGATTC 4260 

TTTAGGAGAA ACATGAAACC TGGATTTCGT GTGAAATGTC CCGATTGTTA AAAAGTTGGC 4320 

TCAATTATTT TTAAAACATT TTGTAAGCCA ACAAAAGTCT GTGGGCTGCC AGTTTATTAC 4380 

TTTTGTCTTA AAACATGATC ATTGTTCTCT CACGGTATCC TTCTGTCTTC CCGTTGCAAA 4440 

TTCACTTTTC TTTCTTCCTG ACATTGCCAT TGAGGGCTTT GTTACCACAA GCTAAGAAAC 4500 

TGAGTTTAAC AGCCCAGTTA TCTGCAACAT GTCAATTACC TTTGCICCTC TCCTGTGATT 4550 

CCCACCATGC TGTGACCCTC AGCTGTCTCC CTTTGCTGGG AATTCTGCAC CAATGTCTTC 4620 

CCCTCAACCC ATTCCCTGGT TGGTCCTACT CCCGTGTGGC CAGAGACATC CTAGCAAATC 4680 

CTTCCTCCTA TTATATCTGA CACTAATTTC TTTTCAACAG CGCTCATGTC TCTTGGCCCA 4740 

GTCAGGIGCT GCCAGGTTTA GATAGGAAAG TACATGTCCC ATTTTCA7GG GTGCCCTTAA 4800 

TGTGGTCCAC GTCCTATATC TTATTATATT TACTCATGGC TCAATGGGGG CCTCCAGAGA 48S0 

CCCTCTCAGG CTGCTGAGCT AGACTAAGGA ATGCATCCAC TGTCATCACA TGAGACACTG 4920 

ACTCTGTGAC GACAAAAGTA CAAACAGTCT GAGGCTAAGA AAGGTTCATC TCACAACAGG 4980 

AAAAACAAAT CTCAACACAC ATTAGAGATA ATTGATTCAG GGGTTTTCTC TCCCAGTCTC 5040 

CCAGCAGGGA CTGATTTCAT TTCTGACCCA CTAGGTTTTC TTTCCAGAAA TAGGTAGCAA 5100 

GGACAAGAAC TAAACAATCC CAGCCCCACC CAGCAACACA GAACACAGGA GTTTGCTTTT 5160 

TTGGAGCATC 5220 

AGCTCGAAGA 5280 

GAGCATOATG GAGGGAAGCC 5340 



TCTCTCTCTA GTATTCTAAA CTGACCCCAT GACCAACTGA GAAITTATTT TTGTTTCATT 
GGTTGTTTCA CAGAATTAGA ACACACACGA CTTTTTATTC CTCCATTGCA AAATGGAATC 
AAGATACTAC ACAAGACCTG TGCTTTCTTC CITTGCATGA TTTACACCTC CGCCTGTTTT 
GGTGCTAGCT GTCTAGAACT TCTCTCTTGG TTTGAATCTG ATTCCTTCAC ACTACACTAG 
r CATCTTGTTT TGTCTAGACT CCAGATACAG AGGGACAGCT GGACTGAGGA 



GCCAGTCACA TCTAGTCTAT GTCCCCAGAG CCCTTGGAGT TGCGCAGCTT A 
ACTCCAAGGA AATTAGTACA GAAGTAACCA CTCTATTAAG TGTGTTCTGC TATGTTCACA 
TGCCTGTAGT ACCTGCAAAC CATGCCAGGT TCATCTAAAG ACATAGGGGA AGATTAAGGA 



GACAGAACTC CCAGCCCAGG AAAGTTCCAT GACAATGACT GTCGCAGAAG G 
ACTAAAGTCA GTCCATTTTC AAGTTTTGGT CTTCAGAGAC AAAAGAACGT CCCAGCCACC 
TGATTTTGAT GGTGAGGTAA CTCTAAGTTG AATTCAGGCT AGIGTTGCAG TATAGCTTTG 
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GCATGTTCAT GAGTGAGCAC CCAGAATGTG TTGAACCAAC CCCCACCCCT AACTACTGAC 
TATGACTGCA GTGGGTTTTT ATGGGGAAAA AAAGTGTGAA AAGCAAAAAG AAAGGAACAG 
AGATTTTTTA TCACCTTTAT TGTAAGACAG TCCATTTATG AATTGAGTAT AAACACATAC 
AAAGTAACAA GAGATTCCTA AGAAACGCAA ATCCTTGAGT ITCACGCACT TCATGTTCAA 
CCATTTGCTG TAATCCAGAG GCAGCCTGTG AAT CATTCTC ATGCCCTGTT TTTTTTTTTT 
TTTTCCTATA ATGTTCTGGG TTTAAAAGCC ATCTTTTCCA CATTTTCTGT AAATAATGGA 
TAATCATTTT AAAAAGTTTT ATTTTTAGTG CTGTTTTAAC AATGTAGATA GAT CAT AAA? 
GTACTTGCTG AATTCAATCA TTTTTAACAA GCCAATAAAG TTTGATAATT CAAAAAAAAA 
AAAAAAAAAA AAAAAAAAA 

Seq ID NO: C109 DNA Sequence 

Nucleic Acid Accession #: NM_006180.1 

Coding sequence: 352.. 2820 

1 11 21 31 41 51 

I I I I I I 

CCCCCATTCG CATCTAACAA GGAATCTGCG CCCCAGAGAG TCCCGGACGC CGCCGGTCGG 
TGCCCGGCGC GCCGGGCCAT GCAGCGACGG CCGCCGCGGA GCTCCGAGCA GCGGTAGCGC 
CCCCCTGTAA AGCGGTTCGC TATGCCGGGA CCACTGTGAA CCCTGCCGCC TGCCGGAACA 
CTCTTCGCTC CGGACCAGCT CAGCCTCTGA TAAGCTGGAC TCGGCACGCC CGCAACAAGC 
ACCGAGGAGT TAAGAGAGCC GCAAGCGCAG GGAAGGCCTC CCCGCACGGG TGGGGGAAAG 
CGGCCGGTGC AGCGCGGGGA CAGGCACTCG GGCTGGCACT GGCTGCTAGG GATGTCGTCC 
TGGATAAGGT GGCATGGACC CGCCATGGCG CGGCTCTGGG GCTTCTGCTG GCTGGTTGTG 
GGCTTCTGGA GGGCCGCTTT CGCCTGTCCC ACGTCCTGCA AATGCAGTGC CTCTCGGATC 
TGGTGCAGCG ACCCTTCTCC TGGCATCGTG GCATTTCCGA GATTGGAGCC TAACAGTGTA 
GATCCTGAGA ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGTTAGA AATCATCAAC 
GAAGATGATG TTGAAGCTTA TGTGGGACTG AGAAATCTGA CAATTGTGGA TTCTGGATTA 
AAAT TTGTGG CTCATAAAGC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 
CGAAACAAAC TGACGAGTTT GTCTAGGAAA CATTTCCGTC ACCTTGACTT GTCTGAACTG 
ATCCTGGTGG GCAATCCATT TACATGCTCC TGTGACATTA TGTGGATCAA G ACT CTCCAA 
GAGG CT AAAT CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 
ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA TCTGGCCGCA 
CCTAACCTCA CTGTGGAGGA AGGAAAGTCT ATCACATTAT CCTGTAGTGT GGCAGGTGAT 
CCGGTTCCTA ATATGTATTG GGATGTTGGT AACCTGGTTT CCAAACATAT GAATGAAACA 
AGCCACACAC AGGGCTCCTT AAGGATAACT A 



rCTAATAGCC 1440 



CATTTTGCAC CAACTATCAC ATTTCTCGAA T 
CCATTCACTG TGAAAGGCAA CCCCAAACCA GCGCTTCAGT G 
TTGAATGAGT CCAAATACAT CTGTACTAAA A 
JC AGCTGGATAA TCCCACTCAC A 
AAGAATGAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC A 

ATTGACGATG GTGCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA T 

5 GARTGACA TCGGGGACAC CACGAACAGA AGTAATGAAA TCCCTTCCAC AGACGTCACT 

GATAAAACCC CTCGGGAACA TCTCTCGGTC TATGCTGTGG TGGTGATTGC GTCTGTGGTG 16 80 

GGATTTTGCC TTTTGGTAAT GCTGTTTCTG CTTAAGTTGG CAAGACACTC CAAGTTTGGC 1740 

^TC AAA! G GT TATCAI CAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 1B00 

ATCTCCAATG GGAGTAACAC TCCATCTTCT TCGGAAGGTG GCCCAGATGC TGTCATTATT 1860 

GGAATGACCA AGATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCATCAC CAACAGTCAG 1920 

CTCAAGCCAG ACACATTTGT TCAGCACATC AAGCGACATA ACATTGTTCT GAAAAGGGAG 1980 

CTAGGCGAAG GAGCCTTTGG AAAAGTGTTC CTAGCTGAAT GCTATAACCT CTGTCCTGAG 2 040 

CAGGACAAGA TCTTGGTGGC AGTGAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG 2100 

GACTTCCACC GTGAGGCCGA GCTCCTGACC AACCTCCAGC ATGAGCACAT CGTCAAGTTC 2160 

TATGGCGTCT GCGTGGAGGG CGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATGGG 2220 

GACCTCAACA AGTTCCTCAG GGCACACGGC CCTGATGCCG TGCTGA7G3C TGAGGGCAAC 2280 

CCGCCCACGG AACTGACGCA GTCGCAGATG CTGCATATAG CCCAGCAGAT CGCCGCGGGC 2340 

ATGGTCTACC TGGCGTCCCA GCACTTCGTG CACCGCGATT TGGCCACCAG GAACTGCCTG 2400 

GTCGGGGAGA ACTTGCTGGT GAAAATCGGG GACTTTGGGA TGTCCCGGGA CGTGTACAGC 2460 

ACTGACTACT ACAGGGTCGG TGGCCACACA ATGCTGCCCA TTCGCTGGAT GCCTCCAGAG 2520 

AGCATCATGT ACAGGAAATT CACGACGGAA AGCGACGTCT GGAGCCTGGG GGTCGTGTTG 2580 

TGGGAGATTT TCACCTATGG CAAACAGCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2S40 

GAGTGTATCA CTCAGGGCCG AGTCCTGCAG CGACCCCGCA CGTGCCCCCA GGAGGTGTAT 2700 

GAGCTGATGC TGGGGTGCTG GCAGCGAGAG CCCCACATGA GGAAGAACAT CAAGGGCATC 2760 

CATACCCTCC TTCAGAACTT GGCCAAGGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2 820 

GGCCCTTTTC CCCAGACCGA TCCTTCCCAA CGTACTCCTC AGACGGGCTG AGAGGATGAA 2 880 

CATCTTTTAA CTGCCGCTGG AGGCCACCAA GCTGCTCTCC TTCACTCTGA CAGTATTAAC 2 940 

AT CAAAG ACT CCGAGAAGCT CTCGAGGGAA GCAGTGTGTA C7TCTTCATC CATAGACACA 3000 

GTATTGACTT CTTTTTGGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTG GTTGTTCCTT 3060 

TTTCTTTTTT TAAATTTTCT TTTTCTTCTT TTTTTTCGTC TTCCCTGCTT CACGATTCTT 312 0 

ACCCTTTCTT TTGAATCAAT CTGGCTTCTG CATTACTATT AACTCTGCAT AG ACAAAGG C 3180 

CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACTCCAGT TTGCCCACCA CAACTAACAA 3240 

TGCCTTGTTG TATTCCTGCC TTTGATGTGG ATGAAAAAAA GGGAAAACAA ATATTTCACT 3300 

TAAACTTTGT CACTTCTGCT GTACAGATAT CGAGAGTTTC TATGGATTCA CTTCTATTTA 3360 

TTTATTATTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAGAA 3420 

AACTTGTGTT CAATCTGTGA AGCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 

ATTTATTATG AACCGCAATA TGGGAGGAAC AAAGACAACC ACTGGGATCA GCTGGTGTCA 3540 

GTCCCTACTT AGGAAATACT CAGCAACTGT TAGCTGGGAA GAATGTATTC GGCACCTTCC 3600 

CCTGAGGACC TTTCTGAGGA GTAAAAAGAC TACTGGCCTC TGTGCCATGG ATGATTCTTT 3660 

TCCCATCACC AGAAATGATA GCGTGCAGTA GAGAGCAAAG ATGGCTT 3707 
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AAGACGGATT CTCAGACAAG GCTTGCAAAT GCCCCGCAGC CATCATTTAA CTGCACCCGC 
AGAATAGTTA CGGTTTGTCA CCCGACCCTC CCGGATCGCC TAATTTGTCC CTAGTGAGAC 
CCCGAGGCTC TGCCCGCGCC TGGCTTCTTC GTAGCTGGAT GCATATCGTG C7CCGGGCAG 
CGCGGGCGCA GGGCACGCGT TCGCGCACAC CCTAGCACAC ATGAACAOGC GCAAGAGCTG 
AACCAAGCAC GGTTTCCATT TCAAAAAGGG AGACAGCCTC TACCGCGATT GTAGAAGAGA 
CTGTGGTGTG AATTAGGGAC CGGGAGGCGT CGAACGGAGG AACGGTTCAT CTTAGAGACT 
AATTTTCTGG AGTTTCTGCC CCTGCTCTGC GTCAGCCCTC ACGTCACTTC GCCAGCAGTA 
GCAGAGGCGG CGGCGGCGGC TCCCGGAATT GGGTTGGAGC AGGAGCCTOG CTGGC7GCTT 
\ CGCTCGCGCT CTACGCGCTC AGTCCCCGGC GGTAGCAGGA GCCTGGACCC AGGCGCCGCC 
GGCGGGCGTG AGGCGCCGGA GCCCGGCCTC GAGGTGCATA CCGGACCCCC A7TCGCATCT 
AACAAGGAAT CTGCGCCCCA GAGAGTCCCG GGAGCGCCGC CGGTCGGTGC CCGGCGCGCC 
GGGCCATGCA GCGACGGCCG CCGCGGAGCT COGAGCAGCG GTAGCGCCCC CCTGTAAAGC 
GGTTCGCTAT GCCGGGGCCA CTGTGAACCC TGCCGCCTGC CGGAACACTC TTCGCTCCGG 
ACCAGCTCAG CCTCTGATAA GCTGGACTCG GCACGCCCGC AACAAGC 
GAGAGCCGCA AGCGCAGGGA AGGCCTCCCC GCACGGGTGG GGGAAAGOGG C 
GCGGGGACAG GCACTCGGGC TGGCACTGGC TGCTAGGGAT GTCGTCC7GG A 



T GCAGTGCCTC TCGGATCTGG TGCAGCGACC 
CTTCTCCTGG CATCGTGGCA TTTCCGAGAT TGGAGCCTAA CAGTGTAGAT CCTGAGAACA 
TCACCGAAAT TTTCATCGCA AACCAGAAAA GGTTAGAAAT CATCAACGAA GATGATGTTG 



ATAAAGCATT TCTGAAAAAC 



ATCCATTTAC ATGCTCCTGT GACATTATGT GGATCAAGAC TCTCCAAGAG GCTAAATCCA 
GTCCAGACAC TCAGGATTTG TACTGCCTGA ATGAAAGCAG CAAGAATATT C 
ACCTGCAGAI A 



GATAACTAAC ATTTCATCCG AIGACAGTGG 



CTATCACATT TCTCGAATCT CCAACCTCAG ACCACCACTG GTGCATTCCA TTCACTGTGA 
AAGGCAACCC C 
AATACATCTG T 

TGGATAATCC CACTCACATG AACAATGGGG ACTACACTCT AATAGCCAAG AATGAGTATG 
GGAAGGATGA G 
CAAACCCAAA T 

GGGACACCAC GAACAGAAGT AATGAAATCC CTTCCACAGA CGTCACTGAT AAAACCGGTC 2220 

CCCAACATCT CTCGGTCTAT GCTGTGGTGG TGATTGCG7C TGTGGTGGGA TTT7GCCTTT 2280 

TGGTAATGCT GTTTCTGCTT AAGTTGGCAA GACACTCCAA GTTTGGCATG AAAGGTTTTG 234 0 

TTTTGTTTCA TAAGATCCCA CTGGATGGGT AGCTGAAATA AAGGAAAAGA CAGAGAAAGG 240 0 

GGCTCTCGTG CTTCTTGCTT CATCCTGCCA TCTAAGCTGG ACTCCTGGGA CTGCTGTTGG 2460 

CTTATCCCGG GAAGTGCTGC TTATCTGGGG TTTTCTGGTA GATGTGGGCG GTGTTTGGAG 252 0 

GCTGTACTAT ATGAAGCCTG CATATACTGT GBGCTGTGAT TGGGGAACAC CAATGCAGAG 2580 

GTAACTCTCA GGCACCTAAG CAGCACCTCA ACAAAACATG TTAAATTAAT CCTTCTCTTC 2640 

TTACAGTAGT TCAAATACAA AACTGAAATG AAATCCCATT GGATTGTACT TCTCTTCTGA 270 0 

AAAGTGTGCT TTTTGACCCT ACTGGACATT TATTGACTTA ATTGCTTCTG TTTATTAAAA 2760 

TTGACCTGCA AAGTTAAAAA AAAATTAAAG TTGAGAACAG GTATAAGTGC ACACTGAATA 282 0 

GTCTAATCTA CATGTAACAC ATATTTTAGT GTGATTTTCT ATACTCTAAT CAGCACTGAA 2880 

TTCAGAGGGT TIGACTTTTT CATCTATAAC ACAGTGACTA AAAGAGTTAA GGGTATATAT 294 0 

ACCATCACTT TGGGACTTGG TAGTATTATT AAAAGGTTAT TTCCTTCACT GTCAATAAAA 3000 

GTCCAAATGT TTAGCTTAGG TCTGAGAGTC AAACAATGTT AAGGAT TGTC TTAAAGTTCC 3060 

TTAGCCAGCA AAACAAAACA AAACAAAACA AACAAATGAA AAACGTTTAA AAAGAAGAAG 312 0 

AAGAAAAAAA ACAAGAACAA GCAGCAACAG CTGTTTTGTT GGGGCTATAG ATTTAAGTTA 3180 

GGCATAGTCA ATTTCAGAAT AACTAAGAGT GGAATATATG CATATGGTGA AATTATAACC 3240 

TTGCCCTTTT TTATTTGCCC TCIGCGATCC ACCTGCTTTT TAGAAGTCTG CCGAGTGAGA 3300 

AGGCCACAGT ATCTCATGCT GTTTGCATTA CAGAACTGCA GCTTTTCTAC TCTGAAAAGG 3360 

CCTGGGAGCA GAATGGCTGG CCTGCTGTGA GCAGGAGAGG AGATTCTAAG AAGGATAGTC 3420 

CCCCCTACAA CATACTGTCA TACTGCIGGG TTTTCATGGG TAGGAAAGCT TGTCCTGACC 3480 

CCAGCAGCAA AGAGGTGGCA GGTCGCTAAT GAATATATGC TTTA7AATG7 CC7TCTTCAT 354 0 

TGCTGAGAGG GCAGCCTTAG AGCTGTGGAT TTCTGCATCC CCCCTGAGTC TGACCCATGG 3600 

ACACCTGTTT CATTCACTTT AGCATCACAG TGACCTTTGT ATGCTCTGTT CAGTCTGTGT 3660 

CAGGCAGTAT GCTTGTCCTG AAGAGAGGTT TGGCIATCCC CACCCCACCC CACCCCACCC 3720 

TGTTCCTTTT TTATCAGGAG GACTTCAGAG CCAGGCCTGC AGCATTTTGT TTGAAAACAC 37 80 

AATCAGCTCT GACAGTTAGA CATGCACACA GACGCCATAG CTGGATTGGA AACATTGATG 3840 
G CTGCAATTTA O 



CAAGGCATTA ATCTTAATAA ACCAGGATCC ATTTAGGTAC CACTTGATAT AAAAAGGATA 
T CCAT AATGA ATATTTTATA CTGCATCCTT TACATTAGCC ACTAAATACG TTATTGCTTG 
ATGAAGACCT TTCACAGAAT CCTATGGATT GCAGCATTTC ACTTGGCTAC TTCATACCCA 
K3CAGTT TCTCAAAAGC AGAAACATGC CGCCAGTTCT CAAGTTTTCC 
C ATTTGAATGT AAGGGCAGCT GGCCCCCAAT G 
TCTGAATTCC C 

GATCTTTCCC AAAGGTGTTG ATTTACAAAG AGGCCAGCTA ATAGCAGAAA TCATGACCCT 
GAAAGAGAGA TGAAATTCAA GCTGTGAGCC AGGCAGGAGC TCAGTATGGC AAAGGTTCTT 
GAGAATCAGC CATTTGGTAC AAAAAAGATT TTTAAAGCTT TTATGTTATA CCATGGAGCC 
ATAGAAAGGC TATGGATTGT TTAAGAACTA TTTTAAAGTG TTCCAGACCC AAAAAGGAAA 
AATAAAAAAA AAGGAATATT TGTACCCAAC AGCTAGAAGG ATTGCAAGGT AGATTTTTGT 
TTTAAAATGG AGAGAAGTGG ACAGATAAGG CCATTTAATA TATCAAAGAT CAGTTGACAT 
CTCCTAGGGA ATGATGAAAA CAGCAGGCTA T 
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ATGCCACTGA AGCATTATCT CCTTTTGCTG GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG 
GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGG COTCCCAGGT GGAG7GCACC 
GGGGCACGCA TTGTGGCGGT GCCCACCCCT CTGCCCTGGA ACGCCATGAG CCTGCAGATC 
CTCAACACGC ACATCACTGA AC T CAATGAG TCCCCGTTCC TCAATATCTC AGCCCTCATC 
GCCCTGAGGA TTGAGAAGAA TGAGCIGTCG CGCATCACGC CTGGGGCCTT CCGAAACCTG 
GGCTCGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AGG7TCTGCC CATCGGCC7C 
TTCCAGGGCC TGGACAGCCT TGAGTCTCTC CTTCTGTCCA GTAACCAGCT GTTGCAGATC 
CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 
CTGGAATACA TCCCTGACGG AGCCTTCGAC CACCTGGTAG GAC7CACGAA GCTCAATCTG 
GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 
GTCCTCCGGC TGTATGAGAA CAGGCTCACG GATATCCCCA TGGGCAC7TT TGATGGGCTT 
GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GAC7GCTCTC CCCTGGTCTC 
TTCCACAACA ACCACAACCT CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 
CCACCCAGCA TCTTCATGCA GCTGCCCCAG CTCAACCGTC TTACTCTCTT TGGGAATTCC 
CTGAAGGAGC TCTCTCTGGG GATCTTCGGG CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 
TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA GCAACCTCCG CCAGTTGCAG 
GTCCTGATTC TTAGCCGCAA TCAGATCAGC TTCATCTCCC CGGGTGCCTT CAACGGGCTA 
ACGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC AGGACCTGGA CGGGAATGTC 
TTCCGCATGT TGGCCAACCT GCAGAACATC TCCCTGCAGA ACAATCGCCI CAGACAGCTC 
CCAGGGAATA TCTTCGCCAA CGTCAATGGC CTCATGGCCA TCCAGCTGCA GAACAACCAG 
CTGGAGAACT TGCCCCTCGG CATCTTCGAT CACCTGGGGA AACTGTGTGA GCTGCGGCTG 
TATGACAATC CCTGGAGGTG TGACTCAGAC ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 
AACCAGCCTA GGTTAGGGAC GGACACTCTA CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 
GGCCAGTCCC TCATTATCAT CAATGTCAAC GTTGCTGTTC CAAGOGTCCA TGTCCCTGAG 
GTGCCTAGTT ACCCAGAAAC AC CATGGT AG CCAGACACAC CCAGTTACCC TGACACCACA 
TCCGTCTCTT CTACCACTGA GCTAACCAGC CCT3TGGAAG ACTACACTGA TCTGACTACC 
ATTCAGGTCA CTGATGACCG CAGCGTTTGG GGCATGACCC AGGCCCAGAG CGGGCTGGCC 
ATTGCCGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 
TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT GTCCTGATGC AGATGAAGGC ACCCAATGAG 

Seq ID NOs C112 DNA Sequence 
Nucleic Acid Accession #: NM_0026S8.1 
Coding sequence: 77.. 1372 

i i 1 t t r i i 

GTCCCCGCAG CGCCGTCGCG CCCTCCTCCC GCAGGCCACC GAGGCCGCCG CCGTCTAGCG 
CCCCGACCTC GCCACCATGA GAGCCCTGCT GGCGCGCCTG CTTCTCTGCG TCCTGGTCGT 
GAGCGACTCC AAAGGCAGCA ATGAACTTCA TCAAGTTCCA TCGAAC1 
TGGAGGAACA TGTGTGTCCA ACAAGTACTT CTCCAACATT CACTCCTGCA A 
GAAATTCGGA GGGCAGCACT GTGAAATAGA TAAGTCAAAA ACCTGCT 

TCACTTTTAC CGAGGAAAGG CCAGCACTGA CACCATGGGC CGGCCCT 

;tctgccact GTCCTTCAGC AAACGTACCA TGCCCACAGA ICTGATGCTC TTCAGCTGGG 
CCTGGGGAAA CATAATTACT GCAGGAACCC AGACAACCGG AGGCGACCCT GGTGCTATGT 
GCAGGIGGGC CTAAAGCCGC TTGTCCAAGA GTGCATGGTG CATGACTGCG CAGATGGAAA 
AAAGCCCTCC TCTCCTCCAG AAGAATTAAA ATTTCAGTGT GGCCAAAAGA CTCTGAGGCC 
CCGCTTTAAG ATTATTGGGG GAGAATTCAC CACCATCGAG AACCAGCCCT GGTTTGCGGC 
CATCTACAGG AGGCACCGGG GGGGCICTGT CACCTACGTG TGTGGAGGCA G 
CCCTTGCTGG GTGATCAGCG CCACACACTG CTTCATTGAT TACCCAAAGA A 

CATCGTCTAC CTGGGTCGCT CAAGGCTTAA CTCCAACACG CAAGGGC 

GGTGGAAAAC CTCATCCTAC ACAAGGACTA CAGCGCTGAC ACGCITGCTC ACCACAACGA 
CATTGCCTTG CTGAAGAICC GTTCCAAGGA GGGCAGGTGT GCGCAGCCAT CCCGGACTAT 960 
ACAGACCATC TGCCTGCCCT CGATGTATAA CGATCCCCAG TTTGGCACAA GCTGTGAGAT 1020 
CACTGGCTTT GGAAAAGAGA ATTCTACCGA CTATCTCTAT CCGGAGCAGC TGAAAATGAC 1080 
TGTTGTGAAG CTGATTTCCC AC C GGGAGTG TCAGCAGCCC CACTAC7ACG GCTCTGAAGT 1140 
CACCACCAAA ATGCTATGTG CTGCTGACCC CCAATGGAAA ACAGATTCCT GCCAGGGAGA 1200 
CTCAGGGGGA CCCCTCGTCT GTTCCCTCCA AGGCCGCATG ACTTTGACTG GAATTGTGAG 1260 
CTGGGGCCGT GGATGTGCCC TGAAGGACAA GCCAGGCGTC TACACGAGAG TCTCACACTT 1320 
CTTACCCTGG ATCCGCAGTC ACACCAAGGA AGAGAATGGC CTGGCCCTCT GAGGGTCCCC 1380 
AGGGAGGAAA CGGGCACCAC CCGCTTTCTT GCTGGTTGTC ATTTTTGCAG TAGAGTCATC 1440 
TCCATCAGCT GTAAGAAGAG ACTGGGAAGA TAGGCTCTGC ACAGATGGAT TIGCCTGTGG 1500 
CACCACCAGG GTGAACGACA ATAGCTTTAC CCTCACGGAT AGGCCTGGGT GCTGGCTGCC 1560 
CAGACCCTCT GG CCAGGATG GAGGGGTGGT CCTGACTCAA CATGTTACTG ACCAGCAACT 1620 
TGTCTTTTTC TGGACTGAAG CCTGCAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 1680 
GGCTCGAAGG GAGAGCCAGC TCCCCCGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA 1740 
AATGAATAAT TTCCCAATTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAGG GAGCTTAGCC 1800 
AATGTGGGAG CAGCGGTTTG GGGAGCAGAG ACACTAACGA CTTCAGGGCA GGGCTCTGAT 1860 
ATTCCATGAA TGTATCAGGA AATATATATG TGTG7GTATG TTTGCACACT TGTTGTGTGG 192 0 
GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG TGTCTGATTG TTAAGTCTAA ATATTTCCTT 1980 
AAACTGTGTG GACTGTGATG CCACACAGAG TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 2040 
CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT CTGCAGCATG 2100 
ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA TGTCCCTTTC TTGGCCAGTT 2160 
ATCCCTTCCT TTTAGCCTAG TTCATCCAAT CCTCACTGGG TGGGGTGAGG ACCACTCCTT 2220 
ACACTGAATA TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG 22 80 
ATCAATAAAA TGTGATTTTT CTGA 23 04 

Seq ID NO: C113 DNA Sequence 

Nucleic Acid Accession #: XM_087254.1 
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AGAGTACGTG TTTACAGATA AAACTGGTAC ACTGACAGAA AATGAGATGC AGTTCGGGA 60 

ATGTTCAATT AATGGCATGA AATACCAAGA AATTAATGGT AGAC7TGTAC CCGAAGGACC 120 

AACACCAGAC TCTTCAGAAG GAAACTTATC TTATCTTAGT AGTTTATCCC ATCTTAACAA 180 

CTTATCCCAT CTTACAACCA GTTCCTCITT CAGAACCAGT CCTGAAAATG AAACTGAACT 240 

AATTAAAGAA CATGATCTCT TCTTTAAAGC AGTCAGTCTC TGTCACACTG TACAGATTAG 300 

CAATGTTCAA ACTGACTGCA CTGGTGATGG TCCCTGGCAA TCCAACGTGG CACCATCGCA 360 

GTTGGAGTAC TATGCATCTT CACCAGATGA AAAGGCTCTA GTAGAAGCTG CTGCAAGGAT 420 

TGGTATTGTG TTTATTGGCA ATTCTGAAGA AACTATGGAG GTTAAAACTC TTGGAAAACT 480 

GGAACGGTAC AAACTGCTTC ATATTCTGGA ATTTGATTCA GATCGTAGGA GAATGAGTGT 540 

AATTGTTCAG GCACCTTCAG GTGAGAAGTT ATTATTTGCI AAAGGAGCTG AGTCATCAAT 600 

TCTCCCTAAA TGTATAGGTG GAGAAATAGA AAAAACCAGA ATTCATGTAG ATGAATTTGC 660 

TTTGAAAGGG CTAAGAACTC TGTGTATAGC ATATAGAAAA TTTACATCAA AAGAGTATGA 720 

GGAAATAGAT AAACGCATAT TTGAAGCCAG GACTGCCTTG CAGCAGCGGG AAGAGAAATT 780 

GGCAGCTGTT TTCCAGTTCA TAGAGAAAGA CCTGATATTA CTTGGAGCCA CAGCAGTAGA 840 

^™^ A ^ATAAAG TTCGAGAAAC TATTGAAGCA TTGAGAATGG CIGGTATCAA 900 

AGTATGGGTA CTTACTGGGG ATAAACATGA AACAGCTGTT AGTGTGAGTT TATCATGTGG 960 

AGAACCATGA ACATCCTTGA ACTTATAAAC CAGAAATCAG ACAGCGAGTG 1020 

TGCTGAACAA TTGAGGCAGC TTGCCAGAAG AATTACAGAG GATCATGTGA TTCAGCATGG 1080 

GCTGGTAGTG GATGGGACCA GCCTATCTCT TGCACTCAGG GAGCATGAAA AACTATTTAT 1140 

GGAAGTTTGC AGAAATTGTT CAGCTGTATT ATGCTGTCGT ATGGCTCCAC TGCAGAAAGC 1200 

AAAAGTAATA AGACTAATAA AAATATCACC TGAGAAACCT ATAACATTGG CTGTTGGTGA 1260 

TGGTGCTAAT GACGTAAGCA TGATACAAGA AGCCCATGTT GGCATAGGAA TCATGGGTAA 1320 

CAGGCTGCAA GAAACAGTGA CTATGCAATA GCCAGATTTA AGTTCCTCTC 1380 

CAAATTGCTT TTTGTTCATG GTCATTTTTA TTATATTAGA ATAGCTACCC TTGTACAGTA 1440 
TTTTTTTTAT AAGAATGTGT GCTTTATCAC ACCCCAGTTT TTATATCAGT TCTACTGTTT 
GTTTTCTCAG CAAACATTGT ATGACAGCGT GTACCTGACT TTATACAATA TTTGTTTTAC 
TTCCCTACCT ATTCTGATAT ATAGTCTTTT GGAACAGCAT GTAGACCCTC ATGTGTTACA 



1740 



»'»°"-"" twnftUAUiJAT GTAUACCCTC ATGTGTTACA 

AAATAAGCCC ACCCTTTATC GAGACATTAG TAAAAACCGC CTCTTAAGTA TTAAAACATT 1680 
TCTTTATTGG ACCATCCTGG GCTTCAGTCA TGCCTTTATT TTCTTTTTTG C 
ACTAATAGGG AAAGATACAT CTCTGCTTGG AAATGGCCAG ATGTTTGGAA I 
TGGCACTTTG GTCTTCACAG TCATGGTTAT TACAGTCACA GTAAAGATGG C 
TCATTTTTGG ACTTGGATCA ACCATCTCGT TACCTGGGGA T CTATTATAT 1 
OD ATTTTCCTTG TTTTATGGAG GGATTCTCTG GCCATTTTTG GGCTCCCAGA ATATGTATTT 
TGTGTTTATT CAGCTCCTGT CAAGTGGTTC TGCTTGGTTT CCCATAATCC -CATGGTTG- 

TACATGTCTA TTTCTTGATA TCATAAACAA GGTCTTTGAC CGACACC-CC 

TACTGAAAAC CCACAGCTTA CTGAAACAAA TGCAGGTATC AAGTGCT"GG 
. CTGTTTCCCG GAAGGAGAAG CAGCGTGTGC ATCTGTTGGA AGAATGCTGG AACGAG' 

W AGGAAGATGT AGTCCAACCC ACATCAGCAG ATCATGGAGT GCATCGGATC CTTTC"ATAC 

CAACGACAGG AGCATCTTCA CTCTCTCCAC AATGGACTCA TCTACTTGTT AAAGGGGCAG 
TACTACTTTG TGGGAGCCAG TTCACCTCCT TTCCTAAAAT TCAGTGTGAT CACCCTGTTA 
ATGGCCACAC TAGCTCTGAA ATTAATTTCC AAAATCTTTG TAGTAGTTCA TACCCACTCA 
GAGTTATAAT GGCAAACAAA CAGAAAGCAT TAGTACAAGC CCCTCCCAAC ACCCTTAAT" 
HJ TGAATCTGAA CATGTTAAAA TTTGAGAATA AAGAGACATT TTTCATCTCT TTGTCTGGTT 

TGTCCCTTGT GCTTATGGGA CTCCTAATGG CATTTCAGTC TGTTGCTGAG 
TTTAATATAA ATGTAGAAAA AACAGAGAAA TCTIAGTAAA GAG"ATTTTT 
TTGATTATTG ACTCTTCTAT TTAAATCTGC TTCTGTAAAT TATGCTGAAA 

AGAACTCTAT TTTTTTATTA GAGTTATATT TAAAGCTTTT CATGGGAAAA GTTAATGTGA 2 820 
ATACTGAGGA ATTTTGGTCC CTCAGTGACC TGTGTTGTTA ATTCATTAAT GCATTCTGAG 2880 
TTCACAGAGC AAATTAGGAG AATCATITCC AACCATTATT TACTGCAGTA TGGGGAGTAA 2940 
ATTTATACCA ATTCCTCTAA CTGTACTGTA ACACAGCCIG TAAAGTTAGC CATATAAATG 3000 
CAAGGGTATA TCATATATAC AAATCAGGAA TCAGGTCCGT TCACCGAACT TCAAATTGAT 3 060 
<- GTTTACTAAT ATTTTTGTGA CAGAGTATAA AGACCCTATA GTGGGTAAAT TAGATACTAT 3120 

' TAGCATATTA TTAATTTAAT GTCTTTATCA TTGGATCTTT TGCATGCTTT AATCTGGTTA 3180 

ACATATTIAA ATTTGCTTTT TTTCTCITTA CCTGAAGGCT CTGTGTATAG TATTTCATGA 3240 
CATCGTTGTA CAGTTTAACT ATATCAATAA AAAGTTTGGA CAGTATTTAA ATATTGCAAA 3300 
It™™! AGAATAGTAT GGGTAATTAA ATGAATACAA AAAGAAGAGC 3360 



TAGTATTAGC 



-.--v,.. * imi ix-iirtv^irt^i^ ftkjrtrtlrttiim (AKiTAATTAA ATGAATACAA AAACAACAGC 

CTCTTTCTGC AGCCGACTTA GACATGCTCT TCCCTTTCTA TAAGCTAGAT TTTAGAATAA 3420 

AGGGTTTCAG TTAATAATCT TATTTTCAGG TTATGTCATC TAACTTATAG CAAACTACCA 3480 

CAATACAGTG AGTTCTGCCA GTGTCCCAGT ACAAGGCATA TTTCAGGTGT GGCTGTGGAA 3540 

TGTAAAAATG CTCAACTTGT ATCAGGTAAT GTTAGCAATA AATTAAATGC TAAGAATGAT 3600 

TAATCGGGTA CATGTTACTG TAATTAACTC ATTGCACTTC AAAACCTAAC TTCCATCCTG 3660 

AATTTATCAA GTAGTTCAGT ATTGTCATTT GTTTTTGTTT TAITGAAAAG TAATGTTGTC 3720 



— ~ aimiuaiT IfmTOTI TATTGAAAAG TAATGTTGTC 

TTAAGATTTA GAAGTGATTA TTAGCTTGAG AACTATTACC CAGCTCTAAG CAAATAATGA 3780 

TTGTATACAT ATTAAGATAA TGGTTAAATG CGGTTTTACC AAGTTTTCCC TTGAAAATGT 3840 

AATTCCTTTA TGGAGATTTA TTGTGCAGCC CTAAGCTTCC TTCCCATTTC ATGAATATAA 3900 

GGCTTCTAGA ATTGGACTGG CAGGGGAAAG AATGGTAGAG ACAGAAATTA AGACTTTATC 3960 

CTTGTTTGCT TGTAAACTAT TATTTTCTTG CTAATGTAAC ATTTGTCTGT TCCAGTGATG 4020 

l<J TAAGGATATT AAGTTATTAA GCTAAATATT AATTTTCAAA AATAGTCCTT CTTTAACTTA 4080 

GATATTTCAT AGCTGGATTT AGGAAGATCT GTTATTCTGo AAGTACTAAA AAGAATAATA 4140 

CAACGTACAA TGTCTGCATT CACTAATTCA TGTTCCAGAA GAGGAAATAA TGAAGATATA 4200 

CTCAGTAGAG TACTAGGTGG GAGGATATGG AAATTTGCTC ATAAAATCTC TTA-AAAACG 4260 

TGCATATAAC AAAATGACAC CCAGTAGGCC TGCATTACAT TTACATGACC GTGTTTATTT 4320 

°^ TCAAAT ^CTGAGTA CTGACACCAG ACAAAGACTC CAAAGTCATA AAATAGCCTA 4380 

TGACCAACTG CAGCAAGACA GGAGGTCAGC TCGCCTATAA TGGTGCTTAA HGTGTGATTG 4440 

ATGTAATTTT CTGTACTCAC CATTTGAAGT TAGTTAAGGA GAACTTTATT TTTTTAAAAA 4500 

AAGTAAATGG CAACCACTAG TGTGCTCATC CTGAACTGTT ACTCCAAATC CACTCCGTTT 4560 

TTAAAGCAAA ATTATCTTGT GATTTTAAGA AAAGAGTTTT CTATTTATTT AAGAAAGTAA 4620 

SU CAATGCAGTC TGCAAGCTTT CAGTAGTTTT CTAGTGCTAT ATTCATCCTG TAAAACTCTT till 

ACTACGTAAC CAGTAATCAC AAGGAAAGTG TCCCCTTTGC ATATTTCTTT AAAATTCTT 4740 

CTTTGGAAAG TATGATGTTG ATAATTAACT TACCCTTATC TGCCAAAACC AGAGCAAAAT 4800 

GCTAAATACG TTATTGCTAA TCAGTGGTCT CAAATCGATT TGCCTCCCTT TGCCTCGTCT 4860 

GAGGGCTGTA AGCCTGAAGA TAGTGGCAAG CACCAAGTCA GTTTCCAAAA TTGCCCCTCA 4920 
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GCTGCTTTAA GTGACTCAGC ACCCTGCCTC AGCTTCAGCA GGCGTAGGC7 CACCCIGGGC 4980 

GGAGCAAAGT ATGGGCCAGG GAGAACTACA GCTACGAAGA CCTGCTGTCG AGTTGAGAAA 5040 

AGGGGAGAAT TTATGGTCTG AATTTTCTAA CTGTCCTCTT TCTTGGGTCT AAAGCTCATA 5100 

ATACACAAAG GCTTCCAGAC CTGAGCCACA CCCAGGCCCT ATCCTGAACA GGAGACTAAA 5160 

CAGAGGCAAA TCAACCCTAG GAAATACTTG CATTCTGCCC TAGG6TTAGT ACCAGGACTG 5220 

AGGTCATTTC TACTGGAAAA GATTGTGAGA TTGAACTTAT CTGATCGCTT GAGACTCCTA 5280 

ATAGGCAGGA GTCAAGGCCA CTAGAAAATT GACAGTTAAG AGCCAAAAGT TTTTAAAATA 5340 

TGCTACTCTG AAAAATCTCG TGAAGGCTGT AGGAAAAGGG AGAATCTTCC ATGTTGGTGT 5400 

TTTTCCTGTA AAGATCAGTT TGGGGTATGA TATAAGCAGG TATTAATAAA AATAACACAC 5460 

CAAAGAGTTA CGTAAAACAT GTTTTATTAA TTTTGGTCCC CACGTACAGA CATTTTATTT 5520 

CTATTTTGAA ATGAGTTATC TATTTTCATA AAAGTAAAAC ACTATTAAAG TGCTGTTTTA 5580 

TGTGAAATAA CTTGAATGTT GTTCCTATAA AAAATAGATC ATAACTCATG ATATGTT7GT 5640 

AATCATGGTA ATTTAGATTT TTATGAGGAA TGAGTATCTG GAAATATTG? AG CAATACTT 5700 

GGTTTAAAAT TTTGGACCTG AGACACTGTG GCTGTCTAAT GTAATCCTT" AAAAATTCTC 5760 

TGCATTGTCA GTAAATGTAG TATATTATTG TACAGCTACT CATAATTTTT TAAAGTTTAT 5820 

GAAGTTATAT TTATCAAATA AAAACTTTCC TATAT 5855 

Seq ID NO: CI 14 DNA Sequence 

Nucleic Acid Accession #.- XM_087461.1 

Coding sequence: 236.. 1138 



3 GAACCCCAAA CGCAACCGGG TCTGGAGGGA TCCCCGCGCC 
MCGCCG CCCCTGCGGG CTTGGCAGGC GCCCGGCGCG 
C GCCCGGCCGC CGGCTCCCGC GGTCCCACCG TGAGCTCGCC GGCCCGTCGC 
CCGCTCGCCA TGCAACCGCC GCCGGCCTCG CGCGCGTAGG CGCCCGCCGC AGGCCATGCT 



3 CCCGGCCTCC TCGGGGTGCC CTCCAATGCT TCAGTCAACG C 

C TCGGCGGCCC CCGGGCCCCC CGAGCGCCCG 
GGCCCGGAGG AGGCGGCGGC GGCGGCGGCG C 

GCTGAGCTCG CTGCTCGTGC GCTGGGGCCG CCCGCGGGGC TTCCAGTGCG A 
CTTCTCCACC AACGCGCACG GCCGCGCTTT CTTCGCCG 
GCCGCTGCTC ATCGAGCACC TGGGGCTGGC GGCGGGCGGC GCGCAGCAGG ACCTGCGCCT 

CCCCAGCGCC GCCGCCGCCA CCGCCGGGGC GCCCACCGCG CTGCCAGCCT ACCCCGCGGC 
CGAGCCGCCC GGGCCGCTGT GGCTGCAGGG CGAGCCGCTG CATTTCTGCT GCCTAGACTT 
; CTGGCGGCTG AACCGTAAGC CCATTGAGTC 
CATCGTGGTG TGGAGCGTGG CCGCCCTCAT 
CTGCCCGGTG CCCATCATCG CCGGCTTCCT GCCCAACGGC ATGGAACAGC GCCGGACCAC 
CGCCAGCACC ACCGCAGCCA CCCCCGCCGC AGTGCCCGCA GGGACCACCG CAGCCGCCGC 
CGCCGCCGCC GCTGCCGCCG CCGCCGCGGC CGTCACTTCG GGGGTGGCGA CCAAGTGACC 
CCCTCCGCTC CTCCCTCTGT CCGTCCTGTG TCCGCGCGCG CGGGTGCCTT TCCCGCCGGA 
GACTCGGCCG GTGTGCTTCG TGCTGTAGTT ATCGTTAGTT CCTCTTCCCG AGATI 
GCCGAGAGAC CCCAGCGCCT TTGAAAAGCA AGGTTTGTGC TGCGCTTCCA GTTCCGAAAA 
GCAGATCTTT AAGCCCTTGG ACTGAGGGTG GGATCGCAGC TCCGAAGACG GAGAGGAGGG 
AAATGGGGCC CTTTCCCCTC TATTGCATCC CCCTGCCCGA C 

GCCCTAGATT CATGGCAGAA AATGACCAAA T _________ . 

CTGTTTTAAA TGAAAGTTTT AGTAAAAAAA ATACAAAACA AAAAGATTAA ATTGCTATTG 
CTGTAGTAAG AGAAGCTCTT TGTATCTGAA CATAGTTGTA TTTGAAATTT GTGGTTTTTT 
AAITTATTTA AAATTGGGGG GAGGGCATGG GAAGGATTTA ACACCGATAT ATTGTTACCG 
CTGAAAATGA ACTTTATGAA CCTTTTCCAA GTTGATCTAT CCAGTGACGT GGCCTGGTGG 
GCGTTTCTTC TTGTACTTAT GTGGTTTTTT GGCTTTTAAT ACAGACATTT TCCTCC 

Seq ID NO: c: 
Nucleic Acid 



1 11 21 31 41 51 

I I I I I I 

ACAGTTGTTG CAAAGTGCTC AGCACTAAGG GAGCCAGCGC ACAGCACAGC CAGGAAGGCG 
AGCGAGCCCA GCCAGCCCAG CCAGCCCAGC CAGCCCGGAG GTCATTTGAT TGCCCGCCTC 
AGAACGATGG ATCTGCATCT CTTCGACTAC TCAGAGCCAG GGAACTTCTC GGACATCAGC 
TGGC CATGCA ACAGCAGCGA CTGCATCGTG GTGGACACGG TGATGTGTCC CAACATGCCC 
AACAAAAGCG TCCTGCTCTA CACGCTCTCC TTCATTTACA TTTTCATCTT CGTCATCGGC 
ATGATTGCCA ACTCCGTGGT GGT CTGGGTG AATATCCAGG CCAAGACCAC AGGCTATGAC 

1 ACATCTTGAA CCTGGCCATT GCCGACCTGT GGGTTGTCCT CACCATCCCA 

! TCAGTCTCGT GCAGCACAAC CAGTGGCCCA TGGGCGAGCT CACGTGCAAA 
, TCATCTTCTC CATCAACCTC TTCGGCAGCA TTTTCTTCCT CACGTGCATG 
AGCGTGGACC GCTACCTCTC CATCACCTAC TTCACCAACA CCCCCAGCAG CAGGAAGAAG 

TTCTACCCCG AGCACAGCAT CAAGGAGTGG CTGATCGGCA TGGAGCTGGT CTCCGTTGTC 

ATCTCGGCGT CCAGTGACCA GGAGAAGCAC AGCAGCCGGA AGATCATCTT CTCCTACGTG 
GTGGTCTTCC TTGTCTGCTG GCTGCCCTAC CACGTGGCGG TGCTGCTGGA CATCTTCTCC 
T ACATCCCTTT CACCTGCCGG CTGGAGCACG CCCTCTTCAC GGCCCTGCAT 



GGGTGATGGG CCCTATGGTT TTCTAGAGCA AAGCAAAGTA GCTTCGGGTC TTGATGCTTG 
AGTAGAGTGA AGAGGGGAGC ACGTGCCCCC TGCATCCATT CTCTCTTTCT CTTGATGACG 
CAGCTGTCAT TTGGCTGTGC GTGCTGACAG TTTTGCAACA GGCAGAGCTG TGTCGCACAG 
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CAGTGCTGTG CGTCAGAGCC AGCTGAGGAC AGGCTTGCCT GGACTTCTGT AAGATAGGAT 1500 

TTTCTGTGTT TCCTGAATTT TTTATATGGT GATTTGTATT TAAATTTTAA GACTTTATTT 1ES0 

TCTCACTATT GGTGTACCTT ATAAATGTAT TTGAAAGTTA AATATATTTT AAATATTGTT 162 0 

TGGGAGGCAT AGTGCTGACA TATATTCAGA GTGTTGTAGT TTTAAGGTTA GCGTGACTTC 1680 

AGTTTTGACT AAGGATGACA CTAATTGTTA GCTGTTTTGA AATTATATAT ATATAAATAT 1-740 

ATATAAATAT ATAAATATAT GCCAGTCTTG GCTGAAATGT TTTATTTACC ATAGTTTTAT 1800 

ATCTGTGTGG TGTTTTGTAC CGGCACGGGA TATGGAACGA AAACTGCTTT GTAATGCAGT 1860 

TTGTGACATT AATAGTATTG TAAAGTTACA TTTTAAAATA AACAAAAAAC TGTTCTGGAC 1920 

TGCAAATCTG CACACACAAC GAACAGTTGC ATTTCAGAGA GTTCTCTCAA TTTGTAAGTT 1980 

Seq ID NO: C116 DNA Sequenc 
Nucleic Acid Accession #: K 
Coding sequence; 82.. 6903 



CTGGCTCTTA ACGGCGTTTA TGTCCTTTGC TGTCTGAGGG GCCTCAGCTC 



TGCCATTTCC 

A AGGGTATATC GAGATTTTCA 



3 ACCTGGCGCT 

CAGGGCACCC TACAGTGGAT AGAAGACACT CTGTATGCCA ACGTGGACTT 



CAGGACTTGC TGTGGGTGAC CAGGCCCCTC A 
GTGCTCTCCT TCAACTGGTA TGAAGACAAT AACTATAAGG CCTTTCTGGG GATTGACTCC 



ATCCACACCC TGGACTCAAA TCCTTTAACC AAAATCGCTT GGAGGGCCCC AAACCCTTTC 
CTGATGGGAA AAATCCTGTA CACTCCTGAT TCACCTGCAG CACGAAGGAT ACTGAAGAAT 
GCCAACTCAA CTTTTGAAGA ACTGGAACAC GTTAGGAAGT TGGTCAAAGC CTGGGAAGAA 
CTAGGGCCCC AGATCTGGTA CTTCTTTCAC AACACCACAC AGATGAACAT GATCAGAGAT 
ACCCTGGGGA ACCCAACAGT AAAAGACTTT TTGAATAGGC AGCTTGGTGA AGAAGGTATT 



ATGGCCAACT TCGACTGGAG GGACATATTT AACATCACTG ATCGCACCCT CCGCCTGGTC 
AATCAATACC TGGAGTGCTT GGTCCTGGAT AAGTTTGAAA GCTACAATGA TGAAACTCAG 
CTCACCCAAC GTGCCCTCTC TCTACTGGAG GAAAACATGT TCTGGGCCGG AGTGGTATTC 
CCTGACATGT ATCCCTGGAC CAGCTCTCTA CCACCCCACG TGAAGTATAA GATCCGAATG 
GACATAGAGG TGGTGGAGAA AACCAATAAG ATTAAAGACA GGTATTGGC-A TTCTGGTCCC 



ATGGTTGAAC AGGGGATCAC AAGGAGCCAG GTGCAGGCGG AGGCTCCAGT TGGAATCTAC 1980 

CTCCAGCAGA TGCCCTACCC CTGCTTCGTG GACGATTCTT TCATGATCAT CCTGAACCGC 2040 

TGTTTCCCTA TCITCATGGT GCTGGCATGG ATCTACTCTG TCTCCATGAC TGTGAAGAGC 2100 

ATCGTCTTGG AGAAGGAGTT GCGACTGAAG GAGACCTTGA AAAATCAGGG TGTCTCCAAT 2160 

GCAGTGATTT GGTGTACCTG GTTCCTGGAC AGCTTCTCCA TCATGTOGAT GAGCATCTTC 2220 

CTCCTGACGA TAITCAICAT GCATGGAAGA ATCCTACATT ACAGCGACCC ATTCATCCTC 22 80 

TTCCTGTTCT TGTTGGCTTT CTCCACTGCC ACCATCATGC TGTGCTTTCT GCTCAGCACC 2340 

TTCTTCTCCA AGGCCAGTCT GGCAGCAGCC TGTAGTGGTG TCATCTATTT CACCCTCTAC 2400 
T CGCCTGGCAG GACCGCATGA CCGCTGAGCT G 



GAGCAAGGCC TGGGGCTGCA GTGGAGCAAC ATCGGGAACA GTCCCACGGA A 
3 CTCCTTGATG CTGCGTGCTA I 
T GTTTCCAGGA G ACT AT GG AA CCCCACTTCC T 
CTTCTACAAG AGTCGTATTG GCTTAGCGGT GAAGGGTGTT CAACCAC 
CTGGAAAAGA CCGAGCCCCT AACAGAGGAA ACGGAGGATC CAGAGCACCC AGAAGGAATA 
3 CGTGAAGAAT 



T ATGCCCAGCT GAAAGGAAAG 

TCCCAGGAGG AGGCCCAGCT GGAGATGGAA GCCATGTTGG AGGACACAGG CCTCCACCAC 3240 

A GGACCTATCA GGTGGCATGC AGAGAAAGCT GTCGGTTGCC 3300 

; CAAGGTGGTG ATTCTGGACG AACCCACCTC TGGGGTGGAC 33 SO 

A GACGCTCAAT CTGGGATCTG CTCCTGAAGT ATCGCTCAGG CAGAACCATC 3420 

ATCATGCCCA CTCACCACAT GGACGAGGCC GACCACCAAG GGGACCGCAT TGCCATCATT 3480 

Z CCACTCTTCC TGAAGAACTG CTTTGGCACA 3540 

T TAACCTTGGT GCG CAAGATG AAAAACATCC AGAGCCAAAG GAAAGGCAG7 3600 

GAGGGGACCT GCAGCTGCTC GTCTAAGGGT TTCTCCACCA CGTGTCCAGC CCACGTCGAT 3660 
GACCTAACTC CAGAACAAGT CCTGGATGGG G. 
CACCATGTTC CAGAGGCAAA GCTGGTGGAG T 

CCAAATAAGA ACTTCAAGCA CAGAGCATAT GCCAGCCTTT TCAGAGAGCT GGAGGAGACG 3840 

CTGGCTGACC TTGGTCTCAG CAGTTTTGGA ATTTCTGACA CTCCCCTGGA AGAGATTTTT 3900 

CTGAAGGTCA CGGAGGATTC TGATTCAGGA CCTCTGTTTG CGGGTGGCGC TCAGCAGAAA 3 960 

AGAGAAAACG TCAACCCCCG ACACCCCTGC TTGGGTCCCA GAGAGAAGGC TGGACAGACA 4020 
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ACCGAAACTG ACTTGCTCAC CTGGAACACC TGATGGTCAA ACCAAACAAA TACAAAATrr 
TTCTCCAGAC CCCAGAACTA GAAACCCCGG GCCATCCCAC TAGCAGCTTf GGCCTCCATA 
TTCAAGCAGA TCTGCTTT1C TGCATGTTTG TCTGTGTGTC TG CGTTGTGT 
GTGATTTTCA TGGAAAAATA AAATGCAAAT GCACTCATCA CAAAAAAAAA AAAAAAAA 
Seq ID NOs C117 DNA Sequence 
Nucleic Acid Accession ft: NM_006671.2 
Coding sequence: 138.. 1820 

1 11 21 31 41 51 

1 I I I I I 

GGCACGAGGC TGGTGTTTAG CAACTCCGAC CACCTGCCTG CTGAGGGGCT AGAGCCCTCA 
GCCCAGACCC TGTGCCCCCG GCCGGGCTCT CATGCGTGGA ATGGTGCTGT GCCCCTTGCC 
AGCAGGCCAG GCTCACCATG GTGCCGCATG CCATCTTGGC ACGGGGGAGG GACG-GTGCA 
GGCGGAATGG ACTCCTCATC CIGTC1GTGC TGTCTGTCAT CGTGGGCTGC CTCC-CGGCT 
TCTTCTTGAG qacccGGCGC CTCTCACCAC AGGAAATTAG TTACTTCCAG TTCCCTGGAG 
AGCT CCTGAT GAGGATGCTG AAGATGATGA TCCTGCCACT GGTGGTCTCC AGCT-GATGT 
CCGGACTTGC CTCCCTGGAT GCCAAGACCT CTAGCCGCC? GGGCGTCCTC ACCGTGGCGT 
ACTACCTGTG GACCACCTTC ATGGCTGTCA TCGTGGGCAT CTTCATGGTC TCCATCATCC 
ACCCAGGCAG CGCGGCCCAG AAGGAGACCA CGGAGCAGAG TGGGAAGCCC ATCATGAGCT 
CAGCCGATGC CCTGTTGGAC CICATCCGGA ACATGTTCCC AGCCAACCTA GTAGAAGCCA 
CATTCAAACA GTACCGCACC AAGACCACCC CAGTTGTCAA GICCCCCAAG GTGGCACCAG 
AGGAGGCCCC TCCTCGGCGG ATCCTCATCT ACGGGGTCCA G3AGGAGAAT GGCTCCCATG 
* G f AGAACTT CGCC CTGGAC CTGACCCCGC CGCCCGAGGT CGTTTACAAG TCAGAGCCGG 
GCACCAGCGA TGGCATGAAT GTGCTGGGCA TCGTCTTCTT CTCTGCCACC ATGGGCAICA 
TGCTGGGCCG CATGGGTGAC AGCGGGGCCC CCCTGGTCAG CTTCTGCCAG TGCCTCAATG 
AGTCGGT CAT GAAGAICGTG GCGGTGGCTG TGTGGTATTT CCCCTTCGGC A"TGTGTTCC 
l^J? 1 ? 30 TAAGATCCTG GAGATGGACG ACCCCAGGGC CGTCGGCAAG AAGCTGGGCT 
TCTACTCAGT CACCGTGGTG TGCGGGCTGG TGCTCCACC-G GCTCTTTATC CTGCCCCTGC 
TCTACTTCTT CATCACCAAG AAGAATCCCA TCGTCT-CAT CCGCGGCATC CTGCAGGCTC 
TGCTCATCGC GCTGGCCACC TCCTCCAGCT CAGCCACACT GCCCATCACC TTCAAGTGCC 
TGCTGGAGAA CAAcraraTr ,»m»mn wvw»— iryi 



4200 
4260 
4320 



CCAATGTCTG CTCCCCAGGG GCGCCGGCTG CTCACCCAGA GGGCCAGCCT 
CCCCCAGAGC CAGAGTGCCC AGGCCCGCAG CTCAACACGG GGACACAGCT GGTCCTCCAG 
CATGTGCAGG CGCTGCTGGT CAAGAGATTC CAACACACCA TCCGCAGCCA CAAGGACTTC 
CTGGCGCAGA TCGTGCTCCC GGCTACCTTT GTGTTTTTGG CTCTGATGCT TTCTATTCTT 
t"ac™^ ™GCGAATA CCCCGCTTTG ACCCTTCACC CCTGGATATA TGGGCAGCAG 
TACACCTTCT TCAGCATGGA TGAACCAGGC AGTGAGCAGT TCACGGTACT TGCAGACGTC 

CTCCTGAATA AGCCAGGCTT TGGCAACCGC TGCCTGAAGG AAG 1 AGTAC Z7r, 

, CCCTGTGGCA ACTCAACACC CTGGAAGACT CCTTCTGTGT CCCCAAACA" CACCCAGCTG !™ 

TTCCAGAAGC AGAAATGGAC ACAGGTCAAC CCTTCACCAT CCTGCAGGTG CAGCA^SgG 4560 

==ss=== till 

TIGGTAAAAA CGTATCCTGC TCTTATAAGA AGCAGCTTAA AGAG CAAAT T CIGGGICAAT 4740 

GAACAGAGGT ATGGAGGAAT TTCCATTGGA GGAAAGCTCC CAGTCGTCCC CATCACGGGG 4800 

GAAGCACTTG TTGGGTTTTT AAGCGACCTT GGCCGGATCA TGAATGTGAG CGGGGGCCCT ilTo 

15 ATCACTAGAG AGGCCTCTAA AGAAATACCT GATTTCCTTA AACATCTAGA AACTGAAGAC 4920 

AACATTAAGG TGTGGTTTAA TAACAAAGGC TGGCATGCCC TGGTCAGCTT TCTCAATGTG 4980 

AtSotS ?™lcll*rr rr^™ COT ^CA GGAGCCCCGA GGAGTATGGA 5040 

AICACCGTCA TTAGCCAACC CCTGAACCTG ACCAAGGAGC AGCTCTCAGA GATTACAGTG 5100 

20 S^^™ TGTGGTTGGC ATCTGCGTGA TTTTCTCCAT GTCCTTCGTC 5160 

lK) E™ C <? GCT TT ^CCTTTA TTTGATCCAG GAGCGGGTGA ACAAATCCAA GCACCTCCAG 5220 

TTTATCAGTG GAGTGAGCCC CACCACCTAC TGGGTGACCA ACTTCCTCTG GGACATCATG 5280 

AATTATTCCG TGAGIGCTGG GCTGGTGGTG GGCATCTTCA TCGGGTTTCA G^Sa^CC 5340 

TACACTTCTC CAGAAAACCT TCCTGCCCTT GTGGCACTGC TCCTGCTGTA TGGATGGGCG 5400 

GTCATTCCCA TGATGTACCC AGCATCCTTC CTGTTTGATG TCCCCAGCAC AGCCTATGTG 5^60 

GOTTA TCTT GTGCTAATCT GTTCATCGGC ATCAACAGCA GTGCTATTAC CTTCATCTTG 5520 

GAATTATTTG ATAATAACCG GACGCTGCTC AGGTTCAACG CCGTGCTGAG GAAGCTGCTC 5580 

ATTGTCTTCC CCCACTTCTG CCTGGGCCGG GGCCTCATTG ACCTTGCACT GAGCCAGGCT 5s!o 

GTGACAGATG TCTATGCCCG GTTTGGTGAG GAGCACTCTG CAAATCCGTT CCACTGGGAC 5700 

^0 ^™ GGA AGAA ^ G " TGCCATGGTG GTGGAAGGGG TGGTGTACTT cScCT^CC 57 

JU ™" GGTCC AGCGCCACTT CTTCCTCTCC CAATGGATTG CCGAGCCCAC TAAGGAGCCC 5820 

^S TGATGA ™ CIGAA GAAAGACAAA GAATTATTAC TGGTGGAAAT 5880 

AAAACTGACA TCTTAAGGCT ACATGAACTA ACCAAGATTT ATCTGGGCAC CCCAGCCCA 5940 

GCAGTGGACA GGCTGTGTGT CGGAGTTCGC CCTGGAGAGT GCTTTGGCCT CCTGGGAGTG 6000 

AATGGTGCCG GCAAAACAAC CACATTCAAG ATGCTCACTG GGGACACCAC AGTGACCTCA 6060 

T^l GCCA 0CGTAGCAGG CAAGAGTATT TTAACCAATA TTTCTGAAGT CCATCAAAAT 6120 

ATGGGCTACT GTCCTCAGTT TGATGCAATC GATGAGCTGC TCACAGGACG AGAACATCTT 6180 

TACCTTTATG CCCGGCTTCC AGGTGTACCA GCAGAAGAAA T CGAAAAGGT TGCAAACTGG 6240 

T-lzll^ GCCTGGGCCT GACTGTCTAC GCCGi c GCC 3GCTGGCAC GTACAGTGGG 6300 

40 & : CAACAAGC GGAAACTCTC CACAGCCATC GCACTCATTG GCTGCCCACC GCTGGTGGTG 6360 

W CTGGATGAGC CCACCACAGG GATGGACCCC CAGGCACGCC GCATGCTG-G GAACGTCATC 6420 

GTGAGCATCA TCAGAAAAGG GAGGGCTGT S 5TCCI A CAT CCCACAGCAT 4 

CAGGCACTGT GTACCCGGCT GGCCATCATG GTAAAGGGCC CCTT"CGATG TA-GGGCACC 6540 

ATTCAGCATC TCAAGTCCAA ATTTGGACAT GGCTATATCG TCACAATGAA GATCAAATCC 6600 

CCGAAGGACG ACCTCCTTCC TGACCTGAAC CCTGTGGAGC AGTTCTTCCA GGGGAACTTC 6660 

CCAGGCAGTG TGCAGAGGGA GAGGCACTAC AACATGCTCC AGTTCCAGGT CTCCTCCTCC 672 0 

TCCCTGGCGA GGATCTTCCA GCTCCTCCTC TCCCACAAGG ACAGCCTGCT CATCGAGGAG 6780 

<S££ GTGTTTGTAA ATTITGCTAA ACAGCAGACT 6840 

50 ====== s 

GC £ GAAAACA 5*5^^? CGAATTCAGA AAGAGGTCTT TCAGAAGGAA 7080 

7260 
7318 



™ -wwwwwha. CAGCCACACT GCCCATCACC TTCAAGTGCC 

TGCTGGAGAA CAACCACATC GACCGGCG CA TCGCTCGCTT CGTGCTGCCC GTGGGTGCCA 
CCATCAACAT GGACGGCACT GCGCTCTACG AGGCTGTG3C CGCCATCTTC ATCGCCCAGG 
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TCAACAACTA CGAGCTGGAC TTTGGCCAGA TCATCACCAT CAG7ATCACA G 
CCAGCATTGG GGCAGCTGGC ATCCCCCAGG CCGGCCTCGT CACCATGGTC A' 

CCTCCGTGGG ACTGCCCACC GATGACATCA CCCTCATCAT TGCCGTTGAC TGGGOTCIGG 1E00 

ACCGTTTCCG CACCATGATT AACGTGCTGG GTGATGCGCT GGCAGGGGGG ATCATGGCCC 1560 

ATATATGTCG GAAGGATTTT GCCCGGGACA CAGGCACCCA AAf il ( TGCGAGA 1S20 

CCAAGCCAGT GAGCCTCCAG GAGATCGTGG CAGCCCAGCA GAATGGCTGT GTGAAGAGTG 1S80 

TAGCCGAGGC CTCCGAGCTC ACCCTGGGCC CCACCTGCCC CCACCACGTC CCGGTTCAAG 1740 

TGGAGCGGGA TGAGGAGCTG CCCGCTGCGA GTCTGAACCA CTGCACCATC CAGATCAGCG 1800 

AGCTGGAGAC CAATGTCTGA GCCTGCGGAG CTGCAGGGGC AGGCGAGGCC TCCAGGGGCA I860 

GGGTCCTGAG GCAGGAACTC GACTCTCCAA CCCTCCTGAG CAGCCGGCAG GG3CCAGGAT 1920 

CACACATTCT TCTCACCCTT GAGAGGCTGG AATTAACCCC GCTTGACGGA AAATGTATCT 1980 

CAGAGAAGGG AAAGGCTGCA TGGGGGAGCC CCATCCAGGG AGTGATGGGC CCGGCATTGC 2040 

CTGAGGCCCC GCTGTGACAG TTTCCCCGGT GTGAGCCCGG TGAGGGCGGC AGG CAGGGGT 2100 

TATCCGGCCC CACTTTCTGG ATGACAGACT TGAGGCTCTG AGAGCTGAAA ACACTTGTCC 2160 

AAGGTCTCAC GTTAAGGTCA AGACACTAAC TCAAATCTTT CAAGCCCCGC CTCTCCTCTT 2220 

GGAGGACAGG GCAGCCTGCA GCTGTGTCCA GGCCCAGGCC CCACCCCATA ACAGGTGGCC 2280 

TCAGCCACAC AGTTCTCCCC AAGGGGAGCA GCCCAGGGCC AAGCCCCGCT GCCTTCCCCA 2340 

GGCCACAGTG CGTCCAGTCT CCTGTCCTGC CACGTGTCTT TTGCAAAGCT CCTTGGATGT 240 0 

GGAGACAGAT GTCTTTACTA GAGCTGAAAG GCCCCCTTGA CACATCCAGG CCAACCTCCC 2460 

ATGGAATAGG TAGGCAAGCC AGGACTCCGG GAAGGAGGTG CAGCCAGGAT GCTCTGGTGG 252 0 

AGCTGCCGAT GGGGCCCTGG TGTCAGAACT CCCCAAAGGC CTGTGCGTCC AAGTGGAGTC 2580 

AGGTTTTCTA TTCCTTTCTG TGTTTGCAAA TTCAGTGTTA ACTAAATAAA GGTATTTTGT 2640 



GTACGTGCCC TGCGTGAGTG CGTGGCGGCG GCGCGTGCGC TAGGGGAGTG GGCGGTGAGG 



C GGCGTCCACG C 
A GGGCCCGTGG GTCCGGCCTG GATGCAGGAT GGCCTGAGTC C 
C GTGCCCTCGA CGCGGATGGC TCTAGGGACT CTGGCCTTGG TGCTGGCTCT 
TCCCTGCAGA CGCCGGGAGC GGCCCGCTGG TGCTGATTCG CTGTCTTGGG GGGCCGGCCC 
TCGCATCTCT CCCTACGTGC TGCAGCTGCT TCTGGCCACA CTTCAGGCGG CGCTGCCCCT 
CCGGG TGGGCACTGC CCGGGGGGCC CCACTGCCAA GCTATCTACT 
TCTGGCCTCC GTGCTGGAGA GTCTGGCCGG CGCCTGTGGC CTGTGGCTGC TTGTCGTGGA 
GCGGAGCCAG GCACGGCAGC GTCTGGCAAT GGGCATCTGG ATCAACTTCA GGCACAGCCC 
TGGTCTCCTG CTCCTCTGGA CTGTGGCGTT TGCAGCTGAG AACTTGGCCC TGGTGTCTTG 
GAACAGCCCA CAGTGGTGGT GGGCAAGGGC AGACTTGGGC CAACAGGTTC AGTTTAGCCT 
GTGGGTGCTG CGGTATGTGG TCTCIGGAGG C 
ACTTCGTCCC CAGTC( 



G TGCTCATCTG 1080 

CCTGGGGCTC ATGGGTTTGG AACGGGCACT CAATGTGTTG GTGCCTATAT TCTATAGGAA 1140 

CATTGTGAAC TTGCTGACTG AGAAGGCACC TTGGAACTCT CTGGCCTGGA CTGTTACCAG 1200 

TTACGTCTTC CTCAAGTTCC TCCAGGGGGG TGGCACTGGC AGTACAGGCT TCGTGAGCAA 1260 

CCTGCGCACC TTCCTGTGGA TCCGGGTGCA GCAGTTCACG TCTCGGCGGG TGGAGCTGCT 1320 

CATCTTCTCC CACCTGCACG AGCTCTCACT GCGCTGGCAC CTGGGGCGCC GCACAGGGGA 1360 

GGTGCTGCGG ATCGCGGATC GGGGCACATC CAGTGTCACA GGGCTGCTCA GCTACCTGGT 1440 

GTTCAATGTC ATCCCCACGC TGGCCGACAT CATCATTGGC ATCATCTACT TCAGCATGTT 1500 

CTTCAACGCC TGGTTTGGCC TCATTGTGTT CCTGTGCATG AGTCTTTACC TCACCCTGAC 1560 

CATTGTGGTC ACTGAGTGGA GAACCAAGTT TCGTCGTGCT ATGAACACAC AGGAGAACGC 1620 

TACCCGGGCA CGAGCAGTGG ACTCTCTGCT AAACTTCGAG ACGGTGAAGT ATTACAACGC 1680 

CGAGAGTTAC GAAGTGGAAC GCTATCGAGA GGCCAICATC AAATATCAGG GTTTGGAGTG 1740 

GAAGTCGAGC GCTTCACTGG TTTTACTAAA TCAGACCCAG AACCTGGIGA TTGGGCTCGG 1800 

GCTCCTCGCC GGCTCCCTGC TTTGCGCATA CTTTGTCACT GAGCAGAAGC TACAGGTTGG 1860 

A CCTACATTAT CCAGCTGTAC ATGCCCCTCA ATTGGTTTGG 1920 

T CATTGACATG GAGAACATGT TTGACTTGCT 1980 
2040 

G TTTGAGAACG TGCACTTCAG CTATGCCGAT GGGCGGGAGA CTCTGCAGGA 2100 

CGTGTCTTTC ACTGTGATGC CTGGACAGAC ACTTGCCCTG GTGGGCCCAT CTGGGGCAGG 2160 

A ATTTTGCGCC TGCTGTTTCG CTTCTACGAC ATCAGCICTG GCTGCATCCG 2220 

G CAGGACATTT CACAGGTGAC CCAGGCCTCT CTCCGGTCTC ACATTGGAGT 2280 

TGTGCCCCAA GACACTGTCC TCTTTAATGA CACCATCGCC GACAATATCC GTTACGGCCG 2340 

T GGGAATGATG AGGTGGAGGC TGCTGCTCAG GCTGCAGGCA TCCATGATGC 2400 

T TTCCCTGAAG GGTACAGGAC ACAGGTGGGC GAGCGGGGAC TGAAGCTGAG 24S0 

CGGCGGGGAG AAGCAGCGCG TCGCCATTGC CCGCACCATC CTCAAGGCTC CGGGCATCAT 2520 

T CAGCGCTGGA TACATCTAAT GAGAGGGCCA TCCAGGCTTC 2580 

A GTCTGTGCCA ACCGCACCAC CATCGTAGTG GCACACAGGC TCTCAACTGT 2640 

GGTCAATGCT GACCAGATCC TCGTCATCAA GGATGGCTGC ATCGTGGAGA GGGGACGACA 2700 

CGAGGCTCTG TTGTCCCGAG GTGGGGTGTA TGCTGACATG TGGCAGCTGC AGCAGGGACA 2760 

GGAAGAAACC TCTGAAGACA CTAAGCCTCA GACCATGGAA CGGTGACAAA AGTTTGGCCA 2820 

CTTCCCTCTC AAAGACTAAC CCAGAAGGGA ATAAGATGTG TCTCCTTTCC CTGGCTTATT 2880 

TCATCCTGGT CTTGGGGTAT GGTGCTAGCT ATGGTAAGGG AAAG3GACCT TTCCGAAAAA 2940 

CATCTTTTGG GGAAATAAAA ATGTGGACTG TGAAAAAAAA AAAAAAAAAA AAA 2993 
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45 



I I I I I I 

GGGCAATTTG TTAGTTATCC GCCGCCACCA AGACGCGGCA CGGCGCCTGG ACCGGAGGGG 
CCCCGCGCGG GCGCGAACTT TGGGCTCGGG CGAGTGGGTG GTGCTCCGCC CAGCCCGAGA 
CGGGCGGGCG CGCGG3CCAA TGGGTGCCGC CTCTTCrPi CGGGG f 
GTCCCGGCCA CCAGCGCCCC AGCCCCGAGG CTCAGAAGCG GCAGGCGGAG GCGCGGTCCG 
GGCGCTATGG CCATGCCCGG CGGGTCTCAC GCGGCTGCCC CTCGCCCGGC GCGCCTTCGG 
TAGGGGGCGC CCGGGGCCCA GCTGGCCCGG CCATGCTGCT GGAGACACAG GACGCGC7GT 
ACGTGGCGCT GGAGCTGGTC ATCGCCGCGC TTTCGGTGGC GGGCAACGTG CTGGTGTGCG 
CCGCGGTGGG CACGGCGAAC ACTCTGCAGA CGCCCACCAA CTACTTCCTG GIGTCCCTGG 
CTGCGGCCGA CGTGGCCGTG GGGCTCTTCG CCATCCCCTT TGCCATCACC ATCAGCCTGG 
GCTTCTGCAC TGACTTCTAC GGCTGCCTCT TCCTCGCCTG CTTCGTGCTG GTGCTCAOGC 
AGAGCTCCAT CTTCAGCCTT CTGGCCGTGG CAGTCGACAG ATACCTGGCC ATCTGTGTCC 
CGCTCAGGTA TAAAAGTTTG GTCACGGGGA CCCGAGCAAG AGGGGTCATT GCTGTCCTCT 
GGGTCCTTGC CTTTGGCATC GGATTGACTC CATTCCTGGG GTGGAACAGT AAAGACAGTG 
CCACCAACAA CTGCACAGAA CCCTGGGATG GAACCACGAA TGAAAGCTGC TGCCTTGTGA 
AGTGTCTCTT TGAGAATGTG GTCCCCATGA GCTACATGGT ATATTT CAAT TTCTTTGGGT 
GTGTTCTGCC CCCACTGCTT ATAATGCTGG TGATCTACAT TAAGATCTTC CTGGTGGCCT 
GCAGGCAGCT TCAGCGCACT GAGCTGATGG ACCACTCGAG GACCACCCTC CAGCGGGAGA 
TCCATGCAGC CAAGTCACTG GCCATGATTG TGGGGATTTI TGCCCTGTGC TC-GTTACCTG 
TGCATGCTGT TAACTGTGTC ACTCTTTTCC AGCCAGCTCA GGGTAAAAAT AAGCCCAAGT 
GGGCAATGAA TATGGCCATT CTTCTGTCAC ATGCCAATTC AGTTGTCAAT CCCATTGTCT 
ATGCTTACCG GAACCGAGAC TTCCGCTACA CTTTTCACAA AATTATCTCC AGGTATCTTC 
TCTGCCAAGC AGATGTCAAG AGTGGGAATG GTCAGGCTGG GGTACAGCCT GCTCTCGGTG 
TGGGCCTATG ATCTAGGCTC TCGCCTCTTC CAGGAGAAGA TACAAATCCA CAAGAAACAA 
AGAGGACACG GCTGGTTTTC ATTGTGAAAG ATAGCTACAC CTCACAAGGA AATGGACTGC 
CTCTCTTGAG CACTTCCCTG GAGCTACCAC GTATCTAGCT AATATGTATG TGTCAGTAGT 
AGGCTCCAAG GATTGACAAA TATATTTATG ATCTATTCAG CTGCTTTTAC TGTGTGGATT 
ATGCCAACAG CTTGAATGGA TTCTAACAGA CTCTTTTGTT TTTAAAAGTC TGCCTTGTTT 
ATGGTGGAAA ATTACTGAAA CTATTTTACT GTGAAACAGT GTGAACTATT ATAATGCAAA 
TACTTTTTAA CTTAGAGGCA ATGGAAAAAT AAAAGTTGAC T 



CCCAGCCCGG CCCCGCCGCC CCGGCTGCGC ACGCGACGCC CCCTCCAGGC CCCGCTCCTG 60 

CCCCCTATTT GGTCATTCGG GGGGCAAGCG GCGGGAGGGG AAACGTGCGC GGCCGAAGGG 12 0 

GAAGCGGAGC CGGCGCCGGC TGCGCAGAGG AGCCGCTCTC GCCGCCGCCA CCTCGGCTGG 180 

GAGCCCACGA GGCTGCCGCA TCCTGCCCTC GGAACAATGG GACTCGGCGC GCGAGGTGCT 24 0 

TGGGCCGCGC TGCTCCTGGG CACCCTGCAG GTGCTAGCGC TGCTGGGGGC CGCCCATGAA 30 0 

AGCGCAGCCA TGGCGGAGAC TCTCCAACAT GTGCCTTCTG ACCATACAAA TGAAACTTCC 360 

AACAGTACTG TGAAACCACC AACTTCAGTT GCCTCAGACT CCAGTAATAC AACGGTCACC 42 0 

ACCATGAAAC CTACAGCGGC ATCTAATACA ACAACACCAG GGATGGTCTC AACAAATATG 480 

ACTTCTACCA CCTTAAAGTC TACACCCAAA ACAACAAGTG TTTCACAGAA CACATCTCAG 540 

ATAICAACAT CCACAAT G AC CGTAACCCAC AATAGTTCAG TGACATCTGC TGCTTCATCA SOO 

GTAACAATCA CAACAACTAT GCATTCTGAA GCAAAGAAAG GATCAAAATT TGATACTGGG 660 

AGCITTGTTG GTGGTATTGT ATTAACGCTG GGAGTTTTAT CTATTCTTTA CATTGGATGC 720 

AAAATGTAIT ACT CAAG AAG AGGCATTCGG TATCGAACCA TAGATGAACA TGATGCCATC 780 

ATTTAAGGAA ATCCATGGAC CAAGGATGGA ATACAGATTG ATGCTGCCCT ATCAATTAAT 840 

TTTGGTTTAT TAATAGTTTA AAACAATATT CTCTTTTTGA AAATAGTATA AACAGG CCAT 900 

GCATATAAIG TACAGTGTAT TACGTAAATA TGTAAAGATT CTTCAAGGTA ACAAGGGTTT 960 

GGGTTTTGAA ATAAACATCT GGATCTTATA GACCGTTCAT ACAATGGTTT TAGCAAGTTC 1020 

ATAGTAAGAC AAACAAGTCC TATCTTTTTT TTTTTGGCTG GGGTGGG3GC ATTGGTCACA 1080 

TATGACCAGT AATTGAAAGA CGTCATCACT GAAAGACAGA ATGCCATCTG GGCATACAAA 1140 

TAAGAAGTTT GTCACAGCAC TCAGGATTTT GGGTATCTTT TGTAGCTCAC ATAAAGAACT 1200 

TCAGTGCTTT TCAGAGCTGG ATATATCTTA ATTACTAATG CCACACAGAA ATTATACAAT 1260 

CAAACTAGAT CTGAAGCATA ATTTAAGAAA AACATCAACA TTTITTGIGC TTTAAACTGT 1320 

AGTAGTTGGT CTAGAAACAA AATACTCCAA GAAAAAGAAA ATTTTCAAAT AAAACCCAAA 1380 

ATAATAGCTT TGCTTAGCCC TGTTAGGGAT CCATTGGAGC ATTAAGGAGC ACATATTTTT 1440 

ATTAACTTCT TTTGAGCTTT CAATGTTGAT GTAATTTTTG TTCTCTGTGT AATTTAGGTA 1500 

AACTGCAGTG TTTAACATAA TAATGTTTTA AAGACTTAGT TGTCAGTATT AAATAATCCT 1560 

GGCATTATAG GGAAAAAACC TCCTAGAAGT TAGATTATTT GCTACTGTGA GAATATTGTC 1620 

ACCACTGGAA GTTACTTTAG TTCATTTAAT TTTAATTTTA TATTTTGTGA ATATTTTAAG 1680 

AACTGTAGAG CTGCTTTCAA TATCTAGAAA TTTTTAATTG AGTGTAAACA CACCTAACTT 1740 

TAAGAAAAAG AACCGCTTGT ATGATTTTCA AAAGAACATT TAGAATTCTA TAGAGTCAAA 1800 
ACTATAGCGT AATGCTGTGT TTATTAAGCC AGGGATTGTG GGACTTCCCC C 
AACCTGCAGG ATGAAAATGC TATATTTTCT TTCATGCACT GTCGATATTA C 

GGGAAATGAC ATTTTTATAC TAAAACAAAC ACCAAAATAT TTTAGAATAA A 

AGTTTTGAGA GGAATTTTTA GAGAGGACAT TTCCTCCTTC CTGATTTGGA TATTCCCTCA 
AATCCCTCCT CTTACTCCAT GCTGAAGGAG AAGTACTCTC AGATGCATTA T 
GAGAAAAAGC ACAGTATTGT AGAGACACCA ATATTAGCTA A 

CATTTTACAG TTTATATTCC AGCACTCAAA ACTCAGGGTC AAGTTTTAAC AAAAGAGGTA 
TGTAGTCACA GTAAATACTA AGATGGCATT TCTATCTCAG AGGGCCAAAG TGAATCACAC 
CAGTTTCTGA AGGTCCTAAA AATAGCTCAG ATGTCCTAAT GAACATGCAC C 

TAGGAGTACA ATAAAACTGT TGTCAGCTTT TGTTTTACAG AGAACGCTAG A 

TTTTGAAATG GATCATTTCT ACTTGCTGTG CATTTTAACC AATAATCTGA TGAATATAGA 
AAAAAATGAT CCAAAATATG GATATGATTG GA7GTATGTA ACACATACAT GGAGTATGGA 2520 
GGAAATTTTC TGAAAAATAC ATTTAGAT"A G I AGA3GT GGGCTGATGG 2580 
.. ... 2 ^ 

2700 
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TTCTTGGTAT TTCACTACTT CAAGTCAGTC AGAACTTCGT AGACCGACCT GAAGTTTCTT 2760 

TTTGAATACT TGTTTCTTTA GCACTTTGAA GATAGAAAAA CCACTTTTTA AGTACTAAGT 2820 

CATCATTTGC CTTGAAAGTT TCCTCTGCAT TGGGTTTGAA GTAGTTTAGT TATGTCTTTT 2880 

TCTCTGTATG TAAGTAGTAT AATTTGTTAC TTTCAAATAC CCGTACTTTG AATGTAGGTT 2940 

TTTTTGTTGT TGTTATCTAT AAAAATTGAG GGAAATGGTT ATGCAAAAAA ATATTTTGCT 3000 

TTGGACCATA TTTCTTAAGC ATAAAAAAAT GCTCAGTTTT GCTTGCATTC CTTGAGAATG 3060 

TATTTATCTG AAGATCAAAA CAAACAATCC AGATGTATAA GTACTAGGCA GAAGCCAA1T 312 0 

TTAAAATTTC CTTGAATAAT CCATGAAAGG AATAATTCAA ATACAGATAA ACAGAGTTGG 3180 

CAGTATATTA TAGTGATAAT TTTGTATTTT CAAMAAAAAA AAAGTTAAAC TCTTCTTTTC 3240 

TTTTTATTAT AATGACCAGC TTTTGGTATT TCATTGTTAC CAAGTTCTAT TTTTAGATAA 3300 

AATTGTTCTC CTTCTAAAAA AAAAAAAAAA AAAAAAAA 3338 

Seq ID NO: C121 DNA Sequence 

: NM_004195 

Coding sequf 



20 
25 
30 




AGGAGTGCTG T 
TGCGGAGACC CTTGCTGCAC G 
CAGTCCCAGG GGAAATT 

TCCGGGGGCC ACGAAGGGCA CTGCAAACCT TGGACAGACT GCACCCAGTT CGGGTTTCTC 
ACTGTGTTCC CTGGGAACAA GACCCACAAC GCTGTGTGCG TCCCAGGGTC C 
GAGCCGCTTG GGTGGCTGAC CGTCGTCCTC CTGGCCGTGG C 

ACCTCGGCCC AGCTTGGACT GCACATCTGG CAGCTGAGGA GTCAGTGCAT GTGGCCCCGA 
GAGACCCAGC TGCTGCTGGA GGTGCCGCCG TCGACCGAAG ACGCCAGAAG CTGCCAGTTC 
CCCGAGGAAG AGCGGGGCGA GCGATCGGCA GAGGAGAAGG GGCGGCTGGG AGACCTGTGG 

Seq ID NO : C122 DNA Sequence 
Coding sequence: 



CAGATCTCCT GGGTCTTCTT CTCCCACCAC CTCTCCCTCC TCCTGGGCAG CGCCCTCGGG 
GGCGTCTTCA AAAGGACCCT GGCCCAGTCA CTATGGGCCC TGTTCACCTC CTCTCTGGCC 
ATCTCCCTGG TGTTTGCCGT CATCCCCTTC TGCCGCGACG TGAAGGTGCT GGCCTCAGTC 
ATGGCGCTGG CGGGCTTGGC CATGGGCTGC ATCCACACCC TGGCCAACAT GCAGCTGGTA 
AGGATGTACC AGAAGGACTC GGCCGTCTTC CTCCAGGTGC TCCATTTCTT CGTGGGCTTT 
GGTGCTCTGC TGAGCCCCCT TATTGCTGAC CCTTTCCTGT CTGAGGCCAA CTGCTTGCCT 
i M CA CGGCCAACAC CACCTCCCGA GGCCACCTGT TCCATGTCTC CAGGGTGCTG 

GGCCAGCACC ACGTAGATGC CAAGCCTTGG TCCAACCAGA CGTTCCCAGG GCIGACTCCA 
AAGGACGGGG CAGGGACCCG AGTGTCCTAT GCCTTCTGGA TCATGGCCCI CATCGATCTT 
CCAGTGCCCA TGGCTGTGCT GATGCTGCTG TCCAAGGAGC GG CTGCTG AC CTGCTGTCCC 
CAGAGGAGGC CCCTGCTICT GTCTGCTGAT GAGCTTGCCT TGGAGACACA GCCTCCTGAG 
AAGGAAG AT G CCTCCTCACT GCCCCCAAAG TTTCAGTCAC ACCTAGGGCA TGAGGACCTG 
TTCAGCTGCT GCCAAAGGAA GAACCTCAGA GGAGCCCCTT ATTCCTTCTT TGCCATCCAC 
ATCACGGGCG CCCTGGTACT GTTCATGACG GATGGGTTGA CGGGTGCCTA TTCCGCCTTC 
GTGTACAGCT ATGCTGTGGA GAAGCCCCTG TCTGTGGGAC ACAAGGTGGC TGGCTACCTC 
CCCAGCCTCT TCTGGGGCTT CATCACACTG GGCCGGCTCC TCTCCATTCC CATATCCTCA 
AGAATGAAGC CGGCCACCAT GGTTTTCATC AACGTGGTTG GCGTGC-TGGT GACGTTCCTG 
GTGCTGCTTA TTTTCTCCTA CAACGTCGTC TTCCTGTTCG TGGGGACGGC AAGCCTGGGC 
CTGTTTCTCA GCAGCACCTT CCCCAGCATG CTGGCCTACA CGGAGGACTC GCIGCAGTAC 
AAAGGCTGTG CAACCACAGT GCTGGTGACA GGGGCAGGAG TTGGCGAGAT GGTGCTGCAG 
ATGCTGGTTG GTTCGATATT CCAGGCTCAG GGCAGCTATA GTTTCCTGGT CTGTGGCGTG 
ATCTTTGGTT GTCTGGCTTT IACCITCTAT ATCTTGCTCC TGTTITTCCA CAGGATGCAC 
CCTGGACTCC CATCAGTTCC TACCCAAGAC AGATCAATTG GAATGGAAAA CTCTGAGTGC 
TACCAGAGGT AAAACTGGGT GAAGAAGGCA AGAGAAGACT TTCAGCCTCT TGATCACCAG 
CACGACCATA CTGITTCAGA AAGCTGGGTG GTGGTGGAGG CGCTCTCTCA AIGGCTATTC 
AAGTCTTCTC CACIAAAACT TGGTTGGGTA GAGGAAATTA AATTGAGTCC TGGTACCTGG 
TCAAAATCAI TAGAAGTTTA CCTGGCTTCT CAAGTTATCT TCTTCCCTGG TTCAGACTGT 
TGGTAAGAGC TGTCCAGATA CCCAGATGGG AAGGAAGGAG ACAGCCGCGC GCTTCACTCC 
ATTTGICACC TCATG CATGG ACCATACTCT GGGTTTGAGA TCATTCTTCA T 
3 TTGAAATTGT AAAGCTCCAT GATCACIGCT ATATGTAGAT A 
A CAAGCTGCAA GTTATTCCCT GGCATGCTCA AAGGATTTTC GTGCTITTCA 
CTTAATAGTC CAAAGTCTCT TAAATTCCTG CTGCAGATAT C 

A AAGGAAAAGT TGAATCTTGC TC < ATA TAATGT A 

I CATACCAACT TAGAAAAGCT GATTGTAAAA TTTTCATTTT GACAGCTGGT 
TATTAAATGC AGCCATTATT AAAAATCAAA TCATAC? 
TTTAAAACAA AGGTAATAAA TATTCAAAGC ATATCACTTC C 

Seq ID NO: C123 DNA Sequence 
Nucleic Acid Accession #: NM_002203.2 
Coding sequence: 43.-3588 
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CTGCAAACCC AGCGCAACTA CGGTCCCCCG GTCAGACCCA GGATGGGGCC AGAACGGACA 6 0 

GGGGCCGCGC CGCTGCCGCT GCTGCTGGTG TTAGCGCTCA GTCAAGGCAT TTTAAATTGT 120 

TGTTTGGCCT ACAATGTTGG TCTCCCAGAA GCAAAAATAT TTTCCGGTCC TTCAAGTGAA 180 

CAGTTTGGGT ATGCAGTGCA GCAGTTTATA AATCCAAAAG GCAACTGGTT ACTGGTTGGT 240 

TCACCCTGGA GTGGCTTTCC TGAGAACCGA ATGGGAGATG TGTATAAATG TCCTGTTGAC 3 00 

CTATCCACTG CCACATGTGA AAAACTAAAT TTGCAAACTT CAACAAGCAT TCCAAATGTT 3 SO 

ACTGAGATGA AAACCAACAT GAGCCTCGGC TTGATCCTCA CCAGGAACAT GGGAACTGGA 420 

GGTTTTCTCA CATGTGGTCC TCTGTGGGCA CAGCAATGTG GGAATCAGTA TTACACAACG 4 BO 

GGTGTGTGTT CTGACATCAG TCCTGATTTT CAGCTCTCAG CCAGCTTCTC ACCTGCAACT E40 

CAGCCCTGCC CTTCCCTCAT AGATGTTGTG GTTGTGTGTG ATGAATCAAA TAGTATTTAT 600 

CCTTGGGATG CAGTAAAGAA TTTTTTGGAA AAATTTGTAC AAGGCCTTGA TATAGGCCCC 6S0 

ACAAAGACAC AGGTGGGGTT AATTCAGTAT GCCAATAATC CAAGAGTTGT GTTTAACTTG 720 

AACACATATA AAACCAAAGA AGAAATGATT GTAGCAACAT CCCAGACATC C CAA7 ATGGT 780 

GGGGACCTCA CAAACACATT CGGAGCAATT CAATATGCAA GAAAATATGC CTAT7CAGCA 840 

GCTTCTGGTG GGCGACGAAG TGCTACGAAA GTAATGGTAG TTGTAACTGA CGGTGAATCA 900 

CATGATGGTT CAATGTTGAA AGCTGTGATT GATCAATGCA ACCATGACAA TATACTGAGG 960 

TTTGGCATAG CAGTTCTTGG GTACTTAAAC AGAAACGCCC TTGATACTAA AAATTTAATA 1020 

AAAGAAATAA AAGCGATCGC TAGTATTCCA ACAGAAAGAT ACTTTTTCAA TGTGTCTGAT 1080 

GAAGCAGCTC TACTAGAAAA GGCTGGGACA TTAGGAGAAC AAATTTTCAG CATTGAAGGT 1140 

ACTGTTCAAG GAGGAGACAA CTTTCAGATG GAAATGTCAC AAGTGGGATT CAGTGCAGAT 1200 

TACTCTTCTC AAAATGATAT TCTGATGCTG GGTGCAGTGG GAGCTTTTGG CTGGAGTGGG 1260 

ACCATTGTCC AGAAGACATC TCATGGCCAT TTGATCTTTC CTAAACAAGC CTTTGACCAA 1320 

ATTCTGCAGG ACAGAAATCA CAGTTCATAT TTAGGTTACT CTGTGGCTGC AATTTCTACT 1380 

GGAGAAAGCA CTCACTTTGT TGCTGGTGCT CCTCGGGCAA AT7ATACCGG CCAGATAGTG 1440 

CTATATAGTG TGAATGAGAA TGGCAATATC ACGGTTATTC AGGCTCACCG AGGTGACCAG 1500 

ATTGGCTCCT ATTTTGGTAG TGTGCTGTGT TCAGTTGATG TGGATAAAGA CACCATTACA 1560 

GACGTGCTCT TGGTAGGTGC ACCAATGTAC ATGAGTGACC TAAAGAAAGA GGAAGGAAGA 162 0 

GTCTACCTGT TTACTATCAA AAAGGGCATT TTGGGTCAGC ACCAATTTCT TGAAGGCCCC 1680 

GAGGGCATTG AAAACACTCG ATTTGGTTCA GCAATTGCAG CTCTTTCAGA CATCAACATG 174 0 

GATGGCTTTA ATGATGTGAT TGTTGGTTCA CCACTAGAAA AT CAGAATTC TGGAGCTGTA 1800 

TACATTTACA ATGGT CATCA GGGCACTATC CGCACAAAGT ATTCCCAGAA AATCTTGGGA 1860 

TCCGATGGAG CCTTTAGGAG CCATCTCCAG TACTTTGGGA GGTCCTTGGA TGGCTATGGA 192 0 

GATTTAAATG GGGATTCCAT CACCGATGTG TCTATTGGTG CCTTTGGACA AGTGGTTCAA 1980 

CTCTGGTCAC AAAGTATTGC TGATGTAGCT ATAGAAGCTT CATTCACACC AGAAAAAATC 204 0 

ACTTTGGTCA ACAAGAATGC TCAGATAATT CTCAAACTCT GCTTCAGTGC AAAGTTCAGA 210 0 

CCTACTAAGC AAAACAATCA AGTGGCCATT GTATATAACA TCACACTTGA TGCAGATGGA 2160 

TTTTCATCCA CACTAACCTC CAGGGGGTTA TTTAAAGAAA ACAATGAAAG GTGCCTGCAG 222 0 

AAGAATATGG TAGTAAATCA AGCACAGAGT TGCCCCGAGC ACATCATTTA TATACAGGAG 2280 

CCCTCTGATG TTGTCAACTC TTTGGATTTG CGTGTGGACA TCAGTCTGGA AAACCCTGGC 2 34 0 

ACTAGCCCTG CCCTTCAAGC CTATTCTGAG ACTGCCAAGG TCTTCAGTAT TCCTTTCCAC 2400 

AAAGACTGTG GTGAGGATGG ACTTTGCATT TCTGATCTAG TCCTAGATGT CCGACAAATA 2460 

CCAGCTGCTC AAGAACAACC CTTTATTGTC AGCAACCAAA ACAAAAGGTT AACATTTTCA 2520 

GTAACACTGA AAAATAAAAC CCAAACTCCA TACAACACTG GAATTGTTGT TGATTTTTCA 258 0 

GAAAACTTGT TTTTTGCATC ATTCTCCCTA CCGGTTGATG GGACAGAAGT AACATGCCAG 2 64 0 

GTGGCTGCAT CTCAGAAGTC TGTTGCCTGC GAIGTAGGCT ACCCTGCTTT AAAGAGAGAA 2 70 0 

CAACAGGTGA CTTTTACTAT TAACTTTCAC TTCAATCTTC AAAACCTTCA GAATCAGGCG 2760 

TCTCTCAGTT TCCAAGCCTT AAGTGAAAGC CAAGAAGAAA ACAAGGCTGA TAATTTGGTC 2820 

AACCTCAAAA TTCCTCTCCT GTATGATGCT GAAATTCACT TAACAAGATC TACCAACATA 2880 

AATTTTTATG AAATCTCTTC GGATGGGAAT GTTCCTTCAA TCGTGCACAG TTTTGAAGAT 2940 

GTTGGTCCAA AATTCATCTT CTCCCTGAAG GTAACAACAG GAAGTGTTCC AGTAAGCATG 3000 

GCAACTGTAA TCATCCACAT CCCTCAGTAT ACCAAAGAAA AGAACCCACT GATGTACCTA 3060 

ACTGGGGTGC AAACAGACAA GGCTGGTGAC ATCAGTTGTA ATGCAGATAT CAATCCACTG 312 0 

AAAATAGGAC AAACATCTTC TTCTGTATCT TTCAAAAGTG AAAATTTCAG GCACACCAAA 3180 

GAATTGAACT GCAGAACTGC TTCCTGTAGT AATGTTACCT GCTGGTTGAA AGACGTTCAC 3240 

ATGAAAGGAG AATACTTTGT TAATGTGACT ACCAGAATTT GGAACGGGAC TTTCGCATCA 330 0 

TCAACGTTCC AGACAGTACA GCTAACGGCA GCTGCAGAAA TCAACACCTA TAACCCTGAG 3360 

ATATATGTGA TTGAAGATAA CACTGTTACG ATTCCCCTGA TGATAATGAA ACCTGATGAG 342 0 

AAAGCCGAAG TACCAACAGG AGTTATAATA GGAAGTATAA TTGCTGGAAT CCTTTTGCTG 3480 

TTAGCTCTGG TTGCAATTTT ATGGAAGCTC GGCTTCTTCA AAAGAAAATA TGAAAAGATG 3540 

ACCAAAAATC CAGATGAGAT TGATGAGACC ACAGAGCTCA GTAGCTGAAC CAGCAGACCT 3600 

ACCTGCAGTG GGAACCGGCA GCATCCCAGC CAGGGTTTGC TGTTTGCGTG CATGGATTTC 3660 

TTTTTAAATC CCATATTTTT TTTATCATGT CGTAGGTAAA CTAACCTGGT ATTTTAAGAG 3720 

AAAACTGCAG GTCAGTTTGG ATGAAGAAAT TGTGGGGGGT GGGGGAGGTG CGGGGGGCAG 3780 

GTAGGGAAAT AATAGGGAAA ATACCTATTT TATATGATGG GGGAAAAAAA GTAATCTTTA 3840 

AACTGGCTGG CCCAGAGTTT ACATTCTAAT TTGCATTG-G TCAGAAACAT GAAATGCTTC 3900 

CAAGCATGAC AACTTTTAAA GAAAAATATG ATACTCTCAG ATTTTAAGGG GGAAAACTGT 3960 

TCTCTTTAAA ATATTTGTCT TTAAACAGCA ACTACAGAAG TGGAAGTGCT TGATATGTAA 4 02 0 

GTACTTCCAC TTGTGTATAT TTTAATGAAT ATTGATGTTA ACAAGAGGGG AAAACAAAAC 4080 
ACAGGTTTTT TCAATTTATG CTGCTCATCC AAAGTTGCCA C 
TAATTTTATT TATAAACTAG GTAAAATTTG TTGTTGGTTC C 
CTTCCACACC CCATCTTGCT CTAATGATCA AAACATC 

TACCTCCTAT ATGTCCATTT AAGTTAGGAG AGGGGGCGAT ATAGAGACTA AGGCACAAAA 
TTTTGTTTAA AACTCAGAAT ATAACATTTA TGTAAAATCC CATCTGCTAG AAGCCCATCC 
TGTGCCAGAG GAAGGAAAAG GAGGAAATTT CCTTTCTCTT TTAGGAGGCA CAACAGTTCT 
CTTCTAGGAT TTGTTTGGCT GACTGGCAGT AACCTAGTGA ATTTTTGAAA GATGAGTAAT 
TTCTTTGGCA ACCTTCCTCC TCCCTTACTG AACCACTCTC CCACCTCCTG GTGGTACCAT 
TATTATAGAA GCCCTCTACA GCCTGACTTT CTCTCCAGCG GTCCAAAGTT ATCCCCTCCT 
TTACCCCTCA TCCAAAGTTC CCACTCCTTC AGGACAGCTG CTGTGCATTA GATATTAGGG 
GGGAAAGTCA TCTGTTTAAT T TACACACTT GCATGAATTA CTGTATATAA ACTCCTTAAC 
TTCAGGGAGC TATTTTCATT TAGTGCTAAA CAAGTAAGAA AAATAAGCTA GAGTGAATTT 
CTAAATGTTG GAATGTTATG GGATGTAAAC AATGTAAAGT AAAACACTCT CAGGATTTCA 
CCAGAAGTTA CAGATGAGGC ACTGGAAACC ACCACCAAAT TAGCAGGTGC ACCTTCTGTG 
GCTGTCTTGT TTCTGAAGTA CTTTTTCTTC CACAAGAGTG AATTTGACCT AGGCAAGTTT 
GTTCAAAAGG TAGATCCTGA GATGATTTGG TCAGATTGGG ATAAGGCCCA G 
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TTTTAACAAG CACCCCAGTC ACTAGGATGC AGATGGACCA CACTTTGAGA AACACCACCC 5100 

ATTTCTACTT TTTGCACCTT ATTTTCTCTG TTCCTGAGCC CCCACATTCT CTAGGAGAAA 5160 

CTTAGATTAA AATTCACAGA CACTACATAT CTAAAGCTTT GACAAGTCCT TGACCTCTAT 5220 

AAACTTCAGA GTCCTCATTA TAAAATGGGA AGACTGAGCT GGAGTTCAGC AGTGATGCTT 52B0 

TTTAGTTTTA AAAGTCTATG ATCTGATCTG GACTT CCTAT AATACAAATA CACAATCCTC 5340 

CAAGAATTTG ACTTGGAAAA G 5361 



10 
15 

20 




A GAGACATGTC 
G TCGGGGCAGC ATGCCCTTCT C 

T ATGATGGCTG GGAGCTCTGG CAGCCATACG 1380 

G GCACAATTAG ATAGGAAGAG TCTGGATCTC 1440 

TGATGATCAC AGAGCCATCC TAACAAACGG AATATCACCG ACCCTCCTTT ATGTGAGAGA 1500 

GAAATAAACA TCTATGAAA 1519 

Seq ID NO: C12S DNA Sequence 
Nucleic Acid Accession #: NM_004154 
Coding sequence: 309.. 1295 ~ 



AAGGACAGAG GAGGGGCCCT TCCTGTCAGC TGGCTGGGAG CAGAGGTGGC TTTGTCTTTT 
CGGAAGAACT GGTTCTGTGG AATTTGTGCT TATTTCCCAT CAAGGATCAA GGACCTGCTC 
TGGGGCTACC TCAGGGCCCC ACAGGATGAG GGGCTGGTTT T CAGATGAGT TTTCTGCTTG 
CCTGTCATCT GGATAGTGTC TAAAAATTTG CAAACTGCCT TCTTGTCAGT GTCTTGCTCA 
TTCTTCATGA CACTCCTGAT ATGTCTCTCA GTTTCCTCAT CTGCTGCCTC TCCAGACTTC 
TGCCAGAACA TTGCACGCGA CAGTTTCAGG CACAGAACTG ACTGGCAGCA GGGGCTGCTC 
CACGAGTGGG AATTTGCTCC AGCACTTCAC GGACTGCAAG CGAGGCACTT GCTAACTCTT 
GGATAACAAG ACCTCTGCCA GAAGAACCAT GGCTTTGGAA GGCGGAGTTC AGGCTGAGGA 
GATGGGTGCG GTCCTCAGTG AGCCCCTGCC TCCCTGAACA TAGGAAACCC ACCTGGGCAG 
CCATGGAATG GGACAATGGC ACAGGCCAGG CTCTGGGCTT GCCACCCACC ACCTGTGTCT 
ACCGCGAGAA CTTCAAGCAA CTGCTGCTGC CACCTGTGTA TTCGGCG3TG CTGGCGGCTG 
GCCTGCCGCT GAACATCTGT GTCATTACCC AGATCTGCAC GTCCCGCCGG GCCCTGACCC 
GCACGGCCGT GTACACCCTA AACCTTGCTC TGGCTGACCT GCTATATGCC TGCTCCCTGC 
CCCTGCTCAT CTACAACTAT GCCCAAGGTG ATCACTGGCC CTTTGGCGAC TTCGCCTGCC 
GCCTGGTCCG CTTCCTCTTC TATGCCAACC TGCACGGCAG CATCCTCTTC CTCACCTGCA 
TCAGCTTCCA GCGCTACCTG GGCATCTGCC ACCCGCTGGC CCCCTGGCAC AAACGTGGGG 
GCCGCCGGGC TGCCTGGCTA GTGTGTGTAG CCGTGTGGCT GGCCGTGACA ACCCAGTGCC 
TGCCCACAGC CATCTTCGCT GCCACAGG CA TCCAGCGTAA CCGCACTGTC TGCTATGACC 
TCAGCCCGCC TGCCCTGGCC ACCCACTATA TGCCCTATGG CATGGCTCTC ACTGTCATCG 
GCTTCCTGCT GCCCTTTGCT GCCCTGCTGG CCTGCTACTG TCTCCTGGCC TGCCGCCTGT 
GCCGCCAGGA TGGCCCGGCA GAGCCTGTGG CCCAGGAGCG GCGTGGCAAG GCGGCCCGCA 
TGGCCGTGGT GGTGGCTGCT GCCTTTGCCA TCAGCTTCCT GCCTTTTCAC ATCACCAAGA 
CAGCCTACCT GGCAGTGCGC TCGACGCCGG GCGTCCCCTG CACTGTATTG GAGGCCTTTG 

TCTTCTACTT CACCCAGAAG AAGTTCCGCC GGCGACCACA TGAGCTCCTA CAGAAACTCA 
CAGCCAAATG GCAGAGGCAG GGTCGCTGAG TCCTCCAGGT CCTGGGCAGC C 



TCAGCTCAGC TGGGCATGGA GTTAAGATCC CTCACAGGAC CCAGAAGCTC ACCAAAAACT 
ATTTCTTCAG CCCCTTCTCT GGCCCAGACC CTGTGGGCAT GGAGATGGAC AGACCTGGGC 
CTGGCTCTTG AGAGGTCCCA GTCAGCCATG GAGAGCTGGG GAAACCACAT TAAGGTGCTC 
ACAAAAATAC AGTGTGACGT G 
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:CCGCCT GTGGCTGGTC CTGCAGGTGA 
C AGCTCCAT GG ACATGGAGCG CCCGGGCGAC GGCAAATGCC 
AGCCCATCGA GATCCCGATG TGCAAGGACA TCGGCTACAA CATGACTCGT ATGCCCAACC 
TGATGGGCCA CGAGAACCAG CGCGAGGCAG CCATCCAGTT GCACGAGTTC GCGCCGCIGG 
TGGAGTACGG CTGCCACGGC CACCTCCGCT TCTTCCTGTG CTCGCTGTAC GCGCCGATGT 
GCACCGAGCA GGTCTCTACC CCCATCCCCG CCTGCCGGGT CATGTGCGAG CAGGCCCGGC 
TCAAGTGCTC CCCGATTATG GAGCAGTTCA ACTT CAAGTG GCCCGAOTCC CTGGAOTGCC 
GGAAACTCCC CAACAAGAAC GACCCCAACT ACCTGTGCAT GGAGGCGCCC AACAACGGCT 
CGGACGAGCC CACCCGGGGC TCGGGCCTGT TCCCGCCGOT GTTCCGGCCG CAGCGGCCCC 
ACAGCGCGCA GGAG CACCCG CTGAAGGACG GGGGCCCCGG GCGCGGCGGC TGCGACAACC 
CGGGCAAGTT CCACCACGTG GAGAAGAGCG CGTCGTGCGC GCCGCTCTGC ACGCCCGGCG 
TGGACGTGTA CTGGAGCCGC GAGGACAAGC GCTTCGCAGT GGTCTGGCTG GCCATCTGGG 
CGGTGCTGTG CTTCTTCTCC AGCGCCTTCA CCGTGCTCAC CTTCCTCATC GACCCGGCCC 
GCTTCCGCTA CCCCGAGCGC CCCATCATCT TCCTCTCCAT GTGCTACTGC GTCTACTCCG 
TGGGCTACCT CATCCGCCTC TTCGCCGGCG CCGAGAGCAT CGCCTGCGAC CGGGACAGCG 
GCCAGCTCTA TGTCATCCAG GAGGGACTGG AGAGCACOGG CTGCACGCTG GTCTTCCTGG 
TCCTCTACTA CTTCGGCATG GCCAGCTCGC TGTGGTGGGT GGTCCTCACG CTCACCTGGT 
TCCTGGCCGC CGGCAAGAAG TGGGGCCACG AGGCCATCGA AGCCAACAQC AGCTACTTCC 
ACCTGGCAGC CTGGGCCATC CCGGCGGTGA AGACCATCCT GATCCTGGTC ATGCGCAGGG 
TGGCGGGGGA CGAGCTCACC GGGGTCTGCT ACGTGGGCAG CATGGACGTC AACGCGCTCA 
CCGGCTTCGT GCTCATTCCC CTGGCCTGCT ACCTGGTCAT CGGCACGTCC TTCATCCTCT 
CGGGCTTCGT GGCCCTGTTC CACATCCGGA GGGTGATGAA GACGGGCGGC GAGAACACGG 
ACAAGCTGGA GAAGCTCATG GTGCGTATCG GGCTCTTCTC TGTGCTGTAC ACCGTGCCGG 
CCACCTGTGT GATCGCCTGC TACTTTTACG A 

CCGCCTCCAT CCCCGCCGTG GAGATCTTCA TGGTGAAGAT CTTTATGCTG CTGGTGGTGG 



TAAGATGTGG TGCTTTTCTT GGTTGTGTTT TTCTTTCTTC T 



TTTGGTTTTC CAGCGAAGGG 



T TCAAGAGCCT 

3 CCATCCCCAG GGCCCACCCC CAGGAAGGCC ACAGTGCTGG 
GCGGCATCCC TGCAGAGGAA AGACAGGACC CGGGGCCOGC CTCACACCCC AGTGGATTTG 
GAGTTGCTTA AAATAGACTC TGGCCTTCAC CAATAGTCTC TCTGCAAGAC AGAAACCTCC 
ATCAAACCTC ACATTTGTGA ACTCAAACGA TGTGCAATAC ATTTTTTTCT CTTTCCTTGA 
AAATAAAAAG AGAAACAAGT ATTTTGCTAT ATATAAAGAC AACAAAAGAA ATCTCCTAAC 
A CCCTTCAGTG CTACATTTTG T 
I ATAGACGTTT GGACTGATTT GTGGAAAGGA G 
GGGAGAAGGA TCATTCAAAA GTTACCCAAA GGGCTTATTG ACTCTTTCTA TTGTTAAACA 2640 
AATGATTTCC ACAAACAGAT CAGGAAGCAC TAGGTTGGCA GAGACACTTT GTCTAGTGTA 2700 
TCTGCGTGTG TATATTTGTA ATATATGATA 27 60 
AAAATATGTT CT^TAAAAAA A 2811 

Seg ID NO: C127 DNA Sequence 

Nucleic Acid Accession It: NM_00S761.1 

Coding sequence: 250.. 4956 

1 11 21 31 41 51 

I I I I I I 

GCGAGGAGGA AACGGTGCCG GAGCGCGCAG GGCTTGCTGC CGCCACCGCC GCTGCACAGG 60 
CTGCCGGAGC GAGCCTGCCG CGCGCCGCCC TCCCCGCTCT CCTTCCTGGG CGAGCTGCGG 120 



CTCCGTTGCC GCGCGCCTGA G 

CCCAGCCCCA TGGAGGTCTC CCGGAGGAAG GCGCCGCCGC GCCCCCCGCG CCCCGCAGCG 

CCACTGCCCC TGCTCGCCTA TCTGCTGGCA CTGGCGGCTC CCGGCCGGGG CGCGGACGAG 

CCCGTGTGGC GGTCGGAGCA AGCCATCGGA GCCATCGCGG CGAGCCAGGA GGACGGCGTG 

TTTGTGGCGA GCGGCAGCTG CCTGGACCAG CTGGACTACA GCCTGGAGCA CAGCCTCTCG 

CGCCTGTACC GGGACCAAGC GGGCAACTGC ACAGAGCCGG TCTCGCTGGC GCCCCCCGCG 

CGGCCCCGGC CCGGGAGCAG CTTCAGCAAG CTGCTGCTGC CCTACCGCGA GGGGGCGGCC 

GGCCTCGGGG GGCTGCTGCT CACCGGCTGG ACCTTCGACC GGGGCGCCTG CGAGGTGCGG 

CCCCTGGGCA ACCTGAGCCG CAACTCCCTG CGCAACGGCA CCGAGGTGGT GTCGTGCCAC 

CCGCAGGGCT CGACGGCCGG CGTGGTGTAC CGCGCGGGCC GGAACAACCG CTGGTACCTG 

GCGGTGGCCG CCACCTACGT GCTGCCTGAG CCGGAGACGG CGAGCCGCTG CAACCCCGCG 

C ACGACACGGC CATCGCGCTC AAGGACACGG AGGGGCGCAG CCTGGCCACG 

G GGCGCCTCAA GCTGTGCGAG GGCGCGGGCA GCCTGCACTT CGTGGACGCC 

TTTCTCTGGA ACGGCAGCAT CTACTTCCCC TACTACCCCT ACAACTATAC GAGCGGCGCT 

GCCACCGGCT GGCCCAGCAT GGCGCGCATC GCGCAGAGCA CCGAGGTGCT GTTCCAGGGC 

CAGGCATCCC TCGACTGCGG CCACGGCCAC CCCGACGC-CC GCCGCCTGCT CCTCTCCTCC 

AGCCTAGTGG AGGCCCTGGA CGTCTGGGCG GGAGTGTTCA GCGCGGCCGC TGC-AGAGGGC 

GCGCGCGCCA AGAGGGTCAG CTGGGACTTC AAGACG3CCG AGAGCCACTG CAAAGAAGGG 
GATCAACCTG AAAGAGTCCA ACCAAT CGCA TCATCTACCT TGATCCATTC CGACCTGACA 
TCCGTTTATG GCACCGTGGT AATGAACAGG ACTGTTTTAT 1CTTGGGGAC TGGAGATGGC 
CAGTTACTTA AGGTTATTCT TGGTGAGAAT TTGACTTCAA ATTGTCCAGA GGTTATCTAT 
GAAATTAAAG AAGAGACACC TGTTTTCTAC AAACTCGTTC CTGATCCTGT GAAGAATATC 
TACATTTATC TAACAGCTGG GAAAGAGGTG AGGAGAATTC GTGTTGCAAA CTGCAATAAA 
CAT AAAT CCT GTTCGGAGTG TTTAACAGCC ACAGACCCTC ACTGCGGTTG GTGCCATTCG 
CTACAAAGGT GCACTTTTCA AGGAGATTGT GTACATTCAG AGAACTTAGA AAACTGGCTG 
GATATTTCGT CTGGAGCAAA AAAGTGCCCT AAAATTCAGA TAATTCGAAG CAGTAAAGAA 
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AAGACTACAG TGACTATGGT GGGAAGCTTC TCTCCAAGAC ACTCAAAGTG CATGGTGAAG 
AATGTGGACT CTAGCAGGGA GCTCTGCCAG AATAAAAGTC AGCCCAACCG GACCTGCACC 
TGTAGCATCC CAACCAGAGC AACCTACAAA GATGTTTCAG TTGTCAACGT GATGTTCTCC 
TTCGGTTCTT GGAATTTATC AGACAGATTC AACTTTACCA ACTGCTCATC ATTAAAAGAA 
TGCCCAGCAT GCGTAGAAAC TGGCTGCGCG TGGTGTAAAA GTGCAAGAAG GTGTATCCAC 
CCCTTCACAG CTTGCGACCC TTCTGATTAT GAGAGAAACC AGGAACAGTG TCCAGTGGCT 
h CATCAGGAGG AGGAAGACCC AAGGAGAACA AGGGGAACAG A 



R GTACCTGTGA TAAGGATGTG ATACAGGTTA G 

C ATCAAGCCGG AAAGAAATGA AGGATGTGTG TATCCAGTTT 
GATGGTGGGA ACTGCTCTTC TGTGGGATCC TTATCCTACA TTGCTCTGCC A 



T GTGTGGCGAC TTACTGCGGG TTTTTAGCCC CCAGTTTAAA GAGTTCAAAA 

CTGCAGTATC GGGAGGACCC CAGATTCACG GGGTATCGGG TGGAATCCGA GGTGGACACA 
ATATTTCCAA AAAAGACATT 



GCCCTAATCT G 



3240 



3660 

3840 
3S00 
3960 
4020 



ATTCTTGAAG ATGGAATCAC CAAGCTAAAC ACCATTGGCC ACIATGAGAT ATCAAATGGA 4140 

TCCACTATAA AAGTCTTTAA GAAGATAGCA AATTTTACTT CAGATGTGGA GTACTCGGAT 4200 

GACCACTGCC ATTTGATTTT ACCAGATTCG GAAGCATTCC AAGATGTGCA AGG AAA GAGA 4260 

CATCGAGGGA AGCACAAGTT CAAAGTAAAA GAAATGTATC TGACAAAGCT GCTGTCGACC 4320 

AAGGTGGCAA TTCATTCTGT GCTTGAAAAA CTTTTTAGAA GCATTTGGAG TTTACCCAAC 43 80 

AGCAGAGCT C CATTTGCTAT AAAATACTTT TTTGACTTTT TGGACGCCCA GGCTGAAAAC 4440 

AAAAAAATCA CAGATCCTGA CGTCGTACAT ATTTGGAAAA CAAACAGCCT TCCTCTTCGC 4500 

TTCTGGGTAA ACATCCTGAA GAACCCTCAG TTTGTCTTTG ACATTAAGAA GACACCACAT 4560 

ATAGACGGCT GTTTGTCAGT GATTGCCCAG GCATTCATGG ATGCATTTTC TCTCACAGAG 4620 

CAGCAACTAG GGAAGGAAGC ACCAACTAAT AAGCTTCTCT ATGCCAAGGA TATCCCAACC 4 6 BO 

TACAAAGAAG AAGTAAAATC TTATTACAAA GCAATCAGGG ATTTGCCTCC ATTGTCATCC 4740 

TCAGAAATGG AAGAATTTTT AACTCAGGAA TCTAAGAAAC ATGAAAATGA ATTTAATGAA 4800 

GAAGTGGCCT TGACAGAAAT TTACAAATAC AT CGTAAAAT ATTTTGATGA GATTCTAAAT 4860 

AAACTAGAAA GAGAACGAGG GCTGGAAGAA GCTCAGAAAC AACTCTTGCA TGTAAAAGTC 4 920 

TTATTTGATG AAAAGAAGAA ATGCAAGTGG ATGTAAGCAC TCTGGGGCCT GGCTTAATCT 4980 

T TGAGCTACTC TGTGTCGTTA 5040 
5100 
5121 

Seq ID NO: C128 DNA Sequence 

Nucleic Acid Accession #, NM_002185.1 

Coding sequence: 23.. 1402 



TTTACTTCAA GTCGTTTCTG GAGAAAGTGG CTATGCTCAA AATGGAGACT TGGAAGATGC 
AGAACTGGAT GACTACTCAT TCTCATGCTA TAGCCAGTTG GAAGTGAATG GAT CGCAGCA 
TTCACTGACC TGTGCTTTTG AGGACCCAGA TGTCAACACC ACCAATCTGG AATTTGAAAT 
C CTCGTGGAGG TAAAGTGCCT GAATTTCAGG AAACTACAAG AGATATATTT 



AAAGAGT CTA ACCTGCAAAA AAATAGACCT AACCACTATA GTTAAACCTG AGGCTCCTTT 
TGACCTGAGT GTCATCTATC GGGAAGGAGC C 
ACACTTGCAA AAGAAGT AT G TAAAAG1 

GGATGAAAAC AAATGGACGC ATGTGAMTT AT CCAGCACA AAGCTGACAC T 
AAAGCTCCAA CCGGCAGCAA TGTATGAGAT TAAAGTTCGA TCCATCCCTG ATCACTATTT 
TAAAGGCTTC TGGAGTGAAT GGAGT CCAAG TTATTACTTC AGAACTCCAG AGATCAATAA 
TAGCTCAGGG GAGATGGATC CTATCTTACT AACCATCAGC ATTTTGAGTT TTTTCTCTGT 
CGCTCTGTTG GTCATCTTGG CCTGTGTGTT ATGGAAAAAA AGGATTAAGC CTATCGTATG 
GCCCAGTCTC CCCGATCATA AGAAGACTCT GGAACATCTI TGTAAGAAAC CAAGAAAAAA 
TTTAAATGTG AGTTTCAATC CTGAAAGTTT CCTGGACTGC CAGATTCATA GGGTGGATGA 
CATTCAAGCT AGAGATGAAG TGGAAGGTTT TCTGCAAGAT ACGTTTCCTC AGCAACTAGA 
AGAATCTGAG AAGCAGAGGC TTGGAGGGGA TGTGCAGAGC CCCAACTGCC CATCTGAGGA 
TGTAGTCGTC ACTCCAGAAA GCTTTGGAAG AGATTCATCC CTCACATGCC TGGCTGGGAA 
TGTCAGTGCA TGTGACGCCC CTATTCTCTC CTCTTCCAGG TCCCTAGACT GCAG3GAGAG 
TGGCAAGAAT GGGCCTCATG TGTACCAGGA CCTCCTGCTT AGCCTTGGGA CTACAAACAG 
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30 



CACGCTGCCC CCTCCATTTT CTCTCCAATC TGGAATCCTG ACATTGAACC CAGTTGCTCA 1320 

GGGTCAGCCC ATTCTTACTT CCCTGGGATC AAATCAAGAA GAAGCATATG TCACCATGTC 1380 

CAGCTTCTAC CAAAACCAGT GAAGTGTAAG AAACCCAGAC TGAACTTACC GTGAGCGACA 1440 

AAGATGATTT AAAAGGGAAG TCTAGAGTTC CTAGTCTCCC TCACAGCACA GAGAAGACAA 1500 

5 AATTAGCAAA ACCCCACTAC ACAGT CTGCA AGATTCTGAA ACATTGCTTT GACCACTCTT 1560 

CCTGAGTTCA GTGGCACTCA ACATGAGTCA AGAGCAT OCT GCTTCTACCA TGTGGATTTG 1620 

GT CACAAGGT TTAAGGTGAC CCAATGATTC AGCTATTT 165B 

Seq ID NO: C129 
1U Nucleic Acid Access it 

Coding sequence: 15.. 302 

i r r i 1 i 1 i 1 

JO ACTCTGGACT CCGGATGGCT GCCGCACGCC TCTGCCTCTC CCTGCTGCTC CTGTCCACCT 
GCGTGGCTCT GTTACTACAG CCACTGCTGG GTGCCCAGGG AGCCCCACTG GAGCCAGTGT 
ACCCAGGGGA CAATGCCACA CCAGAGCAGA TGGCCCAGTA TGCAGCTGAT CTCCGTAGAT 
ACATCAACAT GCTGACCAGG CCTAGGTATG GGAAAAGACA CAAAGAGGAC ACGCTGGCCT 
TCTCGGAGTG GGGGTCCCCG CATGCTGCTG TCCCCAGGGA GCTCAGCCCG CTGGACTTAT 

20 AATGCCACCT TCTGTCTCCT ACGACTCCAT GAGCAGCGCC AGCCCAGCTC TCCCCTCTGC 
ACCCTTGGCT CTGGCCAAAG CTTGCTCCCT GCTCCCACAC AGGCTCAATA AAGCAAGTCA 
AAGCC 

Seq ID NO: C130 DNA Sequence 
25 Nucleic Acid Accession #: NM_032S4S.l 
Coding sequence: 47.. 718 



3 CTTCTGTTTA CGGTCAGT 
CTATCAAAGA GAGAAACATA ACGGCGGT 

A ACTGGACCTC CAGTCATTTC GGAGAGGTGA CTGGGAGCGC 
3 AGCCGCTCCC CTACTCCCGG GCTTTCGGAG AGGGTGCGTC 
^ GGAACGGCGG TACCTGCGTG CTGGGCAGCT TCTGCGTGTG 
CCCGGCCCAC TTCACCGGCC GCTACTGCGA GCATGACCAG AGGCGCAGTG AATGCGGCGC 
CCTGGAGCAC GGAGCCTGGA CCCTCCGCGC CTGCCACCTC TGCAGGTGCA TCTTCGGGGC 
CCTGCACTGC CTCCCCCTCC AGACGCCTGA CCGCTGTGAC CCGAAAGACT TCCTGGCCTC 
CCACGCTCAC GGGCCGAGCG CCGGGGGCGC GCCCAGCCTG CTACTCTTGC TGCCCTGCGC 
ACTCCTGCAC CGCCTCCTGC GCCCGGATGC GCCCGCGCAC CCTCGGTCCC TGGTCCCTTC 
CGTCCTCCAG CGGGAGCGGC GCCCCTGCGG AAGGCCGGGA CTTGGGCATC GCCTTTAATT 
TTCTATGTTG TAAATAATAG ATGTGTTTAG TTTACCGTAA GCTGAAGCAC TGGGTGAATA 
TTTTTATTGG GTAATAAATA TTTTCATGAA AGCGCCAAAA AAAAAAAAAA AAAAAAAAAA 
AAAAAA 

Seq ID NO: C131 DNA Sequence 



AGGGAGAGAG GGAGGGGAGG AAATTGGAGA CCCCAGCACC CCCTTGCTCA CTCTCTTGCT 60 

CACAGTCCAC GATGGCCCGG TCCCTGGTGT GCCTTGGTGT CATCATCTTG CTGTCTGCCT 120 

TCTCCGGACC TGGTGTCAGG GGTGGTCCTA TGCCCAAGCT GGCTGACCGG AAGCTGTGTG 180 

CGGACCAGGA GTGCAGCCAC CCTATCTCCA TGGCTGTGGC CCTTCAGGAC TACATGGCCC 240 

CCGACTGCCG ATTCCTGACC ATTCACCGGG GCCAAGTGGT GTATGTCTTC TCCAAGCTGA 3 00 

AGGGCCGTGG GCGGCTCTTC TGGGGAGGCA GCGTTCAGGG AGATTACTAT GGAGATCTGG 3 60 

CTGCTCGCCT GGGCTATTTC CCCAGTAGCA TTGTCCGAGA GG ACCAG AC C CTGAAACCTG 42 0 
3 ATTTCTACTG CCAGTGAGCT CAGCC7ACCG 

CTGGCCCTGC C 



Coding sequenc 



TCACAGAAAA 



A CACCAGTCCC AGAAAAGCCT A 
A TAAAAGCCCC AGTAAAC 
ACAAAGACTA TAAAACCTTC AGTCAAGGTC ACAGGAGACA fl 
TCTCATCTAA ATAAAACTGA AGTTACTCAT C 
A CGAAGCTGAG TTCTATCACA T 

A AAGATGGCT C ACAGAAAGGT ATCCACGCTG GACAGATGGG A 
TCATTCCCTG CATGGGCCAT AGTTATTGTG GTCCTGGTGG CTGTGATTCT O 
TTCCTTGGCC TGATCTTCTT GGTCTCCTAT ATGATGCGGA CACGCCGCAC ACTAACCCAG 
AACACCCAGT ACAATGATGC AGAGGATGAG GGTGGCCCCA ATTCCTACCC GGTCTACCTG 




Seq ID NO: C133 DNA Sequence 
Nucleic Acid Accession #: NM_o 
Coding sequence: 3.. 710 
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I I I I I I 

AGATGACACA AGTCACAGAA AAGT CCACAG AACACCCAGA AAAGACCACG TCAACCACAG 
AGAAAACCAC AAGAAC CCCA GAAAAGCCTA CGCTATACTC AGAGAAGACC ATATGCACCA 
AAGGGAAAAA CACACCAGTC CCAGAAAAGC CTACAGAAAA CCTGGGGAAC ACCACACTGA 
CCACTGAGAC CATAAAAGCC CCAGTAAAGT CCACAGAAAA CCCAGAAAAA ACAGCAGCAG 
TCACAAAGAC TATAAAACCT TCAGTCAAGG TCACAGGAGA CAAATCTCTC ACTACTACCT 
CITCTCATCT AAATAAAACT GAAGTTACTC ATCAGGTGCC CACTGGTTCT TTCACCCTCA 
TTACATCTAG AACGAAGCTG AGTTCTATCA CATCAGAAGC CACAGGAAAC GAGAGCCATC 
CATACCTCAA TAAAGATGGC TCACAGAAAG GTATCCACGC TGGACAGATG GGAGAGAATG 
ATTCATTCCC TGCATGGGCC ATAGTTATTG TGGTCGTGGT GGCTGTGATT CTCCTCCTGG 
TGTTCCTTGG CCTGATCTTC TTGGTCTCCT ATATGATGCG GACACGCCGC ACACTAACCC 
AGAACACCCA GTACAATGAT GCAGAGGATG AGGGTGGCCC CAATTCCTAC CCGGTCTACC 
TGATGGAGCA GCAGAATCTT GGCATGGGCC AGATCCCTTC CCCACGGTGA TCTTGGAGTA 
GGCGCCCAGC CCTGGCTCTT CCATGCTCTG CCCCTTTCCT GGATGAGGAA CCGGACTCAC 
AATTTCTATT TCCGGGACTA CAGGAAGGGC AGAGAATACT GACGGTTACC AGTATTAACC 
CTTCATCTGT TCTTGAAACT GGTTGGGGAA TGAGGTGATA AGCAAGGAGG GTGTAAGTTT 
AGGGGACAAA GAAGAAAGAA TGAATAATAC GAGCAGACAT TCTCTGTAGA A 
CTGAGAATGA AAAGGTGTTT GATGGACATG T 

TTATTTCCAT TCACTATTAC TTAAGAG 
AGGACCCTTG CCCCCACCCC CATTTTTTTA ATGAAAAAAA AAAACAAAAA AAACGGATCC 
LATITAT TTCCTTCTCC ACTCTCTCCA TGCCCTGGAG AAAAAAAAGT 
:aaatat CTCTCATCTA CATGGTTGCT T 
ITCC TAAATGTCTA CAGTGGACGC C 



Seq ID NO: C134 
Nucleic Acid Ac 
Coding eequence 



ATGTGGCCTC GCCTGGCCTT TTGTTGCTGG GGTCTGGCGC TCGTTTCGGG CTGGGCGACC 
TTTCAGCAGA TGTCCCCGTC GCGCAATTTC AGCTTCCGCC TCTTCCCCGA GACCGCGCCC 
GGGGCCCCCG GGAGTATCCC CGCGCCGCCC GCTCCTGGCG ACGAAGCGGC GGGGAGCAGA 
GTGGAGCGGC TGGGCCAGGC GTTCCGGCGA CGCGTGCGGC TGCTGCGGGA GCTCAGCGAG 
CGCCTGGAGC TTGTCTTCCT GGTGGATGAT TCGTCCAGCG TGGGCGAAGT CAACTTCCGC 
AG< 'i TCA TGTTCGTCCG CAAGCTGCTG TCCGACTTCC CCGTGGTGCC CACGGCCACG 
CGCGTGGCCA TCGTGACCTT CTCGT CCAAG AACTACGTGG TGCCGCGCGT CGATTACATC 
TCCACCCGCC GCGCGCGCCA GCACAAGTGC GCGCTGCTCC TCCAAGAGAT CCCTGCCATC 
TCCTACCGAG GTGGCGGCAC CTACACCAAG GGCGCCTTCC AGCAAGCCGC GCAAATTCTT 
CTTCATGCTA GAGAAAACTC AACAAAAGTT GTATTTCTCA TCACTGATGG ATATTCCAAT 
GGGGGAGACC CTAGACCAAT TGCAGCGTCA CTGCGAGATT CAGGAGTGGA GATCTTCACI 
TTTGGCATAT GGCAAGGGAA CATTCGAGAG CTGAATGACA TGGCTTCCAC CCCAAAGGAG 
GAGCACTGTT ACCTGCTACA CAGTTTTGAA GAATTTGAGG CTTTAGCTCG CCGGGCATTG 
CATGAAGATC TACCTTCTGG GAGTTTTATT CAAGATGATA TGGTCCACTG CTCATATCTT 
TGTGATGAAG GCAAGGACTG CTGTGACCGA ATGGGAAGCT GCAAATGTGG GACACACACA 
GGCCATTTTG AGTGCAT CTG TGAAAAGGGG TATTACGGGA AAGGTOTGCA GTATGAATGC 
ACAGCTTGCC CATCGGGGAC ATACAAACCT GAAGGCTCAC CAGGAGGAAT CAGCAGTTGC 
ATTCCATGTC CTGATGAAAA TCACACCTCT CCACCTGGAA GCACATCCCC TGAAGACTGT 
GTCTGCAGAG AGGGATACAG GGCATCTGGC CAGACCTGTG AACTTGTCCA CTGCCCTGCC 
CTGAAGCCTC CCGAAAATGG TTACTTTATC CAAAACACTT GCAACAACCA CTTCAATGCA 
GCCTGTGGGG TCCGATGTCA CCCTGGATTT GATCTTGTGG GAAGCAGCAT CATCTTATGT 
CTACCCAATG GTTTGTGGTC CGGTTCAGAG AGCTACTGCA GAGTAAGAAC ATGTCCTCAT 
CTCCGCCAGC CGAAACATGG CCACATCAGC TGTTCTACAA GGGAAATGTT ATATAAGACA 
ACATGTTTGG TTGCCTGTGA TGAAGGGTAC AGACTAGAAG GCAGTGATAA GCTTACTTGT 
CAAGGAAACA GCCAGTGGGA TGGGCCAGAA CCCCGGTGTG TGGAGCGCCA CTGTTCCACC 
TTTCAGATGC CCAAAGATGT CATCATATCC CCCCACAACT GTGGCAAGCA GCCAGCCAAA 
TTTGGGACGA TCTGCTATGT AAGTTGCCGC CAAGGGTTCA TTTTATCTGG AGTCAAAGAA 
ATGCTGAGAT GTACCACTTC TGGAAAATGG AATGT CGGAG TTCAGC-CAGC TGTGTGTAAA 
GACGTGGAGG CTCCTCAAAT CAACTGTCCT AAGGACATAG Bl 
CAAGATTCTG CCAATGTTAC CTGGCAGATT CCAACAGCTA AAGACAACTC T 
GTGTCAGTCC ACGTTCATCC AGCTTTCACC CCACCTTACC TTTTCCCAAT T 1 
GCTATCGTAT ACACGGCAAC TGACCTATCC GGCAACCAGG CCAGCTGCAT T 
AAGGTTATTG ATGCAGAACC ACCTGTCATA GACTGGTGCA GATCICCACC TCCCGTCCAG 
GTCTCGGAGA AGGTACATGC CGCAAGCTGG GATGAGCCTC AGTTCTCAGA CAACTCAGGG 
GCTGAATTGG TCATTACCAG AAGTCATACA CAAGGAC 
ATAGTACAGT ATACAGCCAC TGACCCCTCA GGCAATAACA G 
GTCATAAAAG GTTCTCCCTG TGAAATTCCA TTCACACCTG TAAATGGGGA TTTTATATGC 
ACTCCAGATA ATACTGGAGT CAACTGTACA TTAACTTGCT TGGAGGGCTA TGATTTCACA 
GAAGGGT CTA CTGACAAGTA TTATTGTGCT TATGAAGATG G 
ACCACTGAAT GGCCAGACTG TGCCAAAAAA CGTTTTGCAA A 1 
GAGATGTTCT ACAAAGCAGC TCGTTGTGAT GACACAGAT C TGATGAAGAA GTTTTCTGAA 
GCATTTGAGA CGACCCTGGG AAAAATGGTC CCATCATTTT GTAGTGATGC AGAGGACATT 



1284 



WO 03/042661 PCT/US02/36810 



GACTGCAGAC TGGAGGAGAA CCTGACCAAA AAATATTGCC TAGAATATAA T 

GAAAATGGCT TTGCAATTGG ACCAGGTGGC TGGGGTGCAG CTAATAGGCT GGATTACTCT 2640 

TACGATGACT TCCTGGACAC TGTGCAAGAA ACAGCCACAA GCATCGGCAA TGCCAAGTCC 2700 

TCACGGATTA AAAGAAGTGC CCCATTATCT GACTATAAAA TTAAGTTAAT TTTTAACATC 2760 

ACAGCTAGTG TGCCATTACC CGATGAAAGA AATGATACCC TTGAATGGGA AAATCAGCAA 2820 

CGACTCCTTC AGACATTGGA AACTATCACA AATAAACTGA AAAGGACTCT CAACAAAGAC 2BB0 

CCCATGTATT CCTTTCAGCT TGCATCAGAA ATACTTATAG CCGACAGCAA TT CATTAGAA 2940 

ACAAAAAAGG CTTCCCCCTT CTGCAGACCA GGCTCAGTGC TGAGAGGGCG TATGTGTGTC 3000 

AATTGCCCTT TGGGAACCTA TTATAATCTG GAACATTTCA CCTGTGAAAG CTGCCGGATC 3060 

GGATCCTATC AAGATGAAGA AGGGCAACTT GAGTGCAAGC TTTGCCCCTC TGGGATGTAC 3120 

ACGGAATATA TCCATTCAAG AAACATCTCT GATTGTAAAG CTCAGTGTAA ACAAGGCACC 31B0 

A GTGGACTTGA GACTTGTGAA TCGTGTCCAC TGGGCACTTA TCAGCCAAAA 3240 

3CTGCCT CTCGTGTCCA GAAAACACCT CAACTGTGAA AAGAGGAGCC 3300 

2ATGTGG AGTTCCTTGT CCAGAAGGAA AATTCTCGCG TTCTGGGTTA 3360 

ATGCCCTGTC ACCCATGTCC TCGTGACTAT TACCAACCTA ATGCAGGGAA GGCCTTCTGC 3420 



TGTCCTTCTG AGGTTTTCCA TGAATGCTTC TTTAACCCTT GCCACAATAG TGGAACCTGC 3600 



CTAGTTGGGG A 

= CAGTCCTTGT T 

C TGCATTGTGA AACAGAAGTC 



TAAATTCATA CAGTTGTAAA 



CTGGATGAAA TCCTCTGACG ACATGAACTA TGGAACACCA 



TGGGTTCTTT ATGTGAATGG CAGGGAAAAG ATAACAAACT GTCCCTCGGT GAATGATGGC 4500 

AGATGGCATC ATATTGCAAT CACTTGGACA AGTGCCAATG GCATCTGGAA AGTCTATATC 4560 

GATGGGAAAT TATCTGACGG TGGTGCTGGC CTCTCTGTTG GTTTGCCCAT ACCTGGTGGT 4620 

GGTGCGTTAG TT CTGGGGCA AGAGCAAGAC AAAAAAGGAG AGGGATTCAG CCCAGCTGAG 4680 

TCTTTTGTGG GCTCCATAAG CCAGCTCAAC CTCTGGGACT ATGTCCTGTC TCCACAGCAG 4740 

GTGAAGTCAC TGGCTACCTC CTGCCCAGAG GAACTCAGTA AAGGAAACGT GTTAGCATGG 4800 

CCTGATTTCT TGTCAGGAAT TGTGGGGAAA GTGAAGATCG ATTCTAAGAG CATATTTTGT 4 86 0 

TCTGATTGCC CACGCTTAGG AGGGTCAGTG COTCATCTGA GAACTGCATC TGAAGATTTA 4 920 

AAGCCAGGTT CCAAAGTCAA TCTGTTCTGT GATCCAGGCT TCCAGCTGGT CGGGAACCCT 4980 



50 
55 



TT CAGCCGATGA CTTCTATGCT 5100 

GGCAGCACAG TAACCTACCA GTGCAACAAT GGCTACTATC TATTGGGTGA CTCAAGGATG 5160 

45 TTCTGTACAG ATAATGGGAG CTGGAACGGC GTTTCACCAT CCTGCCTTGA TGTCGATGAG 5220 

TGTGCAGTTG GATCAGATTG TAGTGAGCAT GCTTCTTGCC TGAACGTAGA TGGATCCTAC 5280 

ATATGTTCAT GTGTCCCACC GTACACAGGA GATGGGAAAA ACTGTGCAGA ACCTATAAAA 5340 

TGTAAGGCTC CAGGAAATCC GGAAAATGGC CACTCCTCAG GTGAGATTTA TACAGTAGGT 5400 

GCCGGAGTCA CATTTTCGTG T CAGG AAGG A TACCAGTTGA TGGGAGTAAC CAAAATCACA 5460 

TGTTTGGAGT CTGGAGAATG G AAT CAT CT A ATACCATATT GTAAAGCTGT TTCATGTGGT 5520 

AAACCGGCTA TTCCAGAAAA TGGTTGCATT GAGGAGTTAG CATTTACTTT TGGCAGCAAA 5580 

GTGACATATA GGTGTAATAA AGGATATACT CTGGCCGGTG ATAAAGAATC ATCCTGTCTT 5640 

GCTAACAGTT CTTGGAGTCA TTCCCCTCCT GTGTGTGAAC CAGTGAAGTG TTCTAGTCCG 5700 

GAAAATATAA ATAATGGAAA ATATATTTTG AGTGGGCTTA CCTACCTTTC TACTGCATCA 5760 

TATTCATGCG ATACAGGATA CAGCTTACAG GGCCCTTCCA TTATTGAATG CACGGCTTCT 5820 

GGCAT CTGGG ACAGAGCGCC ACCTGCCTGT CACCTCGTCT TCTGTGGAGA ACCACCTGCC 5880 

ATCAAAGATG CTGTCATTAC GGGGAATAAC TTCACTTTCA GGAACACCGT CACTTACACT 5940 

TGCAAAGAAG GCTATACTCT TGCTGGTCTT GACACCATTG AATGCCTGGC CGACGGCAAG 6000 

TGGAGTAGAA GTGACCAGCA GTGCCTGGCT GTCTCCTGTG ATGAGCCACC CATTGTGGAC 6060 

CACGCCTCTC CAGAGACTGC CCATCGGCTC TTTGGAGACA TTGCATTCTA CTACTGCTCT 6120 

GATGGTTACA GCCTAGCAGA CAATT CCCAG CTTCTCTGCA ATGCCCAGGG CAAGTGGGTA 6180 
CCCCCAGAAG GTCAAGACAT GCCCCGTTGT AT AGCT CATT TCTGTG? 

\ ATCTGTGAGC AAAGCAAAAT TTGCAGCTGG C 



GGTGGGCAGT GGAACCCTTC CCCCATGTCC ATCCAGTGCA TCCCTGTGCG 
3 CTATGCAAGT GGATCAAACT 



GACATTTTCC 

GTACTGTCAA GTATTCTTGT 
I TTTTCCTAAG AGGAAATTCT ACCACCCTCT G 
Z TGCCAGAATG TGTTCCAGTA GAATGTCCCC A 
3 ATGTGCAAGG CCTTGCCTAT CTCAGCACAG C 
GGCTTTGAAT TGGTGGGAAA TACTACCACC CTTTGTGGAG AAAATGGTCA CTGGCTTGGA 7440 
GGAAAACCAA CATGTAAAGC CATTGAGTGC CTGAAACCCA AGGAGATTTT GAATGGCAAA 7500 
TTCTCTTACA CGGACCTACA CTATGGACAG ACCGTTACCT ACTCTTGCAA CCGAGGCTTT 7560 
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CGGCTCGAAG GTCCCAGTGC CTTGACCTGT TTAGAGACAG GTGATTGGGA TGTAGATGCC 7620 

CCATCTTGCA ATGCCATCCA CTGTGATTCC CCACAACCCA TTGAAAATGG TTTTGTAGAA 76B0 

GGTGCAGATT ACAGCTATGG TGCCATAATC ATCTACAGTT GCTTCCCTGG GTTTCAGGTG 7740 

GCTGGT CATG CCATGCAGAC CTGTGAAGAG TCAGGATGGT CAAGTTCCAT CCCAACATGT 7B00 

ATGCCAATAG ACTGTGGCCT CCCTCCTCAT ATAGATTTTG GAGACTGTAC TAAACTCAAA 7860 

GATGACCAGG GATATTTTGA GCAAGAAGAC GACATGATGG AAGTTCCATA TGTGACTCCT 7 920 



CTAGCAGGCT CTGACTTAAG 
CGCTGTGAAG 

GGCACTGAGA 
AGGAGTCGGG 



GTGCAGTATA GCTGTAAACC TGGACACATT 
GAGAATAGAA AGTGGAGTGG TGCCTCCCCA 

TATGAGTGTG ACCCCGGATA TGTGCTGAAT 



CGGAAGGGCT GGACTATGGC 

TTCCCTAATG GTTTTTCCTT TATTCATGGG 
TATAAGCTCC ATGGAAATTC ATCAAGAAGG 

TTGAATATGG AACTGTCAAT GGGACAGATT TTGACTGTGG AAAGGCAGCC 



AGCTGCAGGT CTGGATATGT CATACAAGGC AGTTCAGATC TGATTTGTAC AGAGAAAGGG 93 OC 
I ATACGATGGA TACAGATACA GATACATTCA CCTGTCAGAA AGATGGTCGC 948C 



AGCTTGATGG AACCTGGGAG 



CATGGATTTG TGGTTGGCAG TAAATACACC TTTGAAAGCA CAATTATTTA T 

CCTGGCTATG AACTAGAGGG GAACAGGGAA CGTGTCTGCC AGGAGAACAG ACAGTGGAGT 3B40 

GGAGGGGTGG CAATATGCAA AGAGACCAGG TGTGAAACTC CACTTGAATT TCTCAATGGG S900 

40 AAAGCTGACA TTGAAAACAG GACGACTGGA CCCAACGTGG TATATTCCTG CAACAGAGGC 596 0 

TACAGTCTTG AAGGGCCATC TGAGGCACAC TGCACAGAAA ATGGAACCTG GAGCCACCCA 10020 

GTCCCTCTCT GCAAACCAAA TCCATGCCCT GTTCCTTTTG TGATT CCCGA GAATGCTCTG 100B0 

CTGTCTGAAA AGGAGTTTTA TGTTGATCAG AATGTGTCCA T CAAATGTAG GGAAGGTTTT 10140 

CTGCTGCAGG GCCACGGCAT CATTACCTGC AACCCCGACG AGACGTGGAC ACAGACAAGC 10200 

45 GCCAAATGTG AAAAAATCTC ATGTGGTCCA CCAGCTCACG TAGAAAATGC AATTGCTCGA 10260 

GGCGTACATT ATCAATATGG AGACATGATC ACCTACTCAT GTTACAGTGG ATACATGTTG 10320 

GAGGGTTTCC TGAGGAGTGT TTGTTTAGAA AATGGAACAT GGACATCACC TCCTATTTGC 103B0 

AGAGCTGTCT GTCGATTTCC ATGTCAGAAT GGGGGCATCT GCCAACGCCC AAATGCTTGT 10440 

TCCTGTCCAG AGGGCTGGAT GGGGCGCCTC TGTGAAGAAC CAATCTGCAT TCTTCCCTGT 10500 

50 CTGAACGGAG GTCGCTGTGT GGCCCCTTAC CAGTGTGACT GCCCGCCTGG CTGGACGGGG 10560 

TCTCGCTGTC ATACAGCTGT TTGCCAGTCT CCCTGCTTAA ATGGTGGAAA ATGTGTAAGA 10620 

CCAAACCGAT GTCACTGTCT TTCTTCTTGG ACGGGACATA ACTGTTCCAG GTAA 10674 

Seq ID NO: C135 DNA Sequence 
55 Nucleic Acid Accession t): FGBNESH predicted 
Coding sequence: 1..390 



= Acid Accession 



GGCAGCGACT GCGCCCCGTC CCGGCGCCGC GCTCGTCCGC AGAGGAG3CG GCCCGGCCCG 
GGCAGCTGCG GCTCGGGATC CGTCGAGGGG AGGCCGAGCT TGCCAAGCTG GCGCCCAGCG 
GGGTCATGGT GCCCGGCGCC CGCGGCGGCG GCGCACTGGC GCGGGCTGCC GGGCGGGGCC 
TCCTGGCTTT GCTGCTCGCG GTCTCCGCCC CGCTCCGGCT GCAGGCGC-AG GAGC1GGGTG 
ATGGCTGTGG ACACCTAGTG ACTTATCAGG AT AGTGG CAC AATGACATCT AAGAATTATC 
CCGGGACCTA CCCCAATCAC ACTGTTTGCG AAAAGACAAT TACAGTACCA AAGGGGAAAA 
GACTGATTCT GAGGT TGGGA GATTTGGATA TCGAATCCCA GACCTGTGCT TCTGACTATC 
TTCTCTTCAC CAGCTCTTCA GATCAATATG GTCCATACTG T 
AAGAACTCTX- GTTGAACACA AGTGAAGTAA CCGTCCGCTT T 
CTGGCCGGGG TTTTTTGCTG ACCTATGCGA GCAGCGACCA TCCAGATTTA fl 
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TGGAACGAGC TAGCCATTAT TTGAAGACAG AATACAGCAA ATTCTGCCCA GCTGGTTGTA 
GAGACGTAGC AGGAGACATT TCTGGGAATA TGGTAGATGG ATATAGAGAT ACCTCTTTAT 
TGTGCAAAGC TGCCATCCAT GCAGGAATAA TTGCTGATGA ACTAGGTGGC CAGATCAGTG 
TGCTTCAGCG CAAAGGGATC AGTCGATATG AAGGGATTCT GGCCAATGGT GTTCTTTCGA 
GGGATGGTTC CCTGTCAGAC AAGCGATTTC TGTTTACCTC CAATGGTTGC AGCAGATCCT 
TGAGTTTTGA ACCTGACGGG CAAATCAGAG CTTCTTCCTC ATGGCAGTCG GTCAATGAGA 
GTGGAGACCA AGTTCACTGG TCTCCTGGCC AAGCCCGACT TCAGGACCAA GGCCCATCAT 
GGGCTTCGGG CGACAGTAGC AACAACCACA AACCACGAGA GTGGCTGGAG ATCGATTTGG 
GGGAGAAAAA GAAAATAACA GGAATTAGGA CCACAGGATC TACACAGTCG AACTTCAACT 
TTTATGTTAA GAGTTTTGTG ATGAACTTCA AAAACAATAA TTCTAAGTGG AAGACCTATA 
AAGGAATTGT GAATAATGAA GAAAAGGTGT TTCAGGGTAA CTCTAACTTT CGGGACCCAG 
\ TTTCATCCCT CCCATCGTGG CCAGATATGT GCGGGTTGTC CCCCAGACAT 
G GATAGCCTTG AAGGTGGAGC TCATTGGTTG O 



25 
30 



AGACAATCAC 



ACTGTTGGAA GCAGATTAAA TATCCCTTTG CCAGACATCA GTCAGCTGAG TTTACCATCA 
A AGTTAGATCT C 



AGGTTGCCCC GGATGGATCT CAGAGATGAG GATCGGAACA CCATGTTCTT TCCCACCCTA I860 

ACAACAACAA AGGGCAGTAA ATTAAAGTAC TCTTTGTAAG GTACAGTTAC CGATTAATCT 1920 

AGAGATAAAA TATTTTCTTA AAAATATATT TCATTAAACA CCTATGCTGT CTCTATAAAA 19S0 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2010 

Seq ID NO: C137 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1761 



C CCCGTCCCGG CGCCGCGCTC GTCCGCAGAG 60 

GAGGCGGCCC GGCCCGGGCA GCTGCGGCTC GGGATCCGTC GAGGGGAGGC CGAGCTTGCC 120 

AAGCTGGCGC CCAGCGGGGT CATGGTGCCC GGCGCCCGCG GCGGCGGCGC ACTGGCGCGG 1B0 

GCTGCCGGGC GGGGCCTCCT GGCTTTGCTG CTCGCGGTCT CCGCCCCGCT CCGGCTGCAG 240 

GCGGAGGAGC TGGGTGATGG CTGTGGACAC CTAGTGACTT ATCAGGATAG TGGCACAATG 300 

C AAT CACACT G TTTGCGAAAA GACAATIACA 3 60 

3 GGAAAAGACT GATTCTGAGG TTGGGAGATT TGGATATCGA ATCCCAGACC 420 

TGTGGTTCTG ACTATCTTCT CTTCACCAGC TCTTCAGATC AATATGGTCC ATACTGTGGA 480 

G TTCCCAAAGA ACTCTTGTTG AACACAAGTG AAGTAACCGT CCGCTTTGAG 540 

G CCGGGGTTTT TTGCTGACCT ATGCGAGCAG CGACCATCCA 600 

GATTTAATAA CATGTTTGGA ACGAGCTAGC CATTATTTGA AGACAGAATA CAGCAAATTC 660 

TGCCCAGCTG GTTGTAGAGA CGTAGCAGGA GACATTTCTG GGAATATGGT AGATGGATAT 720 

AGAGATACCT CTTTATTGTG CAAAGCTGCC ATCCATGCAG GAATAATTGC TGATGAACTA 780 

GGTGGCCAGA TCAGTGTGCT TCAGCGCAAA GGGATCAGTC GATATGAAGG GATTCTGGCC 840 

AATGGTGTTC TTTCGAGGGA TGGTTCCCTG TCAGACAAGC GATTTCTGTT TACCTCCAAT 900 

GGTTGCAGCA GATCCTTGAG TTTTGAACCT GACGGGCAAA TCAGAGCTTC TTCCTCATGG 960 

CAGTCGGTCA ATGAGAGTGG AGACCAAGTT CACTGGTCTC CTGGCCAAGC CCGACTTCAG 1020 

GACCAAGGCC CATCATGGGC TTCGGGCGAC AGTAGCAACA ACCACAAACC ACGAGAGTGG 1080 

CTGGAGATCG ATTTGGGGGA GAAAAAGAAA AT AACAGG AA TTAGGACCAC AGGATCTACA 1140 

CAGTCGAACT TCAACTTTTA TGTTAAGAGT TTTGTGATGA ACTTCAAAAA CAATAATTCT 1200 

AAGTGGAAGA CCTATAAAGG AATTGTGAAT AATGAAGAAA AGGTGTTTCA GGGTAACTCT 1260 

AACTTTCGGG ACCCAGTGCA AAACAATTTC ATCCCTCCCA TCGTGGCCAG ATATGTGCGG 13 20 

GTTGTCCCCC AGACATGGCA CCAGAGGATA GCCTTGAAGG TGGAGCTCAT TGGTTGCCAG 1380 

ATTACACAAG GTAATGATTC ATTGGTGTGG CGCAAGACAA GTCAAAGCAC CAGTGTTTCA 1440 

ACTAAGAAAG AAGATGAGAC AATCACAAGG CCCATCCCCT CGGAAGAAAC ATCCACAGGA 1500 

ATAAACATTA CAACGGTGGC TATTCCATTG GTGCTCCTTG TTGTCCTGGT GTTTGCTGGA 1560 

ATGGGGATCT TTGCAGCCTT TAGAAAGAAG AAGAAGAAAG GAAGTCCGTA TGGATCAGCA 162 0 

GAGGCTCAGA AAACAGACTG TTGGAAGCAG ATTAAATATC CCTTTGCCAG ACATCAGTCA 1680 

GCTGAGTTTA CCATCAGCTA TGATAATGAG AAGGAGATGA CACAAAAGTT AGATCTCATC 1740 

ACAAGTGATA TGGCAGGTTA A 17 61 



GGCAGGAAAG ATTTCTTGAC 



G AATTATCCCG 

GTTTGCGAAA AGACAATTAC AGTACCAAAG GGGAAAAGAC TGATTCTGAG 

CAAT AT GGAA TGCAGAAGGA GGAGGAGACA GAAGTGCTTT G 

CAGAGAGTGG ACATTCCTGT GCAGCTGTTG CCCAGC1 

GCTGATGCAA GAGGTCCATA CTGTGGAAGT ATGACTGTTC CCAAAGAACT CTTGTTGAAC 

ACAAGTGAAG TAACCGTCCG CTTTGAGAGT GGAT CCCACA TTTCTGGCCG GGGTTTTTTG 

CTGACCTATG CGAGCAGCGA CCATCCAGAT TTAATAACAT G 

A CAGAATACAG CAAATTCTGC CCAGCTC 

A GATACCTCTT TATTGTGCAA AGCTGCCATC 720 

CATGCAGGAA TAATTGCTGA TGAACTAGGT GGCCAGATCA GTGTGCTTCA GCGCAAAGGG 780 

ATCAGTCGAT ATGAAGGGAT TCTGGCCAAT GGTGTTCTTT CGAGGGATGG TTCCCTGTCA 840 

GACAAGCGAT TTCTGTTTAC CTCCAATGGT TGCAGCAGAT CCTTGAGTTT TGAACCTGAC 900 
GGGCAAATCA GAGCTTCTTC CTCATGGCAG TCGGTCAATG AGAGTGGAGA CCAAGTTCAC 960 
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20 
25 



35 
40 



TGGTCTCCTG GCCAAGCCCG ACTTCAGGAC CAAGGCCCAT CATGGGCTTC GGGCGACAGT 1020 

AGCAACAACC ACAAACCACG AGAGTGGCTG GAGATCGATT TGGGGGAGAA AAAGAAAATA 1080 

ACAGGAATTA GGACCACAGG AT CT ACACAG TCGAACTTCA ACTTTTATGT TAAGAGTTTT 1140 

GTGATGAACT TCAAAAACAA TAATTCTAAG TGGAAGACCT ATAAAGGAAT TGTGAATAAT 12 00 

GAAGAAAAGG TGTTTCAGGG T AACT CTAAC TTTCGGGACC CAGTGCAAAA CAATTTCATC 1260 

CCTCCCATCG TGGCCAGATA TGTGCGGGTT GTCCCCCAGA CATGGCACCA GAGGATAGCC 1320 

TTGAAGGTGG AGCTCATTGG TTGCCAGATT ACACAAGGTA ATGATTCATT GGTGTGGCGC 1380 

AAG ACAAGT C AAAGCACCAG TGTTTCAACT AAGAAAGAAG ATGAGACAAT CACAAGGCCC 1440 

ATCCCCTCGG AAG AAACAT C CACAGATGCC ATGCCAGTGC AGATTGTCGG AGACCATACC 1500 

CAGATGATCT CACAAAGGGA GAATCTGGGA CCTGATGAGG GCAAAATACC TTTTAAAGGC 1560 

ACAGCGGAAA GCATGGTTAG AGTAGTGTTT GCTGTTGTGG TTAATGACCT TGGCATGCTG 162 0 

E ACACACCTGA GGAGGACATT GATCACTACT GTTGGAAGCA GATTAAATAT 1680 
A GAAGGAGATG 




CGGCCCAAAG CTGTCAGCGC O 
CCAACGCATC CCGGGACGAG TGACAGCTAT TCTGCCCCCA GAGACTGCCT CACACCCCTC 
AACCAGACGG CCATGACTGC CCTTTTGTGA 



AGTGCCCCAG GAGCTATGAC AAGCAAAGGA ACATACTTGC CTGGAGATAG CCTTTGCGAT 
GACAAGCCTG TAACGAATAG T 



GTCGCTGCAT GCTTCTGTTG TTTTTCATAG GCTTGCTTCT GATCCTGCTC CTGCAGGTGG 
CGACAGGTAT CCTAGGAGCT GTTTTCAAAT CTAAGTCTGA TCGCATTGTG AATGAAACTC 
TCTATGAAAA CACAAAGCTT TTGAGCGCCA CAGGGGAAAG TGAAAAACAA TTCCAGGAAG 
CCATAATTGT GTTTCAAGAA GAGTTTAAAT GCTGCGGTTT GGTCAATGGA GCTGCTGATT 
GGGGAAATAA TTTTCAACAC TATCCTGAAT TATGTGCCTG TCTAGATAAG CAGAGACCAT 



CCAGAT CGGGAACAAA TGAATCTGTG 
GATGCATCAA CCTATCGTCA GTCAAACCCC TTTAAAATGT TGCTTTGGCT TTGTAAATTT 
AAATATGTAA GTGCTATATA AGT CAGGAGC AGCTGTCTTT TTAAAATGTC TCGGCTAGCT 
AGACCACAGA TATCTTCTAG ACATATTGAA CACATTTAAG ATTTGAGGGA TATAAGGGAA 
A ATGTGTATTT TTACTCAAAA TAAAAGTAAC TGTTTACGTT AAAAAAAAAA 



Seq ID NO: C140 DNA Sequence 

Nucleic Acid Accession tt : NM_004617.2 

Coding sequence-. 232.. 840 



CTTCAGGTCA GGGAGAATGT ATAAATGTCC ATTGCCATCG AGGTTCTGCT ATTTTTGAGA 
AGCTGAAGCA ACTCCAAGGA CACAGTTCAC AGAAATTTGG TTCTCAGCCC CAAAATACTG 
ATTGAATTGG AGACAATTAC AAGGACTCTC TGGCCAAAAA CCCITGAAGA GGCCCCGTGA 
AGGAGGCAGT GAGGAGCTTT TGATTGCTGA CCTGTGTCGT ACCACCCCAG AATGTGCACT 
GGGGGCTGTG CCAGATGCCT GGGGGGGACC CTCATTCCCC TTGCTTT1TT TGGCTTCCTG 



CAAGAGATCT G 

CTGGTGTTCT TGGGCCTGAA GAACAATGAC TGCTGTGGGT GCTGCGGCAA C 
GGGAAGCGAT TTGCGATGTT CACCTCCACG ATATTTGCTG TGGTTGGATT CTTGGGAGCT 
GGATACTCGT T 

AATAGTACAT GGGGCTACCC CTTCCACGAC GGGGATTATC T 
AACAAGTGCC GAGAGCCTCT CAATGTGGTT CCCTGGAATC TGACCCTCTT CTCCATCCTG 



ACCTCCGAGA TGAGCTGCTC AGACTCTACA 



75 
80 




CTTCTTTGTA TATAAAGATG TTTATATCTT 
, GCACATTTAA AATGAGAAAC TAAGACCAAT 
TTCTGTTTTT AAGAGGAAAA AGAATGATTG ATGTAT CCT A AGTATTGTTA TTTGTTGTCT 
TTTTTTGCTG CCTTGCTTGA GTTGCTTGTG ACTGATCTTT TGAGGCTGTC ATCATGGCTA 
GGGTTCTTTT ATGTATGTTA AAT TAAAACC TGAATTCAGA GGTAACGT 

Seq ID NO: C141 DNA Sequence 

Nucleic Acid Accession ff : NM_002381.2 
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CACGGAGCCC 60 



35 
40 



f TTGCAAGAGC 300 



AGACCCTTGG ACCTGGTGTT TATCATTGAT AGTTCTCGTA GCGTACGGCC CCTGGAATTC 360 

ACCAAAGTGA AAACTTTTGT CTCCCGGATA ATCGACACTC TGGACATTGG GCCAGCCGAC 420 

ACGCGGGTGG CAGTGGT GAA CTATGCTAGC ACTGTGAAGA TCGAGTTCCA ACTCCAGGCC 480 

TACACAGATA AGCAGTCCCT GAAGCAGGCT GTGGGTCGAA TCACACCCTT GTCAACAGGC 540 



A TGGGAGGCCC 660 

CAGGACCAGG TGAATGAAGT GGCGGCTCGG GCCCAAGCAT C 



G GTCATTGAGA AACTTTCCTC TAGATTCCAG 

GAAACCTTCT G 



k ACACCCACGG A 
ATCTGTGTGA ATGACAGAAG TGGCTCTTAT CATTGTGAGT GCTATG7 

3 CTTTGGGTAC CCATGGGTGT 



ACATTTGTGT GAGTGATGGG GCCGCATCCT 
AATACATCG TTAAATTGCT CCAATTTCTC ACCTGAAAAT 

ATACTCATG CATTCTTTTG CACACCTGTT ATTGCCAATG TTCCTGCTAA 1S20 
CTGCAGAATC AGCATGATTT TTCCAAGGAA ATACATATGC AGATACTTAT TAAGAGCAAA 1740 



2460 
2599 



C GGCTCCTCGT G 

GAGCAAGCGC CAGGCACCGC CCCCTGCTCC CGCGGCAGCT CCTGGAGCGC GGACCTGGAC 180 

G CGACCGCACA GCGACITCTG CCTGGGCTGC 240 

G CTTTGGCCCA TCCTTGGGG3 CGCTCTGAGC 300 

3 TGCTGGGGCT GCTTTCTGGC TTTTTGGTCT GGAGACGATG CCGCAGGAGA 360 

A CCACCCCCAT AGAGGAGACC GGOSGAGAGG GCTGCCCAGC TGTGGCGCTG 420 

C CTGCCAGCCG GGGCTCGCCC ACTCATCATT CATTCATCCA 480 

C TCCCAGACGC GGCGGGAGCC AAGCTCCTCC AACCACAAGG 540 

3 GCGGTGAATC ACCTCTGAGG CCTGGGCCCA GGGTTCAGGC- GAACCTTCCA 600 



Seq ID NO: C142 
Nucleic Acid 
Coding sequel 



CCAGGAGGCA CGCTGGTTTT CCGGGGCCGC TCCATCGCGC CTTCCTCCTG CGCCTCGCTT 60 

CTCCGGTCCA GCCGCCATCT TCCTTTCCGC ACAGG3GCCG CCGAGCG3GG CCATGCAGCC 120 

AACGCTGCTT CTCAGCCTCC TGGGAGCCGT GGGGCTGGCG GCTGTCAATT CCATGCCAGT 180 

GGATAACAGG AACCACAATG AAGGAATGGT GACTCGCTGC ATCATTGAGG TCCTCTCAAA 240 
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TGCCTTGTCG AAGTCCAGCG CTCCACCCAT CACCCCTGAG TGCCGCCAAG TCCTGAAGAC 
GAGTAGAAAA GACGT CAAAG ACAAAGAGAC AACTGAAAAT GAAAACACAA AGTTTGAAGT 
AAGATTGTTA AGAGACCCAG CTGATGCCTC GGAAGCCCAC GAGTCCTCCA GCAGGGGAGA 
GGCAGGAGCC CCAGGGGAGG AGGACATCCA AGGCCCAACA AAGGCAGACA CAGAGAAATG 
GGCAGAGGGA GGCGGGCACA GCCGAGAGCG AGCGGATGAG CCCCAGTGGA GCCTCTATCC 
CTCCGACAGC CAAGTCTCTG AAGAAGTGAA GACACGCCAT TCTGAGAAGA GCCAGAGAGA 
GGATGAGGAG GAGGAGGAGG GAGAGAACTA TCAAAAAGGG GAGCGAGGGG AAGATAGCAG 
TGAAGAGAAA CACCTTGAAG AGCCAGGAGA GACACAAAAC GCTTTTCTCA ATGAAAGAAA 
GCAGGCTTCA GCTATAAAAA AAGAGGAGTT AGTGGCCAGA TCGGAAACAC ATGCTGCCGG 
GCATTCTCAG GAGAAGACAC ATAGCCGAGA GAAGAGTAGC CAGGAGAGTG GAGAGGAGGC 
AGGGAGCCAG GAGAATCACC CCCAGGAGTC TAAAGGCCAA CCCCGAAGCC AGGAAGAATC 
TGAGGAAGGT GAGGAAGAT G CCACCTCTGA GGTGGACAAA CGACGCACGA GGCCCAGACA 
CCACCACGGG AGGAGCAGGC CCGACAGGTC CTCTCAAGGA GGGAGTCTTC CCTCTGAGGA 
AAAGGGACAC CCCCAGGAGG AATCTGAGGA GTCAAACGTC AGCATGGCCA GITTAGGGGA 
AAAGAGGGAC CACCATTCAA CCCACTACAG GGCTTCAGAG GAAGAACCTG AATATGGAGA 
AGAAATAAAG GGTTATCCAG GCGTCCAGGC CCCTGAGGAC CTGGAGTGGG A 



45 
50 



GGAGGACAAG AGAAACTACC CCAGCTTAGA GCTTGATAAG ATGGCACATG GATATGGTGA 
AGAAAGTGAG GAAGAGAGGG GCCTTGAGCC GGGAAAGGGA CGCCATCACA GAGGCAGGGG 
AGGGGAGCCA CGTGCCTATT TCATGTCTGA CACCAGAGAA GAGAAAAGGT TCTTGGGTGA 
AGGACACCAC CGTGTCCAAG AAAAC CAGAT GGACAAGGCA AGGAGGCATC CACAAGGTGC 
LTCTCAA CTACGGTGAG GAAGGAGCCC C 
= AGGACACTAA A 



G CCAGGGTCCC CAAGCTGGAC CTGAAAAGGC AATATGACAG 1920 

GGTGGCCCAA CTGGACCAGC TCCTTCACTA CAGGAAGAAG TCAGCTGAGT TTCCAGACTT 1980 

CTATGATTCT GAGGAGCCGG TGAGCACCCA CCAGGAGGCA GAAAATGAAA AGGACAGGGC 204 0 

TGACCAGACA GTCCTGACAG AGGACGAGAA AAAAGAACTC GAAAACTTGG CTGCAATGGA 2100 

TTTGGAACTA CAGAAGATAG CTGAGAAATT CAGCCAAAGG GGCTGACTGT CATTGGAGCG 2160 

A GCCATCACAT GATCTGTTTT TCACCACTTC ACTGAAAGAC 2220 

A TACCCAAGGG CAGAAAGTAG AACTTACTAT TCATTAAATG TTTGACACAA 2280 

TTGGAATTGT CTTTAATTTG TGTCAGAATG CTATTGAAAA TGTGAATTGC ATGACTTGTA 2340 

GCATATTCTT TTCTGCAAAA TAGACATATT AACATGCTTA TGACAATGAC TGTGCTACTG 24 00 

TCTTTGGAAA AATGTTTGTC TCAGTTGGAA ATAATAAAAG ATTCACCTGA GACC 2454 

Seq ID NO: C144 DNA Sequence 

Nucleic Acid Accession #: XM_0930S2.1 

Coding sequence: 93.. 1988 - 



CTTCTTGTGG TAGGGACCTC TCCTCAGTAT TTGAAACTAA CCAGCATCTG ACAGATTTCG 
AATTTGTAAA AAATACCCTC GAAGATTCAG GAATGAAGCT TCTGTGTGAA GGATTAAAAC 
A< CCAACTG TGTATTACAG ACATTGAGGT GGTACCGGTG CCTTATCTCT TCTGCTTCTT 
GTGGGGCTCT AGCAGCTGTT CTTAGCACCA GTCAGTGGCT CACTGAACTG GAATTTAGTG 
AGACAAAACT GGAAGCTTCA GCTTTGAAAT TGCTCTATGG AGGCTTAAAA GATCCAAATT 



TTGGAATGGT TGGAAATTGT TCTGGTTTCT CGGGATCATT GGTGCAATCT CATTTTGGCT 
ACTGTCAGGA CAGTTCTTTC AAATGTG AT C TTTGTAAGCT GCTCTGGCCT TCCACCAGAG 
TTGCTGCTGC AAAGGATTGT GGGAGTCCTA AGTCCTTCCT ATCAGAAGGG CTGAACTGGG 
CAGGAAGACT TGAGGCAGTG GAGGAGGTTT TGGGGTTGGG GGTGCTTGTA CAGCCCGGTG 
ACCCAGCATC TCAGGGTGGG GGGCATTGTG AAAACTATGG GTCTTTTAGA GACTTGGTGG 
ACTTAGAAGT CAAGGCAGAA CCAAGCCTGA GAAAAGGTGG TATGGATCTC CAGAGACCCA 
CCCTACAAGT TGTCCTCCTT TGCAAAATCT TCTCCCTCAA ACTATTTCTC TTTATTGCAT 
TGCCTAATTC TCCTGGTCAG GTTAGTGTGG TGCAAGTGAC CATCCCAGAC GGTTTCGTGA 
ACGTGACTGT TGGATCTAAT GTCACTCTCA TCTGCATCTA CACCACCACT GTGGCCTCCC 




GCCAAAACCA AGGCATCCTC AACGTCAGTG T 
GCGTTCAAGG AAGACCAGAA ACTGGCCACA CTATTTCCCT T 

C CCCTGTGTAC TACTGGCATA AACTTGAGGG AAGAGACATC GTGCCAGTGA 
T CAACCCAACC ACCGGGATTT TGGTCATTGG AAATCTGACA AATTTTGAAC 
AAGGTTATTA CCAGTGTACT GCCATCAACA GACTTGGCAA TAGTTCCTGC GAAATCGATC 
TCACTTCTTC ACATCCAGAA GTTGGAATCA TTGTTGGGGC CTTGATTGGT AGCCTGGTAG 
GTGCCGCCAT CATCATCTCT GTTGTGTGCT TCGCAAGGAA TAAGGCAAAA GCAAAGGCAA 
AAGAAAGAAA TTCTAAGACC ATCGCGGAAC TTGAGCCAAT GACAAAGATA AACCCAAGGG 
GAGAAAGCGA AGCAATGCCA AGAGAAGACG CTACCCAACT AGAAGTAACT C 
CCATTCATGA GACTGGCCCT GATACCATCC AAGAACCAGA C 
AGGAGCCTGC CCCAGAGCCT GCCCCAGGAT CAGAGCCTAT GGCAGTGCCT GACCTTGACA 
TCGAGCTGGA GCTGGAGCCA GAAACGCAGT CGGAATTGGA GCCAGAGCCA GAGCCAGAGC 
CAGAGTCAGA GCCTGGGGTT GTAGTTGAGC CCTTAAGTGA AGATGAAAAG GGAGTGGTTA 
AGGCATAG 

Nucleic Acid A 



WO 03/042661 PCT/US02/36810 



I I I I I I 

ATGGTGTTCG CATTTTGGAA GGTCTTTCTG ATCCTAAGCT GCCTTGCAGG TCAGGTTAGT 
GTGGTGCAAG TGACCATCCC AGACGGTTTC GTGAACGTGA CTGTTGGATC TAATGTCACT 
CTCATCTGCA TCTACACCAC CACTGTGGCC TCCCGAGAAC AGCTTTCCAT CCAGTGGTCT 
TTCTTCCATA AGAAGGAGAT GGAGCCAATT TCTTCTCCTT GGGAG3AGGG GAAGTGGCCA 
GATGTTGAGG CTGTGAAGGG CACTCTTGAT GGACAGCAGG CTGAACTCCA GATTTACTTT 
TCTCAAGGTG GACAAGCTGT AGCCATCGGG CAATTTAAAG ATCGAATTAC AGGGTCCAAC 
GAT CCAGGTA ATGCATCTAT CACTATCTCG CATATGCAGC CAGCAGACAG TGGAATTTAC 
ATCTGCGATG TTAACAACCC CCCAGACTTT CTCGGCCAAA ACCAAGGCAT CCTCAACGTC 
AGTGTGTTAG TGAAACCTTC TAAGCCCCTT TGTAGCGTTC AAGGAAGACC AGAAACTGGC 
CACACTATTT CCCTTTCCTG TCTCTCTGCG CTTGGAACAC CTTCCCCTGT GTACTACTGG 
CATAAACTTG AGGGAAGAGA CATCGTGCCA GTGAAAGAAA ACTTCAACCC AACCACCGGG 
I GACAAATTTT GAACAAGGTT ATTACCAGTG TACTGCCATC 



15 

20 



CTCTGTTGTG 

TGCTTCGCAA G 



ATCCAAGAAC CAGACTATGA GCCAAAGCCT ACTCAGGAGC CTGCCCCAGA GCCTGCCCCA 
CAGTCGGAAT TGGAGCCAGA GCCAGAGCCA G 



Seq ID NO: C146 DNA Sequence 

Nucleic Acid Accession # : NM_003020 .1 

Coding sequence: 23.. 664 



CCAGAGCATT GAAGGTGGAG CTCATGAAGG ACTTCAGCAT TTGGGTCCTT TTGGCAACAT 
CCCCAACATC GTGGCAGAGT TGACTGGAGA CAACATTCCT AAGGACTTTA GTGAGGAT CA 
GGGGTACCCA GACCCTCCAA ATCCCTGTCC TGTTGGAAAA ACAGATGATG GATGTCTAGA 
AAACACCCCT GACACTGCAG AGTTCAGTCG AGAGTTCCAG TTGCACCAGC ATCTCTTTGA 
TCCGGAACAT GACTATCCAG GCTTGGGCAA GTGGAACAAG AAACTCCTTT ACGAGAAGAT 
GAAGGGAGGA GAGAGACGAA AGCGGAGGAG TGTCAATCCA TATCTACAAG GACAGAGACT 
GGATAATGTT GTTGCAAAGA AGTCTGTCCC CCATTTTTCA GATGAGGATA AGGATCCAGA 
GTAAAGAGAA GATGCTAGAC GAAAACCCAC ATTACCTGTT AGGCCTCAGC ATGGCTTATG 
TGCACGTGTA AATGGAGTCC CTGTGAATGA CAGCATGTTT CTTACATAGA TAATTATGGA 
TACAAAGCAG CTGTATGTAG ATAGTGTATT GTCTTCACAC CGATGATTCT GCTTTTTGCT 
AAATTAGAAT AAGAGCTTTT TTGTTTCTTG GGTTTTTAAA ATGTGAATCT GCAATGATCA 
TAAAAATTAA AATGTGAATG TCAACAATAA AAAGCAAGAC TATGAAAGGC TCAGATTTCT 
TGCAGTTTAA AATGGTGTCT GAGGTTGTAC TATTTTGGCC AAGTCTGTAG AAAGCTGTCA 
TTTGATTTTG ATTATGTAGT TCATCCAGCC CTTGGGCATT GTTATACACC AGTAAAGAAG 
GCTGTACTCA AGAGGAGGAG CTGACACATT TCACTTGGCT GCGTCTTAAT AAACATGAAT 
GCAAGCATTG GC 



Seq ID NO: C147 DNA S . 
Nucleic Acid Accession #: NM_o 
Coding sequence: 144.. 806 



GGGCTGGCCC TGGTGTGCCC CAGCTGGGAA ACATGGCTGT CATACATTCA CATCTGTGGA 240 
AAGGATTGCA AGAGAAGTTC TTGAAGGGAG AACCCAAAGT CCTTGGGGTT GTGCAGATTC 300 

ATGGAAGTAA CCCTATTTCC GTGTATf 
TTATTTCAGG ATCCTTGTCA ATTGCAGCAG G 
GTAGTCTAGG AATGAATATC ACCAGCTCTG TACTGGCTGC A 




GAGGCC ACCAAAAGAT C 

T CCAACTTCGA 
T GTGTGTGTAT 

A TAAATTCAAA ATTATGTTCT CATTTTTTTC CCTGGAACTC AATAACTCAT 
TTCACTGGCT CTTTATCGAG AGTACTAGAA GTTAAATTAA TAAATAATGC ATTTAATGAG 
C TTGAAAGTTT TTCATTCATC ATAAGAACTT TATATAAAGG CATTACATTG 
G TTTGGAAGCA GAAGAGCAAA AAAAAGATAT TGTTAAAATG AGGCCTCCAT 
C CCATTTATTT AACTTTTTTT TTCTCCTACC TATGGGGACC 
AAAGTGCTTT TTCCTTCAGG AAGTGGAGAT GCATGGCCAT CTCCCCCTCC CTTTTTCCTT 
CTCCTGCTTT TCTTTCCCCA TAGAAAGTAC CTTGAAGTAG CACAGTCCGT CCTTGCATGT 
GCACGAGCTA TCATTTGAGT AAAAGTATAC ATGGAGTAAA AATCATATTA AGCATCAGAT 
TCAACT TATA TTTTCTATTT CATCTTCTTC CTTTCCCTTC TCCCACCTTC TACTGGGCAT 
AATTATATCT TAAT CAT ATA TGGAAATGTG CAACATATGG TATTTGTTAA ATACGTTTGT 
TTTTATTGCA GAGCAAAAAT AAATCAAATT AGAAGCAATA AAAAAAAAAA AAAAAAAAA 



1291 



WO 03/042661 



PCT/US02/36810 



20 
25 
30 



Nucleic Acid Accession^, 
Coding sequence: 56.. 502 



AGTCTCTGCT CTTCCCAGCC TCTCCGGCGC GCTCCAAGGG CTTCCCGTCG 
CGGCAGTGAG CTCCCGCTGG TCCTGCTGGC GCTGGTCCTC TGCCTAGCGC CCCGGGGGCG 
AGCGGTCCCG CTGCCTGCGG GCGGAGGGAC CGTGCTGACC AAGATGTACC CGCGCGGCAA 
CCACTGGGCG GTGGGGCACT TAATGGGGAA AAAGAGCACA GGGGAGTCTT C 



A ACAAAACCCC 

TAAGAGACTG AGTTCTGCAA GCATCAGTTC TACGGATCAT CAACAAGATT TCCTTGTGCA 
A CTATTCTGTA TCTTTCATCC TTGACTAAAT TCGTGATTTT CAAGCAGCAT 



TTTTTATATC TAGGCTACCT GTTGGTTAGA TTCAAGGCCC CGAGCTGTTA CCATTCACAA 7B0 



Seq ID NO: C149 E 
Nucleic Acid Acce 
Coding sequence: 



TGAGGTGAAG GGCCGCTGTG GCCACAGCCA GTCGGAGCTG CAAGTGTTCT GGGTGGATCG 540 



GGCGACTTGG AGGCTGAGCA AAGTGCAGTT TGTCTACGAC TCCTCGGAGA AAACCCACTT 
CAAAGACGCA GTCAGTGCTG GGAAGCACAC AGCCAACTCG CACCACCTCT CTGCCTTGGT 
CACCCCCGCT GGGAAGTCCT ATGAGTGTCA AGCTCAACAA ACCATTTCAC TGGCCTCTAG 
TGATCCGCAG AAGACGGTCA CCATGATCCT GTCTGCGGTC CACATCCAAC CTTTTGACAT 
TATCTCAGAT TTTGTCTTCA GTGAAGAGCA TAAATGCCCA GTGGATGAGC GGGAGCAACT 
GGAAGAAACC TTGCCCCTGA TTTTGGGGCT CATCTTGGGC CTCGTCATCA TGGTAACACT 
CGCGATTTAC CACGTCCACC ACAAAATGAC TGCCAACCAG GTGCAGATCC CTCGGGACAG 
ATCCCAGTAT AAGCACATGG GCTAGAGGCC GTTAGGCAGG CACCCCCTAT TCCTGCTCCC 
CCAACTGGAT CAGGTAGAAC AACAAAAGCA CTTTTCCATC TTGTACACGA GATACACCAA 
CATAGCTACA ATCAAACAGG CCTGGGTATC TGAGGCTTGC TTGGCTTGTG TCCATGCTTA 
AACCCACGGA AGGGGGAGAC TCTTTCGGAT TTGTAGGGTG AAATGGCAAT TATTCTCTCC 
ATGCTGGGGA GGAGGGGAGG AGGGTCTCAG ACAGCTTTCG TGCTCATGGT GGCTTGGCTT 
TGACICTCCA A 
TTGAAAACAT GCTTCTTTGA G 
TGCTCCCTTG GACACAGCTG GCTTATCCTA TACAGTTGTC AATGCACACA GAATACAACC 
TCATGCTCCC T 



GGGGGACCTG AAGAATCAAT CTGTGTGAGT CTGTTTTTCA AAATGAAATA AAACACACTA 1740 
TTCTCTGGC 174 9 

Seq ID NO: C150 DNA Sequence 



GTCTGCAAAC CAGTGTGCCG TGCCGGCCAA GGACAGGGIG GACTGCGGCT ACCCCCATGT 
CACCCCCAAG GAGTGCAACA ACCGGGGCTG CTGCTTTGAC TCCAGGATCC CTGGAGTGCC 
TTGGTGTTTC AAGCCCCTGA CTAGGAAGAC AGAATGCACC TTCTGAGGCA CCTCCAGCTG 
CCCCTGGGAT GCAGGCTGAG CACCCTTGCC CGGCTGTGAT TGCTGCCAGG CACTGTTCAT 



GGCACGAGCC AGTCTCCGCG CCTCCACCCA GCTCAGGAAC CCGCGAACCC T 

ACTATGAGCC TCCCGTCCAG CCGCGCGGCC CGTGTCCCGG GTCCTTCGGG CTCCTTG1GC 120 

GCGCTGCTCG CGCTGCTGCT CCTGCTGACG CCGCCGGGGC CCCTCGCCAG CGCTGGTCCT 180 

GTCTCTGCTG TGCTGACAGA GCTGCGTTGC ACTTGTTTAC GCGTTACGCT G 



1292 



WO 03/042661 



PCT/US02/36810 




Seq ID NO-. C152 DNA Sequence 
Coding sequence: 148.. 1341 

1 11 21 31 41 51 

I I I I I I 

CGGCCCGCCC TGGGGAGGCG CGCAGCAGAG GCTCCGATTC GGGGCAGGTG AGAGGCTGAC 
TTTCTCTCGG TGCGTCCAGT GGAGCTCTGA GTTTCGAATC GGTGGCGGCG GATTCCCCGC 
GCGCCCGGCG TCGGGGCTTC CAGGAGG AT G CGGAGCCCCA GCGCGGCGTG GCTGCTGGGG 
GCCGCCATCC TGCTAGCAGC CTCTCTCTCC TGCAGTGGCA CCATCCAAGG AACCAATAGA 
TCCTCTAAAG GAAGAAGCCT TATTGGTAAG GTTGATGGCA CATCCCACGT CACTGGAAAA 
GGAGTTACAG TTGAAACAGT CTTTTCTGTG GATGAGTTTT CTGCATCTGT CCTCACTGGA 
AAACTGACCA CGGTCTTCCT TCCAATTGTC TACACAATTG TGTTTGTGGT GGGTTTGCCA 
AGTAACGGCA TGGCCCTGTG GGTCTTTCTT TTCCGAACTA AGAAGAAGCA CCCTGCTGTG 
ATTTACATGG CCAATCTGGC CTTGGCTGAC CTCCTCTCTG TCATCTGGTT CCCCTTGAAG 
ATTGCCTATC ACATACATGC CAACAACTGG ATTTATGGGG AAGCTCTTTG TAATGTGCTT 
ATTGGCTTTT TCTATGGCAA CATGTACTGT TCCATTCTCT TCATGACCTG CCTCAGTGTG 
CAGAGGTATT GGGTCATCGT GAACCCCATG GGGCACT CCA GGAAGAAGGC AAACATTGCC 



GTGAAGCAGA CCATCTTCAT TCCTGCCCTG AACATCACGA CCTGTCATGA TGTTTTGCCT 
GAGCAGCTCT TGGTGGGAGA CATGTT CAAT TACTTCCTCT CTCTGGCCAT TGGGGTCTTT 
ITGTTI _ < J rTCCTCAC AGCCTCTGCC TATGTGCTGA TGATCAGAAT GCTGCGATCT 
TCTGCCATGG ATGAAAACTC AGAGAAGAAA AGGAAGAGGG CCATCAAACT CATTGTCACT 
GTCCTGGCCA TGTACCTGAT CTGCTTCACT CCTAGTAACC TTCTGCTTGT GGTGCATTAT 



G GTCCTCAGAT GGGAATTGCA CAGTAGGATG 1380 

TGGAACCTGT TTAATGTTAT GAGGACGTGT CTGTTATTTC CTAATCAAAA AGGTCTCACC 1440 

ACATACCACC G 1451 

Seq ID NO: C153 DNA Sequence 



GATGAGAATA AT ACT CG AAG CTTTGAGACA GGCTGAAAAT G 

AGAAAATAAG CCCTATGCCT TGAATTCAGA AAAGAACTTT CCAATGGACA TGAGTGATGA 
TTAT GAGACA CAGCAGTGGC CAGAAAGAAA GCTTAAGCAC A* 
TGAAGAGAAT TCCAGGGATA ACCCCTTTAA ACGCACAAAT G 
TACTCCTCAA AGCCTTGCTA CATTGGAATC TGTCTTCCAA GAGCTGGGGA A 
ACCAAACAAC CAGAAACGTG AGAGGATGGA TGAGGAGCAA A, 
AGATGATATC TACAAGGCTA ATAACATTGC CTATGAAGAT G 
GAACCCAGTA GAGGAGAAAA TAGAGAGTCA AACCCAGGAA GAGGTGAGAG A 
GAATATAGGA AAAAATGAAC AAATCAACGA TGAGATGAAA CGCTCAGGGC AGCTTGGCAT 

AATTGCCTAT TTGAAAAGGT TAGTAAATGC TGCAGGAAGT GGGAGGTTAC A 
AAATGGGGAA AGGGCCACCA GGCTTTTTGA GAAACCTCTT GATTCTCAGT C 
GCTGATTGAA ATCTCAAGGA ATTTACAGAT ACCCCCAGAA GACTTAATTG AGATGCTCAA 12 0 0 
AACTGGGGAG AAGCCGAATG GATCAGTGGA ACCGGAGCGG GAGCTTGACC TTCCTGTTGA 1260 
C ATCTCAGAGG CTGACTTAGA CCATCCAGAC CTGTTCCAAA ATAGGATGCT 132 0 



1293 



WO 03/042661 PCT/US02/36810 



CTCCAAGAGT GGCTACCCTA AAACACCTGG TCGTGCTGGG ACTGAGGCCC TACCAGACGG 1380 

GCTCAGTGTT GAGGATATTT TAAATCTTTT AGGGATGC-AG AGTGCAGCAA ATCAGAAAAC 1440 

GTCGTATTTT CCCAATCCAT ATAACCAGGA GAAAGTTCTG CCAAGGCTCC CTTATGGTGC 150 0 

TGGAAGATCT AGATCGAACC AGCTTCCCAA AGCTGCCTGG ATTCCACATG TTGAAAACAG 1560 

ACAGATGGCA TATGAAAACC TGAACGACAA GGATCAAGAA TTAGGTGAGT ACTTGGCCAG 162 0 

GATGCTAGTT AAATACCCTG AGATCATTAA TTCAAACCAA GTGAAGCGAG TTCCTGGTCA 1680 

AGGCTCATCT GAAGATGACC TGCAGGAAGA GGAACAAATT GAGCAGGCCA TCAAAGAGCA 1740 

TTTGAATCAA GGCAGCTCTC AGGAGACTGA CAAGCTGGCC CCGGTGAGCA AAAGGTTCCC 180 0 
TGTGGGGCCC CCGAAGAATG ATGATACCCC A 



30 
35 



T GTTAACACTG 

AATGATGTA CAGGCAGATG AAACCAGGTC ACTGGGGAG 
GTTATCTTG TGTATGGATA TGTGTAAATG TTATGACTC 

ATTTATTATG T 



CCTTTTAGGA GGAGGGAGGG 



CTTTTCTTCT CCTCAACATC TGAATCATGC CATGTGCCCA GAGGAGCTGG C 
TTTCCGTGGT GGCTCAGCTC CTTAACTTTG GGGCGCTTTG CTATGGGAGA CAGCCTCAGC 
CAGGCCCGGT TCGCTTCCCG GACAGGAGGC AAGAGCATTT TATCAAGGGC CTGCCAGAAT 
ACCACGTGGT GGGTCCAGTC CGAGTAGATG CCAGTGGGCA TTTTTTGTCA TATGGCTTGC 
ACTATCCCAT CACGAGCAGC AGGAGGAAGA GAGATTTGGA TGGCTCAGAG GACTGGGTGT 
ACTACAGAAT TTCTCACGAG GAGAAGGACC TGTTTTTTAA CTTGACGGTC AATCAAGGAT 
TTCTTTCCAA TAGCTACATC ATGGAGAAGA GATATGGGAA CCTCTCCCAT GTTAAGATGA 
TGGCTTCCTC TCCCCCCCTC TCCCATCTCA GTGGCACGGT TCTACAGCAG GGCACCAGAG 
TTGGGACGGC AGCCCTCAGT GCCTGCCATG GACTGACTGG ATTTTTCCAA CTACCACATG 
GAGACTTTTT CflTTGAACCC GTGAAGAAGC ATCCACTGGT TGAGGGAGGG TACCACCCGC 
ACATCCTTTA CAGGAGGCAG AAAGTTCCAG AAACCAAGGA GCCAACCTGT GGATTAAAGG 
ACAGTGTTAA CATCTCCCAG AAGCAAGAGC TATGGCGGGA GAAGTGGGAG AGGCACAACT 
TGCCAAGCAG AAGCCTCTCT CGGCGTTCCA TCAGCAAGGA GAGATGGGTG GAGACACTGG 
TGGTGGCCGA CACAAAGATG ATTGAATACC ATGGGAGTGA GAATGTGGAG TCCTACATCC 
TCACCATCAT GAACATGGTC ACTGGGTTGT TCCATAACCC AAGCATTGGC AATGCAATTC 
ACATTGTTGT GGTTCGGCTC ATTCTACTCG AAGAAGAAGA GCAAGGACTG AAAATAGTTC 
ACCATGCAGA AAAGACACTG TCTAGCTTCT GCAAGTGGCA GAAGAGTATC AATCCCAAGA 
GTCACCTCAA TCCTGTTCAT CACGACGTGG CTGTCCTTCT CACCAGAAAG GACATCTGTG 
CTGGTTT CAA TCGCCCCTGC GAGACCCTGG GCCTGTCTCA CCTTTCAGGA ATGTGTCAGC 
CTCACCGCAG TTGTAACATC AATGAAGATT CGGGACTCCC TCTGGCTTTC ACAATTGCCC 
ATGAGCTAGG ACACAGCTTC GGCATCCAGC ATGATGGGAA AG AAAAT GAC TGTGAGCCTG 
TGGGCAGACA TCCGTACATC ATGTCCCGCC AGCTCCAGTA CGATCCCACT CCGCTGACAT 
GGTCCAAGTG CAGCGAGGAG TACATCACCC GCTTCTTGGA CCGAGGCTGG GGGTTCTGTC 
TTGATGACAT ACCTAAAAAG AAAGGCTTGA AGTCCAAGGT CATTGCCCCC GGAGTGATCT 
ATGATGTTCA CCACCAGTGC CAGCTACAAT ATCGACCCAA TGCTACCTTC TGCCAGGAAG 
TAGAAAACGT CTGCCAGACA CTGTGGTGCT CCGTGAAGGG CTTTTGTCGC TCTAAGCTGG 
ACGCTGCTGC AGATGGAACT CAATGTGGTG AGAAGAAGTG GTGTATGGCA GGCAAGTGCA 
TCACAGTGGG GAAGAAACCA GAGAGCATTC CTGGAGGCTG GGGCCGCTGG T 
CCCACTGTTC CAGGACCTGT GGGGCTGGAG TCCAGAGCGC AGAGAGGCTC T 
CCGAGCCAAA GTTTGGAGGG AAATATTGCA CTGGAGAAAG A 
ACGTCCACCC CTGTCGCTCA GAGGCACCAA CATTTCGGCA GATGCAGTGC AGTGAATTTG 
ACACTGTTCC CTACAAGAAT GAACTCTACC ACTGGTTTCC CATTTTTAA 
CTTGTGAGCT CTACTGCCGA CCCATAGATG GCCAGTTTTC TGAGAAAAT 
TCATTGATGG TACCCCTTGC TTTGAAGGCG GCAACAGCAG AAATGTCTGT ATTAATGGCA 
TATGTAAGAT GGTTGGCTGT GACTATGAGA TCGATTCCAA TGCCACCGAG GATCGCTGCG 

AAGGATCTGG TTATGTTGAC ATTGGGCTCA T 
TGGAAATTGA GGGAGCTGGA AACTTCCTGG CCATCAGGAG TGAAGATCCT GAAAAATATT 



TTCAGTATGA CAGGAAAGGA GACCTGGAAA AGCTGATGGC CACAGGTCCC ACCAATGAGT 
CTGTGTGGAT CCAGCTTCTA TTCCAGGTGA CTAACCCTGG CATCAAC 
TCCAGAAAGA TGGCCTTGAC AATGATGTTG AGCAGATGTA CTTCTGGCAG T. 
GGACAGAGTG CAGTGTGACC T GCGGGACAG ' GTATCCGCCG CCAAACTGCC CAT TGCATAA 
AGAAGGGCCG CGGGATGGTG AAAGCTACAT TCTGTGACCC AGAAACACAG CCCAATGGGA 
A GTGCCATGAA AAGGCTTGTC CACCCAGGTG GTGGGCAGGG GAGTGGGAAG 
VTGCGGG CCCCACGGGG AGAAGAAGCG AACCGTGCTG TGCATCCAGA 
CCATGGTCTC TGACGAGCAG GCTCTCCCGC CCACAGACTG CCAGCACCTG CTGAAGCCCA 
AGACCCTCCT TTCCTGCAAC AGAGACATCC TGTGCCCCTC GGACTGGACA GTGGGCAACT 
GGAGTGAGTG TTCTGTTTCC TGTGGTGGTG GAGTGCGGAT TCGCAGTGTC ACATGTGCCA 
AGAACCATGA TGAACCTTGC GATGTGACAA GGAAACCCAA CAGCCGAGCT CTGTGTGGCC 
TCCAGCAATG CCCTTCTAGC CGGAGAGTTC TGAAACCAAA CAAAGGCACT ATTTCCAATG 
GAAAAAACCC ACCAACACTA AAGCCCGTCC CTCCACCTAC ATCCAGGCCC AGAA1GCTGA 
CCACACCCAC AGGGCCTGAG TCTATGAGCA CAAGCACTCC AGCAATCAGC AGCCCTAGTC 
CTACCACAGC CTCCAAAGAA GGAGACCTGG GTGGGAAACA GTGGCAAGAT AGCTCAACCC 
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AACCTGAGCT GAGCTCTCGC TATCTCATTT CCACTGGAAG CACTTCCCAG CCCATCCTCA 
CTTCCCAATC CTTGAGCATT CAGCCAAGTG AGGAAAATGT TTCCAGTTCA GATACTGGTC 
CTACCTCGGA GGGAGGCCTT GTAGCTACAA CAACAAGTGG TTCTGGCT1G TCATCTTCCC 
GCAACCCTAT CACTTGGCCT GTGACTCCAT TTTACAATAC CTTGACCAAA GGTCCAGAAA 
TGGAGATTCA CAGTGGCTCA GGGGAAGAAA GAGAACAGCC TGAGGACAAA GATGAAAGCA 



AAATGCCACT 
CACCCAGAGT 

G AGACCAC CA GCCAGAACCC 

TGATTACTGA GGGCTTTTTG CTAAATGCCT CCAATTACAA GCAGCTCACA AACGGCCACG 
GCTCTGCACA CTGGATCGTC GGAAACTGGA GCGAGTGCTC CACCACATGT GGCCTGGGGG 
C CTACTGGAA AAGGGTGGAG TGCACCACCC AGATGGATTC TGACTGTGCG GCCATCCAGA 
GACCTGACCC TGCAAAAAGA TGCCACCTCC GTCCCTGTGC TGGCTGGAAA GTGGGAAACT 
GGAGCAAGTG CTCCAGAAAC TGCAGTGGGG GCTTCAAGAT ACGCGAGATT CAGTGCGTGG 
ACAGCCGGGA CCACCGGAAC CTGAGGCCAT TTCACTGCCA GTTCCTGGCC GGCATTCCTC 
CCCCATTGAG CATGAGCTGT AACCCGGAGC CCTGTGAGGC GTGGCAGGTG GAGCCTTGGA 



20 GCCTCTGTGA TTGGACAAAA AGACCCACAT CCACCATGTC TTGCAATGAG C 

GTCACTGGGC CACTGGGAAC TGGGACCTGT GTTCCACTTC CTGTGGAGGT GGCTTTCAGA 4800 

AGAGGATTGT CCAATGTGTG CCCTCAGAGG GCAATAAAAC TGAAGACCAA GACCAATGTC 4860 

TATGTGATCA CAAACCCAGA CCTCCAGAAT TCAAAAAATG CAACCAGCAG GCCTGCAAGA 4920 

AAAGTGCCGA TTTACTTTGC ACTAAGGACA AACTGTCAGC CAGTTTCTGC CAGACACTGA 4 980 

25 AAGCCATGAA GAAATGTTCT GTGCCCACCG TGAGGGCTGA GTGCTGCTTC TCGTGTCCCC 5040 

AGACACACAT CACACACACC CAAAGGCAAA GAAGGCAACG GTTGCTCCAA AAGTCAAAAG El 00 

AACTCTAAGC CCAAA El 15 

Seq ID NO: C1S5 DNA Sequence 
30 Nucleic Acid Accession #< NM_001062.1 
Coding sequence-. 76.. 1380 

1 11 21 31 41 51 

„ I I I I I I 

3D gctctcatta ccttctgccc atcacttaat AAATAGCCAG CCAATTCATC AACATTCTGG 60 

TACACTGTTG GAGAGATGAG ACAGTCACAC CAGCTGCCCC TAGTGGGGCT CTTACTGTTT 120 

TCTTTTATTC CAAGCCAACT ATGCGAGATT TGTGAGGTAA GTGAAGAAAA CTACATCCGC 180 

CTAAAACCTC TGTTGAATAC AATGATCCAG TCAAACTATA ACAGGGGAAC CAGCGCTGTC 240 

AATGTTGTGT TGTCCCTCAA ACTTGTTGGA ATCCAGATCC AAACCCTGAT GCAAAAGATG 300 

40 ATCCAACAAA TCAAATACAA TCTGAAAAGC AGATTGTCAG ATGTAAGCTC GGGAGAGCTT 3 60 

GCCTTGATTA TACTGGCTTT GGGAGTATGT CGTAACGCTG AGGAAAACTT AATATATGAT 42 0 

TACCACCTGA CTGACAAGCT AGAAAATAAA TTCCAAGCAG AAATTGAAAA TATGGAAGCA 4B0 

CACAATGGCA CTCCCCTCAC TAACTACTAC CAGCTCAGCC TGGACGTTTT GGCCTTGTGT 540 

CTGTTCAATG GGAACT ACT C AACCGCCGAA GTTGTCAACC ACTTCACTCC TGAAAATAAA 6D0 

45 AACTATTATT TTGGTAGCCA GTTCTCAGTA GATACTGGTG CAATGGCTGT CCTGGCTCTG 6S0 

ACCTGTGTGA AGAAGAGTCT AATAAATGGG CAGATCAAAG CAGATGAAGG CAGTTTAAAG 720 

A TTTATACAAA GTCACTGGTA G AAAAGATT C TGTCIGAGAA AAAAGAAAAT 780 

G G AAACA CAT T TAGCACAGGA GAAGCCATGC AGGCCCTCTT TGTATCATCA 840 

GACTATTATA ATGAAAATGA CTGGAATTGC CAACAAACTC TGAATACAGT GCTCACGGAA 900 

ATTTCTCAAG GAGCATTCAG TAATCCAAAC GCTGCAGCCC AGGTCTTACC TGCCCTGATG 960 

GGAAAGACCT TCTTGGATAT TAACAAAGAC TCTTCTTGCG TCTCTGCTTC AGGTAACTTC 1020 
AACATCTCCG CTGATGAGCC TATAACTGTG ACACCTCCTG ACTCACAATC 



AATAATGACA GAACCTACTG GGAACTTCTG AGTGGAGGCG AACCACTGAG CCAAGGAGCT 
GGTAGTTACG TTGTCCGCAA TGGAGAAAAC TTGGAGGTTC GCTGGAGCAA ATACTAATAA 
GCCCAAACTT TCCTCAGCTG CATAAAATCC ATTTGCAGTG GAGTTCCATG TTTATTGTCC 
TTATGCCTTC TTCTTCATTT ATCCCAGTAC GAGCAGGAGA G 
TCTCTACATG TTCAATAAAA GTTGTTGAAA GATTAAC 

Seq ID NO: C156 DNA Sequence 



I I 

GTGCTGTACC AAGAGTTTGC 

CGGCGAATCA GAAGCAGCAA GCAACTTTGA CTGCTGTCTT GGATACACAG A 
TCATCCTAAA TTTATTGTGG GCTTCACACG GCAGCTGGCC AATGAAGGCT GTGACATCAA 
TGCTATCATC TTTCACACAA AGAAAAAGTT GTCTGTGTGC GCAAATCCAA AACAGACTTG 
GGTGAAATAT ATTGTGCGTC TCCTCAGTAA AAAAGTCAAG AACATGTAAA AACTGTGGCT 
TTTCTGGAAT GGAATTGGAC ATAGCCCAAG AACAGAAAGA ACCTTGCTGG GGTTGGAGGT 
TTCACTTGCA CATCATGGAG GGTTTAGTGC TTATCTAATT TGTGCCTCAC TGGACTTGTC 
CAATTAATGA AGTTGATTCA TATTGCATCA TAGTTTGCTT TGTTTAAGCA TCACATTAAA 
GTTAAACTGT ATTTTATGTT ATTTATAGCT GTAGGTTTTC TGTGTTTAGC TATTTAATAC 
TAATTTTCCA TAAGCTATTT TGGTTTAGTG CAAAGTATAA AATTATATTT GGGGGGGAAT 
AAGATTATAT GGACTTTCTT GCAAGCAACA AGCTATTTTT TAAAAAAACT ATTTAACATT 
CTTTTGTTTA TATTGTTTTG TCTCCTAAAT TGTTGTAATT GCATTATAAA A 
CATTAATAAG ACAAATATT 
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ACGCGCGAGC CCGGCGAGGC CCCGGCAGGC CCGTCCCTGC TCGGGGGCGC GCTGAGACGG 12 0 

CGGGTGAGCT CCACGAGAGC GCCGTCGCCA CTTCGGGCCA ACTTTGCGAT TCCCGACAGT 180 

TAAGCAATGG GGAGACATTT GGCTTTGCTC CTGCTTCTGC TCCTTCTCTT CCAACATTTT 24 0 

GGAGACAGTG ATGGCAGCCA ACGACTTGAA CAGACTCCTC TGCAGTTTAC ACACCTCGAG 300 

5 TACAACGTCA CCGTGCAGGA GAACTCTGCA GCTAAGACTT ATGTGGGGCA TCCTGTCAAG 360 

ATGGGTGTTT ACATTACACA TCCAGCGTGG GAAGTAAGGT ACAAAATTGT TTCCGGAGAC 420 

AGTGAAAACC TGTTCAAAGC TGAAGAGTAC ATTCTCGGAG ACTTTTGCTT TCTAAGAATA 4 80 

AGGACCAAAG GAGGAAATAC AGCTATTCTT AATAGAGAAG TGAAGGATCA CTACACATTG S40 

ATAGTGAAAG CACTTGAAAA AAATACTAAT GTGGAGGCGC GAACAAAGGT CAGGGTGCAG 600 

10 GTGCTGGATA CAAATGACTT GAGACCGTTA TTCTCACCCA CCTCATACAG CGTTTCTTTA 660 

C CTG AAAACA CAGCTATAAG GACCAGTATC GCAAGAGTCA GCGCCACGGA TGCAGACATA 720 

GGAACCAACG GGGAATTTTA CTACAGTTTT AAAGATCGAA CAGATATGTT TGCTATTCAC 780 

GAGATGGAAA TCCTCGCTGC GGACCGTGGC ATGAAGTTGT ATGGGAGCAG TGGCATCAGC 900 

15 AGCATGGCCA AGCTAACGGT GCACATCGAA CAGGCCAATG AATGTGCTCC GGTGATAACA 960 

GATGACTGCG ATCAGGGTGC CAATGGTGAC ATAGCATCTT TAAGCATCGT GGCAGGTGAC 1080 

CTTCTCCAGC AGTTTAGAAC AGTGAGGTCC TTTCCAGGGA GTAAGGAGTA TAAAGTCAAA 1140 

GCCATCGGTG ACATTGATTG GGACAGTCAT CCTTTCGGCT ACAATCTCAC ACTACAGGCT 12 0 0 

20 AAAGATAAAG GAACTCCGCC CCAGTTCTCT TCTGTTAAAG TCAnCACGT GACTTCTCCA 1260 

CAGTT CAAAG CCGGGCCAGT CAAGTTTGAA AAGGATGTTT ACAGAGCAGA AATAAGTGAA 1320 

TTTGCTCCTC CCAACACACC TGTGGTCATG GTAAAGGCCA TTCCTGCTTA TTCCCATTTG 1380 

AGGTATGTTT TTAAAAGGAC ACCTGGAAAA GCTAAATTCA GTTTAAATTA CAACACTGGT 1440 

CTCATTTCTA TTTTAGAACC AGTTAAAAGA CAGCAGGCAG CCCATTTTGA ACTTGAAGTA 1500 

25 ACAACAAGTG ACAGAAAAGC GTCCACCAAG GTCTTGGTGA AAGTCTTAGG TGCAAATAGC 1560 

AATCCCCCTG AATTTACCCA GACAGCGTAC AAAGCTGCTT TTGATGAGAA CGTGCCCATT 162 0 

GGTACTACTA TCATGAGCCT GAGTGCCGTA GACCCTGATG AGGGTGAGAA TGGGTACGTG 1680 

ACATACAGTA T03CAAATTT AAATCATGTG CCGTTTGCGA TTGACCATTT CACTGGTGCC 1740 

GTGAGTACGT CAGAAAACCT GGACTACGAA CTGATGCCTC GGGTTTATAC TCTGAGGATT 1800 

30 CGTGCAT CAG ACTGGGGCTT GCCGTACCGC CGGGAAGTCG AAGTCCTTGC TACAATTACT 1860 

CTCAATAACT TGAATGACAA CACACCTTTG TTTGAGAAAA TAAATTGTGA AGGGACAATT 192 0 

CCCAGAGATC TAGGCGTGGG AGAGCAAATA ACCACTGTTT CTGCTATTGA TGCAGATGAA 1980 

CTTCAGTTGG TACAGT AT CA GATTGAAGCT GGAAATGAAC TGGATTTGTT TAGTTTAAAC 204 0 

CCCAACTCGG GGGTATTGTC ATTAAAGCGA TCGCTAATGG ATGGCTTAGG TGCAAAGGTG 210 0 

35 TCTTTCCACA G1CTGAGAAT CACAGCTACA GATGGAGAAA ATTTTGCCAC ACCATTATAT 2160 

AT CAACAT AA CAGTGGCTGC CAGTCACAAG CTGGTAAACT TGCAGTGTGA AGAGACTGGT 2220 

GTTGCCAAAA TCCTGOCAGA GAAGCTCCTG CAGGCAAATA AATTACACAA CCAGGGAGAG 228D 

GTGGAGGATA TTTTCTTCGA TTCTCACTCT GTCAATGCTC ACATACCGCA GTTTAGAAGC 234D 

ACTCTTCCGA CIGGTATTCA GGTAAAGGAA AACCAGCCTG TGGGTTCCAG TGTAATTTTC 2400 

40 ATGAACTCCA CTGACCTTGA CACTGGCTTC AATGGAAAAC TGGTCTATGC TGTTTCTGGA 2460 

GGAAATGAGG ATAGTTGCTT CATGATTGAT ATGGAAACAG GAATGCTGAA AATTTTATCT 2520 

CCTCTTGACC GTGAAACAAC AGACAAATAC ACCCTGAATA TTACCGTCTA TGACCTTGGG 2580 

ATACCCCAGA AGGCTGCGTG GCGTCTTCTA CATGTCGTGG TTGTCGATGC CAATGATAAT 2640 

CCACCCGAGT TTTTACAGGA GAGCTATTTT GTGGAAGTGA GTCAACACAA GGAGCTACAT 27 00 

45 AGTGAAAT CA TCCAGGTTGA AGCCACAGAT AAAGACCTGG GGCCCAACGG ACACGTGACG 2760 

TACTCAATTC TTACAGACAC AGACACATTT TCAATTGACA GCGTGACGGG TGTTGTTAAC 2820 

ATCGCACGCC CTCTCGATCC AGACCTCCAG CATGAGCACT CCTTAAAGAT TGAGGCCAGG 2880 

GACCAAGCCA GAGAAGAGCC TCAGCTGTTC TCCACTGTCG TTGTGAAAGT AT CACT AGAA 2 940 

GATGTTAATG ACAACCCACC TACATTTATT CCACCTAATT ATCGTGTGAA AGTCCGAGAG 3 000 

50 GATCTTCCAG AAGGAACCGT CATCATGTGG TTAGAAGCCC ACGATCCTGA TTTAGGTCAG 3060 

TCTGGTCAGG TGAGATACAG CCTTCTGGAC CACGGAGAAG GAAACTTCGA TGTGGATAAA 3120 

CTCAGTGGAG CAGTTAGGAT CGTCCAGCAG TTGGACTTTG AGAAGAAGCA AGTGTATAAT 3180 

CTCACTGTGA GGGCCAAAGA CAAGGGAAAG CCAGTTTCTC TGTCTTCTAC TTGCTATGTT 3240 

GAAGTTGAGG TGGTTGATGT GAATGAGAAC CTGCACCCAC CCGTGTTTTC CAGCTTTGTG 3300 

55 GAAAAGGGGA CAGTGAAAGA AGATGCACCT GTTGGTTCAT TGGTAATGAC GGTGTCGGCT 3360 

CATGATGAGG ACGCCGGAAG AGATGGGGAG ATCCGATACT CCATTAGAGA TGGCTCTGGC 342 0 

GTTGGTGTTT TCAAAATAGG TGAAGAGACA GGTGTCATAG AGACGTCAGA TCGACTGGAC 3480 

CGTGAATCGA CCTCCCATTA TTGGCTAACA GTCTTTGCAA CCGATCAGGG TGTCGTGCCT 3540 

CTTTCATCGT TCATAGAGAT CTACATAGAG GTTGAGGATG TCAATGACAA TGCACCACAG 3600 

60 ACATCAGAGC CTGTTTATTA CCCAGAAATC ATGGAAAATT CTCCTAAAGA TGTATCTGTG 3660 

GTCCAGATCG AGGCATTTGA TCCAGATTCG AGCTCTAATG ACAAGCTCAT GTACAAAATT 3720 

ACAAGTGGAA ATCCACAAGG ATTCTTTTCA ATACATCCTA AAACAGGTCT CATCACAACT 3780 

ACGTCAAGGA AGCTAGACCG AGAACAGCAA GATGAACACA TATTAGAGGT TACTGTGACA 3840 

GACAATGGTA GTCCCCCCAA ATCAACCATT GCAAGAGTCA TTGTGAAAAT CCTTGATGAA 3900 

65 AATGACAACA AACCT CAGTT TCTGCAAAAG TTCTACAAAA TCAGACTCCC TGAGCGGGAA 3960 

AAGCCAGACC GAGAAAGAAA TGCCAGACGG GAGCCGCTCT ATCGCGTCAT AGCCACCGAC 4020 

AAGGAT GAGG GCCCCAATGC AGAAATCTCC TACAGCATCG AAGACGGGAA TGAGCATGGC 4080 

AAATTTTTCA TCGAACCGAA AACTGGAGTG GTTTCGTCCA AGAGGTTTTC AGCAGCTGGA 4140 

GAATATGATA TTCTTTCAAT TAAGGCAGTT GACAATGGTC GCCCTCAAAA GTCATCAACC 42 0 0 

70 ACCAGACTCC ATATTGAATG GATCTCCAAG CCCAAACAGT CCCTGGAGCC CATTTCATTT 4260 

GAAGAAT CAT TTTTTACCTT TACTGTGATG GAAAGTGACC CCGTTGCTCA CATGATTGGA 4320 

GTAATATCTG TGGAGCCTCC TGGCATACCC CTTTGGTTTG ACAT CACTGG TGGCAACTAC 4380 

GACAGT CACT TCGATGTGGA CAAGGGAACT GGAACCATCA TTGTTGCCAA ACCTCTTGAT 4440 

GCAGAACAGA AGT CAAACTA CAACCTCACA GTCGAGGCTA CAGATGGAAC CACCACTATC 4500 
75 CTCACTCAGG T AT T CAT CAA AGTAATAGAC ACAAATGAC C ATCGTCCTC=i GTTTTCTACA 4560 

AGTGCT GTGG ATCAGGATGA GAAAAACAAA CTAATCTACA CT CTG CAG AG CAGTAGAGAT 4680 
CCACTGAGTC TCAAGAAATT TCGTCTTGAT CCTGCAACCG GCTCTCTCTA TACTTCTGAG 4740 
- AAACTGGATC ATGAAGCTGT TTCACCAGCA CACCTCACGG TCATGGTACG AGATCAAGAT 4800 

80 GTGCCTGTAA AACGCAACTT TGCAAGGATT GTGGTCAAT G TCAGCGACAC GAATGACCAC 48 60 
GCCCCGTGGT TCACCGCTTC CTCCTACAAA GGGCGGGTTT ATGAATCGGC AGCCGTTGGC 4920 
TCAGTTGTGT TGCAGGTGAC GGCTCTGGAC AAGGACAAAG GGAAAAATGC TGAAGTGCTG 4 980 
TACTCGATCG AGTCAGGAAA TATTGGAAAT ATTGGAAATT CTTTTATGAT TGATCCTGTC 5040 
TTGGGCTCTA TTAAAACTGC CAAAGAATTA GATCGAAGTA ACCAAGCGGA GTATGATTTA 5100 
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ATGGTAAAAG CTACAGATAA GGGCAGTCCA CCAATGAGTG AAATAACTTC TGTGCGTATC E1S0 

TTTGTCACAA TTGCTGACAA CGCCTCTCCG AAGTTTACAT CAAAAGAATA TTCTGTTGAA 5220 

CTTAGTGAAA CTGTCAGCAT TGGGAGTTTC GTTGGGATGG TTACAGCCCA TAGTCAATCA 52B0 

TCAGTGGTGT ATGAAATAAA AGATGGAAAT ACAGGTGATG CTTTTGATAT TAATCCACAT 5340 

TCTGGAACTA TCATCACTCA GAAAGCCCTG GACTTTGAAA CTTTGCCCAT TTACACATTG 5400 

ATAATACAAG GAACTAACAT GGCTGGTTTG TCCACTAATA CAACGGTTCT AGTTCACTTG 54S0 

CAGGATGAGA ATGACAACGC GCCAGTTTTT ATGCAGGCAG AATATACAGG ACTCATTAGT 5520 

GAATCAGCCT CAATTAACAG CGTGGTCCTA ACAGACAGGA ATGTCCCACT GGTGATTCGA 5580 

GCAGCTGATG CTGATAAAGA CTCAAATGCT TTGCTTGTAT ATCACATTGT TGAACCATCT 5640 

GTACACACAT ATTTTGCTAT TGATTCTAGC ACTGGTGCTA 1TCA1ACAG1 ACTAAGTCTG 5700 

GACTATGAAG AAACAAGTAT TTTTCACTTT ACCGTCCAAG TGCATGAC^T GGGAACCCCA S7S0 

CGTTTATTTG CTGAGTATGC AGCGAATGTA ACAGTACATG TAATTGACAT TAATGACTGC 5820 

CCCCCTGTGT TTGCCAAGCC ATTATATGAA GCATCTCTTT TGTTACCAAC ATACAAAGGA 5880 

GTAAAAGTCA T CACAGT AAA TGCTACAGAT GCTGATTCAA GTGCATTCTC ACAGTTGATT 5940 

TACTCCATCA CCGAAGGCAA CAT CGGGG AG AAGTTTTCTA TGGACTACAA GACTGGTGCT 6000 



CACCGAGGCC 



ATCCTCAACC CAGATCGCAG ATTTAAAATA AGCCGCACTT C 

GGCACGCCCT TCGATCGTGA GCAGCAGGAG GCGTTTGATG TGGTTGTAGA AGTGATAGAG 6360 
GAACATAAGC CTTCTGCAGT GGCCCACGTT GTCGTGAAGG T 
GATAATGCGC CGGTGTTTGT CAACCTTCCC TACTACGCCG TTGTTAAAGT G 
GTGGGCCATG TCATTCGCTA TGTCACTGCT GTAGACAGAG ACAGTGGCAG AAACGGGGAA 
GTGCATTACT ACCTCAAGGA ACATCATGAA CACTTTCAAA T 
TCACTGAAAA AGCAATTTGA GCTTGACACC TTAAATP 

GCAAAAGATG GAGGGAACCC GGCCTTTTCA GCGGAAGTTA TCGTTCCGAT C 

AATAAAGCCA TGCCTGTGTT TGAAAAACCT TTCTACAGTG CAGAGATTGC AGAGAGCATC 6780 

CAGGTGCACA GCCCTGTGGT CCACGTGCAG GCTAACAGCC CGGAAGGCCT GAAAGTGTTC 6840 

TACAGCATCA CAGACGGAGA CCCTTTCAGC CAGTTCACTA TTAACTTCAA TACTGGAGTT 6900 

ATCAATGTCA TAGCTCCTCT GGACTTTGAG GCCCACCCGG CATATAAGCT GAGCATACGC 6960 

GCAACTGACT CCTTGACGGG CGCTCATGCT GAAGTATTTG TGGACATCAT AGTAGACGAC 702 0 

ATCAATGATA ACCCTCCTGT GTTTGCTCAG CAGTCTTATG CGGTGACCCT GTCTGAGGCA 7080 

TCTGTAATTG GAACGTCTGT TGTTCAAGTT AGAGCCACCG ATTCTGATTC AGAACCAAAT 7140 

AGAGGAATCT CATACCAGAT GTTTGGGAAT CACAGCAAGA GTCATGATCA TTTTCATGTA 720 0 

GACAGCAGCA CTGGCCTCAT CTCACTACTC AGAACCCTGG ATTACGAGCA GTCCCGGCAG 7260 

CACACGATTT TTGTGAGGGC AGTTGATGGT GGTATGCCCA CGCTGAGCAG TGATGTGATT 732 0 

GTCACGGTGG ACGTTACCGA CCTCAATGGT AATCCACCAC TCTTTGAACA ACAGATTTAT 7380 

GAAGCCAGAA TTAGCGAGCA CGCCCCTCAT GGGCATTTCG TGACCTGTGT AAAAGCCTAT 7440 

CATCCACACA CTTCAGACAT AGACAAGTTG CAGTATTCCA TTCTGTCTGG CAATGATCAT 75 0 0 

AAACATTTTG TCATTGACAG TGCAACAGGG ATTATCACCC TCTCAAACCT GCACCGGCAC 75S0 

GCCCTGAAGC CATTTTACAG TCTTAACCTG TCAGTGTCTG ATGGAGTTTT TAGAAGTTCC 7 62 0 

ACCCAGGTTC ATGTAACTGT AATTGGAGGC AATTTGCACA GTCCTGCTTT CCTTCAGAAC 7680 

GAATATGAAG TGGAACTAGC TGAAAACGCT CCCCTACATA CCCTGGTGAT GGAGGTGAAA 7740 

ACTACGGATG GGGATTCTGG TATTTATGGT CACGTTACTT ACCATATTGT AAATGACTTT 7800 

GCCAAAGACA GATTTTACAT AAATGAGAGA GGACAGATAT TTACTTTGGA AAAACTTGAT 7860 

CGAGAAACCC CGGCGGAGAA AGTGATCTCA GTCCGTTTAA TGGCTAAGGA TGCTGGAGGA 7920 

T TCTGCACCGT GAATGTCATC CTTACAGATG ACAATGACAA TGCACCACAA 7980 

A AGTGAATATC GGGTCCAGTG CTGCTAAAGG GACTTCAGTC 8040 

GTAAAGTCTG CAAGTGATGC CGATGAGGGC TCCAATGCCG ACATCACCTA TGCCATTGAA 8100 

GCAGACTCTG AAAGTGTAAA AGAGAATTTG GAAATTAACA AACTGTCCGG CGTAATCACT 8160 

ACAAAGGAGA GCCTCATTGG CTTGGAAAAT GAATTCTTCA CTTTCTTTGT TAGAGCTGTG 8220 

GATAATGGGT CTCCATCAAA AGAATCTGTT GTTCTTGTCT ATGTTAAAAT CCTTCCACCG 8280 

GAAATGCAGC TTCCAAAATT TTCAGAACCT TTCTATACCT TTACAGTGTC AGAGGACGTG 8340 

CCTGTTGGAA CAGAGATAGA TCTCATCCGA GCAGAACATA GTGGGACTGT TCTTTACAGC 8400 
CTGGTCAAAG GGAATACTCC AGAAAGCAAT AGGGAT 
AGCGGGAGAC TGAAGTTGGA GAAGAGTCTT GATCATGAGA C 

C TCAAGATGAC CATGAGATGG TGGCTTCTGT A 




TTCATAACAG GAGGGGATCC 

AAGGTATATG TGAAGAAACC TCTAGACAGG GAAAAAAGGG 
ACGGCAACTG ATGGCACCTT CTCATCAAAA GCGATAGTTG 
AATGACAACA GTCCAGTTTG TGAAAAGACT TTATATTCAG 

3 CAGACATCOG 



A TCATCTTCTC 948 0 

A GTATTGTCGT CACGCTAGAA 9540 

G ATAACGCCCC CGAATTCTCT GCCGATCCTT ATGCCATCAC CGTGTTTGAA 9600 

AACACAGAGC CGGGAACGCT GCTGACAAGA GTGCAGGCCA CAGATGCCGA CGCAGGAITA 9660 

AATCGGAAGA TTTTATACTC ACTGATTGAC TCTGCTGATG GC-CAGITCTC CATTAACGAA 972 0 

TTATCTGGAA TTATTCAGTT AGAAAAACCT TTGGACAGAG AACTCCAGGC AGTATACACC 9780 

CTCTCTTTGA AAGCTGTGGA TCAAGGCTTG CCAAGGAGGC TGACTGCCAC TGGCACTGTG 9840 

ATTGTATCAG TTCTTGACAT AAATGACAAC CCCCCTGTGT TTGAGTACCG TGAATATGGT 9900 

GCCACCGTGT CTGAGGACAT TCTTGTTGGA ACTGAAGTTC TTCAAGTGTA TGCAGCAAGT 9960 

CGGGATATTG AAGCAAATGC AGAAATCACC TACTCAATAA TAAGTGGAAA TGAACATGGG 10020 

AAATTCAGCA TAGATTCTAA AACAGGGGCC GTATTTATCA TTGAGAATCT GGATTATGAG 10080 

AGCTCTCATG AGTATTACCT AACAGTAGAG GCCACTGATG GAGGCACGCC TTCACTGAGC 10140 
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GACGTTGCCA CTGTGAACGT TAATGTAACA GATATCAACG ATAATACCCC TGTGTTCAGC 10200 
CAAGACACCT ACACGACAGT CAT CAGTG AA GATGCCGTTC TTGAGCAGTC TGTCATCACG 102S0 
GTTATGGCCG ATGATGCCGA TGGACCTTCC AACAGCCACA TCCACTACTC AATTATAGAT 10320 
GGCAAC CAAG GAAGCTCGTT CACAATTGAC CCCGTCAGGG GAGAAGTCAA AGTGACCAAA 10380 
CTTCTCGACC GAGAAACGAT TTCAGGTTAC ACGCTCACGG TTCAAGCTTC TGATAATGGC 10440 
AGTCCACCCA GAGTCAACAC GACGACCGTG AACATCGATG TGTCCGATGT CAATGACAAC 10500 
GCGCCCGTCT TCTCCAGGGG AAACTACAGT GTCATTATCC AGGAAAATAA GCCAGTGGGC 10560 
TTCAGCGTGC TGCAGCTGGT AGTAACAGAT GAGGATTCTT CCCATAACGG TCCACCCITC 10S20 
iACTGG AAATGATGAG AAGGCTTTTG AAGTTAACCC GCAAGGAGTC 10680 
T CATCTGCCAT CAAGAGGAAG GAGAAAGATC ATTACTTACT GCAGGTGAAG 10740 
GTGGCAGATA ATGGAAAGCC TCAGTTGTCA TCTTTGACAT ACATTGACAT TAGGGTAATT 10800 
ft TCTATCCGCC TGCGATTTTG CCCCTGGAGA TTTTCATCAC CTCTTCTGGA 10860 
I CAGGTGGCGT CATTGGGAAG ATCCATGCCA CAGACCAGGA CGTGTATGAT 10920 
ACTCTAACCI ACAGTCTCGA CCCTCAGATG GACAACCTGT TCTCTGTTTC CAGCACAGGG 10980 
GGCAAGCTGA TAGCACACAA AAAGCTAGAC ATAGGGCAAT ACCTTCTCAA TGTCAGCGTA 11040 
ACAGATGGGA AGTT CACGAC GGTGGCCGAC 1 
GAGATGTTGA ACCACACCAT CGCGATCCGC 1 

GGTGACTACT GGCGCAACTT CCAGCGAGCT TTACGGAACA TCCTGGGTGT GAGGAGGAAC 
GACATACAGA TTGTTAGTTT GCAGTCCTCT GAACCTCACC CACATCTGGA CGTCTTACTT 
TTTGTAGAGA AACCAGGTAG TGCTCAGATC TCAACAAAAC AACTTCTGCA CAAGATTAAC 
A CTGACATTGA GGAAATCATT GGAGTTAGGA TACTGAATGT ATTCCAGAAA 
GGACTG CCCCTGGAAG TTCTGCGATG AAAAGGTGTC TGTGGATGAA 
AGTGTGATGT CAACACACAG CACAGCCAGA CTGAGTTTTG TGACTCCCCG CCACCACAGG 
I GTCTCTGCAA AGAGGGAAGG TGCCCACCTG TCCACCATGG CTGTGAAGAT 
C CTGAGGGATC CGAATGTGTG TCTGATCCCT GGGAGGAGAA ACACACCTGT 
GTCTGTCCCA GCGGCAGGTT TGGTCAGTGC CCAGGGAGTT CATCTATGAC ACTGACTGGA 
AACAGCTACG TGAAATACCG TCTGACGGAA AATGAAAACA AATTAGAGAT GAAACTGACC 
ATGAGGCTCA GAACATATTC CACGCATGCG GTTGTCATGT ATGCTCGAGG AACTGACTAT 
AGCATCTTGG AGATTCATCft TGGAAGGCTG CAGTACAAGT TTGACTGTGG AAGTGGCCCT 
GGAATTGTCT CTGTTCAGAG CATTCAGGTC AATGATGGGC AGTGGCACGC AGTGGCCCTG 
GAAGTGAATG GAAACTATGC TCGCTTGGTT CTAGACCAAG TTCATACTGC ATCGGGCACA 
GCCCCAGGGA CTCTGAAAAC CCTGAACCTG GATAACTATG TGTTTTTTGG TGGCCACATC 
CGTCAGCAGG GAACAAGGCA TGGAAGAAGT CCTCAAGTTG GTAATGGTTT CAGGGGTTGT 
ATGGACTCCA TTTATTTGAA TGGGCAGGAG CTCCCTTTAA ACAGCAAACC CAGAAGCTAT 
GCACA C'ATCG AAGAGTCGGT GGATGTATCT CCAGGCTGCT TCCTGACGGC CACGGAAGAC 
TGCGCCAGCA ACCCTTGCCA GAATGGAGGC GTTTGCAATC CGTCACCTGC TGGAGGTTAT 
TACTGCAAAT GCAGTGCCTT GTACATAGGG ACCCACTGTG AGATAAGCGT CAATCCGTGT 
TCCTCCAACC CATGCCTCTA TGGGGGCACG TGTGTTGTCG ACAACGGAGG CTTTGTTTGC 
CAGTGTAGAG GATTATATAC TGGTCAGAGG TGTCAGCTTA GTCCATACTG CAAAGATGAA 
CCCTGTAAGA ATGGCGGAAC ATGCTTTGAC ACTTTGGATG GCGCCGTTTG TCAGTGTGAT 
TCGGGTTTTA GGGGAGAAAG GTGTCAGAGT GATATCGACG AGTGCTCTGG AAACCCTTGC 
CTGCACGGGG CCCTCTGTGA GAACACGCAC GGCTCCTATC ACTGCAACTG CAGCCACGAG 
TACAGGGGAC CTCACTCCGA CCATCCTCCG CCCAACCAGT ATGTGTCCAC GCCGTGGAAC 
ATTGGGTTGG CGGAAGGAAT TGGAATCGTT GTGTTTGTTG CAGGGATATT TTTACTGGTG 
GTGGTGTTTG TTCTCTGCCG TAAGATGATT AGTCGGAAAA AGAAGCATCA GGCTGAACCT 
AAAGACAAGC ACCTGGGACC CCCTACGGCI TTCTTGCAAA GACCGTATTT TGATTCCAAG 
CTAAATAAGA ACATTTACTC AGACATACCA CCCCAGGTGC CTGTCCGGCC TATTTCCTAC 
ACCCCGAGTA TTCCAAGTGA CTCAAGAAAC AATCTGGACC GAAATTCCTT ( 
GCTATCCCAG AGCATCCCGA ATTCAGCACT TTTAACCCCG ; 

AAAGCAGTGG CGGTCTGCAG CGTGGCGCCA AACCTGCCTC CCCCACCCCC TTCAAACTCC 
ft GCGACTCCAT CCAGAAGCCT AGCTGGGACT TTGACTATGA CACAAAAGTG 



12480 
12540 

12660 



13020 



C TGAAGTGCAG TCCCTGAGCT CCTTCCAGTC CGAATCGTGC 
GATGACAATG GGT AT CACT G GGATACATCA GATTGGATGC CAAGCGTTCC TCTGCCGGAC 
A TGAGGTGATT GATGAGCAGA CACCCCTGTA CTCAGCAGAT 
A CTATTACCCT GGAGGCTACG ACATCGAAAG TGATTTTCCT 
G AAGACTTCCC CGCAGCTGAT GAGCTACCAC CGTTACCGCC CGAATTCAGC 
\ CCCTCCTAGA GACATGCCTG CCGCGGGTAG CTTGGGTTCT 
G GTTCAACTTG AATCAGTATT TGCCCAATTT TTATCCCCTC 
GATATGTCTG AACCTCAAAC AAAAGGCACT GGTGAGAATA GTACTTGTAG AGAACCCCAT 
GCCCCTTACC CGCCAGGGTA TCAAAGACAC TTCGAGGCGC CCGCTGTCGA GAGCATGCCC 

TCCGAGGTCA TGATGAGTGA CTATGAGAGC GGGGACGACG GCCACTTCGA AGAGGTGACG 
ATCCCGCCCC TGGATTCCCA GCAGCACACG GAAGTCTGAC TCTCAACTCC CCCCAAAGTG 
CCTGACTTTA GTGAACCTAG AGGTGATGTG AGTAATCCGC GCTGTTCTTT GCAGCAGTGC 
TTCCAAGCTT TTTTTGGTGA GCCGAATGGG CATGGCTGCG CTGGATCCTG CGCCICTGGA 
CGTGCTAGCC ATTTCCAGTG TCCCAACTAC TGTCATCGTG AGC-TTTTCAT CGGCTGTGCC 



AGTCCTGTGT TGTCAGCATG ATTCATGTAT TTATATAGAT TTGATTATTT TAATTTTCCT 
GTCTCTTTTT TTTGTAAATT TTATGTACAG ATTTGATTTT TCATAGTTTT AACTAGATTT 
CCAAGATATT TTGTGCATTT GTTTCAACTG AATTTTGGTG GTGTCAGTGC CATTATCTAG 
CACCCTGATT TTTTTTTTTT TACTATAACC AGGGTTTCAT TCTGTCTTTT TCCACTGAAG 
TGTGACATTT TGTTAGTACA TTTCAGTGTA GTCATTCATT TCTAGCTGTA CATAGGATGA 
: AGATACATGA ACATGTCTTA CATGGGTTGC TGTATTTAGA ATTATAAACA 
3AAAGTG TAACGGGGAC CTTCTGCATA CCTGTTTAGA ACCAAAACCA 
CCATGACACA GTTTTTATAG TGTCTGTATA TTTGTGATGC AATGGTCTTG TAAAGGTTTT 
TAATGAAAAC TACCATTAGC CAGTCTTTCT TACTGACAAT AAATTATTAA TAAAAT 

Seq ID NO: CI SI DNA Sequence 

Nucleic Acid Accession #: NMJJ14220.1 

Coding sequence: 102.. 710 
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GTGGTGTTTG CTTTCTCCAC CAGAAGGGCA CACTTTCATC TAATTTGGGG TATCACTGAG 
CTGAAGACAA AGAGAAGGGG GAGAAAACCT AGCAGACCAC CATGTGCTAT GGGAAGIGTG 
CACGATGCAT CGGACATTCT CTGGTGGGGC TCGCCCTCCT GTGCATCGCG GCTAATATTT 
TGCTTTACTT TCCCAATGGG GAAACAAAGT ATGCCTCCGA AAACCACCTC AGCCGCTTCG 
TGTGGTTCTT TTCTGGCATC GTAGGAGGTG GCCTGCTGAT GCTCCTGCCA GCATTTGTCT 
r GACTGCTGTG GCTGCTGTGG CCATGAAAAC TGTGGCAAAC 
T GTATTGGCTG CTCTCATTGG AATTGCAGGA TCTGGCTACT 
GTGTCATTGT GGCAGCCCTT GGCTTAGCAG AAGGACCACT ATGTCTTGAT T 




TTACAGACTG AGTGACAGTA CTCAGTATAT CTGAGATAAA CTCTATAATG 



AGTTGCTTTT TAT AAG AC CA AGAAGGAGAA AATCCGACAA C 
CACTGCTTGT ATGATGTTTC CCATTCATAC ACCTATAAAT CTCTAACAAG AGGCCCTTTG 
AACTGCCTTG TGTTCTGTGA GAAACAAATA TTTACTTAGA GTGGAAGGAC TGATTGAGAA 
C CAAATGAATG CATCACAACT TACAATGC1G CTCATTGTTG TGAGTACTAT 
TGGAAAG CCTTTTGTCC TCCAAAGATG AGTACTAGGG 
^ AAGGCTACGA TGACTGGGCA AGAAGAAAGA TGGGAAACTG 
AATAAAGCAG TTGATCAGCA TCATTGGAAC ATGGGGACGA GTGACGGCAG GAGGACCACG 
C CTCAAAACTA ACTTGTTTAC AACAAAATAA AGTATTCACT ACGAAAAAAA 



Seq ID NO: C162 DNA Sequence 

Nucleic Acid Accession #: NM_003759.1 

Coding sequence: 1S0..32S7 

i i 1 i 1 r I 1 i 1 

GTTCTTTGTG ACACATCACA CAGAATTGGA GTGCTGTCCT TCTGGAGAGT GGTGGAGAAC 
CAAGATACAG TTCAGAACCA AAGGAATAGA GAAGGGCTTT GATTTCTTTT TGGCTTTAGA 
11, ATTT SA( Jl TTA GCAGGAAAGA TGTCCACTGA AAATGTGGAA GGG AAG CCCA 
GTAACCTTGG GGAGAGAGGA AGAGCCCGGA GCTCCACTTT CCTCAGGGTT GTCCAGCCAA 
TGTTTAACCA CAGTATTTTC ACTTCTGCAG TCTCTCCTGC TGCAGAACGC ATCCGATTCA 
TCTTGGGAGA GGAGGATGAC AGCCCAGCTC CCCCTCAGCT CTTCACGGAA CTGGATGAGC 
TGCTGGCCGT GGATGGGCAG GAGATGGAGT GGAAGGAAAC AGCCAGGTGG ATCAAGTTTG 
AAGAAAAAGT GGAACAGGGT GGGGAAAGAT GGAGCAAGCC CCATGTGGCC ACATTGTCCC 
TTCATAGTTT ATTTGAGCTG AGGACATGTA TGGAGAAAGG ATCCATCATG CTTGATCGGG 
AGGCTTCTTC TCTCCCACAG TTGGTGGAGA TGATTGTTGA CCATCAGATT GAGACAGGCC 
TATTGAAACC TCAACTTAAC GATAAGGTGA CCTATACTTT GCTCCGGAAG CACCGGCATC 
AAACCAAGAA ATCCAACCTT CGGTCCCTGG CIGACATTGG GAAGACAGTC TCCAGTGCAA 
GTAGGATGTT TACCAACCCT GATAATGGTA GCCCAGCCAT GACCCATAGG AATCTGACTT 
CCTCCAGTCT GAATGACATT TCTGATAAAC CGGAGAAGGA CCAGCTGAAG AATAAGTTCA 
TGAAAAAATT GCCACGTGAT GCAGAAGCTT CCAACGTGCT TGTTGGGGAG GTTGACTTTT 



AGAGTCTTCC A 
AATTGCAGCG A 

CATTTTTTGC CAGTGATTTT TATGATGCTT TAAATATTCA AGCTCTTTCG GCAATTCTCT 



TCTTTATCTA TGATGCTTTC AAGAAGATGA TCAAGCTTGC AGATTACTAC C 



CTACTGACAT GTACCATAAT ACTACCTTTG ACTGGGCATT TTTGTCGAAG A 

CAAAATACGG AGGAAACCTT GTCGGGAACA ACTGTAATTT TGTTCCTGAT ATCACACTCA 210 0 

TGTCTTTTAT CCTCTTCTTG GGAACCTACA CCTCTTCCAT GGCTCTGAAA AAATTCAAAA 2160 

CTAGTCCTTA TTTTCCAACC ACAGCAAGAA AACTGATCAG TGATTTTGCC ATTATCTTGT 2220 

CCATTCTCAT CTTTTGTGTA ATAGATGCCC TAGTAGGCGT GGACACCCCA AAACTAATTG 2280 

TGCCAAGTGA GTTCAAGCCA ACAAGTCCAA ACCGAGGTTG GTTCGTTCCA CCGTTTGGAG 2340 

AAAACCCCTG GTGGGTGTGC CTTGCTGCTG CTATCCCGGC TTTGTTGGTC ACTATACTGA 2400 

TTTTCATGGA CCAACAAATT ACAGCTGTGA TTGTAAACAG GAAAGAACAT AAACTCAAGA 2450 

AAGGAGCAGG GTATCACTTG GATCTCTTTT GGGTGGCCAT CCTCATGGTT ATATGCTCCC 2520 

TCATGGCTCT TCCGTGGTAT GTAGCTGCTA CGGTCATCTC CATTGCTCAC ATCGACAGTT 2580 

TGAAGATGGA GACAGAGACT TCTGCACCTG GAGAACAACC AAAGTTTCTA GGAGTGAGGG 2 640 

AACAAAGAGT CACTGGAACC CTTGTGTTTA TTCTGACTGG TCTGTCAGTC TTTATGGCTC 2 700 

CCATCTTGAA GTTTATACCC ATGCCTGTAC TCTATGGTGT GTTCCTGTAT ATGGGAGTAG 2760 

CATCCCTTAA TGGTGTGCAG TTCATGGATC GTCTGAAGCT GCTTCTGATG CCTCTGAAGC 282 0 

ATCAGCCTGA CTTCATCTAC CTGCGTCATG TTCCTCTGCG CAGAGTCCAC CTGTTCACTT 2880 

TCCTGCAGGT GTTGTGTCTG GCCCTGCTTT GGATCCTCAA GTCAACGGTG GCTGCTATCA 2 940 

TTTTTCCAGT AATGATCTTG GCACTTGTAG CTGTCAGAAA AGGCATGGAC TACCTCTTCT 300 0 
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CCCAGCATGA CCTCAGCTTC CTGGATGATG TCATTCCAGA AAAGGACAAG AAAAAGAAGG 3 0S0 
AGGATGAGAA GAAAAAGAAA AAGAAGAAGG GAAGTCTGGA CAGTGACAAT GATGATTCTG 3120 
ACTGCCCATA CTCAGAAAAA GTTCCAAGTA TTAAAATTCC AATGGACATC ATGGAACAGC 3180 
AACCTTICCT AAGCGATAGC AAACCTTCTG ACAGAGAAAG ATCACCAACA TTCCTTGAAC 3240 

3300 
33S0 



T TTGTTTTTGT TTGGCTGTTT GTTTATTTTT TAACTTTTAT 
5CCAAAT AATACAGCGC TCTCTCTGCT TCTCTCTTGC 
T AGTGCACCGT TCCTTAAAAA CAGCATCTGA GGAATCCCCC 
PGTCCTT TGATAACCAA ATTCTGTCAC TCAAGACACA 
GACACCCACA GACCCTGTCC TTTGCCTCTA TTAAGCAGAG GATGGAAGTA TTAAGGATTT 
TGTAACACCT TTTATGAAAA TGTTGAAGGA ACTTAAAACT TTAGCTTTGG AGCTGTGCTT 
ACTGGCTTGT CTTTGTCTGG TAGAACAAAC CTTGACCTCC AGACAGAGTC C 
15 TATAGAGCTC TCCAGGACTG GAAAAAGTGC TGCTATTTTA ACTTGCTCTT G 
A AAGAAGAAAA AACTGAAGGT A 



A TATTTTTCTT C 

GTTGTTTAAA TTATGGGGCC TAACCTGCCA CTTATTTTTT GTCAATTTTT A 

TCTACCTCTT ATTTGTAGCT GCCAGGCTTT CTGTAAAAAT TGTATTC 
TTTTACACAT ACATACACAC ACAAATACAC AATCTCTAGG GTAAGCCAGA AGGCAAGATC 
; ATTTTTCCCT TTCTTTAAAA G 



GTTCTGATAA TAGTAGCATC TAGGTACAGA TGCTGGTTGT ATTACCACGT CAATGTCCTA 4500 
TGCAGTATTG TTAGACATTT TCTCATTTTG AAATATTTGT GTGTTTGTGT ATGTGCTCTG 4560 
TGCCATGGCT GGTGTATATA TGTGCAATGT TAGAAGGCAA AAGAGTGATG GTAGGCAGAG 4620 
GGCAAAGTCA TTGAATCTCT TATGCCAGTT TT CAT AAAAC CCAAACCACA TATGAAAAAA 4680 
TCCATTAAGG GTCCAAGAAG TCTGTCCATA TGAAAATGAG GGTAAATATA GTTTATTTCC 4740 
CAGGTATCAG TCATTATAAT TGATATAATA GCTCTAACAT GCAATATAAA ATTCATAGGA 4800 
GTATTAATAG CCCATTTACA CATCTATAAA ATGTAATGGG ATTGCAGAGC TGCAGAGTAC 4860 
AGTGTAACAG TACTCTCATG CAATTTTTTT CAGGATGCAA AGGCAATTAT TCTTTGTAAG 492 0 
CGGGACATTT AGATATATTT GTGTACATAT TATATGTATG TATATTTCAA A 
TGAAAATTAG ACATTTATTA ACCAAATTTA ACGTGGTATT T 
TGATACATTA CATATTGTGA ATGTATACTA AAAAAACATT TTAAATGTTA A 
TTCAGATTCA TATAACCACA ACTGTGATAT ATCCTAACTA TAACCAGTTG TTGAGGGGTA 5160 
TACTAGAAGC AGAATGAAAC CACATTTTTT GGTTTGATAA TATGCACITA TTGACTCCCA 5220 
CTCATTGTTA TGTTAATTAA GTTATTATTC TGTCTCCTTG TAATTTTGAT TACAAAAATT 5280 
TTATTATCCT CACTTACCTC TTACTTTTAC AGTACCTGAT ACTCCTAAAA CTTTTAACTT 5340 
ATACAAATTA GTCAATAATG ACCCCAATTT TTTCATTAAA ATAATAGTGG TGAATTATAT 5400 
GTTATTGTGT TAAAACCTCA CTTGCCAAAT TCTGGCTTCA CATTTGTATT TAGGGCTATC 5460 
CTTAAAATGA TGAGTCTATA TTATCTAGCT TTCTATTACC CTAATATAAA CTGGTATAAG 5520 
AAGACTTTCC TTTTTTCTTT ATGCATGGAA GCATCAATAA ATTGTTTAAA AACCATGTAT 558D 
AGTAAATTCA GCTTAACCCG TGATCTTCTT AAGITAAAGG TACTTTTGTT TTATAAAAGC 564D 
TCTAGATAAA ACTTTCTTTT CTGATCATGA ATCAAGTATC TGTGGTTTCA TGCCCCTCTC 57DD 
TATACCTTTC AAAGAACTCC TGAAGCAACT TAACTCATCA TTTCAGCCTC TGAGTAGAGG 5760 
TAAAACCTAT GTGTACTTCT GTTTATGATC CATATTGATA TTTATGACAT GAACACAGAA 5820 
TAGTACCTTA CATTTGCTAA ACAGACAGTT AATATCAAAT CCTTTCAATA TTCTGGGAAC 5880 
CCAGGGAAGT TTTTAAAAAT GTCATTACTT TCAAAGGAAC AGAAGTAGTT AACCAAACTA 5910 
ACAAGCAAAA CCTGAGGTTT ACCTAGTGAC ACCAAATTAT CGGTATTTTA ACTGAATTTA 6000 
CCCATTGACT AAGAATGAAC CGGATTTGGT GGTGGTTTTG TTTCTATGCA AACTGGACAC 6060 
AAATTACAAC AGTAAATTTT TTTATAAGTG CTTCTCCCTT CTCCATGATG TGACTTCCGG 6120 
AGATAAAGGA TTCAAAAGAT AAAGACAAAG TACGCTCAGA GTTGTTAACC AGAAAGTCCT 6180 
GGCTGTGGTT GCAGAAACAC TGTTGGAAGA AAAGAGATGA CTAAGTCAAG TGTCTGCCTT 6240 
ATCAAAAGAG CAAAAATGCC TCTGGTTTTG TGTTTGGGAG AAAAATATCT TGGACGCACT 6300 
GTTTTCCTTG ATAAAAGTCA TCTTCTCTAC TGTGTGAAAT GAATACTTGG AATTCTAATT 6360 
GTTTTGTGTG CCAGGGGCAG TAATGTCCCT GCCTCTTCTC CCAATCAAGG TTGAGGAGTG 6420 
GGGCTGGGGA GAGGACTTAA CTGACTTAAG AAGTAGGAAA ACAAAAACCT CTCTCCTCAG 6480 
CCTTCCACCT CCAAGAGAGG AGGAAAAACA GTTGTCTGCT GTCTGTAATT CAGTTTGCGT 6540 
GTATTTTATG CTCATGCACC AACCCATACA G 
A GAATGATTGT CTTCCGAGTT T 

3 TATTGACTGC TGACTATATT TTAAAAACAA AATGAAATAA T 
ATTACAGAGG TTGACATTGT TCAGGGATGG GACAAAGCCT TCTTCAATCC TTTTCATACT 
ACTTAATGAT TTTGGTGCAG GAACCTGAGA TTTTCTGATT TATATTTCAT GATATTTCAC 
?GAAGCC CAGTGGCACC AAA1GGCTGG G 
T GTAGCACCTC ACTATCTGAA AGGCCATGAG TTTTCAGATG A 
GCTTCATTGC AGCCTGAAAT TTTAAAAAAG TTGTGTAATA CGCCAACCAG TCAAGTTGTG 
T ATGTCCAATT TCCTGGCTCA TTTCATTGTG CTCTATGGGT 
TTTCCTA GGCAAACATT GCAACTCAGG GCTAAAGTCA 
TCCAGTGAAA CTTTTAGAGC CAGAAGTAAC TTTGTCCCAG TCCTACAATG TGAAAAGAGT 
3 CCATTTTCAT GGCTGGTACA TATTCGTACG CATTACTTTT 
T ACGCACTTTC AGATATTCTT ATTTTTATTC TCTTAAG1CT TTATTAACTT 
TGGAGAGAGA AATGATGCAT CTTTTTATTT TAAATGAAGT AGATCAACAT GGTGGAACAA 
2 ATTTCAATAT ATTACTAATA ACTTTTTCCA ATATAAATCC 
TTACAGT TTTATGAAGC TTTCTATTGT GACTTTTATG 
GAATTAAGAG ATGAAGAAGA TGAGATATTT TAGCATTTAT ATTTTTCAAA ATTATATGIA 
TACTTAAAAA TAAAGTAACT TTATGC 
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CGGCACAGCC TCACACCTGA ACGCTGTCCT CCCGCAGACG AGACCGGCGG GCACTGCAAA 
GCTGGGACTC GTCTTTGAAG GAAAAAAAAT AGCGAGTAAG AAATCCAGCA CCATTCTTCA 
CTGACCCATC CCGCTGCACC TCTTGTTTCC CAAGTTTTTG AAAGCTGC-CA ACTCTGACCT 
CGGTGTCCAA AAATCGACAG CCACTGAGAC CGGCTTTGAG AAGCCGAAGA TTTGGCAGTT 
5 TCCAGACTGA GCAGGACAAG GTGAAAGCAG GTTGGAGGCG GGTCCAGGAC ATCTGAGGGC 
TGACCCTGGG GGCTCGTGAG GCTGCCACCG CTGCTGCCGC TACAGACCCA GCCTTGCACT 
CCAAGGCTGC GCACCGCCAG CCACTATCAT GTCCACTCCC GGGGTCAATT CGTCCGCCTC 
CTTGAGCCCC GACCGGCTGA ACAGCCCAGT GACCATCCCG GCGGTGATGT TCATCTTCGG 
GGTGGTGGGC AACCTGGTGG CCATCGTGGT GCTGTGCAAG TCGCGCAAGG AGCAGAAGGA 
10 GACGACCTTC TACACGCTGG TATGTGGGCT GGCTGTCACC GACCTGTTGG GCACTTTGTT 
GGTGAGCCCG GTGACCATCG CCACGTACAT GAAGGGCCAA TGGCCCGGGG GCCAGCCGCT 
GTGCGAGTAC AGCACCTTCA TTCTGCTCTT CTTCAGCCTG TCCGGCCTCA G 

15 



25 

C GCTCCTTCAT CTCCCGGGAG C 
3 ACCTCTCACT GCCAGACCTC A 
CAGGAATTTG CTTCCAGGTG TGCCTGGCAT GGGCCTGGCC CAGGAAGACA C 
GAGGACTTTG CGAATATCAG AGACCTCAGA CTCTTCACAG GGTCAGGACT CAGAGAGTGT 
30 CTTACTGGTG GATGAGGCTG GTGGGAGCGG CAGGGCTGGG CCTGCCCCTA AGGGGAGCTC 

CCTGCAAGTC ACATTTCCCA GTGAAACACT GAACTTATCA GAAAAATGTA TATAATAGGC 1860 
AAGGAAAGAA ATACAGTACT GTTTCTGGAC CCTTATAAAA TCCTGTGCAA TAGACACATA 192 0 
CATGTCACAT TTAGCTGTGC TCAGAAGGGC TATCATCA 1958 




Seq ID NO: C164 DNA Sequence 
Coding sequences 42 7.. 1434 



CAGTATCCCT CCTGACAAAA CTAACAAAAA TCCTGTTAGC CAAATAATCA GCCACATTCA 
TATTTACCGT CAAAGTTTTT ATCCTCATTT TACAGCAGTG GAGAGCGATT GCCCCGGGTC 
CCACGTTAGG AAGAGAGAGA ACTGGGATTT GCACCCAGGC AATCTGGGGA CAGAGCTGTG 
ATCACAACTC CATGAGTCAG GGCCGAGCCA GCCCCTTCAC CACCAGCCGG CCGCGCCCCG 
GGAAGGAAGT TTGTGGCGGA GGAGGTTCGT ACGGGAGGAG GGGGAGGCGC CCACGCATCT 
GGGGCTGACT CGCTCTTTCG CAAAACGTCT GGGAGGAGTC CCTGGGGCCA CAAAACTGCC 
TCCTTCCTGA GGCCAGAAGG AGAGAAGACG TGCAGGGACC CCGCGCACAG GAGCTGCCCT 
CGCGACATGG GTCACCCGCC GCTGCTGCCG CTGCTGCTGC TGCTCCACAC CTGCGTCCCA 
GCCTCTTGGG GCCTGCGGTG CATGCAGTGT AAGACCAACG GGGATTGCCG TGTGGAAGAG 
TGCGCCCTGG GACAGGACCT CTGCAGGACC ACGATCGTGC GCTTGTGGGA AGAAGGAGAA 
GAGCTGGAGC TGGTGGAGAA AAGCTGTACC CACTCAGAGA AGACCAACAG GACCCTGAGC 
TATCGGACTG GCTTGAAGAT CACCAGCCTT ACCGAGGTTG TGTGTGGGTT AGACTTGTGC 
AACCAGGGCA ACTCTGGCCG GGCTGTCACC TATTCCCGAA GCCGTTACCT CGAATGCATT 
TCCTGTGGCT CATCAGACAT GAGCTGTGAG AGGGGCCGGC ACCAGAGCCT GCAGTGCCGC 
\GTGCCT GGATGTGGTG ACCCACTGGA TCCAGGAAGG TGAAGAAGGG 




TCCCAGCTAT GAAAACAGCT ATCTCACAAA GTTGTGTGAA GCAGAAGAGA AAAGCTGGAG 
GAAGGCCGTG GGCAATGGGA GAGCTCTTGT TATTATTAAT ATTGTTGCCG CTGTTGTGTT 
GTTGTTATTA ATTAATATTC ATATTATTTA TTTTATACTT ACATAAAGAT TTTGTACCAG 



I I I 

T CATCTTCTGA AGCTTTAAAA ACAATTGATG AATTGGCCTT CAAGATAGAC 
A ACTCGGAACT TGGCTCTCAG CGTATCATCC 
T TTTAGCATTG GTCTTCCAAG CAATAATGAA 
TCGTATTTCC AGATGGATTT TGAGAGTGGA CAAGTGGATC CACTGGCATC TGTAATTTTG 
CCTCCAAACT TACTTGAGAA TTTAAGTCCA GAAGATTCTG TATTAGTTAG A 
TTTACTTTCT TCAACAAAAC TGGACTTTTC CAGGATGTAG GACCCCAAAG A 
GTGAGTTATG TGATGGCGTG CAGTATTGGA AACATTACTA TCCAGAATCT GAAGGATCCT 
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GTTCAAATAA AAATCAAACA TACAAGAACT CAGGAAGTGC ATCATCCCAT CTGTGCCTTC 
TGGGATCTGA ACAAAAACAA AAGTTTTGGA GGATGGAACA CGTCAGGATG TGTTGCACAC 
AGAGATTCAG ATGCAAGT G A GACAGTCTGC CTGTGTAACC ACTTCACACA CTTTGC-AGTT 
CTGATGGACC TTCCAAGAAG TGCCTCACAG TTAGATGCAA GAAACACTAA AGTCCTCACT 
TTCATCAGCT ATATTGGGTG TGGAATATCT GCTATTTTTT CAGCAGCAAC TCTCCTGACA 
TATGTTGCTT TTGAGAAATT GCGAAGGGAT TATCCCTCCA AAATCTTGAT GAACCTGAGC 
ACAGCCCTGC TGTTCCTGAA TCTCCTCTTC CTCCTAGATG GCTGGATCAC CTCCTTCAAT 
GTGGATGGAC TTTGCATTGC TGTTGCAGTC CTGTTGCATT TCTTCCTTCT GGCAACCTTT 
ACCTGGATGG GGCTAGAAGC AATTCACATG TACATTGCTC TAGTTAAAGT ATTTAACACT 
TACATTCGCC GATACATTCT AAAATTCTGC ATCATTGGCT GGGGTTTGCC TGCCTTAGTG 
GTGTCAGTTG TTCTAGCGAG CAGAAACAAC AATGAAGTCT ATGGAAAAGA AAGTTATGGG 
AAAGAAAAAG GTGATGAATT CTGTTGGATT CAAGATCCAG TCATATTTTA TGTGACCTGT 

CAGATCTGTG GGAGGAATGG CAAGAGAAGC AACCGGACCC TGAGAGAAGA AGTGTTAAGG 
T CTGTTGGGCA TGACATGGGG TTTTGCATTC 



TT CAGACAAC TTCTATAAAA ATATTATCAT GTCAGACACC T 
25 TTAATGTCTT TAAGAAAAAG AAATCAATCT GCAGAAATGT GAAGATTTGC A 

AACTGCAACT AGTGATGTAA ATGTGCTATT ACCTAGGTAA CTGCATATAT ATAAGGAATG 
TATTTTGTTA AGAAGGCTTT TGTGAAATTC AGAATTTTTC T 
TGGAAGAGTT GTCATCACTA AAACTTCAGT ACTGAGAGTA ACATGACTCA G 
AAGCTATGAT TTGTAAAATA TATAATTGAA TCAGAGTAAT CATAATGCAG GGGAGACATT 
30 CAAATTAGAG ACAAGGGAGA AGCAATGCTG AGGAAGACCC TAGATAGAGC 
CCACCTAATC GTTATATCTG GATATACCCA TTTTCTGCAT CTTCTTTCTC 
TGTCCTTGCT TIGGAGACTT TAAGACATTT CCTAAAGCAC AAATAAAAGC CTCGTATTTC 
CCCATTGAGA GTTTTGTTCC AAGGAATATG AAGTGAGACA TATGGGTGAG TCATAATAAT 
CAAAATAATT TATGAAGAGC TGGGTCTGCA ATAGCTAGTC TAAAAACTAC TIGTGTGTCA 
35 GTCCTCTGGT TATAGTATAT AAGAGCCTGA GGAGGTCTGG CAAGATAGAT GGTGTATTAT 
TTATGGATCA GGCTGCTGCA TACAAACCTT GCATACTATT ATGCAGCTTA CCTAACTCTC 
AGACTATTCT CACTAATGCT TGCTTGCTAA TGAATGTATA GGAGACCACA TTGTAATTGT 
TCTTAGATGA TGGAGTCCAT GCAGTTTCTT AGAAATCGGT CTCAGTGCAT GCTGTGCTTT 
TTCACATTTG CTCTGGGTTA TCTGGGAAGT ATCAGGTTCT GGGAGGCAAC AGCATTAAGT 
40 GATAAGAAAA GGAGACATTC TGGCAAAGCC AATCTGCTTA AAGGCAAAGT CCAGAACCTG 

GAACCTAGAG GGCTTTCTCT CTGCACGAAA AACAGGTAGT TTGCAGTCTG AGATATGGGA 
GAGCTTTTAG GCTACACAGC AACCCAAGGG ACCTCTCACC TTTTGCTGAG CTTCAATCAG 
GAAGCTATTT GCCTGGCTCC AGCAGATGAT GAGATAATGA GGTAGTGGGT TTTTTATTAC 
TGTTCCATTT TGCAACATCC TGCAACACCA TCCTGGGAGA CAAGAGCATT ACCCAGCTTG 
GCTTTCACGG GGGAGGGTTG TATTCAGT 

Seq ID NO j C166 DNA Sequence 
Nucleic Acid Accession tt: NM_000574.1 
Coding sequence: 66.. 1211 

1 11 21 31 41 51 

I I I I I I 

CCGCTGGGCG TAGCTGCGAC TCGGCGGAGT CCCGGCGGCG CGTCCTTGTT CTAACCCGGC 
GCGCCATGAC CGTCGCGCGG CCGAGCGTGC CCGCGGCGCT GCCCCTCCTC GGGGAGCTGC 
CCCGGCTGCT GCTGCTGGTG CTGTTGTGCC TGCCGGCCGT GTGGGGTGAC TGTGGCCTTC 
CCCCAGATGT ACCTAATGCC CAGCCAGCTT TGGAAGGCCG TACAAGTTTT C 
CTGTAATAAC GTACAAATGT GAAGAAAGCT TTGTGAAAAT TCCTGGCGAG A 
TGATCTGCCT TAAGGGCAGT CAATGGTCAG ATATTGAAGA GTTCTGCAAT CGTAGCTGCG 
AGGTGCCAAC AAGGCTAAAT TCTGCATCCC TCAAACAGCC TTATATCACT C 
TTCCAGTCGG TACTGTTGTG GAATATGAGT GCCGTCCAGG TTACAGAAGA G 
TATCACCAAA ACTAACTTGC CTTCAGAATT TAAAATGGTC CACAGCAGTC GAATTTTGTA 



T TOGTGCAACC ATCTCCTTCT CATGTAACAC A 
CGACTTCTAG TTTTTGTCTT ATTTCAGGCA GCTCTGTCCA GTGGAGTGAC CCGTTGCCAG 
AGTGCAGAGA AATTTATTGT CCAGCACCAC CACAAATTGA C 



AAACCACCAC ACCAAATGCT CAAGCAACAC GGAGTACACC TGTTTCCAGG ACAACCAAGC 
ATTTTCATGA AACAACCCCA AATAAAGGAA GTGGAACCAC TTCAGGTACT ACCCGTCTTC 
TATCTGGGCA CACGTGTTTC ACGTTGACAG GTTTGCTTGG GACGCTAGTA ACCATGGGCT 
TGCTGACTTA GCCAAAGAAG AGTTAAGAAG AAAATACACA CAAGTATACA GACTG1TCCT 
AGTTTCTTAG ACTTATCTGC ATATTGGATA AAATAAATGC AATTGTGCTC TTCATTTAGG 
ATGCTTTCAT TGTCTTTAAG ATGTGTTAGG AATGTCAACA GAGCAAGGAG AAAAAAGGCA 
GTCCTGGAAT CACATTCTTA GCACACCTAC ACCTCTTGAA AATAGAACAA CTTGCAGAAT 
TGAGAGTGAT TCCTTTCCTA AAAGTGTAAG AAAGCATAGA GATTTGTTCG TATTTAGAAT 
GGGATCACGA GGAAAAGAGA AGGAAAGTGA TTTTTTTCCA CAAGATCTGT AATGTTATTT 
CCACTTATAA AGGAAATAAA AAATGAAAAA CATTAT] 
CCCAATTCAG TCTCTTCTAA GCAAAATTGC TAAAGAGAGA T_ 
TCTTTGGCTG TAAGGCATTT TCATCTTTCC TTCGGGTTGG CAAAATATTT TAAAGGTAAA 1740 
ACATGCTGGT GAACCAGGGG TGTTGATGGT GATAAGGGAG GAATATAGAA TGAAAGACTG 1800 
AATCTTCCTT TGTTGGACAA ATAGAGTTTG GAAAAAGCCT GTGAAAGGTG TCTTCTTTGA 1860 
CTTAATGTCT TTAAAAGTAT CCAGAGATAC TACAATATTA ACATAAGAAA AGATTATATA 1920 
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TTATTTCTGA ATCGAGATGT CCATAGTCAA ATTTGTAAAT CTTATTCTTT T 

T ATTTATGACA GTGAACATTC TGATTTTACA TGTAAAACAA GAAAAGTTGA 
G TGAAGAAAAA TGTATTTTTC CTAAATAGAA ATAAATGATC CCATTTTTTG 



Coding sequence ; 1 

1 11 21 31 41 51 

I I I I I I 

ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG TGCTGCTGCA GCTGGCGACC 
GGGGGCAGCT CICCCAGGTC TGGTGTGTTG CTGAGGGGCT GCCCCACACA CTGTCATTGC 
GAGCCCGACG GCAGGATGTT GCTCAGGGTG GACTGCTCCG ACCTGGGGCT CTCGGAGCTG 
CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA GTATGAACAA CATCAGTCAG 
CTGCTCCCGA ATCCCCTGCC CAGTCTCCGC TTCCTGGAGG AGTTACGTCT TGCGGGAAAC 
GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT ACAGTCTTAA AGTTCTTATG 
CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC TGCAGAATTT GCGAAGCCTT 
CAATCCCTGC GTCTGGATGC T AACCACAT C AGCTATGTGC CCCCAAGCTG TTTCAGTGGC 
CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT TAACAGAAAT CCCCGTCCAG 
GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG CCCTGAACAA AATACACCAC 
ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG TTCTACATCT CCATAACAAT 
AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC ACAGCCTAGA GACTTTAGAT 
TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA GGACACTCTC C 
GAACTACATT TCTATGACAA T 
CCTGAACTAA GAACACTGAC TCTGAATGGT GCCTCACAAA TAACTGAATT TCCTGATTTA 
ACTGGAACTG CAAACCTGGA GAGTCTGACT TTAACTGGAG CACAGATCTC A 
GCAATCAGTT ACCTAATCTC CAAGTGCTAG A 
CCAGTTTTTC AGTCTGCCAA A 
GAAATCTACG AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGCCTCCG ATCGCTGAAT 
CCCAATGCAT TTTCCACTTT GCCATCCCTA 



G ATGAACGTGA C 

TTCCTGCTTG ACTTTGAGGA AGACCTGAAA GCCCTTCATT CAGTGCAGTG TTCACCTTCC 1560 

CCAGGCCCCT TCAAACCCTG TGAACACCTG CTTGATGGCT GGCTGATCAG AATTGGAGTG 1620 

TGGACCATAG CAGTTCTGGC ACTTACTTGT AATGCTTTGG TGACTTCAAC AGTTTTCAGA 1680 

TCCCCTCTGT ACATTTCCCC CATTAAACTC TTAATTGGGG TCATCGCAGC AGTGAACATG 174 0 

CTCACGGGAG TCTCCAGTGC CGTGCTGGCT GGTGTGGATG CGTTCACTTT TGGCAGCTTT 1800 

GCACGACATG GTGCCTGGTG GGAGAATGGG GTTGGTTGCC ATGTCATTGG TTTTTTGTCC I860 

ATTTTTGCTT CAGAATCATC TCTTTTCCTC CTTACTCTGG CAGCCCTGGA GCGTGGGTTC 1920 

TCTGTGAAAT ATTCTGCAAA ATTTGAAACG AAAGCTCCAT TTTCTAGCCT GAAAGTAATC 1980 

AAGTATGGCG CCTCCCCTCT CTGCCTGCCT TTGCCTTTTG GGGAGCCCAG CACCATGGGC 2100 

T ACATGGT CG CTCTCATCTT GCTCAATTCC CTTTGCTTCC TCATGATGAC CATTGCCTAC 2160 

ACCAAGCTCT ACTGCAATTT GGACAAGGGA GACCTGGAGA ATATTTGGGA CTGCTCTATG 2220 

GTAAAACACA TTGCCCTGTT GCTCTTCACC AACTGCATCC TAAACTGCCC TGTGGCTTTC 2280 

TTGTCCTTCT CCTCTTTAAT AAACCTTACA TTTATCAGTC CTGAAGTAAT TAAGTTTATC 234 0 

CTTCTGGTGG TAGTCCCACT TCCTGCATGT CTCAATCCCC TTCTCIACAT CTTGTTCAAT 24 0 0 

CCTCACTTTA AGGAGGATCT GGTGAGCCTG AGAAAGCAAA CCTACGTCTG G ACAAGAT CA 2460 

AAACACCCAA GCTTGATGTC AATTAACTCT GATGATGTCG AAAAACAGTC CTGTGACT CA 252 0 

ACTCAAGCCT TGGTAACCTT TACCAGCTCC AGCATCACTT ATGACCTGCC TCCCAGTTCC 2580 

GTGCCATCAC CAGCTTATCC AGTGACTGAG AGCTGCCATC TTTCCTCTGT GGCATTTGTC 264 0 

CCATGTCTTA A 2 651 

Seq ID NO: C168 DNA Sequence 

Nucleic Acid Accession #: NM_003667.2 

Coding sequence: 49.-2772 



TGCTGCTCTC CGCCCGCGTC CGGCTCGTGG CCCCCTACTT CGG3CACCAT GGACACCTCC 
CGGCTCGGTG TGCTCCTGTC CTTGCCTGTG CTGCTGCAGC TGGCGACCGG GGGCAGCTCT 
CCCAGGTCTG GTGTGTTGCT GAGGGGCTGC CCCACACACT GTCATTGCGA GCCCGACGGC 
AGGATGTTGC TCAGGGTGGA CTGCTCCGAC CTGGGGCTCT CGGAGCTGCC TTCCAACCTC 
AGCGTCTTCA CCTCCTACCT AGACCT CAGT ATGAACAACA TCAGTCAGCT GCTCCCGAAT 
CCCCTGCCCA GTCTCCGCTT CCTGGAGGAG TTACGTCTTG CGGGAAACGC TCTGACATAC 
ATTCCCAAGG GAGCATTCAC TGGCCTTTAC AGTCTTAAAG TTCTTATGCT GCAGAATAAT 
CAGCTAAGAC ACGTACCCAC AGAAGCTCTG CAGAATTTGC GAAGCCTTCA ATCCCTGCGT 
CTGGATGCTA ACCACATCAG CTATGTGCCC CCAAGCTGTT TCAGTGGCCT GCATTCCCTG 
AGGCACCTGT GGCTGGATGA CAATGCGTTA ACAGAAATCC CCGTCCAGGC TTTTAGAAGT 
TTATCGGCAT TGCAAGCCAT GACCTTGGCC CTGAACAAAA T. 
GCCTTTGGAA ACCTCTCCAG CTTGGTAGTT CTACATC 
CTGGGAAAGA AATGCTTTGA TGGGCTCCAC AGCCTAC 

AACCTTGATG AATTCCCCAC TGCAATTAGG ACACTCTCCA ACCTTAAAGA A 
CATAGCAACA AT AT CAGGT C GATACCTGAG AAAGCAT 
ACAATACATT TCTATGACAA TCCCATCCAA TTTGTTGGGA GATCTGCITT TCAACATTTA 
CCTGAACTAA GAACACTGAC TCTGAATGGT GCCTCACAAA TAACTGAATT T 
ACTGGAACTG CAAACCTGGA GAGTCTGACT TTAACTGGAG CACAGATCTC A 
CAAACCGTCT GCAATCAGTT ACCTAATCTC CAAGTGCTAG A 
GAAGATTTAC CCAGTTTTTC AGTCTGCCAA AAGCTTCAGA AAATTGACCT AAGACATAAT 
GAAATCTACG. AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGCCTCCG A 
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TTGGCTTGGA ACAAAATTGC TATTATTCAC CCCAATGCAT TTTCCACTTT GCCATCCCTA 1320 

ATAAAGCTGG ACCTATCGTC CAACCTCCTG TCGTCTTTTC CTATAACTGG GTTACATGGT 1380 

TTAACTCACT TAAAATTAAC AGGAAATCAT GCCTTACAGA GCTTC-ATATC ATCTGAAAAC 1440 

TTTCCAGAAC TCAAGGTTAT AGAAATGCCT TATGCTTACC AGTGCTGTGC ATTTGGAGTG 1500 

TGTGAGAATG CCTATAAGAT TTCTAATCAA TGGAATAAAG GTGACAACAG CAGTATGGAC IS SO 

GACCTTCATA AGAAAGATGC TGGAATGTTT CAGGCTCAAG ATGAACGTGA CCTTGAAGAT 1620 

TTCCTGCTTG ACTTTGAGGA AGACCTGAAA GCCCTTCATT CAGTGCAGTG TTCACCTTCC 1680 

CCAGGCCCCT TCAAACCCTG TGAACACCTG CTTGATGGCT GGCTGATCAG AATTGGAGTG 1740 

TGGACCATAG CAGTTCTGGC ACTTACTTGT AATGCTTTGG TGACTTCAAC AGTTTTCAGA 180 0 

TCCCCTCTGT ACATTTCCCC CATTAAACTG TTAATTGGGG TCATCGCAGC AGTGAACATG 1860 

CTCACGGGAG TCTCCAGTGC CGTGCTGGCT GGTGTGGATG CGTTCACTTT TGGCAGCTTT 1920 

ATTTTTGCTT CAGAATCATC TGTTTTCCTG CTTACTCTGG CAGCCCTGGA G 




CCTCACTTTA AGGAGGATCT GGTGAGCCTG AGAAAGCAAA CCTACGTCTG 



CTGAAAATGT 



Seg ID NO: C169 DNA S 
Nucleic Acid Accession # 
Coding sequence: 259.. 2379 

i i 1 r i 1 r f 

GCAGCTCCAG TCCCGGACGC AACCCCGGAG CCGTCTCAGG TCCCTGGGGG GAACGGTGGG 
TTAGACGGGG ACGGGAAGGG ACAGCGGCCT TCGACCGCCC CCCGAGTAAT TGACCCAGGA 
CTCATTTTCA GGAAAGCCTG AAAATGAGTA AAATAGTGAA ATGAGGAATT TGAACATTTT 
ATCTTTGCAT CGGGATCTTC TCAGGATGCA AAGAGTGATT CATCCAAGCC ATGTGGTAAA 
ATCAGGAATT TGAAGAAAAT GGAGATGTTT ACATTTTTGT TGACGTGTAT TTTTCTACCC 
u . „ CA AUTCT CTTCACCTGT GAACCAATTA CTGTTCCCAG ATGTATGAAA 
ATGGCCTACA ACATGACGTT TTTCCCTAAT CTGATGGGTC ATTATGACCA GAGTATTGCC 
GCGGTGGAAA TGGAGCATTT TCTTCCTCTC GCAAATCTGG AATGTTCACC AAACATTGAA 
ACTTTCCTCT GCAAAGCATT TGTACCAACC TGCATAGAAC AAATTCATGT GGTTCCACCT 
TGTCGTAAAC TTTGTGAGAA AGTATATTCT GATTGCAAAA AATTAATTGA CACTTTTGGG 
ATCCGATGGC CTGAGGAGCT TGAATGTGAC AGATTACAAT ACTGTGATGA GACTGTTCCT 
GTAACTTTTG ATCCACACAC AGAATTTCTT GGTCCTCAGA AGAAAACAGA ACAAGTCCAA 
AGAGACATTG GATTTTGGTG TCCAAGGCAT CTTAAGACTT CTGGGGGACA AGGATATAAG 
TTTCTGGGAA TTGACCAGTG TGCGCCTCCA TGCCCCAACA TGTATTTTAA AAGTGATGAG 
CTAGAGTTTG CAAAAAGTTT TATTGGAACA GTTTCAATAT TTTGTCTTTG TGCAACTCTG 



ATATATTACT CTGTCTGTTA CAGCATTGTA TCTCTTATGT ACTTCATTGG ATTTTTGCTG 
GGCGATAGCA CAGCCTGCAA TAAGGCAGAT GAGAAGCTAG AACTTGGTGA CACTGTTGTC 
CTAGGCTCTC AAAATAAGGC TTGCACCGTT TTGTTCATGC TTTTGTATTT TTTCACAATG 
GCTGGCACTG TGTGGTGGGT GATTCTTACC ATTACTTGGT TCTTAGCTGC AGGAAGAAAA 
TGGAGTTGTG AAGCCATCGA GCAAAAAGCA GTGTGGTTTC ATGCTGTTGC ATGGGGAACA 
CCAGGTTTCC TGACTGTTAT GCTTCTTGCT CTGAACAAAG TTGAAGGAGA CAACATTAGT 
GGAGTTTGCT TTGTTGGCCT TTATGACCTG GATGCTTCTC GCTACTTTGT ACTCTTGCCA 
CTGTGCCTTT GTGTGTTTGT TGGGCTCTCT CTTCTTTTAG CTGC-CATTAT TCCCTTAAAT 



AAAGCACAAT 
CTCACCAGAA 



AGAAAAGAGC AGGGAGGTGG T 

CAGAAGCAAA TTTGTGTTAC ACTGGAAGTG ACCTATGCAC TGTTTTGTAA GAATCACTGT 2460 

TCAAGAATAA TATGACTCAT TTCACACAAA GGTTAATGAC AACAATATAC CTGAAAACAG 2580 

AAATGTGCAG GTTAATAATA TTTTTTTAAT AGTGTGGGAG GACAGAGTTA GAGGAATCTT 2640 

T TTATGAAGAT TCTACTCTTG GTAAGAGTAT TTTAAGATGT ACTATGCTAT 2700 

PATAAAA TCAAGATATT TCTTTGCTGA AGTATTTAAA TCTTATCCTT 2760 

GTATCTTTTT ATACATATTT GAAAATAAGC TTATATGTAT TTGAACTTTT TTGAAATCCT 2820 

ATTCAAGTAT TTTTATCATG CTATTGTGAT ATTTTAGCAC TTTGGTAGCT TTTACACTGA 2880 

ATTTCTAAGA AAATTGTAAA ATAGTCTTCT TTTATACTGT AAAAAAAGAT ATACCAAAAA 2 940 

GTCTTATAAT AGGAATTTAA CTTTAAAAAC CCACTTATTG ATACCTTACC ATCTAAAATG 3000 
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TGTGATTTTT ATAGTCTCGT TTTAGGAATT TCACAGATCT AAATTATGTA ACTGAAATAA 30 60 

GGTGCTTACT CAAAGAGTGT CCACTATTGA TTGTATTATG CTGCTCACTG ATCCTTCTGC 312 0 

ATATTTAAAA TAAAATGTCC TAAAGGGTTA GTAGACAAAA TGTTAGTCTT TTGTATATTA 3180 

GGCCAAGTGC AATTGACTTC CCTTTTTTAA TGTTTCATGA CCACCCATTG ATTGTATTAT 3240 

AACCACTTAC AGTTGCTTAT ATTTTTTGTT TTAACTTTTG TTTCTTAACA TTTAGAATAT 3300 

TACATTTTGT ATTATACAGT ACCTTTCTCA GACATTTTGT AG 3 342 

Seq ID NO: C170 DNA Sequence 
Nucleic Acid Accession #> NM_000S82 
Coding sequence: 88...990 



GCAGAGCACA GCATCGTCGG GACCAGACTC GTCTCAGGCC AGTTGCAGCC TTCTCAGCCA 60 
AACGCCGACC AAGGAAAACT CACTACCATG AGAATTGCAG TGATTTGCTT TTGCCTCCTA 120 
GGCATCACCT GTGCCATACC AGTTAAACAG GCTGATTCTG GAAGTTCTGA GGAAAAGCAG 180 
CTTTACAACA AATACCCAGA TGCTGTGGCC ACATGGCTAA ACCCTGACCC ATCTCAGAAG 240 
CAGAATCTCC TAGCCCCACA GACCCTTCCA AGTAAGTCCA ACGAAAGCCA TGACCACATG 300 
GATGATATGG ATGATGAAGA TGATGATGAC CATGTGGACA GCCAGGACTC CATTGACTCG 360 
G ATGATGTAGA TGACACTGAT GATTCTCACC AGTCTGATGA GTCTCACCAT 420 
I TTTCCCACGG ACCTGCCAGC AACCGAAGTT 480 
TTCACTCCAG TTGTCCCCAC AGTAGACACA TATGATGGCC GAGGTGATAG T< 
T CAAAATCTAA GAAGTTTCGC AGACCTGAi 

\ CATGGAAAGC GAGGAGTTGA ATGGTGCATA CAAGGCCATC 
T GATTGGGACA G 
C AGCTGGATGA CCAGAGTGCT GAAACCCACA GCCACAAGCA G 

A TGAGAGCAAT GAGCATTCCG ATGTGATTGA TAGTCAGGAA 
3AATTTC ACAGCCATGA AGATATGCTG 
GTTGTAGACC CCAAAAGTAA GGAAGAAGAT AAACACCTGA AATTTCGTAT TTCTCATGAA 
TTAGATAGTG CATCTTCTGA GGTCAATTAA AAGGAGAAAA AATACAATTT CTCACTTTGC 
ATTTAGTCAA AAGAAAAAAT GCTTTATAGC AAAATGAAAG AGAACATGAA ATGCTTCTTT 
CTCAGTTTAT TGGTTGAATG TGTATCTATT TGAGTCTGGA AATAACTAAT GTGTTTGATA 
ATTAGTTTAG TTTGTGGCTT CATGGAAACT CCCTGTAAAC TAAAAGCTTC AGGGTTATGT 
CTATGTTCAT TCTATAGAAG AAATGCAAAC TATCACTG 

TCATGAATAG AAATTTATGT AGAAGCAAAC AAAATACTTT TACCCACTTA A 
ATAACATTTT ATGTCACTAT AATCTTTTGT TTTTTAAGTT AGTGTATATT TTGTTGTGAT 1380 
TATCTTTTTG TGGTGTGAAT AAATCTTTTA TCTTGAATGT AATAAGAATT TGGTGGTGTC 1440 
AATTGCTTAT TTGTTTTCCC ACGGTTGTCC AGCAATTAAT AAAACATAAC CTTTTTTACT 1500 
GCCTAAAAAA AAAAAAAAAA AAAA 1S24 

Seq ID NO: C171 DNA Sequence 
Nucleic Acid Accession #: NM002821 
Coding sequence: 150.. 3362 

1 11 21 31 41 51 

I I I I I I 

AACTCCCGCC TCGGGACGCC T CGGGGTCGG GCTCCGGCTG CGGCTGCTGC TGCGGCGCCC 60 
GCGCTCCGGT GCGTCCGCCT CCTGTGCCCG CCGCGGAGCA GTCTGCGGCC CGCCGTGCGC 12 D 
CCTCAGCTCC TTTTCCTGAG CCCGCCGCGA TGGGAGCTGC GCGGGGATCC CCGGCCAGAC 180 
CCCGCCGGTT GCCTCTGCTC AGCGTCCTGC TGCTGCCGCT GCTGGC-CGGT ACCCAGACAG 240 
CCATTGTCTT CAT CAAGCAG CCGTCCTCCC AGGATGCACT GCAGGGGCGC CGGGCGCTGC 300 
TTCGCTGTGA GGTTGAGGCT CCGGGCCCGG TACATGTGTA CTGGCTGCTC GATGGGGCCC 3 6D 
CTGTCCAGGA CACGGAGCGG CGTTTCGCCC AGGGCAGCAG CCTGAGCTTI GCAGCTGTGG 420 
ACCGGCTGCA GGACTCTGGC ACCTTCCAGT GTGTGGCTCG GGATGATGTC ACTGGAGAAG 480 
AAGCCCGCAG TGCCAACGCC TCCTTCAACA T CAAATGG AT TGAGC-CAGGT CCTGTGGTCC 540 
TGAAGCATCC AGCCTCGGAA GCTGAGATCC AGCCACAGAC CCAGGTCACA CTTCGTTGCC 600 
ACATTGATGG GCACCCTCGG CCCACCTACC AATGGTTCCG AGATGGGACC CCCCTTTCTG 660 
ATGGTCAGAG CAACCACACA GTCAGCAGCA AGGAGCGGAA CCTGACGCTC CGGCCAGCTG 720 
GTCCTGAGCA TAGTGGGCTG TATTCCTGCT GCGCCCACAG TGCTTTTGGC CAGGCTTGCA 780 
GCAGCCAGAA CTTCACCTTG AGCATTGCTG ATGAAAGCTT TGCCAGGGTG GTGCTGGCAC 840 
CCCAGGACGT GGTAGTAGCG AGGTATGAGG AGGCCATGTT CCATTGCCAG TTCTCAGCCC 900 
AGCCACCCCC GAGCCTGCAG TGGCTCTTTG AGGATGAGAC TCCCATCACT AACCGCAGTC 960 
GCCCCCCACA CCTCCGCAGA GCCACAGTGT TTGCCAACGG GTCTCTGCTG CTGACCCAGG 102 0 
TCCGGCCACG CAATGCAGGG ATCTACCGCT GCATTGGCCA GGGGCAGAGG GGCCCACCCA 1080 
. A AGCCACACTT CACCTAGCAG AGATTGAAGA CATGCCGCTA TTTGAGCCAC 1140 
TTAC AGCTGGCAGC GAGGAGCGTG TGACCTGCCT TCCCCCCAAG GGTCTGCCAG 1200 
AGCCCAGCGT GTGGTGGGAG CACGCGGGAG TCCGGCTGCC CACCCATGGC AGGGTCTACC 12 60 
AGAAGGGCCA CGAGCTGGTG TTGGCCAATA TTGCTGAAAG TGATGCTGGT GTCTACACCT 1320 
GCCACGCGGC CAACCTGGCT GGTCAGCGGA GACAGGATGT CAACATCACT GTGGCCACTG 1380 
TGCCCTCCTG GCTGAAGAAG CCCCAAGACA GCCAGCTGGA GGAGGGCAAA CCCGGCTACT 1440 
T GACCCAGGCC ACACCAAAAC CTACAGTTGT CTGGTACAGA AACCAGATGC 150 0 
A GGACTCACGG TTCGAGGTCT TCAAGAATGG GACCTTGCGC A' 
TGGAGGTGTA TGATGGGACA TGGTACCGTT GTATGAGCAG CACCCCAGCC O 
AGGCGCAAGC CCGTGTCCAA GTGCTGGAAA AGCTCAAGTT CACACCACCA CCCCAGCCAC 
AGCAGTGCAT GGAGTTTGAC AAGGAGGCCA CGGTGCCCTG TTCAGCCACA GGCCGAGAGA 
AGCCCACTAT TAAGTGGGAA CGGGCAGATG GGAGCAGCCT CCCAGAGTGG GT G ACAGACA 
ACGCTGGGAC CCTGCATTTT GCCCGGGTGA CTCGAGATGA CGCTGGCAAC TACACTTGCA 
TTGCCTCCAA CGGGCCGCAG GGCCAGATTC GTGCCCATGT CCAGCTCACT GTGGCAGTTT 
TTATCACCTT CAAAGTGGAA CCAGAGCGTA CGACTGTGTA CCAGGGCCAC ACAGCCCTAC 
TGCAGTGCGA GGCCCAGGGG GACCCCAAGC CGCTGATTCA GTGGAAAGGC AAGGACCGCA 
TCCTGGACCC CACCAAGCTG GGACCCAGGA TGCACATCTT CCAGAATGGC TCCCTGGTGA 
TCCATGACGT GGCCCCTGAG GACTCAGGCC GCTACACCTG CATTGCAGGC AACAGCTGCA 
ACATCAAGCA CACGGAGGCC CCCCTCTATG TCGTGGACAA GCCTGTGCCG GAGGAGTCGG 
AGGGCCCTGG CAGCCCTCCC CCCTACAAGA TGATCCAGAC CATTGGGTTG TCGC-TGGC-TG 
CCGCTGTGGC CTACATCATT GCCGTGCTGG GCCTCATGTT C 
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AAGCCAAGCG GCTGCAGAAG CAGCCCGAGG GCGAGGAGCC AGAGATGGAA TGCCTCAACG 2400 

GAGGGCCTTT GCAGAACGGG CAGCCCTCAG CAGAGATCCA AGAAGAAGTG GCCTTGACCA 2460 

GCTTGGGCTC CGGCCCCGCG GCCACCAACA AACGCCACAG CACAAGTGAT AAGATGCACT 2S20 

TCCCACGGTC TAGCCTGCAG CCCATCACCA CGCTGGGGAA GAGTGAGTTT GGGGAGGTGT 2S80 

TCCTGGCAAA GGCTCAGGGC TTGGAGGAGG GAGTGGCAGA GACCCTGGTA CTTGTGAAGA 2640 

GCCTGCAGAC GAAGGATGAG CAGCAGCAGC TGGACTTCCG GAGGGAGTTG GAGATGTTTG 2700 

GGAAGCTGAA CCACGCCAAC GTGGTGCGGC TCCTGGGGCT GTGCCGGGAG GCTGAGCCCC 27S0 

ACTACATGGT GCTGGAATAT GTGGATCTGG GAGACCTCAA GCAGTTCCTG AGGATTTCCA 2820 

AGAGCAAGGA TGAAAAATTG AAGTCACAGC CCCTCAGCAC CAAGCAGAAG GTGGCCCTAT 28 90 

GCACCCAGGT AGCCCTGGGC ATGGAGCACC TGTCCAACAA CCGCTTTGTG CATAAGGACT 2940 

TGGCTGCGCG TAACTGCCTG GTCAGTGCCC AGAGACAAGT GAAGGTGTCT GCCCTGGGCC 3000 

TCAGCAAGGA TGTGTACAAC AGTGAGTACT ACCACTTCCG CCAGGCCTGG GTGCCGCTGC 3060 

GCTGGATGTC CCCCGAGGCC ATCCTGGAGG GTGACTTCTC TACCAAGTCT GATGTCTGGG 3120 
CCTTCGGTGT GCTGATGTGG GAAGTGTTTA CACATG 
CAGATGATGA AGTACTGGCA GA 



AG CACOGTGGAC AGCAAGCCGT 3360 

GAGGAGGGAG CCCGCTCAGG ATGGCCTGGG CAGGGGAGGA CATCTCTAGA GGGAAGCTCA 3420 

CAGCATGATG GGCAAGATCC CTGTCCTCCT GGGCCCTGAG GTGCCCTAGT GCAACAGGCA 3480 

TTGCTGAGGT CTGAGCAGGG CCTGGCCTTT CCTCCTCTTC CTCACCCTCA TCCTTTGGGA 3540 

GGCTGACTTG GACCCAAACT GGGCGACTAG GGCTTTGAGC TGGGCAGTTT CCCCTGCCAC 3600 

CTCTTCCTCT ATCAGGGACA GTGTGGGTGC CACAGGTAAC CCCAATTTCT GGCCTTCAAC 3 660 

TTCTCCCCTT GACCGGGTCC AACTCTGCCA CTCATCTGCC AACTTTGCCT GGGGAGGGCT 3720 

AGGCTTGGGA TGAGCTGGGT TTGTGGGGAG TTCCTTAATA TTCTCAAGTT CTGGGCACAC 3780 

AGGGTTAATG AGTCTCTTGC CCACTGGTCC ACTTGGGGGT CTAGACCAGG ATTATAGAGG 384 0 

ACACAGCAAG TGAGTCCTCC CCACTCTGGG CTTGTGCACA CTGACCCAGA CCCACGTCTT 390 0 

CCCCACCCTT CTCTCCTTTC CTCATCCTAA GTGCCTGGCA GATGAAGGAG TTTTCAGGAG 3960 

CTTTTGACAC TATATAAACC GCCCTTTTTG TATGCACCAC GGGCGGCTTT TATATGTAAT 4020 

TGCAGCGTGG GGTGGGTGGG CATGGGAGGT AGGGGTGGGC CCTGGAGATG AGGAGGGTGG 4D80 

GCCATCCTTA CCCCACACTT TTATTGTTGT CGTTTTTTGT TTGTTTTGTT TTTTTGTTTT 4140 

TGTTTTTGTT TTTACACTCG CTGCTCTCAA TAAATAAGCC TTTTTTA 4187 




CAATGCCCTC TTTATTCTCT ATTACACAGC CCAGGGGGAG CCGTTCCCCA ACAACCTGGA 300 

CAAGCTATGT GGCCCCAACG TGACGGACTT CCCGCCCTTC CACGCCAACG GCACGGAGAA 360 

GGCCAAGCTG GTGGAGCTGT ACCGCATAGT CGTGTACCTT GGCACCTCCC TGGGCAACAT 420 

CACCCGCGAC CAGAACATCC TCAACCCCAG TGCCCTCAGC CTCCACAGCA AGCTCAACGC 480 

caccgccgac atcctgcgag gcctccttag caacgtgctg tgccgcctgt gcagcaagta 340 

;tggacg tgacctacgg ccctgacacc tcgggtaagg atgtcttcca 600 

3 CTGGGCTGTC AACTCCTGGG GAAGTATAAG CAGAT CATCG CCGTGTTGGC 660 

CCAGGCCTTC TAGCAGGAGG TCTTGAAGTG TGCTGTGAAC CGAGGGATCT CAGGAGTTGG 720 

GTCCAGATGT GGGGGCCTGT CCAAGGGTGG CTGGGGCCCA GGGCATCGCT AAACCCAAAT 780 

GGGGGCTGCT GGCAGACCCC GAGGGTGCCT GGCCAGTCCA CTCCACTCTG GGCTGGGCTG 840 

TGATGAAGCT GAGCAGAGTG GAAACTTCCA TAGGGAGGGA GCTAGAAGAA GGTGCCCCTT 900 

CCTCTGGGAG ATTGTGGACT GGGGAGCGTG GGCTGGACTT CTGCCTCTAC TTGTCCCTTT 960 

GGCCCCTTGC ICAC TTTGTG CAGTGAACAA ACTACACAAG TCATCTACAA GAGCCCTGAC 1020 

CACAGGGTGA GACAGCAGGG CCCAGGGGAG TGGACCAGCC CCCAGCAAAT TATCACCATC 1080 

TGTGCCTTTG CTGCCCCTTA GGTTGGGACT TAGGTGGGCC AGAGGGGCTA GGATCCCAAA 1140 

GGACTCCTTG TCCCCTAGAA GTTTGATGAG TGGAAGATAG AGAGGGGCCT CTGGGATGGA 1200 

AGGCTGTCTT CTTTTGAGGA TGATCAGAGA ACTTGGGCAT AGGAACAATC TGGCAGAAGT 12 60 

TTCCAGAAGG AGGTCACTTG GCATTCAGGC TCTTGGGGAG GCAGAGAAGC CACCTTCAGG 1320 

CCTGGGAAGG AAGACACTGG GAGGAGGAGA GGCCTGGAAA GCTTTGGTAG GTTCTTCGTT 13 80 

CTCTTCCCCG TGATCTTCCC TGCAGCCTGG GATGGCCAGG GTCTGATGGC TGGACCTGCA 1440 

GCAGGGGTTT GTGGAGGTGG GTAGGGCAGG GGCAGGTTGC TAAGTCAGGT GCAGAGGTTC 1500 

TGAGGGACCC AGGCTCTTCC TCTGGGTAAA GGTCTGTAAG AAGGGGCTGG GGTAGCTCAG 1560 

AGTAGCAGCT CACATCTGAG GCCCTGGGAG GTCTTGTGAG GTCACACAGA GGTACTTGAG 1620 

GGGGACTGGA GGCCGTCTCT GGTCCCCAGG GCAAGGGAAC AGCAGAACTT AGGGTCAGGG 1680 

TCTCAGGGAA CCCTGAGCTC CAAGCGTGCT GTGCGTCTGA CCTGGCATGA TTTCTATTTA 1740 

TTATGATATC CTATTTATAT TAACTTATTG GTGCTTTCAG TGGCCAAGTT AATTCCCCTT 1800 

TCCCTGGTCC CTACT CAACA AAATATGATG ATGGCTCCCG ACACAAGCGC CAGGGCCAGG 1860 

GCTTAGCAGG GCCTGGTCTG GAAGTCGACA ATGTTACAAG TGGAATAAGC TTACGGGTGA 1920 

AGCTCAGAGA AGGGTCGGAT CTGAGAGAAT GGGGAGGCCT GAGTGGGAGT GGGGGGCCTT 1980 

GCTCCACCCC CATCCCCTAC TGTGACTTGC TTTAGCGTGT CAGGC-TCCAC- GCTGCAGGGG 2040 

CTGGGCCAAT TTGTGGAGAG GCCGGGTGCC TTTCTGTCTT GCTTCCAGGG GGCTGGTTCA 2100 

CACTGTTCTT GGGCGCCCCA GCATTGTGTT GTGAGGCGCA CTGTTCCTGG CAGATATTGT 2160 

GCCCCCTGGA GCAGT GGGCA AGACAGTCCT TGTGGCCCAC CCTGTCCTTG TTTCTGTGTC 2220 

CCCATGCTGC CTCTGAAATA GCGCCCTGGA ACAACCCTGC CCCTGCACCC AGCATGCTCC 2280 
GACACAGCAG GGAAGCTCCT CCTGTGGCCC GGACACCCAT ft 
CTGGGCCAGA CCCCAGGAAG GTGGGGTAGA CTGGGGC 

GAGGAGGAGA GGGAGGCTGC AGACGCCTGG GACTCAGACC AGGAAGCTGT GGGCCCTCCT 
GCTCCACCCC CATCCCACTC CCACCCATGT CTGGGCTCCC AGGCAGGGAA CCCGATCTCT 
TCCTTTGTGC TGGGGCCAGG CGAGTGGAGA AACGCCCTCC AGTCTGAGAG CAGGGGAGGG 
AAGGAGGCAG CAGAGTTGGG GCAGCTGCTC AGAGCAGTGT TCTGGCTTCT TCTCAAACCC 
TGAGCGGGCT GCCGGCCTCC AAGTTCCTCC GACAAGATGA TGGTACTAAT TATGGTACTT 
TTCACTCACT TTGCACCTTT CCCTGTCGCT CTCTAAGCAC TTTACCTGGA TGGCGCGTGG 
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GCAGTGTGCA GGCAGGTCCT GAGGC CTGGG GTTGGGGTGG AGGGTGCGGC CCGGAGTTGT 2820 



15 
20 
25 
30 
35 
40 



CCCTTGTCCA GGATGCAGGG ACTGCCTTCT CCTTCCTGCT T 

CTGGCTGCAT TCCCCCAGGA TGGGCTTCGA GAAAGACAAA CTTGTCTGGA AACCAGAGTT 3000 

GCTGATTCCA CCCGGGGGGC CCGGCTGACT CGCCCATCAC CTCATCTCCC TGTGGACTTG 3060 

GGAGCTCTGT GCCAGGCCCA CCTTGCGGCC CTGGCTCTGA GTCGCTCTCC CACCCAGCCT 3120 

GGACTTGGCC CCATGGGACC CATCCTCAGT GCTCCCTCCA GATCCCGTCC GGCAGCTTGG 3180 

CGTCCACCCT GCACAGCATC ACTGAATCAC AGAGCCTTTG CGTGAAACAG CTCTGCCAGG 3240 

CCGGGAGCTG GGTTTCTCTT CCCTTTTTAT CTGCTGGTGT GGACCACACC TGGGCCTGGC 3300 

CGGAGGAAGA GAGAGTTTAC CAAGAGAGAT GTCTCCGGGC CCTTATTTAT TATTTAAACA 3360 

TTTTTTTAAA AAGCACTGCT AGTTTACTTG TCTCTCCTCC CCATCGTCCC CATCGTCCTC 3420 

CTTGTCCCTG ACTTGGGGCA CTTCCACCCT GACCCAGCCA GTCCAGCTCT GCCTTGCCGG 34 80 
CTCTCCAGAG TAGACATAGT G' 
CCCTGCAGAT GGTACAGATG T' 

GAGGTTTCCC TCCTTCCTTT CCACTGAAAA GCACATCGCC TTGGGTGACA AATTCCTCTT 3720 

TGATGAATGT ACCCTGTGGG GATGTTTCAT ACTGACAGAT TATTTTTATT TATTCAATGT 3780 

CATATTTAAA ATATTTATTT TTTATACCAA ATGAATCACT TTTTTTTTTA AGAAAAAAAA 3840 

GAGAAATGAA TAAAGAATCT ACTCTTCG 3 8 68 

Seg ID NO: C173 DNA 
Nucleic Acid Accessi 



CTCTGCCCGG GACCTCAGCA TGAACAACCT 
CCTGCGCTTC TTCGAGGAGC TGCGTCTCTC 
CTCTCACACA TCCCAGGACA AGCATTCTCT GGTCTCTACA GCCTGAAAAT 
CAGAACAATC AGCTGGGAGG AATCCCCGCA GAGGCGCTGT G 



TCCTCCCTCC GCCACCTCTG GCTGGACGAC AATGCACTCA CGGAGATCCC 
TGTCAGGGCC CTCAACAACC TCCCTGCCCT GCAGGCCATG A 
CAGCCACATC CCCGACTACG CGTTCCAGAA TCTCACCAGC C 
TAACAACCGC ATCCAGCATC TGGGGACCCA CAGCTTCGAG G 
ACTAGACCTG AATTATAACA AGCTGCAGGA G 
ACTGCAGGAA CTGGGGTTCC ATAACAACAA CATCAAGGCC A 
GGGGAACCCT CTGCTACAGA CGATACACTT TTATGATAAC C 
ATCGGCATTC CAGTACCTGC CTAAAC1 

CCAGGAGTTT CCAGATCTCA AAGGCACCAC CAGCCTGGAG ATCCTGACCC T 
AGGCATCCGG CTGCTCCCAT CGGGGA1 
ACTGTCTCAC AAT CAAATTG AGGAGCTGCC C 
AATCGGCCTC CAACACAACC GCATCTGGGA AATTGGAGCT GACACCTTCA GCCAGCTGAG 
rCCCTGl GCi rGGATl [B rGG.a 1 ULCATCCGG TCCATCCACC CCGAGGCCTT 
CTCCACCCTG CACTCCCTGG TCAAGCTGGA CCTGACAGAC AACCAGCTGA CCACACTGCC 
CCTGGCTGGA CTTGGGGGCT TGATGCATCT GAAGCTCAAA GGGAACCTTG CTCTCTCCCA 
GGCCTTCTCC AAGGACAGTT TCCCAAAACT GAGGATCCTG GAGGTGCCTT ATGCCTACCA 
GTGCTGTCCC TATGGGATGT GTGCCAGCTT CTTCAAGGCC TCTGGGCAGT GGGAGGCTGA 
AGACCTTCAC CTTGATGATG AGGAGTCTTC AAAAAGGCCC CTGGGCCTCC TTGCCAGACA 
AGCAGAGAAC CACTATGACC AGGACCTGGA TGAGCTCCAG CTGGAGATGG AGGACTCAAA 
GCCACACCCC AGTGTCCAGT GTAGCCCTAC TCCAGGCCCC TTCAAGCCCT GTGAGTACCT 



AGCCTCAGTC GATGCCCTGA CCTTTGGTCA GTTCTCTGAG TACGGAGCCC GCTGGGAGAC 1800 

GGGGCTAGGC TGCCGGGCCA CTGGCTTCCT GGCAGTACTT GGGTCGGAGG CATCGGTGCT 1860 

GCTGCTCACT CTGGCCGCAG TGCAGTGCAG CGTCTCCGTC TCCTGTGTCC GGGCCTATGG 1920 

GAAGTCCCCC TCCCTGGGCA GCGTTCGAGC AGGGGTCCTA GGCTGCCTGG CACTGGCAGG 1980 

GCTGGCCGCC GCGCTGCCCC TGGCCTCAGT GGGAGAATAC GGGGCCTCCC CACTCTGCCT 2040 

GCCCTACGCG CCACCTGAGG GTCAGCCAGC AGCCCTGGGC TTCACCGIGG CCCTGGTGAT 2100 

GATGAACTCC TTCTGTTTCC TGGT CGTGGC CGGTGCCTAC ATCAAACTGT ACTGTGACCT 2160 

GCCGCGGGGC GACTTTGAGG CCGTGTGGGA CTGCGCCATG GTGAGGCACG TGGCCTGGCT 2220 

CATCTTCGCA GACGGGCT CC TCTACTGTCC CGTGGCCTTC CTCAGCTTTG CCTCCATGCT 22 80 

GGGCCTCTTC CCTGTCACGC CCGAGGCCGT CAAGTCTGTC CTGCTGGTGG TGCTGCCCCT 2340 

GCCTGCCTGC CTCAACCCAC TGCTGTACCT GCTCTTCAAC CCOCACTTCC GGGATGACCT 2400 

TCGGGGGCTT CGGCCCCGCG CAGGGGACTC AGGGCCCCTA GCCTATGCTG CGGCCGGGGA 2460 

GCTGGAGAAG AGCTCCTGTG ATTCTACCCA GGCCCTGGTA GCCTTCTCTG ATGTGGATCT 2520 

CATTCTGGAA GCTTCIGAAG CTGGGCGGCC CCCTGGGCTG GAGACCTATG GCTTCCCCTC 2580 

AGTGACCCTC ATCTCCTGTC AGCAGCCAGG GGCCCCCAGG CTGGAGGGCA GCCATTGTGT 2640 

AGAGCCAGAG GGGAACCACT TTGGGAACCC CCAACCCTCC ATGGAT GGAG AACTGCTGCT 2700 

GAGGGCAGAG GGATCTACGC CAGCAGGTGG AGGCTTGT CA GGGGGTGGCG GCTTTCAGCC 2760 

CTCTGGCTTG GCCTTTGCTT CACACGTGTA AATATCCCTC CCCATTCTTC TCTTCCCCTC 2820 

TCTTCCCTTT CCTCTCTCCC CCTCGGTGAA TGATGGCTGC TTCTAAAACA AATACAACCA 2880 

AAACTCAGCA GTGTGATCTA TAGCAGGATG GCCCAGTCCC TGC-CTCCACT GATCACCTCT 2 940 

CTCCTGTGAC CATCACCAAC GGGTGCCTCT TGGCCTGGCT TTCCCTTGGC CTTCCTCAGC 3000 

TTCACCTTGA TACTGGGCCT CTTCCTTGTC ATGTCTGAAG CTGTGGACCA GAGACCTGGA 3060 

CTTTTGTCTG CTTAAGGGAA ATGAGGGAAG TAAAGACAGT GAAGGGGTGG AGGGTTGATC 3120 

AGGGCACAGT GGACAGGGAG ACCTCACAGA GAAAGGCCTG GAAGGTGATT TCCCGTGTGA 3180 

CTCATGGATA GGATACAAAA TGTGTTCCAT GTACCATTAA TCTTGACATA TGCCATGCAT 3240 
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Coding sequence: 101.. 2044 

i r f i 1 i 1 i 1 

AGTCTGGCCC TGGACAACCC CAGCAAAGCC GCCCTCAGCC AGCCCAGAAG CACTGGGCCT 
TGGCCACAGC AACACCCACT GAGCACGCTG GGAGCTGAGT ATGGCG1CCC TGGTCTCGCT 
GGAGCTGGGG CTGCTTCTGG CTGTGCTGGT GGTGACGGCG ACGGCGTCCC CGCCTGCTGG 
TCTGCTGAGC CTGCTCACCT CTGGCCAGGG CGCTCTGGAT CAAGAGGCTC TGGGCGGCCT 
GTTAAATACG CTGGCGGACC GTGTGCACTG CACCAACGGG CCGTGTGGAA AGTGCCTGTC 
TGTGGAGGAC GCCCTGGGCC TGGGCGAGCC TGAGGGGTCA GGGCTGCCCC CGGGCCCGGT 
CCTGGAGGCC AGGTACGTCG CCCGCCTCAG TGCCGCCGCC GTCCTGTACC TCAGCAACCC 
CGAGGGCACC TGTGAGGACA CTCGGGCTGG CC 
GGCCCTGCTC GAGAGCCCCA AGGCCCTGAC CO 
GCAGGCCCGG GCTGCCGGCC AGACCCCCAA GACGGCCTGC GTAGATATCC CTCAGCTGCT 
GGAGGAGGCG GTGGGGGCGG GGGCTCCGGG CAGTGCTGGC GGCGTCCTGG CTGCCCTGCT 
GG ACCATGT C AGGAGCGGGT CTTGCTTCCA CGCCTTGCCG AGCCCTCAGT ACTTCGTGGA 
CTTTGTGTTC CAGCAGCACA GCAGCGAGGT CCCTATGACG CTGGCCGAGC TGTCAGCCTT 
GATGCAGCGC CTGGGGGTGG GCAGGGAGGC CCACAGTGAC CACAGTCATC GGCACAGGGG 



ACTGGCTGCA GGGGGGTCGC CCACTACATC CTGCAGACCT TCCTGAGCCT 



GAGACTGAGC CCAGAGTTGA GGCTACTGCC CTATATGATC ACTCTGGGCG ACGCCGTGCA 



CCTCCGCGCT 



CTGGATCCTG GCAGTGGCCA CCGGCCTGTT CCTCTACGTA GCACTCTGCG ACATGCTCCC 1920 

GGCGATGTTG AAAGTACGGG ACCCGCGGCC CTGGCTCCTC TTCCTGCTGC ACAACGTGGG 1980 

CCTGCTGGGC GGCTGGACCG TCCTGCTGCT GCTGTCCCTG TACGAGGATG ACATCACCTT 2040 

CTGATACCCT GCCCTAGTCC CCCACCTTTG ACTTAAGATC CCACACCTCA CAAACCTACA 2100 

CCCCAGAAAC CAGAAGCCCC TATAGAGGCC CCAGTCCCAA CTCCAGTAAA GACACTCTTG 2160 

TCCTTGGAAA AAAAAAAAAA AAAAAAAAAA AA 2192 

Seq ID NO : 

Coding sequence: 



ATGGCGAACG CGAGCGAGCC GGGTGGCAGC GGCGGCGGCG AGGCGGCCGC CCTGGGCCTC 60 

AAGCTGGCCA CGCTCAGCCT GCTGCTGTGC GTGAGCCTAG CGGGCAACGT GCTGTTCGCG 120 

CTGCTGATCG TGCGGGAGCG CAGCCTGCAC CGCGCCCCGT ACTACCTGCT GCTCGACCTG 180 

TGCCTGGCCG ACGGGCTGCG CGCGCTCGCC TGCCTCCCGG CCGTCATGCT GGCGGCGCGG 240 

CGTGCGGCGG CCGCGGCGGG GGCGCCGCCG GGCGCGCTGG GCTGCAAGCT GCTCGCCTTC 300 

CTGGCCGCGC TCTTCTGCTT CCACGCCGCC TTCCTGCTGC TGGGCGTGGG CGTCACCCGC 3 60 

TACCTGGCCA TCGCGCACCA CCGCTTCTAT GCAGAGCGCC TGGCCGGCTG GCCGTGCGCC 420 

GCCATGCTGG TGTGCGCCGC CTGGGCGCTG GCGCTGGCCG CGGCCTTCCC GCCAGTGCTG 480 

GACGGCGGTG GCGACGACGA GGACGCGCCG TGCGCCCTGG AGCAGCGGCC CGACGGCGCC 540 

CCCGGCGCGC TGGGCTTCCT GCTGCTGCTG GCCGTGGTGG TGGGCGCCAC GCACCTCGTC 600 
TACCTCCGCC TGCTCTTCTT CATCCACGAC 



AACTGGACGG CGGGCT T CGG CCGCGGGCCC ACGCCGCCCG CGCTTGTGGG CATCCGGCCC 780 
GCAGGGCCGG GCCGCGGCGC GCGCCGCCTC CTCGTGCTGG A 
AGGCTGTGCA AGATGTTCTA CGCCGTCACG CTGCTCTTCC T 

GTCGTGGCCA GCTACCTGCG GGTCCTGGTG CGGCCCGGCG CCGTCCCCCA GGCCTACCTG 
ACGGCCTCCG TGTGGCTGAC CTTCGCGCAG GCCGGCATCA ACCCCGTCGT GTGCTTCCTC 
TTCAACAGGG AGCTGAG3GA CTGCTTCAGG GCCCAGTTCC CCTGCTGCCA GAGCCCCCGG 
ACCACCCAGG CGACCCATCC CTGCGACCTG AAAGGCATTG GTTTATGA 




3 CCTCCGCAGC CCAACATGGG CCCCGGGTTC CAAAGTTTGC GAAGTTGGGC 
GCCGAGGGGC CGGGGCGCGC GGAGCGTCCG GGGGGGCCCG G 
ACAGGTCGCC TGAGCCGCCT CCGCGGCCGC C 
AGGAGCAGGC GGGGGCGCCG GGGCTTTTGC TGAGTTGGCG GGGTTGGCCA TGGCCGCTGC 
CCGCCCAGCG CGGGGGCCGG AGCTCCCGCT CCTGGGGCTG CTGCTGCTGC TGCTGCTGGG 
GGACCCGGGC CGGGGGGCGG CCTCGAGCGG GAACGCGACC GGGCCTGGGC CTCGGAGCGC 
GGGCGGGAGC GCGAGGAGGA GCGCGGCGGT GACTGGCCCT CCGCCGCCGC TGAGCCACTG 
CGGCCGGGCT GCCCCCTGCG AGCCGCTGCG CTACAACGTG TGCCTGGGCT CGGTGCTGCC 
CTACGGGGCC ACCTCCACAC TGCTGGCCGG AGACTCGGAC TCCCAGGAGG A 
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CAAGCTCGTG CTCTGGTCGG GCCTCCGGAA TGCCCCCCGC TGCTGGGCAG TGATCCAGCC 660 

CCTGCTGTGT GCCGTATACA TGCCCAAGTG TGAGAATGAC CGGGTGGAGC TGCCCAGCCG 720 

TACCCTCTGC CAGGCCACCC GAGGCCCCTG TGCCATCGTG GAGAGGGAGC GGGGCTGGCC 780 

TGACTTCCTG CGCTGCACTC CTGACCGCTT CCCTGAAGGC TGCACGAATG AGGTGCAGAA 840 

CATCAAGTTC AACAGTTCAG GCCAGTGCGA AGTGCCCTTG GTTCGGACAG ACAACCCCAA 900 

GAGCTGGTAC GAGGACGTGG AGGGCTGCGG CATCCAGTGC CAGAACCCGC TCTTCACAGA 960 

GGCTGAGCAC CAGGACATGC ACAGCTACAT CGCGGCCTTC GGGGCCGTCA CGGGCCTCTG 1020 

CACGCTCTTC ACCCTGGCCA CATTCGTGGC TGACTGGCGG AACTCGAATC GCTACCCTGC 1080 

TGTTATTCTC TTCTACGTCA ATGCGTGCTT CTTTGTGGGC AGCATTGGCT GGCTGGCCCA 1140 

GTTCATGGAT GGTGCCCGCC GAGAGATCGT CTGCCGTGCA GATGGCACCA TGAGGCTTGG 1200 

GGAGCCCACC TCCAATGAGA CTCTGTCCTG CGTCATCATC TTTGTCATCG TGTACTACGC 12 SO 

CCTGATGGCT GGTGTGGTTT GGTTTGTGGT CCTCACCTAT GCCTGC-CACA CTTCCTTCAA 1320 

AGCCCTGGGC ACCACCTACC AGCCTCTCTC GGGCAAGACC TCCTACTTCC ACCTGCTCAC 1380 

CTGGTCACTC CCCTTTGTCC TCACTGTGGC AATCCTTGCT GTGGCGCAGG TGGATGGGGA 1440 

CTCTGTGAGT GGCATTTGTT TTGTGGGCTA CAAGAACTAC CGATACCGTG CGGGCTTCGT 1500 

GCTGGCCCCA ATCGGCCTGG TGCTCATCGT GGGAGGCTAC TTCCTCATCC GAGGAGTCAT 1560 

GACTCTGTTC TCCATCAAGA GCAACCACCC CGGGCTGCTG AGTGAGAAGG CTGCCAGCAA 1620 



CCGGGACTAT GTGCTATGTC AGGCCAATGT 

CATGTTTGGA ACTGGCATCG 
CTGGAGGCGT ACCTGGTGCA 



60 
65 



A TGCACACTGT GTCCCACGAC GGGCCCGTGG CSGGCTTGGC 
CTTTGACCTC AATGAGCCCT CAGCTGATGT CTCCTCTGCC TGGGCGC 

I CGGAGAGGAG CCATACTGCC CCAGGATATT TCTGTCACCC C 



G CGCCTGGGCC GGAAGAAGAA GAGGAGGAAG AGGAAGAAGG AGGTGTGCCC 
GCTGGCGCCG CCCCCTGAGC TTCACCCCCC TGCCCCTGCC C 



3 GGAGCGTGGA CCCTGGTCTC CAACCCATTC TGCCCAGAGC CCAGTCCCCC 
TCAGGATCCA TTTCTGCCCA GTGCACCGGC CCCCGTGGCA TGGGCTCATG GCCGCCGACA 
GGGCCTGGGG CCTATTCACT CCCGCACCAA CCTGATGGAC ACAGAACTCA TGGATGCAGA 
CTCGGACTTC TGAGCCTGCA GAGCAGGACC TGGGACAGGA AAGAGAGGAA CCAATACCTT 
CAAGGCTCTT CTTCCTCACC GAGCATGCTT CCCTAGGATC CCGTCTTCCA GAGAACCTGT 



; AGCTGCAGCC TGCTTGCCAG CATCTGCTCC ATCGGGGCAG 
G CAGAGGTGAC AGTTCCCAGA 

GTGGGCTTTG GTGGCCAGGG AGGCAGCCTA Q 
CGTTTTCTGG GCTGATGGGT GCCCTTTCCT GGCAGTCTCA GTCCAAAAGT GTTGACTGTG 
TCATTAGTCC TTTGTCTAAG TAGGGCCAGG GCACCGTATT CCTCTCCCAG GTGTTTGTGG 
GGCTGGAAGG ACCTGCTCCC ACAGGGGCCA TGTCCTCTCT TAATAGGTGG CACTACCCCA 
AACCCATCTT TTGTTCTCCT ATATCCTCCT TCTCCTGTTC CATTTCAGTT CAGTTTCAGC 
GGTGCCAACC TCTTTGCGTT TCCTTTTTGT TGATGAGGAC CCAGAGCTGC TGCACACACT 
CACCTCTAAC CCCCTCCCCT CGCTGCTGGG CCCCATCTCC A 



3480 
3540 



CTGGGAAGCT GCCGTAGCTT CAGCCAGGCA AGAAAGCTTC CTT CAACCTG CATAGCCGGT 3 600 
GGGTGAGGAG ATTCCCACCT TCCATAGCCT CCAAACATGT TCCCAAGGCC CCACTTTCAA 
GAATCAGACA GCAGGAAGCC ATAGATGCTG GCTGGGTTCC AGGTTATGGG GAGAAGAAAT 
ACAGTCAATA AAAGGTTTTT GTATAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO; C177 DNA Sequence 
Nucleic Acid 
Coding sequenc 



ATGGGGGCCC GGAGCGGAGC TCGGGGCGCG CTGCTGCTGG CACTGCTGCT CTGCTGGGAC 
CCGAGGCTGA GCCAAGCAGG TAGGAAGCGA TCGGGTGAGG T 
TCAGCGCCAG CAGAGCCGCT GCCCTAC 

AAGAACAAGC CTGTGGAGCT CCGCTGCCGC GCCTTCCCCG CCACACAGAT CTACTTCAAG 
TGCAACGGCG AGTGGGTCAG CCAGAACGAC C 




CAGGTGGAGG AGCTCTTTGG GCTGGAGGAT TACTGGTGCC AGTGCGTGGC CTGGAGCTCC 
K3CCTAC GTCCGCATCG CCTACCTGCG CAAGAACTTC 
A GGAGGTGCCC CK3GACCATG AGGTTCTCCT GCAGTGCCGC 
T GGCCGAGGTG GAATGGCTCA AGAATGAGGA TGTCATCGAC 
CCCACCCAGG ACACCAACTT CCTGCTCACC ATCGACCACA ACCTCATCAT CCGCCAGGCC 
CGCCTGTCGG ACACTGCCAA CTATACCTGC GTGGCCAAGA ACATCGTGGC CAAACGCCGG 
AGCACCACTG CCACCGTCAT CGTCTACGTG AATGGCGGCT GGTCCAGCTG GGCAGAGTGG 
TCACCCTGCT CCAACCGCTG TGGCCGAGGC TGGCAGAAGC GCACCCGGAC CTGCACCAAC 
CCCGCTCCAC TCAACGGAGG GGCCTTCTGC GAGGGCCAGG CATTCCAGAA GACCGCCTGC 



ACTGAGTGTG CCCACTGGCG TAGCCGCGAG TGCATGGCGC C 
CGTGACTGCA GCGGGACGCT GCTCGACTCT AAGAACTGCA CAGATGGGCT GTGCATGCAA 
AATAAGAAAA CTCTAAGCGA CCCCAACAGC CACCTGCTGG AGGCCTCAGG GGATGCGGCG 
CTGTATGCGG GGCTCGTGGT GGCCATCTTC GTGGTCGTGG CAATCCTCAT GGCGGTGGGG 
GTGGTGGTGT ACCGCCGCAA CTGCCGTGAC TTCGACACAG ACATCACTGA CTCATCTGCT 
GCCCTGACTG GTGGTTTCCA CC 
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GTGTATGCCC TGCAGGACTC CACCGACAAA AT CCCCATGA C 
CCCTTACCCA GCCTTAAGGT CAAGGTCTAC AGCTCCAGCA CCACGGGCTC TGGGCCAGGC 
CTGGCAGATG GGGCTGACCT GCTGGGGGTC TTGCCGCCTG GCACATACCC TAGCGATTTC 
GCCCGGGACA CCCACTTCCT GCACCTGCGC AGCGCCAGCC TCGGTTCCCA GCAGCTCTTG 
GGCCTGCCCC GAGACCCAGG GAGCAGCGTC AGCGGCACCT TTGGCTGCCT GGGTGGGAGG 
CTCAGCATCC CCGGCACAGG GGTCAGCTTG CTGGTGCCCA ATGGAGCCAT TCCCCAGGGC 

GGGACCCAGA CAGTATTGAG CCCCTCGGTG ACCTGTGGAC CCACAGGCCT CCTGCTGTGC 
CGCCCCGTCA TCCTCACCAT GCCCCACTGT GCCGAAGTCA GTGCCCGTGA CTGGATCTTT 
CAGCTCAAGA CCCAGGCCCA CCAGGGCCAC TGGGAGGAGG TGGTGACCCT GGATGAGGAG 
ACCCTGAACA CACCCTGCTA CTGCCAGCTG GAGCCOGGG CCTGTCACAT CCTGCTGGAC 
CAGCTGGGCA CCTACGTGTT CACGGGCGAG TCCTATTCCC GCTCAGCAGT CAAGCGGCTC 
CAGCTGGCCG TCTTCGCCCC CGCCCTCTGC ACCTCCCTGG AGTACAGCCT CCGGGTCTAC 
TGCCTGGAGG ACACGCCTGT AGCACTGAAG GAGGTGCTGG AGCTGGAGCG GACTCTGGGC 
GGATACTTGG TGGAGGAGCC GAAACCGCTA ATGTTCAAGG ACAGTTACCA CAACCTGCGC 
CTCTCCCTCC ATGACCTCCC CCATGCCCAT TGGAGGAGCA AGCTGCTGGC CAAATACCAG 
GAGATCCCCT TCTATCACAT TTGGAGTGGC AGCCAGAAGG CCCTCCACTG CACTTTCACC 
CTGGAGAGGC ACAGCTTGGC CTCCACAGAG CTCACCTGCA AGATCTGCGT GCGGCAAGTG 
GAAGGGGAGG GCCAGATATT CCAGCTGCAT ACCACTCTGG CAGAGACACC TGCTGGCTCC 
CTGGACACTC TCTGCTCTGC CCCTGGCAGC ACTGTCACCA CCCAGCTGGG ACCTTATGCC 
TTCAAGATCC CACTGTCCAT CCGCCAGAAG ATATGCAACA GCCTAGATGC CCCCAACTCA 
CGGGGCAATG ACTGG CGGAT GTTAGCACAG AAGCTCTCTA TGGACCGGTA CCTGAATTAC 
TTTGCCACCA AAGCGAGCCC CACGGGTGTG ATCCTGGACC TCTGGGAAGC TCTGCAGCAG 
GACGATGGGG ACCTCAACAG CCTGGCGAGT GCCTTGGAGG AGATGGGCAA GAGTGAGATG 
CTGGTGGCTG TGGCCACCGA CGGGGACTGC TGAGCCTCCT G 
ACTGGCAGGA GGCAGGTGCA GGGAGGCCTG GGGCAGCCTC CTGATGGGGA T 
CTGCTTCCTC CCAGTTCACA GCCAGAGTTG CCTCTCCTCC TCCTCTTCCC CAACCCCCAG 
ACCATGACCA GCCTTAGAAA ATCCATGTAC TCTGTTGTTA GAGGGCCCAG AGTTCCTTCT 
CCACCCCCGC TCTCTCTCTC TTGGCCTGAG ATCTCIGTGC AGGAACCAAG ATGGGGCTGA 
AGCCTCTGGA GGCAGTTGGT TGGGGGCGGG CAGGCAGGAG GCCCTCCCTC CACCCCCCCA 
CCCTCAGCCC GGCAACTTCT GGGTTCCATG GGTTTTAGTT CCGTTCTCGT TTTCTTCCTC 
CGTTATTGAT TTCTCCTTTC TCCCTAAGCC CCCTTCTGCT TCCACGCCCT TTTCCTCTTT 
GAAGAGTCAA GT AC AATT CA GACAAACTGC TTTCTCCTGT CCAAAAGCAA AAAGGCAAAG 
GAAAGAAAGA AAGCTTCAGA CCGCTAGTAA GGCTCAAAGA AGAAGAAAAA CACCAAAACC 
ACAAGGGAAA AGAAAAACCC AGTTTCTTAG GAAACGCAAA OGATTTATTA TCCAGATTAT 

Seq ID NO : C178 DNA Sequence 
Coding sequence: 310.. 1359 



GCACCGGTCG 




CAAAGAGAAG 



TACGGCATCG GCTTCGCCAA GGTCTTTGTG GATGCCCGGG AGATCAAGCA GAATGCCCGG 840 

ACTCTCATGA ACTTGCACAA CAACGAGGCA GGCCGAAAGA TCCTGGAGGA GAACATGAAG 900 

CTGGAATGTA AGTGCCACGG CGTGTCAGGC TCGTGCACCA CCAAGACGTG CTGGACCACA 9 SO 

60 CTGCCACAGT TTCGGGAGCT GGGCTACGTG CTCAAGGACA AGTACAACGA GGCCGTTCAC 1020 

GTGGAGCCTG TGCGTGCCAG CCGCAACAAG CGGCCCACCT TCCTGAAGAT CAAGAAGCCA 1080 

CTGTCGTACC GCAAGCCCAT GGACACGGAC CTGGTGTACA TCGAGAAGTC GCCCAACTAC 1140 



GCTCCCCAGG CCAGCGGCTG TGACCTCATG TGCTGTGGGC GTGGCTACAA CACCCACCAG 
TACGCCCGCG TGTGGCAGTG CAACTGTAAG TTCCACTGGT GCTGCTATGT CAAGTGCAAC 
ACGTGCAGCG AGCGCACGGA GATGTACACG TGCAAGTGAG CCCCGTGTGC ACACCACCCT 
CCCGCTGCAA GTCAGATTGC TGGGAGGACT GGACCGTTTC CAAGCTGCGG GCTCCCTGGC 
AGGATGCTGA GCTTGTCTTT TCTGCTGAGG AGGGTACTTT TCCTGGGTTT CCTGCAGGCA 
TCCGTGGGGG AAAAAAAATC TCTCAGAGCC CTCAACTATT CTGTTCCACA CCCAATGCTG 
CTCCACCCTC CCCCAGACAC AGCCCAGGT C CCTCCGCGGC TG 



Coding sequence 
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TGTCCTGCTG TGGTGCGTCT CCTGGGCGGA CCTTTTTTAC TCCTTCCATG G 
TGGCCGGGCC CCTGCCCCTG TTTTCTTTGT CACCCCCTTG GTGGTGGGGG T' 
GCTGGCCACC CTGCTGATAC AGTATGAGCG GCTGCAGGGC GTACAGTCTT CGGGGGTCCT 
CATTATCTTC TGGTTCCTGT GTGTGGTCTG CGCCATCGTC CCATTCCGCT CCAAGATCCT 
TTTAGCCAAG GCAGAGGGTG AGATCTCAGA CCCCTTCCGC TTCACCACCT TCTACATCCA 
CTTTGCCCTG GTACTCTCTG CCCTCATCTT GGCCTGCTTC AGGGAGAAAC CTCCATTTTT 
CTCCGCAAAG AATGTCGACC CTAACCCCTA CCCTGAGACC AGCGCTGGCT TTCTCTCCCG 
CCTGTTTTTC TGGTGGTTCA CAAAGATGGC CATCTATGGC TACCGGCATC CCCTGGAGGA 
GAAGGACCTC TGGTCCCTAA AGGAAGAGGA CAGATCCCAG ATGGTGGTGC AGCAGCTGCT 
GGAGGCATGG AGGAAGCAGG AAAAGCAGAC GGCACGACAC AAGGCTTCAG CAGCACCTGG 
GAAAAATGCC T CCGGCGAGG ACGAGGTGCT GCTGGGTGCC CGGCCCAGGC CCCGGAAGCC 
CTCCTTCCTG AAGGCCCTGC TGGCCACCTT CGGCTCCAGC TTCCTCATCA GTGCCTGCTT 
CAAGCTTATC CAGGACCTGC TCTCCTTCAT CAATCCACAG CTGCTCAGCA TCCIGATCAG 
GTTTATCTCC AACCCCATGG CCCCCTCCTG GTGGGGCTTC CTGGTGGCTG GGCTGATGTT 
CCTGTGCTCC ATGATGCAGT CGCTGATCTT ACAACACTAT TACCACTACA TCTTTGTGAC 
TGGGGTGAAG TTTCGTACTG GGATCATGGG TGTCATCTAC AGGAAGGCTC TGGTTATCAC 
CAACTCAGTC AAACGTGCGT CCACTGTGGG GGAAATTGTC AACCTCATGT CAGTGGATGC 
CCAGCGCTTC ATGGACCTTG CCCCCTTCCT CAATCTGCTG TGGTCAGCAC C 
CATCCTGGCG ATCTACTTCC TCTGGCAGAA CCTAGGTCCC TCTGTCCTGG C 

C TTGCTGATTC CACTCAACGG AGCTGTGGCC GTGAAGATGC GCGCCTTCCA 
WlTTGA AGGACTCGCG CATCAAGCTG ATGAGTGAGA TCCTGAACGG 
CATCAAGGTG CTGAAGCTGT ACGCCTGGGA GCCCAGCTTC CTGAAGCAGG TGGAGGGCAT 



TCTGTGTCCT TGTTTAATAT 
AACCTGACTC AGGCCAGTGT 
CGGATCCAGC AATTCCTGAG CCAAGAGGAA CTTGACCCCC AGAGTGTGGA 



GCGGGTCAGT CTGGCTCGAG CTGTTTACAG TGATGCCGAT ATTTTCTTGC T 



AGGCGTGCTG GCAGGCAAGA CGCGAGTGCT GGTGACGCAC GGCATTAGCT TCCTGCCCCA 2520 

GACAGACTTC ATCATTGTGC TAGCTGATGG AC AGGTGT CT GAGATGGGCC CGTACCCAGC 2S80 

CCTGCTGCAG CGCAACGGCT CCTTTGCCAA CTTTCTCTGC AACTATGCCC CCGATGAGGA 2640 

CCAAGGGCAC CTGGAGGACA GCTGGACCGC GTTGGA^GGT GCAGAGGATA AGGAGGCACT 2700 

GCTGATTGAA GACACACTCA GCAACCACAC GGATCTGACA GACAATGATC CAGTCACCTA 2760 

TGTGGTCCAG AAGCAGTTTA TGAGACAGCT GAGTGCCCTG TCCTCAGATG GGGAGGGACA 2820 

GGGTCGGCCT GTACCCCGGA GGCACCTGGG TCCATCAGAG AAGGTGCAGG TGACAGAGGC 2880 

GAAGGCAGAT GGGGCACTGA CCCAGGAGGA GAAAGCAGCC ATTGGCACTG TGGAGCTCAG 2940 

TGTGTTCTGG GATTATGCCA AGGCCGTGGG GCTCTGTACC ACGCTGGCCA TCTGTCTCCT 3000 

GTATGTGGGT CAAAGTGCGG CTGCCATTGG AGCCAATGTG TGGCTCAGTG CCTGGACAAA 3060 

TGATGCCATG GCAGACAGTA GACAGAACAA CACTTCCCTG AGGCTGGGCG TCTATGCTGC 3120 

TTTAGGAATT CTGCAAGGGT TCTTGGTGAT GCTGGCAGCC ATGGCCATGG CAGCGGGTGG 3180 

CATCCAGGCT GCCCGTGTGT TGCACCAGGC ACTGCTGCAC AACAAGATAC GCTCGCCACA 3240 

GTCCTTCTTT GACACCACAC CATCAGGCCG CATCCTGAAC TGCTTCTCCA AGGACATCTA 3300 

TGTCGTTGAT GAGGTTCTGG CCCCTGTCAT CCTCATGCTG CTCAATTCCT TCTTCAACGC 3360 

CATCTCCACT CTTGTGGT CA TCATGGCCAG CACGCCGCTC TTCACTGTGG TCATCCTGCC 3420 

CCTGGCTGTG CTCTACACCT TAGTGCAGCG CTTCTATGCA GCCACATCAC GGCAACTGAA 3480 

GCGGCTGGAA TCAGTCAGCC GCTCACCTAT CTACTCCCAC TTTTCGGAGA CAGTGACTGG 3540 

TGCCAGTGTC ATCCGGGCCT ACAACCGCAG CCGGGATTTT GAGATCATCA GTGATACTAA 3600 

3720 

Z CTGAACCCGG GGCTGGTGGG CCTTTCTGTG TCCTACTCCT TGCAGGTGAC 3780 

ATTTGCTCTG AACTGGATGA TACGAATGAT GTCAGATTTG GAATCTAACA TCGTGGCTGT 3840 

GGAGAGGGTC AAGGAGTACT CCAAGACAGA GACAGAGGCG CCCTGGC-TGG T GG AAGGCAG 3900 

CCGCCCTCCC GAAGGTTGGC CCCCACGTGG GGAGGTGGAG TTCCGGAATT ATTCTGTGCG 3 960 

CTACCGGCCG GGCCTAGACC TGGTGCTGAG AGACCTG AG T CTGCATGTGC ACGGTGGCGA 4020 

GAAGGTGGGG ATCGTGGGCC GCACTGGGGC TGGCAAGTCT TCCATGACCC TTTGCCTGTT 4080 

CCGCATCCTG GAGGCGGCAA AGGGTGAAAT CCGCATTGAT GGCCTCAATG TGGCAGACAT 4140 

CGGCCTCCAT GACCTGCGCT CTCAGCTGAC CATCATCCCG CAGGACCCCA TCCTGTTCTC 4200 

GGGGACCCTG CGCATGAACC TGGACCCCTT CGGCAGCTAC TCAGAGGAGG ACATTTGGTG 4260 



3 AGAATCTCAG CGTGGGCCAG AGGCAGCTCG T 
CCGAGCCCTG CT CCGCAAGA GCCGCATCCT GGTTTTAGAC GAGGCCACAG CTGCCATCGA 
CCTGGAGACT GACAACCTCA TCCAGGCTAC CATCCGCACC CAGTTTGATA CCTGCACTGT 
CCTGACCATC Gl 
CAAAGGAGTA G 

3 GCCAGAGATG CTGGACTTGC CTAAAATATA T 




TAGCAAACAC TGGGGGCACC TTAAGATTTT GCACCTGTAA AGTGCCTTAC AGGGTAACTG 
TGCTGAATGC TTTAGATGAG GAAATGATCC CCAAGTGGTG A 
AGCTAGTTTG AGCCAGTTAG ACTAGTCCCC CGGTCTCCCG A 

TGCACACTGC ACTGTTTTCA AATAACGATT TTATGAAATG ACCTCTGTCC TCCCTCTGAT 

TTTTCATATT TTCCTAAAGT TTCGTTTCTG TTTTTTAATA AAAAGCTTTT TCCTCCTGGA 

ACAGAAGACA GCTGCTGGGT CAGGCCACCC CTAGGAACTC AGTCCTGTAC TCTGGGGTGC 

TGCCTGAATC CATTAAAAAT GGGAGTACTG ATGAAATAAA ACTACATGGT CAACAGTAAA 
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CAGGCGGAGG ACTTCGGCGC GGCTCCTCCT GGGTGTGACC CCGGGCGCGC CCGCCGCGCG 120 

ACGATGAGGG CGCGGCCGCA GGTCTGCGAG GCGCTGCTCT TCGCCCTGGC GCTCCAGACC 180 

GGCGTGTGCT ATGGCATCAA GTGGCTGGCG CTGTCCAAGA CACCATCGGC CCTGGCACTG 240 

AACCAGACGC AACACTGCAA GCAGCTGGAG GGTCTGGTGT CTGCACAGGT GCAGCTGTGC 300 

CGCAGCAACC TGGAGCTCAT GCACACGGTG GTGCACGCCG CCCGCGAGGT CATGAAGGCC 360 

TGTCGCCGGG CCTTTGCCGA CATGCGCTGG AACTGCTCCT CCATTGAGCT CGCCCCCAAC 420 

TATTTGCTTG ACCTGGAGAG AGGGACCCGG GAGTCGGCCT TCGTGTATGC GCTGTCGGCC 480 

GCCGCCATCA GCCACGCCAT CGCCCGGGCC TGCACCTCCG GCGACCTGCC CGGCTGCTCC 540 

TGCGGCCCCG TCCCAGGTGA GCCACCCGGG CCCGGGAACC GCTGGGGAGG AIGTGCGGAC SOO 

AACCTCAGCT ACGGGCTCCT CATGGGGGCC AAGTTTTCCG ATGCTCCTAT GAAGGTGAAA SSO 

AAAACAGGAT CCCAAGCCAA TAAACTGATG CGTCTACACA ACAGTGAAGT GGGGAGACAG 720 

GCTCTGCGCG CCTCTCTGGA AATGAAGTGT AAGTGCCATG GGGTGTCTGG CTCCTGCTCC 780 

ATCCGCACCT GCTGGAAGGG GCTGCAGGAG CTGCAGGATG TGGCTGCTGA CCTCAAGACC 840 

CGATACCTGT CGGCCACCAA GGTAGTGCAC CGACCCATGG GCACCCGCAA GCACCTGGTG 900 

CCCAAGGACC TGGATATCCG GCCTGTGAAG GACTCGGAAC TCGTCTATCT GCAGAGCTCA 960 

CCTGACTTCT GCATGAAGAA TGAGAAGGTG GGCTCCCACG GGACACAAGA CAGGCAGTGC 1020 

AACAAGACAT CCAACGGAAG CGACAGCTGC GACCTTATGT GCTGCGGGCG TGGCTACAAC 1080 

CCCTACACAG ACCGCGTGGT CGAGCGGTGC CACTGTAAGT ACCACTGGTG CTGCTACGTC 1140 

ACCTGCCGCA GGTGTGAGCG TACCGTGGAG CGCTATGTCT GCAAGTGAGG CCCTGCCCTC 1200 

CGCCCCACGC AGGAGCGAGG ACTCTGCTCA AGGACCCTCA GCAACTGGGG CCAGGGGCCT 12 60 

GGAGACACTC CATGGAGCTC TGCTTGTGAA TTCCAGATGC CAGGCATGGG AGGCGGCTTG 1320 

1380 
1440 
1500 



TAGCTTGGGT TGTAAATGTT TTCTTTTGTT TTTTGCTTTT TCTTCCTTTG GGATGTGGAA 
3 TGGCACTTCT CAATTCCTCT 
ft ATGATCTCTA TTTTAAAACT 
k GCAG CTGT AT GAAATAAATG CTGAGTGAGC CCCAGCCCGC CCCTGCAGTT 
C TCGGCAGACC CTGGGGCTGG CAGAGGGAGC TCTCCAGTTT 



ACAGCATGGA GTGGGGTTAC CTGTTGGAAG TGACCTCGCT GCTGGCCGCC TTGGCGCTGC 
TGCAGCGCTC TAGCGGCGCT GCGGCCGCCT CGGCCAAGGA GCTGGCATGC CAAGAGATCA 
CCGTGCCGCT GTGTAAGGGC ATCGGCTACA ACTACACCTA CATGCCCAAT CAGTTCAACC 



AGTGCTCGCC CGATCTCAAG TTCTTCCTGT GCAGCATGTA CACGCCCATC TGCCTAGAGG 
ACTACAAGAA GCCGCTGCCG CCCTGCCGCT CGGTGTGCGA GCGCGCCAAG GCCGGCTGCG 
CGCCGCTCAT GCGCCAGTAC GGCTTCGCCT GGCCCGACCG CATGCGCTGC GACCGGCTGC 
CCGAGCAAGG CAACCCTGAC ACGCTGTGCA TGGACTACAA CCGCACCGAC CTAACCACCG 
CCGCGCCCAG CCCGCCGCGC CGCCTGCCGC CGCCGCCGCC CGGCGAGCAG CCGCCTTCGG 



TCTGGATCGG CCTGTGGTCG GTGCTCTGCT TCGTGTCCAC CTTCGCCACC GTCTCCACCT 
TCCTTATCGA CATGGAGCGC TTCAAGTACC CGGAGCGGCC CATTATCTTC CTCTCGGCCT 
GCTACCTCTT CGTGTCGGTG GGCTACCTAG TGCGCCTGGT GGCGGGCCAC GAGAAGGTGG 



TGGGCGCGGT GGAGCAGCAC GTGCGCTACG AGACCACCGG CCCCGCGCTG TGCACCGTGG 1200 

TCTTCTTGCT GGTCTACTTC TTCGGCATGG CCAGCTCCAT CTGGTGGGTG ATCTTGTCGC 1260 

TCACATGGTT CCTGGCGGCC GGTATGAAGT GGGGCAACGA AGCCATCGCC GGCTACTCGC 1320 

AGTACTT CCA CCTGGCCGCG TGGCTTGTGC CCAGCGTCAA GTCCATCGCG GTGCTGGCGC 1380 

TCAGCTCGGT GGACGGCGAC CCGGTGGCGG GCATCTGCTA CGTGGGCAAC CAGAGCCTGG 1440 

ACAACCTGCG CGGCTTCGTG CTGGCGCCGC TGGTCATCTA CCTCTTCATC GGCACCATGT 1500 

TCCTGCTGGC CGGCTTCGTG TCCCTGTTCC GCATCCGCTC GGTCATCAAG CAACAGGACG 1560 



TCTACACCGT GCCCGCCGCG GTGGTGGTCG CCTGCCTCTT CTACGAGCAG CACAACCGCC 1680 

CGCGCTGGGA GGCCACGCAC AACTGCCCGT GCCTGCGGGA CCTGCAGCCC GACCAGGCAC 1740 

GCAGGCCCGA CTACGCCGTC TTCATGCTCA AGTACTTCAT GTGCCTAGTG C-TGGC-CATCA 1800 

CCTCGGGCGT GTGGGTCTGG TCCGGCAAGA CGCTGGAGTC CTGGCGCTCC CTGTC-CACCC 1860 

GCTGCTGCTG GGCCAGCAAG GGCGCCGCGG TGGGCGGGGC- CGCGGGCGCC ACGGCCGCGG 1920 

GGGGTGGCGG CGGGCCGGGG GGCGGCGGCG GCGGGGGACC CGGCGGCGGC GGGGGGCCGG 1980 

GCGGCGGCGG GGGCTCCCTC TACAGCGACG TCAGCACTGG CCTGACGTGG CGGTCGGGCA 2040 

CGGCGAGCTC CGTGTCTTAT CCAA&GCAGA TGCCATTGTC CCAGGTCTGA GCGGAGGGGA 2100 

GGGGGCGCCC AGGAGGGGTG GGGAGGGGGG CGAGGAG AC C CAAGTGCAGC GAAGGGACAC 2160 

TTGATGGGCT GAGGTTCCCA CCCCTTCACA GTGTTGATIG CTATTAGCAT GATAATGAAC 2220 

TCTTAATGGT ATCCATTAGC TGGGACTTAA ATGACTCACT TAGAACAAAG TACCTGGCAT 2280 

TGAAGCCTCC CAGACCCAGC CCCTTTTCCT CCATTGATC-T GCGGGGAGCT CCTCCCGCCA 2340 
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CGCGTTAATT TCTGTTGGCT GAGGAGGGTG GACTCTGCGG CGTTTCCAGA ACCCGAGATT 
TGGAGCCCTC CCTGGCTGCA CTTGGCTGGG TTTGCAGTCA GATACACAGA TTTCACCTGG 
GAGAACCTCT TTTTCTCCCT CGACTCTTCC TACGTAAACT CCCACCCCTG ACTTACCCTG 
GAGGAGGGGT GACGGCCACC TGATGGGATT GCACGGTITG GGTATTCTTA ATGACCAGGC 
AAATGCCTTA AGTAAACAAA CAAGAAATGT CTTAATTATA CACCCCACGT AAATACGGGT 
TTCTTACATT AGAGGATGTA TTTATATAAT TATTTGTTAA ATTGTAAAAA AAAAAAGTGT 
AAAATATGTA TATATCCAAA GATATAGTGT GTACATTTTT TTGTAAAAAG TTTAGAGGCT 
TACCCCTGTA AGAACAGATA TAAGTATTCT ATTTTGTCAA TAAAATGACT TTTGATAAAT 
GATTTAACCA TTGCCCTCTC CCCCGCCTCT TCTGAGCIGT CACCTTTAAA GTGCTTGCTA 
AGGACGCATG GGGAAAATGG ACATTTTCTG GCTTGTCATT CTGTACACTG ACCTTAGGCA 
TGGAGAAAAT TACTTGTTAA ACTCTAGTTC TTAAGTTGTT AGCCAAGTAA ATATCATTGT 
' ' ' " — TTTTGCAC CTTCCCCAAA GACGGTGTTT TTCATGGGAG 

CAACTCTC ACTTTAGTGG ATGTAAATGG AACTTCIC-CA 
AGGCAGTAAT TCCCCTTAGG CCTTGTTATT TATCCTGCAT GGTATCACTA AAGGTTTCAA 

Seq ID NO: C182 DNA Sequence 
Nucleic Acid Accession XM_050625 
Coding sequence: 222,. 1109 

1 11 21 31 „ SI 

CCGGGTCGGA GCCCCCCGGA GCTGCGCGCG GGCTTGCAGC GCCTCGCCCG CGCTGTCCTC 

CCGGTGTCCC GCTTCTCCGC GCCCCAGCCG CCGGCTGCCA GCTTTTCGGG GCCCCGAGTC 

GCACCCAGCG AAGAGAGCGG GCCCGGGACA AGCTCGAACT CCGGCCGCCT CGCCCTTCCC 

CGGCTCCGCT CCCTCTGCCC CCTCGGGGTC GCGCGCCCAC GATGCTGCAG GGCCCTGGCT 

CGCTGCTGCT GCTCTTCCTC GCCTCGCACT GCTGCCTGGG CTCGGCGCGC GGGCTCTTCC 

TCTTTGGCCA GCCCGACTTC TCCTACAAGC GCAGCAATTG CAAGCCCATC CCTGCCAACC 

TGCAGCTGTG CCACGGCATC GAATACCAGA ACATGGGGCT GCCCAACCTG CTGGGCCACG 

AGACCATGAA GGAGGTGCTG GAGCAGGCCG GCGCTTGGAT CCCGCTGGTC ATGAAGCAGT 

GCCACCCGGA CACCAAGAAG TTCCTGTGCT CGCTCTTCGC CCCCGTCTGC CTCGATGACC 

TAGACGAGAC CATCCAGCCA TGCCACTCGC TCTGCGTGCA GGTGAAGGAC CGCTGCGCCC 

CGGTCATGTC CGCCTTCGGC TTCCCCTGGC CCGACATGCT TGAGTGCGAC CGTTTCCCCC 
AGGACAACGA CCTTTGCATC CCCCTCGCTA GCAGCGACCA CCTCCTGCCA GCCACCGAGG 

AAGCTCCAAA GGTATGTGAA GCCTGCAAAA ATAAAAATGA TGATGACAAC GACATAATGG 

AAACGCTTTG TAAAAATGAT TTTGCACTGA AAATAAAAGT GAAGGAGATA ACCTACATCA 

ACCGAGATAC CAAAATCATC CTGGAGACCA AGAGCAAGAC CATTTACAAG CTGAACGGTG 

TGTCCGAAAG GGACCTGAAG AAATCGGTGC TGTGGCTCAA AGACAGCTTG CAGTGCACCT 

1 GAGAT GAACGACATC AACGCGCCCT ATCTGGTCAT GGGACAGAAA CAGGGTGGGG 
AGCTGGTGAT CACCTCGGTG AAGCGGTGGC AGAAGGGGCA GAGAGAGTTC 

CCCGCAGCAT CCGCAAGCTG CAGTGCTAGT CCCGGCATCC TGATGGCTCC 

CTCCAGAGCA CGGCTGACCA TTTCTGCTCC GGGATCTCAG CTCCCGTTCC CCAAGCACAC 

TCCTAGCTGC TCCAGTCTCA GCCTGGGCAG CTTCCCCCTG CCTTTTGCAC GTTTGCATCC 

CCAGCATTTC CTGAGTTATA AGGCCACAGG AGTGGATAGC TGTTTTCACC TAAAGGAAAA 13 20 

GCCCACCCGA ATCTTGTAGA AATATTCAAA CTAATAAAAT CATGAATATT TTTATGAAGT 13 80 
TT 1382 



1 U » 31 « 51 

AATTCGGCAC GAGGGCAGGT GCAGGCGCAC GCGGCGAGAG CGTAIGGAGC CGAGCCGTTA SO 

GCGCGCGCCG TCGGTGAGTC AGTCCGTCCG TCCGTCCGTC CGTCGGGGCG CCGCAGCTCC 120 

CGCCAGGCCC AGCGGCCCCG GCCCCTCGTC TCCCCGCACC CGGAGCCACC CGGTGGAGCG 180 

GGCCTTGCCG CGGCAG CCAT GTCCATGGGC CTGGAGATCA CGGGCACCGC GCTGGCCGTG 240 

CTGGGCTGGC TGGGCACCAT CGTGTGCTGC GCGTTGCCCA TGTGGCGCGT GTCGGCCTTC 300 

ATCGGCAGCA ACATCATCAC GT CGCAGAAC ATCTGGGAGG GCCTGTGGAT GAACTGCGTG 360 

GTGCAGAGCA CCGGCCAGAT GCAGTGCAAG GTGTACGACT CGCTGCTGGC ACTGCCACAG 420 

GACCTTCAGG CGGCCCGCGC CCTCATCGTG GTGGCCATCC TGCTGGCCGC CTTCGGGCTG 480 

CTAGTGGCGC TGGTGGGCGC CCAGTGCACC AACTGCGTGC AGGACGACAC GGCCAAGGCC 540 

AAGATCACCA TCGTGGCAGG CGTGCTGTTC CTTCTCGCCG CCCTGCTCAC CCTCGTGCCG 600 

GTGTCCTGGT CGGCCAACAC CATTATCCGG GACTTCTACA ACCCCGTGGT GCCCGAGGCG 660 

CAGAAGCGCG AGATGGGCGC GGGCCTGTAC GTGGGCTGGG CGGCCGCGGC GCTGCAGCTG 720 

CTGGGGGGCG CGCTGCTCTG CTGCTCGTGT CCCCCACGCG AGAAGAAGTA CACGGCCACC 780 

AAGGTCGTCT ACTCCGCGCC GCGCTCCACC GGCCCGGGAG CCAGCCTGGG CACAGGCTAC 840 

GACCGCAAGG ACTACGTCTA AGGGACAGAC GCAGGGAGAC CCCACCACCA CCACCAGCAC 900 

CAACACCACC ACCACCACCG CGAGCTGGAG CGCGCACCAG GCCATCCAGC GTGCAGCCTT 960 

GCCTCGGAGG CCAGCCCACC CCCAGAAGCC AGGAAGCCCC CGCGCTGGAC TGGGGCAGCT 1020 > 

TCCCCAGCAG CCACGGCTTT GCGGGCCGGG CAGTCGACTT CGGGGCCCAG GGACCAACCT 1080 

GCATGGACTG TGAAACCTCA CCCTTCTGGA GCACGGGGCC TGGGTGACCG CCAATACTTG 1140 
ACCACCCCGT CGAGCCCCAT CGGGCCGCTG CCCCCATGTC GCGCTGGGCA G 
GCCCTGGAAG GGGCACTTGA TATTTTTCAA TAAAAGCCTC TCGTTTTAGC 



Seq ID NO: C184 DNA Sequence 
Nucleic Acid Accession ft: NM_0 
Coding s 



CCGAGACTCA CGGTCAAGCT AAGGCGAAGA GTGGGTGGCT GAAGCCATAC TATTTTATAG 
AATTAATGGA AAGCAGAAAA GACATCACAA ACCAAGAAGA ACTTTGGAAA ATGAAGCCTA 
GGAGAAATTT AGAAGAAGAC GATTATTTGC ATAAGGACAC GGGAGAGACC AGCATGCTAA 
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15 
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AAAGACCTGT GCTTTTGCAT TTGCACCAAA CAGCCCATGC TGATGAATTT GACTGCCCTT 
CAGAACTTCA GCACACACAG GAACTCTTTC CACAGTGGCA CTTGCCAATT AAAATAGCTG 
CTATTATAGC ATCTCTGACT TTTCTTTACA CTCTTCTGAG GGAAGTAATT CACCCTTTAG 
CAACTTCCCA TCAACAATAT TTTTATAAAA TTCCAATCCT GGTCATCAAC AAAGTCTTGC 
CAATGGTTTC CATCACTCTC TTGGCATTGG TTTACCTGCC AGGTGTGATA GCAGCAATTG 
TCCAACTTCA TAATGGAACC AAGTATAAGA AGTTTCCACA TTGGTTGGAT AAGTGGATGT 
TAACAAGAAA GCAGTTTGGG CTTCTCAGTT TCTTTTTTGC TGTACTGCAT GCAATTTATA 
GTCTGTCTTA CCCAAT GAGG CGATCCTACA GATACAAGTT GCTAAACTGG GCATATCAAC 
AGGTCCAACA AAATAAAGAA GATGCCTGGA TTGAGCATGA TGTTTGGAGA ATGGAGATTT 
ATGTGTCTCT GGGAATTGTG GGATTGGCAA TACTGGCTCT GTTGGCTGTG ACATCTATTC 
CATCTGTGAG TGACTCTTTG ACATGGAGAG AATTTCACTA TATTCAGAGC AAGCTAGGAA 
TTGTTTCCCT TCTACTGGGC ACAATACACG CATTGATTTT TGCCTGGAAT AAGTGGATAG 
ATATAAAACA ATTTGTATGG TATACACCTC CAACTTTTAT GATAGCTGTT TTCCTTCCAA 
TTGTTGTCCT GATATTTAAA AGCATACTAT TCCTGCCATG CTTGAGGAAG AAGATACTGA 
AGATTAGACA TGGTTGGGAA GACGTCACCA AAATTAACAA AACTGAGATA TGTTCCCAGT 
TGTAGAATTA CTGTTTACAC ACATTTTTGT TCAATATTGA TATATTTTAT CACCAACATT 
TCAAGTTTGT ATTTGTTAAT AAAAT GATTA TTCAAGGAAA A 

Seq ID NO: C1S5 

Nucleic Acid Accession #: NM_00177S.l 



CG GGGACAAACC CTGCTGCCGG 
CTCTCTAGGA GAGCCCAACT CTGTCTTGGC GTCAGTATCC TGGTCCTGAT CCTCGTCGTG 
GTGCTCGCGG TGGTCGTCCC GAGGTGGCGC CAGACGTGGA GCGGTCCGGG CACCACCAAG 
CGCTTTCCCG AGACCGTCCT GGCGCGATGC GTCAAGT 
AGACATGTAG ACTGCCAAAG TGTATGGGAT GCTTTCAAGG G 
CCTTGCAACA TTACTGAAGA AGACTATCAG CCACTAATGA AGTTGGGAAC T 
CCTTGCAACA AGATTCTTCT TTGGAGCAGA ATAAAAG 
GTCCAGCGGG ACATGTTCAC CCTGGAGGAC ACGCTGCTAG GCTACCTTGC TGATGACCTC 
ACATGGTGTG GTGAATTCAA CACTTCCAAA ATAAACTATC AATCTTGCCC AGACTGGAGA 
AAGGACTGCA GCAACAACCC TGTTTCAGTA TTCTGGAAAA CGGTTTCCCG CAGGTTTGCA 
GAAGCTGCCT GTGATGTGGT CCATGTGATG CTCAATGGAT CCCGCAGTAA AATCTTTGAC 
AAAAACAGCA CTTTTGGGAG TGTGGAAGTC CATAATTTGC AACCAGACAA GGTTCAGACA 



ATCTACAGAC CTGACAAGTT TCTTCAGTGT GTGAAAAATC CTGAGGATTC A 
TCTGAGATCT GAGCCAGTCG CTGTGGTTGT TTTAGCTCCT TGACTCCTTG TGGTTTATGT 
CATCATACAT GACTCAGCAT ACCTGCTGGT GCAGAGCTGA AGATTTTGGA GGGTCCTCCA 
CAATAAGGTC AATGCCAGAG ACGGAAGCCT TTTTCCCCAA AGTCTTAAAA TAACTTATAT 
CATCAGCATA CCTTTATTGT GATCTATCAA TAGTCAAGAA AAATTATTGT ATAAGATTAG 
AATGAAAATT GTATGTTAAG TTACTTCCTT TAG 

Seq ID NOs C18S DNA Sequence 



I I I I I I 

ATGGTGTCAT GCACGTTCTC GGGGCCCCTA CGGGAAACAA ATGAAAACGT GAAAAAGTTC 
TACGCCTTGC GAGCTTTTAT GTTCCGCATG AGCTCAGAGG CCGCGATGCI CGGGGAAAGC 
AGGACCCCAA AGCCCCGTAA ACACCGCGCG ACCACCCGGG CCAAGATCTT CAAGAGGTTC 
TTTTCAGAAG GATCGGAGAG CAATTCCCGA TTGGTAGAAG AACTTGCTGT AATACACACG 
TACTCTGACG ACCCCGCCCC AACGACTAGC CCCTCCTCTG TGCAACCCCG AGAGTTTGGG 
GTCATGCAGG GGGCGCCACG AGCTCGTTTC GGAAGCCGGA CCCCGCC 



ACGGCACAGA CACACACCCT CCCACACGCG CGCACCAGGG CAGACCCGGC GGGCAGGCGG 
CGGAGGCACC CTCGGAGCCC GGCGCCCGGC GGGGAGGGGA CGTGCTCCGA GGGACCGGCC 
CCGAGGCGCC GGATGGAGGA AGAGATGCAG CCGGCAGAGG AGGGGCCCAG CGTCCCCAAA 
ATCTACAAGC AGCGCAGCCC CTACAGCGTC CTCAAGACGT TCCCCAGCAA GAGACCGGCG 
CTGGCCAAGC GCTACGAGCG ACCCACCCTG GTGGAGCTGC CGCACGGCCA CTTGAGGACT 
CCGGCGCAGC CGCCGCCCGC GTCCCCCGCC GCCTCCICGT CGTCTTCGTT CGCCGCTGTC 
GTCAGACTCG GGGCTCCTCC GCGTCCGCCA CGCCGTGGAT TTCGGGCGCG GGGAACCATC 
CCGCCCCTCC TTCCTGCCCC GGGAGTCGCA GGCACTCTGC TCCCACCGCC CACGTCGTCG 
TCCCCGCCAT CCCCTCGTCC CCGTCCGTGG CACGCTGCCG CGCCGCGGGG AGGGACCTCA 
CATACACATA TGTGGAGGT C CCAGTCCACA CTTCCAGGAT CTGACACCAT GGTCTCTGTC 
TTTGGATTGA TGGCTCAGAG AAGATGGCAG CATAGATCTT TAAAGCAGTT TGAGTGGGGA 
ATTCTTGGAT CTTGGGGTAC TTGGCCATGT GGACAGGATT GGCTGGAGAA GGAGGGTCAG 
GTGGCGGTCC TGCTGCCAAG GTCTGAGGGT AATACTGCTC CTAAGAAGAG TCGAATGATC 
TTGGATGCCT TTGCCCAGCA GTGCAGTCGA GTTCTTAGCC TCTTAAATTG TGGAGGAAAA 
CTCCTGGACT CCAACCATTC TCAGTCCATG ATTTCTTGCG TAAAGCAGGA AGGCTCAAGT 
TACAACGAAA GACAGGAGCA CTGTCACATT GGGAAAGGGG TCCACAGTCA GACCTCAGAC 
AATGTAGACA TAGAGATGCA GTATATGCAA AGGAAACAAC AAACTTCTGC CTTTTTGAGG 
G ACTCTCTACA AAATTACCTG CTCTCGGGAA GCTTTCCAAC TCCAAACCCC 



CATACATTAG AAAATATTTC A 
TCCAGAGAGC CCCCAGTCAA G 

GGTGCTTCAA GACCATTTTC AGGGGCACAG CAAAGCATCG C 
CTTGAGGATG GCATCCGCAG CCCCGTCCCT TTGAGTTGTC- AGGCCTTGGA AATGGATTTG 
ACCTCCTTGG GAAGCAAGCA GCTGTTGAAC AACTATCCTG T 
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TGGGATGAGG CTGTAAATTC TTCAAAGAAA GATGGGAGAC GGCTCCTTCG A 

AGATTTGTTT TCACAACCGA TGAGCTTAAG TACTCATGCG C-CCTTGGGAA AAGGAAAAGG 2040 

TCAGTGCAGT CAGGAGAGAC AGGT CCCGAA AGACGCCCTC TGGATCCAGT TAAAGTAACA 2100 

TGCCTCCGAG GTACTGCATC CTTCCGCTCA GTGTCACCAT CTGTC-ATCTC ATTTCACCGC 2160 

ATTGGCTGTG GCTCTCCCCG TACAAGTGTT CAGCCTTCTG TATTTTGA 2208 



45 



I I I I I I 

ATGGTGGAAC ACGCTGCCCA CAAACATGGA AACGACCGTT CTCAGTGGGA TCAACTTCGA 
GTACAAGGGC ATGACAGGCT GGGAGGTGGC TGGTGATCAC ATTTACACAG CTGCTGGAGC 
CTCAGACAAT GACTTCATGA TTCTCACTCT GGTTGTGCCA GGATTTAGAC CTCCGCAGTC 
GGTGATGGCA GACACAGAGA ATAAAGAGGT GGCCAGAATC ACATTTGTCT TTGAGACCCT 
CTGTTCTGTG AACTGTGAGC TCTACTTCAT GGTGGGTGTG AATTCTAGGA CCAACACTCC 
T CCAAAGGCAA ACAGTCCTAT ACCTACATCA TTGAGGAGAA 
T GGGCCTTCCA GAGGACCACT TTTCATGAGG CAAGCAGGAA 
T GACGTTGCCA AGATCTACTC CATCAATGTC ACCAATGTTA TGAATGGCGT 
r GTGCCCTAGA AGCCTCTGAT GTGGGCTCCT CCTGCACCTC 
h TTGACCGAGA TTCAGGAACC TGCCACT 
CACAATTCTG AAAGCCCACC AGCCTTATGG TGTCCAGGCC TGTGTGCCCT G 

C AACAAGATCC ACTCTCTGTG CTACAATGAT TGCACCTTCT CACGCAACAC 
T ACAACTTCTC CGCTTTGGCA AACACCGTCA CTCTTGCTGG 



TGGAAACCAG GGTAGGAAAA TGTCTGTGTG C 

T ACAAGGCCGG GGTTTCCTCA CAGCCTGTCA GCCTTGCTGA 
TCGACTTATT GGGGTGACAA CAGATATGAC TCTGGATGGA ATCACCTCCC CAGCTGAACT 
TTTCCACCTG GAGTCCTTGG GAATACCGGA CGTGATCTTC TTTTATAGGT CCAATGATGT 
GACCCAGTCC TGCAGTTCTG GGAGATCAAC CACCATCCGC GTCAGGTGCA GTCCACAGAA 
AACTGTCCCT GGAAGTTTGC TGCTGCCAGG AACGTGCTCA GATGGGACCT GTGATGGCTG 
CAACTTCCAC TTCCTGTGGG AGAGCGCGGC TGCTTGCCCG CTCTGCTCAG TGGCTGACTA 
CCATGCTATC GTCAGCAGCT GTGTGGCTGG G 

ACCCAAGCTA TGCTCTGGTG GCATTTCTCT GCCTGAGCAG AGAGTCACCA T 
CATAGATTTC TGGCTGAAAG TGGGCATCTC TGCAGGCACC TGTACTGCCA TCCTGCTCAC 
CGTCTTGACC TGCTACTTTT GGAAAAAGAA TCAAAAACTA GAGTACAAGT ACTCCAAGCT 
GCTGATGAAT GCTACTCTCA AGGACTGTGA CCTGCCAGCA GCTGACAGCT GCGCCATCAT 
GGAAGGCGAG GATGTAGAGG ACGACCTCAT CTTTACCAGC AAGAAGTCAC TCTTTGGGAA 
GATCAAATCA TTTACCTCCA AGCAGCCAGC T 

CTGATGGATT TGACTCAGTG CCGCTGAAGA CATCCTCAGG AGGCCCAGAC A 

GAGAGGCACT GCCTGCCTCA CCTGCCTCCT CACCTTGCAT AGCACCTTTG CAAGCCTGCG 18G0 
GCGATTTGGG TGCCAGCATC CTGCAACACC CACTGCTGGA AATCTCTTCA TTGTGGCCTT 1920 
ATCAGATGTT TGAATTTCAG ATCTTTTTTT ATAGAGTACC CAAACCCTCC TTTCTGCTTG 1980 
CCTCAAACCT GCCAAATATA CCCACACTTT GTTTGTAAAT TATGCCCTTG CTTGTATCTT 2040 
GTTTCCCAAA ATGGCCCATC CG CCAGAGCC ATAGCTTCGT CTGCTCATAA TTCTTATAGC 2100 
TTTGGAATGA AAATATTTCT ATCTTCTTAA GTATAGAAAC TATTTCCTCT GTCCTCTAAC 2160 
TTAAGGGCAG AAACAGCTGG GAGTTTTCCT CGCATGCCCT CAGCTCATGA TCTCTTCAGG 2220 
50 AGAGAGGCTG GGTGAGGAGG GTGTCGGGGT TCCCTGGTGG ATAATCTTCA TAGCAGCCTG 2280 
GATCCATTTC CCCTGGATAA CCAGCTCAAA GGGAGTGAAA ATGGTAGTCT GAGGGCAAGG 2340 
GGAGCAAGGC CTGGGTAAGA AAAGCCTTGA AAAGCATAAA AAGAGGCCGG GCGCGGTGGC 2400 
TCACGCCTGT AATCCCAGCA CTTTGGGAGG CCGAGGCGGG CAGATCATGA GGTCGGGAGA 2460 
TTGAGACCAT CCTGGCTAAC ACGGTGAAGC CCCGTCTCTA CTGGAAATAC AAAAAATTAG 2520 
55 CCGGGCGTGG TGGCGGGTGC CTGTGGTCCC AGCTACTCGG GAGGCTGAGG CGGGAGAATA 2S80 
GCGTGGGCCT GGAAGGGGGA GCTTGCAGTG AGCCGAGATC GCGCCACTGC ACTCCATCCA 2640 
GCCTGGGTGA CAGAGTGAGA CTCTGCCTCA AAAAAAAAAA AAAAAAAGAA AAGCACAAAG 2700 
AGAGGCAACA AGGAATGTTT TTGTTTTTGA GACAGGCTCT CACTCTGTCA CCTAGGCTGG 27 60 
AGTGCAGTGG CGTAATCACT GTTCAGTGCA GCCTCAAGCT CTTGGGCTCA GGCTATCCTC 2820 
60 CCATCTCAGC CTCTCAAGTA GCTGGGACTA CGAGTGTGCA CCACCAGGCT CACTAATTTT 2880 
TGTGTTTTTT GTAGACACGG GGTTTCACCG TGTTGCCCAG GCTGGTCTCC AACTCCTGGG 2940 
CTCAAGTGAT CTGTCCGCCT CGGCCTCCCA AACTGCTGGG ATTACAGGCA TAAGCCACTG 3000 
I TTTTAAACCA CGTAGCTCAT TGCCTTCTCT TAAGTAAATG 3060 
= TCACTGAAGC CAAAGGAATA AGTTCATCAA GAAAATGCCC AAAGCCCTGG 3120 
TGGATACATC CTCCCTATCT TTTTTTTAAA CCTTCCACTA TCACTCTATG ACACTGAAAA 3180 
GAACCAGGTA AGCCCCAAAC CCAGATGTTC CAGCCTTATC CTCTATTGGG TTTACCCACA 3240 
GACATAGCAA ACCCTGTCAG TGAGGAAAAT TCCCCATCCT TGAGTGCCCC CGTCCTAGAA 3300 

TTTTAATGGG GCACTTTAGG GGATACAGCC CACAATGGCA TGGGCCTGAG GTGGCCGTGA 3420 
TGTCTGCTTC TAAGCTTAAC GCATCTGCTC AGGCACAGAA TAAAOGTCTA GGCTGGCCAA 3480 



A CAAGATTTTC A 
AATCCCAAGG CAGAATGGTG G 
TGAGACCCTG TCTCTAP 
CTCACTATTA AGCCCCTATC TTTCTCT 
AACTAAAGAG ACTTCTGGTC CAATTTCTGG CAACATCCCT TCTGAAAGGT GAGTAGAGTG 
GGTGTCTTCT ATGCCCATTT TCCCCAATTT TACACAAACT ATTATCAATG AACTTTTAAG 
TACCTAGAAT GGGTAAAACC AGAGCAAGAC TTTAAATTAC CTTCTTCTTT CTTCTACTGG 
CAGTTCTGCC TCCATCACTA TCAGGCTAGG GTGACCTTCC CTTGGTCAAG CCCCAATTGC 
G TGCCTGTGCC CTTTCTCCAG TGACCATTTG GTGACCAGAT G 



TGCTGCCCTG GCAATGGCTT TGTAAGAGTC AATGAGAACT AGAC-CCAGC-C TGTGGTCCCT 
GGCCATCAAC -AGTGTTGGTG ACGGCAGGGA GTCCCTTTGG TTTAATAAAT CCAGTTTTTC 
TTTGGGTATC CAAATTCTCC CCTCCTTTTG TAGGAGTCAG GCTCTCAGAA CCTGTGTCCA 
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TGTTGGAACT TCCCCCAGTG TGGATGCAGA TACGCAGCTC CTGAGCTCCA GCCTAAAGTC 4380 

TTCTGTAGCC TCAGCAATAC TTGGGCACCT GCTGTCTCAC TGAATAGCTT TCTTTTGTGA 4440 

CAAAGGCCAC AGACAGCCCT TAGACTATTC CGGAAACAGT AGGAAAAATT ACATATGTCT 4500 

TTGACTTCTT TATTCTGACT CCACTGATTT TAGCCATAAT ACTTTAAGGA GCTACTTTTT 4560 

ACTACCCCTT ACCGTGCTGA CTTCTGCAGG TCTGCCCTGT GACCTGTCAG GAACTCCTGA 4620 

GTTACGCTAC TGGGGTCACC TGTTGCTCCC CTAGCAAGTT AGGCATGTCA TATATTTTTA 4680 

ACAGCTTTAT TGAGATATAA TTCACATATT ATACAATTCA CCTTTAAAAC ATACGATTCA 4740 

ATGGTTTTCA GCAAACTCAC AGAGTTGTCC GCCCACTTGA GAGCAAACAC ATGTTCAATT 4800 

TTCTTTTCCT TTTTTTTTTT GAGACAGAGT CAGCTTTGTC GCCCAGGCTG GAGTGCAGTG 4860 

CCATGATCTT GGCTCACTGC AGCCTCCCCA TCCTGGGTTC AAGTGATCCT TCTGCTTCAG 4920 

CCTCCCCAGT AGCTGGGATT ACAGGCATGC GCCACCACGC CTAGCTAATT TTTGTGTTTT 4980 

TAGTAGAGAT GGGGTTTCAC CGTGTTGGCC AGGCTGGTCT CAAACTCCTG GACTCAAGTG 5040 

ATCCACCCAC CTCGGCCTCC CAAAGTGCTG GGATTGCAGG TGTGAGCCAC CGTGCCTGGC 5100 

CTACGIGTTC AATTTTCTAT GAACAAAGGC TTTAGTCCTT GACCCAGGGC TAAAGTGGTC 5160 

TGTCCAAGCT GTTGTTGGTA GAGGGAGTAT GATAAAATGT TTAAATCTCA TTTGGTTACC 5220 

G GAACACGCAG TAACTGTCAT GCTATAGTCA TCATCTGTAT TTGGCTGGGA 5280 

T GTATTCAAGC AGTAGGGTTT TTGCTTTTGT TTTTGTTTTA 5340 

GTGCCAACAA AACTTTTTTT TGTCTGACTA CATTAAAGAT AAGACTGACT ATATTTATAC 5400 

AACAGAAACT TTGTAATAGA TTTTTTCAGC TTTGTGAAAT CGAATTTTTT TTCATCAGGG 5460 

5520 
5567 

Seq ID NO: C188 DNA Sequence 

Nucleic Acid Accession #= NM_014324.1 

Coding sequence: 89.. 1237 

1 11 21 31 41 51 

GGCGCCGGGA TTGGGAGGGC TTCTTGCAGG CTGCTGGGCT GGGGCTAAGG GCTGCTCAGT 60 

TTCCTTCAGC GGGGCACTGG GAAGCGCCAT GGCACTGCAG GGCATCTCGG TCGTGGAGCT 120 

GTCCGGCCTG GCCCCGGGCC GTNTCTGTGC TATGGTCCTG GCTGACTTCG GGGCGCGTGT 180 

GGTACGCGTG GACCGGCCCG GCTCCCGCTA CGACGTGAGC CGCTTGGGCC GGGGCAAGCG 240 

CTCGCTAGTG CTGGACCTGA AGCAGCCGCG GGAGCCGCGT GCTGCGGCGT CTGTGCAAGC 30 0 

GGTCGGATGT GCTGCTGGAG CCCTTCCGCC GCGGTGTCAT GGAGAAACTC CAGC1GGGCC 360 
CAGAGATTCT GCAGCGGGAA AATCCAAGGC TTATTTATGC CAGGCTC 
AGTTCAGGAA AGCTTCTGCC GGTTAGCTGG CCACGATATC AACTATTTGG C 
TGTTCTCTCA AAAATTGGCA GAAGTGGTGA GAATCCGTAT GCCCCGCTGA ATCTCGTGGC 
TGACTTTGCT GGTGGTGGCC TTATGTGTGC ACTGGGCATT ATAATGGCTC TTTTTGACCG 
CACACGCACT GACAAGGGTC AGGTCATTGA TGCAAATATG GTGGAAGGAA CAGCATATTT 
AAGTTCTTTT CTGTGGAAAA CTCAGAAATC GAGTCTGTGG GAAGCACCTC GAGGACAGAA 
CATGTTGGAT GGTGGAGCAC CTTTCTATAC GACTTACAGG ACAGCAGATG GGGAATTCAT 
GGCTGTTGGA GCAATAGAAC CCCAGTTCTA CGAGCTGCTG ATCAAAGGAC TTGGACTAAA 
GTCTGATGAA CTTCCCAATC AGATGAGCAC GGATGATTGG CCAGAAATGA AGAAGAAGTT 
TGCAGATGTA TTTGCAAAGA AGACGAAGGC AGAGTGGTGT CAAATCTTTG A 
TGCCTGTGTG ACTCCGGTTC TGACTTTTGA GGAGGTTGTT CATCATt 

ACGGGGCTCG TTTATCACCA GTGAGGAGCA GGACGTGAGC CCCCGCCTTG CACCTCTGCT 10 80 

GTTAAACACC CCAGCCATCC CTTCTTCCAA AGGGGATCCT TTCATAGGAG AACACACTGA 1140 

GGAGATACTT GAAGAATTTG GATTCAGCCG AGAAGAGATT TATCAGCTTA ACICAGATAA 1200 

AATCATTGAA AGTAATAAGG TAAAAGCTAG TCTCTAACTT CCAGGCCCAC GGCTCAAGTG 12 60 

AATTTGAATA CTGCATTTAC AGTGTAGAGT AACACATAAC ATTGTATGCA TGGAAACATG 1320 

GAGGAACAGT ATTACAGTGT CCTACCACTC TAATCAAGAA AAGAATTACA GACTCTGATT 1380 

CTACAGTGAT GATTGAATTC TAAAAATGGT TATCATTAGG GCTTTTGATT TATAAAACTT 1440 

TGGGTACTTA TACTAAATTA TGGTAGTTAT TCTGCCTTCC AGTTTGCTTG ATATATTTGT 1500 
TGATATTAAG ATTCTTGACT TATATTTTGA ATGGGTTCTA GTGAAAAAGG AATGATATAT 1560 
TCTTGAAGAC AT CGATAT AC ATTTATTTAC ACTCTTGATT CTACAATGTA GAAAATGAGG 1620 
AAATGCCACA AATTGTATGG TGATAAAAGT CACGTGAAAC AGAGTGATTG GTTGCATCCA 1680 
GGCCTTTTGT CTTGGTGTTC ATGATCTCCC TCTAAGCACA TTCCAAACTT TAGCAACAGT 1740 
TAT CACACTT TGTAATTTGC AAAGAAAAGT TTCACCTGTA TTGAATCAGA ATGCCTTCAA 1800 
CTGAAAAAAA CATATCCAAA ATAATGAGGA AATGTGTTGG CTCACTACGT AGAGTCCAGA 1860 
GGGACAGTCA GTTTTAGGGT TGCCTGTATC CAGTAACTCG GGGCCTGTTT CCCCGTGGGT 1920 
CTCTGGGCTG TCAGCTTTCC TTTCTCCATG TGTTTGATTT CTCCTCAGGC TGGTAGCAAG 1980 
C ACACAGCAAC ATCCAGAAAT AAAGATCTCA GGACCCCCCA 2040 




ATG CAAAGGT G 

CCACAAACAG ACATCAGTAT CAGTCCAGCC C 

C TGCTATCGAT ' 

3 TTCTCTGTGG GCAGGAAGTC CGCCAAGCTG 
C ACAGCTGGGA GGAGAATGTC TTTGTATATG ACCTCGTGAA CAGCTGTGTT 
CCCGGCATCC CAGCTGACGT CTGGACAGGC CTTCATGATC ACAGACAGGA AGGGCAGTTT 
GAATGGACTG ATGGCTCATC CTATGACTAC AGCTACTGGG ATGGCAGCCA GCCAGATGAT 
GGCGTCCACG CGGACCCAGA AGAAGAGGAC TGCGTGCAGA TATGGTACAG GCCTACCAGT 
GAGCAGCTAC AGGCCCCAGA GCCCCAGTTA CCCTTATCAA TCTCAGAGGC CACAGATGTC 
TATCTCCCTG AGGATTTCCC AGCTGAGCCC AAGCTCATGG ACCAGTCCTG GGTGTCCAGG 
AAGAGCCTGA AACCATCCAA GAGTCATCTT ATGGAGCCAC CCACTCCAGT GGCCAAGCAC 
CAAAAGGCAA AGACCCGACA TAGGAGCCTG CGCGGCGTCT GGTGGCCATC AGGTAAGGCT 
GGGTCATGGA AAGAAAGAAT GAATGCAGAC TACGGGCGAA GAAAGCGATC GGCCCCGAGG 
CAGGAAGGCC GGCTCCGGTG CAGGGAGCGC CGCCTGCGGG CTGCTTCGGG CCAGGGTCGA 
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CCCGAGGGCC AGCGCAAGCA GCGGCAACAG GAGCGCCAGG 



CAGCGGGGTG CGGCGCCGGA GTGCGGGGAG AACCACCAGG CGCCGGAATT GGGGAGCACG 
TGGAGGGGGC AGCGGCTCCA GCCCCAGACC GCCGCGCTCT GTCACTTTGC ATTAAGAAAG 
5 CTTCCGGGGA ATGCACACGG CCTGGCCGCC GCCTTCGTGC AGCCCGCCCT GCAGGTGCAG 

GAAGAAAAGA ATAATCGCAC CCGTTTCTCA GGTGCTTACT TCACCATGTC CGATCCGACG 
TGTGACCAAG ATAGCAAGGA GCAGTCTTTA AGGCGACACG GCAGAGAGGC AGAAAAAGAT 
GGGCCTTACC GGTTAGTTAA GAAAAAAAGA GGACCTGTTG CCTGTCCCTC TAGCTTTGAA 
CTACAAAGTG GAGGGGAAGT TTGTCTGGAT TTTCCTGTAG AACTGAGGGC AGGGACCTGG 
10 ATTGCTCGAG AACCTCCATA A 

Seq ID NO i CI 90 DNA Sequence 
Nucleic Acid Accession ft: XM_054869.2 
^ Coding sequence: 26.. 2902 

I i 1 f i 1 T T 

TAGACGCGGA GCCCAAGGAG GTAAAATGCA CACTTGCTGC CCCCCAGTAA CTTTGGAACA 
GGACCTT CAC AGAAAAATGC ATAGCTGGAT GCTGCAGACT CTAGCGTTTG CTGTAACATC 
20 TCTCGTCCTT TCGTGTGCAG AAACCATCGA TTATTATGGG GAAATCTGTG ACAATGCATG 
TCCTTGTGAG GAAAAGGACG GCATTTTAAC TGTGAGCTGT G 
TCTCTCTGAA ATTAGCCCTC CCCGTTTCCC AATCTAC 

TTTGAACCGT CTCTATCCCA ATGAGTTTGT CAATTACACT GGGGCTTCAA TTTTGCATCT 
AGGTAGCAAT GTTATCCAGG ACATTGAGAC CGGGGCTTTC CATGGGCTAC GGGGTTTGAG 
25 GAGATTGCAT CTAAACAATA ATAAACTGGA ACTTCTGCGA GATGATACCT TCCTTGGCTT 
GGAGAACCTG GAGTACCTAC AGGTCGATTA CAACTACAT C AGCGTCATTG A 
TTTTGGGAAA CTGCATTTGT TGCAGGTGCT TATCCTCAAT GACAATCTTT T 
ACCCAACAAT CTTTTCCGTT TTGTGCCCTT AACGCACTTG GACCTCCGGG GGAACCGGCT 
GAAACTTCTG CCCTACGTGG GGCTCTTGCA GCACATGGAT AAAGTTGTGG AGCTACAGCT 
30 GGAGGAAAAC CCTTGGAATT GTTCTTGTGA GCTGATCTCT CTAAAGGATT GGTTGGACAG 
CATCTCCTAT TCAGCCCTGG TGGGGGATGT AGTTTGTGAG ACCCCCTICC GCTTACACGG 
AAGGGACTTG GACGAGGTAT CCAAGCAGGA ACTTTGCCCA AGGAGACTTA TTTCTGACTA 
CGAGATGAGG CCGCAGACGC CTTTGAGCAC CACGGGGTAT TTACACACCA C 
AGTGAATT CT GTGGCCACTT CTTCCTCTGC TGTTTAl 
35 GGGGACTCGC CAACCCAACA AGCCCAGGGT GCGCCCCACC T 

CTTGGGCTAC AGCAACTATG GCCCCAGCAT CGCCTATCAG ACCAAATCCC C 
GGAGTGTCCC ACCGCGTGCT CTTGCAACCT GCAGATCTCT G 
CTGCCAGGAG CGAAAGATCG AGAGCATCGC TGAACTGCAG CCCAAGCCCT A 
GAAAATGTAT CTGACAGAGA ACTACATCGC TGTCGTGCGC AGGACAGACT TCCTGGAGGC 
3 GACCTCCTGC ACCTGGGGAA TAACCGCATC TCGATGATCC AGGACCGCGC 
T CTCACCAACC TGAGGCGCCT CTACCTI 
GAGCCCGGAG TTATTCTATG GCCTGCAGAG CCTGCAGTAT CTCTTCCTCC A 

CATCCGCGAG ATTCAGTCTG GAACTTTTGA CCCGGTCCCA AACCTCCAGC TGCTATTCTT 1560 
GAATAACAAC CTCCTGCAGG CCATGCCCTC AGGCGTCTTC TCTGGCTTGA CCCTCCTCAG 1620 
GCTAAACCTG AGGAGTAACC ACTTCACCTC CTTGCCAGTG AGTGGAGTTT TGGACCAGCT 1680 
GAAGTCACTC ATCCAAATCG ACCTGCATGA CAATCCTTGG GATTGTACCT GTGACATTGT 1740 
GGGCATGAAG CTGTGGGTGG AGCAGCTCAA AGTGGGCGTC CTAGTGGACG AGGTGATCTG 1800 
TAAGGCGCCC AAAAAATTCG CTGAGACCGA CATGCGCTCC ATTAAGTCGG AGCTGCTGTG 1B60 
CCCTGACTAT TCAGATGTAG TAGTTTCCAC GCCCACACCC TCCTCTATCC AGGTCCCTGC 1920 
GAGGACCAGC GCCGTGACTC CTGCGGTCCG GTTGAATAGC ACCGGGGCCC CCGCGAGCTT I960 
GGGCGCAGGC GGAGGGGCGT CGTCGGTGCC CTTGTCTGTG TTAATTCTCA GCCTCCTGCT 2040 
GGTTTTCATC ATGTCCGTCT TCGTGGCCGC CGGGCTCTTC GTGCTGGTCA TGAAGCGCAG 2100 
GAAGAAGAAC CAGAGCGACC ACACCAGCAC CAACAACTCC GACGTGAGCT CCTTTAACAT 2160 
GCAGTACAGC GTGTACGGCG GCGGCGGCGG CACGGGCGGC CACCCACACG CGCACGTGCA 222 0 
TCACCGCGGG CCCGCGCTGC CCAAGGTGAA GACGCCCGCG GGCCACGTGT ATGAAIACAT 2280 
CCCCCACCCA CTGGGCCACA TGTGCAAAAA CCCCATCTAC CGCTCCCGAG AGGGCAACTC 2340 
CGTAGAGGAT TACAAAGACC TGCACGAGCT CAAGGTCACC TACAGCAGCA ACCACCACCT 2400 
GCAGCAGCAG CAGCAGCCGC CGCCGCCACC GCAGCAGCCA CAGCAGCAGC CCCCGCCGCA 24 60 
GCTGCAGCTG CAGCCCGGGG AGGAGGAGAG GCGGGAAAGC CACCACTTGC GGAGCCCCGC 2520 
CTACAGCGTC AGCACCATCG AGCCCCGGGA GGACCTGCTG TCGCCGGTGC AGGACGCCGA 2580 
CCGCTTTTAC AGGGGCATTT TAGAACCAGA CAAACACTGC TCCACCACCC CCGCCGGCAA 2640 
TAGCCTCCCG GAATATCCCA AATTCCCGTG CAGCCCCGCT GCTTACACTT TCTCCCCCAA 2700 
G AGACGCCCCC ATCAGTATTT GCACCCGGGG GCAGGGGACA GCAGG CTACG 2760 
?ACAGCC CCCCGAGTGC TGTCTTTGTA GAACCCAACC GGAACGAATA 2820 

_._ JCAAAAC TAAACGTTGA GCCGGACTAC CTCGAAGTGC TGGAAAAACA 2880 

GACCACGTTT AGCCAGTTCT AAAAGCAAAG AAACICTCTT GGAGCTTTTG CATTTAAAAC 2940 

AGGGTGAAGT GCCTTGGCAC GGGATTTCTC AGCTTCGGTG GAAGATACGA AAAGGGTGTG 3060 
CAATTTCCTT TAAAATTTAC ACGTGGGAAA CATTTGTGTA AACTGGGCAC ATCACTTTCT 3120 
CTTCTTGCGT GTGGGGCAGG TGTGGAGAAG GGCTTTAAGG AGGCCAATTT GCTGCGCGGG 3180 

ATGG CAGAGC ATAGATTCTA CTCTTCCTCT TTTGCTTCCT CCCCCTCCCC CGCCCCTGCC 3300 
CCACCTCTCT TTCTCCCCTT TTAAGCCATG GGTGGGTCTA ACTGGCTTTT GTGGAGAAAT 3360 
TAGCACACCC CAACTTTAAT AGGAAATTTG TTCTCTTTTT CCGCCCCTCI CCTTCTCTCC 3420 
TCCCCTCCCC TCCCTTCTCA TTCCTTTTCT TTGTT' 

CTGGACATTT GAATTAAAAA AAAAGTATTG TGATCCTGTA AAGGATCACC A 
ACAAATCATT AAAATTACAG AGCTATATGA TCCATAATTG ATTAGTCAAA ATAACTTATT 
GATGAAATAT ACAAATATTT TATTGTAGCA CCTATTTTTA TATGCACATT TAGCATTCCT 
CTTTCCTTCA CTATTTAGCC TATGATTTTG CAGAG 

TTTCTAAAAC TGACGTGGTA TCAGGAAGGC ATTTTCAATC ATTCAAAATG T 

GAAAACAGAG CTGATTGTAT CCCAATGTAT TTTAAAAAAT AGGGCAATTG ATTGGGCCAT 
TCCGAGAGAA TTGTTTGCAA GTTTTGGGTT TTATTAGAAA ATATTTGAAA GTATTTTTAT 
TAATGAACCA AAATGACATG TTCATTTGAC TACTATTGTA GCCGATTTTC GATTGTTTAA 
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CCAAACCCAG T 



Seq ID NO: C191 DNA Sequence 

Nucleic Acid Accession #: NM_000793.2 

Coding sequence: 401.. 1222 

1 11 21 31 41 51 

I I I I I I 

GCCTGCAGAG AGAGGCACTT TGCACCACAG ACAGATAGCA AGAAGGGAAA GACAGAGAGT 
GAGAAAAAAG AGGAGT CAGT CGCTCCTGGG GAAGGGAGAG AGTGAGACTG GGAGAAAGAG 
AAGCACAGAA AGTGTGTGTA AAACGGAGTA AAGAAAGAAA AAAAAAAAAC TACCCTTAAA 
GCACATTTAA AAAAAAAAAA CTCTGGCAAT TCAAGAAAGA AACAGGCTAC GTTTAAAGAG 
CATAGAGACA ATGAAAGGCT AAAGAAAATT TTAAAATCTC TGCCACAGTC TCATAGGTGC 
A AAGTAGAACT GCCTGTCTTT AACGGACTCT GACAGAGAGG GTGAAGGGGA 
C ACAAGGGAAC TGACTCAGGA GGCAGAGAAG ATGGGCATCC TCAGCGTAGA 
2 ACACTGCAAA TTCTGCCAGT TTTTTTCTCC AACTGCCTCT T 
VTTCTGC TCAAGCACGT GGTGCTGCTG TTGAGCC 
TCGCGGAGAG TGGCGGCGCA TGCTGACCTC AGAGGGACTG CGCIGCGTCT G 
CCTCCTCGAT GCCTACAAAC AGGTGAAATT GGGTGAGGAT GCGCCCAATT C 
GCATGTCTCC AGTACAGAAG GAGGTGACAA CAGTGGCAAT GGTACCCAGG AGAAGATAGC 



CTTTGGCTCA GCCACTTGAC CTCCTTTCAC GAGCCAGCTG CCAGCCTTCC G 
GGAAGAGTTC TCCTCAGTGG CTGACTTCCT GCTGGTCTAC ATTGATGAGG C 
AGATGGCTGG GCGATACCGG GGGACTCCTC TTTGTCTTTT GAGGTGAAGA AG CACCAG AA 960 
CCAGGAAGAT CGATGTGCAG CAGCCCAGCA GCTTCTGGAG C 
GTGCCGAGTT GTGGCTGACC GCATGGACAA T 

TGAACGTGTG TGCATTGTGC AGAGACAGAA AATTGCTTAT CTGGGAGGAA AGGGCCCCTT 
CTCCTACAAC CTT CAAGAAG TCCGGCATTG GCTGGAGAAG AATTTCAGCA AGAGATGAAA 
GAAAACTAGA TTAGCTGGTT AAAGGTATGA TTATAAGAGA G 
ATATAAAGGC AAGGAAATTA AGAACTGAAT CCATATTTCA A 
TGAAAGACAG GAGTTTATCT ATCGGAAGAA CATGAATCTC TAACAGCTCC K 
CACTACTCAA ATGGCATTGG G 

GCCCTATGTG AAAAGATCCC AAGATGGAGA GGAAGAAACG CTAATTCAGC A' 
TCTGCATTGA GAAGGAACTG ATACATCTGA TGCATGCTTT GAGACCAGAA GAAAAGACTT 15 SO 
ACCTGAATAA TTACTACATT AGGGAAGCTA CTGTCTACGT TAAGATAAAG GGTATTGCCT 1620 
TGGCTCTATT TGGCATGGAT GGAGCCCAGT TGGAAAATTC CCAAATATTA CAACAAGTCC 1G80 
TTGAACCCAG GCCATGTGGT TAGACGTTGG TGTTAAGGTT AGACCTTATG TTAGAGTCAT 1740 
TTCTGATGTT CCAGCTTCTA GCCATGTAGT GCTCTCAGTC TTCATACCCC AGAAATTATT 1800 
GGTATATTTG TAGATACCGA GAATGATCCC TCAGTCTGAG AGGTTAGAAT GATCATCTGT 1860 
AATCTGAGGG TTAATTTCTA GGCAGGTGGA GAGAGTGGTA AAAAAGAAAT GAAATTGACA 1920 
AGCTAGGAAA GAGGAGGCAG AAAGATTTGG AAAATTCACA GAGTTTCACC CTTAAGCTGT 1980 
AGAGAGTGGG TCACATTTGT TAGCCACGGA AACATAGAAA CATACACAAG GCCAGAAAAA 2040 
GAAGAAGGAG CTCAACTAAA AGTGGCATAG AGAATACACA TATAAAAACA ATATATTTGT 2100 
CATATGCTCC TAGAGAGGAG AAAGGGGTGA TTGAAAGAAA AAAAAATACT TAAATATTTG 2160 
TAATTGTGAG GGGTTTCTTT TGGAAATAAT TACTTTTGAA CCATGTATGT GGTATGTATA 2220 
TTTTCAGTGG GTTAATTATA CCCCATGATA CCTATTAAAG GAAAACCAGT GGGTCTGGTG 2280 
GTGCTGGTCT TTTCCTCCCC ATTCCTACAA TTTCTATGTG GCCCAAGTCA TTCCTAATCT 2340 
TGGTCTCTAT AGCAGTGTTC TCTCTGAATG CTGAGCTGAA GAAATTATAC GTACATACAC 2400 
ACATACATAC ATACATACAA ATATATGTAT ATATATTCTC AGCTGCTGCG GGAGGTAGGT 2460 
ACCATGGCCA TTCAGCACAG CCTTGATTTC CTCCCAAAGT AGGTGAGCTA TAGTGAAGAA 2520 
TAGGTG CAAA CAAACAAGCT TACTTCCATT GCAAAATAGA AGAAGAGGAA GTTAGAGATA 2580 
ATTCTGATCA ATCATTTTGG AGGCTTTGTT ATAAGGCAAC CCCCGGTATA TCATGGAATT 2640 
TCCATTGACA TTTGAATTTG GACTTGG AT C TTCCCTTGGT CCCATTAGCT GAGGTTTAGT 2700 
AATCTAAAGT CCCTATAGTA TATGATTATA ATGCTATTTT AAAAAATATA TATATAAAAT 2760 
ATTTTTTTCT TTTTAAAATA GACACTATAG TTTTACCCAT AAGTAATATT TAAAGATTAT 2820 
AGCTCCCAAA AGAATGGACC AACCACTTTC GTATCATAAT TTCTTTTTGG TAAATATGAG 2880 
ACTATTATGA AATCATAGTA TATGATTGTA TTTAAAGGTA CAATCAAAGG ATCTTTTGTC 2940 
CATTCCATTA ATAACTGAAT AAAAAATAAA TAAAATGGAT AGAAAAAAAC TAAAGTTGAA 3000 
AATACATTCT TAAACTAGTT GTCTGAAATG AGAAAAGAGT GAGAACTAGG TGTGCAAGAA 30 60 
CCAAACGTAT TTTATTTTAT TTTTTAAATG GGAGCAACAT ATCAGTCGTG TCACCAGCTG 3120 
A AGCTCCATTG GGACTGATTT TTCATGGCAA CATCAGCTTT 3180 
\ AAAACCACCC ACAAAGAAAC AAAGCAAATT TCATTATCTA 3240 
ATGAGTTGCT GGAAAAT CAT ATTGAGAATA ATTATTTCAG ATTCCTCAGT TGTTAACTTC 3300 
TACATTCAAG GGCTTATCTC TGCCCCCATT GATTTTTAAC CTCAAAATGG TGTGAGATTT 3360 
ACTGTGGAAC CCTAAAGCAG TAAAATAAAA AACCTGGTTG CAGCACATTC ACACTGTTGT 3420 
CCTTAAAATT CCCCTTTTTT CTCTATGTAC GATAAAGTAA CAGTATGTCA GATAAGCCGG 3480 
TGGGGGGATG AGATTAGGCT GAGGCAGTGC TAGTCAACTG GGGGAAAAGG ATGATGGAAA 3540 
AATCACCCAG TTGTGCTATA TTTTTAAAGA AGGAGGTCGT T 
CCCTGAGGTT AGCCCAATGG AGAAATGAAG CAGAGGAAGG A 

TATCAGGGAG GAAGATGTTC AATAGAACAT GCAAGAATTT CTGGAAGAAA GGCTGTGGAA 
GGGCCAATGG AGAAAATGAA TGGACAAAGC TCAGGAATCC C 
TTGGTGTTAT CAGGGTTAAG CCCTGTAATT ATGTAACCTA TTTATCGCAA C 
TATGATTTCT TGTGATGTAT TCTTTTATGA A 

AGGAAAATCA ATGCTTTATC TGATATGCTG AGAAATTATT AGATTGCCAA TACTCATGTG 
CGTTTCATGT GTTTTATAAG GTTTGTTCCT TTGAAGAATT GTAGTTCTTA G 
GAAATGTGTA TCTATTTATA TATCATAGTA TAAATCTATG ATATATTTAT A 
AAAGTCTGAG TTCTCTTTCT TAGTCCCTAA TCATGTTTCT CCCATAGGCT GTGTTTACAT 
GGAGCTATCG GTTTAGC 
AGAAATTTTA GATATTATCA T 
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r AACTGTTOCT 

ACTTTGTAT 



G ATCTGTTCAG TTAAGTTCTT CTCCCGTTAA CCAGGAAGTG CTTATTCTCT 4620 

CATCACAGTG GGAAGAATAG CCTATTGTCT TTCATTTTGC CTGAGTGTAT TTTACTATTT 4680 

GGGCTCTGAA ATAAAAATTA TGAAATATGG TGAGGTCACA TGTTGGTGCT GCCTTGCTGC 4740 

ATAAAATTCT AGGAGGGCAG GTTAGGAGAC AGTTATGTAT GGCCTTTCGG GAAAATTCAA 4800 

AGGGTGGGAT TACAAGGGTG TTCCTCAGGC ATGCCCCTAT GGGCCCTATG TGGAAGCAAG 4860 

AAGAATTGAC TGATTTACAG GACTTCTCTT TATGTCAATC TTAAGAGGAT GGATGAATCT 4920 

GGACATTTGT TCCACCCGAC CTCTGACTGA TGGTTTGGAA AATAACTTTA ATTAGGATCA 4980 

TATGACCATT GAAAAAGGAA AAATGTAGAC TCTGACTTCC GTCCCACTGA AGGATTAATG 5040 

AAAACCTTTA CTAGCATTTA GAGCTTTTCA GAACATCCCC ACTGTCATGT GTCTCAGCAG 5100 

TGGAGACTGC AAGTAAGGCT TTTAATTTTA GGAGGTTTTT TTTTTTTTTT TTTTTTCCCC 5160 

TAAATGGTAT GGCCAAAAGT CAGAGTTAAA ATATATATAG TTAGATTCCA ACTTCCTCCT 5220 

T CACT CT AAA AATAGAATCC AAACCCACTC TTCATATATG CTTCCAGAAT GGGG CTTAAG 52 80 

TACCAATCTC TGCTTTGCAA TGGGCACAAT CTTGGTCATG TCCTGAGGCT CTCTAAGAAA 5340 

AG AGAGG AT C TAGGATGGGA GAGCTAGAAA GTTGCTAACT GGGAAGAACA AGGCCCTGAG 5400 

GGGTTGGTCT ACCAATCTGG GAAGATTTGA AAACAAACTT CTCGCAACTG AAGGAAGGCT 5460 

GAAGGCTGCT GCAAGTCATT GAGTGACTTT AGGATGAGCA AAACATTGGG CCACTTCCTA 5520 

ATGCCCTATG TGTATAGTAC CAGAAGCAAG GTCTCAGACT TAACAGACCC AGCTCTGTTC 5580 

CAAGGTGAGT CTGAACCAAT AGAAAGCAAA CATGTGCAGA TATCCAAACA AGACTGCTCA 5640 

TGCAAGTCGG GGCTGGCTAC CCGTCTTAGG CAGCAACAGC AGAGCTCCAG GGAGCTTATT 5700 

CAATATTTAC TGAGACTTCG AAGACCCAGC AGATGTTTAA TGAAGTCACT ATTTTGGCTC 5760 

AAACCCTCCA CTTCTCCCCC TCCCCTCAAA AAGCCAACAG GTAAACACAT AAATGAAAGA 5820 

AACCCACAGA AGGGGATGGG AAATAAAGAA AATTCTCTCA AGACTTCTCC AGGCCCATGT 5880 

CACTGGTCAG CGTGGTTTTT ATGTGTATTA GGATTGGGGG ATGTGAAGAA ATAAGTATCC 5940 

AGTACTTTAT AACCAAAGGA ATTAAATGAT ATTGGGGTAG GGAATGTTGG CCAGTTTTGT 6000 

TTAGTTTTGC CATCACATTG TCACCCAGAC CTCACCTAGC CCCAAGTAAT CGGGCGCCCC 6060 

GAAGAGGGAG ACAGAGATGT GCCAGAGTTG ACCCAGTGTG CGGATGATAA CTACTGACGA 6120 

AAGAGTCATC GACCTCAGTT AGTGGTTGGA TGTAGTCACA TTAGTTTGCC TCTCCCCATC 6180 

TTTGTCTCCC TGGCAAGGAG AATATGCGGG ACATGATGCT AAGAGCCCTG GGTAAATGTG 6240 

GTGAGAATGC ACGCGTGCAT ATGCTACACA TATGTGCTTC TCAGTTGCAG AAAATGAACT 63 00 

GCTTTGGGAG ATTATCAGTA GAAAGAGTGT TATCATATTG GTGCTGAGTG CTATGTGTGC 63 60 

TTATACAATT TGTTCTTGTA TTTTAATAAA CTTTGAATAA AAGAATAAAA AAAAAAAAAA 6420 



I I I I I 

3 AGGTCGAGGG TGCAGCGAGC CGTGATCGTG CTACTGCACT CCAGCCTGGG 
CAACACAGAG AGACCCTGTC TCAAAACAAA CAAACAAACA AACAAACAAA CAAAACAAAG 
T GGGAGTGGGC CGGGCGCGGT GACTCACACC TGTAATCCCA GCACTTTCGG 
'GGATCA CGAGGTCAGG AATTCAAGAT TAGCCTGGAC A 
AACCCCATCT CTACGAAAAA TACAAAAATT AGCCAAGTAT GGTGGCC 

CCAGCTACTC GGGAGACTGA GGCAGAGAAC TGCTTGAACC TGGGAGGCAG AGGTTGCAGT 360 

n GAT CCGAG AT CGCGTCACTG CACTCCAGCG TGGGCGACAG AGCGAGACTC CGTTTCAGAA 420 

AAGAAAAAAA AAAAAAAAAA AAAAAGGGAG TCGGGGTGGA GCTCTCATTG GCTCGTTGCA 480 

TGTGAGTGTC CCTACGGCCT AGAAATACCA GAGAAGCACA TOGGAACGGG CTGGAAATCC 540 

ACCCAGTTAA CTAGAGGGCT TTGAACCTTT TATTAACTTG GAGGTTGACT CTCCTGTCAA 600 

CTCGATTCCC TTTTGGCTGT TTGGCAGGGT CAGTGAGACA TCCCCTGGGT CGCTCGACCC 660 

CGTAGGACGG TTCAGGGAGC CCTCCAGGTC TTCGTTTCTC CTCTTCCCCG CACAGTGCTG '72 0 

OJ TTATCCAGCT GGGGGATCCA ACGCACACTT AAGGCTCCAG CAAAGTGGCT CCGCTGCCGG 780 

ATGGGAGTGC CCCAGTGTGC TGGATGAAGC TGGCGCATGC ACCATGTCAT CATGTGTCTC 840 

TAGCCAGCCC AGCAG CAACC GGGCCGCCCC CCAGGATGAG CTGGGGGGCA GGGGCAGCAG 900 

CAGCAGCGAA AGCCAGAAGC CCTGTGAGGC CCTGCGGGGC CTCTCATCCT TGAGCATCCA 960 

CCTGGGCATG GAGTCCTTCA TTGTGGTCAC CGAGTGTGAG CCGGGCTGTG CTGTGGACCT 1020 

OU CGGCTTGGCG CGGGACCGGC CCCTGGAGGC CGATGGCCAA GAGGTCCCCC TTGACTCCTC 1080 

CGGGTCCCAG GCCCGGCCCC ACCTCTCCGG TCGCAAGCTG TCTCTGCAAG AGCGGTCCCA 1140 

GGGTGGGCTG GCAGCCGGTG GCAGCCTGGA CATGAACGGA CGCTGCATCT GCCCGTCCCT 1200 

GCCCTACTCA CCCGTCAGCT CCCCGCAGTC CTCGCCTCGG CTGCCCCGGC GGCCGACAGT 1260 

GGAGTCTCAC CACGTCTCCA TCACGGGTAT GCAGGACTGT GTGCAGCTGA ATCAGTATAC 1320 

OJ CCTGAAGGAT GAAATTGGAA AGGGCTCCTA TGGTGTCGTC AAGTTGGCCT ACAATGAAAA 1380 

TGACAATACC TACTATGCAA TGAAGGTGCT GTCCAAAAAG AAGCTGATCC GGCAGGCCGG 1440 

CTTTCCACGT CGCCCTCCAC CCCGAGGCAC CCGGCCAGCT CCTGGAGGCT GCATCCAGCC 1500 

CAGGGGCCCC ATTGAGCAGG TGTACCAGGA AATTGCCATC CTCAAGAAGC TGGACCACCC 1560 

CAATGTGGTG AAGCTGGTGG AGGTCCTGGA TGACCCCAAT GAGGACCATC TGTACATGGT 1620 

/U GTTCGAACTG GTCAACCAAG GGCCCGTGAT GGAAGTGCCC ACCCTCAAAC CACTCTCTGA 1S80 

AGACCAGGCC CGTTTCTACT TCCAGGATCT GATCAAAGGC ATCGAGTACT TACACTACCA 1740 

GAAGATCATC CACCGTGACA TCAAACCTTC CAACCTCCTG GTCGGAGAAG ATGGGCACAT 1B00 

CAAGATCGCT GACTTTGGTG TGAGCAATGA ATTCAAGGGC AGTGACGCGC TCCTCTCCAA 1860 

CACCGTGGGC ACGCCCGCCT TCATGGCACC CGAGTCGCTC TCTGAGACCC GCAAGATCTT 1920 

O CTCTGGGAAG GCCTTGGATG TTTGGGCCAT GGGTGTGACA CTATACTGCT TTGTCTTTGG 1980 

CCAGTGCCCA TTCATGGACG AGCGGATCAT GTGTTTACAC AGTAAGATCA AGAGTCAGGC 2040 

CCTGGAATTT CCAGACCAGC CCGACATAGC TGAGGACTTG AAGGACCTGA TCACCCGTAT 2100 

GCTGGACAAG AACCCCGAGT CGAGGATCGT GGTGCCGGAA ATCAAGCTGC ACCCCTGGGT 2160 

or . CACGAGGCAT GGGGCGGAGC CGTTGCCGTC GGAGGATGAG AACTGCACGC TGGTCGAAGT 2220 

OU GACTGAAGAG GAGGTCGAGA ACTCAGTCAA ACACATTCCC AGCTTGGCAA CCGTGATCCT 2280 

GGTGAAGACC ATGATACGTA AACGCTCCTT TGGGAACCCA TTCGAGGGCA GCCGGCGGGA 2340 
GGAACGCTCA CTGTCAGCGC CTGGAAACTT GCTCACCAAA AAACCAACCA G 



CCCCCGTGGG GGAGGAGGAA GTGCTCTTGT GAGAGGCAGT CCCTGCGTGG AAAGTTGCTG 
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GGCCCCCGCC CCCGGCTCCC CCGCACGCAT GCATCCACTG CGGCCGGAGG AGGCCATGGA 2S80 

GCCCGAGTAG CTGCCTGGAT CGCTCGACCT GGCATGCGCG CCGCGICGCC TCTGGGG3GC 2640 

TGCTGCACCG CGTTTCCATA GCAGCATGTC CTACGGAAAC CCAGCACGTG TGTAGAGCCT 2700 

CGATCGTCAT CTCTGGTTAT TTGTTTTTTC CTTTGTTGTT TTAAAGGSGA CAAAAAAAAA 2 7 SO 

5 AAAAAAAGGA CTTGACTCCA TGACGTCGAC CGTGGCCGCT GGCTGGCTGG ACAGGCGGGT 2820 

GTGAGGAGTT GCAGACCCAA ACCCACGTGC ATTTTGGGAA ATTGCTTTTT AAAAGATTTT 2880 

TATGCCAAAA ATCCTTCATT GTGATTTTCA GAACCACGTC AGATATACCA AGTGACTGTG 2940 

TGTGGGGTTT GACAACTGTG GAAAGGCGAG CAGAAAACTC CGGGGGTCTG AGGCCATGGA 3000 

GGTGGTTGCT GCATTTGAGA GGGAGTAGGG GGCTAGATGT GGCTCCTAGT GCAAACCGGA 3060 

10 AACCATGGCA CCTTCCAGAG CCGTGGTCTC AAGGAGTCAG AGCAGGGCTG GCCCTCAGTA 3120 

GCTGCAGGGA GCTTTGATGC AACTTATTTG TAAGAAGGAT TTTTAAATTT TTTATGGGTA 3180 

GAATTGTAGT CAGGAAAACA GAAAGGGCTT GAAATTTAAT AAGTGCTGCT GGAAGGGGAT 3240 

TTTCCAAGCC TGGAAGGGTA TTCAGCAGCT GTGGTGGGGA AACATTTCTC CTGAAAGACT 3300 

GAACGTGTTT CTTCATGACA GCTGCT CAAA GCAGGTTTCT GAGATAGCTG ACCGAGCTCT 3360 

15 GGTAAATCTC TTTGTCAAAT TACGAAAACT TCAGGGTGAA ATCCTATGCT TTCATGTACA 3420 

TTACATGGCT TAAGATTAAA CAAAAACATT TTTCAAGTCT CTAACTAGAG TGAACTCTAG 3480 

AG CACAGT AG TTCAGAAACT ATTTAGAGCT TCCAGGATAT ATTTCACAGC TTCAGGCATG 3540 

TGATCAGTTA GAGCCGATGA AACCTATGCC CGCCTGTATA TATATTAGCA GCTTAGCTAG 3600 

TTCATAACCT GTATATTCTA AAGACTGCTA AGGTTTTGTT TTCATTTTAA ATCCTAGCTG 3660 

20 ATTGTTGTGG TCAATGAAAT ACCCAGTTTC TGAAGGGCCA GGTGGGAAAT GCTTTCACTG 3720 

GACCAACACA CAAATGATCA TCCTGAGGAT CTGAGCTTCC CTAGACTCCA CACAATAACC 3780 

TTGGGGCACC CTTTTAGAGA AGACTGTTGA AACCCACAGC ACTCGTTGGG GTATGAGGAA 3840 

ACCAGGGCTT GGCACAGGAA GTTCCCCTTT GTAGCTAAAA GTCCAGAAAG AAAGGGTTCA 3900 

TCTTTTTGAC TTCCAACTGA TATTGGGAAG TTTGGTTGAG GTTCAAGTGT GACTCCTTCC 3960 

25 AGAGCCACAG GTAGGGGAGT GTGAAGTTGA GGGGGAGGAA AGCTGGAAGG ACTCTGCCTT 4020 

GGGAGATTCC CAGCTCTGCT TTCCAGCGCT TGGTGGAATC TGGGCTGGGG AAAGACGGCA 4080 

CCGGGAAACT CTGCTTCCCC ATTGTTTCCA TCTGATCAGC TGTGGTGTGA GGACTTCTCA 4140 

GACAAAGGCA AGGCCTCGTG CCCCTGCCCA GCCCATTCAT GGAGCCCTGG GCCTTCTTGG 4200 

CTTCCATAGA TCCTAAGCTC TTGACTGTAG TTTAGCCAGA CTTGTTTTGC TATCTTATAA 42 60 

30 GCAGTTCAGA ATTAGGGAAT GCTGGTTTTG AAGAGCAAAG GACAGGTAGT CTAGAGAGGG 4320 

TCGTCTGGCC TGCTTGCTGG GTCTTTGTAA CCCAGCACTT CCTCTTGCCC TCCTGGCTTT 4380 

ATGTTTATGG GAGAGGACTC AATAGCTCCA CCCCTTCTGG CACCAGATGG GGCTTGGTTA 4440 

GTTTGCAATA AGCACCTTGC AGAGGTTAAA GCCAGCGGGT CCCTAGTCTT AGGCCCAGCC 4S00 

TGCTTGTGTG GGCTCTGGCC TGGCCTGGTG GCTGGCCCAG GGGGCAGCAG TGCTTAGAGC 4560 

35 TTCTGCAGGG CTTCTCTTGT TTACACAGCT GCATCAGACA ATGCCATTTC TCCCCACCAC 4620 

GGAACCTTCC ATCTAAG AT T TCTTCCAGGG AATGCCAGCA ATCAGGCAGC ACCCAGCTGT 4680 

GGGGGCAGTG GGGTGGGGGA GACCCACATT GATGACTTTT TTTTTTTCTT TTAATGAAGA 4740 

AACACCAAAG AAAGCTGTGG AAAGGACCTG CCCCACATGA AAAGGATAAG CCAAGATGGC 48 00 

TGTAAACACA GAGCATTTGA GCTCCCACTC TTGGAGCACA TTGATTTTTC AAAAGCCAGC 4 860 

40 TCTGTCAGGA AAGGAGGTGC TGTTATGACA GCTCTTCCAG TGGGCAAAGA GGACGCCCAT 4920 

AATTTCTTCC ATTGCTAGCT CATCTGTGGG ACCAATTTGG TGTAAGCAAC CTGTGGCCTG 4980 

CACTTGTGGC CTCCAAGGAA CCACAAACCC TCCATCCACT TCCCATTTCC TCTGCCCTTT 5040 

TCCACCTCCC CCTTCCATCC CACCAGCTGC CAGTGGCTCC CAGAAAGCCT TATTGAGCCC 5100 

CTTGTTGACA CTTGGGGCTG CGGAGGCCTT TCCCTACTGG TCTGGCCTTT CCTGAGAGGC 5160 

45 AGGTCTTCCG TCCTCAGAGC CTTTCTGGAA CAAGGAGAAT GCCTGTGCAG GTGGACACAC 522 0 

AGGCCTGGCC TGTCGCTCTC ACTTGTCTTC CAGCGGGGAG CTTCACGTTG CCGAGTGGAA 5280 

GAACCATGAC CTCCACTTGC TTCCAAGGTG CTAGGGAAGT TTCAGGGTAC GCTGGTTCCC 5340 
CTCTCCAGCT GGAGGCCGAG TTTCTGGGGA CTGCAGATTT TTCTACTCTG TGATCGATTC 5400 
AATGCCCGAT GCTTCTGTTT CATTCCCGAC CCTTTCTACT ATGCATTTTC CTTTTATCAG 5460 
50 GTGTATAAAG TTAAATACTG TGTATTTATC ACTAAAAAGT ACATGAACTT AAGAGACAAC 5520 
TAAGCCTTTC GTGTTTTTCC ACAGGTGTTT AAGCTTCTCT GTACAGTTGA AATAAACAGA 5580 
CAGCAAAATG GTAAAAAAAA AAAAAAAAAA A 5611 

Seq ID NO: C193 DNA sequence 
55 Nucleic Acid Accession #: NM_018646 
Coding sequences 217.. 2394 

1 11 21 31 41 51 

«cn I I I I 1 1 

OO GCTCTGCCAA GTGTAACAAA CTCACAGCCC TCTCCAAACT GGCTGGGGCT GCTGGGAGAC 60 
TCCCAAGGAA CTCGTCAGGA AGGCAGGAGA CAGGAGACGG GACCTCTACA GGGAGACGGT 120 
GGGCCGGCCC TTGGGGGGGC TGATGTGGCC CCAAGGCTGA GTCCCGTCAG GGTCTGGCCT 180 
CGGCCTCAGG CCCCCAAGGA GCCGGCCCTA CACCCCATGG GTTTGTCACT GCCCAAGGAG 240 
AAAGGGCTAA TTCTCTGCCT ATGGAGCAAG TTCTGCAGAT GGTTCCAGAG ACGGGAGTCC 300 

65 TGGGCCCAGA GCCGAGATGA GCAGAACCTG CTGCAGCAGA AGAGGATCTG GGAGTCTCCT 360 
CTCCTTCTAG CTGCCAAAGA TAATGATGTC CAGGCCCTGA ACAAGTTGCT CAAGTATGAG 420 
GATTGCAAGG TGCACCAGAG AGGAGCCATG GGGGAAACAG CGCTACACAT AGCAGCCCTC 480 
TATGACAACC TGGAGGCCGC CATGGTGCTG ATGGAGGCTG CCCCGGAGCT GGTCTTTGAG 540 
CCCATGACAT CTGAGCTCTA TGAGGGTCAG ACTGCACTGC ACATCGCTGT TGTGAACCAG 600 

70 AACATGAACC TGGTGCGAGC CCTGCTTGCC CGCAGGGCCA GTGTCTCTGC CAGAGCCACA 660 

TTTGCTGCCT GTGTGAACAG TGAGGAGATC GTGCGGCTGC TCATTGAGCA TGGAGCTGAC 780 
ATCCGGGCCC AGGACTCCCT GGGAAACACA GTGTTACACA TCCTCATCCT CCAGCCCAAC 840 
AAAACCTTTG CCTGCCAGAT GTACAACCTG TTGCTGTCCT ACGACAGACA TGGGGACCAC 900 

75 CTGCAGCCCC TGGACCTCGT GCCCAATCAC CAGGGTCTCA CCCCTTTCAA GCTGGCTGGA 960 
GTGGAGGGTA ACACTGTGAT GTTTCAGCAC CTGATGCAGA AGCGGAAGCA CACCCAGTGG 1020 
ACGTATGGAC CACTGACCTC GACTCTCTAT GACCTCACAG AGATCGACTC CTCAGGGGAT 1080 
GAGCAGTCCC TGCTGGAACT TATCATCACC ACCAAGAAGC GGGAGGCTCG CCAGATCCTG 1140 
GACCAGACGC CGGTGAAGGA GCTGGTGAGC CTCAAGTGGA AGCGGTACGG GCGGCCGTAC 1200 

80 TTCTGCATGC TGGGTGCCAT ATATCTGCTG TACATCATCT GCTTCACCAT GTGCTGCATC 1260 
TACCGCCCCC TCAAGCCCAG GACCAATAAC CGCACGAGCC CCCGGGACAA CACCCTCTTA 1320 
CAGCAGAAGC TACTTCAGGA AGCCTACATG ACCCCTAAGG ACGATATCCG GCTGGTCGGG 1380 
GAGCTGGTGA CTGTCATTGG GGCTATCATC ATCCTGCTGC- TAGAGGTTCC AGACATCTTC 1440 
AGAATGGGGG TCACTCGCTT CTTTGGACAG ACCATCCTTG GGGGCCCATT CCATGTCCTC 1500 
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ATCATCACCT ATGCCTTCAT GGTGCTGGTG ACCATGGTGA TGCGGCTCAT CAGTGCCAGC 1560 

GGGGAGGTGG TACCCATGTC CTTTGCACTC GTGCTGGGCT GGTGCAACGT CATGTACTTC 162 0 

GCCCGAGGAT TCCAGATGCT AGGCCCCTTC ACCATCATGA TTCAGAAGAT GATTTTTGGC 1680 

GACCTGATGC GATTCTCCTG GCTGATGGCT GTGGTCATCC TGGGCTTTGC TTCAGCCTTC 1740 

TATATCATCT TCCAGACAGA GGACCCCGAG GAGCTAGGCC ACTTCTACGA CTACCCCATG IS 00 

GCCCTGTTCA GCACCTTCGA GCTGTTCCTT ACCATCATCG ATGGCCCAGC CAACTACAAC 1860 

GTGGACCTGC CCTTCATGTA CAGCATCACC TATGCTGCCT TIGCCAICAT CGCCACACTG 192 0 

CTCATGCTCA ACCTCCTCAT TGCCATGATG GGCGACACTC ACTGGCGAGT GGCCCATGAG 1980 

CGGGATGAGC TGTGGAGGGC CCAGATTGTG GCCACCACGG TGATGCTGGA GCGGAAGCTG 2040 

CCTCGCTGCC TGTGGCCTCG CTCCGGGATC TGCGGACGGG AGTATGGCCT GGGAGACCGC 2100 

TGGTTCCTGC GGGTGGAAGA CAGGCAAGAT CTCAACCGGC AGCGGATCCA ACGCTACGCA 2160 

CAGGCCTTCC ACACCCGGGG CTCTGAGGAT TTGGACAAAG ACTCAGTGGA AAAACTAGAG 222 0 

CTGGGCTGTC CCTTCAGCCC CCACCTGTCC CTTCCTATGC CCTCAOTGTC TCGAAGTACC 2280 

TCCCGCAGCA GTGCCAATTG GGAAAGGCTT CGGCAAGGGA CCCTGAGGAG AGACCTGCGT 2340 

T GGAGGACGGS GAGAGCTGGG AATATCAGAT CTGACTGCGT 2400 

T CGCTTCCTGG AACTTGCTCT CATTTTCCTG GGTGCATCAA ACAAAACAAA 2460 

G GCCCCAGGGG AGAAAGAGGA GTAGCATGAA 2520 



60 



\ GCCCAGCCCA AGCACGGGGC TGGCAGGGCG TGAGGAACTC T 
TGCTCATCAC CCTTCCGACA GGAGCACTGC ATGTCAGAGC ACTTTAf 
TGCTTGGGCG CTCGGTCTCC ACCCCAGGGT CATAAGTGGG GAGAGAGCCC T 
ACCCAGGCAG GTGCAGGGAA GTGCAGAGCT TGTGGAAAGC GTGTGAGTGA GGGAGACAGG 
AACGGCTCTG GGGGTGGGAA GTGGGGCTAG GTCTTGCCAA CTCCATCTTC AATAAAGTCG 



I I I I I I 

GTTCTCCACA ACTGCCAGCA ATCCTTCCAC CAGGCAAAAC ACATCATCTA A~ 

GTGAGGTTTG CTTAGGGCGT GGCAGCTTCG GATAAACGCA GGACTCCGCC TGGCAGCCCG 120 

ATTTCTCCCG GAACCTCTGC TCAGCCTGGT GAACCACACA GGCCCGAGTT TCACCCAGTC 1B0 

CCCACTCCAC GGTGCAGCTG CGGCTTATCT CTCAGCCCAG CGAGATGCCA GCCTTCCTGT 240 

CCCGGGCCAG CGCTCTGACA TGCAGAAGGT GACCCTGGGC CTGCTTGTGT TCCTGGCAGG 300 

CTTTCCTGTC CTGGACGCCA ATGACCTAGA AGATAAAAAC AGTCCTTTCT ACTATGACTG 360 

GCACAGCCTC CAGGTTGGCG GGCTCATCTG CGCTGGGGTT CTGTGCGCCA TGGGCATCAT 420 

CATCGTCATG AGTGAGTGGA GGAGCTCGGG GGAGCAGGCG GGCCGGGGCT GGGGCTCCCC 48 0 

TCCCCTGACC ACTCAGCTCT CCCCAACAGG TGCAAAATGC AAATGCAAGT TTGGCCAGAA 54 0 

GTCCGGTCAC CATCCAGGGG AGACTCCACC TCTCATCACC CCCGGCTCAG CCCAAAGCTG 600 

ATGAGGACAG ACCAGCTGAA ATTGGGTGGA GGACCGTTCT CTCTCCCCAG GTCCTGTCTC 66 0 

TGCACAGAAA CTTGAACTCC AGGATGGAAT TCTTCCTCCT CTGCTGGGAC TCCTTTGCAT 72 0 

GGCAGGGCCT CATCTCACCT CTCGCAAGAG GGTCTCTTTG TTCAATTTTT TTTAATCTAA 780 

AATGATTGTG CCTCTGCCCA AGCAGCCTGG AGACTTCCTA TGTGTGCATT GGGGTGGGGC 840 

TTGGGGCACC ATGAGAAGGT TGGCGTGCCC TGGAGGCTGA CACAGAGGCT GGCACTGAGC 900 

CTGCTTGTTG GGAAAAGCCC ACAGGCCTGT TCCCTTGTGG CTTGGGACAT GGCACAGGCC 960 

CGCCCTCTGC CTCCTCAGCC ATGGGACCTC ATATGCAATT TGGGATTTAC TAGTAGCCAA 1020 

AAGGAATGAA AGAGAGCTCT AACCAGATGG AACACTGGAA CATTCCAGTG GACCCTGGAC 1080 

CATTCCAGGA AAACTGGGAC ATAGGATCGT CCCGCTATGA TGGAAGTGTT CAGACAGTTT 1140 

ATAATAGTAA GCCCCTGTGA CCCTCTCACT TACCCCGAGA CCTCACTTTA TTACAAGATC 12 00 

TTTCCAAATA CCCAAATATC CCTGCAAGCC CGTTAAATAA TTCCCTATGC TACCCTTAAT 12 60 

AACATACAAT GACCACATAG TGTGAGAACT TCCAACAAGC CTCAAAGTCC CTTGAGACTC 1320 

CCCAATACCT AATAAGGCAT GCGAAATGTT CTCATGAACT ACCCCACAAC ACGCCTAAAA 1380 

CTCAAAACAC CCAAAAATAT CTCCTCCAAT GTCCTGAGAC ATGAACCCAA AAAGAGACCC 1440 

ACAATAAACT CGTGACTTGT CCCCTC 1466 

Seq ID NOs C19S DNA Sequence 

Nucleic Acid Accession #: NH_00S971.2 

Coding sequence: 176.. 439 



GTTCTCCACA ACTGCCAGCA ATCCTTCCAC CAGGCAAAAC ACATCATCTA AGGAAAAGAA 
65 GTGAGGTTTG CTTAGGGCGT GGCAGCTTCG GATAAACGCA GGACTCCGCC TGGCAGCCCG 
' ATTTCTCCCG GAACCTCTGC TCAGCCTGGT GAACCACACA GGCCAGCGCT CTGACATGCA 
GAAGGTGACC CTGGGCCTGC TTGTGTTCCT GGCAGGCTTT CCTGTCCTGG ACGCCAATGA 
CCTAGAAGAT AAAAACAGTC CTTTCTACTA TGACTGGCAC AGCCTCCAGG TTGGCGGGCT 
CATCTGCGCT GGGGTTCTGT GCGCCATGGG CATCATCATC GTCATGAGTG CAAAATGCAA 
70 ATGCAAGTTT GGCCAGAAGT CCGGTCACCA TCCAGGGGAG ACTCCACCTC TCATCACCCC 
AGGCTCAGCC CAAAGCTGAT GAGGACAGAC CAGCTGAAAT TGGGTGGAGG ACCGTTCTCT 
GTCCCCAGGT CCTGTCTCTG CACAGAAACT TGAACTCCAG GATGGAATTC T 
rGCATGG CAGGGCCTCA TCTCACCTCT C 



TGGGACATGG CACAGGCCCG CCCTCTGCCT CCTCAGCCAT GGGACCTCAT ATGCAATTTG 



\ ACTGGGACAT AGGATCGTCC CGCTATGATG 
= CCCTGTGACC CTCTCACTTA C 
TCACTTTATT ACAAGATCTT TCCAAATACC CAAATATCCC Tl 
CCCTATGCTA CCCTTAATAA CATACAATGA CCACATAGTG TGAGAACTTC CAACAAGCCT 
CAAAGTCCCT TGAGACTCCC CAATACCTAA TAAGGCATGC GAAATGTTCT CATGAACTAC 
CCCACAACAC GCCTAAAACT CAAAACACCC A 
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TGACTTGTCC CCTC 



I I I I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC G~ 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG GGTCGAGGGA 120 

CCTCAGACTG AATCAAAGAA TGAAGCCTCT TCCCGTGATG TTGTCTATGG CCCCCAGCCC 18 0 

CAGCCTCTGG AAAATCAGCT CCTCTCTGAG GAAACAAAGT CAACTGAGAC TGAGACTGGG 240 

AGCAGAGTTG GCAAACTGCC AGAAGCCTCT CGCATCCTGA ACACTATCCT GAGTAATTAT 30 0 

GACCACAAAC TGCGCCCTGG CATTGGAGAG AAGCCCACTG TGGTCACTGT TGAGATCGCC 360 

GTCAACAGCC TTGGTCCTCT CTCTATCCTA GACATGGAAT ACACCATTGA CATCATCTTC 42 0 

TCCCAGACCT GGTACGACGA ACGCCTCTGT TACAACGACA CCTTTGAGTC TCTTGTTCTG 4S0 

AATGGCAATG TGGTGAGCCA GCTATGGATC CCGGACACCT TTTTTAGGAA TTCTAAGAGG E40 

ACCCACGAGC ATGAGATCAC CATGCCCAAC CAGATGGTCC GCATCTACAA GGATGGCAAG 600 

GTGTTGTACA CAATTAGGAT GACCATTGAT GCCGGATGCT CACTCCACAT GCTCAGATTT 660 

CCAATGGATT CTCACTCTTG CCCTCTATCT TTCTCTAGCT TTTCCTATCC TGAGAATGAG 72 0 

ATGAT CTACA AGTGGGAAAA TTTCAAGCTT GAAATCAATG AGAAGAACTC CTGGAAGCTC 780 

TTCCAGTTTG ATTTTACAGG AGTGAGCAAC AAAACTGAAA TAATCACAAC CCCAGTTGGT 840 

GACTTCATGG TCATGACGAT TTTCTTCAAT GTGAGCAGGC GGTTTGGCTA TGTTGCCTTT 900 

CAAAACTATG TCCCTTCTTC CGTGACCACG ATGCTCTCCT GGGTTTCCTT TTGGATCAAG 960 

ACAGAGTCTG CTCCAGCCCG GACCTCTCTA GGGATCACCT CTGTTCTGAC CATGACCACG 1020 

r TTTCTCGTAA GAATTTCCCG CGTGTCTCCT ATATCACAGC CTTGGATTTC 1080 

A TCTGCTTCGT CTTCTGCTTC TGCGCTCTGT TGGAGTTTGC TGTGCTCAAC 1140 

T ACAACCAGAC AAAAGCCCAT GCTTCTCCTA AACTCOGCCA TCCTCGTATC 1200 

AATAGCCGTG CCCATGCCCG TACCCGTGCA CGTTCCCGAG CCTGTGCCCG CCAACATCAG 1260 

GAAGCTTTTG TGTGCCAGAT TGTCACCACT GAGGGAAGTG ATGGAGAGGA GCGCCCGTCT 1320 

TGCTCAGCCC AGCAGCCCCC TAGCCCAGGT AGCCCTGAGG GTCCCCGCAG CCTCTGCTCC 1380 
AAGCTGGCCT GCTGTGAGTG GTGCAAGCGT TTTAAGAAGT ACTTCTGCAT G 
A GTACCTGGCA GCAGGGCCGC CTCTGCJ 

CTTTCCC AGTGACTTTC TTCTTCTTCA A 
TGCCTTAACT TGTAGGTACC AGCTGGTACC CTGTGGGGCA ACCTCTCCAG TTCCCCAGGA 



AGTTTTTCCT GCCCCATTCC CCAAACAGAA GCTTGCAGAG GGTTTGTCTT 
CTCCCCTACC TGGCCCATTC 



Z TAAGTACAGG CGGATTAGCT A 
AATGCTGACC ACCAGACAAT TACTGCATTT TTCCAGAAGC CCACTATTGC CTTTGTAGTG 



TCAGATTATT ATGTTCTCAG TTCTCTCTCC C 

GA TTATAGTAAC 
A CCCCTGIGAC TCTTTCTGTA 
C TAGCCCTGAC CCAGGCACTA GGCCTTGGTG ACTTCCTGGG 
GCCAAGAAAC TAAGGAAACT CGGCTTTGCA ACAGGCATTA CTCGCCATTG ATTGGTGCCC 
ACCCAGGGCA CACTGTCGGA GTTCTATCAC TTGCTTGACC CCTGGACCCA T. 



GTATGGCACT GGAACTTTGG CAAAGCACTT TTGACAAGTT G 
GATAGCCTTG TGACATCTTT AGGGCAGGAT TCTTATCCCC ATTTTGCAGA TGAAAACCCT 
GAGTCACAGA TTTCTGTGGG ACTGTGGATC TCACTGGAAG CTATCCAAGA G 
ACCTTCTAGA CCACATGATA GGGCTAGACA GCTCAGTTCA CCATGATTCT C 
CTCTGCTGGC ACACCAGTGG CAAGGCCCAG AATGGCGACC TCTCTTTAGC TCAATTTCTG 
GGCCTGAGGT GCTCAGACTG CCCCCAAGAT CAAATCTCTC CTGGCTGTAG TAACCCAGTG 
GAATGAATTT GGACATGCCC CAATGCTTCT ATATGCTAAG TGAAATCTGT GTCTGTAATT 
TGTTGGGGGG TGGATAGGGT GGGGTCTCCA TCTACTTTTT GTCACCATCA TCTGAAATGG 
GGAAATATGT AAATAAATAT ATCAGCAAAG CAAAAAGAAA AAAAAAAA 

Seq ID NO: C197 DNA Sequence 

Nucleic Acid Accession ft: NH_021984.1 

Coding sequence: 572.. 1753 

1 11 21 31 41 51 

I I I I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 
TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 
CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 
GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGBGCAGG ACCTCAGAAG CCCCCTTTGT 
CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGG TCGAGGGACC 
TCAGACTGAA TCAAAGAATG AAGCCTCTTC CCGTGATGTT GTCTATGGCC CCCAGCCCCA 
GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAGACTG AGACTGGGAG 
CAGAGTTGGC AAACTGCCAG AAGCCTCTCG CAT CCTG AAC ACTATCCTGA GTAATTATGA 
3 CGCCCTGGCA TTGGAGAGAA GCCCACTGTG GTCACTGTTG AGATCTCCGT 
T GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 
CCAGACCTGG TACGACGAAC GCCTCTGTTA CAACGACACC TTTGAGTCTC TTGTTCTGAA 
TGGCAATGTG GTGAGCCAGC TATGGATCCC GGACACCTTT TTTAGGAATT CTAAGAGGAC 
CCACGAGCAT GAGATCACCA TGCCCAACCA GATGGTCCGC ATCTACAAGG ATGGCAAGGT 
GTTGTACACA ATT AGGAT GA CCATTGATGC CGGATGCTCA CTCCACATGC TCAGATTTCC 
AATGGATTCT CACTCTTGCC CTCTATCTTT CTCTAGCTTT TCCTATCCTG AGAATGAGAT 
GATCTACAAG TGGGAAAATT TCAAGCTTGA AATCAATGAG AAGAACTCCT GGAAGCTCTT 
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CAGTTGGTGA 



AAACTATGTC CCTTCTTCCG T 

AGAGT CTGCT CCAGCCCGGA CCTCTCTAGG GATCACCTCT G 
GGGCACCTTT TCTCGTAAGA ATTTCCCGCG TGTCTCCTAT A 

TATCGCCATC TGCTTCGTCT TCTGCTTCTG CGCTCTGTTG GAGTTTGCTG TGCTCAACTT 1320 

CCTGATCTAC AACCAGACAA AAGCCCATGC TTCTCCTAAA CTCCGCCATC CTCGTATCAA 13 80 

TAGCCGTGCC CATGCCCGTA CCCGTGCACG TTCCCGAGCC TGTGCCCGCC AACATCAGGA 1440 

AGCTTTTGTG TGCCAGATTG TCACCACTGA GGGAAGTGAT GGAGAGGAGC GCCCGTCTTG 1500 

CTCAGCCCAG CAGCCCCCTA GCCCAGGTAG CCCTGAGGGT CCCCGCAGCC TCTGCTCCAA 1560 

GCTGGCCTGC TGTGAGTGGT GCAAGCGTTT TAAGAAGTAC TTCTGCATGG TCCCCGATTG 1620 

TGAGGGCAGT ACCTGGCAGC AGGCCCGCCT CTGCATCCAT GTCTACCGCC TGGATAACTA 1580 

CTCGAGAGTT GTTTTCCCAG TGACTTTCTT CTTCTTCAAT GTGCTCTACT GGCTTGTTTG 1740 

CCTTAACTTG TAGGTACCAG CTGGTACCCT GTGGGGCAAC CTCTCCAGTT CCCCAGGAGG 1800 

TCCAAGCCCC TTGCCAAGGG AGTTGGGGGA AAGCAGCAGC AGCAGCAGGA GCGACTAGAG 1860 

TTTTTCCTGC CCCATTCCCC AAACAGAAGC TTGCAGAGGG TTTGTCTTTG CTGCCCCTCT 1920 

CCCCTACCTG GCCCATTCAC TGAGTTTTCT CAGCAGACCA TTTCAAATTA TTAATAAATG 1980 

GGCCACCTCC CTCTTCTTCA AGGAGCATCC GTGATGCTCA GTGTTCAAAA CCACAGCCAC 2040 

TTAGTGATCA GCTCCCTAAA ACCATGCCTA AGTACAGGCG GATTAGCTAT CTTCCAACAA 2100 

TGCTGACCAC CAGACAATTA CTGCATTTTT CCAGAAGCCC ACTATTGCCT TTGCAGTGCT 2160 

TTCGGCCCAG TTCTGGCCTC AGCCTCAAAG TGCACCGACT AGTTGCTTGC CTATACCTGG 2220 

CACCTCATTA AGATGCTGGG CAGCAGTATA ACAGGAGGAA GAGATCCCTC TCCTTTGGTC 2280 

T GTTCTCAGTT CTCTCTCCCT GCTACCCCTT TCTCTGCAGA TAGATAGACA 2340 

\ AGAGGGGGGG GCAGCAAGAG AGCCTATTTG GGACAGCATT 2400 

T CTCCTTGCTG GCTCCATCTT TCGTCTGCAC 2460 

TACCAATTCA ATGCCCTTCA TCCAATGGGT ATCTATTTTT GTGTGTGATT ATAGTAACTA 2520 

CTCCCTGCTT TATATGCCAC CCTCTTCCTT CTCTTTGACC CCTGTGACTC TTTCTGTAAC 2S80 

TTTCCCAGTG ACTTCCCCTA GCCCTGACCC AGGCACTAGG CCTTGGTGAC TTCCTGGGGC 2640 

CAAGAAACTA AGGAAACT CG GCTTTGCAAC AGGCATTACT C 

CCAGGGCACA CTGTCGGAGT TCTATCACTT GCTTGACCCC T 
CTGTTATACC CGGGGCACTC TAACCATCAC AATCAATCAA TCAAATTCCC T 



TAGCCTTGTG ACATCTTTAG GGCAGGATTC TTATCCCCAT TTTGCAGATG A 
GTCACAGATT TCTGTGGGAC TGTGGATCTC ACTGGAAGCT ATCCAAGAGC CCACTGTCAC 
CTTCTAGACC ACATGATAGG GCTAGACAGC TCAGTTCACC A 
CTGCTGGCAC ACCAGTGGCA AGGCCCA 
CCTGAGGTGC TCAGACTGCC CCCAAGATCA AATCTCTCCT GGCTGTAGTA ACCCAGTGGA 
ATGAATTTGG ACATGCCCCA ATGCTTCTAT ATGCTAAGTG AAATCTGTGT CTGTAATTTG 
TTGGGGGGTG GATAGGGTGG GGTCTCCATC TACTTTTTGT CACCATCATC TGAAATGGGG 
AAATATCTAA ATAAATATAT CACCAAAGC 

Seq ID NO: C198 DNA Sequence 
Nucleic Acid Accession #: NM_021987.1 
Coding sequence: 572.. 1657 

I I' I' 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 
TCCAAAGTTC TTCCAGTCCT CCTAGGCATC T 
CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 
GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 
CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGG TCGAGGGACC 
TCAGACTGAA TCAAAGAATG AAGCCTCTTC CCGTGATGTT GTCTATGGCC CCCAGCCCCA 
GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAGACTG AGACTGGGAG 
CAGAGTTGGC AAACTGC CAG AAGCCTCTCG CATCCTGAAC ACTATCCTGA GTAATTATGA 
CCACAAACTG CGCCCTGGCA TTGGAGAGAA GCCCACTGTG GTCACTGTTG AGATCTCCGT 
CAACAGCCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 
CCAGACCTGG AATTCTAAGA GGACCCACGA GCATGAGATC ACCATGCCCA ACCAGATGGT 
CCGCATCTAC AAGGATGGCA AGGTGTTGTA CACAATTAGG ATGACCATTG ATGCCGGATG 
CTCACTCCAC ATGCTCAGAT TTCCAATGGA TTCTCACTCT TGCCCTCTAT CTTTCTCTAG 
CTTTTCCTAT CCTGAGAATG AGATGATCTA CAAGTGGGAA AATTTCAAGC TTGAAATCAA 
TGAGAAGAAC TCCTGGAAGC TCTTCCAGTT TGATTTTACA GGAGTGAGCA ACAAAACTGA 
T GGTCATGACG ATTTTCTTCA ATGTGAG CAG 
A TGTCCCTTCT TCCGTGACCA CGATGCTCTC 
CTGGGTTTCC TTTTGGATCA AGACAGAGTC TGCTCCAGCC CGGACCTCTC TAGGGATCAC 
CTCTGTTCTG ACCATGACCA CGTTGGGCAC CTTTTCTCGT AAGAATTTCC CGCGTGTCTC 
CTATATCACA GCCTTGGATT TCTATATCGC CATCTGCTTC GTCTTCTGCT TCTGCGCTCT 
GTTGGAGTTT GCTGTGCTCA ACTTCCTGAT CTACAACCAG ACAAAAGCCC ATGCTTCTCC 
TAAACTCCGC CATCCTCGTA TCAATAGCCG TGCCCATGCC CGTACCCGTG CACGTTCCCG 
AGCCTGTGCC CGCCAACATC AGGAAGCTTT TGTGTGCCAG ATTGTCACCA CTGAGGGAAG 
TGATGGAGAG GAGCGCCCGT CTTGCTCAGC CCAGCAGCCC CCTAGCCCAG G 
GGGTCCCCGC AGCCTCTGCT CCAAGC 

GTACTTCTGC ATGGTCCCCG ATTGTGAGGG CAGTACCTGG C 
CCATGTCTAC CGCCTGGATA ACTACTCGAG AGTTGTTTTC CCAGTGACTT TCTTCTTCTT 

CAACCTCTCC AGTTCCCCAG GAGGTCCAAG CCCCTTGCCA AGGGAGTTGG GGGAAAGCAG 
CAGCAGCAGC AGGAGCGACT AGAGTTTTTC CTGCCCCATT CCCCAAACAG AAGCTTGCAG 
AGGGTTTGTC TTTGCTGCCC CTCTCCCCTA CCTGGCCCAT TCACTGAGTT TTCTCAGCAG 
ACCATTTCAA ATTATTAATA AATGGGCCAC CTCCCTCTTC TTCAAGGAGC ft 
CTCAGTGTTC AAAACCACAG CCACTTAGTG ATCAGCTCCC TAAAACCATG C 
GGCGGATTAG CTATCTTCCA ACAATGCTGA CCACCAGACA ATTACTGCAT T 
GCCCACTATT GCCTTTGCAG TGCTTTCGGC CCAGTTCTGG CCTCAGCCTC AAAGTGCACC 
GACTAGTTGC TTGCCTATAC CTGGCACCTC ATTAAGATGC TGGGCAGCAG TATAACAGGA 
GGAAGAGATC CCTCTCCTTT GGT CAGATTA TTATGTTCTC AGTTCTCTCT C 
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GGGGGCAGCA 



20 



AGAGAGCCTA TTTGGGACAG CATTCCTCTC TCTCTGCTGC TGTGACATCT CCCICTCCTT 

GCTGGCTCCA TCTTTCGTCT GCACTACCAA TTCAATGCCC TTCATCCAAT GGGTATCTAT 

TTTTGTGTGT GATTATAGTA ACTACTCCCT GCTTTATATG CCACCCTCTT CCTTCTCTTT 

GACCCCTGTG ACTCTTTCTG TAACTTTCCC AGTGACTTCC CCTAGCCCTG A 

AGGCCTTGGT GACTTCCTGG GGCCAAGAAA CTAAGGAAAC TCGGCTTTGC A 
ACTCGCCATT GATTGGTGCC CACCCAGGGC ACACTGTCGG AGTTCT7 

CCCTGGACCC ATAAACCAGT CCACTGTTAT ACCCGGGGCA CTCTAACCAT CACAATCAAT 2700 

CAATCAAATT CCCTTAAATT TGTATGGCAC TGGAACTTTG GCAAAGCACT TTTGACAAGT 2760 
TGTGTCTGAT TGGAGCTTCA TGATAGCCTT GTGACATCTT TAGGGCAGGA TTCTTATCCC 282 0 

GCTATCCAAG AGCCCACTGT CACCTTCTAG ACCACATGAT AGGGCTAGAC AGCTCAGTTC 2940 

ACCATGATTC TCTTCTGTCA CCTCTGCTGG CACACCAGTG GCAAGGCCCA GAATGGCGAC 3000 

CTCTCTTTAG CTCAATTTCT GGGCCTGAGG TGCTCAGACT GCCCCCAAGA TCAAATCTCT 3060 

CCTGGCTGTA GTAACCCAGT GGAATGAATT TGGACATGCC CCAATGCTTC TATATGCTAA 312 0 

GTGAAATCTG TGTCTGTAAT TTGTTGGGGG GTGGATAGGG TGGGGTCTCC ATCTACTTTT 318 0 

TGTCACCATC ATCTGAAATG GGGAAATATG TAAATAAATA TATCAGCAAA GC 3232 

Seq ID NO: C199 DNA Sequence 
1309. .2490 ~~ 



1 11 21 31 41 51 

0 - I I I I I I 

ZJ GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 
TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 
CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 
GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 
CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGT GAGTTTCATA 
30 CCTTGGCAGA TGGCCTTTAA CATTTTTGTT TAATTCAATT ATTCTTACTA ATCTTCTTCT 
TTTTCTTGGC TGTGGTGCAT GGCTGTGGAG CTCAGGGTGG ACTCCTGTTG GGCAGCCAGT 
TCCTGGATGG CTGTCTGTGG GTGGAGGACT CCTGCCTTTC CTGTTTAGAC ACCCACAAAG 
GCTGCTCTTT AGCCTCCTTC CCTTCATCCC CTTCCCCTGC CCCCAGTGCA ACGAGTATTA 
CACAACCAAC AAAACCGCAA AATATTCCCA CAATTTTCTG GTCCTCTCTG GGAGAGGCCG 
35 CTCTGGCTTT TCCTCTCAGC CCTGGCCCTC TGCCTGCTCC TCACTCCTGG TTGGTGCTGG 
TCAGGCTGAC TAGAGGCCAA GGCGACCAAC ACTAGGCAAA CGCGGCCAGC GCTCAGACAT 
C TTCATTTCAC GTGTAACATT CTTTTAAAAT CTAGGTCTTG GTTTTGTTGA 
3 TGATCATAAA AGAGGGACAG CATAGAAAGT CCCCAAAGAG 
CAGCAAGGTT TTAAAGAAAT TCACAAGCCT AATCTGTCAC TGTCTTATAA TTTGCTATTA 
40 CCAGTCACAA TTTAACTAGG TTTTGTGTTG AAAACTTGTT TTGGTTTGCT TCTGTCCCAA 

C AGGGCAGAGC TTCATTTTTC GTTTGAATGT 
3 AGGGACCTCA GACTGAATCA AAGAATGAAG CCTCTTCCCG TCATGTTCTC 
TATGGCCCCC AGCCCCAGCC TCTGGAAAAT CAGCTCCTCT CTGAGGAAAC AAAGTCAACT 
GAGACTGAGA CTGGGAGCAG AGTTGGCAAA CTGCCAGAAG CCTCTCGCAT CCTGAACACT 
A CAAACTGCGC CCTGGCATTG GAGAGAAGCC CACTGTGGTC 
A TCCTAGACAT G 

A TCTTCTCCCA G 

3 CAATGTGGTG AGCCAGCTAT GGATCCCGGA CACCTTTTTT 
AGGAATTCTA AGAGGACCCA C 

A TTGATGCCGG A 
A GATTTCCAAT GGATTCTCAC TCTTGCCCTC TATCTTTCTC TAGCTTTTCC 
3 GAAAATTTCA A 

ACAACCCCAG TTGGTGACTT C 

A GCCCGGACCT CTCTAGGGAT C 
CTGACCATGA CCACGTTGGG CACCTTTTCT CGTAAGAATT TCCCGCGTGT CTCCTATATC 1980 
ACAGCCTTGG ATTTCTATAT CGCCATCTGC TTCGTCTTCT GCTTCTGCGC TCTGTTGGAG 2040 
TTTGCTGTGC TCAACTTCCT GATCTACAAC CAGACAAAAG CCCATGCTTC TCCTAAACTC 2100 
CGCCATCCTC GTATCAATAG CCGTGCCCAT GCCCGTACCC GTGCACGTTC CCGAGCCTGT 2160 
GCCCGCCAAC AT CAGG AAGC TTTTGTGTGC CAGATTGTCA CCACTGAGGG AAGTGATGGA 2220 
GAGGAGCGCC CGTCTTGCTC AGCCCAGCAG CCCCCTAGCC CAGGTAGCCC TGAGGGTCCC 22 80 
CGCAGCCTCT GCTCCAAGCT GGCCTGCTGT GAGTGGTGCA AGCGTTTTAA GAAGTACTTC 2340 
TGCATGGTCC CCGATTGTGA GGGCAGTACC TGGCAGCAGG GCCGCCTCTG CATCCATGTC 2400 
TACCGCCTGG ATAACTACTC GAGAGTTGTT TTCCCAGTGA CTTTCTTCTT CTTCAATGTG 2460 
CTCTACTGGC TTGTTTGCCT TAACTTGTAG GTACCAGCTG GTACCCTGTG GGGCAACCTC 2520 
TCCAGTTCCC CAGGAGGTCC AAGCCCCTTG CCAAGGGAGT TGGGGGAAAG CAGCAGCAGC 2580 
AGCAGGAGCG ACTAGAGTTT TTCCTGCCCC ATTCCCCAAA CAGAAGCTTG CAGAGGGTTT 2 640 
GTCTTTGCTG CCCCTCTCCC CTACCTGGCC CATTCACTGA GTTTTCTCAG CAGACCATTT 2700 
CAAATTATTA ATAAATGGGC CACCTCCCTC TTCTTCAAGG AGCATCCGTG ATGCTCAGTG 27S0 
TTCAAAACCA CAGCCACTTA GTGATCAGCT CCCTAAAACC ATGCCTAAGT ACAGGCGGAT 2 820 
TAGCTATCTT CCAACAATGC TGACCACCAG ACAATTACTG CATTTTTCCA GAAGCCCACT 28B0 
ATTGCCTTTG CAGTGCTTTC GGCCCAGTTC TGGCCTCAGC CTCAAAGTGC ACCGACTAGT 2 940 
TGCTTGCCTA TACCTGGCAC CTCATTAAGA TGCTGGGCAG CAGTATAACA GGAGGAAGAG 3 000 
ATCCCTCTCC TTTGGTCAGA TTATTATGTT CTCAGTTCTC TCTCCCTGCT ACCCCTTTCT 3060 
CTGCAGATAG ATAGACACTG GCATTATCCC TTTAGGAAGA GGGGGGGGCA GCAAGAGAGC 3120 
CTATTTGGGA CAGCATTCCT CTCTCTCTGC TGCTGTGACA TCTCCCTCTC CTTGCTGGCT 3180 
G TCTGCACTAC CAATTCAATG CCCTTCATCC AATGGGTATC TATTTTTGTG 3240 
C CCTGCTTTAT ATGCCACCCT CTTCCITCTC TTTGACCCCT 3300 
GTGACTCTTT CTGTAACTTT CCCAGTGACT TCCCCTAGCC CTGACCAGGC ACTAGGCCTT 3 360 
GGTGACTTCC TGGGGCCAAG AAACTAAGGA AACTCGGCTT IGCAACAGGC ATTACTCGCC 3420 
ATTGATTGGT GCCCACCCAG GGCACACTGT CGGAGTTCTA TCACTTGCTT GACCCCTGGA 3480 
CCCATAAACC AGTCCACTGT TATACCCGGG GCACTCTAAC CATCACAATC AATCAATCAA 3 540 
ATTCCCTTAA ATTTGTATGG CACTGGAACT TTGGCAAAGC ACTTTTGACA AGTTGTGTCT 3600 
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GATTGGAGCT TCATGATAGC CTTGTGACAT CTTTAGGGCA GGATTCTTAT CCCCATTTTG 
CAGATGAAAA CCCTGAGTCA CAGATTTCTG 1GGGACTGTG GATCTCACTG GAAGCTATCC 
AAGAGCCCAC TGTCACCTTC TAGACCACAT GATAGGGCTA GACAGCTCAG TTCACCATGA 
TTCTCTTCTG TCACCTCTGC TGGCACACCA GTGGCAAGGC CCAGAATGGC GACCTCTCTT 
TAGCTCAATT TCTGGGCCTG AGGTGCT CAG ACTGCCCCCA AGATCAAATC TCTCCTGC-CT 
GTAGTAACCC AGTGGAATGA A 

CTGTGTCTGT AATTTGTTGG GGGGTGGATA GGGTGGGGTC TCCATCTACT T 
ATCATCTGAA ATGGGGAAAT ATGTAAATAA ATATATCAGC A 

Seq ID 

Nucleic Acid Accession 
Coding 



GGGCTGGAAT ACCCTTCTGG AGCCATCATG GAGACGCCAT CCTGGGCCTG GAGGAAGTGC 
GGCTGACGCC ATCCATGAGG AACCGGAGTG GCGCCGTGTG GAGCAGGGCC TCTGTCCCCT 
TCTCTGCCTG GGAAGTAGAG GTGCAAATGA GGGTGACGGG ACTGGGGCGC CGGGGAGCCC 
ACGGCATGGC CGTGTGGTAC ACCCGGGGCA GGGGCCATGT AGGCTCTGTC CTTGGGGGGC 
TGGCTTCGTG GGACGGCATC GGGATCTTCT TTGACTCTCC GGCAGAGGAT ACTCAGGACA 
GTCCTGCCAT CCGTGTGCTG GCCAGCGACG GGCACATCCC CTCTGAGCAG CCTGGGGATG 
3 GAACCGGCCA CACCCCTTCA 
C CTTGAACAGT GGCCTCACTC 
CCAGTGATCC AGGTGAGTTC TGTGTGGATG TGGGGCCCCT GCTTTTGGTC CCTGGAGGTT 
Z ACCGGCACCC TGGCAGATGA TCATGATGTC CTGTCCTTCC 



AGCAGCTCCG CCTGGCGAGG CAGCTGGAAG GGCTGTGGGC AAGGCTGGGC 
& TCAGACTCTG AAGCTCAAGG 



GACAGTGGAC TCTGCTCCAG GCCCTGCAAG AGATGAGGGA TGCAGCTGTC CGCATGGCTG 
CAGAAGCCCA GGTCTCCTAC CTGCCTGTGG GCATTGAGCA T CATTTCTTA GAGCTGGACC 
ACATCCTGGG CCTCCTGCAG GAGGAGCTT C GGGGCCCGGC GAAGGCAGCA GCCAAGGCCC 
CCCGCCCACC TGGCCAGCCC CCAAGGGCCT CCTCGTG 
TCTACCTCCT CATTCAGACT GTAGGCTTCT TCGGCTACCT C 
ACAAGAGCCT TCAGGAGTGT CTGTCCACAG GCAGCCTTCC TCTGGGTCCT GCACCACACA 1560 
CCCCCAGGGC CCTGGGGATT CTGAGGAGGC AGCCTCTCCC TGCCAGCATG CCTGCCTGAC 1620 
CCACCTCACA GCCTGCTTTG CATCACTGGG AAGCAGG CAG TGTCTTGGGT GGGGGCTTGG 1680 
TCAGTATCCT CTCCGTCTGG GTGCCCAGCT CCCACGCACA CCTGAGCTTT CGGCATGCTC 1740 



AGCCAAGAGA GGGGGGACAG ACAGATGGAA AGACGGACAC GGGGAGCTCT GGGGAGCAGG 60 

CGGCCCCCAC CCCCACTTCC AGCTCTGAGA CATTTGTGCA CAGGCCTGCA GGCGGCTGGG 120 

ATGGCCTGGC CGGGCACGCT CTGGAGACAC ACATGCCAAG GCAGGGCCNG GGCAGCCGAG 180 

GGCCCGTGGG GGCTGTTCCG ACCACACAGG TGCCCGAGGG AGGCAGGGCA G3CCCCGGTG 240 



Mi^a™ «™,^*W3 CGGCGCCTAC GCAATGCACG TTACGCCGCG GTGGAAGGGG 
60 TGTCACCGCC ACAGCGGCCG CACCGTGCGT GGCTCTGTTT CCTGGAAACG TCCAGAACAG 
GCAGCCCCCG AGACAGGGAG GGGCCCAGCG GTCGCCAGGG GAAGTGGGGA TGGGAATGAG 
TGCGGATGGG GCTGAGGTTC TTCCTGAAGG ATGAATGCTC TGAAACTGTG AATAGACACA 
CGGAACTGTG CACGTTCACN GG1GAACCGT GTGGGGAGTG AATCCATCTC AACAGAGCTG 
TTACACAGTG AACAAGAGCA CGANGGGTGG AGGCCNCTTG TCTGCTCCTG CAGATCGAGC 
CGGCTGCTCA GAGCCGGGAG ATGGCCACTC AGCTCCTTTA ACTG 

Seq ID NO; C202 DNA Sequence 
Nucleic Acid Accession (h XM_167803.2 
Coding sequence: 1162.. 1488 



65 



G ACCCCGAGGC CTCGTGGACG TGGGAGAGAG TGTGGTGGGA AGTCTTGAGC 
GGAGGAGGGG ATCTGCCCTT CTCCACTCCT CTCTTGGATC CGCCTCGGTT TCCTGTCCCC 
CCACCACCCG CCTGCCCCGC GGAAGACCGC CCAGTGAGCC AGCCCCCACC TTCCAGGCGC 
CTTCGCCCTG GGGATCCAAC CAACTTGTAT CGAGTGG3CG 3GGCAACGGC TCCCCATTTT 
TCCCGAGCCC CGCCCACAGA GCTCTTAGCC AATCCTATGC AGAGAGCATC TCCTGGCAGG 
GGTCTCTTCC CAACCAGACC CCACCO 

CGCCCCACCA CCACCACGTG CAGGTGGAGC TCTGGGATGC T. 
TGGGCCGAGG GCCGGGTAGG CTAGCACGGG AGGTAAGGGT GGTATGGGAT 0 
CGGTCTAGGG CAATAGGAGA GCAGAGAATG GGGGAACTTG AGGGTGGGGG GAGGGCACCG 
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G GACTTTGGGC A 

CCCCATCTGT AAAATGATGG TAATAATACT TCACCTACCT CATAGGGGAG GTTGTGAGGC 
CACCATCACC TGACCTGGGG GTCAAGGCAG GAGGACTCCG AAGGTGCTAC CCGTGAGCAA 



5 



20 



GCACCTGCCC TCTGGCACCA GATGCCAGGG AAGGGACAGA GGAAAACAGC CACAAACAAG 1200 

CCAGGGGGGC TCCCCGGAGC CCCAGGGGTG GGGATTGGTG GCCACTGTTT GTATGTTCTT 1260 

1(J GAGTGCAAGT GTTTTATAAA AAATAAAACA AAAACCCACC ATCACAAAAA AAAAAATTTT 1320 

GCAGCGAAGA GAAATGAAGA AAAACTGAAG AAAAAAAAAA AACAGGAAAA AAAGAACCAT 1380 

ACAAAATTTT TCCACCACAC ATACCCTCTA AGCCAGCAAG ATTTCCTCTT TGCAAAATCA 1440 

TATTTTTGTG GGAATGGGCC CTGCTTTTTG TGGCAAGGCC TGTTCTGATT AATAAAGGAT 1500 

^ CGTGAAAAAG T 1S11 

Seq ID NO: C203 DNA Sequence 

Nucleic Acid Accession #: NM_024780.1 

Coding sequence: 31.. 1023 

i i 1 r i 1 i 1 r 

ATTAATCTGG CCGTGCCATG CATCTACTCC ATGTTCAGGC TTGTGGAGAG GTACGAGATG SO 

CCACGGCACG AAGTCTACGT TCTCCTGATC CGAAACATCT TTTTGAAAAT ATCAATCATT 120 

GGCATTCTTT GTTACTATTG GCTCAACACC GTGGCCCTGT CTGGTGAAGA GTGTTGGGAA 180 

25 ACCCTCATTG GCCAGGACAT CTACCGGCTC CTTCTGATGG ATTTTGTGTT CTCTTTAGTC 240 

AATTCCTTCC TGGGGGAGTT TCTGAGGAGA ATCATTGGGA TGCAACTGAT CACAAGTCTT 300 

GGCCTTCAGG AGTTTGACAT TGCCAGGAAC GTTCTAGAAC TGATCTATGC ACAAACTCTG 360 

GTGTGGATTG GCATCTTCTT CTGCCCCCTG CTGCCCTTTA TCCAAATGAT TATGCTTTTC 420 




ATTTCTTCCA TGCCACCTGT G 
CAGGAGCGGA AACTGAC 

TTTAATACCT GGATTGCTGA TTTTTCAAGA C 
GTTGTAATAT TGAAATGAGC CTACAAAAAC C 
ATTATCTTCA AGAAGTGTGT GCAGGAATGA TTGGTTCTTA GAAATCTCTC CTGCCAGACT 
TCCCAGACCT GGCAAAGGTT TAGAAACTGT TGCTAAGAAA A 



ACTTTCAACT AAAGACATCC CCCATGTCCC AAAGGTGGAA TACAACCAGA GGTCTCATCT 
CTGAACTTTC TTGCGTACTG ATTACATGAG TCTTIGGAGT CGGGGATGGA GGAGGTTCTG 
FTATACA TGACCATCAA AGTCCTACGT CAAGCTAGCT TTGCAGTGGC 
3 CCAATGAGAT TTATCCGAGA CGCGATTATT GCTAATTGGA AATTTTCCCA 
ATACCCCACC GTGATGACTT GAAATATAAT CAGCGCTGGC AATTTTTGAC AGTCTCTACG 
GAGACTGAAT AAG 

Seq ID NO: C204 Protein Sequence 



CCCTGCGAGT CCCAATCAAT GCGGGTGTGA CTGCTGTATG AAGTCTCAGG AAGCCCACCC 
CAGTGGAGAA GAAGGCTTGC AGGAGGCAGG AGATGCTGT C CGATGACCAC GTGAATGAAA 
TCATCATACA GGTTGAGAAT GTTTCCTCTG GGGTCCAAAG CCACCCATCC TCAAAICAGA 
TTTTTCAAGA AAAGGTGCTG CT AG ACT CAA GCATCAACAT GGTTTTGTCA ATATCTGACA 
TTGATGTGAT AGACTCTCAG ACAGT CAGCA AAAGGAATGA CCAAAAGGGT AACCAGGTGC 
TGCGGTTTTC AACATCTTTG AATGAGTCGA TGTCTCAGAC CCTTCATAGC CTAGAATGCA 
TGGGCATAGA CACTCCTGGT TCTTCACATG AAACTGTTCA AGGACAGAAG TTAATCGCAT 
CCCTTATACC CATGACATCC AGAGACAGAA TTAAAGCCAT CAGGAACCAG CCAAGGACCA 
TGGAAGAGAA AAGGAACCTT AGGAAAATAG TTGACAAAGA AAAAAGCAAA CAGACCCATC 
GTATCCTTCA GCTCAATTGC TGTATTCAGT GTCTGAACTC CATTTCCCGG GCTTATCGGA 
GAT CCAAGAA CAGCCTGTCG GAAATT CTGA ATTCCATCAG CCTGTGGCAG AAGACGCTGA 
AGATCATTGG AGGCAAGTTT GGAACCAGCG TCCTCTCCTA TTTCAACTTT CTGAGATGGC 
TTTTGAAGTT CAACATTTTC TCATTCATCC TGAACTTCAG CTTCATCATA ATCCCTCAGT 
TTACCGTGGC CAAAAAGAAC ACCCTCCAGT TCACTGGGCT GGAGTTTTTC ACTGGGGTGG 
GTTATTTTAG GGACACAGTG ATGTACTATG GCTTTTACAC CAATTCCACC A 
GGAACAGCGG GGCATCCTAC AACATGCAGC TGGCCTACAT CTTCACf 

3 CTTCTTCAGT TTGCTGTTCA GCATGGCCAA GTATTTCCGG A 



3 AAGCTAAAAC AGAAGAATCT T 
A GCTCCGTCAG GAGAATTCCA AGTTGACGTT C 
TCTCTGCCTA CATGGTAGCC TGGGTTGTCT CTACAGGAGT GGCCATAGCC TGCTGTGCAG 
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CCGTGGCCCT GTCTGGTGAA GAGTGTTGGG AAACCCTCAT TGGCCAGGAC A 
TCCTTCTGAT GGATTTTGTG TTCTCTTTAG TCAATTCCTT CCTG3GGGAG TTTCTGAGGA 
iCAACTG ATCACAAGTC TTGGCCTTCA G 
A AC1GAT CT AT GCACAAACTC TGGTGTGGAT TGGCATCTTC T 
TGCTGCCCTT TATCCAAATG ATTATGCTTT TCATCATGTT CTACTCCAAA AATATCAGCC 



G GAACCTCATT GGAAGTGTGC ACTTCTTTTT C 
TAATCATCAC CTATCTTTAC TGGCAGATCA CAGAGGGAAG GAAGATTATG ATAAGGCTGC 
TCCATGAGCA GATCATTAAT GAGGGCAAAG ATAAAATGTT CCTGATAGAA AAATTGATCA 
AGCTGCAGGA TATGGAGAAG AAAGCAAACC CCAGCTCACT TGTTCTGGAA AGGAGAGAGG 
TGGAGCAACA AGGCTTTTTG CATTTGGSGG AACATGATGG CAGTCTTGAC TTGCGATCTA 
GAAGATCAGT TCAAGAAGGT AATCCAAGGG CCTGATGACT CTTTTGGTAA CCAGACACCA 
ATCAAATAAG GGGAGGAGAY GAAAATGGAA TGATTTCTTC CATGCCACCT GTGCCTTTAG 
GAACTGCCCA G AAGAAAAT C CAAGGCTTTA GCCAGGAGCG GAAACTGACT ACCATGTAAT 
TATCAAAGTA AAATTGGGCA TTCCATGCTA TTTTTAATAC CTGGATTGCT GATTTTTCAA 



ACCTAGGAAG AGATAACTAG GGAATAATGT ATATTATCTT CAAGAAGTGT GTGCAGGAAT 2820 



2940 

CCTCTGAATT GTAGAACCTG CATTTATTTG TGACTTTGAA CTAAAGACAT CCCCCATGTC 3000 

CCAAAGGTGG AATACAACCA GAGGTCTCAT CTCTGAACTT TCTTGCGTAC TGATTACATG 3060 

AGTCTTTGGA GTCGGGGATG GAGGAGGTTC TGCCCCTGTG AGGTGTTATA CATGACCATC 3120 

AAAGTCCTAC GTCAAGCTAG CTTTGCAGTG GCAGTACCGT AGCCAATGAG ATTTATCCGA 3180 

GACGCGATTA TTGCTAATTG GAAATTTTCC CAATACCCCA CCGTGATGAC TTGAAATATA 3240 

ATCAGCGCTG GCAATTTTTG ACAGTCTCTA CGGAGACTGA ATAAG 3285 



CTTGCTCTGC CCCAAGACAC ACCCTAGCCC CTCCTTATTC TCAAAAGGGG GAGCTGGGGA 
GCCTCCCCCT ACCCTGGGGC CTCCCCTGCC CCTCCCCGCC C 
CCCACAGGCC ACAGCGCTCT CCTCTCCCTC ACACCAAAAG TCCCAGAGCC A 
GCTGACGACC TGCAAGCCAC AGTGGCTGCC C 

GCAGGAAGCT GGCTGAGCCC CAAGACCCCG GGGGCCATGG GCGGGGATCT GGTGCTTGGC 
CTGGGGGCCT TGAGACGCCG AAAGCGCTTG CTGGAGCAGG AGAAGTCTCT GGCCGGCTGG 
GCACTGGTGC TGGCAGGAAC TGGCATTGGA CTCATGGTGC TGCATGCAGA GATGCTGTGG 
TTCGGGGGGT GCTCGTGGGC GCTCTACCTG TTCCTGGTTA AATGCACGAT CAGCATTTCC 
ACCTTCTTAC TCCTCTGCCT CATCGTGGCC TTTCATGCCA AAGAGGTCCA GCTGTTCATG 
ACCGACAACG GGCTGCGGGA CTGGCGCGTG GCGCTGACCG GGCGGCAGGC G 
GTGCTGGAGC TGGTGGTGTG TGGGCTC 

CAGGATTTAG GGGCGCCGCT GACCTCCCCG CAGCCCTGGC 0 
GAAGCGCTGC TGTCCCTGGC CATGCTGCTG CGTCTCTACC T( 
CTGCGCAGCG GCGTCCTGCT CAACGCTTCC TACCGCAGCA TCGGCGCTCT CAATCAAGTC 960 
CGCTTCCGCC ACTGGTTCGT GGCCAAGCTT TACATGAACA CGCACCCTGG CCGCCTGCTG 1020 
CTCGGCCTCA CGCTTGGCCT CTGGCTGACC ACCGCCTGGG TGCTGTCCGT GGCCGAGAGG 1080 
CAGGCTGTTA ATGCCACTGG GCACCTTTCA GACACACTTT GGCTGATCCC CATCACATTC 1140 
CTGACCATCG GCTATGGTGA CGTGGTGCCG GGCACCATGT GGGG CAAGAT CGTCTGCCTG 1200 
TGCACTGGAG TCATGGGTGT CTGCTGCACA GCCCTGCTGG TGGCCGTGGT GGCCCGGAAG 1260 
CTGGAGTTTA ACAAGGCAGA GAAGCACGTG CACAACTTCA TGATGGATAT CCAGTATACC 132 0 
AAAGAGATGA AGGAGTCCGC TGCCCGAGTG CTACAAGAAG CCTGGATGTT CTACAAACAT 13 80 
ACTCGCAGGA AGGAGTCTCA TGCTGCCCGC AGGCATCAGC GCAAGCTGCT GGCCGCCATC 1440 
AACGCGTTCC GCCAGGTGCG GCTGAAACAC CGGAAGCTCC GGGAACAAGT GAACTCCATG 1500 
GTGGACATCT CCAAGATGCA CATGATCCTG TATGACCTGC AGCAGAATCT GAGCAGCTCA 1560 
CACCGGGCCC TGGAGAAACA GATTGACACG CTGGCGGGGA AGCTGGATGC CCTGACTGAG 1620 
CTGCTTAGCA CTGCCCTGGG GCCGAGGCAG CTTCCAGAAC CCAGCCAGCA GTCCAAGTAG 1680 
CTGGACCCAC GAGGAGGAAC CAGGCTACT T TCCCCAGTAC TGAGGTGGTG GACATCGTCT 1740 
CTGCCACTCC TGACCCAGCC CTGAACAAAG CACCTCAAGT GCAAGGACCA AAGGGGGCCC 1800 
TGGCTTGGAG TGGGTTGGCT TGCTGATGGC TGCTGGAGGG GACGCTGGCT AAAGTGGGTA 1860 
GGCCTTGGCC CACCTGAGGC CCCAGGTGGG AACATGGTCA CCCCCACTCT GCATACCCTC 1920 
ATCAAAAACA CTCTCACTAT GCTGCTATGG ACGACCTCCA GCTCTCAGTT ACAAGTGCAG 1980 
GCGACTGGAG GCAGGACTCC TGGGTCCCTG GGAAAGAGGG TACTAGC-GGC CCGGATCCAG 2040 
GATTCTGGGA GGCTTCAGTT ACCGCTGGCC GAGCTGAAGA ACTGGGTATG AGGCTGGGGC 2100 

2160 



TTCCTGCTCT TCATTCTAGG TTACATCGTG GTGGGGATTG TGGCCTGGTT GTATGGAGAC 180 
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CCCCGGCAAG TCCTCTACCC CAGGAACTCT ACTGGGGCCT ACTGTGGCAT GGGGGAGAAC 240 

C CGTATCTCCT GTACTTCAAC ATCTTCAGCT GCATCCTGTC CAGCAACATC 300 

3 CTGAGAACGG CCTACAGTGC CCCACACCCC AGGTGTGTGT GTCCTCCTGC 360 

CCGGAGGACC CATGGACTGT GGGAAAAAAC GAGTTCTCAC AGACTGTTGG GGAAGTCTTC 420 

TATACAAAAA AC AGGAACT T TTGTCTGCCA GGGGTACCCT GGAATATGAC GGTGATCACA 480 

AGCCTGCAAC AGGAACT CTG CCCCAGTTTC CTCCTCCCCT CTGCTCCAGC TCTGGGACGC 540 

TGCTTTCCAT GGACCAACAT TACTCCACCG GCGCTCCCAG GGATCACCAA TGACACCACC 600 

ATACAGCAGG GGATCAGCGG TCTTATTGAC AGCCTCAATG CCCGAGACAT CAGTGTTAAG 660 

ATCTTTGAAG ATTTTGCCCA GTCCTGGTAT TGGATTCTTG TTGCCCTGGG GGTGGCTCTG 720 

GTCTTGAGCC TACTGTTTAT CTTGCTTCTG CGCCTGGTGG CTGGC-CCCCT GGTGCTGGTG 780 

CTGATCCTGG GAGTGCTGGG CGTGCTGGCA TATGGCATCT ACTACTGCTG GGAGGAGTAC 840 

CGAGTGCTGC GGGACAAGGG CGCCTCCATC TCCCAGCTGG GTTTCACCAC CAACCTCAGT 900 

GCCTACCAGA GCGTGCAGGA GACCTGGCTG GCCGCCCTGA TCGTGTTGGC GGTGCTTGAA 960 

GCCATCCTGC TGCTGGTGCT CATCTTCCTG CGGCAGCGGA TTCGTATTGC CATCGCCCTC 1020 

CTGAAGGAGG CCAGCAAGGC TGTGGGACAG ATGATGTCTA CCATGTTCTA CCCACTGGTC 1080 

ACCTTTGTCC TCCTCCTCAT CTGCATTGCC TACTGGGCCA TGACTGCTCT GTATCCTCTG 1140 

CCCACGCAGC CAGCCACTCT TGGATATGTG CTCTGGGCAT CCAACATCAG CTCCCCCGGC 1200 

TGTGAGAAAG TGCCAATAAA TACATCATGC AACCCCACGG CCCACCTTGT GAACTCCTCG 1260 

TGCCCAGGGC TGATGTGCGT CTTCCAGGGC TACTCATCCA AAGGCCTAAT CCAACGTTCT 132 0 

GTCTTCAATC TGCAAATCTA TGGGGTCCTG GGGCTCTTCT GGACCCTTAA CTGGGTACTG 1380 

GCCCTGGGCC AATGCGTCCT CGCTGGAGCC TTTGCCTCCT TCTACTGGGC CTTCCACAAG 1440 

h TCCCTACCTT CCCCTTAATC TCTGCCTTCA TCCGCACACT CCGTTACCAC 1500 

3 AGCCCTCATC CTGACCCTTG TGCAGATAGC CCGGGTCATC 1560 

ZAGAGGA GTGCAGAACC CTGTAGCCCG CTGCATCATG 1620 

25 TGCTGTTTCA AGTGCTGCCT CTGGTGTCTG GAAAAATTTA TCAAGTTCCT AAACCGCAAT 1680 
GCATACATCA TGATCGCCAT CTACGGGAAG AATTTCTGTG T 
ATGCTACTCA TGCGAAACAT TGTCAGGGTG GTCGTCC 

CTGTTCTTTG GGAAGCTGCT GGTGGTCGGA GGCGTGGGGG TCCTGTCCTT CTTTTTTTTC 1860 

TCCGGTCGCA TCCCGGGGCT GGGTAAAGAC TTTAAGAGCC CCCACCTCAA CTATTACTGG 1920 

30 CTGCCCATCA TGACCTCCAT CCTGGGGGCC TATGTCATCG CCAGCGGCTT CTTCAGCGTT 1980 

TTCGGCATGT GTGTGGACAC GCTCTTCCTC TGCTTCCTGG AAGACCTGGA GCGGAACAAC 2040 

GGCTCCCTGG ACCGGCCCTA CTACATGTCC AAGAGCCTTC TAAAGATTCT GGGCAAGAAG 2100 

AACGAGGCGC CCCCGGACAA CAAGAAGAGG AAGAAGTGAC AGCTCCGGCC CTGATCCAGG 2160 

ACTGCACCCC ACCCCCACCG TCCAGCCATC CAACCTCACT TCGCCTTACA GGTCTCCATT 2220 

35 TTGTGGTAAA AAAAGGTTTT AGGCCAGGCG CCGTGGCTCA CGCCTGTAAT CCAACACTTT 2280 

GAGAGGCTGA GGCGGGCGGA TCACCTGAGT CAGGAGTTCG AGACCAGCCT GGCCAACATG 2340 

Seq ID NO: C207 DNA Sequence 

40 Nucleic Acid Accession II: NM_016100.1 

Coding sequence: 26.. 1618 



CACGAACCTT CCTCTCCCAG TGGCCATGGG TAGCAACAGT GGGCAGGCTG GCCGCCACAT 
CTATAAATCC CTAGCTGATG ATGGCCCCTT TGACTCTGTG GAGCCGCCTA AAAGACCCAC 
" 3VGACTC ATCATGCACA GCATGGCCAT GTTCGGAAGA GAGTTCTGCT ACGCGGTGGA 
GGCAGCGTAT GTGACCCCAG TCCTGCTCAG CGTAGGTCTG CCCAGCAGCC TGTACAGCAT 
TGTGTGGTTC CTCAGCCCCA TCCTGGGATT CCTGCTGCAG CCCGTGGTCG GATCGGCCAG 
CGACCACTGC CGGTCCAGGT GGGGCCGCCG GAGACCCTAC ATCCTCACCC TGGGAGT CAT 
GATGCTCGTG GGCATGGCTC TGTACCTCAA TGGGGCTACT GTTGTAGCAG CTTTGATTGC 
TAACCCAAGG AGGAAGCTGG TTTGGGCCAT AAGTGTCACC ATGATAGGTG TCGTTCTCTT 
TGATTTTGCT GCCGACTTCA TTGATGGGCC CATCAAAGCC TACTTATTTG ATGTCTGCTC 
CCATCAGGAC AAGGAGAAGG GCCTCCACTA CCATGCCCTC TTCACAGGTT TTGGAGGTGC 
CCTGGGTTAC CTTTTGGGTG CTATAGACTG GGCCCATCTG GAGCTGGGAA GACTGTTGGG 
TACAGAATTC CAGGTCATGT TCTTCTTCTC TGCATTGGTG CTCACTTTGT GTTTTACTGT 
TCATCTGTGC AGTATCTCTG AAGCCCCACT TACAGAGGTT GCAAAGGGCA TTCCCCCACA 
GCAAACCCCT CAGGACCCTC CATTGTCATC AGATGGAATG TACGAGTATG GTTCTATCGA 
GAAAGTTAAA AATGGTTACG TAAATCCAGA GCTGGCAATG CAGGGAGCAA AAAACAAAAA 
TCATGCTGAA CAGACTCGCA GGG CAATGAC ATTAAAGTCA CTGCTGAGAG CACTGGTGAA 
CATGCCTCCT CACTACCGCT ACCTTTGCAT CAGCCACCTC ATTGGATGGA CGGCCTTCCT 
GTCCAACATG CTGTTCTTCA CAGATTTCAT GGG CCAG ATT GTGTACCGCG GGGATCCCTA 
TAGTGCACAC AACTCCACAG AGTTTCTCAT CTACGAAAGA GGAGTCGAGG TTGGATGTTG 
C ATCAACTCCG TGTTTTCCTC ACTTTATTCT TACTTTCAGA AAGTTTTGGT 

3 GTCTTTACTT CACGGGATAT TTGCTGTTTG GCCTGGGGAC 

T GGGCTCTTCC CGAATGTCTA CTCCACCCTG GTCCTGTGCA GCCTGTTTGG 
TGTAATGTCC AGCACCCTGT ACACTGTGCC CTTTAACCTC ATTACTGAGT ACCACCGCGA 
GGAAGAAAAG GAGAGGCAGC AGGCCCCAGG AGGGGACCCA GACAACAGCG TGAGAGGGAA 
GGGCATGGAC TGCGCCACCC TCACATGCAT GGTGCAGCTG GCTCAGATCC TGGTCGGAGG 
TGGCCTGGGC TTTCTGGTCA ACACAGCCGG GACCGTTGTC GTCGTGGTGA TCACAGCGTC 
TGCGGTGGCA CTGATAGGCT GTTGCTTTGT CGCTCTCTTT GTTAGATATG TGGATTAGGT 
CAATAAAGAG ACAATGACC 



75 Nucleic Acid Accession # 
Coding sequence 



CGCCGCGGGG CCGGATCCTC C 
TGTTTCCT 

CCCACTCAGG CCCCCGGGCC CCGCTGGAAT TCGGAGGGCC CCTGGGTAAT 3GGGCAGAGA 
GATGGGACCT GGGGCAAAGG CTAAGCGAAG GAGAGCTGGA GCGGGTGAAC TAAGAGCGGG 
GGCGAGATCT GAGGATGGAA GGCTTTGGGG GTGTCGGAGG CAGAGGGACC CGGGGGTTTG 
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3 GACGCCTGGG AAGGGAACTG GGAGGCAGCG G 

CTGAGGACGG GGTGCGGAGG CGCACTCTGG GAATGCCGAG AGGGTCCCGC AGAGACGTCA 
GGGCGCCGTG CGGGCCGGCG GGGAGCTGGG GGGCTAGGGG CGGACC-CCGA CGTGATGGCC 
CTTCCCGCAG GCGCCGCGGC TCTGCTACTG CTGCTGCCCG CCACCATGTT C 
CTGGCGGCCC GTTCGGGCCC CGCGCGCCTG CTGGGTCCAC CCGCGTCCCT G 

GAGGTGCTGT GGAGCCCACG GGCGCTGCTG CTGTGGCTCG CCTG3CTCGC- CCTGCAGGCG 780 

GCGCTCTACC TACTGCCGGC GCGCAAGGTG CGGGCCCCGC TCGCGGACGC TCGGGGGAGG 840 

GAAGCGAATG GGCTCGGCGA GGGAAAGGAC GCCCCGGGCC TTATCAGAGC CCCCTTGGAC 900 

CCGCAGTGGC CGAGGGGCAG GAATTGAAGG ACAAGAGTCG CCTGCGCTAT CCTATTAACG 9S0 

GCTTCCAGGC CCTGGTGCTG ACAGCCCTGT TGGTGGGGCT GGGGATGTCA GCGGGGCTGC 1020 

CTCTGGGGGC GCTCCCGGAA ATGCTCCTGC CCTTGGCGTT TGTCGCCACC CTCACCGCTT 10S0 

TCATCTTCAG CCTCTTTCTC TACATGAAGG CGCAGGTAGC CCCAGTTTCG GCCCTGGCAC 1140 

CTGGGGGGAA CTCAGGCAAT CCGATTTACG ACTTTTTTCT GGGACGAGAG CTCAACCCTC 1200 

GTATCTGTTT CTTCGACTTC AAATATTTCT GTGAACTGCG ACCCGGCCTC ATCGGCTGGG 1260 

TCCTCATCAA CCTGGCCCTG TTGATGAAGG AGGCAGAGCT TCGAGGCAGT CCCTCACTGG 1320 

CCATGTGGCT GGTCAATGGC TTCCAGTTGC TCTACGTGGG TGATGCCCTC TGGCACGAGG 1380 

AGGCCGTCCT CACCACCATG GATATCACAC ATGACGGGTT TGGCTTCATG CTGGCGTTTG 144 0 

GGGACATGGC CTGGGTGCCC TTCACCTACA GCCTGCAGGC CCAGTTCCTG CTGCACCACC 1E00 

CGCAGCCCCT GGGGTTGCCC ATGGCCTCTG TCATCTGCCT CATCAATGCT ACTGGTTACT 1560 

ACATCTTCCG TGGGGCGAAT TCCCAGAAAA ACACTTTCCG AAAGAATCCT TCTGACCCCA 1620 

GAGTGGCTGG GCTTGAGACC ATCTCTACAG CCACAGGGCG GAAACTGCTG GTGTCTGGGT 1680 

GGTGGGGTAT GGTCCGCCAT CCCAACTATC TTGGAGACCT CATCATGGCT CTGGCTTGGT 1740 

CCTTGCCCTG CGGGGTGTCA CACCTGCTGC CCTACTTCTA CCTCCTCTAC TTCACCGCGC 1800 

TGCTGGTGCA CCGTGAGGCC CGGGATGAGC GGAGTGCCTG CAGAAGTACG GCCTGGCCTG 1860 

GCAGGAGTAC TGCCGGCGTG TGCCTTACCG CATCATGCCC TACATCTACT GAAGCGGCTC 1920 
A GGTGGGGCAT GTGCCCACTC ATCCACCAGC ACACCCAGGA C 
T GGGACTCAAG GGCTTGCACC CCACCC 



I I I I I I 

CCAGTTCCGG CACGAGGACC ATGGGCCGGC TGCTGCGGGC CGCCCGGCTG CCGCCGCTGC 6 0 

TTTCGCCGCT GCTGCTTCTG CTGGTTGGGG GAGCGTTCCT GGGTGCCTGT GTGGCTGGGT 120 

CTGATGAGCC TGGCCCAGAG GGCCTCACCT CCACCTCCCT GCTAGACCTC CTCCTGCCCA 180 

CTGGCTTGGA GCCACTGGAC TCAGAGGAGC CTAGTGAGAC CATGGGCCTG GGAGCTGGGC 240 

TGGGAGCCCC TGGCTCAGGC TTCCCCAGCG AAGAGAATGA AGAGTCTCGG ATTCTGCAGC 3 00 

CACCACAGTA CTTCTGGGAA GAGGACGAAG AGCTGAATGA CTCAAGTCTG GACCTGGGAC 3 60 

CCACTGCAGA TTATGTTTTT CCTGACTTAA CTGAGAAGGC AGGTTCCATT GAAGACACTA 420 

GCCAGGCTCA AGAGCTGCCA AACCTCCCCT CTCCCTTGCC CAAGATGAAT CTGGTTGAGC 480 

CTCCCTGGCA TATGCCTCCC AGAGAGGAGG AACAAGAGGA AGAGGAAGAG GAGGAGAGGG 540 

AGAAGGAAGA GGTAGAGAAA CAAGAGGAGG AGGAAGAGGA GGAGCTGCTC CCTGTGAATG 600 

GATCCCAAGA AGAAGCCAAG CCTCAGGTCC GTGACTTTTC TCTCACCAGC AGCAGCCAGA 660 

CCCCAGGGGC CACCAAAAGC AGGCATGAAG ACTCCGGGGA CCAGGCCTCA TCAGGTGTGG 720 

AGGTGGAGAG CAGCATGGGG CCCAGCTTGC TGCTGCCTTC AGTCACCCCA ACTACAGTGA 780 

CTCCGGGGGA CCAGGACTCC ACCAGCCAAG AGGCAGAGGC CACAGTGCTG CCAGCTGCAG 340 

GGCTTGGGGT AGAGTTCGAG GCTCCTCAGG AAGCAAGCGA GGAAGCCACT GCAGGAGCAG 900 

CTGGTTTGTC TGGCCAGCAC GAGGAGGTGC CGGCCTTGCC TTCATTCCCT CAAACCACAG 960 

CTCCCAGTGG GGCCGAGCAC CCAGATGAAG ATCCCCTTGG CTCTAGAACC TCAGCCTCTT 1020 

CCCCACTGGC CCCTGGAGAC ATGGAACTGA CACCTTCCTC TGCTACCTTG GGACAAGAAG 1080 

ATCTCAACCA GCAGCTCCTA GAAGGGCAGG CAGCTGAAGC TCAATCCAGG ATACCCTGGG 1140 

ATTCTACGCA GGTGATCTGC AAGGACTGGA GCAATCTGGC TGGGAAAAAC TACATCATTC 1200 

TGAACATGAC AGAGAACATA GACTGTGAGG TGTTCCGGCA GCACCGGGGG CCACAGCTCC 12 60 

TGGCCCTGGT GGAAGAGGTG CTGCCCCGCC ATGGCAGTGG CCACCATGGG GCCTGGCACA 1320 

TCTCTCTGAG CAAGCCCAGC GAGAAGGAGC AGCACCTTCT CATGACACTG GTGGGCGAGC 1380 

AGGGGGTGGT GCCCACTCAA GATGTCCTTT CCATGCTGGC- TGACATCCGC AGGAGCCTGG 1440 

AGGAGATTGG CATCCAGAAC TATTCCACAA CCAGCAGCTG CCAGGCGCGG GCCAGCCAGG 1500 

TGCGCAGCGA CTACGGCACG CTCTTCGTGG TGCTGGTGGT CATTGGGGCC ATCTGCATCA 1560 

TCATCATTGC GCTTGGCCTG CTCTACAACT GCTGGCAGCG CCGGCTC-CCC AAGCTCAAGC 1620 

ACGTGTCGCA CGGCGAGGAG CTGCGCTTCG TGGAGAACGG CTGCCACGAC AACCCCACGC 1680 

TGGACGTGGC CAGCGACAGC CAGTCGGAGA TGCAGGAGAA GCACCCCAGC CTGAACGGCG 1740 

GCGGGGCCCT CAACGGCCCG GGGAGCTGGG GGGCGCTCAT GGGGGGCAAG CGGGACCCCG 1800 

AGGACTCGGA CGTGTTCGAG GAGGACACGC ACCTGTGAGC GCAGCGAGGC GCAGGCCGAG 1860 

TGGGCCGCCA GGACCAAGCG AGGTGGACCC CGAAACGGAC GGCCCGGAGC CCGCACCAGC 1920 

CCCGCGCCTA CCCGGCCGCC CCCGCGCCTG GCCCTCGGCG CGGGCTCCTT CCCGCTTCCC 1980 

CCGACTTCAC ACGGCGGCTT CGGACCAACT CCCTCACTCC CGCCCGAGGG GCAGGCCICA 2040 

AAGCCCGCCT TGGCCCCGCT TTCCCGCCCC TGAACCCCGG CCCCGCGGGC GGCGGGCGCG 2100 

CTTCCTGCGC CCCGGGACTC AATTAAACCC GCCCGGAGAC CACGCGGGCC CAGCGAAAAA 2160 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2220 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 2269 

Seq ID NO: C210 DNA Sequence 
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CTCCTGGAAC CCCCGACCAT GGAGGTTCTT GGCATGACTG ACTCIGAAGA G 
CCTATGGAGG ACTTCGATTC TTTGGAATGC ATGGAGGGCA GTGACGCATT G3CCCTGCGG 
3GGACGA GATGGACGTG AGCCTCAGGG CCCCGCGCCT GGCCCAGCTC 
G CCATGCACAG CCTGGGTCTG GCTTTCATCT ACGACCAGAC TGAGGACATC 
C TTAGAAGTTT CATGGACGGT TTCACCACAC TTAAGG2 
T CCCCGAACCC CGGGTCCTGG GTGTCCTGCG AACAGGTGCT G 
CTGCTGCTGC TGGCGCTGCT GCTGCCGCTG CTCAGCGGGG GCCTGCACCT GCTGCTCAAG 
TGAGGCCCCG GCGGCTCAGG GCGGGGCTGG CCCCACCCCC ATGACCACTG CCCTGC-AGGT 
GGCGGCCTGC TGCTGTTATC TTTTTAACTG TTTTCTCATG ATGCCTTTTT ATATTTAAAC 
CCCGAGATAG TGCTGGAACA CTGCTGAGGT TTTATACTCA GGTTTTTTGT TTTTTTTTTA 
TTCCAGTTTT CGTTTTTTCT AAAAGATGAA TTCCTATGGC TCTGCAATTG TCACCGGTTA 
ACTGTGGCCT GTGCCCAGGA AGAGCCATTC ACTCCTGCCC CTGCCCACAC GGCAGGTAGC 
AGGGGGAGTG CTGGTCACAC CCCTGTGTGA TATGTGATGC CCTCGGCAAA GAATCTACTG 
GAAT AGATT C CGAGGAGCAG GAGTGCTCAA TAAAATGTTG GTTTCCAGCA AAAAAAAAAA 



Seq ID NO: C211 E 
Nucleic Acid Acce 
Coding sequence: 



GCTGCTCCCG ACGCGGAGCC CGGAGCCCGC GCCGAGCCCC TGGCCTCGCG GTGCCATGCT 
GCCCCGGCGG CGGCGCTGAA GGATGGCGAC GCCGCTGCCT CCGCCCTCCC CGCGGCACCT 
GCGGCTGCTG CGGCTGCTGC TCTCCGGCCT CGTCCTCGGC GCCGCCCTGC GTGGAGCCGC 
CGCCGGCCAC CCGGATGTAG CCGCCTGTCC CGGGAGCCTG GACTGTGCCC TGAAGAGGCG 
GGCAAGGTGT CCTCCTGGTG CACATGCCTG TGGGCCCTGC CTTCAGCCCT TCCAGGAGGA 



G ACTTCCTGGC CCAGGAGCTT GCCCGGAAGG AGTCTGGACA 
ATCAACTCCG CCCCTACCCA AGGACCGACA GCGGCTCCCG GAGCCTGCCA CCCTGGGCTT 
CTCGGCACGG GGGCAGGGGC TGGAGCTGGG CCTCCCCTCC ACTCCAGGAA CCCCCACGCC 
CACGCCCCAC ACCTCCCTGG GCTCCCCTGT GTCATCCGAC CCGGTGCACA TGTCGCCCCT 
GGAGCCCCGG GGAGGGCAAG GCGACGGCCT CGCCCTTGTG CTGATCC 
GGCCGGTGCA GCCGCCCTCT CCGTAGCCTC CCTCTGCTGG TGCAGGCTGC A 
CCGCCTGACT CAGAAGGCCG ACTACGCCAC TGCGAAGGCC CCTGGCTCAC CTGCAGCTCC 
CCGGATCTCG CCTGGGGACC AGCGGCTGGC ACAGAGCGCG GAGATGT" — 
CCAACGGCAA CAGATGCTGT GCCTGGAGCG GCATAAAGAG CCACCCAAGG A 

S AGACTTCACG GTGTACGAGT GCCCGGGCCT 



G GCCCCCAGCT CACCGCCTGC ACTGCCATGA CCTGGAGGCA GACAGACGCC 
CACCTGCTCC CCGACCTCGA GGCCCCCGGG GAGGGGCAGG GCCTGGAGCT TCCCACTAAA 
T GCTTTTGGCT GGGCCTCGGG CTCCAGGCCC TGGGACCCCT 
C CTTTGTGCCA GGACA; TC < CCCTC CACCTCTCCT 
GTTCGGTTTA GACCCCCAAA CTGGAGGGGG CATGGAGAAC CGTAGAGCGC AGGAACGGGT 
GGGTAATTCT AGAGACAAAA GCCAATTAAA GTCCATTTCA G. 

Seq ID NO: C212 DNA Sequence 
Nucleic Acid Accession tt: NM_004445.1 



CGGAGGGGGC GGGCCGGGCT GCGTTCGCTC CAGCCGCGGC TCTACAGCAG C 
GACCCGGGAC CCAGCTTGGC GACGGCGATT CTCGACGCGG GCCCCCAGGA TTCTCCCGGC 
=A GCGTCAGGTC CACCCCGGAA TCCCAGGGAC 



GTGGCACCGT GCGAGCTCCA GGAGCCCCGG GTCCACTGCG AGGCCTCGGG GGGCGCAGAC 
CTGCAGAGAC TGCGGCCAAC GGGAAGAAAT AAAGGGATTA TAGTCCACCC AATTCACAGA 
CTTCTGAGAC TCAGACACGA GGAGAGATAG AGAACCGCCA ATCTCTAGAT CAACAAGCAA 
AGGAGGTGCC AAGCCTGTTT GTCTTCATTG TGACACTGGA GTCTAGATGC TGGGAAGTCC 
AAGATCAGGG TGCCGGCATG GTCAGTTCCT GGCGAAGCCT CICTTCTAGG TTTCAGACTG 



ACCCAGGAGC AGGGTGGTGG 72 0 



GGGAACAGAG TGGCGGGCAT GGTGTGTAGC CTATGGGTGC TG' 

CTGGCTCTGG AAGAGGTATT GCTGGACACC ACCGGAGAGA CATCTGAGAT TGGCTG3CTC 900 

ACCTACCCAC CAGGGGGGTG GGACGAGGTG AGTGTTCTGG ACGACCAGCG ACGCCTGACT 960 

CGGACCTTTG AGGCATGTCA TGTGGCAGGG GCCCCTCCAG GCACCGGGCA GGACAATTGG 1020 

TTGCAGACAC ACTTTGTGGA GCGGCGCGGG GCCCAGAGGG CG CACATTCG ACTCCACTTC 10B0 

TCTGTGCGGG CATGCTCCAG CCTGGGTGTG AGCGGCGGCA CCTGCCGGGA GACCTTCACC 1140 

CTTTACTACC GTCAGGCTGA GGAGCCCGAC AGCCCTGACA GCGTTTCCTC CTGGCACCTC 1200 

AAACGCTGGA CCAAGGTGGA CACAATTGCA GCAGACGAGA GCTTTCCCTC CTCCTCCTCC 1260 

TCCTCCTCCT CTTCTTCCTC TGCAGCGTGG GCTGTGGGAC CCCACGGGGC TGGGCAGCGG 1320 

GCTGGACTGC AACTGAACGT CAAAGAGCGG AGCTTTGGGC CTCTCACCCA ACGCGGCTTC 1380 

TACGTGGCCT TCCAGGACAC GGGGGCCTGC CTGGCCCTGG TCGCTGTCAG GCTCTTCTCC 1440 

TACACCTGCC CTGCCGTGCT CCGATCCTTT GCTTCCTTTC CAGAGACGCA GGCCAGTGGG 1500 

iCCTGTG TGGCTCATGC AGAGCCAGAG 1560 

C AGCCCCCCCA GGCTGCACTG CAACGGGGAG 1620 

GGCAAGTGGA TGGTAGCTGT CGGGGGCTGC CGCTGCCAGC CTG3ATACCA ACCAGCACGA 1680 

GGAGACAAGG CCTGCCAAGC CTGCCCACGG GGGCTCTATA AGTCTTCTGC TGGGAATGCT 1740 

CCCTGCTCAC CATGCCCTGC CCGCAGTCAC GCTCCCAACC CAGCAGCCCC CGTTTGCCCC 1800 

TGCCTGGAGG GCTTCTACCG GGCCAGTTCC GACCCACCAG AGGCCCCCTG CACTGGTCCT 18 60 

CCATCGGCTC CCCAGGAGCT TTGGTTTGAG GTGCAAGGCT CAGCACTCAT GCTACACTGG 1920 



WO 03/042661 PCT/US02/36810 



40 



55 



CGCCTOCCTC GGGAGCTOGG GGGTCGAGGG GACCTGCTlT ^AATGTCGT GTGCAAGGAG 1980 

TGTGAAGGCC GCCAGGAACC TGCCAGCGGT GGTGGGGGCA CTTGTCACCG CTGCAGGGAT 2040 

GAGGTCCACT TCGACCCTCG ccagagaggc CTGACTGAGA GCCGAGTGTT AGTGGGGGGA 2100 

A CATCTTAGAG GTGCAGGCTG TTAATGGGGT GTCTGAGCTC 21S0 

C CTCCTCAGGC TGCAGCCATC AATGTCAGCA CCAGCCATGA AGTGCCCTCT 2220 

A GGTGAGCCGG GCATCCAACA GCATCACGGT GTCCTGGCCG 2280 

G GACTATCAGC TCCGCTACTA TGACCAGGCA 2340 

GAAGACGAAT CCCACTCCTT CACCCTGACC AGCGAGACCA ACACTGCCAC CGTGACACAG 2400 

CTGAGCCCTG GCCACATCTA TGGTTTCCAG GTGCGGGCCC GGACTGCTGC CGGCCACGGC 24S0 

CCCTACGGGG GCAAAGTCTA TTTCCAGACA CTTCCTCAAG GGGAGCTGTC TTCCCAGCTT 2520 

I CCTCCTGCTG 2SB0 
2640 

ACCTACGAGG ACCCCTGTCA GGCCATCCGA GAACTTGCCC GGGAAGTCGA TCCTGCTTAT 2760 

15 ATCAAGATTG AGGAGGTCAT TGGGACAGGC TCTTTTGGAG AAGTGCGCCA GGGCCGCCTG 2820 

CAGCCACGGG GACGGAGGGA GCAGACTGTG GCCATCCAGG CCCTGTGGGC CGGGGGCGCC 2880 

GAAAGCCTGC AGATGACCTT CCTGGGCCGG GCCGCAGTGC TGGGTCAGTT CCAGCACCCC 2940 

AACATCCTGC GGCTGGAGGG CGTGGTCACC AAGAGCCGAC CCCTCATGGT GCIGACGGAG 3000 

TTCATGGAGC TTGGCCCCCT GGACAGCTTC CTCAGGCAGC GGGAGGGCCA GTTCAGCAGC 3060 

20 CTGCAGCTGG TGGCCATGCA GCGGGGAGTG GCTGCTGCCA TGCAGTACCT GTCCAGCTTT 3120 

GCCTTCGTCC ATCGCTCGCT GTCTGCCCAC AGCGTGCTGG TGAATAGCCA CTTGGTGTGC 3180 

AAGGTGGCCC GTC1TGGCCA CAGTCCTCAG GGCCCAAGTT GTTTGCTTCG CTGGGCAGCC 3240 

CCAGAGGTCA TTGCACATGG AAAGCAT ACA ACATCCAGTG ATGTCTGGAG CTTTGGGATA 3300 

CTCATGTGGG AAGTGATGAG TTATGGAGAA CGGCCTTACT GGGACATGAG TGAGCAGGAG 3360 

25 GTACTAAATG CAATAGAGCA GGAGTTCCGG CTGCCCCCGC CTCCAGGCTG TCCTCCTGGA 3420 

TTACATCTAC TTATGTTGGA CACTTGGCAG AAGGACCGTG CCCGGCGGCC TCATTTTGAC 3480 

CAGCTGGTGG CTGCATTTGA CAAGATGATC CGCAAGCCAG ATACCCTGCA GGCTGGCGGG 3540 

3 AAAGGCCTTC CCAGGCCCTT CTGACCCCTG TGGCCCTGGA CTTTCCTTGT 3600 

C CCCAGGCCTG GCTTTCAGCC ATTGGACTGG AGTGCTACCA GGACAACTTC 3660 

3720 

GCCCTGGGCA TCACCCTGGC TGGCCACCAG AAGAAGCTGC TGCACCACAT CCAGCTCCTT 3780 

CAGCAACACC TGAGGCAGCA GGGCTCAGTG GAGGTCTGAG AATGACGATA CCCGTGACTC 3840 

AGCCCTGGAC ACTGGTCCGA GAAGGGACAT GTGGGACGTG AGCCGGGCTC CAACAGCCTC 3900 

TGTGAGAGAT GCCCCACACC AAACCCAACC CTCCCGATGG CTGCATTCCC TGGTCCTCCG 3960 

35 CCTCTCCACC AGCCCCCTCC TCATTAAAGG GAAAGAAGGG AATTTGCAAA 4010 



Seq ID MO; C213 DNA Sequen 
Nucleic Acid Accession #: 
Coding sequences 195.. 1067 



CGCGGAGCGT CACTTCCCGG CAGCGGGAGG CGAGTGG CGA 
CTCGGACCCA CGCGGCGCCG CGGCCCGCCT GGCCTGCAGC GCTCCCACCC 




GTTTACCCAA TCTGCACTGC GAGTTGGTTG GGCTTGACAT TCAGGATGAG A 
ACGAAGTGGG CCACATCGAC AACTCCATGA AGATCCCGCT GAACAATGGG GCAGGCTGCC 
GCTTCGAGGG GCAGTTCAGC ATCAACAAGG TCCCCGGCAA C 
GTGCCACAGC CCAGCCACAG AACCCAGACA TGACGCATGT CATCCACAAG C 
GGGACACGCT ACAGGTCCAG AACATCCACG GAGCTTTCAA TGCTCTCGGG GGAGCAGACA 
GACTCACCTC CAACCCCCTG GCCTCCCACG ACTACATCCT GAAGATTGTG CCCACGGTTT 



ACGTCGCCTA CAGCCACACG GGCCGCATCA TCCCTGCAAT CTGGTTCCGC TACGACCTCA 900 

GCCCCATCAC GGTCAAGTAC ACAGAGAGAC GGCAGCCGCT GTACAGATTC ATCACCACGA 960 

TCTGTGCCAT CATTGGCGGG ACCTTCACCG TCGCCGGCAT CCTGGACTCA TGCATCTTCA 1020 

60 CAGCCTCTGA GGCCTGGAAG AAGATCCAGC TGGGCAAGAT GCATTGACGC CACACCCAGC 1080 

CTAATGGCCG AGGACCCTGG GCATCGCCAG CC1TGCCTCC AGTGCCCTGT CTCCTTTGGC 1140 

CCTCAATCTG GTCCCAAATC TGGCTGTGTC CCAAAGGGTG TGTGGGAAGT GGGGGGAAAG 1200 

TAGAGGATGG CTCGATGTTT TGCAGCTACC TCTTTTCCCC GTGnTCTIT TTAGACAAAT 1260 

TACACTGCCT GAAGTTGCAG TTCCCCTTTC CCTGGGGAGC CCCAAGAACA GAGTCAGGCA 1320 

65 AGGGGTGGGG AGTCCAGGGG AACATCCCAG AATGCATATC GATCAGCTCT CAGCCAGGCT 1380 

TCGACAATCT CGCAGCCCCC ACTAGGTGGA CACATTAATG ATTIGGTTTC TCCCCTGGGC 144 0 

AGCCAACCTG CCCCAGAGGC ACCAGACCTG GGCTTTCAGC TTTGGGACCA GGCTGCCCAA 1500 

AGGTACTCCT TTATACACCC GGCACCTTCC ACGAAAGATG GTACTTCCCA AGCAAGCCCC 1560 

TATGATTTGT CACTATAGAT GGAAATGTGT GGCATGCACA TGAGTTGAAA TTCTTTTATG 1620 

70 CATTTTTTTG AAGAAAAAAA AAAAAACAAC TCTGAGGACA TAGGGGAIGT CAGTTTCCTA 1680 

TGGAAGAGAC ACCTCTGACC CGTTATTCTT ATAATCAAAA TCTGAAGGGA AAAAAATGTT 1740 

TTAGTTCTTT CCCCACTCGT TGGGTTCAAC TAGATTAAAA GGCTGATTTT CAG 1793 
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: C214 DNA Sequen 
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C TGCGGGGACC CTGCTACTTC TGACAGCCAT CGGGGCGGCA TCCTGGGCCA 
TTGTGGCTGT TCTCCTCAGG AGTGAC CAGG AGCCGCTGTA CCCAGTGCAG GTCAGCTCTG 
CGGACGCTCG GCTCATGGTC TTTGACAAGA CGGAAGGGAC GTGGCGGCTG CTGTGCTCCT 
CGCGCTCCAA CGCCAGGGTA GCCGGACTCA GCTGCGAGGA GATGGGCTTC CTCAGGGCAC 
TGACCCACTC CGAGCTGGAC GTGCGAACGG CGGGCGCCAA TGGCACGTCG GGCTTCTTCT 
GTGTGGACGA GGGGAGGCTG CCCCACACCC AGAGGCTGCT GGAGGTCATC TCCGTGTGTG 
ATTGCCCCAG AGGCCGTTTC TTGGCCGCCA TCTGCCAAGA CTGTGGCCGC AGGAAGCTGC 
3 CATCGTGGGA GGCCGGGACA CCAGCTTGGG CCGGTG3CCG TGGCAAGTCA 



TGACAGCCGC CCACTGCTTC CCGGAGCGGA ACCGGGTCCT GTCCCGATGG CGAGTGTTTG 900 

CCGGTGCCGT GGCCCAGGCC TCTCCCCACG GTCTGCAGCT GGGGGTGCAG GCTGTGGTCT 960 

ACCACGGGGG CTATCTTCCC TTTCGGGACC CCAACAGCGA GGAGAACAGC AACGATATTG 102 0 

CCCTGGTCCA CCTCTCCAGT CCCCTGCCCC TCACAGAATA CATCCAGCCT GTGTGCCTCC 1080 

CAGCTGCCGG CCAGGCCCTG GTGGATGGCA AGATCTGTAC CGTGACGGGC TGGGGCAACA 1140 

CGCAGTACTA TGGCCAACAG GCCGGGGTAC TCCAGGAG3C TCGAGTCCCC ATAATCAGCA 1200 

ATGATGTCTG CAATGGCGCT GACTTCTATG GAAACCAGAT CAAGCCCAAG ATGTTCTGTG 12 SO 

CTGGCTACCC CGAGGGTGGC ATTGATGCCT GCCAGGGCGA CAGCGGTGGT CCCTTTGTGT 1320 

GTGAGGACAG CATCTCTCGG ACGCCACGTT GGCGGCTGTG TGGCATTGTG AGTTGGGGCA 1380 

CTGGCTGTGC CCTGGCCCAG AAGCCAGGCG TCTACACCAA AGTCAGTGAC TTCCGGGAGT 1440 

GGATCTTCCA GGCCATAAAG ACTCACTCCG AAGCCAGCGG CATGGTGACC CAGCTCTGAC 1500 

CGGTGGCTTC TCGCTGCGCA GCCTCCAGGG CCCGAGGTGA TCCCGGTGGT GGGATCCACG 1560 

CTGGGCCGAG GATGGGACGT TTTTCTTCTT GGGCCCGGTC CACAGGTCCA AGGACACCCT 1620 

CCCTCCAGGG TCCTCTCTTC CACAGTGGCG GGCCCACTCA GCCCCGAGAC CACCCAACCT 1680 

CACCCTCCTG ACCCCCATGT AAATATTGTT CTGCTGTCTG GGACTCCTGT CTAGGTGCCC 1740 ■ 

" " 3 GATGCTCTTT AAATAATAAA GATGGTTTTG ATT 1783 



I I I I I I 

ATGGTGGAAC ACGCTGCCCA CAAACATGGA AACGACCGTT CTCAGTGGGA TCAACTTCGA 60 

GTACAAGGGC ATGACAGGCT GGGAGGTGGC TGGTGATCAC ATTTACACAG CTGCTGGAGC 120 

CTCAGACAAT GACTTCATGA TTCTCACTCT GGTTGTGCCA GGATTTAGAC CTCCGCAGTC 180 

GGTGATGGCA GACACAGAGA ATAAAGAGGT GGCCAGAATC ACATTTGTCT TTGAGACCCT 240 

CTGTTCTGTG AACTGTGAGC TCTACTTCAT GGTGGGTGTG AATTCTAGGA CCAACACTCC 3 00 

TGTGGAGACG TGGAAAGGTT CCAAAGGCAA ACAGTCCTAT ACCTACATCA TTGAGGAGAA 360 

CACTACCACG AGCTTCACCT GGGCCTTCCA GAGGACCACT TTTCATGAGG CAAGCAGGAA 42 0 

CTACACCAAT CACGTTGCCA AGATCTACTC CATCAATGTC ACCAATGTTA TGAATGGCGT 480 

GGCCTCCTAC TGCCGTCCCT GTGCCCTAGA AGCCTCTGAT GTGGGCTCCT CCTGCACCTC 540 

TTGT CCTGCT GGTTACTATA TTGACCGAGA TTCAGGAACC TGCCACTCCT GCCCCCCTAA 60 0 

CACAATTCTG AAACCCCACC AGCCTTATGG TGTCCAGGCC TGTGTGCCCT GTGGTCCAGG 660 

GACCAAGAAC AACAAGATCC ACTCTCTGTG CTACAATGAT TGCACCTTCT CACGCAACAC 720 

TCCAACCAGG ACTTTCAACT ACAACTTCTC CGCTTTGGCA AACACCGTCA CTCTTGCTGG 780 

AGGGCCAAGC TTCACTTCCA AAGGGTTGAA ATACTTCCAT CACTTTACCC TCAGTCTCTG 840 

TGGAAACCAG GGTAGGAAAA TGTCTGTGTG CACCGACAAT GT CACTGACC TCOGGATTCC 900 

TGAGGGTGAG TCAGGGTTCT CCAAATCTAT CACAGCCTAC GTCTGCCAGG CAGTCATCAT 960 

CCCCCCAGAG GTGACAGGCT ACAAGGCCGG GGTTTCCTCA CAGCCTGTCA GCCTTGCTGA 1020 

TCGACTTATT GGGGTGACAA CAGATATGAC TCTGGATGGA ATCACCTCCC CAGCTGAACT 1080 

TTTCCACCTG GAGTCCTTGG GAATACCGGA CGTGATCTTC TTTTATAGGT CCAATGATGT 1140 

GACCCAGTCC TGCAGTTCTG GGAGATCAAC CACCATCCGC GTCAGGTGCA GTCCACAGAA 1200 

AACTGTCCCT GGAAGTTTGC TGCTGCCAGG AACGTGCTCA GATGGGACCT GTGATGGCTG 12 SO 

CAACTTCCAC TTCCTGTGGG AGAGCGCGGC TGCTTGCCCG CTCTGCTCAG TGGCTGACTA 1320 

CCATGCTATC GT CAGCAGCT GTGTGGCTGG GATCCAGAAG ACTACTTACG TGTGGCGAGA 13 BO 

ACCCAAGCTA TGCTCTGGTG GCATTTCTCT GCCTGAGCAG AGAGTCACCA TCTGCAAAAC 1440 

CATAGATTTC TGGCTGAAAG TGGGCATCTC TGCAGGCACC TGTACTGCCA TCCTGCTCAC 1500 

CGTCTTGACC TGCTACTTTT GGAAAAAGAA TCAAAAACTA GAGTACAAGT ACTCCAAGCT 15S0 

GGTGATGAAT GCTACTCTCA AGGACTGTGA CCTGCCAGCA GCTGACAGCT GCGCCATCAT 1S20 

GGAAGGCGAG GATGTAGAGG ACGACCTCAT CTTTACCAGC AAGAAGTCAC TCTTTGGGAA 16B0 

GATCAAATCA TTTACCTCCA AGCAGCCAGC TCCTGTCACC ATCTCTCTTT CAGAGGACTC 1740 

CTGATGGATT TGACTCAGTG CCGCTGAAGA CATCCTCAGG AGGCCCAGAC ATGGACCTGT 1800 

GAGAGGCACT GCCTGCCTCA CCTGCCTCCT CACCTTGCAT AGCACCTTTG CAAGCCTGCG 18S0 

GCGATTTGGG TGCCAGCATC CTGCAACACC CACTGCTGGA AATCTCTTCA TTGTGGCCTT 1920 

ATCAGATGTT TGAATTTCAG ATCTTTTTTT ATAGAGTACC CAAACCCTCC TTTCTGCTTG 1980 

CCTCAAACCT GC CAAATAT A CCCACACTTT GTTTGTAAAT TATGCCCTTG CTTGTATCTT 2040 

GTTTCCCAAA ATGGCCCATC CGCCAGAGCC ATAGCTTCGT CTGCTCATAA TTCTTATAGC 2100 

TTTGGAATGA AAATATTTCT ATCTTCTTAA GTATAGAAAC TATTTCCTCI GTCCTCTAAC 2160 

TTAAGGGCAG AAACAGCTGG GAGTTTTCCT CGCATGCCCT CAGCTCATGA TCTCTTCAGG 2220 

AGAGAGGCTG GGTGAGGAGG GTGTCGGGGT TCCCTGGTGG ATAAICTTCA TAGCAGCCTG 2280 

GATCCATTTC CCCTGGATAA CCAGCTCAAA GGGAGTGAAA ATGGIAGTCT GAGGGCAAGG 2340 

GGAGCAAGGC CTGGGTAAGA AAAGCCTTGA AAAGCATAAA AAGAGGCCGG GCGCGGTGGC 2400 

TCACGCCTGT AATCCCAGCA CTTTGGGAGG CCGAGGCGGG CAGAICATGA GGTCGGGAGA 2460 

TTGAGACCAT CCTGGCTAAC ACGGTGAAGC CCCGTCTCTA CTGGAAATAC AAAAAATTAG 2520 

CCGGGCGTGG TGGCGGGTGC CTGTGGTCCC AGCTACTCGG GAGGCTGAGG CGGGAGAATA 258 0 

GCGTGGGCCT GGAAGGCGGA GCTTGCAGTG AGCCGAGATC GCGCCACTGC ACTCCATCCA 2 540 

GCCTGGGTGA CAGAGTGAGA CTCTGCCTCA AAAAAAAAAA AAAAAAAGAA AAGCACAAAG 2700 

AGAGGCAACA AGGAATGTTT TTGTTTTTGA GACAGGCTCT CACTCTGTCA CCTAGGCTGG 2760 

AGTGCAGTGG CGTAATCACT GTTCAGTGCA GCCTCAAGCT CTTGGGCTCA GGCTATCCTC 282 0 

CCATCTCAGC CTCTCAAGTA GCTGGGACTA CGAGTGTGCA CCACCAGGCT CACTAATTTT 2880 

TGTGTTTTTT GTAGACACGG GGTTTCACCG TGTTGCCCAG GCTGGTCTCC AACTCCTGGG 2940 

CTCAAGTGAT CTGTCCGCCT CGGCCTCCCA AACTGCTGGG ATTACAGGCA TAAGCCACTG 3000 

CACTCAGCCT TTTATTTGTT TTTTAAACCA CGTAGCTCAT TGCCTTCTCT TAAGTAAATG 3060 

ATAGATATTC TCACTGAAGC CAAAGGAATA AGTTCATCAA GAAAATGCCC AAAGCCCTGG 3120 
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TGGATACATC CTCCCTATCT TTTTTTTAAA CCTTCCACTA 1 

GAACCAGGTA AGCCCCAAAC CCAGATGTTC C 

GACATAGCAA ACCCTGTCAG TGAGGAAAAT TCCCCATCCT TGAGTGCCCC C 

GTTTGGGCCA TATTATGGAA CAGGGGTCTC TTATTTGAAA AGAGCACAAG GAGGCCAAGA 

TTTTAATGGG GCACTTTAGG GGATACAGCC CACAATGGCA TGGGOCTGAG G 

TGTCTGCTTC TAAGCTTAAC GCATCTGCTC AGGCACAGAA TAAACGTCTA G 
AAAAGGAACT GAATCCCAGG C 



T G AG ACCCT G TCTCTAAAAA ATTTAAAAAT AAACAAGGTG TTCACCAAGC 
CTCACTATTA AGCCCCTATC TTTCTCTTTT TTTCATTCTC AATTGCTTTG 
AACTAAAGAG ACTTCTGGTC CAATTTCTGG CAACATCCCT TCTGAAAGGT GAGTAGAGTG 
GGTGTCTTCT ATGCCCATTT TCCCCAATTT TACACAAACT ATTATCAATG AACTTTTAAG 
TACCTAGAAT GGGTAAAACC AGAGCAAGAC TTTAAATTAC CTTCITCTTT CTTCTACTGG 
CAGTTCTGCC TCCATCACTA TCAGGCTAGG GTGACCTTCC CTTGGTC 

C CTCTCTCCAG TGACCATTTG GTGACCAGAT G 

!i ATGCTCATCT GATTAGCCAC 4140 

Z AATGAGAACT AGAGCCAGGC TGTGGTCCCT 4200 

3 ACGGCAGGGA GTCCCTTTGG TTTAATAAAT CCAGTTTTTC 4260 

TTTGGGTATC CAAATTCTCC CCTCCTTTTG TAGGAGTCAG GCTCTCAGAA CCTGTGTCCA 432 0 

TGTTGGAACT TCCCCCAGTG TGGATGCAGA TACGCAGCTC CTGAGCTCCA GCCTAAAGTC 4380 

TTCTGTAGCC TCAGCAATAC TTGGGCACCT GCTGTCTCAC TGAATAGCTT TCTTTTGTGA 4440 

CAAAGGCCAC AGACAGCCCT TAGACTATTC CGGAAACAGT AGGAAAAATT ACATATGTCT 4500 

TTGACTTCTT TATTCTGACT CCACTGATTT TAGCCATAAT ACTTTAAGGA GCTACTTTTT 4560 

ACTACCCCTT ACCGTGCTGA CTTCTGCAGG TCTGCCCTGT GACCTGTCAG GAACTCCTGA 4620 

GTTACGCTAC TGGGGTCACC TGTTGCTCCC CTAGCAAGTT AGGCATGTCA TATATTTTTA 4680 

ACAGCTTTAT TGAGATATAA TTCACATATT ATACAATTCA CCTTTAAAAC ATACGATTCA 4740 

ATGGTTTTCA GCAAACTCAC AGAGTTGTCC GCCCACTTGA GAGCAAACAC ATGTTCAATT 4800 

TTCTTTTCCT TTTTTTTTTT GAGACAGAGT CAGCTTTGTC GCCCAGGCTG GAGTGCAGTG 4860 

CCATGATCTT GGCTCACTGC AGCCTCCCCA TCCTGGGTTC AAGTGATGCT TCTGCTTCAG 4920 

CCTCCCCAGT AGCTGGGATT ACAGGCATGC GCCACCACGC CTAGCTAATT TTTGTGTTTT 4980 

TAGTAGAGAT GGGGTTTCAC CGTGTTGGCC AGGCTGGTCT CAAACTCCTG GACTCAAGTG 5040 

ATCCACCCAC CTCGGCCTCC CAAAGTGCTG GGATTGCAGG TGTGAGCCAC CGTGCCTGGC 5100 

CTACGTGTTC AATTTTCTAT GAACAAAGGC TTTAGTCCTT GACCCAGGGC TAAAGTGGTC 5160 

TGTCCAAGCT GTTGTTGGTA GAGGGAGTAT GATAAAATGT TTAAATCTCA TTTGGTTACC 5220 

TTGAGTCCTG GAACACGCAG TAACTGTCAT GCTATAGTCA TCATCTGTAT TTGGCTGGGA 5280 

AT ACAAATG A AGATTGTGGT GTATTCAAGC AGTAGGGTTT TTGCTTTTGT TTTTGTTTTA 5340 

GTGCCAACAA AACTTTTTTT TGTCTGACTA CATTAAAGAT AAGACTGACT ATATTTATAC 5400 

AACAGAAACT TTGTAATAGA TTTTTTCAGC TTTGTGAAAT CGAATTTTTT TTCATCAGGG 5460 

CTGGTTGGAT TTCCTTTTTA CCCTGTAATC CAAGCGTTAA TAGTTTGTTA GAAGATGGGT 5520 

TATTGCATGT CACTTTTTTT TTTTTGTAAA ATAAAAACAT ACCTTAC 5567 

Seq ID NO: C216 DNA Sequence 
Coding sequence: 26-952 



ATTCCGAGAG TTGCGGAAAC GCTACGAGGA CCTGCTAACC AGGCTGCGGG CCAACCAGAG 
CTGGGAAGAT TCGAACACCG ACCTCGTCCC GGCCCCTGCA GTCCGGATAC TCACGCCAGA 
AGTGCGGCTG GGATCCGGCG GCCACCTGCA CCTGCGTATC TCTCGGGCCG CCCTTCCCGA 



AGCCTTGCAA GACCCCAAGC 



CATGCACGCG CAGATCAAGA CGAGCCTGCA CCGCCTGAAG CCCGACACGG AGCCAGCGCC 840 

CTGCTGCGTG CCCGCCAGCT ACAATCCCAT GGTGCICATT CAAAAGACCG ACACCGGGGT 900 

GTCGCTCCAG ACCTATGATG ACTTGTTAGC CAAAGACTGC CACTGCATAT GAGCAGTCCT 960 

GGTCCTTCCA CTGTGCACCT GCGCGGGGGA GGCGACCTCA GTTGTCCTGC CCTGTGGAAT 1020 

GGGCTCAAGG TTCCTGAGAC ACCCGATTCC TGCCCAAACA GCTGTATTTA TATAAGT CTG 1080 

TTATTTATTA TTAATTTATT GGGGTGACCT TCTTGGGGAC TCGGGGGCTG GTCTGATGGA 1140 

ACTGTGTATT TATTTBAAAC TCTGGTGATA AAAATAAAGC TGTCTGAACT GTTAAAAAAA 1200 

AAAA 1204 

Seq id NO: C432 DNA Sequence 



GGCACGAGGT GTTGCCCTCA GGTCGCTCCC GGGCGCGGAC ACGGAACCCG GCCATGGAAG 
ATCCGTCGGG GGCTCGCGAG CCCCGGGCCC GGCCGAGAGA GCGGGACCCG GGACGGCGCC 
CCCACCCAGA CCAAGGCCGC ACCCACGATC GACCGCGGGA CCGACCCGGG GACCCGCGCA 
GGAAGCGAAG CAGCGACGGG AACCGGCGAA GGGACGGGGA CCGGGACCCG AAGAGAGACC 
AGGAGAGGGA CGGGAACCGC GACCGGAACC GGGACCGGGA GAGGGAGAGA GAGAGGGAAA 
GAGACCCGGA CCGAGGCCCC CGCCGGGACA CACACAGGGA CGCGGGCCCT CGCGCAGGTG 
AACACGGAGT TTGGGAAAAA CCGCGCCAAA GCCGGACGCG GGACGGAGCC CGGGGACTGA 
CCTGGGACGC AGCCGCGCCT CCTGGGCCCG CGCCCTGGGA AGCCCCGGAG CCGCCGCAGC 
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A GGGAGACCCC GGGCGCCGCA GACCCGAAAG TGAACCCCCT TCGGAGAGAT 
ATCTGCCCTC GACCCCCAGG CCTGGACGAG AGGAGGTGGA ATATTACCAG TCAGAGGCGG 
AAGGACTCCT GGAATGCCAC AAATGCAAAT ACTTGTGCAC TGGGAGAGCC TGCTGCCAAA 
TGCTGGAGGT TCTCCTGAAC TTGCTGATCC TGGCCTGCAG CTCTGTGTCT TACAGTTCCA 
CAGGGGGCTA CACGGGCATC ACCAGCTTGG GGGGCATTTA CTACTATCAG TTCGGAGGGG 
CTTACAGTGG CTTTGATGGT GCTGACGGGG AGAAGGCCCA GCAACTGGAT GTCCAGTTCT 
ACCAGCTAAA GCTGCCCATG GTCACTGTGG CAATGGCCTG TAGTGGAGCC CTCACAGCCC 
TCTGCTGCCT CTTCGTTGCC ATGGGTGTCC TGCGGGTCCC GTGGCATTGT CCACTGTTGC 
TGGTGACCGA AGGCTTGTTG GACATGCTCA TCGCGGGGGG GTACATCCCG GCCTTGTACT 



TGTACCAAAG CAAAGGCTAC AGCGGTTTCG GCTGCAGTTT CCACGGAGCA GATATAGGAG 1140 

CTGGAATCTT TGCTGCCCTG GGCATTGTGG TCTTTGCCCT GGGGGCGGTC CTGGCCATAA 1200 

AGGGCTACCG AAAAGTTAGG AAGCTAAAAG AGAAGCCAGC AGAAATGTTT GAATTTTAAG 1260 

GGTTTCTAAA ACGCTCTGAC AGATGCAAGT GGTGGTGGAA GGTAGTCTGA GCCACTGCCT 1320 

15 TTCCCAAGAA TCCCTTGTTG TGGAAGTTTC CAATGCTGGA AAAGCAGCGA GCCAGCGTTG 1380 

GTGTGGTGGG CGGAGCTCCC AGTCGCATGG AGCGGTGTTC ATGGATGCAA CAGACCCTGG 1440 

CTTCTGGAGT CCTCTGTGAG TGAGGGACCA ATCAAAATTA TTTTTCAAAA AGCAAAAAAA 1500 

TGGCCGGCCT CGGCGGCTCA CACCTGTAAC CCCAGCACTT TGGGAGGCIG AGGTGGGTGG 1560 

ATCACTTGAG GTCAGGAGCT CGAGACCAGC TTGGCCAACA TGGTGAGCCC CCGTCTCTAC 1620 

20 TAAAATACAA AAAAATTAGC CAGGCGTGGT GGCGGGCGCC TGTAATCCCA GCTACTTGGG 1680 

AGGCTGAGGC AGGAGAATCG CTTGAATCTG GGAGGCGGAG ATTGCAGTGA GCCGAGATCC 1740 

CGCCACTGCA CTCCAGCCCA GGTGACAGAG CGAGACTCCA TCTCAAAAAA AAAAAAAAAA 1800 



Seq ID NO: C434 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 



30 gagctagcgc tcaagcagag cccagcgcgg TGCTATCGGA CAGAGCCTGG CGAGCGCAAG 
CGGCGCGGGG AGCCAGCGGG GCTGAGCGCG GCCAGGGT CT GAACCCAGAT TTCCCAGACT 
AG CTACCACT CCGCTTGCCC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT C 
ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC G 

AGAGGGAGCA CACTGCCAGG ATGGGAGCTG CTGGGAGGCA GGACTTCCTC T 
35 TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC A 
CTGGGTGCCC TGACCAGAGC CCTGAGTTGC A 
ACCATGTGCA TATCGGCCAG CGCAAGACAC TGCTGCTCAC C 

40 

CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 840 

TCGACCCCAA ATCAGGCACA GTCATCCATT CTGACCGGTT TGACACCTAT AGATCCAAGA 90 0 

45 AAGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGCGGTGCC CGATGGCAGG ATCCTTTCTG 9 60 

TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAGGAAG GCGATGACCA 1020 

AATTGGGAAG CAAACACTTC CTGCACCTTG GATTTAGACA CCCTTGGAGT TTTCTAACTG 1080 

TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1140 

50 CTTTGTCCAG TGAGTGGGTT CAAGACGTGG AGTGGACGGA GTGGTTCGAT CATGATAAAG 1260 

TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 1320 

TATGCAATCG TCCCATTGAT AT ACAGG CCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAGCCT 1440 

GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA 1500 

55 CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA 1560 

AACCTGGAGA TACCCTGGTC ATTGCCAGTA CTGATTACTC CATGIACCAG GCAGAAGAGT 1620 

TCCAGGTGCT TCCCTGCAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 1680 

TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GCGGGCGGAG GTTGGGCTT C 1740 

TGAGCCGGAA CATCATAGTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC TACAGAAACC 1800 

60 ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 

T TAAGGCAG C ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 1920 

AGTACCCGAT TCACTTCCAC CTGGCCGGTG ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2040 

GCTCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTATAA CTC1TTGGGC CACTGCTTCT 2100 

65 TCACGGAAGA TGGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC C1CCTTGTCA 2160 

. AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 2220 

ACTCCTACCC AGGGTACAT C CCCAAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTCT 2280 

GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT GAGGAAACTG 2340 

GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC CGTGGGAATG TACTCCCCAG 2400 

70 GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 2460 

GGGCTGGCAT GATCATAGAC AACGGAGTCA AAACCACCGA GGCCTCTGCC AAGGACAAGC 2520 

GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGACGCC GACCCGCTGA 2580 

AGCCCCGGGA GCCGGCCATC ATCAGACACT TCATTGCCTA CAAGAACCAG GACCACGGGG 2640 

_. CCTGGCTGCG CGGCGGGGAT GTGTGGCTGG ACAGCTGCCA TTTCAGAGGG GAGGCTCAGG 2 700 

75 AAGGCTTCTT GCTTACAGGA ATGAAGGCTG GGGGCATTTT GCTGGGGGGA GATGAGGCAG 2 760 



CTCTGACTCC AAGAGGGTGA AGTCCACAGA AGTGAGCTCC TGCCTTAGGG C 
TCTTCATCCA GGGAACTGAG CACAGGGGGC CTCCAGGAGA CCCTAGATGT GCTCGTACTC 
CCTCGGCCTG GGATTTCAGA GCTGGAAATA TAGAAAATAT CTAGCCCAAA GCCTTCATTT 
G GGAAAGTGAG CCCCCAAGAT GGGAAAGAAC CACACAGCTA AGGGAGGGCC 
A CACCACATTG CCTCAACAAC CGGCCCCAGA 
AAATGGGGAC AAGTCCCCTC 
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2 CCACCTTATA GAGAGCCCAA A 

A GTCTCCCTGG 

GTCTTGTGAT GAACTACATT TATCCCCTTT CCTGCCCCAA CCACAAACTC T 

G CCTGGCTCCC TCCACCCAAC TGCACCCATG AGACTCGGTC CAAGAGTCCA 
C TGTCAGGGAG GTCTTTCCCA CCAAACATCT TTCAGCTGCT 
GGGAGGTGAC CATAGGGCTC TGCTTTTAAA GATATGGCTG CTTCAAAGGC CAGAGT CACA 
GGAAGGACTT CTT CCAGGGA GATTAGTGGT GATGGAGAGG AGAGTTAAAA TGACCTCATG 
TCCTTCTTGT CCACGGTTTT GTTGAGTTTT CACTCTTCTA ATGCAAGGGT CTCACACTGT 
GAACCACTTA GGATGTGATC ACTTTCAGGT GGCCAGGAAT GTTGAATGTC TTTGGCTCAG 
TTCATTTAAA AAAGATAT CT ATTTGAAAGT TCTCAGAGTT GTACATATGT TTCACAGTAC 
AGGATCTGTA CATAAAAGTT TCTTTCCTAA ACCATTCACC AAGAGCCAAT AT CT AGG CAT 
TTTCTTGGTA GCACAAATTT TCTTATTGCT TAGAAAATTG TCCTCCTTGT TATTTCTGTT 
TGTAAGACTT AAGTGAGTTA GGTCTTTAAG GAAAGCAACG CTCCTCTGAA ATGCTTGTCT 
TTTTTCTGTT GCCGAAATAG CTGGTCCTTT TTCGGGAGTT AGATGTATAG AGTGTTTGTA 
TGTAAACATT TCTTGTAGGC ATCACCATGA ACAAAGATAT ATTTTCTATT TATTTATTAT 
ATGTGCACTT CAAGAAGTCA CTGTCAGAGA AATAAAGAAT TGTCTTAAAT GTCATGATTG 
GAGATGTCCT TTGCATTGCT TGGAAGGGGT GTACCTAGAG CCAAGGAAAT TGGCTCTGGT 



MVGKMWPVLW TLCAVRVTVD AISVETPQDV LRASQGKSVT LPCTYHTSTS SREGLIQWDK 
LLLTHTERW IWPFSNKNYI HGELYKNRVS ISNNAEQSDA SITIDQLTMA DNGTYECSVS 
LMSDLEGNTK SRVRLLVLVP PSKPECGIEG ETIIGNNIQL TCQSKEGSPT PQYSWKRYNI 
LNQEQPLAQP ASGQPVSLKN ISTDTSGYYI CTSSNEEGTQ FCNITVAVRS PSMNVALYVG 
IAVGWAALI IIGIIIYCCC CRGKDDNTED KEDARPNREA YEEPPEQLRE LSREREEEDD 
YRQEEQRSTG RESPDHLDQ 



I I I I I 

L LLLPPPPRVG GI 

V LQGGAVYLCP KGASPTQCTP 
- EEPVEYKSLQ WFGATVRAHG SSILACAPLY SWRTEKEPLS 

i GQGYCQGGFS AEFTKTGRW LGGPGSYFWQ GQILSATQEQ 
, INLVQGQLQT RQASSIYDDS YLGYSVAVGE FSGDDTEDFV AGVPKGNLTY 
GYVTILNG3D IRSLYNFSGE QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 
EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 
QQGWFVFPG GPGGLGSKPS QVLQPLWAAS H 
VDKAWYRGR PIVSASASLT IFPAMFNPEE R: 
GFTVELQLDW QKQKGGVRRA LFLASRQATL TQTLLIQNGA R 
LSPIHIALNF SLBPQAPVDS HGLRPALHYQ SKSRIEDKAQ I LLDCGEDNI CVPDLQLEVF 
GEQNHVYLGD KNALNLTFHA QKVGEGGAYE AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 
FAVNQSRLLV CDLGNPMKAG ASLWGGLRFT VPHLRDTKKT IQFDFQILSK NLNNSQSDW 
SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPQKEE DLGPAVHHVY ELINQGPSSI 
SQGVLELSCP QALEGQQLLY VTRVTGLNCT TNHPINPKGL ELDPEGSLHH QQKREAPSRS 
SASSGPQILK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQREH QPFSLQCEAV 
YKALKMPYRI LPRQLPQKER QVATAVQWTK AEGSYGVPLW IIILAILFGXj LLLGLLIYIL 
YKLGFFKRSL P YGTAME KAQ LKPPATSDA 

Seq ID NO: C219 Protein Sequence 



03 1 11 21 31 41 51 

• I I I I I I 

MHSFPPLLLL LFWGWSHSF PATLETQEQD VDLVQKYLEK YYNLKNDGRQ VEKRRKSGPV 
VEKLKQMQEF FGLKVTGKPD AETLKVMKQP RCGVPDVAQF VLTEGNPRHE QTHLTYRIEN 
YTPDLPRADV DHAIEKAFQL WSNVTPLTFT KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 

70 LAHAFQPGPG IGGDAHFDED ERWTNNFREY NLHRVAAHEL GHSLGLSHST DIGALMYPSY 
TFSGDVQLAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTFDAITTI RGEVMFFKDR 
FYMRTNPFYP EVELNFISVF WPQLPNGLEA AYEFADRDEV RFFKGNKYWA VQGQNVLHGY 
PKDIYSSFGF PRTVKHIDAA LSEENTGKTY FFVANKYWRY DEYKRSMDPG YPKMIAHDFP 
GIGHKVDAVF MKDGFFYFFH GTRQYKFDPK TKRILTLQKA N 



MHSFPPLLLL LFWGWSHSF PATLETQEQD VDLVQKYLEK YYNLKNDGRQ VEKRRNSGPV 
VEKLKQMQEF FGLKVTGKPD AETLKVMKQP RCGVPDVAQF VLTEGNPRHE QTHLTYRIEN 
YTPDLPRADV DHAIEKAFQL WSNVTPLTFT KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 
LAHAFQPGPG IGGDAHFDED ERWTNNFREY NLHRVAAHAL GHSLGLSHST DIGALMYPSY 
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TFSGDVQLAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTPDAITTI K 
FYMRTNPFYP EVELNFISVF WPQLPNGLEA AYEFADRDEV RFFKGNKYWA VQGONVLHGY 
PKDIYSSFGF PRTVKHIDAA LSEENTGKTY FFVANKYWRY DEYKRSMDPG YPKMIAHDFP 
GIGHKVDAVF MKDGFFYFFH GTRQYKFDPK TKRILTLQKA N! 



1 11 21 31 41 51 

I I I I I I 

MVRKPWSTI SKGGYLQGNV NGRLPSLGNK EPPGQEKVQL KRKVTLLRGV SIIIGTIIGA 
GIFISPKGVL QNTGSVGMSL TIWTVCGVLS LFGALSYASL GTTIKKSGGH YTYILEVFGP 
LPAFVRVWVE LLIIRPAATA VISLAFGRYI LEPFFIQCEI PELAIKLITA VGITWMVLN 
SMSVSWSARI QIFLTFCKLT AILIIIVPGV MQLIKGQTQM FKDAFSGRDS SITRLPLAFY 
YGMYAYAGWF YLNFVTEEVE NPEKTIPLAI CISMAITIGV YVLTNVAYFT TINAEELLLS 
NAVAVTFSER LLGNFSLAVP IFVALSCFGS MNGGVFAVSR LFYVASREGH LPEILSMIHV 
RKHTPLPAVI VLHPLTMIHL FSGDLDSLLN FLSFARWLFI GLAVAGLIYL RYKCPDMHRP 
FKVPLFIPAL FSFTCLFMVA LSLYSDPFST GIGFVITLTG VPAYYLFIIW DKKPRWFRIM 
SEKITRTLQI ILEWPEEDK L 



Seq I 



17 PDDKFODLVD AVRAEKGFLL LASLRQMKKT RGTLLALERK DHSGQVFSW 120 

SNGKAGTLDL SLTVQGKQHV VSVEEALLAT GQWKSITLFV QEDRAQLYID CEKMENAELD 180 

VPIQSVFTRD LASIARLRIA KGGVNDNFQG VLQNVRFVFG TTPEDILRNK GCSSSTSVLL 240 

TLDNNWNGS SPAIRTNYIG HKTKDLQAIC GISCDELSSM VLELRGLRTI VTTLQDSIRK 300 

VTEENKELAN ELRRPPLCYH NGVQYRNNEE WTVDSCTECH CQNSVTICKK VSCPIHPCSH 3 60 

ATVPDGECCP RCWPSDSADD GWSPWSEWTS CSTSCGNGIQ QRGRSCDSLN NRCEGSSVQT 420 

RTCHIQECDK RFKQDGGWSH WSPWSSCSVT CGDGVITRIR LCNSPSPQMN GKPCEGEARE 480 

TKACKKDACP INGGWGPWSP WDICSVTCGG GVQKRSRLCN NPAPQFGGKD CVGDVTENQI 540 

CNKQDCPIDG CLSNPCFAGV KCTSYPDGSW KCGACPPGYS GNGIQCTDVD ECKEVPDACF 600 

NHNGEHRCEN TDPGYNCLPC PPRFTGSQPF GQGVEHATAN KQVCKPRNPC TDGTHDCNKN 660 

AKCNYLGIIYS DPMYRCECKP GYAGNGIICG EDTDLDGWPN ENLVCVANAT YHCKKDNCPN 720 

LPNSGQEDYD KDGIGDACDD DDDNDKIPDD RDNCPFHYNP AQYDYDRDDV GDRCDNCPYN 760 

HNPDQADTDN MGEGDACAAD IDGDGILNER DNCQYVYNVD QRDTDMDGVG DQCDNCPLEH 840 

NPDQLDSDSD RIGDTCDNNQ DIDEDGHQNN LDNCPYVPNA NQADHDKDGK GDACDHDDDii 900 

DGIPDDKDNC RLVPNPDQKD SDGDGRGDAC KDDFDHDSVP DIDDICPENV DISETDFRRF 960 

QMIPLDPKGT SQNDPNWWR HQGKELVQTV NCDPGLAVGY DEFNAVDFSG TFFINTERDD 1020 

DYACFVFCYQ SSSRFYWMW KQVTQSYWDT NPTRAQGYSG LSVKWNSTT GPGEHLRNAL 1080 

WHTGNTPGQV RTLWHDPRHI GWKDFTAYRW RLSIIRPKTGF IRWMYSGKK IMADSGPIYD 1140 

j FVFSQEMVFF SDLKYECRDP 1170 



60 



SEIITFAESG TARKTLHFEI SKEGSDLSW ERAEVWLFLK VPKANRTRTK VTIRLFQQQK 
HPQGSLDTGE EAEEVGLKGE RSELLLSEKV VDARKSTWHV FPVSSSIQRL LDOGKSSLDV 
RIACEQCQES GASLVLLGKK KKKEEEGEGK KKGGGEGGAG ADEEKEQSHR PFLMLQARQS 
EDHPHRRRRR GLECDGKVNI CCKKQFFVSF KDIGWNDWII APSGYHANYC EGECPSHIAG 
TSGSSLSFHS TVINHYRMRG HSPFANLKSC CVPTKLRPMS MLYYDDGQNI IKKDIQNMIV 



Seq ID NO: C224 Protein Sequence 
Protein Accession ft: NP_000086.1 

1 11 21 31 41 51 

I I I I I 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 
FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 
NGSHCTDWE CKAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQGVGLAF AKANKQVCTD 
INECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 
ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 
EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRKPDQRNTD EDKWGDACDN CRSQKNUDQK 
DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 
VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 
RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTWLD 

PEGDAQIDPN WWLNQGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT V 

FGYQDSSSFY WMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL R 
ESQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY EGPELVADSK V 
RLGVFCFSQE NI IWANLRYR CNDTIPEDYE THQLRQA 

Seq ID NO: C225 Protein Sequence 
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MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREWDL YKGMCLQGPA 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGBK GECLRESFEE SNTPNYKQCS WSSLNYGIDL 
K MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 
M IHRTSSVEGL CEGIGAGLVD V 



1 11 21 31 41 51 

1 I I I I I 

MPKRAHWGAL SWLILLWGH PRVALACPHP CACYVPSEVH CTFRSLASVP AGIARHVERI 
NLGFHSIQAL SETSFAGLTK LELLMIHGNE IPSIPDGALR DLSSLQVFKF SYNKLRVITG 
QTLQGLSNLM RLHIDHNKIE FIHPQAFNGL TSLRLLHLEG NLLHQLHPST FSTFTFLDYF 
RLSTIRHLYL AEHMVRTLPA SMLRNMPLLE NLYLQGNPWT CDCEMRWFLE WDAKSRGILK 
CKKDKAYEGG QLCAMCFSPK KLYKHEIHKL KDMTCLKPSI ESPLRONRSR SIEEEQEQEE 
DGGSQLILEK FQLPQWSISL NMTDEHGNMV NLVCDIKKPM DVYKIHLNOT DPPDIDINAT 
VALDFECPHT RENYEKLWKL IAYYSEVPVK LHRELMLSKD PRVSYQYRQD ADEEALYYTG 
VRAQILAEPE WVMQPSIDIQ LNRRQSTAKK VLLSYYTQYS QTISTKDTRQ ARGRSWVMIE 
PSGAVQRDQT VLEGGPCQLS CNVKASESPS IFWVLPDGSI LKAPMDDPDS KFSILSSGWL 
RIKSMEPSDS GLYQCIAQVR DEMDRMVYRV LVQSPSTQPA EKDTVTIGKN PGESVTLPCN 
ALAIPEAHLS WILPNRRIIN DLANTSHVYM LPNGTLSIPK VQVSDSGYYR CVAVNQQGAD 
HFTVGITVTK KGSGLPSKRG RRPGAKALSR VREDIVEDEG GSGMGDEENT SRRLLHPKDQ 
EVFLKTKDDA INGDKKAKKG RRKLKLWKHS EKEPETNVAE GRRVFESRRR INMANKQINP 
ERWADILAKV RGKNLPKGTE VPPLIKTTSP PSLSLEVTPP FPAVSPPSAS PVQTVTSAEE 
SSADVPLLGE EEIWLGTISS ASMGLEHNHN GVILVEPEVT STPLEEV 
TEGDLKGTAA PTLISEPYEP SPTLHTLDTV YEKPTHEETA TEGWSAADVG S 
EPPLDAVSLA ESEPMQYFDP DLETKSQPDE DKMKEDTFAH LTPTPTIWVN DSSTSQLFED 
STIGEPGVPG QSHLQGLTDN IHLVKSSLST QDTLLIKKGM KEMSQTLQGG NMLEGDPTHS 
RSSESEGQES KSITLPDSTL GIMSSMSPVK KPAETTVGTL LDKDTTTVTT TPRQKVAPSS 
TMSTHESRRR PNGRRRLRPN KFRHRHKQTP PTTFAPSETF STQPTQAPDI KISSQVESSL 
VPTAWVDNTV NTPKQLEMEK NAEPTSKGTP RRKHGKRPNK HRYTPSTVSS RASGSKPSPS 
PENKHRNIVT PSSETILLPR TVSLKTEGPY DSLDYMTTTR KIYSSYPKVQ ETLPVTYKPT 
SDGKEIKDDV ATNVDKHKSD ILVTGESITN AIPTSRSLVE TMGEFKEESS PVGFPGTPTW 
NPSRTAQPGR LQTDIPVTTS GENLTDPPLL KELEDVDFTS EFLESLTVST PFHQEEAGSS 
TTLSSIKVEV ASSQAETTTL DQDIILETTVA ILLSETRPQN IITPTAARMKE PASSSPSTIL 
t S TTK PALPSPRISQ ASRDSKRNVF LNYVGN PETE ATPVNNEGTO HMSGPNELST 

PSSDRDAFNL STKLELEKQV FGSRSLPRGP DSQRQDGRVK ASHQLTRVPA KPILPTATVR 
LPEMSTQSAS RYFVTSQSPR HWTNKPEITT YPSGALPENK QFTTPRLSST TIPLPLHMSK 
PSIPSKFTDR RTDQFNGYSK VFGNWNIPEA RNPVGKPPSP RIPHYSNGRL PFFTNKTLSF 
PQLGVTRRPQ IPTSPAPVMR ERKVIPGSYN RIHSHSTFHL DFGPPAPPLL HTPQTTGSPS 
TNLQNIPMVS STQSSISFIT SSVQSSGSFH QSSSKFFAGG PPASKFKSLG EKPQILTKSP 
QTVSVTAETD TVFPCEATGK PKPFVTWTKV STGALMTPNT RIQRFEVLKN GTLVIRKVQV 
QDRGQYMCTA SNLHGLDRMV VLLSVTVQQP QILASHYQDV TVYLGDTIAM ECLAKGTPAP 
QISHIFPDRR WQTVSPVES RITLHENRTL SIKEASFSDR GVYKCVASNA AGADSLAIRL 
HVAALPPVIH QEKLENISLF PGLSIHIHCT AKAAPLPSVR WVLGDGTQIR PSQFLHGNLF 
VFPNGTLYIR NLAPKDSGRY ECVAANLVGS ARRTVQLNVQ RAAANARITG TSPRRTDVRY 
GGTLKLDCSA SGDPWPRILW RLPSKRMIDA LFSFDSRIKV FANGTLWKS VTDKDAGDYL 
CVARKKVGDD YWLKVDWM KPAKIEHKEE NDHKVFYGGD LKVDCVATGL PNPEISWSLP 
DGSLVNSFMQ SDDSGGRTKR YWFNNGTLY FNEVGMREEG DYTCFAENQV GKDEHRVRVK 
WTAPATIRN KTYLAVQVPY GDWTVACEA KGEPMPKVTW LSPTNKVIPT SSEKYQIYQD 
GTLLIQKAQR SDSGNYTCLV RNSAGEDRKT VWIHVNVQPP KINGNPNPIT TVREIAAGGS 
RKLIDCKAEG IPTPRVLWAF PEGWLPAPY YGNRITVHGN GSLDIRSLRK SDSVQLVCMA 
RNEGGEARLI VQLTVLEPME KPIFHDPISE KITAMAGHTI SLNCSAAGTP TPSLVWVLPN 
GTDLQSGQQL QRFYHKADGM LHISGLSSVD AGAYRCVARK AAGHTERLVS LKVGLKPEAN 
KQYHNLVSII NGETLKLPCT PPGAGQGRFS WTLPNGMHLE GPQTLGRVSL LDNGTLTVRE 
ASVFDRGTYV CRMETEYGPS VTSIPVIVIA YPPRITSEPT PVIYTRPGNT VKLNCMAMGI 
PKADITWELP DKSHLKAGVQ ARLYGNRFLH PQGSLTIQHA TQRDAGFYKC MAKNILGSDS 
KTTYIHVF 

Seq ID NO; C228 Protein Sequence 



I I I I 

K TSQEDELDVP DDISVRVMSS QSVLVSWVDP VLEKQKKWA 
K QIANRRVLIE NLIPDTVYEF AVRI SQGERD GKWSTSVFQR 
G KPTWAASWD ALPETEGKVK VCLLDTGLFS VSSFQPSAKS 
FQNTFFHTPR LSNHLEQSPS PILETLLLPW WMVCSLGNAI FSKSGPQTGE AWDLTPKPSL 
SLCQQECSCT QKDFSCLAYL IDIQTKQVNK DPQLEGSVFG PCFLFYFLTF KLDIGGFSFI 
MCYEDP7VSS LTGNSLKSVA ASKADVQQNT EDNGKPEKPE PSSPSPRAPA SSQHPSVPAS 
PQGRNAKDLL LDLKNKILAN GGAPRKPQLR AKKAEELDLQ STEITGEEEL GSREDSPHSP 
SDTQDQKRTL RPPSRHGHSV VAPGRTAVRA RMPALPRREG VDKPGFSLAT QPRPGAPPSA 
SASPAHHAST QGTSHRPSLP ASLNDNDLVD SDEDERAVGS LHPKGAFAQP RPALSPSRQS 
PSSVLRDRSS VHPGAKPASP ARRTPHSGAA EEDSSASAPP SRLSPPHGGS SRLLPTQPHL 
SSPLSKGGKD GEDAPATNSN APSRSTMSSS VSSHLSSRTQ VSEGAEASDG ESHGDGDRED 
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60 
65 



GGRQAEATAQ TLRAR P ASGH FHLLRHKPFA ANGRSPSRFS IGRGPRLQPS SSPQSTVPSR 
AHPRVPSHSD SHPKLSSGIH GDEEDEKPLP ATWNDHVPS SSRQPISRGW EDLRRSPQRG 
ASLHRKEPIP ENPKSTGADT HPQGKYSSLA SKAQDVQQST DADTEGHSPK AQPGSTDRHA 
SPARPPAARS QQHPSVPRRM TPGRAPEQQP PPPVATSQHH PGPQSRDAGR SPSQPRLSLT 
QAGRPRPTSQ GRSHSSSDPY TASSRGMLPT ALQNQDEDAQ GSYDDDSTEV EAQDVRAPAH 
AARAKEAAAS LPKHQQVESP TGAGAGGDHR SQRGHAASPA RPSRPGGPQS RARVPSRAAP 
GKSEPPSKRP LSSKSQQSVS AEDEEEEDAG FFKGGKEDLL SSSVPKWPSS STPRGGKDAD 
GSLAKEEREP AIALAPRGGS LAPVKRPIjPP PPGSSPRASH VPSRPPPRSA ATVSPVAGTH 
PWPRYTTRAP PGHFSTTPML SLRQRMMHAR FRNPLSRQPA RPSYRQGYNG RPNVEGKVLP 
GSNGKPNGQR IINGPQGTKW WDLDRGLVL NAEGRYLQDS HGNPLRIKLG GDGRTIVDLE 
GTPWSPDGL PLFGQGRHGT PLANAQDKPI LSLGGKPLVG LEVIKKTTHP PTTTMQPTTT 
TTPLPTTTTP RPTTATTMQP TTTTTPLPTT TPRPTTATTR RTTTRRPTTT VRTTTRTTTT 
TTPKPTTPIP TCPPGTLERH DDDGNL IMSS NGIPECYAEE DEFSGLETDT AVPTEEAYVI 
YDEDYEFETS RPPTTTEPST TATTPRVIPE EGAISSFPEE EFDLAGRKRP VAPYVTYLNK 
DPSAPCSLTD ALDHFQVDSL DEI IPNDLKK SDLPPQHAPR NITWAVEGC HSFVIVDWDK 
ATPGDLVTGY LVYSASYEDF IRNKFSTQAS SVTHLPIENL KPNTRYYFKV QAQNPHGYGP 
ISPSVSFVTE SDNPLLWRP PGGELSGSHS LSNMIPATRT AMDGNM 

Seq ID NO: C229 Protein Sequence 



I I I I I I 

ILALGVR GAPCEAVRIP MCRHMPWNIT RMPNHLHHST OENAILAIEQ 

1FFFCAM YAPICTLEFL HDPIKPCKSV CQRARDDCEP LMKMYNHSWP 

V YDRGVCISPE AIVTDLPEDV KWIDITPDMM VQERPLDVDC KRLSPDRCKC 

Y LSKNYSYVIH AKIKAVQRSG CKEVTTWDV KEIFKSSSPI PRTQVPLITM 
SSCQCPHILP HQDVLIMCYE WRSRMMLLEN CLVEKWRDQL SKRSIQWEER LQEQRRTVQD 
KKKTAGRTSR SNPPKPKGKP PAPKPASPKK NIKTRSAQKR TNPKRV 



I I I I I I 

MAPAAWLRSA AARALLPPML LLLLQPPPLL ARALPPDVHH LHAERRGPQP WIB 
PAPATQEAPR PASSLRPPRC GVPDPSDGLS ARNRQKRFVL SGGRHEKIDL TYRILRFPWQ 
LVQEQVRQTM AEALKVWSDV TPLTFTEVHE GRADIMIDFA P.YWHGDDLPF DGPGGILAHA 
FFPKTHREGD VHFDYDETWT IGDDQGTDLL QVAAHEFGIIV LGLOHTTAAK ALMSAFYTFR 
rPLSL,SPDD( RGVQHLYGQP WPTVTSRTPA LGPOAGIDTN EIAPLEPDAP PDACEASFDA 
VSTIRCELFF FKAGFWRLR GGQLQPGYPA LASRHWQGLP SPVDAAFEDA QGHIWFFQGA 
QYWVYDGEKP VLGPAPLTEL GLVRFPVHAA LVWGPEKNKI YFFRGRDYWR FHPSTRRVDS 
PVPRRATDWR GVPSEIDAAF QDADGYAYFL RGRLYWKFDP VKVKAIiEGFP RLVGPDFFGC 
AEPANTFL 

Seq ID NO: C231 Protein Sequence 
Protein Accession #: NPJ376927 

50 1 11 21 31 41 51 

I I I I I I 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQ I LSLLPLKFFP IIVIGIIALI 
LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 
TAASWKTMCS DDW KGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 
55 VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 
LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 
LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DASPVLNHAA 
VPLISNKICN HRDVYGGIIS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVGATSF 
GIGCAEVNKP GVYTRVTSFL DWIHEQMERD LKT 



Seq ID NO: C232 Protein 
Protein Accession ft: NP_ 



G TSNGTARLPQ LGTVGQSPYT SAPPLSHTPN ADFQPPYFPP SO 

PYQPIYPQSQ DPYSHVNDPY SLNPLHAQPQ PQHPGWPGQR QSQESGLLHT HRGLPHQLSG 120 

LDPRRDYRRH EDLLHGPHAL SSGLGDLSIH SLPHAIEEVP HVEDPGINIP DQTVI KKGPV 180 

SLSKSNSNAV SAIPINKDNL FGGWNPNEV FCSVPGRLSL LSSTSKYKVT VAEVQRRLSP 240 



VCAAVTALQN YLTEALKAMD KMYLSNNPNS 




= AQVGTNKELC CLVYTSWQIP QKFIVDYSET SPQCPKPGVI 60 
LLTKRGRQIC ADPNKKWVQK Y 
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MILQAHLHSL CLLMLYLATG YGQEGKFSGP LKPMTFSIYE GQEPSQIIFQ FKAHPPAVTF 
ELTGETDNIF VIEREGLLYY NRALDRETRS THNLQVAALD ANGIIVEGPV PITIEVKDIN 
DNRPTFLQSK YEGSVRQNSR PGKPFLYVNA TDLDDPATPN GQLYYQIVIQ LPMINNVMYF 
QINNKTGAIS LTREGSQELN PAKNPSYNLV ISVKDMGGQS ENSFSDTTSV DIIVTENIWK 
APKPVEMVEN STDPHPIKIT QVRWNDPGAQ YSLVDKEKLP RFPFSIDQEG DIYVTQPLDR 
EEKDAYVFYA VAKDEYGKPL SYPLEIHVKV KDINDNPPTC PSPVTVFEVQ ENERLGHSIG 
TLTAHDRDEE NTANSFLNYR IVEQTPKLPM DGLFLIQTYA GMLQLAKQSL KKQDTPQYNL 
TIEVSDKDFK TLCFVQINVI DINDQIPIFE KSDYGNLTLA 3DTNIGSTIL TIQATDADEP 
FTGSSKILYH IIKGDSEGRL GVDTDPHTNT GYVIIKKPLD FETAAVSNIV FKAENPEPLV 
FGVKYNASSF AKFTLIVTDV HEAPQFSQHV FQAKVSEDVA IGTKVGNVTA KDPEGLDISY 
>LDREAG SPYRVQWAT EVGGSSLSSV SEFHLILMDV 
L SAPGSLIFEA TDDDQHLFRG PHFTFSLGSG SLQNDVIEVSK 
jIRINDGG RPPLEGIVSL PVTFCSCVBG SCFRPAGHQT 
[AWFIRI KKDKGKDNVE SAQASEVKPL RS 

Seq ID NO: C235 Protein Sequence 



I I I I I I 
MARARPPPPP SPPPGLLPLL PPLLLLPLLL LPAGCRALEE TLMDTKWVTS ELAWTSHPES 
GWEEVSGYDE AMNPIRTYQV CNVRESSQNN WLRTGFIWRR DVQRVYVELK F 
NIPGSCKETF NLFYYEADSD VASASSPFWM ENPYVKVDTI A ' 
FGPLSKAGFY LAFQDQGACM SLISVRAFYK KCASTTAGFA L 
CIPNAVEVSV PLKLYCMGDG EWMVPVGACT CATGHEPAAK ESQCRPCPPG SYKAKQGEGP 
CLPCPPNSRT TSPAASICTC HNNFYRADSD SADSACTTVP SPPRGVISNV NETSLILEWS 
EPRDLGGRDD LLYNVICKKC HGAGGASACS RCDDNVEFVP RQLGLTERRV HISHLLAHTR 
YTFEVQAVNG VSGKSPLPPR YAAVN ITTNQ AAPSEVPTLR LHSSSGSSLT LSWAPPERPN 
GVILDYEMKY FEKSEGIAST VTSQMNSVQL DGLRPDARYV VQVRARTVAG YGQYSRPAEF 
ETTSERGSGA QQLQEQLPLI VGSATAGLVF WAVWIAIV CLRKQRHGSD SEYTEKLQQY 
IAPGMKVYID PFTYEDPNEA VREFAKEIDV SCVKIEEVIG AGEFGEVCRG RLKQPGRREV 
FVAIKTLKVG YTERQRRDFL SEASIMGQFD HPNIIRLEGV VTKSRPVMIL TEFM3NCALD 
SFLRLNDGQF TVIQLVGMLR GIAAGMKYLS EMNYVHRDLA ARNILVNSNL VCKVSDFGLS 
RFLEDDPSDP TYTSSLGGKI PIRWTAPEAI AYRXFTSASD VWSYGIVMWE VKSYGERPYW 
DMSNQDVINA VEQDYRLPPP MDCPTALHQL MLDCWVRDRN LRPKFSQIVN TLDKLIRNAA 
SLKVIASAQS GMSQPLLDRT V ' ~ - 



MYVGYVLDKD SPVYPGPARP ASLGLGPANY GPPAPPPAPP QYPDFSSYSH VEPAPAPPTA 
WGAPFPAPKD DWAAAYGPGP AAPAASPASL AFGPPPDFSP VPAPPGPGPG LLAQPLGGPG 
TPSSPGAQRP TPYEWMRRSV AAGGG3GSGK TRTKDKYRW YTDHQRLEI 



1 11 21 31 41 51 

I I I I I I 

MGSDRARKGG GGPKDFGAGL KYHSRHEKVN GLEEGVEFLP VNNVKKVEXH G 

VLIGLLLVIjL GIGFLVWHLQ yrdvrvqkvf NGYMRITNEN fveayensns TEFVSLASKV 

KDALKLLYSG VPFLGPYHKE SAVTAFSEGS VIAYYWSEFS IPQHLVEEAE RVMAEERWM 

LPPRARSLKS fwtswafp tdsktvqrtq dnscsfglha rgvelmrftt PGFPDSPYPA 
HARCQWALRG DADSVLSLTF RSFDLASCDE RGSDLVTVYN TLSPMEPHAL VQLCGTYPPS 
YNLTFHSSQN VliLITLITNT ERRHPGFEAT FFQLPRMSSC GGRLRKAQGT FNSPYYPGHY 
I EVPNNQHVKV RFKFFYLLEP GVPAGTCPKD T 



SDELNCSCDA GHQFTCKNKF CKPLFWVCDS V 
SQQCNGKDDC GDGSDEASCP KVNWTC 

T RQARWGGTD ADEGEWPWQV SLHALGQGHI CGASLISE 
T QWTAFLGLHD QSQRSAPGVQ ERRLKRIISH PFFNDFTFDY DIALLELEKP 
AEYSSHVRPI CLPDASHVFP AGKAIWTGW GHTQYGGTGA LILQKGEIRV INQTTCEHLL 
PQQITPRMMC VGFLSGGVDS CQGDSGGPLS SVEADGRIFQ AGWSHGDGC AQRNKPGVYT 



1 11 21 31 41 51 

I I I I I I 

MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGkl^ A SKMM / EIMFLLDGSN 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 

PCEHRTLEMV REFAGNAPCW RGSRP-TLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 
SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKL3LEC RVDLLFLLDS SAGTTLDGFL 
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J RAVLSEDSKA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 

LTGSALRQAA ERGFGSATRT GQDRPRRWV LLTESHSEDE VAGPARHARA RELLLLGVGS 

EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 

SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTXPT RAAMLRAISQ 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAVWLT GGRGAEDAAV PAQKLRNNGI 
SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK RPV1ILCKP3P 
CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR FLRRP 

Seq ID NO: C239 Protein Sequence 



I I I I I I 



SVGKGSFERS KHFAI TVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YP.TTCPGPCD 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 

RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 

EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 
SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 
APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAVWLT GGRGAEDAAV PAQKLRNNGI 
SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 
CMNEGSCVLQ NGSYRCKCRD GWEGPHCEKR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 



I I I I I I 

MLSSTDFTFA SWELWRVDH PNEEQQKDVT LRVSGDLHVG GVMLKLVEQI NISQDWSDFA 
LWWEQKIICWL LKTIIWTLDKY GVQADAKLLF TPQIIKMLRLR LPNLKMVRLR VSFSAWFKA 
VSDICKILNI RRSEELSLLK PSGDYFKKKK KKDKNNKEPI IEDILNLESS PTASGSSVSP 
GLYSKTMTPI YDPINGTPAS STMTWFSDSP LTEQNCSILA FSQPPQSPEA LADMYQPRSL 
VDKAKLNAGW LDSSRSLMEQ GIQEDEQLLL RFKYYSFFDL NPKYDAVRIN QLYEQARWAI 
LLEEIDCTEE EMLIFAALQY HISKLSLSAE TQDFAGESEV DEIEAALSNL EVTLEGGKAD 

RGCEWPDVN VAGRKFGIKL LIPVADGMNE MYLRCDHENQ YAQWMAACML ASKGKTMADS 
SYQPEVLNIL SFLRMKNRNS ASQVASSLEN MDMNPECFVS PRCAKKHKSK QLAARILEAH 
QNVAQMPLVE AKLRFIQAWQ SLPEFGLTYY LVRFKGSKKD DILGVSYNKL IKIDAATGIP 
VTTWRFTNIK QWNVNWETRQ WIEFDQNVF TAFTCLSADC KIVHEYIGGY IFLSTRSKDQ 
NETLDEDLFH KLTGGQD 



Seq ID NO: C242 DNA Sequence 
Nucleic Acid Accession #: NM_005170 
60 Coding sequence; 33 7.. 918 

1 11 21 31 41 51 

I I I I I I 

GGGCGTGAGA AAGGCGACGG CGGCGGCGCG GAGGAGGGTT ATCTATACAT TTAAAAACCA 

65 GCCGCCTGCG CCGCGCCTGC GGAGACCTGG GAGAGTCCGG CCGCACGCGC GGGACACGAG 
CGTCCCACGC TCCCTGGCGC GTACGGCCTG CCACCACTAG GCCTCCTATC CCCGGGCTCC 
AGACGACCTA GGACGCGTGC CCTGGGGAGT TGCCTGGCGG CGCCGTGCCA GAAGCCCCCT 
TGGGGCGCCA CAGTTTTCCC CGTCGCCTCC GGTTCCTCTG CCTGCACCTT CCTGCGGCGC 
GCCGGGACCT GGAGCGGGCG GGTGGATGCA GGCGCGATGG ACGGCGGCAC ACTGCCCAGG 

70 TCCGCGCCCC CTGCGCCCCC CGTCCCTGTC GGCTGCGCTG CCCGGCGGAG ACCCGCGTCC 
CCGGAACTGT TGCGCTGCAG CCGGCGGCGG CGACCGGCCA CCGCAGAGAC CGGAGGCGGC 
GCAGCGGCCG TAGCGCGGCG CAATGAGCGC G AGCGCAAC C GCGTGAAGCT GGTGAACTTG 
GGCTTCCAGG CGCTGCGGCA GCACGTGCCG CACGGCGGCG CCAGCAAGAA GCTGAGCAAG 
GTGGAGACGC TGCGCTCAGC CGTGGAGTAC ATCCGCGCGC TGCAGCGCCT GCTGGCCGAG 

75 CACGACGCCG TGCGCAACGC GCTGGCGGGA GGGCTGAGGC CGCAGGCCGT GCGGCCGTCT 
GCGCCCCGCG GGCCGCCAGG GACCACCCCG GTCGCCGCCT CGCCCTCCCG CGCTTCTTCG 
TCCCCGGGCC GCGGGGGCAG CTCGGAGCCC GGCTCCCCGC GTTCCGCCTA CTCGTCGGAC 
GACAGCGGCT GCGAAGGCGC GCTGAGTCCT GCGGAGCGCG AGCTACTCGA CTTCTCCAGC 

^ TGGTTAGGGG GCTACTGAGC GCCCTCGACC TA 

Seq ID NO: C243 Protein Sequence 
Protein Accession if: NP_0S0233.1 
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Q LQQPLGLTWP WLIWGKDA2 KLMEFVYKKQ 
RI RCRPRHSP.PD PLQQRTAWAI 
SQLATRRYQA SCRQARGEWP DSGSSCSSAP VCAICLEEFS EGQEHRVISC L 
PWLHQHRTCP LCVPNITEGD SFSQSLGPSR SYQEPGRRLH LIRQHPGHAH Y 
PSRSAVARPP RPGPFLPSQE PGMGPRHHRF PRAAHPRAPG EQQRLAGAQH PYAQGWGMSH 
LQSTSQHPAA CPVPLRRARP PDSSGSGESY CTERSGYLAD GPASDSSSGP CHGSSSDSW 
NCTDISLQGV HGSSSTFCSS LSSDFDPLVY CSPKGDPQRV DKQPSVTSRP RSLDSWPTG 
ETQVSSHVHY HRHRHHHYKK RFQWHGRKPG PETGVPQ3RP PIPRTQPQPE PPSPDQQVTG 
SNSAAPSGRL SNPQCPRALP EPAPGPVDAS SICPSTSSLF NLQKSSLSAR HPQRKRRGGP 
SEPTPGSRPQ DATVHPACQI FPHYTPSVAY PWSPEAHPLI CGPPGLDKRL LPETPGPCYS 
NSQPVWLCLT PRQPLEPHPP GEGPSEWSSD TAEGRPCPYP HCQVLSAQPC- SSEELEELCE 
QAV 

Seq ID NO: C244 DNA Sequence 
Nucleic Acid Accession #: NM_004289 
Coding sequence: 493. .1695 

i i 1 f i 1 I 1 r 

GCCGCCGCCT CGTCCACCGG AGGAGCCGGC GCCAGCGTGG ACGGCGG CAG CCAGGCTGTG 
CAGGGGGGCG GCGGGGACCC CCGAGCGGCT CGGAGTGGCC CCTTGGACGC CGGGGAAGAG 
C GACGGCTCAG GTGCCGGACG CTGGCGGATG TGCGAGCGAG 
A AAAGCACGAA GCTGTGGATC ATAGTTCCCA GCATGAGGAA 
: CCAGAAGGAG AACTCACTTC AGCAGAATGA TGATGATGAA 
3 CAGAGAAACC TGACTGGGAG GCAGAAAAGA CCACTGAATC T 



T GGAGGGCATC TCATTGGGAG ATATTCCTCT TCCAGGCAGT 480 

2 TTCAGCACAT TATCATGTAA ACTTCAGCCA O 
\ GGCCATCTTG CTTTGTCCCA ACAATACATT T 
\ GTCACAAGAA CCATTTCTGC AGTTAAATTC T 

C AAACCCTTCC TGGAACTAAT TTGACAGGAT TTCTTTCACC GGTTGACAAT 

3 CCAAGACCTA CTGTATGACC TTGACATAAA TATATTTGAT 
GAGATAAACT TAATGTCATT GGCCACAGAA GACAACTTTG ATCCAA1 
CTTTTTGATG AACCAGATTC TGATTCTGGC CTTTCTTTAG ATTCAAGTCA C 
TCTGTCATCA AGTCTAATTC CTCTCACTCT GTGTGTGATG AAGGTGCTAT AGGTTATTGC 
ACTGACCATG AATCTAGTTC CCATCATGAC TTAGAAGGTG CTGTAGGTGG CTACTACCCA 
GAACCCAGTA AGCTTTGTCA CTTGGATCAA AGTGATTCTG ATTTCCATGG AGATCTTACA 
TTTCAACACG TATTTCATAA CCACACTTAC CACTTACAGC CAACTGCACC AGAATCTACT 
TCTGAACCTT TTCCGTGGCC TGGGAAGTCA CAGAAGATAA GGAGTAGATA CCTTGAAGAC 
ACAGATACAA ACTTGAGCCG TGATGAACAG CGTGCTAAAG CTTTGCATAT CCCTTTTTCT 
GTAGATGAAA TTGTCGGCAT GCCTGTTGAT TCTTTCAATA GCATGTTAAG TAGATATTAT 
CTGACAGACC TACAAGTCTC ACTTATCCGT GACATCAGAC GAAGAGGGAA AAATAAAGTT 
CCTCCGCAGA ACTGTCGTAA ACGCAAATTG GACATAATTT TGAATTTAGA AGATGATGTA 
TGTAACTTGC AAGCAAAGAA GGAAACTCTT AAGAGAGAGC AAGCACAATG TAACAAAGCT 
ATTAACATAA TGAAACAGAA ACTGCATGAC CTTTATCATG ATATTTTTAG TAGATTAAGA 
GATGACCAAG GTAGGCCAGT CAATCCCAAC CACTATGCTC TCCAGTC 
AGTATCTTGA TAGTACCCAA AGAACTGGTG GCCTCAGGCC ACAAAAAGGA A 
GGAAAGAGAA AGTGAGAAGA AACTGAAGAT GGACTCTATT ATGTGAAGTA GTAATGTTCA 1740 
GAAACTGATT ATTTGGATCA GAAACCAT TG AAACTGCTTC AAGAATTGTA TCTTTAAGTA 1800 
CTGCTACTTG AATAACTCAG TTAACGCTGT TTTGAAGCTT ACATGGACAA ATGTTTAGGA 1860 
CTTCAAGATC ACACTTGTGG GCAATCTGGG GGAGCCACAA CTTTTCATGA AGTGCATTGT 1920 
ATACAAAATT CATAGTTATG TCCAAAGAAT AGGTTAACAT GAAAACCCAG TAAGACTTTC 1980 
CATCTTGGCA GCCATCCTTT TTAAGAGTAA GTTGGTTACT TCAAAAAGAG CAAACACTGG 2040 
GGATCAAATT ATTTTAAGAG GTATTTCAGT TTTAAATGCA AAATAGCCTT ATTTTCATTT 2100 
AGTTTGTTAG CACTATAGTG AGCTTTTCAA ACACTATTTT AATCTTTATA TTTAACTTAT 2160 
AAATTTTGCT TTCT 2174 
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MALRRLGAAL LLLPLLAAVE ETLMDSTTAT AELGWMVHPP SGWEEVSGYD ENMNTIRTYQ 
VCNVFESSQN NWLRTKF I RR RGAHRIHVEM KFSVRDCSSI PSVPGSCKET FNLYYYEADF 
DSATKTFPNW MENPWVKVDT IAADESFSQV DLGGRVMKIN TEVRSFGPVS RSGFYLAFQD 
YGGCMSLIAV RVFYRKCPRI IQNGAIFQET LSGAESTSLV AARGSCIANA EEVDVPIKLY 
CNGDGEWLVP IGRCMCKAGF EAVENGTVCR GCPSGTFKAN QGDEACTHCP INSRTTSSGA 
TNCVCRNGYY RADLDPLDMP CTTIPSAPQA VISSVNETSL MLEWTPPRDS GGREDLVYNI 
ICKSCGSGRG ACTRCGDNVQ YAPRQLGLTE PRIYISDLLA HTQYTFEIQA VMGVTDQSPF 
SPQFASVNIT TNQAAPSAVS IMHQVSRTVD SITLSWSQPD QPNGVILDYE LQYYEKELSE 
YNATAIKSPT NTVTVQGLKA GAIYVFQVRA RTVAGYGRYS GKKYFQTMTE AEYQTSIQEK 
LPLIIGSSAA GLVFLIAVW IAIVCNRRRG FERADSEYID KLQHYTSGH14 TPGMKIYIDP 
FTYEDPNEAV REFAKEIDIS CVKIEQVIGA GEFGEVCSGH LKLPGKREIF VAIKTLKSGY 
TEKQRRDFLS EASIMGQFDH PNVIHLEGW TKSTPTOIIT SFMEN3SLDS FLRQNDGQFT 
VIQLVGMLRG IAAGMKYLAD MNYVHRDIiAA RNILVNSNLV CKVSDFGLSR FLEDDTSDPT 
YTSALGGKIP IRWTAPEAIQ YRKFTSASDV WSYGIVHWEV MSYG3RPYWD MTNQDVINAI 
EQDYRLPPPM DCPSALHQLH LDCWQKDRNH RPKFGQIVNT LDKMIRNPNS LKAMAPLSSG 
INLPLLDRTI PDYTSFNTVD EWLEAIKMGQ YKESFANAGF TSFDWSQMM MEDILRVGLT 
LAGHQKKILN SIQVMRAQMN QIQSVEV 
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MDARRVPQKD LRVKKNLKKF RYVKLISMET SSSSDDSCDS FASDNFANTP. LQSVREGCRT 
RSQCRHSGPL RVAMKFPARS TRGATNKKAE SRQPSENSVT DSNSDS DI L IF L 

NIKQNKAMLA KLMSELESFP GSFRGRHPLP GSDSQSRRPR RRTFPGVASR RNPERRARPL 
TRSRSRILGS LDALPMEEEE EEDKYMLVRK RKTVDGYMNE DDLPRSRRSR SSVTLPHIIR 
PVEEITEGGV GERLQQFSKR RYITVHWALL VINAVRRLLI PKQTAETQTA GAFEASSVAP 
AFETVMVKRS GMLCWIRTGI ARLVEESATA VSAGSEMDGV RLGSLCI 
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MENPSPAAAL GKALCALLLA TLGAAGQPLG GESICSARAP AXYSITFTGK WSQTAFPKQY 

PLFRPPAQWS SLLGAAHSSD YSMWRKNQYV SNGLRDFAER GEAWALKKEI EAAGEALQSV 

HAVFSAPAVP SGTGQTSAEL EVQRRHSLVS FWRIVPSPD WFVGVDSLDL CDGDRHREQA 

ALDLYPYDAG TDSGFTFSSP KFATIPQDTV TEITSSSPSH PANSFYYPRL KALPPIARVT 

LVRLRQSPRA FIPPAPVLPS RDNEIVDSAS VPETPLDCEV SLWSSWGLCG G 

RTRYVRVQPA NNGSPCPELE EEAECVPDNC V 

Protein Accession it : NP_063947.1 

i i 1 r i 1 r i 1 

MLQDPDSDQP LNSLDVKPLR KPRIPMETFR KVGIPIIIAL LSLASIIIW VI.] 
YFLCGQPLHF IPRKQLCDGE LDCPLGEDEE HCVKSFPEGP AVAVRLSKDR STLQVLDSAT 
GNWFSACFDN FTEALAETAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMRNSS 
GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 
LTAAHCFRKH TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FKPMYPKDND IALMKLQFPL 
TFSGTVRPIC LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTRCNADD 
AYQGEVTEKM MCAGIPEGGV DTOQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT 
KVSAYLNWIY NVWKAEL 
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MGCKVLLNIG QQMLRRKWD CSREETRLSR CLNTFDLVAL GVGSTLGAGV YVLAGAVARE 
NAGPAIVISF LIAALASVLA GLCYGEFGAR VPKTGSAYLY SYVTVGELWA FITGHNLILS 
YIIGTSSVAR AWSATFDELI GRPIGEFSRT HMTLNAPGVL AENPDIFAVI IIliILTGLLT 
LGVKESAMVN KIFTCINVLV LGFIMVSCFV KGSVKNWQLT EEDFGNTSGR LCLNNDTKEG 
KPGVGGFMPF GFSGVLSGAA TCFYAFVGFD CIATTGEEVK NPQKAIPVGI VASLLICFIA 
YFGVSAALTL MMPYFCLDNN SPLPDAFKHV GWEGAKYAVA VGSLCALSAS LLGSMFPMPR 
VIYAMAEDGL LFKFLANVND RTKTPIIATL ASGAVAAVMA FLFDLKDLVD LMSIGTLLAY 
SLVAACVLVL RYQPEQPNLV YQMASTSDEL DPADQNELAS TNDSQLGFLP EAEMFSLKTI 
LSPKNMEPSK ISGLIVMIST SLIAVLIITF CIVTVLGREA LTKGALWAVF LLAGSALLCA 
WTGVIWRQP ESKTKLSFKV PFLPVLPILS IFVNVYLMMQ LDQGTWVRFA VWMLIGFIIY 
FGYGLWHSEE ASLDADQART PDGNLDQCK 



I I I I I I 

MRAPHLHLSA ASGARALAKL LPLLMAQLWA AEAALLPQND TRLDPEAYGA PCARGSQPWQ 

VSLFNGLSFH CAGVLVDQSW VLTAAHCGNK PLWARVGDDH LLLLQGEOLR RTTRSWHPK 

YHQGSGPILP RRTDEHDLML LKLARPWPG PRVRALQLPY RCAQPGDQCQ VAGWGTIAAR 

RVKYNKGLTC SSITILSPKE CEVFYPGWT NNMICAGLDR GQDPCQSDSG GPLVCDETLQ 

GILSWGVYPC GSAQHPAVYT QICKYMSWIN KVIRSN 



I I I I I I 

MTRAATAEPG RVSPASPARS TAGLPRAFLQ SLRTLLDILD DHQRGCVHLR SIQSLWVEAR 
ELPSGVLEGL SQRRGPQPGA AVRSRRGGAV PRGARAVPER CAGTETRRGP. RCSGLQRLGG 
GFRGCPADPC ARGEHRRHTI TSGVDCGLLK QMKELEQEKE VLLQGLEMMA QGRDWYQQQL 
QQVQERQCRL GQSRASADFG AVGSPRPLGR LLPKVQEVAR WLGELLAEAC AGRALPTSSS 
GPPCSALTST SSPGWQQQII LMLKEQNRLL TQEVTEKSER ITQLEQKSAL IKQLFEARAL 
SQQDGGLSPA GPHIEPLTRF RLPVLTWAGA LLSPHSPQIA LPLSADSGGP LKELPDTWFP 
AVLLWVPSPG KRTAHARLHF HQRPAEGAWQ LGCGAEAAPE TCGTLPHFES HKTTCEPDSL 
GGPCPQEGDR SWSHLGAAFD VAPAVAKVTP NREDAAGSRH GDICPLCPKG LLTFRDIAIE 
FSLAEWQCLD HAQQNLYRDV MLENYRNLFS LGMTVSKPDL IACLEQNKEP QNIKRNEMAA 
KHPVTCSHFN QDLQPEQSIK DSLQKVIPRT YGKCGHENLQ LKKCCKRVDE CEVHKGGYND 
LNQCLSNTQN KIFQTHKCVK VFSKFSNSNR HNARYTt f F F CHF3HLNQI 

IIHTKEKSYK CEECGKSFNH SSSGTTHKRI LTGEKPYRCE ECGKAFRKPS NLTRHKRIHT 
GEKPYACEEC GQAFRRSSTL TNHKRIHTGE RPYKCEECGK AFSVSSALIY KKRIHTGEKP 
YTCEECGKAF NCSSTLKTHK IIHTGEKPYT CEECGRTFNC SSTVKAHKRI H 
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ECDKAFKWHS SLAKHKIIHT GEKPYKCSDS KALAKSSEVQ KVYSC-DGENG IRVHKKKETQ 

GWLVRNKNEN RTGLFQIRAA VRPNRDPSWG QQEGSLTDPI QRK3EPDLQN HYDHQNALED 

QRNTGVGGLL TFRDWIEFS LBBWQCLDHA QQNLYRDVML ENYRI LVSL I" PDLI1 

CLEQNKEPWN IKRNEMVTKH PDLPPELGIK DSLQKVIPRR YGK3GHDNLQ VKTCKSMGEC 

EVQKGGCIJEV NQCLSTTQNK IFQTHKCVKV FGKFSNSNRH KTPrfT KHF KC 3 2 

MVSQLHQHQI IHTRENSYQC EECGKPFNCS STLSKHKRIH TGEKPYRCEE CGKAFTWSST 

LTKHRRIHTG EKPYTCEECG QAFSRSSTLA NHKRIHTGEK PYTCEECGKA FSLSSSLTYH 
KRIHTGEKPY TCEECGKAFH CSSTLKKHKI IHTGEKPYKC KECGKAFAFS STLNTHKRIH 

TGEEPYKCEE CDKAFKWSSS LANHKSMHTG EKPYKCE 

Seq ID NO; C252 protein Sequence 
Protein Accession 9: NP_114433 .1 



MASRSMRLLL LLSCLAKTGV LGDIIMRPSC APGWFYHKSN CYGYFRKLRN WSDAELECQS 
YGNGAHLASI LSLKEASTIA EYISGYQRSQ PIWIGLHDPQ KRQQWQWIDG A" 
KSMGGNKHCA E 



QVKVAGKPMY L 
CNFFDFDTFG GHIKFALGFK AAHLEGTELK 



HMGQQLVGQY PIHFHLAGDV D 
YNSLGHCFFT EDGPEERNTP DHCLGLLVKS GTLLPSDRDS KMCKMITGDS YPGYIPKPRQ 
DCNAVSTFWM ANPNNHLINC AAAGSEETGF WFIFHHVPTG PSVGMYSPGY SEHIPLGKFY 
NNRAHSNYRA GMIIDNGVKT TEASAKDKRP FLSIISARYS PHQDADPLKP REPAIIRHFI 
AYKNQDHGAW LRGGDVWLDS CRFADNGIGL TLASGGTFPY DDGSKQEIKN S 
GTEMMDNRIW GPGGLDHSGR TLPIGQNFPI RGIQLYDGPI NIQNCTFRKF V 
SCPHNNVTGI AFEDVPITSR VFFGEPGPWF NQLDMEX3DKT 8 
NDNWLVRHPD CINVPDWRGA ICSGCYAQMY IQAYKTSMLR MKIIKNDFPS 
HPLYLEGALT RSTHYQQYQP WTLQKGYTI HHDQTAPAEL AIWLINFNKG DWIRVGLCYP 
R rFSILSD VHNRLLKQTS KTGVFVRTLQ MDKVEQSYPG RSHYYWDEDS GLLFLKLKAQ 
NEREKFAFCS MKGCERIKIK ALIPKNAGVS DCTATAYPKF TERAWDVPM PKKLFGSQLK 
TKDHFLEVKM ESSKQIIFFIIL WNDFAYIEVD GKKYPSEEDG IQVWIDGNQ GRWSHTSFR 
NSILQGIPWQ LFNYVATIPD NSIVLMASKG RYVSRGPWTR VLEKLGADRG LKLKEQMAFV 
GFKGSFRPIW VTLDTEDHKA KIFQVVPIPV VKKKKt, 

Seq ID NO J C254 Protein Sequence 
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MTALSSENCS FQYQLRQTNQ PLDVNYLLFL IILGKILLNI LTLGMRRKNT CQNFMEYFCI 
S LAFVDLLLL VNISIILYFR DFVLLSIRFT KYKICLFTQI ISFTYGFLHY PVFLTACIDY 
CLNFSKTTKL SFKCQKLFYF FTVILIWISV LAYVLGDPAI YQSLKAQNAY SRHCPFYVSI 
QSYWLSFFMV MILFVAFITC WEEVTTLVQA IRITSYMMET ILYFPFSSHS SYTVRSKKIF 
LSKLIVCFLS TWLPFVLLQV IIVLLKVQIP AYIEMNIPWL YFVNSFLIAT V ' " 

LKDIGLPLDP FVNWKCCFIP LTIPNLEQIE KPISIMIC 

Seq ID NO: C255 Protein Sequence 
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MALVLGSLLL LGLCGNSFSG GQPSSTDAPK AWNYELPATN YETQDSHKAG PIGILFELVH 
IFLYWQPRD FPEDTLRKFL QKAYESKIDY DKIVYYEAGI ILCCVLGLLF IILMPLVGYF 
FCMCRCCNKC GGEMHQRQKE NGPFLRKCFA ISLLVICIII SIGIFYGFVA NHQVETRIKR 
SRKIiADSNFK DLRTLLNETP EQIKYILAQY NTTKDKAFTD LNSINSVLGG GILDRLRPNI 
IPVLDEIKSM ATAIKETKEA LENMNSTLKS LHQQSTQLSS SLTSVKTSLR SSLNDPLCLV 
HPSSETCNSI RLSLSQLNSN PELRQLPPVD AELDNVNNVL RTDLDGLVQQ GYQSLNDIPD 
RVQRQTTTW AGIKRVLNSI GSDIDNVTQR LPIQDILSAF SVYVNNTESY IHRNLPTLEE 
YDSYWWLGGL VICSLLTLI V I FYYLGLLCG VCGYDRHATP TTRGCVSNTG GVFLMVGVGL 
SFLFCWILMI IWLTFVFGA NVEKLICEPY TSKELFRVLD TPYLLNEDWE YYLSGKLFNK 
SKMKLTFEQV YSDCKKNRGT YGTLHLQNSF NISEHLNINE HTGSISSELE SLKVNLNIFL 
LGAAGRKNLQ DFAACGIDRM NYDSYLAQTG KSPAGVNLLS FAYDLHAKAN SLPPGNLRNS 
LKRDAQTIKT IHQQRVLPIE QSLSTLYQSV KILQRTGNGL LERVTRILAS LDFAQNFITN 
NTSSVIIEET KKYGRTIIGY FEHYLQWIEF SISEKVASCK PVATALDTAV DVFLCSYIID 
7FLLPAL IFAVKLAKYY RRMDSEDVYD DVETIPMKNM ENGNNGYHKD 
M TSPSQH 



V AAADTTETNF P 

PSFPTATSPA PPIISTHSSS TIPTPAPPII STHSSSTIPI PTAADSESTT K 
ITASSPNDGL ITMVPSETQS NNEMSPTTED NQSSGPPTGT ALLETSTLNS TGPSNPCQDD 
PCADNSLCVK LHNTSFCLCL EGYYYNSSTC KKGKVFPGKI SVTVSETFDP B3KESMAYQD 
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DVFGTSVYGQ TVILTVSTSL SPRSEMRADD KFVNVTIVTI L 
RSSSSNFLNY DLTLRCDYYG CNQTADDCLN GLACDCKSDL QRPNPQSPFC 
CNAQHKQCLI KKSGGAPECA CVPGYQEDAN GNCQKCAFGY SGLDCKDKFQ 
TKHIEEENLI DEDFQNLKLR STGFTNLGAE 



PC-ESSDNCTA LVQTEDNPRV 
VRCEHFFLTV HQPLSKEYVA 
SGDPELPQV 



1 11 21 31 41 51 

I I I I I I 

MDRSKENCIS GPVKATAPVG GPKRVLVTQQ IPCQNPLPVN SGQAQRVLCP SNSSQRUPLQ 60 

AQKLVSSHKP VQNQKQKQLQ ATSVPHPVSR PLNNTQKSKQ PLPSAPENNP EEELASKQKN 120 

EESKKRQWAL EDFEIGRPLG KGKFGNVYLA REKQSKFILA LKVLFKAQLE KAGVEHQLRR 180 

EVEIQSHLRH PNILRLYGYF HDATRVYLIL EYAPLGTVYR ELQKLSKFDE QRTATYITEL 240 

ANALSYCHSK RVIHRDIKPE MLLLGSAGEL KIADFGWSVH APSSRRTTLC GTLDYLPPEM 300 

IEGRMHDEKV DLWSLGVLCY EFLVGKPPFE ANTYQETYKR ISRVEFTFPD FVIEGARDLI 3 SO 

SRLLKHNPSQ RPMLREVLEH PWITANSSKP SNCQNKESAS KQS 403 



Protein Accession KP_037S04.1 



MSRTAYTVGA LLLLLGTLLP AAEGKKKGSQ GAIPPPDKAQ HNDSEQTQSP QQPGSRNRGR 
GQGRGTAMPG EEVLESSQEA LHVTERKYLK RDWCKTQPLK QTIHEEGCNS RTIINRFCYG 
QCNSFYIPRH IRKEEGSFQS CSFCKPKKFT TMMVTLNCPE LQPPTKKKRV TRVKQCRCIS 
IDLD 



1 11 21 31 41 

I I I I I 

MKVGVLWLIS FFTFTDGHGG FL 

EKRDLRNFLK LLKPPLDWSH GL 
NCYLHTAGAL PSCECHLNNL SQSVNFCERT K 
IEIQLKKAYE RIQGFESVQV TQFRNGS IV 
LFPLEDGSFR VFGKAQCNDI VFGFGSKDD 
LSLLEELNKN FSMIVGNATE AAVSSFVQNL SVIIRQNPST T 
ASHFRVSNST MEDVISIADN ILNSASVTNW TVLLREEKYA S 
PLNFSRKFID WKGIPVNKSQ LKRGYSYQIK MCPQNTSIPI RGRVLIGSDQ FQRSLPETII 
SMASLTLGNI LPVSKNGNAQ VNGPVISTVI QNYSINEVFL FFSKIESNLS QPHCVFWDFS 
HLQWNDAGCH LVNETQDIVT CQCTHLTS - ~ 
ILCLIIEALF WKQIKKSQTS HTRRICMV 

TQPSNTYKRK DVCWLNWSNG SKPLLAFWP ALAIVAVNFV WMA/LTKLW R 
DDKATIIRVG KSLLILTPLL GLTWGFGIGT IVDSQNIiAWH VIFALLNAFQ GFFILCFGIL 
LDSKLRQLLF NKLSALSSWK QTEKQNSSDL SAKPKFSKPF NPLQNKGHYA FSHTGDSSDN 
IMLTQFVSNE 



Seq ID NO: C262 Protein Sequence 
Protein Accession #: NP_005594.1 

1 11 21 31 41 51 

I I I I I I 

MSTERDSETT FDEDSQPNDE WPYSDDETE DELDDQGSAV EPEQNRVNRE AEENREPFRK 
ECTWQVKAND RKYHEQPHFM NTKFLCI KES KYANNAIKTY KYNAFTFIPM NLFEQFKRAA 
NLYFLALLIL QAVPQISTLA WYTTLVPLLV VLGVTAIKDL VDDVARHKMD KEINNRTCEV 
IKDGRFKVAK WKEIQVGDVI RLKKNDFVPA DILLLSSSSF NSLCYVSTAE LDG3TMLKFK 
MSLEITDQYL QREDTLATFD GFIECEEPNN RLDKFTGTLF WRNTSFPLDA DKILLRGCVI 
RNTDFCHGLV IFAGADTKIM KNSGKTRFKR TKIDYLMNYM VYTIFWLIL LSAGLAIGHA 
YWEAQVGNSS WYLYDGEDDT PSYRGFLIFW GYI I VLNTMV PISLYVSVEV IRLGQSHFIN 
WDLQMYYAEK DTPAKARTTT LNEQLGQIHY IFSDKTGTLI QNIMTFKKCC INGQIYGDHR 
DASQHNHNKI EQVDFSWNTY ADGKLAFYDH YLIEQIQSGK EPEVRQFFFL L 
RTDGQLHYQA ASPDEGALVN AARNFGFAFL ARTQNTITIS ELGTERTYMV L 
KRMSIIVRTP EGNIKLYCKG ADTVIYERLH RMNPTKQETQ DALDIFAKET L 
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J KKPMAASVAS TNRDEALDKV Y 
if LTGDKKETAE NIGFACELLT E 
KFAPPVQESF FPPGGNRALI ITGSWLNEIL LEKKTKRNKI LKLKFPI 
RLEAKKEQRQ KNFVDLACEC SAVICCRVTP KQKAMWDLV KRYKKA: 
I SGQEGMQAVM SSDYSFAQFR YLQRLLLVHG R 

P PNGYSAQTAY EDWFITLYNV LYTSLPVLLM GLLDQDVSDK LSLRPPGLYI 
FVSLLHG VLTSMILFFI PLGAYLQTVG QDGEAPSDYQ SFAVTIASAL 
?GSIALY FGIMFDFHSA GIHVLFPSAF QPTGTASNAL 
RQPYIWLTII LTVAVCLLPV VAIRFLSMTI WPSESDKIQK HRKRLKAEEQ HQRRQQVFRR 
GVSTRRSAYA FSHQRGYADL ISSGRSIRKK RSPLDAIVAD GTAEYRRTGD S 

Seq ID NO: C263 Protein Sequence 



SFKGKDPQRD CQNYIKILLP LSGSHLFTCG TAAFSPMCTY I 

?ALWDG ELYTGTVSSF QGNDPAISRS QSLRPTKTES SMLQDPAF 
L GSLOGDDDKI YFFFSETGQE FEFFENTIVS RIARICKGDE GGERVLQQRW 
TSFLKAQLLC SRPDDGFPFN VLQDVFTLSP SPQDWRDTLF YGVFTSQWHR GTTEGSAVCV 
FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TNSARERKIN SSLQLPDRVL 
NFLKDHFLMD GQVRSRMLLL QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 
PRVHIIEELQ IFSSGQPVQN LLLDTHRGLL YAASHSGWQ VPMANCSLYR SCGDCLLARD 
PYCAWSGSSC KHVSLYQPQL ATRPWIQDIE GASAKDLCSA SSWSPSFVP TGEKPCEQVQ 
FQPNTVNTLA CPLLSNLATR LWLRNGAPUN ASASCHVLPT GDLLLVGTQQ LGEFQCWSLE 
EGFQQLVASY CPEWEDGVA DQTDEGGSVP VIISTSRVSA PAGGKASWGA DRSYWKEPLV 
MCTLFVLAVL LPVLFLLYRH RNSMKVFLKQ GECASVHPKT CPWLPPETR PLNGLGPPST 
PLDHRGYQSL SDSPPGSRVF TESEKRPLSI QDSFVEVSPV CPRPRVRLGS EIRDSW 

Seq ID NO-. C264 Protein Sequence 



MASQNRDPAA TSVAAARKGA EPSGGAARGP VGKRLQQELM T 

ft GTVYEDLRYK LSLEFPSGYP YNAPTVKFLT PCYHPNVDTQ GNICLDILKE 

ILLSIQSLLG EPNIDSPLNT HAASLKKNPT AFKKYLQETY SKQVTSQEP 



1 11 21 31 41 51 

I I I I I I 

MGRGWGFLFG LLGAVWLLSS GHGEEQPPET AAQRCFCQVS GYLDDCT 

LFPRLQKLLE SDYFRYYKVN LKRPCPFWND ISQCGRRDCA VKPCQSDEVP D 

LIEE CEQAERLGAV DESLSEETQK AVLQWTKHDD SSDNFCEADD IQSPEAEYVD 

fEENCFK PQTIKRPLNP LASGQGTSEE N 
V HLSARYLLQE TWLEKKWGHN ITEFQQRFDG I 
[ ELRALSKVLP FFERPDFQLF TGNKIQDEEN KMLLLEILHE I 
NSFFAGDKKE AHKLKEDFRL HFRNISRIMD CVGCFKCRLW GKLQTQGLGT ALKILFSEKL 
IANNPESGPS YEFHLTRQEI VSLFNAFGRI STSVKELENF RNIiLQNIH 



MQPRRQRLPA PWSGPRGPRP TAPLLALLLL LAPVAAPAGS GGPDDPGQPQ DAGVPRRLLQ 60 

QKARAALHFF NFRSGSPSAL RVLAEVQEGR AWINPKEGCK VHWFSTERY NPESLLQEGE 120 

GRLGKCSARV FFKNQKPRPT INVTCTRLIE KKKRQQEDYL LYKQMKQLKN PLEIVSIPDN 180 

HGHIDPSLRL IWDLAFLGSS YVMWEMTTQV SHYYLAQLTS V 



Seq ID NO: C267 Protein 



MSVKGMAIAL AVILCATWQ GFPMFKRGRC LCIGPGVKAV K 
EVIITLKENK GQRCLNPKSK QARLIIKKVE RKNF 



MLRQVLRRGL QSFCHRLGLC VSRHPVFFLT VPAVLTITFG LSALNRFQPE GDLERLVAPS 

HSLAKIERSL ASSLFPUDQS KSQLYSDLHT PGRYGRVILL SPTGDNILLQ AEGILQTHRA 

VLEMKVNHKG YNYTFSHLCV LRNQDKKCVL DDIISVLEDL RQAAVSNKTT ARVQVRYPNT 
KLKVCSFCML LPIKEAALHF LP 
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20 



I I I I I I 

MRLPLLLVFA SVIPGAVLLL DTRQFLIYNE DHKRCVDAVS PSAVQTAACN QDAESQKFRW 
5 VSESQIMSVA FKLCLGVPSK TDWVAITLYA CDSKSEFQKW ECKNDTLLGI KGEDLFFNYG 
NRQEKNIMLY KGSGLWSRWK IYGTTDNLCS RGYEAMYTLL GNANGATCAF PFKFENKWYA 
DCTSAGRSDG WLWCGTTTDY DTDKLFGYCP LKFEGSESLW NKDPLTSVSY QINSKSALTW 
HQARKSCQQQ NAELLSITEI HEQTYLTGLT SSLTSGLWIG LNSLSFNSGW QWSDRSPFRY 
UJWLPGSPSA EPGKSCVSLN PGKNAKWENL ECVQKLGYIC KKGNTTLNSF VIPSESDVPT 

10 HCPSQWWPYA GHCYKIHRDE KKIQRDALTT CRKEGGDLTS IHTIEELDFI I SQLGYEPND 
ELWIGLNDIK IQMYFEWSDG TPVTFTKWLR GEPSHENNRQ EDCWMKGKD GYWADRGCEW 
PLGYICKMKS RSQGPEIVEV EKGCRKGWKK HHFYCYMIGH TLSTFAEANQ TCNNEMAYLT 
TIEDRYEQAF LTSFVGLRPE KYFWTGLSDI QTKGTFQWTI EEEVRFTHWN SDMPGRKPGC 
VAMRTGIAGG LWDVLKCDEK AKFVCKHWAE GVTHPPKPTT TPEPKCPEDW GASSRTSLCF 

15 KLYAKGKHEK KTWFESRDFC RALGGDLASI NNKEEQQTIW RLITASGSYH KLFWLGLTYG 
SPSEGFTWSD GSPVSYENWA YGEPNNYQNV EYCGELKGDP TMSWNDINCE HLNNWICQIQ 
KGQTPKPEPT PAPQDNPPVT EDGHVIYKDY QYYFSKEKET MDNARAFCKR NFGDLVSIQS 
ESEKKFLWKY VNRNDAQSAY FIGLLISLDK K " ----- -- 



TMYSNSGFWN DINCGYPNAF ICQRHNSSIN ATTVMPTMPS V 
GFMEEERKNW QEARKACIGF GGNLVSIQNE KEQAFLTYHM KDSTFSAWTG LNDVNSEHTF 
LWTDGRGVHY TNWGKGYPGG RRSSLSYEDA DCWIIGGAS NEAGKWMDDT CDSKRGYICQ 
TRSDPSLTNP PATIQTDGFV KYGKSSYSLM RQKFQWHEAE TYCKLHHSLI ASILDPYSNA 
M ERVWIALNSN LTDNQYTWTD KWRVRYTNNA ADEPKLKSAC VYLDLDGYWK 
? LCKRSDEIPA TEPPQLPGRC PESDHTAWIP FHGHCYYIES SYTRMWGQAS 
U VSIESAAESS FLSYRVEPLK SKTNFWIGLF RNVEGTWLWI NKSPVSFVNW 
S DCVALHASSG FWSNIHCSSY KGYICKRPKI IDAKPTHELL TTKADTRKMD 
PSKPSSNVAG WIIVILLIL TGAGLAAYFF YKKRRVHLPQ EGAFENTLYF SSQSSPGTSD 
MKDLVGNIEQ NEHSVI 



I I I I I I 

MVLLHWCLLW LLFPLSSRTQ KLPTRDEELF QMQIRDKAFF H 
PKRYFFWEE DNTPLSVTVT PCDAPLEWKL SLQELPEDRS G 
EGTELFSYKG NDVEYFISSS SPSGLYQLDL L 
RVDVTSLGRT TVTLAWKPSP TASLLKOPIO YCWINKEHK FKSLCAVEAK LSADDAFMMA 
PKPGLDFSPF DFAHFGFPSD NSGKERSFQA KPSPKLGRHV YSRPKVDIQK I 
VSDLEPDTQY YFDVFWNIN SNMSTAYVGT FARTKEEAKQ K 
KFLRFAPVSS HQKVTFFIHS CLDAVQIQVR RDGKLLLSQN VEGIQQFQLR G 
LKGNKKGASM LKILATTRPT KQSFPSLPED TRIKAFDKLR TCSSATVAWL GTQERNKFCI 
YKKEVDDNYN EDQKKREQNQ CLGPDIRKKS EKVLCKYFHS QNLQKAVTTE TIKGLQPGKS 
H GGHSVKYQSK V 



MEKVQLEFEN QEMEKKLQEF R 



GEDYCSGCFA NVHQKGALKL HRTTLLQARS QILFNVLDVA HQFIKDVNPD EPKEENNSTK 
ETSKIQHKPK SVLLQRSSSE VEITTMKRAQ RTKPRKSLLC EGSFDEEASA QSFQEVLSQW 
RTGNHDDNKK QNLHAAVKDS LEECEVQTNL KIWREPLNIE LKEDILSYME K 
PQEQLFKCYQ IRSHIHMKPL VMHSVLKMKT MKIVMVRRPK YNTQLFYCQ 



X RTPQEQLFKM LSDTFPHPHE TTGDAQCSQN ENDED SDGEE TKVQHTALLL 
PVETLNIERP EPSLKIVELD DTYEEEFEEA ENIVPYKVKL ADADSQRSCA FHDCQKNSFP 
YENGIHQHHV FDKGKRDFLN LCLRNSSTYY KDNSKGETSN TDFDNIVDPD VYSSDIEKIE 
ESTSFERNLK EKNIGLESNQ KSDDSCVSLE SKDTLLGRDL EKAPIEEKLS QDIKESLELS 
NLYKRPSFEE SKTTKSSLLL QEIACRSKPI TKQYQGLERF FIFDTNERLN LLPSHRLECN 
TONAIVL GVLQGAQSPS SSRKQQKK.GQ KSQRPSTANF 
S RSAAAQSSSR AASEISEIEY IDITDQNELS LDDTTDQHTL 
DHIiEKELQVL RSLADTSEKL YSLTSEEFPD FSSQSLNISQ ISTDFLKTSH VRGPCGVEEL 
SCSGRDTKIQ SLLSLSESST DEEEEDFLNK QHVITLPWSK ST 

Protein Accession #: NP_005399.1 
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I I I I I 

MFLLTGGVSL KSAEKNPDPT WLQDKSWEEI CRASEFEAFR GLRQHFCEHI 

LDSNCTIPLI FVLSPGADPM ASLLKFANDK SMSGNKFQAI SLGQGQGPIA AKMIKAAIEE 

GTWVCLQNCH LAVSWMPMLE KICEDFTSET CNSSFRLWLT SYPSSKFPVT I LQNGVKMTN 

EPPTGLRLNL LQSYLTDPVS DPEFFKGCRG KELLFINEYD TIPFEAISYL TGECNYGGRV 

TDDWDRRLLL TMLADFYNLY IVENPHYKFS PSGNYFAPPK GTYEDYIEFI KKLPFTQHPE 

IFGLHENVDI SKDLQQTKTL FESLLLTQGG SKQTGASGST DQILLEITKD ILNKLPSDFD 

IEMALRKYPV RYEESMNTVL VQEMERFNNL IITIRNTLRD LEKAIKGVW MDSALEALSS 

SLLVGKVPEI WAKRSYPSLK PLGSYITDFL ARLNFLQDWY NSGKPCVFWL SGFFFTQAFL 

TGAMQNYARK YTTPIDLLGY EFEVIPSDTS DTSPEDGVYI HGLYLDGARW DRESGLLAEQ 

YPKLLFDLMP IIWIKPTQKS RIIKSDAYVC PLYKTSERKG TLSTTGHSTN FVIAMLLKTD 
QPTRHWIKRG VALLCQLDD 

Seg ID NO: C275 Prot 



I I I I I I 

MAESHLLQWL LLLLPTLCGP GTAAWTTSSL ACAQGPEFWC QSLEQALQCR ALGHCLQEVW 

GHVGADDLCQ ECEDIVHILN KMAKEAIFQD TMRKFLEQEC NVLPLKLLMP QOJQVLDDYF 

PLVIDYFQNQ TDSNGICMHL GLCKSRQPEP EQEPGMSDPL PKPLRDPLPD PLLDKLVLPV 

LPGALQARPG PHTQDLSEQQ FPIPLPYCWL. CRALIKRIQA MIPKGALAVA VAQVCRWPL 

VAGGICQCLA ERYSVILLDT LLGRMLPQLV CRLVLRCSMD DSAGPRSPTG EWLPRDSECH 

LCMSVTTQAG NSSEQAIPQA MLQACVGSWL DREKCKQFVE QHTPQLLTLV PRGWDAHTTC 

QALGVCGTMS SPLQCIHSPD L 

Seg ID NO: C27S Protein Seguence 



LLLLLQGSAD GNGIQGFFYP WSCEGDIWDR ESCGGQAAID SPNLCLRLRC 



NILPFMDPLK LLLKTLGISV 



I I I I I I 

MPLSYAYKNA ETLAGRHTSS WMSRGAYQRR NTRAAGRPEE CTDRNWHAGR TRGIWLGQLE 
ERCSDVFGVS FFWWVRGLAG SGAKLQTFTP AQEGAPTVQR QAEALLKCRQ SGRPGRGGAE 
SERARDASML SPLSAAMRNY PTSSTIPPRR SYSPTEIAHK SYSCSLPDMK ISMAESGPSL 
DSLDI LEDGE SGSPFLVTHL YFLGWTTGM EQLDFETGPN IFDLQIYVKD EVGVTDLQVL 
TVQVTDVNEP PQFQGNLAED HLRADQPHFN AHSHTYVRW ATALARHRLR SSIGSPFLGT 
FCVWGMQYF LISPPKSFRM SANGTLFSTT ELDFEAGHRS FHLIVEVRDS GGLKASTELQ 
VNIVNLNDEV PRFTSPTRVY TVLEELSPGT IVANITAEDP DDEGFPSHLL YSITTVSKYF 
MINQLTGTIQ VAQRIDRDAG ELRQNPTISL EVLVKDRPYG GQENRIQITF IVEDVNDNPA 
TCQKFTFRSS LHPALCSKTL TWMDTVLDCF HAADKDIPVT GRFTKERGLI GLTVPHGWGS 
LTIMAEGKEE QVTSYMDGSR QRDRACVGKL LLIKPSDLMR LSHYHENNSG KTCPHDSISS 
YQVPPTTCRN SRIQATNHED TSSVTVTTOI LEENDEKPIC TPNSYFLALP VDLKVGTNIQ 



MINQLTGTIQ VAQRIDRDAG ELRQNPTISL EVLVKDRPYG GQENRIQITF IVEDVNDNPA 
TCQKFTFSIM VPERTAKGTL LLDLNKFCFD DDSEAPNNRF NFTMPSGVGS GSRFLQDPAG 
SGKIVLIGDL DYENPSNDAA GNKYTVIIQV QDVAPPYYKN NVYVYILTSP ENEFPLIFDR 
PSYVFDVSER RPAQGHLSGP EEKRLLSICM VRAVCHKFGL HIASGSPRV? GRPIGQSHPQ 
TLPLQDWEEQ GTSDKERRNE DCRERRRGGN YPDEHYL 
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K AELSCWEEGN GRIALQGTLL NTYVCSILIR TTMHVGFIVG QYFIYGIFLT 
raCYVSR PTEKNVFIVF MLAVAALSLL LSLAELYHLG WKKIRQRFVK 
PRQHMAKCQL SGPSVGIVQS CTPPPDFNQC LENGPGGKFF NPFSNNMASQ QNTDNLVTEQ 
VRGQEQTPGE GFIQVRYGQK PEVPNGVSPG HRLPHGYHSD KRRLSKASSK ARSDDLSV 



I I I I I I 

MYLSICCCFL LWAPALTLKN LNYSVPEEQG AGTVIGNIGR DARLQPGLPP AERGGGGRSK 
SGSYRVLENS APHLLDVDAD SGLLYTKQRI DRESLCRHNA KCQLSLSVFA NDKEICMIKV 
EIQDINDNAP SFSSDQIEHD ISENAAPGTR FPLTSAHDPD AG2NGLRTYL LTRDDHGLFG 
LDVKSRGDGT KFPELVIQKA 'LDREQQNHHT LVLTALDGGE PPRSATVQIN VKVIDSNDNS 
PVFEAPSYLV ELPENAPLGT WIDLNATDA DEGPNGEVLY SFSSYVPDRV RELFSIDPKT 
GLIRVKGNLD YEENGMLE ID VQARDLGPNP IPAHCKVTVK LIDRNDNAPS IGFVSVRQGA 
LSEAAPPGTV IALVRVTDRD SGKNGQLQCR VLGGGGTGGG GGLGGPGGSV PFKLEENYDN 
FYTWTDRPL DRETQDEYNV TIVARDGGSP PLNSTKSFAI KILDENDNPP RFTKGLYVLQ 
VHENNIPGEY LGSVLAQDPD LGQNGTVSYS ILPSHIGDVS IYTYVSVNPT NGAIYALRSF 
NFEQTKAFEF KVLAKDSGAP AHLESNATVR VTVLDVNDNA PVIVLPTLQN DTAELQVPRN 
AGLGYLVSTV RALDSDFGES GRLTYEIVDG NDDHLFEIDP SSGEIRTLHP FWEDVTPWE 
LWKVTDHGK PTLSAVAKLI IRSVSGSLPE GVPRVNGEQH HWDMSLPLIV TLSTISIILL 
AAMITIAVKC KRENKEIRTY NCRIAEYSHP QLGGGKGKKK KINKNDIMLV QSEVEERNAM 
NVMHWSSPS LATSPMYFDY QTRLPLSSPR SEVMYLKPAS NNLTVPQGHA GCHTSFTGQG 
TNASETPATR MSIIQTDNFP AEPNYMGSRQ QFVQSISVAP RLRTQKEPA 

Seq ID NO: C282 Protein Sequence 



MELCRSLALL GGSLGLMFCL IALSTDFWFE AVGPTHSAHS GLWPTGHGDI ISGYIHVTQT 
FSIMAVLWAL VSVSFLVLSC FPSLFPPGHG PLVSTTAAFA AAISMWAMA VYTSERWDQP 
S WSFYLGWVSA I 



□ LARAHLRDEE KSCPCLAQEG PQGDLLTKTQ 
ELGRDYRTCL TIVQKLKKMV DKPTQRSVSN AATRVCRTGR SRWRDVCRNF MRRYQSRVTQ 
GLVAGETAQQ ICEDLRLCIP STGPL 



MKVSAAALAV ILIATALCAP ASASPYSSDT TPCCFAYIAR PLPRAHIKEY FYTSGKCSNP 



1 11 21 31 41 51 

I I I I I I 

MAPGRAVAGL LLLAAAGLGG VAEGPGLAPS EDVLSVFGAN LSLSAAQLQH LLEQMGAASR 
VGVPEPGQLH FNQCLTAEEI FSLHGFSNAT QITSSKFSVI CPAVLQQLNF HPCEDRPKHK 
TRPSHSEVWG YGFLSVTI IN LASLLGLILT PLIKKSYFPK ILTFFVGLAI GTLFSNAIFQ 
LIPEAFGFDP KVDSYVEKAV AVFGGFYLLF FFERMLKMLL KTYGQMGHTH FGNDNFGPQE 
KTHQPKALPA INGVTCYANP AVTEANGHIH FDNVSWSLQ DGKKEPSSCT CLKGPKLSEI 
GTIAWMITLC DALHNFIDGL AIGASCTLSL LQGLSTSIAI LCEEFPKELG DFVILLNAGM 
STRQALLFNF LSACSCYVGL AFGILVGNNF APNIIFALAG GMFLYISLAD MFPEMNDMLR 
EKVTGRKTDF TFFMIQNAGM LTGFTAILLI TLYAGEIELE 



Seq ID NO: C28S Protein Sequence 



I I I I I I 

MPNSEPASLL ELFNSIATQG ELVRSLKAGN ASKDEIDSAV KMLVSLKMSY KAAAGEDYKA 
DCPPGNPAPT SNHGPDATEA EEDFVDPWTV QTSSAKGIDY DKLIVP.FGSS KIDKELINRI 
ERATGQRPHH FLRRGIFFSH RDMNQVLDAY ENKKPFYLYT GRGPSSEAMH VGHLIPFIFT 
KWLQDVFNVP LVIQMTDDEK YLWKDLTLDQ AYGDAVENAK DIIACGFDIN KTFIFSDLDY 
MGMSSGFYKN WKIQKHVTF NQVKGIFGFT DSDCIGKISF PAIQAAPSFS NSFPQIFRDR 
TDIQCLIPCA IDQDPYFRMT RDVAPRIGYP KPALLHSTFF PALQGAQTKM SASDPNSSIF 
LTDTAKQIKT KVNKHAFSGG RDTIEEHRQF GGNCDVDVSF MYLTFFLEDD DKLEQIRKDY 
TSGAMLTGEL KKALIEVLQP LIAEHQARRK E 



Seq ID NO: C2 87 protein sequence 



1349 



WO 03/042661 PCT/US02/36810 



Protein Acces 



I I I I I I 

MTVFRQENVD DYYDTGEELG SGQFAWKKC REKSTGLQYA AKFIKKRRTK SSRRGVSRBD 

IEREVSILKE IQHPNVITLH EVYENKTDVI LILELVAGGE LFDFLAEKES LTEEEATEFL 

KQILNGVYYL HSLQIAHFDL KPENIMLLDR NVPKPRIKII DFGLAHKIDF GNEFKNIFGT 

PEFVAPEIVN YEPLGLEADM WSIGVITYIL LSGASPFLGD TKQETIANVS AVHYEFEDEY 

FSNTSALAKD FIRRLLVKDP KKRMTIQDSL QHPWIKPKDT QQALSRKASA VNMEKFKKFA 

ARKKWKQSVR LISLCQRLSR SFLSRSNMSV ARSDDTLDEE DSFVMKAIIH AINDDNVPGL 

QHLLGSLSNY DVNQPNKHGT PPLLIAAGCG NIQILQLLIK RGSRIDVQDK GGSNAVYWAA 

RHGHVDTLKF LSENKCPLDV KDKSGEMALH VAARYGHADV AQVTCAASAQ IPISRTKEEE 



CDKDGHIALH LAVRRCQMEV IKTLLSQGCF VDYQDRHGNT PLHVACKDGN MPIWALCEA 
NCNLDISNKY GRTPLHLAAN NGILDWRYL CLMGASVEAL TTDGKTAEDL ARSEQHEHVA 
GLLARLRKDT HRGLFIQQLR PTQNLQPRIK LKLFGHSGSG KTTLVESLKC GLLRSFFRRR 
RPRLSSTNSS RFPPSPLASK PTVSVSINNL YPGCENVSVR SRSMMFEPGL TKGMLEVFVA 
PTHHPHCSAD DQSTKAIDIQ NA YLNGVGD F SVWEFSGNPV YFCCYDYFAA NDPTSIHVW 
FSLEEPYEIQ LNPVIFWLSF LKSLVPVEEP IAFGGKLKMP LQWLVATHA DIMNVPRPAG 
GEFGYDKDTS LLKEIRNRFG NDLHISMKLF VLDAGASGSK DMKVLRNHLQ EIRSQIVSVC 
PPMTHLCEKI ISTLPSWRKL NGPNQLMSLQ QFVYDVQDQL NPLASEEDLR RIAQQLHSTG 
EINIMQSETV QDVLLLDPRW LCTNVLGKLL SVETPRALHH YRGRYTVEDI QRLVPDSDVE 
ELLQILDAMD ICARDLSSGT MVDVPALIKT DNLHRSWADE EDEVMUYGGV RIVPVEHLTP 
FPCGIFHKVQ VNLCRWIHQQ STEGDADIRL WVNGCKLANR GAELLVLLVN HGQGIEVQVR 
GLETEKIKCC LLLDSVCSTI ENVMATTLPG LLTVKHYLSP QQLREHHEPV MIYQPRDFFR 
AQTLKETSLT NTMGGYKESF SSIMCFGCHD VYSQASLGMD IHASDLNLLT RRKLSRLLDP 
PDPLGKDWCL LAMNLGLPDL VAKYNTNNGA PKDFLPSPLH ALLREWTTYP ESTVGTLMSK 
LRELGRRDAA DLLLKASSVF KINLDGNGQE AYASSCNSGT SYNSISSWS R 



1 11 21 31 41 51 

I I I I I I 

MELRARGWWL LCAAAALVAC ARGDPASKSR SCGEVRQIYG AKGFSLSDVP QAEI SGEHLR 
ICPQGYTCCT SEMEENLANR SHAELETALR DSSRVLQAML ATQLRSFDDH FQHLLNDSER 
TLQATFPGAF GELYTQNARA FRDLYSELRL YYRGANLHLE ETLAEFWARL LERLFXQLHP 
QLLLPDDYLD CLGKQAEALR PFGEAPRELR LRATRAFVAA RSFVOGLGVA SDWRKVAQV 
E IAJ MKLVYCAHCL GVPGARPCPD YCRNVLKGCL ANQADLDAEW RNLLDSMVLI 
TDKFWGTSGV ESVIGSVHTW LAEAINALQD NRDTLTAKVI QGCGNPKVNP QGPGPEEKRR 
RGKLAPRERP PSGTLEKLVS EAKAQLRDVQ DFWISLPGTL CSEKMALSTA SDDRCWNGMA 
3 DGIjANQINNP EVEVDITKPD MTIRQQIMQL KIMTNRLRSA YKGIIDVDFQD 



1 11 21 31 41 51 

I I I I I I 

MI ILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAE 
EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNCGFGKTGI IDYGIRLNRS 
ERWDAYCYNP HAKECGGVFT DPKQIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 
DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 
QIKYVAMDPV SKSSQGKKTS TTSTGNKNFL AGRFSHL 



I I I I I I 

MRANDALQVIi GLLFSLARGS EVGNSQAVCP GTLNGLSVTG DAENQYQTLY KLYERCEWM 
GNIiEIVLTGH NADLSFLQWI REVTCYVLVA MNEFSTLPLP NLRWRGTQV YDGKFAIFVM 
LNYNTNSSHA LRQLRLTQLT EILSGGVYIE KNDKLCHMDT IDWRDIVRDR DAE I WKDNG 
RSCPPCHEVC KGRCWGPGSE DCQTLTKTIC APQCNGHCFG PKPNQCCHDE CAGGCSGPQD 
TDCFACRHFN DSGACVPRCP QPLVYNKLTF QLEPNPHTKY QYGGVCVASC PHNFWDQTS 
CVRACPPDKM EVDKNGLKMC EPCGGLCPKA CEGTGSGSRF QTTO5SNIBG FTOiCTKILGN 
LDFLITGLNG DPWHKIPALD PEKLNVFRTV REITGYLKIQ SWPPHMHNFS VFSNLTTIGG 
RSLYNRGFSL LIMKNLHVTS LGFRSLKEIS AGRIYISANR QLCYHKSUW TKVLRGPTEE 
RLDIKHNRPR RDCVAEGKVC DPLCSSGGCW GPGPGQCLSC RNYSRGGVCV THCNFLNGEP 
REFAHEAECF SCHPECQPMG GTATCNGSGS DTCAQCAHFR DGPHCVSSCP HGVLGAKGPI 
YKYPDVQNEC RPCHENCTQG CKGPELQDCL GQTLVLIGKT HLTMALTVIA GLWIFMMLG 
GTFLYWRGRR IQNKRAMRRY LERGESIEPL DPSEKANKVL ARIFKETELR KLKVLGSGVF 
GTVHKGVWIP EGESIKIPVC IKVIEDKSGR QSFQAVTDHM LA I GSLDHAH IVRLLGLCPG 

SSLQLVTQYL PLGSLLDHVR QHRGALGPQL LLNWGVQIAK GMYYLEEHGM V 

LLKSPSQVQV ADFGVADLLP PDDKQLLYSE AKTPIKH 
VWELMTFGAE PYAGLRLAEV PDLLEKGERL AQPQICTIDV Y 



NLATTTLGSA LSLPVGTLNR PRGSQSLLSP SSGYMPMNQG NLG3SCQESA VSGSSERCPR 1080 

PVSLHPMPRG CLASESSEGH VTGSEAELQE KVSMCRSRSR SRSPRPRGDS AYHSQRHSLL 1140 

TPVTPLSPPG LEEEDVNGYV MPDTHLKGTP SSREGTLSSV GLSSVLGTEE EDEDSEYEYM 1200 

NRRRRHSPPH PPRPSSLEEL GYEYMDVGSD LSASLGSTQS CPLHPVPIMP TAGTTPDEDY 1260 
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Seq ID NO: C2 91 Protein Sequence 
Protein Accession #: NP_001207.1 

1 11 21 31 41 

I I I I I 

MAPLCPSPWL PLLIPAPAPG LTVQLLLSLL LLMPVHPQRL PRMQEDSPLG G 
GEEDLPSEED SPREEDPPGE EDLPGEEDLP GEEDLPEVKP KSEEEGSLKL EDLPTVEAPG 
DPQEPQNNAH RDKEGDDQSH WRYGGDPPWP RVSPACAGRF QSPVDIRPQL AAFCPALRPL 
ELLGFQLPPL PELRLRNNGH SVQLTLPPGL EMALGPGREY RALQLI L J V3RI EHT 

VEGHRFPAEI HWHLSTAFA RVDEALGRPG GLAVLAAFLE EGPSENSAYE QLLSRLEEIA 
EEGSETQVPG LDISALLPSD FSRYFQYEGS LTTPPCAQGV IWTVFNQTVM LSAKOLKTLS 
DTLWGPGDSR LQLNFRATQP LNGRVIEASF PAGVDSSPRA AEPVQLNSCL AAGDILALVF 
GLLFAVTSVA FLVQMRRQHR RGTKGGVSYR PAEVAETGA 



1 11 21 31 41 51 

I I I I I I 

MGGAWDEGP TGVKAPDGGW GWAVLFGCFV ITGFSYAFPK AVSVFFKELI OEFGIGYSDT 
AWISSILLAM LYGTGPLCSV CVNRFGCRPV MLVGGLFASL GMVAASFCRS IIQVYLTTGV 
ITGLGLALNF QPSLIMLNRY FSKRRPMANG LAAAGSPVFL CALSPLGQLL QDRYGKRGGF 
LILGGLLLNC CVCAALMRPL WTAQPGSGP PRPSRRLLDL SVFRDRGFVL YAVAASVMVL 
GLFVPPVFW SYAKDLGVPD TKAAFLLTIL GFIDIFARPA AGFVAGLGKV RPYSVYLFSF 
SMFFNGLADL AGSTAGDYGG LWFCIFFGI SYGMVGALQF EVLMAIVGTH KFSSAIGLVL 
LMEAVAVLVG PPSGGKLLDA THVYMYVFIL AGAEVLTSSL ILLLGNFFCI RKKPKEPQPE 
VAAAEEEKLH KPPADSGVDL REVEHFLKAE 

Seq ID NO: C2 93 Protein Sequence 



MALFVRLLAI: ALALALGPAA TLAGPAKSPY QLVLQHSRLR GRQHGPNVCA VQKVIGTNRK 
YFTNCKQWYQ RKICGKSTVI SYECCPGYEK VPGEKGCPAA LPLSNLYETL GWGSTTTQL 
YTDRTEKLRP EMEGPGSFTI FAPSNEAWAS LPAEVLE 
LTDELKHGMT LTSMYQNSNI QIHHYPNGIV TVNCARL 
NNIQQIIEIE DTFETLRAAV AASGLNTMLE GNGQYTLLAP T 
EALRDLLNNH ILKSAMCAEA IVAGLSVETL EGTTLEij 
NGVIHYIDEL LIPDSAKTLF ELAAESDVST AIDLFRQAGL GNHLSGSERL TLLAPLNEVF 
KDGTPPIDAH TKNLLRNHII KDQKASKYLY HGQTLETLGG KKLRVFVYRN SLCIENSCIA 
AHDKRGRYGT LFTMDRVLTP PMGTVMDVLK GDNRFSMLVA AIQSAGLTST LNREGVYTVF 
APTNEAFRAL PPRERSRLLG DAKELANILK YIIIGDEILVS GGIGALVRLK SLQGDKLEVS 
LKNNWSVNK EPVAEPDIMA TNGWHVITN VLQPPANRPQ ERGDELADSA LEIFKQASAF 
SRASQRSVRL APVYQKLLER MKH 

Seq ID NO: C2 94 Protein Sequence 

1 11 21 31 41 51 

I I I / I I I 

MTQRSIAGPI CNLKFVTLLV ALSSELPFLG AGVQLQDNGY NGLLIAINPQ VPENQNLI SN 
IKEMITEASF YLFNATKRRV FFRNIKILIP ATWKANNNSK IKQESYEKAN VIVTDWYGAH 
GDDPYTLQYR GCGKEGKYIH FTPNFLLNDN LTAGYGSRGR VFVHEWAHLR WGVFDEYNND 
KPFYINGQNQ IKVTRCSSDI TGIFVCEKGP CPQENCIISK LFKEGCTFIY NSTQNATASI 
MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADFHHSFP MNGTELPPPP 
TFSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLM QIVEIHTFVG IASFDSKGEI 
RAQLHQINSN DDRKLLVSYL PTTVSAKTDI SICSGLKKGF EWEKMiGKA YGSVMILVTS 
GDDKLLGNCL PTVLSSGSTI HS I ALGSSAA PNLEELSRLT GGLKFFVPDI SNSNSMIDAF 
SRISSGTGDI FQQHIQLEST GENVKPHHQL KNTVTVDNTV GNDTNFLVTW QASGPPEIIL 
FDPDGRKYYT NNFITNLTFR TASLWIPGTA KPGHWTYTLN NTHHSLQALK VTVTSRASNS 
AVPPATVEAF VERDSLHFPH PVMIYANVKQ GFYPILNATV TATVEPETGD PVTLRLLDDG 
AGADVIKNDG IYSRYFFSFA ANGRYSLKVB VNHSPSISTP AHSIPGSKAM YVPGYTANGN 
IQMNAPRKSV GRNEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 
LTLSWTAPGE DFDQGQATSY EIRMSKSLQN IQDDFNNAIL VNTSKRNPQQ AGIREIFTFS 
PQISTNGPEH QPNGETHESH RIYVAIRAMD RNSLQSAVSN IAQAPLFIPP NSDPVPARDY 
I WTHHTLSRK KRADKKENGT KLL 



1 11 21 31 41 

I I I I I 

MKFLLILLLQ ATASGALPLN SSTSLEKNNV LFGERYLEKF Y 
KEKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR K 
YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WFARGAHGD FHAFDGKGGI 
IiAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAV " 
KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNS 

FKDRFFWLKV SERPKT S VNL ISSLWPTLPS GIEAAYEIEA RNQVFLFKDD K 
EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDERRQMM D 
NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK TLKSNSWFGC . 

a Sequence 
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MKFLLILLLQ AT ASGAL PLN SSTSLEKNNV LFGERYLEKF YGLEINKLPV TKMKYSGNLM 
KEKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 
YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WFARGAHGD FHAFDGKGGI 
LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHAI GHSLGLGHSS DPKAVMFPTY 
KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 
FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA RNOVFLFXDD KYWLISNLRP 
EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDERRQMM DPGYPKLITK 
NFQGIGPKID AVFY SKNKY Y YFFQGSNQFE YDFLLQRITK TLKSNSWFGC 

Seq ID NO: C297 Protein Sequence 



; FQGLLIFGNV I IGCCGI ALT AECIFFVSDQ HSLYPLLEAT DKDDIYGAAW 
IGIFVGICLF CLSVLGIVGI MKSSRKILLA YFILMFIVYA FEVASCITAA TORDFFTPNL 
FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 
DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWFGFAI 
3 TMFYWSRIEY 



11 21 31 41 51 

I I I I I 

3 LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 
QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 
KGPFPQRLNQ LKSMKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 
YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 
!l GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 
r TAVAWE I LD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 
3 GDDGDHFTIT THPESNOGIL TTRKGLDFEA KNQHTLYVEV THEAPFVLKL 
PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQXISYRILR 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 
VMDHGPVPEP RQITICNQSP VRQVLNITDK DLSPHTSPFQ AQLTDDSDI Y WTAEVNEEGD 
TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVSTCPGP WKGGFILPVL 
GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 



DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 



I I I I I I 

MAKKSAENGI YSVSGDEKKG PLIAPGPDGA PAKGDGPVGL GTPGGRLAVP PRETWTRQMD 

FIMSCVGFAV GLGNVWRFPY LCYKNGGGVF LIPYVLIALV GGIPIFFLEI SLGQFMKAGS 

INVWNICPLF KGLGYASMVI VFYCNTYYIM VLAWGFYYLV KSFTTTLPKA TCGHTWNTPD 

CVEIFRHEDC ANASLANLTC DQLADRRSPV IEFWENKVLR LSGGLEVPGA LNWEVTLCLL 

ACWVLVYFCV WKGVKSTGKI VYFTATFPYV VLWLLVRGV LLPGALDGI I YYLKPDWSKL 

GSPQVWIDAG TQIFFSYAIG LGALTALGSY NRFNMNCYKD AIILALINSG TSFFAGFWF 

SILGFMAAEQ GVHISKVAES GPGLAFIAYP RAVTLMPVAP LWAALFFFML LLLGLDSQFV 

GVEGFITGLL DLLPASYYFR FQREISVALC CALCFVIDLS MVTDGGMYVF QLFDYYSASG 
TTLLWQAFWE C 



I I I I I I 

MEPSSKKLTG RLMLAVGGAV LGSLQFGYNT GVINAPQKVI EEFYNQTWVH RYGESILPTT 
LTTLWSLSVA IFSVGGMIGS FSVGLFVNRF GRRNSMLMMN LLAFVSAVLM GFSKLGKSFE 
MLILGRFIIG VYCGLTTGFV PMYVGEVSPT AFRGALGTLH QLGIWGILI AQVFGLDSIM 
GNKDLWPLLL SIIFIPALLQ CIVLPFCPES PRFLLINRNE ENRAKSVLKK LRGTADVTHD 
LQEMKEESRQ MMREKKVTIL ELFRSPAYRQ PILIAWLQL SCQLSGINAV FYYSTSIFEK 
AGVQQPVYAT IGSGIVNTAF TWSLFWER AGRRTLHLIG IAGMAGCAIL MTIALALLEQ 
LPWMSYLSIV AIFGFVAFFE VGPGPIPWFI VAELFSQGPR PAAIAVAGFS NWTSNFIVGM 
CFQYVEQLCG PYVFIIFTVL LVLFFIFTYF KVPETKGRTF DEIASGFRQG GASQSDKTPE 
ELFHPLGADS QV 




MAGAGPKRRA LAAPAAEEKE EAREKMLAAK 
GTIIGSGIFV TPTGVLKEAG SPGLALWWA 
LEVYGSLPAF LKLWIELLII RPSSQYIVAL 
LLTAVNCYSV KAATRVQDAF AAAKLLALAL 



SADGSAPAGE GEGVTLQRNI TLLNGVAI IV 60 

ACGVFSIVGA LCYAELGTTI SKSGGDYAYM 120 

VFATYLLKPL FPTCPVPEEA AKLVACLCVL 1B0 

IILLGFVQIG KGDVSNLDPN FSFEGTKLDV 240 
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3 LFAYGGWNYL NFVTEEMINP YRNLPLMII SLPIVTLVYV L 

STEQMLSSEA VATOFGNYHL GVMSWIIPVF VGLSCFGSVN GSLFTSSRLF FVGSREGHLP 360 

SILSMIHPQL LTPVPSLVFT CVMTLLYAFS KDIFSVINFF SFFNWLCVAL AIIGMIWLRH 420 

RKPELERPIK VNLALPVFFI LACLFLIAVS FWKTPVECGI GFTIILSGLP VYFFGVWWKN 480 

KPKWLLQGIF STTVLCQKLM QWPQET 507 



MNWSIFEGLL SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA ERWSDDHKD FDCNTRQPGC 60 

SNVCFDEFFP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 12 0 

GKKRGGLWWT YVCSLVFKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCFISKP 180 

SEKNIFTLFM VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 240 

DDLLSGDLIF LGSDSHPPLL PDRPRDHVKK TIL 273 



MKICSLTLLS FLLLAAQVLL VEGKKKVKNG LHSKWSEQK DTLGNTQIKQ KSRPGNKGKF 
VTKDQANCRW AATEQEEGIS LKVECTQLDH EFSCVFAGNP TSCLKLKDER VYWKQVARNL 
RSQKDICRYS KTAVKTRVCR KDFPESSLKL VSSTLFGNTK PRKEKTEMSP RESIKGKETT 
PSSLAVTQTM ATKAPECVED PDMANQRKTA LEFCGETWSS LCTFFLSIVQ DTSC 



MCSEIILRQE VLKDGFHRD L LIKVKFGESI EDLHTCRLLI KQDIPAGLYV DPYELASLRE 

RNITEAVMVS ENFDIEAPNY LSKESEVLIY ARRDSQCIDC FQAFLPVHCR YHRPHSEDGE 

ASIWNNPDL LMFCDQAGSR RMIRFRFDSF DKTIEFPILK CWAHSEVAAP CALENEDICQ 

WNKMKYKSVY KNVILQVPVG LTUHTSLVCS VTLLITILCS KKKKK 



1 11 21 31 41 51 

I I I I I I 

MSLWQPLVLV LLVLGCCFAA PRQRQSTLVL FPGDLRTNLT DRQLAEEYLY RYGYTRVAEM 
RGESKSLGPA LLLLQKQLSL PETGELDSAT LKAMRTPRCG VPDLGRFQTF EGDLKWHHHN 
ITYWIQNYSE DLPRAVIDDA FARAFALWSA VTPLTFTRVY SRDADIVIQF GVAEHGDGYP 
FDGKDGLLAH AFPPGPGIQG DAHFDDDELW SLGKGVWPT RFGNADGAAC HFPFIFEGRS 
YSACTTDGRS DGLPHCSTTA NYDTDDRFGF CPSERLYTRD GNADGKPCQF PFIFQGQSYS 

" ~ ^ATTANY DRDKLFGFCP TRADSTVMGG NSAGELCVFP FTFLGKEYST 

ITTSNFD SDKKWGFCPD QGYSLFLVAA HEFGHALGLD HSSVPEALMY 
' LHKDDVNGIR HLYGPRPEPE PRPPTTTTPQ PTAPPTVCPT GPPTVHPSER 

TGPPTAG PSTATTVPLS PVDDACNVNI FDAIAEIGKQ LYLFKDGKYW 

P QGPFLIADKW PALPRKLDSV FEEPLSKKLF FFSGRQVWVY TGASVLGPRR 
LDKLGLGADV AQVTGALRSG RGKHLLFSGR RLWRFDVKAQ MVDPRSASEV DRMFPGVPLD 
THDVFQYREK AYFCQDRFYW RVSSRSELNQ VDQVGYVTYD ILQCPED 



Protein Accession tf : NP_00 02 04 

1 11 21 31 41 51 

I I I I I I 

MAGPRPSPWA RLLLAALISV SLSGTLANRC KKAPVKSCTE CVRVDKDCAY CTDEMFRDRR 60 

CNTQAEL1AA GCQRESIWM ESSFQITEET QIDTTLRRSQ MSPQGLRVRL RPGESRHFEL 120 

EVFEPLESPV DLYILHDFSN SMSDDLDNLK KMGQNLARVL SQLTSDYTIG FGKFVDKVSV 180 

PQTDMRPEKL KEPWPNSDPP FSFKNVISLT EDVDEFRNKL QGERISGNLD APEGGFDAIL 240 

QTAVCTRDIG WRPDSTHLLV FSTESAFHYE ADGANVLAGI MSRNDERCHL DTTGTYTQYR 300 

TQDYPSVPTL VRLLAKHNII PIFAVTNYSY SYYEKLHTYF PVSSLGVLQE DSSNIVELLE 360 

EAFNRIRSNL DIRALDSPRG LRTEVTSKMF QKTRTGSFHI RRGEVGIYQV QLRALEHVDG 42 0 

THVCQLPEDQ KGNIHLKPSF SDGLKMDAGI ICDVCICELQ K3VRSARCSF NGDFVCGQCV 480 

CSEGWSGQTC NCSTGSLSDI QPCLREGEDK PCSGRGECQC GHCVCYGEGR YEGQFCEYDN 540 

FQCPRTSGFL CNDRGRCSMG QCVCEPGHTG PSCDCPLSNA TCIDSNGGIC NGRGHCECGR 600 

CHCHQQSLYT DTICEINYSA IHPGLCEDLR SCVQCQAWGT GSKKGRTCEE CNFKVKMVDE 660 

LLPLLALLLL LCWKYCACCK ACLALLPCCN RGHMVGFKED HYMLRENLMA SDHLDTPMLR 780 

SGNLKGRDW RWKVTNNMQR PGFATHAASI NPTELVPYGL SLRLARLCTS NLLKPDTREC 840 

AQLRQEVEEN LNEVYRQISG VHKLQQTKFR QQPNAGKKQD HTIVDTVLMA PRSAKPALLK BOO 

LTEKQVEQRA FHDLKVAPGY YTLTADQDAR GMVEFQEGVE LVDVRVPLFI RPEDDDEKQL 560 

LVEAIDVPAG TATLGRRLVN ITIIKEQARD WSFEQPEFS VSRGDQVARI PVIRRVLDGG 1020 

KSQVSYRTQD GTAQGNRDYI PVEGELLFQP GEAWKELQVK LLELQEVDSL LRGRQVRRFH 1080 

VQLSNPKFGA HLGQPHSTTI IIRDPDELDR SFTSQMLSSQ PPPHGDLGAP QNPNAKAAGS 1140 
RKIHFNWLPP SGKPMGYRVK YWIQGDSESE AHLLDSKV 
AQGEGPYSSL VSCRTHQEVP SEPGRLAFNV VSSTVTQLSW A 

NDDNRPIGPM KKVLVDNPKN RMLLIENLRE SQPYRYTVKA RNGAGWGPER EAIINLATQP 
KRPMSIPIIP DIPIVDAQSG EDYDSFLMYS DDVLRSPSGS QRPSVSDDTG CGWKFEPLLG 
EELDLRRVTW RLPPELIPRL SASSGRSSDA EAPTAPRTTA ARAGRAAAVP RSATPGPPGE 
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30 



HLVNGRMDFA FPGSTNSLHR MTTTSAAAYG THLSPHVPHR VLSTSSTLTR DYNSLTRSEH 
SHSTTLPRDY STLTSVSSHD SRLTAGVPDT PTRLVFSALG PTSLRVSWQE PRCERPLQGY 
SVEYQLLNGG ELHRLNI PNP AQTSVWEDL LPHHSYVFRV RAQSQEGWGR EREGVITIES 
QVHPQSPLCP LPGSAFTLST PSAPGPLVFT ALSPDSLQLS WERPRRPHGD IVGYLVTCEM 
AQGGGPATAF RVDGDSPESR LTVPGLSENV PYKFKVQART TEGFGPEREG IITIESQDGG 
PFPQLGSRAG LFQHPLQSEY SSITTTHTSA TEPFLVDGLT LGAQHLEAGG SLTRHVTQEF 
VSRTLTTSGT LSTHMDQQFF QT 

Protein Accession # : NP_07S404.1 



MGFNLTLAKL PNNELHGQES HNSGNRSDGP GKNTTLHNEF DTIVLPVLYL I 

GLAVWIFFHI RNKTSFIFYL KNIWADLIM TLTFPFRIVH DAGFGPWYFK F 

FYANMYTSIV FLGLISIDRY LKWKPFGDS RMYSITFTKV LSVCVWVIMA VLSLPNIILT 

NGQPTEDNIH DCSKLKSPLG VKWHTAVTYV NSCLFVAVLV ILIGCYIAIS RYIHKSSRQF 



Seq ID NO: C30B Protein Sequenc 



oc 1 11 21 31 41 51 

25 | | | | | | 

MVWCLGLAVL SLVISQGADG RGKPEWSW GRAEESWLG COLLPPAGRP PLHVIEHLRF 

GFLLPIFIQF GLYSPRIDPD YVGRVRLQKG ASLQIEGLRV EDQGWYECRV FFLDQHIPED 

DFANGSWVHL TVNSPPQFQE TPPAVLEVQE LEPVTLRCVA RGSPLPHVTW KLRGKDLGQG 

QGQVQVQNGT LRIRRVERGS SGVYTCOASS TEGSATHATQ LLVLGPPVIV VPPKNSTVNA 

SODVSLACHA EAYPANLTYS WFQDNINVFH ISRLQPRVQI LVDGSLRLLA TQPDDAGCYT 

CVPSNGLLHP PSASAYLTVL CMPGVIRCPV RANPPLLFVS WTKDGKALQL DKFPGWSQGT 

EGSLIIALGN EDALGEYSCT PYNSLGTAGP SPVTRVLLKA PPAFIERPKE EYFQEVGREL 

LIPCSAQGDP PPWSWTKVG RGLQGQAQVD SNSSLILRPL TKEAHGHWEC SASNAVARVA 

TSTNVYVLGT SPHWTKVSV VALPKGANVS WEPGFDGGYL QRFSVWYTPL AKRPDRMHHD 

WVSLAVPVGA AHLLVPGLQP HTOYQFSVLA QNKLGSGPFS EIVLSAPEGL PTTPAAPGLP 

PTEIPPPLSP PRGLVAVRTP RGVLLHWDPP ELVPKRLDGY VLEGROGSQG WEVLDPAVAG 

TETELLVPGL IKDVLYEFRL VAFAGSFVSD PSNTANVSTS GLEVYPSRTQ LPGLLPQPVL 

AGWGGVCFL GVAVLVSILA GCLLNKKRAA RRRRKRLROD PPLIFSPTGK SAAPSALGSG 

SPDSVAKLKL QGSPVPSLRQ SLLWGDPAGT PSPKPDPPSS RGPLPLEPIC RGPDGRFVMG 

PTVAAPQERS GREQAEPRTP AQRLARSFDC SSSSPSGAPQ PLCIEDISPV APPPAAPPSP 

LPGPGPLLQY LSLPFFREMH VDGDWPPLEE PSPAAPPDYM DTRRCPTSSF LRSPETPPVS 

PRESLPGAVV GAGATAEPPY TALADWTLRE RLLPGLLPAA PRGSLTSQSS GRGSASFLRP 

PSTAPSAGGS YLSPAPGDTS SWASGPERWP RREKWTVSK RRNTSVDENY EWDSEFPGDM 

ELLETLHLGL ASSRLRPEAE TELGVKTPEE GCLLNTAHVT GPEARCAALR EEFLAFRRRR 
Y RQPVPHPEQA ILL 



Seq ID NO: C309 Protein Sequence 



I I I I I I 

MLTKPLQGPP APPGTPTPPP GGKDREAFEA EYRLGPLLGK GGFGTVFAGH RLTDRLQVAI 

KVIPRNRVLG WSPLSDSVTC PLEVALLWKV GAGGGHPGV I RLLDWFETQE GFMLVLERPL 

PAQDLFDYIT EKGPLGEGPS RCFFGQWAA IQHCHSRGW HRDIKDENIL IDLRRGCAKL 

IDFGSGALLH DEPYTDFDGT RVYSPPEWIS RHQYHALPAT WSLGILLYD MVCGDIPFER 
DQEILEAELH FPAHVSPDCC ALIRRCLAPK PSSRPSLEEI LLDPWMQTPA EDVTPQPLQR 
RPCPFGLVIA TLSLAWPGIA PNGQKSHPMA MSQG 

Seq ID NO: C310 Protein Sequence 



I I I I I I 

MECLYYFLGF LLLAARLPLD AAKRFHDVLG NERPSAYMRE HNQLNGWSSD ENDWNEKLYP 
VWKRGDMRWK NSWKGGRVQA VLTSDSPALV GSNITFAVOT, IFPRCQKEDA NGNIVYEKNC 
RNEAGLSADP YVYNWTAWSE DSDGENGTGQ SHHNVFPDGK PFPHHPGWRR WNFIYVFHTL 
GQYFQKLGRC SVRVSVNTAN VTLGPQLMEV TVYRRHGRAY VPIAQVKDVY WTDQIPVFV 
TMFQKNDRNS SDETFLKDLP IMFDVLIHDP SHFLNYSTIN YKWSFGDNTG LFVSTNHTVN 
HTYVLNGTFS LNLTVKAAAP GPCPPPPPPP RPSKPTPSLG PAGDNPLELS RIPDENCQIN 
RYGHFQATIT IVEGILEVNI IQMTDVLMPV PWPESSLIDF WTCQGSIPT EVCTIISDPT 
CEITQNTVCS PVCVDEMCLL TVRRTFNGSG TYCVMLTLGD DTSLALTSTL ISVPDRDPAS 
PLRMANSALI SVGCLAIFVT VISLLVYKKH KEYNPIENSP GNWRSKGLS VFLNRAKAVF 
FPGNQEKDPL LKNQEFKGVS 



MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG K 

QSPINIDEDL TQVNVNLKKL KFQGKDKTSL ENTFIHNTGK TVEINLTNDY R __ 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFSEAV KGKGKLRALS 
ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 
TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFRSQQY KFSRQVFSSY 



1354 
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TGKEEIHEAV CSSEPENVQA DPEHYTSLLV TWERPRWYD TMIEKFAVLY QQLDOEDQTK 
HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 
LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 
RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VSC-TSASL1ID 
GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 
ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG HVWFPSSTDI 
TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 
TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 
LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 
IKHFPKHVAD LHASSGFTEE FEEVQSCTVD LGITADSSNH PDNKHKNRYI NIVAYDHSRV 
KLAQLAEKDG KLTDYINANY VDGYNRPKAY IAAQGPLKST AEDFWRMIWE HNVEVIVMIT 
NLVEKGRRKC DQYWPADGSE EYGNFLVTQK SVQVLAYYTV RNFTLRNTKI KKGSQKGRPS 
GRWTQYHYT QWPDMGVPEY SLPVLTFVRK AAYAKREAVG PVWECSAGV GRTGTYIVLD 
SMLQQIQEEG TVNIFGFLKH IRSQRNYLVQ TEEQYVFIHD TLVEAILSKE TEVLDSHIHA 
YVNALLIPGP AGKTKLEKQF QLLSQSNIQQ SDYSAALKQC NREKNRTSSI IPVERSP.VGI 
SSLSGEGTDY INASYIMGYY QSNEFIITQE PLLHTIKDFW RMIWDIMAQL WMIPDGQNM 
AEDEFVYWPN KDEPINCESF KVTLMAEEHK CLSNEEKLII QDFILEATQD DYVLEVRHFQ 
CPKWPNPDSP ISKTFELISV IKEEAANRDG PMIVHDEHGG VTAGTFCALT TLMHGLEKEN 
SVDVYQVAKM INLMRPGVFA DIEQYQFLYK VILSLVSTRQ EENPSTSLDS NGAALPDGNI 



1 11 21 31 41 SI 
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MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIEEAV CSSEPENVQA DPEtTYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 3 60 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 4B0 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLLLDNQI 780 

LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ B40 

ILPQVTSATE SDKVPLIIASL PVAGGDLLLE PSLAQYSDVL STTHAASETL EFGSESGVLY 90 0 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF TVSYSSAIPV HDSVGVTYQG 960 

SLFSGPSHIP IPKSSLITPT ASLLQPTHAL SGDGEWSGAS SDSEFLLPDT DGLTALNISS 102 0 

PVSVAEFTYT TSVFGDDNKA LSKSEIIYGN ETELQIPSFN EMVY'PSESTV MPNMYDNVNK 1080 

LNASLQETSV SISSTKGMFP GSLAHTTTKV FDHEISQVPE IJNFSVQPTIIT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

TISYASEKYE PVLLKSESSH QWPSLYSND ELFQTANLEI NQAIIPPKGRH VFATPVLSID 1320 

EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD DRGSDGLSIH 1440 

KCMSCSSYRE SQEKVMNDSD THENSLMDQN NPISYSLSEN SEEDNRVTSV SSDSQTGMDR 1500 

SPGKSPSANG LSQKHNDGKE ENDIQTGSAL LPLSPESKAW AVLTSDEESG SGQGTSDSLN 1560 

ENETSTDFSF ADTNEKDADG ILAAGDSEIT PGFPQSPTSS VTSENSEVFH VSEAEASNSS 1620 

HESRIGLAEG LESEKKAVIP LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR 1680 

VISTPPTPIF PISDDVGAIP IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI 1740 

TADSSNHPDN KHKNRYINIV AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA 1800 

QGPLKSTAED FWRMIWEHNV EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ 1860 

VLAYYTVRNF TLRNTKIKKG SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY 192 0 

AKRHAVGPW VHCSAGVGRT GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE 1980 

QYVFIHDTLV EAILSKETEV LDSHIHAYVN ALLIPGPAGK TKLEKQFQLL SQSNIQQSDY 2040 

SAALKQCNRE KNRTSSIIPV ERSRVGISSL SGEGTDYINA SYIMGYYQSN EFIITQHPLL 2100 
HTIKDFWRMI WDHNAQLWM IPDGQNMAED EFVYWPNKDE PINCESFKVT LMAEEHKCLS 2160 
NEEKLIIQDF I LEATQDDYV LEVRHFQCPK WPNPDSPISK TFELISVIKE EAANRDGPMI 2220 
VHDEHGGVTA GTFCALTTLM HQLEKENSVD VYQVAKMINL MRPGVFADIE QYQFLYKVIL 2280 
SLVSTRQEEN PSTSLDSNGA ALPDGNIAES LESLV 2315 

Seg ID NO: C313 Protein Sequence 
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MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 
QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 
FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 
ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 
TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 
TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 
HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 
LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTKYNRI GTKYNEAKTN 
RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 
GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS F ' 



ENISQGYIFS S 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 
TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNA3ASNS SHESRIGLAE GLESEKKAVI 
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PLVIVSALTF ICLWLVGIL IYWRKCFQTA HFYLEDSTSP RVISTPPTPI FPISDDVGAI 840 

PIKHFPKHVA DLHASSGFTE EFETLKEFYQ EVQSCTVDLG ITADSSNHPD NKHKNRYINI 900 

VAYDKSRVKL AQLA3KDGKL TDYINANYVD GYNRPKAYIA AQCrLKSTAE DFWRMIWEHN 9S0 

VEVIVMITNL VEKGRRKCDQ YWPADGSEEY GNFLVTQKSV QVLAYYTVRN FTLRNTKIKK 1020 

GSQKGRPSGR WTQYHYTQW PDMGVPEYSL PVLTFVRKAA YAKRHAVGPV WHCSAGVGR 1080 

TGTYIVLDSM LQQIQHEGTV NIFGFLKHIR SQRNYLVQTE EQYVFIHDTL VEAILSKETE 1140 

VLDSHIHAYV NALLIPGPAG KTKLEKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVGISS LSGEGTDYIN ASYIMGYYQS NEFIITQHPL LHTIKDFWRM IWDHNAQLW 12S0 

MI PDGQNMAE DEFVYWPNKD EPINCESFKV TLMAEEHKCL SNEEKLIIQD FILEATQDDY 1320 

VLEVRHFQCP KWPNPDSPIS KTFELISVIK EEAANRDGPM IVHDEKGGVT AGTFCALTTL 1380 

MHQLEKENSV DVYQVAKMIN LMRPGVFADI EQYQFLYKVI LSLVSTRQEE NPSTSLDSNG 1440 

AALPDGNIAE SLESLV 145 S 

Seq ID NO: C314 Protein Sequence 
Protein Accession #s Eos sequence 



40 
45 



F HWGKCNMSSD GSEHSLEGQK FPLEMQIYCF D. 
3FKAIID GVESVSRFGK QAALDPFILL N: 
PCTDTDWIVF KDTVSISESQ LAVFCEVLTM QQSGYVMLMD YLQNNFREQQ YKFSRQVFS3 



KHEFLTDGYQ DLGAILNNLL PNMSYVLQIV AICTNGLYGK YSDQLIVDMP T 

ELIGTEEIIK EEEEGKDIEE GAIVNPG 

SGKGDVPNTS LNSTSQPVTK L 
HMNLSGTAES LNTVSITEYE E 
SENISQGYIF SSENPETITY DVLIPESARN ASEDSTSSGS E 
30 ITAQPDVGSG RESFLQTNYT EIRVDESEKT TKSFSAGPVM SQGPSVTDLE MPHYSTFAYF 
PTEVTPHAFT PSSRQQDLVS TVNWYSQTT QPVYNEASNS SHESRIGLAE GLESEKKAVI 
PLVIVSALTF ICLWLVGIL IYWRKCFQTA HFYLEDSTSP RVISTPPTPI FPISDDVGAI 
PIKHFPKHVA DLHASSGFTE EFETLKEFYQ EVQSCTVDLG ITADSSNHPD NKHKNRYINI 
VAYDHSRVKL AQLAEKDGKL TDYINANYVD GYNRPKAYIA AQGPLKSTAE D 
35 VEVIVMITNL VEKGRRKCDQ YWPADGSEEY GNFLVTQKSV 
GSQKGRPSGR WTQYHYTQW PDMGVPEYSL PVLTFVRKAA 
TGTYIVLDSM LQQIQHEGTV NIFGFLKHIR SQRNYLVQTE EQYVFIHDTL VEAILSKETE 
VLDSHIHAYV NALLIPGPAG KTKLEKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 
VERSRVGISS LSGEGTDYIN ASYIMGYYQS NEFIITQHPL LIITIKDFWRM IWDIINAQLW 
M I PDGQNMAE DEFVYWPNKD EPINCESFKV TLMAEEHKCL SNEEKLIIQD FILEATQDDY 
VLEVRHFQCP KWPNPDSPIS KTFELISVIK EEAANRDGPM IVEDEHGGVT AGTFCALTTL 
MHQLEKENSV DVYQVAKMIN LMRPGVFADI EQYQFLYKVI LSLVSTRQEE NPSTSLDSNG 



1 11 21 31 41 51 
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50 MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKHWG KKYPTCNSPK 60 
QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 
FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 
ILFEVGTEEN LDFKAI IDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 
TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 
55 TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 
Y SDQLIVDMPT DNPELDLFPE 420 



RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTE LPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 500 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSS ESLKDPSMEG NVWFPSSTDI S60 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

P SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

7LVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP B40 

iSGFTEE FETLKEFYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV 900 

A QLAEKDGKLT DYINANYVDG YNRPKAY I AA QGPLKSTAED FWRMIWEHNV 960 

IRKCDQY WPADGSEEYG NFLVTQKSVQ VLAYYTVRNF TLRNTKIKKG 102 0 

SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY AKRHAVGPW VHCSAGVGRT 1080 

GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE QYVFIHDTLV EAILSKETEV 1140 

LDSHIHAYVN ALLIPGPAGK TKLEKQFQGL TLSPRLECRG TISAHCNLPL PGLTDPPTSA 1200 

SRVAGTILLS QSNIQQSDYS AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS 1260 

YIMGYYQSNE FIITQHPLLH TIKDFWRMIW DHNAQLWMI PDGQNMAEDE FVYWPNKDEP 132 0 

INCESFKVTL MAEEHKCLSN EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT 1380 

FELISVIKEE AANRDGPMIV HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM 1440 

RPGVFADIEQ YQFLYKVILS LVGTRQEENP STSLDSNGAA LPDGNIAESL ESLV 1494 



MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG K 
QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY E 
FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 
ILFEVGTEEN LDFKAI IDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 
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K DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 
TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 
HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDHPT DNP3ASNSSH 
ESRIGLAEGL ESEKKAVIPL VIVSALTFIC LWLVGILIY WRKCFQTAHF YLEDSTSPRV 
ISTPPTPIFP ISDDVGAIPI KHFPKHVADL HASSGFTEl. r 5TLKEFYQEV (JSCTVDLGIT 
ADSSNHPDNK HKNRYINIVA YDHSRVKLAQ LAEKDGKLTD YINANYVDGY NRPKAYIAAQ 
GPLKSTAEDF WRMIWEHNVE VIVMITNLVE KGRRKCDQYW PADGSEEYGN FLVTQKSVQV 
LAYYTVRNFT LRNTKIKKGS QKGRPSGRW TQYHYTQWPD MGVP3YSI I I RKAAYA 
KEHAVGPVW HCSAGVGRTG TYIVLDSMLQ QIQHEGTVNI FGFLXHIRSQ RNYLVQTEEQ 
YVFIHDTLVE AILSKETEVL DSHIHAYVNA LLIPGPAGKT KLEKQFQLLS QSNIQQSDYS 
AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS YIMGYYQSNE FIITQHPLLH 
TIKDFWRMIW DHNAQLWMI PDGQNMAEDE FVYWPNKDEP INCESFKVTL MAEEHKCLSN 
EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT FELISVIKEE AANRDGPMIV 
HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM RPGVFADIEQ YQFLYKVILS 
LVSTRQEENP STSLDSHGAA LPDGNIAESL ESL 

Seg ID NO: C317 Protein Sequence 



MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNKG 
QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY 
S EHSLEGQKFP LEMQIYC 



K DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 
V CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 
HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 
LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 
RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 
GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 
ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 
TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 
TEVTPHAFIP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 
LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 
IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV 
AHDHSRVKLA QIJXKKDGKLT DYINANYVDG YNRPKAYIAA QGPLKSTAED FWRMIMEHNV 
EVIVMITNLV EKCRRKCDQY WPADGSEEYG NFLVTQKSVQ VLAYYTVRNF TLRNTKIKKG 
SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY AKRIIAVGPW VHCSAGVGRT 
GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE QY\'FIHDTLV EAILSKETEV 
LDSHIHAYVN ALLIPGPAGK TKLEKQFQGL TLSPRLECRG TISAHCNLPL PGLTDPPTSA 
SRVARTILLS QSNIQQSDYS AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS 
YIMGYYQSNE FIITQHPLLH TIKDFWRMIW DHNAQLWMI PDGQNMAEDE FVYWPNKDEP 
INCESFKVTL MAEEHKCLSN EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT 
FELISVIKEE AANRDGPMIV HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM 
RPGVFADIEQ YQFLYKVILS LVSTRQEENP STSLDSNGAA LPDGNIAESL ESL 



J J FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 
ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 
TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 
TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 
HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 
60 LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 
GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 
ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 
TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 
65 TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 

TjVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 

D LHASSGFTEE FETLKEFYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV 
A QLAEKDGKLT DYINANYVDG YNRPKAYIAA QGPLKSTAED FWRMIWEHNV 
V EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ VLAYYTVRNF TLRNTKIKKG 
SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY AKRHAVGPW VHCSAGVGRT 
GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE QYVFIHDTLV EAILSKETEV 
LDSHIHAYVN ALLIPGPAGK TKLEKQFQLL SQSNIQQSDY SAALKQCNRE KNRTSSIIPV 
ERSRVGISSL SGEGTDYINA SYIMGYYQSN EFIITQHPLL HTIKDFWRMI WDHNAQLWM 
IPDGQNMAED EFVYWPNKDE PINCESFKVT LMAEEHKCLS NEEKLIIQDF ILEATQAWRS 
DGRNFLCSDN PYAPTRKRKF RGCLPGSQDD QSDEARSLC 

Seg ID NO: C319 Protein Sequence 



MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 



VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV 
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2 LAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTE 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDC-QRMFEA AAVGSLLAGG 300 

PWAILGMIY NVIILGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 

VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQS ITVGVAPIW VIASWTFSV 42 0 

KMTLGFDLTA AQAFTWTVF HSMIFALKVT PFSVKSLSEA SVAVDRFKSL FLM3EVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VLAEQKGHLL LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPSEGQL VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWKL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SHAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 1080 

LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDEATAAM DTETDLL I QE TIREAFADCT 1380 

MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAMFAAAE NKVAVKG 143 7 

Seq ID NO: C320 Protein Sequence 



MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 
LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLF 
30 VAHKKGEL3M EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV V 
LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI V 
ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLLAGG 
PWAILGMIY NVIILGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 
VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 
35 HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 
NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 
VLAEQKGHLL LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD E 

CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIY. 
40 NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTI 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPSEGQL VQLEEKGQGS 
VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSl.i L SYWIKQGS J TTVTRGNETS 
VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 
KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FPWFLVAVGP 
45 LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 
LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 
GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 
RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 
GLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 



I I I I I I 

MPALWLGCCL CFSLLLPAAR ATSRREVCDC NGKSRQCIFD RELHRQTGNG FRCLNCNDNT 
DGIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPGVTG ARCDRCLPGF 
60 HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGRCVCKPA VTGERCDRCR SGYYNLDGGN 
PEGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA VQRNGSPAKL QWSQRHQDVF 
SSAQRLDPVY FVAPAKFLGN QQVSYGQSLS FDYRVDRGGR HPSAHDVILE GAGLRITAPL 
MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYFEYRRLL RNLTALRIRA TYGEYSTGYI 
DNVTLISARP VSGAPAPWVE QCICPVGYKG QFCQDCASGY KRDSARLGPF GTCIPCNCQG 
65 GGACDPDTGD CYSGDENPDI ECADCPIGFY NDPHDPRSCK PCPCHNGFSC SVMPETEEW 
CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQCN NNVDPSASGN CDRLTGRCLK 
CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS EPVGCRSDGT CVCKPGFGGP 
NCEHGAFSCP ACYNQVKIQM DQFMQQLQRM EALISKAQGG DGWPDTELE GRMQQAEQAL 
QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVRDTH 
70 RLITQMQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE ATRLAESHVE SASNMEQLTR 
ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQL TREATQAEIE 
ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS LSTLVTRHMD EFKRTQKNLG 
NWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM GNATFYEVES ILKNLREFDL 
QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE 
75 IEQEIGSLNL EANVTADGAL AMEKGLASLK SEMRET/EGEL ERKELEFDTN MDAVQMVITE 

AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGL VLLEQKLSRA KTQINSQLRP 
R QQRGHLHLLE TSIDGILADV KNLENIRDNL PPGCYNTQAL SQQ 
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MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GI\ 
NSVDPENITE IFIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 
NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 
SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 
NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 
WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 
LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 
DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGFVLF HKIPLDG 

Seq ID NO: C324 Protein Sequence 
Protein Accession ft: NP_00S171.1 

i i 1 T i 1 i 1 T 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 
NSVDPENITE IFIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 
NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 
SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 
NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 
WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 
LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 
DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGPASV ISNDDDSASP 
LHHISNGSNT PSSSEGGPDA VIIGMTKIPV IENPQYFGIT NSQLKPDTFV QHIKRHNIVL 
KRELGEGAFG KVFLAECYNL CPEQDKILVA VKTLKDASDN ARKDFHREAE LLTNLQHEHI 
VKFYGVCVEG DPLIMVFEYM KHGDLNKFLR AHGPDAVLMA EGNPPTELTQ SQMLHIAQQI 
AAGMVYLASQ HFVHRDLATR NCLVGENLLV KIGDFGMSRD VYSTDYYRVG GHTMLPIRWM 
PPESIMYRKF TTESDVWSLG WLWEIFTYG KQPWYQLSNN EVIECITQGR VLQRPRTCPQ 
EVYELMLGCW QREPIIMRKNI KGIHTLLQNL AKASPVYLDI LG 

Seq ID NO: C325 Protein Sequence 
Protein Accession #: Eos sequence 
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MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 
NSVDPENITE IFIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 
NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 
SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 
NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 
WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 
LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPSI 
DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGFVLF HKIPLDG 

Seq ID NO: C326 Protein Sequence 
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MPLKHYLLLL VGCQAWGAGL AYHGCPSECT CSRASQVECT GARIVAVPTP LPWNAMSLQI 60 

LNTHITELNE SPFLNISALI ALRIEKNELS RITPGAFRNL GSLRYLSLAN NKLQVLPIGL 120 

FQGLDSLESL LLSSNQLLQI QPAHFSQCSN LKELQLHGNH LEY I PDGAFD HLVGLTKLNL 180 

GKNSLTHISP RVFQHLGNLQ VLRLYENRLT DIPMGTFDGL VNLQELALQQ NQIGLLSPGL 240 

FHNNHNLQRL YLSNNHISQL PPSIFMQLPQ LNRLTLFGNS LKELSLGIFG PMPNLRELWL 300 

YDNHISSLPD NVFSNLRQLQ VLILSRNQIS FISPGAFNGL TELRELSLHT NALQDLDGNV 3 60 

G LMAIQLQNNQ LENLPLGIFD HLGKLCELRL 420 

" N VAVPSVHVPE 4 80 

W GMTQAQSGLA 540 
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MRALLARLLL CVLWSDSKG SNELHQVPSN CDCLNG3TCV SNKYFSNIHW CNCPKKFGGQ 

HCEIDKSKTC YEGNGHFYRG KASTDTMGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 

YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKPSSP PESLKFQCGQ KTLRPRFKII 

GGEFTTIENQ PWFAAIYRRH RGGSVTYVCG GSLISPCWVI SATHCFIDYP KKEDYIVYLG 

RSRLNSNTQG EMKFEVENLI LHKDYSADTL AHHNDIALLK IRSKEGRCAQ PSRTIQTICL 

PSMYNDPQFG TSCEITGFGK ENSTDYLYPE QLKMTWKLI SHRECQQPKY YGSEVTTKML 

CAADPQWKTD SCQGDSGGPL VCSLQGRMTL TGIVSWGRGC ALKDKPGVYT RVSHFLPWIR 
SHTKEENGLA L 

Seq ID NO: C32 8 Protein Sequence 
Protein Accession #: XP_087254.1 
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MQFRECSING MKYQEINGRL VPEGPTPD3S EGNLSYLSSL SHLNNLSHLT TSSSFRTSPE 
NETELIKEHD LFFKAVSLCH TVQISNVQID CTGDGPWQSN LAPSQLEYYA SSPDEKALVE 
AAARIGIVFI GNSEETMEVK TLGKLERYKL LHILEFDSDR RRMSVIVQAP SGEKLLFAKG 
AESSILPKCI GGEIEKTRIH VDEFALKGLR TLCIAYRKFT SKEYEEIDKR IFEARTALQQ 
REEKLAAVFQ FIEKDLILLG ATAVEDRLQD KVRETIEALR MAGIKWVLT GDKHETAVSV 
SLSCGHFHRT MNILELINQK SDSECAEQLR QLARRITEDH VIQHGLWDG TSLSLALREH 
EKLFMEVCRN CSAVLCCRMA PLQKAKVIRL IKISPEKPIT LAVGDGANDV SMIQEAHVGI 
GIMGKEGRQA ARNSDYAIAR FKFLSKLLFV HGHFYYIRIA TLVQYFFYKN VCFITPQFLY 
QFYCLFSQQT LYDSVYLTLY NICFTSLPIL IYSLLEQHVD PHVLQNKPTL YRDISKNRLL 
SIKTFLYWTI LGFSHAF IFF FGSYLLIGKD TSLLGNGQMF GNWTFGTLVF TVMVITVTVK 
MALETHFWTW INHLVTWGSI IFYFVF5LFY GGILWPFLGS QNMYFVFIQL LSSGSAWFAI 
C LDSMCCFPEG EAACASVGRM 



\ AACPLPPVRG GAADAPGLLG VPSNASVNAS SAASPSPRGC KPRRPPGPPS 
R RRLCNISVQR QMLSSLLVRW GRPRGFQCDL LLFSTNAKGR AFFAAAFHRV 
GPPLLIEHLG LAAGGAQQDL RLCVGCGWVR GRRTGRL 
AAEPPGPLWL QGEPLHFCCL DFSLEELQGE PGWRLHJ 
LIWPVPIIAG FLPNGMEQRR TTASTTAATP AAVPAGTTAA A 



I I I I ■ I I 

MDLHLFDYSE PGNFSDISWP CNSSDCIWD TVMCPNMPNK SVLLYTLSFI YIFIFVIGMI 
ANSWVWVNI QAKTTGYDTH CYILNLAIAD LWWLTIPVW WSLVQHNQW PMGELTCKVT 
HLIFSINLFG SIFFLTCMSV DRYLSITYFT NTPSSRKKMV RRWCILVWL KAFCVSLPDT 
YYLKTVTSAS NNETYCRSFY PEIISIKEWLI CMELVSWLG FAVPFSIIAV FYFLLARAIS 
AS DQEKH3S RKIIFSY* r ' FLVCKLPYH" AVLLDIFSIL EYIPFTCRLE HALFTALItVT 
QCLSLVHCCV NPVLYSFINR NYRYELHKAF IFKYSAKTGL TKLIDASRVS ETEYSALEQS 



MGFVRQIQLL LWKNWTLRKR QKIRFWELV WPLSLFLVLI WLRNANPLYS HHECHFPNKA 
MPSAGMLPWL QGIFCNVNNP CFQSPTPGES PGIVSNYNNS ILARVYRDFQ ELLMNAPESQ 
HLGRIWTELH ILSQFMDTLR THPERIAGRG IRIRDILKDE ETLTLFLIKN IGLSDSWYL 
LINSQVRPEQ FAHGVPDLAL KDIACSEALL ERFIIFSQRR GAKTVRYALC SLSQGTLQWI 
EDTLYANVDF FKLFRVLPTL LDSRSQGINL RSWGGILSDM SPRIQEFIHR PSHQDLLWVT 
RPLMQNGGPE TFTKLMGILS DLLCGYPEGG GSRVLSFNWY EDNNYKAFLG IDSTRKDPIY 
SYDRRTTSFC NALIQSLESN PLTKIAWRAA KPLLMGKILY TPDSPAARRI LKNANSTFEE 
LEHVRKLVKA WEEVGPQIWY FFDNSTQMNM IRDTLGNPTV KDFLNRQLGE EGITAEAILN 
FLYKGPRESQ ADDMANFDWR DIFNITDRTL RLVNQYLECL VLDKFESYND ETQLTQRALS 
LLEENMFWAG WFPDMYPWT SSLPPHVKYK IRMDIDWEK TNKIKDRYWD SGPRADPVED 
FRYIWGGFAY LQDMVEQGIT RSQVQAEAPV GI YLQQMPYP CFVDDSFMII LNRCFPIFMV 
LAWIYSVSMT VKSIVLEKEL RLKETLKNQG VSNAVIWCTW FLDSFSIMSM SIFLLTIFIM 
HGRILHYSDP FILFLFLLAF STATIMLCFL LSTFFSKASL AAACSGVIYF TLYLPHILCF 
AWQDRMT AE L KKAVSLLSPV AFGFGTEYLV RFEEQGLGLQ WSNIGNSPTE GDEFSFLLSM 
QMMLLDAACY GLLAWYLDQV FPGDYGTPLP WYFLLQESYW LSGEGCSTRE ERALEKTEPL 
TEETEDPEHP EGIHDSFFER EHPGWVPGVC VKNLVKIFEP CGRPAVDRLN ITFYENQITA 
FLGHNGAGKT TTLSILTGLL PPTSGTVLVG GRDIETSLDA VRQSLGMCPQ HNILFHHLTV 
AEHMLFYAQL KGKSQEEAQL EMEAMLEDTG LHHKRNEEAQ DLSGGMQRKL SVAIAFVGDA 
KWILDEPTS GVDPYSRRSI WDLLLKYRSG RTIIMPTKHM DEADHQGDRI AIIAQGRLYC 
. SGTPLFLKNC FGTGLYLTLV RKMKNIQSQR KGSEGTCSCS SKGFSTTCPA HVDDLTPEQV 
LDGDVNELMD WLHHVPEAK LVECIGQELI FLLPNKNFKH RAYASLFREL EETLADLGLS 
E EIFLKVTEDS DSGPLFAGGA QQKRENVNPR HPCLGPREKA GQTPQDSHVC 
E GQPPPEPECP GPQLNTGTQL VLQHVQALLV KRFQKTIRSH KDFLAQIVLP 
ATFVFLALML SIVILPFGEY PALTLHPWIY GQQYTFFSMD EPGSEQFTVL ADVLLNKPGF 
GNRCLKEGWL PEYPCGNSTP WKTPSVSPNI TQLFQKQKWT QVNPSPSCRC STREKLTMLP 
ECPEGAGGLP PPQRTQRSTE ILQDLTDRNI SDFLVKTYPA LIRSSLKSKF WVNEQRYGGI 
SIGGKLPWP ITGEALVGFL SDLGRIMNVS GGPITREASK EIPDFLKHLE TEDNIKVWFN 
R ASLPKDRSPE EYGITVISQP LNLTKEQLSE ITVLTTSVDA 
Y LIQERVNKSK HLQFISGVSP TTYNVTNFLW DIHNYSVSAG 
LWGIFIGFQ KKAYTSPENL PALVALLLLY GWAVIPMMYP ASFLFDVPST AYVALSCANL 
FIGINSSAIT FILELFDNNR TLLRFNAVLR KLLIVFPHFC LGRGLIDLAL SQAVTDVYAR 
FGEEHSANPF HWDLIGKNLF AMWEGWYF LLTLLVQRHF FLSQWIAEPT K 
VAEERQRI IT GGNKTDILRL HELTKIYLGT SSPAVDRLCV GVRPGEC 
TFKMLTGDTT VTSGDATVAG KSILTNISEV HQNMGYCPQF DAIDELI 
GVPAEEIEKV ANWSIKSLGL TVYADCLAGT YSGGNKRKLS TAIALIGCPP LVLLDEPTTG 
MDPQARRMLW NVIVSIIRKG RAWLTSHSM EECEALCTRL AIMVKGAFRC MGTIQHLKSK 
FGDGYIVTMK IKSPKDDLLP DLNPVEQFFQ GNFPGSVQRE RHYNMLQFQV SSSSLARIFQ 
LLLSHKDSLL IEEYSVTQTT LDQVFVNFAK QQTESHDLPL HPRAAGASRQ AQD 
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MVPHAILARG RDVCRRNGLL ILSVLSVIVG CLLGFFLRTR RLSPQEISYF QFPGELLMRM 
LKMMILPLW SSLMSGLASL DAKTSSRLGV LTVAYYLWTT FMAVIVGIFM VSIIHPGSAA 
QKETTEQSGK PIMSSADALL DLIRNMFPAN LVEATFKQYR TKTTPWXSP KVAPEEAPPR 
RILIYGVQEE NGSHVQNFAL DLTPPPEWY KSEPGTSDGM NVLGIVFFSA TMGIHLGRMG 
DSGAPLVSFC QCLNESVMKI VAVAVWYFPF GIVFLIAGKI LEMDDPRAVG KKLGFYSVTV 
VCGLVLHGLF ILPLLYFFIT KKNPIVFIRG ILQALLIALA TSSSSATLPI TFKCLLENNH 
IDRRIARFVL PVGATINMDG TALYEAVAAI FIAQVNNYEL DFGQIITISI TATAA9IGAA 
GIPQAGLVTM VIVLTSVGLP TDDITLI IAV DWALDRFRTM INVLGDALAA GIMAHICRKD 
FARDTGTEKL LPCETKPVSL QEIVAAQQNG CVKSVAEASE LTLGPTCPKH VPVQVERDEE 
LPAASLNHCT IQISELETNV 

Seq ID NO: C333 Protein sequence 
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MVTVGNYCEA EGPVGPAWMQ DGLSPCFFFT LVPSTRMALG TLALVLALPC RRI 
SLSWGAGPRI SPYVLQLLLA TLQAALPLAG LAGRVGTARG APLPSYLLLA SVL3SLAGAC 
GLWLLWERS QARQRLAMGI WIKFRHSPGL LLLWTVAFAA ENLALVSWWS PQWWWARADL 
GQQVQFSLWV LRYWSGGLF VLGLWAPGLR PQSYTLQVHE EDQDVERSQV RSAAQQSTWR 
DFGRKLRLLS GYLWPRGSPA LQLWLICLG LMGLERALNV LVPIFYRNIV NLLTEKAPWN 
SLAWTVTSYV FLKFLQGGGT GSTGFVSNLR TFLWIRVQQF TSRRVELLIF SHLHELSLRW 
HLGRRTGEVL RIADRGTSSV TGLLSYLVFN VIPTLADIII GIIYFSMFFN AWFGLIVFLC 
MSLYLTLTIV VTEWRTKFRR AMNTQENATR ARAVDSLLNF ETVKYYNAES YEVERYREAI 
IKYQGLEWKS SASLVLLNQT QNLVIGLGLL AGSLLCAYFV TEQKLQVGDY VLFGTYIIQL 
YMPLNWFGTY YRMIQTNFID MENMFDLLKE ETEVKDLPGA GPLRFQKGRI EFENVHFSYA 
DGRETLQDVS FTVMPGQTLA LVGPSGAGKS TILRLLFRFY DISSGCIRID GODISQVTQA 
SLRSHIGWP QDTVLFNDTI ADNIRYGRVT AGNDEVEAAA QAAGIHDAIM AFPEGYRTQV 
GERGLKLSGG EKQRVAIART ILKAPGIILL DEATSALDTS NERAIQASLA KVCANRTTIV 
VAHRLSTWN ADQILVIKDG CIVERGRHEA LLSRGGVYAD MWOLQQGOEE TSEDTKPQTM 



I I I I I 

MLLETQDALY VALELVIAAI, SVAGNVLVCA AVGTANTLQT PTNYFLVSLA A 
IPFAITISLC FCTDFYCCLF LACFVLVLTQ SSIFSLLAVA VDRYLAICVP LRYKSLVTGT 
RARGVIAVLW VIiAFGIGLTP FLGWNSKDSA TNNCTEPWDG TTNESCCLVK CLFENWPMS 
F ^F(,( VLPPLLIMLV IYIKIFLVAC RQLQRTELMD HSRTTLQREI HAAKSLAMIV 
GIFALCWLPV HAVNCVTLFQ PAQGKNKPKW AMNMAILLSH ANSWNPIVY AYRNRDFRYT 
FHKI ISRYLL CQADVKSGNG QAGVQPALGV GL 
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MGLGARGAWA ALLLGTLQVL ADLGAAHESA AMAETLQHVP SDHTNETSNS TVKPPTSVAS 
DSSNTTVTTM KPTAASNTTT PGMVSTNNTS TTLKSTPKTT SVSQNTSQIS TSTMTVTHNS 
SVTSAASSVT ITTTMHSEAK KGSKFDTGSF VGGIVLTLGV LSILYIGCKM YYSRRGIRYR 
T IDEHDAI I 



1 11 21 31 41 51 

I I I I I I 

MAQHGAMGAF RALCGLALLC ALSLGQRPTG GPGCGPGRLL LGTGTDARCC RVKTTRCCRD 
YPGEECCSEW DCMCVQPEFH CGDPCCTTCR HHPCPPGQGV QSQGKFSFGF QCIDCASGTF 
SGGHEGHCKP WTDCTQFGFL TVFPGNKTHN AVCVPGSPPA EPLGWLTWL LAVAACVLLL 
TSAQLGLHIW QLRSQCMWPR ETQLLLEVPP STEDARSCQF PEEERGERSA EEKGRLGDLW 



I I I I I I 

MGCDGRVSGL LRRNLQPTLT YWSVFFSFGL CIAFLGPTLL DLRCQTHSSL PQISWVFFSC 
QLCLLLGSAL GGVFKRTLAQ SLWALFTSSL AISLVFAVIP F 
CIDTVANMQL VRMYQKDSAV FLQVLHFFVG FGALLSPL ~ 
RGHLFHVSRV LGQHHVDAKP WSNQTFPGLT PKDGAGTK 
LSKERLLTCC PQRRPLLLSA DELALETQPP EKEDASSLPP KFQSHLGEED LFSCCQRKNL 
RGAPYSFFAI HITGALVLFM TDGLTGAYSA FVYSYAV3KP LSVGHKVAGY LPSLFWGFIT 
LGRLLSIPIS SRMKPATMVF INWGWVTF LVLLIFSYNV VFLFVGIASL GLFLSSTFPS 
MLAYTEDSLQ YKGCATTVLV TGAGVGEMVL QMLVGSIFQA QGSYSFLVCG VIFGCLAFTF 
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HPGLPSVPTQ DRSIGMENSE CYQR 



RNMGTGGFLT CGPLWAQQCG NQYYTTGVCS DISPDFQLSA SFSPATQPCP S 
ESNSIYPWDA VKNFLEKPVQ GLDIGPTKTQ VGLIQYANNP RWFNLNTYK T 

QTSQYGGDLT NTFGAIQYAR KYAYSAASGG RRSATKVMW VTDGESHDGS MLKAVIDQCN 300 

HDNILRFGIA VLGYLNRNAL DTKNLIKEIK AIASIPTERY FFNVSDEAAL LEKAGTLGEQ 360 

IFSIEGTVQG GDNFQMEMSQ VGFSADYSSQ NDILMLGAVG AFGWSGTIVQ KTSHGHLIFP 420 

KQAFDQILQD RNHSSYLGYS VAAISTGEST HFVAGAPRAN YTGQIVLYSV NENGNITVIQ 480 

AHRGDQIGSY FGSVLCSVDV DKDTITDVLL VGAPMYMSDL KKEEGRVYLF TIKKGILGQH 540 

QFLEGPEGIE NTRFGSAIAA LSDINMDGFN DVIVGSPLEN QNSGAVYIYN GHQGTIRTKY 600 

SQKILGSDGA FRSHLQYFGR SLDGYGDLNG DSITDVSIGA FGOVVQLWSQ SIADVAIEAS 660 

FTPEKITLVN KNAQIILKLC FSAKFRPTKQ NNQVAIVYNI TLDADGFSSR VTSRGLFKEN 720 

NERCLQKNMV VNQAQSCPEH IIYIQEPSDV VNSLDLRVDI SLENPGTSPA LEAYSETAKV 780 

FSIPFHKDCG EDGLCISDLV LDVRQIPAAQ EQPFIVSNQN 5RLTFSVTLI I ESAYNTG 840 

IWDFSENLF FASFSLPVDG TEVTCQVAAS QKSVACDVGY PALKREQQVT FTINFDFNLQ 900 

NLONQASLSF QALSESQEEN KADNLVNLKI PLLYDAEIHL TRST.MINFYE ISSDGNVPSI 960 

VHSFEDVGPK FIFSLKVTTG SVPVSMATVI IHIPQYTKEK NPLMYLTGVQ TDKAGDISOJ 1020 

ADINPLKIGQ TSSSVSFKSE NFRHTKELNC RTASCSNVTC WLKDVKMKGE YFVNVTTRIW 1080 

NGTFASSTFQ TVQLTAAAE I NTYNPEIYVI EDNTVTIPLM IMKPDEKAEV PTGVIIGSII 1140 

AGILLLLALV AILWKLGFFK RKYEKMTKNP DEIDETTELS S 1181 

| I" f I' I' f 

MYRPRARAAP EGRVRGCAVP STVLLLLAYL AYLALGTGVF WTLEGRAAQD SSRSFQRDKW 60 

ELLQNFTCLD RPALDSLIRD WQAYKHGAS LLSNTTSMGR WELVGSFFFS VSTITTIGYG 120 
NLSPNTMAAR LFCIFFALVG IPLNLWLNR LGHLMQQGVN HWASRLGGTW QDPDKARWLA 180 

GSGALLSGLL LFLLLPPLLF SHMEGWSYTE GFYFAFITLS TVGFGDYVIG MNPSQRYPLW 240 
YKHMVSLWIL FGMAWLAL 1 1 KLILSQLETP GRVCSCCHHS SKEDFKSQSW RQGPDREPES 300 
HSPQQGCYPE GPMGI IQHLE PSAHAAGCGK OS 332 



MEWDNGTGQA LGLPPTTCVY RENFKQLLLP PVYSAVIiARG LPLNICVITQ I 
TAVYTLNLAL ADLLYACSLP LLIYNYAQGD HWPFGDFACR LVRFLFYANL » 
SFORYLGICH PLAPWHKRGG RRAAWLVCVA WLAVTTQCL PTAIFAATGI QRNRTVCYDL 
SPPA1.ATHYM PYGMALTVIG FLLPFAALLA CYCLLACRLC RQDGPAEPVA QERRGKAARM 
AWVAAAF A I SFLPFHITKT AYLAVRSTPG VPCTVLEAFA AAYKGTRPFA SANSVLDPIL 



MQRPGPRLWL VLQVMGSCAA ISSMDMERPG DGKCQPIEIP MCKDIGYNMT RMPNLMGHEN 
QREAAIQLHE FAPLVEYGCH GHLRFFLCSL YAPMCTEQVS TPIPACRVNC EQARLKCSPI 
MEQFNFKWPD SLDCRKLPNK NDPNYLCMEA PtJNGSBEPTR GSGLFPPLFR PQRPHSAQEH 
PLKDGGPGRG GCDNPGKFHH VEKSASCAPL CTPGVDVYWS REDKRFAWW LAIWAVLCFF 
SSAFTVLTFL IDPARFRYPE RPIIFLSMCY CVYSVGYLIR LFAGAESIAC DRDSGQLYVI 
QEGLESTGCT LVFLVLYYFG MASSLWWWL TLTWFLAAGK KWGHEAIEAN SSYFHLAAWA 
IPAVKTILIL VMRRVAGDEL TGVCYVGSMD VNALTGPVLI PLACYLVIGT SFILSGFVAL 
FHIRRVMKTG GENTDKLEKL MVRIGLFSVL YTVPATCVIA CYFYERLNMD YWKILAAQHK 
CKMNNQTKTL DCLMAASIPA VEIFMVKIFM LLWGITSGM WIWTSKTLQS WQQVCSRRLK 
KKSRRKPASV ITSGGIYKKA QHPQKTHHGK YEIPAQSPTC V 



Protein Accession #: NP_005752.1 

1 11 21 31 41 51 

I I I I I I 

MEVSRRKAP P RPPRPAAPLP LLAYLLALAA PGRGADEPVW RSEQAIGAIA ASQEDGVFVA 
SGSCLDQLDY SLEHSLSRLY RDQAGNCTEP VSLAPPARPR PGSSFSKLLL PYREGAAGLG 
GLLLTGWTFD RGACEVRPLG NLSRNSLRNG TEWSCHPQG STAGWYRAG RNNRWYLAVA 
ATYVLPEPET ASRCNPAASD HDTAIALKDT EGRSLATQEL GRLKLCEGAG SLHFVDAFLW 
NGSIYFPYYP YNYT SGAATG WPSMARIAQS TEVLFQGQAS LDCGHGHPDG RRLLLSSELV 
EALDVWAGVF SAAAGEGQER RSPTTTALCL FRHSEIQARA KRVSWDFKTA ESHCKEGEQP 
ERVQPIASST LIHSDLTSVY GTWMNRTVL FLGTGDGQLL KVILGENLTS NCPEVIYEIK 
EETPVFYKLV PDPVKNIYIY LTAGKEVRRI RVANCNKHKS CSECLTATDP HCGWCHSLQR 
CTFQGDCVHS ENLENWLDIS SGAKKCPKIQ IIRSSKEKTT VTMVGSFSPR HSKCMVKNVD 
SSRELCQNKS QPNRTCTCSI PTRATYKDVS WNVMFSFGS WNLSDRFNFT NCSSLKECPA 
CVETGCAWCK SARRCIHPFT ACDPSDYERN QEQCPVAVEK TSGGGRPKEN KGNRTNOALQ 
VFYIKSIEPQ KVSTLGKSNV IVTGANFTRA SNITMILKGT STCDKDVIQV SHVLNDTHMK 
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PSLPSSRKBM KDVCIQFDGG NCSSVGSLSY IALPHCSLIF PATTWISGGQ NITMMGRNFD 
VIDNLIISHE LKGNINVSEY CVATYCGFLA PSLKSSKVRT NVTVKLRVQD TYLDCGTLQY 
REDPRFTGYR VESEVDTELE VKIQKENDNF NISKKDIEIT LFHGENGQLN CSFENITRNQ 
DLTTILCKIK GIKTASTIAN SSKKVRVKLG NLELYVEQES VPSTWYFLIV LPVLLVIVIF 
AAVGVTRHKS KELSRKQSQQ LELLESELRK EIRDGFAELQ MDKLDWDSF GTVPFLDYKH 
FALETFFPES GGFTHIFTED MHNRDANDKN ESLTALDALI CNKSFLVTVI HTLEKQKNFS 
VKDRCLFASF LTIALQTKLV YLTSILEVLT RDLMEQCSNM QPKLMLRRTE SWEKLLTNW 
MSVCLSGFLR ETVGEPFYLL VTTLNQKINK GPVDVITCKA LYTLNEDWLL WQVPEFSTVA 
LNWFEKIPE NESADVCRNI SVNVLDCDTI GQAKEKIFQA FLSKNGSPYG LQLNEIGLEL 
QMGTRQKELL DIDSSSVILE DGITKLNTIG HYEISNGSTI KVFKKIANFT SDVEYSDDHC 
HLILPDSEAF QDVQGKRHRG KHKFKVKEMY LTKLLSTKVA IHSVLEKLFR SIWSLPNSRA 
PFAIKYFFDF LDAQAENKKI TDPDWHIWK TNSLPLRFWV NILKNPQFVF DIKKTPKIDG 
CLSVIAQAFM DAFSLTEQQL GKEAPTNKLL YAKDIPTYKE EVKSYYKAIR DLPPLSSSEM 
EEFLTQESKK HENEFNEEVA LTEIYKYIVK YFDEILNKLE RERGLEEAQK QLLHVKVLFD 



1 11 21 31 41 SI 

I I I I I I 

MTILGTTFGM VFSLLQWSG ESGYAQNGDL EDAELDDYSF SCYSQLEVNG SQHSLTCAEE 
DPDVNTTNLE FEICGALVEV KCLNFRKLQE IYFIETKKFL LIGKSNICVK VGEKSLTCKK 
IDLTTIVKPE APFDLSVIYR EGANDFWTF NTSHLQKKYV KVLMHDVAYR QEKDENKWTH 
VNLSSTKLTL LQRKLQPAAM YEIKVRSIPD HYFKGFWSEW SPSYYFRTPE INNSSGEMDP 
ILLTISILSF FSVALLVILA CVLWKKRIKP IVWPSLPDHK KTLEHLCKKP RKNLNVSFNP 
ESFLDCQIHR VDDIQARDEV EGFLQDTFPQ QLEESEKQRL GGDVQSPNCP SEDVWTPES 
FGRDSSLTCL AGNVSACDAP ILSSSRSLDC RESGKNGPHV YQDLLLSLGT TNSTLPPPFS 
LQSGILTLNP VAQGQPILTS LGSNQEEAYV TMSSFYQNQ 



MTWRHHVRLL FTVSLALO 1 1 NLGNSYQREK HNGGREEVTK VATQKHRQSP LNWTSSHFGE 
VTGSAEGWGP EEPLPYSHAF GEGASARPRC CRNGGTCVLG SFCVCPAHFT GRYCEEDQRR 
SECGALEIIGA WTLRACHLCR CIFCALHCLP LQTPDRCDPK DFLASHAHGP SAGGAPSLLL 
LLPCALLHRL LRPDAPAHPR SLVPSVLQRE RRPCGRPGLG HRL 



NP_006S24.1 



MTQVTEKSTE HPEKTTSTTE KTTRTPEKPT LYSEKTICTK GKNTPVPEKP TENLGNTTLT 
TETIKAPVKS TENPEKTAAV TKTIKPSVKV TGDKSLTTTS SHLNKTEVTH QVPTGSFTLI 
TSRTKLSSIT SEATGNESHP YLNKDGSQKG IHAGQMGEND SFPAWAIVIV VLVAVILLLV 
Y MMRTRRTLTQ NTQYNDAEDE GGPNSYPVYL MEQQNLGMGQ IPSPR 



MTQVTEKSTE HPEKTTSTTE K 

TETIKAPVKS TENPEKTAAV TKTIKPSVKV TGDKSLTTTS SHLNKTEVTH QVPTGSFTLI 

TSRTKLSSIT SEATGNESHP YLNKDGSQKG IHAGQMGSND SFPAWAIVIV VLVAVILLLV 

FLGLIFLVSY MMRTRRTLTQ NTQYNDAEDE GGPNSYPVYL M — 
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mwprlafccw glalvsgwat pqqmspsrnf s 

verlgqafrr rvrllrelse rlelvflvdd sssvgevnpr selmfvrkll sdfpwptat 
k nywprvdyi strrarqhkc alllqeipai syrgggtytk gafqqaaqil 
:dgysn ggdprpiaas lrdsgveift fgiwqghire lndmastpke 
jARRAL hedlpsgsfi qddmvhcsyl cdegkdccdr mgsckcgtht 
s yygkglqyec tacpsgtykp egspggissc ipcpdenhts p 

G QTCELVHCPA LKPPENGYFI QNTCNNHFNA A 
LPNGLWSGSE SYCRVRTCPH LRQPKHGHIS CSTREMLYKT TCLVACDEGY R 
QGNSQWDGPE PRCVERHCST FQMPKDVIIS PHNCGKQPAK FGTICYVSCR QGFILSGVKE 
MLRCTTSGKW NVGVQAAVCK DVEAPQINCP KDIEAKTLEQ QDSANVTWQI PTAKDNSGSK 
VSVHVHPAFT PPYLFPIGDV AIVYTATDLS GNQASCIFHI KVIDAEPPVI DWCRSPPPVQ 
VSEKVHAASW DEPQFSDNSG AELVITRSHT QGDLFPQGET IVQYTATDPS GKNRTCDIHI 
VIKGSPCEIP FTPVNGDFIC TPDNTGVNCT LTCLEGYDFT EGSTDKYYCA YEDGWKPTY 
TTEWPDCAKK RFANHGFKSF EMFYKAARCD DTDLMKKFSE AFETTLGKMV PSFCSDAEDI 
DCRLEENLTK KYCLEYNYDY ENGFAIGPGG WGAANRLDYS YDDFLDTVQE TATSIGNAKS 
SRIKRSAPLS DYKIKLIFNI TASVPLPDER NDTbEWENQQ RLLQTLSTIT NKLKRTLNKD 
PMYSFQLASE ILIADSNSLE TKKASPFCRP GSVLRGRMCV NCPLGTYYNL EHFTCESCRI 
GSYQDEEGQL ECKLCPSGMY TEYIHSRNIS DCKAQCKQGT YSYSGLETCE SCPLGTYQPK 
FGSRSCLSCP ENTSTVKRGA VNISACGVPC PEGKFSRSGL MPCHPCPRDY YQPNAGKAFC 
LACPFYGTTP FAGSRSITEC STSVLNITIF GGFGHLELLN CPSEVFHECF FNPCHNSGTC 
QQLGRGYVCL CPLGYTGLKC ETDIDECSPL PCLNNGVCKD LVGEFICECP SGYTGQRCEE 
NINECSSSPC LNKGICVDGV AGYRCTCVKG FVGLHCETEV NECQSNPCLN NAVCEDQVGG 
FLCKCPPGFL GTRCGKNVDE CLSQPCKNGA TCKDGANSFR CLCAAGFTGS HCELNINECQ 
SNPCRNQATC VDELKSYSCK CQPGFSGKRC ETEQSTGFNL DFEVSGIYGY VMLDGMLPSL 
HALTCTFWMK SSDDMNYGTP ISYAVDNGSD NTLLLTDYNG WVLYVNGREK ITNCPSVNDG 
RWHHIAITWT SANGIWKVYI DGKLSDGGAG LSVGLPIPGG GALVLGQEQD KXGEGFSPAE 
SFVGSISQLN LWDYVLSPQQ VKSLATSCPE ELSKGNVLAW PDFLSGIVGK VKIDSKSIFC 
SDCPRLGGSV PHLRTASEDL KPGSKVNLFC DPGFQLVGNP VQYCLNQGQW TQPLPHCERI 
SCGVPPPLEN GFHSADDFYA GSTVTYQCNN GYYLLGDgRM FCTDNGSWNG VSPSCLDVDE 
CAVGSDCSEH ASCLNVDGSY ICSCVPPYTG DGKNCAEPIK CKAPGNPENG HSSGEIYTVG 
AGVTFSCQEG YQLMGVTKIT CLESGEWNHL IPYCKAVSCG KPAIPENGCI EELAFTFGSK 
VTYRCNKGYT LAGDKESSCL ANSSWSHSPP VCEPVKCSSP ENINNGKYIL SGLTYLSTAS 
YSCDTGYSLQ GPSIIECTAS GIWDRAPPAC HLVFCGEPPA IKDAVITGNN FTFRNTVTYT 
CKEGYTLAGL DTIECLADGK WSRSDQQCLA VSCDEPPIVD HASPETAHRL FGDIAFYYCS 
DGYSLADNSQ LLCNAQGKWV PPEGQDMPRC IAHFCEKPPS VSYSILESVS KAKFAAGSW 
SFKCMEGFVL NTSAKIECMR GGQWNPSPMS IQCIPVRCGE PPSIMKGYAS GSNYSFGAMV 
AYSCNKCFYI KGEKKSTCEA TGQWSSPIPT CHPVSCGEPP KVENGFLEHT TGRIFSSEVR 
YQCNPGYKSV GSPVFVCQAN RHWHSESPLM CVPLDCGKPP PIQNGFMKGE NFEVGSKVQF 
F NEGi LVG r 5 rCQKSG KWNKKSNPKC MPAKCPEPPL LENQLVLKEL T 
CKEGHVLQGP SVLKCLPSQQ WNDSFPVCKI V " " 
VGGFFLRGNS TTLCQPDGTW SSPLPECVPV ECPQPEEIPN G 
GFELVGNTTT LCGENGHWLG GKPTCKAIEC L 
RLEGPSALTC LETGDWDVDA PSCNAIHCDS PQPIENGFVE GADYS YGAI I IYSCFPGPQV 2580 
AGHAMQ1CEE SGWSSSIPTC MPIDCGLPPII IDFCDCTKLK DDQGYFEQED DMMEVPYVTP 2640 
HPPYHLGAVA KTWENTKESP ATHSSNFLYG TKVSYTCNPG YELLGNPVLI CQEDGTWNGS 2700 
APSCISIECD LPTAPEKGFL RFTETSMGSA VQYSCKPGHI LAGSDLRLCL ENRKWSGASP 2760 
RCEAISCKKP NPVMNGSIKG StlYTYLSTLY YECDPGYVUJ GTERRTCQDD KNWDEDEPIC 2B20 
IPVDCSSPPV SANGOVRGDE YTFQKEIEYT CNEGFLLEGA RSRVCLANGS WSGATPDCVP 2BB0 
VRCATPPQLA NGVTEGLDYG FMKEVTFHCH EGYILiHGAPK LTCQSDGNWD AEIPLCKPVN 2940 
CGPPEDLAHG FPNGFSFIHG GHIQYQCFPG YKLHGNSSRR CLSNGSWSGS SPSCLPCRCS 3000 
TPVIEYGTVN GTDFDCGKAA RIQCFKGFKL LGLSEITCEA DGQWSSGFPH CEHTSCGSLP 3060 
MIPNAFISET SSWKENVITY SCRSGYVIQG SSDLICTEKG WSQPYPVCE PLSCGSPPSV 3120 
ANAVATGEAH TYESEVKLRC LEGYTMDTDT DTFTCQKDGR WFPERISCSP KKCPLPENIT 31B0 
HILVHGDDFS VNRQVSVSCA EGYTFEGVNI SVCQLDGTWE PPFSDESCSP VSCGKPESPE 3240 
HGFWGSKYT FESTI IYQCE PGYELEGNRE RVCQENRQWS GGVAICKETR CETPLEFLNG 3300 
KADIEHRTTG PNWYSCNRG YSLEGPSEAH CTENGTWSHP VPLCKPNPCP VPFVIPENAL 3360 
LSEKEFYVDQ NVSIKCREGF LLQGHGIITC NPDETWTQTS AKCEKISCGP PAHVENAIAR 3420 
GVHYQYGDMI TYSCYSGYML EGFLRSVCLE NGTWTSPPIC RAVCRFPCQN GGICQRPNAC 3480 
SCPEGWMGRL CEEPICILPC LNGGRCVAPY QCDCPPGWTG SRCHTAVCQS PCLNGGKCVR 3540 
JCSR 3557 



Protein Access 



A DELGGQISVL QRKGISRYEG ILANGVLSRD 
S SSWQSVKESG DQVHWSPGQA RLQDQGPSWA 
SGDSSNNHKP REWLEIDLGE KKKITGIRTT GSTQSNFNFY VKSFVMNFKH NNSKWKTYKG 
IVNNEEKVFQ GNSNFRDPVQ NNFIPPIVAR YVRWPQTWH QRIALKVELI GCQITQGNDS 
LVWRKTSQST SVSTKKEDET ITRPIPSEET STGINITTVA IPLVLLWLV ~ ~ " " 
RKKKKKGSPY GSAEAQKTDC WKQIKYPFAR HQSAEFTISY DNEKBMTQKL 

1364 
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I I I I I I 

MGFGAGQRLR PVPAPRSSAE EAARPGQLRL GIRRGEAELA KLAPSGVMVP GARGGGALAR 
AAGRGLLALL LAVSAPLRLQ AEELGDGCGH LVTYQDSGTM TSKNYPGTYP NHTVCEKTIT 
VPKGKRLILR LGDLDIESQT CASDYLLFTS SSDQYGPYGG SMTVPKELLL NTSEVTVRPE 
SGSHISGRGP LLTYASSDHP DLITCLERAS HYLKTEYSKF CPAGCRDVAG DISGNMVDGY 
RDTSLLCKAA IHAGIIADEL GGQISVLQRK GISRYEGILA NGVLSRDGSL SDKRFLFTSN 
GCSRSLSFEP DGQIRASSSW QSVHESGDQV HNSPGOARLQ BQGPSWASGD SSNNHKPREW 
LEIDLGEKKK ITGIRTTGST QSNFNFYVKS FVMNFKNNNS KWKTYKGIVN NSEKVFQGNS 
NFRDPVQNNF IPPIVARYVR WPQTWHQRI ALKVELIGCQ ITQGNESLVK RKTSQSTSVS 
TKKEDETITR PIPSEETSTG INITTVAIPL VLLWLVFAG MGIFAAFRKK KXKGSPYGSA 
EAQKTDCWKQ IKYPFARHQS AEFTISYDNE KEMTQKLDLI TSDMAG 



1 11 21 31 41 SI 

I I I I I I 

MFQRQERFLD LSSAEAVAAW ILHQHPDIIN KGDGCGHLVT YQDSGTMTSK NYPGTYPNHT 
VCEKTITVPK GKRLILRLGD LDIESOTCAS DYLLFTSSSD QYGMQK3EET EVLCLSVAGA 
QRVDIPVQLL PSFLEGWKGH ADARGPYCGS MTVPKELLLN TSEVTVRFES GSHISGRGFL 
LTYASSDHPD LITCLERASH YLKTEYSKFC PAGCRDVAGD ISGNMVDGYR DTSLLCKAAI 
HAGIIADELG GQISVLQRKG ISRYEGILAN GVLSRDGSLS DKRFLFTSNG CSRSLSFEPD 
GQIRASSSWQ. SVNESGDQVH WSPGQARLQD QGPSWASGDS SNNHKPREWL EIDLGEKKKI 
TGIRTTGSTQ SNFNFYVKSF VMNFKNNNSK WKTYKGIVNN EEKVFQGNSN FRDPVQNNFI 
PPIVARYVRV VPQTWHQRIA LKVELIGCQI TQGNDSLVWR KTSQSTSVST KKEDETITRP 
IPSEETSTDA MPVQIVGDHT QMISQRENLG PDEGKIPFKG TAESMVRWF AVWNDLGML 
FLAHTPEEDI DHYCWKQIKY PFARHQSAEF TISYDNEKEM TQKLDLITSD MADYQQPLMI 
GTGTVTRKGS TFRPMDTDAE EAGVSTDAGG HYDCPQRAGR HEYALPLAPP EPEYATPIVE 
RHVLRAHTFS AQSGYRVPGP QPGHKHSLSS GGFSPVAGVG AQDGDYQRPH SAQPADRGYD 
RPKAVSALAT ESGHPDSQKP PTHPGTSDSY SAPRDCLTPL NOTAMTALL 

Seq ID NO: C3S4 Protein Sequence 



Y SMFTFNFLFW LCGILILALA IWVRVSNESQ AIFGSEDVGS SSYVAVDILI 

AVGAI IMILG FLGCCGAIKE SRCMLLLFFI GLLLILLLQV ATGILGAVFK SKSDRIVKET 

LYENTKLLSA TGESEKQFQE AIIVFQEEFK CCGLVNGAAD WGNNFQHYPE LCACLDKQRP 

CQSYNGKQVY KETCISFIKD FLAKNLI IVI GISFGLAVIE ILGLVFSMVL YCQIGNK 



MCTGGCARCL GGTLIPLAFF GFLANILLFF PGGKVIDDND HLSQEIWFFG GILGSGVLMI 
FPALVFLGLK NNDCCGCCGN EGCGKRFAMF TSTIFAWGF LGAGYSFIIS AISINKGPKC 
LMANSTWGYP FHDGDYLNDE ALWNKCREPL NWPWNLTLF SILLWGGIQ MVLCAIQWN 
GLLGTLCGDC QCCGCCGGDG PV 

Seq ID NO: C356 Protein Sequence 
Protein Accession #: NP__002372.1 

1 11 21 31 41 51 

I I I I I I 

MPRPAPARRL PGLLLLLWPL LLLPSAAPDP VARPGFRRLE TRGPGGSPGR RPSPAAPDGA 
PASGTSEPGR ARGAGVCKSR PLDLVFIIDS SRSVRPLEFT KVKTFVSRII DTLDIGPADT 
RVAWNYAST VKIEFQLQAY TDKQSLKQAV GRITPLSTGT MSGLAIQTAM DEAFTVEAGA 
. REPSSNIPKV AIIVTDGRPQ DQWEVAARA QASGIELYAV GVDRADMASL KMMASEPLEE 
HVFYVETYGV IEKLSSRFQE TFCALDPCVL GTHQCQHVCI SDGEGKHHCE CSQGYTLNAD 
KKTCSALDRC ALNTHGCEHI CVMDRSGSYH CECYEGYTLN EDRKTCSAQD KCALGTHGCQ 
HICVNDRTGS HHCECYEGYT LNADKKTCSV RDKCALGSHG OQHICVSDGA ASYHCDCYPG 
YTLNEDKKTC SATEEARRLV STEDACGCEA TLAFQDKVSS YLQRLNTKLD D 

Seq ID NOs C357 Protein Sequence 



L RLLVLGLWLA LLRSVAGEQA PGTAPCSRGS SWSADLDKCM DCASCRARPH 
SDFCLGCAAA PPAPFRLLWP ILGGALSLTF VLGLLSGFLV WRRCRRREKF TTPISETGGE 
GCPAVALIQ 
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MQPTLLLSLL GAVGLAAVNS MPVDNRNHNE GMVTRCIIEV LSNALSKSSA PPITPECRQV 
LKTSRKDVKD KETTENENTK PEVRLLRDPA DASEAHESSS RC-EAGAPGES DIQGPTKADT 
EKWAEGGGHS RERADEPQWS LYPSDSQVSE EVKTRHSEKS QREDEEEEEG ENYQKGERGE 
DSSEEKHLEE PGETQNAFLN ERKQASAIKK EELVARSETH AAGHSQEKTH SRSKSSQESG 
EEAGSQENHP QESKGQPRSQ EESEEGEEDA TSEVDKRRTR PRHHHGRSRP DRSSQGGSLP 
SEEKGHPQEE SEESNVSMAS LGEKRDHHST HYRASEEEPE YGEEIKGYPG VQ.APEDLEWE 
RYRGRGSEEY RAPRPQSEES WDEEDKRNYP SLELDKMAHG YGEESEEERC- LEPGKGRHHR 
GRGGEPRAYP MSDTREEKRF LGEGHHRVQE NQMDKARRHP QGAWKELDRN YLNYGEEGAP 
GKWQQQGDLQ DTKENREEAR FQDKQYSSHH TAEKRKRLGE LFNPYYDPLQ WKSSHPERRD 
NMNDNFLEGE EENELTLNEK NFFPEYNYDW WEKKPFSEDV NWGYEKRNLA RVPKLDLKRQ 
YDRVAQLDQL LHYRKKSAEF PDFYDSEEPV STHQEAENEK DRADQTVLTE DEKKELENLA 
AMDLELQKIA EKFSQRG 



1 11 21 31 41 51 

I I I I I I 

MKLLCEGLKQ PNCVLQTLRW YRCLISSASC GALAAVLSTS QWLTELEFSE TKLEASALKL 
LYGGLKDPNC KLQKLNLQFS LSVTAAKLPV GMVGNCSGFS GSLVQSHFGY CQDSSFKCDL 
CKLLWPSTRV AAAKDCGSPK SFLSEGLNWA GRLEAVEEVL GLGVLVQPGD PASQGGGHCE 
NYGSFRDLVD LEVKAEPSLR KGGMDLQRPT LQWLLCKIF SLKLFLFIAL PNSPGQVSW 
QVTIPDGFVN VTVGSNVTLI CIYTTTVASR EQLSIQWSFF HKKEMEPISS PWEEGKWPDV 
EAVKGTLDGQ QAELQIYFSQ GGQAVAIGQF KDRITGSKDP GNASITISHM QPADSGIYIC 
DVNNPPDFLG QNQGILNVSV LVKPSKPLCS VQGRPETGHT ISLSCLSALG TPSPVYYWHK 
LEGRDIVPVK ENFNPTTGIL VIGNLTNFEQ GYYQCTAINR LGNSSCEIDL TSSHPEVGII 
VGALIGSLVG AAIIISWCF ARNKAKAKAK ERNSKTIAEL EPMTKINPRG ESEAMPREDA 
TQLEVTLPSS IHETGPDTIQ EPDYEPKPTQ EPAPEPAPGS EPMAVPDLDI ELELEPETQS 
ELEPEPEPEP ESEPGWVEP LSEDEKGWK A 



I I I I I I 

MV FAFWKVFL ILSCL.AGQVS VVQVTIPDGF VHVTOGSNVT LICIYTTTVA SRSOLSIQWS 
FFHKKEMEPI SSPWEEGKWP DVEAVKGTLD GQQAELQIYF SQGGQAVAIG QFKDRITGSN 
DPGNASITIS HMQPADSGIY ICDVNNPPDF LGQNQGILNV SVLVKPSKPL CSVQGRPETG 
HTISLSCLSA LGTPSPVYYW HKLEGRDIVP VKENFNPTTG ILVIGNLTNF EQGYYQCTAI 
NRLGNSSCE I DLTSSHPEVG IIVGALIGSL VGAAIIISW CFARNKAKAK AKERNSKTI A 
ELEPMTKINP RGESEAMPRE DATQLEVTLP SSII1ETGPDT IQEPDYEPKP TQEPAPEPAP 
GSEPMRVPDL DIELELEPET QSELEPEPEP EPESEPGWV EPLSEDEKGV VKA 



Seq I 



MVSRMVSTHL SGLLFWLASG WTPAFAYSPR TPDRVSEADI QRLLHGVMEQ LGIARPRVEY 
PAHQAMNLVG PQSIEGGAHE GLQHLGPFGN IPNIVAELTG DNIPKDFSED QGYPDPPNPC 
PVGKTDDGCL ENTPDTAEFS REFQLHQHLF DPEHDYPGLG KWNKKLLYEK MKGGERRKRR 
SVWPYLQGQR LDNWAKKSV PHFSDEDKDP E 



MTTMQGMEQA MPGAGPGVPQ LGNMAVIHSH LWKGLQEKFL KGEPKVLGW QILTALMSLS 
MGITMMCMAS NTYGSNPISV YIGYTIWGSV MFIISGSLSI AAGIRTTKGL VRGSLGHNIT 
SSVLAASGIL INTFSLAFYS FHHPYCNYYG NSNNCHGTMS ILMGLDGMVL LLSVLEFCIA 
/ LCCTPGGWL ILPSHSHMAE TASPTPLNEV 



MRGSELPLVL LALVLCLAPR GRAVPLPAGG GTVLTKMYPR GNHWAVGHLM GKKSTGESSS 
VSERGSLKQQ LREYIRWEEA ARNLLGLIEA KENRNHQPPQ PKALGNQQPS WDSEDSSNFK 
R LSAPGSQREG RNPQLNQQ 



on I I I 1 1 1 

Ol) MDLQGRGVPS IDRLRVLLML FHTMAQIMAE QEVENLSGLS TNPEKDIFW RENGTTCLMA SO 

EFAAKFIVPY DVWASNYVDL ITEQADIALT RGAEVKGRCG HSQSELQVFW VDRAYALKML 120 

FVKESHNMSK GPEATWRLSK VQFVYDSSEK THFKDAVSAG KHTANSHHLS ALVTPAGKSY 180 

ECQAQQTISL ASSDPQKTVT MILSAVHIQP FDIISDFVFS EEHKCPVDER EQLEETLPLI 240 

LGLILGLVIM VTLAIYHVHH KMTANQVQIP RDRSQYKHMG 2 80 
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SANQCAVPAK DRVDCGYPHV TPKECHNRGC CFDSRIPGVP 



10 
15 
20 



50 



M RSSKGRSLIG KVDGTSEVTG KGVTVETVFS 60 



T GKLTTVPLPI VYTIVFWGL PSMGMALWVF LFRTKKKHPA VIYKANLALA 
DLLSVIWFPL KIAYHIHANN WIYGEALCNV LIGFFYGNMY CSILFMTCLG VQRYWVIVNP 
MGHSRKKANI AIGISIAIWL LILLVTIPLY WKQTIFIPA LNITTCHDVL PEQLLVGDMF 
NYFLSLAIGV FLFPAFLTAS AYVLMIRMLR SSAMDENSEK KEKRAIKLIV TVLAMYLICF 
TPSNLLLWH YFLIKSQGQS HVYALYIVAL CL3TLNSCID PFVYYFVSHD FRDHAKNALL 
CRSVRTVKQM QVSLTSKKHS RKSSSYSSSS TTVKTSY 



I I I I I 

3 AALSLIPLIF LISGAEAASF QRNQLLQKEP DLRLENVQKF PSI 
YIENLRQQAH KEESSPDYNP YQGVSVPLQQ KENGDSSKLP ERDSLSEEDW MRIILEALRQ 
AENEPQSAPK ENKPYALNSE KNFPMDMSDD YETQQWPERK LKHMQFPPMY SENSRDNPFK 
RTNEIVEEQY TPQSLATLES VFQELGKT.TG PNNQKRERMD EEQKLYTDDE DDIYKANNIA 
40 YEDWGGEDW NPVEEKIESQ TQEEVRDSKE NIGKNEQIND EMKRSGQLGI QEEDLRKESK 

DQLSDDVSKV IAYLKRLVNA AGSGRLQNGQ NGERATRLFE KPLDSQSIYQ LIEISRNLQI 
PPEDLIEMLK TGEKPNGSVE PERELDLPVD LDDISEADLD HPDLFQNRML SKSGYPKTPG 
RAGTEALPDG LSVEDILNLL GMESAANQKT SYFPNPYNQE KVLPRLPYGA GRSRSNQLPK 
AAWIPHVENR QMAYENLNDK DQELGEYLAR MLVKYPEIIN SNQVKRVPGQ CSSEDDLQEE 
45 EQIEQAIKEH LNQGSSQETD KT.APVSKRFP VGPPKNDDTP NRQYWDEDLL KKVLEYLNQE 

KAEKGREHIA KRAMENM 



I I I I I I 

HPCAQRSWLA NLSWAQLLN FGALCYGRQP QPGPVRFPDR RQEHFIKGLP EYHWGPVRV 60 

DASGHFLSYG LHYPITSSRR KRDLDGSEDW VYYRISHEEK DLFFNLTVNQ GFLSNSYIME 120 

KRYGNLSHVK MMASSAPLCH LSGTVLQQGT RVGTAAL SAC HGLTGFFQLP HGDFFIEPVK 1B0 

KHPLVEGGYH PHIVYRRQKV PETKEPTCGL KDSWISQKQ ELWREKWERH NLPSRSLSRR 240 

SISKERWVET LWADTKMIE YHGSENVESY ILTIMNMVTG LFHNPSIGNA IHIVWRLIL 300 

LEEEEQGLKI VHHAEKTLSS FCKWQKSINP KSDLNPVHHD VAVLLTRKDI CAGFNRPCET 360 

LGLSHLSGMC QPHRSOIINE DSGLPLAFTI AHELGHSFGI QHDGKENDCE PVGRHPYIMS 420 

RQLQYDPTPL TWSKCSEEYI TRFLDRGWGF CLDDIPKKKG LKSKVIAPGV IYDVHHQCQL 4B0 

QYGPNATFCQ EVENVCQTLW CSVKGFCRSK LDAAADGTQC GEKKWCMAGK CITVGKKPES 540 

IPGGWGRWSP WSHCSRTCGA GVQSAERLCN NPEPKFGGKY CTGERKRYRL CNVHPCRSEA 600 

PTFRQMQCSE FDTVPYKNEL YHWFPIFNPA HPCELYCRPI DGQFSEKMLD AVIDGTPCFE 660 

GGNSRNVCIN GICKMVGCDY EIDSNATEDR CGVCLGDGSS CQTVRKMFKQ KEGSGYVDIG 720 

LIPKGARDIR VMEIEGAGNF LAIRSEDPEK YYLNGGFIIQ WNGNYKLAGT VFQYDRKGDL 7B0 

EKLMATGPTN ESVWIQLLFQ VTNPGIKYEY T IQKDGLDND VEQWYFWQYG HWTECSVTCG B40 

TGIRROTAHC IKKGRGMVKA TFCDPETQPN GRQKKCHEKA CPPRWWAGEVI EACSATCGPH 900 

GEKKRTVLCI QTMVSDEQAL PPTDCQHLLK PKTLLSCNRD ILCPSDWTVG NWSECSVSCG 960 

GGVRIRSVTC AKNHDEPCDV TRKPNSRALC GLCjQCPSSRR VLKPNKGTIS NGKNPPTLKP 1020 

VPPPTSRPRM LTTPTGPESM STSTPAISSP SPTTASKEGD LGGKQWQDSS TQPELSSRYL 1080 

ISTGSTSQPI LTSQSLSIQP SEENVSSSDT GPTSEGGLVA TTTSGSGLSS SRNPITHPVT 1140 

PFYNTLTKGP EMEIHSGSGE EREQPEDKDE SNPVIWTKIR VPGNDAPVES TEMPLAPPLT 1200 

PDLSRESWWP PFSTVMEGLL PSQRPTTSET GTPRVEGMVT EKPANTLLPL GGDHQPEPSG 1260 

KTAKRNHLKL PNNMNQTKSS EPVLTEEDAT SLITEGFLLN ASNYKQLTNG HGSAHWIVGN 1320 

WSECSTTCGL GAYWKRVECT TQMDSDCAAI QRPDPAKRCH LRPCAGWKVG NKSKCSRNCS 1380 

GGFKIREIQC VDSRDHRNLR PFHCQFLAGI PPPLSMSCHP EPCEAWQVEP WSQCSRSCGG 1440 

GVQERGVFCP GGLCDWTKRP TSTMSCNEHL CCHNATGNWD LCSTSCGGGF QKRIVQCVPS 1500 
EGNKTEDQDQ CLCDHKPRPP EFKKCNQQAC KKSADLLCTK DKLSASFCQT L 
TVRAECCFSC PQTHITHTQR QRRQRLLQKS KEL 
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I I I I I I 

MRQSHQLPLV GLLLPSPIPS QLCEICEVSE ENYIRLKPLL NTMIQSNYNR GTS 
LKLVGIQIQT LMQKMIQQIK YNVKSRLSDV SSQELALIIL ALCVCRNAEE NLIYDYKLTD 
KLENKFQAEI ENMEAHNGTP LTNYYQLSLD VLALCLFNGN YSTAEWNHF TPENKNYYFG 
SQFSVDTGAM AVLALTCVKK SLINGQIKAD EGSLKNISIY TKSLVEKILS SKKENC-LIGN 
TFSTGEAMQA LFVSSDYYNE NDWNCQQTLN TVLTEISQGA PSNPNAAAQV LPALMGKTPL 
DINKDSSCVS ASGNFNISAD EPITVTPPDS QSYISVNYSV RINETYFTKV TVLNGSVFLS 
VMEKAQKNND TIFGFTMEER SWGPYITCIQ GLCANNNDRT YWELLSGGEP LSQGAGSYW 
RNGENLEVRW SKY 

Seg ID NO.- C371 Protein Sequence 



Seq ID NOs C372 Protein Sequence 
20 Protein Accession #: NP_037403.1 

1 11 21 31 41 51 

I I I I I I 

MAGSPLLWGP RAGGVGLLVL LLLGLFRPPP ALCARPVKEP RGLSAASPPL AETGAPRRFR 
25 RSVPRGEAAG AVQELARALA HLLEAERQER ARAEAQEAED QQARVLAOLL RVWGAPRNSD 
PALGLDDDPD APAAQLARAL LRARLDPAAL AAQLVPAPVP AAAItRPR PP V YDDGPAGPDA 

EEAGDETPDV DPELLRYLLG RILAGSADSE GVAAPRRLRR AADHDVGSEL P 

RVKRLETPAP QVPARRLLPP 

30 Se 

Protein Accession #i NP_002236.1 

1 11 21 31 41 51 

« I I I I I I 

3D MLQSLAGSSC VRLVERHRSA WCFGFLVLGY LI LVI S n ' FSSVELPYED LLRQELRKLK 

RRFLEEHECL SEQQLEQFLG RVLEASNYGV SVLSNASGNW NWDFTSALFF ASTVLSTTGY 
GHTVPLSDGG KAFCIIYSVI GIPFTLLFLT AWQRITVHV TRRPVLYFHI RWGFSKQWA 
IVHAVLLGFV TVSCFFFIPA AVFSVLEDDW NFLESFYFCF ISLSTIGLGD YVPGEGYNQK 
FRELYKIGIT CYLLLGLIAM LWLETFCEL HSLKKFRK 

40 PSSITDQAAG MKEDQKQNEP FVATQSSACV DGPAHH 

Seq ID NO: C374 Protein Sequence 



I I I I I I 

MGRHLALLLL LLLLFQHFGD SDGSQRLEQT PLQFTHLEYN VTVQENSAAK T~ 
VYITHPAWEV RYKIVSGDSE NLFKAEEYIL GDFCFLRIRT KGGNTAILNR EVKDHYTLIV 
KALEKNTNVE ARTKVRVQVL DTNDLRPLFS PTSYSVSLPE NTAIRTSIAR VSATDADIGT 
NGEFYYSFKD RTDMFAIHPT SGVIVLTGRL DYLETKLYEM EILAADRGMK LYGSSGISSM 
AKLTVHIEQA NECAPVITAV TLSPSELDRD PAYAIVTVDD CDQGANGDIA SLSIVAGDLL 
QQFRTVRSFP GSKEYKVKAI GDIDWDSHPF GYNLTLQAKD KGTPPQFSSV KVIHVTSPQF 
KAGPVKFEKD VYRAEISEFA PPNTPWMVK AIPAYSHLRY VFKRTPGKAK FSLNYNTGLI 
SIL-EPVKRQQ AAHFELEVTT SDRKASTKVL VKVLGAHSNP PEFTQTAYKA AFDENVPIGT 
TIMSLSAVDP DEGENGYVTY SIANMHVPF AIDHFTGAVS TSENLDYELM PRVYTLRIRA 
SDWGLPYRRE VEVLATITLN NLNDNTPLFE KINCEGTIPR DLGVGEQITI VSAIDADELQ 
LVQYQIEAGN ELDLFSLNPN SGVLSLKRSL MDGLGAKVSF HSLRITATDG ENFATPLYIN 
I TVAASHKLV NLQCEETGVA KMIAEKLLQA NKIjHNOGEVE DIFFDSHSVN AHIPQFRSTL 
PTGIQVKENQ PVGSSVIFHN STDLDTGFNG KLVYAVSGGN EDSCFMIDME TGMLKILSPL 
DRETTDKYTL NITVYDLGIP QKAAWRLLHV VWDANDNPP EFLQESYFVE VSEDKEVHSE 
IIQVEATDKD LGPNGHVTYS ILTDTDTFSI DSVTGWNIA RPLDRELQHE HSLKIEARDQ 
AREEPQLFST VWKVSLEDV NDNPPTFIPP NYRVKVREDL PEGTVIMWLE AHDPDLGQSG 
QVRYSLLDHG EGNFDVDKLS GAVRIVQQLD FEKKQVYNLT VRAKDKGKPV SLSSTCYVEV 
EWDVNENLH PPVFSSFVEK GTVKEDAPVG SL-VMTVSAHD EDAGRDGEIR YSIRDGSGVG 
VFKIGEETGV IETSDRLDRE STSHYWLTVP ATDQGWPLS SFIEIYIEVE DVNDNAPQTS 
EPVYYPEIME NSPKDVSWQ IEAFDPDSSS NDKLMYKITS GNPQGFFSIH PKTGLITTTS 
RKLDREQQDE HILEVTVTDN GSPPKSTIAR VIVKILDEND NKPQFLQKFY KIRLPEREKP 
DRERNARREP LYRVIATDKD EGPNAEISYS IEDGN^F"! F FIEPKTGV, 1 
DILSIKAVDN GRPQKSSTTR LHIEWISKPK QSLEPISFEE SFFTFTW.ES DPVAHMIGVI 
SVEPPGIPL-W FDITGGNYDS HFDVDKGTGT IIVAKPLDAE QKSNYNLTVE ATDGTTTILT 
QVPIKVIDTN DHRPQFSTSK YEWIPEDTA PETEILQISA VDQDEKNKU YTLQSSRDPL 
SLKKFRLDPA TGSLYTSEKL DHEAVSPAHL TVMVRDQDVP VKRKFARIW NVSDIKDHAP 
WFTASSYKGR VYESAAVGSV VLQVTALDKD KGKNAEVLYS IESGNIGNI ' I FMIDPVLG 
SIKTAKELDR SNQAEYDLMV KATDKGSPPM SEITSVRIFV TIADNASPKF TSKEYSVELS 
ETVSIGSFVG MVTAHSQSSV VYEIKDGNTG DAFDINPHSG THTQKALDF E*~ ~ " 
QGTNMAGLST NTTVLVHLQD ENDNAPVFMQ AEYTGLISES ASINSWLTD R 
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3 TYFAIDSSTG AIHTVI.SLDY EETSIPHPTV QVHDMGTPRL 18S0 
P VFAKPLYEAS LLLPTYKGVK VITVNATDAD SSAFSQLIYS 1920 
ITEGNIGEKF SMDYKTGALT VQNTTQLRSR YELTVRASDG RFACL T r ^EHL 19B0 

KFTQDVYSAV VKENSTEAET LAVITAIGSP INEPLFYHIL NPDRRFKISR TSGVLSTTGT 2040 
PFDREQQEAF DVWEVIEEH KPSAVAHVW KVIVEDQNDN APVFVRLPYY AWKVDTEVG 2100 
HVIRYVTAVD RDSGRNGEVH YYLKEHHEHF QIGPLGEISL KKQFELDTLN KEYLVTWAK 2160 
DGGNPAFSAE VIVPITVMNK AMPVFEKPFY SAEIAESIQV HSPWHVQAN SPEGLKVFYS 2220 
ITDGDPFSQF TINFNTGVIN VIAPLDFEAH PAYKLSIRAT DSLTGAHAEV FVDIIVDDIN 2280 
DNPPVFAQQS YAVTLSEASV IGTSWQVRA TDSDSEPNRG ISYQMFGNHS KSHDHFHVDS 2340 
STGLISLLRT LDYEQSRQHT IFVRAVDGGM PTLSSDVIVT VDVTDLNGNP PLFEQQIYEA 2400 
RISEHAPHGH FVTCVKAYDA DSSDIDKLQY SILSGNDHKH FVIDSATGII TLSNLHRHAL 2460 
KPFYSLNLSV SDGVFRSSTQ VHVTVIGGNL HSPAFLQNEY EVELAENAPL HTLVMEVKTT 2 52 0 
DGDSGIYGHV TYHIVNDFAK DRFYINERGQ IFTLEKLDRE TPAEKVISVR U1AKDAGGKV 2580 
AFCTVNVILT DDNDNAPQFR ATKYEVNIGS SAAKGTSWK SASDADEGSN ADITYAIEAD 2640 
SESVKENLEI NKLSGVITTK ESLIGLENEF FTFFVRAVDN GSPSKESWL VYVKILPPEM 2700 
QLPKFSEPFY TFTVSEDVPV GTEIDLIRAE HSGTVLYSLV KGNTPESNRD ESFVIDRQSG 2760 
RLKLEKSLDH ETTKWYQFSI LARCTQDDHE MVASVDVSIQ VKDAMDNSPV FESSPYEAFI 2820 
VENLPGGSRV IQIRASDADS GTNGQVMYSL DQSQSVEVIE SFAINMSTGW ITTLKELDHE 2880 
KRDNYQIKW ASDHGEKIQL SSTAIVDVTV TDVNDSPPRF TAEIYKGTVS EDDPQGGVIA 2940 
ILSTTDADSE EINRQVTYFI TGGDELGQFA VETIQNEWKV YVKKPLDREK RDNYLLTITA 3000 
TDGTFSSKAI VEVKVLDAND NSPVCEKTLY SDTIPEDVLP GKLIMQISAT DADIRSNAEI 3060 
TYTLLGSGAE KFKLNPDTGE LKTSTPLDRE EQAVYHLLVR ATDGGGRFCQ ASIWTLEDV 3120 
NDNAPEFSAD PYAITVFENT EPGTLLTRVQ ATDADAGLNR KILYSLIDSA DGQFSINELS 3180 
GIIQLEKPLD RELQAVYTLS LKAVDQGLPR RLTATGTVIV SVLDINDNPP VPEYREYGAT 3240 
VSEDILVGTE VLQVYAASRD IEANAEITYS IISGNEHGKF SIDSKTGAVF IIENLDYESS 3300 
HEYYLTVEAT DGGTPSLSDV ATVNVNVTDI MDNTPVFSQD TYTTVISEDA VLEQSVITVM 33 60 
ADDADGPSNS HIHYSIIDGN QGSSFTIDPV RGEVKVTKLL DRETISGYTL TVQASDNGSP 3420 
PRVNTTTVNI DVSDVNDNAP VFSRGKYSVI IQENKPVGFS VLQLWTD ED SSHNGPPFFF 3480 
TIVTGNDEKA FEVHPQGVLL TSSAIKRKEK DHYLLQVKVA DNGKPQLSSL TYIDIRVIEE 3540 
SIYPPAILPL EIFITGSGEE YSGGVIGKIH ATDQDVYDTL TYSLDPQMDM LFSVSSTGGK 3600 
LIAHKKLDIG QYLLNVSVTD GKFTTVADIT VHIRQVTQEM LNHTIAIRFA NLTPEEFVGD 3 650 
YWRNFQRALR NILGVRRNDI QIVSLQSSEP HPHLDVLLFV EKPGSAQIST Kl 
VTDIEEIIGV RILNVFQKLC AGLDCPWKFC DEKVSVDESV MSTHSTARLS F 
VCLCKEGRCP PVHHGCEDDP CPEGSECVSD PWEEKHTCVC PSGRFGQCPG " 
YVKYRLTENE NKLEMKLTMR LRTYSTHAW MYARGTE 
VSVQSIQVND GQWHAVALEV NGNYARLVLD QVHTASGTAP G 
QGTRHGRSPQ VGNGFRGCMD SIYLNGQELP LNSKPRf 

SNPCQNGGVC NPSPAGGYYC KCS AL«Y IGTH CEISVNPCSS NPCLYGGTCV VDNGGFVCQC 
RCLYTCQRCQ LSPYCKDEPC KNGGTCFDSL DGAVCQCDSG FRGERCQSDI DECSGNPCLH 
GAIiCENTHGS YHCNCSHEYR GRHCEDAAPN QYVSTPWNIG LAEGIGIWF VACIFLLWV 
FVLCRKMISR KKKHQAEPKU KHLGPATAFL QRPYFDSKLN KNIYEDIPPO VPVRPISYTP 
SIPSDSRNNL DRNSFEGSAI PEHPEFSTFN PESVHGHRKA VAVCSVAPNL PPPPPSNSPS 
DSDSIQKPSW DFDYDTKWD LDPCLSKKPL EEKPSQPYSA RESLEEVQEL SSFQSESCDD 
N H T^Dtl MPSVPLPDIQ EFPNYEVIDE QTPLYSADPN AIDTDYYPGG YDIESDFPPP 
PEDFPAADEL PPLPPEFSNQ FESIHPPRDM PAAGSLGSSS RNRQRFNI.NQ YLPNFYPLDM 
SEPQTKGTGE NSTCREPHAP YPPGYQRHFE APAVESMPMS VYASTASCSD VSACCEVESS 
VMMSDYESGD DGHFEEVTIP PLDSQQHTEV 



MCYGKCARCI GHSLVGLALL CIAANILLYF PNGETKYASE NHLSRFVWFF SGIVGGGLLM 
55 LLPAFVFIGL EQDDCCGCCG HENCGKRCAM LSSVLAALIG IAGSGYCVIV AALGLAEGPL 
CLDSLGQWNY TFASTEGQYL LDTSTWSECT EPKHIVEWNV SLFSILLALG GIEFILCLIQ 
VINGVLGGIC GFCCSHQQQY DC 

60 

1 11 21 31 41 51 

I I I I I I 

MSTENVEGKP SNLGERGRAR SSTFLRWQP MFNHSIFTSA VSPAAERIRF ILGEEDDSPA 
65 PPQLFTELDE LLAVDGQEME WKETARWIKF EEKVEQGGER WSKPHVATLS LHSLFELRTC 
MEKGSIMLDR EASSLPQLVE HIVDHQIETG LLKPELKDKV TYTLLRKHRH QTKKSKLRSL 
ADIGKTVSSA SRMFTNPDNG SPAMTHRNLT SSSLMDISDK PEKDQLKNKF MKKLPRDAEA 
SNVLVGEVDF LDTPFIAFVR LQQAVMLGAL TEVPVPTRFL FILLGPKGKA KSYHEIGRAI 
ATLMSDEVFH DIAYKAKDRH DLIAGIDEFL DEVIVLPPGE WDPAIRIEPP KSLPSSDKRK 
70 NMYSGGENVQ MNGDTPHDGG HGGGGHGDCE ELQRTGRFCG GLIXDIKRKA PFFASDFYDA 
LNIQALSAIL FIYLATVTNA ITFGGLLGDA TDNMQGVLES FLGTAVSGAI FCLFAGQPLT 
ILSSTGPVLV FERLLFNFSK DNNFDYLEFR LWIGLWSAFL CLILVATDAS FLVQYFTRFT 
EEGFSSLISF IFIYDAFKKM IKLADYYPIN SNFKVGYNTL FSCTCVPPDP ANISISNDTT 
LAPEYLPTMS STDMYHNTTF DWAFLSKKEC SKYGGNLVGN NCNFVPDITL MSFILFLGTY 
75 TSSMALKKFK TSPYFPTTAR KLISDFAIIL SILIFCVIDA LVGVDTPKLI VPSEFKPTSP 
NRGWFVPPFG ENPWWVCLAA AIPALLVTIL IFMDQQITAV IVNRKEHKLK KGAGYHLDLF 
WVAILMVICS LMALPWYVAA TVISIAHIDS LKMETETSAP GEQFKFLGVR EQRVTGTLVF 
ILTGLSVFMA PILKFIPMPV LYGVFLYMGV ASLNGVQFMD RLKLLLMPLK HQPDFIYLRH 
VPLRRVHLFT FLQVLCLALL WILKSTVAAI IFPVMILALV AVRKGMDYLF SQHDLSFLDD 
SLDSDNDDS DCPYSEKVPS IKIPMDIMEQ QPFLSDSKPS 



DRERSPTFLE RHTSC 

Seg ID NOs C378 Protei 
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I I I I I I 

MSTPGVNSSA SLSPDRLNSP VTIPAVMFIF GWGNLVAIV VLCKSr.^ I ITTFYTL CC 
LAVTDLLGTL LVSPVTIATY MKGQWPGGQP LCEYSTFILL FFSLSGLSII CAMSVERYLA 
INHAYFYSHY VDKRLAGLTL FAVYASNVLF CALPNMGLGS SRLQYPDTWC FIDWTTNVTA 
HAAYSYMYAG FSSFLILATV LCNVLVCGAL LRMHRQFMRR TSLGTEQHHA AAAASVASRG 
HPAASPALPR LSDFRRRRSF RRIAGAEIQM VILLIATSLV VLICSIPLW RVFVNQLYQ? 
SLEREVSKNP DLQAIRIASV NPILDPWIYI LLRKTVLSKA IEKIKCLFCR IGGSRREP.SG 
QHCSDSQRTS SAMSGHSRSF ISRELKEISS TSQTLLPDLS LPDLSENGLG GRNLLPGVPG 
MGLAQEDTTS LRTLRISETS DSSQGQDSES VLLVDEAGGS GRAC-PAPKGS 3LQVTFPSET 
LNLSEKCI 



MGHPPLLPLL LLLHTCVPAS WGLRCMQCKT NGDCRVEECA LGQDLCRTTI VRLWEEGEEL 60 

ELVEKSCTHS EKTNRTLSYR TGLKITSLTE WCGLDLCNQ GNSGRAVTYS RSRYLECISC 120 

GSSDMSCERG RHQSLQCRSP EEQCLDWTH WIQEGEEGRP KDDRHLRGCG YLPGCPGSMG 180 

FHNNDTFHFL KCCNTTKCNE GPILELENLP QNGRQCYSCK GNSTHGCSSS ETFLIDCRGP 240 

3 HAHLGDAFSM NHIDVSCCTK SGCNEPDLDV 300 



1 11 21 31 41 51 

I I I I I I 

MDFESGQVDP LASVILPPNL LENLSPEDSV LVRRAQFTFF NKTGLFQDVG PQRKTLVSYV 
MACSIGNITI QNLKDPVQIK IKHTRTQEVH HPICAFWDLN KNKSFGGKNT SGCVAHRDSD 
ASETVCLCNH FTHFGVLMDL PRSASQLDAR NTKVLTFISY IGCGISAIFS AATLLTYVAF 
EKLRRDYPSK ILMNLSTALL FLNLLFLLDG WITSFHUDGL CIAVAVLLHF FLLATFTWMG 
LEAIHMYIAL VKVFNTYIRR YILKFCIIGW GLPALWSW LASRNNNEVY GKESYGKEKG 
DEFCWIQDPV I FYVTCAGYF GVMFFLNIAM FIWMVQICG RNGKRSNRTL REEVLRNLRS 
WSLTFLLGM TWGFAFFAWG PLNIPFMYLF SIFNSLQGLF IFIFHCAMKE NVQKQWRRHL 
CCGRFRLADN SDWSKTATNI IKKSSDNLGK SLSSSSIGSN STYLTSKSKS SSTTYFKRNS 
S FNKSGSLRQC FHGQVLVKTG PC 



1 11 21 31 41 

I I I I I 

MTVARPSVPA ALPLLGELPR LLLLVLLCLP AVWGDCGLPP DVPNAQPALE 



it KIPGEKDSVI CLKGSQWSDI EEFCNKSCEV PTRLNSASLK QPYITQNYFP 
VGTWEYECR PGYRREPSLS PKLTCLQNLK WSTAVEFCKK KSCPNPGEIR NGQIDVPGGI 
LFGATISFSC NTGYKLFGST SSFCLISGSS VQWSDPLPEC REIYCPAPPQ IDNGIIQGER 
C I (JSVT YACMKGFTMI GEHSIYCTVN NDEGEWSGPP PSCRGKSLTS KVPPTVQKPT 
TVNVPTTEVS PTSQKTTTKT TTPNAQATRS TPVSRTTKHF HETTPNKGSG T — 
GHTCFTLTGL LGTLVTMGLL T 



I I I I I I 

MDTSRLGVLL SLPVLLQLAT GGSSPRSGVL LRGCPTHCHC EPDGRKLLRV DCSDLGLSEL 

60 PSNLSVFTSY LDLSMNNISQ LLPNPLPSLR FLEELRLAGN ALTYIPKGAF TGLYSLKVLM 

LQNNQLRHVP TEALQNLRSL QSLRLDANHI SYVPPSCFSG LHSLRHLWLD DNALTEIPVQ 
AFRSLSALQA MTLALNKIHH IPDYAFGNLS SLWLHLHNN RIHSLGKKCF DGLHSLETLD 
LNYNNLDEFP TAIRTLSNLK ELHFYDNPIQ FVGRSAFQHL PELRTLTLNG ASQITEFPDL 
TGTANLESLT LTGAQI SSLP QTVCNQLPNL QVLDLSYNLL EDLPSFSVCQ " " " 

65 EIYEIKVDTF QQLLSLRSLN LAWNKIAIIH PNAFSTLPSL IKLDLSSNLL 
. LTHLKLTGNH ALQSLISSEN FPELKVIEMP YAYQCCAFGV CENAYKISNQ 
DLHKKDAGMF QAQDERDLED FLLDFEEDLK ALHSVQCSPS PGPFKPCSHL 
WTIAVLALTC NALVTSTVFR SPLYISPIKL LIGVIAAVNM LTGVSSAVLA 
ARHGAWWENG VGCHVIGFLS IFASESSVFL LTLAALERGF 



TKLYCNLDKG 
LLVWPLPAC LNPLLYILFN 
TQALVTFTSS 



I I I I I I 

MDTSRLGVLL SLPVLLQLAT GGSSPRSGVL LRGCPTHCHC EPDGRMLLRV DCSDLGLSEL 
PSNLSVFTSY LDLSMNNISQ LLPNPLPSLR FLEELRLAGN ALTYIPKGAF TGLYSLKVLM 
LQNNQLRHVP TEALQNLRSL QSLRLDANHI SYVPPSCFSG LHSLRHLWLD DNALTEIPVQ 
AFRSLSALQA MTLALNKIHH IPDYAFGNLS SLWLHLHNN R 
LNYNNLDEFP T. "" 
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FQHLPELRTL TLNGASQITE FPDLTGTANL ESLTLTGAQI SSLPOTVCNQ L 
YNLLEDLPSF SVCQKLQKID LRHNEIYEIK VDTFQQLLSL RSLNLAWNKI AIIHPNAFST 
LPSLIKLDLS SNLLSSFPIT GLHGLTHLKL TGNHALQSLI SSENFPELKV IEMPYAYQCC 
AFGVCENAYK ISNQNNKGDN SSMDDLHKKD AGMFQAQDER DLEDFLLDFE EDLKALHSVQ 
CSPSPGPFKP CEHLLDGWLI RIGVWTIAVL ALTCNALVTS TVFRSPLYIS PIKIiLIGVIA 
AVNMLTGVSS AVLAGVDAFT FGSFARHGAW WENGVGCHVI GFLSIFASES SVFLLT1AAL 
ERGFSVKYSA KFETKAPFSS LKVIILLCAL LALTMAAVPL LGGSKYGASP LCLPLPFGEP 
STMGYMVALI LLNSLCFLMM TIAYTKLYCN LDKGDLENIW DCSMVKKIAL LLFTNCILNC 
PVAFLSFSSL INLTFISPEV IKFILLWVP LPACLNPLLY ILFNPHFKED LVSLRKQTYV 
WTRSKHPSLM SINSDDVEKQ SCDSTQALVT FTSSSITYDL PPSSVPSPAY PVTESCHLSS 



20 LECDRLQYCD ETVPVTFDPH TEFLGPQKKT EQVQRDIGFW CPRHLKTSGG QGYKFLGIDQ 
CAPPCPNMYF KSDELEFAKS FIGTVSIFCL CATLFTFLTF LIDVRRFRYP ERPIIYYSVC 
YSIVSLMYFI GFLLGDSTAC NKADEKLELG DTWLGSQNK ACTVLFMLLY FFTMAGTVWW 
VILTITWFLA AGRKWSCEAI EQKAVW FHAV AWGTPGFLTV MLLALNKVEG DNISGVCFVG 
LYDLDASRYF VLLPLCLCVF VGLSLLLAGI ISLNHVRQVI QHDGRHQEKL KKFMIRIGVF 

25 SGLYLVPLVT LLGCYVYEQV NRITWEITWV SDHCRQYHIP CPYQAKAKAR PELALFMIKY 

LMTLIVGISA VFWVGSKKTC TEWAGFFKRK RKRDPISESR RVLQESCEFF LKHNSKVKHK 
KKHYKPSSHK LKVISKSMGT STGATANHGT SAVAITSHDY LGQETLTEIQ TSPETSMREV 
KADGASTPRL REQDCGEPAS PAASISRLSG EQVDGKGQAG SVSESARSEG RISPKSDITD 
TGLAQSNNLQ VPSSSEPSSL KGSTSLLVHP VSGVRKEQGG GCHSDT 

30 

Seq ID NO-. C38S Protein Sequence 



I I I I I I 

MKIAVICFCL LGITCAIPVK QADSGSSEEK QLYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 60 

PSKSNESHDH MDDMDDEDDD DHVDSODSID SND3DDVDDT DDSIIQSDESH HSDESDELVT 120 

DFPTDLPATE VFTPWPTVD TYDGRGDSW YGLREKSKKF RRPDIQYPDA TDEDITSHME 180 

SEELNGAYKA IPVAQDLNAP SDKDSRGKDS YETSQLDDQS AETHSHKQSR LYKRK ANDES 240 

NEHSDVIDSQ ELSKVSREFH SHEFHSHEDM LWDPKSKEE DKHLKFRISH ELDSASSEVN 300 



1 11 21 31 41 51 

M RG PAR PRRLPliliSVL LLPLLGGTQT AIVFIKQPSS CDALQGRRAL LRCEVEAPGP 
VHVYNLLDGA PVQDTERRFA QGSSLSFAAV DRLQDSGTFQ CVARDDVTGE EARS ANAS FN 
IKWIEAGPW LKHPASEAEI QPQTQVTLRC HIDGHPRPTY QWFRDGTPLS DGQSNHTVSS 
KERN fLRPA GPEHSGLYSC CAHSAFGQAC SSQNFTLSIA DESFARWLA PQDVWARYE 
EAMFHCQFSA QPPPSLQWLF EDETPITNRS RPPHLRRATV FANGSLLLTQ VRPRNAGIYR 
CIGQGQRGPP IILEATLHLA EIEDMPLFEP RVFTAGSEER VTCLPPKGLP EPSVWWEHAG 
VRLPTHGRVY QKGHELVLAN IAESDAGVYT CHAANLAGQR RQDVNITVAT VPSWLKKPQD 
SQLEEGKPGY LDCLTQATPK PTWWYRNQM LISEDSRFEV FKNGTLRINS VEVYDGTWYR 
CMSSTPAGSI EAQARVQVLE KLKFTPPPQP QQCMEFDKEA TVPCSATGRE KPTIKWERAD 
GSSLPEWVTD NAGTLHFARV TRDDAGNYTC IASNGPQGQI RAHVQLTVAV FITFKVEPER 
TTVYQGHTAL LQCEAQGDPK PLIQWKGKDR ILDPTKLGPR MHIFQNGSLV IHDVAPEDSG 
RYTCIAGNSC NIKHTEAPLY WDKPVPEES EGPGSPPPYK MIQTIGLSVG AAVAYIIAVL 
GLMFYCKKRC KAKRLQKQPE GEEPEMECLN GGPLQNGQPS AEIOEEVALT SLGSGPAATN 
KRHSTSDKMH FPRSSLQPIT TLGKSEFGEV FLAKAQGLSE GVAETLVLVK SLQTKDEQQQ 
LDFRRELEMF GKLNHANWR LLGLCREAEP HYIWLEYVDL GDLKQFLRIS KSKDEKLKSQ 
PLSTKQKVAL CTQVALGMEH LSNNRFVHKD LAARNCLVSA QRQVKVSALG LSKDVYNSEY 
YHFRQAWVPL RKMSPEAILE GDFSTKSDVW AFGVLMWEVF THGEHPHGGQ ADDEVLADLQ 
AGKARLPQPE GCPSKLYRLM QRCWALSPKD RPSFSEIASA LGDSTVDSKP 



V NATCAIRHPC HNNLMNQIRS QLAQLNGSAN 
ALFILYYTAQ GEPFPNNLDK LCGPNVTDFP PFHANGTEKA KLVELYRIW YLGTSLGNIT 
RDQKILNPSA LSLHSKLNAT ADILRGLLSN VLCRLCSKYH VGHVDVTYGP DTSGKDVFQK 
KKLGCQLLGK Yr 



MGRPRLTLVC HVSIIISARD LSMNNLTELQ PGLFHHLP.FL EELRLSGNHL SHIPGQAFSG 60 

LYSLKILMLQ NNQLGGIPAE ALWELPSLQS LRLDANLISL VPSRSFEGLS SLRH^WLDDN 120 

ALTEIPVRAL NNLPALQAMT LALNRISHIP DYAFQNLTSL WLHLHNNRI QKLGTHSFEG 180 

LHNLETLDLN YNKLQEFPVA IRTLGRLQEL GFHNNNIKAI PEKAFMGNPL LQTIHFYDNP 240 
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IQFVGRSAFQ YLPKLHTLSL NGAMDIQEFP DLKGTTSLEI LTLTRAGIRL LPSGMCQQLP 
QIEELPSLHR CQKLEEIGLQ HNRIWEIGAD TFSQLSSLQA LDLSWNAIRS 
SLVKLDLTDN QLTTLPLAGL GGLMHLKLKG NLALSQAFSK DSFPKLRILE 
VPYAYQCCPY GMCASFFKAS GQWEAEDLHL DDEESSKRPL GLLARQAENH YDQDLDELQL 
EMEDSKPHPS VQCSPTPGPF KPCEYLFESW GIRLAVWAIV LLSVLCNGLV LLTVFAGGPV 
PLPPVKFWG AIAGAMTLTG ISCGLLASVD ALTFGQFSEY GARWETGLGC RATGFLAVLG 
L AAVQCSVSVS CVRAYGKSPS LGSVRAGVLG CLALAGLAAA LPLASVGEYG 
? PEGQPAALGF TVALVMMNSF CFLWAGAYI KLYCDLPRGD FEAVWD CAMV 
RHVAWLI FAD GLLYCPVAFL SFASMLGLFP VTPEAVKSVL LWLPLPACL NPLLYLLFNP 
HFRDDLRRLR PRAGDSGPLA YAAAGELEKS SCDSTOALVA FSDVDLILEA SEAGRPPGLE 
TYGFPSVTLI SCQOPGAPRL EGSHCVEPEG NHFGNPQPSM DGELLLRAEG STPAGGGLSG 
GGGFQPSGLA FASHV 

Seq ID NO: C389 Protein Sequence 

1 11 21 31 41 SI 

I I I I I I 

MASLVSLELG LLLAVLWTA TASPPAGLLS LLTSGQGALD QEALGGLLNT LADRVHCTNG 
PCGKCLSVED ALGLGEPEGS GLPPGPVLEA RYVARLSAAA VLYLSNPEGT CEDTRAGLWA 
SHADHLLALL ESPKALTPGL SWLLQRMOAR AAGQTPKTAC VDIPQLLEEA VGAGAPGSAG 
GVLAALLDHV RSGSCFHALP SPQYFVDFVF QQHSSEVPMT IiAELSALMQR LGVGREAHSD 
HSHRHRGASS RDPVPLISSS NSSSVWDTVC LSARDVMAAY GLSEQAGVTP EAWAQLSPAL 
LQQQLSGACT SQSRPPVQDQ LSQSERYLYG SLATLLICLC AVFGLLLLTC TGCRGVAHYI 
LQTFLSLAVG ALTGDAVLHL TPKVLGLHTH SEEGLSPQPT WRLLAHLAGL YAFFLFENLF 
NLLLPRDPED LEDGPCGHSS HSHGGHSHGV SLQLAPSELR QPKPPHEGSR ADLVAEESPE 
LLNPEPRRLS PELRLLPYMI TLGDAVHNFA DGLAVGAAFA SSWKTGLATS LAVFCHELPH 
ELGDFAALLH AGLSVRQALL LNLASALTAF AGLYVALAVG VSEESEAWIL AVATGLFLYV 
ALCDMLPAML KVRDPRPWLL FLLHNVGLLG GWTVLLLLSL YEDDITF 

Seq ID NO: C390 Protein Sequence 



I I I I I 

MANASEPGGS GGGEAAALGL KLATLSLLLC VSLAGNVLFA LL] 
CLADGLRALA CLPAVMLAAR RAAAAAGAPP GALGCKLLAF LAALFCFHAA FLLLGVGVTR 
I AERLAGWPCA AMLVCAAWAL ALAAAFPPVL DGGGDDEDAP CALEQRPDGA 
/GATHLV YLRLLFFIHD RRKMRPARLV PAVSHDWTFH GPGATGQAAA 
NWTACFCRCF TPPALVGIRP AGPCRGARRL LVLESFKTEK RLCKMFYAVT LLFLLLWGPY 
WASYLRVLV RPGAVPQAYL TASVWLTFAQ AGINPWCFL FNRELRDCFR AQFPCCQSPR 
TTQATHPCDL KGIGL 



I I I I I I 

MAAARPARGP ELPLLGLLLL LLLGDPGRGA ASSGNATGPG PRSAGGSARR S" ' 
LSHCGRAAPC EPLRYNVCLG SVLPYGATST LLAGDSDSQE EAHGKLVLWS G 
VIQPLLCAVY MPKCENDRVE LPSRTLCQAT RGPCAIVERE RGWPDFLRCT PDRFPEGCTN 
EVQNIKFNSS GQCEVPLVRT DNPKSWYEDV EGCGIQCQNP LFTEASHQDM HSYIAAFGAV 
TGLCTLFTLA TFVADWRNSN RYPAVILFYV NACFFVGSIG WLAQFMDGAR REIVCRADGT 
MRLGEPTSNE TLSCVIIFVI VYYALMAGW WFWLTYAWH TSFKALGTTY QPLSGKTSYF 
HLLTWSLPFV LTVAILAVAQ VDGDSVSGIC FVGYKNYRYR AGFVLAP I GL VIiIVGGYFLI 
RGVMTLFSIK SNHPGLLSEK AASKINETML RLGI FGFLAF GFVLITFSCH FYDFFNQAEW 
ERSFRDYVLC QANVTIGLPT KQPIPDCEIK NRPSLLVEKI NLFAMFGTGI AMSTWVWTKA 
TLLIWRRTWC RLTGQSDDEP KRIKKSKMIA KAFSKRHELL QNPGQELSFS MHTVSHDGFV 
AGLAFDLNEP SADVSSAWAQ HVTKMVARRG AILPQDISVT PVATPVPPEE QANLWLVEAE 
ISPELQKRLG RKKKRRKRKK EVCPLAPPPE LHPPAPAPST IPRLPQLPRQ KCLVAAGAWG 
AGDSCRQGAW TLVSNPFCPE PSPPQDPFLF SAPAPVAWAH GRRQGLGPIH SRTNLMDTEL 



I I I I I I 

MGARSGARGA LLLALLLCWD PRLSQAGRKR SGEVLPDSFP SAPAEPLPYF LQEPQDAYIV 
KNKPVELRCR AFPATQIYFK CNGEWVSQND HVTQEGLDEA TLGARGGLRV REVQIEVSRQ 
QVEELFGLED YWCQCVAWSS AGTTKSRRAY VRIAYLRKNF DOEPLGKEVP LDHEVLLQCR 
PPEGVPVAEV EWLKNEDVID PTQDTNFLLT IDHNLIIRQA RLSDTANYIC VAKNIVAKRR 
STTATVIVYV NGGWSSWAEW SPCSNRCGRG WQKRTRTCTN PAPLNGGAFC EGOAFQKTAC 
S TECAHWRSRE CMAPPPQNGG RDCSGTLLDS KNCTDGLCMQ 



ALTGGFHPVN FKTARPSNPQ L 

3 LADGADLLGV LPPGTYPSDF ARDTHFLHLR SASLGSQQLL 



EIPFYHIWSG SQKALHCTFT LERHSLASTE LTCKICVRQV EGEGQIFQLH T 
LDTLCSAPGS TVTTQLGPYA FKIPLSIRQK ICNSLDAPNS RGNDWRMLAQ KLSMDRYLNY 
FATKASPTGV ILDLWEALQQ DDGDLNSLAS ALEEMGKSEM LVAVATDGDC 
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MNRKARRCLG HLFLSLGMVY LRIGGFSSW ALGASI ICMK IPGLAPHQRA ICQSRPDAII 
VIGEGSQMGL DECQFQFRNG RWNCSALGER TVFGKELKVG SREAAFTYAI IAAGVAHAIT 
AACTQGNLSD CGCDKEKQGQ YHRDEGWKWG GCSADIRYGI GFAKVFVDAR EIKQNARTLM 
NLHNNEAGRK ILEENMKLEC KCHGVSGSCT TKTCWTTLPO. FRELGYVLKD KYNEAVHVEP 



1 11 21 31 41 SI 

I I I I I I 

MDALCGSGEL GSKFWDSNLS VHTENPDLTP CFQNSLLAWV PCIYLKVALP CYLLYLRHHC 
RGYIILSHLS KLKMVLGVLL WCVSWADLFY SFHGLVHGRA PAPVFFVTPL WGVTMLLAT 
LLIQYERLQG VQSSGVLIIF WFLCWCAIV PFRSKILLAK AEGSISDPFR FTTFYIHFAL 
VLSALILACF REKPPFFSAK MVDPMPYPET SAGFLSRLFF WWFTKMAIYG YRHPLEEKDL 
WSLKEEDRSQ MWQQLLEAW RKQEKQTARH KASAAPGKNA SGEDEVLLGA RPRPRKPSFL 
KALLATFGSS FLISACFKLI QDLLSFINPQ LLSILIRFIS NPMAPSWWGF LVAGLMFLCS 
MMQSLILQHY YHYI FVTGVK FRTGIMGVIY RKALVITNSV KRASTVGSIV NLMSVDAQRF 
MDLAPFLNLL WSAPLQIILA I YFLWO.NLGP SVLAGVAFMV LLIPLNGAVA VKMRAFQVKQ 
MKLKDSRIKL MSEILNGIKV LKLYAWEPSF LKOVEGIRQG ELQLLRTAAY LHTTTTFTWM 
CSPFLVTLIT LWVYVYVDPN NVLDAEKAFV SVSLFNILRL PLNKLPQLIS NLTQASVSLK 
RIQQFLSQEE LDPQSVERKT ISPGYAITIH SGTFTWAQDL PPTIflSLDIQ VPKGALVAW 
GPVGCGKSSL VSALLGEMEK LEGKVHMKGS VAYVPQOAWI QNCTLQEblVL FGKALNPKRY 
QQTLEACALL ADLEMLPGGD QTEIGEKGIN LSGGQRQRVS ItARAVYSDAD IFLLDDPLSA 
VDSHVAKHIF DHVIGPEGVL AGKTRVLVTH GI SFLPQTDF IIVLADGQVS EMGPYPALLQ 
RNGSFANFLC NYAPDEDOGH LEDSWTALEG AEDKEALLIE DTLSNHTDLT DNDPVTYWQ 
KQFMRQLSAL SSDGEGQGRP VPRRHLGPSE KVQVTEAKAD GALTQSEKAA I' """" 
DYAKAVGLCT TLAICLLYVG QSAAAIGANV WLSAWTNDAM ADSRQNNTSL R 
LQGFLVMLAA MAMAAGGIQA ARVLHQALLH NKIRSPQSFF D 
EVLAPVILML LNSFFNAIST LWIMASTPL FTWILPLAV LYTLVQRFYA ATSROLKRLE 
SVSRSPIYSH FSETVTGASV IKAYHRSRDF EI ISDTKVDA NQRSCYPYII SNRWLSIGVE 
FVGNCWLFA ALFAVIGRSS LNPGLVGLSV SYSLQVTFAL NWMIRMMSDL ESNIVAVERV 
KEYSKTETEA PWWEGSRPP EGWPPRGEVE FRKYSVRYRP GLDLVLRDLS LHVHGGEKVG 
I VGRTGAGKS SMTLCI.FRIL EAAKGEIRID GLNVAD I GLH DLRSQLTIIP ODPILFSGTL 
RMNLDPFGSY SEEDIWWALE LSHLHTFVSS QPAGLDFQCS SGGENLSVGQ RQLVCLARAL 
LRKSRILVLD EATAAIDLET DNLIQATIRT QFDTCTVLTI AHRLNTIMDY TRVLVLDKGV 
VAEFDSPAKL I, 



| I' f I' I' I' 

MRARPQVCEA LLFALALQTG VCYGIKWLAL SKTPSALALN QTQHCKQLEG LVSAQVQLCR 
SNLELMHTW HAAREVMKAC RRAFADMRWN CSSIELAPNY LLDLERGTRE SAFVYALSAA 
AISHAIARAC TSGDLPGCSC GPVPGEPPGP GNRWGGCADN LSYGLLHGAK FSDAPMKVKK 
TGSQANKLMR LHNSEVGRQA LRASLEMKCK CHGVSGSCSI RTCWKGLQEL QDVAADLKTR 
YLSATKWHR PMGTRKHLVP KDLDIRPVKD SELVYLQSSP DFCMKNEKVG SHGTQDRQCN 
KTSNGSDSCD LMCCGRGYNP YTDRWERCH CKYHWCCYVT CRRCERTVER YVCK 

Seq ID NO s C396 Protein Sequence 



I I I I I I 

MEWGYLLEVT SLLAALALLQ RSSGAAAASA KELACQEITV PLCKGIGYKY TYMPNQFNHD 60 

TQDEAGLEVH QFWPLVEIQC SPDLKFFLCS MYTPICLEDY KKPLPPCRSV CERAKAGCAP 120 

LMRQYGFAWP DRMRCDRLPE QGNPDTLCMD YNRTDLTTAA PSPPRRLPPP PPGEQPPSGS 1B0 

GHGRPPGARP PHRGGGRGGG GGDAAAPPAR GGGGGGKARP PGGGAAPCEP GCQCRAPMVS 240 

VSSERHPLYN RVKTGQIANC ALPCHNPFFS QDERAFTVFW IGLWSVLCFV STFATVSTFL 300 

IDMERFKYPE RPIIFLSACY LFVSVGYLVR LVAGHEKVAC SGGAPGAGGB GGAGGAAAGA 360 

GAAGAGAGGP GGRGEYEELG AVEQHVEYET TGPALCTWF LLVYFFGMAS SIWWVILSLT 420 

WFLAAGMKWG NEAIAGYSQY FHLAAWLVPS VKSIAVLALS SVDGDPVAGI CYVGNQSLDN 430 

LRGFVLAPLV IYLFIGTMFL LAGFVSLFRI RSVIKQQDGP TKTHKLEKLM IRLGLFTVLY 540 

C LFYEQHNRPR WEATHNCPCL RDLQ.PDQAF.R PDYAVFMLKY FMCLWGITS 600 



GGGSLYSDVS TGLTWRSGTA SSVSYPKQMP LSQV 



MLQGPGSLLL LFLASHCCLG SARGLFLFGQ PDFS/KKS Z iir LQI HGIEYQNMRL 

PNLLGHETMK EVLEQAGAWI PLVMKQCHPD TKKFLCSLFA PVCLDDLDET ICPCHSLCVQ 

VKDRCAPVMS AFGFPWPDMb ECDRFPQDND LCIPLASSDH LLPATESAPK VCEACKNKMD 

DDNDIMETLC KNDFALKIKV KEITYINRDT KIILETKSKT IYKLNGVSER D 
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DSLQCTCEEM NDINAPYMM GQKQGGELVI TSVKRWQKGQ REFKRISRSI RKLQC 



IVCCALPMWR VSAFIGSNII TSQr IWEGLM MW WQSTGQ SO 
APGLLVALVG AQCTNCVQDD TAKAKITIVA 120 
VPEAQKREMG AGLYVGWAAA ALQLLGGALL 1BO 



Seq ID NO; C399 Protein Sequence 
Protein Accession #: NP_036581.1 

1 11 21 31 41 SI 

I I I I I I 

MESRKDITNQ EELWKMKPRR NLEEDDYLHK DTGETSMLKR PVLLHLHQTA HADEFDCPSE 
LQHTQELFPQ WHLPIKIAAI IASLTFLYTL LREVIHPLAT SHQQYFYKIP ILVINKVLPM 
VSITLLALVY LPGVIAAIVQ LHNGTKYKKF PHWLDKWMLT RKQFGLLSFF FAVLHAIYSL 
SYPMRRSYRY KLLNWAYQQV QQNKEDAWIE HDVWRMEIYV SLGIVGLAIL ALLAVTSIPS 
VSDSLTWREF HYIQSKLGIV SLLLGTIHAL IFAWNKWIDI KQFVWYTPPT FMIAVFLPIV 
VLIFKSILFL PCLRKKILKI RHGWEDVTKI NKTEICSQL 



1 11 21 31 41 51 

I I I I I I 

MANCEFSPVS GDKPCCRLSR RAQLCLGVSI LVLILVWLA WVPRWRQTW SGPGTTKRFP 
ETVLARCVKY TEIHPEMRHV DCQSVWDAFK GAFISKHPCM ITEEDYQPLM KLGTQTVPCN 
KILLWSRIKD LAHQFTQVQR DMFTLEDTLL GYLADDLTWC GEFNTSKINY QSCPDWRKDC 
SHNPVSVFWK TVSRRFAEAA CDWHVMLNG SRSKIFDKNS TFGSVEVHNL QPEKVQTLEA 
WVIHGGREDS RDLCQDPTIK ELESIISKRM IQFSCKNIYR PDKFLQCVKN PEDSSCTSEI 



I I I I I I 

MVSCTFSGPL RETNENVKKF YALRAFMFRM SSEAAKLGSS RTPKPRKHRA TTRAKIFKRF 
FSEG E NOP LVEELAVIIIT YSDDPAPTTS PSSVQPREFG VMQGAPRARF GSRTPPAAAE 
ASSPHLGIGE AACQSGARAA APRAGARRCQ PQRQAAAAAA TAQTHTLPHA RTRADPAGRR 
RRHPRSPAPG GEGTCSEGPA PRREMEEEMQ PAEEGPSVPK IYKORSPYSV LKTFPSKRPA 
LAKRYERPTL VELPIIGIILRT PAQPPPASPA ASSSSSFAAV VRLGAPPRPP RRGFRARGTI 
PPLLPAPGVA GTLLPPPTSS SPPSPRPRPW HAAAPRGGTS HTHMWRSQST LPGSDTMVSV 
FGI.MAQRRWQ HRSLKQFEWG ILGSWGTWPC GQDWLEKEGQ VAVLLPRSEG NTAPKKSRMI 
LDAFAQQCSR VLSLLNCGGK LLDSNHSQSM ISCVKQEGSS YNERQSHCHI GKGVHSQTSD 
NVDIEMQYMQ RKQQTSAFLR VFTDSLQNYL LSGSFPTPNP SSASEYC 
HTLENISLDS TASLCKSRHL SREPPVKSDF PNPLQQALAG GASRPFSGAQ QSIA 
LEDGIRSPVP LSCEALEMDL TSLGSKQLLN NYPVYITSKQ WDEAVNSSKK D 
RFVFTTDELK YSCGLGKRKR SVQSGETGPE RRPLDPVKVT CLRGTASFRS VSPSVISFHR 
IGCGSPRTSV QPSVF 



GASDNDFMIL TLWPGFRPP QSVMADTENK 
EVARITFVFE TLCSVNCELY FMVGVNSRTN TPVETWKGSK GKQSYTYIIE ENTTTSFTWA 
FQRTTFHEAS RKYTNDVAKI YSINVTNVMN GVASYCRPCA LEASDVGSSC TSCPAGYYID 

FSALANTVTL AGGPSFTSKG LKYFHHFTLS LCGNQGRKMS VCTDNVTDLR IPEGESGFSK 
SITAYVCQAV IIPPEVTGYK AGVSSQPVSL ADRLIGVTTD MTLDGITSPA ELFHLESLGI 
PDVIFFYRSN DVTQSCSSGR STTIRVRCSP QKTVPGSLLL FGTCSDGTCD GCNFHFLWES 
AAACPLCSVA DYHAIVSSCV AGIQKTTYVW REPKLCSGGI SLPEQRVTIC KTIDFWLKVG 
: KNQKLEYKYS KLVM 
! PAPVTISLSE DS 



60 
65 
70 



75 | | | | f 

MALQGISWE LSGLAPGRXC AMVIoADFGAR WRVDRPGSR " 
REPRAAASVQ AVGCAAGALP PRCHGETPAG PRDSAAGKSK 
GHDINYLALS GVLSKIGRSG ENPYAPLNLV ADFAGGGLMC 
DANMVEGTAY LSSFLWKTQK SSLWEAPRGQ NMLDGGAPFY 
oO YELLIKGLGL KSDELPNQMS TDDWPEMKKK FADVFAKKTK AEWCQIFDGT 
EEWHHDHNK ERGSFITSEE QDVSPRLAPL LLNTPAIPSS KGDPFIGEKT 
REEIYQLNSD KIIESNKVKA SL 

Seq ID NO: C404 Protein Sequence 
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I I I I I 

MQRWTLWAAA FLTLHSAQAF PQTDISISPA LPELPLPSLC P 
WAEADLYCSE FSVGRKSAKL ASIHSWEENV FVYDLVNSCV PGIPADVWTG LHDHRQEGQF 
ST SYWDGSQPDD GVHADPEEED CVQIWYRPTS EQLQAPEPQL P 
P KLMDQSNVSR KSLKPSKSHL MEPPTPVAKH QKAKTRHRSL R 
GSWKERMNAD YGRRKRSAPR QEGRLRCRER RLRAASGQGR PEGQRKORQQ ERQERGWEEL 
GGVSPMRGAQ AWQHGLGAGS QRGAAPECGE NHQAPELGST WRGQRLQPQT AALCHFALRK 
LPGNAHGLAA AFVQPALQVQ EEKNNRTRFS GAYFTMSDPT CDQDSKEQSL RRHGREAEXD 
GPYRLVKKKR GPVACPSSFE LQSGGEVCLD FPVELRAGTW IAREPP 

Seq ID NO i C4QS Protein Sequence 
Protein Accession #= XP_0S4869.2 



VLILNDNLLS SLPNNLFRFV PLTHLDLRGN RLKLLPYVGL LQHMDKWEL OLEENPWNCS 
CELISLKDWL DSISYSALVG DWCETPFRL HGRDLDEVSK QELCPRRLIS DYEMRPQTPL 
STTGYLHTTP ASVNSVATSS SAVYKPPLKP PKGTRQPNKP RVRPTSRQPS KDLGYSNYGP 
SIAYOTKSPV PLECPTACSC NLQISDLGLN VNCQERKIES IAELQPKPYN PKKMYLTENY 
IAWRRTDFL EATGLDLLHL GNKRISMIQD RAFGDLTNLR RLYLNGNRIE RLSPELFYGL 
QSLQYLFLQY MLIREIQSGT FDPVPNLQLL FLMNMLLQAM P 
TSLPVSGVLD OLKSLIQIDL HDNPWDCTCD IVGMKLWVEQ L 



ELKVTYSSNH HLQQQQQPPP PPQOPQO.OPP POLQLO.PGEE ERRESHHLRS PAYSVSTIEP 
REDLLSPVQD ADRFYRGILE PDKHCSTTPA GNSLPEYPKF PCSPAAYTFS PNYDLRRPHQ 
P SAVFVEPNRN EYLELKAKLN VEPDYDEVLE KQTTFSQF 



I I I I 

I TLQILPVFFS NCLFLALYDS VILLKHWLL LSRSKSTRGE WRRMLTSEGL 

RCVWKSFLLD AYKOVKLGED APNSSWHVS STEGGDNSGN GTQEKIAEGA TCHLLDFASP 

ERPLWNFGS ATUPPFTSQL PAFRKLVEEF SSVADFLLVY IDEAHPSDGW AIPGDSSLSF 

EVKKHQNQED RCAAAQQLLE RFSLPPOCRV VADRMDNNAN IAYGVAFERV CI VORQKIAY 
SI LQEVRHWLEK NFSKRUKKTR LAG 

1 Protein Sequence 



I I I I I 

MSSCVSSQPS SNRAAPQDEL GGRGSSSSES QKECEALRGL SSLSIHLGKE S 
GCAVDLGLAR DRPLEADGQE VPLDSSGSQA RPHLSGRKLS LQERSQGGLA AGGSLDMHGR 
CICPSLPYSP VSSPQSSPRL PRRPTVESHH VSITGMQDCV Q.LNQYTLKDE IGKGSYGWK 
LAYNENDNTY YAMKVLSKKK LIRQAGFPRR PPPRGTRPAP GGCIQPRGPI EQVYQEIAIL 
KKLDHPNWK LVEVLDDPNE DHLYMVFELV NQGPVMEVPT LKPLSEDQAR FYFQDLIKGI 
EYLHYQKI IH RDIKPSNLLV GEDGHIKIAD FGVSNEFKGS DALLSNTVGT PAFMAPESLS 
ETRKIFSGKA LDVWAMGVTL YCFVFGQCPF MDERIMCLHS KIKSQALEFP DQPDIAEDLK 
DLITRMLDKN PESRIWPEI KLHPWVTRHG AEPLPSEDEN CTLVEVTEEE VENSVKHIPS 
LATVILVKTM IRKRSFGNPF EGSRREERSL SAPGNLLTKK PTRECESLSE LKEARQRRQP 
PGHRPAPRGG GGS ALVRGS P CVESCWAPAP GSPARMHPLR PEEAMEPE 



Seq ID NO: C408 Protein Sequence 



. 1 11 21 31 41 51 

I I I I I I 

MGLSLPKEKG LILCLWSKFC RWFQRRESWA QSRDEQNLLQ C.KRIWESELL LAAKDNDVQA 
LNKLLKYEDC KVHQRGAMGE TALHIAALYD NLEAAMVLME AAPELVFEPM TSELYEGQTA 
LHIAWNQNM NLVRALLARR ASVSARATGT AFRRSPCNLI YFGEHPLSFA ACVNSEEIVR 
LLIEHGADIR AQDSLGNTVL HILILQPNKT FACQMYNLLL SYDRHGDHLQ PLDLVPNHQG 
LTPFKLAGVE GNTVMFQHLM QKRKHTQWTY GPLTSTLYDL TEIDSSGDEQ SLLELIITTK 
KREARQILDQ TPVKELVSLK WKRYGRPYFC MLGAIYLLYI ICFTMCCIYR PLKPRTNNRT 
SPRDNTLLQQ KLLQEAYMTP KDDIRLVGEL VTVIGAIIIL LVEVPDIFRM GVTRFFGQTI 
LGGPFHVLI I TYAFMVLVTM VMRLISASGE WPMSFALVL GWOJVMYFAR GFQMLGPFTI 
MIQKMIFGDL MRFCWLMAW ILGFASAFYI IFQTEDPEEL GHFYDYPMAL FSTFELFLTI 
IDGPANYNVD LPFMYSITYA AFAI I ATLLH LNLLIAMMGD THWRVAKERD ELWRAQIVAT 
TVMLERKLPR CLWPRSGICG REYGLGDRWF LRVEDRQDLK RQRIQRYAQA FHTRGSEDLD 
KDSVEKLELG CPFSPHLSLP MPSVSRSTSR SSANWERLRQ GTLRRDLRGI INRGLEDGES 

Seq ID Ni 
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MGIIIVMSEW 



Protein Acce 



I I I I I I 

MLSKVLPVLL GILLILQSRV EGPQTESKNE ASSRDWYGP QPQPLET" ' — 



TGSRVGKLPE ASRILNTILS NYDHKLRPGI GEKPTWTVE IAVNSLGPLS ILDMEYTIDI 
IFSQTWYDER LCYNDTFESL VLNGNWSQL WIPDTFFRNS KRTHEHEITM PNQMVRIYKD 
GKVLYTIRMT IDAGCSLHML RFPMDSHSCP LSFSSFSYPE NEMIYKWENF KLEINEKNSW 
KLFQFDFTGV SNKTEIITTP VGDFMVMTIF FNVSRRFGYV AFQNYVPSSV TTMLSWVSFW 
IKTESAPART SLGITSVLTM TTLGTFSRKN FPRVSYITAL DFYIAICFVF CFCALLEFAV 
LNFLIYNQTK AHASPKLRHP RINSRAHART RARSRACARQ KQEAFVCQIV TTEGSDGEER 
PSCSAQQPPS PGSPEGPRSL CSKLACCEWC KRFKKYFCMV PDCSGSTWQQ GRLCIHVYRL 
DNYSRWFPV TFFFFNVLYW LVCLNL 

Seq ID NO-. C412 Protein Sequence 
Protein Accession ff : NP_068819.1 

1 11 21 31 41 51 

I I I I I I 

IIFS QTWYDERLCY NDTFESLVLN GNWSQLWIP DTFFRNSK — 



MVRIYKDGKV LYTIRMTIDA GCSLHMLRFP MDSHSCPLSF SSFSYPENEM I 

F QLDFTGVSNK TEIITTPVGD FMVMTIFFNV SRRFGYVAFQ NYVPSSVTTM 
?ARTSLG ITSVLTMTTL GTFSRKNFPR VSYITALDFY IAICFVFCFC 
F LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVCQIVTTE 
: SAQQPPSPGS PEGPRSLCSK LACCEWCKRF KKYFCMVPDC EGSTWQOARL 



3 QTWNSKRTHE HEITI4PNQMV RIYKDGKVLY TIRMTIDAGC 
SHSCPLSFSS FSYPENEHIY KWENFKLEIN EKNSWKLFQF DFTGVSNKTS 



1 11 21 31 41 51 

I I I I I I 

MEYTIDIIFS QTWYDERLCY NDTFESLVLN GNWSQLWIP DTFFRNSKRT HEHEITMPNQ 
MVRIYKDGKV LYTIRMTIDA GCSLHMLRFP MDSHSCPLSF SSFSYPENEM IYKWENFKLE 
INEKNSWKLF QFDFTGVSNK TEIITTPVGD FMVMTIFFNV SRRFGYVAFQ NYVPSSVTTM 
LSWVSFWIKT ESAPARTSLG ITSVLTMTTL GTFSRKNFPR VSYITALDFY IAICFVFCFC 
ALLEFAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVCQIVTTE 
GSDGEERPSC SAQQPPSPGS PEGPRSLCSK LACCEWCKRF KKYFCMVPDC EGSTWQQGRL 



MPAVSGPGPL FCLLLLLLDP HSPETGCPPL RRFEYKLSFK 
ILGLEEVRLT PSMRNRSGAV WSRASVPFSA WEVEVQMRVT 



VGSVLGGLAS WDGIGIFFDS PAEDTQDSPA IRVLASDGHI PSEQPGDGAS QGLGSCHWDF 180 

RNRPHPFRAR ITYWGQRLRM SLNSGLTPSD PGEFCVDVGP LLLVPGGFFG VSAATGTLAD 240 

F SLSEPSPEVP PQPFLEMQQL RLARQLEGLW ARLGLGTRED VTPKSDSEAQ 3 00 

E ETLGRHRRIL QALRGLSKQL AQAERQWKKQ LGPPGQARPD GGWALDASCQ 360 

IPSTPGRGGH LSMSLNKDSA KVGALLHGQW TLLQALQ3MR DAAVR14AAEA QVSYLPVGIE 42 0 

HHFLELDHIL GLLQEELRGP AKAAAKAPRP PGQPPRASSC LQPGIFLFYL LIQTVGFFGY 480 

VHFRQELNKS LQECLSTGSL PLGPAPHTPR ALGILRRQPL PASMPA 526 
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25 
30 



LRHLCTGLQA AGMAWPGTLW RHTCQGRAXA AEGPWGLFRP 
HRCPREAGQA PVGPSPETQG VAHVCSRARV SVDEREPGGG AYAMHVTPRW K 
EQAAPETGRG P 



I 



I 



I 



I 



I 



MPRLVERYEM P 

LLMDFVFSLV NSFLGEFLRR I IGMQLITSL GLQEFDIARN VLELIYAQTL VWIGIFFCPL 
LPFIQMIMLF IMFYSKNISL MMNFQPPSKA WRASQMMTFF IFLLFFPSFT GVLCTLAITI 
WRLKPSADCG PFRGLPLFIH SIYSWIDTLS TRPGYLWWK IYRNLIGSVH FFFILTLIVL 

ITYLYWQIT EGRKIMIRLL HEQIINEGKD K 
EQQGFLHLGE HDGSLDLRSR RSVQEGNPRA 



FLIEKLIK LQDMEKKANP SSLVLERREV 



MLSDDHVNE I IIQVENVSSG VQSHPSSNQI F 
RFSTSLNESM SQTLHSLECM G 
DKEKSKQTHR I 



3 INMVLSISDI DVIDSQTVSK SO 



KAIRNQPRTM 
SISLWQKTLK 
TGLEFFTGVG 
MAKYFRNNFI 
/TFNQLLTRF 
[NLAVPCIYS 
TLIGQDIYRL 



IIGGKFGTSV LSYFNFLRWL L 
YFRDTVMYYG FYTNSTIQHG N 
NPHIYSGGIT K 



GVLCTLAITI 



MFRLVERYEM PRHEVYVLLI 
LLMDFVFSLV NSFLGEFLRR 
LPFIQMIMLF IMFYSKNISL 
WRLKPSADCG PFRGLPLFIH 
I ITYLYWQIT 
EQQGFLHLGE 



SIYSWIDTLS TRPGYLWWW IYRNLIGSVH 
HEQIINEGKD KMFLIEKLIK LQDMEKKANP 
RSVQEGNPRA 



-i Accession ft: 



ein Sequence 



I 



I 



I 



MGGDLVLGLG ALRRRKRLLE QEKSLAGWAL VLAGTGIGLM VLHAEMLWFG GCSWALYLFL 
VKCTISISTF LLLCLIVAFH AKEVQ L FMTD NGLRDWRVAL TGRQAAQIVL ELWCGLHPA 
PVRGPPCVQD LGAPLTSPQP WPGFLGQGEA LLSLAMLLRL YLVPRAVLLR SGVLLNASYR 
SIGALNQVRF RHWFVAKLYM NTHPGRLLLG LTLGLWLTTA WVLSVAERQA VI 
LWLIPITFLT IGYGDWPGT MWGKIVCLCT GVMGVCCTAL L 
FMMDIQYTKE MKESAARVLQ EAWMFYKHTR RKESHAf 

ALTELL STALGPRQLP 



PRQVLYPRNS 
PEDPWTVGKN 
CFPWTNITPP 
VLSLLFILLL 
AYQSVQETWL 
TFVLLLICIA 
CPGLMCVFQG 
PQDIPTFPLI 
CCFKCCLWCL 
LFFGKLLWG 
FGMCVDTLFL 



EAYGKPVKYD 
TGAYCGMGEN 
EFSQTVGEVF 
ALPGITNDTT 
RLVAGPLVLV 
AALIVLAVLE 
YWAMTALYPL 
YSSKGLIQRS 
SAFIRTLRYH 



PSFRGPIKNR SCTDVICCVL F 
KDKPYLLYFN IFSCILSSNI ISVAENGLQC PTPQVCVSSC 
YTKNRNFCLP GVPWNMTVIT SLQQELCPSF LLPSAPALGR 
IQQGISGLID SLNARDISVK IFEDFAQSWY W I LVALGVAL 
LILGVLGVLA YGIYYCWEEY RVLRDKGASI SQLGFTTNLS 



GVGVLSFFFF 



PTQPATLGYV LWASNISSPG CEKVPINTSC NPTAHLVNSS 
VFNLQIYGVL GLFWTLNWVL ALGQCVLAGA FASFYWAFHK 
TGSLAFGALI LTLVQIARVI LEYIDHKLRG VQNPVARCIM 
AYIMIAIYGK NFCVSAKNAF MLLMRNIVRV WLDKVTDLL 
SGRIPGLGKD FKSPHLNYYW LPIMTSILGA YVIASGFFSV 
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GFLLQPWGS ASDHCRSRWG RRRPYILTLG VMMLVGMALY 



A FLSNMLFFTD FMGQ I VYRGD PYSAHNSTEF 

G CWGFCINSVF SSLYSYFQKV LVSYIGLKGL YFTGYLLFGL GTGFIGLFPN 

VYSTLVLCSL FGVMSSTLYT VPFNLITEYH REEEKERQQA PGGDPDNSVR GKGMDCATLT 
CMVQLAQILV G 



I I I I I 

MEGFGGVGGR GTRGFAAKGV WRGRAEEGPV LGAAERGFMV STGSRRRVFE G 
PGKGTGRQRG AWGPRAEDGV RRRTLGMPRG SRRDVRAPCG PAGSWGARGG R 
RGSATAAARH HVPPAPGGPF GPRAPAGSTR VPARAGGAVE PTGAAAVARL ARPAGGALPT 
AGAQGAGPAR GRSGEGSEWA RRGKGRPGPY QSPLGPAVAE GQELKDKSRL RYPINGFQAL 
VLTALLVGLG MSAGLPLGAL PEMLLPLAFV ATLTAFIFSL FLYMKAQVAP VSALAPGGNS 
GNPIYDFFLG RELNPRICFF DFKYFCELRP GLIGHVLINL ALLMKEAELR GSPSLAMWLV 
NGFQLLYVGD ALWHEEAVLT TMDITHDGFG FMLAFGDMAW VPFTYSLQAQ FLLHHPQPLG 
LPMASVICLI NATGYYIFRG ANSQKNTFRK NPSDPRVAGL ETISTATGRK LLVSGWWGMV 
RHPNYLGDLI MALAWSLPCG VSHLLPYFYL LYFTALLVHR EARDERSACR STAWPGRSTA 
GVCLTASCPT STEAAPPPQV GHVPTHPPAH PGPGASTHLG LKGLHPTQP 



I I I I I I 

MGRLLRAARL PPLLSPLLLL LVGGAFLGAC VAGSDEPGPE GLTSTSLLDL LLPTGLEPLD 
SEEPSETMGL GAGLGAPGSG FPSEENEESR ILQPPQYFWE EEEELNDSSL DLGPTADYVF 
PDLTEKAGSI EDTSQAQELP NLPSPLPKMN LVEPPWHMPP REEEEr 2 EE Z_ EI 3EVE 
QEEEEEEELL PVNGSQEEAK PQVRDFSLTS SSQTPGATKS RHEDSGDQAS SGVEVESSMG 
PSLLLPSVTP TTVTPGDQDS TSQEAEATVL PAAGLGVEFE APQEASEEAT AGAAGLSGQH 
EEVPALPSFP QTTAPSGAEH PDEDPLGSRT SASSPLAPGD MELTPSSATL GQEDLNQQLL 
EGQAAEAQSR IPWDSTQVIC KDWSNLAGKN YIILNMTENI DCEVFRQHRG PQLLALVEEV 
LPRHGSGHHG AWHISLSKPS EKEQHLLMTL VGEQGWPTQ DVLSKLGDIR RSLEEIGIQ.M 
YSTTSSCQAR ASOVRSDYGT LFWLWIGA ICIIIIALGL LYNCKQRRLP KLKHVSKGEE 
LRFVENGCHD NPTLDVASDS QSEMQEKHPS LNGGGALNG? GSW6ALMGGK RDPEDSDVFE 



Protein Access 



ILMETLLYEQ LLEPPTMEVL GMTDSEEDLD PMEDFDSLEC MEGSDALALR 
SLRAPRLAQL SEVAMHSLGL AFIYDQTEDI RDVLRSFMDG FTTLKENIMR 
VSCEQVLIiAIj LLLLALLLPL LSGGLHLLLK 



I I I I I I 

MATPLPPPSP RHLRLLRLLL SGLVLGAALR GAAAGHPDVA ACPGSLDCAI, KRi 
HACGPCLQPF QEDQQGLCVP RMRRPPGGGR PQPRLEDEID FLAQELARKE SGQSTPPLPK 
DRQRLPEPAT LGFSARGQGL ELGLPSTPGT PTPTPHTSLG SPVSSDPVHM SPLEPRGGQG 
DGIiALVLILA FCVAGAAALS VASLCWCRLQ REIRLTQKAD YATAKAPGSP AAPRISPGDQ 

RLAQSAEMYH YQHQRQQHLC LERHKEPPKE LDTASSDEEN EDGDFTVYEC P 

VRNPLFDHAA LSAPLPAPSS PPALP 

Seq^I ^ 

1 11 21 31 41 51 

I I I I I I 

MVCSLWVLLL VSSVLALEEV LLDTTGETSE IGWLTYPPGG WDEVSVU3DQ RRLTRTFEAC 
HVAGAPPGTG QDNVJLQTHFV ERRGAQRAHI RLHFSVRACS SLGVSGGTCR ETFTLYYRQA 
EEPDSPDSVS SWHLKRWTKV DTIAADESFP SSSSSSSSSS SAAWAVGPHG AGQRAGLQLN 
VKERSFGPLT QRGFYVAFQD TGACLALVAV RLFSYTCPAV LRSFASFPET QASGAGGASL 
VAAVGTCVAH AEPEEDGVGG QAGGSPPRLH CNGEGKWMVA VGGCRCQPGY QPARGDKACQ 
ACPRGLYKSS AGNAPCSPCP ARSHAPNPAA PVCPCLEGFY RASSDPPEAP CTGPPSAPQE 
LWFEVQGSAL MLHWRLPREL GGRGDLLFNV VCKECEGRQE PASGGGGTCH RCRDEVHFDP 
RQRGLTESRV LVGGLRAHVP YILEVQAVNG VSELSPDPPQ AAAINVSTSH EVPSAVPWH 
QVSRASNSIT VSWPQPDQTN GNILDYQLRY YDQAEDESHS FTLTSETNTA TVTQLSPGHI 
YGFQVRARTA AGHGPYGGKV YFQTLPQGEL SSQLPERLSL VIGSILGALA FLLLAAITVL 
AWFQRKRRG TGYTEQLQQY SSPGLGVKYY IDPSTYEDPC QAIRELAREV DPAYIKIEEV 
IGTGSFGEVR QGRLQPRGRR EQTVAIQALW AGGAESLQMT FLGRAAVLGQ FQHPNILRLE 
GWTKSRPLM VLTEFMELGP LDSFLRQREG QFSSLQLVAK QRGVAAAKQY LSSFAFVHRS 
LSAHSVLVNS HLVCKVARLG HSPQGPSCLL RWAAPEVIAH GKHTTSSDVN SFGILMWEVM 
SYGERPYWDM SEQEVLNAIE QEFRLPPPPG CPPGLHLLML DTWQKDRARR PHFDQLVAAF 
DKMIRKPDTL QAGGDPGERP SQALLTPVAL DFPCLDSPQA WLSAIGLECY QDNFSKFGLC 
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AGHQKKLLHH IQLLQQHLRQ QGSVEV 
Protein Sequence 



I YRKVPKDLTQ PTYTGAIISI CCCLFILFLF LSELTGFITT EWNELYVDD 
PDKDSGGKID VSLNISLPNL HCELVGLDIQ DEMGRHEVGH IDHSMKIPLN NGAGCRFEGQ 
FSINKVPGNF HVSTHSATAQ PQNPDHTHVI HKLSFGDTLO. VQNIKGAFNA LGGADRLTSN 
PLASHDYILK IVPTVYEDKS GKQRYSYQYT VANKEYVAYS HTGRIIPAIW FRYDLSPITV 
KYTERRQPLY RFITTICAII GGTFTVAGIL DSCIFTASEA WKKIQLGKMH 



Protein Accession it: NP_002142.1 

1 11 ,21 31 41 51 

I I I I I I 

MAQKEGGRTV PCCSRPKVAA LTAGTLLLLT AIGAASWAIV AVLLRSDQEP LYPVQVSSAD 
ARLMVFDKTE GTWRLLCSSR SNARVAGLSC EEMGFLRALT HSELDVRTAG ANGTSGFFCV 
DEGRLPHTQR LLEVISVCDC PRGRFLAAIC QDCGRRKLPV DRIVGGRDTS LGRWPWQVSL 
RYDGAHLCGG SLLSGDWVLT AAHCFPERNR VLSRWRVFAG AVAOASPKGL QLGVQAWYH 
GGYLPFRDPN SEENSNDIAL VHLSSPLPLT EYIQPVCLPA AGQALVDGKI CTVTGWGNTQ 
YYGQQAGVLQ EARVPIISND VCNGADFYGN QIKPKMFCAG YPEGGIDACQ GDSGGPFVCE 
DSISRTPRWR LCGIVSWGTG CALAQKPGVY TKVSDFREWI FQAIKTHSEA SGMVTQL 



1 11 21 31 41 51 

I I I I I I 

METTVLSGIN FEYKGMTGWE VAGDHIYTAA GASDNDFMIL TLWPGFRPP QSVMADTENK 
EVARITFVFE TLCSVNCELY FMVGVNSRTN TPVETWKGSK GKQSYTYIIE EMTTTSFTWA 
FQRTTFHEAS RKYTNDVAKI YSINVTNVMN GVASYCRPCA LEASDVGSSC TSCPAGYYID 
RDSGTCHSCP PNTILKAHQP YGVQACVPCG PGTKNNKIHS LCYNDCTFSR NTPTRTFNYN 
FSALANTVTL AGGPSFTSKG LKYFHHFTLS LCGNQGRKMS VCTDNVTDLR IPEGESGFSK 
SITAYVCQAV IIPPEVTGYK AGVSSQPVSL ADRLIGVTTD MTLDGITSPA ELFHLESLGI 
PDVIFFYRSN DVTQSCSSGR STTIRVRCSP QKTVPGSLLL PGTCSDGTCD GCNFHFLWES 
AAACPLCSVA DYHAIVSSCV AGIQKTTYVW REPKLCSGGI SLPEQRVTIC KTIDFWLKVG 
ISAGTCTAIL LTVLTCYFWK KNQKLEYKYS KLVMNATLKD CDLPAADSCA IMEGEDVEDD 
LIFTSKKSLF GKIKSFTSKQ PAPVTISLSE DS 

Seq ID NO: C431 Protein Sequence 
Protein Accession #s NP_004855.1 

I I' I' f I' I' 

MPGQELRTVN GSQMLLVLLV LSWLPHGGAL SLAEASRASF P 
EDLLTRLRAN QSWEDSNTDL VPAPAVRILT PEVRLGS 
HRALFRLSPT ASRSWDVTRP LRRQLSIARP QAPALHLRLS PPPSQSDQLL AESSSARPQL 
ELHLRPQAAR GRRRARARNG DDCPLGPGRC CRLHTVRASL EDLGWADKVL SPREVQVTMC 
IGACPSQFRA ANMHAQIKTS LHRLKPDTEP APCCVPASYN PMVLIQKTDT GVSLQTYDDL 
LAKDCHCI 



1 11 21 31 41 51 

I I I I I I 

MEDPSGAREP RARPRERDPG RRPHPDQGRT HDRPRDRPGD PRRKRSSDGN R~" 
RDQERDGNRD RNRDRERERE RERDPDRGPR RDTHRDAGPR AGEHGVWEKP RQSRTRDGAR 
GLTWDAAAPP GPAPWEAPEP PQPQRKGDPG RRRPESEPPS ERYLPSTPRP GREEVEYYQS 
EAEGLLECHK CKYLCTGRAC CQMLEVLLNL LILACSSVSY SSTGGYTGIT SLGGIYYYQF 
GGAYSGFDGA DGEKAQQLDV QFYQLKLPMV TVAMACSGAL TALCCLFVAM GVLRVPWHCP 
LLLVTEGLLD MLIAGGYIPA LYFYFHYLSA AYGSPVCKER QALYQSKGYS GFGCSFHGAD 
IGAGIFAALG IWFALGAVL AIKGYRKVRK L] 



1 11 21 31 41 51 

I I I I I I 

MGAAGRQDFL FKAMLTISWL TLTCFPGATS TVAAGCPDQS PELQPKNPGK DQDHHVHIGQ 
GKTLLLTSSA TVYSIHISEG GKLVIKDHDE PIVLRTRHIL IDNGGELHAG SALCPFQGNF 
TIILYGRADE GIQPDPYYGL KYIGVGKGGA LELHGQKKLS WTFLNKTLHP GGMAEGGYFF 
ERSWGHRGVI VHVIDPKSGT VIHSDRFDTY RSKKESERLV QYLNAVPDGR ILSVAVNDEG 
SRNLDDMARK AMTKLGSKHF LHLGFRHPWS FLTVKGNPSS SVEDHIEYHG HRGSAAARVF 
KLFQTEHGEY FNVSLSSEWV QDVEWTEWFD HDKVSQTKGG EKISDLWKAH PGKICNRPID 
IQATTMDGVN LSTEWYKKG QDYRFACYDR GRACRSYRVR FLCGKPVR 
STILNLEDNV QSWKPGDTLV IASTDYSMYQ AEEFQVLPCR SCAPNQVK 
EIDGVDMRAE VGLLSRNIIV MGEMEDKCYP YRNHICNFFD FDTFGGHIKF ? 
GTELKHMGQQ LVGQYPIHFH DAGDVDERGG YDPPTYIRDL S 
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D NGVKTTEASA KDKRPFLSII SAF.YSPHQDA DPLKPR2PAI 
IRHFIAYKNQ DHGAWL RGGD VWLDSCHFRG EAQEGFLLTG MKAGGILLGG DEAASGMAQG 
FSPPCRCLLK LVTTGSPFAH VSLAHS 
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It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
sequences of accession numbers, and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication, accession number, or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 



1 1 . A method for determining the presence or absence of a pathological cell in a 

2 patient, said method comprising detecting a nucleic acid comprising a sequence at least 80% 

3 identical to a sequence as described in Tables 2A-80 in a biological sample from said patient, 

4 thereby determining the presence or absence of said pathological cell. 

1 2. The method of Claim 1 , wherein: 

2 a) said pathology is described in Table 1 , including a cancer; and/or 

3 b) said biological sample comprises isolated nucleic acids. 

1 3. The method of Claim 1 , wherein said biological sample is tissue from an organ ' 

2 which is affected by said pathology of Table 1 , including a cancer. 

1 4. The method of Claim 2, wherein said nucleic acids are mKNA 

1 5 . The method of Claim 2: 

2 a) further comprising a step of amplifying nucleic acids before said step of detecting 

3 said nucleic acid; or 

4 b) where said detecting is of a protein encoded by said nucleic acid. 

1 6. The method of Claim 1 , wherein said nucleic acid comprises a sequence as 

2 described in Tables 2A-80. 

1 7. The method of Claim 2, wherein: 

2 a) said detecting step is carried out by: 

3 i) using a labeled nucleic acid probe; 

4 ii) utilizing a biochip comprising a sequence at least 80% identical to a sequence 

5 as described in Tables 2A-80; or 

6 iii) detecting a polypeptide encoded by said nucleic acid; or 

7 b) said patient is: 

8 i) undergoing a therapeutic regimen to treat said pathology of Table 1 ; or 

9 ii) is suspected of having said pathology or cancer. 

1 8. An isolated nucleic acid molecule comprising a sequence as described in 

2 Tables 2A-80. 

1382 



WO 03/042661 



PCT/US02/36810 



1 9. The nucleic acid molecule of Claim 8, which is labeled. 

1 10. An expression vector comprising the nucleic acid of Claim 8. 

1 1 1 . A host cell comprising the expression vector of Claim 10. 

1 12. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 comprising a sequence as described in Tables 2A-80. 

1 13. An antibody that specifically binds a polypeptide of Claim 12. 

1 14. The antibody of Claim 13: 

2 a) conjugated to an effector component; 

3 b) conjugated to a detectable label, including a fluorescent label, a radioisotope, or a 

4 cytotoxic chemical; 

5 c) which is an antibody fragment; or 

6 d) which is a humanized antibody. 

1 1 5 . A method for specifically targeting a compound to a pathological cell in a 

2 patient, said method comprising administering to said patient an antibody of Claim 13, 

3 thereby providing said targetting. 

1 1 6. A method for determining the presence or absence of a pathological cell in a 

2 patient, said method comprising contacting a biological sample with an antibody of Claim 13. 

1 17. The method of Claim 1 6, wherein: 

2 a) said antibody is conjugated to: 

3 i) an effector component; or 

4 ii) a fluorescent label; or 

5 b) said biological sample is a blood, serum, urine, or stool sample. 

1 1 8. A method for identifying a compound that modulates a pathology-associated 

2 polypeptide, said method comprising the steps of: 
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a) contacting said compound with a pathology-associated polypeptide, said 
polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence 
at least 80% identical to a sequence as described in Tables 2A-80; and 

b) determining the functional effect of said compound upon said polypeptide. 

19. A drug screening assay comprising the steps of: 

a) administering a test compound to a mammal having a pathology of Table 1 or a 
cell isolated therefrom; and 

b) comparing the level of gene expression of a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as described in 
Tables 2A-80 in a treated cell or mammal with the level of gene expression of said 
polynucleotide in a control cell or mammal, wherein a test compound that 
modulates said level of expression of the polynucleotide is a candidate for the 
treatment of said pathology. 
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